BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036343
(795 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 1251 bits (3236), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 590/826 (71%), Positives = 688/826 (83%), Gaps = 43/826 (5%)
Query: 10 AILLCLILQTLF-NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
A LLCL+ Q +F +LS AY VSHDGRAI IDG+R++LLSGSIHYPRSTP MWPDLI+KAK
Sbjct: 5 AHLLCLLFQAVFISLSCAYNVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPDLIRKAK 64
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGGLDAIETYVFWNAHEP RRQYDF+G+LDLIRFIKTIQD+GLY +LRIGPYVCAEWNYG
Sbjct: 65 EGGLDAIETYVFWNAHEPARRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVCAEWNYG 124
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWLHNMPG++E RT N+VFMNEMQNFTTLIVDM K+EKLFASQGGPII+AQIENEYG
Sbjct: 125 GFPVWLHNMPGVQEFRTVNEVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQIENEYG 184
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
N++S+YGDAGK YI+WCAKMA SLDIGVPWIMCQESDAP PM FTPN+PN
Sbjct: 185 NMISNYGDAGKVYIDWCAKMAESLDIGVPWIMCQESDAPQPMINTCNGWYCDSFTPNDPN 244
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
SPK+WTENWTGWFKSWGGKDP RTAEDLAF+VARFFQ GGTFQNYYMYHGGTNFGRTSGG
Sbjct: 245 SPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRTSGG 304
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS---- 353
PYLTTSYDYDAP+DE+G+LNQPKWGHL+ELH +LK+MEKTLT+GNV+ TD+GNSV+
Sbjct: 305 PYLTTSYDYDAPLDEFGNLNQPKWGHLKELHTVLKAMEKTLTHGNVSTTDFGNSVTATVY 364
Query: 354 ------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
GS Y +PAWSVSILPDCKTE +NTAKVNTQT+V VK
Sbjct: 365 ATEEGSSCFFGNANTTGDATITFQGSDYVVPAWSVSILPDCKTEAYNTAKVNTQTSVIVK 424
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKD 449
+PNQA N+ + L+W WRPE I++ VV+GKG F+ + LIDQK ND SDYLWYMT+ DLK
Sbjct: 425 KPNQAENEPSSLKWVWRPEAIDEPVVQGKGSFSASFLIDQKVINDASDYLWYMTSVDLKP 484
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
DD I S NMTLR+N++G VLHA+VNG +V SQWTKYG D+F++ VKL GKNQISL
Sbjct: 485 DDIIW--SDNMTLRVNTTGIVLHAFVNGEHVGSQWTKYGVFKDVFQQQVKLNPGKNQISL 542
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS TVGLQNYG FDMV GI GPV L+G+ GDET+IKDLS HKWTY+VGL GL+D KFY
Sbjct: 543 LSVTVGLQNYGPMFDMVQAGITGPVELIGQKGDETVIKDLSCHKWTYEVGLTGLEDNKFY 602
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ + N GWS++NVP N +MTWYKTTF+APL NDPVVL+LQGMGKGFAWVNGYNLGRY
Sbjct: 603 SKASTNETCGWSAENVPSNSKMTWYKTTFKAPLGNDPVVLDLQGMGKGFAWVNGYNLGRY 662
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP+YLAE DGCS++ CDYRG Y ++KC NCG PSQ WYHVPRS+++DG NTLVLFEEFG
Sbjct: 663 WPSYLAEADGCSSDPCDYRGQYDNNKCVTNCGQPSQRWYHVPRSFLQDGENTLVLFEEFG 722
Query: 690 GNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEA 749
GNP Q+NFQT+VVG+ CG AHE KT+EL+C+GR IS IK+ASFGDPQG CG+F+ G+C+
Sbjct: 723 GNPWQVNFQTLVVGSVCGNAHEKKTLELSCNGRPISAIKFASFGDPQGTCGSFQAGTCQT 782
Query: 750 EIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E D+LP+++++CVGK++CSI+ SE LG T+C + VK+L VEA+C
Sbjct: 783 EQDILPVLQQECVGKETCSIDISEDKLGKTNCGS-VVKKLAVEAVC 827
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 1140 bits (2950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 547/834 (65%), Positives = 650/834 (77%), Gaps = 49/834 (5%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
+A+ K+ + AI C++ L L+ A VS+DGRA+ IDG+R++L SGSIHYPRSTP MW
Sbjct: 12 VASSKNATHAISFCVLFVLLNVLASAVEVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMW 71
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
PDLI+KAK GGLDAIETYVFWN HEPLRR+YDF+GNLDLIRFI+TIQ +GLY +LRIGPY
Sbjct: 72 PDLIRKAKAGGLDAIETYVFWNVHEPLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPY 131
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEW YGGFP+WLHNMPGIE RT NKVFMNEMQNFTTLIVDMAK+EKLFASQGGPII+
Sbjct: 132 VCAEWTYGGFPMWLHNMPGIE-FRTANKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIII 190
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
AQIENEYGN+M+ YGDAGK Y++WCA MA SLDIGVPWIMCQ+SDAP PM
Sbjct: 191 AQIENEYGNIMAPYGDAGKVYVDWCAAMANSLDIGVPWIMCQQSDAPQPMINTCNGWYCD 250
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
FTPNNPNSPK+WTENWTGWFK+WGGKDP RTAEDL+++VARFFQ GGTFQNYYMYHGGT
Sbjct: 251 SFTPNNPNSPKMWTENWTGWFKNWGGKDPHRTAEDLSYSVARFFQTGGTFQNYYMYHGGT 310
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
NFGR +GGPY+TTSYDYDAP+DE+G+LNQPKWGHL++LH +LKSME+TLT GN+T D G
Sbjct: 311 NFGRVAGGPYITTSYDYDAPLDEFGNLNQPKWGHLKDLHTVLKSMEETLTEGNITTIDMG 370
Query: 350 NSVS----------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVN 381
NSV G+ Y +PAWSVSILPDCK E +NTAKVN
Sbjct: 371 NSVEVTVYATQKVSSCFFSNSNTTNDATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVN 430
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWY 441
QT+V VK N+A + A L+W WRPEMI+D V GKG + N LIDQK+TND SDYLWY
Sbjct: 431 AQTSVMVKNKNEAEDQPASLKWSWRPEMIDDTAVLGKGQVSANRLIDQKTTNDRSDYLWY 490
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
M + DL +DD L + NMTLR+N++G +LHAYVNG Y+ SQW G N +FE VKL
Sbjct: 491 MNSVDLSEDD--LVWTDNMTLRVNATGHILHAYVNGEYLGSQWATNGIFNYVFEEKVKLK 548
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
GKN I+LLSAT+G QNYG+ +D+V +GI GPV +VGR GDETIIKDLSSHKW+YKVG++
Sbjct: 549 PGKNLIALLSATIGFQNYGAFYDLVQSGISGPVEIVGRKGDETIIKDLSSHKWSYKVGMH 608
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
G+ K Y+ + S W NVPLNR +TWYKTTF+APL D VV++LQG+GKG AWV
Sbjct: 609 GMA-MKLYDPE---SPYKWEEGNVPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKGEAWV 664
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG +LGRYWP+ +AE DGC+ +CDYRGPY + KC NCGNP+Q WYHVPRS++ NT
Sbjct: 665 NGQSLGRYWPSSIAE-DGCNA-TCDYRGPYTNTKCVRNCGNPTQRWYHVPRSFLTADENT 722
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGA 741
LVLFEEFGGNPS +NFQTV +GTACG A+EN +EL C R IS+IK+ASFGDPQG+CG+
Sbjct: 723 LVLFEEFGGNPSLVNFQTVTIGTACGNAYENNVLELACQNRPISDIKFASFGDPQGSCGS 782
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F KGSCE D L +I+K CVGK+SCS++ SE G+TSC + KRL VEA+C
Sbjct: 783 FSKGSCEGNKDALDIIKKACVGKESCSLDVSEKAFGSTSCGS-IPKRLAVEAVC 835
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 1132 bits (2927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 533/825 (64%), Positives = 645/825 (78%), Gaps = 46/825 (5%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
LC I L+ + A VSHDGRAI IDG+R++L+SGSIHYPRSTP MWPDLIKKAKEG
Sbjct: 10 FFLCYIFLALYG-TYAVEVSHDGRAIKIDGKRRVLISGSIHYPRSTPQMWPDLIKKAKEG 68
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLDAIETYVFWNAHEP+RR+YDF+GN DLIRF+KTIQD+GL+ +LRIGPYVCAEWNYGG
Sbjct: 69 GLDAIETYVFWNAHEPIRREYDFSGNNDLIRFLKTIQDEGLFAVLRIGPYVCAEWNYGGI 128
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVW++N+PG+E +RT NKVFMNEMQNFTTLIVDM +KEKLFASQGGPIIL+QIENEYGNV
Sbjct: 129 PVWVYNLPGVE-IRTANKVFMNEMQNFTTLIVDMVRKEKLFASQGGPIILSQIENEYGNV 187
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
MS YGD GK+YINWCA MA S +IGVPWIMCQ+ DAP PM F PNNPNSP
Sbjct: 188 MSAYGDEGKAYINWCANMADSFNIGVPWIMCQQPDAPQPMINTCNGWYCHDFEPNNPNSP 247
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENW GWFK+WGGKDP RTAED+A++VARFF+ GGTFQNYYMYHGGTNFGRT+GGPY
Sbjct: 248 KMWTENWVGWFKNWGGKDPHRTAEDIAYSVARFFETGGTFQNYYMYHGGTNFGRTAGGPY 307
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV------- 352
+TTSYDYDAP+DEYG++ QPKWGHL+ELH +LKSME +LT GNV+ D G+ V
Sbjct: 308 ITTSYDYDAPLDEYGNIAQPKWGHLKELHLVLKSMENSLTNGNVSKIDLGSYVKATVYAT 367
Query: 353 ---------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
G++YN+PAWSVSILPDC+TEE+NTAKVN QT++ VKR
Sbjct: 368 NDSSSCFLTNTNTTTDATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVNVQTSIMVKRE 427
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDD 450
N+A ++ L+W WR E +++ ++ GK + NT++DQK + ND SDYLWYMT D+
Sbjct: 428 NKAEDEPEALKWVWRAENVHNSLI-GKSSVSKNTIVDQKIAANDSSDYLWYMTRLDINQK 486
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
DP+ ++N LRIN +G V+HA+VNG ++ S W YG ND FE +KL G+N ISLL
Sbjct: 487 DPVW--TNNTILRINGTGHVIHAFVNGEHIGSHWATYGIHNDQFETNIKLKHGRNDISLL 544
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S TVGLQNYG ++D +G+ P+ L+G GDETIIKDLSSHKWTYKVGL+G ++K F
Sbjct: 545 SVTVGLQNYGKEYDKWQDGLVSPIELIGTKGDETIIKDLSSHKWTYKVGLHGWENKFFSQ 604
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
S W S +P+N+ +TWYKTTF+APLE+DP+V++LQGMGKG+AWVNG++LGRYW
Sbjct: 605 DTFFASSSKWESNELPINKMLTWYKTTFKAPLESDPIVVDLQGMGKGYAWVNGHSLGRYW 664
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
P+Y A+EDGCS + CDYRG Y KC NCG PSQ WYHVPR +I+DGVNTLVLFEE GG
Sbjct: 665 PSYNADEDGCSDDPCDYRGEYNDTKCVSNCGKPSQRWYHVPRDFIEDGVNTLVLFEEIGG 724
Query: 691 NPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAE 750
NPSQINFQTV+VG+AC A+ENKT+EL+CHGR IS+IK+ASFG+PQG CGAF KGSCE+
Sbjct: 725 NPSQINFQTVIVGSACANAYENKTLELSCHGRSISDIKFASFGNPQGTCGAFTKGSCESN 784
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ L L++K CVGK+SCSI+ SE GAT+C VKRL VEA+C
Sbjct: 785 NEALSLVQKACVGKESCSIDVSEKTFGATNC-GNMVKRLAVEAVC 828
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 1125 bits (2911), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 534/836 (63%), Positives = 647/836 (77%), Gaps = 52/836 (6%)
Query: 1 MATLKHCSRAILLCLILQTL-FNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
MA+LK LL + F+L A +SHDGRAITIDG+R++LLSGSIHYPRSTP M
Sbjct: 1 MASLK-----FLLAISFSLFTFHLVSAAVISHDGRAITIDGKRRVLLSGSIHYPRSTPQM 55
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
WPDLIKK+KEGGLDAIETYVFWN HEP RRQYDF GNLDL+RFIK +QD+GLY +LRIGP
Sbjct: 56 WPDLIKKSKEGGLDAIETYVFWNVHEPSRRQYDFGGNLDLVRFIKAVQDEGLYAVLRIGP 115
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWNYGGFPVWLHNMPGIE LRT N +FMNEMQNFT+LIVDM K+E+LFASQGGPII
Sbjct: 116 YVCAEWNYGGFPVWLHNMPGIE-LRTANSIFMNEMQNFTSLIVDMMKQEQLFASQGGPII 174
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
+AQ+ENEYGNVMS YG AGK+YI+WCA MA SL+IGVPWIMCQ+SDAP PM
Sbjct: 175 IAQVENEYGNVMSSYGAAGKAYIDWCANMAESLNIGVPWIMCQQSDAPDPMINTCNGWYC 234
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
FTP+NPNSPK+WTENWTGWFKSWGGKDP RTAED+AFAVARFFQ GGTFQNYYMYHGG
Sbjct: 235 DQFTPSNPNSPKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGG 294
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGPY+TTSYDYDAP+DE+G+LNQPKWGHL++LH +L SME+ LT G V++ DY
Sbjct: 295 TNFGRTAGGPYITTSYDYDAPLDEFGNLNQPKWGHLKQLHDVLHSMEEILTSGTVSSVDY 354
Query: 349 GNSVS----------------------------GSSYNLPAWSVSILPDCKTEEFNTAKV 380
NSV+ G++Y +PAWSVSILPDC +NTAKV
Sbjct: 355 DNSVTATIYATDKESSCFLSNANETSDATIEFKGTTYTIPAWSVSILPDCANVGYNTAKV 414
Query: 381 NTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKST-NDVSDYL 439
TQT+V VKR N+A ++ L W WRPE ++ V+ G+GH ++DQK+ ND SDYL
Sbjct: 415 KTQTSVMVKRDNKAEDEPTSLNWSWRPENVDKTVLLGQGHIHAKQIVDQKAVANDASDYL 474
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WYMT+ DLK DD I S +M++RIN SG +LHAYVNG Y+ SQW++Y SN +FE+ VK
Sbjct: 475 WYMTSVDLKKDDLIWS--KDMSIRINGSGHILHAYVNGEYLGSQWSEYSVSNYVFEKSVK 532
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L G+N I+LLSATVGL NYG+ +D++ GI GPV LVGR GDETIIKDLS+++W+YKVG
Sbjct: 533 LKHGRNLITLLSATVGLANYGANYDLIQAGILGPVELVGRKGDETIIKDLSNNRWSYKVG 592
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L GL+DK + + S+ W + +P N+ +TWYKTTF+APL DPVVL+LQG+GKG A
Sbjct: 593 LLGLEDKLYLSDSKHASK--WQEQELPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKGMA 650
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG ++GRYWP++LAE+DGCST+ CDYRGPY ++KC NCG P+Q WYHVPRS+++D
Sbjct: 651 WINGNSIGRYWPSFLAEDDGCSTDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQDNE 710
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGAC 739
NTLVLFEEFGGNPSQ+NFQTVV G AC E + +E++C+G+ IS +++ASFGDPQG C
Sbjct: 711 NTLVLFEEFGGNPSQVNFQTVVTGVACVSGDEGEVVEISCNGQSISAVQFASFGDPQGTC 770
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+ KGSCE D L +++K CVG +SCS+E S G+TSC G V RL VE LC
Sbjct: 771 GSSVKGSCEGTEDALLIVQKACVGNESCSLEVSHKLFGSTSCDNG-VNRLAVEVLC 825
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 1110 bits (2870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/826 (64%), Positives = 633/826 (76%), Gaps = 50/826 (6%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
AI CL F A VSHDGRAITIDG+R++L+SGSIHYPRST MWPDLIKK+KE
Sbjct: 33 AIFFCL-----FTFVSATIVSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDLIKKSKE 87
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLDAIETYVFWN+HEP RRQYDF+GNLDL+RFIKTIQ +GLY +LRIGPYVCAEWNYGG
Sbjct: 88 GGLDAIETYVFWNSHEPSRRQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCAEWNYGG 147
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FP+WLHN+PG E LRT N VFMNEMQNFT+LIVDM K E LFASQGGPIILAQ+ENEYGN
Sbjct: 148 FPMWLHNLPGCE-LRTANSVFMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVENEYGN 206
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
VMS YG AGK+YI+WC+ MA SLDIGVPWIMCQ+SDAP PM FTPNN NS
Sbjct: 207 VMSAYGAAGKTYIDWCSNMAESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQFTPNNANS 266
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
PK+WTENWTGWFKSWGGKDP RTAED+AFAVARFFQ GGTFQNYYMYHGGTNFGRT+GGP
Sbjct: 267 PKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGP 326
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS----- 353
Y+TTSYDYDAP+DEYG+LNQPKWGHL++LH +L SME TLT+GN++ DY NSV+
Sbjct: 327 YITTSYDYDAPLDEYGNLNQPKWGHLKQLHDILHSMEYTLTHGNISTIDYDNSVTATIYA 386
Query: 354 -----------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
G+ YN+PAWSVSILPDC+ +NTAKV TQT + VK+
Sbjct: 387 TDKESACFFGNANETSDATIVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQTAIMVKQ 446
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKST-NDVSDYLWYMTNADLKD 449
N+A + + L+W W PE + + GKGH LIDQK+ ND SDYLWYMT+ +K
Sbjct: 447 KNEAEDQPSSLKWSWIPENTHTTSLLGKGHAHARQLIDQKAAANDASDYLWYMTSLHIKK 506
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
DDP+ S S+M+LR+N SG VLHAYVNG ++ SQ+ KYG + +FE+ +KL GKN ISL
Sbjct: 507 DDPVWS--SDMSLRVNGSGHVLHAYVNGKHLGSQFAKYGVFSYVFEKSLKLRPGKNVISL 564
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LSATVGLQNYG FD+V GIPGPV ++G GDE ++KDLSSHKW+Y VGL G ++ Y
Sbjct: 565 LSATVGLQNYGPMFDLVQTGIPGPVEIIGHRGDEKVVKDLSSHKWSYSVGLNGFHNE-LY 623
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
++ + ++ R W +++P N+ M WYKTTF+APL DPVVL+LQGMGKGFAWVNG N+GRY
Sbjct: 624 SSNSRHASR-WVEQDLPTNKMMIWYKTTFKAPLGKDPVVLDLQGMGKGFAWVNGNNIGRY 682
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP++LAEEDGCSTE CDYRG Y ++KC NCG P+Q WYHVPRS+ D NTLVLFEEFG
Sbjct: 683 WPSFLAEEDGCSTEVCDYRGAYDNNKCVTNCGKPTQRWYHVPRSFFNDYENTLVLFEEFG 742
Query: 690 GNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEA 749
GNP+ +NFQTV VG G A E +T+EL+C+G+ IS I++ASFGDPQG GA+ KG+CE
Sbjct: 743 GNPAGVNFQTVTVGKVSGSAGEGETIELSCNGKSISAIEFASFGDPQGTSGAYVKGTCEG 802
Query: 750 EIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
D +++K CVGK++C +EAS+ G TSC + V L V+A C
Sbjct: 803 SNDAFSIVQKACVGKETCKLEASKDVFGPTSCGSDVVNTLAVQATC 848
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 1105 bits (2859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 520/808 (64%), Positives = 629/808 (77%), Gaps = 45/808 (5%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+VS+ R ITIDG+ KI LSGSIHYPRSTP MWPDLIKK+KEGGLD IETYVFWNAHEP+
Sbjct: 25 QVSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPV 84
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
RRQYDF+ NLDL+RFIKTIQ++GLY +LRIGPYVCAEWNYGGFPVWLHN+PGIEELRTTN
Sbjct: 85 RRQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTN 144
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
VFMNEMQNFTTLIVDM K+E LFASQGGPIILAQIENEYGNVM+ YGDAGK+Y+NWCA
Sbjct: 145 PVFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCAN 204
Query: 208 MATSLDIGVPWIMCQESDAPSP-----------MFTPNNPNSPKIWTENWTGWFKSWGGK 256
MA S ++GVPWIMCQ+ DAP P FTPNN SPK+WTENWTGWFKSWGG+
Sbjct: 205 MADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGR 264
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
DP RT EDLAF+VARFFQ GGTFQNYYMYHGGTNF R +GGPY+TT+YDY+AP+DEYG+L
Sbjct: 265 DPVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNL 324
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS----------------------- 353
NQPK+GHL++LH LKS+EK L GNVT TD +SVS
Sbjct: 325 NQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGKSCFFSNINETTDA 384
Query: 354 -----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
G +N+PAWSVSILPDC+ E +NTAKVNTQT+V VK+ N+A N+ L+W WRPE
Sbjct: 385 LVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVLEWMWRPE 444
Query: 409 MINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
I++ GKG N LIDQK + ND SDYLWYMT+ +LK DPI S + MTLRIN S
Sbjct: 445 NIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWS--NEMTLRINVS 502
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G ++HA+VNG ++ SQW Y N +FE+ VKL GKN ISLLSAT+GL+NYG+++D++
Sbjct: 503 GHIVHAFVNGEHIGSQWASYDVYNYIFEQEVKLKPGKNIISLLSATIGLKNYGAQYDLIQ 562
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
+GI GPV L+GR GDETIIKDLS+HKW+Y+VGL+G +++ F + + W S N+P+
Sbjct: 563 SGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLF--SPESRFATKWQSGNLPV 620
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
NR MTWYKTTF+ PL DPV L+LQG+GKG AWVNG+++GRYWP+++AE DGCS E CDY
Sbjct: 621 NRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAE-DGCSDEPCDY 679
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RG Y + KC +CG P+Q WYHVPRSW+ +G NTLVLFEEFGGNPS +NF+T+ + ACG
Sbjct: 680 RGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEKACG 739
Query: 708 QAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSC 767
A+E K++EL+C G+ I+ IK+ASFGDP G+CG F KGSCE + D + ++E C+GK+SC
Sbjct: 740 HAYEKKSLELSCQGKEITGIKFASFGDPTGSCGNFSKGSCEGKNDAMKIVEDLCIGKESC 799
Query: 768 SIEASEANLGATSCAAGTVKRLVVEALC 795
I+ SE GAT+CA G VKRL VEA+C
Sbjct: 800 VIDISEDTFGATNCALGVVKRLAVEAVC 827
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 1103 bits (2852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/814 (64%), Positives = 623/814 (76%), Gaps = 45/814 (5%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N +L VSHDGRAI IDG+R++L+SGSIHYPRSTP MWP+LI+KAKEGGLDAIETYVFW
Sbjct: 23 NKALHTNVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFW 82
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
NAHEP RR YDF+GN D+IRF+KTIQ+ GLY +LRIGPYVCAEWNYGG PVW+HN+P +E
Sbjct: 83 NAHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVE 142
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
+RT N VFMNEMQNFTTLIVDM KKEKLFASQGGPIIL QIENEYGNV+S YGDAGK+Y
Sbjct: 143 -IRTANSVFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAY 201
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+NWCA MA SL +GVPWIMCQESDAP PM F PN+ NSPK+WTENW GWF
Sbjct: 202 MNWCANMAESLKVGVPWIMCQESDAPQPMINTCNGWYCDNFEPNSFNSPKMWTENWIGWF 261
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
K+WGG+DP RTAED+AFAVARFFQ GGTFQNYYMYHGGTNFGRT+GGPY+TTSYDYDAP+
Sbjct: 262 KNWGGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 321
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS----------------- 353
DEYG++ QPKWGHL+ELH LK+ME+ LT GNV+ TD GNSV
Sbjct: 322 DEYGNIAQPKWGHLKELHSALKAMEEALTSGNVSETDLGNSVKVTIYATNGSSSCFLSNT 381
Query: 354 -----------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
G++Y +PAWSVSILPDC+ EE+NTAKV QT+V K ++A + A L+
Sbjct: 382 NTTADATLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTSVMTKENSKAEKEAAILK 441
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
W WR E I D + GK + + + L+DQK + ND SDYLWYMT +K DDP+ S NMT
Sbjct: 442 WVWRSENI-DKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHDDPVW--SENMT 498
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
LRIN SG V+HA+VNG Y+DS W YG ND FE +KL G N ISLLS TVGLQNYG+
Sbjct: 499 LRINGSGHVIHAFVNGEYIDSHWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGA 558
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
FD G+ GP+ LV G+ETIIK+LSSHKW+YK+GL+G D K F + ++ W
Sbjct: 559 FFDTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDHKLFSDDSPFAAQSKWE 618
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
S+ +P NR +TWYKTTF+APL DPVV++LQGMGKG+AWVNG N+GR WP+Y AEEDGCS
Sbjct: 619 SEKLPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKNIGRIWPSYNAEEDGCS 678
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
E CDYRG Y KC NCG P+Q WYHVPRS++KDG NTLVLF E GGNPS +NFQTVV
Sbjct: 679 DEPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLFAELGGNPSLVNFQTVV 738
Query: 702 VGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQC 761
VG C A+ENKT+EL+C GR+IS IK+ASFGDP+G CGAF GSCE++ + LP+++K C
Sbjct: 739 VGNVCANAYENKTLELSCQGRKISAIKFASFGDPKGVCGAFTNGSCESKSNALPIVQKAC 798
Query: 762 VGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
VGK++CSI+ SE GAT+C KRL VEA+C
Sbjct: 799 VGKEACSIDLSEKTFGATAC-GNLAKRLAVEAVC 831
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 1102 bits (2851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/808 (64%), Positives = 628/808 (77%), Gaps = 45/808 (5%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+VS+ R ITIDG+ KI LSGSIHYPRSTP MWPDLIKK+KEGGLD IETYVFWNAHEP+
Sbjct: 25 QVSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPV 84
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
RRQYDF+ NLDL+RFIKTIQ++GLY +LRIGPYVCAEWNYGGFPVWLHN+PGIEELRTTN
Sbjct: 85 RRQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTN 144
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
VFMNEMQNFTTLIVDM K+E LFASQGGPIILAQIENEYGNVM+ YGDAGK+Y+NWCA
Sbjct: 145 PVFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCAN 204
Query: 208 MATSLDIGVPWIMCQESDAPSP-----------MFTPNNPNSPKIWTENWTGWFKSWGGK 256
MA S ++GVPWIMCQ+ DAP P FTPNN SPK+WTENWTGWFKSWGG+
Sbjct: 205 MADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGR 264
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
DP RT EDLAF+VARFFQ GGTFQNYYMYHGGTNF R +GGPY+TT+YDY+AP+DEYG+L
Sbjct: 265 DPVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNL 324
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS----------------------- 353
NQPK+GHL++LH LKS+EK L GNVT TD +SVS
Sbjct: 325 NQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGKSCFFSNINETTDA 384
Query: 354 -----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
G +N+PAWSVSILPDC+ E +NTAKVNTQT+V VK+ N+A N+ L+W WRPE
Sbjct: 385 LVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVLEWMWRPE 444
Query: 409 MINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
I++ GKG N LIDQK + ND SDYLWYMT+ +LK DPI S + MTLRIN S
Sbjct: 445 NIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWS--NEMTLRINVS 502
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G ++HA+VNG ++ SQW Y N + E+ VKL GKN ISLLSAT+GL+NYG+++D++
Sbjct: 503 GHIVHAFVNGEHIGSQWASYDVYNYIXEQEVKLKPGKNIISLLSATIGLKNYGAQYDLIQ 562
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
+GI GPV L+GR GDETIIKDLS+HKW+Y+VGL+G +++ F + + W S N+P+
Sbjct: 563 SGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLF--SPESRFATKWQSGNLPV 620
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
NR MTWYKTTF+ PL DPV L+LQG+GKG AWVNG+++GRYWP+++AE DGCS E CDY
Sbjct: 621 NRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAE-DGCSDEPCDY 679
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RG Y + KC +CG P+Q WYHVPRSW+ +G NTLVLFEEFGGNPS +NF+T+ + ACG
Sbjct: 680 RGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEKACG 739
Query: 708 QAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSC 767
A+E K++EL+C G+ I+ IK+ASFGDP G+CG F KGSCE + D + ++E C+GK+SC
Sbjct: 740 HAYEKKSLELSCQGKEITGIKFASFGDPTGSCGNFSKGSCEGKNDAMKIVEDLCIGKESC 799
Query: 768 SIEASEANLGATSCAAGTVKRLVVEALC 795
I+ SE GAT+CA G VKRL VEA+C
Sbjct: 800 VIDISEDTFGATNCALGVVKRLAVEAVC 827
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 1100 bits (2846), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/832 (63%), Positives = 627/832 (75%), Gaps = 46/832 (5%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
+ S ++ C ++ + S A VSHDGRAI IDG+R++LLSGSIHYPRSTP MWP+L
Sbjct: 1 MNFLSLSVWFCFVILSFIG-SNAVEVSHDGRAIIIDGKRRVLLSGSIHYPRSTPEMWPEL 59
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
I+KAKEGGLDAIETYVFWNAHEP RR YDF+GN D+IRF+KTIQ+ GLY +LRIGPYVCA
Sbjct: 60 IQKAKEGGLDAIETYVFWNAHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCA 119
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWNYGG PVW+HN+P +E +RT N V+MNEMQNFTTLIVDM KKEKLFASQGGPIIL QI
Sbjct: 120 EWNYGGIPVWVHNLPDVE-IRTANSVYMNEMQNFTTLIVDMVKKEKLFASQGGPIILTQI 178
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYGNV+S YGDAGK+Y+NWCA MA SL++GVPWIMCQESDAP M F
Sbjct: 179 ENEYGNVISHYGDAGKAYMNWCANMAESLNVGVPWIMCQESDAPQSMINTCNGFYCDNFE 238
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PNNP+SPK+WTENW GWFK+WGG+DP RTAED+AFAVARFFQ GGTFQNYYMYHGGTNF
Sbjct: 239 PNNPSSPKMWTENWVGWFKNWGGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFD 298
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV 352
RT+GGPY+TTSYDYDAP+DEYG++ QPKWGHL+ELH +LKSME+TLT GNV+ TD+GNSV
Sbjct: 299 RTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHNVLKSMEETLTSGNVSETDFGNSV 358
Query: 353 S----------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
G +Y +PAWSVSILPDC+ EE+NTAKVN QT
Sbjct: 359 KATIYATNGSSSCFLSSTNTTTDATLTFRGKNYTVPAWSVSILPDCEHEEYNTAKVNVQT 418
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMT 443
+V VK ++A + L+W WR E I D + GK + + N L+DQK + ND SDYLWYMT
Sbjct: 419 SVMVKENSKAEEEATALKWVWRSENI-DNALHGKSNVSANRLLDQKDAANDASDYLWYMT 477
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
+K DDP+ NMTLRINSSG V+HA+VNG ++ S W YG ND FE +KL G
Sbjct: 478 KLHVKHDDPVWG--ENMTLRINSSGHVIHAFVNGEHIGSHWATYGIHNDKFEPKIKLKHG 535
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N ISLLS TVGLQNYG+ FD G+ P+ LV GDETIIK+LSS+KW+YKVGL+G
Sbjct: 536 TNTISLLSVTVGLQNYGAFFDTWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKVGLHGW 595
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
D K F + + W S+ +P +R +TWYKTTF APL DPVV++LQGMGKG+AWVNG
Sbjct: 596 DHKLFSDDSPFAAPNKWESEKLPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGKGYAWVNG 655
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
N+GR WP+Y AEEDGCS E CDYRG Y KC NCG P+Q WYHVPRS++KDG N LV
Sbjct: 656 QNIGRIWPSYNAEEDGCSDEPCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLKDGANNLV 715
Query: 684 LFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFK 743
LF E GGNPSQ+NFQTVVVGT C A+ENKT+EL+C GR+IS IK+ASFGDP+G CGAF
Sbjct: 716 LFAELGGNPSQVNFQTVVVGTVCANAYENKTLELSCQGRKISAIKFASFGDPEGVCGAFT 775
Query: 744 KGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
GSCE++ + L +++K CVGK++CS + SE G T+C KRL VEA+C
Sbjct: 776 NGSCESKSNALSIVQKACVGKQACSFDVSEKTFGPTAC-GNVAKRLAVEAVC 826
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 1085 bits (2807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/831 (61%), Positives = 633/831 (76%), Gaps = 50/831 (6%)
Query: 6 HCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
H S+ +L L TL + A +V++DGRAI IDG+ ++L+SGSIHYPRST MWPDL+K
Sbjct: 2 HPSKVLLATLFFFTLAPWATASKVTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPDLVK 61
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
K++EGGLDAIETYVFW++HEP RR+YDF+GNLDLIRF+KTIQD+GLY +LRIGPYVCAEW
Sbjct: 62 KSREGGLDAIETYVFWDSHEPARREYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVCAEW 121
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
NYGGFPVWLHNMPG++ +RT N VFMNEM+NFTTLIV+M K+E LFASQGGP+ILAQIEN
Sbjct: 122 NYGGFPVWLHNMPGVQ-MRTANDVFMNEMRNFTTLIVNMVKQENLFASQGGPVILAQIEN 180
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPN 234
EYGNVMS YGD GK+YI WCA MA SL IGVPW+MCQ+SDAP PM FTPN
Sbjct: 181 EYGNVMSSYGDEGKAYIEWCANMAQSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQFTPN 240
Query: 235 NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRT 294
P SPK+WTENWTGWFKSWGGKDP RTAEDLAF+VARF+Q GGTFQNYYMYHGGTNFGRT
Sbjct: 241 RPTSPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRT 300
Query: 295 SGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG 354
+GGPY+TTSYDYDAP+DEYG+LNQPKWGHL+ELH +L SME TLT GN+++ D+GNSVSG
Sbjct: 301 AGGPYITTSYDYDAPLDEYGNLNQPKWGHLKELHDVLHSMEDTLTRGNISSVDFGNSVSG 360
Query: 355 S----------------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
+ Y +PAWSVSILPDC+ +NTAKV+ QT+V
Sbjct: 361 TIYSTEKGSSCFLTNTDSRNDTTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQTSV 420
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNA 445
VK+ N A ++ A L W WRPE + ++ GKG ++N ++DQK + ND+SDYL+YMT+
Sbjct: 421 MVKKKNVAEDEPAALTWSWRPETNDKSILFGKGEVSVNQILDQKDAANDLSDYLFYMTSV 480
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
LK+DDPI NMTLRI SGQVLH +VNG ++ SQW KYG + +FE+ +KL +GKN
Sbjct: 481 SLKEDDPIWG--DNMTLRITGSGQVLHVFVNGEFIGSQWAKYGVFDYVFEQQIKLNKGKN 538
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
I+LLSATVG NYG+ FD+ G+ GPV LVG DE IIKDLSSHKW+YKVGL GL
Sbjct: 539 TITLLSATVGFANYGANFDLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKVGLEGLRQ 598
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
N +++S + W N P N+ TWYK TF+APL DPVV++L G+GKG AWVNG +
Sbjct: 599 ----NLYSSDSSK-WQQDNYPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAWVNGNS 653
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK-DGVNTLVL 684
+GRYWP+++AE DGCS + CDYRG Y ++KC NCG P+Q WYHVPRS++ +G NTLVL
Sbjct: 654 IGRYWPSFIAE-DGCSLDPCDYRGSYDNNKCVTNCGKPTQRWYHVPRSFLNNEGDNTLVL 712
Query: 685 FEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKK 744
FEEFGG+PS +NFQT +G+AC A E K +EL+C GR IS IK+ASFG+P G CG+F K
Sbjct: 713 FEEFGGDPSSVNFQTTAIGSACVNAEEKKKIELSCQGRPISAIKFASFGNPLGTCGSFSK 772
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+CEA D L +++K CVG++SC+I+ SE G+T+C +K L VEA+C
Sbjct: 773 GTCEASNDALSIVQKACVGQESCTIDVSEDTFGSTTCGDDVIKTLSVEAIC 823
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 1014 bits (2621), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/834 (58%), Positives = 607/834 (72%), Gaps = 52/834 (6%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
+KH +R + L IL T F+L+ + VSHD RAITI+G+R+ILLSGSIHYPRST MWPDL
Sbjct: 3 MKHFTRLLSLFFILITSFSLANSTIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDL 62
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
I KAK+GGLDAIETYVFWNAHEP RR+YDF+GNLD++RFIKTIQD GLY +LRIGPYVCA
Sbjct: 63 INKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCA 122
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWNYGGFPVWLHNMP ++ RT N FMNEMQNFTT IV+M K+EKLFASQGGPIILAQI
Sbjct: 123 EWNYGGFPVWLHNMPNMK-FRTVNPSFMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQI 181
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYGNV+S YG AGK+YI+WCA MA SLDIGVPW+MCQ+ +AP PM +
Sbjct: 182 ENEYGNVISSYGAAGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYE 241
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
P NP++PK+WTENWTGWFK+WGGK P RTAEDLAF+VARFFQ GGTFQNYYMYHGGTNFG
Sbjct: 242 PTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFG 301
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV 352
R +GGPY+TTSYDY APIDE+G+LNQPKWGHL++LH++LKSMEK+LTYGN++ D GNS+
Sbjct: 302 RVAGGPYITTSYDYHAPIDEFGNLNQPKWGHLKQLHRVLKSMEKSLTYGNISRIDLGNSI 361
Query: 353 ----------------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
G Y++PAWSVS+LP+C E +NTAKVNTQT
Sbjct: 362 KATIYTTKEGSSCFIGNVNATANALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNTQT 421
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMT 443
++ + ++ L+W WRPE +++ G L+DQK TND SDYLWYMT
Sbjct: 422 SIMTEDSSKP----EKLEWTWRPESAQKMILKSSGDLIAKGLVDQKDVTNDASDYLWYMT 477
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV-KLTR 502
L DP+ S NMTLR++S+ VLHAYVNG YV +Q+ K G + FE+ V L
Sbjct: 478 RVHLDKKDPLW--SRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHLVH 535
Query: 503 GKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG 562
G N ISLLS +VGLQNYG+ F+ P GI GPV LVG G+ETI KDLS H+W YK+GL G
Sbjct: 536 GTNHISLLSVSVGLQNYGAFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNG 595
Query: 563 LDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVN 622
++K F + + W+++ P +R +TWYK F+APL +PV+++ G+GKG AW+N
Sbjct: 596 YNNKLFSTKSVGHIK--WANEMFPTSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWIN 653
Query: 623 GYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK-DGVNT 681
G ++GRYWP++ + +DGC E CDYRG YGSDKCA+ CG P+Q WYHVPRS++K G NT
Sbjct: 654 GQSIGRYWPSFNSSDDGCKDE-CDYRGEYGSDKCAFMCGEPTQRWYHVPRSFLKASGHNT 712
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGA 741
+ LFEE GGNPS +NF+TVVVGT C +AHE+ +EL+CH IS +K+ASFG+P G CG
Sbjct: 713 ITLFEEMGGNPSMVNFKTVVVGTVCARAHEHNKVELSCHNHPISAVKFASFGNPVGHCGT 772
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F G+C+ + D + + K+CVGK +C+I S G+T + K+L VE C
Sbjct: 773 FAVGTCQGDKDAVKTVAKECVGKLNCTINVSSDTFGSTLDCGDSPKKLAVELEC 826
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 1013 bits (2619), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 490/834 (58%), Positives = 607/834 (72%), Gaps = 52/834 (6%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
+KH +R + L IL T +L+ + VSHD RAITI+G+R+ILLSGSIHYPRST MWPDL
Sbjct: 3 MKHFTRLLSLFFILITSLSLAKSTIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDL 62
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
I KAK+GGLDAIETYVFWNAHEP RR+YDF+GNLD++RFIKTIQD GLY +LRIGPYVCA
Sbjct: 63 INKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCA 122
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWNYGGFPVWLHNMP ++ RT N FMNEMQNFTT IV M K+EKLFASQGGPIILAQI
Sbjct: 123 EWNYGGFPVWLHNMPNMK-FRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGGPIILAQI 181
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYGNV+S YG GK+YI+WCA MA SLDIGVPW+MCQ+ +AP PM +
Sbjct: 182 ENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYE 241
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
P NP++PK+WTENWTGWFK+WGGK P RTAEDLAF+VARFFQ GGTFQNYYMYHGGTNFG
Sbjct: 242 PTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFG 301
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV 352
R +GGPY+TTSYDY AP+DE+G+LNQPKWGHL++LH +LKSMEK+LTYGN++ D GNS+
Sbjct: 302 RVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSI 361
Query: 353 ----------------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
G Y++PAWSVS+LPDC E +NTAKVNTQT
Sbjct: 362 KATIYTTKEGSSCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQT 421
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMT 443
++ + ++ L+W WRPE +++G G L+DQK TND SDYLWYMT
Sbjct: 422 SIMTEDSSKP----ERLEWTWRPESAQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMT 477
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV-KLTR 502
L DP+ S NMTLR++S+ VLHAYVNG YV +Q+ K G + FER V L
Sbjct: 478 RLHLDKKDPLW--SRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVH 535
Query: 503 GKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG 562
G N ISLLS +VGLQNYG F+ P GI GPV LVG G+ETI KDLS H+W YK+GL G
Sbjct: 536 GTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNG 595
Query: 563 LDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVN 622
+DK F + K+ ++ W+++ +P R +TWYK F+APL +PV+++L G+GKG AW+N
Sbjct: 596 YNDKLF-SIKSVGHQK-WANEKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWIN 653
Query: 623 GYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK-DGVNT 681
G ++GRYWP++ + +DGC E CDYRG YGSDKCA+ CG P+Q WYHVPRS++ G NT
Sbjct: 654 GQSIGRYWPSFNSSDDGCKDE-CDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNT 712
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGA 741
+ LFEE GGNPS +NF+TVVVGT C +AHE+ +EL+CH R IS +K+ASFG+P G CG+
Sbjct: 713 ITLFEEMGGNPSMVNFKTVVVGTVCARAHEHNKVELSCHNRPISAVKFASFGNPLGHCGS 772
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F G+C+ + D + K+CVGK +C++ S G+T + K+L VE C
Sbjct: 773 FAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFGSTLDCGDSPKKLAVELEC 826
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 1009 bits (2610), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/825 (59%), Positives = 595/825 (72%), Gaps = 49/825 (5%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
L LIL T F + + VSHD RAITIDG+R+ILLSGSIHYPRST MWPDLI KAK+GGL
Sbjct: 11 LFLILITSFGSANSTIVSHDERAITIDGQRRILLSGSIHYPRSTSDMWPDLISKAKDGGL 70
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D IETYVFWNAHEP RRQYDF+GNLDL+RFIKTIQ GLY +LRIGPYVCAEWNYGGFPV
Sbjct: 71 DTIETYVFWNAHEPSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGPYVCAEWNYGGFPV 130
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WLHNMP ++ RT N FMNEMQNFTT IV+M K+E LFASQGGPIILAQIENEYGNV+S
Sbjct: 131 WLHNMPDMK-FRTINPGFMNEMQNFTTKIVNMMKEESLFASQGGPIILAQIENEYGNVIS 189
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
YG GK+YI+WCA MA SLDIGVPWIMCQ+ AP PM + P+NP+SPK+
Sbjct: 190 SYGAEGKAYIDWCANMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCDQYKPSNPSSPKM 249
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGWFK+WGGK P RTAEDLAF+VARFFQ GGTFQNYYMYHGGTNFGR +GGPY+T
Sbjct: 250 WTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYIT 309
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS-------- 353
TSYDYDAP+DEYG+LNQPKWGHL++LH LLKSMEK LTYGN++ D GNSV+
Sbjct: 310 TSYDYDAPLDEYGNLNQPKWGHLKQLHTLLKSMEKPLTYGNISTIDLGNSVTATVYSTNE 369
Query: 354 --------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
G YN+PAWSVS+LPDC E +NTA+VNTQT++ +
Sbjct: 370 KSSCFIGNVNATADALVNFKGKDYNVPAWSVSVLPDCDKEAYNTARVNTQTSIITE---D 426
Query: 394 AGNDQAPLQWKWRPEMIND-FVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDD 451
+ ++ L+W WRPE +++G G L+DQK TND SDYLWYMT L D
Sbjct: 427 SCDEPEKLKWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRVHLDKKD 486
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
PI S NM+LR++S+ VLHAYVNG YV +Q + + FE+ V L G N ++LLS
Sbjct: 487 PIW--SRNMSLRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFEKKVNLVHGTNHLALLS 544
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
+VGLQNYG F+ P GI GPV LVG GDETI KDLS H+W YK+GL G + K F
Sbjct: 545 VSVGLQNYGPFFESGPTGINGPVKLVGYKGDETIEKDLSKHQWDYKIGLNGFNHKLFSMK 604
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
A + R WS++ +P +R ++WYK F+APL DPV+++L G+GKG W+NG ++GRYWP
Sbjct: 605 SAGHHHRKWSTEKLPADRMLSWYKANFKAPLGKDPVIVDLNGLGKGEVWINGQSIGRYWP 664
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD-GVNTLVLFEEFGG 690
++ + ++GC TE CDYRG YGSDKCA+ CG P+Q WYHVPRS++ D G NT+ LFEE GG
Sbjct: 665 SFNSSDEGC-TEECDYRGEYGSDKCAFMCGKPTQRWYHVPRSFLNDKGHNTITLFEEMGG 723
Query: 691 NPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAE 750
+PS + F+TVV G C +AHE+ +EL+C+ R IS +K+ASFG+P G CG+F GSCE
Sbjct: 724 DPSMVKFKTVVTGRVCAKAHEHNKVELSCNNRPISAVKFASFGNPSGQCGSFAAGSCEGA 783
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
D + ++ K+CVGK +C++ S G+ + KRL VE C
Sbjct: 784 KDAVKVVAKECVGKLNCTMNVSSHKFGSNLDCGDSPKRLFVEVEC 828
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 1000 bits (2586), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/829 (58%), Positives = 602/829 (72%), Gaps = 93/829 (11%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S LLC +L + + + A VSHDGRAITIDG R++LLSGSIHYPRST MWPDLIKK
Sbjct: 4 SLKFLLCCLLVS--SCAYATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKG 61
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEGGLDAIETYVFWNAHEP RRQYDF+GNLDLIRF+KTIQD+G+Y +LRIGPYVCAEWNY
Sbjct: 62 KEGGLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYVCAEWNY 121
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWLHNMPG+E RTTN FMNEMQNFTT+IV+M KKEKLFASQGGPIILAQIENEY
Sbjct: 122 GGFPVWLHNMPGME-FRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEY 180
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
GNV+ YG+AGK+YI WCA MA SLD+GVPWIMCQ+ DAP PM FTPNNP
Sbjct: 181 GNVIGSYGEAGKAYIKWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFTPNNP 240
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N+PK+WTENWTGW+K+WGGKDP RT ED+AFAVARFFQ GGTFQNYYMYHGGTNF RT+G
Sbjct: 241 NTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQRGGTFQNYYMYHGGTNFDRTAG 300
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--- 353
GPY+TT+YDYDAP+DE+G+LNQPK+GHL++LH +L +MEKTLTYGN++ D+GN V+
Sbjct: 301 GPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATV 360
Query: 354 -------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
G+ Y++PAWSVSILPDCKTE +NTAK+NTQT+V V
Sbjct: 361 YKTEEGSSCFIGNVNETSDAKINFQGTFYDVPAWSVSILPDCKTETYNTAKINTQTSVMV 420
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADL 447
K+ N+A N+ + L+W WRPE I++ +++GKG + L DQK +ND SDYLWYMT ++
Sbjct: 421 KKANEAENEPSTLKWSWRPENIDNVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
K+ DP+ NM+LRINS+ VLHA+VNG ++ + + G + +FE+ K G N I
Sbjct: 481 KEQDPVW--GKNMSLRINSTAHVLHAFVNGQHIGNYRAENGKFHYVFEQDAKFNPGANVI 538
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS TVGL NYG+ F+ VP GI GPV ++GR GDETI+KDLS+HKW+YK GL G +++
Sbjct: 539 TLLSITVGLPNYGAFFENVPAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFENQL 598
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
F ++ S WS APL ++PVV++L G+GKG AW+NG N+G
Sbjct: 599 F----SSESPSTWS------------------APLGSEPVVVDLLGLGKGTAWINGNNIG 636
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI-KDGVNTLVLFE 686
RYWP +LA+ DGCS E YHVPRS++ DG NTLVLFE
Sbjct: 637 RYWPAFLADIDGCSAE------------------------YHVPRSFLNSDGDNTLVLFE 672
Query: 687 EFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGS 746
E GGNPS +NFQT+ VG C +E +EL+C+G+ IS IK+ASFG+P G CG+F+KG+
Sbjct: 673 EIGGNPSLVNFQTIGVGNVCANVYEKNVLELSCNGKPISSIKFASFGNPGGNCGSFEKGT 732
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CEA D ++ ++CVGK+ CSI+ SE GA C G KRL VEA+C
Sbjct: 733 CEASNDAAAILTQECVGKEKCSIDVSEKKFGAADC-GGLAKRLAVEAIC 780
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 991 bits (2562), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/829 (58%), Positives = 601/829 (72%), Gaps = 93/829 (11%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S + +LC +L + + + A VSHDGRAITIDG R++LLSGSIHYPRST MWPDLIKK
Sbjct: 3 SLSFILCCVLVS--SCAYATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKG 60
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEG LDAIETYVFWNAHEP RRQYDF+GNLDLIRF+KTIQ++G+Y +LRIGPYVCAEWNY
Sbjct: 61 KEGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNY 120
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWLHNMPG+E RTTN FMNEMQNFTT+IV+M KKEKLFASQGGPIILAQIENEY
Sbjct: 121 GGFPVWLHNMPGME-FRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEY 179
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
GNV+ YG+AGK+YI WCA MA SLD+GVPWIMCQ+ DAP PM F+PNNP
Sbjct: 180 GNVIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNP 239
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N+PK+WTENWTGW+K+WGGKDP RT ED+AFAVARFFQ GTFQNYYMYHGGTNF RT+G
Sbjct: 240 NTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAG 299
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--- 353
GPY+TT+YDYDAP+DE+G+LNQPK+GHL++LH +L +MEKTLTYGN++ D+GN V+
Sbjct: 300 GPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATV 359
Query: 354 -------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
G+SY++PAWSVSILPDCKTE +NTAK+NTQT+V V
Sbjct: 360 YQTEEGSSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTSVMV 419
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADL 447
K+ N+A N+ + L+W WRPE I+ +++GKG + L DQK +ND SDYLWYMT +L
Sbjct: 420 KKANEAENEPSTLKWSWRPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNL 479
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
K+ DP+L NM+LRINS+ VLHA+VNG ++ + + G + +FE+ K G N I
Sbjct: 480 KEQDPVL--GKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPGANVI 537
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS TVGL NYG+ F+ GI GPV ++GR GDETI+KDLS+HKW+YK GL G +++
Sbjct: 538 TLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFENQL 597
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
F ++ S WS APL ++PVV++L G+GKG AW+NG N+G
Sbjct: 598 F----SSESPSTWS------------------APLGSEPVVVDLLGLGKGTAWINGNNIG 635
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI-KDGVNTLVLFE 686
RYWP +L++ DGCS E YHVPRS++ +G NTLVLFE
Sbjct: 636 RYWPAFLSDIDGCSAE------------------------YHVPRSFLNSEGDNTLVLFE 671
Query: 687 EFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGS 746
E GGNPS +NFQT+ VG+ C +E +EL+C+G+ IS IK+ASFG+P G CG+F+KG+
Sbjct: 672 EIGGNPSLVNFQTIGVGSVCANVYEKNVLELSCNGKPISAIKFASFGNPGGDCGSFEKGT 731
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CEA + ++ ++CVGK+ CSI+ SE GA C A KRL VEA+C
Sbjct: 732 CEASNNAAAILTQECVGKEKCSIDVSEDKFGAAECGA-LAKRLAVEAIC 779
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 981 bits (2537), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/819 (60%), Positives = 594/819 (72%), Gaps = 61/819 (7%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A +++ D R I I+GERKIL+SGS+HYPRSTP MWPDLI+K+K+GGL+ I+TYVFW+ HE
Sbjct: 27 ADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P RRQYDFTGN DL+RFIK IQ QGLY +LRIGPYVCAEW YGGFPVWLHN P I+ LRT
Sbjct: 87 PQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQ-LRT 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N V+M+EMQ FTT+IVDM KKE+LFASQGGPII++QIENEYGNVM Y DAG YINWC
Sbjct: 146 NNTVYMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWC 205
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A+MA +LD GVPWIMCQ+ +AP PM FTPNNPNSPK+WTENW+GW+K+WG
Sbjct: 206 AQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWG 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G DP RTAEDLAF+VARF+Q GGTFQNYYMYHGGTNFGRT+GGPY+TTSYDYDAP++EYG
Sbjct: 266 GSDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY-------------------GNS---- 351
+ NQPKWGHLR+LH LL SMEK LTYG+V N DY GNS
Sbjct: 326 NKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSCFFGNSNADR 385
Query: 352 -----VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G +Y +PAWSVSILPDC E +NTAKVN+Q + VK+ ++A N+ LQW WR
Sbjct: 386 DVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQWTWR 445
Query: 407 PEMINDFVVRGKGHFALNTLIDQKST-NDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I G F + L+DQK+ D SDYL+YMT D+ +DDPI ++TL +N
Sbjct: 446 GETIQYIT---PGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDDPIW--GKDLTLSVN 500
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
+SG +LHA+VNG ++ Q+ G F R V L GKN+I+LLSATVGL NYG FDM
Sbjct: 501 TSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDM 560
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
V GI GPV ++ G IIKDLS +++W YK GL G +DKK + +A ++ W S N
Sbjct: 561 VNQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNG-EDKKIFLGRARYNQ--WKSDN 617
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+P+NR WYK TF+AP DPVV++L G+GKG AWVNG++LGRYWP+Y+A +GCS E
Sbjct: 618 LPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPE- 676
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
CDYRGPY ++KC NCGNPSQ WYHVPRS++ N LVLFEEFGGNPS + FQTV VG
Sbjct: 677 CDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGN 736
Query: 705 ACGQAHENKTMELTCHGRRISEIKYASFGDPQGACG--------AFKKGSCEAEIDVLPL 756
AC A E T+EL+C GR IS IK+ASFGDPQG CG F+KG+CEA D L +
Sbjct: 737 ACANAREGYTLELSCQGRAISGIKFASFGDPQGTCGKPFATGSQVFEKGTCEAA-DSLSI 795
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
I+K CVGK SCSI+ SE LG C A T KRL VEA+C
Sbjct: 796 IQKLCVGKYSCSIDVSEQILGPAGCTADT-KRLAVEAIC 833
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 981 bits (2536), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/798 (59%), Positives = 583/798 (73%), Gaps = 52/798 (6%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
G+R+ILLSGSIHYPRST MWPDLI KAK+GGLDAIETYVFWNAHEP RR+YDF+GNLD+
Sbjct: 1 GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTT 159
+RFIKTIQD GLY +LRIGPYVCAEWNYGGFPVWLHNMP ++ RT N FMNEMQNFTT
Sbjct: 61 VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMK-FRTVNPSFMNEMQNFTT 119
Query: 160 LIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWI 219
IV M K+EKLFASQGGPIILAQIENEYGNV+S YG GK+YI+WCA MA SLDIGVPW+
Sbjct: 120 KIVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWL 179
Query: 220 MCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFA 268
MCQ+ +AP PM + P NP++PK+WTENWTGWFK+WGGK P RTAEDLAF+
Sbjct: 180 MCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFS 239
Query: 269 VARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELH 328
VARFFQ GGTFQNYYMYHGGTNFGR +GGPY+TTSYDY AP+DE+G+LNQPKWGHL++LH
Sbjct: 240 VARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLH 299
Query: 329 KLLKSMEKTLTYGNVTNTDYGNSV----------------------------SGSSYNLP 360
+LKSMEK+LTYGN++ D GNS+ G Y++P
Sbjct: 300 TVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEGSSCFIGNVNATADALVNFKGKDYHVP 359
Query: 361 AWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGH 420
AWSVS+LPDC E +NTAKVNTQT++ + ++ L+W WRPE +++G G
Sbjct: 360 AWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKP----ERLEWTWRPESAQKMILKGSGD 415
Query: 421 FALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNY 479
L+DQK TND SDYLWYMT L DP+ S NMTLR++S+ VLHAYVNG Y
Sbjct: 416 LIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLW--SRNMTLRVHSNAHVLHAYVNGKY 473
Query: 480 VDSQWTKYGASNDLFERPV-KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
V +Q+ K G + FER V L G N ISLLS +VGLQNYG F+ P GI GPV LVG
Sbjct: 474 VGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVG 533
Query: 539 RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
G+ETI KDLS H+W YK+GL G +DK F + K+ ++ W+++ +P R +TWYK F
Sbjct: 534 YKGEETIEKDLSQHQWDYKIGLNGYNDKLF-SIKSVGHQK-WANEKLPTGRMLTWYKAKF 591
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+APL +PV+++L G+GKG AW+NG ++GRYWP++ + +DGC E CDYRG YGSDKCA+
Sbjct: 592 KAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDE-CDYRGAYGSDKCAF 650
Query: 659 NCGNPSQIWYHVPRSWIK-DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMEL 717
CG P+Q WYHVPRS++ G NT+ LFEE GGNPS +NF+TVVVGT C +AHE+ +EL
Sbjct: 651 MCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCARAHEHNKVEL 710
Query: 718 TCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLG 777
+CH R IS +K+ASFG+P G CG+F G+C+ + D + K+CVGK +C++ S G
Sbjct: 711 SCHNRPISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFG 770
Query: 778 ATSCAAGTVKRLVVEALC 795
+T + K+L VE C
Sbjct: 771 STLDCGDSPKKLAVELEC 788
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 972 bits (2512), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/819 (60%), Positives = 592/819 (72%), Gaps = 65/819 (7%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A +++ D R I I+GERKIL+SGS+HYPRSTP MWPDLI+K+K+GGL+ I+TYVFW+ HE
Sbjct: 27 ADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P RRQYDFTGN DL+RFIK IQ QGLY +LRIGPYVCAEW YGGFPVWLHN P I+ LRT
Sbjct: 87 PQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQ-LRT 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N V+M+EMQ FTT+IVDM KKE+LFASQGGPII++QIENEYGNVM Y DAG YINWC
Sbjct: 146 NNTVYMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWC 205
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A+MA +LD GVPWIMCQ+ +AP PM FTPNNPNSPK+WTENW+GW+K+WG
Sbjct: 206 AQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWG 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G DP RTAEDLAF+VARF+Q GGTFQNYYMYHGGTNFGRT+GGPY+TTSYDYDAP++EYG
Sbjct: 266 GSDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY-------------------GNS---- 351
+ NQPKWGHLR+LH LL SMEK LTYG+V N DY GNS
Sbjct: 326 NKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSCFFGNSNADR 385
Query: 352 -----VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G +Y +PAWSVSILPDC E +NTAKVN+Q + VK+ ++A N+ LQW WR
Sbjct: 386 DVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQWTWR 445
Query: 407 PEMINDFVVRGKGHFALNTLIDQKST-NDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I G F + L+DQK+ D SDYL+YMT +DDPI ++TL +N
Sbjct: 446 GETIQYIT---PGRFTASELLDQKTVAEDTSDYLYYMTT----NDDPIW--GKDLTLSVN 496
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
+SG +LHA+VNG ++ Q+ G F R V L GKN+I+LLSATVGL NYG FDM
Sbjct: 497 TSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDM 556
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
V GI GPV ++ G IIKDLS +++W YK GL G +DKK + +A ++ W S N
Sbjct: 557 VNQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNG-EDKKIFLGRARYNQ--WKSDN 613
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+P+NR WYK TF+AP DPVV++L G+GKG AWVNG++LGRYWP+Y+A +GCS E
Sbjct: 614 LPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPE- 672
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
CDYRGPY ++KC NCGNPSQ WYHVPRS++ N LVLFEEFGGNPS + FQTV VG
Sbjct: 673 CDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGN 732
Query: 705 ACGQAHENKTMELTCHGRRISEIKYASFGDPQGACG--------AFKKGSCEAEIDVLPL 756
AC A E T+EL+C GR IS IK+ASFGDPQG CG F+KG+CEA D L +
Sbjct: 733 ACANAREGYTLELSCQGRAISGIKFASFGDPQGTCGKPFATGSQVFEKGTCEAA-DSLSI 791
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
I+K CVGK SCSI+ SE LG C A T KRL VEA+C
Sbjct: 792 IQKLCVGKYSCSIDVSEQILGPAGCTADT-KRLAVEAIC 829
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 966 bits (2496), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/828 (56%), Positives = 588/828 (71%), Gaps = 107/828 (12%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S + +LC +L + + + A VSHDGRAITIDG R++LLSGSIHYPRST MWPDLIKK
Sbjct: 26 SLSFILCCVLVS--SCAYATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKG 83
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEG LDAIETYVFWNAHEP RRQYDF+GNLDLIRF+KTIQ++G+Y +LRIGPYVCAEWNY
Sbjct: 84 KEGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNY 143
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWLHNMPG+E RTTN FMNEMQNFTT+IV+M KKEKLFASQGGPIILAQIENEY
Sbjct: 144 GGFPVWLHNMPGME-FRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEY 202
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
GNV+ YG+AGK+YI WCA MA SLD+GVPWIMCQ+ DAP PM F+PNNP
Sbjct: 203 GNVIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNP 262
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N+PK+WTENWTGW+K+WGGKDP RT ED+AFAVARFFQ GTFQNYYMYHGGTNF RT+G
Sbjct: 263 NTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAG 322
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--- 353
GPY+TT+YDYDAP+DE+G+LNQPK+GHL++LH +L +MEKTLTYGN++ D+GN V+
Sbjct: 323 GPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATV 382
Query: 354 -------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
G+SY++PAWSVSILPDCKTE +NTAK+NTQT+V V
Sbjct: 383 YQTEEGSSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTSVMV 442
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADL 447
K+ N+A N+ + L+W WRPE I+ +++GKG + L DQK +ND SDYLWYMT +L
Sbjct: 443 KKANEAENEPSTLKWSWRPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNL 502
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
K+ DP+L NM+LRINS+ VLHA+VNG ++ + + G + +FE+ K G N I
Sbjct: 503 KEQDPVL--GKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPGANVI 560
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS TVGL NYG+ F+ GI GPV ++GR GDETI+KDLS+HKW+YK GL G +++
Sbjct: 561 TLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFENQL 620
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
F ++ S WS APL ++PVV++L G+GKG AW+NG N+G
Sbjct: 621 F----SSESPSTWS------------------APLGSEPVVVDLLGLGKGTAWINGNNIG 658
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYWP +L++ DG NTLVLFEE
Sbjct: 659 RYWPAFLSD---------------------------------------IDGDNTLVLFEE 679
Query: 688 FGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSC 747
GGNPS +NFQT+ VG+ C +E +EL+C+G+ IS IK+ASFG+P G CG+F+KG+C
Sbjct: 680 IGGNPSLVNFQTIGVGSVCANVYEKNVLELSCNGKPISAIKFASFGNPGGDCGSFEKGTC 739
Query: 748 EAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
EA + ++ ++CVGK+ CSI+ SE GA C A KRL VEA+C
Sbjct: 740 EASNNAAAILTQECVGKEKCSIDVSEDKFGAAECGA-LAKRLAVEAIC 786
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 939 bits (2428), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/833 (55%), Positives = 592/833 (71%), Gaps = 62/833 (7%)
Query: 13 LCLILQTLFNLSL-AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L L+ TL NL++ A+ VS+D RAITIDG+RK+L SGSIHYPRST MWP LI KAKEGG
Sbjct: 5 LLLLSFTLVNLAINAFEVSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKEGG 64
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD IETYVFWNAHEP RQYDF+GNLDL++FIKTIQ +GLY +LRIGPYVCAEWNYGGFP
Sbjct: 65 LDVIETYVFWNAHEPQPRQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGGFP 124
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWLHNMP +E RT N +MNEMQ FTTLIVD + E LFASQGGPIILAQIENEYGN+M
Sbjct: 125 VWLHNMPNME-FRTNNTAYMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNIM 183
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
S+YG+ GK Y+ WCA++A S IGVPW+MCQ+SDAP P+ F+PN+ + PK
Sbjct: 184 SEYGENGKQYVQWCAQLAESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQFSPNSKSKPK 243
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENWTGWFK+WGG P RTA D+A+AVARFFQ+GGTFQNYYMYHGGTNFGRTSGGPY+
Sbjct: 244 MWTENWTGWFKNWGGPIPHRTARDVAYAVARFFQYGGTFQNYYMYHGGTNFGRTSGGPYI 303
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYN-- 358
TTSYDYDAP+DEYG+ NQPKWGHL++LH+LLKSME LT G +TDYGN ++ + YN
Sbjct: 304 TTSYDYDAPLDEYGNKNQPKWGHLKQLHELLKSMEDVLTQGTTNHTDYGNLLTATVYNYS 363
Query: 359 --------------------------LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
+PAWSVSILP+C E +NTAK+N QT++ V + N
Sbjct: 364 GKSACFLGNANSSNDATIMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVMKDN 423
Query: 393 QAGNDQAP---LQWKWRPE---MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNA 445
++ N++ P L W+W E + D V G L+DQK TND SDYLWY+T+
Sbjct: 424 KSDNEEEPHSTLNWQWMHEPHVQMKDGQVLGSVSRKAAQLLDQKVVTNDTSDYLWYITSV 483
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
D+ ++DPI S +R++++G VLH +VNG Q+ + G + +E +KL +G N
Sbjct: 484 DISENDPIWS-----KIRVSTNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLKKGTN 538
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
+ISLLS TVGL NYG+ F V G+ GPV LV + ++KD++++ W YKVGL+G +
Sbjct: 539 EISLLSGTVGLPNYGAHFSNVSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLHG-EI 597
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
K Y + +GW++ +P NR WYKT F++P DPVV++L+G+ KG AWVNG N
Sbjct: 598 VKLY---CPENNKGWNTNGLPTNRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWVNGNN 654
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK-DGVNTLVL 684
+GRYW YLA+++GC T +C+YRGPY SDKC CG P+Q WYHVPRS+++ D NTLVL
Sbjct: 655 IGRYWTRYLADDNGC-TATCNYRGPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQNTLVL 713
Query: 685 FEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRR-ISEIKYASFGDPQGACGAFK 743
FEEFGG+P+++ F TV+V C ++E +EL+C + IS+IK+ASFG P+G CG+FK
Sbjct: 714 FEEFGGHPNEVKFATVMVEKICANSYEGNVLELSCREEQVISKIKFASFGVPEGECGSFK 773
Query: 744 KGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSC-AAGTVKRLVVEALC 795
K CE+ + L ++ K C+GK+SCS++ S+ LG T C +L +EA+C
Sbjct: 774 KSQCESP-NALSILSKSCLGKQSCSVQVSQRMLGPTGCRMPQNQNKLAIEAVC 825
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 935 bits (2417), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/840 (55%), Positives = 592/840 (70%), Gaps = 65/840 (7%)
Query: 8 SRAILLCLILQTLFNLSL-AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKK 66
S + LLCL +L ++++ A VS+D RA+TIDG+R+IL S SIHYPRSTP MWP LI+K
Sbjct: 9 SASFLLCL---SLISIAINALEVSYDERALTIDGKRRILFSASIHYPRSTPEMWPYLIRK 65
Query: 67 AKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWN 126
AKEGGLD IETYVFWNAHEP RRQY+F+ NLDL+RFI+TIQ +GLY ++RIGPY+ +EWN
Sbjct: 66 AKEGGLDVIETYVFWNAHEPQRRQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWN 125
Query: 127 YGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
YGG PVWLHN+P +E RT N+ FM EM+ FTT IVDM + E LFA QGGPII+AQIENE
Sbjct: 126 YGGLPVWLHNIPNME-FRTHNRAFMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQIENE 184
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YGNVM YG+ G Y+ WCA++A S + GVPW+M Q+S+AP M F PN+
Sbjct: 185 YGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPND 244
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
+ PKIWTENWTG +K+WG ++P R AED+A+AVARFFQFGGTFQNYYMYHGGTNF RT+
Sbjct: 245 NHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTA 304
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS 355
GGPY+TTSYDYDAP+DEYG+LNQPKWGHLR+LH LLKS E LT G+ NTDYGN V+ +
Sbjct: 305 GGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQNTDYGNMVTAT 364
Query: 356 ----------------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
Y +PAWSVSILP+C +E +NTAKVNTQT +
Sbjct: 365 VYTYDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTTIM 424
Query: 388 VKRPNQAGNDQAPLQWKWRPE---MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMT 443
VK+ N+ + + L+W+WR E + D + G L+DQK TND SDYLWY+T
Sbjct: 425 VKKDNE--DLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWYIT 482
Query: 444 NADLK-DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTR 502
+ D+K DDDP S + LR+++SG VLH +VNG +V +Q K G + E +KLT
Sbjct: 483 SIDIKGDDDP--SWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIKLTT 540
Query: 503 GKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD-----ETIIKDLSSHKWTYK 557
GKN+ISLLS TVGL NYG FD + G+ GPV LV GD + I+KDLS ++W+YK
Sbjct: 541 GKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQWSYK 600
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
VGL+G + + NS + W + VP +R + WYKTTF++P+ +DPVV++L G+GKG
Sbjct: 601 VGLHGEHEMHY---SYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGKG 657
Query: 618 FAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD 677
AWVNG ++GRYW +YLA+E+GCS + CDYRGPY S+KC C PSQ WYHVPRS+++D
Sbjct: 658 HAWVNGNSIGRYWSSYLADENGCSPK-CDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRD 716
Query: 678 G-VNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRR-ISEIKYASFGDP 735
NTLVLFEE GG P +NF TV VG C A+E T+EL C+ + ISEIK+ASFG P
Sbjct: 717 NDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGNTLELACNKNQVISEIKFASFGLP 776
Query: 736 QGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+G CG+F+KG+CE+ + L I+ QC+GK CSI+ SE LG T C +RL VEA+C
Sbjct: 777 KGECGSFQKGNCESS-EALSAIKAQCIGKDKCSIQVSERTLGPTRCRVAEDRRLAVEAVC 835
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 934 bits (2413), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/815 (56%), Positives = 572/815 (70%), Gaps = 59/815 (7%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A VS+DGRAITIDG+RKIL SGSIHYPRST MWP LI+K+KEGGLD IETYVFWN HE
Sbjct: 24 AIDVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHE 83
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P QYDF+GNLDL+RFIKTIQ+QGLY +LRIGPYVCAEWNYGGFPVWLHN+P IE RT
Sbjct: 84 PHPGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNIE-FRT 142
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N +F +EM+ FTTLIVDM + EKLFASQGGPIILAQIENEYGN+M YG GK Y+ WC
Sbjct: 143 NNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWC 202
Query: 206 AKMATSLDIGVPWIMCQESDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
A++A S IGVPWIMCQ+SDAP P+ PN+ N PK+WTE+WTGWF WG
Sbjct: 203 AQLAQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWG 262
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P RTAED+AFAV RFFQ+GGTFQNYYMYHGGTNFGRTSGGPY+TTSYDYDAP++EYG
Sbjct: 263 GPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYG 322
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS------------------- 355
LNQPKWGHL+ LH++LKS+E TLT G+ N DYGN ++ +
Sbjct: 323 DLNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFSYAGQSVCFLGNAHPSM 382
Query: 356 ---------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
Y +PAWSVSILPDC TE +NTAKVN QT++ + L W+W
Sbjct: 383 DANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSIMTIN----NENSYALDWQWM 438
Query: 407 PE----MINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
PE + D V G L+DQK ND SDYLWY+T+ D+K DPILS ++ +
Sbjct: 439 PETHLEQMKDGKVLGSVAITAPRLLDQKVANDTSDYLWYITSVDVKQGDPILS--HDLKI 496
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
R+N+ G VLH +VNG ++ SQ+ YG FE +KL GKN+ISL+S TVGL NYG+
Sbjct: 497 RVNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLKLGKNEISLVSGTVGLPNYGAY 556
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
FD + G+ G L+ G E + KD+S++ W YKVG++G ++ K Y+ + E W +
Sbjct: 557 FDNIHVGVTGVQLVSQNDGSE-VTKDISTNVWHYKVGMHG-ENVKLYSPSRSTEE--WFT 612
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+ ++ WYKTTF P+ D VVL+L+G+GKG AWVNG N+GRYW +YLA EDGCS+
Sbjct: 613 NGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSS 672
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINFQTVV 701
+CDYRG Y S+KC NCGNP+Q WYHVP S+++DG+ NTLV+FEE GGNP Q+ TV
Sbjct: 673 -TCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVT 731
Query: 702 VGTACGQAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQ 760
+ AC +A+E +EL C + ISEIK+ASFG P+G CG+FKKG CE+ D L ++++
Sbjct: 732 IAKACAKAYEGHELELACKENQVISEIKFASFGVPEGECGSFKKGHCESS-DTLSIVKRL 790
Query: 761 CVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C+GK+ CSI+ +E LG T C RL ++ALC
Sbjct: 791 CLGKQQCSIQVNEKMLGPTGCRVPE-NRLAIDALC 824
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 932 bits (2408), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/844 (55%), Positives = 589/844 (69%), Gaps = 61/844 (7%)
Query: 3 TLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
T + CS + + L L + A VS+D RA+TIDG+R+IL SGSIHYPRSTP MWP
Sbjct: 2 TGRKCSLSAMFLLCLSLISIAINALEVSYDERALTIDGKRRILFSGSIHYPRSTPEMWPY 61
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
LI+KAKEGGLD IETYVFWNAHEP RRQYDF+ NLDL+RFI+TIQ +GLY ++RIGPY+
Sbjct: 62 LIRKAKEGGLDVIETYVFWNAHEPQRRQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYIS 121
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
+EWNYGG PVWLHN+P +E RT N+ FM EM+ FT IVDM + E LFA QGGPII+AQ
Sbjct: 122 SEWNYGGLPVWLHNIPNME-FRTHNRAFMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQ 180
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------F 231
IENEYGNVM YG+ G Y+ WCA++A S + GVPW+M Q+S+AP M F
Sbjct: 181 IENEYGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQF 240
Query: 232 TPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
PN+ + PKIWTENWTG +K+WG ++P R AED+A+AVARFFQFGGTFQNYYMYHGGTNF
Sbjct: 241 QPNDNHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNF 300
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS 351
RT+GGPY+TTSYDYDAP+DEYG+LNQPKWGHLR+LH LLKS E LT G+ +TDYGN
Sbjct: 301 KRTAGGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQHTDYGNM 360
Query: 352 VSGS----------------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQ 383
V+ + Y +PAWSVSILP+C +E +NTAKVNTQ
Sbjct: 361 VTATVYTYDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQ 420
Query: 384 TNVKVKRPNQAGNDQAPLQWKWRPE---MINDFVVRGKGHFALNTLIDQKS-TNDVSDYL 439
T + VK+ N+ + + L+W+WR E + D + G L+DQK TND SDYL
Sbjct: 421 TTIMVKKDNE--DLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYL 478
Query: 440 WYMTNADLK-DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
WY+T+ D+K DDDP S + LR+++SG VLH +VNG +V +Q K G + E +
Sbjct: 479 WYITSIDIKGDDDP--SWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKI 536
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD-----ETIIKDLSSHK 553
KLT GKN+ISLLS TVGL NYG FD + G+ GPV LV GD + I+KDLS ++
Sbjct: 537 KLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQ 596
Query: 554 WTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQG 613
W+YKVGL+G + + NS + W + VP +R + WYKTTF++P+ +DPVV++L G
Sbjct: 597 WSYKVGLHGEHEMHY---SYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSG 653
Query: 614 MGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRS 673
+GKG AWVNG ++GRYW +YLA+E+GCS + CDYRGPY S+KC C PSQ WYHVPRS
Sbjct: 654 LGKGHAWVNGNSIGRYWSSYLADENGCSPK-CDYRGPYTSNKCLSMCAQPSQRWYHVPRS 712
Query: 674 WIK-DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRR-ISEIKYAS 731
+++ D NTLVLFEE GG P +NF TV VG C A+E T+EL C+ + ISEIK+AS
Sbjct: 713 FLRDDDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGNTLELACNKNQVISEIKFAS 772
Query: 732 FGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVV 791
FG P+G CG+F+KG+CE+ + L I+ QC+GK CSI+ SE LG T C +RL V
Sbjct: 773 FGLPKGECGSFQKGNCESS-EALSAIKAQCIGKDKCSIQVSERALGPTRCRVAEDRRLAV 831
Query: 792 EALC 795
EA+C
Sbjct: 832 EAVC 835
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 930 bits (2404), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/841 (55%), Positives = 583/841 (69%), Gaps = 63/841 (7%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSL-AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M + + +LLC L ++++ A VS+DGRAITIDG+RKIL SGSIHYPRST M
Sbjct: 1 MGKMGSITTLLLLC---SALISIAIEAIDVSYDGRAITIDGKRKILFSGSIHYPRSTAEM 57
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
WP LI+K+KEGGLD IETYVFWN HEP QYDF+GNLDL+RFIKTIQ+QGL+ +LRIGP
Sbjct: 58 WPSLIEKSKEGGLDVIETYVFWNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGP 117
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWNYGGFPVWLHN+P IE RT N +F +EM+ FTTLIVDM + EKLFASQGGPII
Sbjct: 118 YVCAEWNYGGFPVWLHNIPNIE-FRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPII 176
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT------- 232
LAQIENEYGN+M YG GK Y+ WCA++A S IGVPWIMCQ+SD P P+
Sbjct: 177 LAQIENEYGNIMGSYGQNGKEYVQWCAQLAQSYQIGVPWIMCQQSDTPDPLINTCNGFYC 236
Query: 233 ----PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
PN+ N PK+WTE+WTGWF WGG P RTAED+AFAV RFFQ+GGTFQNYYMYHGG
Sbjct: 237 DQWHPNSNNKPKMWTEDWTGWFMHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGG 296
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRTSGGPY+TTSYDYDAP++EYG LNQPKWGHL+ LH++LKS+E TLT G+ N DY
Sbjct: 297 TNFGRTSGGPYITTSYDYDAPLNEYGDLNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDY 356
Query: 349 GNSVSGS----------------------------SYNLPAWSVSILPDCKTEEFNTAKV 380
GN ++ + Y +PAWSVSILPDC TE +NTAKV
Sbjct: 357 GNQMTATIFSYAGQSVCFLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNTAKV 416
Query: 381 NTQTNVKVKRPNQAGNDQAPLQWKWRPE----MINDFVVRGKGHFALNTLIDQKSTNDVS 436
N QT++ + L W+W PE + D V G L+DQK ND S
Sbjct: 417 NAQTSIMTIN----NENSYALDWQWMPETHLEQMKDGKVLGSVAITAPRLLDQKVANDTS 472
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
DYLWY+T+ D+K DPILS ++ +R+N+ G VLH +VNG ++ SQ+ YG FE
Sbjct: 473 DYLWYITSVDVKQGDPILS--HDLKIRVNTKGHVLHVFVNGAHIGSQYATYGKYPFTFEA 530
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
+KL GKN+ISL+S TVGL NYG+ FD + G+ G L+ G E + KD+S++ W Y
Sbjct: 531 DIKLKLGKNEISLVSGTVGLPNYGAYFDNIHVGVTGVQLVSQNDGSE-VTKDISTNVWHY 589
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
KVG++G ++ K Y+ ++ E W + + ++ WYKTTF P+ D VVL+L+G+GK
Sbjct: 590 KVGMHG-ENVKLYSPSRSSEE--WFTNGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGK 646
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G AWVNG N+GRYW +YLA EDGCS+ +CDYRG Y S+KC NCGNP+Q WYHVP S+++
Sbjct: 647 GQAWVNGNNIGRYWVSYLAGEDGCSS-TCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLR 705
Query: 677 DGV-NTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTC-HGRRISEIKYASFGD 734
DG+ NTLV+FEE GGNP Q+ TV + AC +A+E +EL C + ISEI++ASFG
Sbjct: 706 DGLDNTLVVFEEQGGNPFQVKIATVTIAKACAKAYEGHELELACKENQVISEIRFASFGV 765
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P+G CG+FKKG CE+ D L ++++ C+GK+ CSI +E LG T C RL ++AL
Sbjct: 766 PEGECGSFKKGHCESS-DTLSIVKRLCLGKQQCSIHVNEKMLGPTGCRVPE-NRLAIDAL 823
Query: 795 C 795
C
Sbjct: 824 C 824
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 914 bits (2361), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/813 (55%), Positives = 581/813 (71%), Gaps = 62/813 (7%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D AI I+GER+I+ SGSIHYPRST MWPDLI+KAK+GGLDAIETY+FW+ HEP R
Sbjct: 27 VSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFWDRHEPHR 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+YDF+G+L+ I++ + IQ+ GLYV++RIGPYVCAEWNYGGFP+WLHNMPGI+ LRT N+
Sbjct: 87 RKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQ-LRTNNQ 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
V+ NEMQ FTT IV+M K+ LFASQGGPIILAQIENEYGNVM+ YG+AGK+YINWCA+M
Sbjct: 146 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYINWCAQM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A SL+IG+PWIMCQ+SDAP P+ FTPNNPNSPK++TENW GWFK WG KD
Sbjct: 206 AESLNIGIPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPNSPKMFTENWVGWFKKWGDKD 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P RTAED+AF+VARFFQ GG NYYMYHGGTNFGRTSGGP++TTSYDYDAP+DEYG+LN
Sbjct: 266 PHRTAEDVAFSVARFFQSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLDEYGNLN 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS--------------------- 356
QPKWGHL++LH +K EK LT ++ D+G+SV+ +
Sbjct: 326 QPKWGHLKQLHASIKLGEKILTNSTRSDQDFGSSVTFTKFSNLETGEKFCFLSNADENND 385
Query: 357 ----------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
Y LPAWSVSIL C E FNTAKV++QT++ K+ N+ N A L W W
Sbjct: 386 AIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTSLFFKKQNEKEN--AKLSWNWA 443
Query: 407 PEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + D ++G G F N L++QK +T D SDYLWYMTN + S N+TL++N
Sbjct: 444 SEPMRD-TLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVN----SNTTSSLQNLTLQVN 498
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
+ G VLHA++N Y+ SQW G S +FE+P++L G N I+LLSATVGL+NY + +D
Sbjct: 499 TKGHVLHAFINRRYIGSQWGSNGQS-FVFEKPIQLKLGTNTITLLSATVGLKNYDAFYDT 557
Query: 526 VPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
VP GI G P+ L+G D + DLSS+ W+YKVGL G + K+ YN +N + WS+ N
Sbjct: 558 VPTGIDGGPIYLIG---DGNVTTDLSSNLWSYKVGLNG-ERKQLYNPMFSNRTK-WSTLN 612
Query: 585 V-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ RRMTW+K TF+ P DPVVL++QGMGKG AWVNG ++GR+WP+++A D CS E
Sbjct: 613 KKSIGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRFWPSFIASNDSCS-E 671
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
+CDY+G Y +KC NCGN SQ WYH+PRS++ D +NTL+LFEE GGNP ++ QT+ +G
Sbjct: 672 TCDYKGSYNPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGGNPQMVSVQTITIG 731
Query: 704 TACGQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCV 762
T CG A+E T+EL+C G ISEI++AS+G P+G CG+F+ G + ++EK C+
Sbjct: 732 TICGNANEGSTLELSCQGGHVISEIQFASYGHPEGKCGSFQSGLWDVTKSTTIIVEKACI 791
Query: 763 GKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G K+CSI+ S NL S A +L V+ALC
Sbjct: 792 GMKNCSIDIS-PNLFKLSKVAYPYAKLAVQALC 823
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 909 bits (2350), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/813 (55%), Positives = 581/813 (71%), Gaps = 63/813 (7%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D AI I+GER+++ SGSIHYPRST MWPDLI+KAK+GGLDAIETY+FW+ HEP R
Sbjct: 5 VSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHEPQR 64
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
++YDF+G+L+ I+F + +QD GLY+++RIGPYVCAEWNYGGFP+WLHNMPGI+ LRT N+
Sbjct: 65 QKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQ-LRTDNQ 123
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
V+ NEM FTT IV+M K+ LFASQGGPIILAQIENEYGNVM+ YG+AGK+YINWCA+M
Sbjct: 124 VYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCAQM 183
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A S +IGVPWIMCQ+SDAP P+ F+PNNP SPK++TENW GWFK WG KD
Sbjct: 184 AESFNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWGDKD 243
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R+AED+AF+VARFFQ GG F NYYMYHGGTNFGRTSGGP++TTSYDY+AP+DEYG+LN
Sbjct: 244 PYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLN 303
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS--------------------- 356
QPKWGHL++LH +K EK LT G +N +G+ V+ +
Sbjct: 304 QPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTFKTFGSFVTLTKFSNPTTKERFCF 363
Query: 357 ----------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
Y +PAWSVSI+ CK E FNTAK+N+QT++ VK N+ N L W W
Sbjct: 364 LSNTXKADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSIFVKVQNEKEN--VKLSWVWA 421
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE ++D ++GKG F N L++QK T D SDYLWYMTN + I N+TL++N
Sbjct: 422 PEAMSD-TLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSI----HNVTLQVN 476
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
+ G VLHA+VN Y+ SQW G S +FE+P+ L G N I+LLSATVGL+NY + +D
Sbjct: 477 TKGHVLHAFVNTRYIGSQWGNNGQS-FVFEKPILLKAGTNIITLLSATVGLKNYDAFYDT 535
Query: 526 VPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI G P+ L+G D + DLSS+ W+YKVGL G + K+ YN + E W++ N
Sbjct: 536 LPTGIDGGPIYLIG---DGNVKIDLSSNLWSYKVGLNG-EIKQLYNP-VFSQETSWNTLN 590
Query: 585 V-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ RRMTWYKT+F+ P DPV L++QGMGKG AW+NG ++GR+WP+++A D CS E
Sbjct: 591 KNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGNDNCS-E 649
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
+CDYRG Y KC NCGNPSQ WYH+PRS++ + NTLVLFEE GG+P Q++ QT+ +G
Sbjct: 650 TCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQTITIG 709
Query: 704 TACGQAHENKTMELTCHGRR-ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCV 762
T CG A+E T+EL+C G ISEI++AS+G+P+G CG+FK+GS + L L+EK C
Sbjct: 710 TICGNANEGSTLELSCQGEYIISEIQFASYGNPKGKCGSFKQGSWDVTNSAL-LLEKTCK 768
Query: 763 GKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G KSCS++ S A L A RLVV+ALC
Sbjct: 769 GMKSCSVDVS-AKLFGLGDAVNLSARLVVQALC 800
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 908 bits (2347), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/835 (54%), Positives = 581/835 (69%), Gaps = 71/835 (8%)
Query: 15 LILQTLFNLSLAY-----RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
L+ + L+ Y VS+D AI I+GER+++LSGS+HYPRST MWPDLI+KAK+
Sbjct: 18 LVFSLVVTLACFYFCKGDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKD 77
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLDAIETY+FW+ HEP RR+YDFTG LD I+F + +QD GLYV++RIGPYVCAEWNYGG
Sbjct: 78 GGLDAIETYIFWDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGG 137
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FP+WLHN+PGI + RT N+V+ NEMQ FTT IV+M K+ LFASQGGPIILAQIENEYGN
Sbjct: 138 FPLWLHNLPGI-QFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGN 196
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FTPNNPN 237
VM+ YG+AGKSYINWCA+MA SL+IG+PWIMCQ++DAP P+ F+PNNP
Sbjct: 197 VMTPYGNAGKSYINWCAQMAESLNIGIPWIMCQQNDAPQPIINTCNGFYCDYDFSPNNPK 256
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
SPK++TENW GWFK WG KDP R+ ED+AFAVARFFQ GG F NYYMYHGGTNFGRT+GG
Sbjct: 257 SPKMFTENWVGWFKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGG 316
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN---------VTNTDY 348
P++TTSYDY+AP+DEYG+LNQPKWGHL++LH +K EK LT VT T +
Sbjct: 317 PFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKISSFVTLTKF 376
Query: 349 GNSVSGSSY------------------------NLPAWSVSILPDCKTEEFNTAKVNTQT 384
N SG + +PAWSVSIL C E FNTAK+N+QT
Sbjct: 377 SNPTSGERFCFLSNTDNKNDATIDLQADGKYFVPVPAWSVSILDGCNKEVFNTAKINSQT 436
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMT 443
++ VK N+ N Q W W PE + D ++GKG F N L++QK T D SDYLWYMT
Sbjct: 437 SMFVKVQNKKENAQ--FSWVWAPEPMRD-TLQGKGTFKANLLLEQKGTTVDFSDYLWYMT 493
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
N D S N+TL++N+ G +LHA+VN Y+ SQW G S +FE+P+ + G
Sbjct: 494 NI----DSNATSSLQNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQS-FVFEKPILIKPG 548
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYG 562
N I+LLSATVGL+NY + +D VP GI GP+ L+ GD + DLSS+ W+YKVGL G
Sbjct: 549 TNTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLI---GDGNVKIDLSSNLWSYKVGLNG 605
Query: 563 LDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ K+ YN + WS+ N + RRMTWYKT+F+ P D V L++QGMGKG AWV
Sbjct: 606 -EMKQLYNP-VFSQRTNWSTINQKSIGRRMTWYKTSFKTPSGIDRVTLDMQGMGKGQAWV 663
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG ++GR+WP+++A D CST +CDYRG Y KC NCGNPSQ WYH+PRS++ D NT
Sbjct: 664 NGQSIGRFWPSFIASNDSCST-TCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNT 722
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRR-ISEIKYASFGDPQGACG 740
LVLFEE GGNP Q++ QT+ +GT CG A+E T+EL+C G ISEI++AS+G+P+G CG
Sbjct: 723 LVLFEEIGGNPQQVSVQTITIGTICGNANEGSTLELSCQGGHIISEIQFASYGNPEGKCG 782
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+FK+GS I+ L+EK C+G++SCSI+ S + G RL ++ALC
Sbjct: 783 SFKQGSWHV-INSAILVEKLCIGRESCSIDVSAKSFGLGD-VTNLSARLAIQALC 835
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 907 bits (2344), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/813 (55%), Positives = 580/813 (71%), Gaps = 63/813 (7%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D AI I+GER+++ SGSIHYPRST MWPDLI+KAK+GGLDAIETY+FW+ HEP R
Sbjct: 5 VSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHEPQR 64
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
++YDF+G+L+ I+F + +QD GLY+++RIGPYVCAEWNYGGFP+WLHNMPGI+ LRT N+
Sbjct: 65 QKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQ-LRTDNQ 123
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
V+ NEM FTT IV+M K+ LFASQGGPIILAQIENEYGNVM+ YG+AGK+YINWCA+M
Sbjct: 124 VYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCAQM 183
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A SL+IGVPWIMCQ+SDAP P+ F+PNNP SPK++TENW GWFK WG KD
Sbjct: 184 AESLNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWGDKD 243
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R+AED+AF+VARFFQ GG F NYYMYHGGTNFGRTSGGP++TTSYDY+AP+DEYG+LN
Sbjct: 244 PYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLN 303
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS------------------------ 353
QPKWGHL++LH +K EK LT G +N +G+ V+
Sbjct: 304 QPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTLTKFSNPTTKERFCFLSNTDDTND 363
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
Y +PAWSVSI+ CK E FNTAK+N+QT++ VK N+ N L W W
Sbjct: 364 ATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSMFVKVQNEKEN--VKLSWVWA 421
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE ++D ++GKG F N L++QK T D SDYLWYMTN + I N+TL++N
Sbjct: 422 PEAMSD-TLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSI----HNVTLQVN 476
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
+ G VLHA+VN Y+ SQW G S +FE+P+ L G N I+LLSATVGL+NY + +D
Sbjct: 477 TKGHVLHAFVNTRYIGSQWGNNGQS-FVFEKPILLKAGTNIITLLSATVGLKNYDAFYDT 535
Query: 526 VPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI G P+ L+G D + +LSS+ W+YKVGL G + K+ YN + E W++ N
Sbjct: 536 LPTGIDGGPIYLIG---DGNVTTNLSSNLWSYKVGLNG-EIKQLYNP-VFSQETSWNTLN 590
Query: 585 V-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ RRMTWYKT+F+ P DPV L++QGMGKG AW+NG ++GR+WP+++A D CS E
Sbjct: 591 KNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGNDNCS-E 649
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
+CDYRG Y KC NCGNPSQ WYH+PRS++ + NTLVLFEE GG+P Q++ QT+ +G
Sbjct: 650 TCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQTITIG 709
Query: 704 TACGQAHENKTMELTCHGRR-ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCV 762
T CG A+E T+EL+C G ISEI++AS+G+P+G CG+FK+GS + L L+EK C
Sbjct: 710 TICGNANEGSTLELSCQGEYIISEIQFASYGNPKGKCGSFKQGSWDVTNSAL-LLEKTCK 768
Query: 763 GKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
KSCS++ S A L A RLVV+ALC
Sbjct: 769 DMKSCSVDVS-AKLFGLGDAVNLSARLVVQALC 800
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 904 bits (2337), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/814 (56%), Positives = 572/814 (70%), Gaps = 64/814 (7%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D AI I+GER+++LSGS+HYPRST MWPDLI+KAK+GGLDAIETY+FW+ HEP R
Sbjct: 12 VSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHEPQR 71
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+YDFTG LD I+F + +QD GLYV++RIGPYVCAEWNYGGFP+WLHN+PGI + RT N+
Sbjct: 72 RKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGI-QFRTDNQ 130
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
V+ NEMQ FTT IV+M K+ LFASQGGPIILAQIENEYGNVM+ YG+AGKSYINWCA+M
Sbjct: 131 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCAQM 190
Query: 209 ATSLDIGVPWIMCQESDAPSPM------------FTPNNPNSPKIWTENWTGWFKSWGGK 256
A SL+IG+PWIMCQ+SDAP P+ F+PNNP SPK++TENW GWFK WG K
Sbjct: 191 AESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFKKWGDK 250
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
DP R+ ED+AFAVARFFQ GG F NYYMYHGGTNFGRT+GGP++TTSYDY+AP+DEYG+L
Sbjct: 251 DPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDEYGNL 310
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGN---------VTNTDYGNSVSGS------------ 355
NQPKWGHL++LH +K EK LT VT T + N SG
Sbjct: 311 NQPKWGHLKQLHASIKMGEKILTNSTRSDQKLXSFVTLTKFSNPTSGERFCFLSNTDNKN 370
Query: 356 ----------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
Y +PAWSVSIL C E FNTAK+N+QT++ VK N+ N Q W W
Sbjct: 371 DATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQTSMFVKVQNKKENAQ--FSWVW 428
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
PE + D ++GKG F N L++QK T D SDYLWYMTN D S N+TL++
Sbjct: 429 APEPMRD-TLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNI----DSNATSSLQNVTLQV 483
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
N+ G +LHA+VN Y+ SQW G S +F +P+ + G N I+LLSATVGL+NY + +D
Sbjct: 484 NTKGHMLHAFVNRRYIGSQWRSNGQS-FVFXKPILIKPGTNTITLLSATVGLKNYDAFYD 542
Query: 525 MVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
VP GI GP+ L+ GD + DLSS+ W+YKVGL G + K+ YN + WS+
Sbjct: 543 TVPTGIDGGPIYLI---GDGNVKIDLSSNLWSYKVGLNG-EMKQLYNP-VFSQRTNWSTI 597
Query: 584 NV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
N + RRMT YKT F+ P DPV L++QGMGKG AWVNG ++GR+WP+++A D CST
Sbjct: 598 NQKSIGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNGQSIGRFWPSFIAGNDSCST 657
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
+CDYRG Y KC NCGNPSQ WYH+PRS++ D NTLVLFEE GGNP Q++ QT+ +
Sbjct: 658 -TCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITI 716
Query: 703 GTACGQAHENKTMELTCHGRR-ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQC 761
GT CG A+E T+EL+C G ISEI++AS+G+P+G CG+FK+GS I+ L+EK C
Sbjct: 717 GTICGNANEGSTLELSCQGGHIISEIQFASYGNPEGKCGSFKQGSWHV-INSAILVEKLC 775
Query: 762 VGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+G +SCSI+ S + G RL ++ALC
Sbjct: 776 IGMESCSIDVSAKSFGLGD-VTNISARLAIQALC 808
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 891 bits (2303), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/826 (54%), Positives = 567/826 (68%), Gaps = 57/826 (6%)
Query: 23 LSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWN 82
+ Y V++D RAI IDG RK++LSGSIHYPRSTP MWP LI+KAKEGGL+ IETYVFWN
Sbjct: 1 MGFGYEVTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWN 60
Query: 83 AHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE 142
AHEP +RQYDF+GNLDLIRFIKTI+D+GLY ILRIGPYVCAEWNYGGFPVWLHN+PGI +
Sbjct: 61 AHEPHQRQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGI-Q 119
Query: 143 LRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
+RT N+V+ NEM+ FTTLIV+M K KLFASQGGPIIL+QIENEYGNV S YGD GK Y+
Sbjct: 120 IRTNNEVYKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYV 179
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFK 251
WCA +A S +GVPWIMCQ+SDAPSPM + NN + PKIWTENWTGWF+
Sbjct: 180 KWCANLAESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYSNNKSLPKIWTENWTGWFQ 239
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPID 311
WG K+P R+AED+AFAVARFFQ GG+ NYYMYHGGTNFG T GGPY+T SYDYDAP+D
Sbjct: 240 DWGQKNPHRSAEDVAFAVARFFQLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPLD 299
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------- 350
EYG+L QPKWGHLR+LH +L SME+TLTYG N++Y +
Sbjct: 300 EYGNLRQPKWGHLRDLHSVLNSMEQTLTYGESKNSNYPDNNNIFITIFAYQGKRSCFFSS 359
Query: 351 --------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP-- 400
S G+ Y LPAWSVSILPDC TE +NTA VN QT++ + N A + + P
Sbjct: 360 IDYKDQTISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSIMENKANAADSFREPNS 419
Query: 401 LQWKWRPEMINDFVVRGK---GHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSG 456
LQWKWRPE I ++G N L+DQK+ TN SDYLW MTN D +D +
Sbjct: 420 LQWKWRPEKIRGLSLQGDFVGNTLVANELMDQKAVTNGTSDYLWIMTNYDHNMNDSLWGA 479
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQWT--KYGASNDLFERPVKLTRGKNQISLLSATV 514
++ L+++++G V+HA+VNG +V SQ + G + +FE +KL RG N+ISL+S +V
Sbjct: 480 GKDIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIKLKRGINRISLVSVSV 539
Query: 515 GLQNYGSKFDMVPNGIPGPVLLVGRA---GDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
GLQNYG+ FD P GI GP+ ++GR+ + D+SS++W YK GL+G D + A
Sbjct: 540 GLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVYKTGLHGEDQG--FQA 597
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
R + +K+V +N+ WYKT+F APL DPVV++L G+GKG AWVNG N+GR+WP
Sbjct: 598 VRPRHRRQFYTKHVLINQPFVWYKTSFNAPLGQDPVVVDLLGLGKGTAWVNGRNIGRFWP 657
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
LA +DG C Y G Y +C CG P+Q +YH+PR W+K N LVLFEE GG
Sbjct: 658 KALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPEDNKLVLFEELGGT 717
Query: 692 PSQINFQTVVVGTACGQAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFK-KGSCEA 749
P ++ QTV VG C +E T+EL+C HGR+ S+I +ASFG PQG CG+F + +
Sbjct: 718 PDFVSVQTVTVGKVCVHGYEGHTVELSCQHGRKFSKITFASFGLPQGKCGSFTPSNNHDC 777
Query: 750 EIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
DV ++EK CVGK+ CSI+ SE L C A + RL VEA+C
Sbjct: 778 HADVSTIVEKACVGKERCSIDISEKALAPIHCDA-RIYRLAVEAVC 822
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 887 bits (2292), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/831 (53%), Positives = 578/831 (69%), Gaps = 64/831 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L A V++D R++ I+GER+++ SG++HYPRST MWPD+I+KAK+GGL
Sbjct: 12 IALFFLAFTASCFATEVTYDARSLIINGERRVIFSGAVHYPRSTVQMWPDIIQKAKDGGL 71
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
DAIE+YVFW+ HEP+RR+YDF+GNLD I+F + IQ+ GLY ILRIGPYVCAEWN+GGFP+
Sbjct: 72 DAIESYVFWDRHEPVRREYDFSGNLDFIKFFQIIQEAGLYAILRIGPYVCAEWNFGGFPL 131
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WLHNMPGIE LRT N ++ NEMQ FTT IV+MAK+ KLFASQGGPIILAQIENEYGN+M+
Sbjct: 132 WLHNMPGIE-LRTDNPIYKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNIMT 190
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
DYG+AGK+YI WCA+MA + +IGVPWIMCQ+ DAP PM F PNNP SPK+
Sbjct: 191 DYGEAGKTYIKWCAQMALAQNIGVPWIMCQQHDAPQPMINTCNGHYCDSFQPNNPKSPKM 250
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
+TENW GWF+ WG + P R+AED AF+VARFFQ GG NYYMYHGGTNFGRT+GGPY+T
Sbjct: 251 FTENWIGWFQKWGERVPHRSAEDSAFSVARFFQNGGILNNYYMYHGGTNFGRTAGGPYMT 310
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYN--- 358
TSY+YDAP+DEYG+LNQPKWGHL++LH +K EK +T G T+ D+GN V+ ++Y
Sbjct: 311 TSYEYDAPLDEYGNLNQPKWGHLKQLHAAIKLGEKIITNGTRTDKDFGNEVTLTTYTHTN 370
Query: 359 ---------------------------LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
LPAWSV+IL C E FNTAKVN+QT++ VK+
Sbjct: 371 GERFCFLSNTNDSKDANVDLQQDGNYFLPAWSVTILDGCNKEVFNTAKVNSQTSIMVKKS 430
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
+ A N L W W PE D + GKG+F +N L++QK T DVSDYLWYMT+ D+ D
Sbjct: 431 DDASNK---LTWAWIPEKKKD-TMHGKGNFKVNQLLEQKELTFDVSDYLWYMTSVDIND- 485
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
S SN TLR+N+ G L AYVNG +V +++++G N +E+ V L +G N I+LL
Sbjct: 486 ---TSIWSNATLRVNTRGHTLRAYVNGRHVGYKFSQWGG-NFTYEKYVSLKKGLNVITLL 541
Query: 511 SATVGLQNYGSKFDMVPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
SATVGL NYG+KFD + GI G PV L+G +ETI DLS++ W+YK+GL G + K+ Y
Sbjct: 542 SATVGLPNYGAKFDKIKTGIAGGPVQLIGN-NNETI--DLSTNLWSYKIGLNG-EKKRLY 597
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ + ++ P+ R +TWYK F AP NDPVV++L G+GKG AWVNG ++GRY
Sbjct: 598 DPQPRIGVSWRTNSPYPIGRSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVNGQSIGRY 657
Query: 630 WPTYLAEEDGCSTESCDYRGPY-GSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
W +++ +GCS ++CDYRG Y + KC NCGNPSQ WYHVPRS++K+ NTLVLFEE
Sbjct: 658 WTSWITATNGCS-DTCDYRGKYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTLVLFEEI 716
Query: 689 GGNPSQINFQTVVVGTACGQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSC 747
GGNP ++FQTV+ GT C Q E +EL+C G+ IS+I+++SFG+P G CG+FKKG+
Sbjct: 717 GGNPQNVSFQTVITGTICAQVQEGALLELSCQGGKTISQIQFSSFGNPTGNCGSFKKGTW 776
Query: 748 EAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGT---VKRLVVEALC 795
EA D ++E CVG+ SC ++ G V RL V+A C
Sbjct: 777 EA-TDGQSVVEAACVGRNSCGFMVTKEAFGVAIGPMNVDERVARLAVQATC 826
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 886 bits (2290), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/815 (56%), Positives = 563/815 (69%), Gaps = 69/815 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V +D A+ I+G+RKI+LSGSIHYPRST MW DLI+KAKEGGLD IETY+FWNAHE R
Sbjct: 30 VEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQKAKEGGLDTIETYIFWNAHERRR 89
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+Y+FTGNLD ++F + +Q+ GLY ILRIGPY CAEWNYGGFPVWLHN+P I+ RT N+
Sbjct: 90 REYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEWNYGGFPVWLHNIPEIK-FRTDNE 148
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+F NEMQ FTT IV+MAK+ KLFASQGGPIILAQIENEYGNVM YG+AGKSY+ WCA+M
Sbjct: 149 IFKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNVMGPYGEAGKSYVQWCAQM 208
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + +IGVPWIMCQ+SDAPS + FTPN+P SPK+WTENWTGW+K WG KD
Sbjct: 209 AVAQNIGVPWIMCQQSDAPSSVINTCNGFYCDTFTPNSPKSPKMWTENWTGWYKKWGQKD 268
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P RTAEDLAF+VARFFQ+ G QNYYMY+GGTNFGRTSGGP++ TSYDYDAP+DEYG+LN
Sbjct: 269 PHRTAEDLAFSVARFFQYNGVLQNYYMYYGGTNFGRTSGGPFIATSYDYDAPLDEYGNLN 328
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------SVSGS------------ 355
QPKWGHL+ LH LK EK LT V T Y + ++ G
Sbjct: 329 QPKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVELTTYTSNIDGERLCFLSNTKMDG 388
Query: 356 ---------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQA-PLQWKW 405
Y +PAWSVSIL DC E +NTAKVN QT++ VK+ ++ ND L W+W
Sbjct: 389 LDVDLQQDGKYFVPAWSVSILQDCNKETYNTAKVNVQTSLIVKKLHE--NDTPLKLSWEW 446
Query: 406 RPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
PE + G+G F L++QK +T D SDYLWYMT+ D + S N+TLR+
Sbjct: 447 APEPTKA-PLHGQGGFKATQLLEQKAATYDESDYLWYMTSVDNNG-----TASKNVTLRV 500
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
SGQ LHA+VNG + SQ FE+P L G N ISLLSATVGLQNYG FD
Sbjct: 501 KYSGQFLHAFVNGKEIGSQ----HGYTFTFEKPALLKPGTNIISLLSATVGLQNYGEFFD 556
Query: 525 MVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
P GI GPV L+ T DLSS++W+YKVGL G + +FY+ + ++ W S
Sbjct: 557 EGPEGIAGGPVELIDSGNTTT---DLSSNEWSYKVGLNG-EGGRFYDPTSGRAK--WVSG 610
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
N+ + R MTWYKTTF+AP +PVV++LQGMGKG AWVNG +LGR+WP A+ +GC +
Sbjct: 611 NLRVGRAMTWYKTTFQAPSGTEPVVVDLQGMGKGHAWVNGNSLGRFWPILTADPNGCDGK 670
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
CDYRG Y KC NCGNP+Q WYHVPRS++ +G NTL+LFEE GGNPS ++FQ
Sbjct: 671 -CDYRGQYKEGKCLSNCGNPTQRWYHVPRSFLNNGSNTLILFEEIGGNPSDVSFQITATE 729
Query: 704 TACGQAHENKTMELTCHGRR--ISEIKYASFGDPQG-ACGAFKKGSCEAEIDVLPLIEKQ 760
T CG +E T+EL+C+G R IS+I+YASFGDPQG +CG+F++GS EA +EK
Sbjct: 730 TICGNTYEGTTLELSCNGGRRIISDIQYASFGDPQGSSCGSFQRGSVEASRS-FSAVEKA 788
Query: 761 CVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C+GK+SCSI S+A G RLVV+A+C
Sbjct: 789 CMGKESCSINVSKATFGVEDSFGVDNNRLVVQAVC 823
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 874 bits (2257), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/818 (55%), Positives = 549/818 (67%), Gaps = 115/818 (14%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A +++ D R I I+GERKIL+SGS+HYPRSTP MWPDLI+K+K+GGL+ I+TYVFW+ HE
Sbjct: 23 ADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHE 82
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P RRQYDFTGN DL+RFIK IQ QGLY +LRIGPYVCAEW YGGFPVWLHN P I +LRT
Sbjct: 83 PQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSI-QLRT 141
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N V+M IENEYGNVM Y DAG YINWC
Sbjct: 142 NNTVYM-------------------------------IENEYGNVMRAYHDAGVQYINWC 170
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A+MA +LD GVPWIMCQ+ +AP PM FTPNNPNSPK+WTENW+GW+K+WG
Sbjct: 171 AQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWG 230
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G DP RTAEDLAF+VARF+Q GGTFQNYYMYHGGTNFGRT+GGPY+TTSYDYDAP++EYG
Sbjct: 231 GSDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYG 290
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY-------------------GNS---- 351
+ NQPKWGHLR+LH LL SMEK LTYG+V N DY GNS
Sbjct: 291 NKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSCFFGNSNADR 350
Query: 352 -----VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G +Y +PAWSVSILPDC E +NTAKVN+Q + VK+ ++A N+ LQW WR
Sbjct: 351 DVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQWTWR 410
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
E I ++ G + D+ +DDPI ++TL +N+
Sbjct: 411 GETI-QYITPG--------------------------SVDISNDDPIW--GKDLTLSVNT 441
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
SG +LHA+VNG ++ Q+ G F R + L GKN+I+LLS TVGL NYG FDMV
Sbjct: 442 SGHILHAFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNYGPDFDMV 501
Query: 527 PNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
GI GPV ++ G IIKDLS +++W YK GL G +DKK + +A ++ W S N+
Sbjct: 502 NQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNG-EDKKIFLGRARYNQ--WKSDNL 558
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
P+NR WYK TF+AP DPVV++L G+GKG AWVNG++LGRYWP+Y+A +GCS E C
Sbjct: 559 PVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPE-C 617
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
DYRGPY ++KC NCGNPSQ WYHVPRS++ N LVLFEEF GNPS + FQTV VG A
Sbjct: 618 DYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFXGNPSSVTFQTVTVGNA 677
Query: 706 CGQAHENKTMELTCHGRRISEIKYASFGDPQGACG--------AFKKGSCEAEIDVLPLI 757
C A E T+EL+C GR IS IK+ASFGDPQG CG F+KG+CEA D L +I
Sbjct: 678 CANAREGYTLELSCQGRAISXIKFASFGDPQGTCGKPFATGSQVFEKGTCEAA-DSLSII 736
Query: 758 EKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+K CVGK SCSI+ SE LG C A T KRL VEA+C
Sbjct: 737 QKLCVGKYSCSIDVSEQILGPAGCTADT-KRLAVEAIC 773
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/815 (51%), Positives = 549/815 (67%), Gaps = 57/815 (6%)
Query: 20 LFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYV 79
LF+ + V++DGR++ I+GERKI++SG+IHYPRS+PGMWP L+KKAK GGL+AIETYV
Sbjct: 7 LFSSAKKISVTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIETYV 66
Query: 80 FWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPG 139
FWNAHEP R QYDF+GN DL++FIK +Q + LY ILRIGPYVCAEWNYGGFPVWLHN+PG
Sbjct: 67 FWNAHEPQRGQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNLPG 126
Query: 140 IEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGK 199
I+ RT N+V+ F L ++ K +F + IENE+GNV YG GK
Sbjct: 127 IK-FRTNNQVYKVTFX-FFFLTKNLKKINNMF-------LKNXIENEFGNVEGSYGQEGK 177
Query: 200 SYINWCAKMATSLDIGVPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSW 253
Y+ WCA++A S ++ PWIMCQ+ DAP P+ F PNN NSPK+WTE+W GWFK W
Sbjct: 178 EYVKWCAELAQSYNLSEPWIMCQQGDAPQPIVCNCDQFKPNNKNSPKMWTESWAGWFKGW 237
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G +DP RTAEDLAFAVARFFQ+GG+ NYYMYHGGTNFGR++GGPY+TTSYDY+AP+DEY
Sbjct: 238 GERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYNAPLDEY 297
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS----------------- 356
G++NQPKWGHL++LH+L++SMEK LTYG+V + D G+S + +S
Sbjct: 298 GNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTATSYTYKGKSSCFFGNPENS 357
Query: 357 ----------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
Y +P WSV++LPDCKTE +NTAKVNTQT ++ P+ G + PL+W+WR
Sbjct: 358 DREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIREMVPSLVGKHKKPLKWQWR 417
Query: 407 PEMINDFVVRGK---GHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E I G N+LIDQK TND SDYLWY+T L +DP+ +TL
Sbjct: 418 NEKIEHLTHEGDISGSAITANSLIDQKMVTNDSSDYLWYLTGFHLNGNDPLF--GKRVTL 475
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK-LTRGKNQISLLSATVGLQNYGS 521
R+ + G +LHA+VN ++ +Q+ YG + E+ V+ L G NQI+LLSATVGL NYG+
Sbjct: 476 RVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRHGFNQIALLSATVGLPNYGA 535
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
++ V GI GPV L+ D I+DLS+++W YKVGL G + +F++ + W
Sbjct: 536 YYENVEVGIYGPVELI---ADGKTIRDLSTNEWIYKVGLDG-EKYEFFDPDHK-FRKPWL 590
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
S N+PLN+ TWYKT+F P + VV++L GMGKG AWVNG ++GRYWP+YLA E+GCS
Sbjct: 591 SNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIGRYWPSYLATENGCS 650
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINFQTV 700
+ SCDYRG Y KCA NCG P+Q WYH+PRS++ DG NTL+LFEEFGG P I +T
Sbjct: 651 S-SCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEEFGGMPLNIEIKTT 709
Query: 701 VVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQ 760
V C + +ELTCH R + I + FG+P+G C F KGSC + + +IEK+
Sbjct: 710 RVKKVCAKVDLGSKLELTCHDRTVKRIIFVGFGNPKGNCNNFHKGSCHSS-EAFSVIEKE 768
Query: 761 CVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C+ K+ CSIE ++ LG T C L V+ C
Sbjct: 769 CLWKRKCSIEVTKDKLGLTGCKNPKDNWLAVQVSC 803
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 844 bits (2181), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/819 (51%), Positives = 559/819 (68%), Gaps = 66/819 (8%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
A V++D A+ I+GER+++ SG+IHYPRST MWPDLI+KAK+GGLDAIETY+FW+ H
Sbjct: 6 FATEVTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDRH 65
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP+RR+Y+F+GNLD ++F + IQ GLY I+RIGPY CAEWN+GGFP WLHNMPGIE LR
Sbjct: 66 EPVRREYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIE-LR 124
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
T N V+ NEMQNFTT IV++ K+ KLFASQGGPIILAQIENEYG++M +Y DAGK+Y+ W
Sbjct: 125 TNNSVYKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQW 184
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
A+MA + +IGVPWIMCQ+ DAP P+ F PNNP SPKI+TENW GWF+ W
Sbjct: 185 AAQMALAQNIGVPWIMCQQQDAPQPIINTCNGYYCHNFQPNNPKSPKIFTENWIGWFQKW 244
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G + P R+AED AF+VARFFQ GG NYYMYHGGTNFGRT+GGPY+TTSYDYDAPIDEY
Sbjct: 245 GERVPHRSAEDSAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEY 304
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLT-YGNVTNTDYGNSVSGSSYN-------------- 358
G+LNQPKWGHL+ LH +K E LT Y + D GN ++ ++Y
Sbjct: 305 GNLNQPKWGHLKNLHAAIKLGENVLTNYSARKDEDLGNGLTLTTYTNSSGARFCFLSNNN 364
Query: 359 -----------------LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPL 401
+PAWSVSI+ C E FNTAKVN+QT++ VK+ + + L
Sbjct: 365 NTDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTSMMVKKSDNVSSTN--L 422
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNM 460
W+W+ E D + G G L++QK T D SDYLWYMT+AD+ D S SN
Sbjct: 423 TWEWKVEPKRD-TIHGNGSLKAQKLLEQKELTLDASDYLWYMTSADIND----TSIWSNA 477
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
TLR+N+SG LH YVN YV Q+++YG + +E+ V L G N I+LLSATVGL NYG
Sbjct: 478 TLRVNTSGHSLHGYVNQRYVGYQFSQYG-NQFTYEKQVSLKNGTNIITLLSATVGLANYG 536
Query: 521 SKFDMVPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
+ FD GI G PV L+G+ + DLS++ W+YK+GL G + + Y+A+ N
Sbjct: 537 AWFDDKKTGISGGPVELIGK---NNVTMDLSTNLWSYKIGLNG-ERRHLYDAQQ-NVSVA 591
Query: 580 W--SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
W +S +P+ + + WY+ F++P +P+V++LQG+GKG AWVNG+++GRYW ++++
Sbjct: 592 WHTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGRYWSSWISPS 651
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
DGCS ++CDYRG Y KC NCG+PSQ WYHVPRS++ +NTLVLFEE GGNP + F
Sbjct: 652 DGCS-DTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIGGNPQSVQF 710
Query: 698 QTVVVGTACGQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPL 756
QTV GT C +E EL+C G+ +S+I++AS+G+P+G CG+FKKG+ +A + +
Sbjct: 711 QTVTTGTICANVYEGAQFELSCQSGQVMSQIQFASYGNPEGQCGSFKKGNFDAA-NSQSV 769
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+E CVGK +C ++ G T+ + ++ RL V+ C
Sbjct: 770 VEASCVGKNNCGFNVTKEMFGVTNVS--SIPRLAVQVTC 806
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 831 bits (2146), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/745 (55%), Positives = 529/745 (71%), Gaps = 64/745 (8%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWP+L +KAKEGG+DAIETY+FW+ HEP+RRQY F+GN D+++F K Q+ GL+VILRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEW+YGGFP+WLHN+PGIE LRT N+++ NEMQ FTT IVD+ K+ KLFA QGGPI
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIE-LRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPI 119
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
ILAQIENEYGNVM YGDAG+ Y+NWCA+MA ++GVPWIMCQ+S+AP PM
Sbjct: 120 ILAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFY 179
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
F PNNP SPK+WTENW+GWFK WGG+DP RTAEDLAF+VARF Q GG +YYMYHG
Sbjct: 180 CDQFKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYHG 239
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTN-- 345
GTNFGRT+GGPY+TTSYDY+AP+DEYG+LNQPKWGHL++LH+ +K E+ LT G VT+
Sbjct: 240 GTNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSKN 299
Query: 346 -------TDYGNSVSGS---------------------SYNLPAWSVSILPDCKTEEFNT 377
T Y N +G Y+LPAWSV+IL DC E +NT
Sbjct: 300 FWGGVDQTTYTNQGTGERFCFLSNTNMEEANVDLGQDGKYSLPAWSVTILQDCNKEIYNT 359
Query: 378 AKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVS 436
AKVNTQT++ VK+ ++ + L W W PE + V++GKG F L++QK T D +
Sbjct: 360 AKVNTQTSIMVKKLHEE-DKPVQLSWTWAPEPMKG-VLQGKGRFRATELLEQKETTVDTT 417
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---- 492
DYLWYMT+ +L ++ L +N+TLR+ + G LHAYVN + +Q++K +
Sbjct: 418 DYLWYMTSVNL--NETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQSVKG 475
Query: 493 -----LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIP-GPVLLVGRAGDETII 546
LFE+PV LT G N ISLLSATVGL NYG +D P GI GPV LV
Sbjct: 476 DDYSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGKP---F 532
Query: 547 KDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDP 606
DL+S++W+YK+GL G + K++ + + ++ + +S N+P R MTWYKTTF +P +P
Sbjct: 533 MDLTSYQWSYKIGLSG-EAKRYNDPNSPHASKFTASDNLPTGRAMTWYKTTFASPSGTEP 591
Query: 607 VVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQI 666
VV++L GMGKG AWVNG +LGR+WPT +A+ GC ++CDYRG Y DKC NCGNPSQ
Sbjct: 592 VVVDLLGMGKGHAWVNGKSLGRFWPTQIADAKGCP-DTCDYRGSYNGDKCVTNCGNPSQR 650
Query: 667 WYHVPRSWI-KDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCH-GRRI 724
WYH+PRS++ KDG NTL+LFEE GGNP+ ++FQ V V T CG A+E T+EL+C GR I
Sbjct: 651 WYHIPRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAVETICGNAYEGSTLELSCEGGRTI 710
Query: 725 SEIKYASFGDPQGACGAFKKGSCEA 749
S+I++AS+GDP+G CGAF KGS A
Sbjct: 711 SDIQFASYGDPEGTCGAFMKGSFYA 735
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/868 (49%), Positives = 536/868 (61%), Gaps = 93/868 (10%)
Query: 9 RAILLCLILQTLFN------LSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
R + L LI LFN A V++D R++ IDG+R++L+SGSIHYPRSTP MWPD
Sbjct: 5 RNLRLVLIYAFLFNGFYYWKHVSAANVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPD 64
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
+I+KAK+GGLD IE+YVFWN HEP + +Y F DL++F+K +Q GL V LRIGPY C
Sbjct: 65 IIQKAKDGGLDVIESYVFWNMHEPKQNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYAC 124
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
AEWNYGGFPVWLH +PGI RT N+ F NEMQ FT IVDM K+EKLFASQGGPIILAQ
Sbjct: 125 AEWNYGGFPVWLHLIPGIH-FRTDNEPFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQ 183
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------F 231
IENEYGN+ YG AGKSY+ W A MA L+ GVPW+MCQ++DAP P+ F
Sbjct: 184 IENEYGNIDGPYGAAGKSYVKWAASMAVGLNTGVPWVMCQQADAPDPIINTCNGFYCDAF 243
Query: 232 TPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
TPN+PN PK+WTENW+GWF S+GG+ P R EDLAF+VARFFQ GGTFQNYYMYHGGTNF
Sbjct: 244 TPNSPNKPKMWTENWSGWFLSFGGRLPFRPTEDLAFSVARFFQRGGTFQNYYMYHGGTNF 303
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS 351
GRT+GGP++ TSYDYDAPIDEYG + QPKWGHL+ELHK +K E L T G+
Sbjct: 304 GRTTGGPFIATSYDYDAPIDEYGIVRQPKWGHLKELHKAIKLCEAALVNAESNYTSLGSG 363
Query: 352 V-----------------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNT 382
+ +G+SY+LPAWSVSILPDCK FNTAK+ +
Sbjct: 364 LEAHVYSPGSGTCAAFLANSNTQSDATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGS 423
Query: 383 QTNVKVKRP---------NQAGNDQA-PLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KS 431
QT P + G D A W W E I + G F+ L++Q +
Sbjct: 424 QTTSVQMNPANLILAGSNSMKGTDSANAASWSWLHEQIG---IGGSNTFSKPGLLEQINT 480
Query: 432 TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASN 491
T D SDYLWY T+ + D++P L + L + S G LH ++NG + +S
Sbjct: 481 TVDSSDYLWYTTSIQVDDNEPFLHNGTQPVLHVQSLGHALHVFINGEFAGRGAGSSSSSK 540
Query: 492 DLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSS 551
+ P+ L GKN I LLS TVGLQNYGS FD GI GPV+L G E DLS+
Sbjct: 541 IALQTPITLKSGKNNIDLLSITVGLQNYGSFFDTWGAGITGPVILQGFKDGE---HDLST 597
Query: 552 HKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNL 611
+WTY++GL G + Y+ S + + ++P + M WYKT F+AP NDPV LNL
Sbjct: 598 QQWTYQIGLTG-EQLGIYSGDTKASAQWVAGSDLPTKQPMIWYKTNFDAPSGNDPVALNL 656
Query: 612 QGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVP 671
GMGKG AWVNG ++GRYWP+Y+A + GC T+SCDYRG Y S KC NCG PSQ YHVP
Sbjct: 657 LGMGKGVAWVNGQSIGRYWPSYIASQSGC-TDSCDYRGAYSSTKCQTNCGQPSQKLYHVP 715
Query: 672 RSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK------------------ 713
RSWI+ N LVLFEE GG+P+QI+F T VG+ C Q E
Sbjct: 716 RSWIQPTGNVLVLFEELGGDPTQISFMTRSVGSLCAQVSETHLPPVDSWKSSATSGLEVN 775
Query: 714 ----TMELTCHGRR--ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSC 767
++L C R I IK+ASFG +G+CG+F G C + ++E+ C+G++SC
Sbjct: 776 KPKAELQLHCPSSRHLIKSIKFASFGTSKGSCGSFTYGHCNTN-STMSIVEEACIGRESC 834
Query: 768 SIEASEANLGATSCAAGTVKRLVVEALC 795
S+E S G GTVK L VEA C
Sbjct: 835 SVEVSIEKFGDP--CKGTVKNLAVEASC 860
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 825 bits (2131), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/852 (50%), Positives = 530/852 (62%), Gaps = 79/852 (9%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L +L + S A V++D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+
Sbjct: 7 VFVLVSLLGAIATTSFASTVTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSKD 66
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD IETYVFWN HEP+RRQYDF G DL++F+KT+ + GLYV LRIGPYVCAEWNYGG
Sbjct: 67 GGLDVIETYVFWNLHEPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGG 126
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FP+WLH +PGI + RT N F EMQ FT IVDM KKE L+ASQGGPIIL+QIENEYGN
Sbjct: 127 FPLWLHFIPGI-QFRTDNGPFKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGN 185
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
+ S YG A KSYI W A MATSLD GVPW+MCQ++DAP PM FTPN+
Sbjct: 186 IDSAYGSAAKSYIQWAASMATSLDTGVPWVMCQQADAPDPMINTCNGFYCDQFTPNSVKK 245
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
PK+WTENWTGWF S+GG P R ED+AFAVARFFQ GGTFQNYYMYHGGTNFGRT+GGP
Sbjct: 246 PKMWTENWTGWFLSFGGAVPYRPVEDIAFAVARFFQLGGTFQNYYMYHGGTNFGRTTGGP 305
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT------------------- 339
++ TSYDYDAPIDEYG L QPKWGHL++LHK +K E L
Sbjct: 306 FIATSYDYDAPIDEYGLLRQPKWGHLKDLHKAIKLCEAALIATDPTITSLGTNLEASVYK 365
Query: 340 ---------YGNV-TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
NV TN+D + SG+SY+LPAWSVSILPDCK NTA++N+ +
Sbjct: 366 TGTGSCAAFLANVRTNSDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQINSMAVMPRF 425
Query: 390 RPNQAGNDQAPLQ-----WKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTN 444
ND W W E + + + L L T D SDYLWY +
Sbjct: 426 MQQSLKNDIDSSDGFQSGWSWVDEPVG--ISKNNAFTKLGLLEQINITADKSDYLWYSLS 483
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
+++ D+P L S L + S G LHA++NG S G + + PV L GK
Sbjct: 484 TEIQGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHGK 543
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N I LLS TVGLQNYG+ +D GI GP+ L G A T+ DLSS +WTY+VGL G +
Sbjct: 544 NTIDLLSLTVGLQNYGAFYDKQGAGITGPIKLKGLANGTTV--DLSSQQWTYQVGLQGEE 601
Query: 565 DKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
+ +S + + +P + + WYKTTF+AP NDPV L+ GMGKG AWVNG
Sbjct: 602 ----LGLPSGSSSKWVAGSTLPKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQ 657
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GRYWP Y++ GC T SC+YRGPY S+KC NCG PSQ YHVPRSW++ NTLVL
Sbjct: 658 SIGRYWPAYVSSNGGC-TSSCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVL 716
Query: 685 FEEFGGNPSQINFQTVVVGTACGQAHE-------------------NKTMELTC--HGRR 723
FEE GG+P+QI+F T V + C + E + + L C +
Sbjct: 717 FEEIGGDPTQISFATKQVESLCSRVSEYHPLPVDMWGSDLTTGRKSSPMLSLECPFPNQV 776
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
IS IK+ASFG P+G CG+F C + L ++++ C+G KSCSI S G +
Sbjct: 777 ISSIKFASFGTPRGTCGSFSHSKCSSRT-ALSIVQEACIGSKSCSIGVSIDTFGDP--CS 833
Query: 784 GTVKRLVVEALC 795
G K L VEA C
Sbjct: 834 GIAKSLAVEASC 845
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/854 (49%), Positives = 540/854 (63%), Gaps = 83/854 (9%)
Query: 9 RAILLCLILQT-LFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
+ IL+ L S A V++D RA+ IDG+R++L+SGSIHYPRSTP MWP LI+K+
Sbjct: 4 KEILVVFFFSVVLAETSFAANVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPGLIQKS 63
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP+R QY+F G DL++F+K + + GLYV +RIGPYVCAEWNY
Sbjct: 64 KDGGLDVIETYVFWNGHEPVRNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVCAEWNY 123
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFP+WLH +PGI+ RT N+ F EMQ FT IVDM K+EKL+ASQGGPIIL+QIENEY
Sbjct: 124 GGFPLWLHFIPGIK-FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 182
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
GN+ S +G A K+YINW A MA SLD GVPW+MCQ++DAP P+ FTPN+
Sbjct: 183 GNIDSAFGPAAKTYINWAAGMAISLDTGVPWVMCQQADAPDPVINTCNGFYCDQFTPNSK 242
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N PK+WTENW+GWF+S+GG P R EDLAFAVARF+Q GTFQNYYMYHGGTNFGRT+G
Sbjct: 243 NKPKMWTENWSGWFQSFGGAVPYRPVEDLAFAVARFYQLSGTFQNYYMYHGGTNFGRTTG 302
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT----------------- 339
GP+++TSYDYDAP+DEYG L QPKWGHL+++HK +K E+ L
Sbjct: 303 GPFISTSYDYDAPLDEYGLLRQPKWGHLKDVHKAIKLCEEALIATDPTTTSLGSNLEATV 362
Query: 340 ----------YGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-KV 388
N+ TD + +G+SYNLPAWSVSILPDCK NTAK+N+ T V
Sbjct: 363 YKTGSLCAAFLANIATTDKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKINSVTIVPSF 422
Query: 389 KRPNQAGNDQAPLQ----WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMT 443
R + G+ + W W E + + F + L++Q +T D SDYLWY
Sbjct: 423 ARQSLVGDVDSSKAIGSGWSWINEPVG---ISKNDAFVKSGLLEQINTTADKSDYLWYSL 479
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
+ ++K D+P L S L + S G LHA++NG S K + + P+ LT G
Sbjct: 480 STNIKGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITLTPG 539
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
KN I LLS TVGLQNYG+ +++ GI GPV L + G+ DLSS +WTY++GL G
Sbjct: 540 KNTIDLLSLTVGLQNYGAFYELTGAGITGPVKLKAQNGNTV---DLSSQQWTYQIGLKGE 596
Query: 564 DDKKFYNAKAANSERGWSSK-NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVN 622
D + ++ S W S+ +P N+ + WYKT+F+AP NDPV ++ GMGKG AWVN
Sbjct: 597 D-----SGISSGSSSEWVSQPTLPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVN 651
Query: 623 GYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTL 682
G ++GRYWPT ++ GC+ +SC+YRG Y S+KC NCG PSQ +YH+PRSWIK N L
Sbjct: 652 GQSIGRYWPTNVSPSSGCA-DSCNYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNIL 710
Query: 683 VLFEEFGGNPSQINFQTVVVGTACGQAHENK-------------------TMELTC--HG 721
VL EE GG+P+QI F T VG+ C E+ + L C
Sbjct: 711 VLLEEIGGDPTQIAFATRQVGSLCSHVSESHPQPVDMWNTDSEGGKRSGPVLSLQCPHPD 770
Query: 722 RRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSC 781
+ IS IK+ASFG P G+CG++ G C + L +++K CVG KSC++ S G
Sbjct: 771 KVISSIKFASFGTPHGSCGSYSHGKC-SSTSALSIVQKACVGSKSCNVGVSINTFGDP-- 827
Query: 782 AAGTVKRLVVEALC 795
G K L VEA C
Sbjct: 828 CRGVKKSLAVEASC 841
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/823 (49%), Positives = 553/823 (67%), Gaps = 69/823 (8%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V +D A+ I+GER+++ SG+IHYPRST MWPDL++KAK+GGLDAIETY+FW+ HE
Sbjct: 22 ALEVKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDRHE 81
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
+R +Y+F+GNLD ++F KTIQ+ GLY I+RIGPY CAEWNYGGFPVWLH +PGI E+RT
Sbjct: 82 QVRGRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGI-EMRT 140
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N + NEMQ F T I+++AK+ LFASQGGPIILAQIENEYG++M ++ + GK+YI W
Sbjct: 141 DNAAYKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKWA 200
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A+MA + +IGVPW MCQ++DAP P+ F PNNP SPK++TENW GWF+ WG
Sbjct: 201 AQMALAQNIGVPWFMCQQNDAPQPIINTCNGYYCHNFKPNNPKSPKMFTENWIGWFQKWG 260
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+ P RTAED A+AVARFFQ GG F NYYMYHGGTNFGRTSGGPY+ TSYDYDAPI+EYG
Sbjct: 261 ERAPHRTAEDSAYAVARFFQNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEYG 320
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLT-YGNVTNTDYGNSVSGSSYN--------------- 358
+LNQPK+GHL+ LH+ +K EK LT Y + + D GN ++ ++Y
Sbjct: 321 NLNQPKYGHLKFLHEAIKLGEKVLTNYTSRNDKDLGNGITLTTYTNSVGARFCFLSNDKD 380
Query: 359 ---------------LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
+PAWSV+IL C E FNTAKVN+QT++ K+ + + ++ L W
Sbjct: 381 NTDGNVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTSIMEKKIDNSSTNK--LTW 438
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
W E D + G+G + L++QK T D SDYLWYMT+ D+ D S SN L
Sbjct: 439 AWIMEPKKD-TMNGRGSIKAHQLLEQKELTLDASDYLWYMTSVDIND----TSNWSNANL 493
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ +SG LH YVN Y+ +++G +N +E+ V L G N I+LLSATVGL NYG++
Sbjct: 494 HVETSGHTLHGYVNKRYIGYGHSQFG-NNFTYEKQVSLKNGTNIITLLSATVGLANYGAR 552
Query: 523 FDMVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
FD + GI GPV LVG+ ++ DLS+ W++KVGL G + ++FY+ + S W+
Sbjct: 553 FDEIKTGISDGPVKLVGQ---NSVTIDLSTGNWSFKVGLNG-EKRRFYDLQ-PRSGVAWN 607
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ + P + +TWYKT F++PL +P+V++LQG+GKG AWVNG ++GRYW +++ GCS
Sbjct: 608 TSSYPTGKPLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGRYWTSWITSTAGCS 667
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
++CDYRG Y +KC C +PSQ WYHVPRS++ D +NTL+LFEE GGNP ++F T
Sbjct: 668 -DTCDYRGNYKKEKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIGGNPQNVSFLTET 726
Query: 702 VGTACGQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQ 760
T C +E +EL+C G+ I+ I +ASFG+PQG CG+FKKGS E+ ++ ++E
Sbjct: 727 TKTICANVYEGGKLELSCQNGQVITSINFASFGNPQGQCGSFKKGSWES-LNSQSMMETS 785
Query: 761 CVGKKSCSIEASE----ANLGATSCAAGTVK----RLVVEALC 795
C+GK C + NL S + +VK RL V+A C
Sbjct: 786 CIGKTGCGFTVTRDMFGVNLDPLSASKASVKDGIPRLAVQATC 828
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/855 (50%), Positives = 537/855 (62%), Gaps = 88/855 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++L LILQ + + A V++D RA+ IDG+RK+L+SGSIHYPRSTP MWP+LIKK+K+G
Sbjct: 9 MILLLILQIMM-AATAVNVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIKKSKDG 67
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD IETYVFW+ HEP + +Y+F G DL++F+K +++ GLYV LRIGPYVCAEWNYGGF
Sbjct: 68 GLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCAEWNYGGF 127
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWLH +PGI + RT N+ F EMQ FTT IVD+ K+EKL+ASQGGPIIL+QIENEYGN+
Sbjct: 128 PVWLHFVPGI-KFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNI 186
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
S YG A K YI W A MA SLD GVPW MCQ++DAP PM FTPN+ + P
Sbjct: 187 DSAYGAAAKIYIKWSASMALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFTPNSNSKP 246
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENW+GWF +G P R EDLAFAVARF+Q GGTFQNYYMYHGGTNF RTSGGP
Sbjct: 247 KMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPL 306
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------------- 339
++TSYDYDAPIDEYG L QPKWGHLR+LHK +K E L
Sbjct: 307 ISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKT 366
Query: 340 --------YGNV-TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN----- 385
NV T +D S +G SY+LPAWSVSILPDCK FNTAK+N+ T
Sbjct: 367 ASGSCAAFLANVGTKSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFA 426
Query: 386 VKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNA 445
+ +P+ + + +W + E I + + L +T D SDYLWY
Sbjct: 427 RQSLKPDGGSSAELGSEWSYIKEPIG--ISKADAFLKPGLLEQINTTADKSDYLWYSLRM 484
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
D+K D+ L S L I S GQV++A++NG S K S D+ P+ L GKN
Sbjct: 485 DIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQKISLDI---PINLAAGKN 541
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
+ LLS TVGL NYG+ FD+V GI GPV L G +I DL+S +WTY+VGL G D
Sbjct: 542 TVDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSI--DLASQQWTYQVGLKGED- 598
Query: 566 KKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
A W SK+ +P + + WYKTTF+AP ++PV ++ G GKG AWVNG
Sbjct: 599 ----TGLATVDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQ 654
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GRYWPT +A GC T+SCDYRG Y ++KC NCG PSQ YHVPRSW+K NTLVL
Sbjct: 655 SIGRYWPTSIAGNGGC-TDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVL 713
Query: 685 FEEFGGNPSQINFQTVVVGT----ACGQAH---------------ENKT---MELTC--H 720
FEE GG+P+QI+F T G+ Q+H N+T + L C
Sbjct: 714 FEEMGGDPTQISFGTKQTGSNLCLMVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPVS 773
Query: 721 GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATS 780
+ IS IK+ASFG PQG CG+F G C + L +++K C+G +SC++E S G
Sbjct: 774 TQVISSIKFASFGTPQGTCGSFTHGHCNSSRS-LSVVQKACIGSRSCNVEVSTRVFGEP- 831
Query: 781 CAAGTVKRLVVEALC 795
G +K L VEA C
Sbjct: 832 -CRGVIKSLAVEASC 845
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 803 bits (2075), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/834 (50%), Positives = 528/834 (63%), Gaps = 91/834 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GGLD IETYVFWN HEP+R
Sbjct: 30 VSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 89
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY+F G DL+ F+K + + GLYV LRIGPYVCAEWNYGGFP+WLH +PGI +LRT N+
Sbjct: 90 GQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-KLRTDNE 148
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM FT IV+M K EKL+ASQGGPIIL+QIENEYGN+ YG A K+YINW A M
Sbjct: 149 PYKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAKTYINWAANM 208
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A SLD GVPW+MCQ++DAPS + F+PN+ ++PKIWTENW+GWF S+GG
Sbjct: 209 AVSLDTGVPWVMCQQADAPSSVINTCNGFYCDQFSPNSNSTPKIWTENWSGWFLSFGGAV 268
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P+R EDLAFAVARF+Q GGTFQNYYMYHGGTNFGR+SGGP++ TSYDYDAP+DEYG L
Sbjct: 269 PQRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDAPLDEYGLLR 328
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV------------------------- 352
QPKWGHL+++HK +K E + + T + G ++
Sbjct: 329 QPKWGHLKDVHKAIKLCEPAMVATDPTISSLGQNIEAAVYKTGSVCSAFLANVDTKSDAT 388
Query: 353 ---SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ-----WK 404
+G+SY LPAWSVSILPDCK NTAK+NT T V D P + W
Sbjct: 389 VTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATMVPSFTRQSISADVEPTEAVGSGWS 448
Query: 405 WRPEMINDFVVRGKGH-FALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
W IN+ V KG F L++Q +T D SDYLWY T+ D+K G L
Sbjct: 449 W----INEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDVK-------GGYKADL 497
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ S G LHA+VNG S G + E PV+ GKN I LLS TVGLQNYG+
Sbjct: 498 HVQSLGHALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDLLSLTVGLQNYGAF 557
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
FD+V GI GPV L G A TI DLSS +WTY++GL G D+ + S + S
Sbjct: 558 FDLVGAGITGPVQLKGSANGTTI--DLSSQQWTYQIGLKGEDED-----LPSGSSQWISQ 610
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+P N+ +TWYKT F+AP ++PV L+ GMGKG AWVNG ++GRYWPT +A + GC+
Sbjct: 611 PTLPKNQPLTWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNVAPKTGCT- 669
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
C+YRG Y +DKC NCG PSQ YHVPRSW+K NTLVLFEE GG+P+Q++F T V
Sbjct: 670 -DCNYRGAYSADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGDPTQLSFATRQV 728
Query: 703 GTACGQAHENK---------------------TMELTCHGRRISEIKYASFGDPQGACGA 741
+ C E+ ++E + IS IK+AS+G P G CG+
Sbjct: 729 ESLCSHVSESHPSPVDMWSSDSKAGSKSRPRLSLECPFPNQVISSIKFASYGRPSGTCGS 788
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F GSC + L +++K CVG KSCSIE S G G K L VEA C
Sbjct: 789 FSHGSCRSS-RALSIVQKACVGSKSCSIEVSTHTFGDP--CKGLAKSLAVEASC 839
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/834 (51%), Positives = 524/834 (62%), Gaps = 92/834 (11%)
Query: 37 TIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGN 96
IDG R++L+SGSIHYPRSTP MWPDLI K+K GGLD IETYVFW+ HEPL+ QYDF G
Sbjct: 1 VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60
Query: 97 LDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN 156
DL+RFIKT+ + GLYV LRIGPY CAEWNYGGFP+WLH +PGI + RT NK F +EMQ
Sbjct: 61 KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGI-KFRTDNKPFKDEMQR 119
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
FTT IVD+ K+E L+ASQGGPIIL+QIENEYGN+ YG A KSYINW A MATSLD GV
Sbjct: 120 FTTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGV 179
Query: 217 PWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDL 265
PW+MCQ++DAP P+ F+PN+ N PKIWTENW+GWF S+GG P+R EDL
Sbjct: 180 PWVMCQQTDAPDPIINTCNGFYCDQFSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDL 239
Query: 266 AFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLR 325
AFAVARFFQ GGTFQNYYMY G NFG TSGGP++ TSYDYDAPIDEYG QPKWGHL+
Sbjct: 240 AFAVARFFQRGGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLK 299
Query: 326 ELHKLLKSMEKTLT----------------------------YGNV-TNTDYGNSVSGSS 356
ELHK +K E L N+ T +D + +G S
Sbjct: 300 ELHKAIKLCEPALVATDHHTLRLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNGKS 359
Query: 357 YNLPAWSVSILPDCKTEEFNTAKVNTQT--------NVKVKRPNQAGNDQAPLQWKWRPE 408
Y+LPAWSVSILPDC+T FNTA++N+Q N + +Q Q W
Sbjct: 360 YSLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDW--- 416
Query: 409 MINDFVVRGKGHFALNT-----LIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
FV+ G N L++Q +T DVSDYLWY + + D+P LS + L
Sbjct: 417 ---SFVIEPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEPFLSNGTQSNL 473
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
S G VLHA+VNG S G + +FE+ + LT G N I LLSATVGLQNYG+
Sbjct: 474 HAESLGHVLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAF 533
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
FD++ GI GPV L G+ G DLSS+ WTY++GL G D N + + + S
Sbjct: 534 FDLMGAGITGPVKLKGQNG----TLDLSSNAWTYQIGLKGEDLSLHEN--SGDVSQWISE 587
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+P N+ + WYKTTF AP NDPV ++ GMGKG AWVNG ++GRYWPTY + ++GCST
Sbjct: 588 STLPKNQPLIWYKTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTYSSPQNGCST 647
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
+C+YRGPY + KC NCG PSQI YHVPRS+I+ NTLVLFEE GG+P+QI+ T +
Sbjct: 648 -ACNYRGPYSASKCIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQM 706
Query: 703 GTACGQAHENK-------------------TMELTC--HGRRISEIKYASFGDPQGACGA 741
+ C E+ T++L C + IS IK+ASFG P G CG+
Sbjct: 707 TSLCAHVSESHPAPVDTWLSLQQKGKKSGPTIQLECPYPNQVISSIKFASFGTPSGMCGS 766
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F C + VL +++K CVG K CS+ S LG G +K L VEA C
Sbjct: 767 FNHSQCSSA-SVLAVVQKACVGSKRCSVGISSKTLGDP--CRGVIKSLAVEAAC 817
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 800 bits (2065), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/856 (50%), Positives = 543/856 (63%), Gaps = 94/856 (10%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
RA + L+L V +D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K
Sbjct: 2 RAFEIVLVLLWFLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP++ QYDF G DL++F+K + + GLYV LRIGPYVCAEWNYG
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFP+WLH +PGI+ RT N+ F EM+ FT IVD+ K+EKL+ASQGGPIIL+QIENEYG
Sbjct: 122 GFPLWLHFIPGIK-FRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYG 180
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
N+ S YG AGKSYINW AKMATSLD GVPW+MCQ+ DAP P+ FTPN+
Sbjct: 181 NIDSHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNT 240
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK+WTENW+GWF S+GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF R++GG
Sbjct: 241 KPKMWTENWSGWFLSFGGAVPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGG 300
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN--------------- 342
P++ TSYDYDAPIDEYG + Q KWGHL+++HK +K E+ L +
Sbjct: 301 PFIATSYDYDAPIDEYGIIRQQKWGHLKDVHKAIKLCEEALIATDPKISSLGQNLEAAVY 360
Query: 343 ---------VTNTDYGN----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
+ N D N + SG+SY+LPAWSVSILPDCK NTAK+N+ + +
Sbjct: 361 KTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAIS-- 418
Query: 390 RPNQAGNDQAPLQ-----WKWRPEMINDFVVRGKGHFALNT-LIDQ-KSTNDVSDYLWYM 442
N D + L+ W W IN+ V K T L++Q +T D SDYLWY
Sbjct: 419 --NFVTEDISSLETSSSKWSW----INEPVGISKDDILSKTGLLEQINTTADRSDYLWYS 472
Query: 443 TNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTR 502
+ DL DD S L I S G LHA++NG +Q S + P+ L
Sbjct: 473 LSLDLADDP-----GSQTVLHIESLGHALHAFINGKLAGNQAGNSDKSKLNVDIPIALVS 527
Query: 503 GKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDLSSHKWTYKVGLY 561
GKN+I LLS TVGLQNYG+ FD V GI GPV+L G + G+ T+ DLSS KWTY++GL
Sbjct: 528 GKNKIDLLSLTVGLQNYGAFFDTVGAGITGPVILKGLKNGNNTL--DLSSRKWTYQIGLK 585
Query: 562 GLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
G D ++ S GW+S++ P N+ + WYKT F+AP ++PV ++ GMGKG AW
Sbjct: 586 GED-----LGLSSGSSGGWNSQSTYPKNQPLVWYKTNFDAPSGSNPVAIDFTGMGKGEAW 640
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
VNG ++GRYWPTY+A GC T+SC+YRGPY S KC NCG PSQ YHVPRS++K N
Sbjct: 641 VNGQSIGRYWPTYVASNAGC-TDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSFLKPNGN 699
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-------------------TMELTC-- 719
TLVLFEE GG+P+QI+F T + + C ++ + L+C
Sbjct: 700 TLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKVGPALLLSCPN 759
Query: 720 HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGAT 779
H + IS IK+AS+G P G CG F +G C + L +++K C+G +SCS+ S G
Sbjct: 760 HNQVISSIKFASYGTPLGTCGNFYRGRCSSN-KALSIVKKACIGSRSCSVGVSTDTFGDP 818
Query: 780 SCAAGTVKRLVVEALC 795
G K L VEA C
Sbjct: 819 --CRGVPKSLAVEATC 832
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 799 bits (2064), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/841 (50%), Positives = 528/841 (62%), Gaps = 89/841 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG+RK+L+SGSIHYPRSTP MWP+LI+K+K+GGLD IETYVFW+ HE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P + +Y+F G DL++F+K GLYV LRIGPYVCAEWNYGGFPVWLH +PGI+ RT
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK-FRT 141
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FTT IVD+ K+EKL+ASQGGPIIL+QIENEYGN+ S YG A KSYI W
Sbjct: 142 DNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWS 201
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW MCQ++DAP PM FTPN+ N PK+WTENW+GWF +G
Sbjct: 202 ASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFG 261
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
P R EDLAFAVARF+Q GGTFQNYYMYHGGTNF RTSGGP ++TSYDYDAPIDEYG
Sbjct: 262 DPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 321
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
L QPKWGHLR+LHK +K E L + T T G+++
Sbjct: 322 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTK 381
Query: 353 -------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----KVKRPNQAGNDQAP 400
+G SYNLPAWSVSILPDCK FNTAK+N+ T + +P+ + +
Sbjct: 382 SDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELG 441
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
QW + E I + F L++Q +T D SDYLWY D+K D+ L S
Sbjct: 442 SQWSYIKEPIG---ISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSK 498
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L I S GQV++A++NG S K S D+ P+ L G N I LLS TVGL NY
Sbjct: 499 AVLHIESLGQVVYAFINGKLAGSGHGKQKISLDI---PINLVTGTNTIDLLSVTVGLANY 555
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ FD+V GI GPV L G +I DL+S +WTY+VGL G D A
Sbjct: 556 GAFFDLVGAGITGPVTLKSAKGGSSI--DLASQQWTYQVGLKGED-----TGLATVDSSE 608
Query: 580 WSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W SK+ +P + + WYKTTF+AP ++PV ++ G GKG AWVNG ++GRYWPT +A
Sbjct: 609 WVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNG 668
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
GC TESCDYRG Y ++KC NCG PSQ YHVPRSW+K N LVLFEE GG+P+QI+F
Sbjct: 669 GC-TESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFA 727
Query: 699 TVVVGT----ACGQAH---------------ENKT---MELTC--HGRRISEIKYASFGD 734
T G+ Q+H N+T + L C + I IK+ASFG
Sbjct: 728 TKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGT 787
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P+G CG+F +G C + L L++K C+G +SC++E S G G VK L VEA
Sbjct: 788 PKGTCGSFTQGHCNSSRS-LSLVQKACIGLRSCNVEVSTRVFGEP--CRGVVKSLAVEAS 844
Query: 795 C 795
C
Sbjct: 845 C 845
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/841 (50%), Positives = 528/841 (62%), Gaps = 89/841 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG+RK+L+SGSIHYPRSTP MWP+LI+K+K+GGLD IETYVFW+ HE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P + +Y+F G DL++F+K GLYV LRIGPYVCAEWNYGGFPVWLH +PGI+ RT
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK-FRT 147
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FTT IVD+ K+EKL+ASQGGPIIL+QIENEYGN+ S YG A KSYI W
Sbjct: 148 DNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWS 207
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW MCQ++DAP PM FTPN+ N PK+WTENW+GWF +G
Sbjct: 208 ASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFG 267
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
P R EDLAFAVARF+Q GGTFQNYYMYHGGTNF RTSGGP ++TSYDYDAPIDEYG
Sbjct: 268 DPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 327
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
L QPKWGHLR+LHK +K E L + T T G+++
Sbjct: 328 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTK 387
Query: 353 -------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----KVKRPNQAGNDQAP 400
+G SYNLPAWSVSILPDCK FNTAK+N+ T + +P+ + +
Sbjct: 388 SDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELG 447
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
QW + E I + F L++Q +T D SDYLWY D+K D+ L S
Sbjct: 448 SQWSYIKEPIG---ISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSK 504
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L I S GQV++A++NG S K S D+ P+ L G N I LLS TVGL NY
Sbjct: 505 AVLHIESLGQVVYAFINGKLAGSGHGKQKISLDI---PINLVTGTNTIDLLSVTVGLANY 561
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ FD+V GI GPV L G +I DL+S +WTY+VGL G D A
Sbjct: 562 GAFFDLVGAGITGPVTLKSAKGGSSI--DLASQQWTYQVGLKGED-----TGLATVDSSE 614
Query: 580 WSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W SK+ +P + + WYKTTF+AP ++PV ++ G GKG AWVNG ++GRYWPT +A
Sbjct: 615 WVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNG 674
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
GC TESCDYRG Y ++KC NCG PSQ YHVPRSW+K N LVLFEE GG+P+QI+F
Sbjct: 675 GC-TESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFA 733
Query: 699 TVVVGT----ACGQAH---------------ENKT---MELTC--HGRRISEIKYASFGD 734
T G+ Q+H N+T + L C + I IK+ASFG
Sbjct: 734 TKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGT 793
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P+G CG+F +G C + L L++K C+G +SC++E S G G VK L VEA
Sbjct: 794 PKGTCGSFTQGHCNSSRS-LSLVQKACIGLRSCNVEVSTRVFGEP--CRGVVKSLAVEAS 850
Query: 795 C 795
C
Sbjct: 851 C 851
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 798 bits (2060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/841 (50%), Positives = 528/841 (62%), Gaps = 89/841 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG+RK+L+SGSIHYPRSTP MWP+LI+K+K+GGLD IETYVFW+ HE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P + +Y+F G DL++F+K GLYV LRIGPYVCAEWNYGGFPVWLH +PGI+ RT
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK-FRT 147
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FTT IVD+ K+EKL+ASQGGPIIL+QIENEYGN+ S YG A KSYI W
Sbjct: 148 DNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWS 207
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW MCQ++DAP PM FTPN+ N PK+WTENW+GWF +G
Sbjct: 208 ASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFG 267
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
P R EDLAFAVARF+Q GGTFQNYYMYHGGTNF RTSGGP ++TSYDYDAPIDEYG
Sbjct: 268 DPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 327
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
L QPKWGHLR+LHK +K E L + T T G+++
Sbjct: 328 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTK 387
Query: 353 -------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----KVKRPNQAGNDQAP 400
+G SYNLPAWSVSILPDCK FNTAK+N+ T + +P+ + +
Sbjct: 388 SDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELG 447
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
QW + E I + F L++Q +T D SDYLWY D+K D+ L S
Sbjct: 448 SQWSYIKEPIG---ISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSK 504
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L I S GQV++A++NG S K S D+ P+ L G N I LLS TVGL NY
Sbjct: 505 AVLHIESLGQVVYAFINGKLAGSGHGKQKISLDI---PINLVTGTNTIDLLSVTVGLANY 561
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ FD++ GI GPV L G +I DL+S +WTY+VGL G D A
Sbjct: 562 GAFFDLMGAGITGPVTLKSAKGGSSI--DLASQQWTYQVGLKGED-----TGLATVDSSE 614
Query: 580 WSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W SK+ +P + + WYKTTF+AP ++PV ++ G GKG AWVNG ++GRYWPT +A
Sbjct: 615 WVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNG 674
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
GC TESCDYRG Y ++KC NCG PSQ YHVPRSW+K N LVLFEE GG+P+QI+F
Sbjct: 675 GC-TESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFA 733
Query: 699 TVVVGT----ACGQAH---------------ENKT---MELTC--HGRRISEIKYASFGD 734
T G+ Q+H N+T + L C + I IK+ASFG
Sbjct: 734 TKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGT 793
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P+G CG+F +G C + L L++K C+G +SC++E S G G VK L VEA
Sbjct: 794 PKGTCGSFTQGHCNSSRS-LSLVQKACIGLRSCNVEVSTRVFGEP--CRGVVKSLAVEAS 850
Query: 795 C 795
C
Sbjct: 851 C 851
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 798 bits (2060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/847 (49%), Positives = 527/847 (62%), Gaps = 84/847 (9%)
Query: 16 ILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAI 75
+L TL S V++D RA+ IDG+R++L+SGSIHYPRST MW DLI+K+K+GGLD I
Sbjct: 19 VLLTLATTSYGVNVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDGGLDVI 78
Query: 76 ETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH 135
ETYVFWNAHEP++ QY+F G DL++FIK + + GLY LRIGPYVCAEWNYGGFP+WLH
Sbjct: 79 ETYVFWNAHEPVQNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGFPLWLH 138
Query: 136 NMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG 195
+PGI+ RT N+ F EMQ FT IVDM K+EKL+ASQGGPIIL+QIENEYGN+ S YG
Sbjct: 139 FVPGIK-FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSSYG 197
Query: 196 DAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTE 244
A KSYINW A MA SLD GVPW+MCQ++DAP P+ FTPN+ N PK+WTE
Sbjct: 198 PAAKSYINWAASMAVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSKNKPKMWTE 257
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSY 304
NW+GWF S+GG P R EDLAFAVARF+Q GGTFQNYYMYHGGTNFGR++GGP+++TSY
Sbjct: 258 NWSGWFLSFGGAVPYRPVEDLAFAVARFYQLGGTFQNYYMYHGGTNFGRSTGGPFISTSY 317
Query: 305 DYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT------------------------- 339
DYDAP+DEYG QPKWGHL++LHK +K E+ L
Sbjct: 318 DYDAPLDEYGLTRQPKWGHLKDLHKSIKLCEEALVATDPVTSSLGQNLEATVYKTGTGLC 377
Query: 340 ---YGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK--VKRPNQA 394
N +D + +G+SYNLP WSVSILPDCK NTAK+N+ T + V +
Sbjct: 378 SAFLANFGTSDKTVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVIPNFVHQSLIG 437
Query: 395 GNDQAPL---QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDD 450
D A W W E + + F L++Q +T D SDYLWY + +KD+
Sbjct: 438 DADSADTLGSSWSWIYEPVG---ISKNDAFVKPGLLEQINTTADKSDYLWYSLSTVIKDN 494
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+P L S L + S G LHA+VNG S G + E PV L GKN I LL
Sbjct: 495 EPFLEDGSQTVLHVESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNTIDLL 554
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S T GLQNYG+ F++ GI GPV L G T+ DLSS +WTY++GL G +
Sbjct: 555 SLTAGLQNYGAFFELEGAGITGPVKLEGLKNGTTV--DLSSLQWTYQIGLKGEE----LG 608
Query: 571 AKAANSERGWSSK-NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ NS+ W ++ +P + + WYKT+F AP NDP+ ++ GMGKG AWVNG ++GRY
Sbjct: 609 LSSGNSQ--WVTQPALPTKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGRY 666
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WPT ++ GCS +C+YRG Y S KC NC PSQ YHVPRSW++ NTLVLFEE G
Sbjct: 667 WPTKVSPTSGCS--NCNYRGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEEIG 724
Query: 690 GNPSQINFQTVVVGTACGQAHENK---------------------TMELTCHGRRISEIK 728
G+P+QI F T + C E+ ++E + IS IK
Sbjct: 725 GDPTQIAFATKQSASLCSHVSESHPLPVDMWSSNSEAERKAGPVLSLECPFPNQVISSIK 784
Query: 729 YASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKR 788
+ASFG P+G CG+F G C++ L +++K C+G KSCSI AS + G G K
Sbjct: 785 FASFGTPRGTCGSFSHGQCKS-TRALSIVQKACIGSKSCSIGASASTFGDP--CRGVAKS 841
Query: 789 LVVEALC 795
L VEA C
Sbjct: 842 LAVEASC 848
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/836 (51%), Positives = 526/836 (62%), Gaps = 86/836 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG+RK+L+SGSIHYPRSTP MWP+LI+K+K+GGLD IETYVFW+ HE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P + +Y+F G DL++F+K GLYV LRIGPYVCAEWNYGGFPVWLH +PGI+ RT
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK-FRT 141
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FTT IVD+ K+EKL+ASQGGPIIL+QIENEYGN+ S YG A KSYI W
Sbjct: 142 DNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWS 201
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW MCQ++DAP PM FTPN+ N PK+WTENW+GWF +G
Sbjct: 202 ASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFG 261
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
P R EDLAFAVARF+Q GGTFQNYYMYHGGTNF RTSGGP ++TSYDYDAPIDEYG
Sbjct: 262 DPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 321
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
L QPKWGHLR+LHK +K E L + T T G+++
Sbjct: 322 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTK 381
Query: 353 -------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+G SYNLPAWSVSILPDCK FNTAKV + N K P+ + + QW +
Sbjct: 382 SDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKV--KFNSISKTPDGGSSAELGSQWSY 439
Query: 406 RPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
E I + F L++Q +T D SDYLWY D+K D+ L S L I
Sbjct: 440 IKEPIG---ISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLHI 496
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S GQV++A++NG S K S D+ P+ L G N I LLS TVGL NYG+ FD
Sbjct: 497 ESLGQVVYAFINGKLAGSGHGKQKISLDI---PINLVTGTNTIDLLSVTVGLANYGAFFD 553
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+V GI GPV L G +I DL+S +WTY+VGL G D A W SK+
Sbjct: 554 LVGAGITGPVTLKSAKGGSSI--DLASQQWTYQVGLKGED-----TGLATVDSSEWVSKS 606
Query: 585 -VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+P + + WYKTTF+AP ++PV ++ G GKG AWVNG ++GRYWPT +A GC TE
Sbjct: 607 PLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGC-TE 665
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
SCDYRG Y ++KC NCG PSQ YHVPRSW+K N LVLFEE GG+P+QI+F T G
Sbjct: 666 SCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTG 725
Query: 704 T----ACGQAH---------------ENKT---MELTC--HGRRISEIKYASFGDPQGAC 739
+ Q+H N+T + L C + I IK+ASFG P+G C
Sbjct: 726 SNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTC 785
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+F +G C + L L++K C+G +SC++E S G G VK L VEA C
Sbjct: 786 GSFTQGHCNSSRS-LSLVQKACIGLRSCNVEVSTRVFGEP--CRGVVKSLAVEASC 838
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/860 (48%), Positives = 541/860 (62%), Gaps = 85/860 (9%)
Query: 6 HCSRAILLC---LILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
+C I+L + L L S A V++D RA+ +DG R++L+SGSIHYPRSTP MWPD
Sbjct: 7 YCLSVIMLVFGVVFLHCLVMTSFAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPD 66
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
LI+K+K+GGLD IETYVFWN HEP+R QYDF G DLI F+K ++ GL+V +RIGPYVC
Sbjct: 67 LIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVEKAGLFVHIRIGPYVC 126
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
AEWNYGGFP+WLH +PGI E RT N+ F EM+ FT IVDM K+E L+ASQGGP+IL+Q
Sbjct: 127 AEWNYGGFPLWLHFIPGI-EFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQ 185
Query: 183 IENEYGN--VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
IENEYGN + S YG K Y+NW A MATSL+ GVPW+MCQ+ DAP +
Sbjct: 186 IENEYGNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCD 245
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
F N+ +PK+WTENWTGWF S+GG P R ED+AFAVARFFQ GGTFQNYYMYHGGT
Sbjct: 246 QFKQNSDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGT 305
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL--TYGNVTN-- 345
NFGRTSGGP++ TSYDYDAP+DEYG +NQPKWGHL++LHK +K E + T N+T+
Sbjct: 306 NFGRTSGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPNITSLG 365
Query: 346 ------------------------TDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVN 381
+D S +G+SY+LP WSVSILPDCK F+TAK+N
Sbjct: 366 SNIEVSVYKTDSQCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKIN 425
Query: 382 TQTNVK--VKRPNQAGNDQAPLQ-WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSD 437
+ + + V R ++A L W E + + + F L++Q +T D SD
Sbjct: 426 SASTISTFVTRSSEADASGGSLSGWTSVNEPVG---ISNENAFTRMGLLEQINTTADKSD 482
Query: 438 YLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERP 497
YLWY + ++K+D+P L S L + + G VLHAY+NG S SN E P
Sbjct: 483 YLWYSLSVNIKNDEPFLQDGSATVLHVKTLGHVLHAYINGKLSGSGKGNSRHSNFTIEVP 542
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
V L G+N+I LLSATVGLQNYG+ FD+ GI GPV L G T DLSS +WTY+
Sbjct: 543 VTLVPGENKIDLLSATVGLQNYGAFFDLKGAGITGPVQLKGFKNGSTT--DLSSKQWTYQ 600
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
VGL G +D N + W S+ +P N+ + WYK +F+AP + P+ ++ GMGK
Sbjct: 601 VGLKG-EDLGLSNGGSTL----WKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGK 655
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G AWVNG ++GR+WP Y+A DGC T+ C+YRG Y ++KC NCG PSQ+ YHVPRSW+K
Sbjct: 656 GEAWVNGQSIGRFWPAYIAPNDGC-TDPCNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLK 714
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQ---AH----------------ENKTMEL 717
N LVLFEE GG+P++++F T + + C + AH T+ L
Sbjct: 715 SSGNVLVLFEEMGGDPTKLSFATREIQSVCSRISDAHPLPIDMWASEDDARKKSGPTLSL 774
Query: 718 TC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEAN 775
C + IS IK+ASFG PQG CG+F G C + + L +++K C+G KSCS+ S
Sbjct: 775 ECPHPNQVISSIKFASFGTPQGTCGSFIHGRCSSS-NALSIVKKACIGSKSCSLGVSINA 833
Query: 776 LGATSCAAGTVKRLVVEALC 795
G G K L VEA C
Sbjct: 834 FGDP--CKGVAKSLAVEASC 851
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/860 (49%), Positives = 541/860 (62%), Gaps = 85/860 (9%)
Query: 6 HCSRAILLC---LILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
+C I+L + L L S A V++D RA+ +DG R++L+SGSIHYPRSTP MWPD
Sbjct: 7 YCLSVIMLVFGVVFLHCLVMTSFAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPD 66
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
LI+K+K+GGLD IETYVFWN HEP+R QYDF G DLI F+K ++ GL+V +RIGPYVC
Sbjct: 67 LIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVERAGLFVHIRIGPYVC 126
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
AEWNYGGFP+WLH +PGI E RT N+ F EM+ FT IVDM K+E L+ASQGGP+IL+Q
Sbjct: 127 AEWNYGGFPLWLHFIPGI-EFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQ 185
Query: 183 IENEYGN--VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
IENEYGN + S YG K Y+NW A MATSL+ GVPW+MCQ+ DAP +
Sbjct: 186 IENEYGNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCD 245
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
F N+ +PK+WTENWTGWF S+GG P R ED+AFAVARFFQ GGTFQNYYMYHGGT
Sbjct: 246 QFKQNSDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGT 305
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL--TYGNVTN-- 345
NFGRTSGGP++ TSYDYDAP+DEYG +NQPKWGHL++LHK +K E + T NVT+
Sbjct: 306 NFGRTSGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPNVTSLG 365
Query: 346 ------------------------TDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVN 381
+D S +G+SY+LP WSVSILPDCK F+TAK+N
Sbjct: 366 SNIEVSVYKTDSQCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKIN 425
Query: 382 TQTNVK--VKRPNQAGNDQAPLQ-WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSD 437
+ + + V R ++A L W E + + + F L++Q +T D SD
Sbjct: 426 SASTISTFVTRSSEADASGGSLSGWTSVNEPVG---ISNENAFTRMGLLEQINTTADKSD 482
Query: 438 YLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERP 497
YLWY + ++K+D+P L S L + + G VLHAY+NG S SN E P
Sbjct: 483 YLWYSLSVNIKNDEPFLQDGSATVLHVKTLGHVLHAYINGRLSGSGKGNSRHSNFTIEVP 542
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
V L G+N+I LLSATVGLQNYG+ FD+ GI GPV L G T DLSS +WTY+
Sbjct: 543 VTLVPGENKIDLLSATVGLQNYGAFFDLKGAGITGPVQLKGFKNGSTT--DLSSKQWTYQ 600
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
VGL G +D N + W S+ +P N+ + WYK +F+AP + P+ ++ GMGK
Sbjct: 601 VGLKG-EDLGLSNGGSTL----WKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGK 655
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G AWVNG ++GR+WP Y+A DGC T+ C+YRG Y ++KC NCG PSQ+ YHVPRSW+K
Sbjct: 656 GEAWVNGQSIGRFWPAYIAPNDGC-TDPCNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLK 714
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQ---AH----------------ENKTMEL 717
N LVLFEE GG+P++++F T + + C + AH T+ L
Sbjct: 715 SSGNVLVLFEEMGGDPTKLSFATREIQSVCSRTSDAHPLPIDMWASEDDARKKSGPTLSL 774
Query: 718 TC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEAN 775
C + IS IK+ASFG PQG CG+F G C + + L +++K C+G KSCS+ S
Sbjct: 775 ECPHPNQVISSIKFASFGTPQGTCGSFIHGRCSSS-NALSIVKKACIGSKSCSLGVSINA 833
Query: 776 LGATSCAAGTVKRLVVEALC 795
G G K L VEA C
Sbjct: 834 FGDP--CKGVAKSLAVEASC 851
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/842 (49%), Positives = 532/842 (63%), Gaps = 86/842 (10%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+ SLA V++D RA+ IDG+RK+L+SGS+HYPRSTP MWP +I+K+K+GGLD IETYVFW
Sbjct: 20 SFSLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFW 79
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP+R QYDF G DL++FIK + GLYV +RIGPYVCAEWNYGGFPVWLH +PG+
Sbjct: 80 NLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGV- 138
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
+ RT N+ F EM+ FT IVD+ K+EKL+ASQGGPIIL+QIENEYGNV S +G A KSY
Sbjct: 139 QFRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSY 198
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+ W A MATSL+ GVPW+MC + DAP P+ FTPN+ N PK+WTENW+GWF
Sbjct: 199 VQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
S+GG P R EDLAFAVARF+Q GG+ QNYYMYHGGTNFGRTSGGP++ TSYDYDAPI
Sbjct: 259 LSFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPI 318
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLT---------------------------YGNV 343
DEYG + QPKWGHLR++HK +K E+ L NV
Sbjct: 319 DEYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGSQCSAFLANV 378
Query: 344 -TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN--------VKVKRPNQA 394
T +D + +G+SY+LPAWSVSILPDCK NTAK+N+ T +KV
Sbjct: 379 DTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASE 438
Query: 395 GNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPI 453
D W W E I + FA L +Q +T D SDYLWY + D+K D+P
Sbjct: 439 AFDSG---WSWIDEPIG---ISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPY 492
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSAT 513
L+ SN L ++S G VLH ++N S G+S + P+ L GKN I LLS T
Sbjct: 493 LANGSNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLT 552
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA 573
VGLQNYG+ F++ G+ GPV L + + T+ DLSS +WTY++GL G D +
Sbjct: 553 VGLQNYGAFFELRGAGVTGPVKLENQKNNITV--DLSSGQWTYQIGLEGED----LGLPS 606
Query: 574 ANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
++ + S N+P N+ +TWYKTTF+AP +DP+ L+ G GKG AW+NG+++GRYWP+Y
Sbjct: 607 GSTSQWLSQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSY 666
Query: 634 LAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPS 693
+A G T CDY+G Y ++KC NCG PSQ YHVP+SW+K NTLVLFEE G +P+
Sbjct: 667 IAS--GQCTSYCDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPT 724
Query: 694 QINFQTVVVGTACGQAHENK--------------------TMELTCHGRRISEIKYASFG 733
++ F + +G+ C E+ ++E + IS IK+ASFG
Sbjct: 725 RLTFASKQLGSLCSHVSESHPPPVEMWSSDSKQQKTGPVLSLECPSPSQVISSIKFASFG 784
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
P+G CG+F G C + L +++K C+G KSCSI+ S G G K L VEA
Sbjct: 785 TPRGTCGSFSHGQCSTR-NALSIVQKACIGSKSCSIDVSIKAFGDP--CRGKTKSLAVEA 841
Query: 794 LC 795
C
Sbjct: 842 YC 843
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 793 bits (2048), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/854 (50%), Positives = 528/854 (61%), Gaps = 87/854 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I+ L+L + + A V++D RA+ IDG+RKIL+SGSIHYPRSTP MWPDLI+K+K+G
Sbjct: 15 IVSLLVLVMMTAAATAASVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKSKDG 74
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD IETYVFWN HEP + +Y+F G DL++F+K GLYV LRIGPY CAEWNYGGF
Sbjct: 75 GLDVIETYVFWNGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNYGGF 134
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWLH +PGI + RT N+ F EMQ FT IVD+ K+EKL+ASQGGPIIL+QIENEYGN+
Sbjct: 135 PVWLHFVPGI-KFRTDNEPFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNI 193
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
S YG AGKSY+ W A MA SLD GVPW MCQ+ DAP P+ FTPN+ N P
Sbjct: 194 DSSYGAAGKSYMKWSASMALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQFTPNSNNKP 253
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENW+GWF +G P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF RTSGGP
Sbjct: 254 KMWTENWSGWFLGFGEPSPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFERTSGGPL 313
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------------- 339
++TSYDYDAPIDEYG L QPKWGHLR+LHK +K E L
Sbjct: 314 ISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPKITSLGSNLEAAVYKT 373
Query: 340 --------YGNV-TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV---- 386
N+ T +D + +G SY LPAWSVSILPDCK FNTAK+N+ T
Sbjct: 374 STGSCAAFLANIGTKSDATVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATESTAFA 433
Query: 387 -KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTN 444
+ +PN + + QW + E + + F L++Q +T D SDYLWY
Sbjct: 434 RQSLKPNADSSAELGSQWSYIKEPVG---ISKADAFVKPGLLEQINTTADKSDYLWYSLR 490
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
D+K D+ L S L + S GQ+++A++NG S K S D+ P+ L GK
Sbjct: 491 MDIKGDETFLDEGSKAVLHVQSIGQLVYAFINGKLAGSGNGKQKISLDI---PINLVTGK 547
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPV-LLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N I LLS TVGL NYG FD+ GI GPV L + G T DLSS +WTY+VGL G
Sbjct: 548 NTIDLLSVTVGLANYGPFFDLTGAGITGPVSLKSAKTGSST---DLSSQQWTYQVGLKGE 604
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
D + +S S+ +P ++ + WYKTTF+AP +DPV ++ G GKG AWVNG
Sbjct: 605 DK----GLGSGDSSEWVSNSPLPTSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNG 660
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
++GRYWPT +A DGC SCDYRG Y S+KC NCG PSQ YHVPRSWIK NTLV
Sbjct: 661 QSIGRYWPTSIARTDGC-VGSCDYRGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLV 719
Query: 684 LFEEFGGNPSQINFQTVVVGT----ACGQAH-------------ENKT---MELTC--HG 721
L EE GG+P++I+F T G+ Q+H N+T + L C
Sbjct: 720 LLEEMGGDPTKISFATKQTGSNLCLTVSQSHPAPVDTWISDSKFSNRTSPVLSLKCPVST 779
Query: 722 RRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSC 781
+ IS I++ASFG P G CG+F G C + L +++K CVG +SC +E S G
Sbjct: 780 QVISSIRFASFGTPTGTCGSFSYGHCSSARS-LSVVQKACVGSRSCKVEVSTRVFGEP-- 836
Query: 782 AAGTVKRLVVEALC 795
G VK L VEA C
Sbjct: 837 CRGVVKSLAVEASC 850
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 793 bits (2048), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/842 (49%), Positives = 531/842 (63%), Gaps = 86/842 (10%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+ SLA V++D RA+ IDG+RK+L+SGS+HYPRSTP MWP +I+K+K+GGLD IETYVFW
Sbjct: 20 SFSLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFW 79
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP+R QYDF G DL++FIK + GLYV +RIGPYVCAEWNYGGFPVWLH +PG+
Sbjct: 80 NLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGV- 138
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
+ RT N+ F EM+ FT IVD+ K+EKL+ASQGGPIIL+QIENEYGNV S +G A KSY
Sbjct: 139 QFRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSY 198
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+ W A MATSL+ GVPW+MC + DAP P+ FTPN+ N PK+WTENW+GWF
Sbjct: 199 VQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
S+GG P R EDLAFAVARF+Q GG+ QNYYMYHGGTNFGRTSGGP++ TSYDYDAPI
Sbjct: 259 LSFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPI 318
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLT---------------------------YGNV 343
DEYG + QPKWGHLR++HK +K E+ L NV
Sbjct: 319 DEYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGSQCSAFLANV 378
Query: 344 -TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN--------VKVKRPNQA 394
T +D + +G+SY+LPAWSVSILPDCK NTAK+N+ T +KV
Sbjct: 379 DTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASE 438
Query: 395 GNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPI 453
D W W E I + FA L +Q +T D SDYLWY + D+K D+P
Sbjct: 439 AFDSG---WSWIDEPIG---ISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPY 492
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSAT 513
L+ SN L ++S G VLH ++N S G+S + P+ L GKN I LLS T
Sbjct: 493 LANGSNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLT 552
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA 573
VGLQNYG+ F++ G+ GPV L + T+ DLSS +WTY++GL G D +
Sbjct: 553 VGLQNYGAFFELRGAGVTGPVKLENXKNNITV--DLSSGQWTYQIGLEGED----LGLPS 606
Query: 574 ANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
++ + S N+P N+ +TWYKTTF+AP +DP+ L+ G GKG AW+NG+++GRYWP+Y
Sbjct: 607 GSTSQWLSQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSY 666
Query: 634 LAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPS 693
+A G T CDY+G Y ++KC NCG PSQ YHVP+SW+K NTLVLFEE G +P+
Sbjct: 667 IAS--GQCTSYCDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPT 724
Query: 694 QINFQTVVVGTACGQAHENK--------------------TMELTCHGRRISEIKYASFG 733
++ F + +G+ C E+ ++E + IS IK+ASFG
Sbjct: 725 RLTFASKQLGSLCSHVSESHPPPVEMWSSDSKQQKTGPVLSLECPSPSQVISSIKFASFG 784
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
P+G CG+F G C + L +++K C+G KSCSI+ S G G K L VEA
Sbjct: 785 TPRGTCGSFSHGQCSTR-NALSIVQKACIGSKSCSIDVSIKAFGDP--CRGKTKSLAVEA 841
Query: 794 LC 795
C
Sbjct: 842 YC 843
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 789 bits (2038), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/851 (50%), Positives = 532/851 (62%), Gaps = 91/851 (10%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
LLC+ TLF ++ Y D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GG
Sbjct: 13 LLCIHSPTLFCANVEY----DHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGG 68
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD IETYVFWN +EP+R QYDF G DL++F+KT+ GLYV LRIGPYVCAEWNYGGFP
Sbjct: 69 LDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFP 128
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
+WLH +PGI+ RT N+ F EM+ FT IVDM K+E L+ASQGGP+IL+QIENEYGN+
Sbjct: 129 LWLHFIPGIK-FRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNID 187
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
S YG AGKSYI W A MATSLD GVPW+MCQ++DAP P+ FTPN+ PK
Sbjct: 188 SAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPK 247
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENW+GWF +GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF RTSGGP++
Sbjct: 248 MWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 307
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV-------- 352
TSYDYDAPIDEYG + QPKWGHL+E+HK +K E+ L + T T G ++
Sbjct: 308 ATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTG 367
Query: 353 --------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----K 387
SG+SY+LPAWSVSILPDCK NTAK+N+ + + +
Sbjct: 368 SVCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISSFTTE 427
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD 446
+ + ++ + W W E + + F L++Q +T D SDYLWY + D
Sbjct: 428 SLKEDIGSSEASSTGWSWISEPVG---ISKADSFPQTGLLEQINTTADKSDYLWYSLSID 484
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
K D S L I S G LHA++NG SQ G + PV L GKN
Sbjct: 485 YKGD-----AGSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNT 539
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
I LLS TVGLQNYG+ FD GI GPV+L G A T+ DLS KWTY+VGL G D
Sbjct: 540 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTL--DLSYQKWTYQVGLKGED-- 595
Query: 567 KFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
++ S W+S++ P N+ + WYKTTF AP +DPV ++ GMGKG AWVNG +
Sbjct: 596 ---LGLSSGSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQS 652
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYWPTY+A + GC T+SC+YRGPY + KC NCG PSQ YHVPRSW+K N LVLF
Sbjct: 653 IGRYWPTYVASDAGC-TDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLF 711
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHENK-------------------TMELTC--HGRRI 724
EE GG+P+QI+F T + C ++ + LTC + I
Sbjct: 712 EEKGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSDTESGRKVGPVLSLTCPHDNQVI 771
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S IK+AS+G P G CG F G C + L +++K C+G SCS+ S G + G
Sbjct: 772 SSIKFASYGTPLGTCGNFYHGRCSSN-KALSIVQKACIGSSSCSVGVSSETFG--NPCRG 828
Query: 785 TVKRLVVEALC 795
K L VEA C
Sbjct: 829 VAKSLAVEATC 839
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 789 bits (2038), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/837 (49%), Positives = 525/837 (62%), Gaps = 86/837 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S V++D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GGLD IETYVFWN
Sbjct: 22 SFCANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP++ QY+F G DL++F+K + GLYV LRIGPY CAEWNYGGFP+WLH +PGI+
Sbjct: 82 HEPVQGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQ-F 140
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT NK F EM+ FT IVDM K+E L+ASQGGPIIL+Q+ENEYGN+ + YG A KSYI
Sbjct: 141 RTDNKPFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQVENEYGNIDAAYGPAAKSYIK 200
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MATSLD GVPW+MCQ++DAP P+ FTPN+ PK+WTENW+GWF S
Sbjct: 201 WAASMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNAKPKMWTENWSGWFLS 260
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R EDLAFAVARF+Q GGTFQNYYMYHGGTNFGRT+GGP+++TSYDYDAPID+
Sbjct: 261 FGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDQ 320
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLT---------------------------YGNVTN 345
YG + QPKWGHL+++HK +K E+ L N+
Sbjct: 321 YGIIRQPKWGHLKDVHKAIKLCEEALIATDPTITSPGPNIEAAVYKTGSICAAFLANIAT 380
Query: 346 TDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----KVKRPNQAGNDQAP 400
+D + +G+SY+LPAWSVSILPDCK NTAK+N+ + + + + D +
Sbjct: 381 SDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSASMISSFTTESFKEEVGSLDDSG 440
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
W W E I + F+ L++Q +T D SDYLWY + D++ D S
Sbjct: 441 SGWSWISEPIG---ISKSDSFSKFGLLEQINTTADKSDYLWYSISIDVEGDS-----GSQ 492
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L I S G LHA++NG S G + + PV L GKN I LLS TVGLQNY
Sbjct: 493 TVLHIESLGHALHAFINGKIAGSGTGNSGKAKVNVDIPVTLVAGKNSIDLLSLTVGLQNY 552
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ FD GI GPV+L G T+ DLSS +WTY+VGL K+ + +N G
Sbjct: 553 GAFFDTWGAGITGPVILKGLKNGSTV--DLSSQQWTYQVGL------KYEDLGPSNGSSG 604
Query: 580 -WSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
W+S++ +P N+ + WYKT F AP ++PV ++ GMGKG AWVNG ++GRYWPTY++
Sbjct: 605 QWNSQSTLPTNQSLIWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSPN 664
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
GC T+SC+YRG Y S KC NCG PSQ YH+PRSW++ NTLVLFEE GG+P+QI+F
Sbjct: 665 GGC-TDSCNYRGAYSSSKCLKNCGKPSQTLYHIPRSWLQPDSNTLVLFEESGGDPTQISF 723
Query: 698 QTVVVGTACGQAHENK-------------------TMELTCHGRRISEIKYASFGDPQGA 738
T +G+ C E+ ++E + IS IK+ASFG P G
Sbjct: 724 ATKQIGSMCSHVSESHPPPVDLWNSDKGRKVGPVLSLECPYPNQLISSIKFASFGTPYGT 783
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG FK G C + L +++K C+G SC I S G G K L VEA C
Sbjct: 784 CGNFKHGRCRSN-KALSIVQKACIGSSSCRIGISINTFGDP--CKGVTKSLAVEASC 837
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 787 bits (2033), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/851 (50%), Positives = 530/851 (62%), Gaps = 91/851 (10%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
LLC+ LF ++ Y D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GG
Sbjct: 13 LLCIHTPKLFCANVEY----DHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGG 68
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD IETYVFWN HEP+R QYDF G DL++F+KT+ GLYV LRIGPYVCAEWNYGGFP
Sbjct: 69 LDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFP 128
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWLH +PGI+ RT N+ F EM+ FT IVDM K+EKL+ASQGGP+IL+QIENEYGN+
Sbjct: 129 VWLHFIPGIK-FRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNID 187
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
+ YG AGKSYI W A MATSLD GVPW+MC ++DAP P+ FTPN+ PK
Sbjct: 188 TAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPK 247
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENW+GWF +GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF R SGGP++
Sbjct: 248 MWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFI 307
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV-------- 352
TSYDYDAPIDEYG + QPKWGHL+E+HK +K E+ L + T T G ++
Sbjct: 308 ATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTG 367
Query: 353 --------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----K 387
SG+SY+LPAWSVSILPDCK+ NTAK+N+ + + +
Sbjct: 368 SVCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTE 427
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD 446
+ + ++ + W W E + + F+ L++Q +T D SDYLWY + D
Sbjct: 428 SSKEDIGSSEASSTGWSWISEPVG---ISKTDSFSQTGLLEQINTTADKSDYLWYSLSID 484
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
K D SS L I S G LHA++NG SQ G + PV L GKN
Sbjct: 485 YKAD-----ASSQTVLHIESLGHALHAFINGKLAGSQPGNSGKYKFTVDIPVTLVAGKNT 539
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
I LLS TVGLQNYG+ FD GI GPV+L G A T+ DLSS KWTY+VGL G D
Sbjct: 540 IDLLSLTVGLQNYGAFFDTWGVGITGPVILKGFANGNTL--DLSSQKWTYQVGLQGED-- 595
Query: 567 KFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
++ S W+ ++ P N+ +TWYKTTF AP +DPV ++ GMGKG AWVNG
Sbjct: 596 ---LGLSSGSSGQWNLQSTFPKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQR 652
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYWPTY+A + C T+SC+YRGPY + KC NC PSQ YHVPRSW+K N LVLF
Sbjct: 653 IGRYWPTYVASDASC-TDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLF 711
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHENK-------------------TMELTC--HGRRI 724
EE GG+P+QI+F T + C ++ + LTC + I
Sbjct: 712 EERGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSETESGRKVGPVLSLTCPHDNQVI 771
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S IK+AS+G P G CG F G C + L +++K C+G SCS+ S G G
Sbjct: 772 SSIKFASYGTPLGTCGNFYHGRCSSN-KALSIVQKACIGSSSCSVGVSSDTFG--DPCRG 828
Query: 785 TVKRLVVEALC 795
K L VEA C
Sbjct: 829 MAKSLAVEATC 839
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/820 (50%), Positives = 530/820 (64%), Gaps = 76/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R+I+LSGSIHYPRSTP MWPDLIKKAKEGGLDAIETY+FWN HEP R
Sbjct: 31 VSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYGG P WL ++PG++ R N+
Sbjct: 91 RQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNE 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K K+FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQESD-APSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ D P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL+ELH +LKSMEKTL +G +T+YG++++
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSACFINNRFDDK 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ TQT+V VK+PN A +Q L+W W
Sbjct: 390 DVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQESLKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE ++ F+ KG+F N L++Q T+ D SDYLWY T+ + K G + L +N
Sbjct: 450 PENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHK-------GEGSYKLYVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG + + G E PVKL GKN ISLLSATVGL+NYG F+
Sbjct: 503 TTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYGPSFEK 562
Query: 526 VPNGI-PGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL ++ W+ N
Sbjct: 563 MPTGIVGGPVKLIDSNGTAI---DLSNSSWSYKAGL----ASEYRQIHLDKPGYKWNGNN 615
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE-DGCS 641
+P+NR TWYK TFEAP D VV++L G+ KG AWVNG NLGRYWP+Y A E GC
Sbjct: 616 GTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGC- 674
Query: 642 TESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQIN 696
CDYRG + ++ +C CG PSQ +YHVPRS++ G NTL+LFEE GG+PS +
Sbjct: 675 -HRCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVA 733
Query: 697 FQTVVVGTACGQAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+TVV G C + L+C G +S + ASFG +G CG + +G CE++
Sbjct: 734 LRTVVPGAVCTSGEAGDAVTLSCGGGHAVSSVDVASFGVGRGRCGGY-EGGCESKA-AYE 791
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CVGK+SC++E + A GA C +G L V+A C
Sbjct: 792 AFTAACVGKESCTVEITGAFAGA-GCLSGV---LTVQATC 827
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 784 bits (2025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/846 (49%), Positives = 524/846 (61%), Gaps = 90/846 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+K+K+GGLD IETYVFW+
Sbjct: 126 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 185
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HE +R QYDF G DL+RF+K + D GLYV LRIGPYVCAEWNYGGFPVWLH +PGI+
Sbjct: 186 HEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK-F 244
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG AGK+Y+
Sbjct: 245 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 304
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA SLD GVPW+MCQ+SDAP P+ FTPN+ + PK+WTENW+GWF S
Sbjct: 305 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 364
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AEDLAFAVARF+Q GGTFQNYYMYHGGTNFGR++GGP++ TSYDYDAPIDE
Sbjct: 365 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 424
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------------NSV-------- 352
YG + QPKWGHLR++HK +K E L + + G NS+
Sbjct: 425 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANV 484
Query: 353 ----------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP----NQAGNDQ 398
+G++Y LPAWSVSILPDCK NTA++N+Q R Q +D
Sbjct: 485 DAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDS 544
Query: 399 ------APLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDD 451
A W + E + + + L++Q +T D SD+LWY T+ +K D+
Sbjct: 545 LITPELATAGWSYAIEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDE 601
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
P L+GS + L +NS G VL Y+NG S +S + PV L GKN+I LLS
Sbjct: 602 PYLNGSQS-NLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLS 660
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
TVGL NYG+ FD+V G+ GPV L G G +LSS WTY++GL G +D YN
Sbjct: 661 TTVGLSNYGAFFDLVGAGVTGPVKLSGPNG----ALNLSSTDWTYQIGLRG-EDLHLYNP 715
Query: 572 KAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
A+ E W S N P N+ + WYKT F AP +DPV ++ GMGKG AWVNG ++GRYW
Sbjct: 716 SEASPE--WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW 773
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
PT LA + GC SC+YRG Y S+KC CG PSQ YHVPRS+++ G N LVLFE+FGG
Sbjct: 774 PTNLAPQSGC-VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGG 832
Query: 691 NPSQINFQTVVVGTACGQAHE-------------------NKTMELTC--HGRRISEIKY 729
+PS I+F T + C E + L C G+ IS IK+
Sbjct: 833 DPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKF 892
Query: 730 ASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRL 789
ASFG P G CG + G C + L ++++ CVG +CS+ S N G +G K L
Sbjct: 893 ASFGTPSGTCGNYNHGECSSS-QALAVVQEACVGMTNCSVPVSSNNFGDP--CSGVTKSL 949
Query: 790 VVEALC 795
VVEA C
Sbjct: 950 VVEAAC 955
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 783 bits (2023), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/846 (49%), Positives = 524/846 (61%), Gaps = 90/846 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+K+K+GGLD IETYVFW+
Sbjct: 28 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 87
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HE +R QYDF G DL+RF+K + D GLYV LRIGPYVCAEWNYGGFPVWLH +PGI+
Sbjct: 88 HEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK-F 146
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG AGK+Y+
Sbjct: 147 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 206
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA SLD GVPW+MCQ+SDAP P+ FTPN+ + PK+WTENW+GWF S
Sbjct: 207 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 266
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AEDLAFAVARF+Q GGTFQNYYMYHGGTNFGR++GGP++ TSYDYDAPIDE
Sbjct: 267 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 326
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------------NSV-------- 352
YG + QPKWGHLR++HK +K E L + + G NS+
Sbjct: 327 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANV 386
Query: 353 ----------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP----NQAGNDQ 398
+G++Y LPAWSVSILPDCK NTA++N+Q R Q +D
Sbjct: 387 DAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDS 446
Query: 399 ------APLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDD 451
A W + E + + + L++Q +T D SD+LWY T+ +K D+
Sbjct: 447 LITPELATAGWSYAIEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDE 503
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
P L+GS + L +NS G VL Y+NG S +S + PV L GKN+I LLS
Sbjct: 504 PYLNGSQS-NLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLS 562
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
TVGL NYG+ FD+V G+ GPV L G G +LSS WTY++GL G +D YN
Sbjct: 563 TTVGLSNYGAFFDLVGAGVTGPVKLSGPNG----ALNLSSTDWTYQIGLRG-EDLHLYNP 617
Query: 572 KAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
A+ E W S N P N+ + WYKT F AP +DPV ++ GMGKG AWVNG ++GRYW
Sbjct: 618 SEASPE--WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW 675
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
PT LA + GC SC+YRG Y S+KC CG PSQ YHVPRS+++ G N LVLFE+FGG
Sbjct: 676 PTNLAPQSGC-VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGG 734
Query: 691 NPSQINFQTVVVGTACGQAHE-------------------NKTMELTC--HGRRISEIKY 729
+PS I+F T + C E + L C G+ IS IK+
Sbjct: 735 DPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKF 794
Query: 730 ASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRL 789
ASFG P G CG + G C + L ++++ CVG +CS+ S N G +G K L
Sbjct: 795 ASFGTPSGTCGNYNHGECSSS-QALAVVQEACVGMTNCSVPVSSNNFGDP--CSGVTKSL 851
Query: 790 VVEALC 795
VVEA C
Sbjct: 852 VVEAAC 857
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 783 bits (2021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/848 (50%), Positives = 526/848 (62%), Gaps = 95/848 (11%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
LLC+ TLF V +D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GG
Sbjct: 13 LLCIHSPTLF----CANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGG 68
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD IETYVFWN +EP+R QYDF G DL++F+KT+ GLYV LRIGPYVCAEWNYGGFP
Sbjct: 69 LDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFP 128
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
+WLH +PGI+ RT N+ F EM+ FT IVDM K+E L+ASQGGP+IL+QIENEYGN+
Sbjct: 129 LWLHFIPGIK-FRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNID 187
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
S YG AGKSYI W A MATSLD GVPW+MCQ++DAP P+ FTPN+ PK
Sbjct: 188 SAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPK 247
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENW+GWF +GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF RTSGGP++
Sbjct: 248 MWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 307
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV-------- 352
TSYDYDAPIDEYG + QPKWGHL+E+HK +K E+ L + T T G ++
Sbjct: 308 ATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTG 367
Query: 353 --------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR-- 390
SG+SY+LPAWSVSILPDCK NTAKV + +
Sbjct: 368 SVCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKVCLTNFISMFMWL 427
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKD 449
P+ G W W E + + F L++Q +T D SDYLWY + D K
Sbjct: 428 PSSTG-------WSWISEPVG---ISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKG 477
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
D S L I S G LHA++NG SQ G + PV L GKN I L
Sbjct: 478 D-----AGSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNTIDL 532
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS TVGLQNYG+ FD GI GPV+L G A T+ DLS KWTY+VGL G D
Sbjct: 533 LSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTL--DLSYQKWTYQVGLKGED----- 585
Query: 570 NAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGR 628
++ S W+S++ P N+ + WYKTTF AP +DPV ++ GMGKG AWVNG ++GR
Sbjct: 586 LGLSSGSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGR 645
Query: 629 YWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
YWPTY+A + GC T+SC+YRGPY + KC NCG PSQ YHVPRSW+K N LVLFEE
Sbjct: 646 YWPTYVASDAGC-TDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEK 704
Query: 689 GGNPSQINFQTVVVGTACGQAHENK-------------------TMELTC--HGRRISEI 727
GG+P+QI+F T + C ++ + LTC + IS I
Sbjct: 705 GGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSI 764
Query: 728 KYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVK 787
K+AS+G P G CG F G C + L +++K C+G SCS+ S G + G K
Sbjct: 765 KFASYGTPLGTCGNFYHGRCSSN-KALSIVQKACIGSSSCSVGVSSETFG--NPCRGVAK 821
Query: 788 RLVVEALC 795
L VEA C
Sbjct: 822 SLAVEATC 829
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/844 (49%), Positives = 521/844 (61%), Gaps = 88/844 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S A V++D RA+ IDG R++L+SGSIHYPRSTP MWP L++KAK+GGLD +ETYVFW+
Sbjct: 25 SAATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDV 84
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP+R QYDF G DL+RF+K D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI +L
Sbjct: 85 HEPVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-KL 143
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F EMQ FT +V K L+ASQGGPIIL+QIENEYGN+ + YG AGKSYI
Sbjct: 144 RTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIR 203
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA +LD GVPW+MCQ++DAP P+ FTP+ P+ PK+WTENW+GWF S
Sbjct: 204 WAAGMAVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLS 263
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R EDLAFAVARF+Q GGT QNYYMYHGGTNFGR+SGGP+++TSYDYDAPIDE
Sbjct: 264 FGGAVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDE 323
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------- 350
YG + QPKWGHLR++HK +K E L + + G
Sbjct: 324 YGLVRQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGSLCAAFLANIDD 383
Query: 351 ------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR----PNQAGN---- 396
+ +G +Y LPAWSVSILPDCK NTA++N+Q R QA +
Sbjct: 384 QSDKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSV 443
Query: 397 --DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPI 453
+ A W + E + + + L++Q +T D SD+LWY T+ + +P
Sbjct: 444 EAELAASSWSYAVEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPY 500
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSAT 513
L+GS + L +NS G VL ++NG S +S PV L GKN+I LLSAT
Sbjct: 501 LNGSQS-NLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSAT 559
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA 573
VGL NYG+ FD+V GI GPV L G G DLSS +WTY++GL G +D YN
Sbjct: 560 VGLTNYGAFFDLVGAGITGPVKLTGPKG----TLDLSSAEWTYQIGLRG-EDLHLYNPSE 614
Query: 574 ANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
A+ E W S N P N +TWYK+ F AP +DPV ++ GMGKG AWVNG ++GRYWPT
Sbjct: 615 ASPE--WVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPT 672
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
+A + GC SC+YRG Y + KC CG PSQI YHVPRS+++ G N +VLFE+FGGNP
Sbjct: 673 NIAPQSGC-VNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNP 731
Query: 693 SQINFQTVVVGTACGQAHENK-------------------TMELTC--HGRRISEIKYAS 731
S+I+F T + C E+ + L C G+ IS IK+AS
Sbjct: 732 SKISFTTKQTESVCAHVSEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFAS 791
Query: 732 FGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVV 791
FG P G CG++ G C + L + ++ CVG SCS+ S N G G K LVV
Sbjct: 792 FGTPSGTCGSYSHGECSSS-QALAVAQEACVGVSSCSVPVSAKNFGDP--CRGVTKSLVV 848
Query: 792 EALC 795
EA C
Sbjct: 849 EAAC 852
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/849 (49%), Positives = 526/849 (61%), Gaps = 93/849 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+K+K+GGLD IETYVFW+
Sbjct: 28 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 87
Query: 84 HEPLR---RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
HEP+R +QYDF G DL+RF+K + D GLYV LRIGPYVCAEWNYGGFPVWLH +PGI
Sbjct: 88 HEPVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGI 147
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
+ RT N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG AGK+
Sbjct: 148 K-FRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKA 206
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y+ W A MA SLD GVPW+MCQ+SDAP P+ FTPN+ + PK+WTENW+GW
Sbjct: 207 YMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGW 266
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
F S+GG P R AEDLAFAVARF+Q GGTFQNYYMYHGGTNFGR++GGP++ TSYDYDAP
Sbjct: 267 FLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAP 326
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------------NSV----- 352
IDEYG + QPKWGHLR++HK +K E L + + G NS+
Sbjct: 327 IDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFL 386
Query: 353 -------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP----NQAG 395
+G++Y LPAWSVSILPDCK NTA++N+Q R Q
Sbjct: 387 ANVDAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDT 446
Query: 396 NDQ------APLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLK 448
+D A W + E + + + L++Q +T D SD+LWY T+ +K
Sbjct: 447 DDSLITPELATAGWSYAIEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVK 503
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
D+P L+GS + L +NS G VL Y+NG S +S + PV L GKN+I
Sbjct: 504 GDEPYLNGSQS-NLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKID 562
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
LLS TVGL NYG+ FD++ G+ GPV L G G +LSS WTY++GL G +D
Sbjct: 563 LLSTTVGLSNYGAFFDLIGAGVTGPVKLSGPNG----ALNLSSTDWTYQIGLRG-EDLHL 617
Query: 569 YNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
YN A+ E W S N P N+ + WYKT F AP +DPV ++ GMGKG AWVNG ++G
Sbjct: 618 YNPSEASPE--WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIG 675
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYWPT LA + GC SC+YRG Y S+KC CG PSQ YHVPRS+++ G N LVLFE+
Sbjct: 676 RYWPTNLAPQSGC-VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQ 734
Query: 688 FGGNPSQINFQTVVVGTACGQAHE-------------------NKTMELTC--HGRRISE 726
FGG+PS I+F T + C E + L C G+ IS
Sbjct: 735 FGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQTPGPALRLECPREGQVISN 794
Query: 727 IKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTV 786
IK+ASFG P G CG + G C + L ++++ CVG +CS+ S N G +G
Sbjct: 795 IKFASFGTPSGTCGNYNHGECSSS-QALAVVQEACVGMTNCSVPVSSNNFG--DPCSGVT 851
Query: 787 KRLVVEALC 795
K LVVEA C
Sbjct: 852 KSLVVEAAC 860
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/846 (48%), Positives = 548/846 (64%), Gaps = 74/846 (8%)
Query: 3 TLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
T+ S A++L LI + + V+++ RA+ IDG+R+I+LSGSIHYPRSTP MWPD
Sbjct: 5 TMARASLALVLLLITAAV-GAANCTTVAYNDRALVIDGQRRIVLSGSIHYPRSTPEMWPD 63
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
LIKKAKEGGLDAIETYVFWN HEP RQY+F GN D++RF K IQ+ G+Y ILRIGPY+C
Sbjct: 64 LIKKAKEGGLDAIETYVFWNGHEPRPRQYNFAGNYDIVRFFKEIQNAGMYAILRIGPYIC 123
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
EWNYGG P WL ++PG++ R N+ F +EM+ FTTLIV+ K +FA QGGPIIL+Q
Sbjct: 124 GEWNYGGLPAWLRDIPGMQ-FRMHNQPFEHEMETFTTLIVNKLKDANMFAGQGGPIILSQ 182
Query: 183 IENEYGNVMSDYGDA--GKSYINWCAKMATSLDIGVPWIMCQE-SDAPSPMFT------- 232
IENEYGN+M++ DA YI+WCA MA ++GVPWIMCQ+ +D P +
Sbjct: 183 IENEYGNIMANLTDAQSASEYIHWCAAMANKQNVGVPWIMCQQDADVPPNVINTCNGFYC 242
Query: 233 ----PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
P + PKIWTENWTGWFK+W D R+A+D+AFAVA FFQ G+ QNYYMYHGG
Sbjct: 243 HDWFPKRTDIPKIWTENWTGWFKAWDKPDFHRSAQDIAFAVAMFFQKRGSLQNYYMYHGG 302
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGPY+TTSYDYDAP+DEYG++ +PK+GHL++LH +LKSMEK L +G+ ++ +Y
Sbjct: 303 TNFGRTAGGPYITTSYDYDAPLDEYGNIREPKYGHLKDLHAVLKSMEKILVHGDFSDINY 362
Query: 349 GNSVS----------------------------GSSYNLPAWSVSILPDCKTEEFNTAKV 380
G +V+ G+++ +PAWSVS+LPDCK +NTAK+
Sbjct: 363 GRNVTVTKYTLDGSSVCFISNQFDDRDANATIDGTTHVVPAWSVSVLPDCKAVAYNTAKI 422
Query: 381 NTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYL 439
QT+V VK+PN + L+W W PE + F+ KG F N L++Q +T+ D SDYL
Sbjct: 423 KAQTSVMVKKPNTVEQEPENLKWSWMPEHLKPFMTDEKGSFRKNELLEQITTSTDQSDYL 482
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WY T+ + K G + L +N++G ++A+VNG Q + GA E PVK
Sbjct: 483 WYRTSFEHK-------GEAKYKLSVNTTGHQIYAFVNGKLAGRQHSPNGAFIFQLESPVK 535
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKV 558
L GKN +SLLSAT+GL+NYG+ F+++P GI GPV LV G DLS+ W+YK
Sbjct: 536 LHDGKNYLSLLSATMGLKNYGALFELMPAGIVGGPVKLVDNNGSTI---DLSNSSWSYKA 592
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
GL G + ++ + K G + +P+NR TWYK TF+AP + VV +L G+ KG
Sbjct: 593 GLAG-EHRQIHLDKPGYKWHG-DNGTIPINRAFTWYKATFQAPAGEEAVVADLMGLNKGV 650
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSW 674
AWVNG NLGRYWP+Y+A E G CDYRG + ++ KC C P+Q +YHVPR +
Sbjct: 651 AWVNGNNLGRYWPSYVAAEMG-GCHHCDYRGAFKAEGDGLKCLTGCNEPAQRFYHVPRVF 709
Query: 675 IKDGV-NTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN-KTMELTC---HGRRISEIKY 729
++ G NT+VLFEE GG+PS++ F TV VG C +A E + L+C GR IS +
Sbjct: 710 LRAGEPNTVVLFEEAGGDPSRVGFHTVAVGPVCVEAAEKGDNVTLSCGQHKGRTISSVDL 769
Query: 730 ASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRL 789
AS+G +G CGA+ +G CE++ E CVGK+SC+++ ++A GA C +G L
Sbjct: 770 ASYGVTRGQCGAY-QGGCESKAAYEAFAEA-CVGKESCTVQHTDAFSGA-GCQSGV---L 823
Query: 790 VVEALC 795
V+A C
Sbjct: 824 TVQATC 829
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/859 (49%), Positives = 530/859 (61%), Gaps = 99/859 (11%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
LLC+ LF ++ Y D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GG
Sbjct: 13 LLCIHTPKLFCANVEY----DHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGG 68
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD IETYVFWN HEP+R QYDF G DL++F+KT+ GLYV LRIGPYVCAEWNYGGFP
Sbjct: 69 LDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFP 128
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWLH +PGI+ RT N+ F EM+ FT IVDM K+EKL+ASQGGP+IL+QIENEYGN+
Sbjct: 129 VWLHFIPGIK-FRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNID 187
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
+ YG AGKSYI W A MATSLD GVPW+MC ++DAP P+ FTPN+ PK
Sbjct: 188 TAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPK 247
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENW+GWF +GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF R SGGP++
Sbjct: 248 MWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFI 307
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV-------- 352
TSYDYDAPIDEYG + QPKWGHL+E+HK +K E+ L + T T G ++
Sbjct: 308 ATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTG 367
Query: 353 --------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-----K 387
SG+SY+LPAWSVSILPDCK+ NTAK+N+ + + +
Sbjct: 368 SVCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTE 427
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD 446
+ + ++ + W W E + + F+ L++Q +T D SDYLWY + D
Sbjct: 428 SSKEDIGSSEASSTGWSWISEPVG---ISKTDSFSQTGLLEQINTTADKSDYLWYSLSID 484
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKY--------GASNDLFERPV 498
K D SS L I S G LHA++NG K+ G + PV
Sbjct: 485 YKAD-----ASSQTVLHIESLGHALHAFINGKLAGKYKLKHSQLIICNSGKYKFTVDIPV 539
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L GKN I LLS TVGLQNYG+ FD GI GPV+L G A T+ DLSS KWTY+V
Sbjct: 540 TLVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITGPVILKGFANGNTL--DLSSQKWTYQV 597
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
GL G D ++ S W+ ++ P N+ +TWYKTTF AP +DPV ++ GMGKG
Sbjct: 598 GLQGED-----LGLSSGSSGQWNLQSTFPKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKG 652
Query: 618 FAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD 677
AWVNG +GRYWPTY+A + C T+SC+YRGPY + KC NC PSQ YHVPRSW+K
Sbjct: 653 EAWVNGQRIGRYWPTYVASDASC-TDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKP 711
Query: 678 GVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-------------------TMELT 718
N LVLFEE GG+P+QI+F T + C ++ + LT
Sbjct: 712 SGNILVLFEERGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSETESGRKVGPVLSLT 771
Query: 719 C--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL 776
C + IS IK+AS+G P G CG F G C + L +++K C+G SCS+ S
Sbjct: 772 CPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSN-KALSIVQKACIGSSSCSVGVSSDTF 830
Query: 777 GATSCAAGTVKRLVVEALC 795
G G K L VEA C
Sbjct: 831 GDP--CRGMAKSLAVEATC 847
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/840 (49%), Positives = 519/840 (61%), Gaps = 83/840 (9%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S +V++D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+GGLD IETYVFWN
Sbjct: 17 SYCAKVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 76
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HE +R QYDF G DL++F+KT+ + GLYV LRIGPYVCAEWNYGGFP+WLH +PGI +L
Sbjct: 77 HEAVRGQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-QL 135
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F EMQ FT IVDM KKEKL+ASQGGPIIL+QIENEYGN+ YG A ++YI
Sbjct: 136 RTDNEPFKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQTYIK 195
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPMF-----------TPNNPNS-PKIWTENWTGWFK 251
W A MA SLD GVPW+MCQ+ DAP + TP P PK+WTENW+GWF
Sbjct: 196 WAADMAVSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQWTPRLPEKRPKMWTENWSGWFL 255
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPID 311
S+GG P+R EDLAFAVARFFQ GGTFQNYYMYHGGTNFGR++GGP++ TSYDYDAPID
Sbjct: 256 SFGGAVPQRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPID 315
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV------------------- 352
EYG L QPKWGHL+++HK +K E+ + + + +G +V
Sbjct: 316 EYGLLRQPKWGHLKDVHKAIKLCEEAMVATDPKYSSFGPNVEATVYKTGSACAAFLANSD 375
Query: 353 ---------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ- 402
+G+SY+LPAWSVSILPDCK NTAK+N+ + + +D +
Sbjct: 376 TKSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAAMIPSFMHHSVLDDIDSSEA 435
Query: 403 ----WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGS 457
W W E + + K F L++Q +T D SDYLWY + D+ D L
Sbjct: 436 LGSGWSWINEPVG---ISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVTSSDTFLQDG 492
Query: 458 SNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQ 517
S L + S G LHA++NG + PV GKN I LLS T+GLQ
Sbjct: 493 SQTILHVESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNTIDLLSLTIGLQ 552
Query: 518 NYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSE 577
NYG+ FD GI GPV L G T DLSS +WTY++GL G D + ++ S
Sbjct: 553 NYGAFFDKSGAGITGPVQLKGLKNGTTT--DLSSQRWTYQIGLQGED-----SGFSSGSS 605
Query: 578 RGWSSK-NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAE 636
W S+ +P + +TWYK TF AP ++PV L+ GMGKG AWVNG ++GRYWPT A
Sbjct: 606 SQWISQPTLPKKQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNNAP 665
Query: 637 EDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQIN 696
GC +SC++RGPY S+KC NCG PSQ YHVPRSW+K NTLVLFEE GG+P+QI+
Sbjct: 666 TSGCP-DSCNFRGPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGDPTQIS 724
Query: 697 FQTVVVGTACGQAHENK---------------------TMELTCHGRRISEIKYASFGDP 735
F T + + C E+ ++E + IS IK+AS+G P
Sbjct: 725 FATRQIESLCSHVSESHPSPVDTWSSDSKAGRKLGPVLSLECPFPNQVISSIKFASYGKP 784
Query: 736 QGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
QG CG+F G C++ L +++K CVG KSCSIE S G G K L VEA C
Sbjct: 785 QGTCGSFSHGQCKS-TSALSIVQKACVGSKSCSIEVSVKTFGDP--CKGVAKSLAVEASC 841
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 780 bits (2013), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/844 (48%), Positives = 523/844 (61%), Gaps = 88/844 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S A V++D RA+ IDG R++L+SGSIHYPRSTP MWP L++KAK+GGLD +ETYVFW+
Sbjct: 24 SSATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDI 83
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HE QYDF G DL+RF+K D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI+
Sbjct: 84 HETATXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK-F 142
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F EMQ FT +V K L+ASQGGPIIL+QIENEYGN+ S YG AGKSYI
Sbjct: 143 RTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIR 202
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA +LD GVPW+MCQ++DAP P+ FTPN+ + PK+WTENW+GWF S
Sbjct: 203 WAAGMAVALDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSNSKPKLWTENWSGWFLS 262
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R EDLAFAVARF+Q GGT QNYYMYHGGTNFGR+SGGP+++TSYDYDAPIDE
Sbjct: 263 FGGAVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDE 322
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLT-------------------YGNV---------T 344
YG + QPKWGHL+++HK +K E L G+V T
Sbjct: 323 YGLVRQPKWGHLKDVHKAIKQCEPALIATDPSYMSMGQNAEAHVYKAGSVCAAFLANMDT 382
Query: 345 NTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR---PNQAGNDQAPL 401
+D + +G++Y LPAWSVSILPDCK NTA++N+QT R + +D + +
Sbjct: 383 QSDKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEMRSLGSSTKASDGSSI 442
Query: 402 Q-------WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPI 453
+ W + E + + + L++Q +T D SD+LWY T+ +K +P
Sbjct: 443 ETELALSGWSYAIEPVG---ITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGGEPY 499
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSAT 513
L+GS + L +NS G VL AY+NG + S +S + P+ L GKN+I LLS T
Sbjct: 500 LNGSQS-NLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKIDLLSGT 558
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA 573
VGL NYG+ FD+V GI GPV L G G + DLSS WTY+VGL G + YN
Sbjct: 559 VGLSNYGAFFDLVGAGITGPVKLSGPKG----VLDLSSTDWTYQVGLRG-EGLHLYNPSE 613
Query: 574 ANSERGW-SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
A+ E W S K P N+ + WYK+ F P +DPV ++ GMGKG AWVNG ++GRYWPT
Sbjct: 614 ASPE--WVSDKAYPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPT 671
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
LA + GC SC+YRGPY S KC CG PSQ YHVPRS+++ G N +VLFE+FGG+P
Sbjct: 672 NLAPQSGC-VNSCNYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDP 730
Query: 693 SQINFQTVVVGTACGQAHENK-------------------TMELTC--HGRRISEIKYAS 731
S+I+F T + C E+ + L C G+ IS IK+AS
Sbjct: 731 SKISFTTKQTASVCAHVSEDHPDQIDSWISPQQKVQRSGPALRLECPKAGQVISSIKFAS 790
Query: 732 FGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVV 791
FG P G CG + G C + L + ++ C+G SCS+ S N G G K LVV
Sbjct: 791 FGTPSGTCGNYNHGECSSP-QALAVAQEACIGVSSCSVPVSTKNFGDP--CTGVTKSLVV 847
Query: 792 EALC 795
EA C
Sbjct: 848 EAAC 851
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 780 bits (2013), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/849 (49%), Positives = 525/849 (61%), Gaps = 93/849 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+K+K+GGLD IETYVFW+
Sbjct: 28 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 87
Query: 84 HEPLR---RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
HE +R +QYDF G DL+RF+K + D GLYV LRIGPYVCAEWNYGGFPVWLH +PGI
Sbjct: 88 HEAVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGI 147
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
+ RT N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG AGK+
Sbjct: 148 K-FRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKA 206
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y+ W A MA SLD GVPW+MCQ+SDAP P+ FTPN+ + PK+WTENW+GW
Sbjct: 207 YMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGW 266
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
F S+GG P R AEDLAFAVARF+Q GGTFQNYYMYHGGTNFGR++GGP++ TSYDYDAP
Sbjct: 267 FLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAP 326
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------------NSV----- 352
IDEYG + QPKWGHLR++HK +K E L + + G NS+
Sbjct: 327 IDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFL 386
Query: 353 -------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP----NQAG 395
+G++Y LPAWSVSILPDCK NTA++N+Q R Q
Sbjct: 387 ANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDT 446
Query: 396 NDQ------APLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLK 448
+D A W + E + + + L++Q +T D SD+LWY T+ +K
Sbjct: 447 DDSLITPELATAGWSYAIEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVK 503
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
D+P L+GS + L +NS G VL Y+NG S +S + PV L GKN+I
Sbjct: 504 GDEPYLNGSQS-NLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKID 562
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
LLS TVGL NYG+ FD+V G+ GPV L G G +LSS WTY++GL G +D
Sbjct: 563 LLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNG----ALNLSSTDWTYQIGLRG-EDLHL 617
Query: 569 YNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
YN A+ E W S N P N+ + WYKT F AP +DPV ++ GMGKG AWVNG ++G
Sbjct: 618 YNPSEASPE--WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIG 675
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYWPT LA + GC SC+YRG Y S+KC CG PSQ YHVPRS+++ G N LVLFE+
Sbjct: 676 RYWPTNLAPQSGC-VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQ 734
Query: 688 FGGNPSQINFQTVVVGTACGQAHE-------------------NKTMELTC--HGRRISE 726
FGG+PS I+F T + C E + L C G+ IS
Sbjct: 735 FGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISN 794
Query: 727 IKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTV 786
IK+ASFG P G CG + G C + L ++++ CVG +CS+ S N G +G
Sbjct: 795 IKFASFGTPSGTCGNYNHGECSSS-QALAVVQEACVGMTNCSVPVSSNNFGDP--CSGVT 851
Query: 787 KRLVVEALC 795
K LVVEA C
Sbjct: 852 KSLVVEAAC 860
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 778 bits (2008), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/830 (49%), Positives = 534/830 (64%), Gaps = 87/830 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++ CL L L+ S A V +D AI ++GERK+++SG+IHYPRST MWPDLI KAK+G
Sbjct: 10 LIACLAL--LYTCSSATTVEYDSSAIILNGERKLIISGAIHYPRSTSQMWPDLIMKAKDG 67
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
LDAIETY+FW+ HEP+RR+YDF+GNLD I+F+K Q+QGLYV+LRIGPYVCAEWNYGGF
Sbjct: 68 DLDAIETYIFWDLHEPVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVCAEWNYGGF 127
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
P+WLHNMPGI+ LRT N VF EM+ FTT IV M K+ LFA QGGPIILAQIENEYG+V
Sbjct: 128 PMWLHNMPGIQ-LRTDNAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQIENEYGDV 186
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+S YG+AG SYI WCA+MA + +IGVPWIMC++ +AP+ + F PNNP SP
Sbjct: 187 ISHYGEAGNSYIKWCAEMALAQNIGVPWIMCKQKNAPATIIDTCNGYYCDTFKPNNPKSP 246
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
KI+TENW GWF+ WG + P RTAED AF+VARFFQ GG QNYY+YHGGTNFGRT+GGP+
Sbjct: 247 KIFTENWVGWFQKWGERRPHRTAEDSAFSVARFFQNGGALQNYYLYHGGTNFGRTAGGPF 306
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYN- 358
+ T+YDYDAP+DEYG+L +PK+GHL+ LH +K EK LT G T +G+S+ ++Y
Sbjct: 307 IITTYDYDAPLDEYGNLIEPKYGHLKRLHAAIKLGEKVLTNGTATWESHGDSLWMTTYTN 366
Query: 359 ------------------------------LPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
+PAWS+S+L DC E +NTAK QTN+ +
Sbjct: 367 KGTGQKFCFLSNSHTSKDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNTAKTEAQTNIYM 426
Query: 389 KRPNQA-GNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
K+ +Q GN +W W + + D +GKG F + L+DQKS T SDYLWYMT
Sbjct: 427 KQLDQKLGNSP---EWSWTSDPMED-TFQGKGTFTASQLLDQKSVTVGASDYLWYMTEVV 482
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ D + +++N++G +L+ ++NG +Q + E + L +G N
Sbjct: 483 VNDTNTW----GKAKVQVNTTGHILYLFINGFLTGTQHGTVSQPGFIHEGNISLNQGTNI 538
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
ISLLS TVG NYG+ FDM GI G PV L ++ DLS W+YKVG+ G+
Sbjct: 539 ISLLSVTVGHANYGAFFDMQETGIVGGPVKLFSIENPNNVL-DLSKSTWSYKVGINGMT- 596
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
KKFY+ K + W + NV + MTWYKTTF+ P +PVVL+L G+ KG AWVNG +
Sbjct: 597 KKFYDPKTTIGVQ-WKTNNVSIGVPMTWYKTTFKTPDGTNPVVLDLIGLQKGEAWVNGQS 655
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYWP LAE GCS ++CDYRG Y +DKC CG PSQ +YHVPRS++ + VNTLVLF
Sbjct: 656 IGRYWPAMLAENKGCS-DTCDYRGEYNADKCLSGCGEPSQRFYHVPRSFLNNDVNTLVLF 714
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKG 745
EE G + + N G+ +SEI++AS+GDP+G+CG+FK G
Sbjct: 715 EEMGFDATPFN------------------------GKTMSEIQFASYGDPEGSCGSFKIG 750
Query: 746 SCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E+ ++EK C+GK+SCSI + + GT +L V+ C
Sbjct: 751 EWESRYSK-TVVEKACIGKQSCSINVTSSTFRLKK--GGTNGQLAVQLSC 797
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/820 (50%), Positives = 527/820 (64%), Gaps = 75/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D+IRF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++P ++ R N
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQ-FRMHNA 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+NFTTLI++ K +FA QGGPIILAQIENEYGNVM + + YI+WCA
Sbjct: 146 PFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCA 205
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 206 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 266 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G +T+Y ++V+
Sbjct: 326 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTSACFINNRNDNK 385
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT + VK+ N + L+W W
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPENLKWSWM 445
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ D K G ++ TL +N
Sbjct: 446 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHK-------GEASYTLFVN 498
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E VKL GKN ISLLSAT+GL+NYG F+
Sbjct: 499 TTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEK 558
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G ++ W + N
Sbjct: 559 MPAGIVGGPVKLIDNNGTGI---DLSNSSWSYKAGLAG----EYRQIHLDKPGYRWDNNN 611
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+NR TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 612 GTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 670
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NTL+LFEE GG+PSQ+ F
Sbjct: 671 HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIF 730
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+VV G+ C A + L+C H + IS I SFG +G CGA+ +G CE++
Sbjct: 731 HSVVAGSVCVSAEVGDAITLSCGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKA 789
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ A L + C +G L V+A C
Sbjct: 790 FTEA-CLGKESCTVQIINA-LTGSGCLSGV---LTVQASC 824
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/820 (50%), Positives = 530/820 (64%), Gaps = 75/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++PG++ R N
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K +FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G +T+Y + V+
Sbjct: 330 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSACFINNRNDNM 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT V V + N + L+W W
Sbjct: 390 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKANMVEKEPESLKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ + K G ++ TL +N
Sbjct: 450 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHK-------GEASYTLFVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E P KL GKN ISLLSAT+GL+NYG F+
Sbjct: 503 TTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEK 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G + ++ + K + W + N
Sbjct: 563 MPAGIVGGPVKLIDNNGKGI---DLSNSSWSYKAGLAG-EYRQIHLDKPGCT---WDNNN 615
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+N+ TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 616 GTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 674
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NTL+LFEE GG+PS ++F
Sbjct: 675 HHCDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTLILFEEAGGDPSHVSF 734
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+TV G+ C A T+ L+C H + IS I SFG +G CGA+ KG CE++
Sbjct: 735 RTVAAGSVCASAEVGDTITLSCGQHSKTISAINMTSFGVARGQCGAY-KGGCESKAAYKA 793
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ + A G + C + L V+A C
Sbjct: 794 FTEA-CLGKESCTVQITNAVTG-SGCLSNV---LTVQASC 828
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/820 (50%), Positives = 527/820 (64%), Gaps = 75/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++PG++ R N
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K +FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G +T+Y ++V+
Sbjct: 330 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTSACFINNRNDNK 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT + VK+ N + L+W W
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPENLKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ D K G ++ TL +N
Sbjct: 450 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHK-------GEASYTLFVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E VKL GKN ISLLSAT+GL+NYG F+
Sbjct: 503 TTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEK 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G ++ W + N
Sbjct: 563 MPAGIVGGPVKLIDNNGTGI---DLSNSSWSYKAGLAG----EYRQIHLDKPGYRWDNNN 615
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+NR TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 616 GTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 674
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NTL+LFEE GG+PSQ+ F
Sbjct: 675 HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIF 734
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+VV G+ C A + L+C H + IS I SFG +G CGA+ +G CE++
Sbjct: 735 HSVVAGSVCVSAEVGDAITLSCGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKA 793
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ A L + C +G L V+A C
Sbjct: 794 FTEA-CLGKESCTVQIINA-LTGSGCLSGV---LTVQASC 828
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/820 (50%), Positives = 526/820 (64%), Gaps = 75/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 31 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D+IRF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++P ++ R N
Sbjct: 91 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQ-FRMHNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+NFTTLI++ K +FA QGGPIILAQIENEYGNVM + + YI+WCA
Sbjct: 150 PFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G + +Y ++V+
Sbjct: 330 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTSACFINNRNDNK 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT + VK+ N + L+W W
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPESLKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ D K G ++ TL +N
Sbjct: 450 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHK-------GEASYTLFVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E VKL GKN ISLLSAT+GL+NYG F+
Sbjct: 503 TTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEK 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G ++ W + N
Sbjct: 563 MPAGIVGGPVKLIDNNGTGI---DLSNSSWSYKAGLAG----EYRQIHLDKPGYRWDNNN 615
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+NR TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 616 GTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 674
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NTL+LFEE GG+PSQ+ F
Sbjct: 675 HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIF 734
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+VV G+ C A + L+C H + IS I SFG +G CGA+ +G CE++
Sbjct: 735 HSVVAGSVCVSAEVGDAITLSCGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKA 793
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ A L + C +G L V+A C
Sbjct: 794 FTEA-CLGKESCTVQIINA-LTGSGCLSGV---LTVQASC 828
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/820 (50%), Positives = 526/820 (64%), Gaps = 75/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D+IRF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++P ++ R N
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQ-FRMHNA 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+NFTTLI++ K +FA QGGPIILAQIENEYGNVM + + YI+WCA
Sbjct: 146 PFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCA 205
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 206 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 266 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G + +Y ++V+
Sbjct: 326 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTSACFINNRNDNK 385
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT + VK+ N + L+W W
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPESLKWSWM 445
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ D K G ++ TL +N
Sbjct: 446 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHK-------GEASYTLFVN 498
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E VKL GKN ISLLSAT+GL+NYG F+
Sbjct: 499 TTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEK 558
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G ++ W + N
Sbjct: 559 MPAGIVGGPVKLIDNNGTGI---DLSNSSWSYKAGLAG----EYRQIHLDKPGYRWDNNN 611
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+NR TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 612 GTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 670
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NTL+LFEE GG+PSQ+ F
Sbjct: 671 HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIF 730
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+VV G+ C A + L+C H + IS I SFG +G CGA+ +G CE++
Sbjct: 731 HSVVAGSVCVSAEVGDAITLSCGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKA 789
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ A L + C +G L V+A C
Sbjct: 790 FTEA-CLGKESCTVQIINA-LTGSGCLSGV---LTVQASC 824
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/819 (49%), Positives = 530/819 (64%), Gaps = 73/819 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+DGR++ +DGER+I++SGSIHYPRSTP MWPDLIKKAKEGGL+AIETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+++F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYGG PVWL ++PGI+ R NK
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIK-FRLHNK 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM--SDYGDAGKSYINWCA 206
F NEM+ FTTLIV K +FA QGGPIILAQIENEYG M + + YI+WCA
Sbjct: 150 PFENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFT------------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ + P N + PK+WTENWTGW++ W
Sbjct: 210 DMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+ +R ED+AFAVA FFQ G+ QNYYMYHGGTNFGRT+GGPY+TTSYDYDAP+DEYG
Sbjct: 270 QPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL+ELH +L SMEK L +G+ +T+YG++V+
Sbjct: 330 NLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSACFINNRFDDR 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ TQT V V + + +W W
Sbjct: 390 DVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE + F+ KG+F N L++Q +T D SDYLWY T+ + K G + L +N
Sbjct: 450 PENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHK-------GEGSYVLYVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V Q++ + PVKL GKN ISLLS TVGL+NYG F++
Sbjct: 503 TTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGGSFEL 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ +G DLS++ W+YK GL G + +K Y K N R +S
Sbjct: 563 LPAGIVGGPVKLIDSSGSAI---DLSNNSWSYKAGLAG-EYRKIYLDKPGNKWRSHNS-T 617
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE-DGCSTE 643
+P+NR TWYKTTF+AP D VV++L G+ KG AWVNG +LGRYWP+Y+A + GC
Sbjct: 618 IPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGC--H 675
Query: 644 SCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINFQ 698
CDYRG + ++ KC CG PSQ YHVPRS++ G NTL+LFEE GG+PS++ +
Sbjct: 676 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLHKGEPNTLILFEEAGGDPSEVAVR 735
Query: 699 TVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPL 756
TVV G+ C A T+ L+C HGR IS + ASFG +G CG++ G C++++
Sbjct: 736 TVVEGSVCASAELGDTVTLSCGAHGRTISSVDVASFGVARGRCGSY-DGGCDSKV-AYDA 793
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CVGK+SC++ ++A C +G L V+A C
Sbjct: 794 FAAACVGKESCTVLVTDA-FANAGCVSGV---LTVQATC 828
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 774 bits (1999), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/825 (50%), Positives = 534/825 (64%), Gaps = 76/825 (9%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V +D RA+ IDGER++L+SGSIHYPRSTP MWPDLI+KAKEGGLDAIETYVFWN HEP
Sbjct: 25 EVGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHEPR 84
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
RRQY+F G+ D++RF K +QD G+Y ILRIGPY+C EWNYGG P WL ++ G++ R N
Sbjct: 85 RRQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQ-FRMHN 143
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS--DYGDAGKSYINWC 205
F EM+ FTTLIVD K+ K+FA QGGPIIL+QIENEYGN+M + ++ YI+WC
Sbjct: 144 HPFEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHWC 203
Query: 206 AKMATSLDIGVPWIMCQESD-APSPMFT-----------PNNPNSPKIWTENWTGWFKSW 253
A MA ++GVPWIMCQ+ D PS + P + PKIWTENWTGWFK+W
Sbjct: 204 AAMANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWFPKRTDIPKIWTENWTGWFKAW 263
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
D R+AED+AF+VA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEY
Sbjct: 264 DKPDFHRSAEDIAFSVAMFFQTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 323
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS---------------------- 351
G++ QPK+GHL++LH +LKSMEK L +G+ +T GN+
Sbjct: 324 GNIRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMGNTNVTVTKYTLDNSSACFISNKFD 383
Query: 352 --------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
+G+++ +PAWSVSILPDCKT +N+AK+ TQT+V VKRP A L W
Sbjct: 384 DKEVNVTLDNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTSVMVKRPG-AETVTDGLAW 442
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTL 462
W PE + F+ KG+F N L++Q +T+ D SDYLWY T+ + K G SN L
Sbjct: 443 SWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRTSFEHK-------GESNYKL 495
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+N++G L+A+VNG V ++ G E PVKL GKN ISLLSAT+GL+NYG+
Sbjct: 496 HVNTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGKNYISLLSATIGLKNYGAL 555
Query: 523 FDMVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
F+M+P GI GPV LV + T DLS+ W+YK GL G + + + AN WS
Sbjct: 556 FEMMPAGIVGGPVKLVDTVTNTTAY-DLSNSSWSYKAGLAG--EYRETHLDKANDRSQWS 612
Query: 582 ---SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE- 637
+ +P++R TWYK TFEAP +PVV +L G+GKG WVNG NLGRYWP+Y+A +
Sbjct: 613 GGLNGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNGNNLGRYWPSYVAADM 672
Query: 638 DGCSTESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNP 692
DGC + CDYRG + ++ KC C PSQ +YHVPRS+IK G NT+VLFEE GG+P
Sbjct: 673 DGC--QRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGEPNTMVLFEEAGGDP 730
Query: 693 SQINFQT-VVVGTACGQAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAE 750
++++F T V A + L C HGR IS + AS G +G CGA+ +G CE++
Sbjct: 731 TRVSFHTVAVGAACAEAAEVGDEVALACSHGRTISSVDVASLGVARGKCGAY-QGGCESK 789
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ CVGK+SC++ +E + C +G L V+A C
Sbjct: 790 AALA-AFTAACVGKESCTVRHTEDFRAGSGCDSGV---LTVQATC 830
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/820 (50%), Positives = 525/820 (64%), Gaps = 75/820 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D+IRF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++P ++ R N
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQ-FRMHNA 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+NFTTLI++ K +FA QGGPIILAQIENEYGNVM + + YI+WCA
Sbjct: 146 PFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCA 205
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 206 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 266 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G + +Y ++V+
Sbjct: 326 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTSACFINNRNDNK 385
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT + VK+ N + L+W W
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPESLKWSWM 445
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ D K G ++ TL +N
Sbjct: 446 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHK-------GEASYTLFVN 498
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E VKL GKN ISLLSAT+GL+NYG F+
Sbjct: 499 TTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEK 558
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G ++ W + N
Sbjct: 559 MPAGIVGGPVKLIDNNGTGI---DLSNSSWSYKAGLAG----EYRQIHLDKPGYRWDNNN 611
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+NR TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 612 GTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 670
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NTL+LFEE GG+PSQ+ F
Sbjct: 671 HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIF 730
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+VV G+ C A + L+C H + IS I SFG +G CGA+ +G CE++
Sbjct: 731 HSVVAGSVCVSAEVGDAITLSCGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKA 789
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ + A + + G L V+A C
Sbjct: 790 FTEA-CLGKESCTVQI----INALTGSGGLSGVLTVQASC 824
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/839 (48%), Positives = 539/839 (64%), Gaps = 78/839 (9%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L L L + ++ A V+++ RA+ IDG+R+I+LSGSIHYPRSTP MWPDLI KAKEGG
Sbjct: 6 FLLLALVAVTQVASATTVTYNDRALVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGG 65
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
L+ IETYVFWN HEP RRQY+F G+ D+IRF K IQ+ G++ ILRIGPY+C EWNYGG P
Sbjct: 66 LNTIETYVFWNGHEPRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGPYICGEWNYGGLP 125
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
WL ++PG++ R N F EM+ FTTLIV+ K +FA QGGPIILAQIENEYGN+M
Sbjct: 126 AWLRDIPGMQ-FRLHNAPFEREMETFTTLIVNKMKDVNMFAGQGGPIILAQIENEYGNIM 184
Query: 192 SDYGD--AGKSYINWCAKMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPN 237
+ + YI+WCA MA ++GVPWIMCQ+ +D P + PN
Sbjct: 185 GQLKNNQSASQYIHWCADMANKQEVGVPWIMCQQDNDVPHNVINTCNGFYCHDWFPNRTG 244
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PKIWTENWTGWFK+W D R+AED+AFAVA FFQ G+ NYYMYHGGTNFGRTSGG
Sbjct: 245 IPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGG 304
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS---- 353
PY+TTSYDYDAP+DEYG++ QPK+GHL++LH L++SMEK L +G +T YG +V+
Sbjct: 305 PYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGKNVTVTKY 364
Query: 354 ------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
G ++ +PAWSVSILP+CKT +NTAK+ TQT+V VK
Sbjct: 365 MYGGSSVCFINNQFVDRDMKVTLGGETHLVPAWSVSILPNCKTVAYNTAKIKTQTSVMVK 424
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLK 448
+ N + ++W W PE + F+ +G F + L++Q +T+ D SDYLWY T+ + K
Sbjct: 425 KANSVEKEPETMRWSWMPENLKPFMTDHRGSFRQSQLLEQIATSTDQSDYLWYRTSLEHK 484
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
G + TL +N+SG ++A+VNG V + GA + PVKL GKN +S
Sbjct: 485 -------GEGSYTLYVNTSGHEMYAFVNGRLVGQNHSADGAFVFQLQSPVKLHSGKNYVS 537
Query: 509 LLSATVGLQNYGSKFDMVPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
LLS TVGL+NYG F++VP GI GPV LVG G DL+ W+YK GL G + ++
Sbjct: 538 LLSGTVGLKNYGPSFELVPAGIAGGPVKLVGTNGTAI---DLTKSSWSYKSGLAG-ELRQ 593
Query: 568 FYNAKAANSERGWSSKN--VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
+ K W S N +P+NR TWYKTTFEAP + VV++L G+ KG AWVNG +
Sbjct: 594 IHLDKPGYK---WQSHNGTIPVNRPFTWYKTTFEAPAGEEAVVVDLLGLNKGVAWVNGNS 650
Query: 626 LGRYWPTYLAEE-DGCSTESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV- 679
LGRYWP+Y A E GC CDYRG + ++ +C CG P+Q +YHVPRS+++ G
Sbjct: 651 LGRYWPSYTAAEMPGCHV--CDYRGKFIAEGDGIRCLTGCGEPAQRFYHVPRSFLRAGEP 708
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-NKTMELTC--HGRRISEIKYASFGDPQ 736
NTL+LFEE GG+P++ F TV VG C A E + L+C HGR ++ + ASFG +
Sbjct: 709 NTLILFEEAGGDPTRAAFHTVAVGPVCVAAVELGDDVTLSCGGHGRVVASVDVASFGVAR 768
Query: 737 GACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+CGA+ KG CE++ L CVG++SC+++ + A GA C +G L V+A C
Sbjct: 769 GSCGAY-KGGCESKA-ALKAFTDACVGRESCTVKYTAAFAGA-GCQSGA---LTVQATC 821
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/819 (49%), Positives = 529/819 (64%), Gaps = 73/819 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+DGR++ +DGER+I++SGSIHYPRSTP MWPDLIKKAKEGGL+AIETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+++F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYGG PVWL ++PGI + R NK
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGI-KFRLHNK 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM--SDYGDAGKSYINWCA 206
F N M+ FTTLIV K +FA QGGPIILAQIENEYG M + + YI+WCA
Sbjct: 150 PFENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFT------------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ + P N + PK+WTENWTGW++ W
Sbjct: 210 DMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+ +R ED+AFAVA FFQ G+ QNYYMYHGGTNFGRT+GGPY+TTSYDYDAP+DEYG
Sbjct: 270 QPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL+ELH +L SMEK L +G+ +T+YG++V+
Sbjct: 330 NLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSACFINNRFDDR 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILP+CKT FN+AK+ TQT V V + + +W W
Sbjct: 390 DVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE + F+ KG+F N L++Q +T D SDYLWY T+ + K G + L +N
Sbjct: 450 PENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHK-------GEGSYVLYVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V Q++ + PVKL GKN ISLLS TVGL+NYG F++
Sbjct: 503 TTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGGSFEL 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ +G DLS++ W+YK GL G + +K Y K N R +S
Sbjct: 563 LPAGIVGGPVKLIDSSGSAI---DLSNNSWSYKAGLAG-EYRKIYLDKPGNKWRSHNS-T 617
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE-DGCSTE 643
+P+NR TWYKTTF+AP D VV++L G+ KG AWVNG +LGRYWP+Y+A + GC
Sbjct: 618 IPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGC--H 675
Query: 644 SCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINFQ 698
CDYRG + ++ KC CG PSQ YHVPRS++ G NTL+LFEE GG+PS++ +
Sbjct: 676 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVR 735
Query: 699 TVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPL 756
TVV G+ C A T+ L+C HGR IS + ASFG +G CG++ G CE+++
Sbjct: 736 TVVEGSVCASAEVGDTVTLSCGAHGRTISSVDVASFGVARGRCGSY-DGGCESKV-AYDA 793
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CVGK+SC++ ++A C +G L V+A C
Sbjct: 794 FAAACVGKESCTVLVTDA-FANAGCVSGV---LTVQATC 828
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/855 (47%), Positives = 519/855 (60%), Gaps = 89/855 (10%)
Query: 9 RAILLCLILQTLFNL----SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
R + L+L F + S V++D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI
Sbjct: 2 RTSQILLVLLWFFCIYAPSSFGANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLI 61
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+K+K+GGLD IETYVFWN HEP+R QY+F G DL++F+K + GLYV LRIGPY CAE
Sbjct: 62 QKSKDGGLDVIETYVFWNLHEPVRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAE 121
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WNYGGFP+WLH +PGI + RT NK F EM+ FT IVD+ K+E L+ASQGGPIIL+QIE
Sbjct: 122 WNYGGFPLWLHFIPGI-QFRTDNKPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIE 180
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYGN+ +DYG A KSYI W A MATSL GVPW+MCQ+ +AP P+ F P
Sbjct: 181 NEYGNIEADYGPAAKSYIKWAASMATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQFKP 240
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+ PKIWTE +TGWF ++G P R EDLAFAVARF+Q GGTFQNYYMYHGGTNFGR
Sbjct: 241 NSNTKPKIWTEGYTGWFLAFGDAVPHRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGR 300
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------- 339
SGGP++ +SYDYDAPIDEYG + QPKWGHL+++HK +K E+ L
Sbjct: 301 ASGGPFVASSYDYDAPIDEYGFIRQPKWGHLKDVHKAIKLCEEALIATDPTITSLGPNIE 360
Query: 340 -------------YGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
N+ +D + +G+SY+LPAWSVSILPDCK NTAK+ + + +
Sbjct: 361 AAVYKTGVVCAAFLANIATSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKITSASMI 420
Query: 387 K---VKRPNQAGN-DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWY 441
+ G+ D + +W W E I + F+ L++Q +T D SDYLWY
Sbjct: 421 SSFTTESLKDVGSLDDSGSRWSWISEPIG---ISKADSFSTFGLLEQINTTADRSDYLWY 477
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
+ DL + L I S G LHA++NG S + +N + P+ L
Sbjct: 478 SLSIDLD-------AGAQTFLHIKSLGHALHAFINGKLAGSGTGNHEKANVEVDIPITLV 530
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
GKN I LLS TVGLQNYG+ FD GI GPV+L + DLSS +WTY+VGL
Sbjct: 531 SGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKCLKNGSNV--DLSSKQWTYQVGLK 588
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
D + S + S +P N+ +TWYKT F AP N+PV ++ GMGKG AWV
Sbjct: 589 NED----LGLSSGCSGQWNSQSTLPTNQPLTWYKTNFVAPSGNNPVAIDFTGMGKGEAWV 644
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG ++GRYWPTY + + GC T+SC+YRG Y + KC NCG PSQ YHVPRSW++ NT
Sbjct: 645 NGQSIGRYWPTYASPKGGC-TDSCNYRGAYDASKCLKNCGKPSQTLYHVPRSWLRPDRNT 703
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENK---------------------TMELTCH 720
LVLFEE GGNP QI+F T +G+ C E+ ++E
Sbjct: 704 LVLFEESGGNPKQISFATKQIGSVCSHVSESHPPPVDSWNSNTESGRKVVPVVSLECPYP 763
Query: 721 GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATS 780
+ +S IK+ASFG P G CG FK G C + L +++K C+G SC IE S G
Sbjct: 764 NQVVSSIKFASFGTPLGTCGNFKHGLCSSN-KALSIVQKACIGSSSCRIELSVNTFGDP- 821
Query: 781 CAAGTVKRLVVEALC 795
G K L VEA C
Sbjct: 822 -CKGVAKSLAVEASC 835
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/851 (47%), Positives = 522/851 (61%), Gaps = 84/851 (9%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L L S V++D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+
Sbjct: 7 VFVLLWFLGVYVPASFCSNVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKD 66
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GG+D IETYVFWN HEP+R QY+F G DL+ F+K + GLYV LRIGPYVCAEWNYGG
Sbjct: 67 GGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYGG 126
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FP+WLH + GI+ RT N+ F EM+ FT IVDM K+E L+ASQGGPIIL+QIENEYGN
Sbjct: 127 FPLWLHFIAGIK-FRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGN 185
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
+ + A KSYI+W A MATSLD GVPWIMCQ+++AP P+ FTPN+ N
Sbjct: 186 IDTHDARAAKSYIDWAASMATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNK 245
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
PK+WTENW+GWF ++GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNFGRT+GGP
Sbjct: 246 PKMWTENWSGWFLAFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGP 305
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT------------------- 339
+++TSYDYDAPIDEYG + QPKWGHL++LHK +K E+ L
Sbjct: 306 FISTSYDYDAPIDEYGDIRQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYK 365
Query: 340 --------YGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK--VK 389
N+ +D + +G+SY+LP WSVSILPDCK NTAKVNT + +
Sbjct: 366 TGAVCSAFLANIGMSDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFAT 425
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLK 448
+ D + + F + L++Q +T D SDYLWY + +
Sbjct: 426 ESLKEKVDSLDSSSSGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYE 485
Query: 449 D---DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
D D P+ L I S G LHA+VNG S+ G + + P+ L GKN
Sbjct: 486 DNAGDQPV--------LHIESLGHALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGKN 537
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
I LLS TVGLQNYG+ +D V GI GPV+L G ++ DL+S +WTY+VGL G
Sbjct: 538 TIDLLSLTVGLQNYGAFYDTVGAGITGPVILKGLKNGSSV--DLTSQQWTYQVGLQG--- 592
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
+F + N + S N+P N+ +TWYKT F AP ++PV ++ GMGKG AWVNG +
Sbjct: 593 -EFVGLSSGNVGQWNSQSNLPANQPLTWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQS 651
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYWPTY++ GC T+SC+YRG Y + KC NCG PSQ YHVPR+W+K NT VLF
Sbjct: 652 IGRYWPTYISPNSGC-TDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLF 710
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHENK---------------------TMELTCHGRRI 724
EE GG+P++I+F T + + C E+ ++E + I
Sbjct: 711 EESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAESERKVGPVLSLECPYPNQAI 770
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S IK+ASFG P+G CG + GSC + L +++K C+G SC+I S G + G
Sbjct: 771 SSIKFASFGTPRGTCGNYNHGSCSSN-RALSIVQKACIGSSSCNIGVSINTFG--NPCRG 827
Query: 785 TVKRLVVEALC 795
K L VEA C
Sbjct: 828 VTKSLAVEAAC 838
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/878 (47%), Positives = 534/878 (60%), Gaps = 111/878 (12%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
RA + L+L V +D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K
Sbjct: 2 RAFEIVLVLLWFLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP++ QYDF G DL++F+K + + GLYV LRIGPYVCAEWNYG
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFM--NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
GFP+WLH +PGI + RT N+ F EM+ FT IVD+ K+EKL+ASQGGPIIL+QIENE
Sbjct: 122 GFPLWLHFIPGI-KFRTDNEPFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENE 180
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YG++ S YG AGKSYINW AKMATSLD GVPW+MCQ+ DAP + FTPN+
Sbjct: 181 YGDIDSAYGSAGKSYINWAAKMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQFTPNS 240
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYM----------- 284
PK+WTENW+ W+ +GG P R EDLAFAVARFFQ GGTFQNYYM
Sbjct: 241 NTKPKMWTENWSAWYLLFGGGFPHRPVEDLAFAVARFFQRGGTFQNYYMVLQPEMFFTSS 300
Query: 285 ----------YHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
YHGGTNF R++GGP++ TSYD+DAPIDEYG + QPKWGHL++LHK +K
Sbjct: 301 IYYMVLFLRPYHGGTNFDRSTGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLC 360
Query: 335 EKTLT---------------------------YGNV-TNTDYGNSVSGSSYNLPAWSVSI 366
E+ L NV T +D + SG+SY+LPAWSVSI
Sbjct: 361 EEALIATEPKITSLGPNLEAAVYKTGSVCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSI 420
Query: 367 LPDCKTEEFNTAKVNTQTNV-----KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHF 421
LPDCK NTAK+N+ + + K + + + + + +W W E + + F
Sbjct: 421 LPDCKNVVLNTAKINSASAISNFVTKSSKEDISSLETSSSKWSWINEPVG---ISKDDIF 477
Query: 422 ALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYV 480
+ L++Q + T D SDYLWY + DLKDD S L I S G LHA+VNG
Sbjct: 478 SKTGLLEQINITADRSDYLWYSLSVDLKDD-----LGSQTVLHIESLGHALHAFVNGKLA 532
Query: 481 DSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-R 539
S + P+K+ G NQI LLS TVGLQNYG+ FD GI GPV L G +
Sbjct: 533 GSHTGNKDKPKLNVDIPIKVIYGNNQIDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLK 592
Query: 540 AGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTF 598
G+ T+ DLSS KWTY+VGL G D ++ S GW+S++ P N+ + WYKT F
Sbjct: 593 NGNNTL--DLSSQKWTYQVGLKGED-----LGLSSGSSEGWNSQSTFPKNQPLIWYKTNF 645
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+AP ++PV ++ GMGKG AWVNG ++GRYWPTY+A C T+SC+YRGP+ KC
Sbjct: 646 DAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASNADC-TDSCNYRGPFTQTKCHM 704
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK----- 713
NCG PSQ YHVPRS++K NTLVLFEE GG+P+QI F T + + C ++
Sbjct: 705 NCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDPTQIAFATKQLESLCAHVSDSHPPQID 764
Query: 714 --------------TMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLI 757
+ L C H + I IK+AS+G P G CG F +G C + L ++
Sbjct: 765 LWNQDTTSWGKVGPALLLNCPNHNQVIFSIKFASYGTPLGTCGNFYRGRCSSN-KALSIV 823
Query: 758 EKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+K C+G +SCSI S G G K L VEA C
Sbjct: 824 KKACIGSRSCSIGVSTDTFGDP--CRGVPKSLAVEATC 859
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/841 (48%), Positives = 517/841 (61%), Gaps = 87/841 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+KAK+GGLD IETYVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P+R QYDF G DL F+KT+ D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI + RT
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-KFRT 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG GK+Y+ W
Sbjct: 146 DNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWA 205
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW+MCQ++DAP P+ FTPN+ PK+WTENW+GWF S+G
Sbjct: 206 AGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFG 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P R EDLAFAVARF+Q GGTFQNYYMYHGGTN R+SGGP++ TSYDYDAPIDEYG
Sbjct: 266 GAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
+ QPKWGHLR++HK +K E L + + T G +V
Sbjct: 326 LVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGSVCAAFLANIDGQS 385
Query: 353 ------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN---------- 396
+G Y LPAWSVSILPDCK NTA++N+QT R ++ N
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 445
Query: 397 DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILS 455
+ A W + E + + L++Q +T D SD+LWY T+ +K D+P L+
Sbjct: 446 ELAVSDWSYAIEPVG---ITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYLN 502
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
GS + L +NS G VL Y+NG S +S +++P++L GKN+I LLSATVG
Sbjct: 503 GSQS-NLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVG 561
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
L NYG+ FD+V GI GPV L G G DLSS +WTY++GL G +D Y+ A+
Sbjct: 562 LSNYGAFFDLVGAGITGPVKLSGLNG----ALDLSSAEWTYQIGLRG-EDLHLYDPSEAS 616
Query: 576 SERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
E W S N P+N + WYKT F P +DPV ++ GMGKG AWVNG ++GRYWPT L
Sbjct: 617 PE--WVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNL 674
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQ 694
A + GC SC+YRG Y S KC CG PSQ YHVPRS+++ G N LVLFE FGG+PS+
Sbjct: 675 APQSGC-VNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSK 733
Query: 695 INFQTVVVGTACGQAHE------------------NKTMELTC--HGRRISEIKYASFGD 734
I+F G+ C Q E + L C G+ IS +K+ASFG
Sbjct: 734 ISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLECPKEGQVISSVKFASFGT 793
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G CG++ G C + L ++++ C+G S +N C G K L VEA
Sbjct: 794 PSGTCGSYSHGEC-SSTQALSIVQEACIGVSS-CSVPVSSNYFGNPC-TGVTKSLAVEAA 850
Query: 795 C 795
C
Sbjct: 851 C 851
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/842 (48%), Positives = 525/842 (62%), Gaps = 88/842 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG R++L+SGSIHYPRSTP MWP +I+KAK+GGLD IETYVFW+ HE
Sbjct: 34 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFWDIHE 93
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P+R QYDF G DL F+KT+ D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI + RT
Sbjct: 94 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-KFRT 152
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG AGK+Y+ W
Sbjct: 153 DNEPFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWA 212
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW+MCQ++DAP P+ FTPN+ PK+WTENW+GWF S+G
Sbjct: 213 AGMAISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFG 272
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P R EDLAFAVARF+Q GGTFQNYYMYHGGTN R+SGGP++ TSYDYDAPIDEYG
Sbjct: 273 GAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYG 332
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------ 350
+ +PKWGHLR++HK +K E L + + T G
Sbjct: 333 LVREPKWGHLRDVHKAIKLCEPALIATDPSYTSLGQNAEAAVYKTGSVCAAFLANIDGQS 392
Query: 351 ----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN---------- 396
+ +G Y LPAWSVSILPDCK NTA++N+Q R ++ N
Sbjct: 393 DKTVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRYLESSNMASDGSFITP 452
Query: 397 DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILS 455
+ A W + E + + L++Q +T D SD+LWY T+ +K D+P L+
Sbjct: 453 ELAVSGWSYAIEPVG---ITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYLN 509
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
GS + L +NS G VL Y+NG S +S +++P++L GKN+I LLSATVG
Sbjct: 510 GSQS-NLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVG 568
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
L NYG+ FD+V GI GPV L G G DLSS +WTY++GL G +D Y+ A+
Sbjct: 569 LSNYGAFFDLVGAGITGPVKLSGTNG----ALDLSSAEWTYQIGLRG-EDLHLYDPSEAS 623
Query: 576 SERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
E W S N P+N+ + WYKT F P +DPV ++ GMGKG AWVNG ++GRYWPT L
Sbjct: 624 PE--WVSANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNL 681
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQ 694
A + GC SC+YRG Y S+KC CG PSQ YHVPRS+++ G N +VLFE+FGG+PS+
Sbjct: 682 APQSGC-VNSCNYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSK 740
Query: 695 INFQTVVVGTACGQAHE------------NKTME-------LTC--HGRRISEIKYASFG 733
I+F G+ C Q E +TM+ L C G+ IS IK+ASFG
Sbjct: 741 ISFVIRQTGSVCAQVSEEHPAQIDSWNSSQQTMQRYGPELRLECPKDGQVISSIKFASFG 800
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
P G CG++ G C + L ++++ C+G SCS+ S G + G K L VEA
Sbjct: 801 TPSGTCGSYSHGEC-SSTQALSVVQEACIGVSSCSVPVSSNYFG--NPCTGVTKSLAVEA 857
Query: 794 LC 795
C
Sbjct: 858 AC 859
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/823 (49%), Positives = 525/823 (63%), Gaps = 80/823 (9%)
Query: 31 HDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQ 90
++ RA+ IDG+R+I+LSGSIHYPRSTP MWPDLI KAKEGGL+ IETYVFWN HEP RRQ
Sbjct: 30 YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89
Query: 91 YDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVF 150
Y+F GN D++RF K IQ+ G++ ILRIGPY+C EWNYGG P WL ++PG++ R N F
Sbjct: 90 YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNDPF 148
Query: 151 MNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS--DYGDAGKSYINWCAKM 208
EM+ FTTLIV+ K +FA QGGPIILAQIENEYGN+M + + YI+WCA M
Sbjct: 149 EREMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADM 208
Query: 209 ATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWGGK 256
A IGVPWIMCQ+ +D P + PN PKIWTENWTGWFK+W
Sbjct: 209 ANKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWFPNRTGIPKIWTENWTGWFKAWDKP 268
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
D R+AED+AFAVA FFQ G+ NYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG++
Sbjct: 269 DFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNI 328
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYN------------------ 358
QPK+GHL++LH LLKSMEK L +G +T +G +V+ + Y
Sbjct: 329 RQPKYGHLKDLHNLLKSMEKILVHGEYKDTSHGKNVTVTKYTYGGSSVCFISNQFDDRDV 388
Query: 359 ---------LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+PAWSVSILPDCKT +NTAK+ TQT+V VK+ N + L+W W PE
Sbjct: 389 NVTLAGTHLVPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKANSVEKEPEALRWSWMPEN 448
Query: 410 INDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
+ F+ G F + L++Q +T+ D SDYLWY T+ + K G + TL +N++G
Sbjct: 449 LKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRTSLEHK-------GEGSYTLYVNTTG 501
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
++A+VNG V + GA + PVKL GKN +SLLS TVGL+NYG F++VP
Sbjct: 502 HKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPLFELVPA 561
Query: 529 GIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD-----DKKFYNAKAANSERGWSS 582
GI GPV LVG A D I DL+ W+YK GL G DK Y ++ N S
Sbjct: 562 GIAGGPVKLVG-ANDTAI--DLTHSSWSYKSGLAGEHRQIHLDKPGYKWRSHN-----GS 613
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
++P+NR TWYKTTF AP ++ VV++L G+ KG AWVNG +LGRYWP+Y A E G
Sbjct: 614 GSIPVNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGRYWPSYTAAEMGGCH 673
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
+CDYRG + ++ +C CG PSQ +YHVPRS+++ G NTLVLFEE GG+P++ F
Sbjct: 674 GACDYRGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLVLFEEAGGDPARAAF 733
Query: 698 QTVVVGTACGQAHE-NKTMELTCHGRR----ISEIKYASFGDPQGACGAFKKGSCEAEID 752
TV VG C A E + L+C G ++ + ASFG +G CG + +G CE++
Sbjct: 734 HTVAVGHVCVAAAEVGDDVTLSCGGGLGGGVVASVDVASFGVTRGGCGDY-QGGCESKA- 791
Query: 753 VLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
L CVG++SC+++ + A G C +G +L V+A C
Sbjct: 792 ALKAFRDACVGRESCTVKYTPAFAGP-GCQSG---KLTVQATC 830
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/823 (49%), Positives = 516/823 (62%), Gaps = 79/823 (9%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
VS+D RA+ IDG+R+I+LSGSIHYPRSTP MWPDLI+KAK+GGL+ IETYVFWN HEP
Sbjct: 32 EVSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTIETYVFWNGHEPR 91
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
RQY+F GN D++RF K +Q G+Y ILRIGPY+C EWNYGG P WL ++P ++ R N
Sbjct: 92 PRQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLRDIPDMQ-FRLHN 150
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWC 205
+ F EM+ FTTLIV+ K +FA QGGPIIL QIENEYGNV S+ D + YI+WC
Sbjct: 151 EPFEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPDQESATKYIHWC 210
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM------------FTPNNPNSPKIWTENWTGWFKSW 253
A MA ++GVPWIMCQ+S+ P F P N PKIWTENWTGWFK+W
Sbjct: 211 ADMANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHDFKPKGSNMPKIWTENWTGWFKAW 270
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
D R AED+A+AVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TT+YDYDAP+DEY
Sbjct: 271 DKPDYHRPAEDVAYAVAMFFQNRGSVQNYYMYHGGTNFGRTSGGPYITTTYDYDAPLDEY 330
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV--------------------- 352
G++ QPK+GHL+ LH +L SMEK L YG T+ + V
Sbjct: 331 GNIRQPKYGHLKALHTVLTSMEKHLVYGQQNETNLDDKVKATKYTLDDGSSACFISNSHD 390
Query: 353 --------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
GS+Y +PAWSVS+LPDCKT +NTAKV TQT+V VK+ + A + L+W
Sbjct: 391 NKDVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQTSVMVKKESAA---KGGLKWS 447
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
W PE + G F N L++Q T D SDYLWY T+ + TL
Sbjct: 448 WLPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKTSLTRGPKE-------QFTLY 500
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+N++G L+A+VNG + G FE PV L GKN ISLLSATVGL+NYG+ F
Sbjct: 501 VNTTGHELYAFVNGELAGYKHAVNGPYLFQFEAPVTLKPGKNYISLLSATVGLKNYGASF 560
Query: 524 DMVPNGIPG-PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+++P GI G PV LV G+ DLS++ WTYK GL+G + K+ + K WS
Sbjct: 561 ELMPAGIVGGPVKLVSAHGNTI---DLSNNTWTYKTGLFG-EQKQIHLDKPGLR---WSP 613
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA-EEDGCS 641
VP NR TWYK TF+AP + VV++L G+ KG +VNG+NLGRYWP+Y+A + DGC
Sbjct: 614 FAVPTNRPFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHNLGRYWPSYVAGDMDGC- 672
Query: 642 TESCDYRGPY----GSDKCAYNCGNPSQIWYHVPRSWI---KDGVNTLVLFEEFGGNPSQ 694
CDYRG Y +KC CG Q +YHVPRS++ NT+VLFEE GG+P++
Sbjct: 673 -HRCDYRGEYVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAHGAPNTVVLFEEAGGDPAK 731
Query: 695 INFQTVVVGTACGQAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGS-CEAEID 752
+NF+TV VG C A + + L C HGR IS + ASFG G CGA++ GS CE++
Sbjct: 732 VNFRTVAVGPVCADAEKGDAVTLACAHGRTISSVDTASFGVSGGQCGAYEGGSGCESK-P 790
Query: 753 VLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
L I CVGKK C++ ++A + C V L V+A C
Sbjct: 791 ALEAITAACVGKKWCTVSYTDA-FDSADCKGSGV--LTVQATC 830
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/820 (50%), Positives = 527/820 (64%), Gaps = 107/820 (13%)
Query: 15 LILQTLFNLSL--AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+++ TL LSL A V +D A+ I+GERKI+ SG+IHYPRSTP MWP+LI KAK+GGL
Sbjct: 9 VLISTLALLSLCSATTVEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELINKAKDGGL 68
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
DAIETYVFW+ HEP+RRQYDF+GNLD+++F + IQ+ GLYVILRIGPYVCAEWNYGGFP+
Sbjct: 69 DAIETYVFWDRHEPVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEWNYGGFPM 128
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WLHN PG+E LRT N+++ + F F S I+
Sbjct: 129 WLHNTPGVE-LRTDNEIYKVPLLIF-------------FVSNNVRIV------------- 161
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKS 252
S IN C C F PNNP SPK++TENW+GW+K
Sbjct: 162 -------SQINTCNGY-----------YCD-------TFKPNNPKSPKMFTENWSGWYKL 196
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
WGGK RTAED+AF+VARF Q GG F NYYMY+GGTNFGRT+GGPY+T SYDYD+P+DE
Sbjct: 197 WGGKTSYRTAEDMAFSVARFVQAGGVFNNYYMYYGGTNFGRTAGGPYITASYDYDSPLDE 256
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYN-------------- 358
YG+LNQPKWGHL++LH +K EK +T G VT ++ V ++Y
Sbjct: 257 YGNLNQPKWGHLKQLHASIKLGEKIITNGTVTIKNFQAGVDLTAYTNNATRERFCFLSNI 316
Query: 359 ----------------LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP-L 401
+PAWSVSIL +C E FNTAKVNTQT++ VK+ + ND+ L
Sbjct: 317 NIADAHIDLQQDGNYTIPAWSVSILQNCSKEIFNTAKVNTQTSLMVKKLYE--NDKPTNL 374
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNM 460
W W PE + D ++ GKG F + L+DQK T D SDYLWYMT+ D+ + + N+
Sbjct: 375 SWVWAPEPMKDTLL-GKGRFRTSQLLDQKETTVDASDYLWYMTSFDMNKNTLQWT---NV 430
Query: 461 TLRINSSGQVLHAYVNGNY-VDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
TLR+ S G VLHAYVN V SQ G FE+PV L G N ISLLSATVGL NY
Sbjct: 431 TLRVTSRGHVLHAYVNKKLIVGSQLVIQGEFT--FEKPVTLKPGNNVISLLSATVGLANY 488
Query: 520 GSKFDMVPNGI-PGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
GS FD P GI GPV L+ + DLSS+ W+YK+GL G + K+FY+ + +++
Sbjct: 489 GSFFDKTPVGIVDGPVQLMANGKP---VMDLSSNLWSYKIGLNG-EAKRFYDPTSRHNK- 543
Query: 579 GWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
WS+ N V R MTWYKTTF +P DPVV++LQGMGKG AW NG +LGRYWP+ +A
Sbjct: 544 -WSAANGVSTARPMTWYKTTFSSPSGTDPVVVDLQGMGKGHAWANGKSLGRYWPSQIANA 602
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI-KDGVNTLVLFEEFGGNPSQIN 696
+GCS +CDYRGPY + KC NCG P+Q WYHVPRS++ +G NTL+LFEE GG+PS I+
Sbjct: 603 NGCSG-TCDYRGPYNAGKCTRNCGIPTQRWYHVPRSFLNSNGKNTLILFEEVGGDPSGIS 661
Query: 697 FQTVVVGTACGQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
FQ V T CG A+E T+EL+C GR ISEI++AS+G+PQG C +FKKGS +A ++ +
Sbjct: 662 FQIVTTETICGNAYEGSTLELSCQGGRTISEIQFASYGNPQGTCSSFKKGSFDA-MNSVQ 720
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+++K+CVGK SCSI AS+ + KRL V+A C
Sbjct: 721 MVQKECVGKDSCSIIASDETFMVNEPQGISNKRLAVQAHC 760
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/745 (52%), Positives = 490/745 (65%), Gaps = 69/745 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R+I+LSGSIHYPRSTP MWPDLIKKAKEGGLDAIETY+FWN HEP R
Sbjct: 31 VSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYGG P WL ++PG++ R N+
Sbjct: 91 RQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNE 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K K+FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQESD-APSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ D P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL+ELH +LKSMEKTL +G +T+YG++++
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSACFINNRFDDK 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ TQT+V VK+PN A +Q L+W W
Sbjct: 390 DVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQESLKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE ++ F+ KG+F N L++Q T+ D SDYLWY T+ + K G + L +N
Sbjct: 450 PENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHK-------GEGSYKLYVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG + + G E PVKL GKN ISLLSATVGL+NYG F+
Sbjct: 503 TTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYGPSFEK 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL ++ W+ N
Sbjct: 563 MPTGIVGGPVKLIDSNGTAI---DLSNSSWSYKAGL----ASEYRQIHLDKPGYKWNGNN 615
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE-DGCS 641
+P+NR TWYK TFEAP D VV++L G+ KG AWVNG NLGRYWP+Y A E GC
Sbjct: 616 GTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGC- 674
Query: 642 TESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDG-VNTLVLFEEFGGNPSQIN 696
CDYRG + ++ +C CG PSQ +YHVPRS++ G NTL+LFEE GG+PS +
Sbjct: 675 -HRCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVA 733
Query: 697 FQTVVVGTACGQAHENKTMELTCHG 721
+TVV G C + L+C G
Sbjct: 734 LRTVVPGPVCTSGEAGDAVTLSCGG 758
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/858 (47%), Positives = 519/858 (60%), Gaps = 96/858 (11%)
Query: 10 AILLCLILQTLFNLSLAYR-----VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
+++L L+ ++F S R VS+D RA+ IDG+R++L SGSIHYPR+TP +WPD+I
Sbjct: 6 SLVLILLFVSIFACSYLERGWSGKVSYDHRALVIDGKRRVLQSGSIHYPRTTPEVWPDII 65
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+K+KEGGLD IETYVFWN HEP++ QY F G DL+RF+KTIQ+ GL V LRIGPY CAE
Sbjct: 66 RKSKEGGLDVIETYVFWNYHEPVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRIGPYACAE 125
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WNYGGFP+WLH +PGI+ RTTN++F EM+ F T IV+M K+E LFASQGGPIILAQ+E
Sbjct: 126 WNYGGFPLWLHFIPGIQ-FRTTNELFKEEMKLFLTKIVNMMKEENLFASQGGPIILAQVE 184
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYGNV YG AG+ Y+ W A+ A SL+ VPW+MC + DAP P+ F+P
Sbjct: 185 NEYGNVEWAYGAAGELYVKWAAETAVSLNTSVPWVMCAQVDAPDPIINTCNGFYCDRFSP 244
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+P+ PK+WTEN++GWF S+G P R EDLAFAVARFF+ GGTFQNYYMY GGTNFGR
Sbjct: 245 NSPSKPKMWTENYSGWFLSFGYAIPYRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 304
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV- 352
T+GGP + TSYDYDAPIDEYG + QPKWGHLR+LHK +K E+ L + + GN++
Sbjct: 305 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRDLHKAIKQCEEHLISSDPIHQQLGNNLE 364
Query: 353 ----------------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
+G+ Y LPAWSVSILPDCK FNTAKV
Sbjct: 365 AHIYYKSSNDCAAFLANYDSSSDANVTFNGNIYFLPAWSVSILPDCKNVIFNTAKVLILN 424
Query: 385 NVKVKRPNQAGNDQAPLQ---WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLW 440
+ ++ PL+ W W E + + G F L++Q +T D+SD+LW
Sbjct: 425 LGDDFFAHSTSVNEIPLEQIVWSWYKEEVG---IWGNNSFTAPGLLEQINTTKDISDFLW 481
Query: 441 YMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKL 500
Y T+ + D ++ L I S G +VN V AS L E+ + L
Sbjct: 482 YSTSISVNADQ-----VKDIILNIESLGHAALVFVNKVLVGKYGNHDDASFSLTEK-ISL 535
Query: 501 TRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGL 560
G N + LLS +G+QNYG FD+ GI VLLVG++ + DLSS KWTY+VGL
Sbjct: 536 IEGNNTLDLLSMMIGVQNYGPWFDVQGAGIYA-VLLVGQS---KVKIDLSSEKWTYQVGL 591
Query: 561 ----YGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
+GLD ANS + P+N+ + WYK TF AP P+ LNL GMGK
Sbjct: 592 EGEYFGLD-----KVSLANSSLWTQGASPPINKSLIWYKGTFVAPEGKGPLALNLAGMGK 646
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G AWVNG ++GRYWP YL+ GC+ +SCDYRG Y S KC CG P+Q YH+PR+W+
Sbjct: 647 GQAWVNGQSIGRYWPAYLSPSTGCN-DSCDYRGAYDSFKCLKKCGQPAQTLYHIPRTWVH 705
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE------------------NKTMELT 718
G N LVL EE GG+PS+I+ T C E N + LT
Sbjct: 706 PGENLLVLHEELGGDPSKISVLTRTGHEICSIVSEDDPPPADSWKSSSEFKSQNPEVRLT 765
Query: 719 C-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLG 777
C G I I +ASFG P G CG F GSC A D+L +++K C+G++ CSI S ANLG
Sbjct: 766 CEQGWHIKSINFASFGTPAGICGTFNPGSCHA--DMLDIVQKACIGQEGCSISISAANLG 823
Query: 778 ATSCAAGTVKRLVVEALC 795
G +KR VEA C
Sbjct: 824 DP--CPGVLKRFAVEARC 839
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/811 (48%), Positives = 496/811 (61%), Gaps = 90/811 (11%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWP LI+K+K+GGLD IETYVFW+ HE +R QYDF G DL+RF+K + D GLYV LRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWNYGGFPVWLH +PGI+ RT N+ F EMQ FT +VD K L+ASQGGPI
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIK-FRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPI 119
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENEYGN+ S YG AGK+Y+ W A MA SLD GVPW+MCQ+SDAP P+
Sbjct: 120 ILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFY 179
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
FTPN+ + PK+WTENW+GWF S+GG P R AEDLAFAVARF+Q GGTFQNYYMYHG
Sbjct: 180 CDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHG 239
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGR++GGP++ TSYDYDAPIDEYG + QPKWGHLR++HK +K E L + +
Sbjct: 240 GTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSS 299
Query: 348 YG------------NSV------------------SGSSYNLPAWSVSILPDCKTEEFNT 377
G NS+ +G++Y LPAWSVSILPDCK NT
Sbjct: 300 LGQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNT 359
Query: 378 AKVNTQTNVKVKRP----NQAGNDQ------APLQWKWRPEMINDFVVRGKGHFALNTLI 427
A++N+Q R Q +D A W + E + + + L+
Sbjct: 360 AQINSQVTTSEMRSLGSSIQDTDDSLITPELATAGWSYAIEPVG---ITKENALTKPGLM 416
Query: 428 DQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTK 486
+Q +T D SD+LWY T+ +K D+P L+GS + L +NS G VL Y+NG S
Sbjct: 417 EQINTTADASDFLWYSTSIVVKGDEPYLNGSQS-NLLVNSLGHVLQIYINGKLAGSAKGS 475
Query: 487 YGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETII 546
+S + PV L GKN+I LLS TVGL NYG+ FD+V G+ GPV L G G
Sbjct: 476 ASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNG----A 531
Query: 547 KDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLEND 605
+LSS WTY++GL G +D YN A+ E W S N P N+ + WYKT F AP +D
Sbjct: 532 LNLSSTDWTYQIGLRG-EDLHLYNPSEASPE--WVSDNAYPTNQPLIWYKTKFTAPAGDD 588
Query: 606 PVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQ 665
PV ++ GMGKG AWVNG ++GRYWPT LA + GC SC+YRG Y S+KC CG PSQ
Sbjct: 589 PVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC-VNSCNYRGAYSSNKCLKKCGQPSQ 647
Query: 666 IWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-------------- 711
YHVPRS+++ G N LVLFE+FGG+PS I+F T + C E
Sbjct: 648 TLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQ 707
Query: 712 -----NKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGK 764
+ L C G+ IS IK+ASFG P G CG + G C + L ++++ CVG
Sbjct: 708 TSQTQGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSS-QALAVVQEACVGM 766
Query: 765 KSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+CS+ S N G +G K LVVEA C
Sbjct: 767 TNCSVPVSSNNFGDP--CSGVTKSLVVEAAC 795
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/819 (47%), Positives = 516/819 (63%), Gaps = 93/819 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+DGR++ +DGER+I++SGSIHYPRSTP MWPDLIKKAKEGGL+AIETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+++F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYGG PVWL ++PGI + R NK
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGI-KFRLHNK 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM--SDYGDAGKSYINWCA 206
F N M+ FTTLIV K +FA QGGPIILAQIENEYG M + + YI+WCA
Sbjct: 150 PFENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFT------------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ + P N + PK+WTENWTGW++ W
Sbjct: 210 DMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+ +R ED+AFAVA FFQ G+ QNYYMYHGGTNFGRT+GGPY+TTSYDYDAP+DEYG
Sbjct: 270 QPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL+ELH +L SMEK L +G+ +T+YG++V+
Sbjct: 330 NLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSACFINNRFDDR 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILP+CKT FN+AK+ TQT V V + + +W W
Sbjct: 390 DVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE + F+ KG+F N L++Q +T D SDYLWY T+ + K G + L +N
Sbjct: 450 PENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHK-------GEGSYVLYVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V Q++ N+ F +K NYG F++
Sbjct: 503 TTGHELYAFVNGKLVGQQYSP----NENFTFQLKSP----------------NYGGSFEL 542
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ +G DLS++ W+YK GL G + +K Y K N R +S
Sbjct: 543 LPAGIVGGPVKLIDSSGSAI---DLSNNSWSYKAGLAG-EYRKIYLDKPGNKWRSHNS-T 597
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE-DGCSTE 643
+P+NR TWYKTTF+AP D VV++L G+ KG AWVNG +LGRYWP+Y+A + GC
Sbjct: 598 IPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGC--H 655
Query: 644 SCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINFQ 698
CDYRG + ++ KC CG PSQ YHVPRS++ G NTL+LFEE GG+PS++ +
Sbjct: 656 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVR 715
Query: 699 TVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPL 756
TVV G+ C A T+ L+C HGR IS + ASFG +G CG++ G CE+++
Sbjct: 716 TVVEGSVCASAEVGDTVTLSCGAHGRTISSVDVASFGVARGRCGSY-DGGCESKV-AYDA 773
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CVGK+SC++ ++A C +G L V+A C
Sbjct: 774 FAAACVGKESCTVLVTDA-FANAGCVSGV---LTVQATC 808
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/829 (46%), Positives = 497/829 (59%), Gaps = 80/829 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ I+GER+IL+SGSIHYPRST MWPDL +KAK+GGLD I+TYVFWN HEP
Sbjct: 25 VTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDGGLDVIQTYVFWNMHEPSP 84
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL++F+K Q+ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 85 GNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNE 143
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F N M+ FT +VD+ K E LF SQGGPIILAQ+ENEY +YG AG Y+NW A+M
Sbjct: 144 PFKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEEMEYGLAGAQYMNWAAQM 203
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A +D GVPW+MC++ DAP P+ F PN P P +WTE W+GW+ +GG
Sbjct: 204 AVGMDTGVPWVMCKQDDAPDPVINTCNGFYCDNFVPNKPYKPTMWTEAWSGWYTEFGGAS 263
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARFF GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG +
Sbjct: 264 PHRPVEDLAFAVARFFVKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLIR 323
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------------------------NSV- 352
QPKWGHL+ELHK +K E L G+ T G NSV
Sbjct: 324 QPKWGHLKELHKAIKLCEPALVSGDPVVTSLGHFQQAYVYSAGAGNCAAFIVNYDSNSVG 383
Query: 353 ----SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y + WSVSILPDC+ FNTAKV+ QT+ P W+ E
Sbjct: 384 RVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQTSQMKMTPVGG------FGWESIDE 437
Query: 409 MINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
I F A+ L T D +DYLWY+T+ ++ +D+P + L + S+G
Sbjct: 438 NIASF--EDNSISAVGLLEQINITRDNTDYLWYITSVEVDEDEPFIKNGGLPVLTVQSAG 495
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
LH ++N + SQ+ + F V+L G N+ISLLS TVGLQN G F+M
Sbjct: 496 DALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNKISLLSMTVGLQNIGPHFEMANA 555
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GP+ L +G + +DLSS +W+Y++GL G + + N+ VP +
Sbjct: 556 GVLGPITL---SGFKDGTRDLSSQRWSYQIGLKG--ETMNLHTSGDNTVEWMKGVAVPQS 610
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
+ + WYK F+AP DP+ L+L MGKG AWVNG ++GRYWP+YLAE G ++ C Y
Sbjct: 611 QPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYWPSYLAE--GVCSDGCSYE 668
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQ 708
G Y KC NCG SQ WYHVPRSW++ NTLVLFEE GGNPS ++ T V + C
Sbjct: 669 GTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEIGGNPSGVSLVTRSVDSVCAH 728
Query: 709 AHENKT---------------------MELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
E+ + + L C G+RIS IK+ASFG PQG CG+F++G
Sbjct: 729 VSESHSQSINFWRLESTDQVQKLHIPKVHLQCSKGQRISAIKFASFGTPQGLCGSFQQGD 788
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C + + + I+K+C+G + CS+ SE G C G K + +EA+C
Sbjct: 789 CHSP-NSVATIQKKCMGLRKCSLSVSEKIFGGDPC-PGVRKGVAIEAVC 835
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/884 (45%), Positives = 516/884 (58%), Gaps = 119/884 (13%)
Query: 10 AILLCLILQTLFNLSLA-YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
A LLC L +S A + VS+D RA+ IDG+R++L+S IHYPR+TP MWPDLI K+K
Sbjct: 9 AALLCFSLTIQLGVSFAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSK 68
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGG D I+TYVFWN HEP+RRQY+F G D+++F+K + GLY+ LRIGPYVCAEWN+G
Sbjct: 69 EGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFG 128
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL ++PGIE RT N F +EMQ F IVD+ +KE LF+ QGGPII+ QIENEYG
Sbjct: 129 GFPVWLRDIPGIE-FRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYG 187
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
NV S +G GK Y+ W A+MA LD GVPW+MCQ++DAP + F PN+ N
Sbjct: 188 NVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSAN 247
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK+WTE+W GWF SWGG+ PKR ED+AFAVARFFQ GG+F NYYMY GGTNFGR+SGG
Sbjct: 248 KPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGG 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN--------------- 342
P+ TSYDYDAPIDEYG L+QPKWGHL+ELH +K E L +
Sbjct: 308 PFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEAHV 367
Query: 343 ------VTNTDYGNSVS-------------------GSSYNLPAWSVSILPDCKTEEFNT 377
+ +T GN S G Y LP WSVSILPDC+T FNT
Sbjct: 368 YRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNT 427
Query: 378 AKVNTQTNVK-------------VKRPNQAGN--DQAPLQWKWRPEMINDFVVRGKGHFA 422
AKV QT++K V +P N P W E I+ V + +F
Sbjct: 428 AKVGAQTSIKTVEFDLPLVRNISVTQPLMVQNKISYVPKTWMTLKEPIS---VWSENNFT 484
Query: 423 LNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNY 479
+ +++ T D SDYLW +T ++ +D + + TL I+S +LH +VNG
Sbjct: 485 IQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQL 544
Query: 480 VDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLL 536
+ S W K +P++L +G N + LLS TVGLQNYG+ + G G V L
Sbjct: 545 IGSVIGHWVK-------VVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKL 597
Query: 537 VGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN-RRMTWYK 595
G E DLS + WTY+VGL G K + ++ +E W+ + TWYK
Sbjct: 598 TGFKNGEI---DLSEYSWTYQVGLRGEFQKIYMIDESEKAE--WTDLTPDASPSTFTWYK 652
Query: 596 TTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDK 655
T F+AP +PV L+L MGKG AWVNG+++GRYW T +A +DGC CDYRG Y + K
Sbjct: 653 TFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYW-TRVAPKDGCG--KCDYRGHYHTSK 709
Query: 656 CAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-- 713
CA NCGNP+QIWYH+PRSW++ N LVLFEE GG P +I+ ++ T C + E+
Sbjct: 710 CATNCGNPTQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESHYP 769
Query: 714 ---------------------TMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEI 751
M L C G IS I++AS+G PQG+C F +G C A
Sbjct: 770 SLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQCHAP- 828
Query: 752 DVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ L L+ K C GK SC I + G C G VK L VEA C
Sbjct: 829 NSLALVSKACQGKGSCVIRILNSAFGGDPC-RGIVKTLAVEAKC 871
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/835 (46%), Positives = 505/835 (60%), Gaps = 91/835 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ IDG+R++L SGSIHYPR+TP +WP++I+K+KEGGLD IETYVFWN HEP+R
Sbjct: 36 VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F G DL+RF+KT+Q+ GL+V LRIGPY CAEWNYGGFP+WLH +PG+ + RT+N
Sbjct: 96 GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGV-QFRTSND 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+F N M++F T IVD+ K + LFASQGGPIILAQ+ENEYGNV YG G+ Y+ W A+
Sbjct: 155 IFKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAET 214
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A SL+ VPW+MC + DAP P+ FTPN+P+ PK+WTEN++GWF ++G
Sbjct: 215 AISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQFTPNSPSKPKMWTENYSGWFLAFGYAV 274
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARFF++GG+FQNYYMY GGTNFGRT+GGP + TSYDYDAPIDEYG +
Sbjct: 275 PYRPVEDLAFAVARFFEYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 334
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LH +K E+ L + + GN
Sbjct: 335 QPKWGHLRDLHSAIKQCEEYLVSSDPVHQQLGNKLEAHVYYKHSNDCAAFLANYDSGSDA 394
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK----VKRPNQAGNDQAPLQWK 404
+ +G++Y LPAWSVSIL DCK FNTAKV TQ ++ + GN A W
Sbjct: 395 NVTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDALFSRSTTVDGNLVAASPWS 454
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
W E + + G F L++Q +T D SD+LWY T+ ++ L
Sbjct: 455 WYKEEVG---IWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVEAGQ-----DKEHLLN 506
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
I S G +VN +V + + ++ R + L G N + +LS +G+QNYG F
Sbjct: 507 IESLGHAALVFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDVLSMLIGVQNYGPWF 566
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGL----YGLDDKKFYNAKAANSERG 579
D+ GI V LV + KDLSS KWTY+VGL GLD N ANS
Sbjct: 567 DVQGAGIHS-VFLVDLHKSK---KDLSSGKWTYQVGLEGEYLGLD-----NVSLANSSLW 617
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
++P+N+ + WYK T AP N P+ LNL MGKG AW+NG ++GRYW YL+ G
Sbjct: 618 SQGTSLPVNKSLIWYKATIIAPEGNGPLALNLASMGKGQAWINGQSIGRYWSAYLSPSAG 677
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
C T++CDYRG Y S KC CG P+Q YH+PR+W+ G N LVL EE GG+PSQI+ T
Sbjct: 678 C-TDNCDYRGAYNSFKCQKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQISLLT 736
Query: 700 VVVGTACGQAHENK------------------TMELTC-HGRRISEIKYASFGDPQGACG 740
C E+ + LTC HG I+ I +ASFG P+G CG
Sbjct: 737 RTGQDICSIVSEDDPPPADSWKPNLEFMSQSPEVRLTCEHGWHIAAINFASFGTPEGKCG 796
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F G+C A D+L +++K C+G + CSI S A LG G VKR VVEALC
Sbjct: 797 TFTPGNCHA--DMLTIVQKACIGHERCSIPISAAKLGDP--CPGVVKRFVVEALC 847
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/841 (46%), Positives = 497/841 (59%), Gaps = 81/841 (9%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
SLA V++D R++ IDG+RK+L+S SIHYPRS PGMWP L+K AKEGG+D IETYVFWN
Sbjct: 18 SLAANVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFWNG 77
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HE Y F G DL++F+K +Q +Y+ILR+GP+V AEWN+GG PVWLH +PG
Sbjct: 78 HELSPDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTV-F 136
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT ++ F MQ F TLIV++ KKEKLFASQGGPIILAQ+ENEYG+ YGD GK Y
Sbjct: 137 RTNSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYAM 196
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA S +IGVPWIMCQ+ DAP P+ FTPN+PN PK+WTENW GWFK+
Sbjct: 197 WAANMALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKT 256
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+G DP R ED+AF+VARFFQ GG+ QNYYMYHGGTNFGRTSGGP++TTSYDY+APIDE
Sbjct: 257 FGAPDPHRPHEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPIDE 316
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS--------------------- 351
YG PKWGHL+ELH+ +KS E L YG N G S
Sbjct: 317 YGLARLPKWGHLKELHRAIKSCEHVLLYGEPINLSLGPSQEVDVYTDSSGGCAAFISNVD 376
Query: 352 --------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ-------AGN 396
SY++PAWSVSILPDCK FNTAKV +QT+ P + +
Sbjct: 377 EKEDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEELQPSLVPSNK 436
Query: 397 DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILS 455
D LQW+ + + G+ F N +D +T D +DYLWY + + + + L
Sbjct: 437 DLKGLQWE---TFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGESENFLK 493
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
S L + S G LHA+VN S S FE P+ L GKN I+LLS TVG
Sbjct: 494 EISQPVLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDIALLSMTVG 553
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
LQN G ++ V G+ V G I DLS++ WTYK+GL G + Y + N
Sbjct: 554 LQNAGPFYEWVGAGLTS----VKIKGLNNGIMDLSTYTWTYKIGLQG-EHLLIYKPEGLN 608
Query: 576 SERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
S + S+ P + +TWYK + P N+P+ L++ MGKG AW+NG +GRYWP +
Sbjct: 609 SVKWLSTPEPPKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWPRKSS 668
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
D C E CDYRG + +KC+ CG P+Q WYHVPRSW K N LV+FEE GG+P++I
Sbjct: 669 IHDKCVQE-CDYRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTKI 727
Query: 696 NFQTVVVGTACG----------------QAHENKTMELTCHGR-----RISEIKYASFGD 734
F C A+EN + T H + IS +K+AS+G
Sbjct: 728 RFSRRKTTGVCALVSEDHPTYELESWHKDANENNKNKATIHLKCPENTHISSVKFASYGT 787
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G CG++ +G C + + ++EK C+ K C+IE +E N C + T K+L VEA+
Sbjct: 788 PTGKCGSYSQGDCH-DPNSASVVEKLCIRKNDCAIELAEKNFSKDLCPS-TTKKLAVEAV 845
Query: 795 C 795
C
Sbjct: 846 C 846
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/841 (47%), Positives = 502/841 (59%), Gaps = 109/841 (12%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+KAK+GGLD IETYVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P+R QYDF G DL F+KT+ D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI + RT
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-KFRT 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FT A+IENEYGN+ S YG GK+Y+ W
Sbjct: 146 DNEPFKAEMQRFT----------------------AKIENEYGNIDSAYGAPGKAYMRWA 183
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW+MCQ++DAP P+ FTPN+ PK+WTENW+GWF S+G
Sbjct: 184 AGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFG 243
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P R EDLAFAVARF+Q GGTFQNYYMYHGGTN R+SGGP++ TSYDYDAPIDEYG
Sbjct: 244 GAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYG 303
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
+ QPKWGHLR++HK +K E L + + T G +V
Sbjct: 304 LVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGSVCAAFLANIDGQS 363
Query: 353 ------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN---------- 396
+G Y LPAWSVSILPDCK NTA++N+QT R ++ N
Sbjct: 364 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 423
Query: 397 DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILS 455
+ A W + E + + L++Q +T D SD+LWY T+ +K D+P L+
Sbjct: 424 ELAVSDWSYAIEPVG---ITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYLN 480
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
GS + L +NS G VL Y+NG S +S +++P++L GKN+I LLSATVG
Sbjct: 481 GSQS-NLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVG 539
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
L NYG+ FD+V GI GPV L G G DLSS +WTY++GL G +D Y+ A+
Sbjct: 540 LSNYGAFFDLVGAGITGPVKLSGLNG----ALDLSSAEWTYQIGLRG-EDLHLYDPSEAS 594
Query: 576 SERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
E W S N P+N + WYKT F P +DPV ++ GMGKG AWVNG ++GRYWPT L
Sbjct: 595 PE--WVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNL 652
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQ 694
A + GC SC+YRG Y S KC CG PSQ YHVPRS+++ G N LVLFE FGG+PS+
Sbjct: 653 APQSGC-VNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSK 711
Query: 695 INFQTVVVGTACGQAHE------------------NKTMELTC--HGRRISEIKYASFGD 734
I+F G+ C Q E + L C G+ IS +K+ASFG
Sbjct: 712 ISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLECPKEGQVISSVKFASFGT 771
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G CG++ G C + L ++++ C+G S +N C G K L VEA
Sbjct: 772 PSGTCGSYSHGEC-SSTQALSIVQEACIGVSS-CSVPVSSNYFGNPC-TGVTKSLAVEAA 828
Query: 795 C 795
C
Sbjct: 829 C 829
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/855 (45%), Positives = 507/855 (59%), Gaps = 90/855 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++LCL L L LA V++D R++ IDG RK+L+S SIHYPRS P MWP LI+ AKEG
Sbjct: 8 LVLCLFLP----LCLAANVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQNAKEG 63
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
G+D IETYVFWN HE Y F G DL++FI + + GLY+ILRIGP+V AEWN+GG
Sbjct: 64 GVDVIETYVFWNGHELSPDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWNFGGV 123
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWLH +P RT N F MQ FTT IV + KKEKLFASQGGPIIL+Q+ENEYG++
Sbjct: 124 PVWLHYIPNTV-FRTDNASFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEYGDI 182
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
YG+ GK Y W A+MA S +IGVPWIMCQ+ DAP P+ FTPN+PN P
Sbjct: 183 ERVYGEGGKPYAMWAAQMAVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKP 242
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENW GWFK++G +DP R ED+AF+VARFFQ GG+ QNYYMYHGGTNFGRT+GGP+
Sbjct: 243 KMWTENWPGWFKTFGARDPHRPPEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTAGGPF 302
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS---- 355
+TTSYDYDAPIDEYG PKWGHL+ELH+ +K E+ L T G S+
Sbjct: 303 ITTSYDYDAPIDEYGLPRLPKWGHLKELHRAIKLTERVLLNSEPTYVSLGPSLEADVYTD 362
Query: 356 -------------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SY+LPAWSVSILPDCK FNTA + +QT +
Sbjct: 363 SSGACAAFIANIDEKDDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAMVEMV 422
Query: 391 PNQ-------AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYM 442
P + D L+W+ + + GK F N L+D +T D +DYLWY
Sbjct: 423 PEELQPSADATNKDLKALKWE---VFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYT 479
Query: 443 TNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVK 499
T+ + +++ L GS + L + S G LHA++N Q + G +D+ F++ +
Sbjct: 480 TSIFVNENEKFLKGSQPV-LVVESKGHALHAFINKKL---QVSATGNGSDITFKFKQAIS 535
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L GKN+I+LLS TVGLQN G ++ V G+ V+ G DLSS+ W+YK+G
Sbjct: 536 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGLSKVVI----EGFNNGPVDLSSYAWSYKIG 591
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L G + Y + + SS+ P + +TWYK + P N+PV L++ MGKG A
Sbjct: 592 LQG-EHLGIYKPDGIKNVKWLSSREPPKQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLA 650
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG +GRYWPT + D C + CDYRG + DKC CG P+Q WYHVPRSW K
Sbjct: 651 WLNGEEIGRYWPTKSSIHDVC-VQKCDYRGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSG 709
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTAC---GQAH---------------ENKTMELTCHG 721
N LV+FEE GG+P+QI V C G+ H T++L C
Sbjct: 710 NILVIFEEKGGDPTQIRLSKRKVLGICAHLGEGHPSIESWSEAENVERKSKATVDLKCPD 769
Query: 722 R-RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATS 780
RI++IK+ASFG PQG+CG++ G C + + + L+EK C+ + C IE E
Sbjct: 770 NGRIAKIKFASFGTPQGSCGSYSIGDCH-DPNSISLVEKVCLNRNECRIELGEEGFNKGL 828
Query: 781 CAAGTVKRLVVEALC 795
C + K+L VEA+C
Sbjct: 829 CPTAS-KKLAVEAMC 842
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 727 bits (1876), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/841 (46%), Positives = 502/841 (59%), Gaps = 78/841 (9%)
Query: 21 FNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVF 80
F ++L+ VS+DGR++ IDG+RK+L+S SIHYPRS P MWP L++ AKEGG+D IETYVF
Sbjct: 14 FTVALSGNVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVF 73
Query: 81 WNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
WN HE Y F G DL++F KT+Q G+Y+ILRIGP+V AEWN+GG PVWLH +PG
Sbjct: 74 WNGHELSPGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGT 133
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
RT N+ FM MQ FTT IV++ K+EKLFASQGGPIIL+QIENEYG + Y + GK
Sbjct: 134 V-FRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKK 192
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y W AKMA S + GVPWIMCQ+ DAP P+ FTP +PN PKIWTENW GW
Sbjct: 193 YALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGW 252
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
FK++GG+DP R AED+AF+VARFFQ GG+ NYYMYHGGTNFGRT+GGP++TTSYDYDAP
Sbjct: 253 FKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAP 312
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG--------------- 354
+DEYG PKWGHL+ELH+ +K E L G N G SV
Sbjct: 313 VDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFIS 372
Query: 355 --------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP---NQAGND 397
+SY+LPAWSVSILPDCK FNTAKV +QTNV P Q+
Sbjct: 373 NVDDKNDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNVVAMIPESLQQSDKG 432
Query: 398 QAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSG 456
L+W E + GK F + +D +T D +DYLW+ T+ + +++ L
Sbjct: 433 VNSLKWDIVKEKPG---IWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKK 489
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGL 516
S L I S+G LHA+VN Y + S F+ P+ L GKN+I+LL TVGL
Sbjct: 490 GSKPVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIALLCLTVGL 549
Query: 517 QNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS 576
Q G +D + G+ V G + DLSS+ WTYK+G+ G + + Y N
Sbjct: 550 QTAGPFYDFIGAGLTS----VKIKGLKNGTIDLSSYAWTYKIGVQG-EYLRLYQGNGLN- 603
Query: 577 ERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
+ W+S + P + +TWYK +AP ++PV L++ MGKG AW+NG +GRYWP
Sbjct: 604 KVNWTSTSEPQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSE 663
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
+ + CDYRG + DKC CG P+Q WYHVPRSW K N LVLFEE GG+P +I
Sbjct: 664 FKSEDCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKI 723
Query: 696 NFQTVVVGTACGQAHE-----------------NKTM---ELTC-HGRRISEIKYASFGD 734
F V AC E NK + LTC RIS +K+ASFG
Sbjct: 724 KFVRRKVSGACALVAEDYPSVGLLSQGEDKIQNNKNVPFAHLTCPSNTRISAVKFASFGT 783
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G+CG++ KG C + + ++EK C+ K C I+ +E N C G ++L VEA+
Sbjct: 784 PSGSCGSYLKGDCH-DPNSSTIVEKACLNKNDCVIKLTEENFKTNLC-PGLSRKLAVEAV 841
Query: 795 C 795
C
Sbjct: 842 C 842
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/855 (46%), Positives = 502/855 (58%), Gaps = 84/855 (9%)
Query: 6 HCSRAILLCLILQTLF---NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
H IL+ + TLF L V++D +AI I+G+R++L+SGSIHYPRSTP MW
Sbjct: 4 HSVSKILVLFLTMTLFMASELIHCTTVTYDKKAILINGQRRLLISGSIHYPRSTPEMWEG 63
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
LI+KAK+GGLD I+TYVFWN HEP Y F G DL+RFIKT+Q GL++ LRIGPYVC
Sbjct: 64 LIQKAKDGGLDVIDTYVFWNGHEPSPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVC 123
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
AEWN+GGFPVWL +PGI RT N F MQ FT IV M K EKLFASQGGPIIL+Q
Sbjct: 124 AEWNFGGFPVWLKYVPGIS-FRTDNGPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQ 182
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------F 231
IENEYG G G++YINW AKMA LD GVPW+MC+E DAP PM F
Sbjct: 183 IENEYGPERKALGAPGQNYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINACNGFYCDGF 242
Query: 232 TPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
TPN P P +WTE W+GWF +GG R +DLAFAVARF Q GG++ NYYMYHGGTNF
Sbjct: 243 TPNKPYKPTMWTEAWSGWFLEFGGTIHHRPVQDLAFAVARFIQRGGSYVNYYMYHGGTNF 302
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN- 350
GRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELHK +K E +L T T G
Sbjct: 303 GRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHSLLSSEPTVTSLGTY 362
Query: 351 ---------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQ 383
+ + Y+LP WSVSILPDC+ E +NTAKV Q
Sbjct: 363 HQAYVFNSGPRRCAAFLSNFHSVEARVTFNNKHYDLPPWSVSILPDCRNEVYNTAKVGVQ 422
Query: 384 TNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMT 443
T+ P + W+ E I+ R A+ L T D SDYLWYMT
Sbjct: 423 TSHVQMIPT----NSRLFSWQTYDEDISSVHERSSIP-AIGLLEQINVTRDTSDYLWYMT 477
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
N D+ D LSG TL + S+G LH +VNG + S + F PV L G
Sbjct: 478 NVDISSSD--LSGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVNLHAG 535
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N+I+LLS VGL N G ++ GI GPV L G + KDL+ HKW KVGL G
Sbjct: 536 INRIALLSIAVGLPNVGLHYESWKTGIQGPVFLDGLGNGK---KDLTLHKWFNKVGLKG- 591
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMT--WYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ + A+S GW +++ + T WYK F AP N+P+ L+++ MGKG W+
Sbjct: 592 EAMNLVSPNGASSV-GWIRRSLATQTKQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVWI 650
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG ++GRYW Y A+ D CS SC Y G + KC +CG P+Q WYHVPRSW+K N
Sbjct: 651 NGQSIGRYWMAY-AKGD-CS--SCSYIGTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNL 706
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHEN---------------KTM-----ELTCH- 720
+V+FEE GG+PS+I V CG HEN KT+ L C
Sbjct: 707 VVVFEELGGDPSKITLVRRSVAGVCGDLHENHPNAENFDVDGNEDSKTLHQAQVHLHCAP 766
Query: 721 GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATS 780
G+ IS IK+ASFG P G CG+F++G+C A + ++EK C+G++SCS+ S +
Sbjct: 767 GQSISSIKFASFGTPSGTCGSFQQGTCHA-TNSHAVVEKNCIGRESCSVAVSNSTFETDP 825
Query: 781 CAAGTVKRLVVEALC 795
C +KRL VEA+C
Sbjct: 826 C-PNVLKRLSVEAVC 839
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/820 (47%), Positives = 511/820 (62%), Gaps = 94/820 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++PG+ + R N
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGM-QFRLHNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K +FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ GGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQ-------------------KRGGPYITTSYDYDAPLDEYG 310
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G +T+Y + V+
Sbjct: 311 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSACFINNRNDNM 370
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT V V + + L+W W
Sbjct: 371 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKAKMVEKEPESLKWSWM 430
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ + K G ++ TL +N
Sbjct: 431 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHK-------GEASYTLFVN 483
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E P KL GKN ISLLSAT+GL+NYG F+
Sbjct: 484 TTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEK 543
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G + ++ + K + W + N
Sbjct: 544 MPAGIVGGPVKLIDNNGKGI---DLSNSSWSYKAGLAG-EYRQIHLDKPGCT---WDNNN 596
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+N+ TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A E G
Sbjct: 597 GTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMG-GC 655
Query: 643 ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
CDYRG + ++ KC CG PSQ +YHVPRS++K+G NT++LFEE GG+PS ++F
Sbjct: 656 HHCDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSF 715
Query: 698 QTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+TV G+ C A T+ L+C H + IS I SFG +G CGA+ KG CE++
Sbjct: 716 RTVAAGSVCASAEVGDTITLSCGQHSKTISAINVTSFGVARGQCGAY-KGGCESKAAYKA 774
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ + A G + C + L V+A C
Sbjct: 775 FTEA-CLGKESCTVQITNAVTG-SGCLSNV---LTVQASC 809
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/839 (46%), Positives = 501/839 (59%), Gaps = 74/839 (8%)
Query: 21 FNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVF 80
F ++ + VS+DGR++ ID +RK+L+S SIHYPRS P MWP L++ AKEGG+D IETYVF
Sbjct: 69 FTVASSANVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVF 128
Query: 81 WNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
WN HE Y F G DL++F +T+Q G+Y+ILRIGP+V AEWN+GG PVWLH +PG
Sbjct: 129 WNGHELSPGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGT 188
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
RT N+ FM MQ FTT IV++ K+EKLFASQGGPIILAQIENEYG + Y + GK
Sbjct: 189 V-FRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGKK 247
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y W AKMA S + GVPWIMCQ+ DAP P+ FTP +PN PKIWTENW GW
Sbjct: 248 YALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGW 307
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
FK++GG+DP R AED+AF+VARFFQ GG+ NYYMYHGGTNFGRT+GGP++TTSYDYDAP
Sbjct: 308 FKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAP 367
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG--------------- 354
+DEYG PKWGHL+ELH+ +K E L G N G SV
Sbjct: 368 VDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFIS 427
Query: 355 --------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ-AGNDQA 399
+S++LPAWSVSILPDCK FNTAKV +QT+V P +D+
Sbjct: 428 NVDDKNDKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSVVAMVPESLQQSDKV 487
Query: 400 PLQWKWRPEMINDFV-VRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGS 457
+KW +++ + + GK F N +D +T D +DYLW+ T+ + +++ L
Sbjct: 488 VNSFKW--DIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKG 545
Query: 458 SNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQ 517
+ L I S+G LHA+VN Y + + F+ P+ L GKN+I+LL TVGLQ
Sbjct: 546 NKPVLLIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIALLCLTVGLQ 605
Query: 518 NYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSE 577
G +D V G+ V G DLSS+ WTYK+G+ G + + Y N+
Sbjct: 606 TAGPFYDFVGAGLTS----VKIKGLNNGTIDLSSYAWTYKIGVQG-EYLRLYQGNGLNNV 660
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
S+ P + +TWYK +AP ++PV L++ MGKG AW+NG +GRYWP +
Sbjct: 661 NWTSTSEPPKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFK 720
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+ CDYRG + DKC CG P+Q WYHVPRSW K N LVLFEE GG+P +I F
Sbjct: 721 SEDCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKF 780
Query: 698 QTVVVGTACGQAHE-----------------NKTM---ELTCHGR-RISEIKYASFGDPQ 736
V AC E NK + L C G RIS +K+ASFG P
Sbjct: 781 VRRKVSGACALVAEDYPSVALVSQGEDKIQSNKNIPFARLACPGNTRISAVKFASFGSPS 840
Query: 737 GACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G CG++ KG C + + ++EK C+ K C I+ +E N + C G ++L VEA+C
Sbjct: 841 GTCGSYLKGDCH-DPNSSTIVEKACLNKNDCVIKLTEENFKSNLC-PGLSRKLAVEAVC 897
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/839 (46%), Positives = 499/839 (59%), Gaps = 87/839 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I+G+RK+L+S SIHYPRS P MWP L++ AKEGG+D IETYVFWN HEP
Sbjct: 46 VTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVFWNGHEPSP 105
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y F G DL++F K IQ G+Y+ILRIGP+V AEWN+GG PVWLH +PG RT ++
Sbjct: 106 GNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGTT-FRTDSE 164
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F T V++ K+E+LFASQGGPIIL+Q+ENEYG + YG+ GK Y W AKM
Sbjct: 165 PFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRYALWAAKM 224
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A S + GVPWIMCQ+ DAP P+ F P +PN PKIWTENW GWFK++G +D
Sbjct: 225 ALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWFKTFGARD 284
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+A++VARFFQ GG+ QNYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG
Sbjct: 285 PHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 344
Query: 318 QPKWGHLRELHKLLKSMEKT--------LTYGNVTNTDYGNSVSGS-------------- 355
PKWGHL+ELHK++KS E L+ G + D SG+
Sbjct: 345 FPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLANMDDKNDK 404
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP-------NQAGNDQAPL 401
SY+LPAWSVSILPDCK FNTAKV QT++ P + D L
Sbjct: 405 VVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSPKRDIKSL 464
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
QW+ E V G F N +D +T D +DYLWY T+ + ++ L
Sbjct: 465 QWEVFKETAG---VWGVADFTKNGFVDHINTTKDATDYLWYTTSIFVHAEEDFLRNRGTA 521
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
L + S G +H ++N S F P+ L GKN+ISLLS TVGLQ G
Sbjct: 522 MLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEISLLSMTVGLQTAG 581
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG--LDDKKFYNAKAANSER 578
+ ++ + G P V + AG +T DL++ WTYK+GL G L +K YN K+ +
Sbjct: 582 AFYEWIGAG-PTSVKV---AGFKTGTMDLTASAWTYKIGLQGEHLRIQKSYNLKS----K 633
Query: 579 GWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
W+ + P ++ +TWYK +AP N+PV L++ MGKG AW+NG +GRYWP ++
Sbjct: 634 IWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKY 693
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+ C T+ CDYRG + DKC CG P+Q WYHVPRSW K N L++FEE GG+PSQI F
Sbjct: 694 ENCVTQ-CDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRF 752
Query: 698 QTVVVGTACG--------------QAHE------NKTMELTC-HGRRISEIKYASFGDPQ 736
V ACG Q E T+ L C IS +K+ASFG+P
Sbjct: 753 SMRKVSGACGHLSVDHPSFDVENLQGSEIENDKNRPTLSLKCPTNTNISSVKFASFGNPN 812
Query: 737 GACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G CG++ G C + + L+EK C+ + C++E S AN C + TVK+L VE C
Sbjct: 813 GTCGSYMLGDCHDQ-NSAALVEKVCLNQNECALEMSSANFNMQLCPS-TVKKLAVEVNC 869
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/839 (45%), Positives = 502/839 (59%), Gaps = 87/839 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I+G+RK+L+S SIHYPRS P MWP L++ AKEGG+D IETYVFWN HEP
Sbjct: 46 VTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVFWNGHEPSP 105
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y F G DL++F K IQ G+Y+ILRIGP+V AEWN+GG PVWLH +PG RT ++
Sbjct: 106 GNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGTT-FRTDSE 164
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F T V++ K+E+LFASQGGPIIL+Q+ENEYG + YG+ GK Y W AKM
Sbjct: 165 PFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRYALWAAKM 224
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A S + GVPWIMCQ+ DAP P+ F P +PN PKIWTENW GWFK++G +D
Sbjct: 225 ALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWFKTFGARD 284
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+A++VARFFQ GG+ QNYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG
Sbjct: 285 PHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 344
Query: 318 QPKWGHLRELHKLLKSMEKT--------LTYGNVTNTDYGNSVSGS-------------- 355
PKWGHL+ELHK++KS E L+ G + D SG+
Sbjct: 345 FPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLANMDDKNDK 404
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP-------NQAGNDQAPL 401
SY+LPAWSVSILPDCK FNTAKV QT++ P + D L
Sbjct: 405 VVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSPKRDIKSL 464
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
QW+ E V G F N +D +T D +DYLWY T+ + ++ L
Sbjct: 465 QWEVFKETAG---VWGVADFTKNGFVDHINTTKDATDYLWYTTSIFVHAEEDFLRNRGTA 521
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
L + S G +H ++N S F P+ L GKN+I+LLS TVGLQ G
Sbjct: 522 MLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEIALLSMTVGLQTAG 581
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG--LDDKKFYNAKAANSER 578
+ ++ + G P V + AG +T DL++ WTYK+GL G L +K YN K+ +
Sbjct: 582 AFYEWIGAG-PTSVKV---AGFKTGTMDLTASAWTYKIGLQGEHLRIQKSYNLKS----K 633
Query: 579 GWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
W+ + P ++ +TWYK +AP N+PV L++ MGKG AW+NG +GRYWP ++
Sbjct: 634 IWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKY 693
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+ C T+ CDYRG + DKC CG P+Q WYHVPRSW K N L++FEE GG+PSQI F
Sbjct: 694 ENCVTQ-CDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRF 752
Query: 698 QTVVVGTACG-------------------QAHENK-TMELTC-HGRRISEIKYASFGDPQ 736
V ACG ++ +N+ T+ L C IS +K+ASFG+P
Sbjct: 753 SMRKVSGACGHLSVDHPSFDVENLQGSEIESDKNRPTLSLKCPTNTNISSVKFASFGNPN 812
Query: 737 GACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G CG++ G C + + L+EK C+ + C++E S AN C + TVK+L VE C
Sbjct: 813 GTCGSYMLGDCHDQ-NSAALVEKVCLNQNECALEMSSANFNMQLCPS-TVKKLAVEVNC 869
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/826 (47%), Positives = 494/826 (59%), Gaps = 80/826 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +++ I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F G DL+RF+K ++ GLY LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 87 GQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIH-FRTDNG 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M FT IV M K E L+ +QGGPIIL+QIENEYG V G AGKSY NW AKM
Sbjct: 146 PFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L+ GVPW+MC++ DAP P+ F+PN N PK+WTE WTGWF +GG
Sbjct: 206 AVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKDNKPKMWTEAWTGWFTGFGGAV 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P+R AED+AFAVARF Q GG+F NYYMYHGGTNFGRT+GGP+++TSYDYDAPIDEYG L
Sbjct: 266 PQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGLLR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E L G T T G
Sbjct: 326 QPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQESYVYRSKSSCAAFLANFNSRYYAT 385
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+ +G YNLP WSVSILPDCKT FNTA+V QT +K G WK E
Sbjct: 386 VTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQTTT-MKMQYLGG-----FSWKAYTE- 438
Query: 410 INDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
D F + L++Q ST D SDYLWY T D+ ++ L L + S+G
Sbjct: 439 --DTDALNDNTFTKDGLVEQLSTTWDRSDYLWYTTYVDIAKNEEFLKTGKYPYLTVMSAG 496
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
+H ++NG + + + KL G N+IS+LS +VGL N G+ F+
Sbjct: 497 HAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKISILSVSVGLPNVGNHFETWNT 556
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GPV L G + +DLS KWTY++GL+G ++N E G +S+ PL
Sbjct: 557 GVLGPVTLTGLNEGK---RDLSLQKWTYQIGLHGETLSLHSLTGSSNVEWGEASQKQPL- 612
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
TWYKT F AP N+P+ L++ MGKG W+NG ++GRYWP Y A S SCDYR
Sbjct: 613 ---TWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPAYKASG---SCGSCDYR 666
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQ 708
G Y KC NCG SQ WYHVPRSW+ N LV+ EE+GG+P+ I+ V + C +
Sbjct: 667 GTYNEKKCLSNCGEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTGISMVKRSVASVCAE 726
Query: 709 AHE-NKTME-------------LTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDV 753
E TM+ L+C G+++S+IK+ASFG PQG CG+F +GSC A
Sbjct: 727 VEELQPTMDNWRTKAYGRPKVHLSCDPGQKMSKIKFASFGTPQGTCGSFSEGSCHAHKSY 786
Query: 754 LPL----IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ + CVG++ CS+ + G C GT+K+L VEA+C
Sbjct: 787 DAFEQEGLMQNCVGQEFCSVNVAPEVFGGDPC-PGTMKKLAVEAIC 831
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 717 bits (1852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/853 (45%), Positives = 504/853 (59%), Gaps = 84/853 (9%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
+ A + L L S++ VS+D RAITI+G+R+IL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 13 AMAAVSALFLLGFLVCSVSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKA 72
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEGGLD I+TYVFWN HEP +Y F GN DL+RF+K +Q GLY+ LRIGPYVCAEWN+
Sbjct: 73 KEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGPYVCAEWNF 132
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F +MQ FTT IV+M K E+LF SQGGPIIL+QIENEY
Sbjct: 133 GGFPVWLKYIPGI-SFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEY 191
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G + + G G+SY NW AKMA L GVPW+MC++ DAP P+ F+PN
Sbjct: 192 GPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKA 251
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 252 YKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAG 311
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++ TSYDYDAP+DEYG QPKWGHL++LH+ +K E L G T GN
Sbjct: 312 GPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHV 371
Query: 351 -----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN-- 385
S + YNLP WS+SILPDCK +NTA+V QT+
Sbjct: 372 YKAKSGACSAFLANYNPKSYAKVSFGSNHYNLPPWSISILPDCKNTVYNTARVGAQTSRM 431
Query: 386 VKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTN 444
V+ P G L W+ E + ++ F + L++Q +T D SDYLWYMT+
Sbjct: 432 KMVRVPVHGG-----LSWQAYNEDPSTYIDES---FTMVGLVEQINTTRDTSDYLWYMTD 483
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
+ ++ L TL + S+G +H ++NG S + + F + V L G
Sbjct: 484 VKIDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGF 543
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N+I++LS VGL N G F+ G+ GPV L G +G +DLS KWTYKVGL G
Sbjct: 544 NKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLSGGR---RDLSWQKWTYKVGLKGES 600
Query: 565 DKKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ +++ E W+ V + +TWYKTTF AP + P+ +++ MGKG W+NG
Sbjct: 601 LSLHSLSGSSSVE--WAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWING 658
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
+LGR+WP Y A S C Y G + DKC NCG SQ WYHVPRSW+K N LV
Sbjct: 659 QSLGRHWPAYKAVG---SCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLV 715
Query: 684 LFEEFGGNPSQINFQTVVVGTACGQAHE----------------NKTMELTCH-----GR 722
+FEE+GG+P+ I+ V + C +E NK + H G+
Sbjct: 716 VFEEWGGDPNGISLVRREVDSVCADIYEWQSTLVNYQLHASGKVNKPLHPKVHLQCGPGQ 775
Query: 723 RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCA 782
+I+ +K+ASFG P+G CG++++GSC + K CVG+ CS+ + G C
Sbjct: 776 KITTVKFASFGTPEGTCGSYRQGSCH-DHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPC- 833
Query: 783 AGTVKRLVVEALC 795
+K+L VEA+C
Sbjct: 834 PNVMKKLAVEAVC 846
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/853 (45%), Positives = 502/853 (58%), Gaps = 84/853 (9%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
+ A + L L S++ VS+D RAITI+G+R+IL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 13 AMAAVSALFLLGFLVCSVSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKA 72
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEGGLD I+TYVFWN HEP +Y F GN DL++F+K +Q GLY+ LRIGPYVCAEWN+
Sbjct: 73 KEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNF 132
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F +MQ FTT IV+M K E+LF SQGGPIIL+QIENEY
Sbjct: 133 GGFPVWLKYIPGI-SFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEY 191
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G + + G G+SY NW AKMA L GVPW+MC++ DAP P+ F+PN
Sbjct: 192 GPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKA 251
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 252 YKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAG 311
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++ TSYDYDAP+DEYG QPKWGHL++LH+ +K E L G T GN
Sbjct: 312 GPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHV 371
Query: 351 -----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN-- 385
S + YNLP WS+SILPDCK +NTA+V QT+
Sbjct: 372 YKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRM 431
Query: 386 VKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTN 444
V+ P G L W+ E + ++ F + L++Q +T D SDYLWYMT+
Sbjct: 432 KMVRVPVHGG-----LSWQAYNEDPSTYIDES---FTMVGLVEQINTTRDTSDYLWYMTD 483
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
+ ++ L TL + S+G +H ++NG S + + F + V L G
Sbjct: 484 VKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGF 543
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N+I++LS VGL N G F+ G+ GPV L G G +DLS KWTYKVGL G
Sbjct: 544 NKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGR---RDLSWQKWTYKVGLKGES 600
Query: 565 DKKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ +++ E W+ V + +TWYKTTF AP + P+ +++ MGKG W+NG
Sbjct: 601 LSLHSLSGSSSVE--WAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWING 658
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
+LGR+WP Y A S C Y G + DKC NCG SQ WYHVPRSW+K N LV
Sbjct: 659 QSLGRHWPAYKAVG---SCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLV 715
Query: 684 LFEEFGGNPSQINFQTVVVGTACGQAHE----------------NKTMELTCH-----GR 722
+FEE+GG+P+ I V + C +E NK + H G+
Sbjct: 716 VFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQ 775
Query: 723 RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCA 782
+I+ +K+ASFG P+G CG++++GSC A K CVG+ CS+ + G C
Sbjct: 776 KITTVKFASFGTPEGTCGSYRQGSCHAH-HSYDAFNKLCVGQNWCSVTVAPEMFGGDPC- 833
Query: 783 AGTVKRLVVEALC 795
+K+L VEA+C
Sbjct: 834 PNVMKKLAVEAVC 846
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/810 (47%), Positives = 507/810 (62%), Gaps = 72/810 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ ++G+R+ILLSGS+HYPR+TP MWP +I+KAKEGGLD IETYVFW+ HEP
Sbjct: 20 VSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPSP 79
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F G DL++F+K +Q GL V LRIGPYVCAEWN GGFP+WL ++P I RT N+
Sbjct: 80 GQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHI-VFRTDNE 138
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ+F T IV+M K+E LFASQGGPIILAQ+ENEYGNV S YG+AG YINW A+M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMC +S P + + P P +WTE++TGWF +G
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPTLYKKPTMWTESYTGWFTYYGWPL 258
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AFAVARFF+ GG+F NYYMY GGTNFGRTSGGPY+ +SYDYDAP+DEYG +
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQH 318
Query: 318 QPKWGHLRELHKLLKSMEKTL-------------------TYGN-----VTNTDYGNSV- 352
PKWGHL++LH+ LK E+ + +YGN + N D N
Sbjct: 319 LPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNGCVAFLANVDSMNDTV 378
Query: 353 ---SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
SY+LPAWSVSI+ DCKT FN+AKV +Q+ V P+++ L W E
Sbjct: 379 VEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKSQSAVVSMNPSKSS-----LSWTSFDEP 433
Query: 410 INDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQ 469
+ + G A L ++T D SDYLWY T +G+ + L I S
Sbjct: 434 VG---ISGSSFKAKQLLEQMETTKDTSDYLWYTTR--------YATGTGSTWLSIESMRD 482
Query: 470 VLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNG 529
V+H +VNG + S T + E P+KL G N I+LLSATVGLQN+G+ + G
Sbjct: 483 VVHIFVNGQFQSSWHTSKSVLYNSVEAPIKLAPGSNTIALLSATVGLQNFGAFIETWSAG 542
Query: 530 IPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNR 589
+ G ++L G G + ++LS +WTY+VGL G +D K + + + S WS+ V +
Sbjct: 543 LSGSLILKGLPGGD---QNLSKQEWTYQVGLKG-EDLKLFTVEGSRSVN-WSA--VSTKK 595
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
+TWY T F+AP +DPV L+L MGKG AWVNG ++GRYWP Y A + C ESCDYRG
Sbjct: 596 PLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCP-ESCDYRG 654
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQA 709
Y +KC CG SQ WYHVPRSW+K N LVLFEE GG+PS I+F T C +
Sbjct: 655 SYDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARV 714
Query: 710 HENK--TMELTCHGRR--ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKK 765
+E+ +++L C G + IS+I++AS G+P+G+CG+FK+GSC D+ +EK CVG++
Sbjct: 715 YESHPASVKLWCPGEKQVISQIRFASLGNPEGSCGSFKEGSCHTN-DLSNTVEKACVGQR 773
Query: 766 SCSIEASEANLGATSCAAGTVKRLVVEALC 795
SCS+ + ++C K L VEALC
Sbjct: 774 SCSL---APDFTTSACPGVREKFLAVEALC 800
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/853 (45%), Positives = 502/853 (58%), Gaps = 84/853 (9%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
+ A + L L S++ VS+D RAITI+G+R+IL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 13 AMAAVSALFLLGFLVCSVSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKA 72
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEGGLD I+TYVFWN HEP +Y F GN DL++F+K +Q GLY+ LRIGPYVCAEWN+
Sbjct: 73 KEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNF 132
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F +MQ FTT IV+M K E+LF SQGGPIIL+QIENEY
Sbjct: 133 GGFPVWLKYIPGI-SFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEY 191
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G + + G G+SY NW AKMA L GVPW+MC++ DAP P+ F+PN
Sbjct: 192 GPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKA 251
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 252 YKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAG 311
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++ TSYDYDAP+DEYG QPKWGHL++LH+ +K E L G T GN
Sbjct: 312 GPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHV 371
Query: 351 -----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN-- 385
S + YNLP WS+SILPDCK +NTA+V QT+
Sbjct: 372 YKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRM 431
Query: 386 VKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTN 444
V+ P G L W+ E + ++ F + L++Q +T D SDYLWYMT+
Sbjct: 432 KMVRVPVHGG-----LSWQAYNEDPSTYIDES---FTMVGLVEQINTTRDTSDYLWYMTD 483
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
+ ++ L TL + S+G +H ++NG S + + F + V L G
Sbjct: 484 VKVDANEGFLRNGDLPTLTVLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLRAGF 543
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N+I++LS VGL N G F+ G+ GPV L G G +DLS KWTYKVGL G
Sbjct: 544 NKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGR---RDLSWQKWTYKVGLKGES 600
Query: 565 DKKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ +++ E W+ V + +TWYKTTF AP + P+ +++ MGKG W+NG
Sbjct: 601 LSLHSLSGSSSVE--WAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWING 658
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
+LGR+WP Y A S C Y G + DKC NCG SQ WYHVPRSW+K N LV
Sbjct: 659 QSLGRHWPAYKAVG---SCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLV 715
Query: 684 LFEEFGGNPSQINFQTVVVGTACGQAHE----------------NKTMELTCH-----GR 722
+FEE+GG+P+ I V + C +E NK + H G+
Sbjct: 716 VFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQ 775
Query: 723 RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCA 782
+I+ +K+ASFG P+G CG++++GSC A K CVG+ CS+ + G C
Sbjct: 776 KITTVKFASFGTPEGTCGSYRQGSCHAH-HSYDAFNKLCVGQNWCSVTVAPEMFGGDPC- 833
Query: 783 AGTVKRLVVEALC 795
+K+L VEA+C
Sbjct: 834 PNVMKKLAVEAVC 846
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 714 bits (1842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/821 (47%), Positives = 508/821 (61%), Gaps = 94/821 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+++ R++ IDGER+I++SGSIHYPRSTP MWPDLIKKAKEGGLDAIETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ GLY ILRIGPY+C EWNYGG P WL ++PG+ + R N
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGM-QFRLHNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K +FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQE-SDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ SD P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ GGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQ-------------------KRGGPYITTSYDYDAPLDEYG 310
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL++LH ++KS+EK L +G +T+Y + V+
Sbjct: 311 NLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSACFINNRNDNM 370
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ QT V V + + L+W W
Sbjct: 371 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKAKMVEKEPESLKWSWM 430
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F+ KG + N L++Q T+ D SDYLWY T+ + K G ++ TL +N
Sbjct: 431 RENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHK-------GEASYTLFVN 483
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG V + G E P KL GKN ISLLSAT+GL+NYG F+
Sbjct: 484 TTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEK 543
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK GL G + ++ + K + W + N
Sbjct: 544 MPAGIVGGPVKLIDNNGKGI---DLSNSSWSYKAGLAG-EYRQIHLDKPGCT---WDNNN 596
Query: 585 --VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
VP+N+ TWYKTTF+AP D VV++L G+ KG AWVNG NLGRYWP+Y A
Sbjct: 597 GTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAARSMRRL 656
Query: 643 -ESCDYRGPYGSD----KCAYNCGNPSQIWYHVPRSWIKDGV-NTLVLFEEFGGNPSQIN 696
+ YRG + ++ KC CG PSQ +YHVPRS++K+G NT++LFEE GG+PS ++
Sbjct: 657 PTTAHYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVS 716
Query: 697 FQTVVVGTACGQAHENKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVL 754
F+TV G+ C A T+ L+C H + IS I SFG +G CGA+ KG CE++
Sbjct: 717 FRTVAAGSVCASAEVGDTITLSCGQHSKTISAINVTSFGVARGQCGAY-KGGCESKAAYK 775
Query: 755 PLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
E C+GK+SC+++ + A G + C + L V+A C
Sbjct: 776 AFTEA-CLGKESCTVQITNAVTG-SGCLSNV---LTVQASC 811
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 713 bits (1840), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/837 (47%), Positives = 488/837 (58%), Gaps = 87/837 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S+ VS+D RAITI+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN
Sbjct: 25 SILATVSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNG 84
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP Y F DL++FIK +Q GLYV LRIGPY+CAEWN+GGFPVWL +PGIE
Sbjct: 85 HEPSPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIE-F 143
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N F MQ FT IV M K EKLF SQGGPIIL+QIENE+G V + G GK+Y
Sbjct: 144 RTDNGPFKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTK 203
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA L GVPW+MC++ DAP P+ F PN PK+WTENWTGW+
Sbjct: 204 WAADMAVKLGTGVPWVMCKQDDAPDPVINTCNGFYCENFKPNKDYKPKLWTENWTGWYTE 263
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRTS G ++ TSYDYDAP+DE
Sbjct: 264 FGGAVPYRPAEDLAFSVARFIQNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDE 323
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT--------------------------NT 346
YG PKWGHLR+LHK +K E L + T +T
Sbjct: 324 YGLTRDPKWGHLRDLHKAIKLCEPALVSVDPTVKSLGSNQEAHVFQSKSSCAAFLANYDT 383
Query: 347 DYGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
Y V+ Y+LP WS+SILPDCKT FNTA++ Q++ P L W+
Sbjct: 384 KYSVKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSSQMKMTPVGGA-----LSWQ 438
Query: 405 -WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
+ E + L L +Q T D SDYLWYMTN ++ D+ L + L
Sbjct: 439 SYIEEAATGYT---DDTTTLEGLWEQINVTRDASDYLWYMTNVNIDSDEGFLKNGDSPVL 495
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
I S+G LH ++NG + + F + VKLT G N+ISLLS VGL N G
Sbjct: 496 TIFSAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKLTAGINKISLLSVAVGLPNVGVH 555
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS----ER 578
F+ GI GPV L G +DLS KW+YK+GL G + + ++S E
Sbjct: 556 FEKWNAGILGPVTLKGL---NEGTRDLSGWKWSYKIGLKG-EALSLHTVTGSSSVEWVEG 611
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
S+K PL TWYK TF+AP NDPV L++ MGKG WVNG ++GR+WP Y A
Sbjct: 612 SLSAKKQPL----TWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPAYTARG- 666
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
S +C+Y G Y KC NCG PSQ WYHVPRSW+ N LV+FEE+GG PS I+
Sbjct: 667 --SCSACNYAGTYDDKKCRSNCGEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSGISLV 724
Query: 699 TVVVGTACGQAHENK-------------------TMELTC-HGRRISEIKYASFGDPQGA 738
G+ C E + L C HG++IS+IK+AS+G PQG
Sbjct: 725 KRTTGSVCADIFEGQPALKNWQMIALGRLDHLQPKAHLWCPHGQKISKIKFASYGSPQGT 784
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG+FK GSC A EK+C+GK+SCS+ + G C + K+L VEA+C
Sbjct: 785 CGSFKAGSCHAH-KSYDAFEKKCIGKQSCSVTVAAEVFGGDPCPDSS-KKLSVEAVC 839
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/870 (44%), Positives = 506/870 (58%), Gaps = 128/870 (14%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A +S+D RAI I G+R+IL+SG IHYPR++P MWP LI+ AKEGGLD I+TYVFW+ HE
Sbjct: 20 ATNISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P Y+F G DLIRF+K + GLYV LRIGPYVCAEWN+GGFP WL +PGI+ RT
Sbjct: 80 PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQ-FRT 138
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F ++M+ F IVDM K E+LFASQGGP++ +QIENEYGNV YG GK+Y+ W
Sbjct: 139 HNRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYMLWA 198
Query: 206 AKMATSLDIGVPWIMCQESDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
A+MA L+ GVPWIMC++ DAP + PN+ + P +WTENW+GW++SWG
Sbjct: 199 ARMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQSWG 258
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYM------------------YHGGTNFGRTSG 296
P RT ED+AFAVARFFQ GG QNYYM Y GGTNFGRTSG
Sbjct: 259 EAAPYRTVEDVAFAVARFFQRGGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNFGRTSG 318
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT----------------- 339
GP++TTSYDYDAP+DE+G L QPKWGHL+ELH LK E LT
Sbjct: 319 GPFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPVYYTLGRMQEMVQ 378
Query: 340 -----------------------YGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFN 376
N+ + G YNLP WSVSILPDC+ FN
Sbjct: 379 AHVYSDGSLEANFSNLATPCAAFLANIDTSSASVKFGGKVYNLPPWSVSILPDCRNVVFN 438
Query: 377 TAKVNTQTNV----KVKRPN---QAGNDQAP-----LQWKWRPEMINDFVVRGKGHFALN 424
TA+V+ QT+V V++P+ + P L W+W E + G +
Sbjct: 439 TAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGG---SGINKILAH 495
Query: 425 TLIDQKST-NDVSDYLWYMT-----NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGN 478
L++Q ST ND +DY+WY T + +LK DP+L I S ++H +VNG
Sbjct: 496 ALLEQISTTNDSTDYMWYSTRFEILDQELKGGDPVLV--------ITSMRDMVHIFVNGE 547
Query: 479 YVDSQWT-KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLV 537
+ S T K G ++P+ L G N +++LSATVGLQNYG+ + GI G + +
Sbjct: 548 FAGSTSTLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIWIQ 607
Query: 538 GRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTT 597
G + T ++L+S W ++VGL G D ++ S+ ++P + + WYK
Sbjct: 608 GLS---TGTRNLTSALWLHQVGLNGEHDAITWS----------STTSLPFFQPLVWYKAN 654
Query: 598 FEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCA 657
F P +DPV ++L MGKG AWVNG++LGR+WP A GCS + CDYRG Y S KC
Sbjct: 655 FNIPDGDDPVAIHLGSMGKGQAWVNGHSLGRFWPVITAPSTGCS-DRCDYRGTYYSSKCL 713
Query: 658 YNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK---- 713
+CG PSQ WYHVPR W+ + NTLVL EE GGN S ++F + VV C Q E
Sbjct: 714 SSCGLPSQEWYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPV 773
Query: 714 -------TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKK 765
+ L+C G+ IS I +ASFG+P+G CGAF+KGSC A ++ ++EK C+G++
Sbjct: 774 AQFSSLPELGLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHA-LESETIVEKACIGRQ 832
Query: 766 SCSIEASEANLGATSCAAGTVKRLVVEALC 795
SCS E N G C G K L VEA C
Sbjct: 833 SCSFEIFWKNFGTDPC-PGKAKTLAVEAAC 861
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/852 (45%), Positives = 495/852 (58%), Gaps = 75/852 (8%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
M T S + L F + V++D +AI I+G+R+IL+SGSIHYPRSTP MW
Sbjct: 1 METFSVSSFLFFVFLAALLGFRSTQCTTVTYDKKAILINGQRRILISGSIHYPRSTPEMW 60
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
DL++KAK+GGLD ++TYVFWN HEP YDF G DL+RFIKT Q GLYV LRIGPY
Sbjct: 61 DDLMQKAKDGGLDVVDTYVFWNVHEPSPGNYDFEGRYDLVRFIKTAQRVGLYVHLRIGPY 120
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEWN+GGFPVWL +PGI RT N F MQ FT IV M K EKLFASQGGPIIL
Sbjct: 121 VCAEWNFGGFPVWLKYVPGI-SFRTDNGPFKMAMQGFTQKIVQMMKSEKLFASQGGPIIL 179
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
+QIENEYG G AG +Y+NW AKMA L+ GVPW+MC+E DAP P+
Sbjct: 180 SQIENEYGPQSKALGAAGHAYMNWAAKMAVGLNTGVPWVMCKEDDAPDPVINSCNGFYCD 239
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
F+PN P P +WTE W+GWF +GG R +DLAFAVARF Q GG+ NYYMYHGGT
Sbjct: 240 YFSPNKPYKPTLWTEAWSGWFTEFGGPVYGRPVQDLAFAVARFVQKGGSLFNYYMYHGGT 299
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
NFGRT+GGP++TTSYDYDAP+DEYG L QPK+GHL+ LH+ +K E L + T T G
Sbjct: 300 NFGRTAGGPFITTSYDYDAPLDEYGMLRQPKYGHLKNLHRAIKLCEHALVSSDPTVTSLG 359
Query: 350 ------------------------NSVSGSSYN-----LPAWSVSILPDCKTEEFNTAKV 380
NS + +N LPAWS+SILPDCK FNTA+V
Sbjct: 360 AYEQAHVFSSGPGRCAAFLANYHTNSAATVVFNNMRYALPAWSISILPDCKRVVFNTAQV 419
Query: 381 NTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYL 439
P + L W+ E + + + G + L++Q T D SDYL
Sbjct: 420 GVHIAQTQMLPT-----ISKLSWETYNE--DTYSLGGSSRMTVAGLLEQINVTRDTSDYL 472
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WYMT+ + + L G TL + S+G +H ++NG + S + + P+
Sbjct: 473 WYMTSVGISSSEAFLRGGQKPTLSVRSAGHAVHVFINGQFSGSAYGSREHPAFTYTGPIN 532
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L G N+I+LLS VGL N G F+ GI GP+ + G G + KDL+ KW+Y+VG
Sbjct: 533 LRAGMNKIALLSIAVGLPNVGLHFEKWQTGILGPISISGLNGGK---KDLTWQKWSYQVG 589
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L G + + A S + R +TWYK +F AP N+P+ L+L+ MGKG A
Sbjct: 590 LKG-EAMNLVSPTEATSVDWIKGSLLQGQRPLTWYKASFNAPRGNEPLALDLRSMGKGQA 648
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG ++GRYW Y + GCS C Y G Y C CG P+Q WYHVPRSW+K
Sbjct: 649 WINGQSIGRYWMAY--AKGGCS--RCTYAGTYRPPTCENGCGQPTQRWYHVPRSWLKPTN 704
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTACGQA---------------HENKTMELTCH-GRR 723
N LVLFEE GG+ S+I+ V CG+A E ++ L C+ G+
Sbjct: 705 NVLVLFEELGGDASKISLMRRSVTGLCGEAVEYHAKNDSYIIESNEELDSLHLQCNPGQV 764
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
IS IK+ASFG P G CG+++KG+C A D +IEK+C+G KSCS+ + N G C
Sbjct: 765 ISAIKFASFGTPSGTCGSYQKGTCHAP-DSHAIIEKKCIGLKSCSVSTTRDNFGVDPC-P 822
Query: 784 GTVKRLVVEALC 795
+K+L+VE C
Sbjct: 823 NELKQLLVEVDC 834
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/858 (45%), Positives = 510/858 (59%), Gaps = 92/858 (10%)
Query: 11 ILLCLILQTLFNLSLAY----------RVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
+++CL L ++N++L VS+D +AITI+G+R+IL+SGSIHYPRSTP MW
Sbjct: 1 MVICLKLIIMWNVALLLVFSLIGSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEMW 60
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
PDLI+KAK+GGLD I+TYVFWN HEP +Y F GN DL++FIK +Q GLYV LRIGPY
Sbjct: 61 PDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPY 120
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEWN+GGFPVWL +PGI RT N+ F ++MQ FTT IVD+ K E+L+ SQGGPII+
Sbjct: 121 VCAEWNFGGFPVWLKYIPGI-SFRTDNEPFKHQMQKFTTKIVDLMKAERLYESQGGPIIM 179
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
+QIENEYG + + G AGK+Y W A+MA L GVPW+MC++ D P P+
Sbjct: 180 SQIENEYGPMEYEIGAAGKAYTKWAAEMAMGLGTGVPWVMCKQDDTPDPLINTCNGFYCD 239
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
F+PN PK+WTE WTGWF +GG P R AEDLAF+VARF Q GG+F NYYMYHGGT
Sbjct: 240 YFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGT 299
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
NFGRT+GGP++ TSYDYDAP+DEYG L QPKWGHL++LH+ +K E L G+ T T G
Sbjct: 300 NFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIG 359
Query: 350 N--------SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKV 380
N S SG+ YNLP WS+SILPDCK +NTA+V
Sbjct: 360 NYQEAHVFKSKSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPDCKNTVYNTARV 419
Query: 381 NTQT-NVKVKR-PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSD 437
+Q+ +K+ R P G W + F + L++Q +T D+SD
Sbjct: 420 GSQSAQMKMTRVPIHGG-----FSWL---SFNEETTTTDDSSFTMTGLLEQLNTTRDLSD 471
Query: 438 YLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERP 497
YLWY T+ L ++ L + L + S+G LH ++NG + + F
Sbjct: 472 YLWYSTDVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEG 531
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
VKL G N+ISLLS VGL N G F+ G+ GP+ L G +DLS KW+YK
Sbjct: 532 VKLRAGVNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGR---RDLSWQKWSYK 588
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGK 616
VGL G + +++ E W ++ R+ +TWYKTTF+AP P+ L++ MGK
Sbjct: 589 VGLKGEILSLHSLSGSSSVE--WIQGSLVSQRQPLTWYKTTFDAPAGTAPLALDMDSMGK 646
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G W+NG NLGRYWP Y A + + CDY G Y +KC NCG SQ WYHVP+SW+K
Sbjct: 647 GQVWLNGQNLGRYWPAYKASG---TCDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWLK 703
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTAC------------------GQAHENKTMELT 718
N LV+FEE GG+P+ I + + C G+A + L+
Sbjct: 704 PTGNLLVVFEELGGDPNGIFLVRRDIDSVCADIYEWQPNLISYQMQTSGKAPVRPKVHLS 763
Query: 719 CH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLG 777
C G++IS IK+ASFG P G+CG F +GSC A E+ CVG+ C++ S N G
Sbjct: 764 CSPGQKISSIKFASFGTPAGSCGNFHEGSCHAH-KSYDAFERNCVGQNWCTVTVSPENFG 822
Query: 778 ATSCAAGTVKRLVVEALC 795
C +K+L VEA+C
Sbjct: 823 GDPC-PNVLKKLSVEAIC 839
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/860 (45%), Positives = 495/860 (57%), Gaps = 83/860 (9%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAY-RVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M T SR IL C + + + V++D +A+ I+G+R+IL SGSIHYPRSTP M
Sbjct: 4 MGTGDSASRLILWCCLGLLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDM 63
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W LI+KAK+GG+D IETYVFWN HEP +YDF G DL+RF+K I GLY LRIGP
Sbjct: 64 WEGLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIGP 123
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F M+ FT IV++ K E LF SQGGPII
Sbjct: 124 YVCAEWNFGGFPVWLKYVPGIS-FRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG G G +Y+ W AKMA + + GVPW+MC+E DAP P+
Sbjct: 183 LSQIENEYGRQGQILGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVISTCNGFYC 242
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F PN P P IWTE W+GWF +GG R +DLAFAVARF Q GG+F NYYMYHGG
Sbjct: 243 DSFAPNKPYKPTIWTEAWSGWFTEFGGPMHHRPVQDLAFAVARFIQKGGSFVNYYMYHGG 302
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 303 TNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSTDPVVTSL 362
Query: 349 GN--------SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAK 379
GN S SG YNLP WS+SILPDC+ FNTAK
Sbjct: 363 GNKQQAHVYSSESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAK 422
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P G+ Q W+ + + + F L++Q T D SDY
Sbjct: 423 VGVQTSQMEMLPTSTGSFQ------WQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDY 476
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWYMT+ D+ + + L G TL I S+G +H +VNG S + ++ +
Sbjct: 477 LWYMTSVDIGETESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKI 536
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L G N+I+LLS VGL N G F+ GI GPV L G + + +DLS KWTY+V
Sbjct: 537 NLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGK---RDLSWQKWTYQV 593
Query: 559 GLYGLDDKKFYNAKAANSERGW--SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
GL G Y + GW +S V + +TW+KT F+AP N+P+ L+++GMGK
Sbjct: 594 GLKGEAMNLAYPTNTPSF--GWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGK 651
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G WVNG ++GRYW + + G C Y G Y +KC CG P+Q WYHVPRSW+K
Sbjct: 652 GQIWVNGESIGRYWTAFATGDCG----HCSYTGTYKPNKCNSGCGQPTQKWYHVPRSWLK 707
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTAC--------------------GQAHENKTME 716
N LV+FEE GGNPS ++ V C GQ +
Sbjct: 708 PSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYHPNIKNWQIESYGKGQTFRRPKVH 767
Query: 717 LTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEAN 775
L C G+ IS IK+ASFG P G CG++++G C A ++E++CVGK C++ S +N
Sbjct: 768 LKCSPGQAISAIKFASFGTPLGTCGSYQQGDCHAATS-YAILERKCVGKARCAVTISNSN 826
Query: 776 LGATSCAAGTVKRLVVEALC 795
G C +KRL VEA+C
Sbjct: 827 FGKDPC-PNVLKRLTVEAVC 845
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/812 (47%), Positives = 508/812 (62%), Gaps = 73/812 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ ++G+R+ILLSGS+HYPR+TP MWP +I+KAKEGGLD IETYVFW+ HEP
Sbjct: 20 VSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPSP 79
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F G DL++F+K +Q GL + LRIGPYVCAEWN GGFP+WL ++P I RT N+
Sbjct: 80 GQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHI-VFRTDNE 138
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ+F T IV+M K+E LFASQGGPIILAQ+ENEYGNV S YG+AG YINW A+M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMC +S P + + P P +WTE++TGWF +G
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPILYKKPTMWTESYTGWFTYYGWPI 258
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYM--YHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
P R ED+AFAVARFF+ GG+F NYYM Y GGTNFGRTSGGPY+ +SYDYDAP+DEYG
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLDEYGM 318
Query: 316 LNQPKWGHLRELHKLLKSMEKTL-------------------TYGN-----VTNTDYGNS 351
+ PKWGHL++LH+ LK E+ + +YGN + N D N
Sbjct: 319 QHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNGCVAFLANVDSMND 378
Query: 352 V----SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRP 407
SY+LPAWSVSIL DCKT FN+AKV +Q+ V P+ ++ L W
Sbjct: 379 TVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKSQSAVVSMSPS-----KSTLSWTSFD 433
Query: 408 EMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
E + + G A L ++T D SDYLWY T+ + +G+ + L I S
Sbjct: 434 EPVG---ISGSSFKAKQLLEQMETTKDTSDYLWYTTSVE-------ATGTGSTWLSIESM 483
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
V+H +VNG + S T + E P+ L G N I+LLSATVGLQN+G+ +
Sbjct: 484 RDVVHIFVNGQFQSSWHTSKSVLYNSVEAPITLAPGSNTIALLSATVGLQNFGAFIETWS 543
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ G ++L G G + ++LS +WTY+VGL G +D K + + + S WS+ V
Sbjct: 544 AGLSGSLILKGLPGGD---QNLSKQEWTYQVGLKG-EDLKLFTVEGSRSVN-WSA--VST 596
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
+ +TWY T F+AP +DPV L+L MGKG AWVNG ++GRYWP Y A + C ESCDY
Sbjct: 597 EKPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCP-ESCDY 655
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RG Y +KC CG SQ WYHVPRSW+K N LVLFEE GG+PS I+F T C
Sbjct: 656 RGSYDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICA 715
Query: 708 QAHENK--TMELTCHGRR--ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVG 763
+ +E+ +++L C G + IS+I++AS G+P+G+CG+FK+GSC D+ +EK CVG
Sbjct: 716 RVYESHPASVKLWCPGEKQVISQIRFASLGNPEGSCGSFKEGSCHTN-DLSNTVEKACVG 774
Query: 764 KKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++SCS+ + ++C K L VEALC
Sbjct: 775 QRSCSL---APDFTISACPGVREKFLAVEALC 803
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/865 (44%), Positives = 505/865 (58%), Gaps = 118/865 (13%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A +S+D RAI I G+R+IL+SG +HYPR++P MWP LI+ AKEGGLD I+TYVFW+ HE
Sbjct: 20 ATNISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P Y+F G DLIRF+K + GLYV LRIGPYVCAEWN+GGFP WL +PGI+ RT
Sbjct: 80 PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQ-FRT 138
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F ++M+ F IVDM K E+LFASQGGP++ +QIENEYGNV YG GK+Y+ W
Sbjct: 139 HNRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYMLWA 198
Query: 206 AKMATSLDIGVPWIMCQESDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
A+MA L+ GVPWIMC++ DAP + PN+ + P +WTENW+GW++ WG
Sbjct: 199 ARMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQLWG 258
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYM------------------YHGGTNFGRTSG 296
P RT ED+AFAVARFFQ GG QNYYM Y GGTNFGRTSG
Sbjct: 259 EAAPYRTVEDVAFAVARFFQRGGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNFGRTSG 318
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT----------------- 339
GP++TTSYDYDAP+DE+G L QPKWGHL+ELH LK E LT
Sbjct: 319 GPFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPLYYTLGRMQEMVQ 378
Query: 340 -----------------------YGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFN 376
N+ + G+ YNLP WSVSILPDC+ FN
Sbjct: 379 AHVYSDGSLEANFSNLATPCAAFLANIDTSSASVKFGGNVYNLPPWSVSILPDCRNVVFN 438
Query: 377 TAKVNTQTNV----KVKRPN---QAGNDQAP-----LQWKWRPEMINDFVVRGKGHFALN 424
TA+V+ QT+V V++P+ + P L W+W E + G +
Sbjct: 439 TAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGG---SGINKILAH 495
Query: 425 TLIDQKST-NDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQ 483
L++Q ST ND +DYLWY T ++ D + L G + L I S ++H +VNG + S
Sbjct: 496 ALLEQISTTNDSTDYLWYSTRFEISDQE--LKGG-DPVLVITSMRDMVHIFVNGEFAGST 552
Query: 484 WT-KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
T K G ++P+ L G N +++LSATVGLQNYG+ + GI G V + G +
Sbjct: 553 STLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVWIQGLS-- 610
Query: 543 ETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPL 602
T ++L+S W ++VGL G D ++ S+ ++P + + WYK F P
Sbjct: 611 -TGTRNLTSALWLHQVGLNGEHDAITWS----------STTSLPFFQPLVWYKANFNIPD 659
Query: 603 ENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGN 662
+DPV ++L MGKG AWVNG++LGR+WP A GCS + CDYRG Y S KC CG
Sbjct: 660 GDDPVAIHLGSMGKGQAWVNGHSLGRFWPAITAPSTGCS-DRCDYRGTYYSSKCLSGCGL 718
Query: 663 PSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK--------- 713
PSQ WYHVPR W+ + NTLVL EE GGN S ++F + VV C Q E
Sbjct: 719 PSQEWYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQFSS 778
Query: 714 --TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIE 770
+ L+C G+ IS I +ASFG+P+G CGAF+KGSC A ++ ++EK C+G++SCS E
Sbjct: 779 LPELGLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHA-LESETIVEKACIGRQSCSFE 837
Query: 771 ASEANLGATSCAAGTVKRLVVEALC 795
N G C G K L VEA C
Sbjct: 838 IFWKNFGTDPC-PGKAKTLAVEAAC 861
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/825 (46%), Positives = 489/825 (59%), Gaps = 84/825 (10%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D +A+ ++G+R+IL+SGSIHYPRS P MWPDLI+KAK+GGLD ++TYVFWN HEP RR
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNEP 148
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQNFTT IVDM K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA
Sbjct: 149 FKAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 208
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+L+ VPW+MC+E DAP P+ F+PN P+ P +WTE WT W+ +G P
Sbjct: 209 VALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVP 268
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLA+ VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG L +
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 328
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHL+ELHK +K E L G+ T GN
Sbjct: 329 PKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYAR 388
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
S +G Y+LP WS+SILPDCKT +NTA V +Q + + AG W+ E
Sbjct: 389 VSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQ--ISQMKMEWAGG----FTWQSYNED 442
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
IN G FA L++Q T D +DYLWY T D+ D+ LS N L + S+G
Sbjct: 443 INSL---GDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMSAG 499
Query: 469 QVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
LH +VNG T YG+ D + VKL G N IS LS VGL N G F+
Sbjct: 500 HALHIFVNGQLTG---TVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFET 556
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
GI GPV L G +DL+ KWTYKVGL G + +++ E G +
Sbjct: 557 WNAGILGPVTLDGLNEGR---RDLTWQKWTYKVGLKGEALSLHSLSGSSSVEWGEPVQKQ 613
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
PL +WYK F AP ++P+ L++ MGKG W+NG +GRYWP Y A + C
Sbjct: 614 PL----SWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASG---TCGIC 666
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
DYRG Y KC NCG+ SQ WYHVPRSW+ N LV+FEE+GG+P+ I+ + G+
Sbjct: 667 DYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRIAGSI 726
Query: 706 CG--------------QAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAE 750
C + +E + L C HGR+++ IK+ASFG PQG+CG++ +G C A
Sbjct: 727 CADVSEWQPSMANWRTKGYEKAKVHLQCDHGRKMTHIKFASFGTPQGSCGSYSEGGCHAH 786
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ K C+G++ C + G C GT+KR VVEA+C
Sbjct: 787 -KSYDIFWKSCIGQERCGVSVVPDAFGGDPC-PGTMKRAVVEAIC 829
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/861 (45%), Positives = 498/861 (57%), Gaps = 85/861 (9%)
Query: 1 MATLKHCSRAIL-LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M T SR IL CL L + V++D +A+ I+G+R+IL SGSIHYPRSTP M
Sbjct: 1 MGTGDSASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDM 60
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W DLI+KAK+GG+D IETYVFWN HEP +YDF G DL+RF+KTI GLY LRIGP
Sbjct: 61 WEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGP 120
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F M+ FT IV++ K E LF SQGGPII
Sbjct: 121 YVCAEWNFGGFPVWLKYVPGI-SFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 179
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG G G +Y+ W AKMA + + GVPW+MC+E DAP P+
Sbjct: 180 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 239
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F PN P P IWTE W+GWF +GG R +DLAF VARF Q GG+F NYYMYHGG
Sbjct: 240 DSFAPNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGG 299
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 300 TNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSI 359
Query: 349 GNS-----------------------------VSGSSYNLPAWSVSILPDCKTEEFNTAK 379
GN + YNLP WS+SILPDC+ FNTAK
Sbjct: 360 GNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAK 419
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P N QW+ E ++ + F + L++Q T D SDY
Sbjct: 420 VGVQTSQMEMLPTDTKN----FQWESYLEDLSS--LDDSSTFTTHGLLEQINVTRDTSDY 473
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWYMT+ D+ D + L G TL I S+G +H +VNG S + ++ +
Sbjct: 474 LWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKI 533
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L G N+I+LLS VGL N G F+ GI GPV L G + + DLS KWTY+V
Sbjct: 534 NLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKM---DLSWQKWTYQV 590
Query: 559 GLYGLDDKKFYNAKAANSER-GW--SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
GL G + A N+ GW +S V + +TW+KT F+AP N+P+ L+++GMG
Sbjct: 591 GLKG---EAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMG 647
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG WVNG ++GRYW T A D CS C Y G Y +KC CG P+Q WYHVPR+W+
Sbjct: 648 KGQIWVNGESIGRYW-TAFATGD-CS--HCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWL 703
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTVVVGTAC--------------------GQAHENKTM 715
K N LV+FEE GGNPS ++ V C GQ +
Sbjct: 704 KPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYHPNIKNWQIESYGKGQTFHRPKV 763
Query: 716 ELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA 774
L C G+ I+ IK+ASFG P G CG++++G C A ++E++CVGK C++ S +
Sbjct: 764 HLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATS-YAILERKCVGKARCAVTISNS 822
Query: 775 NLGATSCAAGTVKRLVVEALC 795
N G C +KRL VEA+C
Sbjct: 823 NFGKDPC-PNVLKRLTVEAVC 842
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/861 (45%), Positives = 498/861 (57%), Gaps = 85/861 (9%)
Query: 1 MATLKHCSRAIL-LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M T SR IL CL L + V++D +A+ I+G+R+IL SGSIHYPRSTP M
Sbjct: 4 MGTGDSASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDM 63
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W DLI+KAK+GG+D IETYVFWN HEP +YDF G DL+RF+KTI GLY LRIGP
Sbjct: 64 WEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGP 123
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F M+ FT IV++ K E LF SQGGPII
Sbjct: 124 YVCAEWNFGGFPVWLKYVPGI-SFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG G G +Y+ W AKMA + + GVPW+MC+E DAP P+
Sbjct: 183 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 242
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F PN P P IWTE W+GWF +GG R +DLAF VARF Q GG+F NYYMYHGG
Sbjct: 243 DSFAPNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGG 302
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 303 TNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSI 362
Query: 349 GNS-----------------------------VSGSSYNLPAWSVSILPDCKTEEFNTAK 379
GN + YNLP WS+SILPDC+ FNTAK
Sbjct: 363 GNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAK 422
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P N QW+ E ++ + F + L++Q T D SDY
Sbjct: 423 VGVQTSQMEMLPTDTKN----FQWESYLEDLSS--LDDSSTFTTHGLLEQINVTRDTSDY 476
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWYMT+ D+ D + L G TL I S+G +H +VNG S + ++ +
Sbjct: 477 LWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKI 536
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L G N+I+LLS VGL N G F+ GI GPV L G + + DLS KWTY+V
Sbjct: 537 NLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKM---DLSWQKWTYQV 593
Query: 559 GLYGLDDKKFYNAKAANSER-GW--SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
GL G + A N+ GW +S V + +TW+KT F+AP N+P+ L+++GMG
Sbjct: 594 GLKG---EAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMG 650
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG WVNG ++GRYW T A D CS C Y G Y +KC CG P+Q WYHVPR+W+
Sbjct: 651 KGQIWVNGESIGRYW-TAFATGD-CS--HCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWL 706
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTVVVGTAC--------------------GQAHENKTM 715
K N LV+FEE GGNPS ++ V C GQ +
Sbjct: 707 KPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYHPNIKNWQIESYGKGQTFHRPKV 766
Query: 716 ELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA 774
L C G+ I+ IK+ASFG P G CG++++G C A ++E++CVGK C++ S +
Sbjct: 767 HLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATS-YAILERKCVGKARCAVTISNS 825
Query: 775 NLGATSCAAGTVKRLVVEALC 795
N G C +KRL VEA+C
Sbjct: 826 NFGKDPC-PNVLKRLTVEAVC 845
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/859 (45%), Positives = 511/859 (59%), Gaps = 93/859 (10%)
Query: 11 ILLCLILQTLF---NLSLAYR--------VSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
+++CL L+ + L LA+ VS+D +AITI+G+R+IL+SGSIHYPRSTP M
Sbjct: 1 MVMCLKLKLIMWNVALLLAFSLIGSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEM 60
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
WPDLI+KAK+GGLD I+TYVFWN HEP +Y F GN DL++FIK +Q GLYV LRIGP
Sbjct: 61 WPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGP 120
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F +MQ FTT IVD+ K E+L+ SQGGPII
Sbjct: 121 YVCAEWNFGGFPVWLKYIPGI-SFRTDNEPFKVQMQKFTTKIVDLMKAERLYESQGGPII 179
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
++QIENEYG + + G AGK+Y W A+MA L GVPWIMC++ D P P+
Sbjct: 180 MSQIENEYGPMEYEIGAAGKAYTKWAAEMAMELGTGVPWIMCKQDDTPDPLINTCNGFYC 239
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F+PN PK+WTE WTGWF +GG P R AEDLAF+VARF Q GG+F NYYMYHGG
Sbjct: 240 DYFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYHGG 299
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++ TSYDYDAP+DEYG L QPKWGHL++LH+ +K E L G+ T T
Sbjct: 300 TNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKI 359
Query: 349 GN--------SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAK 379
GN S+SG+ YNLP WS+SILP+CK +NTA+
Sbjct: 360 GNYQEAHVFKSMSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPNCKNTVYNTAR 419
Query: 380 VNTQT-NVKVKR-PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVS 436
V +Q+ +K+ R P G L W + F + L++Q +T D+S
Sbjct: 420 VGSQSAQMKMTRVPIHGG-----LSWL---SFNEETTTTDDSSFTMTGLLEQLNTTRDLS 471
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
DYLWY T+ L ++ L + L + S+G LH ++NG + + F
Sbjct: 472 DYLWYSTDVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNE 531
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
VKL G N+ISLLS VGL N G F+ G+ GP+ L G +DLS KW+Y
Sbjct: 532 GVKLRTGVNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGR---RDLSWQKWSY 588
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMG 615
KVGL G +++ E W ++ R+ +TWYKTTF+AP P+ L++ MG
Sbjct: 589 KVGLKGETLSLHSLGGSSSVE--WIQGSLVSQRQPLTWYKTTFDAPDGTAPLALDMNSMG 646
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG W+NG NLGRYWP Y A + + CDY G Y +KC NCG SQ WYHVP+SW+
Sbjct: 647 KGQVWLNGQNLGRYWPAYKASG---TCDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWL 703
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTVVVGTAC------------------GQAHENKTMEL 717
K N LV+FEE GG+ + I+ + + C G+A + L
Sbjct: 704 KPTGNLLVVFEELGGDLNGISLVRRDIDSVCADIYEWQPNLISYQMQTSGKAPVRPKVHL 763
Query: 718 TCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL 776
+C G++IS IK+ASFG P G+CG F +GSC A + E+ CVG+ C++ S N
Sbjct: 764 SCSPGQKISSIKFASFGTPVGSCGNFHEGSCHAHMS-YDAFERNCVGQNLCTVAVSPENF 822
Query: 777 GATSCAAGTVKRLVVEALC 795
G C +K+L VEA+C
Sbjct: 823 GGDPC-PNVLKKLSVEAIC 840
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/836 (44%), Positives = 494/836 (59%), Gaps = 82/836 (9%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S+ VS+D +AITI+G+R+IL+SGSIHYPRS+P MWPDLI+KAKEGGLD I+TYVFWN
Sbjct: 28 SVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNG 87
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP +Y F GN DL++F+K ++ GLYV LRIGPY+CAEWN+GGFPVWL +PGI
Sbjct: 88 HEPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGIN-F 146
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N F +MQ FTT IV+M K E+LF +QGGPIIL+QIENEYG + + G GK+Y
Sbjct: 147 RTDNGPFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTK 206
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A+MA L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF
Sbjct: 207 WAAEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQ 266
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE
Sbjct: 267 FGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDE 326
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------- 350
YG L QPKWGHL++LH+ +K E L G+ T GN
Sbjct: 327 YGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYH 386
Query: 351 -------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
S YNLP WS+SILPDCK +NTA+V Q+ P P+
Sbjct: 387 QRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTP-------VPMHG 439
Query: 404 KWRPEMINDF-VVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
+ + N+ G F + L++Q +T DVSDYLWYMT+ + + L
Sbjct: 440 GFSWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPV 499
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L + S+G LH ++NG + + F + VKL G N+ISLLS VGL N G
Sbjct: 500 LGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGP 559
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
F+ GI GPV L G +DLS KW+YK+GL+G + ++ + +S W+
Sbjct: 560 HFETWNAGILGPVTLNGLNEGR---RDLSWQKWSYKIGLHG--EALGLHSISGSSSVEWA 614
Query: 582 SKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
++ R+ ++WYKTTF AP N P+ L++ MGKG W+NG ++GR+WP Y A
Sbjct: 615 EGSLVAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASG--- 671
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
+ C Y G Y KC+ NCG SQ WYHVP+SW+K N LV+FEE+GG+P+ I+
Sbjct: 672 TCGDCSYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRR 731
Query: 701 VVGTACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGAC 739
V + C +E NK + H G++I IK+ASFG P+G C
Sbjct: 732 DVDSVCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVC 791
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G++++GSC A CVG+ SCS+ + G C +K+L VEA+C
Sbjct: 792 GSYRQGSCHA-FHSYDAFNNLCVGQNSCSVTVAPEMFGGDPC-LNVMKKLAVEAIC 845
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/832 (44%), Positives = 483/832 (58%), Gaps = 77/832 (9%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
++ VS+D +A+ I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GG+D I+TYVFWN
Sbjct: 23 TVTASVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTYVFWNG 82
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP Y F DL++FIK +Q GLY+ LRIGPY+CAEWN+GGFPVWL +PGIE
Sbjct: 83 HEPSPGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVPGIE-F 141
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N F MQ FT IV M K EKLF +QGGPIIL+QIENEYG V + G GK+Y
Sbjct: 142 RTDNGPFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTK 201
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA L GVPWIMC++ DAP PM F PN PKIWTE WTGW+
Sbjct: 202 WAADMAVKLGTGVPWIMCKQEDAPDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTGWYTE 261
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AED+AF+VARF Q GG++ NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE
Sbjct: 262 FGGAVPHRPAEDMAFSVARFIQNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDE 321
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------- 350
+G +PKWGHLR+LHK +K E L + T T G+
Sbjct: 322 FGLPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVFKSKSVCAAFLANYDT 381
Query: 351 ------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
+ Y LP WSVSILPDCKT +NTA++ +Q++ P A +
Sbjct: 382 KYSVKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQSSQMKMVP-------ASSSFS 434
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
W+ +N L +Q T D +DYLWY+T+ + D+ L N L
Sbjct: 435 WQSYNEETASADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKIDADEGFLKSGQNPLLT 494
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
I S+G LH ++NG + + F + +KLT G N+ISLLS VGL N G F
Sbjct: 495 IFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKISLLSVAVGLPNVGLHF 554
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G+ GP+ L G +DLS KW+YK+GL G + + A + S
Sbjct: 555 ETWNAGVLGPITLKGL---NEGTRDLSGQKWSYKIGLKG-ESLSLHTASGSESVEWVEGS 610
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ + +TWYKT F+AP NDP+ L++ MGKG W+NG N+GR+WP Y+A S
Sbjct: 611 LLAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWPGYIAHG---SCG 667
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
C+Y G + KC NCG PSQ WYHVPRSW+K N L +FEE+GG+P+ I+F
Sbjct: 668 DCNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPTGISFVKRTTA 727
Query: 704 TACGQAHENK-------------------TMELTC-HGRRISEIKYASFGDPQGACGAFK 743
+ C E + L C G++IS+IK+ASFG PQG CG+F+
Sbjct: 728 SVCADIFEGQPALKNWQAIASGKVISPQPKAHLWCPTGQKISQIKFASFGMPQGTCGSFR 787
Query: 744 KGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+GSC A E+ CVGK+SCS+ + G C + K+L VEA+C
Sbjct: 788 EGSCHAH-KSYDAFERNCVGKQSCSVTVAPEVFGGDPC-PDSAKKLSVEAVC 837
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/866 (44%), Positives = 508/866 (58%), Gaps = 96/866 (11%)
Query: 1 MATLKHCSRAIL--LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPG 58
M + SR ++ + L+L + S VS+D +AI ++G+R+IL+SGSIHYPRSTP
Sbjct: 1 MMVINMVSRLVMWNVLLVLLSSCVFSGLASVSYDHKAIIVNGQRRILISGSIHYPRSTPE 60
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWPDLI+KAKEGG+D I+TYVFWN HEP + +Y F DL++FIK + GLYV LR+G
Sbjct: 61 MWPDLIQKAKEGGVDVIQTYVFWNGHEPEQGKYYFEERYDLVKFIKLVHQAGLYVNLRVG 120
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PY CAEWN+GGFPVWL +PGI RT N+ F MQ FTT IV+M K E+L+ SQGGPI
Sbjct: 121 PYACAEWNFGGFPVWLKYVPGI-SFRTDNEPFKAAMQKFTTKIVNMMKAERLYESQGGPI 179
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENEYG + +G+ GKSY W AKMA L GVPW+MC++ DAP P+
Sbjct: 180 ILSQIENEYGPLEVRFGEQGKSYAEWAAKMALDLGTGVPWLMCKQDDAPDPVINTCNGFY 239
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
F PN PKIWTE WT WF +G P R EDLAF VA F Q GG+F NYYMYHG
Sbjct: 240 CDYFYPNKAYKPKIWTEAWTAWFTEFGSPVPYRPVEDLAFGVANFIQTGGSFINYYMYHG 299
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGRT+GGP++ TSYDYDAP+DE+G L QPKWGHL++LH+ +K E L G+ T T
Sbjct: 300 GTNFGRTAGGPFVATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTA 359
Query: 348 YGN--------SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTA 378
GN S SG+ YNLP WS+SILPDCK +NTA
Sbjct: 360 LGNYQKAHVFRSTSGACAAFLANNDPNSFATVAFGNKHYNLPPWSISILPDCKHTVYNTA 419
Query: 379 KVNTQTNVKVKRPNQAG------NDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KS 431
+V Q+ + P G NDQ N F V G L++Q +
Sbjct: 420 RVGAQSALMKMTPANEGYSWQSYNDQTAFY------DDNAFTVVG--------LLEQLNT 465
Query: 432 TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASN 491
T DVSDYLWYMT+ + + L + L ++S+G LH +VNG + +
Sbjct: 466 TRDVSDYLWYMTDVKIDPSEGFLRSGNWPWLTVSSAGDALHVFVNGQLAGTVYGSLKKQK 525
Query: 492 DLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSS 551
F + V L G N+ISLLS VGL N G F+ G+ GPV L G DE +DL+
Sbjct: 526 ITFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWNTGVLGPVSLSGL--DEG-KRDLTW 582
Query: 552 HKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLN 610
KW+YKVGL G + ++ + +S W ++ R+ +TWYKTTF AP N+P+ L+
Sbjct: 583 QKWSYKVGLKG--EALNLHSLSGSSSVEWVEGSLVAQRQPLTWYKTTFNAPAGNEPLALD 640
Query: 611 LQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHV 670
+ MGKG W+NG ++GRYWP Y A + ++C+Y GP+ KC NCG+ SQ WYHV
Sbjct: 641 MNSMGKGQVWINGQSIGRYWPGYKASG---TCDACNYAGPFNEKKCLSNCGDASQRWYHV 697
Query: 671 PRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE----------------NKT 714
PRSW+ N LV+FEE+GG+P+ I+ + + C +E +K
Sbjct: 698 PRSWLHPTGNLLVVFEEWGGDPNGISLVKRELASVCADINEWQPQLVNWQLQASGKVDKP 757
Query: 715 MELTCH-----GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSI 769
+ H G++I+ IK+ASFG PQG CG+F +GSC A EK C+G++SC++
Sbjct: 758 LRPKAHLSCTSGQKITSIKFASFGTPQGVCGSFSEGSCHAH-HSYDAFEKYCIGQESCTV 816
Query: 770 EASEANLGATSCAAGTVKRLVVEALC 795
+ G C + +K+L VEA+C
Sbjct: 817 PVTPEIFGGDPCPS-VMKKLSVEAVC 841
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/854 (45%), Positives = 496/854 (58%), Gaps = 80/854 (9%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAYR-VSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
MAT +C L+ +L N L + V++D +AI I+G+R+IL SGSIHYPRSTP M
Sbjct: 1 MATHYYCFPLFLIAFLLA---NSHLIHSTVTYDRKAILINGQRRILFSGSIHYPRSTPEM 57
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W DLI KAK GGLD +ETYVFWN HEP Y+F G DL+RFIKTIQ GLY LRIGP
Sbjct: 58 WEDLILKAKNGGLDVVETYVFWNVHEPYPGIYNFEGRFDLVRFIKTIQKAGLYANLRIGP 117
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F N MQ FT IV + K E LF SQGGPII
Sbjct: 118 YVCAEWNFGGFPVWLKYVPGI-SFRTDNEAFKNAMQGFTEKIVALMKSENLFESQGGPII 176
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
LAQIENEYG +G+AG +Y+ W A MA L GVPW+MC+E+DAP P+
Sbjct: 177 LAQIENEYGTESKLFGEAGYNYMTWAANMAVGLQTGVPWVMCKEADAPDPVINTCNGFYC 236
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F+PN P P +WTE WTGWF +GG +R +DLAFAVARF Q GG+ NYYMYHGG
Sbjct: 237 DTFSPNKPYKPTMWTEAWTGWFSEFGGPLHQRPVQDLAFAVARFIQRGGSLVNYYMYHGG 296
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG L QPK+GHL+ELH+ +K E L + T
Sbjct: 297 TNFGRTAGGPFITTSYDYDAPIDEYGLLRQPKYGHLKELHRAIKMCEPALVSADPIVTSL 356
Query: 349 GN--------SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAK 379
G+ S SG YNLP WS+SILPDCK FNTAK
Sbjct: 357 GDYQQAHVYSSESGGCAAFLSNYDTKSFARVLFNNRHYNLPPWSISILPDCKNAVFNTAK 416
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT P ++ L W+ E I+ + + L++Q T D SDY
Sbjct: 417 VGVQTAQMGMLPAESTT----LSWESYFEDIS--ALDDRSMMTSPGLLEQINVTRDTSDY 470
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWY+T+ D+ +P L G TL + S+G +H ++NG S + + V
Sbjct: 471 LWYITSVDISSSEPFLHGGELPTLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKV 530
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDLSSHKWTYK 557
L G N+I LLS VGL N G F+ GI GPV+L G R G DLSS KWTYK
Sbjct: 531 NLHAGTNKIGLLSVAVGLPNVGGHFETWNTGILGPVVLYGLRQGKW----DLSSQKWTYK 586
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
VGL G + + E +S + +TW+K F+AP +P+ L+++GMGKG
Sbjct: 587 VGLKGEAMNLISPSGFSPVEWMQASLAAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKG 646
Query: 618 FAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD 677
W+NG ++GRYW Y CS C+Y + KC CG P+Q WYHVPRSW++
Sbjct: 647 QIWINGQSIGRYWTAY--ARGNCS--RCNYATAFRPPKCQLGCGQPTQRWYHVPRSWLRP 702
Query: 678 GVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE---------------NKTMELTCH-G 721
N LV+FEE GGNPS+I+ +V + C E + L+C G
Sbjct: 703 EQNLLVVFEEVGGNPSRISIVKRLVTSVCADVSEFHPTFKNWHITAKFITPKVHLSCDPG 762
Query: 722 RRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSC 781
+ IS IK+ASFG P G CG++++G+C A ++EK+CVGK+ C++ S +N
Sbjct: 763 QYISSIKFASFGTPLGTCGSYQQGTCHAPSSS-GILEKKCVGKQRCAVTVSNSNF--EDP 819
Query: 782 AAGTVKRLVVEALC 795
+KRL VEA+C
Sbjct: 820 CPNMMKRLSVEAVC 833
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/836 (44%), Positives = 494/836 (59%), Gaps = 82/836 (9%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S+ VS+D +AITI+G+R+IL+SGSIHYPRS+P MWPDLI+KAKEGGLD I+TYVFWN
Sbjct: 21 SVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNG 80
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP +Y F GN DL++F+K ++ GLYV LRIGPY+CAEWN+GGFPVWL +PGI
Sbjct: 81 HEPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGIN-F 139
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N F +MQ FTT +V+M K E+LF +QGGPIIL+QIENEYG + + G GK+Y
Sbjct: 140 RTDNGPFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTK 199
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A+MA L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF
Sbjct: 200 WAAEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQ 259
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE
Sbjct: 260 FGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDE 319
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------- 350
YG L QPKWGHL++LH+ +K E L G+ T GN
Sbjct: 320 YGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYH 379
Query: 351 -------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
S YNLP WS+SILPDCK +NTA+V Q+ P P+
Sbjct: 380 QRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTP-------VPMHG 432
Query: 404 KWRPEMINDF-VVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
+ + N+ G F + L++Q +T DVSDYLWYMT+ + + L
Sbjct: 433 GFSWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPV 492
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L + S+G LH ++NG + + F + VKL G N+ISLLS VGL N G
Sbjct: 493 LGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGP 552
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
F+ GI GPV L G +DLS KW+YK+GL+G + ++ + +S W+
Sbjct: 553 HFETWNAGILGPVTLNGLNEGR---RDLSWQKWSYKIGLHG--EALGLHSISGSSSVEWA 607
Query: 582 SKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
++ R+ ++WYKTTF AP N P+ L++ MGKG W+NG ++GR+WP Y A
Sbjct: 608 EGSLVAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASG--- 664
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
+ C Y G Y KC+ NCG SQ WYHVP+SW+K N LV+FEE+GG+P+ I+
Sbjct: 665 TCGDCSYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRR 724
Query: 701 VVGTACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGAC 739
V + C +E NK + H G++I IK+ASFG P+G C
Sbjct: 725 DVDSVCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVC 784
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G++++GSC A CVG+ SCS+ + G C +K+L VEA+C
Sbjct: 785 GSYRQGSCHA-FHSYDAFNNLCVGQNSCSVTVAPEMFGGDPC-LNVMKKLAVEAIC 838
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/857 (46%), Positives = 502/857 (58%), Gaps = 93/857 (10%)
Query: 10 AILLCLILQTLFNLSLAY-RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
A CL L F L + V++D +AI I+G+R+IL SGSIHYPRSTP MW DLI KAK
Sbjct: 12 AAFFCLALWLGFQLEQVHCSVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAK 71
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGGLD IETYVFWN HEP R Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+G
Sbjct: 72 EGGLDVIETYVFWNVHEPSRGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFG 131
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL +PGI RT N+ F MQ FT IV M K E+L+ SQGGPIIL+QIENEYG
Sbjct: 132 GFPVWLKYVPGIS-FRTDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYG 190
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
G AG++Y+NW AKMA GVPW+MC+E DAP P+ FTPN P
Sbjct: 191 AQSKLLGSAGQNYVNWAAKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPY 250
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P IWTE W+GWF +GG + +R +DLAF VARF Q GG+F NYYMYHGGTNFGRT+GG
Sbjct: 251 KPSIWTEAWSGWFSEFGGPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGG 310
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL--------TYGN------- 342
P++TTSYDYDAP+DEYG + QPK+GHL+ELHK +K E+ L + GN
Sbjct: 311 PFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSTDPAVTSLGNFQQAHVY 370
Query: 343 ----------VTNTDYGNSV----SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
++N D +SV + YNLP WS+SILPDC+ FNTAKV QT+
Sbjct: 371 SAKSGDCAAFLSNFDTKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQ 430
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADL 447
P + W+ E I+ + L++Q T D SDYLWY+T+ D+
Sbjct: 431 MLPT----NTRMFSWESFDEDISSLDDGSSITTTTSGLLEQINVTRDTSDYLWYITSVDI 486
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ L G TL + S+G +H ++NG S + + V L G N+I
Sbjct: 487 GSSESFLRGGKLPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAGTNRI 546
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS VGL N G F+ GI GPV+L R D+ + DLS KWTY+VGL G +
Sbjct: 547 ALLSVAVGLPNVGGHFETWNTGILGPVVL--RGFDQGKL-DLSWQKWTYQVGLKG----E 599
Query: 568 FYNAKAAN--SERGW------SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
N + N S W S KN PL TW+KT F+AP ++P+ L+++GMGKG
Sbjct: 600 AMNLASPNGISSVEWMQSALVSDKNQPL----TWHKTYFDAPDGDEPLALDMEGMGKGQI 655
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG ++GRYW T LA + C Y G + KC CG P+Q WYHVPRSW+K
Sbjct: 656 WINGLSIGRYW-TALAAGN---CNGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDH 711
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTAC------------------GQAHENKTMELTCH- 720
N LV+FEE GG+PS+I+ V + C G++ E ++ H
Sbjct: 712 NLLVVFEELGGDPSKISLVKRSVSSVCADVSEYHPNIRNWHIDSYGKSEEFHPPKVHLHC 771
Query: 721 --GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGA 778
G+ IS IK+ASFG P G CG ++KG C + L EK+C+GK C++ S +N G
Sbjct: 772 SPGQTISSIKFASFGTPLGTCGNYEKGVCHSSTSHATL-EKKCIGKPRCTVTVSNSNFGQ 830
Query: 779 TSCAAGTVKRLVVEALC 795
C +KRL VEA+C
Sbjct: 831 DPC-PNVLKRLSVEAVC 846
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/861 (45%), Positives = 494/861 (57%), Gaps = 86/861 (9%)
Query: 1 MATLKHCSRAIL-LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M T SR IL CL L + V++D +A+ I+G+R+IL SGSIHYPRSTP M
Sbjct: 4 MGTGDSASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDM 63
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W DLI+KAK+GG+D IETYVFWN HEP +YDF G DL+RF+KTI GLY LRIGP
Sbjct: 64 WEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGP 123
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F M+ FT IV++ K E LF SQGGPII
Sbjct: 124 YVCAEWNFGGFPVWLKYVPGI-SFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG G G +Y+ W AKMA + + GVPW+MC+E DAP P+
Sbjct: 183 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 242
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F PN P P IWTE W+GWF +GG R +DLAF VARF Q GG+F NYYMYHGG
Sbjct: 243 DSFAPNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGG 302
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 303 TNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSI 362
Query: 349 GNS-----------------------------VSGSSYNLPAWSVSILPDCKTEEFNTAK 379
GN + YNLP WS+SILPDC+ FNTAK
Sbjct: 363 GNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAK 422
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P N Q W + + + F + L++Q T D SDY
Sbjct: 423 VGVQTSQMEMLPTDTKNFQ------WESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDY 476
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWYMT+ D+ D + L G TL I S+G +H +VNG S + ++ +
Sbjct: 477 LWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKI 536
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L G N+I+LLS VGL N G F+ GI GPV L G + + DLS KWTY+V
Sbjct: 537 NLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKM---DLSWQKWTYQV 593
Query: 559 GLYGLDDKKFYNAKAANSER-GW--SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
GL G + A N+ GW +S V + +TW+KT F+AP N+P+ L+++GMG
Sbjct: 594 GLKG---EAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMG 650
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG WVNG ++GRYW T A D CS C Y G Y +KC CG P+Q WYHVPR+W+
Sbjct: 651 KGQIWVNGESIGRYW-TAFATGD-CS--HCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWL 706
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTVVVGTAC--------------------GQAHENKTM 715
K N LV+FEE GGNPS ++ V C GQ +
Sbjct: 707 KPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYHPNIKNWQIESYGKGQTFHRPKV 766
Query: 716 ELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA 774
L C G+ I+ IK+ASFG P G CG++++G C A I ++CVGK C++ S +
Sbjct: 767 HLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSY--AILERCVGKARCAVTISNS 824
Query: 775 NLGATSCAAGTVKRLVVEALC 795
N G C +KRL VEA+C
Sbjct: 825 NFGKDPC-PNVLKRLTVEAVC 844
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 705 bits (1819), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/628 (57%), Positives = 445/628 (70%), Gaps = 68/628 (10%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWPDLI+KAK+GGLDAIETY+FW+ HEP RR+YDF+G LD I+F + IQD GLYV++RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWNYGGFPVWLHNMPGI+ LRT N+V+ NEMQ FTT IV+M K+ LFASQGGPI
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQ-LRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPI 119
Query: 179 ILAQIENEYGNVMSD-YGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------- 230
ILAQIENEYGNVM+ YGDAGK+YINWCA+MA SL+IGVPWIMCQ+SDAP PM
Sbjct: 120 ILAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGF 179
Query: 231 ----FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYH 286
FTPNNP SPK++TENW GWFK WG KDP RTAED+AF+VARFFQ GG F NYYMYH
Sbjct: 180 YCDNFTPNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYH 239
Query: 287 GGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT 346
GGTNFGRTSGGP++TTSYDY+AP+DEYG+LNQPKWGHL++LH +K EK LT +N
Sbjct: 240 GGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSNQ 299
Query: 347 DYGNSVS-------------------------------GSSYNLPAWSVSILPDCKTEEF 375
++G+SV+ Y +PAWSVSIL C E +
Sbjct: 300 NFGSSVTLTKFSNPTTGERFCFLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKEVY 359
Query: 376 NTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TND 434
NTAKVN+QT++ VK N+ N Q L W W PE + D ++G G FA N L++QK T D
Sbjct: 360 NTAKVNSQTSMFVKEQNEKENAQ--LSWAWAPEPMKD-TLQGNGKFAANLLLEQKRVTVD 416
Query: 435 VSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF 494
SDY WYMT D S N+TL++N+ G VLHA+VN Y+ S+W G S +F
Sbjct: 417 FSDYFWYMTKVDTNG----TSSLQNVTLQVNTKGHVLHAFVNKRYIGSKWGSNGQS-FVF 471
Query: 495 ERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPG-PVLLVGRAGDETIIKDLSSHK 553
E+P+ L G N I+LLSATVGL+NY + +DMVP GI G P+ L+G D + DLSS+
Sbjct: 472 EKPILLKSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIG---DGNVTTDLSSNL 528
Query: 554 WTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN-----RRMTWYKTTFEAPLENDPVV 608
W+YKVGL G + K+ YN + W +PLN RRMTWYKT+F+ P DPVV
Sbjct: 529 WSYKVGLNG-EMKQIYNP-VFSQRTNW----IPLNQKSIGRRMTWYKTSFKTPAGIDPVV 582
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAE 636
L++QGMGKG AWVNG ++GR+WP+++ +
Sbjct: 583 LDMQGMGKGQAWVNGQSIGRFWPSFIXK 610
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 705 bits (1819), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/862 (45%), Positives = 496/862 (57%), Gaps = 87/862 (10%)
Query: 1 MATLKHCSRAIL-LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M + SR IL CL L L + V++D +A+ I+G+R+IL SGSIHYPRSTP M
Sbjct: 1 MGSGDSASRLILWFCLGLLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDM 60
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W LI+KAK+GG+D IETYVFWN HEP +YDF G DL+RF+KTI GLY LRIGP
Sbjct: 61 WEGLIQKAKDGGIDVIETYVFWNLHEPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGP 120
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F M+ FT IV++ K E LF SQGGPII
Sbjct: 121 YVCAEWNFGGFPVWLKYVPGI-SFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 179
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG G G +Y+ W AKMA + + GVPW+MC+E DAP P+
Sbjct: 180 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 239
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F PN P P IWTE W+GWF +GG R +DLAF VARF Q GG+F NYYMYHGG
Sbjct: 240 DSFAPNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGG 299
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + +PK+GHL+ELH+ +K EK L + T
Sbjct: 300 TNFGRTAGGPFVTTSYDYDAPIDEYGLIREPKYGHLKELHRAIKMCEKALVSADPVVTSI 359
Query: 349 GNS-----------------------------VSGSSYNLPAWSVSILPDCKTEEFNTAK 379
GN + YNLP WS+SILPDC+ FNTAK
Sbjct: 360 GNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAK 419
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P N Q W+ + + + F L++Q T D SDY
Sbjct: 420 VGVQTSQMEMLPTDTKNFQ------WQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDY 473
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWYMT+ D+ D + L G TL I S+G +H +VNG S + ++ +
Sbjct: 474 LWYMTSVDIGDTESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKI 533
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L G N+I+LLS VGL N G F+ GI GPV L G + + +DLS KWTY+V
Sbjct: 534 NLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGK---RDLSWQKWTYQV 590
Query: 559 GLYGLDDKKFYNAKAANSER--GW--SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
GL G + N + R GW +S V + +TW+KT F+AP N+P+ L+++GM
Sbjct: 591 GLKG----EAMNLAFPTNTRSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGM 646
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG WVNG ++GRYW T A D CS C Y G Y +KC CG P+Q +YHVPRSW
Sbjct: 647 GKGQIWVNGESIGRYW-TAFATGD-CS--QCSYTGTYKPNKCQTGCGQPTQRYYHVPRSW 702
Query: 675 IKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC--------------------GQAHENKT 714
+K N LV+FEE GGNPS ++ V C GQ
Sbjct: 703 LKPSQNLLVIFEELGGNPSSVSLVKRSVSGVCAEVSEYHPNIKNWQIESYGKGQTFHRPK 762
Query: 715 MELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASE 773
+ L C G+ I+ IK+ASFG P G CG++++G C A ++E++CVGK C++ S
Sbjct: 763 VHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATS-YAILERKCVGKARCAVTISN 821
Query: 774 ANLGATSCAAGTVKRLVVEALC 795
N G C +KRL VEA+C
Sbjct: 822 TNFGKDPC-PNVLKRLTVEAVC 842
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 704 bits (1817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/898 (43%), Positives = 512/898 (57%), Gaps = 136/898 (15%)
Query: 9 RAILLCLILQTLFNLSLAY----RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
R + LCL +Q + Y VS+D RA+ IDG+R++L+S IHYPR+TP MWPDLI
Sbjct: 12 RCLFLCLAVQFALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLI 71
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
K+KEGG+D I+TY FW+ HEP+R QY+F G D+++F + GLY+ LRIGPYVCAE
Sbjct: 72 AKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAE 131
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFPVWL ++PGIE RT N +F EMQ F +VD+ ++E+L + QGGPII+ QIE
Sbjct: 132 WNFGGFPVWLRDIPGIE-FRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMMQIE 190
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYGN+ +G GK YI W A+MA L GVPW+MC++ DAP + + P
Sbjct: 191 NEYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKP 250
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+ N P +WTE+W GW+ SWGG+ P R EDLAFAVARF+Q GG+FQNYYMY GGTNFGR
Sbjct: 251 NSYNKPTLWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 310
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTN-------- 345
TSGGP+ TSYDYDAPIDEYG L++PKWGHL++LH +K E L + N
Sbjct: 311 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQ 370
Query: 346 ----------------TDYGNSVS-------------------GSSYNLPAWSVSILPDC 370
T YG+ +S G YNLP WSVSILPDC
Sbjct: 371 EAHVYRVNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDC 430
Query: 371 KTEEFNTAKVNTQTNVKVKR---PNQAG----------NDQAPLQWKW----RPEMI--- 410
+ +NTAKV QT++K P +G ND + W P +
Sbjct: 431 RNVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSE 490
Query: 411 NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSSG 468
N+F V+G L T D SDYLW++T + +DD +N++ + I+S
Sbjct: 491 NNFTVQG-------ILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMR 543
Query: 469 QVLHAYVNGNYVD----SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
VL +VNG + W K E+PVK +G N + LL+ TVGLQNYG+ +
Sbjct: 544 DVLRVFVNGQLTEGSVIGHWVK-------VEQPVKFLKGYNDLVLLTQTVGLQNYGAFLE 596
Query: 525 MVPNGIPGPVLLVG-RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
G G + L G + GD DLS WTY+VGL G + K Y + N + GW+
Sbjct: 597 KDGAGFRGQIKLTGFKNGD----IDLSKLLWTYQVGLKG-EFFKIYTIE-ENEKAGWAEL 650
Query: 584 NVPLN-RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+ + WYKT F++P DPV L+L MGKG AWVNG+++GRYW T +A EDGC
Sbjct: 651 SPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYW-TLVAPEDGCP- 708
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
E CDYRG Y SDKC++NCG P+Q YHVPRSW++ N LV+ EE GGNP I+ +
Sbjct: 709 EICDYRGAYNSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSA 768
Query: 703 GTACGQAHENK------------------------TMELTCH-GRRISEIKYASFGDPQG 737
G C Q E+ M L C G IS I++AS+G PQG
Sbjct: 769 GVLCAQVSESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQDGFTISSIEFASYGTPQG 828
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+C F G+C A + ++ K C+GK SCS+E S + G C G VK L VEA C
Sbjct: 829 SCQKFSMGNCHA-TNSSSIVSKSCLGKNSCSVEISNNSFGGDPC-RGIVKTLAVEARC 884
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/837 (45%), Positives = 488/837 (58%), Gaps = 82/837 (9%)
Query: 23 LSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWN 82
LS V++D +AI I+G+R+IL SGSIHYPRSTP MW DLI KAKEGGLD +ETYVFWN
Sbjct: 21 LSSHASVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWN 80
Query: 83 AHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE 142
HEP Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+GGFPVWL +PGI
Sbjct: 81 VHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIS- 139
Query: 143 LRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
RT N+ F MQ FT IV M K E+LF SQGGPIIL+QIENEYG GDAG++Y+
Sbjct: 140 FRTDNEPFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYV 199
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFK 251
NW AKMA + GVPW+MC+E DAP P+ FTPN P P IWTE W+GWF
Sbjct: 200 NWAAKMAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFT 259
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPID 311
+GG KR +DLAFAVARF GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+D
Sbjct: 260 EFGGPIHKRPVQDLAFAVARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLD 319
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------------------- 351
EYG + QPK+GHL+ELH+ +K E+ L + T G S
Sbjct: 320 EYGLIRQPKYGHLKELHRAIKMCERALVSTDPIITSLGESQQAHVYTTESGDCAAFLSNY 379
Query: 352 ---------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
+ YNLP WSVSILPDC+ FNTAKV QT+ P N Q
Sbjct: 380 DSKSSARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKVGVQTSQMQMLPT---NTQL--- 433
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
+ W + + V L++Q T D SDYLWY+T+ D+ + L G T
Sbjct: 434 FSWESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPT 493
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L + S G +H ++NG S + ++ V L G N+I+LLS +GL N G
Sbjct: 494 LIVQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRIALLSVAIGLPNVGE 553
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW- 580
F+ GI GPV L G + DLS KWTY+VGL G + + S W
Sbjct: 554 HFESWSTGILGPVALHGLDQGKW---DLSGQKWTYQVGLKG--EAMDLASPNGISSVAWM 608
Query: 581 -SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
S+ V N+ +TW+KT F+AP ++P+ L+++GMGKG W+NG ++GRYW T+
Sbjct: 609 QSAIVVQRNQPLTWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTTFATG--- 665
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
+ C+Y G + KC CG P+Q WYHVPRSW+K N LV+FEE GGNPS+I+
Sbjct: 666 -NCNDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGGNPSKISLVK 724
Query: 700 VVVGTAC------------------GQAHENKTMELTCH---GRRISEIKYASFGDPQGA 738
V + C G++ E ++ H G+ IS IK+ASFG P G
Sbjct: 725 RSVSSVCADVSEYHPNIKNWHIESYGKSEEFHPPKVHLHCSPGQTISSIKFASFGTPLGT 784
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG +++G+C + ++EK+C+GK C++ S +N G C +KRL VEA+C
Sbjct: 785 CGNYEQGACHSPAS-YAILEKRCIGKPRCTVTVSNSNFGQDPCPK-VLKRLSVEAVC 839
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 703 bits (1815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/837 (46%), Positives = 496/837 (59%), Gaps = 84/837 (10%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S+ VS+D +AITI+G+ +IL+SGSIHYPRSTP MWPDLI+KAKEGGLD I+TYVFWN
Sbjct: 23 SVIASVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNG 82
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP +Y F GN DL++FIK +Q GLYV LRIGPYVCAEWN+GGFPVWL +PGI
Sbjct: 83 HEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGI-SF 141
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F +MQ FT IVDM K ++LF SQGGPII++QIENEYG + + G GKSY
Sbjct: 142 RTDNEPFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTK 201
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A MA L GVPWIMC++ DAP P+ F+PN PK+WTE WTGWF
Sbjct: 202 WAADMAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCDYFSPNKDYKPKMWTEAWTGWFTE 261
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE
Sbjct: 262 FGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDE 321
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS-------- 356
YG L QPKWGHL++LH+ +K E L G+ T T GN S SG+
Sbjct: 322 YGLLQQPKWGHLKDLHRAIKLSEPALISGDPTVTRIGNYQEAHVFKSKSGACAAFLGNYN 381
Query: 357 -------------YNLPAWSVSILPDCKTEEFNTAKVNTQT-NVKVKR-PNQAGNDQAPL 401
YNLP WS+SILPDCK +NTA+V +Q+ +K+ R P G L
Sbjct: 382 PKAFATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSAQMKMTRVPIHGG-----L 436
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
W+ E F + L++Q +T D++DYLWY T+ + ++ L +
Sbjct: 437 SWQVFTEQT---ASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDVVIDPNEGFLRSGKDP 493
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
L + S+G LH ++N + + F + VKL G N+ISLLS VGL N G
Sbjct: 494 VLTVLSAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKISLLSVAVGLPNVG 553
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
F+ G+ GP+ L G DE +DLS KW+YKVGL+G +++ E W
Sbjct: 554 PHFETWNAGVLGPITLNGL--DEG-RRDLSWQKWSYKVGLHGEALSLHSLGGSSSVE--W 608
Query: 581 SSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
+ V + +TWYKTTF+AP P L++ MGKG W+NG NLGRYWP Y A
Sbjct: 609 VQGSLVSRMQPLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPAYKASG-- 666
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
+ ++CDY G Y +KC NCG SQ WYHVP SW+ N LV+FEE GG+P+ I
Sbjct: 667 -TCDNCDYAGTYNENKCRSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNGIFLVR 725
Query: 700 VVVGTACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGA 738
+ + C +E NK + H G++IS IK+ASFG P G+
Sbjct: 726 RDIDSVCADIYEWQPNLISYQMQTSGKTNKPVRPKAHLSCGPGQKISSIKFASFGTPVGS 785
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG F +GSC A EK CVG+ SC + S N G C +K+L VEA+C
Sbjct: 786 CGNFHEGSCHAH-KSYNTFEKNCVGQNSCKVTVSPENFGGDPC-PNVLKKLSVEAIC 840
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/849 (44%), Positives = 497/849 (58%), Gaps = 81/849 (9%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
R L+ IL + + V++D R+ I+G+RKIL+SGSIHYPRSTP MWPDLI+KAK
Sbjct: 3 RGSLVVFILIFSWVSHGSASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAK 62
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD I+TYVFWN HEP R +Y F G DL+RFIK +Q GLYV LRIGPY+CAEWN+G
Sbjct: 63 DGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFG 122
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL +PGI RT N F MQ FT IVDM K EKLF QGGPII++QIENEYG
Sbjct: 123 GFPVWLKYVPGIA-FRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYG 181
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
V + G GK+Y W A+MA L GVPW+MC++ DAP P+ F PN
Sbjct: 182 PVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDY 241
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK++TE WTGW+ +GG P R AEDLA++VARF Q G+F NYYMYHGGTNFGRT+GG
Sbjct: 242 KPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGG 301
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG--- 354
P+++TSYDYDAPIDEYG ++PKWGHLR+LHK +K E L + T T G ++
Sbjct: 302 PFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVY 361
Query: 355 --------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
+ Y+LP WSVSILPDCK FNTA++ Q++
Sbjct: 362 KAKSGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQSSQMK 421
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
P + W+ + ++ L++Q + T D +DYLWYMT +
Sbjct: 422 MNPVST--------FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHI 473
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
K D+ L L + S+G LH ++NG + + + F VKLT G N+I
Sbjct: 474 KPDEGFLKTGQYPVLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKI 533
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
SLLS +GL N G F+ G+ GPV L G +E + D+SS KW+YK+GL G +
Sbjct: 534 SLLSVAMGLPNVGLHFETWNAGVLGPVTLKGL--NEGTV-DMSSWKWSYKIGLKG--EAL 588
Query: 568 FYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A +S W ++ ++ +TWYKTTF AP NDP+ L++ MGKG W+NG ++
Sbjct: 589 NLQAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESI 648
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP Y A + C+Y G + KC CG PSQ WYHVPRSW+K N L++FE
Sbjct: 649 GRHWPAYTAHGN---CNGCNYAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFE 705
Query: 687 EFGGNPSQINFQTVVVGTACGQAHENK---------------TMELTCH-----GRRISE 726
E GGNP+ I + C E + +++ H G +IS+
Sbjct: 706 ELGGNPAGITLVKRTMDRVCADIFEGQPSLKNSQIIGSSKVNSLQSKAHLWCAPGLKISK 765
Query: 727 IKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTV 786
I++ASFG PQG CG+F++GSC A L ++ C+GK+SCS+ + G C G++
Sbjct: 766 IQFASFGVPQGTCGSFREGSCHAHKSYDAL-QRNCIGKQSCSVSVAPEVFGGDPC-PGSM 823
Query: 787 KRLVVEALC 795
K+L VEALC
Sbjct: 824 KKLSVEALC 832
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/849 (44%), Positives = 497/849 (58%), Gaps = 81/849 (9%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
R L+ IL + + V++D R+ I+G+RKIL+SGSIHYPRSTP MWPDLI+KAK
Sbjct: 6 RGSLVVFILIFSWVSHGSASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAK 65
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD I+TYVFWN HEP R +Y F G DL+RFIK +Q GLYV LRIGPY+CAEWN+G
Sbjct: 66 DGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFG 125
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL +PGI RT N F MQ FT IVDM K EKLF QGGPII++QIENEYG
Sbjct: 126 GFPVWLKYVPGIA-FRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYG 184
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
V + G GK+Y W A+MA L GVPW+MC++ DAP P+ F PN
Sbjct: 185 PVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDY 244
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK++TE WTGW+ +GG P R AEDLA++VARF Q G+F NYYMYHGGTNFGRT+GG
Sbjct: 245 KPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGG 304
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG--- 354
P+++TSYDYDAPIDEYG ++PKWGHLR+LHK +K E L + T T G ++
Sbjct: 305 PFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVY 364
Query: 355 --------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
+ Y+LP WSVSILPDCK FNTA++ Q++
Sbjct: 365 KAKSGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQSSQMK 424
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
P + W+ + ++ L++Q + T D +DYLWYMT +
Sbjct: 425 MNPVST--------FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHI 476
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
K D+ L L + S+G LH ++NG + + + F VKLT G N+I
Sbjct: 477 KPDEGFLKTGQYPVLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKI 536
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
SLLS +GL N G F+ G+ GPV L G +E + D+SS KW+YK+GL G +
Sbjct: 537 SLLSVAMGLPNVGLHFETWNAGVLGPVTLKGL--NEGTV-DMSSWKWSYKIGLKG--EAL 591
Query: 568 FYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A +S W ++ ++ +TWYKTTF AP NDP+ L++ MGKG W+NG ++
Sbjct: 592 NLQAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESI 651
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP Y A + C+Y G + KC CG PSQ WYHVPRSW+K N L++FE
Sbjct: 652 GRHWPAYTAHGN---CNGCNYAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFE 708
Query: 687 EFGGNPSQINFQTVVVGTACGQAHENK---------------TMELTCH-----GRRISE 726
E GGNP+ I + C E + +++ H G +IS+
Sbjct: 709 ELGGNPAGITLVKRTMDRVCADIFEGQPSLKNSQIIGSSKVNSLQSKAHLWCAPGLKISK 768
Query: 727 IKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTV 786
I++ASFG PQG CG+F++GSC A L ++ C+GK+SCS+ + G C G++
Sbjct: 769 IQFASFGVPQGTCGSFREGSCHAHKSYDAL-QRNCIGKQSCSVSVAPEVFGGDPC-PGSM 826
Query: 787 KRLVVEALC 795
K+L VEALC
Sbjct: 827 KKLSVEALC 835
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/848 (45%), Positives = 501/848 (59%), Gaps = 80/848 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
IL+ +L L+ S++ VS+D +AITI+G+R+IL+SGSIHYPRS+P MWPDLI+KAKEG
Sbjct: 14 ILVVFLLLGLWVCSVSSSVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEG 73
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP +Y F GN DL++FIK ++ GLYV LRIGPYVCAEWN+GGF
Sbjct: 74 GLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAEWNFGGF 133
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N F +MQ FTT IV+M K E+LF SQGGPIIL+QIENEYG +
Sbjct: 134 PVWLKYVPGI-NFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPM 192
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G G++Y W AKMA L GVPW+MC++ DAP P+ F+PN P P
Sbjct: 193 EYELGAPGQAYSKWAAKMAVGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKPYKP 252
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTE WTGWF +GG P R AEDLAF+VARF Q GG F NYYMYHGGTNFGRT+GGP+
Sbjct: 253 KMWTEAWTGWFTEFGGAVPYRPAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRTAGGPF 312
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------S 351
+ TSYDYDAP+DEYG L QPKWGHL++LH+ +K E L G + GN S
Sbjct: 313 IATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGAPSVMPLGNYQEAHVFKS 372
Query: 352 VSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SG+ YNLP WS+SILPDCK +NTA++ Q+
Sbjct: 373 KSGACAAFLANYNQRSFAKVSFGNMHYNLPPWSISILPDCKNTVYNTARIGAQSARMKMS 432
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKD 449
P + W+ E + G F + L++Q +T DVSDYLWY T+ +
Sbjct: 433 PIPM---RGGFSWQAYSE---EASTEGDNTFMMVGLLEQINTTRDVSDYLWYSTDVRIDS 486
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
++ L L + S+G LH +VNG + + + F + VK+ G N+I L
Sbjct: 487 NEGFLRSGKYPVLTVLSAGHALHVFVNGQLSGTAYGSLESPKLTFSQGVKMRAGINRIYL 546
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G F+ G+ GPV L G +DLS KWTYK+GL+G
Sbjct: 547 LSIAVGLPNVGPHFETWNAGVLGPVTLNGLNEGR---RDLSWQKWTYKIGLHGEALSLHS 603
Query: 570 NAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGR 628
+ +++ E W+ + V + + WYKTTF AP N P+ L++ MGKG W+NG ++GR
Sbjct: 604 LSGSSSVE--WAQGSFVSRKQPLMWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQSVGR 661
Query: 629 YWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
YWP Y A + C C+Y G + KC NCG SQ WYHVPRSW+ N LV+FEE+
Sbjct: 662 YWPAYKASGN-CGV--CNYAGTFNEKKCLTNCGEASQRWYHVPRSWLNTAGNLLVVFEEW 718
Query: 689 GGNPSQINFQTVVVGTACGQAHE----------------NKTMELTCH-----GRRISEI 727
GG+P+ I+ V + C +E NK + H G++IS I
Sbjct: 719 GGDPNGISLVRREVDSVCADIYEWQPTLMNYMMQSSGKVNKPLRPKVHLQCGAGQKISLI 778
Query: 728 KYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVK 787
K+ASFG P+G CG++++GSC A + CVG+ CS+ + G C +K
Sbjct: 779 KFASFGTPEGVCGSYRQGSCHA-FHSYDAFNRLCVGQNWCSVTVAPEMFGGDPC-PNVMK 836
Query: 788 RLVVEALC 795
+L VEA+C
Sbjct: 837 KLAVEAVC 844
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/897 (43%), Positives = 510/897 (56%), Gaps = 135/897 (15%)
Query: 9 RAILLCLILQTLFNLSLAY----RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
R + LCL +Q + Y VS+D RA+ IDG+R++L+S IHYPR+TP MWPDLI
Sbjct: 12 RCLFLCLAVQFALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLI 71
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
K+KEGG+D I+TY FW+ HEP+R QY+F G D+++F + GLY+ LRIGPYVCAE
Sbjct: 72 AKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAE 131
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFPVWL ++PGIE RT N +F EMQ F +VD+ ++E+L + QGGPII+ QIE
Sbjct: 132 WNFGGFPVWLRDIPGIE-FRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMLQIE 190
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYGN+ +G GK YI W A+MA L GVPW+MC++ DAP + + P
Sbjct: 191 NEYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKP 250
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+ N P +WTE+W GW+ SWGG+ P R EDLAFAVARF+Q GG+FQNYYMY GGTNFGR
Sbjct: 251 NSYNKPTMWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 310
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTN-------- 345
TSGGP+ TSYDYDAPIDEYG L++PKWGHL++LH +K E L + N
Sbjct: 311 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQ 370
Query: 346 ----------------TDYGNSVS-------------------GSSYNLPAWSVSILPDC 370
T YG+ +S G YNLP WSVSILPDC
Sbjct: 371 EAHVYRMNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDC 430
Query: 371 KTEEFNTAKVNTQTNVKVKR---PNQAG----------NDQAPLQWKW----RPEMI--- 410
+ +NTAKV QT++K P +G ND + W P +
Sbjct: 431 RNVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSE 490
Query: 411 NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSSG 468
N+F V+G L T D SDYLW++T + +DD +N++ + I+S
Sbjct: 491 NNFTVQG-------ILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMR 543
Query: 469 QVLHAYVNGNYVDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
VL +VNG S W K E+PVK +G N + LL+ TVGLQNYG+ +
Sbjct: 544 DVLRVFVNGQLTGSVIGHWVK-------VEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEK 596
Query: 526 VPNGIPGPVLLVG-RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G G + L G + GD D S WTY+VGL G + K Y + N + W+ +
Sbjct: 597 DGAGFRGQIKLTGFKNGD----IDFSKLLWTYQVGLKG-EFLKIYTIE-ENEKASWAELS 650
Query: 585 VPLN-RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ WYKT F++P DPV L+L MGKG AWVNG+++GRYW T +A EDGC E
Sbjct: 651 PDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYW-TLVAPEDGCP-E 708
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
CDYRG Y SDKC++NCG P+Q YHVPRSW++ N LV+ EE GGNP I+ + G
Sbjct: 709 ICDYRGAYDSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAG 768
Query: 704 TACGQAHENK------------------------TMELTCH-GRRISEIKYASFGDPQGA 738
C Q E+ M L C G IS I++AS+G PQG+
Sbjct: 769 VLCAQVSESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQDGFTISSIEFASYGTPQGS 828
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C F G+C A + ++ K C+GK SCS+E S + G C G VK L VEA C
Sbjct: 829 CQKFSMGNCHA-TNSSSIVSKSCLGKNSCSVEISNISFGGDPC-RGVVKTLAVEARC 883
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/880 (43%), Positives = 507/880 (57%), Gaps = 132/880 (15%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N + VS+D RA+ IDG R++L+SG IHYPR+TP MWPDLI K+KEGG+D I+TYVFW
Sbjct: 33 NFFKPFNVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYVFW 92
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP++ QY F G DL++F+K + GLY+ LRIGPYVCAEWN+GGFPVWL ++PGI
Sbjct: 93 NGHEPVKGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGI- 151
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N FM EMQ F IVD+ ++E LF+ QGGPII+ QIENEYGN+ +G GK Y
Sbjct: 152 VFRTDNSPFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKEY 211
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+ W A+MA L GVPW+MC+++DAP + + PN+ P +WTE+W GW+
Sbjct: 212 VKWAARMALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGYKPNSNKKPILWTEDWDGWY 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+WGG P R EDLAFAVARFFQ GG+FQNYYMY GGTNF RT+GGP+ TSYDYDAPI
Sbjct: 272 TTWGGSLPHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAPI 331
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN---------------------------- 342
DEYG L++PKWGHL++LH +K E L +
Sbjct: 332 DEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGSKQEAHVYRANVHAEGQNLT 391
Query: 343 -----------VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ N D +V+ G SY LP WSVS+LPDC+ FNTAKV QT++K
Sbjct: 392 QHGSQSKCSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSIK 451
Query: 388 -----------VKRPNQ--AGNDQAPLQWKW----RPEMI---NDFVVRGKGHFALNTLI 427
+ P Q A N+ + + W P + N+F V G L
Sbjct: 452 SMELALPQFSGISAPKQLMAQNEGSYMSSSWMTVKEPISVWSGNNFTVEG-------ILE 504
Query: 428 DQKSTNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDS--- 482
T D SDYLWY T + DDD +N+ ++I+S VL ++NG S
Sbjct: 505 HLNVTKDHSDYLWYFTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIG 564
Query: 483 QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAG 541
+W K +PV+ +G N++ LLS TVGLQNYG+ + G G L G R G
Sbjct: 565 RWIK-------VVQPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDG 617
Query: 542 DETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEA 600
D DLS+ +WTY+VGL G +++K Y + N + W+ + + TWYKT F+A
Sbjct: 618 D----IDLSNLEWTYQVGLQG-ENQKIYTTE-NNEKAEWTDLTLDDIPSTFTWYKTYFDA 671
Query: 601 PLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNC 660
P DPV L+L MGKG AWVN +++GRYW T +A E+GC + CDYRG Y S+KC NC
Sbjct: 672 PSGADPVALDLGSMGKGQAWVNDHHIGRYW-TLVAPEEGC--QKCDYRGAYNSEKCRTNC 728
Query: 661 GNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK------- 713
G P+QIWYH+PRSW++ N LV+FEE GGNP +I+ + C Q E
Sbjct: 729 GKPTQIWYHIPRSWLQPSNNLLVIFEETGGNPFEISIKLRSASVVCAQVSETHYPPLQRW 788
Query: 714 -----------------TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
++L C G IS I++AS+G PQG+C F +G+C A + L
Sbjct: 789 IHTDFIYGNVSGKDMTPEIQLRCQDGYVISSIEFASYGTPQGSCQKFSRGNCHAP-NSLS 847
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++ K C G+ +C+I S A G C G VK L VEA C
Sbjct: 848 VVSKACQGRDTCNIAISNAVFGGDPC-RGIVKTLAVEAKC 886
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/851 (45%), Positives = 497/851 (58%), Gaps = 87/851 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++LC++LQ L + + V++D +AI I+G+R+IL+SGSIHYPRSTP MW D+I+KAK+G
Sbjct: 11 LVLCMVLQ-LGSQLIQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDG 69
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD +ETYVFWN HEP Y+F G DL+RFI+T+Q GLY LRIGPYVCAEWN+GGF
Sbjct: 70 GLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F MQ FT IV + K E+LF SQGGPIIL+QIENEYG
Sbjct: 130 PVWLKYVPGI-SFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQ 188
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
GDAG Y+ W A MA L GVPW+MC+E DAP P+ F+PN P P
Sbjct: 189 SKLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKP 248
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
IWTE W+GWF +GG +R +DLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP+
Sbjct: 249 TIWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------YGNVTNT 346
+TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K E+ L +V ++
Sbjct: 309 ITTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSS 368
Query: 347 DYGNSVSGSS----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
D G+ + S YNLP WS+SILPDC+ FNTAKV QT
Sbjct: 369 DAGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEML 428
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKD 449
P A L W+ E I+ + F L++Q T D SDYLWY+T D+
Sbjct: 429 PTNA----EMLSWESYDEDISS--LDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGS 482
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L G TL + ++G +H ++NG S + F V L G N I+L
Sbjct: 483 SESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIAL 542
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G F+ GI GPV L G + DLS +WTYKVGL G +
Sbjct: 543 LSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKW---DLSWQRWTYKVGLKG----EAM 595
Query: 570 NAKAAN--SERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
N + N S W ++ R+ +TW+K F AP ++P+ L+++GMGKG W+NG +
Sbjct: 596 NLVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQS 655
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYW Y C + C Y G Y KC CG P+Q WYHVPRSW+K N LV+F
Sbjct: 656 IGRYWTAY--ANGNC--QGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVF 711
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHE-------------NKTMEL---TCH-----GRRI 724
EE GG+PS+I+ + + C E KT EL H G+ I
Sbjct: 712 EELGGDPSRISLVRRSMTSVCADVFEYHPNIKNWHIESYGKTEELHKPKVHLRCGPGQSI 771
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S IK+AS+G P G CG+F++G C A D ++EK+C+G++ C++ S N C
Sbjct: 772 SSIKFASYGTPLGTCGSFEQGPCHAP-DSYAIVEKRCIGRQRCAVTISNTNFAQDPC-PN 829
Query: 785 TVKRLVVEALC 795
+KRL VEA+C
Sbjct: 830 VLKRLSVEAVC 840
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/851 (45%), Positives = 497/851 (58%), Gaps = 87/851 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++LC++LQ L + + V++D +AI I+G+R+IL+SGSIHYPRSTP MW D+I+KAK+G
Sbjct: 64 LVLCMVLQ-LGSQLIQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDG 122
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD +ETYVFWN HEP Y+F G DL+RFI+T+Q GLY LRIGPYVCAEWN+GGF
Sbjct: 123 GLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGF 182
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F MQ FT IV + K E+LF SQGGPIIL+QIENEYG
Sbjct: 183 PVWLKYVPGI-SFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQ 241
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
GDAG Y+ W A MA L GVPW+MC+E DAP P+ F+PN P P
Sbjct: 242 SKLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKP 301
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
IWTE W+GWF +GG +R +DLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP+
Sbjct: 302 TIWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPF 361
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------YGNVTNT 346
+TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K E+ L +V ++
Sbjct: 362 ITTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSS 421
Query: 347 DYGNSVSGSS----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
D G+ + S YNLP WS+SILPDC+ FNTAKV QT
Sbjct: 422 DAGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEML 481
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKD 449
P A L W+ E I+ + F L++Q T D SDYLWY+T D+
Sbjct: 482 PTNA----EMLSWESYDEDISS--LDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGS 535
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L G TL + ++G +H ++NG S + F V L G N I+L
Sbjct: 536 SESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIAL 595
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G F+ GI GPV L G + DLS +WTYKVGL G +
Sbjct: 596 LSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKW---DLSWQRWTYKVGLKG----EAM 648
Query: 570 NAKAAN--SERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
N + N S W ++ R+ +TW+K F AP ++P+ L+++GMGKG W+NG +
Sbjct: 649 NLVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQS 708
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYW Y C + C Y G Y KC CG P+Q WYHVPRSW+K N LV+F
Sbjct: 709 IGRYWTAY--ANGNC--QGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVF 764
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHE-------------NKTMEL---TCH-----GRRI 724
EE GG+PS+I+ + + C E KT EL H G+ I
Sbjct: 765 EELGGDPSRISLVRRSMTSVCADVFEYHPNIKNWHIESYGKTEELHKPKVHLRCGPGQSI 824
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S IK+AS+G P G CG+F++G C A D ++EK+C+G++ C++ S N C
Sbjct: 825 SSIKFASYGTPLGTCGSFEQGPCHAP-DSYAIVEKRCIGRQRCAVTISNTNFAQDPC-PN 882
Query: 785 TVKRLVVEALC 795
+KRL VEA+C
Sbjct: 883 VLKRLSVEAVC 893
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 701 bits (1809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/825 (46%), Positives = 492/825 (59%), Gaps = 84/825 (10%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D +A+ ++G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD ++TYVFWN HEP
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 87 QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNEP 145
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQ FTT IV+M K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA
Sbjct: 146 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 205
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+L+ VPWIMC+E DAP P+ F+PN P+ P +WTE WT W+ +G P
Sbjct: 206 VALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 265
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLA+ VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG L +
Sbjct: 266 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 325
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHL++LHK +K E L G+ T GN
Sbjct: 326 PKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLENKDKVSYAR 385
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+ +G Y+LP WS+SILPDCKT FNTA+V +Q + + AG W+ E
Sbjct: 386 VAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ--ISQMKMEWAGG----FAWQSYNEE 439
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
IN F G+ L++Q T D +DYLWY T D+ D+ LS N+ L + S+G
Sbjct: 440 INSF---GEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLTVMSAG 496
Query: 469 QVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
LH ++NG + T YG+ +D + VKL G N IS LS VGL N G F+
Sbjct: 497 HALHIFINGQL---KGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFET 553
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
GI GPV L G +DL+ KWTY+VGL G + ++ E G +
Sbjct: 554 WNAGILGPVTLDGLNEGR---RDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGEPVQKQ 610
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
PL TWYK F AP ++P+ L++ MGKG W+NG +GRYWP Y A + C T C
Sbjct: 611 PL----TWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGN-CGT--C 663
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
DYRG Y KC NCG+ SQ WYHVPRSW+ N LV+FEE+GG+P+ I+ +G+
Sbjct: 664 DYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSV 723
Query: 706 CGQA--------------HENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAE 750
C +E + L C +G++I+EIK+ASFG PQG+CG++ +G C A
Sbjct: 724 CADVSEWQPSMKNWHTKDYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAH 783
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ K CVG++ C + G C GT+KR VVEA+C
Sbjct: 784 -KSYDIFWKNCVGQERCGVSVVPEIFGGDPC-PGTMKRAVVEAIC 826
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 701 bits (1809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/847 (44%), Positives = 497/847 (58%), Gaps = 78/847 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I + L L + + V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK+G
Sbjct: 11 IFFFVPLMFLHSQLIQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TY+FWN HEP Y+F G DL+RFIKT+Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F MQ FT IV M K E LFASQGGPIIL+QIENEYG
Sbjct: 131 PVWLKFVPGI-SFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPE 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AG +YINW AKMA LD GVPW+MC+E DAP P+ F+PN P P
Sbjct: 190 SRELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+IWTE W+GWF +GG +R +DLAF VARF Q GG+F NYYMYHGGTNFGR++GGP+
Sbjct: 250 RIWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPF 309
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------YGNVTNT 346
+TTSYDYDAPIDEYG + QPK+GHL+ELHK +K E + +V ++
Sbjct: 310 ITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSS 369
Query: 347 DYGNSVSGSS----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
GN + S Y+LPAWS+SILPDC+T FNTA+V QT+
Sbjct: 370 GRGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMF 429
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
P + W+ E I+ + G L++Q + T D +DYLWYMT+ ++
Sbjct: 430 PTNSKLH----SWETYGEDISS--LGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDS 483
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L TL + S G +H ++NG Y S + + L G N+I+L
Sbjct: 484 SESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIAL 543
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G F+ GI GPVLL G + +DLS KW+Y+VGL G
Sbjct: 544 LSIAVGLPNVGLHFETWKTGILGPVLL---HGIDQGKRDLSWQKWSYQVGLKGEAMNLVS 600
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ E S + + WYK F AP ++P+ L+++ MGKG W+NG ++GRY
Sbjct: 601 PNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRY 660
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
W Y A+ D C+ C Y G Y KC + CG+P+Q WYHVPRSW+K N L++FEE G
Sbjct: 661 WMAY-AKGD-CNV--CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELG 716
Query: 690 GNPSQINFQTVVVGTACGQAHENK--------------------TMELTCH-GRRISEIK 728
G+ S+I + + C A+E+ ++ L C G+ IS I
Sbjct: 717 GDASKIALMKRAMKSVCADANEHHPTLENWHTESPSESEELHZASVHLQCAPGQSISTIM 776
Query: 729 YASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKR 788
+ASFG P G CG+F+KG+C A + ++EK C+G++ CS+ S + GA C +KR
Sbjct: 777 FASFGTPSGTCGSFQKGTCHAP-NSQAILEKNCIGQEKCSVPISNSYFGADPC-PNVLKR 834
Query: 789 LVVEALC 795
L VEA C
Sbjct: 835 LSVEAAC 841
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/847 (44%), Positives = 497/847 (58%), Gaps = 78/847 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I + L L + + V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK+G
Sbjct: 11 IFFFVPLMFLHSQLIQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TY+FWN HEP Y+F G DL+RFIKT+Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F MQ FT IV M K E LFASQGGPIIL+QIENEYG
Sbjct: 131 PVWLKFVPGI-SFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPE 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AG +YINW AKMA LD GVPW+MC+E DAP P+ F+PN P P
Sbjct: 190 SRELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+IWTE W+GWF +GG +R +DLAF VARF Q GG+F NYYMYHGGTNFGR++GGP+
Sbjct: 250 RIWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPF 309
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------YGNVTNT 346
+TTSYDYDAPIDEYG + QPK+GHL+ELHK +K E + +V ++
Sbjct: 310 ITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSS 369
Query: 347 DYGNSVSGSS----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
GN + S Y+LPAWS+SILPDC+T FNTA+V QT+
Sbjct: 370 GRGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMF 429
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
P + W+ E I+ + G L++Q + T D +DYLWYMT+ ++
Sbjct: 430 PTNSKLH----SWETYGEDISS--LGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDS 483
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L TL + S G +H ++NG Y S + + L G N+I+L
Sbjct: 484 SESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIAL 543
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G F+ GI GPVLL G + +DLS KW+Y+VGL G
Sbjct: 544 LSIAVGLPNVGLHFETWKTGILGPVLL---HGIDQGKRDLSWQKWSYQVGLKGEAMNLVS 600
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ E S + + WYK F AP ++P+ L+++ MGKG W+NG ++GRY
Sbjct: 601 PNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRY 660
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
W Y A+ D C+ C Y G Y KC + CG+P+Q WYHVPRSW+K N L++FEE G
Sbjct: 661 WMAY-AKGD-CNV--CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELG 716
Query: 690 GNPSQINFQTVVVGTACGQAHENK--------------------TMELTCH-GRRISEIK 728
G+ S+I + + C A+E+ ++ L C G+ IS I
Sbjct: 717 GDASKIALMKRAMKSVCADANEHHPTLENWHTESPSESEELHEASVHLQCAPGQSISTIM 776
Query: 729 YASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKR 788
+ASFG P G CG+F+KG+C A + ++EK C+G++ CS+ S + GA C +KR
Sbjct: 777 FASFGTPSGTCGSFQKGTCHAP-NSQAILEKNCIGQEKCSVPISNSYFGADPC-PNVLKR 834
Query: 789 LVVEALC 795
L VEA C
Sbjct: 835 LSVEAAC 841
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/852 (45%), Positives = 491/852 (57%), Gaps = 83/852 (9%)
Query: 10 AILLCLILQTLFNLSLAY-RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
A CL L F L + V++D +AI I+G+R+IL SGSIHYPRSTP MW DLI KAK
Sbjct: 12 AAFFCLALWLGFQLEQVHCSVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAK 71
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGGLD IETY+FWN HEP R Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+G
Sbjct: 72 EGGLDVIETYIFWNVHEPSRGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFG 131
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL +PGI RT N+ F MQ FT IV M K E+L+ SQGGPIIL+QIENEYG
Sbjct: 132 GFPVWLKYVPGIS-FRTDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYG 190
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
G AG++Y+NW AKMA GVPW+MC+E DAP P+ FTPN P
Sbjct: 191 AQSKLLGPAGQNYVNWAAKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPY 250
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P IWTE W+GWF +GG + +R +DLAF VARF Q GG+F NYYMYHGGTNFGRT+GG
Sbjct: 251 KPSIWTEAWSGWFSEFGGPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGG 310
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------- 350
P++TTSYDYDAP+DEYG + QPK+GHL+ELHK +K E+ L + T GN
Sbjct: 311 PFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSADPAVTSMGNFQQAHVY 370
Query: 351 -SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
+ SG YNLP WS+SILPDC+ FNTAKV QT+
Sbjct: 371 TTKSGDCAAFLSNFDTKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQ 430
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADL 447
P + W+ E I+ + L++Q T D SDYLWY+T+ D+
Sbjct: 431 MLPT----NTHMFSWESFDEDISSLDDGSAITITTSGLLEQINVTRDTSDYLWYITSVDI 486
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER---PVKLTRGK 504
+ L G TL + S+G +H ++NG S YG D R V L G
Sbjct: 487 GSSESFLRGGKLPTLIVQSTGHAVHVFINGQLSGS---AYGTREDRRFRYTGTVNLRAGT 543
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N+I+LLS VGL N G F+ GI GPV+L G + DLS KWTY+VGL G
Sbjct: 544 NRIALLSVAVGLPNVGGHFETWNTGILGPVVLRGLNQGKL---DLSWQKWTYQVGLKGEA 600
Query: 565 DKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
++ E S+ N+ +TW+KT F+AP ++P+ L+++GMGKG W+NG
Sbjct: 601 MNLASPNGISSVEWMQSALVSEKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGL 660
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GRYW A C Y G + KC CG P+Q WYHVPRSW+K N LV+
Sbjct: 661 SIGRYWTAPAAG----ICNGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPNHNLLVV 716
Query: 685 FEEFGGNPSQINFQTVVVGTAC------------------GQAHENKTMELTCH---GRR 723
FEE GG+PS+I+ V + C G++ E ++ H +
Sbjct: 717 FEELGGDPSKISLVKRSVSSICADVSEYHPNIRNWHIDSYGKSEEFHPPKVHLHCSPSQA 776
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
IS IK+ASFG P G CG ++KG C + L EK+C+GK C++ S +N G C
Sbjct: 777 ISSIKFASFGTPLGTCGNYEKGVCHSPTSYATL-EKKCIGKPRCTVTVSNSNFGQDPC-P 834
Query: 784 GTVKRLVVEALC 795
+KRL VEA+C
Sbjct: 835 NVLKRLSVEAVC 846
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/847 (44%), Positives = 497/847 (58%), Gaps = 78/847 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I + L L + + V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK+G
Sbjct: 11 IFFFVPLMFLHSQLIQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TY+FWN HEP Y+F G DL+RFIKT+Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F MQ FT IV M K E LFASQGGPIIL+QIENEYG
Sbjct: 131 PVWLKFVPGI-SFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPE 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AG +YINW AKMA LD GVPW+MC+E DAP P+ F+PN P P
Sbjct: 190 SRELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+IWTE W+GWF +GG +R +DLAF VARF Q GG+F NYYMYHGGTNFGR++GGP+
Sbjct: 250 RIWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPF 309
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT-------------YGNVTNT 346
+TTSYDYDAPIDEYG + QPK+GHL+ELHK +K E + +V ++
Sbjct: 310 ITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSS 369
Query: 347 DYGNSVSGSS----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
GN + S Y+LPAWS+SILPDC+T FNTA+V QT+
Sbjct: 370 GRGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMF 429
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
P + W+ E I+ + G L++Q + T D +DYLWYMT+ ++
Sbjct: 430 PTNSKLH----SWETYGEDISS--LGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDS 483
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L TL + S G +H ++NG Y S + + L G N+I+L
Sbjct: 484 SESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIAL 543
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G F+ GI GPVLL G + +DLS KW+Y+VGL G
Sbjct: 544 LSIAVGLPNVGLHFETWKTGILGPVLL---HGIDQGKRDLSWQKWSYQVGLKGEAMNLVS 600
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ E S + + WYK F AP ++P+ L+++ MGKG W+NG ++GRY
Sbjct: 601 PNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRY 660
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
W Y A+ D C+ C Y G Y KC + CG+P+Q WYHVPRSW+K N L++FEE G
Sbjct: 661 WMAY-AKGD-CNV--CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELG 716
Query: 690 GNPSQINFQTVVVGTACGQAHENK--------------------TMELTCH-GRRISEIK 728
G+ S+I + + C A+E+ ++ L C G+ IS I
Sbjct: 717 GDASKIALMKRAMKSVCADANEHHPTLENWHTESPSESEELHQASVHLQCAPGQSISTIM 776
Query: 729 YASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKR 788
+ASFG P G CG+F+KG+C A + ++EK C+G++ CS+ S + GA C +KR
Sbjct: 777 FASFGTPSGTCGSFQKGTCHAP-NSQAILEKNCIGQEKCSVPISNSYFGADPC-PNVLKR 834
Query: 789 LVVEALC 795
L VEA C
Sbjct: 835 LSVEAAC 841
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 700 bits (1807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/831 (45%), Positives = 492/831 (59%), Gaps = 82/831 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI I+G+R+IL+SGSIHYPRS+P MWPDLI+KAKEGGLD I+TYVFWN HEP
Sbjct: 28 VSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F N DL++FIK IQ GLYV LRIGPYVCAEWN+GGFPVWL +PGI + RT N
Sbjct: 88 GKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGI-QFRTDNG 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F +MQ FTT IV+M K E+LF SQGGPIIL+QIENEYG + + G GK Y +W A M
Sbjct: 147 PFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTDWAAHM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC++ DAP P+ F+PN PK+WTE WTGW+ +GG
Sbjct: 207 ALGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWYTEFGGAV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG L
Sbjct: 267 PSRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
QPKWGHL++LH+ +K E L + T T G S SG+
Sbjct: 327 QPKWGHLKDLHRAIKLCEPALVSADPTVTPLGTYQEAHVFKSKSGACAAFLANYNPRSFA 386
Query: 357 --------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
YNLP WS+SILPDCK +NTA+V Q+ ++K P + PL + +
Sbjct: 387 KVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-AQMKMP------RVPLHGAFSWQ 439
Query: 409 MIND-FVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
ND F L++Q +T D SDYLWY+T+ + ++ L L I S
Sbjct: 440 AYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKIDPNEEFLRSGKYPVLTILS 499
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
+G L ++NG + + F + V L G NQI+LLS VGL N G F+
Sbjct: 500 AGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIALLSIAVGLPNVGPHFETW 559
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
G+ GPV+L G +DLS KW+YKVGL G + +++ E W ++
Sbjct: 560 NAGVLGPVILNGLNEGR---RDLSWQKWSYKVGLKGEALSLHSLSGSSSVE--WIQGSLV 614
Query: 587 LNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
R+ +TWYKTTF AP N P+ L++ MGKG W+NG ++GRYWP Y A S +C
Sbjct: 615 TRRQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPAYKASG---SCGAC 671
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
+Y G Y KC NCG SQ WYHVPR+W+ N LV+ EE+GG+P+ I + +
Sbjct: 672 NYAGSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNGIFLVRREIDSI 731
Query: 706 CGQAHE--------------------NKTMELTC-HGRRISEIKYASFGDPQGACGAFKK 744
C +E L+C G++IS IK+ASFG P+G CG+F++
Sbjct: 732 CADIYEWQPNLMSWQMQASGKVKKPVRPKAHLSCGPGQKISSIKFASFGTPEGGCGSFRE 791
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
GSC A + ++ C+G+ SCS+ + N G C +K+L VEA+C
Sbjct: 792 GSCHAH-NSYDAFQRSCIGQNSCSVTVAPENFGGDPC-PNVMKKLSVEAIC 840
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/833 (45%), Positives = 488/833 (58%), Gaps = 82/833 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A+ VS+D RAI I+G+R+IL+SGSIHYPRS+P MWPDLI+KAKEGGLD I+TYVFWN HE
Sbjct: 14 AWNVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHE 73
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P + +Y F G DL+RFIK ++ GLYV LRIGPYVCAEWN+GGFPVWL + GI RT
Sbjct: 74 PSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGI-NFRT 132
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F MQ FT IVDM K E LF SQGGPIIL+QIENEYG + + G G++Y W
Sbjct: 133 NNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWA 192
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
AKMA L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF +G
Sbjct: 193 AKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFG 252
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE+G
Sbjct: 253 GAVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 312
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGS----------- 355
L QPKWGHL++LH+ +K E L G+ T T GN S SG+
Sbjct: 313 LLRQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPR 372
Query: 356 ----------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
YNLP WS+SILPDCK +NTA++ Q+ P W+
Sbjct: 373 SYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTP-----VSGRFGWQS 427
Query: 406 RPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
E + FA L++Q +T DVSDYLWY T+ + ++ L L +
Sbjct: 428 YNEETASY---DDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTV 484
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S+G LH ++NG + + F + VKL G N I+LLS VGL N G F+
Sbjct: 485 LSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFE 544
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ GPV L G +DLS KW+YKVGL G + +++ E W +
Sbjct: 545 TWNAGVLGPVSLNGLNEGR---RDLSWQKWSYKVGLKGEALSLHSLSGSSSVE--WVEGS 599
Query: 585 V-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ + +TWYKTTF AP N P+ L++ MGKG W+NG N+GRYWP Y A GC
Sbjct: 600 LMARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA-TGGCG-- 656
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
C+Y G Y KC NCG PSQ WYHVP SW+ N LV+FEE GGNP+ I+ +
Sbjct: 657 DCNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIE 716
Query: 704 TACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGACGAF 742
+ C +E NK + H G++IS IK+ASFG P+G CG++
Sbjct: 717 SVCADIYEWQPTLMNYEMQASGKVNKPLRPKAHLWCAPGQKISSIKFASFGTPEGVCGSY 776
Query: 743 KKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++GSC A E+ C+G SCS+ + G C + +K+L VEA+C
Sbjct: 777 REGSCHAH-KSYDAFERSCIGMNSCSVTVAPEIFGGDPCPS-VMKKLSVEAIC 827
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/856 (45%), Positives = 499/856 (58%), Gaps = 83/856 (9%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
LK + +LL + +L + A VS+D +AI I+G+R+ILLSGSIHYPRSTP MWPDL
Sbjct: 6 LKVWNVPLLLVVFACSLLGQASA-SVSYDHKAIIINGQRRILLSGSIHYPRSTPEMWPDL 64
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
I+KAKEGGLD I+TYVFWN HEP +Y F GN DL+RFIK +Q GLYV LRIGPYVCA
Sbjct: 65 IQKAKEGGLDVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCA 124
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWN+GGFPVWL +PGI RT N F +M+ FT IVDM K E+LF SQGGPIIL+QI
Sbjct: 125 EWNFGGFPVWLKYIPGI-SFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQI 183
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYG + + G G+SY W A MA L GVPWIMC++ DAP P+ F+
Sbjct: 184 ENEYGPMEYEIGAPGRSYTQWAAHMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYCDYFS 243
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN PK+WTE WTGWF +GG P R AEDLAF++ARF Q GG+F NYYMYHGGTNFG
Sbjct: 244 PNKAYKPKMWTEAWTGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFG 303
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-- 350
RT+GGP++ TSYDYDAP+DEYG QPKWGHL++LH+ +K E L G+ T GN
Sbjct: 304 RTAGGPFIATSYDYDAPLDEYGLARQPKWGHLKDLHRAIKLCEPALVSGDSTVQRLGNYE 363
Query: 351 ------SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQ 383
S SG+ YNLP WS+SILP+CK +NTA+V +Q
Sbjct: 364 EAHVFRSKSGACAAFLANYNPQSYATVAFGNQHYNLPPWSISILPNCKHTVYNTARVGSQ 423
Query: 384 -TNVKVKR-PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLW 440
T +K+ R P G L WK E + F + L++Q +T D+SDYLW
Sbjct: 424 STTMKMTRVPIHGG-----LSWKAFNE---ETTTTDDSSFTVTGLLEQINATRDLSDYLW 475
Query: 441 YMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKL 500
Y T+ + ++ L N L + S+G LH ++N + + A F V+L
Sbjct: 476 YSTDVVINSNEGFLRNGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRL 535
Query: 501 TRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGL 560
G N+ISLLS VGL N G F+ G+ GP+ L G +DL+ KW+YKVGL
Sbjct: 536 RAGVNKISLLSVAVGLPNVGPHFERWNAGVLGPITLSGLNEGR---RDLTWQKWSYKVGL 592
Query: 561 YGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
G + ++ ++S V + +TWYKTTF+AP P+ L++ MGKG W
Sbjct: 593 KG-EALNLHSLSGSSSVEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVW 651
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
+NG +LGRYWP Y A S C+Y G Y KC NCG SQ WYHVP SW+K N
Sbjct: 652 INGQSLGRYWPAYKASG---SCGYCNYAGTYNEKKCGSNCGEASQRWYHVPHSWLKPSGN 708
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTACGQAHE--------------------NKTMELTC- 719
LV+FEE GG+P+ I + + C +E L+C
Sbjct: 709 LLVVFEELGGDPNGIFLVRRDIDSVCADIYEWQPNLVSYEMQASGKVRSPVRPKAHLSCG 768
Query: 720 HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGAT 779
G++IS IK+ASFG P G+CG++++GSC A + K CVG+ C++ S G
Sbjct: 769 PGQKISSIKFASFGTPVGSCGSYREGSCHAHKSYDAFL-KNCVGQSWCTVTVSPEIFGGD 827
Query: 780 SCAAGTVKRLVVEALC 795
C +K+L VEA+C
Sbjct: 828 PCPR-VMKKLSVEAIC 842
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/829 (44%), Positives = 487/829 (58%), Gaps = 77/829 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I G R++L+S SIHYPRS P MWP L+ +AK+GG D IETYVFWN HE
Sbjct: 102 VTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHETAP 161
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F DL+RF K ++D GLY++LRIGP+V AEWN+GG PVWLH +PG RT N+
Sbjct: 162 GEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPG-AVFRTNNE 220
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + M++FTT IVDM K+E+ FASQGG IILAQIENEYG+ YG GK+Y W A M
Sbjct: 221 PFKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAASM 280
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMCQ+ DAP + F N+P PKIWTENW GWF+++G +
Sbjct: 281 ALAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKTNSPTKPKIWTENWPGWFQTFGESN 340
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AF+VARFFQ GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEYG
Sbjct: 341 PHRPPEDVAFSVARFFQKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLTR 400
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
PKW HLR+LHK +K E +L YGN+T+ G
Sbjct: 401 LPKWAHLRDLHKSIKLCEHSLLYGNLTSLSLGTKQEADVYTDHSGGCVAFLANIDPENDT 460
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ Y+LPAWSVSILPDCK FNTAKV +QT + V + P +W E
Sbjct: 461 VVTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQT-LMVDMVPETLQSTKPDRWSIFRE 519
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ K F N +D +T D +DYLW+ T+ ++ P + + L I+S
Sbjct: 520 KTG---IWDKNDFIRNGFVDHINTTKDSTDYLWHTTSFNVDRSYP--TNGNRELLSIDSK 574
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G +HA++N + S + S+ P+KL GKN+I+LLS TVGLQN G ++ V
Sbjct: 575 GHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIALLSMTVGLQNAGPHYEWVG 634
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ V +G + DLSS+ W YK+GL G + + N++R P
Sbjct: 635 AGLTS----VNISGMKNGSIDLSSNNWAYKIGLEG-EHYGLFKPDQGNNQRWSPQSEPPK 689
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
+ +TWYK + P +DPV +++Q MGKG AW+NG +GRYWP + +D C T SC+Y
Sbjct: 690 GQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSSDDRC-TPSCNY 748
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RGP+ KC CG P+Q WYHVPRSW NTLV+FEE GG+P++I F V C
Sbjct: 749 RGPFNPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATKVCS 808
Query: 708 QAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
EN ++L+C G+ IS +K+ASFGDP G C ++++G
Sbjct: 809 FVSENYPSIDLESWDKSISDDGKDTAKVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGR 868
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C L ++EK C+ SC++ S+ G C G K L +EA C
Sbjct: 869 CH-HPSSLSVVEKACLNINSCTVSLSDEGFGKDLC-PGVAKTLAIEADC 915
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/833 (45%), Positives = 491/833 (58%), Gaps = 78/833 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D R++ I G R++++S SIHYPRS P MWP L+ +AK+GG D IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
QY F DL+RF+K ++D GL +ILRIGPYV AEWNYGG PVWLH +PG RT
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTV-FRT 144
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD-YGDAGKSYINW 204
N+ F N M++FTT IVDM KKE+LFASQGG IILAQIENEYG+ YG GK Y W
Sbjct: 145 NNEPFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMW 204
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
A MA + + GVPWIMCQESDAP P+ F PN+P PKIWTENW GWF+++
Sbjct: 205 AASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTF 264
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G +P R ED+AFAVARFF+ GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEY
Sbjct: 265 GESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEY 324
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVT-------------------------NTDY 348
G PKW HLRELHK ++ E TL YGN T N D
Sbjct: 325 GLRRFPKWAHLRELHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDS 384
Query: 349 GN----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
N + Y+LPAWSVSILPDC+ FNTAKV +QT++ P ++ P +W
Sbjct: 385 ANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVP-ESLQASKPERWS 443
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E + GK F N +D +T D +DYLWY T+ + D S S+ L
Sbjct: 444 IFRERTG---IWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSV--DGSYSSKGSHAVLN 498
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
I+S+G +HA++N + S + S + + L GKN+++LLS TVGLQN G +
Sbjct: 499 IDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNELALLSMTVGLQNAGFAY 558
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ + G V +G T I DLSS+ W YK+GL G + + N++R
Sbjct: 559 EWIGAGFTN----VNISGVRTGIIDLSSNNWAYKIGLEG-EYYNLFKPDQTNNQRWIPQS 613
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
P N+ +TWYK + P +DPV +++Q MGKG AW+NG +GRYWP + D C T
Sbjct: 614 EPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRC-TP 672
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
SC+YRG + DKC CG P+Q WYH+PRSW N LV+FEE GG+P++I F V
Sbjct: 673 SCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVT 732
Query: 704 TACGQAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAF 742
+ C E+ +L+C G+ IS +K+AS G+P G C ++
Sbjct: 733 SVCSFVSEHFPSIDLESWDESAMNEGTPPAKAQLSCPEGKSISSVKFASLGNPSGTCRSY 792
Query: 743 KKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ G C + L ++EK C+ SC++ ++ + G C G K L +EA C
Sbjct: 793 QMGRCH-HPNSLSVVEKACLNTNSCTVSLTDESFGKDLC-HGVTKTLAIEADC 843
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/829 (44%), Positives = 483/829 (58%), Gaps = 78/829 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I G R++L+S SIHYPRS P MWP L+ +AK+GG D +ETYVFWN HEP +
Sbjct: 38 VTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPAQ 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF K ++D GLY+ILRIGP+V AEW +GG PVWLH PG RT N+
Sbjct: 98 GQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTV-FRTNNE 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + M+ FTT IVDM KKE+ FASQGG IILAQ+ENEYG++ YG K Y W A M
Sbjct: 157 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMCQ+ DAP P+ F PN+P PK WTENW GWF+++G +
Sbjct: 217 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 276
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AF+VARFF GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEYG
Sbjct: 277 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 336
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
PKW HLR+LHK +K E TL YGN + G
Sbjct: 337 LPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKDK 396
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ SY+LPAWSVSILPDCK FNTAKV +QT + P + + +R +
Sbjct: 397 VVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSIFREK 456
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + G N +D +T D +DYLWY T+ D+ D L+G N L I S
Sbjct: 457 ----YGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDV--DGSHLAGG-NHVLHIESK 509
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + A++N + S + SN E PV L GKN++SLLS TVGLQN G ++
Sbjct: 510 GHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAG 569
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
GI V +G E I DLSS+KW YK+GL G + + A R P
Sbjct: 570 AGITS----VKISGMENRIIDLSSNKWEYKIGLEG-EYYSLFKADKGKDIRWMPQSEPPK 624
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
N+ MTWYK + P +DPV L++Q MGKG AW+NG +GRYWP D C T SCDY
Sbjct: 625 NQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRC-TSSCDY 683
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RG + +KC CG P+Q WYHVPRSW NTLV+FEE GG+P++I F V + C
Sbjct: 684 RGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCS 743
Query: 708 QAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
E+ ++L+C G+ IS +K+ASFG+P G C ++++GS
Sbjct: 744 FVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFASFGNPSGTCRSYQQGS 803
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C + + ++EK C+ C++ S+ G C G K L +EA C
Sbjct: 804 CH-HPNSISVVEKACLNMNGCTLSLSDEGFGEDLC-PGVTKTLAIEADC 850
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/831 (45%), Positives = 489/831 (58%), Gaps = 82/831 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AITI+G+R+ILLSGSIHYPRSTP MWPDLI+KAKEGGLD I+TYVFWN HEP
Sbjct: 32 VSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSP 91
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F GN DL+RFIK +Q GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 92 GKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGI-SFRTDNG 150
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F +M+ FT IVDM K E+LF SQGGPIIL+QIENEYG + + G G++Y W A M
Sbjct: 151 PFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTQWAAHM 210
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPWIMC++ DAP P+ F+PN PK+WTE WTGWF +GG
Sbjct: 211 AVGLGTGVPWIMCKQEDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 270
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF++ARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG
Sbjct: 271 PHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLPR 330
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGS-------------- 355
QPKWGHL++LH+ +K E L G+ T GN S SG+
Sbjct: 331 QPKWGHLKDLHRAIKLCEPALVSGDPTVQQLGNYEEAHVFRSKSGACAAFLANYNPQSYA 390
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQ-TNVKVKR-PNQAGNDQAPLQWKWR 406
YNLP WS+SILP+CK +NTA+V +Q T +K+ R P G L WK
Sbjct: 391 TVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMTRVPIHGG-----LSWKAF 445
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F + L++Q +T D+SDYLWY T+ + ++ L N L +
Sbjct: 446 NE---ETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTVL 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S+G LH ++N + + A F V+L G N+ISLLS VGL N G F+
Sbjct: 503 SAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFER 562
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ GP+ L G +DL+ KW+YKVGL G + ++ ++S V
Sbjct: 563 WNAGVLGPITLSGLNEGR---RDLTWQKWSYKVGLKG-EALNLHSLSGSSSVEWLQGFLV 618
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
+ +TWYKTTF+AP P+ L++ MGKG W+NG +LGRYWP Y A S C
Sbjct: 619 SRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKASG---SCGYC 675
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
+Y G Y KC NCG SQ WYHVP SW+K N LV+FEE GG+P+ I + +
Sbjct: 676 NYAGTYNEKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSV 735
Query: 706 CGQAHE--------------------NKTMELTC-HGRRISEIKYASFGDPQGACGAFKK 744
C +E L+C G++IS IK+ASFG P G+CG +++
Sbjct: 736 CADIYEWQPNLVSYDMQASGKVRSPVRPKAHLSCGPGQKISSIKFASFGTPVGSCGNYRE 795
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
GSC A +K CVG+ C++ S G C + +K+L VEA+C
Sbjct: 796 GSCHAH-KSYDAFQKNCVGQSWCTVTVSPEIFGGDPCPS-VMKKLSVEAIC 844
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/829 (44%), Positives = 482/829 (58%), Gaps = 78/829 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I G R++L+S SIHYPRS P MWP L+ +AK+GG D +ETYVFWN HEP +
Sbjct: 38 VTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPAQ 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF K ++D GLY+ILRIGP+V AEW +GG PVWLH PG RT N+
Sbjct: 98 GQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTV-FRTNNE 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + M+ FTT IVDM KKE+ FASQGG IILAQ+ENEYG++ YG K Y W A M
Sbjct: 157 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMCQ+ DAP P+ F PN+P PK WTENW GWF+++G +
Sbjct: 217 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 276
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AF+VARFF GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEYG
Sbjct: 277 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 336
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
PKW HLR+LHK +K E TL YGN + G
Sbjct: 337 LPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKDK 396
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ SY+LPAWSVSILPDCK FNTAKV +QT + P + + +R +
Sbjct: 397 VVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSIFREK 456
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + G N +D +T D +DYLWY T+ D+ D L+G N L I S
Sbjct: 457 ----YGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDV--DGSHLAGG-NHVLHIESK 509
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + A++N + S + SN E PV L GKN++SLLS TVGLQN G ++
Sbjct: 510 GHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAG 569
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
GI V +G E I DLSS+KW YK+GL G + + A R P
Sbjct: 570 AGITS----VKISGMENRIIDLSSNKWEYKIGLEG-EYYSLFKADKGKDIRWMPQSEPPK 624
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
N+ MTWYK + P +DPV L++Q MGKG AW+NG +GRYWP D C T SCDY
Sbjct: 625 NQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRC-TSSCDY 683
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RG + +KC CG P+Q WYHVPRSW NTLV+FEE GG+P++I F V + C
Sbjct: 684 RGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCS 743
Query: 708 QAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
E+ ++L+C G+ IS +K+ SFG+P G C ++++GS
Sbjct: 744 FVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGS 803
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C + + ++EK C+ C++ S+ G C G K L +EA C
Sbjct: 804 CH-HPNSISVVEKACLNMNGCTVSLSDEGFGEDLC-PGVTKTLAIEADC 850
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/829 (44%), Positives = 482/829 (58%), Gaps = 78/829 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I G R++L+S SIHYPRS P MWP L+ +AK+GG D +ETYVFWN HEP +
Sbjct: 106 VTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPAQ 165
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF K ++D GLY+ILRIGP+V AEW +GG PVWLH PG RT N+
Sbjct: 166 GQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTV-FRTNNE 224
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + M+ FTT IVDM KKE+ FASQGG IILAQ+ENEYG++ YG K Y W A M
Sbjct: 225 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 284
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMCQ+ DAP P+ F PN+P PK WTENW GWF+++G +
Sbjct: 285 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 344
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AF+VARFF GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEYG
Sbjct: 345 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 404
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
PKW HLR+LHK +K E TL YGN + G
Sbjct: 405 LPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKDK 464
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ SY+LPAWSVSILPDCK FNTAKV +QT + P + + +R +
Sbjct: 465 VVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSIFREK 524
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + G N +D +T D +DYLWY T+ D+ D L+G N L I S
Sbjct: 525 ----YGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDV--DGSHLAGG-NHVLHIESK 577
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + A++N + S + SN E PV L GKN++SLLS TVGLQN G ++
Sbjct: 578 GHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAG 637
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
GI V +G E I DLSS+KW YK+GL G + + A R P
Sbjct: 638 AGITS----VKISGMENRIIDLSSNKWEYKIGLEG-EYYSLFKADKGKDIRWMPQSEPPK 692
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
N+ MTWYK + P +DPV L++Q MGKG AW+NG +GRYWP D C T SCDY
Sbjct: 693 NQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRC-TSSCDY 751
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
RG + +KC CG P+Q WYHVPRSW NTLV+FEE GG+P++I F V + C
Sbjct: 752 RGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCS 811
Query: 708 QAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
E+ ++L+C G+ IS +K+ SFG+P G C ++++GS
Sbjct: 812 FVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGS 871
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C + + ++EK C+ C++ S+ G C G K L +EA C
Sbjct: 872 CH-HPNSISVVEKACLNMNGCTVSLSDEGFGEDLC-PGVTKTLAIEADC 918
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/830 (45%), Positives = 486/830 (58%), Gaps = 82/830 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RAI I+G+R+IL+SGSIHYPRS+P MWPDLI+KAKEGGLD I+TYVFWN HEP +
Sbjct: 30 VSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSQ 89
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DL+RFIK ++ GLYV LRIGPYVCAEWN+GGFPVWL + GI RT N+
Sbjct: 90 GKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGI-NFRTNNE 148
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IVDM K E LF SQGGPIIL+QIENEYG + + G G++Y W AKM
Sbjct: 149 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 208
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF +GG
Sbjct: 209 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 268
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE+G L
Sbjct: 269 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLLR 328
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
QPKWGHL++LH+ +K E L G+ T T GN S SG+
Sbjct: 329 QPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRSYA 388
Query: 357 --------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
YNLP WS+SILPDCK +NTA++ Q+ P W+ E
Sbjct: 389 KVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTP-----VSGRFGWQSYNE 443
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ FA L++Q +T DVSDYLWY T+ + ++ L L + S+
Sbjct: 444 ETASY---DDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTVLSA 500
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G LH ++NG + + F + VKL G N I+LLS VGL N G F+
Sbjct: 501 GHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFETWN 560
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV-P 586
G+ GPV L G +DLS KW+YKVGL G + +++ E W ++
Sbjct: 561 AGVLGPVSLNGLNEGR---RDLSWQKWSYKVGLKGEALSLHSLSGSSSVE--WVEGSLMA 615
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+ +TWYKTTF AP N P+ L++ MGKG W+NG N+GRYWP Y A GC C+
Sbjct: 616 RGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA-TGGCG--DCN 672
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
Y G Y KC NCG PSQ WYHVP SW+ N LV+FEE GGNP+ I+ + + C
Sbjct: 673 YAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIESVC 732
Query: 707 GQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGACGAFKKG 745
+E NK + H G++IS IK+ASFG P+G CG++++G
Sbjct: 733 ADIYEWQPTLMNYEMQASGKVNKPLRPKAHLWCAPGQKISSIKFASFGTPEGVCGSYREG 792
Query: 746 SCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
SC A E+ C+G SCS+ + G C + +K+L VEA+C
Sbjct: 793 SCHAH-KSYDAFERSCIGMNSCSVTVAPEIFGGDPCPS-VMKKLSVEAIC 840
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/829 (45%), Positives = 486/829 (58%), Gaps = 80/829 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI ++G+RKIL+SGSIHYPRSTP MWPDLI+KAKEGG+D I+TYVFWN HEP
Sbjct: 24 VSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEPEE 83
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F DL++FIK +Q+ GLYV LRIGPY CAEWN+GGFPVWL +PGI RT N+
Sbjct: 84 GKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGIS-FRTNNE 142
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FTT IVDM K EKL+ +QGGPIIL+QIENEYG + + G+ GK Y W AKM
Sbjct: 143 PFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEWAAKM 202
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPWIMC++ D P P+ FTPN N PK+WTE WT WF +GG
Sbjct: 203 AVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNKANKPKMWTEAWTAWFTEFGGPV 262
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+AFAVARF Q GG+F NYYMYHGGTNFGRTSGGP++ TSYDYDAP+DE+G L
Sbjct: 263 PYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEFGSLR 322
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
QPKWGHL++LH+ +K E L + T T GN S SG+
Sbjct: 323 QPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQEARVFKSESGACAAFLANYNQHSFA 382
Query: 357 --------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
YNLP WS+SILPDCK +NTA+V Q+ P G W+
Sbjct: 383 KVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSAQMKMTPVSRG-----FSWE---S 434
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
D F + L++Q + T DVSDYLWYMT+ ++ + L+ + L + S+
Sbjct: 435 FNEDAASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEIDPTEGFLNSGNWPWLTVFSA 494
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G LH +VNG + + F + L G N+ISLLS VGL N G F+
Sbjct: 495 GHALHVFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNKISLLSIAVGLPNVGPHFETWN 554
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G +DL+ KW YKVGL G + ++ + S V
Sbjct: 555 AGVLGPVSLNGL---NEGTRDLTWQKWFYKVGLKG-EALSLHSLSGSPSVEWVEGSLVAQ 610
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
+ ++WYKTTF AP N+P+ L++ MGKG W+NG +LGR+WP Y CS C+Y
Sbjct: 611 KQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHWPAY-KSSGSCSV--CNY 667
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG 707
G + KC NCG SQ WYHVPRSW+ N LV+FEE+GG+P I +G+ C
Sbjct: 668 TGWFDEKKCLTNCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDPYGITLVKREIGSVCA 727
Query: 708 QAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGACGAFKKGS 746
+E ++ + H G++IS IK+ASFG P+G CG F++GS
Sbjct: 728 DIYEWQPQLLNWQRLVSGKFDRPLRPKAHLKCAPGQKISSIKFASFGTPEGVCGNFQQGS 787
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C A +K CVGK+SCS++ + N G C +K+L VEA+C
Sbjct: 788 CHAPRS-YDAFKKNCVGKESCSVQVTPENFGGDPC-RNVLKKLSVEAIC 834
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/836 (44%), Positives = 492/836 (58%), Gaps = 82/836 (9%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S+ VS+D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI++AK+GGLD I+TYVFWN
Sbjct: 25 SVRASVSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGLDVIQTYVFWNG 84
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP +Y F N DL++FIK +Q GLYV LRIGPYVCAEWN+GGFPVWL +PGI +
Sbjct: 85 HEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-QF 143
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N F ++MQ FTT IV+M K E+LF S GGPIIL+QIENEYG + + G GK+Y +
Sbjct: 144 RTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYEIGAPGKAYTD 203
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A+MA L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF
Sbjct: 204 WAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTE 263
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R AEDLAF+VA+F Q GG F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DE
Sbjct: 264 FGGAVPYRPAEDLAFSVAKFLQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDE 323
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS-------- 356
YG L QPKWGHL++LH+ +K E L + T T G S SG+
Sbjct: 324 YGLLRQPKWGHLKDLHRAIKLCEPALVSSDPTVTPLGTYQEAHVFKSNSGACAAFLANYN 383
Query: 357 -------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
YNLP WS+SILPDCK +NTA++ QT ++K P + P+
Sbjct: 384 RKSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQT-ARMKMP------RVPIHG 436
Query: 404 KWRPEMIND-FVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
+ + ND F L++Q + T D +DYLWYMT+ + + L +
Sbjct: 437 GFSWQAYNDETATYSDTSFTTAGLLEQINITRDATDYLWYMTDVKIDPSEDFLRSGNYPV 496
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L + S+G L ++NG + + F++ V L G NQI+LLS VGL N G
Sbjct: 497 LTVLSAGHALRVFINGQLAGTAYGSLETPKLTFKQGVNLRAGINQIALLSIAVGLPNVGP 556
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
F+ GI GPV+L G +DLS KW+YK+GL G + ++ +S W+
Sbjct: 557 HFETWNAGILGPVILNGLNEGR---RDLSWQKWSYKIGLKG--EALSLHSLTGSSSVEWT 611
Query: 582 SKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ R+ +TWYKTTF P N P+ L++ MGKG W+N ++GRYWP Y A
Sbjct: 612 EGSFVAQRQPLTWYKTTFNRPAGNSPLALDMGSMGKGQVWINDRSIGRYWPAYKASG--- 668
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
+ C+Y G + KC NCG SQ WYHVPRSW+ N LV+ EE+GG+P+ I
Sbjct: 669 TCGECNYAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTGNLLVVLEEWGGDPNGIFLVRR 728
Query: 701 VVGTACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGAC 739
V + C +E NK + H G++IS IK+ASFG P+G C
Sbjct: 729 EVDSVCADIYEWQPNLMSWQMQVSGRVNKPLRPKAHLSCGPGQKISSIKFASFGTPEGVC 788
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+F++G C A E+ C+G+ SCS+ S N G C +K+L VEA+C
Sbjct: 789 GSFREGGCHAH-KSYNAFERSCIGQNSCSVTVSPENFGGDPC-PNVMKKLSVEAIC 842
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/850 (44%), Positives = 496/850 (58%), Gaps = 78/850 (9%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S+ + L L++ + + + V++D +AI IDG+R+IL+SGSIHYPRSTP MW DL++KA
Sbjct: 7 SKFLTLFLMVLIVGSKLIHCTVTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKA 66
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD I+TYVFWN HEP Y+F G DL+RFIKT+Q GLYV LRIGPYVCAEWN+
Sbjct: 67 KDGGLDVIDTYVFWNVHEPSPGNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNF 126
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F MQ FT IV M K E+LF SQGGPII +QIENEY
Sbjct: 127 GGFPVWLKYVPGI-SFRTDNGPFKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEY 185
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G +G AG SYINW A+MA L GVPW+MC+E DAP P+ F+PN P
Sbjct: 186 GPESRAFGAAGHSYINWAAQMAVGLKTGVPWVMCKEDDAPDPVINTCNGFYCDAFSPNKP 245
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P +WTE W+GWF +GG R +DLAFAVARF Q GG+F NYYMYHGGTNFGR++G
Sbjct: 246 YKPTMWTEAWSGWFTEFGGAFHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRSAG 305
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++TTSYDYDAPIDEYG + +PK+GHL+ELH+ +K E L + T T G
Sbjct: 306 GPFITTSYDYDAPIDEYGLIREPKYGHLKELHRAIKLCEHELVSSDPTITLLGTYQQAHV 365
Query: 351 -----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ Y LP WS+SILPDC+ FNTAKV QT+
Sbjct: 366 FSSGKRSCSAFLANYHTQSAARVMFNNMHYVLPPWSISILPDCRNVVFNTAKVGVQTSHV 425
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
P + W+ E I+ + AL + T D +DYLWY+T+ ++
Sbjct: 426 QMLPTGS----RFFSWESYDEDISSLGASSR-MTALGLMEQINVTRDTTDYLWYITSVNI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ L G TL + S+G LH ++NG + S + F PV L G N+I
Sbjct: 481 NPSESFLRGGQWPTLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVNLRAGTNRI 540
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS VGL N G ++ GI GPV+L G KDL+ +W+Y+VGL G +
Sbjct: 541 ALLSIAVGLPNVGVHYETWKTGILGPVMLHGLNQGN---KDLTWQQWSYQVGLKG-EAMN 596
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
+ A+S + + WYK F+AP N+P+ L+++ MGKG W+NG ++G
Sbjct: 597 LVSPNRASSVDWIQGSLATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWINGQSIG 656
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYW +Y A+ D CS SC Y G + KC CG P+Q WYHVPRSW+K N LV+FEE
Sbjct: 657 RYWLSY-AKGD-CS--SCGYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIFEE 712
Query: 688 FGGNPSQINFQTVVVGTACGQAHENK---------------------TMELTCH-GRRIS 725
GG+ S+I+ + C A E+ + L C G+ IS
Sbjct: 713 LGGDASKISLVKRSTTSVCADAFEHHPTIENYNTESNGESERNLHQAKVHLRCAPGQSIS 772
Query: 726 EIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGT 785
I +ASFG P G CG+F++G+C A + ++EK+C+G++SC + S +N GA C +
Sbjct: 773 AINFASFGTPTGTCGSFQEGTCHAP-NSHSVVEKKCIGRESCMVAISNSNFGADPCPS-K 830
Query: 786 VKRLVVEALC 795
+K+L VEA+C
Sbjct: 831 LKKLSVEAVC 840
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/850 (44%), Positives = 494/850 (58%), Gaps = 80/850 (9%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S+ LLC+ + L+ V++D +A+ I+G+R+IL SGSIHYPRSTP MW LI+KA
Sbjct: 7 SKWFLLCMWVFLCIQLTQC-SVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKA 65
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLDAI+TYVFWN HEP +Y+F G DL+RFIK IQ GLYV LRIGPY+CAEWN+
Sbjct: 66 KDGGLDAIDTYVFWNLHEPSPGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNF 125
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PG+ RT N+ F MQ FT IV M K EKLF SQGGPII++QIENEY
Sbjct: 126 GGFPVWLKFVPGV-SFRTDNEPFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEY 184
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G+ +G G +Y+ W AKMA ++D GVPW+MC+E DAP P+ F+PN P
Sbjct: 185 GHESRAFGAPGYAYLTWAAKMAVAMDTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKP 244
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N P +WTE W+GWF + G +R EDL+FAV RF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 245 NKPTLWTEAWSGWFTEFAGPIQQRPVEDLSFAVTRFIQKGGSFVNYYMYHGGTNFGRTAG 304
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++TTSYDYDAPIDEYG + QPK+GHL+ELHK +K E+ L + T G
Sbjct: 305 GPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCERALLSADPAETSLGTYAKAQV 364
Query: 351 --SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
S SG YNL WS+SILPDCK FNTA V QT+
Sbjct: 365 FYSESGGCAAFLSNYNPTSAARVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQTSQM 424
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD 446
P + L W+ E I+ + L++Q T D SDYLWY T D
Sbjct: 425 QMLP----TNSELLSWETFNEDISS--ADDDSTITVVGLLEQLNVTRDTSDYLWYSTRID 478
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ + L G + TL + S+G +H ++NG+ S + F V L G N
Sbjct: 479 ISSSESFLHGGQHPTLIVQSTGHAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSNI 538
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
IS+LS VGL N G F+ G+ GPV+L G DE KDLS KW+Y+VGL G
Sbjct: 539 ISVLSIAVGLPNNGPHFETWSTGVLGPVVLHGL--DEG-KKDLSWQKWSYQVGLKGEAMN 595
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+N + S + +TWYK F+AP ++P+ L++ MGKG W+NG ++
Sbjct: 596 LVSPNVISNIDWMKGSLFAQKQQPLTWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSI 655
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW Y + CS C Y G + + KC + CG P+Q WYHVPRSW+K N LVLFE
Sbjct: 656 GRYWTAY--AKGNCS--GCSYSGTFRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVLFE 711
Query: 687 EFGGNPSQINFQTVVVGTACGQAHENK--------------------TMELTC-HGRRIS 725
E GG+ S+I+F V T C + E+ + L C G+ IS
Sbjct: 712 ELGGDASKISFMKRSVTTVCAEVSEHHPNIKNWHIESQERPEEMSKPKVHLHCASGQSIS 771
Query: 726 EIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGT 785
IK+ASFG P G CG F+KG+C A ++EK+C+G++ CS+ S +N A C
Sbjct: 772 AIKFASFGTPSGTCGNFQKGTCHAPTS-QAVLEKKCIGQQKCSVAVSSSNF-ANPC-PNM 828
Query: 786 VKRLVVEALC 795
K+L VEA+C
Sbjct: 829 FKKLSVEAVC 838
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/830 (45%), Positives = 486/830 (58%), Gaps = 82/830 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RAI ++G+R+IL+SGS+HYPRSTP MWP +I+KAKEGG+D I+TYVFWN HEP +
Sbjct: 27 VSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHEPQQ 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DL++FIK + GLYV LR+GPY CAEWN+GGFPVWL +PGI RT N
Sbjct: 87 GKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGIS-FRTDNG 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV+M K E+L+ +QGGPIIL+QIENEYG + + G GKSY W AKM
Sbjct: 146 PFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC++ DAP P+ F+PN PKIWTE WT WF +G
Sbjct: 206 AVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGNPV 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF+VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG L
Sbjct: 266 PYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHL++LH+ +K E L G+ T G+
Sbjct: 326 QPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHSFA 385
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
S + YNLP WS+SILPDCK FNTA++ Q+ P G L W+ E
Sbjct: 386 TVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSAQMKMTPVSRG-----LPWQSFNE 440
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + F + L++Q +T DVSDYLWY T+ + + L G L I S+
Sbjct: 441 ETSSYE---DSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWLTIMSA 497
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G LH +VNG + + F + V L G N+ISLLS VGL N G F+
Sbjct: 498 GHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWN 557
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G DE +DL+ KW+YKVGL G + +++ E W ++
Sbjct: 558 AGVLGPVSLTGL--DEG-KRDLTWQKWSYKVGLKGEALSLHSLSGSSSVE--WVEGSLVA 612
Query: 588 NRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
R+ +TWYK+TF AP NDP+ L+L MGKG W+NG +LGRYWP Y A + C +C+
Sbjct: 613 QRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKASGN-CG--ACN 669
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
Y G + KC NCG SQ WYHVPRSW+ N LVLFEE+GG P I+ V + C
Sbjct: 670 YAGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVC 729
Query: 707 GQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGACGAFKKG 745
+E +K + H G++I+ IK+ASFG PQG CG+F++G
Sbjct: 730 ADINEWQPQLVNWQMQASGKVDKPLRPKAHLSCAPGQKITSIKFASFGTPQGVCGSFREG 789
Query: 746 SCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
SC A E+ C+G+ SCS+ + G C +K+L VE +C
Sbjct: 790 SCHA-FHSYDAFERYCIGQNSCSVPVTPEIFGGDPCPH-VMKKLSVEVIC 837
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/830 (45%), Positives = 488/830 (58%), Gaps = 82/830 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RAI ++G+R+IL+SGS+HYPRSTP MWP +I+KAKEGG+D I+TYVFWN HEP +
Sbjct: 27 VSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHEPQQ 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DL++FIK + GLYV LR+GPY CAEWN+GGFPVWL +PGI RT N
Sbjct: 87 GKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGIS-FRTDNG 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV+M K E+L+ +QGGPIIL+QIENEYG + + G GKSY W AKM
Sbjct: 146 PFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC++ DAP P+ F+PN PKIWTE WT WF +G
Sbjct: 206 AVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGNPV 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF+VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG L
Sbjct: 266 PYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHL++LH+ +K E L G+ T G+
Sbjct: 326 QPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHSFA 385
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
S + YNLP WS+SILPDCK FNTA++ Q+ P G L W+ E
Sbjct: 386 TVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSAQMKMTPVSRG-----LPWQSFNE 440
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + F + L++Q +T DVSDYLWY T+ + + L G L I S+
Sbjct: 441 ETSSYE---DSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWLTIMSA 497
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G LH +VNG + + F + V L G N+ISLLS VGL N G F+
Sbjct: 498 GHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWN 557
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G DE +DL+ KW+YKVGL G + +++ E W ++
Sbjct: 558 AGVLGPVSLTGL--DEG-KRDLTWQKWSYKVGLKGEALSLHSLSGSSSVE--WVEGSLVA 612
Query: 588 NRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
R+ +TWYK+TF AP NDP+ L+L MGKG W+NG +LGRYWP Y A + C +C+
Sbjct: 613 QRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKASGN-CG--ACN 669
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
Y G + KC NCG SQ WYHVPRSW+ N LVLFEE+GG P I+ V + C
Sbjct: 670 YAGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVC 729
Query: 707 GQAHE------NKTME--------------LTC-HGRRISEIKYASFGDPQGACGAFKKG 745
+E N M+ L+C G++I+ IK+ASFG PQG CG+F++G
Sbjct: 730 ADINEWQPQLVNWQMQASGKVDKPLRPKAHLSCASGQKITSIKFASFGTPQGVCGSFREG 789
Query: 746 SCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
SC A E+ C+G+ SCS+ + G C +K+L VE +C
Sbjct: 790 SCHA-FHSYDAFERYCIGQNSCSVPVTPEIFGGDPCPH-VMKKLSVEVIC 837
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/838 (45%), Positives = 489/838 (58%), Gaps = 94/838 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AITI+G+R+ILLSGSIHYPRSTP MWPDLI+KAKEGGLD I+TYVFWN HEP
Sbjct: 21 VSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSP 80
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F GN DL+RFIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 81 GKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGI-AFRTNNG 139
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IVDM K E LF SQGGPIIL+QIENEYG + + G AG++Y W A+M
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF +GG
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 259
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG +
Sbjct: 260 PYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLVR 319
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGN-------------VTNTDYGNSVS----------- 353
QPKWGHL++LH+ +K E L G+ V + YG+ +
Sbjct: 320 QPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSFA 379
Query: 354 -----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN--------QAGNDQAP 400
YNLP WS+SILPDCK +NTA+V Q+ P QA N++AP
Sbjct: 380 KVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVPVPIHGAFSWQAYNEEAP 439
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
G+ F L++Q +T DVSDYLWY T+ + D+ L
Sbjct: 440 SS-------------NGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKY 486
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
TL + S+G LH +VN + + F + V L G N+IS+LS VGL N
Sbjct: 487 PTLTVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKISILSIAVGLPNV 546
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G F+ G+ GPV L G +DLS KW+YKVG+ G + +++ E
Sbjct: 547 GPHFETWNAGVLGPVTLNGLNEGR---RDLSWQKWSYKVGVEGEAMSLHSLSGSSSVE-- 601
Query: 580 WSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W++ + R+ +TW+KTTF AP N P+ L++ MGKG W+NG ++GR+WP Y A
Sbjct: 602 WTAGSFVARRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPAYKASG- 660
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
S CDY G + KC NCG SQ WYHVPRSW N LV+FEE+GG+P+ I+
Sbjct: 661 --SCGWCDYAGTFNEKKCLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNGISLV 718
Query: 699 TVVVGTACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQG 737
V + C +E NK + H G++IS +K+ASFG P+G
Sbjct: 719 RREVDSVCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLQCGPGQKISSVKFASFGTPEG 778
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
ACG++++GSC A E+ CVG+ CS+ N+ A +K+L VE +C
Sbjct: 779 ACGSYREGSCHAH-HSYDAFERLCVGQNWCSVTVVPRNVSGEIPAPSVMKKLAVEVVC 835
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 695 bits (1794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/833 (44%), Positives = 490/833 (58%), Gaps = 78/833 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D R++ I G R++++S SIHYPRS P MWP L+ +AK+GG D IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
QY F DL+RF+K ++D GL +ILRIGPYV AEWNYGG PVWLH +PG RT
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTV-FRT 144
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD-YGDAGKSYINW 204
N+ F N +++FTT IVDM KKE+LFASQGG IILAQIENEYG+ YG GK Y W
Sbjct: 145 NNEPFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMW 204
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
A MA + + GVPWIMCQESDAP P+ F PN+P PKIWTENW GWF+++
Sbjct: 205 AASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTF 264
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G +P R ED+AFAVARFF+ GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEY
Sbjct: 265 GESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEY 324
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVT-------------------------NTDY 348
G PKW HLR+LHK ++ E TL YGN T N D
Sbjct: 325 GLRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDS 384
Query: 349 GN----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
N + Y+LPAWSVSILPDC+ FNTAKV +QT++ P ++ P +W
Sbjct: 385 ANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVP-ESLQASKPERWS 443
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E + GK F N +D +T D +DYLWY T+ + D S S+ L
Sbjct: 444 IFRERTG---IWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSV--DGSYSSKGSHAVLN 498
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
I+S+G +HA++N + S + S + P+ L GKN+++LLS TVGLQN G +
Sbjct: 499 IDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNELALLSMTVGLQNAGFAY 558
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ + G V +G T DLSS+ W YK+GL G + + N++R
Sbjct: 559 EWIGAGFTN----VNISGVRTGTIDLSSNNWAYKIGLEG-EYYNLFKPDQTNNQRWIPQS 613
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
P N+ +TWYK + P +DPV +++Q MGKG AW+NG +GRYWP + D C T
Sbjct: 614 EPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRC-TP 672
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
SC+YRG + DKC CG P+Q WYH+PRSW N LV+FEE GG+P++I F V
Sbjct: 673 SCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVT 732
Query: 704 TACGQAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAF 742
+ C E+ +L C G+ IS +K+AS G+P G C ++
Sbjct: 733 SVCSFVSEHFPSIDLESWDESAMTEGTPPAKAQLFCPEGKSISSVKFASLGNPSGTCRSY 792
Query: 743 KKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ G C + L ++EK C+ SC++ ++ + G C G K L +EA C
Sbjct: 793 QMGRCH-HPNSLSVVEKACLNTNSCTVSLTDESFGKDLC-PGVTKTLAIEADC 843
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 695 bits (1794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/902 (43%), Positives = 506/902 (56%), Gaps = 140/902 (15%)
Query: 9 RAILLCLILQTLFNLSL---------AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
R +L+ ++ + L + VS+D RA+ IDG+R++L+S +HYPR++P M
Sbjct: 4 RGVLIVQLMSLTLTIHLLVVSGEFFKPFNVSYDHRALIIDGKRRMLISAGVHYPRASPEM 63
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
WPD+I+K+KEGG D I++YVFWN HEP + QY+F G DL++FI+ + GLY+ LRIGP
Sbjct: 64 WPDIIEKSKEGGADVIQSYVFWNGHEPTKGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGP 123
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFP+WL ++PGIE RT N F EMQ F IVD+ + EKLF QGGP+I
Sbjct: 124 YVCAEWNFGGFPLWLRDVPGIE-FRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVI 182
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
+ Q+ENEYGN+ S YG G+ YI W MA L VPW+MCQ+ DAPS +
Sbjct: 183 MLQVENEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYC 242
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F N+P+ P WTENW GWF SWG + P R EDLAF+VARFFQ G+FQNYYMY GG
Sbjct: 243 DGFKANSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGG 302
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN------ 342
TNFGRT+GGP+ TSYDYD+PIDEYG + +PKWGHL++LH LK E L +
Sbjct: 303 TNFGRTAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIK 362
Query: 343 ---------------------------------VTNTDYGNSVS----GSSYNLPAWSVS 365
+ N D +V+ G +YNLP WSVS
Sbjct: 363 LGPKQEAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVS 422
Query: 366 ILPDCKTEEFNTAKVNTQTNVKVKR---PNQA-------GNDQAPLQ-----WKWRPEMI 410
ILPDC+ FNTAKV QT++K+ P A DQ L W E I
Sbjct: 423 ILPDCQNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPI 482
Query: 411 -----NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LR 463
+F V+G L T D SDYLWYMT + +DD N+T +
Sbjct: 483 GIWSDQNFTVKG-------ILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTIT 535
Query: 464 INSSGQVLHAYVNGNYVDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
I+S V +VNG S QW K F +PV+ G N + LLS +GLQN G
Sbjct: 536 IDSVRDVFRVFVNGKLTGSAIGQWVK-------FVQPVQFLEGYNDLLLLSQAMGLQNSG 588
Query: 521 SKFDMVPNGIPGPVLLVG-RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
+ + GI G + L G + GD DLS WTY+VGL G + FY+ + N +
Sbjct: 589 AFIEKDGAGIRGRIKLTGFKNGD----IDLSKSLWTYQVGLKG-EFLNFYSLEE-NEKAD 642
Query: 580 WSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W+ +V + TWYK F +P DPV +NL MGKG AWVNG+++GRYW + ++ +D
Sbjct: 643 WTELSVDAIPSTFTWYKAYFSSPDGTDPVAINLGSMGKGQAWVNGHHIGRYW-SVVSPKD 701
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
GC + CDYRG Y S KCA NCG P+Q WYH+PRSW+K+ N LVLFEE GGNP +I +
Sbjct: 702 GCPRK-CDYRGAYNSGKCATNCGRPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVK 760
Query: 699 TVVVGTACGQAHE------------------------NKTMELTC-HGRRISEIKYASFG 733
G CGQ E N M L C G IS +++AS+G
Sbjct: 761 LYSTGVICGQVSESHYPSLRKLSNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYG 820
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
PQG+C F +G C A + L ++ + C+GK SC++E S + G C + VK L VEA
Sbjct: 821 TPQGSCNKFSRGPCHA-TNSLSVVSQACLGKNSCTVEISNSAFGGDPCHS-IVKTLAVEA 878
Query: 794 LC 795
C
Sbjct: 879 RC 880
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 695 bits (1794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/851 (44%), Positives = 490/851 (57%), Gaps = 77/851 (9%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
MA+ + A+L + L S V++D +A+ ++G+R+ILLSGSIHYPRS P MW
Sbjct: 1 MASSAPPAPAVLAVALTVALLASSAWAAVTYDRKAVVVNGQRRILLSGSIHYPRSVPEMW 60
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
PDLI+KAK+GGLD ++TYVFWN HEP QY F G DL+ FIK ++ GLYV LRIGPY
Sbjct: 61 PDLIQKAKDGGLDVVQTYVFWNGHEPSPGQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPY 120
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEWN+GGFP+WL +PGI RT N+ F EMQ FTT IV M K E+LF QGGPIIL
Sbjct: 121 VCAEWNFGGFPIWLKYVPGI-SFRTDNEPFKAEMQKFTTKIVQMMKSERLFEWQGGPIIL 179
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
+QIENE+G + D G+ K Y +W A MA +L+ GVPWIMC+E DAP P+
Sbjct: 180 SQIENEFGPLEWDQGEPAKDYASWAANMAMALNTGVPWIMCKEDDAPDPIINTCNGFYCD 239
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
F+PN P+ P +WTE WT W+ +G P R EDLA+ VA+F Q GG+F NYYMYHGGT
Sbjct: 240 WFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGT 299
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
NF RT+GGP++ TSYDYDAP+DEYG L +PKWGHL+ELH+ +K E L + + G
Sbjct: 300 NFERTAGGPFIATSYDYDAPLDEYGLLREPKWGHLKELHRAIKLCEPALVAADPILSSLG 359
Query: 350 N-----------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKV 380
N S +G Y+LP WS+SILPDCKT FNTA+V
Sbjct: 360 NAQKASVFRSSTGACAAFLENKHKLSYARVSFNGMHYDLPPWSISILPDCKTTVFNTARV 419
Query: 381 NTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYL 439
+Q + + AG L W+ E IN F F L++Q T D +DYL
Sbjct: 420 GSQ--ISQMKMEWAGG----LTWQSYNEEINSFSELES--FTTVGLLEQINMTRDNTDYL 471
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WY T D+ D+ L+ N L + S+G LH ++NG + + + VK
Sbjct: 472 WYTTYVDVAKDEQFLTSGKNPKLTVMSAGHALHVFINGQLSGTVYGSVENPKLTYTGKVK 531
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L G N IS LS VGL N G F+ GI GPV L G + +DL+ KWTY+VG
Sbjct: 532 LWSGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGK---RDLTWQKWTYQVG 588
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L G + +++ E G + PL TWYK F AP ++P+ L++ MGKG
Sbjct: 589 LKGEAMSLHSLSGSSSVEWGEPVQKQPL----TWYKAFFNAPDGDEPLALDMNSMGKGQI 644
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG +GRYWP Y A + CDYRG Y KC NCG+PSQ WYHVPR W+
Sbjct: 645 WINGQGIGRYWPGYKASG---TCGHCDYRGEYNETKCQTNCGDPSQRWYHVPRPWLNPTG 701
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTACGQA--------------HENKTMELTC-HGRRI 724
N LV+FEE+GG+P+ I+ G+ C +E + L C HGR+I
Sbjct: 702 NLLVIFEEWGGDPTGISMVKRTTGSVCADVSEWQPSIKNWRTKDYEKAEVHLQCDHGRKI 761
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
+EIK+ASFG PQG+CG + +G C A + +K C+ ++ C + G C G
Sbjct: 762 TEIKFASFGTPQGSCGNYSEGGCHAHRS-YDIFKKNCINQEWCGVSVVPEAFGGDPC-PG 819
Query: 785 TVKRLVVEALC 795
T+KR VVE C
Sbjct: 820 TMKRAVVEVTC 830
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 695 bits (1793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/881 (44%), Positives = 503/881 (57%), Gaps = 115/881 (13%)
Query: 12 LLCLILQTLFNLSL-------AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
+L LI+ L + + VS+D RA+ I G+R++L+S IHYPR+TP MW DLI
Sbjct: 14 ILSLIIALLVYFPILSGSYFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLI 73
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
K+KEGG D ++TYVFWN HEP++ QY+F G DL++F+K I GLY+ LRIGPYVCAE
Sbjct: 74 AKSKEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAE 133
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFPVWL ++PGIE RT N+ F EMQ F T IVD+ ++ KLF QGGPII+ QIE
Sbjct: 134 WNFGGFPVWLRDIPGIE-FRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIE 192
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYG+V YG GK Y+ W A MA L GVPW+MC+++DAP + F P
Sbjct: 193 NEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKP 252
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+ P +WTE+W GW+ WGG P R AEDLAFAVARF+Q GG+FQNYYMY GGTNFGR
Sbjct: 253 NSRTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 312
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN----------- 342
TSGGP+ TSYDYDAP+DEYG ++PKWGHL++LH +K E L +
Sbjct: 313 TSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQ 372
Query: 343 -------------------VTNTDYGNSV----SGSSYNLPAWSVSILPDCKTEEFNTAK 379
+ N D S +G SY LP WSVSILPDC+ FNTAK
Sbjct: 373 EAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAK 432
Query: 380 VNTQTNVK-VKRPNQAGNDQAPLQWKWRPEMIN-----------DFVVRGKGHFALNTLI 427
V QT+VK V+ + + LQ R + ++ + G+ +F L+
Sbjct: 433 VGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLL 492
Query: 428 DQ-KSTNDVSDYLWYMTNADLKDDDPIL--SGSSNMTLRINSSGQVLHAYVNGNYVDS-- 482
+ T D SDYLW+ T + +DD N T+ I+S VL +VN S
Sbjct: 493 EHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIV 552
Query: 483 -QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RA 540
W K +PV+ +G N + LL+ TVGLQNYG+ + G G L G +
Sbjct: 553 GHWVKA-------VQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKN 605
Query: 541 GDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFE 599
GD DLS WTY+VGL G DK + N + WS+ + + WYKT F+
Sbjct: 606 GD----LDLSKSSWTYQVGLKGEADKIY--TVEHNEKAEWSTLETDASPSIFMWYKTYFD 659
Query: 600 APLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYN 659
P DPVVLNL+ MG+G AWVNG ++GRYW ++++DGC +CDYRG Y SDKC N
Sbjct: 660 PPAGTDPVVLNLESMGRGQAWVNGQHIGRYW-NIISQKDGCD-RTCDYRGAYNSDKCTTN 717
Query: 660 CGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-------- 711
CG P+Q YHVPRSW+K N LVLFEE GGNP +I+ +TV G CGQ E
Sbjct: 718 CGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRK 777
Query: 712 -------NKTM---------ELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVL 754
N TM L C G IS I++AS+G P+G+C F G C A + L
Sbjct: 778 WSTPDYINGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHAS-NSL 836
Query: 755 PLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++ + C G+ SC IE S + C +GT+K L V + C
Sbjct: 837 SIVSEACKGRNSCFIEVSNTAFISDPC-SGTLKTLAVMSRC 876
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/831 (45%), Positives = 489/831 (58%), Gaps = 82/831 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ I+G+R+IL SGSIHYPRSTP MW DLI KAKEGG+D +ETYVFWN HEP
Sbjct: 27 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEPSP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K E+LF SQGGPIIL+QIENEYG G AG++Y+NW AKM
Sbjct: 146 PFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC+E DAP P+ FTPN P P IWTE W+GWF +GG
Sbjct: 206 AVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGPI 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
KR +DLAFA ARF GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG +
Sbjct: 266 HKRPVQDLAFAAARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLIR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLT-------------YGNVTNTDYGNSVSGSS-------- 356
QPK+GHL+ELH+ +K E+ L +V T+ G+ + S
Sbjct: 326 QPKYGHLKELHRAIKMCERALVSTDPIVTSLGEFQQAHVYTTESGDCAAFLSNYDSKSSA 385
Query: 357 --------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
Y+LP WSVSILPDC+ FNTAKV QT+ P N Q W+ E
Sbjct: 386 RVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGVQTSQMQMLPT---NTQL-FSWESFDE 441
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
I + V L++Q T D SDYLWY+T+ D+ + L G TL + S+
Sbjct: 442 DI--YSVDESSAITAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTLIVQST 499
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G +H ++NG S + + V L G N+I+LLS +GL N G F+
Sbjct: 500 GHAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRIALLSVAIGLPNVGEHFESWS 559
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW--SSKNV 585
GI GPV L G + DLS KWTY+VGL G + + S W S+ V
Sbjct: 560 TGILGPVALHGLDKGKW---DLSGQKWTYQVGLKG--EAMDLASPNGISSVAWMQSAIVV 614
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
N+ +TW+KT F+AP ++P+ L+++GMGKG W+NG ++GRYW + + C
Sbjct: 615 QRNQPLTWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAFATG----NCNDC 670
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
+Y G + KC CG P+Q WYHVPRSW+K N LV+FEE GGNPS+I+ V +
Sbjct: 671 NYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGGNPSKISLVKRSVSSV 730
Query: 706 C------------------GQAHENKTMELTCH---GRRISEIKYASFGDPQGACGAFKK 744
C G++ E + ++ H G+ IS IK+ASFG P G CG +++
Sbjct: 731 CADVSEYHPNIKNWHIESYGKSEEFRPPKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQ 790
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+C + + ++EK+C+GK C++ S +N G C +KRL VEA+C
Sbjct: 791 GACHSPASYV-ILEKRCIGKPRCTVTVSNSNFGQDPCPK-VLKRLSVEAVC 839
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/837 (44%), Positives = 493/837 (58%), Gaps = 87/837 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D R++ I G R++++S SIHYPRS P MWP L+ +AK+GG D IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
QY F DL+RF+K ++D GL +ILRIGP+V AEWN+GG PVWLH +PG RT
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTV-FRT 144
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD-YGDAGKSYINW 204
N+ F + M++FTT IV+M KKE+LFASQGG IILAQIENEYG+ Y GK Y W
Sbjct: 145 DNEPFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMW 204
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
A MA + + GVPWIMCQESDAP P+ F PN+P PK+WTENW GWF+++
Sbjct: 205 AASMAVAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKLWTENWPGWFQTF 264
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G +P R ED+AFAVARFF+ GG+ QNYY+YHGGTNFGRT+GGP++TTSYDYDAPIDEY
Sbjct: 265 GESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEY 324
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVT-------------------------NTDY 348
G PKW HLR+LHK ++ E TL YGN T N D
Sbjct: 325 GLRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDS 384
Query: 349 GN----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
N + Y+LPAWSVSILPDC+ FNTAKV +QT++ P QA
Sbjct: 385 ANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVAMVPESL---QAS---- 437
Query: 405 WRPEMINDFVVR----GKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
+PE N F R GK F N +D +T D +DYLWY T+ + D S S+
Sbjct: 438 -KPERWNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSV---DESYSKGSH 493
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
+ L I+S G +HA++N ++ S + S+ + P+ L GKN+++LLS TVGLQN
Sbjct: 494 VVLNIDSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNELALLSMTVGLQNA 553
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G ++ + G V +G +LSS+ W YK+GL G + + N++R
Sbjct: 554 GFSYEWIGAGFTN----VNISGVRNGTINLSSNNWAYKIGLEG-EYYSLFKPDQRNNQRW 608
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
P N+ +TWYK + P +DPV +++Q MGKG W+NG +GRYWP + +D
Sbjct: 609 IPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWPRTSSIDDR 668
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
C T SCDYRG + +KC CG P+Q WYH+PRSW N LV+FEE GG+P++I F
Sbjct: 669 C-TPSCDYRGEFNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDPTKITFSR 727
Query: 700 VVVGTACGQAHEN--------------------KTMELTCH-GRRISEIKYASFGDPQGA 738
V + C E+ +L+C G+ IS +K+AS G P G
Sbjct: 728 RAVTSVCSFVSEHFPSIDLESWDGSATNEGTSPAKAQLSCPIGKNISSLKFASLGTPSGT 787
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C +++KGSC + L ++EK C+ SC++ S+ + G C G K L +EA C
Sbjct: 788 CRSYQKGSCH-HPNSLSVVEKACLNTNSCTVSLSDESFGKDLC-PGVTKTLAIEADC 842
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/835 (45%), Positives = 487/835 (58%), Gaps = 78/835 (9%)
Query: 23 LSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWN 82
LS+ VS+D +AI I+G R+IL+SGSIHYPRST MWPDLI+KAKEGGLD IETYVFWN
Sbjct: 22 LSVQASVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLIQKAKEGGLDVIETYVFWN 81
Query: 83 AHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE 142
HEP +Y F GN DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI
Sbjct: 82 GHEPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGI-S 140
Query: 143 LRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
RT N F +M+ FT IV+M K E+L+ SQGGPIIL+QIENEYG + + G GK+Y
Sbjct: 141 FRTDNAPFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYS 200
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFK 251
W A+MA L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF
Sbjct: 201 KWAAQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFT 260
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPID 311
+GG P R AED+AFAVARF Q GG NYYMYHGGTNFGRT+GGP++ TSYDYDAPID
Sbjct: 261 QFGGAVPHRPAEDMAFAVARFIQKGGALINYYMYHGGTNFGRTAGGPFIATSYDYDAPID 320
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------- 356
EYG L QPKWGHL++L++ +K E L G+ T GN S SG+
Sbjct: 321 EYGLLRQPKWGHLKDLNRAIKLCEPALVSGDPIVTRLGNYQEAHVFKSKSGACAAFLSNY 380
Query: 357 --------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
YN+P WS+SILPDCK FNTA+V QT + P +
Sbjct: 381 NPRSYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGAQTAIMKMSPVPMHESFSWQA 440
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
+ P N+ K + L +T D +DYLWY T+ + ++ L L
Sbjct: 441 YNEEPASYNE-----KAFTTVGLLEQINTTRDATDYLWYTTDVHIDANEGFLRSGKYPVL 495
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ S+G +H +VNG + + F R V L G N+I+LLS VGL N G
Sbjct: 496 TVLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNNKIALLSIAVGLPNVGPH 555
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
F+M GI GPV L G DE +DL+ KWTYK+GL G + +++ E W
Sbjct: 556 FEMWNAGILGPVNLNGL--DEG-RRDLTWQKWTYKIGLDGEAMSLHSLSGSSSVE--WIQ 610
Query: 583 KNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
++ ++ +TW+KTTF AP N P+ L++ MGKG W+NG +LGRYWP Y + S
Sbjct: 611 GSLVAQKQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGRYWPAY---KSTGS 667
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
SCDY G Y KC+ NCG SQ WYHVPRSW+ N LV+FEE+GG+P+ I+
Sbjct: 668 CGSCDYTGTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWGGDPNGIHLVRRD 727
Query: 702 VGTACGQAHE----------------NKTMELTCH-----GRRISEIKYASFGDPQGACG 740
V + C +E NK + H G++IS +K+ASFG P+G CG
Sbjct: 728 VDSVCVNINEWQPTLMNWQMQSSGKVNKPLRPKAHLSCGPGQKISSVKFASFGTPEGECG 787
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+F++GSC A ++ CVG+ C++ + G C +K+L VE +C
Sbjct: 788 SFREGSCHAH-HSYDAFQRTCVGQNFCTVTVAPEMFGGDPC-PNVMKKLSVEVIC 840
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/630 (53%), Positives = 434/630 (68%), Gaps = 51/630 (8%)
Query: 208 MATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGK 256
MA SLDIGVPW+MCQ+ +AP PM + P NP++PK+WTENWTGWFK+WGGK
Sbjct: 1 MANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGK 60
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
P RTAEDLAF+VARFFQ GGTFQNYYMYHGGTNFGR +GGPY+TTSYDY AP+DE+G+L
Sbjct: 61 HPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNL 120
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV------------------------ 352
NQPKWGHL++LH +LKSMEK+LTYGN++ D GNS+
Sbjct: 121 NQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEGSSCFIGNVNATADA 180
Query: 353 ----SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
G Y++PAWSVS+LPDC E +NTAKVNTQT++ + ++ L+W WRPE
Sbjct: 181 LVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKP----ERLEWTWRPE 236
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+++G G L+DQK TND SDYLWYMT L DP+ S NMTLR++S+
Sbjct: 237 SAQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLW--SRNMTLRVHSN 294
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPV-KLTRGKNQISLLSATVGLQNYGSKFDMV 526
VLHAYVNG YV +Q+ K G + FER V L G N ISLLS +VGLQNYG F+
Sbjct: 295 AHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESG 354
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
P GI GPV LVG G+ETI KDLS H+W YK+GL G +DK F + K+ ++ W+++ +P
Sbjct: 355 PTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLF-SIKSVGHQK-WANEKLP 412
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
R +TWYK F+APL +PV+++L G+GKG AW+NG ++GRYWP++ + +DGC + CD
Sbjct: 413 TGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCK-DKCD 471
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIK-DGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
YRG YGSDKCA+ CG P+Q WYHVPRS++ G NT+ LFEE GGNPS +NF+TVVVGT
Sbjct: 472 YRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTV 531
Query: 706 CGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKK 765
C +AHE+ +EL+CH R IS +K+ASFG+P G CG+F G+C+ + D + K+CVGK
Sbjct: 532 CARAHEHNKVELSCHNRPISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKECVGKL 591
Query: 766 SCSIEASEANLGATSCAAGTVKRLVVEALC 795
+C++ S G+T + K+L VE C
Sbjct: 592 NCTVNVSSDTFGSTLDCGDSPKKLAVELEC 621
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/857 (43%), Positives = 492/857 (57%), Gaps = 105/857 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D RA+ I+G+R++L+S IHYPR+TP MWP L++K+KEGG D +++YVFWN HEP +
Sbjct: 35 VTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQSYVFWNGHEPKQ 94
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY+F G DL++FIK +Q GLY LRIGPYVCAEWN+GGFP WL ++PGI RT N+
Sbjct: 95 GQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKDIPGI-VFRTDNE 153
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ F + IV++ K+ +LFA QGGPII+AQIENEYGN+ +GD GK Y W A++
Sbjct: 154 PFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDGGKRYAMWAAEL 213
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MCQ+ DAP + F N P WTE+W GWF+ WG
Sbjct: 214 ALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKANTATKPAFWTEDWNGWFQYWGQSV 273
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED AFA+ARFFQ GG+FQNYYMY GGTNF RT+GGP++TTSYDYDAP+DEYG +
Sbjct: 274 PHRPVEDNAFAIARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTSYDYDAPLDEYGLIR 333
Query: 318 QPKWGHLRELHKLLKSMEKTLTY-----------GNVTNTDYGN---------------- 350
QPKWGHLR+LH +K E LT NV Y
Sbjct: 334 QPKWGHLRDLHAAIKLCEPALTAVDEVPLSTWLGPNVEAHVYSGRGQCAAFLANIDSWKI 393
Query: 351 ---SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV------------KVKRPNQAG 395
G +Y LP WSVSILPDCK FNTA+V QT + +V P+
Sbjct: 394 ATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSKLEGEVVMPSNML 453
Query: 396 NDQAP-------LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
AP L+W+ E + +RG N L++Q + T D +DYLWY + +
Sbjct: 454 RKHAPESIVGSGLKWEASVEPVG---IRGAATLVSNRLLEQLNITKDSTDYLWYSISIKV 510
Query: 448 KDD--DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
+ + S L + S +H +VN V S S+ +PV L GKN
Sbjct: 511 SVEAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAM----GSDVQVVQPVPLKEGKN 566
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
I LLS TVGLQNYG+ + GI G LL G + DLS+ +W+Y+VG+ G ++
Sbjct: 567 DIDLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSG---VLDLSTERWSYQVGIQG-EE 622
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
K+ + A+ + SS + P +TWYKTTF+AP DPV L+L MGKG AWVNG++
Sbjct: 623 KRLFETGTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQAWVNGHH 682
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIW-----YHVPRSWIKDGVN 680
+GRYWP+ LA + GCST CDYRG Y +DKC NCG PSQ W YH+PR+W++ N
Sbjct: 683 MGRYWPSVLASQSGCST--CDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWLQLSNN 740
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTACGQAHE-----------NKTME----------LTC 719
LVLFEE GG+ S+++ T C HE N +M+ L C
Sbjct: 741 LLVLFEEIGGDVSKVSLVTRSAPAVCTHVHESQPPPVLFWPANSSMDAMSSRSGEAVLEC 800
Query: 720 -HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGA 778
G+ I IK+ASFG+P+G+CG F++G+C A + L + K C+G CSI G
Sbjct: 801 IAGQHIRHIKFASFGNPKGSCGNFQRGTCHA-MKSLEVARKACMGMHRCSIPVQWQTFGE 859
Query: 779 TSCAAGTVKRLVVEALC 795
K L V+ C
Sbjct: 860 FDPCPDVSKSLAVQVFC 876
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/854 (44%), Positives = 497/854 (58%), Gaps = 85/854 (9%)
Query: 8 SRAILLCLILQTLFNLS--LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
S + LL L L S + V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+
Sbjct: 5 SVSKLLTFFLMVLLMGSKLVQCTVTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQ 64
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
KAK+GGLD I+TYVFW+ HE Y+F G DL+RFIKT+Q GLY LRIGPYVCAEW
Sbjct: 65 KAKDGGLDVIDTYVFWDVHETSPGNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEW 124
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
N+GGFPVWL +PGI RT N+ F MQ FT IV M K E LFASQGGPIIL+QIEN
Sbjct: 125 NFGGFPVWLKYVPGIS-FRTDNEPFKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIEN 183
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPN 234
EYG G AG+SYINW AKMA LD GVPW+MC+E DAP PM F PN
Sbjct: 184 EYGPESRALGAAGRSYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINTCNGFYCDAFAPN 243
Query: 235 NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRT 294
P P +WTE W+GWF +GG +R EDLAFAVARF Q GG++ NYYMYHGGTNFGR+
Sbjct: 244 KPYKPTLWTEAWSGWFTEFGGPIHQRPVEDLAFAVARFIQKGGSYFNYYMYHGGTNFGRS 303
Query: 295 SGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG----- 349
+GGP++TTSYDYDAPIDEYG + +PK+GHL+ LHK +K E L + + T G
Sbjct: 304 AGGPFITTSYDYDAPIDEYGLIREPKYGHLKALHKAIKLCEHALVSSDPSITSLGTYQQA 363
Query: 350 ----------------NSVSGSS-------YNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
N+ S + Y+LP WS+SILPDC+ FNTA+V QT
Sbjct: 364 HVFSSGRSCAAFLANYNAKSAARVMFNNMHYDLPPWSISILPDCRNVVFNTARVGAQT-- 421
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNAD 446
R W+ E I+ + AL L T D SDYLWY+T+ D
Sbjct: 422 --LRMQMLPTGSELFSWETYDEEISSLTDSSR-ITALGLLEQINVTRDTSDYLWYLTSVD 478
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ + L +L + S+G LH ++NG + S + F PV L G N+
Sbjct: 479 ISPSEAFLRNGQKPSLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVNLRAGTNR 538
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
I+LLS VGL N G ++ G+ GPVLL G + KDL+ KW+Y+VGL G
Sbjct: 539 IALLSIAVGLPNVGLHYETWKTGVQGPVLLNGLNQGK---KDLTWQKWSYQVGLKG---- 591
Query: 567 KFYNAKAAN--SERGWSSKNVPLN--RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVN 622
+ N + N S W ++ + + + W+K F+AP N+P+ L+++ MGKG W+N
Sbjct: 592 EAMNLVSPNGVSSVDWIEGSLASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWIN 651
Query: 623 GYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTL 682
G ++GRYW Y A+ D SC Y + KC CG P+Q WYHVPRSW+K N L
Sbjct: 652 GQSIGRYWMAY-AKGD---CNSCSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLL 707
Query: 683 VLFEEFGGNPSQINFQTVVVGTACGQAHENK--------------------TMELTCH-G 721
V+FEE GG+ S+I+ + C A+E+ + L C G
Sbjct: 708 VVFEELGGDASKISLVKRSIEGVCADAYEHHPATKNYNTGGNDESSKLHQAKIHLRCAPG 767
Query: 722 RRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSC 781
+ I+ IK+ASFG P G CG+F++G+C A + +IEK+C+G++SC + S +N GA C
Sbjct: 768 QFIAAIKFASFGTPSGTCGSFQQGTCHAP-NTHSVIEKKCIGQESCMVTISNSNFGADPC 826
Query: 782 AAGTVKRLVVEALC 795
+K+L VEA+C
Sbjct: 827 -PNVLKKLSVEAVC 839
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/837 (45%), Positives = 492/837 (58%), Gaps = 96/837 (11%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTP------------GMWPDLIKKAKEGGLDAIET 77
++D +A+ ++G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD ++T
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86
Query: 78 YVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNM 137
YVFWN HEP QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +
Sbjct: 87 YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146
Query: 138 PGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDA 197
PGI RT N+ F EMQ FTT IV+M K E LF QGGPIIL+QIENE+G + D G+
Sbjct: 147 PGIS-FRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEP 205
Query: 198 GKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENW 246
K+Y +W A MA +L+ VPWIMC+E DAP P+ F+PN P+ P +WTE W
Sbjct: 206 AKAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAW 265
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDY 306
T W+ +G P R EDLA+ VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDY
Sbjct: 266 TAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDY 325
Query: 307 DAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------- 350
DAPIDEYG L +PKWGHL++LHK +K E L G+ T GN
Sbjct: 326 DAPIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAA 385
Query: 351 -------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGND 397
+ +G Y+LP WS+SILPDCKT FNTA+V +Q + + AG
Sbjct: 386 FLENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ--ISQMKMEWAGG- 442
Query: 398 QAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSG 456
W+ E IN F G+ L++Q T D +DYLWY T D+ D+ LS
Sbjct: 443 ---FAWQSYNEEINSF---GEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSN 496
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSAT 513
N+ L + S+G LH ++NG + T YG+ +D + VKL G N IS LS
Sbjct: 497 GENLKLTVMSAGHALHIFINGQL---KGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIA 553
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA 573
VGL N G F+ GI GPV L G +DL+ KWTY+VGL G + +
Sbjct: 554 VGLPNVGEHFETWNAGILGPVTLDGLNEGR---RDLTWQKWTYQVGLKGESMSLHSLSGS 610
Query: 574 ANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
+ E G + PL TWYK F AP ++P+ L++ MGKG W+NG +GRYWP Y
Sbjct: 611 STVEWGEPVQKQPL----TWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGY 666
Query: 634 LAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPS 693
A + C T CDYRG Y KC NCG+ SQ WYHVPRSW+ N LV+FEE+GG+P+
Sbjct: 667 KASGN-CGT--CDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPT 723
Query: 694 QINFQTVVVGTACGQA--------------HENKTMELTC-HGRRISEIKYASFGDPQGA 738
I+ +G+ C +E + L C +G++I+EIK+ASFG PQG+
Sbjct: 724 GISMVKRSIGSVCADVSEWQPSMKNWHTKDYEKAKVHLQCDNGQKITEIKFASFGTPQGS 783
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG++ +G C A + K CVG++ C + G C GT+KR VVEA+C
Sbjct: 784 CGSYTEGGCHAH-KSYDIFWKNCVGQERCGVSVVPEIFGGDPC-PGTMKRAVVEAIC 838
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/871 (44%), Positives = 493/871 (56%), Gaps = 108/871 (12%)
Query: 1 MATLKHCSRAIL-LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
M T SR IL CL L + V++D +A+ I+G+R+IL SGSIHYPRSTP M
Sbjct: 4 MGTGDSASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDM 63
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W DLI+KAK+GG+D IETYVFWN HEP +YDF G DL+RF+KTI GLY LRIGP
Sbjct: 64 WEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGP 123
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F M+ FT IV++ K E LF SQGGPII
Sbjct: 124 YVCAEWNFGGFPVWLKYVPGI-SFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG G G +Y+ W AKMA + + GVPW+MC+E DAP P+
Sbjct: 183 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 242
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F PN P P IWTE W+GWF +GG R +DLAF VARF Q GG+F NYYMYHGG
Sbjct: 243 DSFAPNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGG 302
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 303 TNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSI 362
Query: 349 GNS-------------------------------------VSGSSYNLPAWSVSILPDCK 371
GN + YNLP WS+SILPDC+
Sbjct: 363 GNKQQVWIYYERFAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCR 422
Query: 372 TEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-K 430
FNTAKV+ QW+ E ++ + F + L++Q
Sbjct: 423 NAVFNTAKVSN------------------FQWESYLEDLSS--LDDSSTFTTHGLLEQIN 462
Query: 431 STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
T D SDYLWYMT+ D+ D + L G TL I S+G +H +VNG S +
Sbjct: 463 VTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNR 522
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS 550
++ + L G N+I+LLS VGL N G F+ GI GPV L G + + DLS
Sbjct: 523 RFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKM---DLS 579
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSER-GW--SSKNVPLNRRMTWYKTTFEAPLENDPV 607
KWTY+VGL G + A N+ GW +S V + +TW+KT F+AP N+P+
Sbjct: 580 WQKWTYQVGLKG---EAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPL 636
Query: 608 VLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIW 667
L+++GMGKG WVNG ++GRYW T A D CS C Y G Y +KC CG P+Q W
Sbjct: 637 ALDMEGMGKGQIWVNGESIGRYW-TAFATGD-CS--HCSYTGTYKPNKCQTGCGQPTQRW 692
Query: 668 YHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC--------------------G 707
YHVPR+W+K N LV+FEE GGNPS ++ V C G
Sbjct: 693 YHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYHPNIKNWQIESYGKG 752
Query: 708 QAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEID--VLPLIEKQCVGK 764
Q + L C G+ I+ IK+ASFG P G CG++++G C A +L ++CVGK
Sbjct: 753 QTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERYMQKCVGK 812
Query: 765 KSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C++ S +N G C +KRL VEA+C
Sbjct: 813 ARCAVTISNSNFGKDPC-PNVLKRLTVEAVC 842
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 691 bits (1782), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/832 (45%), Positives = 492/832 (59%), Gaps = 84/832 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AITI+G+RKILLSGSIHYPRSTP MWPDLI+KAKEGGLD I+TYVFWN HEP
Sbjct: 26 VSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSP 85
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F GN DL++FI+ +Q GLYV LRIGPY CAEWN+GGFPVWL +PGI RT N
Sbjct: 86 GKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGI-SFRTDNG 144
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F +MQ FTT IV++ K E+L+ SQGGPIIL+QIENEYG + + G GK+Y W A M
Sbjct: 145 PFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWAAHM 204
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF +GG
Sbjct: 205 AIGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTGFGGTV 264
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG L
Sbjct: 265 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 324
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGS-------------- 355
QPKWGHL++LH+ +K E L + T T GN S SG+
Sbjct: 325 QPKWGHLKDLHRAIKLCEPALVSADPTVTRLGNYQEAHVFKSKSGACAAFLANYNPHSYS 384
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQT-NVKVKR-PNQAGNDQAPLQWKWR 406
YNLP WS+SILP+CK +NTA++ +Q+ +K+ R P G L WK
Sbjct: 385 TVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQSAQMKMTRVPIHGG-----LSWKAF 439
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + F + L++Q +T D+SDYLWY T+ + D+ N L +
Sbjct: 440 NE---ETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPDEGYFRNGKNPVLTVL 496
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S+G LH ++NG + + F V L G N+ISLLS VGL N G F+
Sbjct: 497 SAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKISLLSVAVGLPNVGPHFET 556
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ GP+ L G +DL+ KW+YKVGL G D + +++ + W +
Sbjct: 557 WNAGVLGPITLNGLNEGR---RDLTWQKWSYKVGLKGEDLSLHSLSGSSSVD--WLQGYL 611
Query: 586 PLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
R+ +TWYKTTF+AP P+ L++ MGKG W+NG +LGRYWP Y A S +
Sbjct: 612 VSRRQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPAYKATG---SCDY 668
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C+Y G Y KC NCG SQ WYHVP SW+K N LV+FEE GG+P+ + + +
Sbjct: 669 CNYAGTYNEKKCGTNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNGVFLVRRDIDS 728
Query: 705 ACGQAHE--------------------NKTMELTC-HGRRISEIKYASFGDPQGACGAFK 743
C +E + L+C G++IS IK+ASFG P G+CG ++
Sbjct: 729 VCADIYEWQPNLVSYQMQASGKVSRPVSPKAHLSCGPGQKISSIKFASFGTPVGSCGNYR 788
Query: 744 KGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+GSC A ++ CVG+ SC++ S G C +K+L VEA+C
Sbjct: 789 EGSCHAH-KSYDAFQRNCVGQSSCTVTVSPEIFGGDPC-PNVMKKLSVEAIC 838
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 691 bits (1782), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/843 (44%), Positives = 500/843 (59%), Gaps = 79/843 (9%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
C +L L+ ++ V++D +AI ++G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 15 FCTLLLVLWVCAVTASVTYDHKAIVVNGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGL 74
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GLYV LRIGPY+CAEWN+GGFPV
Sbjct: 75 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYICAEWNFGGFPV 134
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV + K+EKLF +QGGPII++QIENEYG V
Sbjct: 135 WLKYVPGI-AFRTDNEPFKAAMQKFTEKIVSIMKEEKLFQTQGGPIIMSQIENEYGPVEW 193
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W ++MA LD GVPWIMC++ D P P+ FTPN PK+
Sbjct: 194 EIGAPGKAYTKWFSQMAVGLDTGVPWIMCKQQDTPDPLIDTCNGYYCENFTPNKKYKPKM 253
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGW+ +GG P+R AED+AF+VARF Q GG+F NYYMYHGGTNF RTS G ++
Sbjct: 254 WTENWTGWYTEFGGAVPRRPAEDMAFSVARFVQNGGSFVNYYMYHGGTNFDRTSSGLFIA 313
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV-------SG 354
TSYDYD PIDEYG LN+PKWGHLR+LHK +K E L + T T GN++ SG
Sbjct: 314 TSYDYDGPIDEYGLLNEPKWGHLRDLHKAIKLCEPALVSVDPTVTWPGNNLEVHVFKTSG 373
Query: 355 S---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-KVKRPN 392
+ Y+LP WS+SILPDCKT FNTA++ Q+++ K+ N
Sbjct: 374 ACAAFLANYDTKSSASVKFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSSLMKMTAVN 433
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDD 451
A + Q+ + P N+ L +Q T D +DYLWYMT+ ++ ++
Sbjct: 434 SAFDWQS---YNEEPASSNE-----DDSLTAYALWEQINVTRDSTDYLWYMTDVNIDANE 485
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
+ + L + S+G VLH +N + + + F VKL G N+ISLLS
Sbjct: 486 GFIKNGQSPVLTVMSAGHVLHVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKISLLS 545
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
VGL N G F+ G+ GPV L G +DLS KW+YK+GL G + N
Sbjct: 546 IAVGLPNVGPHFETWNAGVLGPVTLKGL---NEGTRDLSKQKWSYKIGLKG--EALNLNT 600
Query: 572 KAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+ +S W ++ ++ + WYKTTF P NDP+ L++ MGKG AW+NG ++GR+W
Sbjct: 601 VSGSSSVEWVQGSLLAKQQPLAWYKTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHW 660
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
P Y+A + C C Y G Y KC NCG PSQ WYH+PRSW+ N LV+FEE+GG
Sbjct: 661 PGYIARGN-CG--DCYYAGTYTDKKCRTNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWGG 717
Query: 691 NPSQINFQTVVVGTACGQAHE------NKTM-----------ELTC-HGRRISEIKYASF 732
+P+ I + C ++ N+ M L C G+ IS+IK+AS+
Sbjct: 718 DPTGITLVKRTTASVCADIYQGQPTLKNRQMLDSGKVVRPKAHLWCPPGKNISQIKFASY 777
Query: 733 GDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVE 792
G PQG CG F++GSC A +K C+GK+SC + + G C G K+L +E
Sbjct: 778 GLPQGTCGNFREGSCHAH-KSYDAPQKNCIGKQSCLVTVAPEVFGGDPC-PGIAKKLSLE 835
Query: 793 ALC 795
ALC
Sbjct: 836 ALC 838
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 690 bits (1780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/841 (45%), Positives = 492/841 (58%), Gaps = 85/841 (10%)
Query: 15 LILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDA 74
L+ T + ++ V +D +AITI+ +R+IL+SGSIHYPRSTP MWP LI+KAKEGG++
Sbjct: 11 LLFVTAWVCNVTATVWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKAKEGGIEV 70
Query: 75 IETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL 134
I+TYVFWN HEP QY F DL++FIK +Q GLYV LRIGPYVCAEWN+GGFP+WL
Sbjct: 71 IQTYVFWNGHEPSPGQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPMWL 130
Query: 135 HNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDY 194
+PGI E RT N F MQ F TLIV+M K++KLF +QGGPIIL+QIENEYG V
Sbjct: 131 KYVPGI-EFRTDNGPFKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYGPVEWTI 189
Query: 195 GDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWT 243
G GK+Y W A MAT L+ GVPWIMC++ DAP P + PNN N PK+WT
Sbjct: 190 GAPGKAYTKWAAAMATGLNTGVPWIMCKQEDAPDPTIDTCNGFYCEGYKPNNYNKPKVWT 249
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTS 303
ENWTGW+ WG P R ED AF+VARF G+F NYYMYHGGTNF RT+ G ++ TS
Sbjct: 250 ENWTGWYTEWGASVPYRPPEDTAFSVARFIAASGSFVNYYMYHGGTNFDRTA-GLFMATS 308
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT------------------- 344
YDYDAP+DEYG + PKWGHLR+LH+ +K E+ L + T
Sbjct: 309 YDYDAPLDEYGLTHDPKWGHLRDLHRAIKQSERALVSADPTVISLGKNQEAHVFQSKMGC 368
Query: 345 -------NTDYGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAG 395
+T Y V+ Y+LP WS+S+LPDCKT +NTAK++ Q+ K P +G
Sbjct: 369 AAFLANYDTQYSARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISAQSTQKWMMPVASG 428
Query: 396 NDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPIL 454
+ W+ + V G F L +QK T D +DYLWYMT+ + ++ L
Sbjct: 429 -------FSWQSHIDEVPVGYSAGTFTKVGLWEQKYLTGDKTDYLWYMTDVTINSNEGFL 481
Query: 455 SGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATV 514
N L + S+G VLH ++NG+ S + F + VKL G N+I+LLSATV
Sbjct: 482 RSGKNPFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLVGGVNKIALLSATV 541
Query: 515 GLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAA 574
GL N G +D G+ GPV L G D++ KW+YK+GL G D K F +
Sbjct: 542 GLANVGVHYDTWNVGVLGPVTLQGLNQGTL---DMTKWKWSYKIGLKGEDLKLF----SG 594
Query: 575 NSERGWS-----SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ GW+ +K PL TWYKT AP NDPV L + MGKG ++NG ++GR+
Sbjct: 595 GANVGWAQGAQLAKKTPL----TWYKTFINAPPGNDPVALYMGSMGKGQMYINGRSIGRH 650
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP Y A+ + + CDY G Y KC CG P Q WYHVPRSW+K N LV+FEE G
Sbjct: 651 WPAYTAKGN---CKDCDYAGYYDDQKCRSGCGQPPQQWYHVPRSWLKPTGNLLVVFEEMG 707
Query: 690 GNPSQINFQTVVVGTACGQAH----------ENKTMELTCH-----GRRISEIKYASFGD 734
G+P+ I+ VVG+ C EN + H G++ S+I +AS+G
Sbjct: 708 GDPTGISLVKRVVGSVCADIDDDQPEMKSWTENIPVTPKAHLWCPPGQKFSKIVFASYGW 767
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
PQG CGA+++G C A P +K C+GK +C I+ + A G C G+ KRL V+
Sbjct: 768 PQGRCGAYRQGKCHALKSWDPF-QKYCIGKGACDIDVAPATFGGDPC-PGSAKRLSVQLQ 825
Query: 795 C 795
C
Sbjct: 826 C 826
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/850 (45%), Positives = 490/850 (57%), Gaps = 86/850 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
L + TLF L++ V++D +AI I+G+R+IL SGSIHYPRSTP MW DLI KAKEG
Sbjct: 9 FLFLFVSLTLF-LAVYSDVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEG 67
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD IETYVFWN HEP Y+F G DL+RFI+T+ GLY LRIGPYVCAEWN+GGF
Sbjct: 68 GLDVIETYVFWNVHEPSPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEWNFGGF 127
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI R N+ F MQ FT IV M K E+L+ SQGGPIIL+QIENEYG
Sbjct: 128 PVWLKYVPGIS-FRQDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQ 186
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
G G +Y++W AKMA + GVPWIMC+E DAP P+ FTPN P P
Sbjct: 187 SKMLGPVGYNYMSWAAKMAVEMGTGVPWIMCKEDDAPDPVINTCNGFYCDKFTPNKPYKP 246
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+WTE W+GWF +GG KR +DLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP+
Sbjct: 247 TMWTEAWSGWFSEFGGPIHKRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPF 306
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------S 351
+TTSYDYDAP+DEYG + QPK+GHL+ELHK +K EK L + T GN +
Sbjct: 307 ITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCEKALISTDPVVTSLGNFQQAYVYTT 366
Query: 352 VSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SG YNLP WSVSILPDC+ FNTAKV QT+
Sbjct: 367 ESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKVGVQTSQMQML 426
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKD 449
P + W+ E D + L++Q T D SDYLWY+T+ D+
Sbjct: 427 PTNSER----FSWESFEE---DTSSSSATTITASGLLEQINVTRDTSDYLWYITSVDVGS 479
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER---PVKLTRGKNQ 506
+ L G +L + S+G +H ++NG S YG D R V L G N
Sbjct: 480 SESFLHGGKLPSLIVQSTGHAVHVFINGRLSGS---AYGTREDRRFRYTGDVNLRAGTNT 536
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
I+LLS VGL N G F+ GI GPV++ G + DLS KWTY+VGL G
Sbjct: 537 IALLSVAVGLPNVGGHFETWNTGILGPVVIHGLDKGKL---DLSWQKWTYQVGLKGEAMN 593
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
++ E S+ V N+ +TW+KT F+AP +P+ L++ GMGKG W+NG ++
Sbjct: 594 LASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISI 653
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW T +A S C+Y G + KC CG P+Q WYHVPRSW+K N LV+FE
Sbjct: 654 GRYW-TAIATG---SCNDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFE 709
Query: 687 EFGGNPSQINFQTVVVGTACGQAHENK--------------------TMELTCH-GRRIS 725
E GG+PS+I+ V + C E + L C+ G+ IS
Sbjct: 710 ELGGDPSKISLAKRSVSSVCADVSEYHPNLKNWHIDSYGKSENFRPPKVHLHCNPGQAIS 769
Query: 726 EIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGT 785
IK+ASFG P G CG++++G+C + ++E++C+GK C + S +N G C
Sbjct: 770 SIKFASFGTPLGTCGSYEQGACHSS-SSYDILEQKCIGKPRCIVTVSNSNFGRDPC-PNV 827
Query: 786 VKRLVVEALC 795
+KRL VEA+C
Sbjct: 828 LKRLSVEAVC 837
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/832 (44%), Positives = 491/832 (59%), Gaps = 84/832 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK+GGLD +ETYVFWN HEP
Sbjct: 28 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 88 GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV + K E LF SQGGPIIL+QIENEYG +G AG +YI W A+M
Sbjct: 147 PFKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC+E DAP P+ F+PN P P IWTE W+GWF +GG
Sbjct: 207 AVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPYKPTIWTETWSGWFTEFGGPI 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R +DLA+AVA F Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG +
Sbjct: 267 HQRPVQDLAYAVATFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
QPK+GHL+ELHK +K E+ L + T GN S SG
Sbjct: 327 QPKYGHLKELHKAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSKSAA 386
Query: 357 --------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
YNLP WS+SILPDC+ FNTAKV QT+ P + L W+ E
Sbjct: 387 RVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLPT----NIPMLSWESYDE 442
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + L++Q T D +DYLWY+T+ D+ + L G TL + S+
Sbjct: 443 DLTS--MDDSSTMTAPGLLEQINVTRDSTDYLWYITSVDIDSSESFLHGGELPTLIVQST 500
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G +H ++NG S + + + V L G N+I+LLS VGL N G F+
Sbjct: 501 GHAVHIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIALLSVAVGLPNVGGHFEAWN 560
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
GI GPV L G + DLS KWTY+VGL G + ++ A S W S ++
Sbjct: 561 TGILGPVALHGLNQGKW---DLSWQKWTYQVGLKG--EAMNLVSQNAFSSVEWISGSLIA 615
Query: 588 NRR---MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
++ +TW+KT F P ++P+ L+++GMGKG W+NG ++GRYW + C+
Sbjct: 616 QKKQQPLTWHKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYWTAF--ANGNCN--G 671
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C Y G + KC CG P+Q +YHVPRSW+K N LVLFEE GG+PS+I+ V +
Sbjct: 672 CSYAGGFRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPSRISLVKRAVSS 731
Query: 705 ACGQAHE--------------------NKTMELTCH-GRRISEIKYASFGDPQGACGAFK 743
C + E + + L C+ G+ IS IK+ASFG P G CG+++
Sbjct: 732 VCSEVAEYHPTIKNWHIESYGKVEDFHSPKVHLRCNPGQAISSIKFASFGTPLGTCGSYQ 791
Query: 744 KGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+G+C A +++K+C+GK+ C++ S +N G C +KRL VEA+C
Sbjct: 792 EGTCHATTS-YSVVQKKCIGKQRCAVTISNSNFG-DPCPK-VLKRLSVEAVC 840
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 688 bits (1775), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/809 (46%), Positives = 474/809 (58%), Gaps = 84/809 (10%)
Query: 46 LSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKT 105
+SGS+HYPRS P MWPDLI+KAK+GGLD ++TYVFWN HEP R QY F G DL+ FIK
Sbjct: 1 MSGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKL 60
Query: 106 IQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMA 165
++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+ F EMQ FTT IVDM
Sbjct: 61 VKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNEPFKAEMQKFTTKIVDMM 119
Query: 166 KKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESD 225
K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA +L+ VPW+MC+E D
Sbjct: 120 KSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDD 179
Query: 226 APSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
AP P+ F+PN P+ P +WTE WT W+ +G P R EDLA+ VA+F Q
Sbjct: 180 APDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQ 239
Query: 275 FGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG L +PKWGHL+ELHK +K
Sbjct: 240 KGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLC 299
Query: 335 EKTLTYGNVTNTDYGN-----------------------------SVSGSSYNLPAWSVS 365
E L G+ T GN S +G YNLP WS+S
Sbjct: 300 EPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWSIS 359
Query: 366 ILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNT 425
ILPDCKT +NTA+V +Q + + AG W+ E IN G F
Sbjct: 360 ILPDCKTTVYNTARVGSQ--ISQMKMEWAGG----FTWQSYNEDINSL---GDESFVTVG 410
Query: 426 LIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQW 484
L++Q T D +DYLWY T D+ D+ LS N L + S+G LH +VNG
Sbjct: 411 LLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTG--- 467
Query: 485 TKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG 541
T YG+ +D + VKL G N IS LS VGL N G F+ GI GPV L G
Sbjct: 468 TVYGSVDDPKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNE 527
Query: 542 DETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAP 601
+DL+ KWTYKVGL G D + +++ E G + PL TWYK F AP
Sbjct: 528 GR---RDLTWQKWTYKVGLKGEDLSLHSLSGSSSVEWGEPMQKQPL----TWYKAFFNAP 580
Query: 602 LENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCG 661
++P+ L++ MGKG W+NG +GRYWP Y A + CDYRG Y KC NCG
Sbjct: 581 DGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASG---TCGICDYRGEYDEKKCQTNCG 637
Query: 662 NPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQA------------ 709
+ SQ WYHVPRSW+ N LV+FEE+GG+P+ I+ G+ C
Sbjct: 638 DSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSICADVSEWQPSMTNWRT 697
Query: 710 --HENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKS 766
+E + L C HGR++++IK+ASFG PQG+CG++ +G C A + K C+G++
Sbjct: 698 KDYEKAKIHLQCDHGRKMTDIKFASFGTPQGSCGSYSEGGCHAH-KSYDIFWKNCIGQER 756
Query: 767 CSIEASEANLGATSCAAGTVKRLVVEALC 795
C + G C GT+KR VVEA+C
Sbjct: 757 CGVSVVPNVFGGDPC-PGTMKRAVVEAIC 784
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/881 (43%), Positives = 506/881 (57%), Gaps = 115/881 (13%)
Query: 12 LLCLILQTLFNLSLA-------YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
+L LI+ L + + VS+D RA+ I +R++L+S IHYPR+TP MW DLI
Sbjct: 14 ILSLIIALLVYFPIVSGSFFKPFNVSYDHRALIIADKRRMLVSAGIHYPRATPEMWSDLI 73
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+K+KEGG D I+TYVFW+ HEP++ QY+F G DL++F+K I GLY+ LRIGPYVCAE
Sbjct: 74 EKSKEGGADVIQTYVFWSGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAE 133
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFPVWL ++PGI + RT N+ F EMQ F T IVD+ + KLF QGGPII+ QIE
Sbjct: 134 WNFGGFPVWLRDIPGI-QFRTDNEPFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQIE 192
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYG+V YG GK Y+ W A MA L GVPW+MC+++DAP + F P
Sbjct: 193 NEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKP 252
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+ P +WTE+W GW+ WGG P R AEDLAFAVARF+Q GG+FQNYYMY GGTNFGR
Sbjct: 253 NSQMKPILWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 312
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN----------- 342
TSGGP+ TSYDYDAP+DEYG ++PKWGHL++LH +K E L +
Sbjct: 313 TSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSNQ 372
Query: 343 -------------------VTNTDYGNSV----SGSSYNLPAWSVSILPDCKTEEFNTAK 379
+ N D S +G SY LP WSVSILPDC+ FNTAK
Sbjct: 373 EAHIYRGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAK 432
Query: 380 VNTQTNVK-VKRPNQAGNDQAPLQWKWRPEMIN-----------DFVVRGKGHFALNTLI 427
V QT+VK V+ + ++ LQ R + ++ + G+ +F L+
Sbjct: 433 VGAQTSVKTVESARPSLGSKSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLL 492
Query: 428 DQ-KSTNDVSDYLWYMTNADLKDDDPIL--SGSSNMTLRINSSGQVLHAYVNGNY---VD 481
+ T D SDYLW+ T + +DD +N T+ I+S VL +VN V
Sbjct: 493 EHLNVTKDRSDYLWHKTRITVSEDDISFWKKNGANPTVSIDSMRDVLRVFVNKQLSGSVV 552
Query: 482 SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RA 540
W K +PV+ +G N + LL+ TVGLQNYG+ + G G L G +
Sbjct: 553 GHWVKA-------VQPVRFMQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKN 605
Query: 541 GDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFE 599
GD DL+ WTY+VGL G + +K Y + N + WS+ + + WYKT F+
Sbjct: 606 GD----MDLAKSSWTYQVGLKG-EAEKIYTVE-HNEKAEWSTLETDASPSIFMWYKTYFD 659
Query: 600 APLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYN 659
P DPVVL+L+ MGKG AWVNG+++GRYW ++++DGC +CDYRG Y SDKC N
Sbjct: 660 TPAGTDPVVLDLESMGKGQAWVNGHHIGRYW-NIISQKDGCE-RTCDYRGAYYSDKCTTN 717
Query: 660 CGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-------- 711
CG P+Q YHVPRSW+K N LVLFEE GGNP I+ +TV G CGQ E
Sbjct: 718 CGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFNISVKTVTAGILCGQVLESHYPPLRK 777
Query: 712 -------NKTME---------LTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVL 754
N TM L C G IS I++AS+G P+G+C F G C A + L
Sbjct: 778 WSTPDYINGTMSINSVAPEVYLHCEDGHVISSIEFASYGTPRGSCDRFSIGKCHAS-NSL 836
Query: 755 PLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++ + C G+ SC IE S + C +GT+K L V A C
Sbjct: 837 SIVSEACKGRTSCFIEVSNTAFRSDPC-SGTLKTLAVMARC 876
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/852 (44%), Positives = 496/852 (58%), Gaps = 90/852 (10%)
Query: 13 LCLILQTLFNLSLAY---RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
LCL L + L V++D RAI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK+
Sbjct: 9 LCLFLGLVCFLGFQLVQCTVTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKD 68
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD +ETYVFWN HEP Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+GG
Sbjct: 69 GGLDVVETYVFWNVHEPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGG 128
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PGI RT N+ F MQ FT IV + K EKLF SQGGPIIL+QIENEYG
Sbjct: 129 FPVWLKYVPGIS-FRTDNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIENEYGA 187
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
+G AG +Y+ W A MA L GVPW+MC+E DAP P+ F PN P
Sbjct: 188 QSKLFGAAGHNYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFAPNKPYK 247
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P IWTE W+GWF +GG +R +DLA+AVARF Q GG+F NYYMYHGGTNFGRT+GGP
Sbjct: 248 PTIWTEAWSGWFSEFGGPIHQRPVQDLAYAVARFIQKGGSFVNYYMYHGGTNFGRTAGGP 307
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
++TTSYDYDAP+DEYG + QPK+GHL+ELH+ +K E+ L + T GN
Sbjct: 308 FITTSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSADPIITSLGNFQQAYVYT 367
Query: 351 SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
S SG YNLP WS+SILPDC+ FNTAKV QT+
Sbjct: 368 SESGDCSAFLSNHDSKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMGM 427
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLK 448
P N Q L W+ E I + L++Q T D +DYLWY T+ D+
Sbjct: 428 LPT---NIQM-LSWESYDEDITS--LDDSSTITAPGLLEQINVTRDSTDYLWYKTSVDIG 481
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
+ L G TL + S+G +H ++NG S + + + V L G N+I+
Sbjct: 482 SSESFLRGGELPTLIVQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAGTNRIA 541
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
LLS VGL N G F+ GI GPV L G + DLS KWTY+VGL G +
Sbjct: 542 LLSVAVGLPNVGGHFEAWNTGILGPVALHGLDQGKW---DLSWQKWTYQVGLKG----EA 594
Query: 569 YNAKAAN--SERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
N + N S W ++ ++ +TW+KT F AP ++P+ L+++GMGKG W+NG
Sbjct: 595 MNLVSPNSISSVDWMRGSLAAQKQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQ 654
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GRYW + C+ C Y G + KC CG P+Q YHVPRSW+K N LV+
Sbjct: 655 SIGRYWTAF--ANGNCN--GCSYAGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVI 710
Query: 685 FEEFGGNPSQINFQTVVVGTACGQAHE--------------------NKTMELTCH-GRR 723
FEEFGG+PS+I+ V + C + E + + L C+ G+
Sbjct: 711 FEEFGGDPSRISLVKRSVSSVCAEVAEYHPTIKNWHIESYGKAEDFHSPKVHLRCNPGQA 770
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
IS IK+ASFG P G CG++++G+C A +++K+C+GK+ C++ S +N G C
Sbjct: 771 ISSIKFASFGTPLGTCGSYQEGTCHAATS-YSVLQKKCIGKQRCAVTISNSNFG-DPCPK 828
Query: 784 GTVKRLVVEALC 795
+KRL VEA+C
Sbjct: 829 -VLKRLSVEAVC 839
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 687 bits (1773), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/834 (44%), Positives = 483/834 (57%), Gaps = 79/834 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+RK+L+S +IHYPRS P MWP L++ AKEGG+D IETYVFWN HEP
Sbjct: 29 VSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPSP 88
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y F G DL++F+K ++ G+++ILRIGP+V AEW +GG PVWLH +PG RT NK
Sbjct: 89 GNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTV-FRTENK 147
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FTT IVD+ K+EK FASQGGPIILAQ+ENEYG DYG+ GK Y W A M
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A S +IGVPWIMCQ+ DAP + FTP N PKIWTENW GWFK++GG +
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQFTPIYQNKPKIWTENWPGWFKTFGGWN 267
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+AF+VARFFQ GG+ NYYMYHGGTNFGRTSGGP++TTSYDY+APIDEYG
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 327
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS---------------------- 355
PKWGHL++LH+ +K E + TN G S+
Sbjct: 328 LPKWGHLKQLHRAIKLCEHIMLNSQPTNVSLGPSLEADVFTNSSGACAAFIANMDDKNDK 387
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ-----AGNDQAPLQW 403
SY+LPAWSVSILPDCK FNTAKV +Q++V P D++
Sbjct: 388 TVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSVVEMLPESLQLSVGSADKSLKDL 447
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
KW + + G+ F + L+D +T +DYLWY T+ + +++ L S+ L
Sbjct: 448 KWD-VFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGENEEFLKKGSSPVL 506
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
I S G +HA+VN S + P+ L GKN I+LLS TVGLQN GS
Sbjct: 507 LIESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIALLSMTVGLQNAGSF 566
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW-S 581
++ V G+ V G DLS++ WTYK+GL G + + + + W S
Sbjct: 567 YEWVGAGLTS----VKIQGFNNGTIDLSAYNWTYKIGLEG--EHQGLDKEEGFGNVNWIS 620
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ P + +TWYK + P +DPV L++ MGKG AW+NG +GRYWP GC
Sbjct: 621 ASEPPKEQPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPRK-GPLHGCV 679
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
E C+YRG + DKC CG P+Q WYHVPRSW K N LV+FEE GG+PS+I F
Sbjct: 680 KE-CNYRGKFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSKIEFSRRK 738
Query: 702 VGTACGQAHEN-------------------KTMELTC-HGRRISEIKYASFGDPQGACGA 741
+ C EN T+ L C IS +K+ASFG+P GAC +
Sbjct: 739 ITGVCALVAENYPSIDLESWNDGSGSNKTVATIHLGCPEDTHISSVKFASFGNPTGACRS 798
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ +G C + + + ++EK C+ K C IE + N SC + K+L VE C
Sbjct: 799 YTQGDCH-DPNSISVVEKVCLNKNRCDIELTGENFNKGSCLS-EPKKLAVEVQC 850
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/886 (43%), Positives = 499/886 (56%), Gaps = 121/886 (13%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
+++ L + + VS+D RA+ IDG+R++L+S IHYPR+TP MWPDLI K+KEG
Sbjct: 13 VVMTLQIAACTEFFKPFNVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLIAKSKEG 72
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
G D I+TY FWN HEP+R QY+F G D+++FIK GLY LRIGPYVCAEWN+GGF
Sbjct: 73 GADLIQTYAFWNGHEPIRGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAEWNFGGF 132
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL ++PGIE RT N + +EMQ F IVD+ ++E LF+ QGGPIIL QIENEYGN+
Sbjct: 133 PVWLRDIPGIE-FRTDNAPYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIENEYGNI 191
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
YG GK Y+ W A MA L GVPW+MC+++DAP + F PN+ P
Sbjct: 192 ERLYGQRGKDYVKWAADMAIGLGAGVPWVMCRQTDAPENIIDACNAFYCDGFKPNSYRKP 251
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+WTE+W GW+ SWGG+ P R ED AFAVARFFQ GG++ NYYM+ GGTNFGRTSGGP+
Sbjct: 252 ALWTEDWNGWYTSWGGRVPHRPVEDNAFAVARFFQRGGSYHNYYMFFGGTNFGRTSGGPF 311
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL--------------------- 338
TSYDYDAPIDEYG L+QPKWGHL++LH +K E L
Sbjct: 312 YVTSYDYDAPIDEYGLLSQPKWGHLKDLHSAIKLCEPALVAVDDAPQYIRLGPMQEAHVY 371
Query: 339 ------------TYGN-------VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEF 375
T GN + N D NS + G Y+LP WSVSILPDCK F
Sbjct: 372 RHSSYVEDQSSSTLGNGTLCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDCKNVAF 431
Query: 376 NTAKVNTQTNVKVKRPN---------------QAGNDQAPLQWKWRPEMINDFVVRGKGH 420
NTAKV +Q +VK + G W E I ++ G +
Sbjct: 432 NTAKVASQISVKTVEFSSPFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEW---GGNN 488
Query: 421 FALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSSGQVLHAYVNG 477
F +++ T D SDYLWY+ + D+D +S ++ L I+S V+ +VNG
Sbjct: 489 FTAEGILEHLNVTKDTSDYLWYIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFVNG 548
Query: 478 NYVDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPV 534
S +W + E+PV L +G N++++LS TVGLQNYG+ + G G +
Sbjct: 549 QLAGSHVGRWVR-------VEQPVDLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQI 601
Query: 535 LLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWY 594
L G E DL++ W Y+VGL G + K ++ + S N + TWY
Sbjct: 602 KLTGLKSGEY---DLTNSLWVYQVGLRG-EFMKIFSLEEHESADWVDLPNDSVPSAFTWY 657
Query: 595 KTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD 654
KT F+AP DPV L L MGKG AWVNG+++GRYW + +A DGC +SCDYRG Y
Sbjct: 658 KTFFDAPQGKDPVSLYLGSMGKGQAWVNGHSIGRYW-SLVAPVDGC--QSCDYRGAYHES 714
Query: 655 KCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK- 713
KCA NCG P+Q WYH+PRSW++ N LV+FEE GGNP +I+ + + C + E+
Sbjct: 715 KCATNCGKPTQSWYHIPRSWLQPSKNLLVIFEETGGNPLEISVKLHSTSSICTKVSESHY 774
Query: 714 -----------------------TMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEA 749
+ L C +G+RIS I +ASFG PQG+C F +G C A
Sbjct: 775 PPLHLWSHKDIVNGKVSISNAVPEIHLQCDNGQRISSIMFASFGTPQGSCQRFSQGDCHA 834
Query: 750 EIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ ++ + C G+ +CSI S G C G VK L VEA C
Sbjct: 835 P-NSFSVVSEACQGRNNCSIGVSNKVFGGDPC-RGVVKTLAVEAKC 878
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 686 bits (1769), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/860 (43%), Positives = 499/860 (58%), Gaps = 112/860 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D RA+ IDGER++L+S IHYPR+TP MWP +I+ AK+GG D ++TYVFWN HEP +
Sbjct: 32 VTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGADVVQTYVFWNGHEPEQ 91
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY+F G DL++FIK ++ GLY LRIGPYVCAEWN+GGFP WL +PGI RT N+
Sbjct: 92 GQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPYWLKEIPGIV-FRTDNE 150
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT+ IV++ K+ +LF+ QGGPII+AQIENEYG++ S +GD GK Y+ W A M
Sbjct: 151 PFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQFGDGGKRYVQWAADM 210
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-----------PNNPNSPKIWTENWTGWFKSWGGKD 257
A SLD VPWIMC++ DAP+ + PN P +WTE+W GWF++WG
Sbjct: 211 ALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGWKPNTALKPILWTEDWNGWFQNWGQAA 270
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED AFAVARFFQ GG+FQNYYMY GGTNF RT+GGP++TT+YDYDAPIDEYG +
Sbjct: 271 PHRPVEDNAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTTYDYDAPIDEYGLIR 330
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGN--------------------------VTNTDYGNS 351
QPKWGHL++LH +K E LT + + N D NS
Sbjct: 331 QPKWGHLKDLHAAIKLCEPALTAVDTVPQSTWIGSNQEAHEYSANGHCAAFLANIDSENS 390
Query: 352 VS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV---KVKRPNQAGNDQAP---- 400
V+ G SY LPAWSVSILPDCK FNTA++ QT V ++ N G+ P
Sbjct: 391 VTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVTRMRIAPSNSRGDIFLPSNTL 450
Query: 401 -------------LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
L+W+ E F +RG G N+L++Q + T D SDYLWY T+
Sbjct: 451 VHDHISDGGVFANLKWQASAE---PFGIRGSGTTVSNSLLEQLNITKDTSDYLWYSTSIT 507
Query: 447 LKDDDPILSGS-SNMTLRINSSGQVLHAYVNGNYVDSQ--WTKYGASNDLFERPVKLTRG 503
+ + S + L + + +H +VNG S W N +P+ L G
Sbjct: 508 ITSEGVTSDVSGTEANLVLGTMRDAVHIFVNGKLAGSAMGW------NIQVVQPITLKDG 561
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
KN I LLS T+GLQNYG+ + GI G V + G LS+ +W+Y+VGL G
Sbjct: 562 KNSIDLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYGNL---SLSTAEWSYQVGLRGE 618
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ K F+N A W S + +TWYKTTF+AP DPV L+L MGKG AW+NG
Sbjct: 619 ELKLFHNGTADGFS--WDSSSFTNASYLTWYKTTFDAPGGTDPVALDLGSMGKGQAWING 676
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIW-------YHVPRSWIK 676
++LGRY+ +A + GC E+CDYRG Y ++KC NCG PSQ W YH+PR+W++
Sbjct: 677 HHLGRYF-LMVAPQSGC--ETCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIPRAWLQ 733
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK--------------------TME 716
N LVLFEE GG+ S+++ T C +E++ M
Sbjct: 734 ATGNLLVLFEEIGGDISKVSVVTRSAHAVCAHINESQPPPIRTWRPHRSIDAFNNPAEML 793
Query: 717 LTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEAN 775
L C G+ I++IK+ASFG+P+G+CG F+ G+C A + + K C+GK+ C I
Sbjct: 794 LECAAGQHITKIKFASFGNPRGSCGHFQHGTCHAN-KSMEAVRKVCIGKQQCYIPVQRKF 852
Query: 776 LGATSCAAGTVKRLVVEALC 795
G+ G K L V+ C
Sbjct: 853 FGSIDPCPGVSKSLAVQVHC 872
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/826 (45%), Positives = 488/826 (59%), Gaps = 86/826 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D +A+ ++G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD ++TYVFWN HEP
Sbjct: 23 LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 83 GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNE 141
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EMQ FTT IV+M K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A M
Sbjct: 142 PFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANM 201
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A +L+ GVPWIMC+E DAP P+ F+PN P+ P +WTE WT W+ +G
Sbjct: 202 AVALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPV 261
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLA+ VA+F Q GG+F NYYM+HGGTNFGRT+GGP++ TSYDYDAPIDEYG L
Sbjct: 262 PHRPVEDLAYGVAKFIQKGGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 321
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
+PKWGHL++LHK +K E L G+ T GN
Sbjct: 322 EPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLDNKDKVSYA 381
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ +G Y+LP WS+SILPDCKT FNTA+V +Q + + AG W+ E
Sbjct: 382 RVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ--ISQMKMEWAGG----FAWQSYNE 435
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
IN F G+ F L++Q T D +DYLWY T D+ DD LS N L +
Sbjct: 436 EINSF---GEDPFTTVGLLEQINVTRDNTDYLWYTTYVDVAQDDQFLSNGENPKLTV--- 489
Query: 468 GQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
+ +N + T YG+ +D + VKL G N IS LS VGL N G F+
Sbjct: 490 --MCFLILNILFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFE 547
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
GI GPV L G +DL+ KWTY+VGL G + ++ E G +
Sbjct: 548 TWNAGILGPVTLDGLNEGR---RDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGEPVQK 604
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
PL TWYK F AP ++P+ L++ MGKG W+NG +GRYWP Y A + C T
Sbjct: 605 QPL----TWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGN-CGT-- 657
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
CDYRG Y KC NCG+ SQ WYHVPRSW+ N LV+FEE+GG+P+ I+ +G+
Sbjct: 658 CDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGS 717
Query: 705 ACGQA--------------HENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEA 749
C +E + L C +G++I+EIK+ASFG PQG+CG++ +G C A
Sbjct: 718 VCADVSEWQPSMKNWHTKDYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYSEGGCHA 777
Query: 750 EIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ K CVG++ C + G C GT+KR VVEA+C
Sbjct: 778 H-KSYDIFWKNCVGQERCGVSVVPEIFGGDPC-PGTMKRAVVEAIC 821
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/830 (44%), Positives = 483/830 (58%), Gaps = 83/830 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP
Sbjct: 39 VSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 98
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DL++FIK +++ GLYV LRIGPY CAEWN+GGFPVWL +PGI RT N+
Sbjct: 99 GEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGI-SFRTDNE 157
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M FT IVDM K+E+LF +QGGPIIL+QIENEYG V + G G++Y W A M
Sbjct: 158 PFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAANM 217
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC++ DAP P+ F+PN P +WTE WT WF ++GG
Sbjct: 218 AVGLGTGVPWVMCKQDDAPDPIINTCNDHYCDWFSPNKNYKPTMWTEAWTSWFTAFGGPV 277
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+AFA+A+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG +
Sbjct: 278 PYRPAEDMAFAIAKFIQRGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYGLIR 337
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------------------------- 351
QPKWGHL++LHK +K E L G+ T G+S
Sbjct: 338 QPKWGHLKDLHKAIKMCEAALVSGDPIVTSLGSSQESHVFKSESGDCAAFLANYDEKSFA 397
Query: 352 ---VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT-NVKVKRPNQAGNDQAPLQWKWRP 407
G YNLP WS+SILPDC FNTA+V QT ++ + N G W+
Sbjct: 398 KVAFQGMHYNLPPWSISILPDCVNTVFNTARVGAQTSSMTMTSVNPDG-----FSWETYN 452
Query: 408 EMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
E + + L++Q T DV+DYLWY T+ + ++ L L + S
Sbjct: 453 EETASY---DDASITMEGLLEQINVTRDVTDYLWYTTDITIDPNEGFLKNGEYPVLTVMS 509
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
+G LH ++NG + + + VKL G N+IS+LS VGL N G+ F+
Sbjct: 510 AGHALHIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKISVLSIAVGLPNIGAHFETW 569
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
G+ GPV+L G +DLS W+YK+GL G + ++ +S WSS +
Sbjct: 570 NTGVLGPVVLNGLNEGR---RDLSWQNWSYKIGLKG--EALQLHSLTGSSSVEWSSL-IA 623
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+ +TWYKTTF AP N P L++ MGKG W+NG ++GRYWP Y A + C C
Sbjct: 624 QKQPLTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWPAYKAYGN-CG--ECS 680
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
Y G Y KC NCG SQ WYHVP SW+ N LV+FEE+GG+P+ I+ G+AC
Sbjct: 681 YTGRYNEKKCLANCGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTGISLVRRTTGSAC 740
Query: 707 ------------------GQAHENK--TMELTC-HGRRISEIKYASFGDPQGACGAFKKG 745
G+A + L+C G++IS IK+ASFG PQG CG F +G
Sbjct: 741 AFISEWHPTLRKWHIKDYGRAERPRRPKAHLSCADGQKISSIKFASFGTPQGVCGNFTEG 800
Query: 746 SCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
SC A + EK CVG++ CS+ S G C +K L VEA+C
Sbjct: 801 SCHAH-KSYDIFEKNCVGQQWCSVTISPDVFGGDPC-PNVMKNLAVEAIC 848
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 683 bits (1763), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/858 (43%), Positives = 490/858 (57%), Gaps = 80/858 (9%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAY-RVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
MAT S+ +L L L L + V++D +AI I+G+R++L SGSIHYPRSTP M
Sbjct: 1 MAT-NSVSKLSMLVLGLFWLLGVQFVQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEM 59
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W LI+KAKEGGLD +ETYVFWN HEP Y+F G DL RFIKTIQ GLY LRIGP
Sbjct: 60 WEGLIQKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLARFIKTIQKAGLYANLRIGP 119
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F MQ FT IV + K E LF SQGGPII
Sbjct: 120 YVCAEWNFGGFPVWLKYVPGIS-FRTDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPII 178
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG +G AG++Y+ W AKMA L GVPW+MC+E DAP P+
Sbjct: 179 LSQIENEYGVQSKLFGAAGQNYMTWAAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 238
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F+PN P P +WTE W+GWF +GG +R +DLAFAVARF Q GG+F NYYMYHGG
Sbjct: 239 DAFSPNRPYKPTMWTEAWSGWFNEFGGPIHQRPVQDLAFAVARFIQKGGSFINYYMYHGG 298
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 299 TNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKALVSADPIVTSL 358
Query: 349 GNS-----------------------------VSGSSYNLPAWSVSILPDCKTEEFNTAK 379
G+S + YNLP WS+SILPDC+ FNTAK
Sbjct: 359 GSSQQAYVYTSESGNCAAFLSNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAK 418
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P + L W+ E ++ + L++Q T D SDY
Sbjct: 419 VGVQTSQLEMLPT----NSPMLLWESYNEDVS--AEDDSTTMTASGLLEQINVTKDTSDY 472
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWY+T+ D+ + L G TL + S+G +H ++NG S + + V
Sbjct: 473 LWYITSVDIGSTESFLHGGELPTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKV 532
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
G+N I+LLS VGL N G F+ GI GPV L G D+ + DLS KWTYKV
Sbjct: 533 NFRAGRNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGL--DQGKL-DLSWAKWTYKV 589
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
GL G ++ E S + +TW+K+ F+AP ++P+ ++++GMGKG
Sbjct: 590 GLKGEAMNLVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQ 649
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDG 678
W+NG ++GRYW Y + + C+Y G + KC CG P+Q WYHVPR+W+K
Sbjct: 650 IWINGVSIGRYWTAYATG----NCDKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPK 705
Query: 679 VNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE--------------------NKTMELT 718
N LV+FEE GGNP+ I+ V C E + L
Sbjct: 706 DNLLVVFEELGGNPTSISLVKRSVTGVCADVSEYHPTLKNWHIESYGKSEDLHRPKVHLK 765
Query: 719 CH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLG 777
C G I+ IK+ASFG P G CG++++G+C A + ++EK+C+GK+ C++ S N G
Sbjct: 766 CSAGYSITSIKFASFGTPLGTCGSYQQGTCHAPMS-YDILEKRCIGKQRCAVTISNTNFG 824
Query: 778 ATSCAAGTVKRLVVEALC 795
C +KRL VE +C
Sbjct: 825 QDPC-PNVLKRLSVEVVC 841
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 683 bits (1763), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/831 (43%), Positives = 487/831 (58%), Gaps = 81/831 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I G R++L+S SIHYPRS P MWP L+ +AKEGG D IETYVFWN HE
Sbjct: 31 VTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHETAP 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F DL++F + ++D GL+++LRIGP+V AEWN+GG P WLH +PG RT N+
Sbjct: 91 GKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTV-FRTNNE 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + M++FTT IVDM K+++ FASQGG IILAQIENEYG YG GK+Y W M
Sbjct: 150 PFKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGSM 209
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPWIMCQ+ D P + F PN+P PKIWTENW GWF+++G +
Sbjct: 210 AQAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQFKPNSPTQPKIWTENWPGWFQTFGESN 269
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AF+VARFF GG+ QNYY+YHGGTNF RT+GGP++TTSYDYDAPIDEYG
Sbjct: 270 PHRPPEDVAFSVARFFGKGGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGLRR 329
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVT------------NTDYGN--------------- 350
PKW HL+ELH+ +K E +L +GN T TD+
Sbjct: 330 LPKWAHLKELHQSIKLCEHSLLFGNSTLLSLGPQQEADVYTDHSGGCVAFLANIDSEKDR 389
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQA--PLQWKWR 406
+ Y+LPAWSVSILPDCK FNTAKV +QT + P G QA P QW
Sbjct: 390 VVTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDMVP---GTLQASKPDQWSIF 446
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I V K F N +D +T D +DYLW+ T+ D+ + P S ++ L I+
Sbjct: 447 TERIG---VWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRNYP--SSGNHPVLNID 501
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S G +HA++N + S + S+ P+ L GKN+I++LS TVGL++ G ++
Sbjct: 502 SKGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAILSMTVGLKSAGPYYEW 561
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
V G+ V +G + DLSS+ W YKVGL G + + N++R
Sbjct: 562 VGAGLTS----VNISGMKNGTTDLSSNNWAYKVGLEG-EHYGLFKHDQGNNQRWRPQSQP 616
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
P ++ +TWYK + P +DPV L++Q MGKG W+NG +GRYWP D C+T SC
Sbjct: 617 PKHQPLTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWPRTSPTNDRCTT-SC 675
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
DYRG + +KC CG P+Q WYHVPRSW NTLV+FEE GG+P++I F V +
Sbjct: 676 DYRGKFSPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATSV 735
Query: 706 CGQAHEN--------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKK 744
C EN ++L+C G+ IS +K+ASFGDP G C ++++
Sbjct: 736 CSFVSENYPSIDLESWDKSISDDGRVAAKVQLSCPKGKNISSVKFASFGDPSGTCRSYQQ 795
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
GSC D + ++EK C+ SC++ S+ G C G K L +EA C
Sbjct: 796 GSCH-HPDSVSVVEKACMNMNSCTVSLSDEGFGEDPC-PGVTKTLAIEADC 844
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 683 bits (1762), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/831 (43%), Positives = 489/831 (58%), Gaps = 84/831 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ +DG+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RFIKT+Q G++V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F N MQ FT IV M K E LFASQGGPIIL+QIENEYG ++G AGK+YINW AKM
Sbjct: 146 PFKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC+E DAP P+ F+PN P P +WTE W+GWF +GG
Sbjct: 206 AVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTI 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R EDLAF VARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG
Sbjct: 266 RQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
+PK+GHL+ELH+ +K E+ L + T T G+ S SG +
Sbjct: 326 EPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFLANYNSNSYAK 385
Query: 357 -------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y+LP WS+SILPDCK FNTA V QTN + + + + W+ E
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----QMQMWADGASSMMWEKYDEE 441
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
++ L++Q T D SDYLWY+T+ ++ + L G + ++L + S+G
Sbjct: 442 VDSLAA--APLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQSAG 499
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
LH ++NG Q + YG D + L G N+++LLS GL N G ++
Sbjct: 500 HALHVFINGQL---QGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYET 556
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ GPV++ G DE +DL+ W+Y+VGL G + + + + S V
Sbjct: 557 WNTGVVGPVVIHGL--DEG-SRDLTWQTWSYQVGLKG-EQMNLNSLEGSGSVEWMQGSLV 612
Query: 586 PLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
N++ + WY+ F+ P ++P+ L++ MGKG W+NG ++GRYW Y AE D +
Sbjct: 613 AQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY-AEGD---CKG 668
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C Y G Y + KC CG P+Q WYHVPRSW++ N LV+FEE GG+ S+I V
Sbjct: 669 CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSG 728
Query: 705 ACGQAHE-------------------NKTMELTCH-GRRISEIKYASFGDPQGACGAFKK 744
C E + L C G+ IS IK+ASFG P G CG F++
Sbjct: 729 VCADVSEYHPNIKNWQIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQ 788
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G C + I+ ++EK+C+G + C + S +N G C +KR+ VEA+C
Sbjct: 789 GECHS-INSNSVLEKKCIGLQRCVVAISPSNFGGDPCPE-VMKRVAVEAVC 837
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/882 (43%), Positives = 502/882 (56%), Gaps = 121/882 (13%)
Query: 15 LILQ-TLF--NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
LI+Q TL N + V++D RA+ IDG R+IL S IHYPR+TP MWPDLI K+KEGG
Sbjct: 19 LIIQFTLISSNFFEPFNVTYDHRALIIDGRRRILNSAGIHYPRATPEMWPDLIAKSKEGG 78
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
D ++TYVFW HEP++ QY F G DL++F+K + + GLY+ LRIGPYVCAEWN+GGFP
Sbjct: 79 ADVVQTYVFWGGHEPVKGQYYFEGRYDLVKFVKLVGESGLYLHLRIGPYVCAEWNFGGFP 138
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWL ++PG+ RT N F EMQ F T IVD+ ++E L + QGGPII+ QIENEYGN+
Sbjct: 139 VWLRDVPGVV-FRTDNAPFKEEMQKFVTKIVDLMREEMLLSWQGGPIIMFQIENEYGNIE 197
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
+G GK Y+ W A MA +LD GVPW+MC+++DAP + F PN+P P
Sbjct: 198 HSFGQGGKEYMKWAAGMALALDAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSPKKPI 257
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
WTE+W GW+ +WGG+ P R EDLAFAVARFFQ GG+FQNYYMY GGTNFGRTSGGP+
Sbjct: 258 FWTEDWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTSGGPFY 317
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTN----------TDYGN 350
TSYDYDAPIDEYG L++PKWGHL++LH +K E L + YG
Sbjct: 318 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGPKQEAHVYGG 377
Query: 351 SVS---------------------------------GSSYNLPAWSVSILPDCKTEEFNT 377
S+S G S+ LP WSVSILPDC+ FNT
Sbjct: 378 SLSIQGMNFSQYGSQSKCSAFLANIDERQAATVRFLGQSFTLPPWSVSILPDCRNTVFNT 437
Query: 378 AKVNTQTNVK-----VKRPNQAGNDQAPLQWKWRPEMINDFVVR------GKGHFALNTL 426
AKV QT++K + N + Q +Q + P+ + + + + +F + +
Sbjct: 438 AKVAAQTHIKTVEFVLPLSNSSLLPQFIVQNEDSPQSTSWLIAKEPITLWSEENFTVKGI 497
Query: 427 IDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSSGQVLHAYVNGNYVDS- 482
++ T D SDYLWY T + DDD + ++ + I+S VL ++NG S
Sbjct: 498 LEHLNVTKDESDYLWYFTRIYVSDDDIAFWEKNKVSPAVSIDSMRDVLRVFINGQLTGSV 557
Query: 483 --QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-R 539
W K +PV+ +G N++ LLS TVGLQNYG+ + G G + L G +
Sbjct: 558 VGHWVKA-------VQPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFKGQIKLTGFK 610
Query: 540 AGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN-RRMTWYKTTF 598
GD DLS+ WTY+VGL G K + + N + WS V TWYKT F
Sbjct: 611 NGD----IDLSNLSWTYQVGLKGEFLKVY--STGDNEKFEWSELAVDATPSTFTWYKTFF 664
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+AP DPV L+L MGKG AWVNG+++GRYW T ++ +DGC SCDYRG Y S KC
Sbjct: 665 DAPSGVDPVALDLGSMGKGQAWVNGHHIGRYW-TVVSPKDGCG--SCDYRGAYSSGKCRT 721
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK----- 713
NCGNP+Q WYHVPR+W++ N LV+FEE GGNP +I+ + C Q E+
Sbjct: 722 NCGNPTQTWYHVPRAWLEASNNLLVVFEETGGNPFEISVKLRSAKVICAQVSESHYPPLR 781
Query: 714 -------------------TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDV 753
M L C G +S I++AS+G P G+C F +G+C A +
Sbjct: 782 KWSRADLTGGNISRNDMTPEMHLKCQDGHIMSSIEFASYGTPNGSCQKFSRGNCHAS-NS 840
Query: 754 LPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++ + C GK C I S A G G +K L VEA C
Sbjct: 841 SSVVTEACQGKNKCDIAISNAVFGDP--CRGVIKTLAVEARC 880
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/857 (43%), Positives = 489/857 (57%), Gaps = 84/857 (9%)
Query: 2 ATLKHCSRAILLCLI-LQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
+ K CS L+ + L S+ Y D +AI I+G+R+IL SGSIHYPRSTP MW
Sbjct: 5 SAYKLCSLVFLVVFLGCSELIQCSVTY----DRKAIMINGQRRILFSGSIHYPRSTPDMW 60
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
DLI+KAK+GG+D IETYVFWN HEP Y F G D++RF+KTIQ GLY LRIGPY
Sbjct: 61 EDLIQKAKDGGIDVIETYVFWNVHEPTPGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPY 120
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEWN+GGFPVWL +PGI RT N+ F MQ FT IV + K E LF SQGGPIIL
Sbjct: 121 VCAEWNFGGFPVWLKYVPGI-SFRTDNEPFKRAMQGFTEKIVGLMKAENLFESQGGPIIL 179
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
+QIENEYG +G AG +Y+ W A MA GVPW+MC+E DAP P+
Sbjct: 180 SQIENEYGVQSKLFGAAGYNYMTWAANMAIQTGTGVPWVMCKEDDAPDPVINTCNGFYCD 239
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
F PN P P IWTE W+GWF +GG +R +DLAFAVA+F Q GG+F NYYM+HGGT
Sbjct: 240 SFAPNKPYKPTIWTEAWSGWFSEFGGTIHQRPVQDLAFAVAKFIQKGGSFINYYMFHGGT 299
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL----------- 338
NFGR++GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K E+ L
Sbjct: 300 NFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRSIKMCERALVSVDPIVTQLG 359
Query: 339 TYG--NVTNTDYGNSVS----------------GSSYNLPAWSVSILPDCKTEEFNTAKV 380
TY +V +T+ G+ + YNLP WS+SILPDC+ FNTAKV
Sbjct: 360 TYQQVHVYSTESGDCAAFLANYDTKSAARVLFNNMHYNLPPWSISILPDCRNVVFNTAKV 419
Query: 381 NTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYL 439
QT+ P W+ E I+ + F L++Q T D SDYL
Sbjct: 420 GVQTSQMEMLPT-----NGIFSWESYDEDISS--LDDSSTFTTAGLLEQINVTRDASDYL 472
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WYMT+ D+ + L G TL I S+G +H ++NG S + + V
Sbjct: 473 WYMTSVDIGSSESFLHGGELPTLIIQSTGHAVHIFINGQLSGSAFGTRENRRFTYTGKVN 532
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L G N+I+LLS VGL N G ++ GI GPV L G + DLS KWTY+VG
Sbjct: 533 LRPGTNRIALLSVAVGLPNVGGHYESWNTGILGPVALHGLDQGKW---DLSWQKWTYQVG 589
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L G + E SS + +TW+K F AP ++P+ L+++GMGKG
Sbjct: 590 LKGEAMNLLSPDSVTSVEWMQSSLAAQRPQPLTWHKAYFNAPEGDEPLALDMEGMGKGQI 649
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG ++GRYW Y + + C Y G + KC CG P+Q WYHVPRSW+K
Sbjct: 650 WINGQSIGRYWTAYASG----NCNGCSYAGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTN 705
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTACGQAHE--------------------NKTMELTC 719
N LV+FEE GG+PS+I+ + + C + E + + L C
Sbjct: 706 NLLVVFEELGGDPSRISLVKRSLASVCAEVSEFHPTIKNWQIESYGRAEEFHSPKVHLRC 765
Query: 720 H-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGA 778
G+ I+ IK+ASFG P G CG++++G+C A ++EK+C+GK+ C++ S +N G
Sbjct: 766 SGGQSITSIKFASFGTPLGTCGSYQQGACHASTS-YAILEKKCIGKQRCAVTISNSNFGQ 824
Query: 779 TSCAAGTVKRLVVEALC 795
C +K+L VEA+C
Sbjct: 825 DPC-PNVMKKLSVEAVC 840
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/778 (47%), Positives = 465/778 (59%), Gaps = 88/778 (11%)
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QYDF G DL+RF+K D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI+ LRT N+
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK-LRTDNEP 59
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQ FT +V K L+ASQGGPIIL+QIENEYGN+ + YG AGKSYI W A MA
Sbjct: 60 FKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMA 119
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+LD GVPW+MCQ++DAP P+ FTP+ P+ PK+WTENW+GWF S+GG P
Sbjct: 120 VALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVP 179
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLAFAVARF+Q GGT QNYYMYHGGTNFGR+SGGP+++TSYDYDAPIDEYG + Q
Sbjct: 180 YRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQ 239
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHLR++HK +K E L + + G
Sbjct: 240 PKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGSLCAAFLANIDDQSDKTV 299
Query: 351 SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR----PNQAGN------DQAP 400
+ +G +Y LPAWSVSILPDCK NTA++N+Q R QA + + A
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
W + E + + + L++Q +T D SD+LWY T+ + +P L+GS +
Sbjct: 360 SSWSYAVEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQS 416
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L +NS G VL ++NG S +S PV L GKN+I LLSATVGL NY
Sbjct: 417 -NLPVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNY 475
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ FD+V GI GPV L G G DLSS +WTY++GL G +D YN A+ E
Sbjct: 476 GAFFDLVGAGITGPVKLTGPKG----TLDLSSAEWTYQIGLRG-EDLHLYNPSEASPE-- 528
Query: 580 WSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W S N P N +TWYK+ F AP +DPV ++ GMGKG AWVNG ++GRYWPT +A +
Sbjct: 529 WVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQS 588
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
C SC+YRG Y + KC CG PSQI YHVPRS+++ G N +VLFE+FGGNPS+I+F
Sbjct: 589 DC-VNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFT 647
Query: 699 TVVVGTACGQAHENK-------------------TMELTC--HGRRISEIKYASFGDPQG 737
T + C E+ + L C G+ IS IK+ASFG P G
Sbjct: 648 TKQTESVCAHVSEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSG 707
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG++ G C + L + ++ CVG SCS+ S N G G K LVVEA C
Sbjct: 708 TCGSYSHGECSSS-QALAVAQEACVGVSSCSVPVSAKNFGDP--CRGVTKSLVVEAAC 762
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/858 (43%), Positives = 490/858 (57%), Gaps = 80/858 (9%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAY-RVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
MAT S+ +L L L L + V++D +AI I+G+R++L SGSIHYPRSTP M
Sbjct: 1 MAT-NSVSKLSMLVLGLFWLLGVQFVQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEM 59
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W LI+KAKEGGLD +ETYVFWN HEP Y+F G DL+RFIKTIQ GLY LRIGP
Sbjct: 60 WEGLIQKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFIKTIQKAGLYANLRIGP 119
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEWN+GGFPVWL +PGI RT N+ F MQ FT IV + K E LF SQGGPII
Sbjct: 120 YVCAEWNFGGFPVWLKYVPGIS-FRTDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPII 178
Query: 180 LAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM--------- 230
L+QIENEYG +G AG++Y+ W AKMA L GVPW+MC+E DAP P+
Sbjct: 179 LSQIENEYGVQSKLFGAAGQNYMTWAAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 238
Query: 231 --FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
F+PN P P +WTE W+GWF +GG +R +DLAFAVA F Q GG+F NYYMYHGG
Sbjct: 239 DAFSPNRPYKPTMWTEAWSGWFNEFGGPIHQRPVQDLAFAVALFIQKGGSFINYYMYHGG 298
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY 348
TNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK L + T
Sbjct: 299 TNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKALVSADPIVTSL 358
Query: 349 GNS-----------------------------VSGSSYNLPAWSVSILPDCKTEEFNTAK 379
G+S + YNLP WS+SILPDC+ FNTAK
Sbjct: 359 GSSQQAYVYTSESGNCAAFLSNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAK 418
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ P + L W+ E ++ + L++Q T D SDY
Sbjct: 419 VGVQTSQLEMLPT----NSPMLLWESYNEDVS--AEDDSTTMTASGLLEQINVTKDTSDY 472
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWY+T+ D+ + L G TL + S+G +H ++NG S + + V
Sbjct: 473 LWYITSVDIGSTESFLHGGELPTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKV 532
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
G+N I+LLS VGL N G F+ GI GPV L G D+ + DLS KWTYKV
Sbjct: 533 NFRAGRNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGL--DQGKL-DLSWAKWTYKV 589
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
GL G ++ E S + +TW+K+ F+AP ++P+ ++++GMGKG
Sbjct: 590 GLKGEAMNLVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQ 649
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDG 678
W+NG ++GRYW Y + + C+Y G + KC CG P+Q WYHVPR+W+K
Sbjct: 650 IWINGVSIGRYWTAYATG----NCDKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPK 705
Query: 679 VNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE--------------------NKTMELT 718
N LV+FEE GGNP+ I+ V C E + L
Sbjct: 706 DNLLVVFEELGGNPTSISLVKRSVTGVCADVSEYHPTLKNWHIESYGKSEDLHRPKVHLK 765
Query: 719 CH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLG 777
C G I+ IK+ASFG P G CG++++G+C A + ++EK+C+GK+ C++ S N G
Sbjct: 766 CSAGYSITSIKFASFGTPLGTCGSYQQGTCHAPMS-YDILEKRCIGKQRCAVTISNTNFG 824
Query: 778 ATSCAAGTVKRLVVEALC 795
C +KRL VE +C
Sbjct: 825 QDPC-PNVLKRLSVEVVC 841
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/865 (43%), Positives = 487/865 (56%), Gaps = 115/865 (13%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ VS+D RA+ + GER++L+S +HYPR+TP MWP +I K KEGG D IETY+FWN HEP
Sbjct: 50 FNVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCKEGGADVIETYIFWNGHEP 109
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+ QY F DL+RFIK + +GL++ LRIGPY CAEWN+GGFPVWL ++PGIE RT
Sbjct: 110 AKGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIE-FRTD 168
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N+ + EMQ F T IVDM K EKL++ QGGPIIL QIENEYGN+ YG AGK Y+ W A
Sbjct: 169 NEPYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAA 228
Query: 207 KMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGG 255
+MA LD G+PW+MC+++DAP + F PN+ N P IWTE+W GW+ WGG
Sbjct: 229 QMALGLDTGIPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGG 288
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
P R AED AFAVARF+Q GG+ QNYYMY GGTNF RT+GGP TSYDYDAPI+EYG
Sbjct: 289 PLPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPINEYGM 348
Query: 316 LNQPKWGHLRELHKLLKSMEKTL-------------------------------TYGN-- 342
L QPKWGHL++LH +K E L T GN
Sbjct: 349 LRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYVKLGSMQEAHIYSSAKVHTNGSTAGNAQ 408
Query: 343 -----VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV------- 386
+ N D VS G SYNLP WSVSILPDC+ FNTA+V QT+V
Sbjct: 409 ICSAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCENVAFNTARVGAQTSVFTFESGS 468
Query: 387 ---------KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVS 436
V P G+ + W + E I + G G FA +++ T D+S
Sbjct: 469 PSHSSRREPSVLLPGVRGSYLSSTWWTSK-ETIGTW---GDGSFATQGILEHLNVTKDIS 524
Query: 437 DYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF 494
DYLWY T+ ++ D+D S + +L I+ V +VNG SQ + +
Sbjct: 525 DYLWYTTSVNISDEDVAFWSSKGVLPSLIIDQIRDVARVFVNGKLAGSQVGHWVS----L 580
Query: 495 ERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW 554
++P++ RG N+++LLS VGLQNYG+ + G G V L G + +T DL++ W
Sbjct: 581 KQPIQFVRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLTGLSNGDT---DLTNSAW 637
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVP-LNRRMTWYKTTFEAPLENDPVVLNLQG 613
TY+VGL G + K +E WS+ + TWYKT +AP DPV ++L
Sbjct: 638 TYQVGLKGEFSMIYTPEKQECAE--WSAMQTDNIQSPFTWYKTMVDAPEGTDPVAIDLGS 695
Query: 614 MGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRS 673
MGKG AWVNG +GRYW + +A E GC + SC+Y G Y KC NCG P+Q WYH+PR
Sbjct: 696 MGKGQAWVNGRLIGRYW-SLVAPESGCPS-SCNYPGAYSETKCQSNCGMPTQSWYHIPRE 753
Query: 674 WIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN--------------------- 712
W+++ N LVLFEE GG+PS+I+ + T C + EN
Sbjct: 754 WLQESNNLLVLFEETGGDPSKISLEVHYTKTICSRISENYYPPLSAWSWLDTGRVSVDSV 813
Query: 713 -KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIE 770
+ L C G IS I +AS+G P G C F KG C A L + + CVGK C+I
Sbjct: 814 APELLLRCDDGYEISRITFASYGTPSGGCQNFSKGKCHAA-STLDFVTEACVGKNKCAIS 872
Query: 771 ASEANLGATSCAAGTVKRLVVEALC 795
S G G +K L VEA C
Sbjct: 873 VSNDVFGDP--CRGVLKDLAVEAEC 895
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/856 (43%), Positives = 490/856 (57%), Gaps = 95/856 (11%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L C++L L + V++D +AI I+G+R++L SGSIHYPRSTP MW DLI KAKE
Sbjct: 10 VLLWCIVLFISSGL-VHCDVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKE 68
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD +ETYVFWN HEP Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+GG
Sbjct: 69 GGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGG 128
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PGI R N+ F N M+ + IV++ K LF SQGGPIIL+QIENEYG
Sbjct: 129 FPVWLKYVPGIS-FRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGP 187
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
G G Y W A MA LD GVPW+MC+E DAP P+ F PN P
Sbjct: 188 QAKVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYK 247
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P IWTE W+GWF +GG +R +DLAFAVA+F Q GG+F NYYMYHGGTNFGRT+GGP
Sbjct: 248 PAIWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGP 307
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK++ + T GN
Sbjct: 308 FITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYS 367
Query: 351 SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
S +G YNLP WS+SILPDC+ FNTAKV QT+
Sbjct: 368 SETGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSKMEM 427
Query: 390 RPNQAGNDQAPLQWKWRPEMINDF----VVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTN 444
P + L W+ E I+ +R G L++Q T D SDYLWY+T+
Sbjct: 428 LPT----NSEMLSWETYSEDISALDDSSSIRSFG------LLEQINVTRDTSDYLWYITS 477
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
D+ + L G TL + ++G +H ++NG S + +F+ V L G
Sbjct: 478 VDIGSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGS 537
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N+I+LLS VGL N G F+ G+ GPV + G + DLS KWTY+VGL G
Sbjct: 538 NRIALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKW---DLSWAKWTYQVGLKG-- 592
Query: 565 DKKFYNAKAAN--SERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
+ N + N S W ++ ++ +TW+K F P ++P+ L++ MGKG W
Sbjct: 593 --EAMNLVSTNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVW 650
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
+NG ++GRYW Y + C Y G + KC CG P+Q WYHVPRSW+K N
Sbjct: 651 INGQSIGRYWTAYATGD----CNGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQN 706
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTAC------------------GQAHENKTMELTCH-- 720
LVLFEE GG+P++I+ V C G+ E ++ H
Sbjct: 707 LLVLFEELGGDPTRISLVKRSVTNVCSNVAEYHPNIKNWQIENYGKTEEFHLPKVRIHCA 766
Query: 721 -GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGAT 779
G+ IS IK+ASFG P G CG+FK+G+C A D ++EK+C+G+++C++ S +N G
Sbjct: 767 PGQSISSIKFASFGTPLGTCGSFKQGTCHAP-DSHAVVEKKCLGRQTCAVTISNSNFGED 825
Query: 780 SCAAGTVKRLVVEALC 795
C +KRL VEA C
Sbjct: 826 PC-PNVLKRLSVEAHC 840
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/851 (44%), Positives = 484/851 (56%), Gaps = 91/851 (10%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A+ L F L A VS+D R++ I+GERK+L+S +IHYPRS P MWP+L+K AKE
Sbjct: 2 ALGLIFFFSLCFTLCFAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKE 61
Query: 70 GGLDAIETYVFWNAHEPLR-RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
GG+D IETYVFWN H+P +Y F G DL++FI +Q+ G+Y+ILRIGP+V AEWN+G
Sbjct: 62 GGVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ--IENE 186
G PVWLH + G RT N F M+ FTT IV + KKEKLFASQGGPIIL+Q +ENE
Sbjct: 122 GIPVWLHYVNGTV-FRTDNYNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENE 180
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YG YG+ GK Y W A+MA S + GVPWIMCQ+ DAP + F P
Sbjct: 181 YGYYEGAYGEGGKRYAAWAAQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQFKPIF 240
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
P+ PKIWTENW GWF+++G +P R AED+AF+VARFFQ GG+ QNYYMYHGGTNFGRT+
Sbjct: 241 PDKPKIWTENWPGWFQTFGAPNPHRPAEDVAFSVARFFQKGGSVQNYYMYHGGTNFGRTA 300
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS---- 351
GGP++TTSYDY+APIDEYG PKWGHL+ELHK +K E L N G S
Sbjct: 301 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLNSKPVNLSLGPSQEAD 360
Query: 352 ----VSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
SG SY LPAWSVSILPDCK +NTAK
Sbjct: 361 VYADASGGCVAFLANIDDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAK------- 413
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNA 445
Q +A L+W+ + + G+ F N +D +T D +DYLWY T+
Sbjct: 414 ------QKDGSKA-LKWE---VFVEKAGIWGEPDFMKNGFVDHINTTKDTTDYLWYTTSI 463
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
+ +++ L + L I S G LHA+VN S S F+ P+ L G N
Sbjct: 464 VVGENEEFLKEGRHPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPISLKAGNN 523
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
+I+LLS TVGL N GS ++ V G+ V G DLS W YK+GL G +
Sbjct: 524 EIALLSMTVGLPNAGSFYEWVGAGLTS----VRIEGFNNGTVDLSHFNWIYKIGLQG-EK 578
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
Y + NS ++ P + +TWYK + P N+PV L++ MGKG AW+NG
Sbjct: 579 LGIYKPEGVNSVSWVATSEPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNGEE 638
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYWP + + C TE CDYRG + DKC CG P+Q WYHVPRSW K N LV+F
Sbjct: 639 IGRYWPRKSSVHEKCVTE-CDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLVIF 697
Query: 686 EEFGGNPSQINFQTVVVGTACGQAHEN--------------------KTMELTC-HGRRI 724
EE GG+P +I F + + C E+ ++ L C I
Sbjct: 698 EEKGGDPEKITFSRRKMSSICALIAEDYPSADRKSLQEAGSKNSNSKASVHLGCPQNAVI 757
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S +K+ASFG P G CG++ +G C + + + ++EK C+ K C+IE +E N C
Sbjct: 758 SAVKFASFGTPTGKCGSYSEGECH-DPNSISVVEKACLNKTECTIELTEENFNKGLCPDF 816
Query: 785 TVKRLVVEALC 795
T +RL VEA+C
Sbjct: 817 T-RRLAVEAVC 826
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/850 (43%), Positives = 503/850 (59%), Gaps = 84/850 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
+ C++ +++ V +D +A+ IDG+R++L SGSIHYPRSTP MW LI+KAK+G
Sbjct: 13 LCCCIVWSSVYVEVTKCNVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKAKDG 72
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLDAI+TYVFWN HEP Y+F G DL+RFIKT+ GLYV LRIGPY+C+EWN+GGF
Sbjct: 73 GLDAIDTYVFWNLHEPSPGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNFGGF 132
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F + MQ FT +V + K EKLF SQGGPIIL+QIENEY
Sbjct: 133 PVWLKFVPGI-SFRTDNEPFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYEPE 191
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+G +G +Y+ W AKMA + GVPW+MC+E DAP P+ F+PN P P
Sbjct: 192 SKAFGASGYAYMTWAAKMAVGMGTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPYKP 251
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+WTE W+GWF +GG +R EDL FAVARF Q GG+F NYYMYHGGTNFGRT+GGP+
Sbjct: 252 TMWTEAWSGWFTEFGGPIYQRPVEDLTFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPF 311
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------NS 351
+TTSYDYDAPIDEYG + +PK+GHL+ELHK +K E L + T T G +S
Sbjct: 312 ITTSYDYDAPIDEYGLIRRPKYGHLKELHKAVKLCELALLNADPTVTTLGSYEQAHVFSS 371
Query: 352 VSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTN-VKVK 389
SGS +++LP WS+SILPDCK FNTA+V QT+ ++
Sbjct: 372 KSGSGAVFLSNFNTKSATKVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTSQTQLL 431
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLK 448
R N + W E ++ V G + L+DQ + T D SDYLWY T+ D+
Sbjct: 432 RTNSELHS-----WGIFNEDVSS--VAGDTTITVTGLLDQLNITRDSSDYLWYTTSVDID 484
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
+ L G + +L + S+G +H ++N S F V L G N+IS
Sbjct: 485 PSESFLGGGQHPSLTVQSAGDAMHVFINDQLSGSASGTREHRRFTFTGNVNLHAGLNKIS 544
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
LLS VGL N G F+ G+ GPV L G + +DLS KW+Y+VGL G +
Sbjct: 545 LLSIAVGLANNGPHFETRNTGVLGPVALHGL---DHGTRDLSWQKWSYQVGLKG--EATN 599
Query: 569 YNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
++ + S W + ++ ++ +TWYK F+ P ++P+ L++ MGKG W+NG ++
Sbjct: 600 LDSPNSISAVDWMTGSLVAQKQQPLTWYKAYFDEPNGDEPLALDMGSMGKGQVWINGQSI 659
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW Y A+ D CS +C Y G + KC + C +P+Q WYHVPRSW+K N LV+FE
Sbjct: 660 GRYWTIY-ADSD-CS--ACTYSGTFRPKKCQFGCQHPTQQWYHVPRSWLKPSKNLLVVFE 715
Query: 687 EFGGNPSQINFQTVVVGTACGQAHEN------------------KTMELTCH---GRRIS 725
E GG+ S++ V + C + EN + E++ H G IS
Sbjct: 716 EIGGDVSKVALVKKSVTSVCAEVSENHPRITNWHTESHGQTEVQQKPEISLHCTDGHSIS 775
Query: 726 EIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGT 785
IK++SFG P G+CG F+ G+C A + +++K+C+GK+ CS+ S N GA C +
Sbjct: 776 AIKFSSFGTPSGSCGKFQHGTCHAP-NSNAVLQKECLGKQKCSVTISNTNFGADPCPS-K 833
Query: 786 VKRLVVEALC 795
+K+L VEA+C
Sbjct: 834 LKKLSVEAVC 843
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 679 bits (1752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/829 (44%), Positives = 488/829 (58%), Gaps = 83/829 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI I+G+++IL+SGSIHYPRSTP MWPDLI+K+K+GGLD I+TYVFWN HEP
Sbjct: 28 VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEPSP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F DL++FIK + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 88 GKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIV-FRTDNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K E+LF SQGGPIIL+QIENE+G V + G GK+Y W A+M
Sbjct: 147 PFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L+ GVPWIMC++ DAP P+ FTPN PK+WTE WTGW+ +GG
Sbjct: 207 AVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGAV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF++ARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG
Sbjct: 267 PTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLPR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
+PKWGHLR+LHK +KS E L + T GN
Sbjct: 327 EPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVFKSKSGCAAFLANYDTKSSAK 386
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
S Y LP W +SILPDCKT +NTA++ +Q++ P ++ L W+
Sbjct: 387 VSFGNGQYELPPWPISILPDCKTAVYNTARLGSQSSQMKMTPVKSA-----LPWQ---SF 438
Query: 410 INDFVVRGKGH-FALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + + L+ L +Q T D +DYLWYMT+ + D+ + + L I S+
Sbjct: 439 VEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKRGESPLLTIYSA 498
Query: 468 GQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
G LH ++NG T YGA + F + VK G N+++LLS +VGL N G F+
Sbjct: 499 GHALHVFINGQLSG---TVYGALENPKLTFSQNVKPRSGINKLALLSISVGLPNVGLHFE 555
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ GPV L G + D+S KWTYK+GL G + + ++S +
Sbjct: 556 TWNAGVLGPVTLKGL---NSGTWDMSRWKWTYKIGLKG-EALGLHTVSGSSSVEWAEGPS 611
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+ + +TWYK TF AP N P+ L++ MGKG W+NG ++GR+WP Y A + C +
Sbjct: 612 MAQKQPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTARGN-CG--N 668
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C Y G Y KC +CG PSQ WYHVPRSW+ N LV+FEE+GG+P++I+ +
Sbjct: 669 CYYAGTYDDKKCRTHCGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPTKISLVERRTSS 728
Query: 705 ACGQAHENK-----------------TMELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
C E + L C G+ IS+IK+AS+G PQG CG+F++GS
Sbjct: 729 VCADIFEGQPTLTNSQKLASGKLNRPKAHLWCPPGQVISDIKFASYGLPQGTCGSFQEGS 788
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C A ++ C+GK+SCS+ + G C G+ K+L VEA+C
Sbjct: 789 CHAH-KSYDAPKRNCIGKQSCSVAVAPEVFGGDPC-PGSTKKLSVEAVC 835
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 679 bits (1752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/851 (43%), Positives = 482/851 (56%), Gaps = 84/851 (9%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+ + L L L + V++D +AI I+G+RKIL+SGSIHYPRSTP MW L++KAK+
Sbjct: 11 SFFISLFLLVLHFQLIQCSVTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLMQKAKD 70
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD I+TYVFWN HEP Y+F G DL+RF+KT+Q GLY+ LRIGPYVCAEWN+GG
Sbjct: 71 GGLDVIQTYVFWNVHEPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAEWNFGG 130
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PGI RT N+ F MQ FT IV M K E LF SQGGPIIL+QIENEYG+
Sbjct: 131 FPVWLKYVPGI-SFRTDNEPFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIENEYGS 189
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
G G +Y+ W AKMA L GVPW+MC+E DAP P+ FTPN P
Sbjct: 190 ESKALGAPGHAYMTWAAKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYCDAFTPNKPYK 249
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P +WTE W+GWF +GG +R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP
Sbjct: 250 PTMWTEAWSGWFTEFGGTVHERPVEDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGP 309
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------- 349
++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K E L + T G
Sbjct: 310 FITTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKLCEPALISADPIVTSLGPYQQSHVFS 369
Query: 350 ---------------NSVS-----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
NSV+ Y+LP WS+SILPDC+ FNTAKV QT+
Sbjct: 370 SGTGGCAAFLSNYNPNSVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKVGVQTS---- 425
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHF--ALNTLIDQKSTNDVSDYLWYMTNADL 447
+ + + + L W+ E D G A+ L T D SDYLWYMT+ D+
Sbjct: 426 QMHMSAGETKLLSWEMYDE---DIASLGDNSMITAVGLLEQLNVTRDTSDYLWYMTSVDI 482
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ L G L + S+G LH Y+NG S F V + G N+I
Sbjct: 483 SPSESSLRGGRPPVLTVQSAGHALHVYINGQLSGSAHGSRENRRFTFTGDVNMRAGINRI 542
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS V L N G ++ G+ GPV+L G + +DL+ KW+Y+VGL G +
Sbjct: 543 ALLSIAVELPNVGLHYESTNTGVLGPVVLHGLDQGK---RDLTWQKWSYQVGLKG--EAM 597
Query: 568 FYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
A + S W + + +TWYK F AP ++P+ L+L MGKG W+NG +
Sbjct: 598 NLVAPSGISYVEWMQASFATQKLQPLTWYKAYFNAPGGDEPLALDLGSMGKGQVWINGES 657
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GRYW T A D C Y G Y + KC CG P+Q WYHVPRSW++ N LV+F
Sbjct: 658 IGRYW-TAAANGD---CNHCSYAGTYRAPKCQTGCGQPTQRWYHVPRSWLQPTKNLLVIF 713
Query: 686 EEFGGNPSQINFQTVVVGTACG---------------------QAHENKTMELTCHGRRI 724
EE GG+ S I+ V + C + H K G+ I
Sbjct: 714 EEIGGDASGISLVKRSVSSVCADVSEWHPTIKNWHIESYGRSEELHRPKVHLRCAMGQSI 773
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S IK+ASFG P G CG+F++G C + + ++EK+C+G++ C++ S N G C
Sbjct: 774 SAIKFASFGTPLGTCGSFQQGPCHSP-NSHAILEKKCIGQQRCAVTISMNNFGGDPC-PN 831
Query: 785 TVKRLVVEALC 795
+KR+ VEA+C
Sbjct: 832 VMKRVAVEAIC 842
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/829 (44%), Positives = 488/829 (58%), Gaps = 83/829 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI I+G+++IL+SGSIHYPRSTP MWPDLI+K+K+GGLD I+TYVFWN HEP
Sbjct: 28 VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEPSP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F DL++FIK + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 88 GKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIV-FRTDNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K E+LF SQGGPIIL+QIENE+G V + G GK+Y W A+M
Sbjct: 147 PFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L+ GVPWIMC++ DAP P+ FTPN PK+WTE WTGW+ +GG
Sbjct: 207 AVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGAV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AEDLAF++ARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG
Sbjct: 267 PTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLPR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------------------------- 351
+PKWGHLR+LHK +KS E L + T GNS
Sbjct: 327 EPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSGCAAFLANYDTKSSAK 386
Query: 352 --VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y LP WS+SILPDC+T +NTA++ +Q++ P ++ L W+
Sbjct: 387 VSFGNGQYELPPWSISILPDCRTAVYNTARLGSQSSQMKMTPVKSA-----LPWQ---SF 438
Query: 410 INDFVVRGKGHFA-LNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
I + + L+ L +Q T D +DY WYMT+ + D+ + + L I S+
Sbjct: 439 IEESASSDESDTTTLDGLWEQINVTRDTTDYSWYMTDITISPDEGFIKRGESPLLTIYSA 498
Query: 468 GQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
G LH ++NG T YGA + F + VKL G N+++LLS +VGL N G F+
Sbjct: 499 GHALHVFINGQLSG---TVYGALENPKLTFSQNVKLRSGINKLALLSISVGLPNVGLHFE 555
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ GPV L G + D+S KWTYKVGL G + + ++S +
Sbjct: 556 TWNAGVLGPVTLKGL---NSGTWDMSRWKWTYKVGLKG-EALGLHTVSGSSSVEWAEGPS 611
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+ + +TWY+ TF AP N P+ L++ MGKG W+NG ++GR+WP Y A + C +
Sbjct: 612 MAQKQPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTARGN-CG--N 668
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C Y G Y KC +CG PSQ WYHVPRSW+ N LV+FEE+GG+P++I+ +
Sbjct: 669 CYYAGTYDDKKCRTHCGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPTKISLVERRTSS 728
Query: 705 ACGQAHENK-----------------TMELTC-HGRRISEIKYASFGDPQGACGAFKKGS 746
C E + L C G+ IS+IK+AS+G QG CG+F++GS
Sbjct: 729 VCADIFEGQPTLTNSQKLASGKLNRPKAHLWCPPGQVISDIKFASYGLSQGTCGSFQEGS 788
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C A ++ C+GK+SCS+ + G C G+ K+L VEA+C
Sbjct: 789 CHAH-KSYDAPKRNCIGKQSCSVTVAPEVFGGDPC-PGSTKKLSVEAVC 835
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/833 (43%), Positives = 488/833 (58%), Gaps = 86/833 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ +DG+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RFIKT+Q G++V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F N MQ FT IV M K E LFASQGGPIIL+QIENEYG ++G AGK+YINW AKM
Sbjct: 146 PFKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC+E DAP P+ F+PN P P +WTE W+GWF +GG
Sbjct: 206 AVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTI 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R EDLAF VARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG
Sbjct: 266 RQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
+PK+GHL+ELH+ +K E+ L + T T G+ S SG +
Sbjct: 326 EPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFLANYNSNSYAK 385
Query: 357 -------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y+LP WS+SILPDCK FNTA V QTN + + + + W+ E
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----QMQMWADGASSMMWEKYDEE 441
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
++ L++Q T D SDYLWY+T ++ + L G + ++L + S+G
Sbjct: 442 VDSLAA--APLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPSEKFLQGGTPLSLTVQSAG 499
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
LH ++NG Q + YG D + L G N+++LLS GL N G ++
Sbjct: 500 HALHVFINGQL---QGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYET 556
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTY--KVGLYGLDDKKFYNAKAANSERGWSSK 583
G+ GPV++ G DE +DL+ W+Y +VGL G + + + + S
Sbjct: 557 WNTGVVGPVVIHGL--DEG-SRDLTWQTWSYQFQVGLKG-EQMNLNSLEGSGSVEWMQGS 612
Query: 584 NVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
V N++ + WY+ F+ P ++P+ L++ MGKG W+NG ++GRYW Y AE D
Sbjct: 613 LVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY-AEGD---C 668
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
+ C Y G Y + KC CG P+Q WYHVPRSW++ N LV+FEE GG+ S+I V
Sbjct: 669 KGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTV 728
Query: 703 GTACGQAHE-------------------NKTMELTCH-GRRISEIKYASFGDPQGACGAF 742
C E + L C G+ IS IK+ASFG P G CG F
Sbjct: 729 SGVCADVSEYHPNIKNWQIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTF 788
Query: 743 KKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++G C + I+ ++EK+C+G + C + S +N G C +KR+ VEA+C
Sbjct: 789 QQGECHS-INSNSVLEKKCIGLQRCVVAISPSNFGGDPCPE-VMKRVAVEAVC 839
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/856 (43%), Positives = 488/856 (57%), Gaps = 95/856 (11%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L C++L L + V++D AI I+G+R++L SGSIHYPRSTP MW DLI KAKE
Sbjct: 10 VLLWCIVLFISSGL-VHCDVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKE 68
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD +ETYVFWN HEP Y+F G DL+RF+KTIQ GLY LRIGPYVCAEWN+GG
Sbjct: 69 GGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGG 128
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PGI R N+ F N M+ + IV++ K LF SQGGPIIL+QIENEYG
Sbjct: 129 FPVWLKYVPGIS-FRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGP 187
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
G G Y W A MA LD GVPW+MC+E DAP P+ F PN P
Sbjct: 188 QAKVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYK 247
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P WTE W+GWF +GG +R +DLAFAVA+F Q GG+F NYYMYHGGTNFGRT+GGP
Sbjct: 248 PATWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGP 307
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
++TTSYDYDAPIDEYG + QPK+GHL+ELH+ +K EK++ + T GN
Sbjct: 308 FITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYS 367
Query: 351 SVSGSS---------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
S +G YNLP WS+SILPDC+ FNTAKV QT+
Sbjct: 368 SETGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSKMEM 427
Query: 390 RPNQAGNDQAPLQWKWRPEMINDF----VVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTN 444
P + L W+ E I+ +R G L++Q T D SDYLWY+T+
Sbjct: 428 LPT----NSEMLSWETYSEDISALDDSSSIRSFG------LLEQINVTRDTSDYLWYITS 477
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
D+ + L G TL + ++G +H ++NG S + +F+ V L G
Sbjct: 478 VDIGSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGS 537
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
N+I+LLS VGL N G F+ G+ GPV + G + DLS KWTY+VGL G
Sbjct: 538 NRIALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKW---DLSWAKWTYQVGLKG-- 592
Query: 565 DKKFYNAKAAN--SERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
+ N + N S W ++ ++ +TW+K F P ++P+ L++ MGKG W
Sbjct: 593 --EAMNLVSTNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVW 650
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
+NG ++GRYW Y + C Y G + KC CG P+Q WYHVPRSW+K N
Sbjct: 651 INGQSIGRYWTAYATGD----CNGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQN 706
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTAC------------------GQAHENKTMELTCH-- 720
LVLFEE GG+P++I+ V C G+ E ++ H
Sbjct: 707 LLVLFEELGGDPTRISLVKRSVTNVCSNVAEYHPNIKNWQIENYGKTEEFHLPKVRIHCA 766
Query: 721 -GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGAT 779
G+ IS IK+ASFG P G CG+FK+G+C A D ++EK+C+G+++C++ S +N G
Sbjct: 767 PGQSISSIKFASFGTPLGTCGSFKQGTCHAP-DSHAVVEKKCLGRQTCAVTISNSNFGED 825
Query: 780 SCAAGTVKRLVVEALC 795
C +KRL VEA C
Sbjct: 826 PC-PNVLKRLSVEAHC 840
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 675 bits (1742), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/722 (48%), Positives = 443/722 (61%), Gaps = 58/722 (8%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
++ L+ V++D R + I+G+ ++L+S SIHYPR+ P MW LI AK GG+D IETYVFW
Sbjct: 17 HVGLSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFW 76
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
+ H+P R Y+F G DL+ F+K + + GLY LRIGPYVCAEWN GGFPVWL ++PGIE
Sbjct: 77 DGHQPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIE 136
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N+ F EMQ F IV M K +KLFA QGGPIILAQIENEYGN+ + YG AGK Y
Sbjct: 137 -FRTNNQPFKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEY 195
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPMF-----------TPNNPNSPKIWTENWTGWF 250
+ W A MA L GVPWIMCQ+SDAP + PNN PK+WTENW+GWF
Sbjct: 196 MEWAANMAQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWF 255
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+ WG P R ED+AFAVARFFQ GG+FQNYYMY GGTNFGR+SGGPY+TTSYDYDAPI
Sbjct: 256 QKWGEASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPI 315
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD---------YGNSVSGS------ 355
DE+G + QPKWGHL++LH +K E L + T YG++ SG+
Sbjct: 316 DEFGVIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLA 375
Query: 356 ---------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP 400
+Y LPAWSVSILPDCKT NTAKV+ QT + +P+ G
Sbjct: 376 NIDSSSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVHVQTAMPTMKPSITG----- 430
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
L W+ PE + V G A L +T D SDYLWY T+ D+ D + S
Sbjct: 431 LAWESYPEPVG--VWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQAD---AASGKA 485
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
L + S V+H +VNG S TK E+P++L G N +++L ATVGLQNYG
Sbjct: 486 LLSLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYG 545
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
+ GI G V++ G + DL++ +W ++VGL G F + S+R
Sbjct: 546 PFIETWGAGINGSVIVKGLPSGQI---DLTAEEWIHQVGLKGESLAIF---TESGSQRVR 599
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
S VP + + WYK F++P NDPV L+L+ MGKG AW+NG ++GR+WP+ A +
Sbjct: 600 WSSAVPQGQALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPSLRAPDTAG 659
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
++CDYRG Y S KC CG PSQ WYHVPRSW++D N +VLFEE GG PS ++F T
Sbjct: 660 CPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKPSGVSFVTR 719
Query: 701 VV 702
V
Sbjct: 720 TV 721
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/827 (43%), Positives = 480/827 (58%), Gaps = 78/827 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ IDG+R+IL SGSIHYPRSTP MW L +KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL++FIKT Q GL+V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K E+LFASQGGPIIL+QIENEYG +G AGKSY NW AKM
Sbjct: 146 PFKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC++ DAP P+ F+PN P P +WTE WTGWF +GG
Sbjct: 206 AVGLDTGVPWVMCKQDDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWTGWFTEFGGTI 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
KR EDL+FAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG
Sbjct: 266 RKRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLT---------------------------YGNVTNTDYGN 350
+PK+GHL+ELH+ +K E L N + + N
Sbjct: 326 EPKYGHLKELHRAVKLCEPALVSVDPAVTTLGSMQEAHVFRSPSSCAAFLANYNSNSHAN 385
Query: 351 SV-SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
V + Y+LP WS+SILPDCKT FNTA V QT+ + + ++ + W+ E
Sbjct: 386 VVFNNEHYSLPPWSISILPDCKTVVFNTATVGVQTS----QMQMWADGESSMMWERYDEE 441
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
+ L++Q T D SDYLWY+T+ D+ + L G ++L + S+G
Sbjct: 442 VGSLAA--APLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPSEKFLQGGEPLSLTVQSAG 499
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
LH ++NG S A ++ L G N+I+LLS GL N G ++
Sbjct: 500 HALHIFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIALLSIACGLPNVGVHYETWNT 559
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
GI GPV+L G + +DL+ W+Y+VGL G ++ N+ S W ++
Sbjct: 560 GIVGPVVLHGL---DVGSRDLTWQTWSYQVGLKG--EQMNLNSLEGASSVEWMQGSLLAQ 614
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
++WY+ F+ P ++P+ L++ MGKG W+NG ++GRY +Y + + ++C Y
Sbjct: 615 APLSWYRAYFDTPTGDEPLALDMGSMGKGQIWINGQSIGRYSTSYASGD----CKACSYA 670
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQ 708
G Y + KC CG P+Q WYHVP+SW++ N LV+FEE GG+ S+I+ V + C
Sbjct: 671 GSYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSKISLVKRSVSSVCAD 730
Query: 709 AHENKT-------------------MELTCH-GRRISEIKYASFGDPQGACGAFKKGSCE 748
E T + L C G+ IS IK+ASFG P G CG F++G C
Sbjct: 731 VSEYHTNIKNWQIENAGEVEFHRPKVHLRCAPGQTISAIKFASFGTPLGTCGNFQQGDCH 790
Query: 749 AEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ ++EK C+G++ C++ S N G C +K++ VEA+C
Sbjct: 791 S-TKSHAVLEKNCIGQQRCAVTISPDNFGGDPCPK-EMKKVAVEAVC 835
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/841 (43%), Positives = 489/841 (58%), Gaps = 94/841 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ +DG+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RFIKT+Q G++V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ----------IENEYGNVMSDYGDAG 198
F N MQ FT IV M K E LFASQGGPIIL+Q IENEYG ++G AG
Sbjct: 146 PFKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAG 205
Query: 199 KSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWT 247
K+YINW AKMA LD GVPW+MC+E DAP P+ F+PN P P +WTE W+
Sbjct: 206 KAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWS 265
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYD 307
GWF +GG +R EDLAF VARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYD
Sbjct: 266 GWFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYD 325
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS--- 356
AP+DEYG +PK+GHL+ELH+ +K E+ L + T T G+ S SG +
Sbjct: 326 APLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFL 385
Query: 357 -----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQA 399
Y+LP WS+SILPDCK FNTA V QTN + + +
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----QMQMWADGAS 441
Query: 400 PLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSS 458
+ W+ E ++ L++Q T D SDYLWY+T+ ++ + L G +
Sbjct: 442 SMMWEKYDEEVDSLAA--APLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGT 499
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVG 515
++L + S+G LH ++NG Q + YG D + L G N+++LLS G
Sbjct: 500 PLSLTVQSAGHALHVFINGQL---QGSAYGTREDRKISYSGNANLRAGTNKVALLSVACG 556
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
L N G ++ G+ GPV++ G DE +DL+ W+Y+VGL G + + + +
Sbjct: 557 LPNVGVHYETWNTGVVGPVVIHGL--DEG-SRDLTWQTWSYQVGLKG-EQMNLNSLEGSG 612
Query: 576 SERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
S V N++ + WY+ F+ P ++P+ L++ MGKG W+NG ++GRYW Y
Sbjct: 613 SVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY- 671
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQ 694
AE D + C Y G Y + KC CG P+Q WYHVPRSW++ N LV+FEE GG+ S+
Sbjct: 672 AEGD---CKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSK 728
Query: 695 INFQTVVVGTACGQAHE-------------------NKTMELTCH-GRRISEIKYASFGD 734
I V C E + L C G+ IS IK+ASFG
Sbjct: 729 IALAKRTVSGVCADVSEYHPNIKNWQIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGT 788
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G CG F++G C + I+ ++EK+C+G + C + S +N G C +KR+ VEA+
Sbjct: 789 PLGTCGTFQQGECHS-INSNSVLEKKCIGLQRCVVAISPSNFGGDPCPE-VMKRVAVEAV 846
Query: 795 C 795
C
Sbjct: 847 C 847
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/831 (43%), Positives = 483/831 (58%), Gaps = 83/831 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ I+G+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL++FIKT Q GL+V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGIS-FRTDNE 150
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K E+LFASQGGPIIL+QIENEYG ++G AGKSY +W AKM
Sbjct: 151 PFKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKM 210
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC++ DAP P+ FTPN P+ P +WTE WTGWF +GG
Sbjct: 211 AVGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTI 270
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
KR EDL+FAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG
Sbjct: 271 RKRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAR 330
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
+PK+GHL+ELHK +K E+ L + T T G+ S SG +
Sbjct: 331 EPKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSGCAAFLANYNSNSHAK 390
Query: 357 -------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y+LP WS+SILPDCKT +NTA V QT+ + + + + W+ E
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTS----QMQMWSDGASSMMWERYDEE 446
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
+ L++Q +T D SDYLWYMT+ D+ + L G ++L + S+G
Sbjct: 447 VGSLAA--APLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTVQSAG 504
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
LH +VNG S ++ VKL G N+ISLLS GL N G ++
Sbjct: 505 HALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYETWNT 564
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GPV+L G DE +DL+ WTY+VGL G ++ N+ S W ++
Sbjct: 565 GVNGPVVLHGL--DEG-SRDLTWQTWTYQVGLKG--EQMNLNSLEGASSVEWMQGSLIAQ 619
Query: 589 RRM--TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+M WY+ F+ P ++P+ L++ MGKG W+NG ++GRY Y + + C
Sbjct: 620 NQMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAYATGD----CKDCS 675
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
Y G + + KC CG P+Q WYHVP+SW++ N LV+FEE GG+ S+I+ V C
Sbjct: 676 YTGSFRAIKCQAGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVC 735
Query: 707 GQAHE---------------------NKTMELTCH-GRRISEIKYASFGDPQGACGAFKK 744
E + L C G+ IS IK+ASFG P G CG+F++
Sbjct: 736 ADVSEFHPSIKNWQTENSGEAKPELRRSKVHLRCAPGQSISAIKFASFGTPLGTCGSFEQ 795
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G C + L + C+GK+ C++ S N G C +KR+ VEA+C
Sbjct: 796 GQCHSTKSQTVL--ENCIGKQRCAVTISPDNFGGDPC-PNVMKRVAVEAVC 843
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/879 (43%), Positives = 490/879 (55%), Gaps = 134/879 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V++D RA+ IDG R++L+S IHYPR+TP MWPDLI KAKEGG+D IETYVFWN H+P
Sbjct: 48 FNVTYDHRALIIDGHRRMLISAGIHYPRATPEMWPDLIAKAKEGGVDVIETYVFWNGHQP 107
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++ QY+F G DL++F K + GLY LRIGPY CAEWN+GGFPVWL ++PGI E RT
Sbjct: 108 VKGQYNFEGRYDLVKFAKLVASNGLYFFLRIGPYACAEWNFGGFPVWLRDIPGI-EFRTN 166
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ------IENEYGNVMSDYGDAGKS 200
N F EM+ F + +V++ ++E LF+ QGGPIIL Q IENEYGN+ S YG+ GK
Sbjct: 167 NAPFKEEMKRFVSKVVNLMREEMLFSWQGGPIILLQVRREYGIENEYGNLESSYGNEGKE 226
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y+ W A MA SL GVPW+MC++ DAP + F PN+ N P WTENW GW
Sbjct: 227 YVKWAASMALSLGAGVPWVMCKQPDAPYDIIDTCNAYYCDGFKPNSRNKPIFWTENWDGW 286
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
+ WG + P R EDLAFAVARFFQ GG+ QNYYMY GGTNFGRT+GGP TSYDYDAP
Sbjct: 287 YTQWGERLPHRPVEDLAFAVARFFQRGGSLQNYYMYFGGTNFGRTAGGPLQITSYDYDAP 346
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG--------------------------NV 343
IDEYG LN+PKWGHL++LH LK E L N+
Sbjct: 347 IDEYGLLNEPKWGHLKDLHAALKLCEPALVAADSPTYIKLGSKQEAHVYQENVHREGLNL 406
Query: 344 TNTDYGNSVS-----------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
+ + N S G +Y LP WSVSILPDC++ FNTAKV QT+V
Sbjct: 407 SISQISNKCSAFLANIDERKAATVTFRGQTYTLPPWSVSILPDCRSAIFNTAKVGAQTSV 466
Query: 387 KVKRPN---------------QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-K 430
K+ N G W E IN ++ F + +
Sbjct: 467 KLVGSNLPLTSNLLLSQQSIDHNGISHISKSWMTTKEPINIWI---NSSFTAEGIWEHLN 523
Query: 431 STNDVSDYLWYMTNADLKDDDPIL--SGSSNMTLRINSSGQVLHAYVNGNYVDS---QWT 485
T D SDYLWY T + D D + +++ L I+S +L +VNG + + W
Sbjct: 524 VTKDQSDYLWYSTRIYVSDGDILFWKENAAHPKLAIDSVRDILRVFVNGQLIGNVVGHWV 583
Query: 486 KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETI 545
K A L +P G N ++LL+ TVGLQNYG+ + GI G + + G
Sbjct: 584 K--AVQTLQFQP-----GYNDLTLLTQTVGLQNYGAFIEKDGAGIRGTIKITGFENGHI- 635
Query: 546 IKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW---SSKNVPLNRRMTWYKTTFEAPL 602
DLS WTY+VGL G + KFYN ++ N+ GW + +P TWYKT F+ P
Sbjct: 636 --DLSKPLWTYQVGLQG-EFLKFYNEESENA--GWVELTPDAIP--STFTWYKTYFDVPG 688
Query: 603 ENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGN 662
NDPV L+L+ MGKG AWVNG+++GRYW T ++ + GC + CDYRG Y SDKC NCG
Sbjct: 689 GNDPVALDLESMGKGQAWVNGHHIGRYW-TRVSPKTGC--QVCDYRGAYDSDKCTTNCGK 745
Query: 663 PSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN---------- 712
P+Q YHVPRSW+K N LV+ EE GGNP I+ + C Q ++
Sbjct: 746 PTQTLYHVPRSWLKASNNFLVILEETGGNPLGISVKLHSASIVCAQVSQSYYPPMQKLLN 805
Query: 713 ---------------KTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPL 756
M L C G IS I +ASFG P G+C +F +G+C A +
Sbjct: 806 ASLLGQQEVSSNDMIPEMNLRCRDGNIISSITFASFGTPGGSCQSFSRGNCHAP-SSKSI 864
Query: 757 IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ K C+GK+SCSI+ S G C VK L VEA C
Sbjct: 865 VSKACLGKRSCSIKISSDVFGGDPC-QDVVKTLSVEARC 902
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/841 (43%), Positives = 489/841 (58%), Gaps = 94/841 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ +DG+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RFIKT+Q G++V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ----------IENEYGNVMSDYGDAG 198
F N MQ FT IV M K E LFASQGGPIIL+Q IENEYG ++G AG
Sbjct: 146 PFKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAG 205
Query: 199 KSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWT 247
K+YINW AKMA LD GVPW+MC+E DAP P+ F+PN P P +WTE W+
Sbjct: 206 KAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWS 265
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYD 307
GWF +GG +R EDLAF VARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYD
Sbjct: 266 GWFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYD 325
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS--- 356
AP+DEYG +PK+GHL+ELH+ +K E+ L + T T G+ S SG +
Sbjct: 326 APLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFL 385
Query: 357 -----------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQA 399
Y+LP WS+SILPDCK FNTA V QTN + + +
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----QMQMWADGAS 441
Query: 400 PLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSS 458
+ W+ E ++ L++Q T D SDYLWY+T+ ++ + L G +
Sbjct: 442 SMMWEKYDEEVDSLAA--APLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGT 499
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVG 515
++L + S+G LH ++NG Q + YG D + L G N+++LLS G
Sbjct: 500 PLSLTVQSAGHALHVFINGQL---QGSAYGTREDRKISYSGNANLRAGTNKVALLSVACG 556
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
L N G ++ G+ GPV++ G DE +DL+ W+Y+VGL G + + + +
Sbjct: 557 LPNVGVHYETWNTGVVGPVVIHGL--DEG-SRDLTWQTWSYQVGLKG-EQMNLNSLEGSG 612
Query: 576 SERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
S V N++ + WY+ F+ P ++P+ L++ MGKG W+NG ++GRYW Y
Sbjct: 613 SVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY- 671
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQ 694
AE D + C Y G Y + KC CG P+Q WYHVPRSW++ N LV+FEE GG+ S+
Sbjct: 672 AEGD---CKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSK 728
Query: 695 INFQTVVVGTACGQAHE-------------------NKTMELTCH-GRRISEIKYASFGD 734
I V C E + L C G+ IS IK+ASFG
Sbjct: 729 IALAKRTVSGVCADVSEYHPNIKNWQIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGT 788
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G CG F++G C + I+ ++E++C+G + C + S +N G C +KR+ VEA+
Sbjct: 789 PLGTCGTFQQGECHS-INSNSVLERKCIGLERCVVAISPSNFGGDPCPE-VMKRVAVEAV 846
Query: 795 C 795
C
Sbjct: 847 C 847
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 673 bits (1737), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/872 (43%), Positives = 489/872 (56%), Gaps = 126/872 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ VS+D RA+ ++G+R+ L+S IHYPR+TP MWPDLI K+KEGG D IETYVFWN HEP
Sbjct: 45 FNVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNGHEP 104
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+R QY+F G DL++F++ GLY LRIGPY CAEWN+GGFPVWL ++PGI E RT
Sbjct: 105 VRGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWLRDIPGI-EFRTN 163
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N F EM+ F + +V++ ++E+LF+ QGGPIIL QIENEYGN+ + YG GK Y+ W A
Sbjct: 164 NAPFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYGKGGKEYMKWAA 223
Query: 207 KMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGG 255
KMA SL GVPW+MC++ DAP + F PN+ N P +WTENW GW+ WG
Sbjct: 224 KMALSLGAGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSHNKPTMWTENWDGWYTQWGE 283
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ P R EDLAFAVARFFQ GG+FQNYYMY GGTNFGRT+GGP TSYDYDAPIDEYG
Sbjct: 284 RLPHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGL 343
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGN--------------------------------- 342
L +PKWGHL++LH LK E L +
Sbjct: 344 LREPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPKQEAHVYQANVHLEGLNLSMFESS 403
Query: 343 ------VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR-- 390
+ N D + G Y +P WSVS+LPDC+ FNTAKV QT+VK+
Sbjct: 404 SICSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVRAQTSVKLVESY 463
Query: 391 --------PNQAGNDQAPL-----QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVS 436
P Q Q W E +N + K F + + + T D S
Sbjct: 464 LPTVSNIFPAQQLRHQNDFYYISKSWMTTKEPLN---IWSKSSFTVEGIWEHLNVTKDQS 520
Query: 437 DYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDS---QWTKYGASN 491
DYLWY T + D D + +++ L I+ +L ++NG + + W K
Sbjct: 521 DYLWYSTRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLIGNVVGHWIK----- 575
Query: 492 DLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSS 551
+ ++ G N ++LL+ TVGLQNYG+ + GI G + + G + DLS
Sbjct: 576 --VVQTLQFLPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENGDI---DLSK 630
Query: 552 HKWTYKVGLYGLDDKKFYNAKAANSERGW---SSKNVPLNRRMTWYKTTFEAPLENDPVV 608
WTY+VGL G + KFY+ + NSE W + +P TWYKT F+ P DPV
Sbjct: 631 SLWTYQVGLQG-EFLKFYSEENENSE--WVELTPDAIP--STFTWYKTYFDVPGGIDPVA 685
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
L+ + MGKG AWVNG ++GRYW T ++ + GC + CDYRG Y SDKC+ NCG P+Q Y
Sbjct: 686 LDFKSMGKGQAWVNGQHIGRYW-TRVSPKSGCQ-QVCDYRGAYNSDKCSTNCGKPTQTLY 743
Query: 669 HVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE----------------- 711
HVPRSW+K N LV+ EE GGNP +I+ + C Q E
Sbjct: 744 HVPRSWLKATNNLLVILEETGGNPFEISVKLHSSRIICAQVSESNYPPLQKLVNADLIGE 803
Query: 712 -----NKTMELTCH---GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVG 763
N EL H G IS + +ASFG P G+C F +G+C A + ++ + C G
Sbjct: 804 EVSANNMIPELHLHCQQGHTISSVAFASFGTPGGSCQNFSRGNCHAP-SSMSIVSEACQG 862
Query: 764 KKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
K+SCSI+ S++ G C G VK L VEA C
Sbjct: 863 KRSCSIKISDSAFGVDPC-PGVVKTLSVEARC 893
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/868 (42%), Positives = 486/868 (55%), Gaps = 112/868 (12%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N + V++D RA+ I G+R++L+S +HYPR+TP MWP LI K KEGG D IETYVFW
Sbjct: 57 NFFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFW 116
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP + QY F DL++F K + +GL++ LRIGPY CAEWN+GGFPVWL ++PGIE
Sbjct: 117 NGHEPAKGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIE 176
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N+ F EMQ F T IV + K+EKL++ QGGPIIL QIENEYGN+ +YG AGK Y
Sbjct: 177 -FRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRY 235
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+ W A+MA LD G+PW+MC+++DAP + F PN+ N P IWTE+W GW+
Sbjct: 236 MQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWY 295
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
WGG P R AED AFAVARF+Q GG+ QNYYMY GGTNF RT+GGP TSYDYDAPI
Sbjct: 296 ADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPI 355
Query: 311 DEYGHLNQPKWGHLRELHKLLK-------------------SMEKTLTY----------- 340
DEYG L QPKWGHL++LH +K SM++ Y
Sbjct: 356 DEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSM 415
Query: 341 -----------GNVTNTDYGNS-VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN--- 385
N+ Y + + G SY+LP WSVSILPDC+ FNTA++ QT+
Sbjct: 416 AGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFT 475
Query: 386 VKVKRPNQAGNDQAPL------------QWKWRPEMINDFVVRGKGHFALNTLIDQ-KST 432
V+ P+++ + + W E I + G +FA+ +++ T
Sbjct: 476 VESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTW---GGNNFAVQGILEHLNVT 532
Query: 433 NDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
D+SDYLWY T ++ D D S + +L I+ V +VNG SQ + +
Sbjct: 533 KDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWVS- 591
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS 550
++P++L G N+++LLS VGLQNYG+ + G G V L G + + DL+
Sbjct: 592 ---LKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDV---DLT 645
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
+ WTY+VGL G + A GWS + TWYKT F P DPV ++
Sbjct: 646 NSLWTYQVGLKG--EFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKTMFSTPKGTDPVAID 703
Query: 611 LQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHV 670
L MGKG AWVNG+ +GRYW + +A E GCS+ SC Y G Y KC NCG P+Q WYH+
Sbjct: 704 LGSMGKGQAWVNGHLIGRYW-SLVAPESGCSS-SCYYPGAYNERKCQSNCGMPTQNWYHI 761
Query: 671 PRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN------------------ 712
PR W+K+ N LVLFEE GG+PS I+ + T C + EN
Sbjct: 762 PREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYPPLSAWSHLSSGRASV 821
Query: 713 ----KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSC 767
+ L C G ISEI +AS+G P G C F KG+C A L L+ + CVG C
Sbjct: 822 NAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHAS-STLDLVTEACVGNTKC 880
Query: 768 SIEASEANLGATSCAAGTVKRLVVEALC 795
+I S G G +K L VEA C
Sbjct: 881 AISVSNDVFGDP--CRGVLKDLAVEAKC 906
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/831 (43%), Positives = 482/831 (58%), Gaps = 83/831 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ I+G+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL++FIKT Q GL+V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGIS-FRTDNE 150
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K E+LFASQGGPIIL+QIENEYG ++G AGKSY +W AKM
Sbjct: 151 PFKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKM 210
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC++ DAP P+ FTPN P+ P +WTE WTGWF +GG
Sbjct: 211 AVGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTI 270
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
KR EDL+FAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG
Sbjct: 271 RKRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAR 330
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
+PK+GHL+ELHK +K E+ L + T T G+ S SG +
Sbjct: 331 EPKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSGCAAFLANYNSNSHAK 390
Query: 357 -------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y+LP WS+SILPDCKT +NTA V QT+ + + + + W+ E
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTS----QMQMWSDGASSMMWERYDEE 446
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
+ L++Q +T D SDYLWYMT+ D+ + L G ++L + S+G
Sbjct: 447 VGSLAA--APLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTVQSAG 504
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
LH +VNG S ++ VKL G N+ISLLS GL N G ++
Sbjct: 505 HALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYETWNT 564
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GPV+L G DE +DL+ WTY+VGL G ++ N+ S W ++
Sbjct: 565 GVNGPVVLHGL--DEG-SRDLTWQTWTYQVGLKG--EQMNLNSLEGASSVEWMQGSLIAQ 619
Query: 589 RRM--TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+M WY+ F+ P ++P+ L++ MGKG W+NG ++GRY Y + + C
Sbjct: 620 NQMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAYATGD----CKDCS 675
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
Y G + + KC CG P+Q WYHVP+ W++ N LV+FEE GG+ S+I+ V C
Sbjct: 676 YTGSFRAIKCQAGCGQPTQRWYHVPKPWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVC 735
Query: 707 GQAHE---------------------NKTMELTCH-GRRISEIKYASFGDPQGACGAFKK 744
E + L C G+ IS IK+ASFG P G CG+F++
Sbjct: 736 ADVSEFHPSIKNWQTENSGEAKPELRRSKVHLRCAPGQSISAIKFASFGTPLGTCGSFEQ 795
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G C + L + C+GK+ C++ S N G C +KR+ VEA+C
Sbjct: 796 GQCHSTKSQTVL--ENCIGKQRCAVTISPDNFGGDPC-PNVMKRVAVEAVC 843
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/850 (44%), Positives = 486/850 (57%), Gaps = 115/850 (13%)
Query: 12 LLCLILQTLFNLSL-------AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
+L LI+ L + + VS+D RA+ I G+R++L+S IHYPR+TP MW DLI
Sbjct: 14 ILSLIIALLVYFPILSGSYFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLI 73
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
K+KEGG D ++TYVFWN HEP++ QY+F G DL++F+K I GLY+ LRIGPYVCAE
Sbjct: 74 AKSKEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAE 133
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFPVWL ++PGIE RT N+ F EMQ F T IVD+ ++ KLF QGGPII+ QIE
Sbjct: 134 WNFGGFPVWLRDIPGIE-FRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIE 192
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTP 233
NEYG+V YG GK Y+ W A MA L GVPW+MC+++DAP + F P
Sbjct: 193 NEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKP 252
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N+ P +WTE+W GW+ WGG P R AEDLAFAVARF+Q GG+FQNYYMY GGTNFGR
Sbjct: 253 NSRTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 312
Query: 294 TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN----------- 342
TSGGP+ TSYDYDAP+DEYG ++PKWGHL++LH +K E L +
Sbjct: 313 TSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQ 372
Query: 343 -------------------VTNTDYGNSV----SGSSYNLPAWSVSILPDCKTEEFNTAK 379
+ N D S +G SY LP WSVSILPDC+ FNTAK
Sbjct: 373 EAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAK 432
Query: 380 VNTQTNVK-VKRPNQAGNDQAPLQWKWRPEMIN-----------DFVVRGKGHFALNTLI 427
V QT+VK V+ + + LQ R + ++ + G+ +F L+
Sbjct: 433 VGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLL 492
Query: 428 DQ-KSTNDVSDYLWYMTNADLKDDDPIL--SGSSNMTLRINSSGQVLHAYVNGNYVDS-- 482
+ T D SDYLW+ T + +DD N T+ I+S VL +VN S
Sbjct: 493 EHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIV 552
Query: 483 -QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RA 540
W K +PV+ +G N + LL+ TVGLQNYG+ + G G L G +
Sbjct: 553 GHWVKA-------VQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKN 605
Query: 541 GDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFE 599
GD DLS WTY+VGL G DK + N + WS+ + + WYKT F+
Sbjct: 606 GD----LDLSKSSWTYQVGLKGEADKIY--TVEHNEKAEWSTLETDASPSIFMWYKTYFD 659
Query: 600 APLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYN 659
P DPVVLNL+ MG+G AWVNG ++GRYW ++++DGC +CDYRG Y SDKC N
Sbjct: 660 PPAGTDPVVLNLESMGRGQAWVNGQHIGRYW-NIISQKDGCD-RTCDYRGAYNSDKCTTN 717
Query: 660 CGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-------- 711
CG P+Q YHVPRSW+K N LVLFEE GGNP +I+ +TV G CGQ E
Sbjct: 718 CGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRK 777
Query: 712 -------NKTM---------ELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEA--EID 752
N TM L C G IS I++AS+G P+G+C F G C A +
Sbjct: 778 WSTPDYINGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASNSLS 837
Query: 753 VLPLIEKQCV 762
++ ++ C+
Sbjct: 838 IVSEVKLYCL 847
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/834 (45%), Positives = 479/834 (57%), Gaps = 107/834 (12%)
Query: 11 ILLCLILQTLFNLSLAY-----RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
+LCL+ +L +L Y VS+DGR++ IDG+RK+L+S SIHYPRS P MWP LI+
Sbjct: 5 FILCLVSTSL-TFTLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQ 63
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
AKEGG+D IETYVFWN HE Y F G DL++F K +QD G+Y+ILRIGP+V AEW
Sbjct: 64 TAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEW 123
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
N+GG PVWLH +PG RT N+ FM+ M+ FTT IV++ KKEKLFASQGGPIIL+QIEN
Sbjct: 124 NFGGVPVWLHYIPG-TVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIEN 182
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPN 234
EYG + Y + GK Y W AKMA S + VPWIMCQ+ DAP P+ FTP
Sbjct: 183 EYGYYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPT 242
Query: 235 NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRT 294
+P PK+WTENW GWFK++GG+DP R ED+AF+VARFFQ GG+ NYYMYHGGTNFGRT
Sbjct: 243 SPKRPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRT 302
Query: 295 SGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG 354
+GGP++TTSYDYDAPIDEYG PKWGHL+ELHK +K E L YG N G SV
Sbjct: 303 AGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEA 362
Query: 355 -----------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTN 385
+SY+LPAWSVSILPDCK FNTAKV++ TN
Sbjct: 363 DIYTDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTN 422
Query: 386 VKVKRP---NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWY 441
+ P Q+ Q L+W E + GK F N +D +T D +DYLW+
Sbjct: 423 IVAMIPEHLQQSDKGQKTLKWDVFKENPG---IWGKADFVKNGFVDHINTTKDTTDYLWH 479
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
T+ + ++ L S L I S G LHA+VN Y + S F+ P+ L
Sbjct: 480 TTSILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLR 539
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
GKN+I++LS TVGLQ G +D + G+ V ++G + TI DLSS+ W YK+G+
Sbjct: 540 AGKNEIAILSLTVGLQTAGPFYDFIGAGVTS-VKIIG-LNNRTI--DLSSNAWAYKIGVL 595
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
G + Y + NS + S+ P + +TWYK +AP ++PV L++ MGKG AW+
Sbjct: 596 G-EHLSIYQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWL 654
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG +GRYWP + + CDYRG + DKC CG PSQ WYHVPRSW K N
Sbjct: 655 NGEEIGRYWPRISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNV 714
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGA 741
LV+FEE GG+P++I F CH Y+S
Sbjct: 715 LVIFEEKGGDPTKITFVR------------------HCHN------PYSSI--------- 741
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++EK CV K I+ E N C G +L VEA+C
Sbjct: 742 --------------VVEKVCVNKNDRVIKVIEDNFKTNLC-HGLSMKLAVEAIC 780
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 668 bits (1724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/864 (43%), Positives = 499/864 (57%), Gaps = 105/864 (12%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N + V++D RA+ + G+R++L+S +HYPR+TP MWP LI KAKEGG+D IETY+FW
Sbjct: 62 NFFEPFNVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFW 121
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP + QY F G D++RF K + +GL++ LRIGPY CAEWN+GGFPVWL ++PGIE
Sbjct: 122 NGHEPAKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIE 181
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N+ + EMQNF T IVD+ K+EKL++ QGGPIIL QIENEYGN+ YG AGK Y
Sbjct: 182 -FRTDNEPYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRY 240
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+ W A+MA +LD GVPW+MC+++DAP + F PN+ N P IWTE+W GW+
Sbjct: 241 MQWAAQMALALDTGVPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWY 300
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
WG P R A+D AFAVARF+Q GG+FQNYYMY GGTNF RT+GGP TSYDYDAPI
Sbjct: 301 ADWGEALPHRPAQDSAFAVARFYQRGGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPI 360
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLT------------------------------- 339
DEYG L QPKWGHL++LH +K E LT
Sbjct: 361 DEYGILRQPKWGHLKDLHAAIKLCEPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSI 420
Query: 340 ----------YGNVTNTDYGNS-VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT---N 385
N+ Y + + G SY+LP WSVSILPDC+T FNTA+V TQT N
Sbjct: 421 SGNAQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFN 480
Query: 386 VKVKRPNQAGNDQAPLQWKWRPEMINDF-------VVRGKGHFALNTLIDQ-KSTNDVSD 437
V+ P+ + + + P + + + + + FA +++ T D+SD
Sbjct: 481 VESGSPSYSSRHKPRILSLGGPYLSSTWWASKEPVGIWSEDIFAAQGILEHLNVTKDISD 540
Query: 438 YLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE 495
YL Y T ++ D+D + S + +L I+ V+ +VNG SQ + + N
Sbjct: 541 YLSYTTRVNISDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGHWVSLN---- 596
Query: 496 RPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWT 555
+P++L +G N+++LLS VGLQNYG+ + G G V L G + + DL++ WT
Sbjct: 597 QPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDI---DLTNSLWT 653
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSS-KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
Y++GL G + + Y+ + S GWSS +N TW+KTTF+AP N PV ++L M
Sbjct: 654 YQIGLKG-EFSRIYSPEKQGSA-GWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSM 711
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG AWVNG+ +GRYW + +A E GC + SC+Y G YG KC NCG +Q WYH+PR W
Sbjct: 712 GKGQAWVNGHLIGRYW-SLVAPESGCPS-SCNYAGNYGDSKCRSNCGIATQSWYHIPREW 769
Query: 675 IKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN---------------------- 712
+++ N LVLFEE GG+PSQI+ + T C + E
Sbjct: 770 LQESDNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGRPSVNTVA 829
Query: 713 KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEA 771
+ L C G IS+I +AS+G P G C F G+C A L L+ + C GK C+I
Sbjct: 830 PELRLQCDEGHVISKITFASYGTPTGDCQNFSVGNCHAST-TLDLVAEACEGKNRCAISV 888
Query: 772 SEANLGATSCAAGTVKRLVVEALC 795
+ G C VK L V A C
Sbjct: 889 TNDVFG-DPCRK-VVKDLAVVAEC 910
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/852 (42%), Positives = 486/852 (57%), Gaps = 94/852 (11%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A ++ L+ S+ VS+D +AITI+G+R+IL+SGSIHYPRS+P MWPDLI+KAKE
Sbjct: 6 ASVVFLVFLASLVCSVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKE 65
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD I+TYVFWN HEP +Y F GN DL++F+K +++ GLYV LRIGPY+CAEWN+G
Sbjct: 66 GGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAEWNFG- 124
Query: 130 FPVWLHNMPGIEELRTTNKVFMNE---MQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
+ + F E M+ FTT IV+M K E+LF SQGGPIIL+QIENE
Sbjct: 125 -----------HQFQNGQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENE 173
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YG + + G G++Y W A+MA L GVPW+MC++ DAP P+ F+PN
Sbjct: 174 YGPMEYELGSPGQAYTKWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNK 233
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
PK+WTE WTGWF +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+
Sbjct: 234 AYKPKMWTEAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTA 293
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----- 350
GGP++ TSYDYDAP+DEYG L QPKWGHL++LH+ +K E L G+ T GN
Sbjct: 294 GGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAH 353
Query: 351 ------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
S YNLP WS+SILPDCK +NTA+V Q+
Sbjct: 354 VFNYKAGGCAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSAT 413
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNA 445
P L W+ E + G F + L++Q +T DVSDYLWYMT+
Sbjct: 414 IKMTPVPM---HGGLSWQTYNEEPSS---SGDNTFTMVGLLEQINTTRDVSDYLWYMTDV 467
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
+ + L L + S+G LH ++NG + + F + V L G N
Sbjct: 468 HIDPSEGFLKSGKYPVLTVLSAGHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVN 527
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
+ISLLS VGL N G F+ GI GPV L G DLS KW+YK+GL+G
Sbjct: 528 KISLLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRM---DLSWQKWSYKIGLHGEAL 584
Query: 566 KKFYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
+ +++ E W+ ++ ++ ++WYKTTF AP N P+ L++ MGKG W+NG
Sbjct: 585 SLHSISGSSSVE--WAEGSLVAQKQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQ 642
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GR+WP Y A G E C Y G Y +KC+ NCG SQ WYHVP+SW+K N LV+
Sbjct: 643 HVGRHWPAYKA--SGTCGE-CTYIGTYNENKCSTNCGEASQRWYHVPQSWLKPTGNLLVV 699
Query: 685 FEEFGGNPSQINFQTVVVGTACGQAHE----------------NKTMELTCH-----GRR 723
FEE+GG+P+ ++ V + C +E NK + H G++
Sbjct: 700 FEEWGGDPNGVSLVRREVDSVCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQK 759
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
I IK+ASFG P+G CG++ +GSC A CVG+ SCS+ + G C +
Sbjct: 760 IRSIKFASFGTPEGVCGSYNQGSCHA-FHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCPS 818
Query: 784 GTVKRLVVEALC 795
+K+L EA+C
Sbjct: 819 -VMKKLAAEAIC 829
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 667 bits (1720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/833 (43%), Positives = 480/833 (57%), Gaps = 88/833 (10%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D +A+ IDG+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
Y F DL+RF+KT+Q GL+V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 90 NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNEP 148
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F MQ FT IV M K E LFASQGGPIIL+QIENEYG ++G AG++YINW AKMA
Sbjct: 149 FKTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMA 208
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
LD GVPW+MC+E DAP P+ F+PN P P +WTE W+GWF +GG
Sbjct: 209 VGLDTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIR 268
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
+R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG + +
Sbjct: 269 QRPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRE 328
Query: 319 PKWGHLRELHKLLKSMEKTL--------TYGNVTNTDYGNSVSGSS-------------- 356
PK HL+ELH+ +K E+ L T G + S SG +
Sbjct: 329 PKHSHLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSGCAAFLANYNSNSHAKV 388
Query: 357 ------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMI 410
Y+LP WS+SILPDCK FN+A V QT+ + G+ + W+ E +
Sbjct: 389 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTS----QMQMWGDGATSMMWERYDEEV 444
Query: 411 NDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN-MTLRINSSG 468
+ L++Q T D SDYLWY+T+ D+ + L G +L + S+G
Sbjct: 445 DSLAA--APLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAG 502
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
LH +VNG Q + YG D + V L G N+I+LLS GL N G ++
Sbjct: 503 HALHVFVNGQL---QGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYET 559
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ GPV+L G +DL+ W+Y+VGL G ++ N+ + W ++
Sbjct: 560 WNTGVGGPVVLHGLNEGS---RDLTWQTWSYQVGLKG--EQMNLNSVEGSGSVEWMQGSL 614
Query: 586 PLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
++ + WYK FE P ++P+ L++ MGKG W+NG ++GRYW Y DG +
Sbjct: 615 IAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---ADG-DCK 670
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF-GGNPSQINFQTVVV 702
C Y G + + KC CG P+Q WYHVPRSW++ N LV+ EE GG+ S+I V
Sbjct: 671 GCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSV 730
Query: 703 GTACG-------------------QAHENKTMELTC-HGRRISEIKYASFGDPQGACGAF 742
+ C + H + L C HG+ IS I++ASFG P G CG F
Sbjct: 731 SSVCADVSEDHPNIKKWQIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNF 790
Query: 743 KKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++G C + ++EK+C+G + C + S N G C + T KR+ VEA+C
Sbjct: 791 QQGGCHSA-SSHAVLEKRCIGLQRCVVAISPDNFGGDPCPSVT-KRVAVEAVC 841
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/825 (45%), Positives = 488/825 (59%), Gaps = 83/825 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V +D RAITI+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP
Sbjct: 26 VWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 85
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F GN DL+RFIK +Q GLY+ LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 86 GKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGI-HFRTDNE 144
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EM+ FT+ IV+M K EKLF QGGPIIL+QIENE+G + D G K+Y W AKM
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L+ GVPW+MC+E DAP P+ F PN P +WTENWTGWF +G
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFYPNKRYKPMMWTENWTGWFTGYGVPV 264
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAF+VA+F Q GG++ NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG L
Sbjct: 265 PHRPVEDLAFSVAKFVQKGGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGMLR 324
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPK+GHL +LHK +K E L G T GN
Sbjct: 325 QPKYGHLTDLHKAIKLCEPALVSGYPVVTSLGNNQESNVFRSNSGACAAFLANYDTKYYA 384
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ +G YNLP WS+SILPDCKT FNTA+V QT +++ G + + P
Sbjct: 385 TVTFNGMRYNLPPWSISILPDCKTTVFNTARVGAQT-TQMQMTTVGGFSW--VSYNEDPN 441
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
I+D G F L++Q S T D +DYLWY T ++ ++ L L S+
Sbjct: 442 SIDD------GSFTKLGLVEQISMTRDSTDYLWYTTYVNIDQNEQFLKNGQYPVLTAQSA 495
Query: 468 GQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
G LH ++NG + T YG+ D + VKL G N+IS LS VGL N G F+
Sbjct: 496 GHSLHVFINGQLIG---TAYGSVEDPRLTYTGNVKLFAGSNKISFLSIAVGLPNVGEHFE 552
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ GPV L G + +DL+ KWTYK+GL G + ++N E G +S+
Sbjct: 553 TWNTGLLGPVTLNGLNEGK---RDLTWQKWTYKIGLKGEALSLHTLSGSSNVEWGDASRK 609
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
PL WYK F AP ++P+ L++ MGKG W+NG ++GRYWP Y A S
Sbjct: 610 QPL----AWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIGRYWPAYKARG---SCPK 662
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
CDY G Y KC NCG+ SQ WYHVPRSW+ N +V+FEE+GG P+ I+ + +
Sbjct: 663 CDYEGTYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGGEPTGISLVKRSMRS 722
Query: 705 AC-----GQAHEN--------KTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAE 750
AC GQ N + L+C G ++++IK+AS+G PQGAC ++ +G C A
Sbjct: 723 ACAYVSQGQPSMNNWHTKYAESKVHLSCDPGLKMTQIKFASYGTPQGACESYSEGRCHAH 782
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ +K C+G++ CS+ G C G +K + V+A C
Sbjct: 783 -KSYDIFQKNCIGQQVCSVTVVPEVFGGDPC-PGIMKSVAVQASC 825
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/671 (51%), Positives = 433/671 (64%), Gaps = 96/671 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R+I+LSGSIHYPRSTP MWPDLIKKAKEGGLDAIETY+FWN HEP R
Sbjct: 31 VSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPHR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQY+F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYGG P WL ++PG++ R N+
Sbjct: 91 RQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNE 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K K+FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 207 KMATSLDIGVPWIMCQESD-APSPMFT-----------PNNPNSPKIWTENWTGWFKSWG 254
MA ++GVPWIMCQ+ D P + PN PKIWTENWTGWFK+W
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
D R+AED+AFAVA FFQ G+ QNYYMYHGGTNFGRTSGGPY+TTSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+L QPK+GHL+ELH +LKSMEKTL +G +T+YG++++
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSACFINNRFDDK 389
Query: 354 -------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G+++ LPAWSVSILPDCKT FN+AK+ TQT+V VK+PN A +Q L+W W
Sbjct: 390 DVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQESLKWSWM 449
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
PE ++ F+ KG+F N L++Q T+ D SDYLWY T+ + K G + L +N
Sbjct: 450 PENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHK-------GEGSYKLYVN 502
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
++G L+A+VNG + + G E PVKL GKN ISLLSATVGL+NYG F+
Sbjct: 503 TTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYGPSFEK 562
Query: 526 VPNGIP-GPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
+P GI GPV L+ G DLS+ W+YK
Sbjct: 563 MPTGIVGGPVKLIDSNGTAI---DLSNSSWSYKA-------------------------- 593
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE-DGCSTE 643
TFEAP DPVV++L G+ KG AWVNG NLGRYWP+Y A E GC
Sbjct: 594 ------------TFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGC--H 639
Query: 644 SCDYRGPYGSD 654
CDYRG + ++
Sbjct: 640 RCDYRGAFQAE 650
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 6/65 (9%)
Query: 731 SFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLV 790
SFG +G CG ++ G CE++ CVGK+SC++E + A GA C +G L
Sbjct: 655 SFGVGRGRCGGYE-GGCESKA-AYEAFTAACVGKESCTVEITGAFAGA-GCLSGV---LT 708
Query: 791 VEALC 795
V+A C
Sbjct: 709 VQATC 713
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/852 (42%), Positives = 492/852 (57%), Gaps = 87/852 (10%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A+L C + + ++ V++D +A+ IDG+R+IL SGSIHYPRSTP MW LI+KAK+
Sbjct: 8 ALLGCAVAVAVLAAAVECAVTYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKD 67
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD I+TYVFWN HEP Y F DL+RFIKT+Q GL+V LRIGPY+C EWN+GG
Sbjct: 68 GGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGG 127
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PGI RT N+ F MQ FT IV M K EKLFASQGGPIIL+QIENEYG
Sbjct: 128 FPVWLKYVPGI-SFRTDNEPFKTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGP 186
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
+ G AG++YINW AKMA L GVPW+MC+E DAP P+ F+PN P
Sbjct: 187 EGKELGAAGQAYINWAAKMAIGLGTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYK 246
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P +WTE W+GWF +GG +R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP
Sbjct: 247 PTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGP 306
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL--------TYGNVTNTDYGN 350
++TTSYDYDAPIDEYG + +PK HL+ELH+ +K E+ L T G +
Sbjct: 307 FITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDPAITTLGTMQEAHVFR 366
Query: 351 SVSGSS--------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
S SG + Y+LP WS+SILPDCK FN+A V QT+ +
Sbjct: 367 SPSGCAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTS----Q 422
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKD 449
G+ + + W+ E ++ L++Q T D SDYLWY+T+ D+
Sbjct: 423 MQMWGDGASSMMWERYDEEVDSLAA--APLLTTTGLLEQLNVTRDSSDYLWYITSVDISP 480
Query: 450 DDPILSGSSN-MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKN 505
+ L G ++L + S+G LH +VNG + Q + YG D + L G N
Sbjct: 481 SENFLQGGGKPLSLSVLSAGHALHVFVNG---ELQGSAYGTREDRRIKYNGNANLRAGTN 537
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
+I+LLS GL N G ++ G+ GP VG G +DL+ W+Y+VGL G +
Sbjct: 538 KIALLSVACGLPNVGVHYETWNTGVGGP---VGLHGLNEGSRDLTWQTWSYQVGLKG--E 592
Query: 566 KKFYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ N+ ++ W ++ + ++WY+ FE P ++P+ L++ MGKG W+NG
Sbjct: 593 QMNLNSLEGSTSVEWMQGSLIAQNQQPLSWYRAYFETPSGDEPLALDMGSMGKGQIWING 652
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
++GRYW Y DG E C Y G + + KC CG P+Q WYHVPRSW++ N LV
Sbjct: 653 QSIGRYWTAY---ADGDCKE-CSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLV 708
Query: 684 LFEEFGGNPSQINFQTVVVGTACGQAHEN----KTMELTCHGRR---------------- 723
+FEE GG+ S+I V + C E+ K ++ +G R
Sbjct: 709 VFEELGGDSSKIALVKRSVSSVCADVSEDHPNIKNWQIESYGEREYHRAKVHLRCSPGQS 768
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
IS IK+ASFG P G CG F++G C + + ++EK+C+G + C++ S + G C
Sbjct: 769 ISAIKFASFGTPMGTCGNFQQGDCHSA-NSHTVLEKKCIGLQRCAVAISPESFGGDPCPR 827
Query: 784 GTVKRLVVEALC 795
T KR+ VEA+C
Sbjct: 828 VT-KRVAVEAVC 838
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/868 (42%), Positives = 487/868 (56%), Gaps = 121/868 (13%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V++D RA+ I G+R++L+S IHYPR+TP MWP LI ++KEGG D IETY FWN HEP
Sbjct: 35 FNVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEP 94
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
R QY+F G D+++F K + GL++ +RIGPY CAEWN+GGFP+WL ++PGIE RT
Sbjct: 95 TRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIE-FRTD 153
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N F EM+ + IVD+ E LF+ QGGPIIL QIENEYGNV S +G GK Y+ W A
Sbjct: 154 NAPFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESTFGPKGKLYMKWAA 213
Query: 207 KMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGG 255
+MA L GVPW+MC+++DAP + FTPN+ PKIWTENW GWF WG
Sbjct: 214 EMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGE 273
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ P R +ED+AFA+ARFFQ GG+ QNYYMY GGTNFGRT+GGP TSYDYDAP+DEYG
Sbjct: 274 RLPYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGL 333
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGN--------------------------------- 342
L QPKWGHL++LH +K E L +
Sbjct: 334 LRQPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGI 393
Query: 343 ----VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN-- 392
+ N D S + G + LP WSVSILPDC+ FNTAKV QT++K +
Sbjct: 394 CAAFIANIDEHESATVKFYGQEFTLPPWSVSILPDCRNTAFNTAKVGAQTSIKTVGSDSV 453
Query: 393 QAGNDQAPLQ-------------WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
GN+ LQ W E + V G +F +++ T D SDY
Sbjct: 454 SVGNNSLFLQVITKSKLESFSQSWMTLKEPLG---VWGDKNFTSKGILEHLNVTKDQSDY 510
Query: 439 LWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNY---VDSQWTKYGASNDL 493
LWY+T + DDD +++ T+ I+S + +VNG V +W K
Sbjct: 511 LWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKWIK------- 563
Query: 494 FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDLSSH 552
+PVKL +G N I LLS TVGLQNYG+ + G G + L G ++GD +L++
Sbjct: 564 VVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGD----INLTTS 619
Query: 553 KWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFEAPLENDPVVLNL 611
WTY+VGL G + + Y+ + S GW+ + +WYKT F+AP DPV L+
Sbjct: 620 LWTYQVGLRG-EFLEVYDVNSTESA-GWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDF 677
Query: 612 QGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVP 671
MGKG AWVNG+++GRYW T +A +GC +CDYRG Y SDKC NCG +Q WYH+P
Sbjct: 678 SSMGKGQAWVNGHHVGRYW-TLVAPNNGCG-RTCDYRGAYHSDKCRTNCGEITQAWYHIP 735
Query: 672 RSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-------------------- 711
RSW+K N LV+FEE P I+ T T C Q E
Sbjct: 736 RSWLKTLNNVLVIFEEIDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLSL 795
Query: 712 -NKT--MELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSC 767
+KT M L C G IS I++AS+G P G+C F +G C A + L ++ + C+G+ SC
Sbjct: 796 MDKTPEMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAA-NSLSVVSQACIGRTSC 854
Query: 768 SIEASEANLGATSCAAGTVKRLVVEALC 795
SI S G VK L V+A C
Sbjct: 855 SIGISNGVFGDP--CRHVVKSLAVQAKC 880
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/739 (46%), Positives = 443/739 (59%), Gaps = 75/739 (10%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
++ L+ V++D R + I+G+ ++L+S SIHYPR+ P MW LI AK GG+D IETYVFW
Sbjct: 19 HVGLSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFW 78
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
+ H+P R Y+F G DL+ F+K + + GLY LRIGPYVCAEWN GGFPVWL ++ GIE
Sbjct: 79 DGHQPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIE 138
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N+ F EMQ F IV M K +KLFA QGGPIILAQIENEYGN+ + YG AGK Y
Sbjct: 139 -FRTNNQPFKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEY 197
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPMF-----------TPNNPNSPKIWTENWTGWF 250
+ W A M+ L GVPWIMCQ+SDAP + PNN PK+WTENW+GWF
Sbjct: 198 MVWAANMSQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWF 257
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+ WG P R ED+AFAVARFFQ GG+FQNYYMY GGTNFGR+SGGPY+TTSYDYDAPI
Sbjct: 258 QKWGEASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPI 317
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD---------YGNSVSGS------ 355
DE+G + QPKWGHL++LH +K E L + T YG++ SG+
Sbjct: 318 DEFGVIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLA 377
Query: 356 ---------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP 400
+Y LPAWSVSILPDCKT NTAKV+ QT + +P+ G
Sbjct: 378 NIDSSSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVDVQTAMPTMKPSITG----- 432
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
L W+ PE + V G A L +T D SDYLWY T+ D+ D + S
Sbjct: 433 LAWESYPEPVG--VWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQAD---AASGKA 487
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
L + S V+H +VNG S TK E+P++L G N +++L ATVGLQNYG
Sbjct: 488 LLYLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYG 547
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
+ GI G V++ G + DL++ +W ++VGL G F + S+R
Sbjct: 548 PFIETWGAGINGSVIVKGLPSGQI---DLTAEEWIHQVGLKGESLAIF---TESGSQRVR 601
Query: 581 SSKNVPLNRRMTWYKTTFE-----------------APLENDPVVLNLQGMGKGFAWVNG 623
S VP + + WYK F+ +P NDPV L+L+ MGKG AW+NG
Sbjct: 602 WSSAVPQGQALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMGKGQAWING 661
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
++GR+WP+ A + ++CDYRG Y S KC CG PSQ WYHVPRSW++DG N +V
Sbjct: 662 QSIGRFWPSLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDGGNLVV 721
Query: 684 LFEEFGGNPSQINFQTVVV 702
LFEE GG PS ++F T V
Sbjct: 722 LFEEEGGKPSGVSFVTRTV 740
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/850 (42%), Positives = 483/850 (56%), Gaps = 85/850 (10%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
LC L F L+ V++DG+A+ I+G+RKIL SGSIHYPRS P MW LI+KAK G
Sbjct: 14 FFLCWSLH--FQLTNCENVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKAKMG 71
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD ++TYVFWN HEP YDF G DL++FIK ++ GLYV LRIGPY+C EWN+GGF
Sbjct: 72 GLDVVDTYVFWNLHEPSPGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNFGGF 131
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
P WL +PGI RT N+ F M FT IV M K E+LF SQGGPIIL+QIENEY
Sbjct: 132 PAWLKFVPGI-SFRTDNEPFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYETE 190
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+G+AG +Y+NW AKMA +D GVPW+MC++ DAP PM F+PN P P
Sbjct: 191 DKVFGEAGFAYMNWAAKMAVQMDTGVPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKP 250
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
WTE WT WF ++GG + KR EDLAF VARF Q GG+ NYYMYHGGTNFGRT+GGP+
Sbjct: 251 NFWTEAWTAWFNNFGGPNHKRPVEDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPF 310
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG-------------NVTNT 346
+TTSYDYDAPIDEYG + QPK+GHL+ LH +K EK L G V ++
Sbjct: 311 ITTSYDYDAPIDEYGLIRQPKFGHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSS 370
Query: 347 DYGN----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
G+ + +G Y LP WS+SILPDCK+ +NTA+V QTN
Sbjct: 371 SSGDCAAFLSNYHSNNTARVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFL 430
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
P + + W+ E I+ + + + L++Q + T D SDYLWY T+ ++
Sbjct: 431 PTKVES----FSWETYNENISS--IEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDP 484
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
++ L G TL S G +H ++NG S + + S F + L G N++SL
Sbjct: 485 NESYLRGGKFPTLTATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSL 544
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS GL N G ++ G+ GPV + G + DLS KW+YKVGL G +
Sbjct: 545 LSIAGGLPNNGPHYEEREMGVLGPVAIHGLDKGKM---DLSRQKWSYKVGLKG--ENMNL 599
Query: 570 NAKAANSERGWSSKNVPLN--RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
+ ++ W+ ++ + +TWYK F+AP ++P+ L++ M KG W+NG N+G
Sbjct: 600 GSPSSVQAVDWAKDSLKQENAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVG 659
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYW + C+ C Y G Y KC + CG P+Q WYHVPRSW+ N +V+FEE
Sbjct: 660 RYWT--ITANGNCT--DCSYSGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEE 715
Query: 688 FGGNPSQINFQTVVVGTAC-------------------GQAHENKTMELTCH---GRRIS 725
GGNPS+I+ V + C G+ +E +++ H G+ IS
Sbjct: 716 VGGNPSRISLVKRSVTSICTEASQYRPVIKNVHMHQNNGELNEQNVLKINLHCAAGQFIS 775
Query: 726 EIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGT 785
IK+ASFG P GACG+ K+G+C + +++K CVG++ C + G C
Sbjct: 776 AIKFASFGTPSGACGSHKQGTCHSPKSDY-VLQKLCVGRQRCLATIPTSIFGEDPC-PNL 833
Query: 786 VKRLVVEALC 795
K+L E +C
Sbjct: 834 RKKLSAEVVC 843
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/862 (42%), Positives = 490/862 (56%), Gaps = 110/862 (12%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V++D RA+ + G+R++L+S +HYPR+TP MWP LI K KEGG+DAIETYVFWN HEP
Sbjct: 61 FNVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGVDAIETYVFWNGHEP 120
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+ QY F G D++RF K + +GL++ LRIGPY CAEWN+GGFPVWL ++PGIE RT
Sbjct: 121 AKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDVPGIE-FRTD 179
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N+ + EMQ F T IVD+ K+EKL++ QGGPIIL QIENEYGN+ YG AGK Y+ W A
Sbjct: 180 NEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGHYGQAGKRYMLWAA 239
Query: 207 KMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGG 255
+MA +LD GVPW+MC+++DAP + F PN+ N P IWTE+W GW+ WG
Sbjct: 240 QMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGE 299
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
P R A+D AFAVARF+Q GG+ QNYYMY GGTNF RT+GGP TSYDYDAPIDEYG
Sbjct: 300 SLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGI 359
Query: 316 LNQPKWGHLRELHKLLKSMEKTLT------------------------------------ 339
L QPKWGHL++LH +K E LT
Sbjct: 360 LRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHVYSSENVHTNGSISGNSQ 419
Query: 340 -----YGNVTNTDYGNS-VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT---NVKVKR 390
N+ Y + + G SY+LP WSVSILPDC+T FNTA+V TQT NV+
Sbjct: 420 FCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGS 479
Query: 391 PNQAGNDQAPL-----------QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
P+ + + + W E + + G+G F +++ T D+SDY
Sbjct: 480 PSYSSRHKPRILSLIGVPYLSTTWWTFKEPVG---IWGEGIFTAQGILEHLNVTKDISDY 536
Query: 439 LWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
L Y T ++ ++D + S +L I+ V +VNG S+ + + N +
Sbjct: 537 LSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGSKVGHWVSLN----Q 592
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
P++L +G N+++LLS VGLQNYG+ + G G V L G + + DL++ WTY
Sbjct: 593 PLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDI---DLTNSLWTY 649
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
++GL G + + Y+ + S S +N TW+KT F+AP N PV ++L MGK
Sbjct: 650 QIGLKG-EFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPVTIDLGSMGK 708
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G AWVNG+ +GRYW + +A E GC + SC+Y G Y KC NCG +Q WYH+PR W++
Sbjct: 709 GQAWVNGHLIGRYW-SLVAPESGCPS-SCNYAGTYSDSKCRSNCGIATQSWYHIPREWLQ 766
Query: 677 DGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN----------------------KT 714
+ N LVLFEE GG+PSQI+ + T C + E
Sbjct: 767 ESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGRPSVNTVAPE 826
Query: 715 MELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASE 773
+ L C G IS+I +AS+G P G C F G+C A L L+ + C GK C+I +
Sbjct: 827 LRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHAST-TLDLVVEACEGKNRCAISVTN 885
Query: 774 ANLGATSCAAGTVKRLVVEALC 795
G C VK L VEA C
Sbjct: 886 EVFG-DPCRK-VVKDLAVEAEC 905
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/795 (44%), Positives = 468/795 (58%), Gaps = 83/795 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ +DG+R+IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RFIKT+Q G++V LRIGPY+C EWN+GGFPVWL +PGI RT N+
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGI-SFRTDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F N MQ FT IV M K E LFASQGGPIIL+QIENEYG ++G AGK+YINW AKM
Sbjct: 146 PFKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW+MC+E DAP P+ F+PN P P +WTE W+GWF +GG
Sbjct: 206 AVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTI 265
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R EDLAF VARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAP+DEYG
Sbjct: 266 RQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAR 325
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGSS------------- 356
+PK+GHL+ELH+ +K E+ L + T T G+ S SG +
Sbjct: 326 EPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFLANYNSNSYAK 385
Query: 357 -------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y+LP WS+SILPDCK FNTA V QTN + + + + W+ E
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----QMQMWADGASSMMWEKYDEE 441
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
++ L++Q T D SDYLWY+T+ ++ + L G + ++L + S+G
Sbjct: 442 VDSLAA--APLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQSAG 499
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
LH ++NG Q + YG D + L G N+++LLS GL N G ++
Sbjct: 500 HALHVFINGQL---QGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYET 556
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ GPV++ G DE +DL+ W+Y+VGL G + + + + S V
Sbjct: 557 WNTGVVGPVVIHGL--DEG-SRDLTWQTWSYQVGLKG-EQMNLNSLEGSGSVEWMQGSLV 612
Query: 586 PLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
N++ + WY+ F+ P ++P+ L++ MGKG W+NG ++GRYW Y AE D +
Sbjct: 613 AQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY-AEGD---CKG 668
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C Y G Y + KC CG P+Q WYHVPRSW++ N LV+FEE GG+ S+I V
Sbjct: 669 CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSG 728
Query: 705 ACGQAHE-------------------NKTMELTCH-GRRISEIKYASFGDPQGACGAFKK 744
C E + L C G+ IS IK+ASFG P G CG F++
Sbjct: 729 VCADVSEYHPNIKNWQIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQ 788
Query: 745 GSCEAEIDVLPLIEK 759
G C + I+ ++EK
Sbjct: 789 GECHS-INSNSVLEK 802
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/719 (49%), Positives = 443/719 (61%), Gaps = 58/719 (8%)
Query: 21 FNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVF 80
F S+ VS+D RAI I+G+RKIL+SGSIHYPRSTP MWPDLI+KAK+GGLD IETYVF
Sbjct: 17 FFSSVKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVF 76
Query: 81 WNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
WN HEP +Y+F G DL+RFIK +Q GLYV LRIGPYVCAEWN+GGFPVWL +PG+
Sbjct: 77 WNGHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGM 136
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
E RT N+ F MQ F IV+M K E LF SQGGPII+AQIENEYG V + G GK+
Sbjct: 137 -EFRTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKA 195
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y W A+MA L GVPWIMC++ DAP P+ F PN P PK+WTE WTGW
Sbjct: 196 YTKWAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGW 255
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
+ +GG P+R AED+AF+VARF Q G+F NYYMYHGGTNFGRTS G ++ TSYDYDAP
Sbjct: 256 YTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAP 315
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGS------ 355
+DEYG LN+PK+GHLR+LHK +K E L T G+ S SG+
Sbjct: 316 LDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLS 375
Query: 356 ---------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP 400
YNLP WS+SILPDCKT +NTA+VN+Q++ P G
Sbjct: 376 NYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTPAGGG----- 430
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSN 459
L W+ E N L +QK+ T D SDYLWYMTN ++ ++ L +
Sbjct: 431 LSWQSYNEETP--TADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKD 488
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L + S+G VLH +VNG + + + VKL G N+ISLLS +VGL N
Sbjct: 489 PYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNV 548
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G +D G+ GPV L G ++L+ KW+YKVGL G + +++ E
Sbjct: 549 GVHYDTWNAGVLGPVTLSGLNEGS---RNLAKQKWSYKVGLKGESLSLHSLSGSSSVE-- 603
Query: 580 WSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W ++ ++ +TWYK TF AP NDP+ L++ MGKG W+NG +GR+WP Y+A+ D
Sbjct: 604 WVRGSLMAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIAQGD 663
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
CS C Y G + KC NCG PSQ WYHVPRSW+K N LV+FEE+GGNP+ I+
Sbjct: 664 -CS--KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISL 719
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/656 (49%), Positives = 427/656 (65%), Gaps = 53/656 (8%)
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------F 231
IENE+GNV YG GK Y+ WCA++A S ++ PWIMCQ+ DAP P+ F
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQF 60
Query: 232 TPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
PNN NSPK+WTE+W GWFK WG +DP RTAEDLAFAVARFFQ+GG+ NYYMYHGGTNF
Sbjct: 61 KPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNF 120
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS 351
GR++GGPY+TTSYDY+AP+DEYG++NQPKWGHL++LH+L++SMEK LTYG+V + D G+S
Sbjct: 121 GRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHS 180
Query: 352 VSGSS---------------------------YNLPAWSVSILPDCKTEEFNTAKVNTQT 384
+ +S Y +P WSV++LPDCKTE +NTAKVNTQT
Sbjct: 181 TTATSYTYKGKSSCFFGNPENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQT 240
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGK---GHFALNTLIDQKS-TNDVSDYLW 440
++ P+ G + PL+W+WR E I G N+LIDQK TND SDYLW
Sbjct: 241 TIREMVPSLVGKHKKPLKWQWRNEKIEHLTHEGDISGSAITANSLIDQKMVTNDSSDYLW 300
Query: 441 YMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK- 499
Y+T L +DP+ +TLR+ + G +LHA+VN ++ +Q+ YG + E+ V+
Sbjct: 301 YLTGFHLNGNDPLF--GKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRN 358
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L G NQI+LLSATVGL NYG+ ++ V GI GPV L+ D I+DLS+++W YKVG
Sbjct: 359 LRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELI---ADGKTIRDLSTNEWIYKVG 415
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L G + +F++ + W S N+PLN+ TWYKT+F P + VV++L GMGKG A
Sbjct: 416 LDG-EKYEFFDPD-HKFRKPWLSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQA 473
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
WVNG ++GRYWP+YLA E+GCS+ SCDYRG Y KCA NCG P+Q WYH+PRS++ DG
Sbjct: 474 WVNGKSIGRYWPSYLATENGCSS-SCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGK 532
Query: 680 -NTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGA 738
NTL+LFEEFGG P I +T V C + +ELTCH R + I + FG+P+G
Sbjct: 533 ENTLILFEEFGGMPLNIEIKTTRVKKVCAKVDLGSKLELTCHDRTVKRIIFVGFGNPKGN 592
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
C F KGSC + + +IEK+C+ K+ CSIE ++ LG T C L V+
Sbjct: 593 CNNFHKGSCHSS-EAFSVIEKECLWKRKCSIEVTKDKLGLTGCKNPKDNWLAVQPF 647
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/719 (48%), Positives = 441/719 (61%), Gaps = 58/719 (8%)
Query: 21 FNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVF 80
F S+ VS+D RAI I+G+RKIL+SGSIHYPRSTP MWPDLI+KAK+GGLD IETYVF
Sbjct: 17 FFSSVKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVF 76
Query: 81 WNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
WN HEP +Y+F G DL+RFIK +Q GLYV LRIGPYVCAEWN+GGFPVWL +PG+
Sbjct: 77 WNGHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGM 136
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
E RT N+ F MQ F IV+M K E LF SQGGPII+AQIENEYG V + G GK+
Sbjct: 137 -EFRTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKA 195
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y W A+MA L GVPWIMC+ DAP P+ F PN P PK+WTE WTGW
Sbjct: 196 YTKWAAQMAVGLKTGVPWIMCKREDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGW 255
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
+ +GG P+R AED+AF+VARF Q G+F NYYMYHGGTNFGRTS G ++ TSYDYDAP
Sbjct: 256 YTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAP 315
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGS------ 355
+DEYG LN+PK+GHLR+LHK +K E L T G+ S SG+
Sbjct: 316 LDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLS 375
Query: 356 ---------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP 400
YNLP WS+SILPDCKT +NTA+VN+Q++ P G
Sbjct: 376 NYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTPAGGG----- 430
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSN 459
L W+ E N L +QK+ T D SDYLWYMTN ++ ++ L +
Sbjct: 431 LSWQSYNEETP--TADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLRNGKD 488
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L + S+G VLH +VNG + + + VKL G N+ISLLS +VGL N
Sbjct: 489 PYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNV 548
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G +D G+ GPV L G ++L+ KW+YKVGL G + +++ E
Sbjct: 549 GVHYDTWNAGVLGPVTLSGLNEGS---RNLAKQKWSYKVGLKGESLSLHSLSGSSSVE-- 603
Query: 580 WSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W ++ ++ +TWYK TF AP NDP+ L + MGKG W+NG +GR+WP Y+A+ D
Sbjct: 604 WVRGSLVAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWPGYIAQGD 663
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
CS C Y G + KC NCG PSQ W+HVPRSW+K N LV+FEE+GGNP+ I+
Sbjct: 664 -CS--KCSYAGTFNEKKCQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNPTGISL 719
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/734 (48%), Positives = 447/734 (60%), Gaps = 62/734 (8%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S +LL +++ +L + VS+D RAI I+G+RKIL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 4 SNNVLLVVLVICSLDLLVKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIQKA 63
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP +Y+F G DL++FIK +Q GLYV LRIGPY+CAEWN+
Sbjct: 64 KDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNF 123
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GG PVWL + G+ E RT N+ F MQ F IV M K EKLF QGGPII+AQIENEY
Sbjct: 124 GGLPVWLKYVSGM-EFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEY 182
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GK+Y W A+MA L VPWIMC++ DAP P+ F PN P
Sbjct: 183 GPVEWEIGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKP 242
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P+R AED+AF+VARF Q G++ NYYMYHGGTNFGRTS
Sbjct: 243 YKPKMWTEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSS 302
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
G ++ TSYDYDAPIDEYG LN+PK+GHLRELHK +K E L T T G+
Sbjct: 303 GLFIATSYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHV 362
Query: 351 --SVSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
S SG+ Y+LP WS+SILPDCKT +NTAKV++Q +
Sbjct: 363 YRSKSGACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSSI 422
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
P G L W+ E + N L +Q++ T D SDYLWYMT+ +
Sbjct: 423 KMTPAGGG-----LSWQSYNE--DTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDIN 475
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRG 503
+ ++ L + L + S+G VLH +VNG T YGA ++ + VKL G
Sbjct: 476 IASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAG---TVYGALDNPKLTYSGNVKLNAG 532
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N+ISLLS +VGL N G +D G+ GPV L G +DL+ KW+YKVGL G
Sbjct: 533 INKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGS---RDLAKQKWSYKVGLKG- 588
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ + ++S V + +TWYK TF AP N+P+ L++ MGKG W+NG
Sbjct: 589 ESLSLHTLSGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWING 648
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
+GR+WP Y A+ D CS C Y G + KC NCG PSQ WYHVPRSW+K N LV
Sbjct: 649 EGVGRHWPGYAAQGD-CS--KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLV 705
Query: 684 LFEEFGGNPSQINF 697
+FEE+GG+P+ I+
Sbjct: 706 VFEEWGGDPTGISL 719
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/734 (48%), Positives = 447/734 (60%), Gaps = 62/734 (8%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S +LL +++ +L + VS+D RAI I+G+RKIL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 4 SNNVLLVVLVICSLDLLVKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIEKA 63
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP +Y+F G DL++FIK +Q GLYV LRIGPY+CAEWN+
Sbjct: 64 KDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNF 123
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GG PVWL + G+ E RT N+ F MQ F IV M K EKLF QGGPII+AQIENEY
Sbjct: 124 GGLPVWLKYVSGM-EFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEY 182
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GK+Y W A+MA L VPWIMC++ DAP P+ F PN P
Sbjct: 183 GPVEWEIGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKP 242
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P+R AED+AF+VARF Q G++ NYYMYHGGTNFGRTS
Sbjct: 243 YKPKMWTEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSS 302
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
G ++ TSYDYDAPIDEYG LN+PK+GHLRELHK +K E L T T G+
Sbjct: 303 GLFIATSYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHV 362
Query: 351 --SVSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
S SG+ Y+LP WS+SILPDCKT +NTAKV++Q +
Sbjct: 363 YRSKSGACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSSI 422
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
P G L W+ E + N L +Q++ T D SDYLWYMT+ +
Sbjct: 423 KMTPAGGG-----LSWQSYNE--DTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDVN 475
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRG 503
+ ++ L + L + S+G VLH +VNG T YGA ++ + VKL G
Sbjct: 476 IASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAG---TVYGALDNPKLTYSGNVKLNAG 532
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N+ISLLS +VGL N G +D G+ GPV L G +DL+ KW+YKVGL G
Sbjct: 533 INKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGS---RDLAKQKWSYKVGLKG- 588
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ + ++S V + +TWYK TF AP N+P+ L++ MGKG W+NG
Sbjct: 589 ESLSLHTLSGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWING 648
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
+GR+WP Y A+ D CS C Y G + KC NCG PSQ WYHVPRSW+K N LV
Sbjct: 649 EGVGRHWPGYAAQGD-CS--KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLV 705
Query: 684 LFEEFGGNPSQINF 697
+FEE+GG+P+ I+
Sbjct: 706 VFEEWGGDPTGISL 719
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/719 (48%), Positives = 442/719 (61%), Gaps = 58/719 (8%)
Query: 21 FNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVF 80
F S+ VS+D RAI I+G+RKIL+SGSIHYPRSTP MWPDLI+KAK+GGLD IETYVF
Sbjct: 17 FFSSVKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVF 76
Query: 81 WNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI 140
WN H P +Y+F G DL+RFIK +Q GLYV LRIGPYVCAEWN+GGFPVWL +PG+
Sbjct: 77 WNGHGPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGM 136
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS 200
E RT N+ F M+ F IV+M K E LF SQGGPII+AQIENEYG V + G GK+
Sbjct: 137 -EFRTNNQPFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKA 195
Query: 201 YINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGW 249
Y W A+MA L GVPWIMC++ DAP P+ F PN P PK+WTE WTGW
Sbjct: 196 YTKWAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGW 255
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
+ +GG P+R AED+AF+VARF Q G+F NYYMYHGGTNFGRTS G ++ TSYDYDAP
Sbjct: 256 YTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAP 315
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------SVSGS------ 355
+DEYG LN+PK+GHLR+LHK +K E L T G+ S SG+
Sbjct: 316 LDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLS 375
Query: 356 ---------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP 400
YNLP WS+SILPDCKT +NTA+VN+Q++ P G
Sbjct: 376 NYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTPAGGG----- 430
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSN 459
L W+ E N L +QK+ T D SDYLWYMTN ++ ++ L +
Sbjct: 431 LSWQSYNEETP--TADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKD 488
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L + S+G VLH +VNG + + + VKL G N+ISLLS +VGL N
Sbjct: 489 PYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNV 548
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G +D G+ GPV L G ++L+ KW+YKVGL G + +++ E
Sbjct: 549 GVHYDTWNAGVLGPVTLSGLNEGS---RNLAKQKWSYKVGLKGESLSLHSLSGSSSVE-- 603
Query: 580 WSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W ++ ++ +TWYK TF AP NDP+ L++ MGKG W+NG +GR+WP Y+A+ D
Sbjct: 604 WVRGSLVAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIAQGD 663
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
CS C Y G + KC NCG PSQ WYHVPRSW+K N LV+FEE+GGNP+ I+
Sbjct: 664 -CS--KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISL 719
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/721 (47%), Positives = 446/721 (61%), Gaps = 63/721 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK GGLD I+TYVFWN HEP
Sbjct: 28 VTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTYVFWNVHEPSP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL+RFIKT+Q GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 88 SNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNG 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K EKLF SQGGPIIL+QIENEYG G G +Y NW AKM
Sbjct: 147 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGHAYSNWAAKM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC+E DAP P+ F+PN P PK+WTE+W+GWF +GG
Sbjct: 207 AVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDFSPNKPYKPKLWTESWSGWFSEFGGPV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P+R A+DLAFAVARF Q GG+F NYYMYHGGTNFGR++GGP++TTSYDYDAPIDEYG L
Sbjct: 267 PQRPAQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLLR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------------------------NSVS 353
+PK+GHL++LHK +K E L + T T G NS +
Sbjct: 327 EPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTQTCAAFLANYHSNSAA 386
Query: 354 GSSYN-----LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
++N LP WS+SILPDCKT+ FNTA+V Q N K++ ++ L W+ E
Sbjct: 387 RVTFNNRHYDLPPWSISILPDCKTDVFNTARVRFQ-NSKIQ---MLPSNSKLLSWETYDE 442
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
++ + + L++Q +T D SDYLWY+T+ D+ + L G + ++ ++SS
Sbjct: 443 DVSSLAESSR--ITASGLLEQINATRDTSDYLWYITSVDISPSESFLRGGNKPSISVHSS 500
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G +H ++NG + S + + F P+ L G N+I+LLS VGL N G F+
Sbjct: 501 GDAVHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKIALLSVAVGLPNGGIHFESWK 560
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS----ERGWSSK 583
GI GP+LL G + KDL+ KW+Y+VGL G + + +S +S+
Sbjct: 561 TGITGPILLHGLDHGQ---KDLTWQKWSYQVGLKG-EAMNLVSPNGVSSVDWVRESLASQ 616
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
N P ++ W+K F AP N+ + L++ GMGKG W+NG ++GRYW Y + C+
Sbjct: 617 NQP---QLKWHKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGRYWLVY--AKGNCN-- 669
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
SC+Y G Y KC CG P+Q WYHVPRSW+K N +V+FEE GGNP +I+ +
Sbjct: 670 SCNYAGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGGNPWKISLVKRTIH 729
Query: 704 T 704
T
Sbjct: 730 T 730
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/723 (47%), Positives = 436/723 (60%), Gaps = 56/723 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L++ L+ + V++D +AI +DG+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 9 VVLMMLCLWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGL 68
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP QY F DL++F+K Q GLYV LRIGPY+CAEWN GGFPV
Sbjct: 69 DVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPV 128
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV + K+ +LF SQGGPIIL+QIENEYG V
Sbjct: 129 WLKYVPGIA-FRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEW 187
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPW+MC++ DAP P+ F PN PK+
Sbjct: 188 EIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKM 247
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGW+ +GG P+R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRTSGG ++
Sbjct: 248 WTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIA 307
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT---------------------- 339
TSYDYDAP+DEYG N+PK+ HLR LHK +K E L
Sbjct: 308 TSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVFSAPG 367
Query: 340 -----YGNVTNTDYGNSVSGS-SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
N Y + G+ Y+LP WS+SILPDCKT +NTAKV K+ N
Sbjct: 368 ACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKVGYGWLKKMTPVNS 427
Query: 394 AGNDQAPLQWK-WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
A W+ + E + +AL ++ T D SDYLWYMT+ ++ ++
Sbjct: 428 A------FAWQSYNEEPASSSQADSIAAYALWEQVN--VTRDSSDYLWYMTDVNVNANEG 479
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L + S+G VLH ++NG + W G F VKL G N++SLLS
Sbjct: 480 FLKNGQSPLLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSV 539
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
VGL N G F+ G+ GPV L G +DLS KW+YKVGL G + +
Sbjct: 540 AVGLPNVGVHFETWNAGVLGPVTLKGL---NEGTRDLSRQKWSYKVGLKG-ESLSLHTES 595
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S V + +TWYKTTF AP NDP+ L+L MGKG WVNG ++GR+WP
Sbjct: 596 GSSSVEWIQGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPG 655
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A S +C+Y G Y KC NCG PSQ WYHVPRSW+ G N+LV+FEE+GG+P
Sbjct: 656 YIAHG---SCNACNYAGYYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDP 712
Query: 693 SQI 695
+ I
Sbjct: 713 NGI 715
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/843 (43%), Positives = 475/843 (56%), Gaps = 97/843 (11%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+LS+A VS+D RA+ +DG+R++L+SGSIHYPRSTP MWP LI KAKEGGLD I+TYVFW
Sbjct: 21 HLSVAVTVSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFW 80
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP R Y++ G +L +FI+ + + G+YV LRIGPYVCAEWN GGFP WL +PGI
Sbjct: 81 NGHEPTRGVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGI- 139
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
E RT N+ F NE Q F +V K+EKLFA QGGPII+AQIENEYGN+ + YG+AG+ Y
Sbjct: 140 EFRTDNEPFKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRY 199
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPMFT-----------PNNPNSPKIWTENWTGWF 250
+NW A MA + + VPWIMCQ+ +AP + PN+ + P WTENWTGWF
Sbjct: 200 LNWIANMAVATNTSVPWIMCQQPEAPQLVINTCNGFYCDGWRPNSEDKPAFWTENWTGWF 259
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+SWGG P R +D+AF+VARFF+ GG+F NYYMYHGGTNF RT G +TTSYDYDAPI
Sbjct: 260 QSWGGGAPTRPVQDIAFSVARFFEKGGSFMNYYMYHGGTNFERT-GVESVTTSYDYDAPI 318
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT------------------------ 346
DEY + QPKWGHL++LH LK E L + T
Sbjct: 319 DEY-DVRQPKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQSSSGTCAAFL 377
Query: 347 ------DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAP 400
D + G Y+LPAWSVSILPDCK+ FNTAKV Q+ + + P
Sbjct: 378 ASWDTNDSLVTFQGQPYDLPAWSVSILPDCKSVVFNTAKVGAQSVIMTMQ------GAVP 431
Query: 401 L-QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSS 458
+ W E + + F+ N L++Q +T D +DYLWYMTN + + D + + S+
Sbjct: 432 VTNWVSYHEPLGPW----GSVFSTNGLLEQIATTKDTTDYLWYMTNVQVAESD-VRNISA 486
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF---ERPVKLTRGKNQISLLSATVG 515
TL ++S H +VNG Y G S+ F +P+ L G N I++LS T+G
Sbjct: 487 QATLVMSSLRDAAHTFVNGFYT-------GTSHQQFMHARQPISLRPGSNNITVLSMTMG 539
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
LQ YG + GI V + +L WTY+VGL G + K+ + +
Sbjct: 540 LQGYGPFLENEKAGIQYGVRIEDLPSGTI---ELGGSTWTYQVGLQG-ESKQLFEVNGSL 595
Query: 576 SERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
+ + V + W KT F+ P N + L+L MGKG WVNG NLGRYW ++ A
Sbjct: 596 TAEWNTISEVSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYWSSFTA 655
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
+ DGC SCDYRG Y KC C PSQ WYH+PR W+ N +VLFEE GGNP I
Sbjct: 656 QRDGCDA-SCDYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDI 714
Query: 696 NFQTVVVGTAC---GQAH------------ENKT-------MELTC-HGRRISEIKYASF 732
+ T + C Q+H +N T + L C G++IS I +AS+
Sbjct: 715 SIATRMPQQICSHISQSHPFPFSLTSWTKRDNLTSTLLRAPLTLECAEGQQISRICFASY 774
Query: 733 GDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVE 792
G P G C F SC A ++ K CVG++ CS+ + G C G K L
Sbjct: 775 GTPSGDCEGFVLSSCHANTS-YDVLTKACVGRQKCSVPIVSSIFGDDPC-PGLSKSLAAT 832
Query: 793 ALC 795
A C
Sbjct: 833 AEC 835
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/720 (46%), Positives = 447/720 (62%), Gaps = 62/720 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +AI I+G+R+IL+SGSIHYPRSTP MW DLI+KAK+GGLD I+TYVFWN HEP
Sbjct: 29 VTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWNVHEPSP 88
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL++FIKT+Q +GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 89 GNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNG 147
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K EKLF SQGGPIIL+QIENEYG G +G +Y NW AKM
Sbjct: 148 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGHAYSNWAAKM 207
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC+E DAP P+ F+PN P PK+WTE+W+GWF +GG +
Sbjct: 208 AVGLGTGVPWVMCKEDDAPDPVINACNGFYCDDFSPNKPYKPKLWTESWSGWFSEFGGSN 267
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P+R EDLAFAVARF Q GG+F NYYMYHGGTNFGR++GGP++TTSYDYDAPIDEYG L
Sbjct: 268 PQRPVEDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLLR 327
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG-----------------------NSVSG 354
+PK+GHL++LHK +K E L + T T G NS +
Sbjct: 328 EPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTTCAAFLANYHSNSAAR 387
Query: 355 SSYN-----LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
++N LP WS+SILPDC+T+ FNTA++ Q + P ++ L W+ E
Sbjct: 388 VTFNNRHYDLPPWSISILPDCRTDVFNTARMRFQPSQIQMLP----SNSKLLSWETYDED 443
Query: 410 INDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
++ + + L++Q +T D SDYLWY+T+ D+ + L G + ++ ++SSG
Sbjct: 444 VSSLAESSR--ITASRLLEQIDATRDTSDYLWYITSVDISSSESFLRGRNKPSISVHSSG 501
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
+H ++NG + S + + F P+ L G N+I+LLS VGL N G F+ +
Sbjct: 502 DAVHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIALLSVAVGLPNGGIHFESWKS 561
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS----ERGWSSKN 584
GI GPVLL + KDL+ KW+Y+VGL G + + +S +S+N
Sbjct: 562 GITGPVLLHDLDHGQ---KDLTGQKWSYQVGLKG-EAMNLVSPNGVSSVDWVSESLASQN 617
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
P ++ W+K F AP +P+ L++ MGKG W+NG ++GRYW Y + C+ S
Sbjct: 618 QP---QLKWHKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRYWMVY--AKGNCN--S 670
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C+Y G Y KC CG P+Q WYHVPRSW+K N +V+FEE GGNP +I+ ++ T
Sbjct: 671 CNYAGTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGNPWKISLVKRIIHT 730
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/722 (47%), Positives = 434/722 (60%), Gaps = 56/722 (7%)
Query: 14 CLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLD 73
L+L + + V++D +AI IDG+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD
Sbjct: 10 VLMLLFFWVCGVTASVTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLD 69
Query: 74 AIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVW 133
I+TYVFWN HEP +Y F DL+RF+K Q GLYV LRIGPY+CAEWN+GGFPVW
Sbjct: 70 VIQTYVFWNGHEPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAEWNFGGFPVW 129
Query: 134 LHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD 193
L +PGI RT N+ F MQ FT IV + K+E+LF SQGGPIIL+QIENEYG V +
Sbjct: 130 LKYVPGIA-FRTDNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIENEYGPVEWE 188
Query: 194 YGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIW 242
G GKSY W A+MA LD GVPW+MC++ DAP P+ F PN PK+W
Sbjct: 189 IGAPGKSYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMW 248
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTT 302
TENWTGW+ +GG P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRTSGG ++ T
Sbjct: 249 TENWTGWYTDFGGASPIRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIAT 308
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN-------------------- 342
SYDYDAP+DEYG N+PKWGHLR LHK +K E L +
Sbjct: 309 SYDYDAPLDEYGLQNEPKWGHLRALHKAIKQSEPALVSTDPKVTSLGYNLEAHVFSTPGA 368
Query: 343 ----VTNTDYGNSVSGS----SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQA 394
+ N D +S + Y+LP WS+SILPDCKT +NTA+V VK P +
Sbjct: 369 CAAFIANYDTKSSAKATFGSGQYDLPPWSISILPDCKTVVYNTARVGNGW-VKKMTPVNS 427
Query: 395 GNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPI 453
G + W+ A L +Q + T D SDYLWYMT+ + ++
Sbjct: 428 G-------FAWQSYNEEPASSSQDDSIAAEALWEQVNVTRDSSDYLWYMTDVYINGNEGF 480
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSAT 513
L + L + S+G +LH ++NG + + G F V L G N++SLLS
Sbjct: 481 LKNGRSPVLTVMSAGHLLHVFINGQLSGTVYGGLGNPKLTFSDNVNLRVGNNKLSLLSVA 540
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA 573
VGL N G F+ G+ GPV L G +DLS KW+YKVGL G + +
Sbjct: 541 VGLPNVGVHFETWNAGVLGPVTLKGL---NEGTRDLSRQKWSYKVGLKG-EALNLHTESG 596
Query: 574 ANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
++S V + +TWYK TF AP NDP+ L+L MGKG WVNG ++GR+WP Y
Sbjct: 597 SSSVEWIQGSLVAKKQPLTWYKATFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGY 656
Query: 634 LAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPS 693
+A S +C+Y G Y KC NCG PSQ WYHVPRSW+ G N+LV+FEE+GG+P+
Sbjct: 657 IAH---GSCNACNYAGYYTDQKCRTNCGKPSQRWYHVPRSWLNSGGNSLVVFEEWGGDPN 713
Query: 694 QI 695
I
Sbjct: 714 GI 715
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/885 (40%), Positives = 480/885 (54%), Gaps = 140/885 (15%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPG------------------------------- 58
++D +A+ IDG+R+IL SGSIHYPRSTP
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89
Query: 59 ---------------------MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
MW LI+KAK+GGLD I+TYVFWN HEP Y F
Sbjct: 90 LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL+RF+KT+Q GL+V LRIGPY+C EWN+GGFPVWL +PGI RT N+ F MQ F
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGIS-FRTDNEPFKTAMQGF 208
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
T IV M K E LFASQGGPIIL+QIENEYG ++G AG++YINW AKMA LD GVP
Sbjct: 209 TEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVP 268
Query: 218 WIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
W+MC+E DAP P+ F+PN P P +WTE W+GWF +GG +R EDLA
Sbjct: 269 WVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLA 328
Query: 267 FAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRE 326
FAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG + +PK HL+E
Sbjct: 329 FAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKE 388
Query: 327 LHKLLKSMEKTL--------TYGNVTNTDYGNSVSGSS--------------------YN 358
LH+ +K E+ L T G + S SG + Y+
Sbjct: 389 LHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSGCAAFLANYNSNSHAKVVFNNEQYS 448
Query: 359 LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGK 418
LP WS+SILPDCK FN+A V QT+ + G+ + W+ E ++
Sbjct: 449 LPPWSISILPDCKNVVFNSATVGVQTS----QMQMWGDGATSMMWERYDEEVDSLAA--A 502
Query: 419 GHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN-MTLRINSSGQVLHAYVN 476
L++Q T D SDYLWY+T+ D+ + L G +L + S+G LH +VN
Sbjct: 503 PLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVN 562
Query: 477 GNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGP 533
G Q + YG D + V L G N+I+LLS GL N G ++ G+ GP
Sbjct: 563 GQL---QGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGP 619
Query: 534 VLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--M 591
V+L G +DL+ W+Y+VGL G ++ N+ + W ++ ++ +
Sbjct: 620 VVLHGLNEGS---RDLTWQTWSYQVGLKG--EQMNLNSVEGSGSVEWMQGSLIAQKQQPL 674
Query: 592 TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPY 651
WYK FE P ++P+ L++ MGKG W+NG ++GRYW Y DG + C Y G +
Sbjct: 675 AWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---ADG-DCKGCSYTGTF 730
Query: 652 GSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF-GGNPSQINFQTVVVGTACG--- 707
+ KC CG P+Q WYHVPRSW++ N LV+ EE GG+ S+I V + C
Sbjct: 731 RAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVS 790
Query: 708 ----------------QAHENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAE 750
+ H + L C HG+ IS I++ASFG P G CG F++G C +
Sbjct: 791 EDHPNIKKWQIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSA 850
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++EK+C+G + C + S N G C + T KR+ VEA+C
Sbjct: 851 -SSHAVLEKRCIGLQRCVVAISPDNFGGDPCPSVT-KRVAVEAVC 893
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/723 (47%), Positives = 434/723 (60%), Gaps = 56/723 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ L+ + V++D +AI +DG+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 9 VVLMSLCLWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGL 68
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP QY F DL++F+K +Q GLYV LRIGPY+CAEWN+GGFPV
Sbjct: 69 DVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNFGGFPV 128
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV + K+ +LF SQGGPII++QIENEYG V
Sbjct: 129 WLKYVPGIA-FRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYGPVEW 187
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPW+MC++ DAP P+ F PN PK+
Sbjct: 188 EIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGYYCENFKPNKNTKPKM 247
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGW+ +GG P+R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRTSGG ++
Sbjct: 248 WTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIA 307
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT---------------------- 339
TSYDYDAP+DEYG N+PK+ HLR LHK +K E L
Sbjct: 308 TSYDYDAPLDEYGLQNEPKYEHLRNLHKAIKQCEPALVATDPKVQSLGYNLEAHVFSTPG 367
Query: 340 -----YGNVTNTDYGNSVSGS-SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
N Y + G+ Y+LP WS+SILPDCKT +NTAKV K+ N
Sbjct: 368 ACAAFIANYDTKSYAKATFGNGQYDLPPWSISILPDCKTVVYNTAKVGNSWLKKMTPVNS 427
Query: 394 AGNDQAPLQWK-WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
A W+ + E + +AL ++ T D SDYLWYMT+ + ++
Sbjct: 428 A------FAWQSYNEEPASSSQADSIAAYALWEQVN--VTRDSSDYLWYMTDVYINANEG 479
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L S+G VLH ++N + W F VKL G N++SLLS
Sbjct: 480 FLKNGQSPVLTAMSAGHVLHVFINDQLAGTVWGGLANPKLTFSDNVKLRVGNNKLSLLSV 539
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
VGL N G F+ G+ GPV L G +DLSS KW+YKVGL G + +
Sbjct: 540 AVGLPNVGVHFETWNAGVLGPVTLKGL---NEGTRDLSSQKWSYKVGLKG-ESLSLHTES 595
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S V + +TWYKTTF AP NDP+ L+L MGKG WVNG ++GR+WP
Sbjct: 596 GSSSVEWIRGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPG 655
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A S +C+Y G Y KC NCG PSQ WYHVPRSW+ G N+LV+FEE+GG+P
Sbjct: 656 YIAHG---SCNACNYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDP 712
Query: 693 SQI 695
+ I
Sbjct: 713 NGI 715
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/756 (45%), Positives = 457/756 (60%), Gaps = 64/756 (8%)
Query: 4 LKHCSRAILLCLILQTLFNLS--LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWP 61
++ S + LL L+ LF S + V++D +AI I+G+R+IL+SGSIHYPRSTP MW
Sbjct: 1 METISVSKLLVLVFTILFLGSELIHCSVTYDRKAIIINGQRRILISGSIHYPRSTPEMWE 60
Query: 62 DLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYV 121
DLI+KAK GGLDAI+TYVFWN HEP Y+F G DL+RFIKT+Q GLYV LRIGPYV
Sbjct: 61 DLIRKAKGGGLDAIDTYVFWNVHEPSPGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYV 120
Query: 122 CAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILA 181
CAEWN+GGFPVWL +PGI RT N F MQ FT IV M K EKLF SQGGPIIL+
Sbjct: 121 CAEWNFGGFPVWLKYVPGI-SFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILS 179
Query: 182 QIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM----------- 230
QIENEYG+ G AG +Y NW AKMA L+ GVPW+MC++ DAP P+
Sbjct: 180 QIENEYGSESKQLGGAGYAYTNWAAKMAVGLNTGVPWVMCKQDDAPDPVINACNGFYCDY 239
Query: 231 FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
F+PN P P +WTE+W+GWF +GG +R +DLAFAVARF Q GG++ NYYMYHGGTN
Sbjct: 240 FSPNKPYKPTLWTESWSGWFTEFGGPIYQRPVQDLAFAVARFIQKGGSYINYYMYHGGTN 299
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG- 349
FGR++GGP++TTSYDYDAPIDEYG + +PK+GHL +LHK +K E+ L + T T G
Sbjct: 300 FGRSAGGPFITTSYDYDAPIDEYGLIREPKYGHLMDLHKAIKQCERALVSSDPTVTSLGA 359
Query: 350 -----------------------NSVSGSSYN-----LPAWSVSILPDCKTEEFNTAKVN 381
NS + ++N LP WS+SILPDCKT+ FNTA+V
Sbjct: 360 YEQAHVFSSKNGACAAFLANYHSNSAARVTFNNRKYDLPPWSISILPDCKTDVFNTARVR 419
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLW 440
QT P ++ W+ E ++ K + L++Q +T D SDYLW
Sbjct: 420 FQTTKIQMLP----SNSKLFSWETYDEDVSSLSESSK--ITASGLLEQLNATRDTSDYLW 473
Query: 441 YMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---LFERP 497
Y+T+ D+ + L G + ++ ++S+G +H ++NG ++ S +G S D F P
Sbjct: 474 YITSVDISSSESFLRGGNKPSISVHSAGHAVHVFINGQFLGS---AFGTSEDRSCTFNGP 530
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
V L G N+I+LLS VGL N G F+ GI G VLL G + KDL+ KW+Y+
Sbjct: 531 VNLRAGTNKIALLSVAVGLPNVGFHFETWKAGITG-VLLYGLDHGQ---KDLTWQKWSYQ 586
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
+GL G ++ + S +V ++ W+K F AP +P+ L+L MGKG
Sbjct: 587 IGLKGEAMNLVSPNGVSSVDWVRDSLDVRSQSQLKWHKAYFNAPDGVEPLALDLSSMGKG 646
Query: 618 FAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD 677
W+NG ++GRYW Y + C+ SC+Y G Y KC CG P+Q WYHVPRSW+K
Sbjct: 647 QVWINGQSIGRYWMVY--AKGACN--SCNYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKP 702
Query: 678 GVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK 713
N +VL EE GGNP +I+ Q ++ T +K
Sbjct: 703 TNNLIVLLEELGGNPWKISLQKRIIHTPASSEPNSK 738
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 339/737 (45%), Positives = 449/737 (60%), Gaps = 59/737 (8%)
Query: 7 CSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKK 66
S AIL+ ++ + A VS+D R++TI R++++S +IHYPRS P MWP L++
Sbjct: 10 ASTAILVVMVFLFSWRSIEAANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQT 69
Query: 67 AKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWN 126
AKEGG +AIE+YVFWN HEP +Y F G ++++FIK +Q G+++ILRIGP+V AEWN
Sbjct: 70 AKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWN 129
Query: 127 YGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
YGG PVWLH +PG R N+ + + M++FTT IV++ K+EKLFA QGGPIIL+Q+ENE
Sbjct: 130 YGGVPVWLHYVPGTV-FRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENE 188
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YG DYG+ GK Y W A MA S +IGVPW+MCQ+ DAP + FTPN
Sbjct: 189 YGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNT 248
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
P+ PKIWTENW GWFK++GG+DP R AED+A++VARFF GG+ NYYMYHGGTNFGRTS
Sbjct: 249 PDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTS 308
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG- 354
GGP++TTSYDY+APIDEYG PKWGHL++LHK + E L G N G+S+
Sbjct: 309 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEAD 368
Query: 355 ----------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
+SY+LPAWSVSILPDCKTE FNTAKV ++++
Sbjct: 369 VYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSS- 427
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNA 445
KV+ + + L+W+ E + G F N L+D +T D +DYLWY T+
Sbjct: 428 KVEMLPEDLKSSSGLKWEVFSEKPG---IWGAADFVKNELVDHINTTKDTTDYLWYTTSI 484
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
+ +++ L S+ L I S G LH ++N Y+ + ++PV L G+N
Sbjct: 485 TVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGEN 544
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
I LLS TVGL N GS ++ V G+ V G +L++ KW+YK+G+ G
Sbjct: 545 NIDLLSMTVGLANAGSFYEWVGAGLTS----VSIKGFNKGTLNLTNSKWSYKLGVEGEHL 600
Query: 566 KKFYNAKAANS-ERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ F K NS W+ P ++ +TWYK E P ++PV L++ MGKG AW+NG
Sbjct: 601 ELF---KPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNG 657
Query: 624 YNLGRYWPTYL---AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
+GRYWP + D C E CDYRG + DKC CG PSQ WYHVPRSW K N
Sbjct: 658 EEIGRYWPRIARKNSPNDECVKE-CDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGN 716
Query: 681 TLVLFEEFGGNPSQINF 697
LV+FEE GGNP +I
Sbjct: 717 ELVIFEEKGGNPMKIKL 733
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 342/727 (47%), Positives = 445/727 (61%), Gaps = 59/727 (8%)
Query: 14 CLILQTLFNLS-LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
C IL F + + V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GG+
Sbjct: 12 CYILFLCFFVCYVTASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGV 71
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D IETYVFWN HEP + +Y F DL++FIK +Q GLYV LRIGPYVCAEWN+GGFPV
Sbjct: 72 DVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 131
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PG+ RT N+ F MQ FTT IV + K E LF SQGGPIIL+QIENEYG V
Sbjct: 132 WLKYVPGV-AFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEW 190
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GKSY W ++MA L+ GVPW+MC++ DAP P+ F+PN PK+
Sbjct: 191 EIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKM 250
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGW+ +G P R AEDLAF+VARF Q G++ NYYMYHGGTNFGRTS G ++
Sbjct: 251 WTENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIA 310
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT----------------- 344
TSYDYDAPIDEYG +++PKWGHLR+LHK +K E L + T
Sbjct: 311 TSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSF 370
Query: 345 --------NTDYGN----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP- 391
N D G+ + Y+LP WS+SILPDCKTE FNTAKV + P
Sbjct: 371 GACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVHRSMTPA 430
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
N A N Q+ + +P + G + N L++Q S T D SDYLWYMT+ ++ +
Sbjct: 431 NSAFNWQS---YNEQPAFSGE-----SGSWTANGLLEQLSQTWDKSDYLWYMTDVNISPN 482
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ + N L S+G VLH ++NG + + + F VKL G N+ISLL
Sbjct: 483 EGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKISLL 542
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G ++ G+ GPV L G +DLS KW+YK+GL G + +
Sbjct: 543 SVAVGLSNVGVHYEKWNVGVLGPVTL---KGLNEGTRDLSKQKWSYKIGLKG-ESLNLHT 598
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
++S + + + +TWYKTTF AP NDP+ L++ MGKG WVNG ++GR+W
Sbjct: 599 TSGSSSVKWTQGSFLSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSIGRHW 658
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
P Y+A + C SC+Y G + KC NCG P+Q WYH+PRSW+ N LV+ EE+GG
Sbjct: 659 PAYIARGN-CG--SCNYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEEWGG 715
Query: 691 NPSQINF 697
+P+ I+
Sbjct: 716 DPTGISL 722
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 349/729 (47%), Positives = 442/729 (60%), Gaps = 61/729 (8%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
R + L+L + + V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK
Sbjct: 15 RNFHMVLLLLFFWVCYVTASVTYDHKAIMINGQRRILISGSIHYPRSTPQMWPDLIQKAK 74
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP +Y F DL+ FIK +Q GL+V LRIGP++CAEWN+G
Sbjct: 75 DGGLDVIETYVFWNGHEPSPGKYYFEDRFDLVGFIKLVQQAGLFVHLRIGPFICAEWNFG 134
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL +PGI RT N+ F MQ FT IV++ K EKLF SQGGPIIL+QIENEYG
Sbjct: 135 GFPVWLKYVPGI-AFRTDNEPFKEAMQKFTEKIVNIMKAEKLFQSQGGPIILSQIENEYG 193
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
V + G GK+Y W A+MA LD GVPW+MC++ DAP P+ FTPN
Sbjct: 194 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNY 253
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK+WTENWTGW+ ++GG P R AED+AF+VARF Q G+ NYYMYHGGTNFGRTS G
Sbjct: 254 KPKLWTENWTGWYTAFGGATPYRPAEDIAFSVARFIQNRGSLFNYYMYHGGTNFGRTSNG 313
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT------------- 344
++ TSYDYDAPIDEYG LN+PKWGHLRELH+ +K E L + T
Sbjct: 314 LFVATSYDYDAPIDEYGLLNEPKWGHLRELHRAIKQCESALVSVDPTVSWPGKNLEVHLY 373
Query: 345 -------------NTDYGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
NTDY V Y+LP WS+SILPDCKTE FNTAKVN+ +
Sbjct: 374 KTESACAAFLANYNTDYSTQVKFGNGQYDLPPWSISILPDCKTEVFNTAKVNSPRLHRKM 433
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKD 449
P + + ND V +AL + T D SDYLWY+T+ ++
Sbjct: 434 TPVNSAFAWQSYNEEPASSSENDPVT----GYALWEQVG--VTRDSSDYLWYLTDVNIGP 487
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQ 506
+D + L S+G VL+ ++NG Y T YG+ +D F + V L G N+
Sbjct: 488 ND--IKDGKWPVLTAMSAGHVLNVFINGQYAG---TAYGSLDDPRLTFSQSVNLRVGNNK 542
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
ISLLS +VGL N G+ F+ G+ GPV L G + DLS KW+YK+GL G +
Sbjct: 543 ISLLSVSVGLANVGTHFETWNTGVLGPVTLTGLSSGTW---DLSKQKWSYKIGLKG-ESL 598
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ +NS V + + WYKTTF AP NDP+ L+L MGKG WVNG ++
Sbjct: 599 SLHTEAGSNSVEWVQGSLVAKKQPLAWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSI 658
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP A + C +C+Y G Y KC NCG PSQ WYHVPRSW++ G N LV+ E
Sbjct: 659 GRHWPGNKARGN-CG--NCNYAGTYTDTKCLANCGQPSQRWYHVPRSWLRSGGNYLVVLE 715
Query: 687 EFGGNPSQI 695
E+GG+P+ I
Sbjct: 716 EWGGDPNGI 724
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 337/733 (45%), Positives = 446/733 (60%), Gaps = 58/733 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ +F+ + A VS+D +AI I+G+++IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 11 ILLLFSCIFSAASA-SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGL 69
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GL+V LRIGPYVCAEWN+GGFPV
Sbjct: 70 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPV 129
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV M K EKLF +QGGPIIL+QIENE+G V
Sbjct: 130 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEW 188
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN PK+
Sbjct: 189 EIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKM 248
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 249 WTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMA 308
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG L +PKWGHLR+LHK +KS E L + + T G+
Sbjct: 309 TSYDYDAPLDEYGLLREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVFKSES 368
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S G Y+LP WS+SILPDCKTE ++TAKV +Q++ P
Sbjct: 369 DCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKVGSQSSQVQMTPVH 428
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDP 452
+G + W+ + L+ L +Q + T D +DYLWYMT+ + D+
Sbjct: 429 SG-------FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEA 481
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L I S+G L+ ++NG + + F + V L G N+++LLS
Sbjct: 482 FLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSI 541
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G + D+S KWTYK GL G + +
Sbjct: 542 SVGLPNVGTHFETWNAGVLGPITLKGL---NSGTWDMSGWKWTYKTGLKG-EALGLHTVT 597
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TWYK TF AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 598 GSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A S C Y G Y KC +CG PSQ WYH+PRSW+ N LV+FEE+GG+P
Sbjct: 658 YIARG---SCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPNGNLLVVFEEWGGDP 714
Query: 693 SQINFQTVVVGTA 705
S+I+ V GTA
Sbjct: 715 SRISL--VERGTA 725
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 344/731 (47%), Positives = 433/731 (59%), Gaps = 57/731 (7%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S+ +++ L L S+ V++D +A+ IDG+R+IL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 5 SKIMVVFLGLVLWVCSSVMASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKA 64
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP QY F +L+RF+K +Q GLYV LRIGPYVCAEWN+
Sbjct: 65 KDGGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNF 124
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F MQ FT IV M K EKL+ SQGGPIIL+QIENEY
Sbjct: 125 GGFPVWLKYVPGI-AFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEY 183
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GKSY W A+MA LD GVPW+MC++ DAP PM F PN
Sbjct: 184 GPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKA 243
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P R EDLA+AVARF Q G+ NYYMYHGGTNFGRT+G
Sbjct: 244 YKPKMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAG 303
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------- 349
GP++ TSYDYDAPIDEYG + QPKWGHLR+LHK +K E L + T + G
Sbjct: 304 GPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHV 363
Query: 350 -NSVSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
N+ SG Y+LP WSVSILPDCKT FNTAKVN +
Sbjct: 364 YNTRSGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNAPSYWP 423
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
P + + W + L++Q S T D +DYLWYMT+
Sbjct: 424 KMTPISS--------FSWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIR 475
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ ++ L L I S+G LH ++NG + + F + V L G N+
Sbjct: 476 IDSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNK 535
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+S+LS VGL N G F+ GI GPV L G +D+S +KW+YKVGL G +
Sbjct: 536 LSMLSVAVGLPNVGVHFETWNAGILGPVTL---KGLNEGTRDMSGYKWSYKVGLKG-EAL 591
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ ++S + V + +TWYKTTF AP N+P+ L++ MGKG W+NG ++
Sbjct: 592 NLHTVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESI 651
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP Y A S C Y G + KC ++CG PSQ WYHVPR+W+K N LV+FE
Sbjct: 652 GRHWPAYTARG---SCGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFE 708
Query: 687 EFGGNPSQINF 697
E+GGNP I+
Sbjct: 709 EWGGNPDGISL 719
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 338/737 (45%), Positives = 448/737 (60%), Gaps = 59/737 (8%)
Query: 7 CSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKK 66
S AIL+ ++ + A VS+D R++TI R++++S +IHYPRS P MWP L++
Sbjct: 10 ASTAILVVMVFLFSWRSIEAANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQT 69
Query: 67 AKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWN 126
AKEGG +AIE+YVFWN HEP +Y F G ++++FIK +Q G+++ILRIGP+V AEWN
Sbjct: 70 AKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWN 129
Query: 127 YGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
YGG PVWLH +PG R N+ + + M++FTT IV++ K+EKLFA QGGPIIL+Q+ENE
Sbjct: 130 YGGVPVWLHYVPGTV-FRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENE 188
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YG DYG+ GK Y W A MA S +IGVPW+MCQ+ DAP + FTPN
Sbjct: 189 YGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNT 248
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
P+ PKIWTENW GWFK++GG+DP R AED+A++VARFF GG+ NYYMYHGGTNFGRTS
Sbjct: 249 PDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTS 308
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG- 354
GGP++TTSYDY+APIDEYG PKWGHL++LHK + E L G N G+S+
Sbjct: 309 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEAD 368
Query: 355 ----------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
+SY+LPAWSVSILPDCKTE FNTAKV ++++
Sbjct: 369 VYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSS- 427
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNA 445
KV+ + + L+W+ E + G F N L+D +T D +DYLWY T+
Sbjct: 428 KVEMLPEDLKSSSGLKWEVFSEKPG---IWGAADFVKNELVDHINTTKDTTDYLWYTTSI 484
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
+ +++ L S+ L I S G LH ++N Y+ + ++PV L G+
Sbjct: 485 TVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGET 544
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
I LLS TVGL N GS ++ V G+ V G +L++ KW+YK+G+ G
Sbjct: 545 NIDLLSMTVGLANAGSFYEWVGAGLTS----VSIKGFNKGTLNLTNSKWSYKLGVEGEHL 600
Query: 566 KKFYNAKAANS-ERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ F K NS W+ P ++ +TWYK E P ++PV L++ MGKG AW+NG
Sbjct: 601 ELF---KPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNG 657
Query: 624 YNLGRYWPTYL---AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
+GRYWP + D C E CDYRG + DKC CG PSQ WYHVPRSW K N
Sbjct: 658 EEIGRYWPRIARKNSPNDECVKE-CDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGN 716
Query: 681 TLVLFEEFGGNPSQINF 697
LV+FEE GGNP +I
Sbjct: 717 ELVIFEEKGGNPMKIKL 733
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 337/733 (45%), Positives = 445/733 (60%), Gaps = 58/733 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ +F+ + A VS+D +AI I+G+++IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 11 ILLLFSCIFSAASA-SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGL 69
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP Y F DL++FIK +Q +GL+V LRIGPYVCAEWN+GGFPV
Sbjct: 70 DVIQTYVFWNGHEPSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAEWNFGGFPV 129
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV M K EKLF +QGGPIIL+QIENE+G V
Sbjct: 130 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEW 188
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN PK+
Sbjct: 189 EIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKM 248
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 249 WTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMA 308
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG +PKWGHLR+LHK +KS E L + + T G+
Sbjct: 309 TSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVFKSES 368
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S G Y+LP WS+SILPDCKTE +NTAKV +Q++ P
Sbjct: 369 DCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQMTPVH 428
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDP 452
+G + W+ + L+ L +Q + T D +DYLWYMT+ + D+
Sbjct: 429 SG-------FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEA 481
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L I S+G L+ ++NG + + F + V L G N+++LLS
Sbjct: 482 FLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSI 541
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G + D+S KWTYK GL G + +
Sbjct: 542 SVGLPNVGTHFETWNAGVLGPITLKGL---NSGTWDMSGWKWTYKTGLKG-EALGLHTVT 597
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TWYK TF AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 598 GSSSVEWVEGPSMAEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A S C Y G Y KC +CG PSQ WYH+PRSW+ N LV+FEE+GG+P
Sbjct: 658 YIAR---GSCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDP 714
Query: 693 SQINFQTVVVGTA 705
S+I+ V GTA
Sbjct: 715 SRISL--VERGTA 725
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 337/733 (45%), Positives = 444/733 (60%), Gaps = 58/733 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ +F+ + A VS+D +AI I+G+++IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 11 ILLLFSCIFSAASA-SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGL 69
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GL+V LRIGPYVCAEWN+GGFPV
Sbjct: 70 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPV 129
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV M K EKLF SQGGPIIL+QIENE+G V
Sbjct: 130 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVEW 188
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN PK+
Sbjct: 189 EIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKM 248
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 249 WTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMA 308
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG +PKWGHLR+LHK +K E L + + T G+
Sbjct: 309 TSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVFKSES 368
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S G Y+LP WS+SILPDCKTE +NTAKV +Q++ P
Sbjct: 369 DCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQMTPVH 428
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDP 452
+G + W+ + L+ L +Q + T D +DYLWYMT+ + D+
Sbjct: 429 SG-------FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEA 481
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L I+S+G L+ ++NG + + F + V L G N+++LLS
Sbjct: 482 FLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSI 541
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G + D+S KWTYK GL G + +
Sbjct: 542 SVGLPNVGTHFETWNAGVLGPITLKGL---NSGTWDMSGWKWTYKTGLKG-EALGLHTVT 597
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TWYK TF AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 598 GSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A S C Y G Y KC +CG PSQ WYH+PRSW+ N LV+FEE+GG+P
Sbjct: 658 YIARG---SCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDP 714
Query: 693 SQINFQTVVVGTA 705
S I+ V GTA
Sbjct: 715 SGISL--VERGTA 725
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 344/733 (46%), Positives = 438/733 (59%), Gaps = 60/733 (8%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
L ++L T L + V++D +A+ I+G+RK+L SGSIHYPRSTP MW LI+KAK+GGL
Sbjct: 13 LSVVLLTSLQL-IQCNVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDGGL 71
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP Y+F G DL+RFIK + + GLYV LRIGPY+CAEWN+GGFPV
Sbjct: 72 DVIDTYVFWNLHEPSPGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGFPV 131
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F + MQ FT IV M K E LF SQGGPIIL+QIENEY
Sbjct: 132 WLKYVPGIS-FRTDNEPFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPESK 190
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+G G +Y+ W A MA S+D GVPW+MC+E DAP P+ F+PN P P +
Sbjct: 191 AFGSPGHAYMTWAAHMAISMDTGVPWVMCKEFDAPDPVINTCNGFYCDYFSPNKPYKPTM 250
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGWF +GG + +R AEDLAFAVARF Q GG+ NYYMYHGGTNFGRTSGGP++T
Sbjct: 251 WTEAWTGWFTDFGGPNHQRPAEDLAFAVARFIQKGGSLVNYYMYHGGTNFGRTSGGPFIT 310
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------NSVS 353
TSYDYDAPIDEYG + QPK+GHL+ELHK +K EK L + T T G +S S
Sbjct: 311 TSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEKALLAADSTVTSLGSYEQAHVFSSDS 370
Query: 354 GS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
G Y+LP WS+SILPDCK FNTA V QT+ P
Sbjct: 371 GGCAAFLSNYNTKQAARVKFNNIQYSLPPWSISILPDCKNVVFNTAHVGVQTSQVHMLP- 429
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDD 451
D L W+ E I+ V + L++Q + T D SDYLWY T+ + +
Sbjct: 430 ---TDSELLSWETFNEDISS--VDDDKMITVAGLLEQLNITRDTSDYLWYTTSVHISSSE 484
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
L G L + S+G LH ++NG S F +K GKN+ISLLS
Sbjct: 485 SFLRGGRLPVLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKNRISLLS 544
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
VGL N G +F+ GI GPV L G + +DL+ KW+YKVGL G D +
Sbjct: 545 VAVGLPNNGPRFETWNTGILGPVTLHGLDEGQ---RDLTWQKWSYKVGLKGEDMN--LRS 599
Query: 572 KAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ + S W ++ + ++ +TWYK F +P +DP+ L++ MGKG W+NG+++GRY
Sbjct: 600 RKSVSLVDWIQGSLMVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGRY 659
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
W Y E CS C Y + +C CG P+Q WYHVPRSW+K N LVLFEE G
Sbjct: 660 WTLY--AEGNCS--GCSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVLFEEIG 715
Query: 690 GNPSQINFQTVVV 702
G+ S+I+ +V
Sbjct: 716 GDASRISLVKRLV 728
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 335/725 (46%), Positives = 440/725 (60%), Gaps = 56/725 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ +F+ + A V +D +AI I+G+R+IL+SGSIHYPRSTPGMWPDLI+KAK GGL
Sbjct: 11 ILLLFSCIFSAASA-SVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAGGL 69
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GL+V LRIGPYVCAEWN+GGFP+
Sbjct: 70 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPI 129
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV+M K EKLF +QGGPIIL+QIENE+G V
Sbjct: 130 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVEW 188
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN PK+
Sbjct: 189 EIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKM 248
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 249 WTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMA 308
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG L QPKWGHLR+LHK +KS E L + + T GN
Sbjct: 309 TSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFNSKS 368
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S Y+LP WS+SILPDCKT FNTAKV + + +P
Sbjct: 369 GCAAFLANHDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASEVQMKPVY 428
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDP 452
+ + W+ + G L+ L +Q T D +DYLWYMT+ + D+
Sbjct: 429 S-------RLPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEA 481
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L L I S+G LH ++NG + + F + VKL G N+++LLS
Sbjct: 482 FLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSI 541
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G T D+S KWTYK+G+ G + +
Sbjct: 542 SVGLPNVGTHFETWNTGVLGPISLKGL---NTGTWDMSRWKWTYKIGMKG-ESLGLHTVT 597
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TWYK TF+AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 598 GSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A+ S +C Y G + KC CG PSQ WYH+PRSW+ N LV+FEE+GG+P
Sbjct: 658 YIAQG---SCGNCYYAGTFNDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDP 714
Query: 693 SQINF 697
S ++
Sbjct: 715 SWMSL 719
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 635 bits (1639), Expect = e-179, Method: Compositional matrix adjust.
Identities = 344/731 (47%), Positives = 433/731 (59%), Gaps = 57/731 (7%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S+ +++ L L S+ V++D +A+ IDG+R+IL+SGSIHYPRSTP MWPDLI+KA
Sbjct: 5 SKIMVVFLGLVLWVCSSVMASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKA 64
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP QY F +L+RF+K +Q GLYV LRIGPYVCAEWN+
Sbjct: 65 KDGGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNF 124
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F MQ FT IV M K EKL+ SQGGPIIL+QIENEY
Sbjct: 125 GGFPVWLKYVPGI-AFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEY 183
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GKSY W A+MA LD GVPW+MC++ DAP PM F PN
Sbjct: 184 GPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKA 243
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P R EDLA+AVARF Q G+ NYYMYHGGTNFGRT+G
Sbjct: 244 YKPKMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAG 303
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------- 349
GP++ TSYDYDAPIDEYG + QPKWGHLR+LHK +K E L + T + G
Sbjct: 304 GPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHV 363
Query: 350 -NSVSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
N+ SG Y+LP WSVSILPDCKT FNTAKVN +
Sbjct: 364 YNTRSGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNAPSYWP 423
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
P + + W + L++Q S T D +DYLWYMT+
Sbjct: 424 KMTPISS--------FSWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIR 475
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ ++ L L I S+G LH ++NG + + F + V L G N+
Sbjct: 476 IDSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNK 535
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+S+LS VGL N G F+ GI GPV L G +D+S +KW+YKVGL G +
Sbjct: 536 LSMLSVAVGLPNVGVHFETWNAGILGPVTL---KGLNEGTRDMSGYKWSYKVGLKG-EAL 591
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ ++S + V + +TWYKTTF AP N+P+ L++ MGKG W+NG ++
Sbjct: 592 NLHTVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESI 651
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP Y A S C Y G + KC ++CG PSQ WYHVPR+W+K N LV+FE
Sbjct: 652 GRHWPAYTARG---SCGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFE 708
Query: 687 EFGGNPSQINF 697
E+GGNP I+
Sbjct: 709 EWGGNPDGISL 719
Score = 364 bits (935), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 211/498 (42%), Positives = 279/498 (56%), Gaps = 43/498 (8%)
Query: 231 FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
F PN PKIWTENW+GW+ ++GG P R ED+AF+VARF Q GG+ NYYMYHGGTN
Sbjct: 734 FKPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTN 793
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG- 349
FGRTS G ++TTSYD+DAPIDEYG L +PKWGHLR+LHK +K E L + T+T G
Sbjct: 794 FGRTS-GLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGK 852
Query: 350 -------NSVSGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVN 381
S SG+ Y+LP WS+SILPDCKT FNTA+V
Sbjct: 853 DQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTARV- 911
Query: 382 TQTNVKVKRPNQAGNDQAPL-QWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYL 439
+ + K+ PN P+ + W K + L++Q S T D +DYL
Sbjct: 912 -RRDPKLFIPNLLMAKMTPISSFWWLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYL 970
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WYMT+ + + L L +NS+G +LH ++NG S + F + V
Sbjct: 971 WYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVN 1030
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
L +G N++S+LS TVGL N G FD G+ GPV L G +D+S +KW+YKVG
Sbjct: 1031 LKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTL---KGLNEGTRDMSKYKWSYKVG 1087
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
L G + Y+ K +NS + W K + +TWYKTTF P N+P+ L++ M KG
Sbjct: 1088 LRG-EILNLYSVKGSNSVQ-W-MKGSFQKQPLTWYKTTFNTPAGNEPLALDMSSMSKGQI 1144
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
WVNG ++GRY+P Y+A C Y G + KC +NCG PSQ WYH+PR W+
Sbjct: 1145 WVNGRSIGRYFPGYIASG---KCNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNG 1201
Query: 680 NTLVLFEEFGGNPSQINF 697
N L++ EE GGNP I+
Sbjct: 1202 NLLIILEEIGGNPQGISL 1219
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 339/731 (46%), Positives = 442/731 (60%), Gaps = 60/731 (8%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
SR + L L + + +LA V +D RAI ++G+R+IL+SGSIHYPRSTP MWPDL++KA
Sbjct: 7 SRNMFFLLFLVSWLSSALA-SVGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLLQKA 65
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD ++TYVFWN HEP +Y F DL++FIK Q GLYV LRIGPY+CAEWN+
Sbjct: 66 KDGGLDVLQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAEWNF 125
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N+ FM M+ FT IV M K E+LF +QGGPIIL+QIENEY
Sbjct: 126 GGFPVWLKYVPGIA-FRTDNRPFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIENEY 184
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GKSY W AKMA L+ GVPW+MC++ DAP P+ FTPN
Sbjct: 185 GPVEWEIGAPGKSYTQWAAKMAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKN 244
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGW+ +GG P R A+DLAF+VARF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 245 YKPKMWTEIWTGWYTEFGGAVPTRPAQDLAFSVARFIQNGGSFANYYMYHGGTNFGRTAG 304
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++ TSYDYDAP+DEYG +PK+ HL+ +HK +K E L + + GN
Sbjct: 305 GPFIATSYDYDAPLDEYGLPREPKYSHLKYMHKAIKMAEPALLATDAAVSKLGNNQEAHV 364
Query: 351 --SVSGSS--------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
S SG + YNLP WS+SILPDCKTE FNTA+V K+
Sbjct: 365 YQSRSGCAAFLANYDTKYPVRVTFWNKQYNLPPWSISILPDCKTEVFNTARVGQSPPTKM 424
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGH-FALNTLIDQKS-TNDVSDYLWYMTNAD 446
A L W+ I D + F L +Q S T D +DYLWYMT+
Sbjct: 425 -------TPVAHLSWQ---AYIEDVATSADDNAFTSVGLREQISLTWDNTDYLWYMTDIT 474
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ ++ L TL+++S+G LH ++NG S + F + VKL G N+
Sbjct: 475 IGPNEQFLRTGKYPTLKVDSAGHALHVFINGQLSGSAYGTLAFPKLEFNQGVKLRAGINK 534
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
++LLS +VGL N G F+ G+ GPV L AG + D++ +WTYK+G+ G +D
Sbjct: 535 LALLSVSVGLANVGLHFETWNTGVLGPVTL---AGVNSGTWDMTRWQWTYKIGMRG-EDM 590
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ ++S + R +TWYK AP N P+ L++ MGKG W+NG ++
Sbjct: 591 SLHTVSGSSSVEWVQGSLLAQYRPLTWYKAILNAPPGNAPLALDMGSMGKGQMWINGQSI 650
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP Y A S +C Y G Y +KC NCG PSQ WYHVPRSW+K N LV+FE
Sbjct: 651 GRHWPAYKAH---GSCGACYYAGTYTENKCRTNCGQPSQRWYHVPRSWLKSSGNLLVVFE 707
Query: 687 EFGGNPSQINF 697
E+GG+P++I+
Sbjct: 708 EWGGDPTKISL 718
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 347/742 (46%), Positives = 442/742 (59%), Gaps = 63/742 (8%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSL--AYRVSHDGRAITIDGERKILLSGSIHYPRSTPG 58
M T IL L+ L S+ V++D +AI I+G R+ILLSGSIHYPRSTP
Sbjct: 1 MGTTILVLSKILTFLLTTMLIGSSVIQCSSVTYDKKAIVINGHRRILLSGSIHYPRSTPE 60
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW DLIKKAK+GGLD I+TYVFWN HEP Y+F G DL+RFIKTIQ+ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWN+GGFPVWL + GI RT N F + MQ FT IV M K+ + FASQGGPI
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGIS-FRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPI 179
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENE+ + G AG SY+NW AKMA L+ GVPW+MC+E DAP P+
Sbjct: 180 ILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFY 239
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
FTPN P P +WTE W+GWF +GG PKR EDLAF VARF Q GG++ NYYMYHG
Sbjct: 240 CDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHG 299
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGRT+GGP++TTSYDYDAPIDEYG + +PK+ HL++LH+ +K E L + T
Sbjct: 300 GTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTK 359
Query: 348 YGN-----------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTA 378
GN + Y LPAWS+SILPDC+ FNTA
Sbjct: 360 LGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTA 419
Query: 379 KVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRG-KGHFALNTLIDQKS-TNDVS 436
V +T+ P +G+ + D G +G L++Q + T D +
Sbjct: 420 TVAAKTSHVQMVP--SGSILYSVA-----RYDEDIATYGNRGTITARGLLEQVNVTRDTT 472
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
DYLWY T+ D+K + L G TL ++S+G +H +VNG++ S + F
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
V L G N+I+LLS VGL N G F+ GI G V+L G DE KDLS KWTY
Sbjct: 533 QVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVVLHGL--DEG-NKDLSWQKWTY 589
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGM 614
+ GL G + + +S W ++ + +TWYK F+AP N+P+ L+L+ M
Sbjct: 590 QAGLRG--ESMNLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSM 647
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG AW+NG ++GRYW + + G SC+Y G Y +KC CG P+Q WYHVPRSW
Sbjct: 648 GKGQAWINGQSIGRYWMAFAKGDCG----SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSW 703
Query: 675 IKDGVNTLVLFEEFGGNPSQIN 696
+K N LVLFEE GG+ S+++
Sbjct: 704 LKPKGNLLVLFEELGGDISKVS 725
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 348/742 (46%), Positives = 440/742 (59%), Gaps = 63/742 (8%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSL--AYRVSHDGRAITIDGERKILLSGSIHYPRSTPG 58
M T IL L+ L S+ V++D +AI I+G R+ILLSGSIHYPRSTP
Sbjct: 1 MGTTILVLSKILTFLLTTMLIGSSMIQCSSVTYDKKAIVINGHRRILLSGSIHYPRSTPE 60
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW DLIKKAK+GGLD I+TYVFWN HEP Y+F G DL+RFIKTIQ+ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWN+GGFPVWL + GI RT N F MQ FT IV M K+ + FASQGGPI
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGIS-FRTDNGPFKAAMQGFTEKIVQMMKEHRFFASQGGPI 179
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENE+ + G AG SY+NW AKMA L+ GVPW+MC+E DAP P+
Sbjct: 180 ILSQIENEFEPELKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINSCNGFY 239
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
FTPN P P +WTE W+GWF +GG PKR EDLAF VARF Q GG++ NYYMYHG
Sbjct: 240 CDYFTPNKPYKPTMWTEAWSGWFTEFGGTIPKRPVEDLAFGVARFIQKGGSYINYYMYHG 299
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGRT+GGP++TTSYDYDAPIDEYG + +PK+ HL++LH+ +K E L + T
Sbjct: 300 GTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTK 359
Query: 348 YGN-----------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTA 378
GN + Y LPAWS+SILPDC+ FNTA
Sbjct: 360 LGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTA 419
Query: 379 KVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRG-KGHFALNTLIDQKS-TNDVS 436
V +T+ P +G+ + D G +G L++Q + T D +
Sbjct: 420 TVAAKTSHVQMMP--SGSILYSVA-----RYDEDIATYGDRGTITARGLLEQVNVTRDTT 472
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
DYLWY T+ D+K + L G TL ++S+G +H +VNG++ S + F
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
V L G N+I+LLS VGL N G F+ GI G V+L G DE KDLS KWTY
Sbjct: 533 QVNLRGGANRIALLSVAVGLPNVGPHFETWATGIVGSVVLHGL--DEG-NKDLSWQKWTY 589
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGM 614
+ GL G K + +S W ++ + +TWYK F+AP N+P+ L+L+ M
Sbjct: 590 QAGLRGEAMKLV--SPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSM 647
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG AW+NG ++GRYW + G SC+Y G Y +KC CG P+Q WYHVPRSW
Sbjct: 648 GKGQAWINGQSIGRYWMAFAKGNCG----SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSW 703
Query: 675 IKDGVNTLVLFEEFGGNPSQIN 696
+K N LVLFEE GG+ S+++
Sbjct: 704 LKPRGNLLVLFEELGGDISKVS 725
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 341/727 (46%), Positives = 435/727 (59%), Gaps = 60/727 (8%)
Query: 15 LILQTLFNLSLAY---RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L+L + + +++ VS+D +A+ I+G+++IL+SGSIHYPRSTP MWPDLI+KAK+GG
Sbjct: 22 LVLLSFCSWEISFVKASVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGG 81
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD I+TYVFWN HEP + Y F DL+RFIK +Q GLYV LRIGPYVCAEWNYGGFP
Sbjct: 82 LDVIQTYVFWNGHEPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFP 141
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWL +PGI E RT N F M FT IV M K EKLF +QGGPIIL+QIENE+G V
Sbjct: 142 VWLKYVPGI-EFRTDNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVE 200
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
D G GK+Y W A+MA L+ GVPW+MC++ DAP P+ F PN PK
Sbjct: 201 WDIGAPGKAYAKWAAQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCEKFVPNQNYKPK 260
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTE WTGWF +G P R AEDL F+VARF Q GG+F NYYMYHGGTNFGRTSGG ++
Sbjct: 261 MWTEAWTGWFTEFGSAVPTRPAEDLVFSVARFIQSGGSFINYYMYHGGTNFGRTSGG-FV 319
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------NSV 352
TSYDYDAPIDEYG LN+PKWGHLR LHK +K E L + T G NS+
Sbjct: 320 ATSYDYDAPIDEYGLLNEPKWGHLRGLHKAIKLCEPALVSVDPTVKSLGENQEAHVFNSI 379
Query: 353 SG---------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
SG + Y+LP WS+S+LPDCKT FNTA+V Q++ K P
Sbjct: 380 SGKCAAFLANYDTTFSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQSSQKKFVP 439
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
+ W+ + F + L +Q T D SDYLWYMT+ ++ +
Sbjct: 440 -------VINAFSWQSYIEETASSTDDNTFTKDGLWEQVYLTADASDYLWYMTDVNIGSN 492
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L + L I S+G L ++NG + + F + VKL G N+ISLL
Sbjct: 493 EGFLKNGQDPLLTIWSAGHALQVFINGQLSGTVYGSLENPKLTFSKNVKLRAGVNKISLL 552
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S +VGL N G+ F+ G+ GPV L G +D+S KWTYK+GL G + +
Sbjct: 553 STSVGLPNVGTHFEKWNAGVLGPVTLKGL---NEGTRDISKQKWTYKIGLKG-EALSLHT 608
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
++S ++ + MTWYKTTF P NDP+ L++ MGKG W+NG ++GR+W
Sbjct: 609 VSGSSSVEWAQGASLAQKQPMTWYKTTFNVPPGNDPLALDMGAMGKGMVWINGQSIGRHW 668
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
P Y+ + C C+Y G Y KC CG PSQ WYHVPRS +K N LV+FEE+GG
Sbjct: 669 PGYIGNGN-CG--GCNYAGTYTEKKCRTYCGKPSQRWYHVPRSRLKPSGNLLVVFEEWGG 725
Query: 691 NPSQINF 697
P I+
Sbjct: 726 EPHWISL 732
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 335/733 (45%), Positives = 444/733 (60%), Gaps = 58/733 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ +F+ + A VS+D +AI I+G+++IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 4 ILLLFSCIFSAASA-SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGL 62
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GL+V LRIGPYVCAEWN+GGFPV
Sbjct: 63 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPV 122
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV M K EKLF SQGGPIIL+QIENE+G V
Sbjct: 123 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVEW 181
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN PK+
Sbjct: 182 EIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKM 241
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 242 WTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMA 301
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG +PKWGHLR+LHK +K E L + + T G+
Sbjct: 302 TSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVFKSES 361
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S G Y+LP WS+SILPDCKTE +NTAKV +Q++ P
Sbjct: 362 DCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQMTPVH 421
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDP 452
+G + W+ + ++ L +Q + T D +DYLWYMT+ + D+
Sbjct: 422 SG-------FPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLWYMTDITIGSDEA 474
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L I+S+G L+ ++NG + + F + V L G N+++LLS
Sbjct: 475 FLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSI 534
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G + D+S KWTYK GL G + +
Sbjct: 535 SVGLPNVGTHFETWNAGVLGPITLKGL---NSGTWDMSGWKWTYKTGLKG-EALGLHTVT 590
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TW+K TF AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 591 GSSSVEWVEGPSMAKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPG 650
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A S C Y G Y KC +CG PSQ WYH+PRSW+ N LV+FEE+GG+P
Sbjct: 651 YIAR---GSCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDP 707
Query: 693 SQINFQTVVVGTA 705
S I+ V GTA
Sbjct: 708 SGISL--VERGTA 718
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 633 bits (1633), Expect = e-178, Method: Compositional matrix adjust.
Identities = 346/742 (46%), Positives = 441/742 (59%), Gaps = 63/742 (8%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSL--AYRVSHDGRAITIDGERKILLSGSIHYPRSTPG 58
M T IL L+ L S+ V++D +AI I+G R+ILLSGSIHYPRSTP
Sbjct: 1 MGTTILVLSKILTFLLTTMLIGSSVIQCSSVTYDKKAIVINGHRRILLSGSIHYPRSTPE 60
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW DLIKKAK+GGLD I+TYVFWN HEP Y+F G DL+RFIKTIQ+ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWN+GGFPVWL + GI RT N F + MQ FT IV M K+ + FASQGGPI
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGIS-FRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPI 179
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENE+ + G AG SY+NW AKMA L+ GVPW+MC+E DAP P+
Sbjct: 180 ILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFY 239
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
FTPN P P +WTE W+GWF +GG PKR EDLAF VARF Q GG++ NYYMYHG
Sbjct: 240 CDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHG 299
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGRT+GGP++TTSYDYDAPIDEYG + +PK+ HL++LH+ +K E L + T
Sbjct: 300 GTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTK 359
Query: 348 YGN-----------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTA 378
GN + Y LPAWS+SILPDC+ FNTA
Sbjct: 360 LGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTA 419
Query: 379 KVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRG-KGHFALNTLIDQKS-TNDVS 436
V +T+ P +G+ + D G +G L++Q + T D +
Sbjct: 420 TVAAKTSHVQMVP--SGSILYSVA-----RYDEDIATYGNRGTITARGLLEQVNVTRDTT 472
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
DYLWY T+ D+K + L G TL ++S+G +H +VNG++ S + F
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
V L G N+I+LLS VGL N G F+ GI G V+L G DE KDLS KWTY
Sbjct: 533 QVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVVLHGL--DEG-NKDLSWQKWTY 589
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGM 614
+ GL G + + +S W ++ + +TWYK F+ P N+P+ L+L+ M
Sbjct: 590 QAGLRG--ESMNLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDVPRGNEPLALDLKSM 647
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG AW+NG ++GRYW + + G SC+Y G Y +KC CG P+Q WYHVPRSW
Sbjct: 648 GKGQAWINGQSIGRYWMAFAKGDCG----SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSW 703
Query: 675 IKDGVNTLVLFEEFGGNPSQIN 696
+K N LVLFEE GG+ S+++
Sbjct: 704 LKPKGNLLVLFEELGGDISKVS 725
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 347/742 (46%), Positives = 440/742 (59%), Gaps = 63/742 (8%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSL--AYRVSHDGRAITIDGERKILLSGSIHYPRSTPG 58
M T IL L+ L S+ V++D +AI I+G R+ILLSGSIHYPRSTP
Sbjct: 1 MGTTILVLSKILTFLLTTMLIGSSVIQCSSVTYDKKAIVINGHRRILLSGSIHYPRSTPE 60
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW DLIKKAK+GGLD I+TYVFWN HEP Y+F G DL+RFIKTIQ+ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWN+GGFPVWL + GI RT N F + MQ FT IV M K+ + FASQGGPI
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGIS-FRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPI 179
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENE+ + G AG SY+NW AKMA L+ GVPW+MC+E DAP P+
Sbjct: 180 ILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFY 239
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
FTPN P P +WTE W+GWF +GG PKR EDLAF VARF Q GG++ NYYMYHG
Sbjct: 240 CDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHG 299
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGRT+GGP++TTSYDYDAPIDEYG + +PK+ HL++LH+ +K E L + T
Sbjct: 300 GTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTK 359
Query: 348 YGN-----------------------------SVSGSSYNLPAWSVSILPDCKTEEFNTA 378
GN + Y LPAWS+SILPDC+ FNTA
Sbjct: 360 LGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTA 419
Query: 379 KVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGK-GHFALNTLIDQKS-TNDVS 436
V +T+ P +G+ + D G G L++Q + T D +
Sbjct: 420 TVAAKTSHVQMVP--SGSILYSVA-----RYDEDIATYGNPGTITARGLLEQVNVTRDTT 472
Query: 437 DYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFER 496
DYLWY T+ D+K + L G TL ++S+G +H +VNG++ S + F
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
V L G N+I+LLS VGL N G F+ GI G V L G DE KDLS KWTY
Sbjct: 533 QVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVALHGL--DEG-NKDLSWQKWTY 589
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQGM 614
+ GL G + + +S W ++ + +TWYK F+AP N+P+ L+L+ M
Sbjct: 590 QAGLRG--ESMNLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSM 647
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG AW+NG ++GRYW + + G SC+Y G Y +KC CG P+Q WYHVPRSW
Sbjct: 648 GKGQAWINGQSIGRYWMAFAKGDCG----SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSW 703
Query: 675 IKDGVNTLVLFEEFGGNPSQIN 696
+K N LVLFEE GG+ S+++
Sbjct: 704 LKPKGNLLVLFEELGGDISKVS 725
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 338/727 (46%), Positives = 440/727 (60%), Gaps = 57/727 (7%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
IL L +L + + A V++D +A+ I+G+R+IL+SGSIHYPRSTP MWPDLIKKAKEG
Sbjct: 12 ILAILCFSSLIHSTEAV-VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP Y F DL++F K + GLY+ LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IVDM K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGMV-FRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPM 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AGK+Y W A+MA L GVPWIMC++ DAP P+ F PN+ N P
Sbjct: 190 QWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R ED+AF+VARF Q GG+F NYYMY+GGTNF RT+G +
Sbjct: 250 KLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTAG-VF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAPIDEYG L +PK+ HL+ELHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
G Y+LP WSVSILPDCKTE +NTAK+ T + P
Sbjct: 369 KTSCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIP 428
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
++ W G F + L++Q S T D +DY WY T+ + D
Sbjct: 429 TST-------KFSWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSD 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L N L I S+G LH +VNG + + S F + +KL+ G N+++LL
Sbjct: 482 ESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALL 541
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G ++ GI GPV L G + D+S KW+YK+GL G + +
Sbjct: 542 STAVGLPNAGVHYETWNTGILGPVTL---KGVNSGTWDMSKWKWSYKIGLRG-EAMSLHT 597
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+++ + W V + +TWYK++F+ P N+P+ L++ MGKG WVNG+N+GR+W
Sbjct: 598 LAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHW 657
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
P Y A + C C+Y G Y KC +CG PSQ WYHVPRSW+K N LV+FEE+GG
Sbjct: 658 PAYTARGN-CG--RCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGG 714
Query: 691 NPSQINF 697
+PS I+
Sbjct: 715 DPSGISL 721
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 343/730 (46%), Positives = 436/730 (59%), Gaps = 55/730 (7%)
Query: 9 RAILLCL-ILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
+ + +C+ + L S+ V++DG+AI I+G+R+IL SGSIHYPRSTP MWP LI+KA
Sbjct: 8 KVLFVCVGLFFLLCCCSVTASVTYDGKAIKINGQRRILFSGSIHYPRSTPEMWPGLIQKA 67
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
KEGGLD I+TYVFWN HEP QY F G DL+RFIK Q GLYV LRIG YVCAEWN+
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRIGLYVCAEWNF 127
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F MQ FT IV++ K EKLF SQGGPII++QIENEY
Sbjct: 128 GGFPVWLKYVPGI-AFRTDNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPIIMSQIENEY 186
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GK+Y W A+MA LD GVPWIMC++ DAP P+ FTPN
Sbjct: 187 GPVEWEIGAPGKAYTKWAAEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFYCEGFTPNKN 246
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGW+ +GG R EDLA++VARF Q G+F NYYMYHGGTNFGRT+
Sbjct: 247 YKPKMWTEAWTGWYTEFGGPIHNRPVEDLAYSVARFIQNNGSFVNYYMYHGGTNFGRTAA 306
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---- 352
G ++ TSYDYDAPIDEYG +PKWGHLR+LHK +K E +L T T G ++
Sbjct: 307 GLFVATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLCEPSLVSAYPTVTWPGKNLEVHV 366
Query: 353 --SGSS----------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
S SS Y+LP WS+SILPDCK FNTA+V+++++
Sbjct: 367 FKSKSSCAAFLANYDPSSPAKVTFQNMQYDLPPWSISILPDCKNAVFNTARVSSKSSQMK 426
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
P G + W+ + A N L +Q S T D SDYLWY+T+ ++
Sbjct: 427 MTPVSGG------AFSWQSYIEETVSADDSDTIAKNGLWEQISITRDGSDYLWYLTDVNI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
++ L + L + S+G LH ++NG + + F VKL G N+I
Sbjct: 481 HPNEGFLKNGQSPVLTVMSAGHALHVFINGQLAGTVYGSLENPKLTFSNNVKLRAGINKI 540
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
SLLSA VGL N G F+ G+ GPV L G +DL+ KW+YKVGL G +D
Sbjct: 541 SLLSAAVGLPNVGLHFETWNTGVLGPVTLKGL---NEGTRDLTKQKWSYKVGLKG-EDLS 596
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
+ ++S + + +TWYK TF AP NDP+ L++ MGKG W+NG ++G
Sbjct: 597 LHTLSGSSSVEWVQGSLLAQKQPLTWYKATFNAPEGNDPLALDMNTMGKGQIWINGESIG 656
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
R+WP Y A + C C Y G Y KC NCG SQ WYHVPRSW+K N LV+FEE
Sbjct: 657 RHWPEYKASGN-CG--GCSYAGIYTEKKCLSNCGEASQRWYHVPRSWLKPSGNFLVVFEE 713
Query: 688 FGGNPSQINF 697
GG+P+ I+F
Sbjct: 714 LGGDPTGISF 723
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 334/736 (45%), Positives = 446/736 (60%), Gaps = 57/736 (7%)
Query: 7 CSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKK 66
S AIL+ L+ + A VS+D R+++I R++++S +IHYPRS P MWP L++
Sbjct: 9 ASTAILVGLVFLFSWRSIDAANVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQT 68
Query: 67 AKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWN 126
AKEGG +AIE+YVFWN HEP R+Y F G ++++FIK +Q G+++ILRIGP+V AEWN
Sbjct: 69 AKEGGCNAIESYVFWNGHEPSPRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWN 128
Query: 127 YGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
YGG PVWLH +PG R N+ + + M++FTT IV++ KKEKLFA QGGPIIL+Q+ENE
Sbjct: 129 YGGVPVWLHYVPGTV-FRADNEPWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENE 187
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
YG DYG+ GK Y W A MA S +IGVPW+MCQ+ DAP + FTPN
Sbjct: 188 YGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNT 247
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
P+ PKIWTENW GWFK++GG+DP R AED+A++VARFF GG+ NYYMYHGGTNFGRTS
Sbjct: 248 PDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTS 307
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG- 354
GGP++TTSYDY+APIDEYG PKWGHL++LHK + E L G N G+S+
Sbjct: 308 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLINGEHQNFTLGHSLEAD 367
Query: 355 ----------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
+SY+LPAWSVSILPDCK E FNTAKV ++ +
Sbjct: 368 VYTDSSGTCAAFLSNLDDKNDKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTSKFS- 426
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNA 445
KV+ + + L+W+ E + G+ F N L+D +T D +DYLWY T+
Sbjct: 427 KVEMLPEDLRSSSGLKWEVFSEKPG---IWGEADFVKNELVDHINTTKDTTDYLWYTTSI 483
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
+ ++ L S L I S G LH ++N Y+ + ++ V L G+N
Sbjct: 484 TVSTNEEFLKKGSPPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVALKAGEN 543
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
I LLS TVGL N GS ++ V G+ V G +L++ KW+YK+G+ G+
Sbjct: 544 NIDLLSMTVGLSNAGSFYEWVGAGLTS----VSIKGFNKGTLNLTNSKWSYKLGVQGVHL 599
Query: 566 KKFYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
+ F + + W+ P ++ +TWYK + P ++PV L++ MGKG AW+NG
Sbjct: 600 ELFKPGDSGAVK--WTVTTKPPKKQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGE 657
Query: 625 NLGRYWPTYLAEE---DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
+GRYWP + D C E CDYRG + DKC CG PSQ WYHVPRSW K N
Sbjct: 658 EIGRYWPRIARKSTPNDECVKE-CDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNE 716
Query: 682 LVLFEEFGGNPSQINF 697
LV+FEE GG+P +I
Sbjct: 717 LVIFEEKGGDPMKITL 732
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 334/725 (46%), Positives = 439/725 (60%), Gaps = 56/725 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+ +F+ + A V +D +AI I+G+R+IL+SGSIHYPRSTPGMWPDLI+KAK GGL
Sbjct: 11 ILLLFSCIFSAASA-SVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAGGL 69
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GL+V LRIGPYVCAEWN+GGFP+
Sbjct: 70 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPI 129
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV+M K EKLF +QGGPIIL+QIENE+G V
Sbjct: 130 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVEW 188
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN PK+
Sbjct: 189 EIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKM 248
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 249 WTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMA 308
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG L QPKWGHLR+LHK +KS E L + + T GN
Sbjct: 309 TSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFNSKS 368
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S Y+LP WS+SILPDCKT FNTAKV + + +P
Sbjct: 369 GCAAFLANYDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASEVQMKPVY 428
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDP 452
+ + W+ + G L+ L +Q T D +DYLWYMT+ + D+
Sbjct: 429 S-------RLPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEA 481
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L L I S+G LH ++NG + + F + VKL G N+++LLS
Sbjct: 482 FLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSI 541
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G T D+S KWTYK+G+ G + +
Sbjct: 542 SVGLPNVGTHFETWNTGVLGPISLKGL---NTGTWDMSRWKWTYKIGMKG-ESLGLHTVT 597
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TWYK TF+AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 598 GSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A+ S +C Y G + KC CG PSQ W H+PRSW+ N LV+FEE+GG+P
Sbjct: 658 YIAQG---SCGNCYYAGTFNDKKCRTYCGKPSQRWCHIPRSWLTPTGNLLVVFEEWGGDP 714
Query: 693 SQINF 697
S ++
Sbjct: 715 SWMSL 719
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 337/727 (46%), Positives = 439/727 (60%), Gaps = 57/727 (7%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
IL L +L + + A V++D +A+ I+G+R+IL+SGSIHYPRSTP MWPDLIKKAKEG
Sbjct: 12 ILAILCFSSLIHSTEAV-VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP Y F DL++F K + GLY+ LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IVDM K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGMV-FRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPM 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AGK+Y W A+MA L GVPWIM ++ DAP P+ F PN+ N P
Sbjct: 190 QWEMGAAGKAYSKWTAEMALGLSTGVPWIMSKQEDAPYPIIDTCNGFYCEGFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R ED+AF+VARF Q GG+F NYYMY+GGTNF RT+G +
Sbjct: 250 KLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTAG-VF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAPIDEYG L +PK+ HL+ELHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
G Y+LP WSVSILPDCKTE +NTAK+ T + P
Sbjct: 369 KTSCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIP 428
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
++ W G F + L++Q S T D +DY WY T+ + D
Sbjct: 429 TST-------KFSWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSD 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L N L I S+G LH +VNG + + S F + +KL+ G N+++LL
Sbjct: 482 ESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALL 541
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G ++ GI GPV L G + D+S KW+YK+GL G + +
Sbjct: 542 STAVGLPNAGVHYETWNTGILGPVTL---KGVNSGTWDMSKWKWSYKIGLRG-EAMSLHT 597
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+++ + W V + +TWYK++F+ P N+P+ L++ MGKG WVNG+N+GR+W
Sbjct: 598 LAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHW 657
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
P Y A + C C+Y G Y KC +CG PSQ WYHVPRSW+K N LV+FEE+GG
Sbjct: 658 PAYTARGN-CG--RCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGG 714
Query: 691 NPSQINF 697
+PS I+
Sbjct: 715 DPSGISL 721
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 333/728 (45%), Positives = 441/728 (60%), Gaps = 57/728 (7%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I L ++ + S V++D +A+ I+G+R+IL+SGSIHYPRSTP MWPDLIKKAKEG
Sbjct: 11 IFLAILCFSSLIWSTEAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP Y F DL++F K + GLY+ LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PGI RT N+ F MQ FT IVDM K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGIV-FRTDNEPFKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPM 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AGK+Y W A+MA L GVPWIMC++ DAP P+ F PN+ N P
Sbjct: 190 EWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R ED+AF+VARF Q GG+F NYYMY+GGTNF RT+G +
Sbjct: 250 KLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFLNYYMYYGGTNFDRTAG-VF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAP+DEYG L +PK+ HL+ELHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPLDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEVHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
G Y+LP WSVSILPDCKTE +NTAK+ T + P
Sbjct: 369 KTSCAAFLSNYDTSSAARIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMVP 428
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
++ W G F + L++Q S T D +DY WY+T+ + D
Sbjct: 429 TST-------KFSWESYNEGSPSSNDDGTFVKDGLVEQISMTRDKTDYFWYLTDITIGSD 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L + L I S+G LH +VNG + + S F + +KL+ G N+++LL
Sbjct: 482 ESFLKTGDDPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKIKLSVGINKLALL 541
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G ++ G+ GPV L G + D+S KW+YK+G+ G + F+
Sbjct: 542 STAVGLPNAGVHYETWNTGVLGPVTL---KGVNSGTWDMSKWKWSYKIGIRG-EAMSFHT 597
Query: 571 AKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+++ + W + + + +TWYK++F+ P N+P+ L++ MGKG WVNG+N+GR+
Sbjct: 598 IAGSSAVKWWIKGSFVVKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWVNGHNIGRH 657
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP Y A + C C+Y G Y KC +CG PSQ WYHVPRSW+K N LV+FEE+G
Sbjct: 658 WPAYTARGN-CG--RCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWG 714
Query: 690 GNPSQINF 697
G+PS I+
Sbjct: 715 GDPSGISL 722
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 332/729 (45%), Positives = 435/729 (59%), Gaps = 57/729 (7%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++ ++ LF S+ V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+G
Sbjct: 13 VIGLVLFLCLFVFSVTASVTYDHKAIVINGQRRILISGSIHYPRSTPQMWPDLIQKAKDG 72
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
G+D I+TYVFWN HEP Y F DL++F+K +Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 73 GVDVIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYVCAEWNFGGF 132
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IV M K E LF SQGGPII++QIENEYG V
Sbjct: 133 PVWLKYVPGVA-FRTDNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIMSQIENEYGPV 191
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G GK+Y W ++MA LD GVPWIMC++ DAP P+ FTPN P
Sbjct: 192 EWEIGAPGKAYTKWFSQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYCENFTPNKNYKP 251
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENW+GW+ +G P R A+D+AF+VARF Q G++ NYYMYHGGTNFGRTS G +
Sbjct: 252 KMWTENWSGWYTDFGSAVPYRPAQDVAFSVARFIQNRGSYVNYYMYHGGTNFGRTSAGLF 311
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS------ 353
+ TSYDYDAPIDEYG L++PKWGHLR LHK +K E L + T + G ++
Sbjct: 312 IATSYDYDAPIDEYGLLSEPKWGHLRNLHKAIKQCEPILVSVDPTVSWPGKNLEVHVYKT 371
Query: 354 -----------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
Y+LP WS+SILPDCKT FNTAKV T + K
Sbjct: 372 STGACAAFLANYDTTSPAKVTFGNGQYDLPPWSISILPDCKTAVFNTAKVGTVPSFHRKM 431
Query: 391 -PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLK 448
P + D I+D N L++Q K T D SDYLWYMT+ ++
Sbjct: 432 TPVSSAFDWQSYNEAPASSGIDDSTTA-------NALLEQIKVTRDSSDYLWYMTDVNIS 484
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
++ + L S+G VLH +VNG + + + F VKL G N+IS
Sbjct: 485 PNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKIS 544
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
LLS VGL N G ++ G+ GPV L G +DLS KW+YK+GL G +
Sbjct: 545 LLSVAVGLSNVGLHYETWNVGVLGPVTLKGL---NEGTRDLSGQKWSYKIGLKG-ETLNL 600
Query: 569 YNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGR 628
+ ++S + ++ + +TWYK TF+AP NDP+ L++ MGKG WVNG ++GR
Sbjct: 601 HTLIGSSSVQWTKGSSLVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGR 660
Query: 629 YWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
+WP Y+A S C+Y G + KC +CG P+Q WYH+PRSW+ N LV+ EE+
Sbjct: 661 HWPAYIARG---SCGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEW 717
Query: 689 GGNPSQINF 697
GG+PS I+
Sbjct: 718 GGDPSGISL 726
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 336/712 (47%), Positives = 429/712 (60%), Gaps = 61/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RA+ I+G+R+IL+SGSIHYPRSTP MWPDL++KAK+GGLD ++TYVFWN HEP +
Sbjct: 31 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQTYVFWNGHEPQQ 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K + GL+V LRIGPYVCAEWN+GGFPVWL +PG+ RT N
Sbjct: 91 GQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYVPGVS-FRTDNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW AKM
Sbjct: 150 PFKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKM 209
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN+ + P +WTE WTGWF ++GG
Sbjct: 210 AVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAV 269
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AFAVARF Q GG+F NYYMYHGGTNF RTSGGP++ TSYDYDAPIDEYG L
Sbjct: 270 PHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLR 329
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E L G+ T GN
Sbjct: 330 QPKWGHLRDLHKAIKQAEPALVSGDPTIQTIGNYEKAYVYKSSSGACAAFLSNYHTNAAA 389
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y+LPAWS+S+LPDC+T FNTA V++ + P AG W+ E
Sbjct: 390 RVVFNGRRYDLPAWSISVLPDCRTAVFNTATVSSPSAPARMTP--AGG----FSWQSYSE 443
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N R F + L++Q S T D SDYLWY T ++ ++ L L I S+
Sbjct: 444 ATNSLDDRA---FTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTIYSA 500
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G L +VNG + + Y + + VK+ +G N+IS+LSA VGL N G+ ++
Sbjct: 501 GHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYEAWN 560
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G + +DLS+ KWTY++GL+G A +++ E G ++ PL
Sbjct: 561 VGVLGPVTLSGLNEGK---RDLSNQKWTYQIGLHGESLGVHSVAGSSSVEWGSAAGKQPL 617
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP N PV L++ MGKG AWVNG+++GRYW +Y A C C Y
Sbjct: 618 ----TWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIGRYW-SYKATGGSCG--GCSY 670
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC CG+ SQ +YHVPRSW+ N LV+ EEFGG+ S + T
Sbjct: 671 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLVT 722
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 337/734 (45%), Positives = 443/734 (60%), Gaps = 63/734 (8%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S+ +++ L L S+ V++D +AI I+G R+IL+SGSIHYPRS P MWPDLI+KA
Sbjct: 5 SKIMVVFLGLFLWVCSSVMASVTYDHKAIIINGRRRILISGSIHYPRSIPQMWPDLIQKA 64
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP QY+F DL+RF+K + GLYV LRIGPYVCAEWN+
Sbjct: 65 KDGGLDVIETYVFWNGHEPSPGQYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNF 124
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F MQ FT IV + K EKL+ SQGGPIIL+QIENEY
Sbjct: 125 GGFPVWLKYVPGI-AFRTDNGPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEY 183
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GKSY W A+MA L+ GVPW+MC++ DAP P+ F PN
Sbjct: 184 GPVEWEIGAPGKSYTKWAAQMALGLNTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKV 243
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGWF +GG P R ED+A++VARF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 244 YKPKMWTEAWTGWFTEFGGPAPYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAG 303
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLK-------SMEKTLTY--------- 340
GP++ TSYDYDAPIDEYG L +PKW HLR+LHK +K S++ T++Y
Sbjct: 304 GPFIATSYDYDAPIDEYGLLREPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHV 363
Query: 341 ---------GNVTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ N D +S + + Y+LP WSVSILPDCK+ FNTAKV T+
Sbjct: 364 FKTRSGSCAAFLANYDASSSATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQP 423
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
P + + W + + L++Q S T D +DYLWYMT+
Sbjct: 424 KMTPVSS--------FSWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIR 475
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRG 503
+ ++ L L + S+G LH ++NG T YG S + F + V L G
Sbjct: 476 IDPNEGFLKSGQWPLLTVFSAGHALHVFINGQLSG---TTYGGSENYKLTFSKYVNLRAG 532
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N++S+LS VGL N G ++ G+ GPV L G D +D+S +KW+YK+GL G
Sbjct: 533 INKLSILSVAVGLPNGGLHYETWNTGVLGPVTLKGLNED---TRDMSGYKWSYKIGLKG- 588
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ ++ ++S + V + +TWYKTTF++P N+P+ L++ MGKG W+NG
Sbjct: 589 EALNLHSVSGSSSVEWVTGSLVAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWING 648
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
++GR+WP Y A+ S C+Y G + KC CG PSQ WYHVPR+W+K N LV
Sbjct: 649 QSIGRHWPAYTAK---GSCGKCNYGGIFNEKKCHSXCGEPSQRWYHVPRAWLKSSGNVLV 705
Query: 684 LFEEFGGNPSQINF 697
+FEE+GGNP I+
Sbjct: 706 IFEEWGGNPEGISL 719
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 339/712 (47%), Positives = 429/712 (60%), Gaps = 62/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+G R+IL+SGSIHYPRSTP MWP LI+KAK+GGLD ++TYVFWN HEP++
Sbjct: 94 VSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPVK 153
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F+ DLIRF+K ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 154 GQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNG 212
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EMQ F IV M K E+LF QGGPII++Q+ENE+G + S G K Y NW AKM
Sbjct: 213 PFKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPYANWAAKM 272
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + + GVPW+MC++ DAP P+ FTPN N P +WTE WTGWF S+GG
Sbjct: 273 AVATNTGVPWVMCKQEDAPDPVINTCNGFYCDYFTPNKKNKPAMWTEAWTGWFTSFGGAV 332
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G L
Sbjct: 333 PHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEFGLLR 392
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E TL G+ T GN
Sbjct: 393 QPKWGHLRDLHKAIKQAEPTLVSGDPTIQSLGNYEKAYVFKSKNGACAAFLSNYHMNSAV 452
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y+LPAWS+SILPDCKT FNTA V T + P W+ E
Sbjct: 453 KVRFNGRHYDLPAWSISILPDCKTVVFNTATVKEPTLLPKMHP------VVRFTWQSYSE 506
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N F + L++Q S T D SDYLWY T ++ + +G L + S+
Sbjct: 507 DTNSL---DDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPGELSKNGQWPQ-LTVYSA 562
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + +VNG S + + ++ VK+ +G N+IS+LS+ VGL N G F+
Sbjct: 563 GHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISILSSAVGLPNVGDHFERWN 622
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G + + +DLS KWTY+VGL G + ++ E G PL
Sbjct: 623 VGVLGPVTLSGLSEGK---RDLSHQKWTYQVGLKGESLGIHTVSGSSAVEWGGPGSKQPL 679
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP +DPV L++ MGKG WVNG+++GRYW +Y A GC C Y
Sbjct: 680 ----TWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGRYW-SYKAPSRGCG--GCSY 732
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y DKC +CG SQ WYHVPRSW+K G N LV+ EE+GG+ + + T
Sbjct: 733 AGTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGVTLAT 784
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 360/871 (41%), Positives = 477/871 (54%), Gaps = 127/871 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V++D RA+ I G+R++L+S IHYPR+TP MWP LI ++KEGG D IETY FWN HEP
Sbjct: 35 FNVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEP 94
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
R QY+F G D+++F K + GL++ +RIGPY CAEWN+GGFP+WL ++PGIE RT
Sbjct: 95 TRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIE-FRTD 153
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N F EM+ + IVD+ E LF+ QGGPIIL QIENEYGNV S +G GK Y+ W A
Sbjct: 154 NAPFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESSFGPKGKLYMKWAA 213
Query: 207 KMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGG 255
+MA L GVPW+MC+++DAP + FTPN+ PKIWTENW GWF WG
Sbjct: 214 EMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGE 273
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ P R +ED+AFA+ARFFQ GG+ QNYYMY GGTNFGRT+GGP TSYDYDAP+DEYG
Sbjct: 274 RLPYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGL 333
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGN--------------------------------- 342
L QPKWGHL++LH +K E L +
Sbjct: 334 LRQPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGI 393
Query: 343 ----VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNT---------------AK 379
+ N D S + G + LP WSV C+ E A+
Sbjct: 394 CAAFIANIDEHESATVKFYGQEFTLPPWSVVF---CQIAEIQLSTQLRWGHKLQSKQWAQ 450
Query: 380 VNTQTNVKV---KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDV 435
+ Q + + K +A ++ W E + V G +F +++ T D
Sbjct: 451 ILFQLGIILCFYKLSLKASSESFSQSWMTLKEPLG---VWGDKNFTSKGILEHLNVTKDQ 507
Query: 436 SDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNY---VDSQWTKYGAS 490
SDYLWY+T + DDD +++ T+ I+S + +VNG V +W K
Sbjct: 508 SDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKWIK---- 563
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDL 549
+PVKL +G N I LLS TVGLQNYG+ + G G + L G ++GD +L
Sbjct: 564 ---VVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGD----INL 616
Query: 550 SSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFEAPLENDPVV 608
++ WTY+VGL G + + Y+ + S GW+ + +WYKT F+AP DPV
Sbjct: 617 TTSLWTYQVGLRG-EFLEVYDVNSTESA-GWTEFPTGTTPSVFSWYKTKFDAPGGTDPVA 674
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
L+ MGKG AWVNG+++GRYW T +A +GC +CDYRG Y SDKC NCG +Q WY
Sbjct: 675 LDFSSMGKGQAWVNGHHVGRYW-TLVAPNNGCG-RTCDYRGAYHSDKCRTNCGEITQAWY 732
Query: 669 HVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE----------------- 711
H+PRSW+K N LV+FEE P I+ T T C Q E
Sbjct: 733 HIPRSWLKTLNNVLVIFEETDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRK 792
Query: 712 ----NKT--MELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGK 764
+KT M L C G IS I++AS+G P G+C F +G C A + L ++ + C+G+
Sbjct: 793 LSLMDKTPEMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAA-NSLSVVSQACIGR 851
Query: 765 KSCSIEASEANLGATSCAAGTVKRLVVEALC 795
SCSI S G VK L V+A C
Sbjct: 852 TSCSIGISNGVFGDP--CRHVVKSLAVQAKC 880
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 332/717 (46%), Positives = 431/717 (60%), Gaps = 57/717 (7%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +++ I+G+R+IL+SGSIHYPRSTP MW DLI KAK GGLD I+TYVFW+ HEP
Sbjct: 30 VTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPSP 89
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
YDF G DL+RFIKT+Q GLY LRIGPYVCAEWN+GG PVWL +PG+ RT N+
Sbjct: 90 GNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGV-SFRTDNE 148
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K EKLF SQGGPIIL+QIENEYG G AG++Y+NW A M
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYGP--ESRGAAGRAYVNWAASM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC+E+DAP P+ F+PN P P +WTE W+GWF +GG
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R EDL+FAVARF Q GG++ NYYMYHGGTNFGR++GGP++TTSYDYDAPIDEYG +
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG----------------------NSVSGS 355
QPK+ HL+ELHK +K E L + T G N+ S +
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386
Query: 356 S-------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+ Y+LP WS+SILPDCK + FNTAKV Q + P + P + W
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAKVRVQPSQVKMLPVK------PKLFSWESY 440
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
+ + L++Q T D SDYLWY+T+ D+ + L G ++ + S+
Sbjct: 441 DEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSINVQSA 500
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G +H +VNG + S + + + PV L G N+I+LLS TVGLQN G ++
Sbjct: 501 GHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHYETWE 560
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
GI GPVLL G + KDL+ +KW+YKVGL G ++ + S+
Sbjct: 561 AGITGPVLLHGLDQGQ---KDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQS 617
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
++ WYK F+AP +P+ L+L+ MGKG W+NG ++GRYW Y A+ D SC Y
Sbjct: 618 RSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAY-AKGD---CNSCTY 673
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
G + KC CG P+Q WYHVPRSW+K N +V+FEE GGNP +I+ V T
Sbjct: 674 SGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKRVAHT 730
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 338/712 (47%), Positives = 427/712 (59%), Gaps = 61/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RA+ I+G+R+IL+SGSIHYPRSTP MWP L++KAK+GGLD ++TYVFWN HEP+R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNG 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW AKM
Sbjct: 147 PFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN+ + P +WTE WTGWF ++GG
Sbjct: 207 AVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AFAVARF Q GG+F NYYMYHGGTNF RTSGGP++ TSYDYDAPIDEYG L
Sbjct: 267 PHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E L G+ T GN
Sbjct: 327 QPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAA 386
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y+LPAWS+S+LPDCK FNTA V+ + R + AG W+ E
Sbjct: 387 RVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPS--APARMSPAGG----FSWQSYSE 440
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N R F + L++Q S T D SDYLWY T ++ ++ L L I S+
Sbjct: 441 ATNSLDGRA---FTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTIYSA 497
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G L +VNG + + Y + + VK+ +G N+IS+LSA VGL N G+ ++
Sbjct: 498 GHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWN 557
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G + +DLS KWTY++GL+G A +++ E G ++ PL
Sbjct: 558 VGVLGPVTLSGLNEGK---RDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAGKQPL 614
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP + PV L++ MGKG AWVNG ++GRYW +Y A GC C Y
Sbjct: 615 ----TWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSGCG--GCSY 667
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC CG+ SQ +YHVPRSW+ N LV+ EEFGG+ S + T
Sbjct: 668 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 719
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 335/711 (47%), Positives = 433/711 (60%), Gaps = 62/711 (8%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D R++TI+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F+ DL+RF+K ++ GLYV LRIGPYVCAEWNYGGFPVWL +PGI RT N
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGI-SFRTDNGP 141
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G KSY++W AKMA
Sbjct: 142 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMA 201
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+ + GVPWIMC++ DAP P+ FTPN+ N P +WTE W+GWF ++GG P
Sbjct: 202 VATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVP 261
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
+R EDLAFAVARF Q GG+F NYYMYHGGTNF RT+GGP++ TSYDYDAPIDEYG L Q
Sbjct: 262 QRPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQ 321
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHL LHK +K E L G+ T + GN
Sbjct: 322 PKWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAAR 381
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+ +G Y+LPAWS+S+LPDC+T +NTA V ++ + N AG W+ E
Sbjct: 382 VAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASS--PAKMNPAGG----FTWQSYGEA 435
Query: 410 INDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
N + F + L++Q S T D SDYLWY T ++ + L L + S+G
Sbjct: 436 TNSL---DETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSAG 492
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
+ +VNG Y + + Y + VK+ +G N+IS+LS+ VGL N G+ ++
Sbjct: 493 HSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWNI 552
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GPV L G + +DLS KWTY++GL G + +++ E G ++ P+
Sbjct: 553 GVLGPVTLSGLNEGK---RDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAGKQPV- 608
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
TW++ F AP PV L+L MGKG AWVNG+ +GRYW +Y A + C C Y
Sbjct: 609 ---TWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYW-SYKASGN-CG--GCSYA 661
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC NCG+ SQ WYHVPRSW+ N +VL EEFGG+ S + T
Sbjct: 662 GTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMT 712
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 335/711 (47%), Positives = 433/711 (60%), Gaps = 62/711 (8%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D R++TI+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 25 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F+ DL+RF+K ++ GLYV LRIGPYVCAEWNYGGFPVWL +PGI RT N
Sbjct: 85 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGI-SFRTDNGP 143
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G KSY++W AKMA
Sbjct: 144 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMA 203
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+ + GVPWIMC++ DAP P+ FTPN+ N P +WTE W+GWF ++GG P
Sbjct: 204 VATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVP 263
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
+R EDLAFAVARF Q GG+F NYYMYHGGTNF RT+GGP++ TSYDYDAPIDEYG L Q
Sbjct: 264 QRPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQ 323
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHL LHK +K E L G+ T + GN
Sbjct: 324 PKWGHLTNLHKAIKQAEPALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAAR 383
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+ +G Y+LPAWS+S+LPDC+T +NTA V ++ + N AG W+ E
Sbjct: 384 VAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASS--PAKMNPAGG----FTWQSYGEA 437
Query: 410 INDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
N + F + L++Q S T D SDYLWY T ++ + L L + S+G
Sbjct: 438 TNSL---DETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSAG 494
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
+ +VNG Y + + Y + VK+ +G N+IS+LS+ VGL N G+ ++
Sbjct: 495 HSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWNI 554
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GPV L G + +DLS KWTY++GL G + +++ E G ++ P+
Sbjct: 555 GVLGPVTLSGLNEGK---RDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAGKQPV- 610
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
TW++ F AP PV L+L MGKG AWVNG+ +GRYW +Y A + C C Y
Sbjct: 611 ---TWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYW-SYKASGN-CG--GCSYA 663
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC NCG+ SQ WYHVPRSW+ N +VL EEFGG+ S + T
Sbjct: 664 GTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMT 714
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 332/725 (45%), Positives = 439/725 (60%), Gaps = 56/725 (7%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L+L +F+ + A V +D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK GGL
Sbjct: 11 ILLLLSCIFSAASA-SVGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKAGGL 69
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP +Y F DL++FIK +Q GL+V LRIGPYVCAEWN+GGFP+
Sbjct: 70 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPI 129
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV+M K EKLF ++GGPIIL+QIENEYG V
Sbjct: 130 WLKYVPGIA-FRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIENEYGPVEW 188
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA L+ GVPWIMC++ DAP P+ F PN PK+
Sbjct: 189 EIGAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKM 248
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTE WTGW+ +GG P R EDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 249 WTEVWTGWYTEFGGAIPTRPVEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMA 308
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------- 350
TSYDYDAP+DEYG L QPKWGHL++LHK +KS E L + + T GN
Sbjct: 309 TSYDYDAPLDEYGLLQQPKWGHLKDLHKAIKSCEYALVAVDPSVTKLGNNQEAHVFNTKS 368
Query: 351 -----------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
S Y+LP WS+SILPDCKT FNTAKV +T+ +P
Sbjct: 369 GCAAFLANYDTKYPVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKVTWKTSQVQMKPVY 428
Query: 394 AGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDP 452
+ + W+ + G L+ L +Q T D +DYLWYMT+ + D+
Sbjct: 429 S-------RLPWQSFIEETTTSDESGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEA 481
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L+ L I S+ LH ++NG + + F + VKL G N+++LLS
Sbjct: 482 FLNNGKFPLLTIFSACHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSI 541
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+VGL N G+ F+ G+ GP+ L G T D+S KWTYK+G+ G + +
Sbjct: 542 SVGLPNVGTHFETWNAGVLGPISLKGL---NTGTWDMSRWKWTYKIGMKG-EALGLHTVT 597
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
++S ++ + +TWYK TF AP + P+ L++ MGKG W+NG ++GR+WP
Sbjct: 598 GSSSVDWAEGPSMAKKQPLTWYKATFNAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
Y+A+ C T C+Y G + KC CG PSQ WYH+PRSW+ N LV+FEE+GG+P
Sbjct: 658 YIAQ-GSCGT--CNYAGTFYDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDP 714
Query: 693 SQINF 697
++
Sbjct: 715 QWMSL 719
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 340/712 (47%), Positives = 424/712 (59%), Gaps = 60/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 26 VSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQ 85
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 86 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNG 144
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW AKM
Sbjct: 145 PFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKM 204
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ FTPN+ P +WTE W+GWF ++GG
Sbjct: 205 AVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGGAV 264
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARF Q GG+F NYYMYHGGTNF RT+GGP++ TSYDYDAPIDEYG L
Sbjct: 265 PHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLR 324
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E + G+ T GN
Sbjct: 325 QPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSSPA 384
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y LPAWS+SILPDCKT +NTA V + K N AG W+ E
Sbjct: 385 KVVYNGRRYELPAWSISILPDCKTAVYNTATVRQKWKEKKLWMNPAGG----FSWQSYSE 440
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N F + L++Q S T D SD+LWY T ++ + L L INS+
Sbjct: 441 DTNSL---DDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQLTINSA 497
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G L +VNG + + Y + + + VK+ +G N+IS+LS+ VGL N G+ ++
Sbjct: 498 GHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYENWN 557
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G + +DLS+ KWTY++GL G + ++ +S W S N
Sbjct: 558 VGVLGPVTLSGLNQGK---RDLSNQKWTYQIGLKG--ESLGVHSITGSSSVEWGSANGA- 611
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
+ +TW+K F AP PV L++ MGKG WVNG N GRYW +Y A S SC Y
Sbjct: 612 -QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYW-SYKAS---GSCGSCSY 666
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC NCG+ SQ WYHVPRSW+ N LV+ EEFGG+ S + T
Sbjct: 667 TGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMT 718
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 337/711 (47%), Positives = 422/711 (59%), Gaps = 62/711 (8%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
S+D RA+ I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP R
Sbjct: 24 SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 84 QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNGP 142
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW A MA
Sbjct: 143 FKAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMA 202
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+ D GVPW+MC++ DAP P+ FTPN+ + P +WTE WTGWF ++GG P
Sbjct: 203 VATDAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNSKPTMWTEAWTGWFTAFGGPVP 262
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R ED+AFAVARF Q GG+F NYYMYHGGTNF RT+GGP++ TSYDYDAPIDEYG + Q
Sbjct: 263 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQ 322
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHLR+LHK +K E L G+ T GN
Sbjct: 323 PKWGHLRDLHKAIKQAEPALVSGDPTIQRIGNYEKAYVFKSSTGACAAFLSNYHTSSAAR 382
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+G Y+LPAWS+SILPDCKT FNTA V T + N AG W+ E
Sbjct: 383 IVYNGRRYDLPAWSISILPDCKTAVFNTATVKEPT--APAKMNPAGG----FAWQSYSED 436
Query: 410 INDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
N F + L++Q S T D SDYLWY T ++ + L L INS+G
Sbjct: 437 TNAL---DSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSSEQFLKTGQWPQLTINSAG 493
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
+ +VNG + Y + + +PVK+ +G N+IS+LS+ +GL N G+ ++
Sbjct: 494 HSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISILSSAMGLPNQGTHYEAWNV 553
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
G+ GPV L G + +DLS+ KWTY++GL G + + + +
Sbjct: 554 GVLGPVTLSGLNQGK---RDLSNQKWTYQIGLKG----ESLGVNSISGSSSVEWSSASGA 606
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
+ +TW+K F AP + PV L++ MGKG WVNG N GRYW +Y A S C Y
Sbjct: 607 QPLTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYW-SYRAS---GSCGGCSYA 662
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G + KC NCG+ SQ WYHVPRSW+K N LV+ EEFGG+ S + T
Sbjct: 663 GTFSEAKCQTNCGDISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTLMT 713
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 343/804 (42%), Positives = 455/804 (56%), Gaps = 88/804 (10%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW LI+KAK+GGLD I+TYVFWN HEP Y F DL+RF+KT+Q GL+V LRIG
Sbjct: 29 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PY+C EWN+GGFPVWL +PGI RT N+ F MQ FT IV M K E LFASQGGPI
Sbjct: 89 PYICGEWNFGGFPVWLKYVPGIS-FRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPI 147
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENEYG ++G AG++YINW AKMA LD GVPW+MC+E DAP P+
Sbjct: 148 ILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFY 207
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
F+PN P P +WTE W+GWF +GG +R EDLAFAVARF Q GG+F NYYMYHG
Sbjct: 208 CDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHG 267
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL--------T 339
GTNFGRT+GGP++TTSYDYDAPIDEYG + +PK HL+ELH+ +K E+ L T
Sbjct: 268 GTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTITT 327
Query: 340 YGNVTNTDYGNSVSGSS--------------------YNLPAWSVSILPDCKTEEFNTAK 379
G + S SG + Y+LP WS+SILPDCK FN+A
Sbjct: 328 LGTMQEAHVFRSPSGCAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNSAT 387
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDY 438
V QT+ + G+ + W+ E ++ L++Q T D SDY
Sbjct: 388 VGVQTS----QMQMWGDGATSMMWERYDEEVDSLAA--APLLTTTGLLEQLNVTRDSSDY 441
Query: 439 LWYMTNADLKDDDPILSGSSN-MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL---F 494
LWY+T+ D+ + L G +L + S+G LH +VNG Q + YG D +
Sbjct: 442 LWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQL---QGSSYGTREDRRIKY 498
Query: 495 ERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW 554
V L G N+I+LLS GL N G ++ G+ GPV+L G +DL+ W
Sbjct: 499 NGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGS---RDLTWQTW 555
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--MTWYKTTFEAPLENDPVVLNLQ 612
+Y+VGL G ++ N+ + W ++ ++ + WYK FE P ++P+ L++
Sbjct: 556 SYQVGLKG--EQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMG 613
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
MGKG W+NG ++GRYW Y DG + C Y G + + KC CG P+Q WYHVPR
Sbjct: 614 SMGKGQVWINGQSIGRYWTAY---ADG-DCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPR 669
Query: 673 SWIKDGVNTLVLFEEF-GGNPSQINFQTVVVGTACG-------------------QAHEN 712
SW++ N LV+ EE GG+ S+I V + C + H
Sbjct: 670 SWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVSEDHPNIKKWQIESYGEREHRR 729
Query: 713 KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEA 771
+ L C HG+ IS I++ASFG P G CG F++G C + ++EK+C+G + C +
Sbjct: 730 AKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSA-SSHAVLEKRCIGLQRCVVAI 788
Query: 772 SEANLGATSCAAGTVKRLVVEALC 795
S N G C + T KR+ VEA+C
Sbjct: 789 SPDNFGGDPCPSVT-KRVAVEAVC 811
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 336/716 (46%), Positives = 426/716 (59%), Gaps = 58/716 (8%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
++ V++D + I IDG+R+IL+SGSIHYPRSTP MWP L +KAKEGGLD I+TYVFWN
Sbjct: 20 AVTASVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQTYVFWNG 79
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP +Y F DL++FIK Q GLYV LRIGPYVCAEWN+GGFPVWL +PGI
Sbjct: 80 HEPSPGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-F 138
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N+ F MQ FTT IV M K E LF +QGGPII++QIENEYG V + G GK+Y N
Sbjct: 139 RTDNEPFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAPGKAYTN 198
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A+MA LD GVPW MC++ DAP P+ FTPN PK+WTENW+GW+
Sbjct: 199 WAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTD 258
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+G R EDLA++VARF Q G+F NYYMYHGGTNFGRTS G ++ TSYDYDAPIDE
Sbjct: 259 FGNAICYRPVEDLAYSVARFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDE 318
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV------SGSS---------- 356
YG N+PKW HLR+LHK +K E L + T T GN + +G+S
Sbjct: 319 YGLTNEPKWSHLRDLHKAIKQCEPALVSVDPTITSLGNKLEAHVYSTGTSVCAAFLANYD 378
Query: 357 -------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
Y+LP WSVSILPDCKT+ FNTAKV Q++ K + D
Sbjct: 379 TKSAATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGAQSSQKTMISTNSTFD------ 432
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
W+ + L +Q T D SDYLWY+T+ ++ ++ + L
Sbjct: 433 -WQSYIEEPAFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPNEDFIKNGQYPIL 491
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ S+G VLH +VNG + + F V LT G N+ISLLS VGL N G
Sbjct: 492 NVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKISLLSVAVGLPNVGLH 551
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
F+ G+ GPV L G +DLS KW+YKVGL G + + S W+
Sbjct: 552 FETWNVGVLGPVTLKGL---NEGTRDLSWQKWSYKVGLKG--ESLSLHTITGGSSVDWTQ 606
Query: 583 KNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
++ ++ +TWYK TF AP NDP+ L++ MGKG WVN ++GR+WP Y+A S
Sbjct: 607 GSLLAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWPGYIAHG---S 663
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
CDY G + + KC NCGNP+Q WYH+PRSW+ N LV+ EE+GG+PS I+
Sbjct: 664 CGDCDYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSGISL 719
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 332/717 (46%), Positives = 431/717 (60%), Gaps = 64/717 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +++ I+G+R+IL+SGSIHYPRSTP MW DLI KAK GGLD I+TYVFW+ HEP
Sbjct: 30 VTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPSP 89
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
YDF G DL+RFIKT+Q GLY LRIGPYVCAEWN+GG PVWL +PG+ RT N+
Sbjct: 90 GNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGV-SFRTDNE 148
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV M K EKLF SQGGPIIL+QIENEYG G AG++Y+NW A M
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYGP--ESRGAAGRAYVNWAASM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC+E+DAP P+ F+PN P P +WTE W+GWF +GG
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R EDL+FAVARF Q GG++ NYYMYHGGTNFGR++GGP++TTSYDYDAPIDEYG +
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 318 QPKWGHLRELHKLLKSMEK--------TLTYGNVTNTDYGNSVSGS-------------- 355
QPK+ HL+ELHK +K E L+ G + +S +G+
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
Y+LP WS+SILPDCK + FNTAKV VK K W+ E
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAKVK-MLPVKPKL----------FSWESYDE 435
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
++ + L++Q T D SDYLWY+T+ D+ + L G ++ + S+
Sbjct: 436 DLSSLAESSR--ITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSINVQSA 493
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G +H +VNG + S + + + PV L G N+I+LLS TVGLQN G ++
Sbjct: 494 GHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHYETWE 553
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
GI GPVLL G + KDL+ +KW+YKVGL G ++ + S+
Sbjct: 554 AGITGPVLLHGLDQGQ---KDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQS 610
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
++ WYK F+AP +P+ L+L+ MGKG W+NG ++GRYW Y A+ D SC Y
Sbjct: 611 RSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAY-AKGD---CNSCTY 666
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
G + KC CG P+Q WYHVPRSW+K N +V+FEE GGNP +I+ V T
Sbjct: 667 SGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKRVAHT 723
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 341/712 (47%), Positives = 424/712 (59%), Gaps = 64/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+G R+ILLSGSIHYPRSTP MWP LI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F+ DL+RF+K ++ GLYV LRIGPYVCAEWN+GGFPVWL +PG+ RT N
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGV-SFRTDNG 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EMQ F IV M K E LF QGGPII++Q+ENE+G + S G K Y NW AKM
Sbjct: 157 PFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN P +WTE WTGWF S+GG
Sbjct: 217 AVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGV 276
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G L
Sbjct: 277 PHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLR 336
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVT-----------------------------NTDY 348
QPKWGHLR+LH+ +K E L + T NT
Sbjct: 337 QPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAV 396
Query: 349 GNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G YNLPAWS+SILPDCKT FNTA V T + P W+ E
Sbjct: 397 KVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNP------VVRFAWQSYSE 450
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N F + L++Q S T D SDYLWY T ++ +D L + L + S+
Sbjct: 451 DTNSL---SDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTVYSA 505
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + +VNG S + Y + VK+ +G N+IS+LS+ VGL N G+ F+
Sbjct: 506 GHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWN 565
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G KDLS KWTY+VGL G ++ E G PL
Sbjct: 566 VGVLGPVTLSSLNGG---TKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQPL 622
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP NDPV L++ MGKG WVNG+++GRYW +Y A GC C Y
Sbjct: 623 ----TWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYW-SYKA-SGGCG--GCSY 674
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y DKC NCG+ SQ WYHVPRSW+K G N LV+ EE+GG+ + ++ T
Sbjct: 675 AGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLAT 726
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 339/712 (47%), Positives = 424/712 (59%), Gaps = 62/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 26 VSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQ 85
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 86 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNG 144
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW AKM
Sbjct: 145 PFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKM 204
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ FTPN+ P +WTE W+GWF ++GG
Sbjct: 205 AVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGGAV 264
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARF Q GG+F NYYMYHGGTNF RT+GGP++ TSYDYDAPIDEYG L
Sbjct: 265 PHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLR 324
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E + G+ T GN
Sbjct: 325 QPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSSPA 384
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y LPAWS+SILPDCKT +NTA V + + N AG W+ E
Sbjct: 385 KVVYNGRRYELPAWSISILPDCKTAVYNTATVKEPS--APAKMNPAGG----FSWQSYSE 438
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N F + L++Q S T D SD+LWY T ++ + L L INS+
Sbjct: 439 DTNSL---DDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQLTINSA 495
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G L +VNG + + Y + + + VK+ +G N+IS+LS+ VGL N G+ ++
Sbjct: 496 GHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYENWN 555
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G + +DLS+ KWTY++GL G + ++ +S W S N
Sbjct: 556 VGVLGPVTLSGLNQGK---RDLSNQKWTYQIGLKG--ESLGVHSITGSSSVEWGSANGA- 609
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
+ +TW+K F AP PV L++ MGKG WVNG N GRYW +Y A S SC Y
Sbjct: 610 -QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYW-SYKAS---GSCGSCSY 664
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC NCG+ SQ WYHVPRSW+ N LV+ EEFGG+ S + T
Sbjct: 665 TGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMT 716
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 329/715 (46%), Positives = 430/715 (60%), Gaps = 68/715 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG+AI ++G+R+IL++GSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP
Sbjct: 31 VTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y F DL++F+K +Q GLYV LRIGPY CAEWN+GGFPVWL +PG+ RT N+
Sbjct: 91 GNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMS-FRTDNE 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV+M K+E+LF QGGPIIL+QIENEYG + + GK+Y W A+M
Sbjct: 150 PFKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWAAQM 209
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L+ GVPWI C++ DAP P+ FTPN PK+WTE WT WF SWG
Sbjct: 210 AVGLNTGVPWIACKQEDAPDPLIDTCNAYYCEKFTPNKSYKPKMWTEAWTAWFTSWGNPV 269
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
R AED AF+V +F Q GG++ NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG N
Sbjct: 270 LYRPAEDQAFSVLKFIQSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYGLTN 329
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------NSVSGSS------------- 356
PK+ HL+ +HK +K EK L + T T G +S SG +
Sbjct: 330 DPKYTHLKHMHKAIKQSEKALVSADATVTSLGTNQEAHVYSSSSGCAAFLANYDVSYSVK 389
Query: 357 -------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
Y+LPAWS+SILPDCKTE +NTAKV R ++ W
Sbjct: 390 VNFGSGQYDLPAWSISILPDCKTEVYNTAKV------LAPRVHKKMTPLGGFTW------ 437
Query: 410 INDFVVRGKGHFALNTLIDQK------STNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
+ ++ FA +T + T D SDYLWYM + + D+ L+ + L
Sbjct: 438 -DSYIDEVASGFASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDEAFLTNGKDPFLN 496
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ S+G L+ +VNG + S + F + VKL G N+I+LLSA+VGL N G F
Sbjct: 497 VQSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIALLSASVGLANVGLHF 556
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G+ GPV L G D++ KW+YKVG+ G +K N A +S W
Sbjct: 557 ENYNVGVLGPVTLTGLNQGTV---DMTKWKWSYKVGVQG--EKLQLNTVAGSSSVEWVKG 611
Query: 584 NVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
++ ++ +TWYK+TF AP NDPV L++ MGKG W+NG +GRYWP Y A+ + C
Sbjct: 612 SMLAKKQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAYTAQGN-CG- 669
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
C Y G + KC CG P+Q WYHVPRSW+K N LV+FEE+GG+P+ I+
Sbjct: 670 -GCSYGGYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGISM 723
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 341/712 (47%), Positives = 424/712 (59%), Gaps = 64/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+G R+ILLSGSIHYPRSTP MWP LI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F+ DL+RF+K ++ GLYV LRIGPYVCAEWN+GGFPVWL +PG+ RT N
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGV-SFRTDNG 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EMQ F IV M K E LF QGGPII++Q+ENE+G + S G K Y NW AKM
Sbjct: 157 PFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN P +WTE WTGWF S+GG
Sbjct: 217 AVRTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGV 276
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G L
Sbjct: 277 PHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLR 336
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVT-----------------------------NTDY 348
QPKWGHLR+LH+ +K E L + T NT
Sbjct: 337 QPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAV 396
Query: 349 GNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G YNLPAWS+SILPDCKT FNTA V T + P W+ E
Sbjct: 397 KVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNP------VVRFAWQSYSE 450
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N F + L++Q S T D SDYLWY T ++ +D L + L + S+
Sbjct: 451 DTNSL---SDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTVYSA 505
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + +VNG S + Y + VK+ +G N+IS+LS+ VGL N G+ F+
Sbjct: 506 GHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWN 565
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G KDLS KWTY+VGL G ++ E G PL
Sbjct: 566 VGVLGPVTLSSLNGG---TKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQPL 622
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP NDPV L++ MGKG WVNG+++GRYW +Y A GC C Y
Sbjct: 623 ----TWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYW-SYKA-SGGCG--GCSY 674
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y DKC NCG+ SQ WYHVPRSW+K G N LV+ EE+GG+ + ++ T
Sbjct: 675 AGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLAT 726
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 332/731 (45%), Positives = 442/731 (60%), Gaps = 63/731 (8%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
ILL ++ + S+ V++D +A+ I+G+R+ILLSGSIHYPRSTP MWPDLI+KAK+G
Sbjct: 11 ILLGILCCSSLICSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP QY F DL++FIK +Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IV M K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGM-VFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPI 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G GK+Y W A+MA L GVPWIMC++ DAP+ + F PN+ N P
Sbjct: 190 EWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R AED+A +VARF Q GG+F NYYMYHGGTNF RT+ G +
Sbjct: 250 KMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAP+DEYG +PK+ HL+ LHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT---NVKV 388
GS+Y+LP WSVSILPDCKTE +NTAKV +T ++K+
Sbjct: 369 KSSCAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKM 428
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
N P W E I G F+ + L++Q S T D +DY WY+T+ +
Sbjct: 429 VPTN------TPFSWGSYNEEIPS--ANDNGTFSQDGLVEQISITRDKTDYFWYLTDITI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
D+ L+G + L I S+G LH +VNG + + F + +KL G N++
Sbjct: 481 SPDEKFLTGEDPL-LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKL 539
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS GL N G ++ G+ GPV L G + D++ KW+YK+G G +
Sbjct: 540 ALLSTAAGLPNVGVHYETWNTGVLGPVTL---NGVNSGTWDMTKWKWSYKIGTKG--EAL 594
Query: 568 FYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ A +S W ++ ++ +TWYK+TF++P N+P+ L++ MGKG W+NG N+
Sbjct: 595 SVHTLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNI 654
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GR+WP Y A E C Y G + KC NCG SQ WYHVPRSW+K N +++ E
Sbjct: 655 GRHWPAYTAR---GKCERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLE 711
Query: 687 EFGGNPSQINF 697
E+GG P+ I+
Sbjct: 712 EWGGEPNGISL 722
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 351/830 (42%), Positives = 453/830 (54%), Gaps = 119/830 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D R++ IDG+RK+L+S +IHYPRS PGMWP+L++ AKEGG+D IETYVFWN HEP
Sbjct: 29 ITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFWNGHEPSP 88
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y F DL++F+K +Q G+Y+ILRIGP+V AEWN+GG PVWLH +PG RT N
Sbjct: 89 SNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTV-FRTDNY 147
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F T IV++ KKEKLFASQGGPIILAQ+ENEYG S YG+ GK Y W A+M
Sbjct: 148 NFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYAMWAAQM 207
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A S +IGVPWIMCQ+ DAP+ + F P P+ PKIWTENW GWF+++G +
Sbjct: 208 AVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQTFGAPN 267
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+AF+VARFFQ GG+ QNYYMYHGGTNFGRTSGGP++TTSYDY+APIDEYG
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLAR 327
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS---------------------- 355
PKW HL+ELHK +K E TL N G S
Sbjct: 328 LPKWAHLKELHKAIKLCELTLLNSVPVNLSLGPSQEADVYAEESGACAAFLANMDEKNDK 387
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQA-GNDQAPLQWKWRP 407
SY+LPAWSVSILPDCK FNTAKVN+QT++ P+ +D+ KW
Sbjct: 388 TVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSIVEMVPDDLRSSDKGTKALKWE- 446
Query: 408 EMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
+ + + G N +D +T D +DYLWY T+ + +++ L L I S
Sbjct: 447 TFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGENEEFLKKGGRPVLLIES 506
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
G LHA+VN + S F++PV L GKN I+LLS TVGLQN GS ++ V
Sbjct: 507 KGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIALLSMTVGLQNAGSFYEWV 566
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
G+ V G DLS+ WTYK+GL G + YN A + ++ P
Sbjct: 567 GAGLTS----VKMKGFNNGTIDLSTFNWTYKIGLQG-EKLGMYNGIAVETVNWVATSKPP 621
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
++ +TWYK A +LN + W + W Y
Sbjct: 622 KDQPLTWYKRQIHARQ-----MLN-------WMWRINSEMILVWTRY------------- 656
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
HVPRSW K N LV+FEE GG+P++I F + C
Sbjct: 657 ----------------------HVPRSWFKPSGNILVIFEEKGGDPTKITFSRRKISGVC 694
Query: 707 GQAHENKTM--------------------ELTC-HGRRISEIKYASFGDPQGACGAFKKG 745
E+ M L C IS IK+ASFG P GACG++ +G
Sbjct: 695 ALVAEDYPMANLESLENAGSGSSNYKASVHLKCPKSSIISAIKFASFGSPAGACGSYSEG 754
Query: 746 SCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C + + ++EK C+ K C +E +E N C G +K+L VEA+C
Sbjct: 755 ECH-DPKSISVVEKVCLNKNQCVVEVTEENFSKGLC-PGKMKKLAVEAVC 802
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 361/828 (43%), Positives = 454/828 (54%), Gaps = 135/828 (16%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
L+S SIHYPRS P MWP LI+ AKEGG+D IETYVFWN HE Y F G DL++F K
Sbjct: 1 LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGG---------------------------------FP 131
+QD G+Y+ILRIGP+V AEWN+GG P
Sbjct: 60 VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWLH +PG RT N+ FM+ M+ FTT IV++ KKEKLFASQGGPIIL+QIENEYG
Sbjct: 120 VWLHYIPGTV-FRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYE 178
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
+ Y + GK Y W AKMA S + VPWIMCQ+ DAP P+ FTP +P PK
Sbjct: 179 NYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPK 238
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENW GWFK++GG+DP R ED+AF+VARFFQ GG+ NYYMYHGGTNFGRT+GGP++
Sbjct: 239 MWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFI 298
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG------ 354
TTSYDYDAPIDEYG PKWGHL+ELHK +K E L YG N G SV
Sbjct: 299 TTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDS 358
Query: 355 -----------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
+SY+LPAWSVSILPDCK FNTAKV++ TN+ P
Sbjct: 359 SGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIP 418
Query: 392 ---NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADL 447
Q+ Q L+W E + GK F N +D +T D +DYLW+ T+ +
Sbjct: 419 EHLQQSDKGQKTLKWDVFKENPG---IWGKADFVKNGFVDHINTTKDTTDYLWHTTSILI 475
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
++ L S L I S G LHA+VN Y + S F+ P+ L GKN+I
Sbjct: 476 DANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEI 535
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
++LS TVGLQ G +D + G+ V ++G + TI DLSS+ W YK+G+ G +
Sbjct: 536 AILSLTVGLQTAGPFYDFIGAGVTS-VKIIG-LNNRTI--DLSSNAWAYKIGVLG-EHLS 590
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
Y + NS + S+ P + +TWYK +AP ++PV L++ MGKG AW+NG +G
Sbjct: 591 IYQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIG 650
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYWP + + CDYRG + DKC CG PSQ WYHVPRSW K N LV+FEE
Sbjct: 651 RYWPRISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEE 710
Query: 688 FGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSC 747
GG+P++I F CH Y+S
Sbjct: 711 KGGDPTKITFVR------------------HCHN------PYSSI--------------- 731
Query: 748 EAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++EK CV K I+ E N C G +L VEA+C
Sbjct: 732 --------VVEKVCVNKNDRVIKVIEDNFKTNLC-HGLSMKLAVEAIC 770
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 340/704 (48%), Positives = 419/704 (59%), Gaps = 64/704 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+G R+ILLSGSIHYPRSTP MWP LI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F+ DL+RF+K ++ GLYV LRIGPYVCAEWN+GGFPVWL +PG+ RT N
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGV-SFRTDNG 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EMQ F IV M K E LF QGGPII++Q+ENE+G + S G K Y NW AKM
Sbjct: 157 PFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN P +WTE WTGWF S+GG
Sbjct: 217 AVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGV 276
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G L
Sbjct: 277 PHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLR 336
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVT-----------------------------NTDY 348
QPKWGHLR+LH+ +K E L + T NT
Sbjct: 337 QPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAV 396
Query: 349 GNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G YNLPAWS+SILPDCKT FNTA V T + P W+ E
Sbjct: 397 KVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNP------VVRFAWQSYSE 450
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N F + L++Q S T D SDYLWY T ++ +D L + L + S+
Sbjct: 451 DTNSL---SDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTVYSA 505
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + +VNG S + Y + VK+ +G N+IS+LS+ VGL N G+ F+
Sbjct: 506 GHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWN 565
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G KDLS KWTY+VGL G ++ E G PL
Sbjct: 566 VGVLGPVTLSSLNGG---TKDLSHQKWTYQVGLKGETLGLQTVTGSSAVEWGGPGGYQPL 622
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP NDPV L++ MGKG WVNG+++GRYW +Y A GC C Y
Sbjct: 623 ----TWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYW-SYKA-SGGCG--GCSY 674
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
G Y DKC NCG+ SQ WYHVPRSW+K G N LV+ EE+G N
Sbjct: 675 AGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 339/733 (46%), Positives = 441/733 (60%), Gaps = 59/733 (8%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L+ +L + S+ VS+D +AI I+G R+IL+SGSIHYPRSTP MWPDLI+ AKEGG
Sbjct: 6 LVLFLLFCSWLWSVEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGG 65
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD I+TYVFWN HEP Y F DL++FIK + GLYV LRIGPY+C EWN+GGFP
Sbjct: 66 LDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYICGEWNFGGFP 125
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWL +PGI + RT N F +MQ FT IV+M K EKLF QGGPII++QIENEYG +
Sbjct: 126 VWLKYVPGI-QFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIE 184
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
+ G GK+Y W A+MA L GVPWIMC++ DAP P+ F PN PK
Sbjct: 185 WEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPK 244
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
++TE WTGW+ +GG P R AED+A++VARF Q G+F NYYMYHGGTNFGRT+GGP++
Sbjct: 245 MFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFI 304
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------- 350
TSYDYDAP+DEYG +PKWGHLR+LHK +K E +L + T G+
Sbjct: 305 ATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFWTK 364
Query: 351 ------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-KVKRP 391
+ Y+LP WSVSILPDCKT FNTAKV +Q ++ K+
Sbjct: 365 TSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKMIAV 424
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
N A + Q+ + P D V F + L +Q S T D +DYLWYMT+ + D
Sbjct: 425 NSAFSWQSYNEET--PSANYDAV------FTKDGLWEQISVTRDATDYLWYMTDVTIGPD 476
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L + L + S+G LH +VNG + + + F VKL G N++SLL
Sbjct: 477 EAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLL 536
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G F+ G+ GPV L G + D+S KW+YK+GL G + +
Sbjct: 537 SIAVGLPNVGLHFETWNAGVLGPVTL---KGVNSGTWDMSKWKWSYKIGLKG--EALSLH 591
Query: 571 AKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ +S W ++ R+ + WYKTTF AP+ NDP+ L++ MGKG W+NG ++GR+
Sbjct: 592 TVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRH 651
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP Y A S +C+Y G Y KC NCG SQ WYHVPRSW+ N LV+FEE+G
Sbjct: 652 WPGYKARG---SCGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWG 708
Query: 690 GNPSQINFQTVVV 702
G+P++I+ VV
Sbjct: 709 GDPTKISLVKRVV 721
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 337/727 (46%), Positives = 434/727 (59%), Gaps = 64/727 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAKEGGLD I+TYVFW+ HEP
Sbjct: 37 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWDGHEPSP 96
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DL++FIK ++ GLYV LRIGPY+CAEWN GGFPVWL +PGI RT N+
Sbjct: 97 GKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFPVWLKYIPGI-SFRTDNE 155
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M FT IV+M K E LF QGGPII++QIENEYG V + G GK Y W A M
Sbjct: 156 PFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEWEIGAIGKVYTRWAASM 215
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A +L+ GVPWIMC++ + P P+ F PN P +WTE WTGWF ++GG
Sbjct: 216 AVNLNTGVPWIMCKQDEVPDPIINTCNGFYCDWFKPNKDYKPIMWTELWTGWFTAFGGPV 275
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+A+AV +F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG
Sbjct: 276 PYRPVEDVAYAVVKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLKR 335
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------------------------- 351
+PKWGHLR+LH+ +K E L + T T G+S
Sbjct: 336 EPKWGHLRDLHRAIKMCEPALVSNDPTVTKIGDSQEAHVFKFESGACSAFLENKDETNFV 395
Query: 352 ---VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
G Y LP WS+SILPDC +NT +V TQT++ A N++ W E
Sbjct: 396 KVTFQGMQYELPPWSISILPDCVNVVYNTGRVGTQTSMMTML--SASNNE--FSWASYNE 451
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
D + + L +Q S T D +DYL Y T+ + ++ L L +NS+
Sbjct: 452 ---DTASYNEESMTIEGLSEQISITKDSTDYLRYTTDVTIGQNEGFLKNGEYPVLTVNSA 508
Query: 468 GQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
G L +VNG T YG+ ND F VKL G N+ISLLS+ VGL N G+ F+
Sbjct: 509 GHALQVFVNGQLSG---TAYGSVNDPRLTFSGKVKLWAGNNKISLLSSAVGLPNVGTHFE 565
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ GPV L G + +DLS KW+YKVG+ G + +++ E G S+
Sbjct: 566 TWNYGVLGPVTLNGLNEGK---RDLSLQKWSYKVGVIGEALQLHSPTGSSSVEWGSSTSK 622
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+ + TWYKTTF AP NDP+ L++ MGKG W+NG ++GRYWP Y A CS +
Sbjct: 623 I---QPFTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIGRYWPAYKANGK-CS--A 676
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
C Y G Y KC +NCG SQ WYH+PRSW+ N LV+FEE+GG+P+ I +G+
Sbjct: 677 CHYTGWYDEKKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWGGDPTGITLVRRTIGS 736
Query: 705 ACGQAHE 711
AC +E
Sbjct: 737 ACAYINE 743
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 335/733 (45%), Positives = 436/733 (59%), Gaps = 59/733 (8%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L+ +L + S+ VS+D +AI I+G R+IL+SGSIHYPRSTP MWPDLI+ AKEGG
Sbjct: 6 LVLFLLFCSWLWSVEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGG 65
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
LD I+TYVFWN HEP Y F DL++FIK + GLYV LRI PY+C EWN+GGFP
Sbjct: 66 LDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYICGEWNFGGFP 125
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
VWL +PGI + RT N F +MQ FT IV+M K EKLF QGGPII++QIENEYG +
Sbjct: 126 VWLKYVPGI-QFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIE 184
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPK 240
+ G GK+Y W A+MA L GVPWIMC++ DAP P+ F PN PK
Sbjct: 185 WEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPK 244
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
++TE WTGW+ +GG P R AED+A++VARF Q G+F NYYMYHGGTNFGRT+GGP++
Sbjct: 245 MFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFI 304
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------- 350
TSYDYDAP+DEYG +PKWGHLR+LHK +K E +L + T G+
Sbjct: 305 ATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFWTK 364
Query: 351 ------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-KVKRP 391
+ Y+LP WSVSILPDCKT FNTAKV +Q ++ K+
Sbjct: 365 TSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKMIAV 424
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
N A + W+ F + L +Q S T D +DYLWYMT+ + D
Sbjct: 425 NSA--------FSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPD 476
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L + L + S+G LH +VNG + + + F VKL G N++SLL
Sbjct: 477 EAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLL 536
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G F+ G+ GPV L G + D+S KW+YK+GL G + +
Sbjct: 537 SIAVGLPNVGLHFETWNAGVLGPVTL---KGVNSGTWDMSKWKWSYKIGLKG--EALSLH 591
Query: 571 AKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
+ +S W ++ R+ + WYKTTF AP+ NDP+ L++ MGKG W+NG ++GR+
Sbjct: 592 TVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRH 651
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP Y A S +C+Y G Y KC NCG SQ WYHVPRSW+ N LV+FEE+G
Sbjct: 652 WPGYKARG---SCGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWG 708
Query: 690 GNPSQINFQTVVV 702
G+P++I+ VV
Sbjct: 709 GDPTKISLVKRVV 721
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 332/732 (45%), Positives = 442/732 (60%), Gaps = 64/732 (8%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
ILL ++ + S+ V++D +A+ I+G+R+ILLSGSIHYPRSTP MWPDLI+KAK+G
Sbjct: 11 ILLGILCCSSLICSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP QY F DL++FIK +Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IV M K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGM-VFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPI 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G GK+Y W A+MA L GVPWIMC++ DAP+ + F PN+ N P
Sbjct: 190 EWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R AED+A +VARF Q GG+F NYYMYHGGTNF RT+ G +
Sbjct: 250 KMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAP+DEYG +PK+ HL+ LHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT---NVKV 388
GS+Y+LP WSVSILPDCKTE +NTAKV +T ++K+
Sbjct: 369 KSSCAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKM 428
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
N P W E I G F+ + L++Q S T D +DY WY+T+ +
Sbjct: 429 VPTN------TPFSWGSYNEEIPS--ANDNGTFSQDGLVEQISITRDKTDYFWYLTDITI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
D+ L+G + L I S+G LH +VNG + + F + +KL G N++
Sbjct: 481 SPDEKFLTGEDPL-LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKL 539
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK-VGLYGLDDK 566
+LLS GL N G ++ G+ GPV L G + D++ KW+YK +G G +
Sbjct: 540 ALLSTAAGLPNVGVHYETWNTGVLGPVTL---NGVNSGTWDMTKWKWSYKQIGTKG--EA 594
Query: 567 KFYNAKAANSERGWSSKNVPLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
+ A +S W ++ ++ +TWYK+TF++P N+P+ L++ MGKG W+NG N
Sbjct: 595 LSVHTLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQN 654
Query: 626 LGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF 685
+GR+WP Y A E C Y G + KC NCG SQ WYHVPRSW+K N +++
Sbjct: 655 IGRHWPAYTAR---GKCERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVL 711
Query: 686 EEFGGNPSQINF 697
EE+GG P+ I+
Sbjct: 712 EEWGGEPNGISL 723
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 331/730 (45%), Positives = 437/730 (59%), Gaps = 61/730 (8%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
ILL ++ + S+ V++D +A+ I+G+R+ILLSGSIHYPRSTP MWPDLI+KAK+G
Sbjct: 11 ILLGILWCSSLIYSVKAMVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP QY F DL++FIK +Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +P + RT N+ F MQ FT IV M K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPDM-VFRTDNEPFKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENEYGPI 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G GK+Y W AKMA L GVPWIMC++ DAP+ + F PN+ P
Sbjct: 190 EWEIGAPGKAYTKWVAKMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDKKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R AED+A +VARF Q GG+F NYYMYHGGTNF RT+ G +
Sbjct: 250 KMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------- 350
+ TSYDYDAP+DEYG +PK+ HL+ LHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAQVFKS 368
Query: 351 -------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT---NVKV 388
S GS+Y+LP WSVSILPDCKTE +NTAKV +T ++K+
Sbjct: 369 QSSCAAFLSNYNTSSAARVSFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKM 428
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
N W E I G F+ + L++Q S T D +DY WY+T+ +
Sbjct: 429 VPTNTL------FSWGSYNEEIPS--ANDNGTFSQDGLVEQISITRDKTDYFWYLTDITI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
D+ L+G + L I S+G LH +VNG + + F + +KL G N++
Sbjct: 481 SPDEKFLTGEDPL-LNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKL 539
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS GL N G ++ G+ GPV L G + D+S KW+YK+G G +
Sbjct: 540 ALLSIAAGLPNVGVHYETWNTGVLGPVTL---KGVNSGTWDMSQWKWSYKIGTKG-EALS 595
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
+ +++ V + +TWYK+TF+ P N+P+ L++ MGKG W+NG N+G
Sbjct: 596 IHTVTGSSTVEWKQGSLVATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWINGQNIG 655
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
R+WP Y A E C Y G + +KC NCG SQ WYHVPRSW+K N +V+ EE
Sbjct: 656 RHWPAYTAR---GKCERCSYAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVLEE 712
Query: 688 FGGNPSQINF 697
+GG P+ I+
Sbjct: 713 WGGEPNGISL 722
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 336/716 (46%), Positives = 430/716 (60%), Gaps = 60/716 (8%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S + V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD IETYVFWN
Sbjct: 79 SASRSVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNG 138
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP +Y F DL+RFIK +Q GLYV LRIGPYVCAEWNYGGFP+WL +PGI
Sbjct: 139 HEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGI-AF 197
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT N F MQ F IVDM K EKLF +QGGPIIL+QIENEYG V + G GKSY
Sbjct: 198 RTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAPGKSYTK 257
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKS 252
W A+MA L GVPW+MC++ DAP P+ F PN PKIWTENW+GW+ +
Sbjct: 258 WAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTA 317
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG P R ED+AF+VARF Q GG+ NYYMYHGGTNFGRTS G ++TTSYD+DAPIDE
Sbjct: 318 FGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDE 376
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------NSVSGS--------- 355
YG L +PKWGHLR+LHK +K E L + T+T G S SG+
Sbjct: 377 YGLLREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARVFKSSSGACAAFLANYD 436
Query: 356 ------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
Y+LP WS+SILPDCKT FNT + + VK + W
Sbjct: 437 TSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTGSL----QIGVKSYEAKMTPISSFWW 492
Query: 404 -KWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
++ E + + + + L++Q S T D +DYLWY+ + + + L
Sbjct: 493 LSYKEEPASAY---AQDTTTKDGLVEQVSVTWDTTDYLWYILSIRIDSTEGFLKSGQWPL 549
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L +NS+G +LH ++NG S + F + V L +G N++S+LS TVGL N G
Sbjct: 550 LTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGL 609
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
FD G+ GPV L G +D+S +KW+YKVGL G + Y+ K +NS + W
Sbjct: 610 HFDTWNAGVLGPVTL---KGLNEGTRDMSKYKWSYKVGLRG-EILNLYSVKGSNSVQ-W- 663
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
K + +TWYKTTF P N+P+ L++ M KG WVNG ++GRY+P Y+A
Sbjct: 664 MKGSFQKQPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIARGK--- 720
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
C Y G + KC +NCG PSQ WYH+PR W+ N L++ EE GGNP I+
Sbjct: 721 CNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISL 776
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 334/729 (45%), Positives = 434/729 (59%), Gaps = 59/729 (8%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
+ +LL L L T ++ V++D +AI I+ +R+IL+SGSIHYPRSTP MWPDLI+KAK
Sbjct: 3 KTVLLFLSLLTWVGSTIG-AVTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQKAK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP +Y F DL+ FIK +Q GLYV LRIGPYVCAEWNYG
Sbjct: 62 DGGLDIIETYVFWNGHEPSEGKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYVCAEWNYG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFP+WL +PGI RT N+ F MQ F T IVDM K EKL+ +QGGPIIL+QIENEYG
Sbjct: 122 GFPIWLKFVPGI-AFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENEYG 180
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
V G GKSY W A+MA L GVPW+MC++ DAP P+ F PN
Sbjct: 181 PVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIY 240
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PKIWTENW+GW+ ++GG P R ED+AF+VARF Q G+ NYY+YHGGTNFGRTS G
Sbjct: 241 KPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS-G 299
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT----------- 346
++ TSYD+DAPIDEYG + +PKWGHLR+LHK +KS E L + T T
Sbjct: 300 LFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKSCEPALVSADPTITWLGKNQEARVF 359
Query: 347 -----------DYGNSVS------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
+Y S S + Y+LP WS+SILPDC T FNTA+V ++
Sbjct: 360 KSSSACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCXTVTFNTAQVGVKSYQAKM 419
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLK 448
P + + W K L++Q S T D +DYLWYM + +
Sbjct: 420 MPISS--------FGWLSYKEEPASAYAKDTTTKAGLVEQVSITWDTTDYLWYMQDISID 471
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
+ L L +NS+G +LH ++NG S + F + V L +G N++S
Sbjct: 472 STEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNVDLKQGVNKLS 531
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
+LS TVGL N G FD G+ GPV L G +D+S +KW+YKVGL G +
Sbjct: 532 MLSVTVGLPNVGLHFDTWNAGVLGPVTLEGL---NEGTRDMSKYKWSYKVGLSG-ESLNL 587
Query: 569 YNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGR 628
Y+ K +NS + W+ ++ + +TWYKTTF+ P N+P+ L++ M KG W+NG ++GR
Sbjct: 588 YSDKGSNSVQ-WTKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWINGQSIGR 646
Query: 629 YWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
Y+P Y+A + C Y G + KC NCG PSQ WYH+PR W+ N LV+FEE
Sbjct: 647 YFPGYIAN---GKCDKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEI 703
Query: 689 GGNPSQINF 697
GG+P I+
Sbjct: 704 GGSPDGISL 712
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 617 bits (1590), Expect = e-173, Method: Compositional matrix adjust.
Identities = 336/728 (46%), Positives = 432/728 (59%), Gaps = 61/728 (8%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
AIL CL L + S VS+D +A+ I+G+R+ILLSGSIHYPRSTP MWP LI+KAKE
Sbjct: 14 AILCCLSLSCIVKAS----VSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKE 69
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD IETYVFWN HEP QY F DL++FIK + GLYV LRIGPYVCAEWN+GG
Sbjct: 70 GGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGG 129
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PG+ RT N+ F M+ FT IV M K EKLF +QGGPIILAQIENEYG
Sbjct: 130 FPVWLKFVPGMA-FRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGP 188
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
V + G GK+Y W A+MA L GVPWIMC++ DAP P+ F PN+ N
Sbjct: 189 VEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINK 248
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
PK+WTENWTGW+ +GG P R ED+A++VARF Q GG+ NYYMYHGGTNF RT+G
Sbjct: 249 PKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTAG-E 307
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------- 349
++ +SYDYDAP+DEYG +PK+ HL+ LHK +K E L + T T G
Sbjct: 308 FMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFW 367
Query: 350 --------------NSVS-----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
NS + G Y+LP WSVSILPDCKTE +NTAKVN + +
Sbjct: 368 SKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNAPSVHRNMV 427
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
P ++ W G FA N L++Q S T D SDY WY+T+ +
Sbjct: 428 PTGT-------KFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGS 480
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L + L + S+G LH +VNG + + F + +KL G N+I+L
Sbjct: 481 GETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIAL 540
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G+ F+ G+ GPV L G + D+S KW+YK+G+ G + +
Sbjct: 541 LSVAVGLPNVGTHFEQWNKGVLGPVTL---KGVNSGTWDMSKWKWSYKIGVKG-EALSLH 596
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
++ R V + +TWYK+TF P N+P+ L++ MGKG W+NG N+GR+
Sbjct: 597 TNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRH 656
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP Y A+ S C+Y G + + KC NCG SQ WYHVPRSW+K N +V+FEE G
Sbjct: 657 WPAYKAQG---SCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKSQ-NLIVVFEELG 712
Query: 690 GNPSQINF 697
G+P+ I+
Sbjct: 713 GDPNGISL 720
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 336/728 (46%), Positives = 432/728 (59%), Gaps = 61/728 (8%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
AIL CL L + S VS+D +A+ I+G+R+ILLSGSIHYPRSTP MWP LI+KAKE
Sbjct: 14 AILCCLSLSCIVKAS----VSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKE 69
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD IETYVFWN HEP QY F DL++FIK + GLYV LRIGPYVCAEWN+GG
Sbjct: 70 GGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGG 129
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PG+ RT N+ F M+ FT IV M K EKLF +QGGPIILAQIENEYG
Sbjct: 130 FPVWLKFVPGMA-FRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGP 188
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNS 238
V + G GK+Y W A+MA L GVPWIMC++ DAP P+ F PN+ N
Sbjct: 189 VEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINK 248
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
PK+WTENWTGW+ +GG P R ED+A++VARF Q GG+ NYYMYHGGTNF RT+G
Sbjct: 249 PKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLINYYMYHGGTNFDRTAG-E 307
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------- 349
++ +SYDYDAP+DEYG +PK+ HL+ LHK +K E L + T T G
Sbjct: 308 FMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFW 367
Query: 350 --------------NSVS-----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
NS + G Y+LP WSVSILPDCKTE +NTAKVN + +
Sbjct: 368 SKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNAPSVHRNMV 427
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
P ++ W G FA N L++Q S T D SDY WY+T+ +
Sbjct: 428 PTGT-------KFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGS 480
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
+ L + L + S+G LH +VNG + + F + +KL G N+I+L
Sbjct: 481 GETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIAL 540
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS VGL N G+ F+ G+ GPV L G + D+S KW+YK+G+ G + +
Sbjct: 541 LSVAVGLPNVGTHFEQWNKGVLGPVTL---KGVNSGTWDMSKWKWSYKIGVKG-EALSLH 596
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
++ R V + +TWYK+TF P N+P+ L++ MGKG W+NG N+GR+
Sbjct: 597 TNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRH 656
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
WP Y A+ S C+Y G + + KC NCG SQ WYHVPRSW+K N +V+FEE G
Sbjct: 657 WPAYKAQG---SCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKSQ-NLIVVFEELG 712
Query: 690 GNPSQINF 697
G+P+ I+
Sbjct: 713 GDPNGISL 720
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 615 bits (1587), Expect = e-173, Method: Compositional matrix adjust.
Identities = 331/731 (45%), Positives = 437/731 (59%), Gaps = 59/731 (8%)
Query: 8 SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKA 67
S+ +LL L L + ++A V++D +AI I+G+R+IL+SGSIHYPRSTP MWP LI+ A
Sbjct: 2 SKCVLLFLGLLSWVCYAMA-TVTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLIQNA 60
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K+GGLD IETYVFWN HEP + +Y F DL+RFIK +Q GLYV LRIGPYVCAEWNY
Sbjct: 61 KDGGLDIIETYVFWNGHEPTQGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNY 120
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFP+WL ++PGI RT N+ F MQ FT IV M K EKL+ SQGGPIIL+QIENEY
Sbjct: 121 GGFPIWLKHVPGIV-FRTENEPFKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIENEY 179
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GKSY W A+MA LD GVPW+MC++ DAP P+ F PN
Sbjct: 180 GPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNRE 239
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N PKIWTE W+GW+ ++GG P R AEDLAF+VARF Q GG+ NYYMYHGGTNFGR+SG
Sbjct: 240 NKPKIWTEVWSGWYTAFGGAVPYRPAEDLAFSVARFVQNGGSLFNYYMYHGGTNFGRSSG 299
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG-- 354
++ SYD+DAPIDEYG +PKW HLR+LHK +K E L + T G ++
Sbjct: 300 -LFIANSYDFDAPIDEYGLKREPKWEHLRDLHKAIKLCEPALVSADPNVTWLGKNLEARV 358
Query: 355 ---------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ Y+LP WS+SIL DCK+ FNTA++ Q+
Sbjct: 359 FKSSSGACAAFLANYDISTSSKVSFWNTQYDLPPWSISILSDCKSAIFNTARIGAQS--- 415
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD 446
P + + ++ E+ + + + L++Q + T D +DYLWYMT+
Sbjct: 416 --APMKMMLVSSFWWLSYKEEVASGYATDTTTK---DGLVEQVNFTWDSTDYLWYMTDIQ 470
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
+ ++ + L I+S+G VLH +VNG + + F + V L G N+
Sbjct: 471 IDPNEAFIKSGQWPLLNISSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNLKAGVNK 530
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+S+LS TVGL N G F+ G+ GPV L G I+D+S +KW++KVGL G ++
Sbjct: 531 LSMLSVTVGLPNVGLHFESWNAGVLGPVTL---KGLNEGIRDMSGYKWSHKVGLKG-ENM 586
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ +NS + + + +TWYKT F P N+P+ L++ MGKG W+NG ++
Sbjct: 587 NLHTIGGSNSVQWAKGSGLVQKQPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWINGRSI 646
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYWP Y A S C Y G + KC NCG PSQ WYHVPR W++ N LV+FE
Sbjct: 647 GRYWPAYAASG---SCGKCSYAGIFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVVFE 703
Query: 687 EFGGNPSQINF 697
E GGNP I+
Sbjct: 704 ELGGNPGGISL 714
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 330/701 (47%), Positives = 415/701 (59%), Gaps = 61/701 (8%)
Query: 31 HDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQ 90
+D R++ I+G R+IL+SGSIHYPRSTP MWP LI+KAK+GGLD I+TYVFWN HEP++ Q
Sbjct: 47 YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106
Query: 91 YDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVF 150
Y F DL+RF+K ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N F
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIR-FRTDNGPF 165
Query: 151 MNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMAT 210
MQ F IV M K E LF QGGPII+AQ+ENE+G + S G K Y +W A+MA
Sbjct: 166 KAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAV 225
Query: 211 SLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPK 259
+ GVPW+MC++ DAP P+ FTPN P +WTE WTGWF +GG P
Sbjct: 226 GTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNRKYKPTMWTEAWTGWFTKFGGALPH 285
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQP 319
R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G L QP
Sbjct: 286 RPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQP 345
Query: 320 KWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------------------------- 350
KWGHLR+LH+ +K E L G+ T GN
Sbjct: 346 KWGHLRDLHRAIKQAEPALISGDPTIQSIGNYEKAYIFKSKNGACAAFLSNYHMKTAVKI 405
Query: 351 SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMI 410
G Y+LPAWS+SILPDCKT FNTA V T + P L + W+
Sbjct: 406 RFDGRHYDLPAWSISILPDCKTAVFNTATVKEPTLLPKMNP--------VLHFAWQ-SYS 456
Query: 411 NDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQ 469
D F N L++Q S T D SDYLWY T+ + ++ L L + S+G
Sbjct: 457 EDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWYTTHVSIGGNEQFLKSGQWPQLTVYSAGH 516
Query: 470 VLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNG 529
+ +VNG S + Y F VK+ +G N+IS+LS+ VGL N G+ F++ G
Sbjct: 517 SMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILSSAVGLPNNGNHFELWNVG 576
Query: 530 IPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNR 589
+ GPV L G + +DLS KWTY+VGL G + + +S W+ +
Sbjct: 577 VLGPVTLSGLNEGK---RDLSHQKWTYQVGLKG--ESLGLHTVTGSSAVEWAGPGG--KQ 629
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
+TW+K F AP +DPV L++ MGKG WVNG++ GRYW +Y A C C Y G
Sbjct: 630 PLTWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAGRYW-SYRAYSGSC--RRCSYAG 686
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
Y D+C NCG+ SQ WYHVPRSW+K N LV+ EE+GG
Sbjct: 687 TYREDQCLSNCGDISQRWYHVPRSWLKPSGNLLVVLEEYGG 727
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 335/712 (47%), Positives = 425/712 (59%), Gaps = 60/712 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RA+ I+G+R+IL+SGSIHYPRSTP MWP L++KAK+GGLD ++TYVFWN HEP+R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIS-FRTDNG 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW AKM
Sbjct: 147 PFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN+ + P +WTE WTGWF ++GG
Sbjct: 207 AVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R ED+AFAVARF Q GG+F NYYMYHGGTNF RTSGGP++ TSYDYDAPIDEYG L
Sbjct: 267 PHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLR 326
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
QPKWGHLR+LHK +K E L G+ T GN
Sbjct: 327 QPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAA 386
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
+G Y+LPAWS+S+LPDCK FNTA V+ + R + AG W+ E
Sbjct: 387 RVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPS--APARMSPAGG----FSWQSYSE 440
Query: 409 MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
N R F + L++Q S T D SDYLWY T ++ ++ L L + S+
Sbjct: 441 ATNSLDGRA---FTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSA 497
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G L +VNG + + Y + + VK+ +G N+IS+LSA VGL N G+ ++
Sbjct: 498 GHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWN 557
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G+ GPV L G + +DLS+ KWTY++GL+G A +++ E G ++ PL
Sbjct: 558 VGVLGPVTLSGLNEGK---RDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAGKQPL 614
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TW+K F AP + PV L++ MGKG AWVNG ++GRYW +Y A C Y
Sbjct: 615 ----TWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSS-GGCGGCSY 668
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G Y KC CG+ SQ +YHVPRSW+ N LVL EEFGG+ + T
Sbjct: 669 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVT 720
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 334/730 (45%), Positives = 430/730 (58%), Gaps = 63/730 (8%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
IL CL L + S VS+D +A+ I+G+R+ILLSGSIHYPRSTP MWP LI+KAKE
Sbjct: 14 VILCCLSLVCIVKAS----VSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKE 69
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLD IETYVFWN HEP QY F DL++FIK + GLYV LRIGPYVCAEWN+GG
Sbjct: 70 GGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGG 129
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ--IENEY 187
FPVWL +PG+ RT N+ F M+ FT IV M K EKLF +QGGPIILAQ IENEY
Sbjct: 130 FPVWLKFVPGMA-FRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQGQIENEY 188
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GK+Y W A+MA L GVPWIMC++ DAPSP+ F PN+
Sbjct: 189 GPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYCEDFKPNSS 248
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
N PK+WTENWTGW+ +GG P R ED+A++VARF Q GG+F NYYMYHGGTNF RT+G
Sbjct: 249 NKPKMWTENWTGWYTEFGGAVPYRPVEDIAYSVARFIQKGGSFVNYYMYHGGTNFDRTAG 308
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS----- 351
++ +SYDYDAP+DEYG +PK+ HL+ LHK++K E L + T T G
Sbjct: 309 -EFMASSYDYDAPLDEYGLPREPKYSHLKALHKVIKLSEPALLSADATVTSLGAKQEAYV 367
Query: 352 -----------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
G Y LP WSVSILPDCKTE +NTAKVN + +
Sbjct: 368 FWSKSSCAAFLSNKDESSAARVMFRGFPYVLPPWSVSILPDCKTEFYNTAKVNAPSVHRN 427
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
P A ++ W G FA N L++Q S T D SDY WY+T+ +
Sbjct: 428 MVPTGA-------RFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYLTDITI 480
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ L + S+G LH +VNG + + F + +KL G N++
Sbjct: 481 GSGETFLKTGDFPLFTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFTQKIKLHAGVNKL 540
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
+LLS VGL N G+ F+ G+ GPV L G + D+S KW+YK+G+ G +
Sbjct: 541 ALLSVAVGLPNVGTHFEQWNKGVLGPVTL---KGVNSGTWDMSKWKWSYKIGVKG-EALS 596
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
+ ++ R V + +TWYK+TF P N+P+ L++ MGKG W+NG N+G
Sbjct: 597 LHTDTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIG 656
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
R+WP Y A+ S C+Y G + + KC NCG SQ WYHVPRSW+K N +V+FEE
Sbjct: 657 RHWPAYKAQG---SCGRCNYAGTFNAKKCLSNCGEASQRWYHVPRSWLKSQ-NLIVVFEE 712
Query: 688 FGGNPSQINF 697
+GG+P+ I+
Sbjct: 713 WGGDPNGISL 722
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 606 bits (1562), Expect = e-170, Method: Compositional matrix adjust.
Identities = 344/852 (40%), Positives = 463/852 (54%), Gaps = 113/852 (13%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L+ L+L + V++DGR++ IDGE KIL SGSIHY RSTP MWP LI KAK GG
Sbjct: 8 LVFLVLMAVIVAGDVANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGG 67
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
+D ++TYVFWN HEP + Q+DF+G+ D+++FIK +++ GLYV LRIGP++ EW+YGG P
Sbjct: 68 IDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLP 127
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
WLHN+ GI RT N+ F M+ + +IV + K E L+ASQGGPIIL+QIENEYG V
Sbjct: 128 FWLHNVQGI-VFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVG 186
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNS 238
+ GKSY+ W AK+A LD GVPW+MC++ DAP P+ PN+PN
Sbjct: 187 RAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNK 246
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P IWTENWT +++++G + R+AED+AF VA F G+F NYYMYHGGTNFGR +
Sbjct: 247 PAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNASQF 306
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
+T+ YD AP+DEYG L QPKWGHL+ELH +K E+ L G T G
Sbjct: 307 VITSYYD-QAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFG 365
Query: 351 --------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SSY L SVS+LPDCK FNTAKVN Q N + ++
Sbjct: 366 KKANLCAAILVNQDKCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK 425
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDD 450
Q N +P W+ E + F +L L +T D SDYLW T +
Sbjct: 426 ARQ--NLSSPQMWEEFTETVPSFSETSIRSESL--LEHMNTTQDTSDYLWQTTRFQQSEG 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
P + L++N G LHA+VNG ++ S + A L E+ + L G N ++LL
Sbjct: 482 APSV-------LKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALL 534
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G+ + G + GR +++ W Y+VGL G +K
Sbjct: 535 SVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLY-----FNNYSWGYQVGLKG--EKFHVY 587
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+ +++ W ++ +TWYK +F+ P DPV LNL MGKG AWVNG ++GRYW
Sbjct: 588 TEDGSAKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYW 647
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF-EEFG 689
++ + GNPSQIWYH+PRS++K N LV+ EE
Sbjct: 648 VSFHTYK-----------------------GNPSQIWYHIPRSFLKPNSNLLVILEEERE 684
Query: 690 GNPSQINFQTVVVGTACGQA-----------------HENKT--------MELTC-HGRR 723
GNP I TV V CG +N T ++L C GR+
Sbjct: 685 GNPLGITIDTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRK 744
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
IS+I +ASFG P G+CG++ GSC + + L +++K C+ K CS+ G SC
Sbjct: 745 ISKILFASFGTPNGSCGSYSIGSCHSP-NSLAVVQKACLKKSRCSVPVWSKTFGGDSCPH 803
Query: 784 GTVKRLVVEALC 795
TVK L+V A C
Sbjct: 804 -TVKSLLVRAQC 814
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 606 bits (1562), Expect = e-170, Method: Compositional matrix adjust.
Identities = 345/836 (41%), Positives = 458/836 (54%), Gaps = 115/836 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I+G+ KI+ SGSIHYPRSTP MWP LI KA+ GGLDAI+TYVFWN HEP +
Sbjct: 8 VTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLHEPQQ 67
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF+G DL+RFIK + QGLYV LRIGP++ +EW YGG P WLH++PGI R+ NK
Sbjct: 68 GQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIV-FRSDNK 126
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ + +IV M K EKL+ASQGGPIIL+QIENEYGNV + + + G Y+ W AKM
Sbjct: 127 PFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWAAKM 186
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L GVPW+MC++ DAP P+ PN+P P IWTENWT ++++G
Sbjct: 187 AVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQTYGK 246
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ R+AED+AF A F GG+F NYYMYHGGTNFGRT+ Y+ TSY AP+DEYG
Sbjct: 247 ETRSRSAEDIAFHAALFIAKGGSFVNYYMYHGGTNFGRTAA-EYVPTSYYDQAPLDEYGL 305
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
L QPK GHL+ELH +K K L N G
Sbjct: 306 LRQPKHGHLKELHAAIKLCRKPLLSRKWINFSLGQLQEAFAFERNSDECAAFLVNHDGRS 365
Query: 351 ----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
GSSY LP S+SILP CKT FNTA+V+TQ ++ + QWK
Sbjct: 366 NATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGTRLATRRHKFDSIE--QWKEY 423
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I F K NTL++ +T D SDYLWY S +++ L +N
Sbjct: 424 KEYIPSF---DKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQN------SSNAHSVLTVN 474
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S G LHA+VNG ++ S + + +R + L RG N +SLLS GL + G+ +
Sbjct: 475 SLGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVSLLSVMTGLPDAGAYLER 534
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ + + R + + D +++ W YKVGL G + + N S + + S+
Sbjct: 535 RVAGLR--RVTIQRQHE---LHDFTTYLWGYKVGLSGENIQLHRNNA---SVKAYWSRYA 586
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
+R +TWYK+ F+AP NDPV LNL MGKG AWVNG ++GRYW ++L +
Sbjct: 587 SSSRPLTWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRYWVSFLDSD-------- 638
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
GNP Q W H+PRS++K N LV+ EE GNP I+ T+ +
Sbjct: 639 ---------------GNPYQTWNHIPRSFLKPSGNLLVILEEERGNPLGISLGTMSITKV 683
Query: 706 CGQAH------------ENKT-------------MELTC-HGRRISEIKYASFGDPQGAC 739
CG EN+ ++L C GR+IS + ++SFG P G C
Sbjct: 684 CGHVSISHPPPVISWQGENQINGTRKRKYGRRPKVQLRCPRGRKISSVLFSSFGTPSGDC 743
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ GSC A + +EK C+GK+ CSI S N C G K L+V+A C
Sbjct: 744 ETYAIGSCHAS-NSRATVEKACLGKERCSIPVSSKNFKGDPC-PGIAKSLLVDAKC 797
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 344/813 (42%), Positives = 462/813 (56%), Gaps = 97/813 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G+ KIL SGSIHYPRSTP MW LI KAK GG+D I+TYVFWN HEP +
Sbjct: 2 VTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQQ 61
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q+ F G DL+RF+K IQ QGLY LRIGP++ +EW YGG P WLH++PG+ R+ N+
Sbjct: 62 GQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMV-YRSDNQ 120
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ F + IV M K EKL+ASQGGPIIL+Q+ENEY NV + + + G SY+ W A M
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A +L GVPW+MC++ DAP P+ PN+PN P IWTE+WT +++ +G
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ R+A+D+AF VA F G++ NYYMYHGGTNFGRT+ +T+ YD AP+DEYG
Sbjct: 241 ETYMRSAQDIAFHVALFIAKTGSYVNYYMYHGGTNFGRTASAFTITSYYD-QAPLDEYGL 299
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGN-------------------------VTNTDYGN 350
+ QPKWGHL+ELH +KS K L +G + N D
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHGAHKTFSLGPLQQAYVFQGNSGQCAAFLVNNDGKQ 359
Query: 351 SV----SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
V +SY LP S+SILPDCKT FNTAKVN Q + +PNQ N +W+
Sbjct: 360 EVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNAQYTTRSMKPNQKFNSVG--KWEEY 417
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I +F K N L++ ST D SDYLWY + + P ++
Sbjct: 418 NEPIPEF---DKTSLRANRLLEHMSTTKDTSDYLWYTFR--FQQNLP----NAQSVFNAQ 468
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S G VLHAYVNG + + ++ + V+L G N ++LLSATVGL + G+ +
Sbjct: 469 SHGHVLHAYVNGVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVALLSATVGLPDSGAYLER 528
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ R + KD +++ W Y+VGL G + + Y +N + W+ +
Sbjct: 529 RVAGL-------RRVRIQN--KDFTTYTWGYQVGLLG-ERLQIYTENGSNKVK-WN--KL 575
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
NR + WYKT F+AP NDPV LNL MGKG AWVNG ++GRYW ++ +
Sbjct: 576 GTNRPLMWYKTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTSQ-------- 627
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
G+PSQ WY++PR+++K N LVL EE G P I TV V
Sbjct: 628 ---------------GSPSQTWYNIPRAFLKPTGNLLVLLEEEKGYPPGITVDTVSVTKV 672
Query: 706 CGQAHEN--KTMELTCHGRR-ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCV 762
CG A E+ ++L+C +R IS I +ASFG P G C ++ G+C + +EK C+
Sbjct: 673 CGYASESHLSAVQLSCPLKRNISSIIFASFGTPSGNCESYAIGNCHSSSSKAN-VEKACI 731
Query: 763 GKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
GK+SCSI S G C G K L+VEA C
Sbjct: 732 GKRSCSIPQSNHFFGGDPC-PGIPKVLLVEAKC 763
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 347/841 (41%), Positives = 468/841 (55%), Gaps = 99/841 (11%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S+A VS+D RA+ +DG R++L+SGSIHYPRSTP MWP LI KAK+GGLD I+TYVFW+
Sbjct: 20 SVAVTVSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVIQTYVFWSG 79
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP + Y+F G DL +F++ + + G+YV LRIGPYVCAEWN+GGFP WL +PGI E
Sbjct: 80 HEPTQGVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLRFLPGI-EF 138
Query: 144 RTTNKVFMNEM-QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
RT N+ F + +FT+ ++ + + F Q +I AQIENEYG++ + YG+AG+ Y+
Sbjct: 139 RTDNESFKVHLSHSFTSSLISVYSRS--FNIQ--LVICAQIENEYGSIDAVYGEAGQKYL 194
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFK 251
NW A MA + +I VPWIMC + DAP + F PN+ P +WTENWTGWF+
Sbjct: 195 NWIANMAVATNISVPWIMCNQPDAPPSVIDTCNGFYCDGFRPNSEGKPALWTENWTGWFQ 254
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPID 311
SWG P R +D+AFAVARFFQ GG+F +YYMYHGGTNF R S +TT+YDYDAPID
Sbjct: 255 SWGEGAPTRPVQDIAFAVARFFQKGGSFMHYYMYHGGTNFER-SAMEGVTTNYDYDAPID 313
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLT---------------YGNVTNTDYGNSVS--- 353
EYG + QPKWGHL++LH LK E L +V N+ G +
Sbjct: 314 EYGDVRQPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNSSTGACAAFLA 373
Query: 354 ------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPL 401
G SY+LPAWSVSILPDCK+ FNTAKV Q+ + P+
Sbjct: 374 SWGTDDSTVLFQGQSYDLPAWSVSILPDCKSVVFNTAKVGVQSMTMTMQ------SAIPV 427
Query: 402 -QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
W E + + F+ N L++Q +T D +DYLWY TN ++ + D +G +
Sbjct: 428 TNWVSYREPLEPW----GSTFSTNELVEQIATTKDTTDYLWYTTNVEVAESDAP-NGLAQ 482
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
TL ++ H +VN ++ + +G+ + + L G N + +LS T GLQ
Sbjct: 483 ATLVMSYLRDAAHIFVN-KWLTGTKSAHGSEAS---QSISLRPGINSVKVLSMTTGLQGT 538
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G + GI + + G I++ + WTY+VGL G ++ + + + + S
Sbjct: 539 GPFLEKEKAGIQFGIRVEGLPSGAIIMQ---RNTWTYQVGLQG-ENNRLFESNGSLSAVW 594
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
+S +V ++W+KTTF+ P N V L+L MGKG WVNG NLGRYW + +A DG
Sbjct: 595 STSTDVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYWSSCIAHTDG 654
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
C ++CDYRG + KC CG PSQ WYHVPR W+ N LVLFEE GNP I
Sbjct: 655 C-VDNCDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNPEAITIAP 713
Query: 700 VVVGTACGQAHENK----------------------TMELTC-HGRRISEIKYASFGDPQ 736
+ C + E+ + L C G+ IS I +AS+G P
Sbjct: 714 RIPQHICSRMSESHPFPIPLSSSTKRGSQTSTPPIAPLALECADGQHISRISFASYGTPS 773
Query: 737 GACGAFKKGSCEA--EIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
G CG FK SC A DVL K CVG++ C + + G C G +K L A
Sbjct: 774 GDCGDFKLSSCHANSSKDVL---SKACVGRQKCLVPIVSSICGGDPC-PGMIKSLAATAE 829
Query: 795 C 795
C
Sbjct: 830 C 830
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 330/713 (46%), Positives = 427/713 (59%), Gaps = 64/713 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +A+ I+G+++IL SGSIHYPRSTP MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 28 VTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLHEPSP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DL++FIK + GLYV LRIGPY+C EWN+GGFPVWL +PG+ RT N+
Sbjct: 88 GNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGM-IFRTDNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F +MQ FT IV M K E+L+ SQGGPIIL+QIENEY +G AG +Y+ W A M
Sbjct: 147 PFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWAAHM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A SL+ GVPW+MC+E DAP P+ F+PN P +WTE WTGWF +GG
Sbjct: 207 AVSLNTGVPWVMCKEFDAPDPVVNTCNGFYCDYFSPNKAYKPTMWTEAWTGWFTDFGGPI 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
+R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG +
Sbjct: 267 HQRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 326
Query: 318 QPKWGHLRELHKLLKSMEKTL--------TYGNVTNTDYGNSVSGS-------------- 355
QPK+GHL++LHK +K E+ L T G+ +S SG
Sbjct: 327 QPKYGHLKDLHKAIKLCERALLSSDPVVTTLGSYEQAHVFSSNSGDCAAFLANYNPKATA 386
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPE 408
YNLP WSVSILPDCK FNTA+V Q + P +A L W+ E
Sbjct: 387 KVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGVQPSKIQMLPTEA----RFLSWEALSE 442
Query: 409 MINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
I+ G A L++Q T D SDYLWY T + + L G L++ S+
Sbjct: 443 DISSVDDDKIGTVA--GLLEQINVTRDASDYLWYTTGVHISSSETFLDGGQPPILKVISA 500
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVK-LTRGKNQISLLSATVGLQNYGSKFDMV 526
G +H +VNG S + G F +K L G+N+ISLLS VGL N G +F+
Sbjct: 501 GHGIHVFVNGQLSGSVYGTRGNRRISFSGELKQLHAGRNRISLLSVAVGLPNNGPRFETW 560
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS--ERGWSSKN 584
G+ GPV++ G +DL+ KW+YKVGL G D N + NS W ++
Sbjct: 561 NTGVLGPVVIHGLDQGH---RDLTWQKWSYKVGLKGED----LNLGSPNSIPSINWMQES 613
Query: 585 VPLNRR--MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+ R +TW++ F+AP +DP+ L++ M KG W+NG ++GRYW Y DG T
Sbjct: 614 AMVAERQPLTWHRAFFDAPRGDDPLALDMSSMVKGQVWINGNSIGRYWTVY---ADGNCT 670
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
+C Y G + C + CG P+Q WYH+PRS +K N LV+FEE GG+ S+I
Sbjct: 671 -ACSYSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLVVFEEIGGDVSKI 722
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 319/625 (51%), Positives = 399/625 (63%), Gaps = 63/625 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D RA+ IDG R++L+SGSIHYPRSTP MWP LI+KAK+GGLD IETYVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P+R QYDF G DL F+KT+ D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI + RT
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI-KFRT 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F EMQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG GK+Y+ W
Sbjct: 146 DNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWA 205
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWG 254
A MA SLD GVPW+MCQ++DAP P+ FTPN+ PK+WTENW+GWF S+G
Sbjct: 206 AGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFG 265
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P R EDLAFAVARF+Q GGTFQNYYMYHGGTN R+SGGP++ TSYDYDAPIDEYG
Sbjct: 266 GAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV---------------------- 352
+ QPKWGHLR++HK +K E L + + T G +V
Sbjct: 326 LVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGSVCAAFLANIDGQS 385
Query: 353 ------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN---------- 396
+G Y LPAWSVSILPDCK NTA++N+QT R ++ N
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 445
Query: 397 DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILS 455
+ A W + E + + L++Q +T D SD+LWY T+ +K D+P L+
Sbjct: 446 ELAVSDWSYAIEPVG---ITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYLN 502
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
GS + L +NS G VL Y+NG S +S +++P++L GKN+I LLSATVG
Sbjct: 503 GSQS-NLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVG 561
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
L NYG+ FD+V GI GPV L G G DLSS +WTY++GL G +D Y+ A+
Sbjct: 562 LSNYGAFFDLVGAGITGPVKLSGLNG----ALDLSSAEWTYQIGLRG-EDLHLYDPSEAS 616
Query: 576 SERGWSSKNV-PLNRRMTWYKTTFE 599
E W S N P+N + WYK + E
Sbjct: 617 PE--WVSANAYPINHPLIWYKVSME 639
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 598 bits (1543), Expect = e-168, Method: Compositional matrix adjust.
Identities = 332/738 (44%), Positives = 436/738 (59%), Gaps = 75/738 (10%)
Query: 11 ILLCLILQTLFNLSLAY-RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
++L + + +S Y V +D RAI I+ +R+ILLSGSIHYPRSTP MWPD+I+KAK+
Sbjct: 12 MMLVYVFVLITLISCVYGNVWYDYRAIKINDQRRILLSGSIHYPRSTPEMWPDIIEKAKD 71
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
LD I+TYVFWN HEP +Y F G DL++FIK I GL+V LRIGP+ CAEWN+GG
Sbjct: 72 SQLDVIQTYVFWNGHEPSEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIGPFACAEWNFGG 131
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWL +PGI E RT N F +MQ FTT IVDM K EKLF QGGPIIL QIENEYG
Sbjct: 132 FPVWLKYVPGI-EFRTDNGPFKEKMQVFTTKIVDMMKAEKLFHWQGGPIILNQIENEYGP 190
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMC-QESDAPSPM-----------FTPNNPN 237
V + G GK+Y +W A+MA SL+ GVPWIMC Q+SD P + F P + +
Sbjct: 191 VEWEIGAPGKAYTHWAAQMAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFYCEGFVPKDKS 250
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK+WTENWTGW+ +G P R AED+AF+VARF Q GG+F NYYM+HGGTNF T+ G
Sbjct: 251 KPKMWTENWTGWYTEYGKPVPYRPAEDVAFSVARFIQNGGSFMNYYMFHGGTNF-ETTAG 309
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------- 350
+++TSYDYDAP+DEYG +PK+ HL+ LHK +K E L + T+ G+
Sbjct: 310 RFVSTSYDYDAPLDEYGLPREPKYTHLKNLHKAIKMCEPALVSSDAKVTNLGSNQEAHVY 369
Query: 351 ----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVN------- 381
+ SG + LPAWS+SILPDCK E +NTA+VN
Sbjct: 370 SSNSGSCAAFLANYDPKWSVKVTFSGMEFELPAWSISILPDCKKEVYNTARVNEPSPKLH 429
Query: 382 -TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYL 439
T V Q+ +D+ P G F L +Q T D SDYL
Sbjct: 430 SKMTPVISNLNWQSYSDEVP-------------TADSPGTFREKKLYEQINMTWDKSDYL 476
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
WYMT+ L ++ L L +NS+G VLH +VNG + F + VK
Sbjct: 477 WYMTDVVLDGNEGFLKKGDEPWLTVNSAGHVLHVFVNGQLQGHAYGSLAKPQLTFSQKVK 536
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
+T G N+ISLLSA VGL N G F+ G+ GPV L +G +DL+ W+YK+G
Sbjct: 537 MTAGVNRISLLSAVVGLANVGWHFERYNQGVLGPVTL---SGLNEGTRDLTWQYWSYKIG 593
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
G + + + + +++ + G + PL WYKTTF+AP NDP+ L+L MGKG A
Sbjct: 594 TKGEEQQVYNSGGSSHVQWGPPAWKQPL----VWYKTTFDAPGGNDPLALDLGSMGKGQA 649
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
W+NG ++GR+W +A+ G ++C+Y G Y KC +CG SQ WYHVPRSW++
Sbjct: 650 WINGQSIGRHWSNNIAK--GSCNDNCNYAGTYTETKCLSDCGKSSQKWYHVPRSWLQPRG 707
Query: 680 NTLVLFEEFGGNPSQINF 697
N LV+FEE+GG+ ++
Sbjct: 708 NLLVVFEEWGGDTKWVSL 725
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 598 bits (1543), Expect = e-168, Method: Compositional matrix adjust.
Identities = 331/761 (43%), Positives = 421/761 (55%), Gaps = 108/761 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+G R+IL+SGSIHYPRS P MWP LI+KAK+GGLD ++TYVFWN HEP +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K ++ GLYV LR+GPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGI-RFRTDNG 158
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPII+AQ+ENE+G + S G GK Y +W A+M
Sbjct: 159 PFKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQM 218
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ FTPNN + P +WTE WTGWF +GG
Sbjct: 219 AVGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAA 278
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH-- 315
P R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G
Sbjct: 279 PHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQW 338
Query: 316 -----------------------------------------------LNQPKWGHLRELH 328
L QPKWGHLR +H
Sbjct: 339 LLPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMH 398
Query: 329 KLLKSMEKTLTYGNVTNTDYGN-----------------------------SVSGSSYNL 359
+ +K E L G+ T GN G Y+L
Sbjct: 399 RAIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDL 458
Query: 360 PAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKG 419
PAWS+SILPDCKT FNTA V T + P W+ E N
Sbjct: 459 PAWSISILPDCKTAVFNTATVKEPTLLPKMSPV-----MHRFAWQSYSEDTNSL---DDS 510
Query: 420 HFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGN 478
FA + LI+Q S T D SDYLWY T+ ++ ++ L L + S+G + +VNG
Sbjct: 511 AFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVNGR 570
Query: 479 YVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
S + Y F VK+ +G N+IS+LS+ VGL N G F++ G+ GPV L G
Sbjct: 571 SYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTLSG 630
Query: 539 RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
+ +DLS +W Y+VGL G + + +S W+ + +TW+K F
Sbjct: 631 LNEGK---RDLSHQRWIYQVGLKG--ESLGLHTVTGSSAVEWAGPGGG-TQPLTWHKALF 684
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
AP +DPV L++ MGKG WVNG + GRYW +Y A GC C Y G Y D+C
Sbjct: 685 NAPAGSDPVALDMGSMGKGQVWVNGRHAGRYW-SYRAHSRGCG--RCSYAGTYREDQCTS 741
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
NCG+ SQ WYHVPRSW+K N LV+ EE+GG+ + ++ T
Sbjct: 742 NCGDLSQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVSLAT 782
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 340/855 (39%), Positives = 460/855 (53%), Gaps = 116/855 (13%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L +L + A V++DGR++ IDG+ KIL SGSIHY RSTP MWP LI KAK GG
Sbjct: 8 LAFFVLMAVIVARDAANVTYDGRSLIIDGQHKILFSGSIHYTRSTPQMWPSLIAKAKSGG 67
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
+D I+TYVFWN HEP + Q+DF+G D+++FIK ++ GLYV LRIGP++ EW+YGG P
Sbjct: 68 IDVIDTYVFWNIHEPQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGPFIQGEWSYGGLP 127
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
WLHN+ GI RT N+ F M+ + +IV + K E L+ASQGGPIIL+QIENEYG V
Sbjct: 128 FWLHNVQGI-VFRTDNEPFKYHMKRYAQMIVKLMKSENLYASQGGPIILSQIENEYGMVA 186
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNS 238
+ GKSY+ W AK+A LD GVPW+MC++ DAP P+ PN+PN
Sbjct: 187 RAFRQDGKSYVKWAAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNK 246
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P IWTENWT +++++G + R+AED+AF VA F G+F NYYMYHGGTNFGR +
Sbjct: 247 PAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNASQF 306
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
+T+ YD AP+DEYG L QPKWGHL+ELH +K E+ L G T G
Sbjct: 307 VITSYYD-QAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFG 365
Query: 351 --------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SSY L S+S+LPDCK FNTAKVN Q N + ++
Sbjct: 366 KKANLCAALLVNQDKCDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTAKVNAQYNTRTRK 425
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDD 450
P Q N +P W+ E + F +L L +T D SDYLW T + +
Sbjct: 426 PRQ--NLSSPHMWEKFTETVPSFSETSIRSESL--LEHMNTTQDTSDYLWQTTRFEQSEG 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
P + L++N G VLHA+VN ++ S + A + L E+ + L G N ++LL
Sbjct: 482 APSV-------LKVNHLGHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMALL 534
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G+ + + G + G + +++ W Y+VGL G + Y
Sbjct: 535 SVMVGLPNSGAHLE---RRVVGSRSVNIWNGSYQLF--FNNYSWGYQVGLKG-EKYHVYT 588
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
A + W ++ +TWYK +F+ P DPV LNL MGKG AWVNG ++GRYW
Sbjct: 589 EDGAKKVQ-WKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYW 647
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF-EEFG 689
++ + GNPSQIWYH+PRS++K N LV+ EE
Sbjct: 648 VSFYTSK-----------------------GNPSQIWYHIPRSFLKPNSNLLVILEEERE 684
Query: 690 GNPSQINFQTVVVGTACG----------------------QAH------ENKTMELTC-H 720
G P I TV V CG Q H ++L C
Sbjct: 685 GYPLGITIDTVSVTEVCGHVSNTHPHPVISPRKKGHNRNEQRHLKYRYDRKPKVQLQCPT 744
Query: 721 GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATS 780
GR+IS++ +A+FG+P G+CG++ GSC + + L +++K C+ K CS+ G
Sbjct: 745 GRKISKVLFATFGNPNGSCGSYSVGSCHSP-NSLAVVQKACLRKSRCSVPVWSKTFGGDL 803
Query: 781 CAAGTVKRLVVEALC 795
C TVK L+V A C
Sbjct: 804 CPQ-TVKSLLVRAQC 817
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 357/837 (42%), Positives = 462/837 (55%), Gaps = 118/837 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G+R++L SGSIHYPRSTP MWP LI KAKEGG+D IETY FWN HEP +
Sbjct: 32 VTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEPKQ 91
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF+G LD+++F K +Q QGLY LRIGP++ +EWNYGG P WLH++PGI R+ N+
Sbjct: 92 GQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGII-YRSDNE 150
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQNFTT IV++ K E L+ASQGGPIIL+QIENEY NV + + + G Y+ W AKM
Sbjct: 151 PFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAKM 210
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L GVPW+MC++ DAP P+ PN PN P IWTENWT ++ +G
Sbjct: 211 AVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYGE 270
Query: 256 KDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
R AEDLAF VA F + G+F NYYMYHGGTNFGRTS LT YD AP+DEYG
Sbjct: 271 DKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEYG 329
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGN-------------------------VTNTDYG 349
+ QPKWGHL+ELH ++K TL +G + N D
Sbjct: 330 LIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDKR 389
Query: 350 NSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK-VKRPNQAGNDQAPLQWK 404
+V+ ++Y L A S+SILPDCK FNTAKV+TQ N + V+ G+ + QW
Sbjct: 390 RNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQTRATFGSTK---QWS 446
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNM--T 461
E I F G + L++ +T D SDYLWY + SSN
Sbjct: 447 EYREGIPSF---GGTPLKASMLLEHMGTTKDASDYLWYTLR--------FIQNSSNAQPV 495
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
LR++S VLHA+VNG Y+ S + + V L G N+ISLLS VGL + G
Sbjct: 496 LRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGP 555
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ GI + + GD KD S H W Y+VGL G + + Y + + + W
Sbjct: 556 YLEHKVAGIR--RVEIQDGGDS---KDFSKHPWGYQVGLMG-EKSQIYTSPGSQKVQ-WH 608
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+TWYKT F+AP NDPVVL MGKG AWVNG ++GRYW +YL
Sbjct: 609 GLGSHGRGPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS---- 664
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
G PSQ WY+VPR+++ N LV+ EE G+P +I+ TV
Sbjct: 665 -------------------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVS 705
Query: 702 VGTACG--------------------QAHENKT--MELTC-HGRRISEIKYASFGDPQGA 738
V CG ++H K ++L C IS+I +ASFG P G
Sbjct: 706 VTNVCGHVTDSHPPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGG 765
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C ++ GSC + + L + EK C+GK CSI S + G C GT K L+V A C
Sbjct: 766 CESYAIGSCHSP-NSLAVAEKACLGKNMCSIPHSLKSFGDDPC-PGTPKALLVAAQC 820
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 357/837 (42%), Positives = 462/837 (55%), Gaps = 118/837 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G+R++L SGSIHYPRSTP MWP LI KAKEGG+D IETY FWN HEP +
Sbjct: 24 VTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEPKQ 83
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF+G LD+++F K +Q QGLY LRIGP++ +EWNYGG P WLH++PGI R+ N+
Sbjct: 84 GQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGII-YRSDNE 142
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQNFTT IV++ K E L+ASQGGPIIL+QIENEY NV + + + G Y+ W AKM
Sbjct: 143 PFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAKM 202
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L GVPW+MC++ DAP P+ PN PN P IWTENWT ++ +G
Sbjct: 203 AVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYGE 262
Query: 256 KDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
R AEDLAF VA F + G+F NYYMYHGGTNFGRTS LT YD AP+DEYG
Sbjct: 263 DKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEYG 321
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGN-------------------------VTNTDYG 349
+ QPKWGHL+ELH ++K TL +G + N D
Sbjct: 322 LIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDKR 381
Query: 350 NSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK-VKRPNQAGNDQAPLQWK 404
+V+ ++Y L A S+SILPDCK FNTAKV+TQ N + V+ G+ + QW
Sbjct: 382 RNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQTRATFGSTK---QWS 438
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNM--T 461
E I F G + L++ +T D SDYLWY + SSN
Sbjct: 439 EYREGIPSF---GGTPLKASMLLEHMGTTKDASDYLWYTLR--------FIQNSSNAQPV 487
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
LR++S VLHA+VNG Y+ S + + V L G N+ISLLS VGL + G
Sbjct: 488 LRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGP 547
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ GI + + GD KD S H W Y+VGL G + + Y + + + W
Sbjct: 548 YLEHKVAGIR--RVEIQDGGDS---KDFSKHPWGYQVGLMG-EKSQIYTSPGSQKVQ-WH 600
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+TWYKT F+AP NDPVVL MGKG AWVNG ++GRYW +YL
Sbjct: 601 GLGSHGRGPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS---- 656
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
G PSQ WY+VPR+++ N LV+ EE G+P +I+ TV
Sbjct: 657 -------------------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVS 697
Query: 702 VGTACG--------------------QAHENKT--MELTC-HGRRISEIKYASFGDPQGA 738
V CG ++H K ++L C IS+I +ASFG P G
Sbjct: 698 VTNVCGHVTDSHPPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGG 757
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C ++ GSC + + L + EK C+GK CSI S + G C GT K L+V A C
Sbjct: 758 CESYAIGSCHSP-NSLAVAEKACLGKNMCSIPHSLKSFGDDPC-PGTPKALLVAAQC 812
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 596 bits (1536), Expect = e-167, Method: Compositional matrix adjust.
Identities = 300/499 (60%), Positives = 355/499 (71%), Gaps = 58/499 (11%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A L CL L VS+D AI I+GER+I+ SGSIHYPRST MWPDLI+KAK+
Sbjct: 9 ATLACLTF------CLGDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKD 62
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLDAIETY+FW+ HEP RR+YDF+G LD I+F + IQD GLYV++RIGPYVCAEWNYGG
Sbjct: 63 GGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGG 122
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWLHNMPGI+ LRT N+V+ NEMQ FTT IV+M K+ LFASQGGPIILAQIENEYGN
Sbjct: 123 FPVWLHNMPGIQ-LRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGN 181
Query: 190 VMSD-YGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
VM+ YGDAGK+YINWCA+MA SL+IGVPWIMCQ+SDAP P+ FTPNNP
Sbjct: 182 VMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPK 241
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
SPK++TENW GWFK WG KDP RTAED+AF+VARFFQ GG F NYYMYHGGTNFGRTSGG
Sbjct: 242 SPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGG 301
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS---- 353
P++TTSYDY+AP+DEYG+LNQPKWGHL++LH +K EK LT G TN ++G+SV+
Sbjct: 302 PFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNGTHTNQNFGSSVTLTKF 361
Query: 354 ---------------------------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
Y +PAWSVSIL C E +NTAKVN+QT++
Sbjct: 362 FNPTTGERFCFLSNTDGKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQTSM 421
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNA 445
VK N+ N Q L W W PE + D ++G G FA N ++QK T D SDY WYMTN
Sbjct: 422 FVKEQNEKENAQ--LSWAWAPEPMKD-TLQGNGKFAANLFLEQKRVTADFSDYFWYMTNV 478
Query: 446 DLKDDDPILSGSSNMTLRI 464
D S N+TL++
Sbjct: 479 DTSG----TSSLQNVTLQV 493
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 595 bits (1534), Expect = e-167, Method: Compositional matrix adjust.
Identities = 312/663 (47%), Positives = 406/663 (61%), Gaps = 55/663 (8%)
Query: 7 CSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKK 66
+ +L L+L + + + VS+D +AI IDG+R+IL+SGSIHYPRSTP MWPDLI+K
Sbjct: 12 ITNMFMLLLMLFSSWVCFVEATVSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPDLIQK 71
Query: 67 AKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWN 126
AK+G +D I+TYVFWN HEP +Y F DL+RFIK +Q GLYV LRIGPYVCAEWN
Sbjct: 72 AKDG-VDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWN 130
Query: 127 YGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE 186
+GGFPVWL +PGIE RT N+ F MQ FT IV M K EKLF +QGGPIIL+QIENE
Sbjct: 131 FGGFPVWLKYVPGIE-FRTDNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQIENE 189
Query: 187 YGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNN 235
+G V + G GK+Y W A+MA LD GVPW+MC++ DAP P+ F PN
Sbjct: 190 FGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYCENFVPNQ 249
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
N PK+WTENWTGWF ++GG P+R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+
Sbjct: 250 KNKPKMWTENWTGWFTAFGGPTPQRPAEDVAFSVARFIQNGGSFVNYYMYHGGTNFGRTA 309
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSV--- 352
GGP++ TSYDYDAP+DEYG L +PKWGHLR+LHK +K E L + T T GN+
Sbjct: 310 GGPFIATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKLCESALVSTDPTVTSLGNNQEVH 369
Query: 353 -----SGS---------------------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
SGS Y LP WS+SILPDCKT FNTA++ Q+++
Sbjct: 370 VFNPKSGSCAAFLANYDTTSSAKVNFKIMQYELPPWSISILPDCKTAVFNTARLGAQSSL 429
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNA 445
K P + W+ + F + L +Q T D SDYLWYMTN
Sbjct: 430 KQMTPVST--------FSWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDYLWYMTNI 481
Query: 446 DLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
++ ++ L + L I S+G LH ++NG + + F + VK+ G N
Sbjct: 482 NIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVKMRVGVN 541
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
Q+SLLS +VGLQN G+ F+ G+ GPV L G +DLS +W+YK+GL G +D
Sbjct: 542 QLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEG---TRDLSKQQWSYKIGLKG-ED 597
Query: 566 KKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYN 625
+ ++S ++ + +TWYKTTF AP N+P+ L++ MGKG W+N +
Sbjct: 598 LSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWINSQS 657
Query: 626 LGR 628
+GR
Sbjct: 658 IGR 660
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 332/716 (46%), Positives = 420/716 (58%), Gaps = 89/716 (12%)
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
MQ FT +VD K L+ASQGGPIIL+QIENEYGN+ S YG AGK+Y+ W A MA SLD
Sbjct: 1 MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60
Query: 214 IGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTA 262
GVPW+MCQ+SDAP P+ FTPN+ + PK+WTENW+GWF S+GG P R A
Sbjct: 61 TGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPA 120
Query: 263 EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWG 322
EDLAFAVARF+Q GGTFQNYYMYHGGTNFGR++GGP++ TSYDYDAPIDEYG + QPKWG
Sbjct: 121 EDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWG 180
Query: 323 HLRELHKLLKSMEKTLTYGNVTNTDYG------------NSV------------------ 352
HLR++HK +K E L + + G NS+
Sbjct: 181 HLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKF 240
Query: 353 SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP----NQAGNDQ------APLQ 402
+G++Y LPAWSVSILPDCK NTA++N+Q R Q +D A
Sbjct: 241 NGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLITPELATAG 300
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
W + E + + + L++Q +T D SD+LWY T+ +K D+P L+GS +
Sbjct: 301 WSYAIEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYLNGSQS-N 356
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L +NS G VL Y+NG S +S + PV L GKN+I LLS TVGL NYG+
Sbjct: 357 LLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGA 416
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
FD+V G+ GPV L G G +LSS WTY++GL G +D YN A+ E W
Sbjct: 417 FFDLVGAGVTGPVKLSGPNG----ALNLSSTDWTYQIGLRG-EDLHLYNPSEASPE--WV 469
Query: 582 SKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
S N P N+ + WYKT F AP +DPV ++ GMGKG AWVNG ++GRYWPT LA + GC
Sbjct: 470 SDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC 529
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
SC+YRG Y S+KC CG PSQ YHVPRS+++ G N LVLFE+FGG+PS I+F T
Sbjct: 530 -VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTR 588
Query: 701 VVGTACGQAHE-------------------NKTMELTC--HGRRISEIKYASFGDPQGAC 739
+ C E + L C G+ IS IK+ASFG P G C
Sbjct: 589 QTSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTC 648
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G + G C + L ++++ CVG +CS+ S N G +G K LVVEA C
Sbjct: 649 GNYNHGECSSS-QALAVVQEACVGMTNCSVPVSSNNFGDP--CSGVTKSLVVEAAC 701
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 335/835 (40%), Positives = 453/835 (54%), Gaps = 112/835 (13%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L+ L+L + V++DGR++ IDGE KIL SGSIHY RSTP MWP LI KAK GG
Sbjct: 8 LVFLVLMAVIVAGDVANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGG 67
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
+D ++TYVFWN HEP + Q+DF+G+ D+++FIK +++ GLYV LRIGP++ EW+YGG P
Sbjct: 68 IDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLP 127
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
WLHN+ GI RT N+ F M+ + +IV + K E L+ASQGGPIIL+QIENEYG V
Sbjct: 128 FWLHNVQGI-VFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVG 186
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNS 238
+ GKSY+ W AK+A LD GVPW+MC++ DAP P+ PN+PN
Sbjct: 187 RAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNK 246
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P IWTENWT +++++G + R+AED+AF VA F G+F NYYMYHGGTNFGR +
Sbjct: 247 PAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNASQF 306
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
+T+ YD AP+DEYG L QPKWGHL+ELH +K E+ L G T G
Sbjct: 307 VITSYYD-QAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFG 365
Query: 351 --------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SSY L SVS+LPDCK FNTAKVN Q N + ++
Sbjct: 366 KKANLCAAILVNQDKCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK 425
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDD 450
Q N +P W+ E + F +L L +T D SDYLW T +
Sbjct: 426 ARQ--NLSSPQMWEEFTETVPSFSETSIRSESL--LEHMNTTQDTSDYLWQTTRFQQSEG 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
P + L++N G LHA+VNG ++ S + A L E+ + L G N ++LL
Sbjct: 482 APSV-------LKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALL 534
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G+ + G + GR +++ W Y+VGL G +K
Sbjct: 535 SVMVGLPNSGAHLERRVVGSRSVKIWNGRYQ-----LYFNNYSWGYQVGLKG--EKFHVY 587
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+ +++ W ++ +TWYK +F+ P DPV LNL MGKG AWVNG ++GRYW
Sbjct: 588 TEDGSAKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYW 647
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF-EEFG 689
++ + GNPSQIWYH+PRS++K N LV+ EE
Sbjct: 648 VSFHTYK-----------------------GNPSQIWYHIPRSFLKPNSNLLVILEEERE 684
Query: 690 GNPSQINFQTVVVGTACGQA-----------------HENKT--------MELTC-HGRR 723
GNP I TV V CG +N T ++L C GR+
Sbjct: 685 GNPLGITIDTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRK 744
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGA 778
IS+I +ASFG P G+CG++ GSC + + L +++K C+ K CS+ G
Sbjct: 745 ISKILFASFGTPNGSCGSYSIGSCHSP-NSLAVVQKACLKKSRCSVPVWSKTFGV 798
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 593 bits (1528), Expect = e-166, Method: Compositional matrix adjust.
Identities = 346/837 (41%), Positives = 458/837 (54%), Gaps = 118/837 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G+RKIL SGSIHYPRSTP MWP LI +AK+GG+D IETYVFWN HEP
Sbjct: 28 VTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQGGIDVIETYVFWNQHEPKP 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF+G D++RFI+ +Q QGLY LRIGP++ AEWNYGGFP WLH++PGI RT N+
Sbjct: 88 GQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGFPFWLHDVPGIV-YRTDNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+NFTT IV++ K E L+ASQGGPIIL QIENEY V +++G+AGK Y+ W A M
Sbjct: 147 PFKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVEANFGEAGKRYVLWAANM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L+ GVPW+MC++ DAP P+ PN+PN P IWTENWT + +G
Sbjct: 207 AVGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNKPAIWTENWTSSYPLFGE 266
Query: 256 KDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
R ED+AF VA F + G+F NYYMYHGGTNFGRT+ Y+ T+Y +AP+DEYG
Sbjct: 267 DARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASA-YVQTAYYDEAPLDEYG 325
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS--------------------- 353
+ QP WGHL+ELH +K +TL G +N G +
Sbjct: 326 LIQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYVFRGQSGKCAAFLVNNDS 385
Query: 354 ---------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
+SY LP S+SILPDCK E FNTAK + + + + N QW+
Sbjct: 386 RTDVTVVFQNTSYELPRKSISILPDCKNEAFNTAKASFRPGLISIQTVTKFNSTE--QWE 443
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I +F NTL++ +T D SDYLWY ++DP + L
Sbjct: 444 EYKESILNF---DDTSSRANTLLEHMNTTKDASDYLWYTFR---YNNDP---SNGQSVLS 494
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYG 520
NS LHA++NG + SQ +G+S++L + V G N +SLLS VGL + G
Sbjct: 495 TNSRAHALHAFINGRHTGSQ---HGSSSNLSFSLDNTVSFRAGINNVSLLSVMVGLPDSG 551
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
+ + G L R +KD +++ W Y+VGL G + + + + + W
Sbjct: 552 AYLERRVAG-----LRRVRIQSNGSLKDFTNNPWGYQVGLLGEKLQIYTDVGSQKVQ--W 604
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
S + +TWYKT F+AP N+PV LNL M KG WVNG ++GRYW ++L
Sbjct: 605 SKFGSSTSGLLTWYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIGRYWVSFLTPS--- 661
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
G PSQIWYH+PRS++K N LVL EE G+P I+ V
Sbjct: 662 --------------------GKPSQIWYHIPRSFLKPTGNLLVLLEEETGHPVGISIGKV 701
Query: 701 VVGTACGQA----------------HENK-----TMELTC-HGRRISEIKYASFGDPQGA 738
+ CG HEN ++L C R IS I +ASFG P G
Sbjct: 702 SIPKICGHVSESHLPPVISRVIYKKHENHHGRRPKVQLRCPSNRNISRILFASFGTPSGD 761
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C ++ GSC + + +EK C+GK CS+ S G C GT K L+V+ C
Sbjct: 762 CQSYAVGSCHSS-NSRSNVEKACLGKGMCSVPLSYKRFGGDPC-PGTPKALLVDVQC 816
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 587 bits (1512), Expect = e-164, Method: Compositional matrix adjust.
Identities = 349/842 (41%), Positives = 465/842 (55%), Gaps = 113/842 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ +DG+RK+L SGSIHYPRSTP MW LI KAKEGGLD I+TYVFWN HEP
Sbjct: 24 VTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFWNLHEPQP 83
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF+G D++RFIK +Q QGLYV LRIGP++ EW+YGG P WLH++PGI R+ N+
Sbjct: 84 GQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGI-VFRSDNE 142
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F +MQ FTT IV M + EKL+ SQGGPIIL+QIENEYG V Y + G +Y+ W A+M
Sbjct: 143 PFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYVKWAAQM 202
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L+ GVPW+MC+++DAP P+ PN+PN P IWTENWT + G
Sbjct: 203 AVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTRYVITGE 262
Query: 256 KDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
R+ ED+AF V +F G+F NYYMYHGGTNFGRT+ ++ TSY APIDEYG
Sbjct: 263 NIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASA-FVPTSYYDQAPIDEYG 321
Query: 315 HLNQPKWGHLRELHKLLK-SMEKTLTYGNVT------------------------NTDYG 349
+ QPKWGHL+E+H +K + L+ G VT N D
Sbjct: 322 LIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTISLGQQQQAFVFTGLSGECAAFLLNNDTA 381
Query: 350 NSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQ--TNVKVKRPNQAGNDQAPLQW 403
N+ S +SY+LP S+SILPDCKT FNTAKV+TQ T + G D +W
Sbjct: 382 NTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYTTRSMTRSKLLDGED----KW 437
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E I +F + +++Q ST D SDYLWY + D + L
Sbjct: 438 VQYQEAIVNF---DETSVKSEAILEQMSTTKDASDYLWYTFRFQQESSD------TQAVL 488
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ S G VLHA+VNG V + + V L+ G N +SLLS VG+ + G+
Sbjct: 489 NVRSLGHVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNVSLLSVMVGMPDSGAY 548
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ G+ V + + G+ K+ +++ W Y+VGL G + F + ++ + S
Sbjct: 549 MERRAAGLR-KVKIQEKEGN----KEFTNYSWGYQVGLLGEKLQIFTDQGSSQVQWANFS 603
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
KN LN +TWYKT F+APLE+ PV LNL MGKG AWVNG ++GRYWP+Y A +
Sbjct: 604 KNA-LN-PLTWYKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYRASD----- 656
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIW----YHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
GS + Y N I+ Y+VPRS++K N LV+ EE GGNP QI+
Sbjct: 657 ---------GSSQIWYAYFNTGAIFRAVRYNVPRSFLKPKGNLLVVLEESGGNPLQISVD 707
Query: 699 TVVVGTACGQA-----------------------HENKTMELTC-HGRRISEIKYASFGD 734
T + C ++L C +IS I +AS+G
Sbjct: 708 TASISKICSHVTASHLPLVSSWSKRTNTDNNNSLQARPRVKLDCPSNTKISNILFASYGT 767
Query: 735 PQGACG-AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
P+G CG A+ G C + +++K C+G+ CSI S G C+A K L+V A
Sbjct: 768 PEGTCGDAYAVGMCHSS-SSEAIVQKACLGQMRCSIPVSSKYFGGDPCSANE-KSLLVVA 825
Query: 794 LC 795
C
Sbjct: 826 EC 827
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 342/836 (40%), Positives = 459/836 (54%), Gaps = 120/836 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
++DGR++ ++GE K+L SGSIHYPRSTP MWP LI KAKEGG+D I+TYVFWN HEP +
Sbjct: 16 ATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQQ 75
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F+G D++RF+K IQ QGLY LRIGP++ AEW+YGG P WLH++ GI R+ N+
Sbjct: 76 GTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGI-VYRSDNE 134
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQNFTT IV+M K E L+ASQGGPIIL+QIENEY V + +G+ G Y+ W AKM
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194
Query: 209 ATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWGG 255
A SL GVPW MC+++DAP P+ FT PN+PN P IWTENWT +++++G
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254
Query: 256 KDPKRTAEDLAFAVARFFQF-GGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+ R+AE++AF VA F GT+ NYYMYHGGTNFGR++ +T YD +P+DEYG
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QSPLDEYG 313
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG-------------------- 354
+PKWGHL+ELH +K L G +N G SV
Sbjct: 314 LTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVNRGAI 373
Query: 355 --------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
+Y LP S+SILPDCK FNT +V+ Q N R A L+W+
Sbjct: 374 DSNVLFQNVTYELPLGSISILPDCKNVAFNTRRVSVQHNT---RSMMAVQKFDLLEWEEF 430
Query: 407 PEMINDFVVRGKGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I + N L++ +T D SDYLWY ++ D P S TL ++
Sbjct: 431 KEPIPNI---DDTELRANELLEHMGTTKDRSDYLWYTFR--VQQDSP----DSQQTLEVD 481
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S LHA+VNG+Y S Y + + L G N ISLLS VGL + G+ +
Sbjct: 482 SRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSGAFLET 541
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G+ VG G+ D S W YKVGL G + F + ++N + WS
Sbjct: 542 RVAGL----RRVGIQGE-----DFSEQHWGYKVGLSGEQSQIFLDTGSSNVQ--WSRLGN 590
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
++ +TWYKT F+AP +DP+ LNL MGKG WVNG +GRYW ++L +
Sbjct: 591 S-SQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK-------- 641
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
G PSQ WY+VPRS++K N LV+ EE GNP +I+ +V++
Sbjct: 642 ---------------GEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKT 686
Query: 706 CGQAHE---------------------NKT----MELTC-HGRRISEIKYASFGDPQGAC 739
CGQ E N+T ++L+C ++IS I +ASFG P G C
Sbjct: 687 CGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDC 746
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++ G C + + ++E C+G+ CSI S N C T K L+V+A C
Sbjct: 747 QSYAIGLCHSP-NSRAIVEHACLGRAKCSIPISNLNFRGDPCPHVT-KTLLVDAQC 800
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 313/628 (49%), Positives = 389/628 (61%), Gaps = 64/628 (10%)
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QYDF G DL+RF+K D GLYV LRIGPYVCAEWNYGGFP+WLH +PGI+ LRT N+
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK-LRTDNEP 59
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQ FT +V K L+ASQGGPIIL+QIENEYGN+ + YG AGKSYI W A MA
Sbjct: 60 FKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMA 119
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+LD GVPW+MCQ++DAP P+ FTP+ P+ PK+WTENW+GWF S+GG P
Sbjct: 120 VALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVP 179
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLAFAVARF+Q GGT QNYYMYHGGTNFGR+SGGP+++TSYDYDAPIDEYG + Q
Sbjct: 180 YRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQ 239
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHLR++HK +K E L + + G
Sbjct: 240 PKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGSLCAAFLANIDDQSDKTV 299
Query: 351 SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR----PNQAGN------DQAP 400
+ +G +Y LPAWSVSILPDCK NTA++N+Q R QA + + A
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
W + E + + + L++Q +T D SD+LWY T+ + +P L+GS +
Sbjct: 360 SSWSYAVEPVG---ITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQS 416
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L +NS G VL ++NG S +S PV L GKN+I LLSATVGL NY
Sbjct: 417 -NLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNY 475
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ FD+V GI GPV L G G DLSS +WTY++GL G +D YN A+ E
Sbjct: 476 GAFFDLVGAGITGPVKLTGPKG----TLDLSSAEWTYQIGLRG-EDLHLYNPSEASPE-- 528
Query: 580 WSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W S N P N +TWYK+ F AP +DPV ++ GMGKG AWVNG ++GRYWPT +A +
Sbjct: 529 WVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQS 588
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQI 666
GC SC+YRG Y + KC CG PSQI
Sbjct: 589 GC-VNSCNYRGSYSATKCLKKCGQPSQI 615
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 580 bits (1496), Expect = e-163, Method: Compositional matrix adjust.
Identities = 335/862 (38%), Positives = 467/862 (54%), Gaps = 118/862 (13%)
Query: 12 LLCLILQTLFNLSLAY-------RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
L+ +L L + + A+ V++DGR++ ++G R++L SGSIHYPRSTP MWPD++
Sbjct: 8 LIAAVLSLLVSYAAAHGIAKGAKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPEMWPDIL 67
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+KAK GGL+ I+TYVFWN HEP+ Q++F GN DL++FIK I D GLY LRIGP++ AE
Sbjct: 68 QKAKHGGLNLIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAE 127
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFP WL +P I R+ N+ F M+ ++ +I++M K+ KLFA QGGPIILAQIE
Sbjct: 128 WNHGGFPYWLREVPDI-IFRSYNEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIE 186
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FT 232
NEY ++ Y + G Y+ W KMA L GVPWIMC++ DAP P+ FT
Sbjct: 187 NEYNSIQLAYRELGVQYVQWAGKMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFT 246
Query: 233 -PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
PN PN P +WTENWT ++ +G +R AEDLAF+VARF GT NYYMYHGGTNF
Sbjct: 247 GPNRPNKPSLWTENWTAQYRVFGDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNF 306
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN--------- 342
GRT G ++TT Y +AP+DEYG +PKWGHL++LH L+ +K L G+
Sbjct: 307 GRT-GSSFVTTRYYDEAPLDEYGLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKD 365
Query: 343 -----------------VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVN 381
+TN + + G Y LP S+SILPDCKT +NT +V
Sbjct: 366 KEVRFYEKPGTHICAAFLTNNHSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVV 425
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPE---MINDFVVRGKGHFALNTLIDQKSTNDVSDY 438
Q N + ++ N L+W+ E ++ D + K L + D SDY
Sbjct: 426 AQHNARNFVKSKIANKN--LKWEMSQEPIPVMTDMKILTKSPMELYNFL-----KDRSDY 478
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
W++T+ +L + D + L+I++ G + A+VNGN++ S N +F +PV
Sbjct: 479 AWFVTSIELSNYDLPMKKDIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPV 538
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
K G N I+LL TVGL N G+ + GI +L G T D++++ W +V
Sbjct: 539 KFKAGTNYIALLCMTVGLPNSGAYMEHRYAGIHSVQIL----GLNTGTLDITNNGWGQQV 594
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
G+ G K + + + + P MTWYKT F+ P NDPV+L + M KG
Sbjct: 595 GVNGEHVKAYTQGGSHRVQWTAAKGKGPA---MTWYKTYFDMPEGNDPVILRMTSMAKGM 651
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDG 678
AWVNG N+GRYW +YL+ + PSQ YHVPR+W+K
Sbjct: 652 AWVNGKNIGRYWLSYLSPLE-----------------------KPSQSEYHVPRAWLKPS 688
Query: 679 VNTLVLFEEFGGNPSQINFQTVVVGTACG-------------QAHENKTM---------- 715
N LV+FEE GGNP +I + V T C Q H++K
Sbjct: 689 DNLLVIFEETGGNPEEIEVELVNRDTICSIVTEYHPPHVKSWQRHDSKIRAVVDEVKPKG 748
Query: 716 ELTCHGRR-ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA 774
L C + I ++ +ASFG+P GACG F+ G+C A + ++E+ C+GK +C I
Sbjct: 749 HLKCPNYKVIVKVDFASFGNPLGACGDFEMGNCTAP-NSKKVVEQHCMGKTTCEIPMEAG 807
Query: 775 NLGATSCAAGTV-KRLVVEALC 795
S A + K L V+ C
Sbjct: 808 IFDGNSGACSDITKTLAVQVRC 829
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 333/847 (39%), Positives = 467/847 (55%), Gaps = 128/847 (15%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ ++GER++L SGSIHYPR P MWP++I+KAKEGGL+ I+TYVFWN HEP++
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q++F GN DL++FIK I +QGLYV LRIGPY+ AEWN GGFP WL +P I R+ N+
Sbjct: 88 GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNI-TFRSYNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F++ M+ ++ +++D+ KKEKLFA QGGPII+AQIENEY NV Y D GK YI W A M
Sbjct: 147 PFIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWGG 255
ATSL GVPWIMC++ DAP + FT PN PN P +WTENWT ++++G
Sbjct: 207 ATSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGD 266
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R AED+AF+VARFF GT NYYMY+GGTN+GRTS ++TT Y +AP+DE+G
Sbjct: 267 PPSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRTSSS-FVTTRYYDEAPLDEFGL 325
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVT----NTDYGNSV------------------- 352
+PKW HLR+LH+ L+ + L +G T N D +V
Sbjct: 326 YREPKWSHLRDLHRALRLSRRALLWGTPTVQKINQDLEITVFEKPGSTDCAAFLTNNHTT 385
Query: 353 -------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
G Y LP SVSILPDCKT +NT + +Q N + ++ + L+W+
Sbjct: 386 QPSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHNSRNFITSEKSKN---LKWEM 442
Query: 406 RPE---MINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E I D ++ + L +L T D SDY WY T+ L+ D + L
Sbjct: 443 YQEKVPTIADLPLKNREPLELYSL-----TKDTSDYAWYSTSITLERHDLPMRPDILPVL 497
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDL-----FERPVKLTRGKNQISLLSATVGLQ 517
+I S G L A+VNG YV +G N++ F++P+ L G N I++L+ TVG
Sbjct: 498 QIASMGHALAAFVNGEYVG-----FGHGNNIEKSFVFQKPIILKPGTNTITILAETVGFP 552
Query: 518 NYGSKFDMV---PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAA 574
N G+ + P G+ L+ G D++ + W ++VG++G + F A
Sbjct: 553 NSGAYMEKRFAGPRGVTIQGLMAGTL-------DITQNNWGHEVGVFGEKQELFTEEGAK 605
Query: 575 NSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
+ W+ P +TWYKT F+AP N+PV L + M KG WVNG +LGRYW ++L
Sbjct: 606 KVQ--WTPVTGPPKGAVTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLGRYWTSFL 663
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQ 694
+ G P+Q YH+PR+++K N LV+FEE GG+P+
Sbjct: 664 SP-----------------------LGQPTQAEYHIPRAYLKPTNNLLVIFEETGGHPTN 700
Query: 695 INFQTVVVGTACGQAHE-----------------------NKTMELTCHGRRISE-IKYA 730
I QTV T C E LTC +I E +++A
Sbjct: 701 IEVQTVNRDTICSIITEYHPPHVKSWERSGTDFVAVVEDLKSGAHLTCPDNKIIEKVEFA 760
Query: 731 SFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCS--IEASEANLGATSCAAGTVKR 788
S+G+P GACG G+C + + L ++E+ C+GK +C+ IE + + K
Sbjct: 761 SYGNPDGACGNLFNGNCNSA-NSLKVVEQHCLGKNTCTIPIEREIYDEPSKDPCPNIFKT 819
Query: 789 LVVEALC 795
L V+ C
Sbjct: 820 LAVQVKC 826
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 327/720 (45%), Positives = 426/720 (59%), Gaps = 85/720 (11%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++DGR++ IDG+RKIL SGSIHYPRSTP MWPDLI KAK+GGLD I+TYVFWN HE
Sbjct: 24 AEEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHE 83
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDF+G DL+ FIK IQ QGLYV LRIGP++ +EW YGGFP WLH++PGI RT
Sbjct: 84 PQPGMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPGI-VYRT 142
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F MQNFTT IV+M K+E L+ASQGGPIIL+QIENEY N+ +G AG Y+ W
Sbjct: 143 DNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWA 202
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKS 252
AKMA LD GVPWIMC+++DAP P+ FT PN+PN P +WTENWT +++
Sbjct: 203 AKMAVGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQV 262
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG R+AED+AF V F G++ NYYMYHGGTNFGRT G Y+ T Y AP+DE
Sbjct: 263 YGGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRT-GSAYVITGYYDQAPLDE 321
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYG---NVT----------------------NTD 347
YG L QPKWGHL++LH+++KS TL G N T N D
Sbjct: 322 YGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFTLGQLLEVYVFEEEKGECVAFLINND 381
Query: 348 YGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
N + SSY L S+SILPDC+ F+TA VNT +N ++ P Q N + W
Sbjct: 382 RDNKATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTSNRRIISPKQ--NFSSVDDW 439
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
+ ++I++F ++L++Q +T D SDYLWY + S TL
Sbjct: 440 QQFQDVISNF---DNTSLKSDSLLEQMNTTKDKSDYLWYTLRFEYN------LSCSKPTL 490
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ S+ V HA+VN Y+ + + + E PV + +G N +S+LS VGL + G+
Sbjct: 491 SVQSAAHVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSVMVGLPDSGAF 550
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ G+ + V E +L++ W Y+VGL G + + Y + NS+ GWS
Sbjct: 551 LERRFAGL----ISVELQCSEQESLNLTNSTWGYQVGLMG-EQLQVYKEQ-NNSDTGWSQ 604
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+ + + WYKTTF+ P +DPVVL+L MGKG AWVNG ++GRYW + +
Sbjct: 605 LGNVMEQTLFWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWILFHDSK----- 659
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
GNPSQ YHVPRS++KD N LVL EE GGNP I+ TV V
Sbjct: 660 ------------------GNPSQSLYHVPRSFLKDSGNVLVLLEEGGGNPLGISLDTVSV 701
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 318/711 (44%), Positives = 406/711 (57%), Gaps = 63/711 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +A+ IDG+R+IL+SGSIHYPRSTP MWPDL +KAK+GGLD I+TYVFWN HEP
Sbjct: 25 VSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGHEPSP 84
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y LD ++ K Q L V LR+ P + GFPVWL +PG+ RT N+
Sbjct: 85 GNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMA-FRTDNE 137
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FTT IV M K E LF +QGGPII++QIENEYG V + G GK+Y W A+M
Sbjct: 138 PFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAQM 197
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A LD GVPW MC++ DAP P+ FTPN PK+WTENW+GW+ +GG
Sbjct: 198 AVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYTDFGGAI 257
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
R EDLA++VA F Q G+F NYYMYHGGTNFGRTS G ++ TSYDYDAPIDEYG N
Sbjct: 258 SHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLPN 317
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------------- 350
+PKW HL+ LHK +K E L + T T GN
Sbjct: 318 EPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLANYDTKSA 377
Query: 351 ---SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRP 407
+ Y+LP WSVSILPDCKT FNTA VN + K P + D W+
Sbjct: 378 ATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHSFHKRMTPVETTFD-------WQS 430
Query: 408 EMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
N L +Q T D SDYLWY+T+ ++ + + TL INS
Sbjct: 431 YSEEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFPTLTINS 490
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
+G VLH +VNG + + F V L G N+ISLLS VGL N G F+
Sbjct: 491 AGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVGLHFETW 550
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
G+ GPV L G DE +DLS KW+YKVGL G + + ++S ++
Sbjct: 551 NVGVLGPVRLKGL--DEGT-RDLSWQKWSYKVGLKG-ESLSLHTITGSSSIDWTQGSSLA 606
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+ +TWYKTTF+AP NDPV L++ MGKG W+N ++GR+WP Y+A + + C+
Sbjct: 607 KKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIAHGN---CDECN 663
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
Y G + + KC NCG P+Q WYH+PRSW+ N LV+ EE+GG+P+ I+
Sbjct: 664 YAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISL 714
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 326/860 (37%), Positives = 467/860 (54%), Gaps = 102/860 (11%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
+A + C AIL+ L + N V++D RA+ +DG+R++L++G IHYPRSTP MW
Sbjct: 28 VAAVAMCCSAILVALPSTSAMN------VTYDSRALLLDGQRRLLIAGCIHYPRSTPEMW 81
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
P+L +AK GLD I+TY+FW+ ++P ++ T D +RFIK Q GL V RIGPY
Sbjct: 82 PELFARAKANGLDVIQTYLFWDVNQPTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPY 141
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEWNYGGFP WL + GI R +K +++ + + T V + K KL A+ GGP+IL
Sbjct: 142 VCAEWNYGGFPAWLRQISGIV-FRDNDKPWLDVVGPYITKTVQVLKDNKLLAADGGPVIL 200
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNN----- 235
QIENEYGN+ Y G +Y+ WC ++A SL+ G WIMCQ+ DAP+ N
Sbjct: 201 LQIENEYGNIEDSYA-GGPAYVQWCGQLAASLNAGAQWIMCQQDDAPANTIATCNGFYCD 259
Query: 236 -----PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P +WTENW GWF++WG P R A+D+AFA ARF+ GGT+ +YYMYHGGTN
Sbjct: 260 NYVPHKGQPMMWTENWPGWFQTWGQPSPHRPAQDVAFAAARFYAKGGTYMSYYMYHGGTN 319
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV------- 343
FGRT+GGP +TTSYDYD +DEYG ++PK+ HL LH +L + E + NV
Sbjct: 320 FGRTAGGPGITTSYDYDVALDEYGMPSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLG 379
Query: 344 ----------------------TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVN 381
++ D +G ++ LPAWSVSIL +C +NTA V+
Sbjct: 380 KNLEAHVFNSSSGCVAFLSNIDSSVDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAAVS 439
Query: 382 TQTNVKVKRP--------NQAGNDQAPLQWKWRPEMINDFVV----------RGKGHFAL 423
N + P + A + + L E + F R +
Sbjct: 440 APLNARRMTPLVVHEDAVSDAADHRRSLSKGEGQERVGAFSTFASYAETIGRRAEEAVYF 499
Query: 424 NTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDS 482
+ +Q +TND +DYLWY T + + +++ L I++ V++ YVN +V
Sbjct: 500 TSPQEQINTTNDTTDYLWYTTTYN-------SASATSQVLSISNVNDVVYVYVNRQFVTM 552
Query: 483 QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
W+ G+ N + V L G N I +LS T GLQNYG+ + V GI G V L
Sbjct: 553 SWS--GSVN----KAVPLMAGTNVIDVLSTTFGLQNYGTFLEQVTRGIQGTVKLGS---- 602
Query: 543 ETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPL 602
DL+ + W ++VGL G + F A+N W++ NR +TWY+++F+ P
Sbjct: 603 ----TDLTQNGWWHQVGLLGEELGIFLPQNASNVP--WATPAT-TNRGLTWYRSSFDLPQ 655
Query: 603 END-PVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCG 661
+ P+ L++ GMGKGF WVNG+NLGRYWP+ +A+ C + CDYRG Y +C C
Sbjct: 656 SSQAPLALDMTGMGKGFVWVNGHNLGRYWPSRIADSMAC--DDCDYRGAYDDSRCRQGCN 713
Query: 662 NPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-----TME 716
PSQ +YHVPR W++ N +V+ EE GGNP+ I+ +CG E+ ++
Sbjct: 714 IPSQRYYHVPREWLQPTNNLIVMLEEIGGNPALISLVEREEDISCGAVGEDYPADDLSVV 773
Query: 717 LTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEAN 775
L C + I +++ASFG P G C F GSC A + ++E C+G+++C + + +
Sbjct: 774 LGCGLHQTIRRVEFASFGTPVGTCRQFSLGSCNAA-NSTAIVESLCLGRQACHVPVAINH 832
Query: 776 LGATSCAAGTVKRLVVEALC 795
G T KRL V+ C
Sbjct: 833 FGDP--CPDTTKRLFVQVSC 850
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 334/827 (40%), Positives = 443/827 (53%), Gaps = 97/827 (11%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
L +++DGRA+ + G R++ SG +HY RSTP MWP LI KAK GGLD I+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP++ QY+F G DL++FI+ IQ QGLYV LRIGP+V AEW YGGFP WLH++P I R
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSI-TFR 143
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
+ N+ F MQNF T IV M K E L+ QGGPII++QIENEY + +G +G Y+ W
Sbjct: 144 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 203
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFK 251
A MA L GVPW+MC+++DAP P+ PN+PN P +WTENWT +
Sbjct: 204 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYP 263
Query: 252 SWGGKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+G R ED+AFAVA + + G+F +YYMYHGGTNFGR + Y+TTSY AP+
Sbjct: 264 IYGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPL 322
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS----------------VSG 354
DEYG + QP WGHLRELH +K + L +G+ +N G V+
Sbjct: 323 DEYGLIWQPTWGHLRELHCAVKQSSEPLLFGSYSNFSLGQQQEAHVFETDFKCVAFLVNF 382
Query: 355 SSYNLPAW------------SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
+N P S+S+L DC+ F TAKVN Q + Q+ ND
Sbjct: 383 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDIN--N 440
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
WK E + + K + N L +Q +T D +DYLWY+ + + D G+
Sbjct: 441 WKAFIEPVPQDL--SKSTYTGNQLFEQLPTTKDETDYLWYIVSYKNRASD----GNQIAR 494
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKY-GASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
L + S +LHA+VN YV S + G N + + L G N ISLLS VG + G
Sbjct: 495 LYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSG 554
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
+ + GI VG + + L++ W Y+VGL+G D Y + NS R W
Sbjct: 555 AYMERRTFGIQ----TVGIQQGQQPMHLLNNDLWGYQVGLFGEKD-SIYTQEGPNSVR-W 608
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
N + +TWYKTTF P ND V LNL MGKG WVNG ++GRYW ++ A
Sbjct: 609 MDINNLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS--- 665
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
G PSQ YH+PR ++ N LVL EE GG+P QI T+
Sbjct: 666 --------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTM 705
Query: 701 VVGTACGQAHENKTMELTCH------------GRRISEIKYASFGDPQGACGAFKKGSCE 748
V T CG E L G+RIS I++AS+G+P G C +F+ GSC
Sbjct: 706 SVTTVCGNVDEFSVPPLQSRGKVPKVRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCH 765
Query: 749 AEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
AE ++++ C+G++ CSI A G C G K L+V A C
Sbjct: 766 AE-SSESVVKQSCIGRRGCSIPVMAAKFGGDPC-PGIQKSLLVVADC 810
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 325/738 (44%), Positives = 429/738 (58%), Gaps = 89/738 (12%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
++L IL+ F + A V++DGR++ I+G+R IL SGSIHYPRSTP MWP LI KAK+G
Sbjct: 9 MMLVAILELSFGVKGAEEVTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPGLIAKAKQG 68
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP +YDF+G DL+ FIK I QGLYV LRIGP++ +EWNYGGF
Sbjct: 69 GLDVIQTYVFWNLHEPQPGKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIESEWNYGGF 128
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
P WLH++PGI RT N+ F MQNFTT IV+M K+E L+ASQGGPIIL+QIENEYGN+
Sbjct: 129 PFWLHDVPGI-VYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYGNI 187
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FT-PNNPN 237
+G AG Y+ W AKMA L+ GVPW+MC++ DAP P+ FT PN+PN
Sbjct: 188 QKAFGTAGSQYVEWAAKMAVGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETFTGPNSPN 247
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P +WTENWT +++ +GG R+AED+AF V F G+F NYYMYHGGTNFGRTS
Sbjct: 248 KPAMWTENWTSFYQVYGGVPYIRSAEDIAFHVTLFVARNGSFVNYYMYHGGTNFGRTSSA 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN--------------- 342
Y+ T Y AP+DEYG QPKWGHL+ELH +KS TL G
Sbjct: 308 -YMITGYYDQAPLDEYGLFRQPKWGHLKELHAAIKSCSTTLLQGVQRNFSLGELQEGYVF 366
Query: 343 ----------VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
+ N D GN+V+ SSY L S+SILPDC+ FNTA +NT +N ++
Sbjct: 367 EEENGKCAAFLINNDKGNTVTVQFNNSSYKLLPKSISILPDCQNVAFNTAHLNTTSNRRI 426
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD- 446
Q N + WK ++I +F ++L++Q +T D SDYLWY +
Sbjct: 427 ITSRQ--NFSSVDDWKQFQDVIPNF---DDTSLRSDSLLEQMNTTKDKSDYLWYTLRLEN 481
Query: 447 -LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKN 505
L +DPI L + SS V +A+VN Y+ + + + E P+ L N
Sbjct: 482 NLSCNDPI--------LHVQSSAHVAYAFVNNTYIGGEHGNHDVKSFTLELPITLNERTN 533
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
IS+LS VGL + G+ + G+ V E +L++ W Y+VGL G +
Sbjct: 534 NISILSGMVGLPDSGAFLEKRFAGLNN----VELQCSEQESLNLNNSTWGYQVGLLG-EQ 588
Query: 566 KKFYNAKAANSERGWSSKNVPLNR-RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
K Y + + + N+ ++ +TWYKTTF+ P +DP+ L+L M KG AWVNG
Sbjct: 589 LKVYTEQNSTDIKWTQLGNITIDEVTLTWYKTTFDTPKGDDPIALDLSSMAKGEAWVNGQ 648
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GRYW +L + GNPSQ YHVPRS++KD N+LVL
Sbjct: 649 SIGRYWILFLDSK-----------------------GNPSQSLYHVPRSFLKDSENSLVL 685
Query: 685 FEEFGGNPSQINFQTVVV 702
+E GGNP I+ TV V
Sbjct: 686 LDEGGGNPLDISLNTVSV 703
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 570 bits (1468), Expect = e-159, Method: Compositional matrix adjust.
Identities = 323/722 (44%), Positives = 421/722 (58%), Gaps = 89/722 (12%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++DGR++ IDG+RKIL SG IHYPRSTP MWPDLI KAK+GGLD I+TYVFWN HE
Sbjct: 24 AEEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHE 83
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDF G DL+ FIK IQ QGLYV LRIGP++ +EW YGGFP WLH++PGI RT
Sbjct: 84 PQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQSEWKYGGFPFWLHDVPGIV-YRT 142
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F MQNFTT IV+M K+E L+ASQGGPIIL+QIENEY N+ +G AG Y+ W
Sbjct: 143 DNESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWA 202
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKS 252
AKMA L+ GVPW+MC+++DAP P+ FT PN+PN P +WTENWT +++
Sbjct: 203 AKMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQV 262
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+GG R+AED+AF V F G++ NYYMYHGGTNFGRT+ +T YD AP+DE
Sbjct: 263 YGGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRTASAYVITGYYD-QAPLDE 321
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGN-------------------------VTNTD 347
YG L QPKWGHL++LH+++KS TL G + N D
Sbjct: 322 YGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFSLGQLQEGYVFEEEKGECVAFLKNND 381
Query: 348 YGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
N V+ SY L S+SILPDC+ FNTA VNT +N ++ P Q N + W
Sbjct: 382 RDNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNTTSNRRIISPKQ--NFSSLDDW 439
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD--LKDDDPILSGSSNM 460
K ++I F ++L++Q +T D SDYLWY + L P
Sbjct: 440 KQFQDVIPYF---DNTSLRSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCRKP-------- 488
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
TL + S+ V HA++N Y+ + + + E PV + +G N +S+LSA VGL + G
Sbjct: 489 TLSVQSAAHVAHAFINNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSAMVGLPDSG 548
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
+ + G+ + V E +L++ W Y+VGL G + + K NS+ GW
Sbjct: 549 AFLERRFAGL----ISVELQCSEQESLNLTNSTWGYQVGLLGEQLQVY--KKQNNSDIGW 602
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
S + + + WYKTTF+ P +DPVVL+L MGKG AWVN ++GRYW + +
Sbjct: 603 SQLGNIMEQLLIWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQSIGRYWILFHDSK--- 659
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
GNPSQ YHVPRS++KD N LVL EE GGNP I+ TV
Sbjct: 660 --------------------GNPSQSLYHVPRSFLKDTGNVLVLVEEGGGNPLGISLDTV 699
Query: 701 VV 702
V
Sbjct: 700 SV 701
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 569 bits (1467), Expect = e-159, Method: Compositional matrix adjust.
Identities = 320/716 (44%), Positives = 419/716 (58%), Gaps = 94/716 (13%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V++DGR++ IDG+RKIL SGSIHYPRSTP MWP LI KAKEGGLD I+TYVFWN HEP
Sbjct: 3 EVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEPQ 62
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
QYDF+G DL+RFIK IQ QGLYV LRIGPY+ +EW YGGFP WLH++P I RT N
Sbjct: 63 FGQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIV-YRTDN 121
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQNFTT IV M + E L+ASQGGPIIL+QIENEY NV +G+ G Y+ W A+
Sbjct: 122 QPFKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAE 181
Query: 208 MATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWG 254
MA L GVPW+MC+++DAP P+ FT PN+PN P WTENWT +++ +G
Sbjct: 182 MAVGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYG 241
Query: 255 GKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G+ R+AED+AF V F + G++ NYYMYHGGTN GRTS Y+ TSY AP+DEY
Sbjct: 242 GEPYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSS-YVITSYYDQAPLDEY 300
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG------------------- 354
G L QPKWGHL+ELH +KS TL G +N G G
Sbjct: 301 GLLRQPKWGHLKELHAAIKSCSTTLLEGKQSNFSLGQLQEGYVFEEEGKCVAFLVNNDHV 360
Query: 355 ---------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
SY LP+ S+SILPDC+ FNTA VNT++N ++ Q + +W+
Sbjct: 361 KMFTVQFRNRSYELPSKSISILPDCQNVTFNTATVNTKSNRRMTSTIQTFSSAD--KWEQ 418
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
++I +F + N+L++Q + T D SDYLWY S L
Sbjct: 419 FQDVIPNF---DQTTLISNSLLEQMNVTKDKSDYLWYTL--------------SESKLTA 461
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S+ V HA+ +G Y+ + + + P+KL G N IS+LS VGL + G+ +
Sbjct: 462 QSAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNNISILSVMVGLPDAGAFLE 521
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ + + +E+ DL++ W Y+VGL G + + Y K+ NS WS
Sbjct: 522 RRFAGLTAVEI---QCSEESY--DLTNSTWGYQVGLLG-EQLEIYEEKS-NSSIQWSPLG 574
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
N+ +TWYKT F++P ++PV LNL+ MGKG AWVNG ++GRYW ++ +
Sbjct: 575 NTCNQTLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYWISFHDSK------- 627
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
G PSQ YHVPRS++KD N+LVLFEE GGNP I+ T+
Sbjct: 628 ----------------GQPSQTLYHVPRSFLKDIGNSLVLFEEEGGNPLHISLDTI 667
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 563 bits (1450), Expect = e-157, Method: Compositional matrix adjust.
Identities = 320/824 (38%), Positives = 445/824 (54%), Gaps = 97/824 (11%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V++DGRA+ ++G R++L SG +HY RSTP MWP +I KA++GG+D I+TYVFWN HEP+
Sbjct: 38 EVTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYVFWNVHEPV 97
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ +Y+F G ++++FI+ IQ QGLYV LRIGP++ AEW YGGFP WLH +P I RT N
Sbjct: 98 QGKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPNIT-FRTDN 156
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQ F T +V+M K E L+ QGGPII++QIENEY V +G G Y+ W A
Sbjct: 157 EPFKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPRYVQWAAS 216
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
+A L GVPW+MC+++DAP P+ PN+PN P +WTENWT + +G
Sbjct: 217 LAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWTTRYPIYG 276
Query: 255 GKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
R+ D+ FAVA F + GG+F +YYMYHGGTNFGR + Y+TTSY AP+DEY
Sbjct: 277 NDTKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEY 335
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS------------------ 355
G + QP WGHL+ELH +K + L YG +N G
Sbjct: 336 GLIWQPTWGHLKELHAAVKLSSEPLLYGTYSNFSLGEDQEAHVFETKLKCVAFLVNFDKH 395
Query: 356 ----------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
S L S+SIL DC+T F T KVN Q + Q+ ND WK
Sbjct: 396 QRPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNAQHGSRTAEVVQSLNDTH--TWKA 453
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
E I + K + L + ST D +DYLWY+ + + + D S + L +
Sbjct: 454 FKESIPQDI--SKAAYTGKQLFEHLSTTKDETDYLWYIASYEYRPSD----DSHLVLLNV 507
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASN-DLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
S +LHA+VNG +V S +GA + + L G+N ISLL+ VG + G+
Sbjct: 508 ESQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEGQNTISLLNVMVGSPDSGAHM 567
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ GI + G+ + L++ W Y+VGL+G + + Y + ++S W+
Sbjct: 568 ERRSFGIHKVSIQQGQHA----LHLLNNELWGYQVGLFG-EGNRIYTQEGSHSVE-WTDV 621
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
N +TWY+TTF P+ ND V LNL MGKG W+NG ++GRYW
Sbjct: 622 NNLTYLPLTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRYW------------- 668
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
++ P G PSQ YH+P+ ++K+ N LVL EE GGNP QI TV +
Sbjct: 669 -VSFKTP---------SGQPSQSLYHIPQHFLKNTDNLLVLVEEMGGNPLQITVNTVSIT 718
Query: 704 TACGQAHE-----------NKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEI 751
T C +E + + L C G+ IS +++AS+G+P G C F GSC AE
Sbjct: 719 TVCSSVNELSAPPVQSQGKDPEVRLRCQKGKHISAVEFASYGNPAGDCRTFTIGSCHAE- 777
Query: 752 DVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++++ C+GK+SCSI + G C G K L+V A C
Sbjct: 778 SSESVVKQACIGKRSCSIPVGPGSFGGDPC-PGIQKSLLVVAHC 820
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 563 bits (1450), Expect = e-157, Method: Compositional matrix adjust.
Identities = 321/716 (44%), Positives = 417/716 (58%), Gaps = 91/716 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ IDG+ KIL SGSIHYPRSTP MWP+LI KAKEGGLD I+TYVFWN HEP +
Sbjct: 27 VTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEPQQ 86
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF G +++RFIK IQ QGLYV LRIGPY+ +E YGG P+WLH++PGI R+ N+
Sbjct: 87 GQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGI-VFRSDNE 145
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IV++ K LFASQGGPIIL+QIENEYGNV + + G SYI W A+M
Sbjct: 146 QFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQM 205
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L GVPW+MC++ +AP P+ PN+PN P +WTENWT +++ +G
Sbjct: 206 AVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSLWTENWTSFYQVFGE 265
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
R+AED+A+ VA F G++ NYYMYHGGTNF R + +T YD +AP+DEYG
Sbjct: 266 VPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVVTAYYD-EAPLDEYGL 324
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG----------NSVSGSS--------- 356
+ +PKWGHL+ELH+ +KS +L YG T+ G +S+ ++
Sbjct: 325 VREPKWGHLKELHEAIKSCSNSLLYGTQTSFSLGTQQNAYVFRRSSIECAAFLENTEDRS 384
Query: 357 ---------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRP 407
Y LP S+SILPDCK FNTAKV Q +K Q + + +WK
Sbjct: 385 VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQNARAMKSQLQFNSAE---KWKVYR 441
Query: 408 EMINDFVVRGKGHFALNTLIDQKST-NDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
E I F NTL+DQ ST D SDYLWY L D+ S ++ L S
Sbjct: 442 EAIPSF---ADTSLRANTLLDQISTAKDTSDYLWYTFR--LYDN----SANAQSILSAYS 492
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
G VLHA+VNGN V S+ + + + E + L G N IS LSATVGL N G+ +
Sbjct: 493 HGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNLISGMNNISFLSATVGLPNSGAYLEGR 552
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
G+ + + GR D ++ W Y+VGL G + + Y A + +S+ W S +
Sbjct: 553 VAGLRS-LKVQGR--------DFTNQAWGYQVGLLG-EKLQIYTA-SGSSKVKWESF-LS 600
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+ +TWYKTTF+AP+ NDPVVLNL MGKG+ WVNG +GRYW ++ +
Sbjct: 601 STKPLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNGQGIGRYWVSFHTPQ--------- 651
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
G PSQ WYH+PRS +K N LVL EE GNP I TV +
Sbjct: 652 --------------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGITLDTVYI 693
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 329/835 (39%), Positives = 437/835 (52%), Gaps = 135/835 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ IDGE KIL SGSIHY RSTP MWP LI KAK GG+D ++TYVFWN HEP +
Sbjct: 12 VTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQQ 71
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q+DF+G+ D+++FIK +++ GLYV LRIGP++ EW+YGG P WLHN+ GI RT N+
Sbjct: 72 GQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGI-VFRTDNE 130
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ + +IV + K E L+ASQGGPIIL+QIENEYG V + GKSY+ W AK+
Sbjct: 131 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 190
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A LD GVPW+MC++ DAP P+ PN+PN P IWTENWT
Sbjct: 191 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL------ 244
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+AED+AF VA F G+F NYYMYHGGTNFGR + +T+ YD AP+DEYG
Sbjct: 245 -----SAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNASQFVITSYYD-QAPLDEYGL 298
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
L QPKWGHL+ELH +K E+ L G T G
Sbjct: 299 LRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQDKCE 358
Query: 351 ---SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRP 407
SSY L SVS+LPDCK FNTAKVN Q N + ++ Q N +P W+
Sbjct: 359 STVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQ--NLSSPQMWEEFT 416
Query: 408 EMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
E + F +L L +T D SDYLW T + P + L++N
Sbjct: 417 ETVPSFSETSIRSESL--LEHMNTTQDTSDYLWQTTRFQQSEGAPSV-------LKVNHL 467
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G LHA+VNG ++ S + A L E+ + L G N ++LLS VGL N G+ +
Sbjct: 468 GHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRV 527
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G + GR +++ W Y+VGL G +K + +++ W
Sbjct: 528 VGSRSVKIWNGRYQLY-----FNNYSWGYQVGLKG--EKFHVYTEDGSAKVQWKQYRDSK 580
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
++ +TWYK +F+ P DPV LNL MGKG AWVNG ++ +
Sbjct: 581 SQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF------------------ 622
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLF-EEFGGNPSQINFQTVVVGTAC 706
S YH+PRS++K N LV+ EE GNP I TV V C
Sbjct: 623 ----------------SYFRYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVC 666
Query: 707 GQA-----------------HENKT--------MELTC-HGRRISEIKYASFGDPQGACG 740
G +N T ++L C GR+IS+I +ASFG P G+CG
Sbjct: 667 GHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCG 726
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++ GSC + + L +++K C+ K CS+ G SC TVK L+V A C
Sbjct: 727 SYSIGSCHSP-NSLAVVQKACLKKSRCSVPVWSKTFGGDSCPH-TVKSLLVRAQC 779
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 319/731 (43%), Positives = 421/731 (57%), Gaps = 93/731 (12%)
Query: 16 ILQTLFNLSLAY--RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLD 73
+ T+F + Y V++DGR++ IDG+ KIL SGSIHYPRSTP MWP+LI KAKEGGLD
Sbjct: 13 FISTVFIGTTVYGGNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLD 72
Query: 74 AIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVW 133
I+TYVFWN HEP + QYDF G +++RFIK IQ QGLYV LRIGPY+ +E YGG P+W
Sbjct: 73 VIQTYVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLW 132
Query: 134 LHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD 193
LH++PGI R+ N+ F MQ F+ IV++ K LFASQGGPIIL+QIENEYGNV
Sbjct: 133 LHDIPGI-VFRSDNEQFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGA 191
Query: 194 YGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPK 240
+ + G SYI W A+MA L GVPW+MC++ +AP P+ PN+PN P
Sbjct: 192 FHEKGLSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPS 251
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENWT +++ +G R+AED+A+ VA F G++ NYYMYHGGTNF R + +
Sbjct: 252 LWTENWTSFYQVFGEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVI 311
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG----------N 350
T YD +AP+DEYG + +PKWGHL+ELH +KS ++ +G T+ G +
Sbjct: 312 TAYYD-EAPLDEYGLVREPKWGHLKELHAAIKSCSNSILHGTQTSFSLGTQQNAYVFKRS 370
Query: 351 SVSGSS------------------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
S+ ++ Y LP S+SILPDCK FNTAKV+ Q +K
Sbjct: 371 SIECAAFLENTEDQSVTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVSIQNARAMKSQL 430
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDD 451
+ + + WK E I F G NTL+DQ ST D SDYLWY L D+
Sbjct: 431 EFNSAET---WKVYKEAIPSF---GDTSLRANTLLDQISTTKDTSDYLWY--TFRLYDNS 482
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
P ++ L S G VLHA+VNGN V S + + + E + L G N IS LS
Sbjct: 483 P----NAQSILSAYSHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNISFLS 538
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
ATVGL N G+ + G+ + + GR D ++ W Y++GL G + + Y A
Sbjct: 539 ATVGLPNSGAYLERRVAGLRS-LKVQGR--------DFTNQAWGYQIGLLG-EKLQIYTA 588
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
+ +S+ W S + +TWYKTTF+AP+ NDPVVLNL MGKG+ W+NG +GRYW
Sbjct: 589 -SGSSKVQWESFQSS-TKPLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYWV 646
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
++ + G PSQ WYH+PRS +K N LVL EE GN
Sbjct: 647 SFHTPQ-----------------------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGN 683
Query: 692 PSQINFQTVVV 702
P I TV +
Sbjct: 684 PLGITLDTVYI 694
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 314/730 (43%), Positives = 419/730 (57%), Gaps = 65/730 (8%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
+ +LL L L T ++ V++D +AI I+ +R+IL+SGSIHYPRSTP MWPDLI+KAK
Sbjct: 3 KTVLLFLSLLTWVGSTIG-AVTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQKAK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNL-DLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
+GGLD IETYVFWN HEP + + L + I +I +V L P +
Sbjct: 62 DGGLDIIETYVFWNGHEPSEGKVTWEDFLYEQILYINC-----FHVALFXFPPYFXFQKF 116
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GFP+WL +PGI RT N+ F MQ F T IVDM K EKL+ +QGGPIIL+QIENEY
Sbjct: 117 SGFPIWLKFVPGIA-FRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENEY 175
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V G GKSY W A+MA L GVPW+MC++ DAP P+ F PN
Sbjct: 176 GPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQI 235
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PKIWTENW+GW+ ++GG P R ED+AF+VARF Q G+ NYY+YHGGTNFGRTS
Sbjct: 236 YKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS- 294
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT---------- 346
G ++ TSYD+DAPIDEYG + +PKWGHLR+LHK +K E L + T+T
Sbjct: 295 GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARV 354
Query: 347 ------------DYGNSVS------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
+Y S S + Y+LP WS+SILPDCKT FNTA++ ++
Sbjct: 355 FKSSSACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCKTVTFNTAQIGVKSYEAK 414
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADL 447
P + + W K + L++Q S T D +DYLWYM + +
Sbjct: 415 MMPISS--------FGWLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYLWYMQDISI 466
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ L L +NS+G +LH ++NG S + F + V L +G N++
Sbjct: 467 DSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKL 526
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
S+LS TVGL N G FD G+ GPV L G +D+S +KW+YKVGL G +
Sbjct: 527 SMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEG---TRDMSKYKWSYKVGLSG-ESLN 582
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
Y+ K +NS + W+ ++ + +TWYKTTF+ P N+P+ L++ M KG WVNG ++G
Sbjct: 583 LYSDKGSNSVQ-WTKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVNGRSIG 641
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RY+P Y+A + C Y G + KC NCG PSQ WYH+PR W+ N LV+FEE
Sbjct: 642 RYFPGYIAN---GKCDKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEE 698
Query: 688 FGGNPSQINF 697
GG+P I+
Sbjct: 699 IGGSPDGISL 708
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 557 bits (1435), Expect = e-155, Method: Compositional matrix adjust.
Identities = 321/736 (43%), Positives = 409/736 (55%), Gaps = 91/736 (12%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
R +L LIL V++D ++ I+G KIL SGSIHYPRSTP MWPDLI KAK
Sbjct: 6 RFLLHALILTVSLCTVHGANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAK 65
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGGLD I+TYVFWN HEP + QY+F G DL+ FIK IQ QGLYV LRIGPY+ +E YG
Sbjct: 66 EGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYG 125
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
G P+WLH++PGI RT N F MQ FTT IV+M K LFASQGGPIIL+QIENEYG
Sbjct: 126 GLPLWLHDVPGIV-FRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYG 184
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNN 235
++ S + G YI+W A+MA L GVPW+MC++ DAP P+ PN+
Sbjct: 185 SIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNS 244
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
PN P +WTENWT + +++GG R+A D+A+ VA F G++ NYYMYHGGTNF R +
Sbjct: 245 PNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLA 304
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----- 350
+T YD +AP+DEYG + QPKWGHL+ELH +KS + L G T G+
Sbjct: 305 SAFIITAYYD-EAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQAY 363
Query: 351 ----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
SY LP S+SILP CK FNT KV+ Q NV+
Sbjct: 364 VFRSSTECAAFLENSGPRDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNNVRA 423
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKST-NDVSDYLWYMTNADL 447
+P N WK E I +F K +TL+DQ ST D SDY+WY +
Sbjct: 424 MKPRLQFNSAE--NWKVYTEAIPNFAHTSK---RADTLLDQISTAKDTSDYMWYTFRFNN 478
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
K S ++ L I S G VLH+++NG S + ++ V L G N I
Sbjct: 479 K------SPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNI 532
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
S+LSATVGL N G+ + G+ V + GR D SS+ W Y+VGL G +
Sbjct: 533 SILSATVGLPNSGAFLESRVAGLR-KVEVQGR--------DFSSYSWGYQVGLLGEKLQI 583
Query: 568 FYNAKAANSE-RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
F + ++ + + + S PL TWY+TTF AP NDPVV+NL MGKG AWVNG +
Sbjct: 584 FTVSGSSKVQWKSFQSSTKPL----TWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGI 639
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW ++ + G PSQ WYH+PRS++K N LV+ E
Sbjct: 640 GRYWVSFHKPD-----------------------GTPSQQWYHIPRSFLKSTGNLLVILE 676
Query: 687 EFGGNPSQINFQTVVV 702
E GNP I TV +
Sbjct: 677 EETGNPLGITLDTVYI 692
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 305/632 (48%), Positives = 387/632 (61%), Gaps = 59/632 (9%)
Query: 11 ILLCLILQTLFNLSLAY-----RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
+LCL+ +L +L Y VS+DGR++ IDG+RK+L+S SIHYPRS P MWP LI+
Sbjct: 5 FILCLVSTSL-TFTLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQ 63
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
AKEGG+D IETYVFWN HE Y F G DL++F K +QD G+Y+ILRIGP+V AEW
Sbjct: 64 TAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEW 123
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
N+GG PVWLH +PG RT N+ FM+ M+ FTT IV++ KKEKLFASQGGPIIL+QIEN
Sbjct: 124 NFGGVPVWLHYIPG-TVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIEN 182
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPN 234
EYG + Y + GK Y W AKMA S + VPWIMCQ+ DAP P+ FTP
Sbjct: 183 EYGYYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPT 242
Query: 235 NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRT 294
+P PK+WTENW GWFK++GG+DP R ED+AF+VARFFQ GG+ NYYMYHGGTNFGRT
Sbjct: 243 SPKRPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRT 302
Query: 295 SGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG 354
+GGP++TTSYDYDAPIDEYG PKWGHL+ELHK +K E L YG N G SV
Sbjct: 303 AGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEA 362
Query: 355 -----------------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTN 385
+SY+LPAWSVSILPDCK FNTAKV++ TN
Sbjct: 363 DIYTDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTN 422
Query: 386 VKVKRP---NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWY 441
+ P Q+ Q L+W E + GK F N +D +T D +DYLW+
Sbjct: 423 IVAMIPEHLQQSDKGQKTLKWDVFKENPG---IWGKADFVKNGFVDHINTTKDTTDYLWH 479
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
T+ + ++ L S L I S G LHA+VN Y + S F+ P+ L
Sbjct: 480 TTSILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLR 539
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
GKN+I++LS TVGLQ G +D + G+ V ++G + TI DLSS+ W YK+G+
Sbjct: 540 AGKNEIAILSLTVGLQTAGPFYDFIGAGVTS-VKIIG-LNNRTI--DLSSNAWAYKIGVL 595
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTW 593
G + Y + NS + S+ P + +TW
Sbjct: 596 G-EHLSIYQGEGMNSVKWTSTSEPPKGQALTW 626
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 323/839 (38%), Positives = 457/839 (54%), Gaps = 111/839 (13%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++D R++ I+G+R++L SG+IHYPRSTP MWPDLIKKAK+GG++AIETYVFWN HE
Sbjct: 46 ALGVTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFWNGHE 105
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P+ QY+F G DL++FIK I + LY ++R+GP++ AEWN+GG P WL +PGI R+
Sbjct: 106 PVEGQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGI-IFRS 164
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ F M+ F TLIVD K+EKLFA QGGPIILAQIENEY + + + G SY+ W
Sbjct: 165 DNEPFKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYVQWA 224
Query: 206 AKMATSLDIGVPWIMCQESDAPSPM-------------FTPNNPNSPKIWTENWTGWFKS 252
K+A SL+ VPWIMC++ DAP P+ + PN N P +WTENWT ++
Sbjct: 225 GKLALSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQYRV 284
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
+G +R+AEDLA++VARFF G+ NYYM++GGTNFGRTS + TT Y + P+DE
Sbjct: 285 FGDPPSQRSAEDLAYSVARFFSKNGSMVNYYMHYGGTNFGRTSAS-FTTTRYYDEGPLDE 343
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT---------------------------- 344
+G +PKWGHL+++H+ L ++ L +G T
Sbjct: 344 FGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAAFLANN 403
Query: 345 NTDYGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
NT V+ G LPA S+S+LPDCKT FNT V TQ N + N ++ A
Sbjct: 404 NTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSR----NFVRSEIANKN 459
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT 461
+ W EM + G G F + + T D +DY WY T+ L D + +
Sbjct: 460 FNW--EMCREVPPVGLG-FKFDVPRELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNVRPV 516
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
LR+ S G +HAYVNG Y S + + +R V L G+N I+LL VGL + G+
Sbjct: 517 LRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLKEGENHIALLGYLVGLPDSGA 576
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ G P + ++G T D+S + W ++VG+ G + KK + + + S + W+
Sbjct: 577 YMEKRFAG-PRSITILGL---NTGTLDISQNGWGHQVGIDG-EKKKLFTEEGSKSVQ-WT 630
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ +TWYK F+AP ++PV + + GMGKG WVNG ++GRYW YL+
Sbjct: 631 KPD--QGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP----- 683
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
P+Q YH+PR+++K N +VL EE GGNP ++ TV
Sbjct: 684 ------------------LKKPTQSEYHIPRAYLKPK-NLIVLLEEEGGNPKDVHIVTVN 724
Query: 702 VGTACGQAHE-----------------------NKTMELTCHGRR-ISEIKYASFGDPQG 737
T C E EL C G++ I +++AS+GDP G
Sbjct: 725 RDTICSAVSEIHPPSPRLFETKNGSLQAKVNDLKPRAELKCPGKKQIVAVEFASYGDPFG 784
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKR-LVVEALC 795
ACGA+ G+C A + ++EK C+GK SC I + A +++ L V+ C
Sbjct: 785 ACGAYFIGNCTAP-ESKQVVEKYCLGKPSCQIPLDSIPFSNQNDACTHLRKTLAVQLKC 842
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 313/839 (37%), Positives = 444/839 (52%), Gaps = 114/839 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G R++L SGSIHYPRSTP W ++ KA++GG++ ++TYVFWN HE +
Sbjct: 9 VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y D I+FIK IQ +G+YV LR+GP++ AEWN+GG P WL +P I R+ N+
Sbjct: 69 GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEI-IFRSNNE 127
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ + + ++ K LFA QGGPIILAQIENEY ++ + + G +Y+ W AKM
Sbjct: 128 PFKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKM 187
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A SLDIGVPWIMC+++DAP P+ PN P P IWTENWT ++ +G
Sbjct: 188 AVSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGD 247
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AF+VARFF G+ NYYMYHGGTNFGRTS + TT Y +AP+DEYG
Sbjct: 248 PPSQRSAEDIAFSVARFFSKNGSLVNYYMYHGGTNFGRTSSA-FTTTRYYDEAPLDEYGM 306
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
+PKW HLR++H+ L ++ L G T T
Sbjct: 307 QREPKWSHLRDVHRALSLCKRALFNGASTVTKMSQHHEVIVFEKPGSNLCAAFITNNHTK 366
Query: 351 -----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
S G+ Y +P S+SILPDCKT FNT + +Q + + + + A ND +W+
Sbjct: 367 VPTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSSRNFKRSMAANDH---KWEV 423
Query: 406 RPEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E I K L +L+ D SDY WY T+ +L+ +D L
Sbjct: 424 YSETIPTTKQIPTHEKNPIELYSLL-----KDTSDYAWYTTSVELRPEDLPKKNDIPTIL 478
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
RI S G L A+VNG ++ S + F++PV L G NQI++L++TVGL + G+
Sbjct: 479 RIMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQIAILASTVGLPDSGAY 538
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ G P + ++G + DL+S+ W ++VG+ G +K + + + W
Sbjct: 539 MEHRFAG-PKSIFILGLNSGKM---DLTSNGWGHEVGIKG--EKLGIFTEEGSKKVQWKE 592
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
P ++WYKT F P DPV + + GMGKG W+NG ++GR+W +YL+
Sbjct: 593 AKGP-GPAVSWYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRHWMSYLSP------ 645
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
G P+Q YH+PR++ N LV+FEE NP ++ TV
Sbjct: 646 -----------------LGQPTQSEYHIPRTYFNPKDNLLVVFEEEIANPEKVEILTVNR 688
Query: 703 GTACGQAHENK-----------------------TMELTC-HGRRISEIKYASFGDPQGA 738
T C EN + L C H R I +++ASFGDP GA
Sbjct: 689 DTICSFVTENHPPNVKSWAIKSEKFQAVVNDLVPSASLKCPHQRTIKAVEFASFGDPAGA 748
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSC--SIEASEANLGATSCAAGTVKRLVVEALC 795
CGAF G C A + ++EKQC+GK SC I+ G +C T K L ++ C
Sbjct: 749 CGAFALGKCNAPA-IKQIVEKQCLGKASCLVPIDKDAFTKGQDACPNVT-KALAIQVRC 805
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 321/747 (42%), Positives = 409/747 (54%), Gaps = 101/747 (13%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
R +L LIL V++D ++ I+G KIL SGSIHYPRSTP MWPDLI KAK
Sbjct: 6 RFLLHALILTVSLCTVHGANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAK 65
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGGLD I+TYVFWN HEP + QY+F G DL+ FIK IQ QGLYV LRIGPY+ +E YG
Sbjct: 66 EGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYG 125
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
G P+WLH++PGI RT N F MQ FTT IV+M K LFASQGGPIIL+QIENEYG
Sbjct: 126 GLPLWLHDVPGI-VFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYG 184
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNN 235
++ S + G YI+W A+MA L GVPW+MC++ DAP P+ PN+
Sbjct: 185 SIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNS 244
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
PN P +WTENWT + +++GG R+A D+A+ VA F G++ NYYMYHGGTNF R +
Sbjct: 245 PNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLA 304
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS---- 351
+T YD +AP+DEYG + QPKWGHL+ELH +KS + L G T G+
Sbjct: 305 SAFIITAYYD-EAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQVI 363
Query: 352 -------------------------VSGS----------SYNLPAWSVSILPDCKTEEFN 376
+SG SY LP S+SILP CK FN
Sbjct: 364 KNESSWTYFPLMFSEVPQNVLLSWKISGPRDVTIQFQNISYELPGKSISILPGCKNVVFN 423
Query: 377 TAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKST-NDV 435
T KV+ Q NV+ +P N WK E I +F K +TL+DQ ST D
Sbjct: 424 TGKVSIQNNVRAMKPRLQFNSAE--NWKVYTEAIPNFAHTSK---RADTLLDQISTAKDT 478
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE 495
SDY+WY + K S ++ L I S G VLH+++NG S + +
Sbjct: 479 SDYMWYTFRFNNK------SPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMK 532
Query: 496 RPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWT 555
+ V L G N IS+LSATVGL N G+ + + G E +D SS+ W
Sbjct: 533 KNVNLINGMNNISILSATVGLPNSGAFLES---------RVAGLRKVEVQGRDFSSYSWG 583
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
Y+VGL G + F + +S+ W S + +TWY+TTF AP NDPVV+NL MG
Sbjct: 584 YQVGLLGEKLQIF--TVSGSSKVQWKSFQSS-TKPLTWYQTTFHAPAGNDPVVVNLGSMG 640
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG AWVNG +GRYW ++ + G PSQ WYH+PRS++
Sbjct: 641 KGLAWVNGQGIGRYWVSFHKPD-----------------------GTPSQQWYHIPRSFL 677
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTVVV 702
K N LV+ EE GNP I TV +
Sbjct: 678 KSTGNLLVILEEETGNPLGITLDTVYI 704
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 320/857 (37%), Positives = 463/857 (54%), Gaps = 112/857 (13%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+I L I+ + + A +++DGR++ +DG+ ++ SGSIHYPRSTP MWPD++ KA+
Sbjct: 9 SITLFSIITIVCAQNAAQTITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDILDKARR 68
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGL+ I+TYVFWN HEP + + +F G DL++F+K +Q++G+YV LRIGP++ AEWN+GG
Sbjct: 69 GGLNLIQTYVFWNGHEPEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQAEWNHGG 128
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
P WL +P I R+ N+ F M+ + +++++ K+EKLFA QGGPIILAQIENEY +
Sbjct: 129 LPYWLREVPDII-FRSNNEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIENEYNH 187
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FT-PNNP 236
+ Y G +Y+ W AKMA SL GVPW+MC++ DAP P+ FT PN P
Sbjct: 188 IQLAYEADGDNYVQWAAKMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFTGPNKP 247
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P IWTENWT ++ +G +R+AED+AF+VARFF G+ NYYMYHGGTNFGRT+
Sbjct: 248 YKPFIWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYHGGTNFGRTTS 307
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG--------------- 341
+ TT Y +AP+DE+G +PKW HLR+ HK + +K+L G
Sbjct: 308 A-FTTTRYYDEAPLDEFGLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQKISQYHEVIV 366
Query: 342 ---------------NVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV 386
N T T S GS Y LP S+SILPDCKT FNT + +Q +
Sbjct: 367 YEKKESNLCAAFITNNHTQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNTQNIASQHSS 426
Query: 387 KVKRPNQAGNDQAPLQWKWRPEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMT 443
+ ++ GND +W+ E I + + K L +L+ D +DY WY T
Sbjct: 427 RHFEKSKTGND---FKWEVFSEPIPSAKELPSKQKLPAELYSLL-----KDKTDYGWYTT 478
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
+ +L +D LRI S G L A+VNG Y+ S+ + F++PV G
Sbjct: 479 SVELGPEDIPKKSDVAPVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKPVNFKVG 538
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
NQI++L+ VGL + G+ + G P + ++G DL+S+ W ++VGL G
Sbjct: 539 VNQIAILANLVGLPDSGAYMEHRYAG-PKTITILGLMSGTI---DLTSNGWGHQVGLQGE 594
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+D F + E W ++WYKT F+ P +PV + ++GM KG WVNG
Sbjct: 595 NDSIFTEKGSKKVE--WKDGKGK-GSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNG 651
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
++GR+W +YL+ G P+Q YH+PRS++K N LV
Sbjct: 652 ESIGRHWMSYLSP-----------------------LGKPTQSEYHIPRSFLKPKDNLLV 688
Query: 684 LFEEFGGNPSQINFQTVVVGTACG---------------------QAHENKTME--LTC- 719
+FEE +P +I TV T C + EN T E +TC
Sbjct: 689 IFEEEAISPDKIAILTVNRDTICSFITENHPPNIRSFASKNQKLERVGENLTPEAFITCP 748
Query: 720 HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL-GA 778
++I+ +++ASFGDP G CG+F G C A ++E+ C+GK +CS+ +A G
Sbjct: 749 DQKKITAVEFASFGDPSGFCGSFIMGKCNAP-SSKKIVEQLCLGKPTCSVPMVKATFTGG 807
Query: 779 TSCAAGTVKRLVVEALC 795
VK L ++ C
Sbjct: 808 NDGCPDVVKTLAIQVKC 824
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 324/829 (39%), Positives = 430/829 (51%), Gaps = 155/829 (18%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I G R++L+S SIHYPRS P MWP L+ +AK+GG D +ETYVFWN HEP +
Sbjct: 38 VTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPAQ 97
Query: 89 RQ--------------------YDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
Q Y F DL+RF K ++D GLY+ILRIGP+V AEW +G
Sbjct: 98 GQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFG 157
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
G PVWLH PG RT N+ F + M+ FTT IVDM KKE+ FASQGG IILAQ+ENEYG
Sbjct: 158 GVPVWLHYAPGTV-FRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYG 216
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
++ YG K Y W A MA + + GVPWIMCQ+ DAP P+ F PN+P
Sbjct: 217 DMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPT 276
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK WTENW GWF+++G +P R ED+AF+VARFF GG+ QNYY+
Sbjct: 277 KPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYVA------------ 324
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSY 357
D D+ G G + L + +K +T+ SY
Sbjct: 325 ---------DVYTDQSG-------GCVAFLSNVDSEKDKVVTF------------QSRSY 356
Query: 358 NLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRG 417
+LPAWSVSILPDCK FNTAKV +QT + P + + +R + + + G
Sbjct: 357 DLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSIFREK----YGIWG 412
Query: 418 KGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVN 476
N +D +T D +DYLWY T+ D+ D L+G N L I S G + A++N
Sbjct: 413 NIDLVRNGFVDHINTTKDSTDYLWYTTSFDV--DGSHLAGG-NHVLHIESKGHAVQAFLN 469
Query: 477 GNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLL 536
+ S + SN E PV L GKN++SLLS TVGLQN G ++ GI
Sbjct: 470 NELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAGAGITS---- 525
Query: 537 VGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKT 596
V +G E I DLSS+KW YKV +
Sbjct: 526 VKISGMENRIIDLSSNKWEYKVNV------------------------------------ 549
Query: 597 TFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKC 656
+ P +DPV L++Q MGKG AW+NG +GRYWP D C T SCDYRG + +KC
Sbjct: 550 --DVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRC-TSSCDYRGTFSPNKC 606
Query: 657 AYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN---- 712
CG P+Q WYHVPRSW NTLV+FEE GG+P++I F V + C E+
Sbjct: 607 RRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSEHYPSI 666
Query: 713 ----------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
++L+C G+ IS +K+ SFG+P G C ++++GSC + +
Sbjct: 667 DLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCH-HPNSIS 725
Query: 756 LIEK---------QCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++EK C+ C++ S+ G C G K L +EA C
Sbjct: 726 VVEKGTLGWAHRRACLNMNGCTVSLSDEGFGEDLC-PGVTKTLAIEADC 773
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 317/843 (37%), Positives = 456/843 (54%), Gaps = 115/843 (13%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A V++DG+++ I+G R+IL SGS+HY RSTP MWPD++ KA+ GGL+ I+TYVFWNAHE
Sbjct: 43 ARNVTYDGKSLFINGRREILFSGSVHYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHE 102
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P +++F GN DL++FI+ +Q +G++V LR+GP++ AEWN+GG P WL +PGI R+
Sbjct: 103 PEPGKFNFQGNYDLVKFIRLVQAKGMFVTLRVGPFIQAEWNHGGLPYWLREVPGII-FRS 161
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
N+ + M+ F + I+ M K EKLFA QGGPIILAQIENEY ++ Y + G SY+ W
Sbjct: 162 DNEPYKFHMKAFVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWA 221
Query: 206 AKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKS 252
A MA + DIGVPW+MC++ DAP P+ PN P P IWTENWT ++
Sbjct: 222 ANMAVATDIGVPWLMCKQRDAPDPVINACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRV 281
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
G +R+AED+AF+VARFF G NYYMYHGGTNFGRTS + TT Y +AP+DE
Sbjct: 282 HGDPPSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTSS-VFSTTRYYDEAPLDE 340
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYG--NVTNTDYGNSVS----------------- 353
YG +PKW HLR++HK L + + G +V ++ + V
Sbjct: 341 YGLPREPKWSHLRDVHKALLLCRRAILGGVPSVQKLNHFHEVRTFERVGTNMCAAFITNN 400
Query: 354 -----------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
G++Y LP S+SILPDCKT FNT ++ +Q N + + A N+
Sbjct: 401 HTMEPATINFRGTNYFLPPHSISILPDCKTVVFNTQQIVSQHNSRNYERSPAANN---FH 457
Query: 403 WKWRPEMINDFVVRGKGHFALNTLIDQKS---TNDVSDYLWYMTNADLKDDDPILSGSSN 459
W EM N+ + K +N + + D +DY WY T+ +L +D +
Sbjct: 458 W----EMFNEAIPTAK-KMPINLPVPAELYSLLKDTTDYAWYTTSFELSQEDMSMKPGVL 512
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
LR+ S G + A+VNG+ V + + + F+ PV L G N ISLLS+TVGL +
Sbjct: 513 PVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLRVGTNYISLLSSTVGLPDS 572
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ + G P + ++G DL+ + W ++VGL G + KK ++ + + S +
Sbjct: 573 GAYMEHRYAG-PKSINILGLNRGTL---DLTRNGWGHRVGLKG-EGKKVFSEEGSTSVKW 627
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
VP R ++WY+T F P PV + + GM KG WVNG N+GRYW +YL+
Sbjct: 628 KPLGAVP--RALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGRYWMSYLSP--- 682
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G P+Q YH+PRS++ N LV+FEE P+Q+
Sbjct: 683 --------------------LGKPTQSEYHIPRSFLNPQDNLLVIFEEEARVPAQVEILN 722
Query: 700 VVVGTACGQAHEN-----------------------KTMELTCH-GRRISEIKYASFGDP 735
V T C E + C G+RI +++ASFG+P
Sbjct: 723 VNRDTICSVVGERDPANVNSWVSRRGNFHPVVKSVGAAASMACATGKRIVAVEFASFGNP 782
Query: 736 QGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA---NLGATSCAAGTVKRLVVE 792
G CG F GSC A ++E++C+G+++C++ A N G +C VK+L V+
Sbjct: 783 SGYCGDFAMGSCNAAAS-KQIVERECLGQEACTLALDRAVFNNNGVDAC-PDLVKQLAVQ 840
Query: 793 ALC 795
C
Sbjct: 841 VRC 843
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 328/778 (42%), Positives = 418/778 (53%), Gaps = 132/778 (16%)
Query: 125 WNY-GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
W+Y GFP+WL ++PGIE RT N F EMQ F IVD+ + EKLF QGGP+I+ Q+
Sbjct: 1 WDYCRGFPLWLRDVPGIE-FRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQV 59
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYGN+ S YG G+ YI W MA L VPW+MCQ+ DAPS + F
Sbjct: 60 ENEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFK 119
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
N+P+ P WTENW GWF SWG + P R EDLAF+VARFFQ G+FQNYYMY GGTNFG
Sbjct: 120 ANSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFG 179
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN---------- 342
RT+GGP+ TSYDYD+PIDEYG + +PKWGHL++LH LK E L +
Sbjct: 180 RTAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPK 239
Query: 343 -----------------------------VTNTDYGNSVS----GSSYNLPAWSVSILPD 369
+ N D +V+ G +YNLP WSVSILPD
Sbjct: 240 QEAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPD 299
Query: 370 CKTEEFNTAKVNTQTNVKVKR---PNQA-------GNDQAPLQ-----WKWRPEMI---- 410
C+ FNTAKV QT++K+ P A DQ L W E I
Sbjct: 300 CQNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWS 359
Query: 411 -NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSS 467
+F V+G L T D SDYLWYMT + +DD N+T + I+S
Sbjct: 360 DQNFTVKG-------ILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSV 412
Query: 468 GQVLHAYVNGNYVDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
V +VNG S QW K F +PV+ G N + LLS +GLQN G+ +
Sbjct: 413 RDVFRVFVNGKLTGSAIGQWVK-------FVQPVQFLEGYNDLLLLSQAMGLQNSGAFIE 465
Query: 525 MVPNGIPGPVLLVG-RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
GI G + L G + GD DLS WTY+VGL G + FY+ + N + W+
Sbjct: 466 KDGAGIRGRIKLTGFKNGD----IDLSKSLWTYQVGLKG-EFLNFYSLEE-NEKADWTEL 519
Query: 584 NV-PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+V + TWYK F +P DPV +NL MGKG AWVNG+++GRYW + ++ +DGC
Sbjct: 520 SVDAIPSTFTWYKAYFSSPDGTDPVAINLGSMGKGQAWVNGHHIGRYW-SVVSPKDGCPR 578
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
+ CDYRG Y S KCA NCG P+Q WYH+PRSW+K+ N LVLFEE GGNP +I +
Sbjct: 579 K-CDYRGAYNSGKCATNCGRPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYST 637
Query: 703 GTACGQAHE------------------------NKTMELTC-HGRRISEIKYASFGDPQG 737
G CGQ E N M L C G IS +++AS+G PQG
Sbjct: 638 GVICGQVSESHYPSLRKLSNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQG 697
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+C F +G C A + L ++ + C+GK SC++E S + G C + VK L VEA C
Sbjct: 698 SCNKFSRGPCHA-TNSLSVVSQACLGKNSCTVEISNSAFGGDPCHS-IVKTLAVEARC 753
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 322/755 (42%), Positives = 424/755 (56%), Gaps = 92/755 (12%)
Query: 6 HCSRAILLCLILQTLFNL-----SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
+R LCLIL +F + + A V++DGR++ IDG+RK+L SGSIHYPRSTP MW
Sbjct: 2 EAARVFGLCLILVGMFLVFPGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMW 61
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
P LIKK KEGG+D I+TYVFWN HEP QYDF+G DL++FIK I+ QGLYV LRIGP+
Sbjct: 62 PSLIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPF 121
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
+ AEWNYGG P WL ++PG+ RT N+ F MQ FTT IV++ K E L+ASQGGPIIL
Sbjct: 122 IEAEWNYGGLPFWLRDVPGM-VYRTDNEPFKFHMQKFTTKIVNLMKSEGLYASQGGPIIL 180
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------- 232
+QIENEY NV + + + G SYI W +MA L GVPWIMC+ DAP P+
Sbjct: 181 SQIENEYANVEAAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMRCG 240
Query: 233 -----PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
PN+PN PK+WTE+WT +F+ +G + R+AED+AF F G++ NYYMYHG
Sbjct: 241 ETFPGPNSPNKPKMWTEDWTSFFQVYGTEPYIRSAEDIAFHAVLFIAKNGSYINYYMYHG 300
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFGRTS ++T YD AP+DEYG L QPK+GHL+ELH +KS L G T
Sbjct: 301 GTNFGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILS 359
Query: 348 YG---------NSVSG-------------------SSYNLPAWSVSILPDCKTEEFNTAK 379
G ++ SG SSY+L S+ IL +CK + TAK
Sbjct: 360 LGPMQQAYVFEDASSGCVAFLVNNDAKVSQIQFRKSSYSLSPKSIGILQNCKNLIYETAK 419
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDY 438
VN + N +V P Q N P +W+ E I F N L++ + T D +DY
Sbjct: 420 VNVEKNKRVTTPVQVFN--VPEKWEGFRETIPAF---SGTSLKANALLEHTNLTKDKTDY 474
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWY + K D P +N ++ I SSG V+H +VN S + P
Sbjct: 475 LWY--TSSFKPDSP----CTNPSIYIESSGHVVHVFVNNALAGSGHGSRDIKVVKLQVPA 528
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
LT G+N IS+LS VGL + G+ + G+ + G G + I DLS +W Y V
Sbjct: 529 SLTNGQNSISILSGMVGLPDSGAYMERKSYGLTKVQISCG--GTKPI--DLSGSQWGYSV 584
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPL--NRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
GL G + + + N + WS N L NR + WYKT F+ P + PV LN+ MGK
Sbjct: 585 GLLG-EKVRLQQWRNLNRVK-WSMNNAGLIKNRPLIWYKTIFDGPNGDGPVGLNMSSMGK 642
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G WVNG ++GRYW ++L G+PSQ YH+PR ++K
Sbjct: 643 GEIWVNGESIGRYWVSFLTP-----------------------SGHPSQSIYHIPREFLK 679
Query: 677 DGVNTLVLFEEFGGNPSQINFQTV-VVGTACGQAH 710
N LV+FEE GG+P I+ T+ V+G+ Q+
Sbjct: 680 PSGNLLVVFEEEGGDPLGISLNTISVIGSNRAQSQ 714
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 316/731 (43%), Positives = 409/731 (55%), Gaps = 81/731 (11%)
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N F MQ FT IV M K E LFASQGGPIIL+QIENEY
Sbjct: 1 GGFPVWLKYVPGIS-FRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEY 59
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G G AG+SYINW AKMA L+ GVPW+MC+E DAP P+ F+PN P
Sbjct: 60 GPESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYCDGFSPNKP 119
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P +WTE W+GWF +GG +R +DLAFAVARF Q GG++ NYYMYHGGTNFGRT+G
Sbjct: 120 YKPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTAG 179
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG------- 349
GP++TTSYDYDAPIDEYG +PK+ HL+ELHK +K E L T T G
Sbjct: 180 GPFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQAYI 239
Query: 350 ---------------NSVSGSS-------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
NS S + YNLP WS+SILPDC+ +NTA V QT+
Sbjct: 240 YNSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALVGVQTSHV 299
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
P + L W+ E+I+ R + A+ L T D SDYLWYMT+ D+
Sbjct: 300 HMLPT----GTSLLSWETYDEVISSLDERAR-MTAVGLLEQINVTRDTSDYLWYMTSVDI 354
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ L G TL + S+G + ++NG + S + F PV L G N+I
Sbjct: 355 SSSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRAGSNKI 414
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG--LDD 565
SLLS VGL N G +++ G+ GPV L G + +DL+ KW+Y+VGL G ++
Sbjct: 415 SLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGK---RDLTWQKWSYQVGLKGEAMNL 471
Query: 566 KKFYNAKAANSERG-WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
A +A+ RG ++++V + +TWYK F AP N+P+ L+L+ MGKG +NG
Sbjct: 472 VTPEGASSADWVRGSLAARSV---QPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQ 528
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
++GRYW Y A+ D E+C Y G G +P+Q WYHVPRSW+K N LV+
Sbjct: 529 SIGRYWTAY-AKGD---CEACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLVI 584
Query: 685 FEEFGGNPSQINFQTVVVGTACGQAHENK-------------------TMELTCH-GRRI 724
FEE GG+ S+I + C A EN T+ L C G+ I
Sbjct: 585 FEELGGDASKIALLRRSLTNVCANAFENHPSMAKYSTSSQDGSKVKEATVNLQCGPGQSI 644
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S I++ASFG P G CG+F G+C A + +IEK+CVG+KSCS+ S + GA C
Sbjct: 645 SAIEFASFGTPSGTCGSFHIGTCHAP-NSRSIIEKKCVGQKSCSVTISNSIFGADPC-PN 702
Query: 785 TVKRLVVEALC 795
+KRL VEA+C
Sbjct: 703 VLKRLTVEAVC 713
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 307/686 (44%), Positives = 395/686 (57%), Gaps = 86/686 (12%)
Query: 181 AQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM---------- 230
A+IENEYGN+ S YG GK+Y+ W A MA SLD GVPW+MCQ++DAP P+
Sbjct: 6 AKIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCD 65
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
FTPN+ PK+WTENW+GWF S+GG P R EDLAFAVARF+Q GGTFQNYYMYHGGT
Sbjct: 66 QFTPNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGT 125
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
N R+SGGP++ TSYDYDAPIDEYG + QPKWGHLR++HK +K E L + + T G
Sbjct: 126 NLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLG 185
Query: 350 NSV----------------------------SGSSYNLPAWSVSILPDCKTEEFNTAKVN 381
+V +G Y LPAWSVSILPDCK NTA++N
Sbjct: 186 PNVEAAVYKVGSVCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQIN 245
Query: 382 TQTNVKVKRPNQAGN----------DQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-K 430
+QT R ++ N + A W + E + + L++Q
Sbjct: 246 SQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVG---ITKDNALTKAGLMEQIN 302
Query: 431 STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
+T D SD+LWY T+ +K D+P L+GS + L +NS G VL Y+NG S +S
Sbjct: 303 TTADASDFLWYSTSITVKGDEPYLNGSQS-NLAVNSLGHVLQVYINGKIAGSAQGSASSS 361
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS 550
+++P++L GKN+I LLSATVGL NYG+ FD+V GI GPV L G G DLS
Sbjct: 362 LISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNG----ALDLS 417
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVL 609
S +WTY++GL G +D Y+ A+ E W S N P+N + WYKT F P +DPV +
Sbjct: 418 SAEWTYQIGLRG-EDLHLYDPSEASPE--WVSANAYPINHPLIWYKTKFTPPAGDDPVAI 474
Query: 610 NLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYH 669
+ GMGKG AWVNG ++GRYWPT LA + GC SC+YRG Y S KC CG PSQ YH
Sbjct: 475 DFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC-VNSCNYRGAYSSSKCLKKCGQPSQTLYH 533
Query: 670 VPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE------------------ 711
VPRS+++ G N LVLFE FGG+PS+I+F G+ C Q E
Sbjct: 534 VPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRY 593
Query: 712 NKTMELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSI 769
+ L C G+ IS +K+ASFG P G CG++ G C + L ++++ C+G S
Sbjct: 594 GPALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGEC-SSTQALSIVQEACIGVSS-CS 651
Query: 770 EASEANLGATSCAAGTVKRLVVEALC 795
+N C G K L VEA C
Sbjct: 652 VPVSSNYFGNPC-TGVTKSLAVEAAC 676
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 301/725 (41%), Positives = 416/725 (57%), Gaps = 100/725 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ ++GER++L SGSIHYPR P MWPD+I+KAKEGGL+ I+TYVFWN HEP++
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHEPVQ 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q++F GN D+++FIKTI +QGLYV LRIGPY+ AEWN GGFP WL +P I R+ N+
Sbjct: 88 GQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNI-TFRSYNE 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F++ M+ ++ +++D+ KKEKLFA QGGPII+AQIENEY NV Y D GK Y+ W A M
Sbjct: 147 PFIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAANM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWGG 255
AT L GVPWIMC++ DAP+ + FT PN PN P +WTENWT ++++G
Sbjct: 207 ATGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGD 266
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R AED+AF+VARFF GT NYYMY+GGTN+GRT G ++TT Y +AP+DE+G
Sbjct: 267 PPSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRT-GSSFVTTRYYDEAPLDEFGL 325
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYG-----------------------------NVTNT 346
+PKW HLR+LH+ L+ + L +G N T
Sbjct: 326 YREPKWSHLRDLHRALRLSRRALLWGTPSVQKINQHLEITVYEKPGTDCAAFLTNNHTTL 385
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G Y LP SVSILPDCK NT + +Q N + P++ + L+W+
Sbjct: 386 PATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNSRNFLPSEKAKN---LKWEMY 442
Query: 407 PE---MINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I+D ++ + L +L T D SDY WY T+ + D + L+
Sbjct: 443 QEKVPTISDLSLKNREPLELYSL-----TKDTSDYAWYSTSINFDRHDLPMRPDILPVLQ 497
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDL-----FERPVKLTRGKNQISLLSATVGLQN 518
I S G L A+VNG +V +G N++ F++PV L G N IS+L+ TVG N
Sbjct: 498 IASMGHALSAFVNGEFVG-----FGHGNNIEKSFVFQKPVILKPGTNTISILAETVGFPN 552
Query: 519 YGSKFDMV---PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
G+ + P GI L+ G D++ + W ++VG++G ++ F A
Sbjct: 553 SGAYMEKRFAGPRGITVQGLMAGTL-------DITQNNWGHEVGVFGEKEQLFTEEGAKK 605
Query: 576 SERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
+ W+ N P +TWYKT F+AP N+PV L + M KG WVNG +LGRYW ++L+
Sbjct: 606 VK--WTPVNGPTKGAVTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGRYWSSFLS 663
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
G P+Q YH+PR+++K N LV+FEE GG+P I
Sbjct: 664 P-----------------------LGQPTQFEYHIPRAFLKPTNNLLVIFEETGGHPETI 700
Query: 696 NFQTV 700
Q V
Sbjct: 701 EVQIV 705
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 336/863 (38%), Positives = 446/863 (51%), Gaps = 154/863 (17%)
Query: 9 RAILLCLILQTLFNLSLAYR---VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
R + L + + A R V++DGR++ IDG+RKI+ SGSIHYPRSTP MWP LI
Sbjct: 3 RVLFLVAAVLAVIGSGSAVRGGDVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSLIA 62
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
KAKEGGLDAIETYVFWN HEP YDF+G D++RFIK +Q QGLY LRIGP++ +EW
Sbjct: 63 KAKEGGLDAIETYVFWNVHEPQPGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQSEW 122
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
+YGG P WLH++PGI R+ N+ F MQNFT +V M + E L+ASQGGPIIL+QIEN
Sbjct: 123 SYGGLPFWLHDIPGI-VFRSDNEPFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIEN 181
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT------------- 232
EYG V YG G +Y+ W A+MA L GVPW+MC++++AP +
Sbjct: 182 EYGTVQKAYGQEGLAYVQWAAQMAEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFVG 241
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQF-GGTFQNYYMYHGGTNF 291
PN+PN P IWTENWT ++AED+AF V F G+F NYYMYHGGTNF
Sbjct: 242 PNSPNKPSIWTENWT-----------TQSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTNF 290
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG-- 349
GRT+ ++TTSY AP+DEYG QPKWGHL+ELH +K L G N G
Sbjct: 291 GRTASA-FVTTSYYDQAPLDEYGLTTQPKWGHLKELHAAIKLCSTPLLSGVQVNLYLGPQ 349
Query: 350 ------NSVSG---------------------SSYNLPAWSVSILPDCKTEEFNTAKVNT 382
N+VSG +SY+LP S+SILPDCK N
Sbjct: 350 QQAYIFNAVSGECAAFLINNDSSNAASVPFRNASYDLPPMSISILPDCK---------NV 400
Query: 383 QTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWY 441
T + + A W+ E I +F TL++Q +T D SDYLWY
Sbjct: 401 STQYTTRTMGRGEVLDAADVWQEFTEAIPNF---DSTSTRSETLLEQMNTTKDSSDYLWY 457
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
+ D + L ++S G LHA+VNG V S FE V L+
Sbjct: 458 TFRFQHESSD------TQAILDVSSLGHALHAFVNGQAVGSVQGSRKNPRFKFETSVSLS 511
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
+G N +SLLS VG+ + G+ + G+ ++ D+ D +++ W Y++GL
Sbjct: 512 KGINNVSLLSVMVGMPDSGAFLENRAAGLRTVMIR-----DKQDNNDFTNYSWGYQIGLQ 566
Query: 562 GLDDKKFYNAKAANSE--RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
G + + Y + ++ + +S+ PL TWYKT +AP + PV LNL MGKG A
Sbjct: 567 G-ETLQIYTEQGSSQVQWKKFSNAGNPL----TWYKTQVDAPPGDVPVGLNLASMGKGEA 621
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
WVNG ++GRYWP+ YHVPRS++K
Sbjct: 622 WVNGQSIGRYWPS-----------------------------------YHVPRSFLKPTG 646
Query: 680 NTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELT--------------CHGRR-- 723
N LVL EE GGNP Q++ TV + CG + ++ GRR
Sbjct: 647 NLLVLQEEEGGNPLQVSLDTVTISQVCGHVTASHLAPVSSWIEHNQRYKNPAKVSGRRPK 706
Query: 724 ----------ISEIKYASFGDPQGAC-GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEAS 772
IS I +AS+G P G C + G+C ++ + ++E+ C+GK CSI S
Sbjct: 707 VLLACPSKSKISRISFASYGTPLGNCRNSMAVGTCHSQ-NSKAVVEEACLGKMKCSIPVS 765
Query: 773 EANLGATSCAAGTVKRLVVEALC 795
G C A K L+V A C
Sbjct: 766 VRQFGGDPCPA-KAKSLMVVAEC 787
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 276/520 (53%), Positives = 350/520 (67%), Gaps = 57/520 (10%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
RA + L+L V +D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K
Sbjct: 2 RATEIVLVLLWFLPTMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP++ QYDF G DL++F+K + + GLYV LRIGPYVC+EWNYG
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCSEWNYG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFP+WLH +PGI + RT N+ F EM+ FTT IVD+ K+EKL+ASQGGPIIL+QIENEYG
Sbjct: 122 GFPLWLHFIPGI-KFRTDNEPFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQIENEYG 180
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FTPNNP 236
++ S YG AGKSYINW AKMATSLD GVPW+MCQ++DAP P+ FTPN+
Sbjct: 181 DIDSAYGSAGKSYINWAAKMATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQFTPNSK 240
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTENW+ W+ +GG P R EDLAFAVARFFQ GGTFQNYYMYHGGTNF R++G
Sbjct: 241 TKPKLWTENWSAWYLLFGGGFPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTG 300
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL-------TY--------- 340
GP++ TSYD+DAPIDEYG + QPKWGHL+++HK +K E+ L TY
Sbjct: 301 GPFIATSYDFDAPIDEYGVIRQPKWGHLKDVHKAIKLCEEALIAAEPKITYLGPNLEAAV 360
Query: 341 ---GNV---------TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-- 386
G+V +D + SG+SY+LPAWSVSILPDCK NTAK+N+ + +
Sbjct: 361 YKTGSVCAAFLANVDAKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASTISN 420
Query: 387 ---KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNT-LIDQKS-TNDVSDYLWY 441
+ + + + ++ + +W W IN+ V K T L++Q + T D SDYLWY
Sbjct: 421 FVTESLKEDISSSETSRSKWSW----INEPVGISKDDILSKTGLLEQINITADRSDYLWY 476
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVD 481
+ DLKDD S L I S G LHA++NG D
Sbjct: 477 SLSVDLKDDP-----GSQTVLHIESLGHALHAFINGKLAD 511
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 193/322 (59%), Gaps = 34/322 (10%)
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDLSSHKWT 555
P+ + GKN+I LLS TVGLQNYG+ FD GI GPV+L G + G++T+ DLSS KWT
Sbjct: 1949 PITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGNKTL--DLSSRKWT 2006
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNV-PLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
Y+VGL G D ++ S W+SK P + + WYKT F+AP ++PVV++ GM
Sbjct: 2007 YQVGLKGED-----LGLSSGSSGAWNSKTTFPKKQPLIWYKTNFDAPSGSNPVVIDFTGM 2061
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKG AWVNG ++GRYWPTY+A C T+SC+YRGP+ KC NCG PSQ YHVP+S+
Sbjct: 2062 GKGEAWVNGQSIGRYWPTYVASNVDC-TDSCNYRGPFTQTKCHMNCGKPSQTLYHVPQSF 2120
Query: 675 IKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-------------------TM 715
+K NTLVLFEE GG+P+QI+F T +G+ C ++ +
Sbjct: 2121 LKPNGNTLVLFEESGGDPTQISFATKQIGSVCAHVSDSHPPQIDLWNQDTESGGKVGPAL 2180
Query: 716 ELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASE 773
L C H + IS IK+AS+G P G CG F +G C + L +++K C+G +SCSI S
Sbjct: 2181 LLNCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSN-KTLSIVKKACIGSRSCSIGVST 2239
Query: 774 ANLGATSCAAGTVKRLVVEALC 795
G G K L VEA C
Sbjct: 2240 DTFGDP--CKGVPKSLAVEATC 2259
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 543 bits (1399), Expect = e-151, Method: Compositional matrix adjust.
Identities = 323/838 (38%), Positives = 452/838 (53%), Gaps = 113/838 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G+R++L SGSIHYPRSTP MWP+LI+KAK GGL+ I+TYVFWN HEP +
Sbjct: 31 VTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F G+ DL++FIKTI + G+ +R+GP++ AEWN+GG P WL +P I R+ N
Sbjct: 91 GKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIPDI-IFRSDNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ F T+I++ K+EKLFASQGGPIILAQIENEY V Y + G SY+ W M
Sbjct: 150 PFKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGVSYVQWAGNM 209
Query: 209 ATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWGG 255
A L GVPW+MC++ DAP P+ FT PN+P+ P +WTENWT F+ +G
Sbjct: 210 ALGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENWTAQFRVFGD 269
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED AF+VAR+F G+ NYYMYHGGTNF RT+ ++TT Y +AP+DEYG
Sbjct: 270 PPSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGL 328
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYG--NV--------------------------TNTD 347
+PKWGHL++LH+ L +K L +G NV NT
Sbjct: 329 QREPKWGHLKDLHRALNLCKKALLWGTPNVQRLSADVEARFFEQPRTNDCAAFLANNNTK 388
Query: 348 YGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK--VKRPNQAGNDQAPLQW 403
+V+ G Y LPA S+SILPDCKT +NT V +Q N + VK G L+W
Sbjct: 389 DPETVTFRGKKYYLPAKSISILPDCKTVVYNTMTVVSQHNSRNFVKSRKTDGK----LEW 444
Query: 404 KWRPEMI-NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
K E I ++ +V + L L T D +DY W+ T ++ +D N L
Sbjct: 445 KMFSETIPSNLLVDSRIPRELYNL-----TKDKTDYAWFTTTINVDRNDLSARKDINPVL 499
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
R+ S G + A++NG ++ S + + + VKL G N ++LL + VGL + G+
Sbjct: 500 RVASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGINFVTLLGSLVGLPDSGAY 559
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ G G +L G T DLSS+ W ++V L G K F K + W+
Sbjct: 560 MEHRYAGPRGVSIL----GLNTGTLDLSSNGWGHQVALSGETAKVF--TKEGGRKVTWTK 613
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
N +TWYKT F+AP PV + + GM KG W+NG ++GRYW Y++
Sbjct: 614 VNKD-GPPVTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRYWMNYISP------ 666
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
G P+Q YH+PRS++K N +V+ EE G +P +I TV
Sbjct: 667 -----------------LGEPTQSEYHIPRSYLKPTNNLMVILEEEGASPEKIEILTVNR 709
Query: 703 GTACGQAHE------------NKTM-----------ELTC-HGRRISEIKYASFGDPQGA 738
T C E NK L C + ++I +++ASFGDP G
Sbjct: 710 DTICSYVTEYHPPNVRSWERKNKKFTPVADDAKPAARLKCPNKKKIVAVQFASFGDPSGT 769
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL-GATSCAAGTVKRLVVEALC 795
CG F G+C++ I ++E+ C+GK SC I + G K L V+ C
Sbjct: 770 CGNFAVGTCDSPISK-QVVEQHCLGKTSCDIPMDKGLFNGKKDNCPNLTKNLAVQVKC 826
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 542 bits (1397), Expect = e-151, Method: Compositional matrix adjust.
Identities = 321/842 (38%), Positives = 449/842 (53%), Gaps = 122/842 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ I+G+R++L SGSIHYPRSTP MWP+LI KAK GGL+ I+TYVFWN HEP +
Sbjct: 31 VTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F G DL++FIKTI + G++ LR+GP++ AEWN+GG P WL +P I R+ N
Sbjct: 91 GKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDI-IFRSDNA 149
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + M+ F T I+DM K+EKLFASQGGPIIL+QIENEY V Y + G SYI W M
Sbjct: 150 PFKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQWAGNM 209
Query: 209 ATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWGG 255
A L+ GVPW+MC++ DAP P+ FT PN PN P +WTENWT F+ +G
Sbjct: 210 ALGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQFRVFGD 269
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED AF+VAR+F G+ NYYMYHGGTNF RT+ ++TT Y +AP+DEYG
Sbjct: 270 PPSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGL 328
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGN----------------------------VTNTD 347
+PKWGHL++LH+ L +K L +GN N+
Sbjct: 329 QREPKWGHLKDLHRALNLCKKALLWGNPNVQKLSADVEARFYEQPGTKVCAAFLASNNSK 388
Query: 348 YGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN----VKVKRPNQAGNDQAPL 401
+V G Y LPA S+SILPDCKT +NT V +Q N VK ++ N+ L
Sbjct: 389 EAETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRNFVKSRKTNK-------L 441
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQK---STNDVSDYLWYMTNADLKDDDPILSGSS 458
+W E I +++ + ++ T D +DY+W+ T ++ D
Sbjct: 442 EWNMYSETI-------PAQLQVDSSLPKELYNLTKDKTDYVWFTTTINVDRRDMNERKRI 494
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQN 518
N LR+ S G + A+VNG ++ S + + + V L G N ++LL VGL +
Sbjct: 495 NPVLRVASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSVDLKPGINFVTLLGTLVGLPD 554
Query: 519 YGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
G+ + G G +L G T DL+S+ W ++VGL G K F K +
Sbjct: 555 SGAYMEHRYAGPRGVSIL----GLNTGTLDLTSNGWGHQVGLSGETAKLF--TKEGGGKV 608
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W +K +TWYKT F+AP PV + + GM KG W+NG ++GRYW TY++
Sbjct: 609 TW-TKVQKAGPPVTWYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKSIGRYWMTYVSP-- 665
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
G P+Q YH+PRS++K N +V+FEE NP +I
Sbjct: 666 ---------------------LGEPTQSEYHIPRSYLKPTDNLMVIFEEEEANPEKIEIL 704
Query: 699 TVVVGTACGQAHE-------------NK----------TMELTC-HGRRISEIKYASFGD 734
TV T C E NK L C + ++I +++ASFGD
Sbjct: 705 TVNRDTICSYVTEYHPPSVKSWERKNNKFTPVVDNAKPAAHLKCPNQKKIIAVQFASFGD 764
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL-GATSCAAGTVKRLVVEA 793
P G CG + G+C + + ++E+ C+GK SC I + G G K L V+
Sbjct: 765 PLGTCGDYAVGTCHSLVSK-QVVEEHCLGKTSCDIPIDKGLFAGKKDDCPGISKTLAVQV 823
Query: 794 LC 795
C
Sbjct: 824 KC 825
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 542 bits (1397), Expect = e-151, Method: Compositional matrix adjust.
Identities = 321/750 (42%), Positives = 417/750 (55%), Gaps = 91/750 (12%)
Query: 10 AILLCLILQTLFNLS----LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
+ L LI+ T S A V++DGR++ IDG+RK+L SGSIHYPRSTP MWP LIK
Sbjct: 9 GLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIK 68
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
KAKEGG+D I+TYVFWN HEP QYDF+G DL++FIK I+ QGLYV LRIGP++ AEW
Sbjct: 69 KAKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEW 128
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
NYGG P WL ++PG+ RT N+ F MQ FT IVD+ K E L+ASQGGPIIL+QIEN
Sbjct: 129 NYGGLPFWLRDVPGM-VYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIEN 187
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT------------- 232
EY NV + + G SYI W +MA L GVPWIMC+ DAP P+
Sbjct: 188 EYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPG 247
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN+PN PK+WTE+WT +F+ +G + R+AED+AF A F G++ NYYMYHGGTNFG
Sbjct: 248 PNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFG 307
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--- 349
RTS ++T YD AP+DEYG L QPK+GHL+ELH +KS L G T G
Sbjct: 308 RTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQ 366
Query: 350 ------------------NSVSGS-------SYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
N S +Y+L S+ IL +CK + TAKVN +
Sbjct: 367 QAYVFEDANNGCVAFLVNNDAKASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKM 426
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMT 443
N +V P Q N P W E I F N L++ + T D +DYLWY +
Sbjct: 427 NTRVTTPVQVFN--VPDNWNLFRETIPAFP---GTSLKTNALLEHTNLTKDKTDYLWYTS 481
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
+ K D P +N ++ SSG V+H +VN S + PV L G
Sbjct: 482 S--FKLDSP----CTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLING 535
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
+N IS+LS VGL + G+ + G+ + G G + I DLS +W Y VGL G
Sbjct: 536 QNNISILSGMVGLPDSGAYMERRSYGLTKVQISCG--GTKPI--DLSRSQWGYSVGLLG- 590
Query: 564 DDKKFYNAKAANSERGWSSKNVPL--NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ + Y K N + WS L NR + WYKTTF+ P + PV L++ MGKG WV
Sbjct: 591 EKVRLYQWKNLNRVK-WSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWV 649
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG ++GRYW ++L G PSQ YH+PR+++K N
Sbjct: 650 NGESIGRYWVSFLTP-----------------------AGQPSQSIYHIPRAFLKPSGNL 686
Query: 682 LVLFEEFGGNPSQINFQTV-VVGTACGQAH 710
LV+FEE GG+P I+ T+ VVG++ Q+
Sbjct: 687 LVVFEEEGGDPLGISLNTISVVGSSQAQSQ 716
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 320/846 (37%), Positives = 447/846 (52%), Gaps = 125/846 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
++ D R++ +DG R + SGSIHYPRS P MWPDLI +AKEGGL+ IE+YVFWN HEP
Sbjct: 15 ITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWNGHEPEM 74
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G D+I+F K +Q+ ++ ++RIGP+V AEWN+GG P WL +P I RT N+
Sbjct: 75 GVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDII-FRTNNE 133
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F T+IV+ K KLFASQGGPIILAQIENEY ++ + + + G +YI+W AKM
Sbjct: 134 PFKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIHWAAKM 193
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A+ L+IGVPWIMC+++ AP + P + N P +WTENWT ++ +G
Sbjct: 194 ASDLNIGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVFGD 253
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AFAVARF+ GGT NYYMYHGGTNFGRT G ++ Y +AP+DE+G
Sbjct: 254 PPSQRSAEDIAFAVARFYSVGGTMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGL 312
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
+PKWGHLR+LH L+ +K + +GN +N G
Sbjct: 313 YKEPKWGHLRDLHHALRLCKKAILWGNPSNQPLGKLYEARLFEIPEQKICVAFLSNHNTK 372
Query: 351 -----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+ G Y +P SVSIL DCKT F+T VN+Q N + +DQ W
Sbjct: 373 EDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHNQRTFH----FSDQTVQGNVW 428
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKS------TNDVSDYLWYMTNADLKDDDPILSGSSN 459
+D V + + QK T D +DY+WY T+ L+ +D
Sbjct: 429 EMYTESDKVPT----YKFTNIRTQKPLEAYNLTKDKTDYVWYTTSFKLEAEDLPFRKDIW 484
Query: 460 MTLRINSSGQVLHAYVNGNYVDS-QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQN 518
L ++S G + A+VNG YV + TK + + E+P+++ G N +S+LS T+G+Q+
Sbjct: 485 PVLEVSSHGHAMVAFVNGKYVGAGHGTKINKAFTM-EKPIEVRTGINHVSILSTTLGMQD 543
Query: 519 YGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
G + GI G V G T DL+S+ W + VGL G + A++E+
Sbjct: 544 SGVYLEHRQAGIDG----VTIQGLNTGTLDLTSNGWGHLVGLEG-------ERRNAHTEK 592
Query: 579 GWSSKN-VP--LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
G VP +R +TWY+ F+ P +DPVV+++ MGKG +VNG LGRYW +Y
Sbjct: 593 GGDGVQWVPAVFDRPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRYWSSY-- 650
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF-GGNPSQ 694
+ G PSQ YHVPR ++K N + +FEE GG P
Sbjct: 651 ---------------------KHALGRPSQYLYHVPRCFLKPTGNVMTIFEEEGGGQPDG 689
Query: 695 INFQTVVVGTACGQAHENKTME------------------------LTCHGRR-ISEIKY 729
I TV C E L+C ++ I ++ +
Sbjct: 690 IMILTVKRDNICSFISEKNPAHVKSWERKDSHLKSVADADLKPQAVLSCPEKKLIQQVVF 749
Query: 730 ASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRL 789
AS+G+P G CG + G+C A ++EK CVGKKSC ++ S GA G+ L
Sbjct: 750 ASYGNPLGICGNYTVGNCHAP-KAKEIVEKACVGKKSCVLQVSHEVYGADLNCPGSTGTL 808
Query: 790 VVEALC 795
V+A C
Sbjct: 809 AVQAKC 814
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 324/806 (40%), Positives = 434/806 (53%), Gaps = 120/806 (14%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWP LI KAKEGG+D I+TYVFWN HEP + Y+F+G D++RF+K IQ QGLY LRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
P++ AEW+YGG P WLH++ GI R+ N+ F MQNFTT IV+M K E L+ASQGGPI
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGI-VYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPI 119
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENEY V + +G+ G Y+ W AKMA SL GVPW MC+++DAP P+
Sbjct: 120 ILSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMR 179
Query: 231 ----FT-PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQF-GGTFQNYYM 284
FT PN+PN P IWTENWT +++++G + R+AE++AF VA F GT+ NYYM
Sbjct: 180 CGETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYM 239
Query: 285 YHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT 344
YHGGTNFGR++ +T YD +P+DEYG +PKWGHL+ELH +K L G +
Sbjct: 240 YHGGTNFGRSASAFMITGYYD-QSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKS 298
Query: 345 NTDYGNSVSG----------------------------SSYNLPAWSVSILPDCKTEEFN 376
N G SV +Y LP S+SILPDCK FN
Sbjct: 299 NFSLGQSVEAIVFKTESNECAAFLVNRGAIDSNVLFQNVTYELPLGSISILPDCKNVAFN 358
Query: 377 TAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQK-STNDV 435
T +V+ Q N R A L+W+ E I + N L++ +T D
Sbjct: 359 TRRVSVQHNT---RSMMAVQKFDLLEWEEFKEPIPNI---DDTELRANELLEHMGTTKDR 412
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE 495
SDYLWY ++ D P S TL ++S LHA+VNG+Y S Y
Sbjct: 413 SDYLWYTFR--VQQDSP----DSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLA 466
Query: 496 RPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWT 555
+ + L G N ISLLS VGL + G+ + G+ VG G+ D S W
Sbjct: 467 KNITLRNGINNISLLSVMVGLPDSGAFLETRVAGL----RRVGIQGE-----DFSEQHWG 517
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
YKVGL G + F + ++N + WS ++ +TWYKT F+AP +DP+ LNL MG
Sbjct: 518 YKVGLSGEQSQIFLDTGSSNVQ--WSRLGNS-SQPLTWYKTQFDAPPGDDPIALNLGSMG 574
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG WVNG +GRYW ++L + G PSQ WY+VPRS++
Sbjct: 575 KGAVWVNGRGIGRYWVSFLTPK-----------------------GEPSQKWYNVPRSFL 611
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE---------------------NKT 714
K N LV+ EE GNP +I+ +V++ CGQ E N+T
Sbjct: 612 KPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRT 671
Query: 715 ----MELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSI 769
++L+C ++IS I +ASFG P G C ++ G C + + ++E C+G+ CSI
Sbjct: 672 RRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIGLCHSP-NSRAIVEHACLGRAKCSI 730
Query: 770 EASEANLGATSCAAGTVKRLVVEALC 795
S N C T K L+V+A C
Sbjct: 731 PISNLNFRGDPCPHVT-KTLLVDAQC 755
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 321/799 (40%), Positives = 427/799 (53%), Gaps = 87/799 (10%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
L +++DGRA+ + G R++ SG +HY RSTP MWP LI KAK GGLD I+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP++ QY+F G DL++FI+ IQ QGLYV LRIGP+V AEW YGGFP WLH++P I R
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSI-TFR 143
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
+ N+ F MQNF T IV M K E L+ QGGPII++QIENEY + +G +G Y+ W
Sbjct: 144 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 203
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFK 251
A MA L GVPW+MC+++DAP P+ PN+PN P +WTENWT +
Sbjct: 204 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYP 263
Query: 252 SWGGKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+G R ED+AFAVA F + G+F +YYMYHGGTNFGR + Y+TTSY AP+
Sbjct: 264 IYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPL 322
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDC 370
DEY + L + ++ N ++ N S L S+S+L DC
Sbjct: 323 DEYDF----------KCVAFLVNFDQH----NTPKVEFRN----ISLELAPKSISVLSDC 364
Query: 371 KTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ- 429
+ F TAKVN Q + Q+ ND WK E + + K + N L +Q
Sbjct: 365 RNVVFETAKVNAQHGSRTANAVQSLNDIN--NWKAFIEPVPQDL--SKSTYTGNQLFEQL 420
Query: 430 KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKY-G 488
+T D +DYLWY+ + + D G+ L + S +LHA+VN YV S + G
Sbjct: 421 TTTKDETDYLWYIVSYKNRASD----GNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDG 476
Query: 489 ASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKD 548
N + + L G N ISLLS VG + G+ + GI VG + +
Sbjct: 477 PRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQ----TVGIQQGQQPMHL 532
Query: 549 LSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVV 608
L++ W Y+VGL+G D Y + NS R W N + +TWYKTTF P ND V
Sbjct: 533 LNNDLWGYQVGLFGEKD-SIYTQEGTNSVR-WMDINNLIYHPLTWYKTTFSTPPGNDAVT 590
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LNL MGKG WVNG ++GRYW ++ A G PSQ Y
Sbjct: 591 LNLTSMGKGEVWVNGESIGRYWVSFKAPS-----------------------GQPSQSLY 627
Query: 669 HVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGR------ 722
H+PR ++ N LVL EE GG+P QI T+ V T CG E L G+
Sbjct: 628 HIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQSRGKVPKVRI 687
Query: 723 ------RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL 776
RIS I++AS+G+P G C +F+ GSC AE ++++ C+G++ CSI A
Sbjct: 688 WCQGGNRISSIEFASYGNPVGDCRSFRIGSCHAE-SSESVVKQSCIGRRGCSIPVMAAKF 746
Query: 777 GATSCAAGTVKRLVVEALC 795
G C G K L+V A C
Sbjct: 747 GGDPC-PGIQKSLLVVADC 764
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 319/841 (37%), Positives = 441/841 (52%), Gaps = 102/841 (12%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
+ A V++D RA+ IDG R++L+SGSIHYPRSTP MWP+L +AK G+D I+TY+FWN
Sbjct: 22 AYAMNVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTYLFWNT 81
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
+ P ++ + D +RF++ Q+ GLYV RIGP+VCAEW YGG P WL +P I
Sbjct: 82 NVPTPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIPDI-MF 140
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
R ++ ++ + T V + K +L A QGGPIIL QIENEYG S Y G Y+
Sbjct: 141 RDYDQPWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRYA-GGPQYVE 199
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPMFTPNN----------PNSPKIWTENWTGWFKSW 253
WC ++A +L WIMC + DAP+ + N P P +WTENW GWF+ W
Sbjct: 200 WCGQLAANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVPHPGQPSMWTENWPGWFQKW 259
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G P R A+D+A+AV R++ GG++ NYYMYHGGTNF RT+GGP++TT+YDYDA +DEY
Sbjct: 260 GDPTPHRPAQDVAYAVTRYYIKGGSYMNYYMYHGGTNFERTAGGPFITTNYDYDASLDEY 319
Query: 314 GHLNQPKWGHLRELHKLLKSME---------KTLTYG--------------------NVT 344
G N+PK+ HL +H +L E K ++ G N
Sbjct: 320 GMPNEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNLEAHIYNSSVGCVAFLSNNNN 379
Query: 345 NTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT--------------NVKVKR 390
TD +G +Y LPAWSVS+L C T +NTA V +
Sbjct: 380 KTDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTAVCRAHQRAPHDAACCARESRRVCDRL 439
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNT---------LIDQKSTNDVSDYLWY 441
P +AP Q R + V+ G A T IDQ T D +DYLWY
Sbjct: 440 PPLRPKARAPCQ-SGRIRHLCLVVLTSIGPQAPATKYWNKTPLEQIDQ--TLDHTDYLWY 496
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
T+ + S ++ L + V + YVNG +V W+ ++ V L
Sbjct: 497 STSY-------VSSSATYAQLSLPQITDVAYVYVNGKFVTVSWSGNVSAT------VSLV 543
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
G N I +LS T+GL N G G+ G V L +L+ + W ++ G+
Sbjct: 544 AGPNTIDILSLTMGLDNGGDILSEYNCGLLGGVYLGS--------VNLTENGWWHQTGVV 595
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLEND-PVVLNLQGMGKGFAW 620
G + F + W++ V LN +TWYK++F+ P ++ P+ L+L GMGKG+ W
Sbjct: 596 GERNAIFLPENL--KKVAWTTPAV-LNTGLTWYKSSFDVPRDSQAPLALDLTGMGKGYVW 652
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
VNG+NLGRYWPT LA C + CDYRG Y + C C PSQ YHVPR W++ N
Sbjct: 653 VNGHNLGRYWPTILATNWPC--DVCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENN 710
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTME-----LTCHGRR-ISEIKYASFGD 734
LVL EE GGNPS+I +CG E+ + L C + I+ + +AS+G
Sbjct: 711 VLVLLEEMGGNPSKIALVEREEYVSCGVVGEDYPADDLAVVLGCGTHQTIAGVDFASYGT 770
Query: 735 PQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
P G+C ++++GSC A + ++ C GK++CSI S A G C T KRL V+
Sbjct: 771 PMGSCRSYQQGSCHAS-NSTEIVLSLCHGKQACSIPVSAAMFG-NPCPDVTNKRLAVQVA 828
Query: 795 C 795
C
Sbjct: 829 C 829
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 540 bits (1392), Expect = e-151, Method: Compositional matrix adjust.
Identities = 320/750 (42%), Positives = 416/750 (55%), Gaps = 91/750 (12%)
Query: 10 AILLCLILQTLFNLS----LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
+ L LI+ T S A V++DGR++ IDG+RK+L SGSIHYPRSTP MWP LIK
Sbjct: 9 GLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIK 68
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
K KEGG+D I+TYVFWN HEP QYDF+G DL++FIK I+ QGLYV LRIGP++ AEW
Sbjct: 69 KTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEW 128
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
NYGG P WL ++PG+ RT N+ F MQ FT IVD+ K E L+ASQGGPIIL+QIEN
Sbjct: 129 NYGGLPFWLRDVPGM-VYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIEN 187
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT------------- 232
EY NV + + G SYI W +MA L GVPWIMC+ DAP P+
Sbjct: 188 EYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPG 247
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN+PN PK+WTE+WT +F+ +G + R+AED+AF A F G++ NYYMYHGGTNFG
Sbjct: 248 PNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFG 307
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--- 349
RTS ++T YD AP+DEYG L QPK+GHL+ELH +KS L G T G
Sbjct: 308 RTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQ 366
Query: 350 ------------------NSVSGS-------SYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
N S +Y+L S+ IL +CK + TAKVN +
Sbjct: 367 QAYVFEDANNGCVAFLVNNDAKASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKM 426
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMT 443
N +V P Q N P W E I F N L++ + T D +DYLWY +
Sbjct: 427 NTRVTTPVQVFN--VPDNWNLFRETIPAFP---GTSLKTNALLEHTNLTKDKTDYLWYTS 481
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
+ K D P +N ++ SSG V+H +VN S + PV L G
Sbjct: 482 S--FKLDSP----CTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLING 535
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
+N IS+LS VGL + G+ + G+ + G G + I DLS +W Y VGL G
Sbjct: 536 QNNISILSGMVGLPDSGAYMERRSYGLTKVQISCG--GTKPI--DLSRSQWGYSVGLLG- 590
Query: 564 DDKKFYNAKAANSERGWSSKNVPL--NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ + Y K N + WS L NR + WYKTTF+ P + PV L++ MGKG WV
Sbjct: 591 EKVRLYQWKNLNRVK-WSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWV 649
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG ++GRYW ++L G PSQ YH+PR+++K N
Sbjct: 650 NGESIGRYWVSFLTP-----------------------AGQPSQSIYHIPRAFLKPSGNL 686
Query: 682 LVLFEEFGGNPSQINFQTV-VVGTACGQAH 710
LV+FEE GG+P I+ T+ VVG++ Q+
Sbjct: 687 LVVFEEEGGDPLGISLNTISVVGSSQAQSQ 716
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 540 bits (1392), Expect = e-150, Method: Compositional matrix adjust.
Identities = 323/859 (37%), Positives = 452/859 (52%), Gaps = 140/859 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG+++ ++G R++L SGSIHY RSTP WPD++ KA+ GGL+ I+TYVFWNAHEP +
Sbjct: 35 VTYDGKSLFVNGRRELLFSGSIHYTRSTPDAWPDILDKARHGGLNVIQTYVFWNAHEPEQ 94
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F GN DL++FI+ +Q +G+YV LR+GP++ AEWN+GG P WL +PGI R+ N+
Sbjct: 95 GKFNFEGNNDLVKFIRLVQSKGMYVTLRVGPFIQAEWNHGGLPYWLREVPGI-IFRSDNE 153
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ M+ + + I+ M K EKLFA QGGPIILAQIENEY ++ Y + G SY+ W A M
Sbjct: 154 PYKKYMKAYVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 213
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A +LDIGVPWIMC++ DAP P+ PN P P +WTENWT ++ +G
Sbjct: 214 AVALDIGVPWIMCKQKDAPDPVINACNGRHCGDTFSGPNKPYKPSLWTENWTAQYRVFGD 273
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AF+VARFF G NYYMYHGGTNFGRT+ + TT Y +AP+DEYG
Sbjct: 274 PVSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEYGM 332
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTN 345
QPKW HLR+ HK L K + G N TN
Sbjct: 333 ERQPKWSHLRDAHKALLLCRKAILGGVPTVQKLNDYHEVRIFEKPGTSTCSAFITNNHTN 392
Query: 346 TDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQ------------TNVKVKRPNQ 393
S GS+Y LPA S+S+LPDCKT +NT V Q + V + N+
Sbjct: 393 QAATISFRGSNYFLPAHSISVLPDCKTVVYNTQNVMNQLVYYKLISSHLIIKLIVSQHNK 452
Query: 394 AGNDQAP----LQWKWRPEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNAD 446
++ L+W+ E I K L TL+ D +DY WY T+ +
Sbjct: 453 RNFVKSAVANNLKWELFLEAIPSSKKLESNQKIPLELYTLL-----KDTTDYGWYTTSFE 507
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ 506
L +D L S + LRI S G L A+VNG Y+ + + + FE+P G N
Sbjct: 508 LGPED--LPKKSAI-LRIMSLGHTLSAFVNGQYIGTDHGTHEEKSFEFEQPANFKVGTNY 564
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
IS+L+ TVGL + G+ + G P + ++G + +L+ + W ++VGL G K
Sbjct: 565 ISILATTVGLPDSGAYMEHRYAG-PKSISILGLNKGKL---ELTKNGWGHRVGLRGEQLK 620
Query: 567 KFYNAKAANSERGWSSKNV---PL---NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
F +E G SK V P+ R ++W KT F P PV + + GMGKG W
Sbjct: 621 VF-------TEEG--SKKVQWDPVTGETRALSWLKTRFATPEGRGPVAIRMTGMGKGMIW 671
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
VNG ++GR+W ++L+ G PSQ YH+PR ++ N
Sbjct: 672 VNGKSIGRHWMSFLSP-----------------------LGQPSQEEYHIPRDYLNAKDN 708
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-----------------------TMEL 717
LV+ EE G+P +I V T C EN L
Sbjct: 709 LLVVLEEEKGSPEKIEIMIVDRDTICSYITENSPANVNSWGSKNGEFRSVGKNSGPQASL 768
Query: 718 TC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL 776
C G++I +++ASFG+P G CG F G+C ++EK C+GK+ C +E + AN
Sbjct: 769 KCPSGKKIVAVEFASFGNPSGYCGDFALGNCNGGA-AKGVVEKACLGKEECLVEVNRANF 827
Query: 777 GATSCAAGTVKRLVVEALC 795
C AG+V L ++A C
Sbjct: 828 NGQGC-AGSVNTLAIQAKC 845
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 320/799 (40%), Positives = 427/799 (53%), Gaps = 87/799 (10%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
L +++DGRA+ + G R++ SG +HY RSTP MWP LI KAK GGLD I+TYVFWN H
Sbjct: 21 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 80
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP++ QY+F G DL++FI+ IQ QGLYV LRIGP+V AEW YGGFP WLH++P I R
Sbjct: 81 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSI-TFR 139
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
+ N+ F MQNF T IV M K E L+ QGGPII++QIENEY + +G +G Y+ W
Sbjct: 140 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 199
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFK 251
A MA L GVPW+MC+++DAP P+ PN+PN P +WTENWT +
Sbjct: 200 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYP 259
Query: 252 SWGGKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
+G R ED+AFAVA + + G+F +YYMYHGGTNFGR + Y+TTSY AP+
Sbjct: 260 IYGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPL 318
Query: 311 DEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDC 370
DEY + L + ++ N ++ N S L S+S+L DC
Sbjct: 319 DEYDF----------KCVAFLVNFDQH----NTPKVEFRN----ISLELAPKSISVLSDC 360
Query: 371 KTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ- 429
+ F TAKVN Q + Q+ ND WK E + + K + N L +Q
Sbjct: 361 RNVVFETAKVNAQHGSRTANAVQSLNDIN--NWKAFIEPVPQDL--SKSTYTGNQLFEQL 416
Query: 430 KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKY-G 488
+T D +DYLWY+ + + D G+ L + S +LHA+VN YV S + G
Sbjct: 417 TTTKDETDYLWYIVSYKNRASD----GNQIARLYVKSLAHILHAFVNNEYVGSVHGSHDG 472
Query: 489 ASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKD 548
N + + L G N ISLLS VG + G+ + GI VG + +
Sbjct: 473 PRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQ----TVGIQQGQQPMHL 528
Query: 549 LSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVV 608
L++ W Y+VGL+G D Y + NS R W N + +TWYKTTF P ND V
Sbjct: 529 LNNDLWGYQVGLFGEKD-SIYTQEGPNSVR-WMDINNLIYHPLTWYKTTFSTPPGNDAVT 586
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LNL MGKG WVNG ++GRYW ++ A G PSQ Y
Sbjct: 587 LNLTSMGKGEVWVNGESIGRYWVSFKAPS-----------------------GQPSQSLY 623
Query: 669 HVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCH-------- 720
H+PR ++ N LVL EE GG+P QI T+ V T CG E L
Sbjct: 624 HIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQSRGKVPKVRI 683
Query: 721 ----GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL 776
G+RIS I++AS+G+P G C +F+ GSC AE ++++ C+G++ CSI A
Sbjct: 684 WCQGGKRISSIEFASYGNPVGDCRSFRIGSCHAE-SSESVVKQSCIGRRGCSIPVMAAKF 742
Query: 777 GATSCAAGTVKRLVVEALC 795
G C G K L+V A C
Sbjct: 743 GGDPC-PGIQKSLLVVADC 760
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 318/751 (42%), Positives = 418/751 (55%), Gaps = 93/751 (12%)
Query: 10 AILLCLILQTLFNLS----LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
+ L LI+ T S A V++DGR++ IDG+RK+L SGSIHYPRSTP MWP LIK
Sbjct: 9 GLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIK 68
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
K KEGG+D I+TYVFWN HEP QYDF+G DL++FIK I+ QGLYV LRIGP++ AEW
Sbjct: 69 KTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEW 128
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
NYGG P WL ++PG+ RT N+ F MQ FT IVD+ K E L+ASQGGPIIL+QIEN
Sbjct: 129 NYGGLPFWLRDVPGM-VYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIEN 187
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT------------- 232
EY NV + + G SYI W +MA L GVPWIMC+ DAP P+
Sbjct: 188 EYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPG 247
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN+PN PK+WTE+WT +F+ +G + R+AED+AF A F G++ NYYMYHGGTNFG
Sbjct: 248 PNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFG 307
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--- 349
RTS ++T YD AP+DEYG L QPK+GHL+ELH +KS L G T G
Sbjct: 308 RTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQ 366
Query: 350 ------------------NSVSGS-------SYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
N S +Y+L S+ IL +CK + TAKVN +
Sbjct: 367 QAYVFEDANNGCVAFLVNNDAKASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKM 426
Query: 385 NVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGH-FALNTLIDQKS-TNDVSDYLWYM 442
N +V P Q N P W + + + + H N L++ + T D +DYLWY
Sbjct: 427 NTRVTTPVQVFN--VPDNW----NLFRETIPASQAHLLKTNALLEHTNLTKDKTDYLWYT 480
Query: 443 TNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTR 502
++ K D P +N ++ SSG V+H +VN S + PV L
Sbjct: 481 SS--FKLDSP----CTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLIN 534
Query: 503 GKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG 562
G+N IS+LS VGL + G+ + G+ + G G + I DLS +W Y VGL G
Sbjct: 535 GQNNISILSGMVGLPDSGAYMERRSYGLTKVQISCG--GTKPI--DLSRSQWGYSVGLLG 590
Query: 563 LDDKKFYNAKAANSERGWSSKNVPL--NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
+ + Y K N + WS L NR + WYKTTF+ P + PV L++ MGKG W
Sbjct: 591 -EKVRLYQWKNLNRVK-WSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIW 648
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
VNG ++GRYW ++L G PSQ YH+PR+++K N
Sbjct: 649 VNGESIGRYWVSFLTP-----------------------AGQPSQSIYHIPRAFLKPSGN 685
Query: 681 TLVLFEEFGGNPSQINFQTV-VVGTACGQAH 710
LV+FEE GG+P I+ T+ VVG++ Q+
Sbjct: 686 LLVVFEEEGGDPLGISLNTISVVGSSQAQSQ 716
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 304/716 (42%), Positives = 406/716 (56%), Gaps = 85/716 (11%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+V++DGR++ IDG RKIL SGSIHYPRSTP MW LI KAKEGG+D I+TYVFWN HEP
Sbjct: 61 QVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEPQ 120
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
QYDF G DL +FIK IQ QGLY LRIGP++ +EW+YGG P WLH++ GI RT N
Sbjct: 121 PGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGI-VYRTDN 179
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQNFTT IV++ K E L+ASQGGPIIL+QIENEY N+ + + + G SY+ W AK
Sbjct: 180 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 239
Query: 208 MATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWG 254
MA L GVPW+MC++SDAP P+ FT PN+PN P +WTENWT +++ +G
Sbjct: 240 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 299
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G+ R+AED+AF VA F G++ NYYMYHGGTNFGR S Y+ TSY AP+DEYG
Sbjct: 300 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRASSA-YIKTSYYDQAPLDEYG 358
Query: 315 HLNQPKWGHLRELHK--------LLKSMEKTLTYGN-----------------VTNTDYG 349
+ QPKWGHL+ELH LL ++ ++ G + N D G
Sbjct: 359 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 418
Query: 350 NSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
N+ + S L S+SILPDCK FNTAK+NT N ++ +Q+ + A +W+
Sbjct: 419 NNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYNERIATSSQSFD--AVDRWEE 476
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
+ I +F+ N +++ + T D SDYLWY S + L I
Sbjct: 477 YKDAIPNFL---DTSLKSNMILEHMNMTKDESDYLWYTFRFQPN------SSCTEPLLHI 527
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S +HA+VN YV + + F+ P+ L N IS+LS VG + G+ +
Sbjct: 528 ESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLE 587
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ + G I D +++ W Y+VGL G + +N E W
Sbjct: 588 SRFAGLTRVEIQCTEKG----IYDFANYTWGYQVGLSGEKLHIYKEENLSNVE--WRKTE 641
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+ N+ +TWYK F P +DPV LNL MGKG AWVNG ++GRYW ++ +
Sbjct: 642 ISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSK------- 694
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
G+PSQ YHVPR+++K N LVL EE G+P I+ +T+
Sbjct: 695 ----------------GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETI 734
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 316/840 (37%), Positives = 451/840 (53%), Gaps = 112/840 (13%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V++DG ++ I+G R++L SGSIHYPRSTP MWP++IK+AK+GGL+ I+TYVFWN HEP
Sbjct: 43 EVTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPE 102
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ +++F+G DL++FIK I+ G+YV LR+GP++ AEW +GG P WL +PGI RT N
Sbjct: 103 QGKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIF-FRTDN 161
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
F + + +I+D K+EKLFASQGGPIIL QIENEY V Y + G +YI W +K
Sbjct: 162 TPFKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASK 221
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
+ S+D+G+PW+MC+++DAP PM PN N P +WTENWT F+ +G
Sbjct: 222 LVHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYG 281
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+R+ ED+A++VARFF GT NYYMYHGGTNFGRTS Y+TT Y DAP+DEYG
Sbjct: 282 DPPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYG 340
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYG--------NVTNTDY----GNSV---------- 352
+PK+GHL+ LH L +K L +G N T Y G V
Sbjct: 341 LEREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNT 400
Query: 353 --------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
G Y +P S+SILPDCKT +NT ++ + + ++ N +K
Sbjct: 401 ESAEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKN--FDFK 458
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
E + ++G + + T D +DY WY T+ + D+D S TLRI
Sbjct: 459 VFTETVPS-KIKGDSYIPVELY---GLTKDETDYGWYTTSFKIDDNDLSKKKGSKPTLRI 514
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S G LH ++NG Y+ + + + +F++P+ L G+N +++L G + GS +
Sbjct: 515 ASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHLTMLGVLTGFPDSGSYME 574
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
G P V ++G G T+ DL+ +KW KVG+ G +K +A+ + W K
Sbjct: 575 HRYTG-PRSVSILG-LGSGTL--DLTEENKWGNKVGMEG--EKLGIHAEEGLKKVKW-QK 627
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+TWY+T F+AP + + GMGKG WVNG +GRYW ++L+
Sbjct: 628 FSGKEPGLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP------- 680
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NPSQINFQTVVV 702
G P+QI YH+PRS++K N LV+FEE P I+F +
Sbjct: 681 ----------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIINR 724
Query: 703 GTACGQAHENK-----------------------TMELTCHG-RRISEIKYASFGDPQGA 738
T C EN T L C G ++ISE+++ASFG+P G
Sbjct: 725 DTVCSHIGENYTPSVRHWTRKNDQVQAITDDVHLTASLKCSGTKKISEVEFASFGNPNGT 784
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGTVKRLVVEALC 795
CG F G+C A + ++EK C+GK C I +++ SC K+L V+ C
Sbjct: 785 CGNFTLGTCNAPVSK-KVVEKYCLGKAECVIPVNKSTFQQDKKDSCPK-VEKKLAVQVKC 842
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 318/846 (37%), Positives = 452/846 (53%), Gaps = 112/846 (13%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+ S A +++DG ++ I+G R++L SGSIHYPRSTP MWP++IK+AK+GGL+ I+TYVFW
Sbjct: 21 SFSGALSITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFW 80
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP + +++F+G DL++FIK I+ GLYV LR+GP++ AEW +GG P WL +PGI
Sbjct: 81 NVHEPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGI- 139
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N+ F + + +++DM K+EKLFASQGGPIIL QIENEY V Y + G +Y
Sbjct: 140 FFRTDNEPFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNY 199
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTG 248
I W +K+ S+D+G+PW+MC+++DAP PM PN N P +WTENWT
Sbjct: 200 IKWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTT 259
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDA 308
F+ +G +R+ ED+A++VARFF GT NYYMYHGGTNFGRTS Y+TT Y DA
Sbjct: 260 QFRVFGDPPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDA 318
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG--------NVTNTDY----GNSV---- 352
P+DE+G +PK+GHL+ LH L +K L +G N T Y G V
Sbjct: 319 PLDEFGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAF 378
Query: 353 --------------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQ 398
G Y +P S+SILPDCKT +NT ++ + + ++ N
Sbjct: 379 LANNNTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKN 438
Query: 399 APLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSS 458
+K E + ++G + T D SDY WY T+ + D+D
Sbjct: 439 --FDFKVFTESVPS-KIKGDSFIPVELY---GLTKDESDYGWYTTSFKIDDNDLSKKKGG 492
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQN 518
LRI S G LH ++NG Y+ + + + +F++PV L G+N +++L G +
Sbjct: 493 KPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPD 552
Query: 519 YGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSE 577
GS + G P V ++G G T+ DL+ +KW KVG+ G ++ +A+ +
Sbjct: 553 SGSYMEHRYTG-PRSVSILG-LGSGTL--DLTEENKWGNKVGMEG--ERLGIHAEEGLKK 606
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
W K MTWY+T F+AP + + GMGKG WVNG +GRYW ++L+
Sbjct: 607 VKW-EKASGKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP- 664
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NPSQIN 696
G P+QI YH+PRS++K N LV+FEE P I+
Sbjct: 665 ----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELID 702
Query: 697 FQTVVVGTACGQAHENK-----------------------TMELTCHG-RRISEIKYASF 732
F V T C EN T L C G ++IS +++ASF
Sbjct: 703 FVIVNRDTVCSYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKKISAVEFASF 762
Query: 733 GDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGTVKRL 789
G+P G CG F GSC A + ++EK C+GK C I +++ SC K+L
Sbjct: 763 GNPNGTCGNFTLGSCNAPVSK-KVVEKYCLGKAECVIPVNKSTFEQDKKDSCPK-VEKKL 820
Query: 790 VVEALC 795
V+ C
Sbjct: 821 AVQVKC 826
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 320/820 (39%), Positives = 433/820 (52%), Gaps = 142/820 (17%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+GE +IL SGSIHYPRSTP
Sbjct: 40 VTYDGRSLIINGEHRILFSGSIHYPRSTP------------------------------- 68
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+YDF G DL++F+ +Q QGLY LRIGP++ EW YGG P WLH++ GI R+ N+
Sbjct: 69 -EYDFDGRKDLVKFLLEVQAQGLYAALRIGPFIEGEWTYGGLPFWLHDVSGIV-FRSDNE 126
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F T IV+M K +L+ASQGGPII++QIENEY NV + + + G Y++W A M
Sbjct: 127 PFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQIENEYQNVETAFHEKGSRYVHWAANM 186
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L+ GVPW+MC+++DAP P+ PN+PN P +WTENWT +++ +GG
Sbjct: 187 AVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFAGPNSPNKPSMWTENWTSFYQVFGG 246
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ RTAED+AF VA F G++ NYYMYHGGTNFGRT G ++TTSY AP+DEYG
Sbjct: 247 EPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRT-GSAFVTTSYYDQAPLDEYGL 305
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGN-------------------------VTNTDYGN 350
+ QPKWGHL++LH +KS KTL G + N D
Sbjct: 306 IRQPKWGHLKDLHAKIKSCSKTLIRGTHQTFPLGRLQEAYVFREKSGDCVAFLVNNDGRR 365
Query: 351 SVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
V+ SY LP S+SILPDCK+ FNTAKVNTQ + +Q + +W+
Sbjct: 366 DVTVRFQNRSYELPHKSISILPDCKSITFNTAKVNTQYATRSATLSQEFSSVG--KWEEY 423
Query: 407 PEMINDF---VVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E + F +R K TL+D ST D SDYLWY P TL
Sbjct: 424 KETVATFDSTSLRAK------TLLDHLSTTKDTSDYLWYTFRFQNHFSRP------QSTL 471
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
R S G VLHAYVNG Y S + +++ E V+L G N ++LLS TVGL + G+
Sbjct: 472 RAYSRGHVLHAYVNGVYAGSAHGSHESTSFTLENSVRLKNGTNNVALLSVTVGLPDSGAY 531
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF----YNAKAANSER 578
+ G+ R + KD +++ W Y+VGL G + + N + N R
Sbjct: 532 LERRVAGL-------HRVRIQN--KDFTTYSWGYQVGLLGEKLQIYTDNGLNKVSWNEFR 582
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
G + +TWYKT F+AP +DP+ LNL MGKG AWVNG ++GRYW ++ +
Sbjct: 583 G-------TTQPLTWYKTQFDAPAGSDPIALNLHSMGKGEAWVNGQSIGRYWVSFSTSK- 634
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
GNPSQ YH+P+S++K N LVL EE G P I
Sbjct: 635 ----------------------GNPSQTRYHIPQSFVKPTGNLLVLLEEEKGYPPGITVD 672
Query: 699 TVVVGTACGQAHEN--KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
++ + CG E+ ++L+C R IS I ++SFG P+G C + G C + +
Sbjct: 673 SISISKVCGHVSESHKSVVQLSCPPNRNISRILFSSFGTPEGNCNQYAIGKCHSS-NSRA 731
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++EK C+GK C I S G C G K L+V+A C
Sbjct: 732 IVEKACIGKTKCIILRSNRFFGGDPC-PGIRKGLLVDAKC 770
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 291/654 (44%), Positives = 384/654 (58%), Gaps = 63/654 (9%)
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ Y+F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 3 KIMYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIA-FRTDN 61
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
F MQ FT IV + K EKL+ SQGGPIIL+QIENEYG V + G GKSY W A+
Sbjct: 62 GPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 121
Query: 208 MATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGK 256
MA LD GVPW+MC++ DAP P+ F PN PK+WTE WTGWF +GG
Sbjct: 122 MALGLDTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGP 181
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
P R ED+A++VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG L
Sbjct: 182 APYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLL 241
Query: 317 NQPKWGHLRELHKLLK-------SMEKTLTY------------------GNVTNTDYGNS 351
+PKW HLR+LHK +K S++ T++Y + N D +S
Sbjct: 242 REPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSS 301
Query: 352 VS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRP 407
+ + Y+LP WSVSILPDCK+ FNTAKV T+ P + + W
Sbjct: 302 ATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTPVSS--------FSWLS 353
Query: 408 EMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
+ + L++Q S T D +DYLWYMT+ + ++ L L + S
Sbjct: 354 YNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFS 413
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+G LH ++NG T YG S + F + V L G N++S+LS VGL N G +
Sbjct: 414 AGHALHVFINGQLSG---TTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHY 470
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G+ GPV L G D +D+S +KW+YK+GL G + ++ ++S +
Sbjct: 471 ETWNTGVLGPVTLKGLNED---TRDMSGYKWSYKIGLKG-EALNLHSVSGSSSVEWVTGS 526
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
V + +TWYKTTF++P N+P+ L++ MGKG W+NG ++GR+WP Y A+ S
Sbjct: 527 LVAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTAK---GSCG 583
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
C+Y G + KC NCG PSQ WYHVPR+W+K N LV+FEE+GGNP I+
Sbjct: 584 KCNYGGIFNEKKCHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISL 637
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 317/840 (37%), Positives = 449/840 (53%), Gaps = 112/840 (13%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V++DG ++ I+G R++L SGSIHYPRSTP MWP++IK+AK+GGL+ I+TYVFWN HEP
Sbjct: 43 EVTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPE 102
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ +++F+G DL++FIK I+ GLYV LR+GP++ AEW +GG P WL +PGI RT N
Sbjct: 103 QGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGI-FFRTDN 161
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F + + +++DM K+EKLFASQGGPIIL QIENEY V Y + G +YI W +K
Sbjct: 162 EPFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASK 221
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
+ S+D+G+PW+MC+++DAP PM PN N P +WTENWT F+ +G
Sbjct: 222 LVHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFG 281
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+R+ ED+A++VARFF GT NYYMYHGGTNFGRTS Y+TT Y DAP+DE+G
Sbjct: 282 DPPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEFG 340
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYG--------NVTNTDY----GNSV---------- 352
+PK+GHL+ LH L +K L +G N T Y G V
Sbjct: 341 LEREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNT 400
Query: 353 --------SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
G Y +P S+SILPDCKT +NT ++ + + ++ N +K
Sbjct: 401 EAAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKN--FDFK 458
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
E + ++G + T D SDY WY T+ + D+D LRI
Sbjct: 459 VFTESVPS-KIKGDSFIPVELY---GLTKDESDYGWYTTSFKIDDNDLSKKKGGKPNLRI 514
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S G LH ++NG Y+ + + + +F++PV L G+N +++L G + GS +
Sbjct: 515 ASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGSYME 574
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
G P V ++G G T+ DL+ +KW KVG+ G ++ +A+ + W K
Sbjct: 575 HRYTG-PRSVSILG-LGSGTL--DLTEENKWGNKVGMEG--ERLGIHAEEGLKKVKW-EK 627
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
MTWY+T F+AP + + GMGKG WVNG +GRYW ++L+
Sbjct: 628 ASGKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP------- 680
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NPSQINFQTVVV 702
G P+QI YH+PRS++K N LV+FEE P I+F V
Sbjct: 681 ----------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIVNR 724
Query: 703 GTACGQAHENK-----------------------TMELTCHG-RRISEIKYASFGDPQGA 738
T C EN T L C G ++IS +++ASFG+P G
Sbjct: 725 DTVCSYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKKISAVEFASFGNPNGT 784
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGTVKRLVVEALC 795
CG F GSC A + ++EK C+GK C I +++ SC K+L V+ C
Sbjct: 785 CGNFTLGSCNAPVSK-KVVEKYCLGKAECVIPVNKSTFEQDKKDSCPK-VEKKLAVQVKC 842
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 317/838 (37%), Positives = 441/838 (52%), Gaps = 112/838 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG R+I SGSIHYPRS P MWP+LI KAKEGGL+ IETY+FWN HEP +
Sbjct: 41 VSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWNIHEPEK 100
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q+DF G D++RF K IQ+ +Y ++R+GP++ AEWN+GG P WL +P I RT N+
Sbjct: 101 GQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDI-VFRTNNE 159
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ M+ F +I+ K LFASQGGPIILAQIENEY ++ + + + G YI W A M
Sbjct: 160 PYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIKWAANM 219
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A S ++G+PWIMC+++ APS + P N + P +WTENWT ++ +G
Sbjct: 220 AISTNVGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPMNKSMPLLWTENWTAQYRVFGD 279
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AFAVARFF GGT NYYMYHGGTNFGRTS + YD +AP+DE+G
Sbjct: 280 PPSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 338
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
+PKWGHLR+LH LK +K L +G + G
Sbjct: 339 YKEPKWGHLRDLHLALKLCKKALLWGKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTK 398
Query: 351 -----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+ G SY +P S+SIL DCKT F T VN Q N + DQ W
Sbjct: 399 DDVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFH----FADQTTQNNVW 454
Query: 406 RPEMINDFVV--RGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
+M ++ V + L D + T D +DY+WY ++ L+ DD + L
Sbjct: 455 --QMFDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVL 512
Query: 463 RINSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
+NS G A+VN +V TK + L E+P+ L +G N +++L++T+G+ + G+
Sbjct: 513 EVNSHGHASVAFVNTKFVGCGHGTKMNKAFTL-EKPMDLKKGVNHVAVLASTMGMMDSGA 571
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ G+ + AG DL+++ W + VGL G + K+ Y K S
Sbjct: 572 YLEHRLAGVDRVQIKGLNAG----TLDLTNNGWGHIVGLVG-EQKQIYTDKGMGSVTWKP 626
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ N +R +TWYK F+ P DP+VL++ MGKG +VNG +GRYW +Y
Sbjct: 627 AVN---DRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISY-------- 675
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
+ G PSQ YH+PRS+++ N LVLFEE G P I TV
Sbjct: 676 ---------------KHALGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVK 720
Query: 702 VGTAC------GQAH----ENKTME-------------LTCHGRR-ISEIKYASFGDPQG 737
C AH E K + LTC ++ I ++ +AS+G+P G
Sbjct: 721 RDNICTFISERNPAHIKSWERKDSQITVTAADLKPRATLTCSPKKLIQQVVFASYGNPMG 780
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG + GSC L+EK C+GK+ C++ S G GT L V+A C
Sbjct: 781 ICGNYTIGSCHTP-RAKELVEKACLGKRICTLPVSADVYGGDVNCPGTTATLAVQAKC 837
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 313/836 (37%), Positives = 430/836 (51%), Gaps = 110/836 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ IDG+R + SG+IHYPRS P +WP LI++AKEGGL+ IETY+FWNAHEP
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+F G DLI+++K IQ+ +Y I+RIGP++ AEWN+GG P WL + I R N
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHII-FRANND 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV K +LFASQGGPIIL QIENEYGN+ D+ G Y+ W A+M
Sbjct: 155 PYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S GVPWIMC++S AP + T + N P +WTENWT F+++G +
Sbjct: 215 ALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
R+AED+A+AV RFF GG+ NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 275 VAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMY 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH +++S +K G N T
Sbjct: 334 KEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
D G + +P+ SVSIL CK +NT +V Q N + ++ + QW+
Sbjct: 394 DGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSYHTSEVTSKNN--QWEMY 451
Query: 407 PEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I D VR K L T D SDYLWY T+ L+ DD L+
Sbjct: 452 SEKIPKYRDTKVRMK-----EPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQ 506
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ SS + + N +V +FE+PV L G N + LLS+T+G+++ G +
Sbjct: 507 VKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGEL 566
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
V +GI ++ G T DL + W +K L G +DK+ Y+ K + ++
Sbjct: 567 AEVKSGIQECLI----QGLNTGTLDLQVNGWGHKAALEG-EDKEIYSEKGVGKVQWKPAE 621
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
N R TWYK F+ P +DPVVL++ M KG +VNG +GRYW +Y
Sbjct: 622 N---GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSYRTL------- 671
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
G PSQ YH+PR ++K N LV+FEE G P I QTV
Sbjct: 672 ----------------AGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRD 715
Query: 704 TACGQAHENKTMELTC--------------HGRR----------ISEIKYASFGDPQGAC 739
C E+ ++ H RR I E+ +ASFG+P+G C
Sbjct: 716 DICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEGMC 775
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G F G+C + ++EK+C+GK SC + GA T L V+ C
Sbjct: 776 GNFTVGTCHTP-NAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 830
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 301/716 (42%), Positives = 406/716 (56%), Gaps = 85/716 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ IDG+RKIL SGSIHYPRSTP MWP L+ KA+EGG+D I+TYVFWN HEP
Sbjct: 25 VTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREGGVDVIQTYVFWNLHEPRP 84
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+YDF+G DL+RFIK IQ QGLYV LRIGP++ +EW YGGFP WLH++P I R+ N+
Sbjct: 85 GEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPDIV-YRSDNE 143
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQNFTT IV+M K E L+ASQGGPIIL+QIENEY NV + + D G Y+ W AKM
Sbjct: 144 PFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVEAAFRDKGPPYVIWAAKM 203
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A L GVPW+MC+++DAP P+ PN+P P +WTENWT +++ +GG
Sbjct: 204 AVELQTGVPWVMCKQTDAPDPVINTCNGMRCGETFGGPNSPTKPSLWTENWTSFYQVYGG 263
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+ R+AED+AF V F G++ NYYM+HGGTNFGRT+ Y+ TSY AP+DEYG
Sbjct: 264 EPYIRSAEDIAFHVTLFIAKNGSYINYYMFHGGTNFGRTASA-YVITSYYDQAPLDEYGL 322
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
+ QPKWGHL+ELH +KS T+ G +N G
Sbjct: 323 IRQPKWGHLKELHAAIKSCSSTILEGVQSNFSLGQLQQAYIFEEEGAGCAAFLVNNDQKN 382
Query: 351 ----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
++ L S+S+LPDC+ FNTAKVN + N + +Q +D +W+
Sbjct: 383 NATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKGNEITRTSSQLFDDAD--RWEAY 440
Query: 407 PEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
++I +F + +TL++ +T D SDYLWY T + L + S + L +
Sbjct: 441 TDVIPNF---ADTNLKSDTLLEHMNTTKDKSDYLWY-TFSFLPN-----SSCTEPILHVE 491
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDL-FERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S V A+VN Y S A E P+ L N IS+LS VGLQ+ G+ +
Sbjct: 492 SLAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTISILSTMVGLQDSGAFLE 551
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ + R + I ++++W Y+ GL G + N E WS
Sbjct: 552 RRYAGLTRVEI---RCAQQEIYNFTNNYEWGYQAGLSGESLNIYMREHLDNIE--WSEVV 606
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
++ ++W+K F+AP NDPVVLNL MGKG AWVNG ++GRYW ++L +
Sbjct: 607 SATDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWLSFLTSK------- 659
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
G PSQ YH+PR+++ N LVL EE GG+P I+ TV
Sbjct: 660 ----------------GQPSQTLYHIPRAFLNSSGNLLVLLEESGGDPLHISLDTV 699
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 328/848 (38%), Positives = 448/848 (52%), Gaps = 125/848 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D R++ IDG R+I SGSIHYPRS P WPDLI KAKEGGL+ IE+YVFWN HEP +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DLI+F K IQ++ +Y I+RIGP+V AEWN+GG P WL +P I RT N+
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDII-FRTNNE 151
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ F TLIV+ K+ KLFASQGGPIILAQIENEY ++ + +AG YINW AKM
Sbjct: 152 PFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKM 211
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A + + GVPWIMC+++ AP + P + P +WTENWT ++ +G
Sbjct: 212 AIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGD 271
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AF+VARFF GGT NYYMYHGGTNFGR +G ++ Y +AP+DE+G
Sbjct: 272 PPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGL 330
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVT----------------------------NTD 347
+PKWGHLR+LH L+ +K L +GN + NT
Sbjct: 331 YKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTK 390
Query: 348 YGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+V+ G Y + S+SIL DCKT F+T VN+Q N + DQ W
Sbjct: 391 EDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFH----FADQTVQDNVW 446
Query: 406 RPEMINDFVVRGKGHFALNT---LIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
EM ++ + ++ T L T D +DYLWY T+ L+ DD L
Sbjct: 447 --EMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVL 504
Query: 463 RINSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
++S G + A+VN +V TK + + E+ + L G N +++LS+T+GL + GS
Sbjct: 505 EVSSHGHAIVAFVNDAFVGCGHGTKINKAFTM-EKAMDLKVGVNHVAILSSTLGLMDSGS 563
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ G V V G T DL+++ W + VGL G + +SE+G
Sbjct: 564 YLEHRMAG----VYTVTIRGLNTGTLDLTTNGWGHVVGLDG-------ERRRVHSEQGMG 612
Query: 582 S---KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
+ K N+ +TWY+ F+ P DPVV++L MGKGF +VNG LGRYW +Y
Sbjct: 613 AVAWKPGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY----- 667
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
+ G PSQ YHVPRS ++ NTL+ FEE GG P I
Sbjct: 668 ------------------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMIL 709
Query: 699 TVVVGTAC------GQAH-----ENK-------------------TMELTCHGRR-ISEI 727
TV C AH E+K T L+C ++ I +
Sbjct: 710 TVKRDNICTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSV 769
Query: 728 KYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVK 787
+AS+G+P G CG + GSC A ++EK C+G+K+CS+ S G GT
Sbjct: 770 VFASYGNPLGICGNYTVGSCHAP-RTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTG 828
Query: 788 RLVVEALC 795
L V+A C
Sbjct: 829 TLAVQAKC 836
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 328/848 (38%), Positives = 447/848 (52%), Gaps = 125/848 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D R++ IDG R+I SGSIHYPRS P WPDLI KAKEGGL+ IE+YVFWN HEP +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G DLI+F K IQ++ +Y I+RIGP+V AEWN+GG P WL +P I RT N+
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDII-FRTNNE 151
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F M+ F TLIV+ K+ KLFASQGGPIILAQIENEY ++ + +AG YINW AKM
Sbjct: 152 PFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKM 211
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A + + GVPWIMC+++ AP + P + P +WTENWT ++ +G
Sbjct: 212 AIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGD 271
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AF+VARFF GGT NYYMYHGGTNFGR +G ++ Y +AP DE+G
Sbjct: 272 PPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPFDEFGL 330
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVT----------------------------NTD 347
+PKWGHLR+LH L+ +K L +GN + NT
Sbjct: 331 YKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTK 390
Query: 348 YGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+V+ G Y + S+SIL DCKT F+T VN+Q N + DQ W
Sbjct: 391 EDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFH----FADQTVQDNVW 446
Query: 406 RPEMINDFVVRGKGHFALNT---LIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
EM ++ + ++ T L T D +DYLWY T+ L+ DD L
Sbjct: 447 --EMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVL 504
Query: 463 RINSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
++S G + A+VN +V TK + + E+ + L G N +++LS+T+GL + GS
Sbjct: 505 EVSSHGHAIVAFVNDAFVGCGHGTKINKAFTM-EKAMDLKVGVNHVAILSSTLGLMDSGS 563
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ G V V G T DL+++ W + VGL G + +SE+G
Sbjct: 564 YLEHRMAG----VYTVTIRGLNTGTLDLTTNGWGHVVGLDG-------ERRRVHSEQGMG 612
Query: 582 S---KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
+ K N+ +TWY+ F+ P DPVV++L MGKGF +VNG LGRYW +Y
Sbjct: 613 AVAWKPGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY----- 667
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
+ G PSQ YHVPRS ++ NTL+ FEE GG P I
Sbjct: 668 ------------------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMIL 709
Query: 699 TVVVGTAC------GQAH-----ENK-------------------TMELTCHGRR-ISEI 727
TV C AH E+K T L+C ++ I +
Sbjct: 710 TVKRDNICTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGFKPTAVLSCPTKKTIQSV 769
Query: 728 KYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVK 787
+AS+G+P G CG + GSC A ++EK C+G+K+CS+ S G GT
Sbjct: 770 VFASYGNPLGICGNYTVGSCHAP-RTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTG 828
Query: 788 RLVVEALC 795
L V+A C
Sbjct: 829 TLAVQAKC 836
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 314/836 (37%), Positives = 436/836 (52%), Gaps = 110/836 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R + SG+IHYPRS P MWP L+ +AK+GGL+ IETYVFWNAHEP
Sbjct: 33 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWNAHEPEP 92
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+F G DLI+F+K IQD +Y ++RIGP++ AEWN+GG P WL +P I R N+
Sbjct: 93 GKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHII-FRANNE 151
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV K +FASQGGPIILAQIENEYGN+ D+ G Y+ W A+M
Sbjct: 152 PYKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLEWAAEM 211
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S +IG+PWIMC+++ AP + T + N P++WTENWT F+++G +
Sbjct: 212 ALSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWTLRDKNKPRLWTENWTAQFRAFGDQ 271
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
R+AED+A++V RFF GGT NYYMY+GGTNFGRT G Y+ T Y +APIDEYG
Sbjct: 272 AAVRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEAPIDEYGLN 330
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGN--------------------------VTNTDYGN 350
+PK+GHLR+LHKL+KS K G ++N + G
Sbjct: 331 KEPKFGHLRDLHKLIKSYHKAFLVGKQSFELLGHGYEAHNYELPEENLCLAFISNNNTGE 390
Query: 351 S----VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
G Y +P+ SVSIL DC +NT +V Q + + + A W+
Sbjct: 391 DGTVMFRGKKYYIPSRSVSILADCNHVVYNTKRVFVQHSER--SFHTADESTKNNVWEMY 448
Query: 407 PEMINDF---VVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I + VR K L T D SDYLWY T+ L+ DD ++
Sbjct: 449 SEPIPRYKVTSVRTKEPLEQYNL-----TKDKSDYLWYTTSFRLEADDLPFRRDIRPVVQ 503
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ SS + +VN + S LFE+P+ L G N ++LLS+++G+++ G +
Sbjct: 504 VKSSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLALLSSSMGMKDSGGEL 563
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
V GI ++ G T DL + W +K+ L G +DK+ Y K + + ++
Sbjct: 564 VEVKGGIQDCMI----QGLNTGTLDLQGNGWGHKINLDG-EDKEIYTEKGMGTVKWKPAE 618
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
N +TWY+ F+ P +DPVVL++ M KG +VNG +GRYW +Y
Sbjct: 619 N---GHAVTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWTSYKT-------- 667
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
G PSQ YH+PR ++K N LV+FEE G P I QTV
Sbjct: 668 ---------------IAGLPSQSLYHIPRPFLKSKKNLLVVFEEEIGKPEGILIQTVRRD 712
Query: 704 TACGQAHENKTME-----------------------LTC-HGRRISEIKYASFGDPQGAC 739
C E+ + LTC H + I E+ +ASFG+P+GAC
Sbjct: 713 DICFLMSEHNPAQVKTWDADGGQIKLIAEDHSSRGILTCPHKKTIEEVVFASFGNPEGAC 772
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G F G+C + + K+C+GKKSC + GA T L V+ C
Sbjct: 773 GNFTAGTCHTP-NAKEFVAKECLGKKSCVLPLIHTLYGADINCPTTTATLAVQVRC 827
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 314/840 (37%), Positives = 442/840 (52%), Gaps = 114/840 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ DG R+I LSGSIHYPRS P MWP+LI KAKEGGL+ IETYVFWN HEP +
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F G D++RF + IQ+ +Y ++R+GP++ AEWN+GG P WL +P I RT N+
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDI-VFRTNNE 161
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ M+ F +I+ K LFASQGGPIILAQIENEY ++ + + D G YINW AKM
Sbjct: 162 PYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKM 221
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A S +IG+PWIMC+++ APS + P N + P +WTENWT ++ +G
Sbjct: 222 AISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVFGD 281
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AFAVARFF GGT NYYMYHGGTNFGRTS + YD +AP+DE+G
Sbjct: 282 PPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 340
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS---------------------- 353
+PKWGHLR+LH+ LK +K L +G + G +
Sbjct: 341 YKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTK 400
Query: 354 --------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
G Y +P S+S+L DC+T F T VN Q N + DQ W
Sbjct: 401 DDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFH----FADQTAQNNVW 456
Query: 406 RPEMINDFVV--RGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
EM + V + L D + T D +DY+WY ++ L+ DD + L
Sbjct: 457 --EMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKTVL 514
Query: 463 RINSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
+NS G A+VN +V TK + L E+P+ L +G N +++L++++G+ + G+
Sbjct: 515 EVNSHGHASVAFVNNKFVGCGHGTKMNKAFTL-EKPMDLKKGVNHVAVLASSMGMTDSGA 573
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ G+ + AG DL+++ W + VGL G + K+ Y K S
Sbjct: 574 YMEHRLAGVDRVQITGLNAG----TLDLTNNGWGHIVGLVG-ERKQIYTDKGMGSVTWKP 628
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ N +R +TWYK F+ P DPVVL++ MGKG +VNG +GRYW +Y
Sbjct: 629 AMN---DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY-------- 677
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
+ G PSQ YHVPRS+++ N LVLFEE G P I TV
Sbjct: 678 ---------------KHALGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILTVK 722
Query: 702 VGTAC------GQAH----ENKTMELTCHG----------------RRISEIKYASFGDP 735
C AH E K ++T + I ++ +AS+G+P
Sbjct: 723 RDNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVVFASYGNP 782
Query: 736 QGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G CG + GSC ++EK C+GK+ C++ + G + +GT L V+A C
Sbjct: 783 AGICGNYTVGSCHTP-RAKEVVEKACLGKRVCTLPVAADVYGGDANCSGTTATLAVQAKC 841
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 321/809 (39%), Positives = 427/809 (52%), Gaps = 97/809 (11%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
L +++DGRA+ + G R++ SG +HY RSTP MWP LI KAK GGLD I+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP++ QY+F G DL++FI+ IQ QGLYV LRIGP+V AEW YGGFP WLH++P I R
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSI-TFR 143
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
+ N+ F MQNF T IV M K E L+ QGGPII++QIENEY + +G +G Y+ W
Sbjct: 144 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 203
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGW-- 249
A MA L GVPW+MC+++DAP P+ PN+PN P +WTENWT
Sbjct: 204 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSN 263
Query: 250 --------FKSWGGKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+ +G R ED+AFAVA F + G+F +YYMYHGGTNFGR + Y+
Sbjct: 264 GQNNSAFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YV 322
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLP 360
TTSY AP+DEY + L + ++ N ++ N S L
Sbjct: 323 TTSYYDGAPLDEYDF----------KCVAFLVNFDQH----NTPKVEFRN----ISLELA 364
Query: 361 AWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGH 420
S+S+L DC+ F TAKVN Q + Q+ ND WK E + + K
Sbjct: 365 PKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDIN--NWKAFIEPVPQDL--SKST 420
Query: 421 FALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNY 479
+ N L +Q +T D +DYLWY+ + + D G+ L + S +LHA+VN Y
Sbjct: 421 YTGNQLFEQLTTTKDETDYLWYIVSYKNRASD----GNQIAHLYVKSLAHILHAFVNNEY 476
Query: 480 VDSQWTKY-GASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
V S + G N + + L G N ISLLS VG + G+ + GI VG
Sbjct: 477 VGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQ----TVG 532
Query: 539 RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
+ + L++ W Y+VGL+G D Y + NS R W N + +TWYKTTF
Sbjct: 533 IQQGQQPMHLLNNDLWGYQVGLFGEKD-SIYTQEGTNSVR-WMDINNLIYHPLTWYKTTF 590
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
P ND V LNL MGKG WVNG ++GRYW ++ A
Sbjct: 591 STPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS--------------------- 629
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELT 718
G PSQ YH+PR ++ N LVL EE GG+P QI T+ V T CG E L
Sbjct: 630 --GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQ 687
Query: 719 CHGR------------RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKS 766
G+ RIS I++AS+G+P G C +F+ GSC AE ++++ C+G++
Sbjct: 688 SRGKVPKVRIWCQGGNRISSIEFASYGNPVGDCRSFRIGSCHAE-SSESVVKQSCIGRRG 746
Query: 767 CSIEASEANLGATSCAAGTVKRLVVEALC 795
CSI A G C G K L+V A C
Sbjct: 747 CSIPVMAAKFGGDPC-PGIQKSLLVVADC 774
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 303/721 (42%), Positives = 407/721 (56%), Gaps = 88/721 (12%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+V++DGR++ IDG RKIL SGSIHYPRSTP MW LI KAKEGG+D I+TYVFWN HEP
Sbjct: 25 QVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEPQ 84
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
QYDF G DL +FIK IQ QGLY LRIGP++ +EW+YGG P WLH++ GI RT N
Sbjct: 85 PGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGI-VYRTDN 143
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQNFTT IV++ K E L+ASQGGPIIL+QIENEY N+ + + + G SY+ W AK
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203
Query: 208 MATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWG 254
MA L GVPW+MC++SDAP P+ FT PN+PN P +WTENWT +++ +G
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G+ R+AED+AF VA F G++ NYYMYHGGTNFGR S Y+ TSY AP+DEYG
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRASSA-YIKTSYYDQAPLDEYG 322
Query: 315 HLNQPKWGHLRELHK--------LLKSMEKTLTYGN-----------------VTNTDYG 349
+ QPKWGHL+ELH LL ++ ++ G + N D G
Sbjct: 323 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 382
Query: 350 NSVS----GSSYNLPAWSVSILPDCKTEEFNTAKV---NTQTNVKVKRPNQA--GNDQAP 400
N+ + S L S+SILPDCK FNTAKV + Q+ K++ +++ + A
Sbjct: 383 NNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKVCSSSRQSAYKIQELSRSCIQSFDAV 442
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSN 459
+W+ + I +F+ N +++ + T D SDYLWY S +
Sbjct: 443 DRWEEYKDAIPNFL---DTSLKSNMILEHMNMTKDESDYLWYTFRFQPN------SSCTE 493
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L I S +HA+VN YV + + F+ P+ L N IS+LS VG +
Sbjct: 494 PLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDS 553
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ + G+ + G I D +++ W Y+VGL G + +N E
Sbjct: 554 GAYLESRFAGLTRVEIQCTEKG----IYDFANYTWGYQVGLSGEKLHIYKEENLSNVE-- 607
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
W + N+ +TWYK F P +DPV LNL MGKG AWVNG ++GRYW ++ +
Sbjct: 608 WRKTEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSK-- 665
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G+PSQ YHVPR+++K N LVL EE G+P I+ +T
Sbjct: 666 ---------------------GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 704
Query: 700 V 700
+
Sbjct: 705 I 705
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 320/839 (38%), Positives = 433/839 (51%), Gaps = 114/839 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D R++ IDG R+I SGSIHYPRS WPDLI +AKEGGL+ IE+YVFWN HEP
Sbjct: 36 ITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWNIHEPEM 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F G D+I+F K IQ+ ++ ++RIGP+V AEWN+GG P WL +P I RT N+
Sbjct: 96 GVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIV-FRTDNE 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ MQ F TL+V+ K KLFASQGGPIILAQIENEY ++ + + + G YI+W AKM
Sbjct: 155 PYKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYIDWAAKM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A S GVPWIMC+++ AP+ + P + N P +WTENWT ++ +G
Sbjct: 215 AISTSTGVPWIMCKQTKAPAEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVFGD 274
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AFAVARFF GG+ NYYMYHGGTNFGRT G ++ Y +AP+DE+G
Sbjct: 275 PPSQRSAEDIAFAVARFFSVGGSMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGM 333
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------------------------- 350
+PKWGHLR+LH L+ +K L GN + G
Sbjct: 334 YKEPKWGHLRDLHHALRLCKKALLRGNPSTQPLGKLYEARLFEIPEQKVCVAFLSNHNTK 393
Query: 351 -----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+ G Y +P SVSIL DCKT F+T VN Q N + DQ W
Sbjct: 394 EDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLT----DQTLQNNVW 449
Query: 406 RPEMINDFVVRGK--GHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
D V K + L T D +DYLWY T+ L+ +D L
Sbjct: 450 EMYTEGDKVPTYKFTTDRSEKPLEAYNMTKDKTDYLWYTTSFKLEAEDLPFRQDIKPVLE 509
Query: 464 INSSGQVLHAYVNGNYVDS-QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+S G + A+VNG V + TK + L E+P+++ G N +S+LS+T+GLQ+ G+
Sbjct: 510 ASSHGHAMVAFVNGKLVGAAHGTKMNKAFSL-EKPIEVRAGINHVSILSSTLGLQDSGAY 568
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ G+ V G T DLSS+ W + VGL G K A+ ++G
Sbjct: 569 LEHRQAGVHS----VTIQGLNTGTLDLSSNGWGHIVGLDG-------ERKQAHMDKGGEV 617
Query: 583 KNVP--LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ P + +TWY+ F+ P DPVV++L MGKG +VNG LGRYW +Y
Sbjct: 618 QWKPAVFDLPLTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYWSSY------- 670
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
+ G PSQ YHVPR ++K N L +FEE GG P I TV
Sbjct: 671 ----------------KHALGRPSQYLYHVPRCFLKPTGNVLTIFEEEGGRPDAIMILTV 714
Query: 701 VVGTACG----------QAHENKTMELTC--------------HGRRISEIKYASFGDPQ 736
C ++ E K +LT + I ++ +AS+G+P
Sbjct: 715 KRDNICSFISEKNPGHVRSWERKDSQLTVVADDLKPRAVLTCPEKKTIQQVVFASYGNPL 774
Query: 737 GACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G CG + G+C ++EK CVGKKSC + S G GT L V+A C
Sbjct: 775 GICGNYTVGNCHTP-KAKEVVEKACVGKKSCVLAVSHEVYGGDLNCPGTTATLAVQAKC 832
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 317/799 (39%), Positives = 426/799 (53%), Gaps = 125/799 (15%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPG----------- 58
A+L C + ++ ++ V++D +A+ IDG+R+IL SGSIHYPRSTP
Sbjct: 8 ALLGCAVAVSVLVAAVECAVTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPP 67
Query: 59 ---------------MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFI 103
MW LI+KAK+GGLD I+TYVFWN HEP
Sbjct: 68 TIPWRGLWLRIYGSEMWEGLIQKAKDGGLDVIQTYVFWNGHEP----------------- 110
Query: 104 KTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVD 163
T + + R Y E GFPVWL +PGI RT N+ F MQ FT IV
Sbjct: 111 -TPGNDSDGIFFRFEQYYFEE---SGFPVWLKYVPGIS-FRTDNEPFKTAMQGFTEKIVG 165
Query: 164 MAKKEKLFASQGGPIILAQ---------IENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
M K E LFASQGGPIIL+Q IENEYG ++G AG++YINW AKMA L
Sbjct: 166 MMKSENLFASQGGPIILSQASIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGT 225
Query: 215 GVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAE 263
GVPW+MC+E DAP P+ F+PN P P +WTE W+GWF +GG +R E
Sbjct: 226 GVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVE 285
Query: 264 DLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGH 323
DLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG + +PK H
Sbjct: 286 DLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSH 345
Query: 324 LRELHKLLKSMEKTL--------TYGNVTNTDYGNSVSGSS------------------- 356
L+ELH+ +K E+ L T G + S SG +
Sbjct: 346 LKELHRAVKLCEQALVSVDPAITTLGTMQEARVFQSPSGCAAFLANYNSNSYAKVVFNNE 405
Query: 357 -YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV 415
Y+LP WS+SILPDCK FN+A V QT+ + G+ + + W+ E ++
Sbjct: 406 QYSLPPWSISILPDCKNVVFNSATVGVQTS----QMQMWGDGASSMTWERYDEEVDSLAA 461
Query: 416 RGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN-MTLRINSSGQVLHA 473
L++Q T D SDYLWY+T+ D+ + L G ++L + S+G LH
Sbjct: 462 --APLLTTTGLLEQLNVTRDSSDYLWYITSVDISSSENFLQGGGKPLSLSVQSAGHALHV 519
Query: 474 YVNGNYVDSQWTKYGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGI 530
+VNG Q + YG D + L G N+I+LLS GL N G ++ G+
Sbjct: 520 FVNGQL---QGSAYGTREDRRIKYNGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGV 576
Query: 531 PGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR 590
GPV+L G DE +DL+ W+Y+VGL G ++ N+ +S W ++ +
Sbjct: 577 GGPVVLHGL--DEG-SRDLTWQTWSYQVGLKG--EQMNLNSIEGSSSVEWMQGSLIAQNQ 631
Query: 591 --MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
+ WY+ FE P ++P+ L++ MGKG W+NG ++GRYW Y DG E C Y
Sbjct: 632 QPLAWYRAYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---ADGDCKE-CSYT 687
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQ 708
G + + KC CG P+Q WYHVP+SW++ N LV+FEE GG+ S+I V + C
Sbjct: 688 GTFRAPKCQSGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDSSKIALVKRSVSSVCAD 747
Query: 709 AHEN----KTMELTCHGRR 723
E+ K ++ +G R
Sbjct: 748 VSEDHPNIKNWQIESYGER 766
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 276/570 (48%), Positives = 354/570 (62%), Gaps = 54/570 (9%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D R++TI+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD I+TYVFWN HEP++
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F+ DL+RF+K ++ GLYV LRIGPYVCAEWNYGGFPVWL +PGI RT N
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGI-SFRTDNGP 141
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G KSY++W AKMA
Sbjct: 142 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMA 201
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+ + GVPWIMC++ DAP P+ FTPN+ N P +WTE W+GWF ++GG P
Sbjct: 202 VATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVP 261
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
+R EDLAFAVARF Q GG+F NYYMYHGGTNF RT+GGP++ TSYDYDAPIDEYG L Q
Sbjct: 262 QRPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQ 321
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------------------------- 350
PKWGHL LHK +K E L G+ T + GN
Sbjct: 322 PKWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAAR 381
Query: 351 -SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
+ +G Y+LPAWS+S+LPDC+T +NTA V ++ + N AG W+ E
Sbjct: 382 VAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASS--PAKMNPAGG----FTWQSYGEA 435
Query: 410 INDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSG 468
N + F + L++Q S T D SDYLWY T ++ + L L + S+G
Sbjct: 436 TNSL---DETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSAG 492
Query: 469 QVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
+ +VNG Y + + Y + VK+ +G N+IS+LS+ VGL N G+ ++
Sbjct: 493 HSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWNI 552
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
G+ GPV L G + +DLS KWTY+V
Sbjct: 553 GVLGPVTLSGLNEGK---RDLSKQKWTYQV 579
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 286/622 (45%), Positives = 365/622 (58%), Gaps = 53/622 (8%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+ L++ L+ + V++D +AI +DG+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGL
Sbjct: 9 VVLMMLCLWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGL 68
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D I+TYVFWN HEP QY F DL++F+K Q GLYV LRIGPY+CAEWN GGFPV
Sbjct: 69 DVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPV 128
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PGI RT N+ F MQ FT IV + K+ +LF SQGGPIIL+QIENEYG V
Sbjct: 129 WLKYVPGIA-FRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEW 187
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GK+Y W A+MA LD GVPW+MC++ DAP P+ F PN PK+
Sbjct: 188 EIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKM 247
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGW+ +GG P+R AEDLAF+VARF Q GG+F NYYMYHGGTNFGRTSGG ++
Sbjct: 248 WTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIA 307
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT---------------------- 339
TSYDYDAP+DEYG N+PK+ HLR LHK +K E L
Sbjct: 308 TSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVFSAPG 367
Query: 340 -----YGNVTNTDYGNSVSGS-SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQ 393
N Y + G+ Y+LP WS+SILPDCKT +NTAKV K+ N
Sbjct: 368 ACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKVGYGWLKKMTPVNS 427
Query: 394 AGNDQAPLQWK-WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
A W+ + E + +AL ++ T D SDYLWYMT+ ++ ++
Sbjct: 428 A------FAWQSYNEEPASSSQADSIAAYALWEQVN--VTRDSSDYLWYMTDVNVNANEG 479
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L + S+G VLH ++NG + W G F VKL G N++SLLS
Sbjct: 480 FLKNGQSPLLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSV 539
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
VGL N G F+ G+ GPV L G +DLS KW+YKVGL G + +
Sbjct: 540 AVGLPNVGVHFETWNAGVLGPVTLKGL---NEGTRDLSRQKWSYKVGLKG-ESLSLHTES 595
Query: 573 AANSERGWSSKNVPLNRRMTWY 594
++S V + +TWY
Sbjct: 596 GSSSVEWIQGSLVAKKQPLTWY 617
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/29 (65%), Positives = 25/29 (86%)
Query: 667 WYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
WYHVPRSW+ G N+LV+FEE+GG+P+ I
Sbjct: 616 WYHVPRSWLSSGGNSLVVFEEWGGDPNGI 644
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 526 bits (1355), Expect = e-146, Method: Compositional matrix adjust.
Identities = 312/835 (37%), Positives = 435/835 (52%), Gaps = 108/835 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R + SG+IHYPRS P MW L+K AK GGL+ IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DLIRF+ I+D +Y I+RIGP++ AEWN+GG P WL + I R N+
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHII-FRANNE 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EM+ F IV K ++FA QGGPIIL+QIENEYGN+ D G Y+ W A+M
Sbjct: 155 PFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNN------------PNSPKIWTENWTGWFKSWGGK 256
A S IGVPW+MC++S AP + N N P++WTENWT F+++G +
Sbjct: 215 AISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
+R+AED+A+AV RFF GGT NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 275 LAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMC 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH ++KS K +G N T
Sbjct: 334 KEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ-WKW 405
D G + +P+ SVSIL DCKT +NT +V Q + +R ++ + W+
Sbjct: 394 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---ERSFHTTDETSKNNVWEM 450
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I F R L T D SDYLWY T+ L+ DD ++I
Sbjct: 451 YSEAIPKF--RKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 508
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S+ + + N +V + + +FE+P+ L G N I++LS+++G+++ G +
Sbjct: 509 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 568
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA-ANSERGWSSKN 584
V GI V+ G T DL + W +K L G +DK+ Y K A + + +
Sbjct: 569 VKGGIQDCVV----QGLNTGTLDLQGNGWGHKARLEG-EDKEIYTEKGMAQFQWKPAEND 623
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+P+ TWYK F+ P +DP+V+++ M KG +VNG +GRYW +++
Sbjct: 624 LPI----TWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL-------- 671
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
G+PSQ YH+PR+++K N L++FEE G P I QTV
Sbjct: 672 ---------------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDD 716
Query: 705 ACGQAHEN-----KTME------------------LTCHGRR-ISEIKYASFGDPQGACG 740
C E+ KT E L C +R I E+ +ASFG+P+GACG
Sbjct: 717 ICVFISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACG 776
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F G+C D ++EK+C+GK+SC + GA T L V+ C
Sbjct: 777 NFTAGTCHTP-DAKAIVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRC 830
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 526 bits (1355), Expect = e-146, Method: Compositional matrix adjust.
Identities = 333/824 (40%), Positives = 432/824 (52%), Gaps = 139/824 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+G+R++L SGSIHYPRSTP MWP LI KAKEGG+D IETY FWN HEP +
Sbjct: 24 VTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEPKQ 83
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QYDF+G LD+++F K +Q QGLY LRIGP++ +EWNYGG P WLH++PGI R+ N+
Sbjct: 84 GQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGII-YRSDNE 142
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQNFTT IV++ K E L+ASQGGPIIL+QIENEY NV + + + G Y+ W AKM
Sbjct: 143 PFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAKM 202
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFA 268
A L + + +G R AEDLAF
Sbjct: 203 AVDLQTAM----------------------------------RYYGEDKRGRAAEDLAFQ 228
Query: 269 VARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLREL 327
VA F + G+F NYYMYHGGTNFGRTS LT YD AP+DEYG + QPKWGHL+EL
Sbjct: 229 VALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEYGLIRQPKWGHLKEL 287
Query: 328 HKLLKSMEKTLTYGN-------------------------VTNTDYGNSVS----GSSYN 358
H ++K TL G + N D +V+ ++Y
Sbjct: 288 HAVIKLCSDTLLXGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDKRRNVTVLFQNTNYE 347
Query: 359 LPAWSVSILPDCKTEEFNTAKVNTQTNVK-VKRPNQAGNDQAPLQWKWRPEMINDFVVRG 417
L A S+SILPDCK FNTAKV+TQ N + V+ G+ + QW E I F G
Sbjct: 348 LAANSISILPDCKKIAFNTAKVSTQFNTRSVQTRATFGSTK---QWSEYREGIPSF---G 401
Query: 418 KGHFALNTLIDQK-STNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAY 474
+ L++ +T D SDYLWY + SSN LR++S VL A+
Sbjct: 402 GTPLKASMLLEHMGTTKDASDYLWYTLR--------FIHNSSNAQPVLRVDSLAHVLLAF 453
Query: 475 VNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPV 534
VNG Y+ S + + V L G N+ISLLS VGL + G + GI
Sbjct: 454 VNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEHKVAGIRRVE 513
Query: 535 LLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWY 594
+ D KD S H W Y+VGL G + + Y + + + W +TWY
Sbjct: 514 IQ-----DGGXSKDFSKHPWGYQVGLMG-EKLQIYTSPGSQKVQ-WYGLGSHGRGPLTWY 566
Query: 595 KTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD 654
KT F+AP NDPVVL MGKG AWVNG ++GRYW +YL
Sbjct: 567 KTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS----------------- 609
Query: 655 KCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG------- 707
G PSQ WY+VPR+++ N LV+ EE G+P +I+ TV V CG
Sbjct: 610 ------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTDSHP 663
Query: 708 -------------QAHENKT--MELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEI 751
++H K ++L C IS+I +ASFG P G C ++ GSC +
Sbjct: 664 PPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCHSP- 722
Query: 752 DVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ L + EK C+GK CSI S + G C GT K L+V A C
Sbjct: 723 NSLAVAEKACLGKNXCSIPHSLKSFGDDPC-PGTPKALLVAAQC 765
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 310/833 (37%), Positives = 433/833 (51%), Gaps = 106/833 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R + SG+IHYPRS P MW L+K AK GGL+ IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DLIRF+ I+D +Y I+RIGP++ AEWN+GG P WL + I R N+
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHII-FRANNE 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EM+ F IV K ++FA QGGPIIL+QIENEYGN+ D G Y+ W A+M
Sbjct: 155 PFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNN------------PNSPKIWTENWTGWFKSWGGK 256
A S IGVPW+MC++S AP + N N P++WTENWT F+++G +
Sbjct: 215 AISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
+R+AED+A+AV RFF GGT NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 275 LAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMC 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH ++KS K +G N T
Sbjct: 334 KEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK-VKRPNQAGNDQAPLQWKW 405
D G + +P+ SVSIL DCKT +NT +V Q + + ++ + W+
Sbjct: 394 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV---WEM 450
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I F R L T D SDYLWY T+ L+ DD ++I
Sbjct: 451 YSEAIPKF--RKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 508
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S+ + + N +V + + +FE+P+ L G N I++LS+++G+++ G +
Sbjct: 509 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 568
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
V GI V+ G T DL + W +K L G +DK+ Y K + ++N
Sbjct: 569 VKGGIQDCVV----QGLNTGTLDLQGNGWGHKARLEG-EDKEIYTEKGMAQFQWKPAEN- 622
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
+ +TWYK F+ P +DP+V+++ M KG +VNG +GRYW +++
Sbjct: 623 --DLPITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL--------- 671
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTA 705
G+PSQ YH+PR+++K N L++FEE G P I QTV
Sbjct: 672 --------------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDI 717
Query: 706 CGQAHEN-----KTME------------------LTCHGRR-ISEIKYASFGDPQGACGA 741
C E+ KT E L C +R I E+ +ASFG+P+GACG
Sbjct: 718 CVFISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGN 777
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEAL 794
F G+C D ++EK+C+GK+SC + GA T L V+ L
Sbjct: 778 FTAGTCHTP-DAKAIVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQLL 829
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 309/873 (35%), Positives = 462/873 (52%), Gaps = 125/873 (14%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYR-------VSHDGRAITIDGERKILLSGSIHYPRST 56
+K +R ++ L++ +L + + ++ V++DG ++ I+G+R++ SGS+HYPRST
Sbjct: 9 MKSRTRYLIAILLVISLCSKASSHDDEKKKKGVTYDGTSLIINGKRELFFSGSVHYPRST 68
Query: 57 PGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILR 116
P MWP +I KA+ GGL+ I+TYVFWN HEP + +YDF G DL++FIK I ++GLYV LR
Sbjct: 69 PDMWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLR 128
Query: 117 IGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGG 176
+GP++ AEWN+GG P WL +P + RT N+ F + + I+ M K+EKLFASQGG
Sbjct: 129 LGPFIQAEWNHGGLPYWLREVPDV-YFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGG 187
Query: 177 PIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT---- 232
PIIL QIENEY V Y + G+ YI W A + S+++G+PW+MC+++DAP +
Sbjct: 188 PIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNG 247
Query: 233 ---------PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYY 283
PN + P +WTENWT F+ +G +RTAED+AF+VAR+F G+ NYY
Sbjct: 248 RHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTAEDIAFSVARYFSKNGSHVNYY 307
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV 343
MYHGGTNFGRTS ++TT Y DAP+DE+G PK+GHL+ +H+ L+ +K L +G +
Sbjct: 308 MYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQL 366
Query: 344 ----------------------------TNTDYGNSV--SGSSYNLPAWSVSILPDCKTE 373
NT N++ G Y LP+ S+SILPDCKT
Sbjct: 367 RAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTV 426
Query: 374 EFNTAKVNTQTNVK--VKRPNQAGNDQAPLQWKWRPEMIN-DFVVRGKGHFALNTLIDQK 430
+NTA++ Q + + VK + + + + P +++ D ++ G+ ++
Sbjct: 427 VYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL-------- 478
Query: 431 STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
T D +DY WY T+ + +DD LR+ S G L YVNG Y ++
Sbjct: 479 -TKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMK 537
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDL 549
+ F +PV G N+IS+L GL + GS + G P + ++G ++G + ++
Sbjct: 538 SFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAG-PRAISIIGLKSGTRDLTEN- 595
Query: 550 SSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVL 609
++W + GL G + K+ Y + + + W + +TWYKT FE P + V +
Sbjct: 596 --NEWGHLAGLEG-EKKEVYTEEGSKKVK-WEKDGE--RKPLTWYKTYFETPEGVNAVAI 649
Query: 610 NLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYH 669
++GMGKG WVNG +GRYW ++L+ G P+Q YH
Sbjct: 650 RMKGMGKGLIWVNGIGVGRYWMSFLSP-----------------------LGEPTQTEYH 686
Query: 670 VPRSWIK--DGVNTLVLFEEFGG-NPSQINFQTVVVGTACGQAHEN-------------- 712
+PRS++K N LV+ EE G I+F V T C E+
Sbjct: 687 IPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPK 746
Query: 713 -----KTMELTCHGR-----RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCV 762
K M L R ++ E+++ASFGDP G CG F G C A ++EK+C+
Sbjct: 747 IVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSAS-KSKEVVEKECL 805
Query: 763 GKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+ CSI + G C VK L V+ C
Sbjct: 806 GRNYCSIVVARETFGDKGCPE-IVKTLAVQVKC 837
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 286/588 (48%), Positives = 362/588 (61%), Gaps = 55/588 (9%)
Query: 14 CLILQTLFNLS-LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
C IL F + + V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GG+
Sbjct: 12 CYILFLCFFVCYVTASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGV 71
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
D IETYVFWN HEP + +Y F DL++FIK +Q GLYV LRIGPYVCAEWN+GGFPV
Sbjct: 72 DVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 131
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
WL +PG+ RT N+ F MQ FTT IV + K E LF SQGGPIIL+QIENEYG V
Sbjct: 132 WLKYVPGV-AFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEW 190
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKI 241
+ G GKSY W ++MA L+ GVPW+MC++ DAP P+ F+PN PK+
Sbjct: 191 EIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKM 250
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
WTENWTGW+ +G P R AEDLAF+VARF Q G++ NYYMYHGGTNFGRTS G ++
Sbjct: 251 WTENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIA 310
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT-------------NTDY 348
TSYDYDAPIDEYG +++PKWGHLR+LHK +K E L + T T +
Sbjct: 311 TSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSF 370
Query: 349 G-------NSVSGS---------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP- 391
G N +GS Y+LP WS+SILPDCKTE FNTAKV + P
Sbjct: 371 GACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVHRSMTPA 430
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
N A N Q+ + +P + G + N L++Q S T D SDYLWYMT+ ++ +
Sbjct: 431 NSAFNWQS---YNEQPAFSGE-----SGSWTANGLLEQLSQTWDKSDYLWYMTDVNISPN 482
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ + N L S+G VLH ++NG + + + F VKL G N+ISLL
Sbjct: 483 EGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKISLL 542
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
S VGL N G ++ G+ GPV L G +DLS KW+YKV
Sbjct: 543 SVAVGLSNVGVHYEKWNVGVLGPVTL---KGLNEGTRDLSKQKWSYKV 587
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 309/839 (36%), Positives = 442/839 (52%), Gaps = 112/839 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG ++ IDG+R++L SGSIHYPRSTP MWP +IK+AK+GGL+ I+TYVFWN HEP +
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F+G DL++FIK IQ G+YV LR+GP++ AEW +GG P WL +PGI RT NK
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIF-FRTDNK 159
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + + +I+D K+E+LFASQGGPIIL QIENEY V Y G +YI W + +
Sbjct: 160 QFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNL 219
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
S+ +G+PW+MC+++DAP PM PN N P +WTENWT F+ +G
Sbjct: 220 VDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGD 279
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+ ED+A++VARFF GT NYYMYHGGTNFGRTS Y+TT Y DAP+DEYG
Sbjct: 280 PPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGL 338
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNV----------------------------TNTD 347
+PK+GHL+ LH L +K L +G NT+
Sbjct: 339 EKEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTE 398
Query: 348 YGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
++ G Y + S+SILPDCKT +NTA++ +Q + ++ N + +K
Sbjct: 399 AAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKK--FDFKV 456
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL-KDDDPILSGSSNMTLRI 464
E + + G + + T D +DY WY T+ + K+ P G +RI
Sbjct: 457 FTETLPS-KLEGNSYIPVELY---GLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTF-VRI 511
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S G LHA++NG Y+ S + + +F++ V L G+N + +L G + GS +
Sbjct: 512 ASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSYME 571
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G G +L +G + + S KW K+G+ G +K + + + W K
Sbjct: 572 HRYTGPRGISILGLTSGTLDLTE---SSKWGNKIGMEG--EKLGIHTEEGLKKVEWK-KF 625
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+TWY+T F+AP + + GMGKG WVNG +GRYW ++L+
Sbjct: 626 TGKAPGLTWYQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSP-------- 677
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NPSQINFQTVVVG 703
G P+QI YH+PRS++K N LV+FEE P ++F V
Sbjct: 678 ---------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFAIVNRD 722
Query: 704 TACGQAHENK-----------------------TMELTCHG-RRISEIKYASFGDPQGAC 739
T C EN T L C G ++I+ +++ASFG+P G C
Sbjct: 723 TVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKKIAAVEFASFGNPIGVC 782
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGTVKRLVVEALC 795
G F G+C A + +IEK C+GK C I +++ SC VK L V+ C
Sbjct: 783 GNFTLGTCNAPVSK-QVIEKHCLGKAECVIPVNKSTFQQDKKDSC-KNVVKMLAVQVKC 839
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 300/838 (35%), Positives = 435/838 (51%), Gaps = 112/838 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+S+D R++ +DG R+I SGSIHYPRS P MWP+LI KAKEGGL+ IETYVFWN HEP +
Sbjct: 38 ISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q++F G D+++F K IQ+ ++ ++R+GP++ AEWN+GG P WL +P I RT N+
Sbjct: 98 GQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIV-FRTNNE 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ M+ F +++ K LFASQGGPIILAQIENEY ++ + + + G YI+W A+M
Sbjct: 157 PYKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIHWAAQM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A +IG+PWIMC+++ AP + P N P +WTENWT ++ +G
Sbjct: 217 AIGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQYRVFGD 276
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AFAVARFF GGT NYYMYHGGTNFGRT+ + YD +AP+DE+G
Sbjct: 277 PPSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTAAAFVMPKYYD-EAPLDEFGL 335
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS---------------------- 353
+PKWGHLR+LH LK +K L +G + G +
Sbjct: 336 YKEPKWGHLRDLHLALKLCKKALLWGKPSTEKLGKQLEARVFEIPEQKVCVAFLSNHNTK 395
Query: 354 --------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
G Y +P S+SIL DCKT F T VN Q N + N Q +
Sbjct: 396 DDVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTNQNNVWQM-F 454
Query: 406 RPEMINDFV---VRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E + + +R + L L T D +DY+WY ++ L+ DD + +
Sbjct: 455 DEEKVPKYKQAKIRTRKAADLYNL-----TKDKTDYVWYTSSFKLEPDDMPIRRDIKTVV 509
Query: 463 RINSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
+NS G A+VN + TK + L E+P++L +G N +++L++++G+ + G+
Sbjct: 510 EVNSHGHASVAFVNNKFAGCGHGTKMNKAFTL-EKPMELKKGVNHVAVLASSMGMMDSGA 568
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
+ G+ + AG DL+++ W + VGL G + K+ Y K S
Sbjct: 569 YLEHRLAGVDRVQITGLNAG----TLDLTNNGWGHIVGLVG-EQKEIYTEKGMASVTWKP 623
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ N ++ +TWYK F+ P DP+VL++ MGKG +VNG +GRYW +Y
Sbjct: 624 AVN---DKPLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIGRYWMSY-------- 672
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
+ G PSQ YH+PRS+++ N LVLFEE G P I TV
Sbjct: 673 ---------------KHALGRPSQQLYHIPRSFLRPKDNVLVLFEEEFGRPDAIMILTVK 717
Query: 702 VGTACGQAHENKTME-----------------------LTCHGRR-ISEIKYASFGDPQG 737
C E LTC ++ I ++ +AS+G+P G
Sbjct: 718 RDNICTYISERNPAHIKSWERKDSQITATADDLKARATLTCPPKKLIQQVVFASYGNPVG 777
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CG + GSC ++EK C+GK++C++ S G GT L V+A C
Sbjct: 778 ICGNYTIGSCHTP-RAKEVVEKSCLGKRTCTLPVSADVYGGDVNCPGTTATLAVQAKC 834
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 311/875 (35%), Positives = 462/875 (52%), Gaps = 129/875 (14%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYR-------VSHDGRAITIDGERKILLSGSIHYPRST 56
+K +R ++ L++ +L + + ++ V++DG ++ I+G+R++L SGS+HYPRST
Sbjct: 9 MKSRTRYLIAILLVISLCSKASSHDDEKKKKGVTYDGTSLIINGKRELLFSGSVHYPRST 68
Query: 57 PGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILR 116
P MWP +I KA+ GGL+ I+TYVFWN HEP + +YDF G DL++FIK I ++GLYV LR
Sbjct: 69 PHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLR 128
Query: 117 IGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGG 176
+GP++ AEWN+GG P WL +P + RT N+ F + + I+ M K+EKLFASQGG
Sbjct: 129 LGPFIQAEWNHGGLPYWLREVPDV-YFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGG 187
Query: 177 PIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT---- 232
PIIL QIENEY V Y + G+ YI W A + S+++G+PW+MC+++DAP +
Sbjct: 188 PIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNG 247
Query: 233 ---------PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYY 283
PN + P +WTENWT F+ +G +RT ED+AF+VAR+F G+ NYY
Sbjct: 248 RHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYY 307
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV 343
MYHGGTNFGRTS ++TT Y DAP+DE+G PK+GHL+ +H+ L+ +K L +G +
Sbjct: 308 MYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQL 366
Query: 344 ----------------------------TNTDYGNSV--SGSSYNLPAWSVSILPDCKTE 373
NT N++ G Y LP+ S+SILPDCKT
Sbjct: 367 RAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTV 426
Query: 374 EFNTAKVNTQTNVK--VKRPNQAGNDQAPLQWKWRPEMIN-DFVVRGKGHFALNTLIDQK 430
+NTA++ Q + + VK + + + + P +++ D ++ G+ ++
Sbjct: 427 VYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL-------- 478
Query: 431 STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
T D +DY WY T+ + +DD LR+ S G L YVNG Y ++
Sbjct: 479 -TKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMK 537
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG-RAGDETIIKDL 549
+ F +PV G N+IS+L GL + GS + G P + ++G ++G + ++
Sbjct: 538 SFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAG-PRAISIIGLKSGTRDLTEN- 595
Query: 550 SSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS--KNVPLNRRMTWYKTTFEAPLENDPV 607
++W + GL G + K+ Y + + + W K PL TWYKT FE P + V
Sbjct: 596 --NEWGHLAGLEG-EKKEVYTEEGSKKVK-WEKDGKRKPL----TWYKTYFETPEGVNAV 647
Query: 608 VLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIW 667
+ ++ MGKG WVNG +GRYW ++L+ G P+Q
Sbjct: 648 AIRMKAMGKGLIWVNGIGVGRYWMSFLSP-----------------------LGEPTQTE 684
Query: 668 YHVPRSWIK--DGVNTLVLFEEFGG-NPSQINFQTVVVGTACGQAHEN------------ 712
YH+PRS++K N LV+ EE G I+F V T C E+
Sbjct: 685 YHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREG 744
Query: 713 -------KTMELTCHGR-----RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQ 760
K M L R ++ E+++ASFGDP G CG F G C A ++EK+
Sbjct: 745 PKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSAS-KSKEVVEKE 803
Query: 761 CVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C+G+ CSI + G C VK L V+ C
Sbjct: 804 CLGRNYCSIVVARETFGDKGCPE-IVKTLAVQVKC 837
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 316/808 (39%), Positives = 427/808 (52%), Gaps = 110/808 (13%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V+++ RA+ +DG R++L +G +HYPRSTP MWP LI KAKEGGLD I+TYVFWN HEP+
Sbjct: 17 EVTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVHEPI 76
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ QY+F G DL+RFIK IQ QGLYV LRIGP++ +EW YGGFP WLH++P I R+ N
Sbjct: 77 QGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNI-TFRSDN 135
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQ F T IV+M K E L+ QGGPII +QIENEY V +G +G+ Y++W A
Sbjct: 136 EPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWAAA 195
Query: 208 MATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAF 267
MA L GVPW MC+++DAP P+ ++ P + +N + + +G R+ +D+ F
Sbjct: 196 MAVDLQTGVPWTMCKQNDAPDPVVGIHSYTIP-VNFQNDSRNYLIYGNDTKLRSPQDITF 254
Query: 268 AVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRE 326
AVA F + G++ +YYMYHGGTNFGR + Y+TTSY AP+DEYG + QP WGHLRE
Sbjct: 255 AVALFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGLIWQPTWGHLRE 313
Query: 327 LHKLLKSMEKTLTYGNVTNTDYGNSVSGS----------------------------SYN 358
LH +K + L +G +N G S
Sbjct: 314 LHAAVKQSSEPLLFGTYSNLSIGQEQEAHIFETETQCVAFLVNFDQHHISEVVFRNISLE 373
Query: 359 LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGK 418
L S+SIL DCK F TAKVN Q + Q+ +D + WK E I V K
Sbjct: 374 LAPKSISILLDCKQVVFETAKVNAQHGSRTAEEVQSFSDIS--TWKAFKEPIPQDV--SK 429
Query: 419 GHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNG 477
++ N L + ST D +DYLWY+ L N+ RI+ S H
Sbjct: 430 SAYSGNRLFEHLSTTKDATDYLWYIVGLFL-----------NILGRIHGS----HG---- 470
Query: 478 NYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLV 537
G +N +F + L G N ISLLSA VG + G+ + GI +
Sbjct: 471 ----------GPANIIFSTNISLQEGPNTISLLSAMVGSPDSGAHMERRVFGIRKVSIQQ 520
Query: 538 GRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTT 597
G+ + + +L W Y+VGL+G + Y + +E W++ + +TWYKTT
Sbjct: 521 GQEPENLLNNEL----WGYQVGLFG-ERNNIYTQDSKITE--WTTIDNLTYSPLTWYKTT 573
Query: 598 FEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCA 657
F P+ ND V LNL GMGKG WVNG ++GRYW ++ A
Sbjct: 574 FSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAP--------------------- 612
Query: 658 YNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE------ 711
GNPSQ YH+PR ++ NTLVLFEE GGNP I T+ V CG +E
Sbjct: 613 --SGNPSQSLYHIPREFLNPQDNTLVLFEEMGGNPQLITVNTMSVSRVCGNVNELSAPSL 670
Query: 712 -----NKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKK 765
++L C G+ IS I++AS+G P G C F G C A ++++ C+GK
Sbjct: 671 QYKDKEPAVDLWCPEGKHISAIEFASYGGPTGDCKKFGFGRCHAG-SSESVVKQACLGKS 729
Query: 766 SCSIEASEANLGATSCAAGTVKRLVVEA 793
CS+ + G C G K L+V A
Sbjct: 730 GCSVPVTPIKFGGDPC-PGIQKSLLVVA 756
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 309/836 (36%), Positives = 422/836 (50%), Gaps = 125/836 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ IDG+R + SG+IHYPRS P +WP LI++AKEGGL+ IETY+FWNAHEP
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+F G DLI+++K IQ+ +Y I+RIGP++ AEWN+GG P WL + I R N
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHII-FRANND 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV K +LFASQGGPIIL QIENEYGN+ D+ G Y+ W A+M
Sbjct: 155 PYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S GVPWIMC++S AP + T + N P +WTENWT F+++G +
Sbjct: 215 ALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
R+AED+A+AV RFF GG+ NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 275 VAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMY 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH +++S +K G N T
Sbjct: 334 KEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
D G + +P+ SVSIL CK +NT +V Q N + ++ + QW+
Sbjct: 394 DGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSYHTSEVTSKNN--QWEMY 451
Query: 407 PEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I D VR K L T D SDYLWY T+ L+ DD L+
Sbjct: 452 SEKIPKYRDTKVRMK-----EPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQ 506
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ SS + + N +V +FE+PV L G N + LLS+T+G+++ G +
Sbjct: 507 VKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGEL 566
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
V +GI ++ G T DL + W +K L G +DK+ Y+ K + ++
Sbjct: 567 AEVKSGIQECLI----QGLNTGTLDLQVNGWGHKAALEG-EDKEIYSEKGVGKVQWKPAE 621
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
N R TWYK F+ P +DPVVL++ M KG +VNG +GRYW +Y
Sbjct: 622 N---GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSYRTL------- 671
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
G PSQ YH+PR ++K N LV+FEE G P I QTV
Sbjct: 672 ----------------AGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRD 715
Query: 704 TACGQAHENKTMELTC--------------HGRR----------ISEIKYASFGDPQGAC 739
C E+ ++ H RR I E+ +ASFG+P+G C
Sbjct: 716 DICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEGMC 775
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G F +C+GK SC + GA T L V+ C
Sbjct: 776 GNF----------------TECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 815
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 285/628 (45%), Positives = 377/628 (60%), Gaps = 58/628 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
ILL ++ + S+ V++D +A+ I+G+R+ILLSGSIHYPRSTP MWPDLI+KAK+G
Sbjct: 11 ILLGILCCSSLICSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP QY F DL++FIK +Q GLYV LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IV M K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGM-VFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPI 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G GK+Y W A+MA L GVPWIMC++ DAP+ + F PN+ N P
Sbjct: 190 EWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R AED+A +VARF Q GG+F NYYMYHGGTNF RT+ G +
Sbjct: 250 KMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAP+DEYG +PK+ HL+ LHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT-NVKVKR 390
GS+Y+LP WSVSILPDCKTE +NTAKV T + ++K+
Sbjct: 369 KSSCAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVRTSSIHMKMVP 428
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKD 449
N P W E I G F+ + L++Q S T D +DY WY+T+ +
Sbjct: 429 TN------TPFSWGSYNEEIPS--ANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISP 480
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
D+ L+G + L I S+G LH +VNG + + F + +KL G N+++L
Sbjct: 481 DEKFLTGEDPL-LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLAL 539
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
LS GL N G ++ G+ GPV L G + D++ KW+YK+G G +
Sbjct: 540 LSTAAGLPNVGVHYETWNTGVLGPVTL---NGVNSGTWDMTKWKWSYKIGTKG--EALSV 594
Query: 570 NAKAANSERGWSSKNVPLNRR-MTWYKT 596
+ A +S W ++ ++ +TWYK
Sbjct: 595 HTLAGSSTVEWKEGSLVAKKQPLTWYKV 622
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 306/760 (40%), Positives = 410/760 (53%), Gaps = 110/760 (14%)
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL ++PGIE RT N+ + EMQ F T IVD+ K+EKL++ QGGPIIL QIENEYG
Sbjct: 19 GFPVWLRDVPGIE-FRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYG 77
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
N+ YG AGK Y+ W A+MA +LD GVPW+MC+++DAP + F PN+ N
Sbjct: 78 NIQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYN 137
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P IWTE+W GW+ WG P R A+D AFAVARF+Q GG+ QNYYMY GGTNF RT+GG
Sbjct: 138 KPTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGG 197
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT------------------ 339
P TSYDYDAPIDEYG L QPKWGHL++LH +K E LT
Sbjct: 198 PLQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAH 257
Query: 340 -----------------------YGNVTNTDYGNS-VSGSSYNLPAWSVSILPDCKTEEF 375
N+ Y + + G SY+LP WSVSILPDC+T F
Sbjct: 258 VYSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAF 317
Query: 376 NTAKVNTQT---NVKVKRPNQAGNDQAPL-----------QWKWRPEMINDFVVRGKGHF 421
NTA+V TQT NV+ P+ + + + W E + + G+G F
Sbjct: 318 NTARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVG---IWGEGIF 374
Query: 422 ALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGN 478
+++ T D+SDYL Y T ++ ++D + S +L I+ V +VNG
Sbjct: 375 TAQGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGK 434
Query: 479 YVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
S+ + + N +P++L +G N+++LLS VGLQNYG+ + G G V L G
Sbjct: 435 LAGSKVGHWVSLN----QPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTG 490
Query: 539 RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
+ + DL++ WTY++GL G + + Y+ + S S +N TW+KT F
Sbjct: 491 LSNGDI---DLTNSLWTYQIGLKG-EFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMF 546
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+AP N PV ++L MGKG AWVNG+ +GRYW + +A E GC + SC+Y G Y KC
Sbjct: 547 DAPEGNGPVTIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCPS-SCNYAGTYSDSKCRS 604
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN------ 712
NCG +Q WYH+PR W+++ N LVLFEE GG+PSQI+ + T C + E
Sbjct: 605 NCGIATQSWYHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLS 664
Query: 713 ----------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+ L C G IS+I +AS+G P G C F G+C A L
Sbjct: 665 AWSRAANGRPSVNTVAPELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHAST-TLD 723
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
L+ + C GK C+I + G VK L VEA C
Sbjct: 724 LVVEACEGKNRCAISVTNEVFGDP--CRKVVKDLAVEAEC 761
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 302/838 (36%), Positives = 438/838 (52%), Gaps = 110/838 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG ++ IDG+R++L SGSIHYPRSTP MWP +IK+AK+GGL+ I+TYVFWN HEP +
Sbjct: 40 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F+G DL++FIK I+ G+YV LR+GP++ AEW +GG P WL +PGI RT NK
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIF-FRTDNK 158
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + + +I+D K+E+LFASQGGPIIL QIENEY V Y G +YI W +K+
Sbjct: 159 PFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKL 218
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
S+ +G+PW+MC+++DAP PM PN N P +WTENWT F+ +G
Sbjct: 219 VDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGD 278
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+ ED+A++VARFF G+ NYYMYHGGTNFGRTS Y+TT Y DAP+DEYG
Sbjct: 279 PPTQRSVEDIAYSVARFFSKNGSHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGL 337
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNV----------------------------TNTD 347
+PK+GHL+ LH L +K L +G NT+
Sbjct: 338 EREPKYGHLKHLHSALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTE 397
Query: 348 YGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
++ G Y + S+SILPDCKT +NTA++ +Q + ++ N + +K
Sbjct: 398 AAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKK--FDFKV 455
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E + + G + + T D +DY WY T+ + + +RI
Sbjct: 456 FTETLPS-KLEGNSYIPVELY---GLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFVRIA 511
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S G LH ++NG Y+ S + + +F++ V L G+N + +L G + GS +
Sbjct: 512 SLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLIMLGVLTGFPDSGSYMEH 571
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
G G +L +G + + S KW K+G+ G +K + + + W K
Sbjct: 572 RYTGPRGVSILGLTSGTLDLTE---SSKWGNKIGMEG--EKLGIHTEEGLKKVEWK-KFT 625
Query: 586 PLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESC 645
+TWY+ F+AP + + + GMGKG WVNG +GRYW ++L+
Sbjct: 626 GKAPGLTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYWQSFLSP--------- 676
Query: 646 DYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NPSQINFQTVVVGT 704
G P+QI YH+PRS++K N LV+FEE P ++F V T
Sbjct: 677 --------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFVIVNRDT 722
Query: 705 ACGQAHENK-----------------------TMELTCHG-RRISEIKYASFGDPQGACG 740
C EN T L C G ++I+ +++ASFG+P G CG
Sbjct: 723 VCSYVGENYTPSVRHWTRKQDQVQAITDNVSLTATLKCSGTKKIAAVEFASFGNPIGVCG 782
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGTVKRLVVEALC 795
F G+C A + +IEK C+GK C I +++ SC K L V+ C
Sbjct: 783 NFTLGTCNAPVSK-QVIEKHCLGKAECVIPVNKSTFQQDKKDSC-KNVAKTLAVQVKC 838
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 307/807 (38%), Positives = 423/807 (52%), Gaps = 113/807 (14%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW D++ KA+ GGL+ I+TYVFWN HEP+ Q++F GN DL++FIK I ++ +YV LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
P++ AEWN+GG P WL P I R+ N F + M+ + +IVDM K+ KLFASQGGPI
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNI-IFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPI 119
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
+LAQIENEY +V Y + G Y+ W A MA L +GVPWIMC++ DAP P+
Sbjct: 120 VLAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRH 179
Query: 231 ----FT-PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMY 285
FT PN P P +WTENWT ++ +G +R AED+AF+VARFF G+ NYYMY
Sbjct: 180 CGDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMY 239
Query: 286 HGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG---- 341
HGGTNFGRTS + TT Y +AP+DE+G +PKWGHLR++HK L +K L +G
Sbjct: 240 HGGTNFGRTS-AVFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGI 298
Query: 342 --------------------------NVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEF 375
N T + + G + LP S+SILPDCKT F
Sbjct: 299 QVIGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVF 358
Query: 376 NTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMI---NDFVVRGKGHFALNTLIDQKST 432
NT + +Q N + P++ N L+WK PE I V K L +L+
Sbjct: 359 NTETIVSQHNARNFIPSKNANK---LKWKMSPESIPTVEQVPVNNKIPLELYSLL----- 410
Query: 433 NDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
D +DY WY T+ +L +D LRI S G + +VNG Y+ + + N
Sbjct: 411 KDTTDYGWYTTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEEKNF 470
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSH 552
+F+ V G N I+LL VGL + G+ + G P + ++G T D+S +
Sbjct: 471 VFQGSVPFKAGVNNIALLGILVGLPDSGAYMEHRFAG-PRSITILGL---NTGTLDISKN 526
Query: 553 KWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQ 612
W ++V L G K F + + WS +TWYKT F+AP NDPV + +
Sbjct: 527 GWGHQVALQGEKVKVF--TQGGSHRVDWSEIKEE-KSALTWYKTYFDAPEGNDPVAIRMN 583
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
GMGKG WVNG ++GRYW +YL+ ST+S YH+PR
Sbjct: 584 GMGKGQIWVNGKSIGRYWMSYLSPLK-LSTQS----------------------EYHIPR 620
Query: 673 SWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG---QAH-------ENKTME------ 716
S+IK N LV+ EE P ++ V T C Q H E K +
Sbjct: 621 SFIKPSENLLVILEEENVTPEKVEILLVNRDTICSFITQYHPPNVKSWERKDKQFRAVVD 680
Query: 717 -------LTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCS 768
L C H ++I+ I++ASFGDP G CG F+ G C + D L+E+ C+GK++CS
Sbjct: 681 DVKTGAHLRCPHDKKITNIEFASFGDPSGVCGNFEHGKCHSSSDTKKLVEQHCLGKENCS 740
Query: 769 IEASEANLGATSCAAGTVKRLVVEALC 795
+ + C + K L ++A C
Sbjct: 741 VPMDAFDNFKNECDS---KTLAIQAKC 764
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 294/688 (42%), Positives = 380/688 (55%), Gaps = 82/688 (11%)
Query: 171 FASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM 230
FASQGGPIIL+QIENEYG G AG +YINW AKMA +LD GVPW+MC+E DAP PM
Sbjct: 2 FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61
Query: 231 -----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTF 279
F+PN P P +WTE W+GWF +GG R +DLAF+VARF Q GG++
Sbjct: 62 INACNGFYCDGFSPNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSY 121
Query: 280 QNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT 339
NYYMYHGGTNFGRT+GGP++TTSYDYD PIDEYG + QPK+GHL+ELHK +K E L
Sbjct: 122 INYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALV 181
Query: 340 YGNVTNTDYGN----------------------------SVSGSSYNLPAWSVSILPDCK 371
+ T T G + + Y+LPAWS+SILPDC+
Sbjct: 182 SSDPTVTSLGAYQQAYVFNSGPRRCAAFLSNFHSTGARMTFNNMHYDLPAWSISILPDCR 241
Query: 372 TEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-K 430
FNTAKV QT+ R + W+ E ++ R A L++Q
Sbjct: 242 NVVFNTAKVGVQTS----RVQMIPTNSRLFSWQTYDEDVSSLHERSS--IAAGGLLEQIN 295
Query: 431 STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
T D SDYLWYMTN D+ + L G TL + S+G LH +VNG + S +
Sbjct: 296 VTRDTSDYLWYMTNVDISSSE--LRGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREHR 353
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS 550
F +PV L G N+I+LLS VGL N G ++ GI GPV L G KDL+
Sbjct: 354 QFTFAKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGR---KDLT 410
Query: 551 SHKWTYKVGLYG--LDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVV 608
KW KVGL G +D + + RG S + + WYK F AP ++P+
Sbjct: 411 MQKWFNKVGLKGEAMDLVSPNGGSSVDWIRG--SLATQTKQTLKWYKAYFNAPGGDEPLA 468
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
L+++ MGKG W+NG ++G+YW Y A D CS C Y G + KC CG P+Q WY
Sbjct: 469 LDMRSMGKGQVWINGQSIGKYWMAY-ANGD-CSL--CSYIGTFRPTKCQLGCGQPTQRWY 524
Query: 669 HVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACG--QAH-------------ENK 713
HVPRSW+K N +V+FEE GG+PS+I V C Q H E+K
Sbjct: 525 HVPRSWLKPTQNLVVVFEELGGDPSKITLVKRSVAGVCADLQEHHPNAEKLDIDSHEESK 584
Query: 714 TM-----ELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSC 767
T+ L C G+ IS IK+ASFG P G CG+F++G+C A + ++EK C+G++SC
Sbjct: 585 TLHQAQVHLQCVPGQSISSIKFASFGTPTGTCGSFQQGTCHA-TNSHAIVEKNCIGRESC 643
Query: 768 SIEASEANLGATSCAAGTVKRLVVEALC 795
+ S + G C +KRL VEA+C
Sbjct: 644 LVTVSNSIFGTDPC-PNVLKRLSVEAVC 670
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 278/645 (43%), Positives = 369/645 (57%), Gaps = 84/645 (13%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N + V++D RA+ I G+R++L+S +HYPR+TP MWP LI K KEGG D IETYVFW
Sbjct: 57 NFFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFW 116
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HEP + QY F DL++F K + +GL++ LRIGPY CAEWN+GGFPVWL ++PGIE
Sbjct: 117 NGHEPAKGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIE 176
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
RT N+ F EMQ F T IV + K+EKL++ QGGPIIL QIENEYGN+ +YG AGK Y
Sbjct: 177 -FRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRY 235
Query: 202 INWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWF 250
+ W A+MA LD G+PW+MC+++DAP + F PN+ N P IWTE+W GW+
Sbjct: 236 MQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWY 295
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPI 310
WGG P R AED AFAVARF+Q GG+ QNYYMY GGTNF RT+GGP TSYDYDAPI
Sbjct: 296 ADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPI 355
Query: 311 DEYGHLNQPKWGHLRELHKLLK-------------------SMEKTLTY----------- 340
DEYG L QPKWGHL++LH +K SM++ Y
Sbjct: 356 DEYGILRQPKWGHLKDLHTAIKLCEPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSM 415
Query: 341 -----------GNVTNTDYGNS-VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN--- 385
N+ Y + + G SY+LP WSVSILPDC+ FNTA++ QT+
Sbjct: 416 AGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFT 475
Query: 386 VKVKRPNQAGNDQAPL------------QWKWRPEMINDFVVRGKGHFALNTLIDQ-KST 432
V+ P+++ + + W E I + G +FA+ +++ T
Sbjct: 476 VESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTW---GGNNFAVQGILEHLNVT 532
Query: 433 NDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
D+SDYLWY T ++ D D S + +L I+ V +VNG SQ + +
Sbjct: 533 KDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWVS- 591
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS 550
++P++L G N+++LLS VGLQNYG+ + G G V L G + + DL+
Sbjct: 592 ---LKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDV---DLT 645
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYK 595
+ WTY+VGL G + A GWS + TWYK
Sbjct: 646 NSLWTYQVGLKG--EFSMIYAPEKQGCAGWSRMQKDSVQPFTWYK 688
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 316/810 (39%), Positives = 422/810 (52%), Gaps = 109/810 (13%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+VS D RA+ +DG R++L +G +HY RSTP MWP LI KAKEGGLD I+TYVFWN HEP+
Sbjct: 41 QVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNVHEPV 100
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ QY+F G DL+RFIK IQ QGLYV LRIGP++ +EW YGGFP WLH++P I R+ N
Sbjct: 101 QGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNI-TFRSDN 159
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQ F T IV+M K E L+ QGGPII +QIENEY V +G +G+ Y++W A
Sbjct: 160 EPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSWAAA 219
Query: 208 MATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAF 267
MA GVPW MC+++DAP P+ ++ P + N + + +G R+ ED+AF
Sbjct: 220 MAVDRQTGVPWTMCKQNDAPDPVVGIHSHTIPLDFP-NASRNYLIYGNDTKLRSPEDIAF 278
Query: 268 AVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRE 326
AV F + G++ +YYMYHGGTNFGR + Y+TTSY AP+DEYG + QP WGHLRE
Sbjct: 279 AVVYFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDAAPLDEYGLIWQPTWGHLRE 337
Query: 327 LHKLLKSMEKTLTYGNVTNTDYGNSVSGS----------------------------SYN 358
LH +K + L +G + G S
Sbjct: 338 LHAAVKQSSEPLLFGTYSYLSLGQEQEAHIFETESQCVAFLVNFDRHHISEVVFRNISLE 397
Query: 359 LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGK 418
L S+SIL DCK F TAKV Q + Q+ +D W E I V K
Sbjct: 398 LAPKSISILSDCKRVVFETAKVTAQHGSRTAEEVQSFSDIN--TWTAFKEPIPQDV--SK 453
Query: 419 GHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNG 477
++ N L + ST D +DYLWY I+ N+ RI+ S H
Sbjct: 454 AMYSGNRLFEHLSTTKDDTDYLWY-----------IVGLFHNILGRIHGS----HG---- 494
Query: 478 NYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLV 537
G +N + + L G N ISLLSA VG + G+ + G+ +
Sbjct: 495 ----------GPANIILNTNISLKEGPNTISLLSAMVGSPDSGAHMERRVFGLQKVSIQQ 544
Query: 538 GRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTT 597
G+ + + +L W Y+VGL+G + Y + + S W++ +TWYKTT
Sbjct: 545 GQEPENLLNNEL----WGYQVGLFG-ERNSIYTQEGSKSVE-WTTIYNLAYSPLTWYKTT 598
Query: 598 FEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCA 657
F P ND V LNL GMGKG WVNG ++GRYW ++ A
Sbjct: 599 FSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAP--------------------- 637
Query: 658 YNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE------ 711
GNPSQ YH+PR ++ N LVLFEE GGNP QI TV V C +E
Sbjct: 638 --SGNPSQSLYHIPRQFLNPQDNILVLFEEMGGNPQQITVNTVSVTRVCVNVNELSAPSL 695
Query: 712 ---NK--TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKK 765
NK ++L C G++IS I++AS+G+P G C + GSC A ++++ C+GK
Sbjct: 696 QYKNKEPAVDLRCQEGKQISAIEFASYGNPIGDCKKIRFGSCHAG-SSESVVKQACLGKS 754
Query: 766 SCSIEASEANLGATSCAAGTVKRLVVEALC 795
CSI + G C G K L+V A C
Sbjct: 755 GCSIPITPIKFGGDPC-PGIKKSLLVVANC 783
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 506 bits (1304), Expect = e-140, Method: Compositional matrix adjust.
Identities = 278/653 (42%), Positives = 370/653 (56%), Gaps = 92/653 (14%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N + V++D RA+ I G+R++L+S +HYPR+TP MWP LI K KEGG D IETYVFW
Sbjct: 57 NFFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFW 116
Query: 82 NAHEPLRRQYDFT--------GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVW 133
N HEP + QY F +DL++F K + +GL++ LRIGPY CAEWN+GGFPVW
Sbjct: 117 NGHEPAKGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVW 176
Query: 134 LHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD 193
L ++PGIE RT N+ F EMQ F T IV + K+EKL++ QGGPIIL QIENEYGN+ +
Sbjct: 177 LRDIPGIE-FRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGN 235
Query: 194 YGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIW 242
YG AGK Y+ W A+MA LD G+PW+MC+++DAP + F PN+ N P IW
Sbjct: 236 YGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIW 295
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTT 302
TE+W GW+ WGG P R AED AFAVARF+Q GG+ QNYYMY GGTNF RT+GGP T
Sbjct: 296 TEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQIT 355
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHKLLK-------------------SMEKTLTY--- 340
SYDYDAPIDEYG L QPKWGHL++LH +K SM++ Y
Sbjct: 356 SYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTG 415
Query: 341 -------------------GNVTNTDYGNS-VSGSSYNLPAWSVSILPDCKTEEFNTAKV 380
N+ Y + + G SY+LP WSVSILPDC+ FNTA++
Sbjct: 416 EVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARI 475
Query: 381 NTQTN---VKVKRPNQAGNDQAPL------------QWKWRPEMINDFVVRGKGHFALNT 425
QT+ V+ P+++ + + W E I + G +FA+
Sbjct: 476 GAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTW---GGNNFAVQG 532
Query: 426 LIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDS 482
+++ T D+SDYLWY T ++ D D S + +L I+ V +VNG S
Sbjct: 533 ILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGS 592
Query: 483 QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
Q + + ++P++L G N+++LLS VGLQNYG+ + G G V L G +
Sbjct: 593 QVGHWVS----LKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDG 648
Query: 543 ETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYK 595
+ DL++ WTY+VGL G + A GWS + TWYK
Sbjct: 649 DV---DLTNSLWTYQVGLKG--EFSMIYAPEKQGCAGWSRMQKDSVQPFTWYK 696
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 269/568 (47%), Positives = 340/568 (59%), Gaps = 49/568 (8%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I L ++ + S V++D +A+ I+G+R+IL+SGSIHYPRSTP MWPDLIKKAKEG
Sbjct: 11 IFLAILCFSSLIHSTEAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEG 70
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD I+TYVFWN HEP Y F DL++F K + GLY+ LRIGPYVCAEWN+GGF
Sbjct: 71 GLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGF 130
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
PVWL +PG+ RT N+ F MQ FT IVDM K+EKLF +QGGPIIL+QIENEYG +
Sbjct: 131 PVWLKYVPGMV-FRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPM 189
Query: 191 MSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSP 239
+ G AGK+Y W A+MA L GVPWIMC++ DAP P+ F PN+ N P
Sbjct: 190 QWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKP 249
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
K+WTENWTGWF +GG P R ED+AF+VARF Q GG+F NYYMY GGTNF RT+G +
Sbjct: 250 KLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYXGGTNFDRTAG-VF 308
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS-------- 351
+ TSYDYDAPIDEYG L +PK+ HL+ELHK++K E L + T T G+
Sbjct: 309 IATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVFKS 368
Query: 352 --------------------VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
G Y+LP WSVSILPDCKTE +NTAK+ T + P
Sbjct: 369 KTSCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIP 428
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
++ W G F + L++Q S T D +DY WY T+ + D
Sbjct: 429 TST-------KFSWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSD 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L N L I S+G LH +VNG + + S F + +KL+ G N+++LL
Sbjct: 482 ESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALL 541
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVG 538
S VGL N G ++ GI GPV L G
Sbjct: 542 STAVGLPNAGVHYETWNTGILGPVTLKG 569
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 236/331 (71%), Positives = 273/331 (82%), Gaps = 19/331 (5%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A L CL + VS+D A+ I+GER+I+ SGSIHYPRST MWPDLI+KAK+
Sbjct: 9 ATLACL------TFCIGDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKD 62
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GGLDAIETY+FW+ HEP RR+YDF+G LD I+F + IQD GLYV++RIGPYVCAEWNYGG
Sbjct: 63 GGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGG 122
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FPVWLHNMPGI +LRT N+V+ NEMQ FTT IV+M K+ LFASQGGPIILAQIENEYGN
Sbjct: 123 FPVWLHNMPGI-QLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGN 181
Query: 190 VMSD-YGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
VM+ YGDAGK+YINWCA+MA SL+IGVPWIMCQ+SDAP PM FTPNNP
Sbjct: 182 VMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYCDNFTPNNPK 241
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
SPK++TENW GWFK WG KDP RTAED+AF+VARFFQ GG F NYYMYHGGTNFGRTSGG
Sbjct: 242 SPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGG 301
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELH 328
P++TTSYDY+AP+DEYG+LNQPKWGHL++LH
Sbjct: 302 PFITTSYDYNAPLDEYGNLNQPKWGHLKQLH 332
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 298/836 (35%), Positives = 416/836 (49%), Gaps = 144/836 (17%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D R++ IDG+R + SG+IHYPRS P +WP L+ +AKEGGL+ IETY+FWNAHEP
Sbjct: 36 VTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWNAHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+F G LDL++F+K IQ+ G+Y I+RIGP++ AEWN+GG P WL + I R N
Sbjct: 96 GKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHII-FRANND 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ +T +V K +LFASQGGP+IL QIENEYGN+ D+ G Y+ W A+M
Sbjct: 155 PYKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLEWAAQM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S GVPWIMC++S AP + T + N P +WTENWT F+++G +
Sbjct: 215 ALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
R+AED+A+AV RFF GG+ NYYMYHGGTNFGRTS LT YD +AP+DEYG
Sbjct: 275 LAMRSAEDIAYAVLRFFAKGGSMVNYYMYHGGTNFGRTSASYVLTGYYD-EAPLDEYGMY 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH +++S +K G N T
Sbjct: 334 KEPKFGHLRDLHNVIRSYQKAFLSGKHSSEILGHGYEAQIFELPEENLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
D G + +P+ SVSIL CK +NT +V Q + + ++ + QW+
Sbjct: 394 DGTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRVFVQHSERSYHTSEVTSKNN--QWEMY 451
Query: 407 PEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
EM+ D +R K L T D SDYLWY T+ L+ DD G L+
Sbjct: 452 SEMVPKYKDTKIRTK-----EPLEQYNQTKDASDYLWYTTSFRLESDDLPFRGDIRPVLQ 506
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ SS + + N +V S +FE+PV L G N + LLS+T+G+++ G +
Sbjct: 507 VKSSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVVLLSSTMGMKDSGGEL 566
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
V GI E +I+ L++ +V GW
Sbjct: 567 AEVKGGI-----------QECLIQGLNTGTLDLQV-------------------NGWG-- 594
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+K F+ P +DP+VL++ M KG +VNG +GRYW ++
Sbjct: 595 ----------HKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWVSFRTL------- 637
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
G PSQ YH+PR ++K N LV+FEE G P I QTV
Sbjct: 638 ----------------AGTPSQAVYHIPRPFLKPKDNLLVVFEEEMGKPDGILVQTVTRD 681
Query: 704 TACGQAHENKTMELTC--------------HGRR----------ISEIKYASFGDPQGAC 739
C E+ ++ H R I E+ +ASFG+P G C
Sbjct: 682 DICLLISEHNPGQIKTWDTDGVKIKLIAEDHSVRGTLMCPPEKIIQEVVFASFGNPDGMC 741
Query: 740 GAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G F G+C + ++EK+C+GK SC + GA T L V+ C
Sbjct: 742 GNFTVGTCHTP-NAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTGTLGVQVRC 796
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 273/649 (42%), Positives = 366/649 (56%), Gaps = 61/649 (9%)
Query: 12 LLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
L+ L+L + V++DGR++ IDGE KIL SGSIHY RSTP MWP LI KAK GG
Sbjct: 8 LVFLVLMAVIVAGDVANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGG 67
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
+D ++TYVFWN HEP + Q+DF+G+ D+++FIK +++ GLYV LRIGP++ EW+YGG P
Sbjct: 68 IDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLP 127
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM 191
WLHN+ GI RT N+ F M+ + +IV + K E L+ASQGGPIIL+QIENEYG V
Sbjct: 128 FWLHNVQGI-VFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVG 186
Query: 192 SDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNS 238
+ GKSY+ W AK+A LD GVPW+MC++ DAP P+ PN+PN
Sbjct: 187 RAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNK 246
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P IWTENWT +++++G + R+AED+AF VA F G+F NYYMYHGGTNFGR +
Sbjct: 247 PAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNASQF 306
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-------- 350
+T+ YD AP+DEYG L QPKWGHL+ELH +K E+ L G T G
Sbjct: 307 VITSYYD-QAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFG 365
Query: 351 --------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
SSY L SVS+LPDCK FNTAKVN Q N + ++
Sbjct: 366 KKANLCAAILVNQDKCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK 425
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDD 450
Q N +P W+ E + F +L L +T D SDYLW T +
Sbjct: 426 ARQ--NLSSPQMWEEFTETVPSFSETSIRSESL--LEHMNTTQDTSDYLWQTTRFQQSEG 481
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
P + L++N G LHA+VNG ++ S + A L E+ + L G N ++LL
Sbjct: 482 APSV-------LKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALL 534
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
S VGL N G+ + G + GR +++ W Y+VGL G +K
Sbjct: 535 SVMVGLPNSGAHLERRVVGSRSVKIWNGRYQ-----LYFNNYSWGYQVGLKG--EKFHVY 587
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
+ +++ W ++ +TWYK +F+ P DPV LNL MGKG A
Sbjct: 588 TEDGSAKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 306/838 (36%), Positives = 429/838 (51%), Gaps = 114/838 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D ++ IDG R++ SG+IHYPRS MWP L+K AKEGGL+ IETYVFWNAHEP
Sbjct: 38 VTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWNAHEPEP 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F G D+I+F+K IQ G+Y I+RIGP++ EWN+G P WL +P I R N+
Sbjct: 98 GKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHII-FRANNE 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV M K E LFASQGG +ILAQIENEYGN+ D+ G Y+ W A+M
Sbjct: 157 PYKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLEWAAEM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNN------------PNSPKIWTENWTGWFKSWGGK 256
A S +IGVPWIMC++S AP + N N P +WTENWT F+++G
Sbjct: 217 AISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDENKPHLWTENWTAQFRAFGND 276
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
+R+AED+A++V RFF GGT NYYMY+GGTNFGRT G Y+ T Y + PIDEYG
Sbjct: 277 LAQRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPIDEYGMP 335
Query: 317 NQPKWGHLRELHKLLKSM--------------------------EKTLTYGNVTNTDYGN 350
PK+GHLR+LH ++KS E+ L ++N + G
Sbjct: 336 KAPKYGHLRDLHNVIKSYSRAFLEGKQSFELLGQGYEARNFEIPEEKLCLAFISNNNTGE 395
Query: 351 S----VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK-VKRPNQAGNDQAPLQWKW 405
G Y +P+ SVSIL DCK +NT +V Q + + + +A + W
Sbjct: 396 DGTVIFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHKAEKATKNNV-----W 450
Query: 406 RPEMINDFVVRGKGHFALNT--LIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
EM ++ + R K N L T D SDYLWY T+ L+ DD + G +
Sbjct: 451 --EMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEADDLPIRGDIRPVIA 508
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ S+ + +VN + + FE P+ L G N ++LLS+++G+++ G +
Sbjct: 509 VKSTAHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLRLGVNHLALLSSSMGMKDSGGEL 568
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ GI + G T DL + W +K L G + K+ Y K + + W
Sbjct: 569 VELKGGIQDCTI----QGLNTGTLDLQINGWGHKAKLEG-EVKEIYTEKGMGAVK-W--- 619
Query: 584 NVPL--NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
VP + +TWYK F+ P +DPVVL++ M KG +VNG +GRYW +Y
Sbjct: 620 -VPAVSGQAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYWTSYKT------ 672
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
K A SQ YH+PR+++K N LV+FEE G P I QTV
Sbjct: 673 -----------PGKVA------SQAVYHIPRTFLKSKNNLLVVFEEELGKPEGILIQTVR 715
Query: 702 VGTACGQAHENKTME-----------------------LTCHGRR-ISEIKYASFGDPQG 737
C E+ + L C ++ I E+ +ASFG+P G
Sbjct: 716 RDDICVFISEHNPAQIKPWDEHGGQIKLIAEDHNTRGFLNCPPKKIIQEVVFASFGNPVG 775
Query: 738 ACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+C F G+C + ++EK+C+GKK C + GA T L V+ C
Sbjct: 776 SCANFTVGTCHTP-NAKEIVEKECLGKKGCVLPVLHTFYGADINCPTTTATLAVQVRC 832
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 275/624 (44%), Positives = 344/624 (55%), Gaps = 102/624 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+G R+IL+SGSIHYPRS P MWP LI+KAK+GGLD ++TYVFWN HEP +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K ++ GLYV LR+GPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGI-RFRTDNG 158
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPII+AQ+ENE+G + S G GK Y +W A+M
Sbjct: 159 PFKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQM 218
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ FTPNN + P +WTE WTGWF +GG
Sbjct: 219 AVGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAA 278
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH-- 315
P R EDLAFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G
Sbjct: 279 PHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQW 338
Query: 316 -----------------------------------------------LNQPKWGHLRELH 328
L QPKWGHLR +H
Sbjct: 339 LLPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMH 398
Query: 329 KLLKSMEKTLTYGNVTNTDYGN-----------------------------SVSGSSYNL 359
+ +K E L G+ T GN G Y+L
Sbjct: 399 RAIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDL 458
Query: 360 PAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKG 419
PAWS+SILPDCKT FNTA V T + P W+ E N
Sbjct: 459 PAWSISILPDCKTAVFNTATVKEPTLLPKMSP-----VMHRFAWQSYSEDTNSL---DDS 510
Query: 420 HFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGN 478
FA + LI+Q S T D SDYLWY T+ ++ ++ L L + S+G + +VNG
Sbjct: 511 AFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVNGR 570
Query: 479 YVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
S + Y F VK+ +G N+IS+LS+ VGL N G F++ G+ GPV L G
Sbjct: 571 SYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTLSG 630
Query: 539 RAGDETIIKDLSSHKWTYKVGLYG 562
+ +DLS +W Y+VGL G
Sbjct: 631 LNEGK---RDLSHQRWIYQVGLKG 651
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 276/632 (43%), Positives = 363/632 (57%), Gaps = 57/632 (9%)
Query: 106 IQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMA 165
+ GLYV LRIGPYVCAEWN+GGFPVWL +PG+ RT N+ F M+ FT IV M
Sbjct: 2 VHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMA-FRTDNEPFKAAMKKFTEKIVWMM 60
Query: 166 KKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESD 225
K EKLF +QGGPIILAQIENEYG V + G GK+Y W A+MA L GVPWIMC++ D
Sbjct: 61 KAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQED 120
Query: 226 APSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
AP P+ F PN+ N PK+WTENWTGW+ ++GG P R ED+A++VARF Q
Sbjct: 121 APGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFIQ 180
Query: 275 FGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
GG+ NYYMYHGGTNF RT+G ++ +SYDYDAP+DEYG +PK+ HL+ LHK +K
Sbjct: 181 KGGSLVNYYMYHGGTNFDRTAG-EFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLS 239
Query: 335 EKTLTYGNVTNTDYG-----------------------NSVS-----GSSYNLPAWSVSI 366
E L + T T G NS + G Y+LP WSVSI
Sbjct: 240 EPALLSADATVTSLGAKQEAYVFWSKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSI 299
Query: 367 LPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTL 426
LPDCKTE +NTAKVN + + P ++ W G FA N L
Sbjct: 300 LPDCKTEVYNTAKVNAPSVHRNMVPTGT-------KFSWGSFNEATPTANEAGTFARNGL 352
Query: 427 IDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWT 485
++Q S T D SDY WY+T+ + + L + L + S+G LH +VNG + +
Sbjct: 353 VEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYG 412
Query: 486 KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETI 545
F + +KL G N+I+LLS VGL N G+ F+ G+ GPV L G +
Sbjct: 413 GLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTL---KGVNSG 469
Query: 546 IKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLEND 605
D+S KW+YK+G+ G + + ++ R V + +TWYK+TF P N+
Sbjct: 470 TWDMSKWKWSYKIGVKG-EALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNE 528
Query: 606 PVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQ 665
P+ L++ MGKG W+NG N+GR+WP Y A+ S C+Y G + + KC NCG SQ
Sbjct: 529 PLALDMNTMGKGQVWINGRNIGRHWPAYKAQ---GSCGRCNYAGTFDAKKCLSNCGEASQ 585
Query: 666 IWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
WYHVPRSW+K N +V+FEE GG+P+ I+
Sbjct: 586 RWYHVPRSWLKS-QNLIVVFEELGGDPNGISL 616
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 256/497 (51%), Positives = 313/497 (62%), Gaps = 63/497 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D +AITI+G+R+ILLSGSIHYPRSTP MWPDLI+KAKEGGLD I+TYVFWN HEP
Sbjct: 21 VSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSP 80
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F GN DL+RFIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 81 GKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGI-AFRTNNG 139
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ FT IVDM K E LF SQGGPIIL+QIENEYG + + G AG++Y W A+M
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A L GVPW+MC++ DAP P+ F+PN PK+WTE WTGWF +GG
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 259
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R EDLAF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG +
Sbjct: 260 PYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLVR 319
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGN-------------VTNTDYGN-------------- 350
QPKWGHL++LH+ +K E L G+ V + YG+
Sbjct: 320 QPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSFA 379
Query: 351 --SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN--------QAGNDQAP 400
+ YNLP WS+SILPDCK +NTA+V Q+ P QA N++AP
Sbjct: 380 KVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVPVPIHGAFSWQAYNEEAP 439
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
G+ F L++Q +T DVSDYLWY T+ + D+ L
Sbjct: 440 SS-------------NGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKY 486
Query: 460 MTLRINSSGQVLHAYVN 476
TL + S+G LH +VN
Sbjct: 487 PTLTVLSAGHALHVFVN 503
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 304/837 (36%), Positives = 422/837 (50%), Gaps = 116/837 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R + SG+IHYPRS P MW L+K AK+GGL+ IETYVFWNAHEP
Sbjct: 35 VSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWNAHEPEP 94
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+F G DLI+F+K IQ +Y ++RIGP++ AEWN+GG P WL +P I R N+
Sbjct: 95 GKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHI-IFRANNE 153
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV K ++FASQGGP+ILAQIENEYGN+ D+ G Y+ W A+M
Sbjct: 154 PYKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQM 213
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S + GVPWIMC++S AP + T + N P++WTENWT F+++G +
Sbjct: 214 AISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQ 273
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYM-YHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
R+AED+A++V RFF GGT NYYM Y+GGTNFGRT G Y+ T Y + P+DE
Sbjct: 274 LALRSAEDIAYSVLRFFAKGGTLVNYYMQYYGGTNFGRT-GASYVLTGYYDEGPVDEC-M 331
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTN 345
PK+GHLR+LH L+KS + G N T
Sbjct: 332 PKAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTG 391
Query: 346 TDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
D + G Y +P+ SVSIL DCK +NT +V Q + + Q W+
Sbjct: 392 EDGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQKLAKSN--AWEM 449
Query: 406 RPEMINDF---VVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
E I + +R K L T D SDYL + L+ DD G +
Sbjct: 450 YSEPIPRYKLTSIRNKEPMEQYNL-----TKDDSDYLCFR----LEADDLPFRGDIRPVV 500
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
++ S+ L +VN + + +FE P+ L G N ++LLS+++G+++ G +
Sbjct: 501 QVKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGE 560
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
V GI + G T DL + W +KV L G + K+ Y K + + W
Sbjct: 561 LVEVKGGIQDCTI----QGLNTGTLDLQVNGWGHKVKLEG-EVKEIYTEKGMGAVK-WVP 614
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
R +TWYK F+ P DPVVL++ MGKG +VNG +GRYWP+Y
Sbjct: 615 ATT--GRAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTV------ 666
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
G PSQ YH+PR ++K N LV+FEE G P I QTV
Sbjct: 667 -----------------GGVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRR 709
Query: 703 GTACGQAHENKTME-----------------------LTCHGRR-ISEIKYASFGDPQGA 738
C E+ + L C ++ I E+ +ASFG+P+G+
Sbjct: 710 DDICVFISEHNPAQIKTWDKDGGQIKLIAEDHSTRGILKCPPKKTIQEVVFASFGNPEGS 769
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C F G+C + ++ K+C+GKKSC + GA T L V+ C
Sbjct: 770 CANFTAGTCHTP-NAKDIVAKECLGKKSCVLPVLHTVYGADINCPTTTATLAVQVRC 825
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 284/716 (39%), Positives = 384/716 (53%), Gaps = 112/716 (15%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+V++DGR++ IDG RKIL SGSIHYPRSTP MW LI KAKEGG+D I+TYVFWN HEP
Sbjct: 25 QVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEPQ 84
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
QYDF G DL +FIK IQ QGLY LRIGP++ +EW+YGG P WLH++ GI RT N
Sbjct: 85 PGQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGI-VYRTDN 143
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQNFTT IV++ K E L+ASQGGPIIL+QIENEY N+ + + + G SY+ W AK
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203
Query: 208 MATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSPKIWTENWTGWFKSWG 254
MA L GVPW+MC++SDAP P+ FT PN+PN P +WTENWT +++ +G
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G+ R+AED+AF VA F G++ NYYM
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYM----------------------------VS 295
Query: 315 HLNQPKWGHLRELHK--------LLKSMEKTLTYGN-----------------VTNTDYG 349
+ QPKWGHL+ELH LL ++ ++ G + N D G
Sbjct: 296 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 355
Query: 350 NSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
N+ + S L S+SILPDCK FNTAK+NT N ++ +Q+ + A +W+
Sbjct: 356 NNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYNERITTSSQSFD--AVDRWEE 413
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
+ I +F+ N +++ + T D SDYLWY S + L I
Sbjct: 414 YKDAIPNFL---DTSLKSNMILEHMNMTKDESDYLWYTFRFQPN------SSCTEPLLHI 464
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
S +HA+VN YV + + F+ P+ L N IS+LS VG + G+ +
Sbjct: 465 ESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLE 524
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ + G I D +++ W Y+VGL G + +N E W
Sbjct: 525 SRFAGLTRVEIQCTEKG----IYDFANYTWGYQVGLSGEKLHIYKEENLSNVE--WRKTE 578
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+ N+ +TWYK F P +DPV LNL MGKG AWVNG ++GRYW ++ +
Sbjct: 579 ISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSK------- 631
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
G+PSQ YHVPR+++K N LVL EE G+P I+ +T+
Sbjct: 632 ----------------GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETI 671
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 270/651 (41%), Positives = 368/651 (56%), Gaps = 62/651 (9%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
V++DGRA+ ++G R++L SG +HY RSTP MWP LI AK+GGLD I+TYVFWN HEP+
Sbjct: 39 EVTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNVHEPV 98
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ QY+F G DL++FI+ IQ QGLYV LRIGP++ AEW YGGFP WLH++P I RT N
Sbjct: 99 QGQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNI-TFRTDN 157
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQ F T IV+M K E L+ QGGPII++QIENEY V +G G Y+ W A+
Sbjct: 158 EPFKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRWAAE 217
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
MA L GVPW+MC+++DAP P+ PN+P P +WTENWT + +G
Sbjct: 218 MAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYPIYG 277
Query: 255 GKDPKRTAEDLAFAVARFF-QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
R+ ED+AFAVA F + G+F +YYMYHGGTNFGR + Y+TTSY AP+DEY
Sbjct: 278 NDTKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEY 336
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS------------------ 355
G + +P WGHLRELH +K + L +G +N G
Sbjct: 337 GLIWRPTWGHLRELHAAVKLSSEALLFGRYSNFSLGPEQEAHIFETELKCVAFLVNFDKH 396
Query: 356 ----------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
+ L S+S+L +C+T F TA+VN Q + ++ ND WK
Sbjct: 397 QTPTVVFRNIYFQLAPKSISVLSECRTVVFETARVNAQYGSRTAEVVESLNDIH--TWKA 454
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNAD-LKDDDPILSGSSNMTLR 463
E I + + K + N L + S T D +DYLWY+ + + + DD L + L
Sbjct: 455 FKEPIPEDI--SKAVYTGNQLFEHLSMTKDETDYLWYIVSYEYIPSDDGQL-----VLLN 507
Query: 464 INSSGQVLHAYVNGNYVDSQWTKY-GASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
+ S VLHA+VN Y S + G N + + L G+N ISLLS VG + G+
Sbjct: 508 VESRAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEGQNTISLLSVMVGSPDSGAH 567
Query: 523 FDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ GI + G+ + L++ W Y+VGLYG ++ + +++++E W+
Sbjct: 568 MERRSFGIHKVSIQQGQQP----LHLLNNELWAYQVGLYGEANRIYTQEESSSAE--WTE 621
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
N TWYKTTF P+ ND V LNL MGKG WVNG +LGRYW ++
Sbjct: 622 INNLTYHPFTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESLGRYWVSF 672
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 291/813 (35%), Positives = 422/813 (51%), Gaps = 122/813 (15%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWP +I KA+ GGL+ I+TYVFWN HEP + +YDF G DL++FIK I ++GLYV LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
P++ AEWN+GG P WL +P + RT N+ F + + I+ M K+EKLFASQGGPI
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDV-YFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPI 119
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT------ 232
IL QIENEY V Y + G+ YI W A + S+++G+PW+MC+++DAP +
Sbjct: 120 ILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRH 179
Query: 233 -------PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMY 285
PN + P +WTENWT F+ +G +RT ED+AF+VAR+F G+ NYYMY
Sbjct: 180 CGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMY 239
Query: 286 HGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV-- 343
HGGTNFGRTS ++TT Y DAP+DE+G PK+GHL+ +H+ L+ +K L +G +
Sbjct: 240 HGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRA 298
Query: 344 --------------------------TNTDYGNSV--SGSSYNLPAWSVSILPDCKTEEF 375
NT N++ G Y LP+ S+SILPDCKT +
Sbjct: 299 QTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVY 358
Query: 376 NTAKVNTQTNVK--VKRPNQAGNDQAPLQWKWRPEMIN-DFVVRGKGHFALNTLIDQKST 432
NTA++ Q + + VK + + + + P +++ D ++ G+ ++ T
Sbjct: 359 NTAQIVAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL---------T 409
Query: 433 NDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
D +DY WY T+ + +DD LR+ S G L YVNG Y ++ +
Sbjct: 410 KDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSF 469
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS-S 551
F +PV G N+IS+L GL + GS + G P + ++G ++ +DL+ +
Sbjct: 470 EFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAG-PRAISIIGL---KSGTRDLTEN 525
Query: 552 HKWTYKVGLYGLDDKKFYNAKAANSERGWSS--KNVPLNRRMTWYKTTFEAPLENDPVVL 609
++W + GL G + K+ Y + + + W K PL TWYKT FE P + V +
Sbjct: 526 NEWGHLAGLEG-EKKEVYTEEGSKKVK-WEKDGKRKPL----TWYKTYFETPEGVNAVAI 579
Query: 610 NLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYH 669
++ MGKG WVNG +GRYW ++L+ G P+Q YH
Sbjct: 580 RMKAMGKGLIWVNGIGVGRYWMSFLSP-----------------------LGEPTQTEYH 616
Query: 670 VPRSWIK--DGVNTLVLFEEFGG-NPSQINFQTVVVGTACGQAHEN-------------- 712
+PRS++K N LV+ EE G I+F V T C E+
Sbjct: 617 IPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPK 676
Query: 713 -----KTMELTCHGR-----RISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCV 762
K M L R ++ E+++ASFGDP G CG F G C A ++EK+C+
Sbjct: 677 IVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSAS-KSKEVVEKECL 735
Query: 763 GKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G+ CSI + G C VK L V+ C
Sbjct: 736 GRNYCSIVVARETFGDKGCPE-IVKTLAVQVKC 767
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 295/835 (35%), Positives = 415/835 (49%), Gaps = 139/835 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R + SG+IHYPRS P MW L+K AK GGL+ IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DLIRF+ I+D +Y I+RIGP++ AEWN+GG P WL + I R N+
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHII-FRANNE 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F +IENEYGN+ D G Y+ W A+M
Sbjct: 155 PF-------------------------------KIENEYGNIKKDRKVEGDKYLEWAAEM 183
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNN------------PNSPKIWTENWTGWFKSWGGK 256
A S IGVPW+MC++S AP + N N P++WTENWT F+++G +
Sbjct: 184 AISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQ 243
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
+R+AED+A+AV RFF GGT NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 244 LAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMC 302
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH ++KS K +G N T
Sbjct: 303 KEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGE 362
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ-WKW 405
D G + +P+ SVSIL DCKT +NT +V Q + +R ++ + W+
Sbjct: 363 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---ERSFHTTDETSKNNVWEM 419
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRIN 465
E I F R L T D SDYLWY T+ L+ DD ++I
Sbjct: 420 YSEAIPKF--RKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 477
Query: 466 SSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDM 525
S+ + + N +V + + +FE+P+ L G N I++LS+++G+++ G +
Sbjct: 478 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 537
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKA-ANSERGWSSKN 584
V GI V+ G T DL + +K L G +DK+ Y K A + + +
Sbjct: 538 VKGGIQDCVV----QGLNTGTLDLQGNGRGHKARLEG-EDKEIYTEKGMAQFQWKPAEND 592
Query: 585 VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTES 644
+P+ TWYK F+ P +DP+V+++ M KG +VNG +GRYW +++
Sbjct: 593 LPI----TWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL-------- 640
Query: 645 CDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT 704
G+PSQ YH+PR+++K N L++FEE G P I QTV
Sbjct: 641 ---------------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDD 685
Query: 705 ACGQAHEN-----KTME------------------LTCHGRR-ISEIKYASFGDPQGACG 740
C E+ KT E L C +R I E+ +ASFG+P+GACG
Sbjct: 686 ICVFISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPQRTIQEVVFASFGNPEGACG 745
Query: 741 AFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
F G+C D ++EK+C+GK+SC + GA T L V+ C
Sbjct: 746 NFTAGTCHTP-DAKAVVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRC 799
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 300/883 (33%), Positives = 446/883 (50%), Gaps = 143/883 (16%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSG--------------- 48
+K +R ++ L++ +L + + ++ + +T DG + +
Sbjct: 1 MKSRTRYLIAILLVISLCSKASSHDDEKKKKGVTYDGSERNFIDHKWKKRASFLWFCSLP 60
Query: 49 SIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQD 108
S H R MWP +I KA+ GGL+ I+TYVFWN HEP + +YDF G DL++FIK I +
Sbjct: 61 SKHTSRKH--MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHE 118
Query: 109 QGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKE 168
+GLYV LR+GP++ AEWN+GG P WL +P + RT N+ F + + I+ M K+E
Sbjct: 119 KGLYVTLRLGPFIQAEWNHGGLPYWLREVPDV-YFRTNNEPFKEHTERYVRKILGMMKEE 177
Query: 169 KLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPS 228
KLFASQGGPIIL QIENEY V Y + G+ YI W A + S+++G+PW+MC+++DAP
Sbjct: 178 KLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPG 237
Query: 229 PMFT-------------PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQF 275
+ PN + P +WTENWT F+ +G +RT ED+AF+VAR+F
Sbjct: 238 NLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSK 297
Query: 276 GGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSME 335
G+ NYYMYHGGTNFGRTS ++TT Y DAP+DE+G PK+GHL+ +H+ L+ +
Sbjct: 298 NGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCK 356
Query: 336 KTLTYGNV----------------------------TNTDYGNSV--SGSSYNLPAWSVS 365
K L +G + NT N++ G Y LP+ S+S
Sbjct: 357 KALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSIS 416
Query: 366 ILPDCKTEEFNTAKVNTQTNVK--VKRPNQAGNDQAPLQWKWRPEMIN-DFVVRGKGHFA 422
ILPDCKT +NTA++ Q + + VK + + + + P +++ D ++ G+ ++
Sbjct: 417 ILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL 476
Query: 423 LNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDS 482
T D +DY + +DD P G + LR+ S G L YVNG Y
Sbjct: 477 ---------TKDKTDYACVKID---EDDFPDQKGLKTI-LRVASLGHALIVYVNGEYAGK 523
Query: 483 QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
++ + F +PV G N+IS+L GL + GS + G P + ++G
Sbjct: 524 AHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAG-PRAISIIGL--- 579
Query: 543 ETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSS--KNVPLNRRMTWYKTTFE 599
++ +DL+ +++W + GL G + K+ Y + + + W K PL TWYKT FE
Sbjct: 580 KSGTRDLTENNEWGHLAGLEG-EKKEVYTEEGSKKVK-WEKDGKRKPL----TWYKTYFE 633
Query: 600 APLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYN 659
P + V + ++ MGKG WVNG +GRYW ++L+
Sbjct: 634 TPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP----------------------- 670
Query: 660 CGNPSQIWYHVPRSWIK--DGVNTLVLFEEFGG-NPSQINFQTVVVGTACGQAHEN---- 712
G P+Q YH+PRS++K N LV+ EE G I+F V T C E+
Sbjct: 671 LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVS 730
Query: 713 ---------------KTMELTCHGR-----RISEIKYASFGDPQGACGAFKKGSCEAEID 752
K M L R ++ E+++ASFGDP G CG F G C A
Sbjct: 731 VKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSAS-K 789
Query: 753 VLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++EK+C+G+ CSI + G C VK L V+ C
Sbjct: 790 SKEVVEKECLGRNYCSIVVARETFGDKGCPE-IVKTLAVQVKC 831
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 259/650 (39%), Positives = 364/650 (56%), Gaps = 60/650 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ DG R+I LSGSIHYPRS P MWP+LI KAKEGGL+ IETYVFWN HEP +
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F G D++RF + IQ+ +Y ++R+GP++ AEWN+GG P WL +P I RT N+
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDI-VFRTNNE 161
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ M+ F +I+ K LFASQGGPIILAQIENEY ++ + + D G YINW AKM
Sbjct: 162 PYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKM 221
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
A S +IG+PWIMC+++ APS + P N + P +WTENWT ++ +G
Sbjct: 222 AISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVFGD 281
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
+R+AED+AFAVARFF GGT NYYMYHGGTNFGRTS + YD +AP+DE+G
Sbjct: 282 PPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 340
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS---------------------- 353
+PKWGHLR+LH+ LK +K L +G + G +
Sbjct: 341 YKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTK 400
Query: 354 --------GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
G Y +P S+S+L DC+T F T VN Q N + DQ W
Sbjct: 401 DDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFH----FADQTAQNNVW 456
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
+ + L D + T D +DY+WY ++ L+ DD + L +
Sbjct: 457 EMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKTVLEV 516
Query: 465 NSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
NS G A+VN +V TK + L E+P+ L +G N +++L++++G+ + G+
Sbjct: 517 NSHGHASVAFVNNKFVGCGHGTKMNKAFTL-EKPMDLKKGVNHVAVLASSMGMTDSGAYM 575
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G+ + AG DL+++ W + VGL G + K+ Y K S +
Sbjct: 576 EHRLAGVDRVQITGLNAG----TLDLTNNGWGHIVGLVG-ERKQIYTDKGMGSVTWKPAM 630
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
N +R +TWYK F+ P DPVVL++ MGKG +VNG +GRYW +Y
Sbjct: 631 N---DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY 677
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 210/342 (61%), Positives = 255/342 (74%), Gaps = 13/342 (3%)
Query: 10 AILLCLILQTLFNLSLA-YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
A LLC L +S A + VS+D RA+ IDG+R++L+S IHYPR+TP MWPDLI K+K
Sbjct: 9 AALLCFSLTIQLGVSFAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSK 68
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
EGG D I+TYVFWN HEP+RRQY+F G D+++F+K + GLY+ LRIGPYVCAEWN+G
Sbjct: 69 EGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFG 128
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFPVWL ++PGIE RT N F +EMQ F IVD+ +KE LF+ QGGPII+ QIENEYG
Sbjct: 129 GFPVWLRDIPGIE-FRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYG 187
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
NV S +G GK Y+ W A+MA LD GVPW+MCQ++DAP + F PN+ N
Sbjct: 188 NVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSAN 247
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PK+WTE+W GWF SWGG+ PKR ED+AFAVARFFQ GG+F NYYMY GGTNFGR+SGG
Sbjct: 248 KPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGG 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT 339
P+ TSYDYDAPIDEYG L+QPKWGHL+ELH +K E L
Sbjct: 308 PFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALV 349
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 188/473 (39%), Positives = 252/473 (53%), Gaps = 56/473 (11%)
Query: 354 GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDF 413
G Y LP WSVSILPDC+T FNTAKV QT++K + + P W E I+
Sbjct: 606 GQIYKLPPWSVSILPDCRTTVFNTAKVGAQTSIKTNKISYV-----PKTWMTLKEPIS-- 658
Query: 414 VVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQV 470
V + +F + +++ T D SDYLW +T ++ +D + + TL I+S +
Sbjct: 659 -VWSENNFTIQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDI 717
Query: 471 LHAYVNGNYVDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
LH +VNG + S W K +P++L +G N + LLS TVGLQNYG+ +
Sbjct: 718 LHIFVNGQLIGSVIGHWVK-------VVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDG 770
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G G V L G E DLS + WTY+VGL G K + ++ +E W+
Sbjct: 771 AGFKGQVKLTGFKNGEI---DLSEYSWTYQVGLRGEFQKIYMIDESEKAE--WTDLTPDA 825
Query: 588 N-RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+ TWYKT F+AP +PV L+L MGKG AWVNG+++GRYW T +A +DGC CD
Sbjct: 826 SPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYW-TRVAPKDGCG--KCD 882
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC 706
YRG Y + KCA NCGNP+QIWYH+PRSW++ N LVLFEE GG P +I+ ++ T C
Sbjct: 883 YRGHYHTSKCATNCGNPTQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTIC 942
Query: 707 GQAHENK-----------------------TMELTC-HGRRISEIKYASFGDPQGACGAF 742
+ E+ M L C G IS I++AS+G PQG+C F
Sbjct: 943 AEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHTISSIEFASYGTPQGSCQMF 1002
Query: 743 KKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+G C A + L L+ K C GK SC I + G C G VK L VEA C
Sbjct: 1003 SQGQCHAP-NSLALVSKACQGKGSCVIRILNSAFGGDPC-RGIVKTLAVEAKC 1053
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 257/597 (43%), Positives = 330/597 (55%), Gaps = 56/597 (9%)
Query: 143 LRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
RT N+ F MQ FTT IV M K E LF +QGGPII++QIENEYG V + G GK+Y
Sbjct: 3 FRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYT 62
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFK 251
W A+MA LD GVPW MC++ DAP P+ FTPN PK+WTENW+GW+
Sbjct: 63 KWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYT 122
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPID 311
+GG R EDLA++VA F Q G+F NYYMYHGGTNFGRTS G ++ TSYDYDAPID
Sbjct: 123 DFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPID 182
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN--------------------- 350
EYG N+PKW HL+ LHK +K E L + T T GN
Sbjct: 183 EYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLAN 242
Query: 351 ---------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPL 401
+ Y+LP WSVSILPDCKT FNTA VN + K P + D
Sbjct: 243 YDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHSFHKRMTPVETTFD---- 298
Query: 402 QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
W+ N L +Q T D SDYLWY+T+ ++ + +
Sbjct: 299 ---WQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFP 355
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
TL INS+G VLH +VNG + + F V L G N+ISLLS VGL N G
Sbjct: 356 TLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVG 415
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
F+ G+ GPV L G DE +DLS KW+YKVGL G + + ++S
Sbjct: 416 LHFETWNVGVLGPVRLKGL--DEGT-RDLSWQKWSYKVGLKG-ESLSLHTITGSSSIDWT 471
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
++ + +TWYKTTF+AP NDPV L++ MGKG W+N ++GR+WP Y+A +
Sbjct: 472 QGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIAHGN-- 529
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+ C+Y G + + KC NCG P+Q WYH+PRSW+ N LV+ EE+GG+P+ I+
Sbjct: 530 -CDECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISL 585
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 269/680 (39%), Positives = 358/680 (52%), Gaps = 119/680 (17%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
N + V++D RA+ I G+R++L+S +HYPR+TP MWP LI K KEGG D IETYVFW
Sbjct: 57 NFFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFW 116
Query: 82 NAHEPLRRQYDFTGNLDLIRF----------------IKTIQDQGLYVIL---------- 115
N HEP + QY F DL++F I ++ G VI
Sbjct: 117 NGHEPAKGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEP 176
Query: 116 ---------RIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAK 166
R P + GFPVWL ++PGIE RT N+ F EMQ F T IV + K
Sbjct: 177 AKGQYYFEERFDPVKFEKHVIFGFPVWLRDIPGIE-FRTDNEPFKAEMQTFVTKIVTLMK 235
Query: 167 KEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDA 226
+EKL++ QGGPIIL QIENEYGN+ +YG AGK Y+ W A+MA LD G+PW+MC+++DA
Sbjct: 236 EEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDA 295
Query: 227 PSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQF 275
P + F PN+ N P IWTE+W GW+ WGG P R AED AFAVARF+Q
Sbjct: 296 PEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQR 355
Query: 276 GGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLK--- 332
GG+ QNYYMY GGTNF RT+GGP TSYDYDAPIDEYG L QPKWGHL++LH +K
Sbjct: 356 GGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCE 415
Query: 333 ----------------SMEKTLTY----------------------GNVTNTDYGNS-VS 353
SM++ Y N+ Y + +
Sbjct: 416 PALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIF 475
Query: 354 GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN---VKVKRPNQAGNDQAPL--------- 401
G SY+LP WSVSILPDC+ FNTA++ QT+ V+ P+++ + +
Sbjct: 476 GKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPY 535
Query: 402 ---QWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGS 457
W E I + G +FA+ +++ T D+SDYLWY T ++ D D S
Sbjct: 536 LSSTWWTSKETIGTW---GGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSS 592
Query: 458 SNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
+ +L I+ V +VNG SQ + + ++P++L G N+++LLS VG
Sbjct: 593 KGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWVS----LKQPIQLVEGLNELTLLSEIVG 648
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAAN 575
LQNYG+ + G G V L G + + DL++ WTY+VGL G + A
Sbjct: 649 LQNYGAFLEKDGAGFRGQVTLTGLSDGDV---DLTNSLWTYQVGLKG--EFSMIYAPEKQ 703
Query: 576 SERGWSSKNVPLNRRMTWYK 595
GWS + TWYK
Sbjct: 704 GCAGWSRMQKDSVQPFTWYK 723
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 293/862 (33%), Positives = 419/862 (48%), Gaps = 184/862 (21%)
Query: 12 LLCLILQTLFNLSLAY-------RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
L+ +L L + + A+ V++DGR++ ++G R++L SGSIHYPRSTP
Sbjct: 8 LIAAVLSLLVSYAAAHGIAKGAKTVTYDGRSLIVNGRRELLFSGSIHYPRSTP------- 60
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+++F GN DL++FIK I D GLY LRIGP++ AE
Sbjct: 61 -------------------------EFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAE 95
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
WN+GGFP WL +P I R+ N+ F M+ ++ +I++M K+ KLFA QGGPIILAQIE
Sbjct: 96 WNHGGFPYWLREVPDII-FRSYNEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIE 154
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FT 232
NEY ++ Y + G Y+ W KMA L GVPWIMC++ DAP P+ FT
Sbjct: 155 NEYNSIQLAYKELGVQYVQWAGKMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFT 214
Query: 233 -PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
PN PN P +WTENWT ++ +G +R AEDLAF+VARF GT NYYMYHGGTNF
Sbjct: 215 GPNRPNKPSLWTENWTAQYRVFGDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNF 274
Query: 292 GRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN--------- 342
GRT G ++TT Y +AP+DEYG +PKWGHL++LH L+ +K L G+
Sbjct: 275 GRT-GSSFVTTRYYDEAPLDEYGLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKD 333
Query: 343 -----------------VTNTDYGNSVS----GSSYNLPAWSVSILPDCKTEEFNTAKVN 381
+TN + + G Y LP S+SILPDCKT +NT +V
Sbjct: 334 KEVRFYEKPGTHICAAFLTNNHSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVV 393
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPE---MINDFVVRGKGHFALNTLIDQKSTNDVSDY 438
Q N + ++ N L+W+ E ++ D + K L + D SDY
Sbjct: 394 AQHNARNFVKSKIANKN--LKWEMSQEPIPVMTDMKILTKSPMELYXFL-----KDRSDY 446
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
W++T+ +L + D + L+I++ G + A+VNGN++ S N +F +PV
Sbjct: 447 AWFVTSIELSNYDLPMKKDIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPV 506
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
K +G+N++ + +D GI +L G T D++++ W +V
Sbjct: 507 KF-QGRNKLHCPAV----------YDSGTTGIHSVQIL----GLNTGTLDITNNGWGQQV 551
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
G+ G K + + + + P MTWYKT F+ P NDPV+L + M KG
Sbjct: 552 GVNGEHVKAYTQGGSHRVQWTAAKGKGPA---MTWYKTYFDMPEGNDPVILRMTSMAKG- 607
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDG 678
NG + YHVPR+W+K
Sbjct: 608 ---NG------------------------------------------LEYHVPRAWLKPS 622
Query: 679 VNTLVLFEEFGGNPSQINFQTVVVGTACG-------------QAHENKTM---------- 715
N LV+FEE GGNP +I + V T C Q H++K
Sbjct: 623 DNLLVIFEETGGNPEEIEXELVNRDTICSIVTEYHPPHVKSWQRHDSKIRAVVDEVKPKG 682
Query: 716 ELTCHGRR-ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA 774
L C + I ++ +ASFG+P GACG F+ G+C A + ++E+ C GK +C I
Sbjct: 683 HLKCPNYKVIVKVDFASFGNPLGACGDFEMGNCTAP-NSKKVVEQHCXGKTTCEIPMEAG 741
Query: 775 NLGATSCAAGTV-KRLVVEALC 795
S A + K L V+ C
Sbjct: 742 IFXGNSGACSDITKTLAVQVRC 763
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 443 bits (1140), Expect = e-121, Method: Compositional matrix adjust.
Identities = 254/642 (39%), Positives = 345/642 (53%), Gaps = 83/642 (12%)
Query: 219 IMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAF 267
++C++ DAP P+ F+PN PK+WTE WTGWF +GG P R AED+AF
Sbjct: 1 VLCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAF 60
Query: 268 AVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+VARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEYG QPKWGHL++L
Sbjct: 61 SVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDL 120
Query: 328 HKLLKSMEKTLTYGNVTNTDYGN-----------------------------SVSGSSYN 358
H+ +K E L G T GN S + YN
Sbjct: 121 HRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYN 180
Query: 359 LPAWSVSILPDCKTEEFNTAKVNTQTN--VKVKRPNQAGNDQAPLQWKWRPEMINDFVVR 416
LP WS+SILPDCK +NTA+V QT+ V+ P G L W+ E + ++
Sbjct: 181 LPPWSISILPDCKNTVYNTARVGAQTSRMKMVRVPVHGG-----LSWQAYNEDPSTYIDE 235
Query: 417 GKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYV 475
F + L++Q +T D SDYLWYMT+ + ++ L TL + S+G +H ++
Sbjct: 236 S---FTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFI 292
Query: 476 NGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVL 535
NG S + + F + V L G N+I++LS VGL N G F+ G+ GPV
Sbjct: 293 NGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVS 352
Query: 536 LVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN-VPLNRRMTWY 594
L G G +DLS KWTYKVGL G + +++ E W+ V + +TWY
Sbjct: 353 LNGLNGGR---RDLSWQKWTYKVGLKGESLSLHSLSGSSSVE--WAEGAFVAQKQPLTWY 407
Query: 595 KTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD 654
KTTF AP + P+ +++ MGKG W+NG +LGR+WP Y A S C Y G + D
Sbjct: 408 KTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVG---SCSECSYTGTFRED 464
Query: 655 KCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE--- 711
KC NCG SQ WYHVPRSW+K N LV+FEE+GG+P+ I V + C +E
Sbjct: 465 KCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQS 524
Query: 712 -------------NKTMELTCH-----GRRISEIKYASFGDPQGACGAFKKGSCEAEIDV 753
NK + H G++I+ +K+ASFG P+G CG++++GSC A
Sbjct: 525 TLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAH-HS 583
Query: 754 LPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
K CVG+ CS+ + G C +K+L VEA+C
Sbjct: 584 YDAFNKLCVGQNWCSVTVAPEMFGGDPC-PNVMKKLAVEAVC 624
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 441 bits (1133), Expect = e-120, Method: Compositional matrix adjust.
Identities = 273/782 (34%), Positives = 398/782 (50%), Gaps = 122/782 (15%)
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QYDF G DL++FIK I ++GLYV LR+GP++ AEWN+GG P WL +P + RT N+
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVY-FRTNNEP 138
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F + + I+ M K+EKLFASQGGPIIL QIENEY V Y + G+ YI W A +
Sbjct: 139 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 198
Query: 210 TSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGGK 256
S+++G+PW+MC+++DAP + PN + P +WTENWT F+ +G
Sbjct: 199 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 258
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
+RT ED+AF+VAR+F G+ NYYMYHGGTNFGRTS ++TT Y DAP+DE+G
Sbjct: 259 PTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLE 317
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNV----------------------------TNTDY 348
PK+GHL+ +H+ L+ +K L +G + NT
Sbjct: 318 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 377
Query: 349 GNSV--SGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK--VKRPNQAGNDQAPLQWK 404
N++ G Y LP+ S+SILPDCKT +NTA++ Q + + VK + + + +
Sbjct: 378 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSE 437
Query: 405 WRPEMIN-DFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
P +++ D ++ G+ ++ T D +DY WY T+ + +DD LR
Sbjct: 438 NIPSLLDGDSLIPGELYYL---------TKDKTDYAWYTTSVKIDEDDFPDQKGLKTILR 488
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ S G L YVNG Y ++ + F +PV G N+IS+L GL + GS
Sbjct: 489 VASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYM 548
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLS-SHKWTYKVGLYGLDDKKFYNAKAANSERGWSS 582
+ G P + ++G ++ +DL+ +++W + GL G + K+ Y + + + W
Sbjct: 549 EHRFAG-PRAISIIGL---KSGTRDLTENNEWGHLAGLEG-EKKEVYTEEGSKKVK-WEK 602
Query: 583 --KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
K PL TWYKT FE P + V + ++ MGKG WVNG +GRYW ++L+
Sbjct: 603 DGKRKPL----TWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP---- 654
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK--DGVNTLVLFEEFGG-NPSQINF 697
G P+Q YH+PRS++K N LV+ EE G I+F
Sbjct: 655 -------------------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDF 695
Query: 698 QTVVVGTACGQAHEN-------------------KTMELTCHGR-----RISEIKYASFG 733
V T C E+ K M L R ++ E+++ASFG
Sbjct: 696 VLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFG 755
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
DP G CG F G C A ++EK+C+G+ CSI + G C VK L V+
Sbjct: 756 DPTGTCGNFTMGKCSAS-KSKEVVEKECLGRNYCSIVVARETFGDKGCPE-IVKTLAVQV 813
Query: 794 LC 795
C
Sbjct: 814 KC 815
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 270/713 (37%), Positives = 387/713 (54%), Gaps = 74/713 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ I+GERK+LLS SIHYPR+TP MW +++ K G+D IETY FWN HEP
Sbjct: 43 VSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYTFWNLHEPTP 102
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Y+F GN ++ F+ + GLYV +R GPYVCAEWNYGGFP WL + GI R N+
Sbjct: 103 GTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDGI-VFRDYNQ 161
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
FM++M N+ T IV+ + +AS GGPIILAQ+ENEYG + + YG +G Y W A+
Sbjct: 162 PFMDQMSNWMTYIVNYLR--PYYASNGGPIILAQVENEYGWLEAAYGASGTKYALWAAQF 219
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKSWG 254
A SLDIG+PWIMC + D + + T N PN P WTENW GWF++W
Sbjct: 220 ANSLDIGIPWIMCSQDDIATVINTCNGFYCHDWIDVHWTAYPNQPAFWTENWPGWFQNWE 279
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G P R +D+ ++VAR+ +GG+ NYYM+ GGT FGR +GGP++TTSYDYD IDEYG
Sbjct: 280 GGVPHRPVQDVLYSVARWIAYGGSMMNYYMWFGGTTFGRWTGGPFITTSYDYDGAIDEYG 339
Query: 315 HLNQPKWGHLRELHKLLKSME---------KTLTYG-NVTNTDYGNSVSGSSYNLPAWSV 364
+ +PK+ E H ++ + E K + G NV + + + +G S++ A
Sbjct: 340 YPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEISHFYSVETGESFSFLANFG 399
Query: 365 SILPDCKTEEFNTAKVNTQT-NVKVKRPNQAGNDQ------APLQWKWRPEMINDFVVRG 417
+ +T ++N Q +V++ N + D +P+ ++ P + + +
Sbjct: 400 AT--GVQTVQWNGITFKVQPWSVQLLYNNVSIFDTSATPIGSPVPKQFTPIKSFENIGQW 457
Query: 418 KGHFALN------TLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQV 470
F L T ++Q S T D +DYLWY+T ++ L + + +
Sbjct: 458 SESFDLTFTNYSETPMEQLSLTRDQTDYLWYVTKIEVN--------RVGAQLSLPNISDM 509
Query: 471 LHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGI 530
+H +V+ Y+ T G +N + + G + + +L VGL NY + GI
Sbjct: 510 VHVFVDNQYIA---TGRGPTNITLNSTIGV--GGHTLQVLHTKVGLVNYAEHMEATVAGI 564
Query: 531 PGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR 590
PV L D D+SS+ W+ K + G + + YN + S + W+ NV N
Sbjct: 565 FEPVTL-----DSV---DISSNGWSMKPFVQG-ETLQLYNPNHSGSVQ-WT--NVTGNPP 612
Query: 591 MTWYKTTFEAPL-ENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
+TWYK F L N + L++ GM KG +VNGYN+GRYW LA GC+ C Y+G
Sbjct: 613 LTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYW---LALAYGCN--PCTYQG 667
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVV 702
Y C CG PSQ +YHVP W+ +G N +V+FEE GNP I V+
Sbjct: 668 GYSPSMCQLGCGEPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEAITLVQRVI 720
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 284/850 (33%), Positives = 413/850 (48%), Gaps = 161/850 (18%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG ++ IDG+R++L SGSIHYPRSTP MWP +IK+AK+GGL+ I+TYVFWN HEP +
Sbjct: 54 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH-NMPGIEELRTTN 147
+++F+G DL++FIK IQ G+YV LR+GP++ AEW +G + H N+ G
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDHKNIAGA------- 166
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+IENEY V Y G +YI W +
Sbjct: 167 --------------------------------YRKIENEYSAVQRAYKQDGLNYIKWASN 194
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
+ S+ +G+PW+MC+++DAP PM PN N P +WTENWT F+ +G
Sbjct: 195 LVDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFG 254
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
+R+ ED+A++VARFF GT NYYMYHGGTNFGRTS Y+TT Y DAP+DEYG
Sbjct: 255 DPPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYG 313
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNV----------------------------TNT 346
+PK+GHL+ LH L +K L +G NT
Sbjct: 314 LEKEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNT 373
Query: 347 DYGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
+ ++ G Y + S+SILPDCKT +NTA++ +Q + ++ N + +K
Sbjct: 374 EAAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKK--FDFK 431
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL-KDDDPILSGSSNMTLR 463
E + + G + + T D +DY WY T+ + K+ P G +R
Sbjct: 432 VFTETLPS-KLEGNSYIPVELY---GLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTF-VR 486
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
I S G LHA++NG Y+ S + + +F++ V L G+N + +L G + GS
Sbjct: 487 IASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSYM 546
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G G +L +G + + S KW K+G+ G +K + + + W K
Sbjct: 547 EHRYTGPRGISILGLTSGTLDLTE---SSKWGNKIGMEG--EKLGIHTEEGLKKVEWK-K 600
Query: 584 NVPLNRRMTWYK----------TTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTY 633
+TWY+ T F+AP + + GMGKG WVNG +GRYW ++
Sbjct: 601 FTGKAPGLTWYQKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSF 660
Query: 634 LAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NP 692
L+ G P+QI YH+PRS++K N LV+FEE P
Sbjct: 661 LSP-----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKP 697
Query: 693 SQINFQTVVVGTACGQAHENK-----------------------TMELTCHG-RRISEIK 728
++F V T C EN T L C G ++I+ ++
Sbjct: 698 ELMDFAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKKIAAVE 757
Query: 729 YASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGT 785
+ASFG+P G CG F G+C A + +IEK C+GK C I +++ SC
Sbjct: 758 FASFGNPIGVCGNFTLGTCNAPVSK-QVIEKHCLGKAECVIPVNKSTFQQDKKDSC-KNV 815
Query: 786 VKRLVVEALC 795
VK L V+ C
Sbjct: 816 VKMLAVQVKC 825
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 273/777 (35%), Positives = 380/777 (48%), Gaps = 112/777 (14%)
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
RQ F G DLI+F+K IQ +Y ++RIGP++ AEWN+GG P WL +P I R N+
Sbjct: 104 RQVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHII-FRANNE 162
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV K ++FASQGGP+ILAQIENEYGN+ D+ G Y+ W A+M
Sbjct: 163 PYKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQM 222
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S + GVPWIMC++S AP + T + N P++WTENWT F+++G +
Sbjct: 223 AISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQ 282
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
R+AED+A++V RFF GGT NYYMY+GGTNFGRT G Y+ T Y + P+DEYG
Sbjct: 283 LALRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPVDEYGMP 341
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
PK+GHLR+LH L+KS + G N T
Sbjct: 342 KAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGE 401
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
D + G Y +P+ SVSIL DCK +NT +V Q + + Q W+
Sbjct: 402 DGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQKLAKSN--AWEMY 459
Query: 407 PEMINDF---VVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I + +R K L T D SDYLWY T+ L+ DD G ++
Sbjct: 460 SEPIPRYKLTSIRNKEPMEQYNL-----TKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQ 514
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ S+ L +VN + + +FE P+ L G N ++LLS+++G+++ G +
Sbjct: 515 VKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGEL 574
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
V GI + G T DL + W +KV L G + K+ Y K + + W
Sbjct: 575 VEVKGGIQDCTI----QGLNTGTLDLQVNGWGHKVKLEG-EVKEIYTEKGMGAVK-WVPA 628
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
R +TWYK F+ P DPVVL++ MGKG +VNG +GRYWP+Y
Sbjct: 629 TT--GRAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTV------- 679
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
G PSQ YH+PR ++K N LV+FEE G P I QTV
Sbjct: 680 ----------------GGVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRRD 723
Query: 704 TAC------------------------GQAHENKTMELTCHGRR-ISEIKYASFGDPQGA 738
C + H + + L C ++ I E+ +ASFG+P+G+
Sbjct: 724 DICVFISEHNPAQIKTWDKDGGQIKVIAEDHSTRGI-LKCPPKKTIQEVVFASFGNPEGS 782
Query: 739 CGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C F GSC + ++ K+C+GKKSC + GA T L V+ C
Sbjct: 783 CANFTAGSCHTP-NAKDIVAKECLGKKSCVLPVLHTVYGADINCPTTTATLAVQVRC 838
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 257/731 (35%), Positives = 392/731 (53%), Gaps = 94/731 (12%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP- 86
+++D R++ I+GERK+L+SGS+HYPR++ W +++K +K G+D IETY+FWN H+P
Sbjct: 41 NITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNVHQPN 100
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++ N ++ F+ ++ L+V LRIGPYVCAEWNYGGFP+WL N+ GI R
Sbjct: 101 TPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIV-FRDY 159
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N+ FM+ M + T++VD K + FA GGPII+AQIENEYG + ++YG +G+ Y W
Sbjct: 160 NQPFMDAMSTWVTMVVD--KLQDYFAPNGGPIIIAQIENEYGWLENEYGASGREYALWAI 217
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKS 252
A SL+IG+PWIMC + D S + T N P+ P WTENW GWF++
Sbjct: 218 NFAKSLNIGIPWIMCAQEDIDSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVGWFEN 277
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDE 312
WG PKR +D+ F+ ARF +GG+ NYYM+ GGTNFGR+ GGP++ TSY+YDAP+DE
Sbjct: 278 WGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYDAPLDE 337
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLT------------------YGN--VTNTDYGNSV 352
+G N+PK+ + H ++ E + YG V T++G +
Sbjct: 338 FGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTPVPLSNISEAHPYGEDLVFLTNFGLVI 397
Query: 353 -----SGSSYNLPAWSV------SILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPL 401
G++Y L WSV S++ D K +T+ K PN D
Sbjct: 398 DYIQWQGTNYTLQPWSVVIVYSGSVVFDTSYVPDEYIKPSTRDQFK-DVPNAINYDSILS 456
Query: 402 QWKW-RPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNM 460
+W + ++IND ++ + L TND +DYLWY TN L +
Sbjct: 457 FSEWGQSDIINDCIINNESPLEQINL-----TNDTTDYLWYTTNITLNE---------TT 502
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG--KNQISLLSATVGLQN 518
TL I + H ++NG Y + W+ ++ T G Q+ +L+ T+GL+N
Sbjct: 503 TLTIENMYDFCHVFLNGAYQGNGWSPVAYIT------LEPTNGNINYQLQILTMTMGLEN 556
Query: 519 YGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
Y + + G+ G + L G+ ++++++W+ K G+ G + + YN + ++S+
Sbjct: 557 YAAHMESYSRGLLGSISL-GQT-------NITNNQWSMKPGILG-EKLQIYN-EYSSSKV 606
Query: 579 GWSSKNVPLNRRMTWYKTTFEAP-LENDP----VVLNLQGMGKGFAWVNGYNLGRYWPTY 633
W N + MTWY+ L +DP VLN+ M KGF +VNG+N+GRY+
Sbjct: 607 NWQPYNPSATQSMTWYQFNISLDGLSSDPSSNAYVLNMTSMNKGFVYVNGFNIGRYF-LM 665
Query: 634 LAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI----KDGVNTLVLFEEFG 689
A + C+ + DY G Y +C PSQ YH+P W+ T++LFEE
Sbjct: 666 EATQSNCTLKQ-DYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQYATVILFEEVN 724
Query: 690 GNPSQINFQTV 700
G+P++I ++
Sbjct: 725 GDPTKIQLLSL 735
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 244/609 (40%), Positives = 329/609 (54%), Gaps = 72/609 (11%)
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTE WTGWF +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+GGP++
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------- 350
TSYDYDAP+DEYG QPKWGHL++LH+ +K E L G T GN
Sbjct: 61 ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSK 120
Query: 351 -------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN--VKVK 389
S + YNLP WS+SILPDCK +NTA+V QT+ V+
Sbjct: 121 SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMKMVR 180
Query: 390 RPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLK 448
P G L W+ E + ++ F + L++Q +T D SDYLWYMT+ +
Sbjct: 181 VPVHGG-----LSWQAYNEDPSTYIDES---FTMVGLVEQINTTRDTSDYLWYMTDVKVD 232
Query: 449 DDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQIS 508
++ L TL + S+G +H ++NG S + + F + V L G N+I+
Sbjct: 233 ANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIA 292
Query: 509 LLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKF 568
+LS VGL N G F+ G+ GPV L G G +DLS KWTYKVGL G
Sbjct: 293 ILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGR---RDLSWQKWTYKVGLKGESLSLH 349
Query: 569 YNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
+ +++ E W+ V + +TWYKTTF AP + P+ +++ MGKG W+NG +LG
Sbjct: 350 SLSGSSSVE--WAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLG 407
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
R+WP Y A S C Y G + DKC NCG SQ WYHVPRSW+K N LV+FEE
Sbjct: 408 RHWPAYKAVG---SCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEE 464
Query: 688 FGGNPSQINFQTVVVGTACGQAHE----------------NKTMELTCH-----GRRISE 726
+GG+P+ I V + C +E NK + H G++I+
Sbjct: 465 WGGDPNGITLVRREVDSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITT 524
Query: 727 IKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTV 786
+K+ASFG P+G CG++++GSC A K CVG+ CS+ + G C +
Sbjct: 525 VKFASFGTPEGTCGSYRQGSCHAH-HSYDAFNKLCVGQNWCSVTVAPEMFGGDPC-PNVM 582
Query: 787 KRLVVEALC 795
K+L VEA+C
Sbjct: 583 KKLAVEAVC 591
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 267/737 (36%), Positives = 394/737 (53%), Gaps = 94/737 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ I+GERK+L SGSIHYPR++ MWP ++K++K+ G+D I+TY+FWN H+P
Sbjct: 40 VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99
Query: 89 -RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+Y F GN ++ +F+ ++ LYV LRIGPYVCAEW YGGFP+WL +P I R N
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIV-YRDYN 158
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ +MNEM + +V + FA GGPIILAQ+ENEYG + +YG G Y W
Sbjct: 159 QQWMNEMSIWMEFVVKYL--DNYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSID 216
Query: 208 MATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKSW 253
A SL+IG+PWIMCQ++D S + T N PN P WTENW GWF++W
Sbjct: 217 FAKSLNIGIPWIMCQQNDIESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G PKR +D+ ++ ARF +GG+ NYYM+ GGTNFGRTSGGP++ TSYDYDAP+DE+
Sbjct: 277 GQAKPKRPVQDILYSNARFIAYGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDEF 336
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTL-------------------TYG-NVTN-TDYGNSV 352
G N+PK+ + H++L ++E L YG N++ T+YG S
Sbjct: 337 GQPNEPKFSLSSKFHQVLHAIESDLLNNQPPKSPTFLSQFIEVHQYGINLSFITNYGTST 396
Query: 353 S-------GSSYNLPAWSVSILPDCKTEEFNTAKV--NTQTN---VKVKRPNQAGNDQAP 400
+ +Y + WSV I+ + + F+T+ + NT N + +P Q+
Sbjct: 397 TPKIIQWMNQTYTIQPWSVLIIYNNEI-LFDTSFIPPNTLFNNNTINNFKPINQNIIQSI 455
Query: 401 LQWKWRPEMINDFVVRGKGHF------ALNTL--IDQ-KSTNDVSDYLWYMTNADLKDDD 451
Q I+DF + G ++N++ I+Q T D SDY WY TN
Sbjct: 456 FQ-------ISDFNLNSGGGGGDGDGNSVNSVSPIEQLLITKDTSDYCWYSTNVTTTSLS 508
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
+ N+ L I +H +++ Y S ++ L P+ Q+ +LS
Sbjct: 509 --YNEKGNIFLTITEFYDYVHIFIDNEYQGSAFSPSLCQLQL--NPIN-NSTTFQLQILS 563
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
T+GL+NY S + GI G +L+ + +L++++W K GL G + K F N
Sbjct: 564 MTIGLENYASHMENYTRGILGSILIGSQ--------NLTNNQWLMKSGLIGENIKIFNND 615
Query: 572 KAANSERGWSSKNVPLNRR-MTWYKTTFE---APLENDPVV--LNLQGMGKGFAWVNGYN 625
N + SS + L ++ +TWYK P++ V L++ M KG WVNGY+
Sbjct: 616 NTINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGMIWVNGYS 675
Query: 626 LGRYWPTYLAEE--DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI-----KDG 678
+GRYW + + + E+ Y G Y +C PSQ Y VP W+ +
Sbjct: 676 IGRYWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLFNNNYNNQ 735
Query: 679 VNTLVLFEEFGGNPSQI 695
T+++ EE GNP++I
Sbjct: 736 YATIIIIEELNGNPNEI 752
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 242/645 (37%), Positives = 340/645 (52%), Gaps = 82/645 (12%)
Query: 216 VPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAED 264
VPW+MC++ DAP PM F+PN P P WTE WT WF ++GG + KR ED
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVED 62
Query: 265 LAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHL 324
LAF VARF Q GG+ NYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL
Sbjct: 63 LAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHL 122
Query: 325 RELHKLLKSMEKTLTYGN-------------------------VTNTDYGNSV----SGS 355
+ LH +K EK L G ++N N+ +G
Sbjct: 123 KRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFNGR 182
Query: 356 SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV 415
Y LP WS+SILPDCK+ +NTA+V QTN P + + W+ E I+ +
Sbjct: 183 HYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFLPTKVES----FSWETYNENISS--I 236
Query: 416 RGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAY 474
+ + L++Q + T D SDYLWY T+ ++ ++ L G TL S G +H +
Sbjct: 237 EEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHGMHVF 296
Query: 475 VNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPV 534
+NG S + + S F + L G N++SLLS GL N G ++ G+ GPV
Sbjct: 297 INGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGVLGPV 356
Query: 535 LLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN--RRMT 592
+ G + DLS KW+YKVGL G + + ++ W+ ++ + +T
Sbjct: 357 AIHGLDXGKM---DLSRQKWSYKVGLKG--ENMNLGSPSSVQAVDWAKDSLKQENAQPLT 411
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
WYK F+AP ++P+ L++ M KG W+NG N+GRYW + C+ C Y G Y
Sbjct: 412 WYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWT--ITANGNCT--DCSYSGTYR 467
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC------ 706
KC + CG P+Q WYHVPRSW+ N +V+FEE GGNPS+I+ V + C
Sbjct: 468 PRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICTEASQY 527
Query: 707 -------------GQAHENKTMELTCH---GRRISEIKYASFGDPQGACGAFKKGSCEAE 750
G+ +E +++ H G+ IS IK+ASFG P GACG+ K+G+C +
Sbjct: 528 RPVIKNVHMHQNNGELNEQNVLKINLHCAAGQFISAIKFASFGTPSGACGSHKQGTCHSP 587
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+++K CVG++ C + G C K+L E +C
Sbjct: 588 KSDY-VLQKLCVGRQRCLATIPTSIFGEDPC-PNLRKKLSAEVVC 630
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 235/537 (43%), Positives = 307/537 (57%), Gaps = 58/537 (10%)
Query: 208 MATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGK 256
MA S +IGVPW+MCQ+ DAP + FTPN P+ PKIWTENW GWFK++GG+
Sbjct: 1 MAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGR 60
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
DP R AED+A++VARFF GG+ NYYMYHGGTNFGRTSGGP++TTSYDY+APIDEYG
Sbjct: 61 DPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLP 120
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG---------------------- 354
PKWGHL++LHK + E L G N G+S+
Sbjct: 121 RLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKND 180
Query: 355 -------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRP 407
+SY+LPAWSVSILPDCKTE FNTAKV ++++ KV+ + + L+W+
Sbjct: 181 KAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSS-KVEMLPEDLKSSSGLKWEVFS 239
Query: 408 EMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
E + G F N L+D +T D +DYLWY T+ + +++ L S+ L I S
Sbjct: 240 EKPG---IWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFIES 296
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
G LH ++N Y+ + ++PV L G+N I LLS TVGL N GS ++ V
Sbjct: 297 KGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYEWV 356
Query: 527 PNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS-ERGWSSKNV 585
G+ V G +L++ KW+YK+G+ G + F K NS W+
Sbjct: 357 GAGLTS----VSIKGFNKGTLNLTNSKWSYKLGVEGEHLELF---KPGNSGAVKWTVTTK 409
Query: 586 PLNRR-MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL---AEEDGCS 641
P ++ +TWYK E P ++PV L++ MGKG AW+NG +GRYWP + D C
Sbjct: 410 PPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECV 469
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
E CDYRG + DKC CG PSQ WYHVPRSW K N LV+FEE GGNP +I
Sbjct: 470 KE-CDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLS 525
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 414 bits (1064), Expect = e-112, Method: Compositional matrix adjust.
Identities = 221/534 (41%), Positives = 299/534 (55%), Gaps = 54/534 (10%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGR++ IDG+R + SG+IHYPRS P +WP LI++AKEGGL+ IETY+FWNAHEP
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+F G DLI+++K IQ+ +Y I+RIGP++ AEWN+GG P WL + I R N
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHII-FRANND 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ EM+ F IV K +LFASQGGPIIL QIENEYGN+ D+ G Y+ W A+M
Sbjct: 155 PYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMF------------TPNNPNSPKIWTENWTGWFKSWGGK 256
A S GVPWIMC++S AP + T + N P +WTENWT F+++G +
Sbjct: 215 ALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
R+AED+A+AV RFF GG+ NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 275 VAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMY 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH +++S +K G N T
Sbjct: 334 KEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
D G + +P+ SVSIL CK +NT +V Q N + ++ + QW+
Sbjct: 394 DGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSYHTSEVTSKNN--QWEMY 451
Query: 407 PEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
E I D VR K L T D SDYLWY T+ L+ DD L+
Sbjct: 452 SEKIPKYRDTKVRMK-----EPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQ 506
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQ 517
+ SS + + N +V +FE+PV L G N + LLS+T+G++
Sbjct: 507 VKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMK 560
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 239/551 (43%), Positives = 311/551 (56%), Gaps = 58/551 (10%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW L+K AKEGG+D IETYVF N HE Y F G DL++F+K +Q G+Y+IL IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
P+V EWN+GG P+WLH +P +T +K F MQ F TLIV++ KK+KLFASQGGPI
Sbjct: 61 PFVATEWNFGGVPIWLHYVPR-TIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPI 119
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL Q+ENEYG+ Y D GK Y+ W A M S +IGVPWIMCQ + PM
Sbjct: 120 ILTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFY 179
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
FTPN+P+ ++WTENW WFK++G + R ED+AF+VA FF F + NYYMYHG
Sbjct: 180 CDQFTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFF-FPKS-XNYYMYHG 237
Query: 288 GTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
GTNFG TSGGP++TT+Y+Y+APIDEYG PK GHL+EL + +KS E L YG N
Sbjct: 238 GTNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPINLX 297
Query: 348 ---------YGNSVSG--------------------SSYNLPAWSVSILPDCKTEEFNTA 378
Y +S+ G SY++PAWSVSILPDCK FNTA
Sbjct: 298 LGPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFNTA 357
Query: 379 KVNTQTN-----VKVKRPN--QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-K 430
KV +Q + ++ +P+ + D L WK + + G+ F N +D
Sbjct: 358 KVVSQISQVEMVLEDLQPSLVPSNKDLKGLXWK---TFVEKAGIWGEADFVKNGFVDHIN 414
Query: 431 STNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS 490
+T D +D LWY + + + + L S L + S G LHA+VN S S
Sbjct: 415 TTKDTTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNGSHS 474
Query: 491 NDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLS 550
FE P+ L GKN+I +LS TVGLQN ++ V + V G I DLS
Sbjct: 475 PFKFECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTS----VKIKGLNNGIMDLS 530
Query: 551 SHKWTYKVGLY 561
++ W YK L+
Sbjct: 531 TYPWIYKSLLH 541
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 243/591 (41%), Positives = 327/591 (55%), Gaps = 77/591 (13%)
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLN 317
P R AED+AFAVARF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG L
Sbjct: 1 PHRPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 60
Query: 318 QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG--------------------NSVSGS-- 355
+PKWGHLR+LH+ +K E L G+ T T G N SGS
Sbjct: 61 EPKWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYA 120
Query: 356 -------SYNLPAWSVSILPDCKTEEFNTAKVNTQTN-VKVKRPNQAGNDQAPLQWKWRP 407
Y++P WS+SILPDCKT FNTA++ QT+ +K++ + W+
Sbjct: 121 RVVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQTSQLKMEWAGK-------FSWESYN 173
Query: 408 EMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
E N F R F L++Q S T D +DYLWY T ++ +++ L L +NS
Sbjct: 174 EDTNSFDDRS---FTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYPVLTVNS 230
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+G +H Y+NG T YGA + + VKL G N+IS+LS VGL N G F
Sbjct: 231 AGHSMHIYINGQLTG---TIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHF 287
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G+ GPV L G + +DLS KW Y++GL G + +++ E G S+
Sbjct: 288 ETWNTGVLGPVTLSGLNEGK---RDLSWQKWIYQIGLKGEALNLHTLSGSSSVEWGGPSQ 344
Query: 584 NVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTE 643
+ +TWYKT+F AP NDP+ L++ MGKG W+NG ++GRYWP Y A S
Sbjct: 345 ----KQSLTWYKTSFNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKASG---SCG 397
Query: 644 SCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVG 703
CDYRG Y KC NCG +Q WYHVPRSW+ N LV+FEE+GG+PS I+ V
Sbjct: 398 GCDYRGTYNEKKCQSNCGESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKVE 457
Query: 704 TACGQAHE--------------NKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCE 748
+ C + E L+C G++++ IK+ASFG PQG CGAF +G+C
Sbjct: 458 SVCAEIAEWQPNMDNVHTGNYGRSKAHLSCAPGQKMTNIKFASFGTPQGTCGAFSEGTCH 517
Query: 749 AEIDVLPL----IEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
A + + C+G++SC++ + G C GT+K+L VEA+C
Sbjct: 518 AHKSYDAFEKESLLQNCIGQQSCAVLVAPEVFGGDPC-PGTMKKLAVEAIC 567
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 242/585 (41%), Positives = 316/585 (54%), Gaps = 72/585 (12%)
Query: 265 LAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHL 324
LAF VARF Q GG+F NYYMYHGGTNFGRT+GGP++TTSYDYDAPIDEYG + QPK+GHL
Sbjct: 1 LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60
Query: 325 RELHKLLKSMEKTLTYGNVTNTDYGNSVSGS----------------------------- 355
+ELH+ +K EK L + T GN
Sbjct: 61 KELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNV 120
Query: 356 SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV 415
YNLP WS+SILPDC+ FNTAKV QT+ P N QW+ E ++ +
Sbjct: 121 HYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKN----FQWESYLEDLSS--L 174
Query: 416 RGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAY 474
F + L++Q T D SDYLWYMT+ D+ D + L G TL I S+G +H +
Sbjct: 175 DDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIF 234
Query: 475 VNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPV 534
VNG S + ++ + L G N+I+LLS VGL N G F+ GI GPV
Sbjct: 235 VNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPV 294
Query: 535 LLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER-GW--SSKNVPLNRRM 591
L G + + DLS KWTY+VGL G + A N+ GW +S V + +
Sbjct: 295 ALHGLSQGKM---DLSWQKWTYQVGLKG---EAMNLAFPTNTPSIGWMDASLTVQKPQPL 348
Query: 592 TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPY 651
TW+KT F+AP N+P+ L+++GMGKG WVNG ++GRYW T A D CS C Y G Y
Sbjct: 349 TWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYW-TAFATGD-CS--HCSYTGTY 404
Query: 652 GSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC----- 706
+KC CG P+Q WYHVPR+W+K N LV+FEE GGNPS ++ V C
Sbjct: 405 KPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSE 464
Query: 707 ---------------GQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAE 750
GQ + L C G+ I+ IK+ASFG P G CG++++G C A
Sbjct: 465 YHPNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAA 524
Query: 751 IDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++E++CVGK C++ S +N G C +KRL VEA+C
Sbjct: 525 TS-YAILERKCVGKARCAVTISNSNFGKDPC-PNVLKRLTVEAVC 567
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 264/741 (35%), Positives = 380/741 (51%), Gaps = 93/741 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V +D R++ I+GERK+++SGSIHYPRSTP MWP LIKK+K+ G++ IETYVFWN H+P
Sbjct: 46 VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105
Query: 89 RQ-YDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
Q Y+F GN ++ F+ Q +GLYV LRIGPYVCAEWNYGG P WL N+PGI R N
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGI-VFRDYN 164
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ +M EM ++ T IV+ K FAS GGPIILAQ+ENEYG + ++YGD+GK Y W
Sbjct: 165 QPWMTEMASWMTFIVNYLK--PYFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAIS 222
Query: 208 MATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKSW 253
A SL+IG+PW MCQ++D + T N PN P +TENW GW + +
Sbjct: 223 FAKSLNIGIPWTMCQQNDIDDAINTCNGFYCHDWIQYHFQVYPNQPAFFTENWAGWIQYY 282
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
P R EDL ++VAR+F GG+ NYYM+HGGT F R S +LT SYDYDA +DEY
Sbjct: 283 SEGVPHRPTEDLLYSVARWFSRGGSLMNYYMWHGGTTFARYS-STFLTNSYDYDAALDEY 341
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTE 373
G+ +PK+ L +LH +L L +S P ++S + C T
Sbjct: 342 GYEAEPKYSALAQLHSVLSQYSYIL-------------LSSGEVARPV-NISNITTCNTI 387
Query: 374 E---FNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMIN----------------DFV 414
E +NT +N N + AP+Q W + I D
Sbjct: 388 EIIQYNTT-INGTLETITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYNNQTVIDTS 446
Query: 415 VRGKGHFALNTLIDQKSTNDVSDYLWY----------MTNADLKDDDPILSGSSNMTLRI 464
+ + A K +V W + A+L + L + + T +
Sbjct: 447 YVKQQYSAQKEFYQSKRVKNVLVSSWTEPIGVGNYSNVVTANLPSEQ--LDLTLDQTDYL 504
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
++ +++ Y++G Y W++ S F K G +++S+LS T+GL +YGS F+
Sbjct: 505 CNADDMIYIYIDGEY--QSWSR--GSPAHFVLDTKFGIGTHKLSILSLTMGLISYGSHFE 560
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ G V L +D++++ W+ + L G + ++ WS N
Sbjct: 561 SYKRGLNGTVTLG--------TQDITNNGWSMRPYLVG----EMQGIQSNPHLTSWSINN 608
Query: 585 -VPLNRRMTWYKTTFEAPLE---NDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ +N+ +TWYK E L++ GM KGF VNG ++GRYW L GC
Sbjct: 609 ELSINQPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVNGNSIGRYW---LTLGWGC 665
Query: 641 STESCDYRGP-YGSDKCAYNCGNPSQIWYHVPRSWI---KDGVNTLVLFEEFGGNPSQIN 696
+ C+Y G Y C CG PS+ +YHVP ++ + +N +++FEE G+P+ I
Sbjct: 666 GS-GCNYTGDGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLNEIIVFEELSGDPNSIQ 724
Query: 697 FQTVVVGTACGQAHENKTMEL 717
V Q + +E
Sbjct: 725 LVQRYVPYQLDQTDYDNNLEF 745
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/312 (59%), Positives = 230/312 (73%), Gaps = 12/312 (3%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D +A+ ++G+R+IL+SGSIHYPRS P MWPDLI+KAK+GGLD ++TYVFWN HEP RR
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI LRT N+
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-SLRTDNEP 148
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQNFTT IVDM K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA
Sbjct: 149 FKAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 208
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+L+ VPW+MC+E DAP P+ F+PN P+ P +WTE WT W+ +G P
Sbjct: 209 VALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVP 268
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLA+ VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG LN
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGELNT 328
Query: 319 PKWGHLRELHKL 330
+G L+ L
Sbjct: 329 FYFGKRHALYSL 340
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/312 (59%), Positives = 229/312 (73%), Gaps = 12/312 (3%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D +A+ ++G+R+IL+SGSIHYPRS P MWPDLI+KAK+GGLD ++TYVFWN HEP RR
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNEP 148
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F EMQNFTT IVDM K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA
Sbjct: 149 FKAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 208
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+L+ VPW+MC+E DAP P+ F+PN P+ P +WTE WT W+ +G P
Sbjct: 209 VALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVP 268
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLA+ VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG LN
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGELNT 328
Query: 319 PKWGHLRELHKL 330
+G L+ L
Sbjct: 329 FYFGKRHALYSL 340
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/341 (58%), Positives = 239/341 (70%), Gaps = 16/341 (4%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
+ +LL L L T ++ V++D +AI I+G R+IL+SGSIHYPRSTP MWPDLI+KAK
Sbjct: 3 KTVLLFLCLLTWVCSTIG-SVTYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQKAK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP +Y F DL+RFIK +Q GLYV LRIGPYVCAEWNYG
Sbjct: 62 DGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFP+WL +PGI RT N F MQ F IVDM K EKLF +QGGPIIL+QIENEYG
Sbjct: 122 GFPIWLKFVPGI-AFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYG 180
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
V + G GKSY W A+MA L GVPW+MC++ DAP P+ F PN
Sbjct: 181 PVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIY 240
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
PKIWTENW+GW+ ++GG P R ED+AF+VARF Q GG+ NYYMYHGGTNFGRTS G
Sbjct: 241 KPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-G 299
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWG--HLRELHKLLKSMEK 336
++TTSYD+DAPIDEYG L +P G L+ L++ + M K
Sbjct: 300 LFVTTSYDFDAPIDEYGLLREPILGPVTLKGLNEGTRDMSK 340
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 101/176 (57%), Gaps = 9/176 (5%)
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
++ ++ I GPV L G +D+S +KW+YKVGL G + Y+ K +NS + W
Sbjct: 314 EYGLLREPILGPVTL---KGLNEGTRDMSKYKWSYKVGLRG-EILNLYSVKGSNSVQ-WM 368
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ + +TWYKTTF P N+P+ L++ M KG WVNG ++GRY+P Y+A
Sbjct: 369 KGSFQ-KQPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIARG---K 424
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
C Y G + KC +NCG PSQ WYH+PR W+ N L++ EE GGNP I+
Sbjct: 425 CNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISL 480
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 184/318 (57%), Positives = 227/318 (71%), Gaps = 17/318 (5%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
A VS+D + I+ E+ I+ SG +HYP ST +WP + K+ K GGLDAIE+Y+FW+ H
Sbjct: 5 FATEVSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWDRH 64
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP+RR+YD +GNLD I F+K IQ+ LY ILRIGPYVC WN+GGF +WLHNMP I ELR
Sbjct: 65 EPVRREYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEI-ELR 123
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
N + NEMQ FTT IV+MAK+ KLFA GGPIIL IENEYGN+M+DY +A K YI W
Sbjct: 124 IDNPIXKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIKW 183
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
CA+MA + +IGVPWIMC DAP PM F PNNP S K++ F+ W
Sbjct: 184 CAQMALTQNIGVPWIMCXXRDAPQPMINTCNGHYCDSFXPNNPKSSKMFRX-----FQKW 238
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
G + P ++AE+ F+VARFFQ GG NYYMYHGGTNFG GGPY+T SY+YDAP+DEY
Sbjct: 239 GERVPHKSAEESTFSVARFFQSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDEY 298
Query: 314 GHLNQPKWGHLRELHKLL 331
G+LN+PKW H ++LHK L
Sbjct: 299 GNLNKPKWEHFKQLHKEL 316
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 50/77 (64%), Gaps = 2/77 (2%)
Query: 701 VVGTACGQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEK 759
+ GT C Q +E ++ +C G+ IS+I++ASFG+P+G CG+FK G+ EA D ++E
Sbjct: 407 ITGTICTQVNEGAQLDPSCQIGKTISQIQFASFGNPEGNCGSFKGGTWEA-TDSQSVVEV 465
Query: 760 QCVGKKSCSIEASEANL 776
C+G+ SC ++ ++
Sbjct: 466 ACIGRNSCGFTVTKRHI 482
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 30/43 (69%)
Query: 598 FEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
FEAP DP+V++LQ GK AWVNG ++G YW +++ +GC
Sbjct: 363 FEAPFGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWITNTNGC 405
Score = 46.2 bits (108), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 36/55 (65%), Gaps = 4/55 (7%)
Query: 427 IDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVD 481
+ ++ T DVSD+LWYMT+ D+ D +S +N TLR+++ G L AYV+G D
Sbjct: 312 LHKELTFDVSDFLWYMTSIDIPD----ISLWNNSTLRVSTMGHTLRAYVSGRADD 362
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 183/312 (58%), Positives = 227/312 (72%), Gaps = 16/312 (5%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
++D +A+ ++G+R+IL+SGSIHYPRS P MWPDLI+KAK+GGLD ++TYVFWN HEP RR
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
QY F G DL+ FIK ++ GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N+
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNEP 148
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F +NFTT IVDM K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA
Sbjct: 149 F----KNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 204
Query: 210 TSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+L+ VPW+MC+E DAP P+ F+PN P+ P +WTE WT W+ +G P
Sbjct: 205 VALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVP 264
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQ 318
R EDLA+ VA+F Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDEYG LN
Sbjct: 265 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGELNT 324
Query: 319 PKWGHLRELHKL 330
+G L+ L
Sbjct: 325 FYFGKRHALYSL 336
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 183/298 (61%), Positives = 217/298 (72%), Gaps = 12/298 (4%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D RA+ I+G+R+IL+SGSIHYPRSTP MWP L++KAK+GGLD ++TYVFWN HEP+R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
QY F DL+RF+K + GLYV LRIGPYVCAEWN+GGFPVWL +PGI RT N
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI-SFRTDNG 146
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F IV M K E LF QGGPIILAQ+ENEYG + S G K Y NW AKM
Sbjct: 147 PFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKM 206
Query: 209 ATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKD 257
A + GVPW+MC++ DAP P+ F+PN+ + P +WTE WTGWF ++GG
Sbjct: 207 AVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAV 266
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGH 315
P R ED+AFAVARF Q GG+F NYYMYHGGTNF RTSGGP++ TSYDYDAPIDEYG
Sbjct: 267 PHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGR 324
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 258/767 (33%), Positives = 380/767 (49%), Gaps = 127/767 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS D RA+ +DG R ++LSG++HYPRSTP MWP +++ ++ GL+ +ETY+FWN HE R
Sbjct: 3 VSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERRR 62
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
DF+G LDL+RF + Q +GL VILRIGPY+CAE NYGG P WL ++P I +RT N+
Sbjct: 63 GVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDI-RMRTDNE 121
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F E + L+ ++ + L A GGP+ILAQIENEY N+ + YG+ G+ Y+ W ++
Sbjct: 122 AFKREKARWVRLVAEVIR--PLCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 209 ATSLDIGVPWIMCQE------------SDAPSPMFTPN--------------NPNSPKIW 242
A SL +G+PW+ C + A + T N +P P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTT 302
TENW GW+++WGG PKR E+LA+A ARFF GG+ NY+++HGGTNFGR G LTT
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGR-DGMYLLTT 298
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN----VTNTDYGNSVSGSSYN 358
+Y++ P+DEYG L K HL L+K L + + +T G S
Sbjct: 299 AYEFGGPLDEYG-LPTTKARHLARLNKALAACADKILASERPRAITGERNGLLKFQYSSG 357
Query: 359 LPAWSVSILPDCKTEEFNTAKV--NTQTNVKVKRPNQA-GNDQAPLQWKWRPE-MINDFV 414
L W + + N + ++ V+R +A G AP W WR E + +
Sbjct: 358 LTFWCDDVARTVRIVGKNGEVLYDSSARVAPVRRTWKASGVRFAP--WGWRAEPLPAAWP 415
Query: 415 VRGKGHFALNTLIDQ-KSTNDVSDYLWYMT----------------------------NA 445
+ ++Q T D +DY WY T
Sbjct: 416 AEAQSAVTARKPLEQLLLTKDETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGALARV 475
Query: 446 DLKDDDPILSGSSNM-------TLRINSSGQVLHAYVNGNYVDSQWTKY---------GA 489
+ P ++G ++ TLR+ ++H +++G +V + T G
Sbjct: 476 GRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVFIDGTFVATTPTPLRERRGKMDAGL 535
Query: 490 SNDLFE---RPVKLTRGKNQISLLSATVGLQNYG-----SKFDMVPNGIPGPVLLVGRAG 541
FE + +++T GK+++SLL +GL + G+ PV G
Sbjct: 536 FTQTFELDLKALRITPGKHRLSLLCCALGLIKGDWMIGYENMALEKKGLWAPVFWNG--- 592
Query: 542 DETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP----LNRRMTWYKTT 597
K L +W ++ GL G ++ + AA S W + R + W++TT
Sbjct: 593 -----KKLEG-EWRHQPGLLG--ERCGFADPAAGSLLAWKTAKAATGRGARRPLRWWRTT 644
Query: 598 FEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPY-----G 652
F P + P L+L GMGKG AW+NG+ +GRYW LA+ D GP+ G
Sbjct: 645 FTRPKGHGPWALDLGGMGKGMAWINGHCIGRYW--LLADTDPM--------GPWMAWMKG 694
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKD--GVNTLVLFEEFGGNPSQINF 697
S A + G P+Q +YHVP W++ G +TLVLFEE GG+P+ +
Sbjct: 695 SLTAAPSSG-PTQRYYHVPDDWLRTDGGPDTLVLFEELGGDPATVRL 740
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 256/750 (34%), Positives = 371/750 (49%), Gaps = 99/750 (13%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+I L L++ + LS VS+D RAI I+GERK+L S SIHYPRST MWPD++K+ K
Sbjct: 13 SIFLILLIFPNYVLSDKLTVSYDNRAIIINGERKLLYSASIHYPRSTRTMWPDILKRTKA 72
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
G++ IETY+FWN H+P YDF G+ D+ F+ +++G +VI+R GPYVCAEWN GG
Sbjct: 73 AGINTIETYIFWNLHQPTPDTYDFEGSSDVKHFLDLCKEEGFHVIVRFGPYVCAEWNNGG 132
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
P WL +PGI RT N+ FM EM+ + IV +A GGPII+AQIENEYG
Sbjct: 133 LPSWLKAVPGI-VYRTHNEPFMREMKKWMDYIVHYLS--DYYAPNGGPIIMAQIENEYGW 189
Query: 190 VMSDYGD-AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNN------------- 235
+ +Y + G Y++W K+A S + G+PWIMCQ++ + T N
Sbjct: 190 LEYEYREQGGPEYVDWAVKLAKSYNTGIPWIMCQQNTRSDVINTCNGFYCHDWLQYHQRT 249
Query: 236 -PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRT 294
P+ P +TE WTGW + + P R D+ ++ ARF+ GG NYYM+HGGT FGR
Sbjct: 250 FPDQPAFFTELWTGWPQYFEEGFPTRPTVDVLYSAARFYSRGGGMVNYYMWHGGTTFGRF 309
Query: 295 SGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDY---GNS 351
+ P+LTTSYDYDAP+DEYG +PK+ L +LH L+ + + Y N+
Sbjct: 310 T-SPFLTTSYDYDAPLDEYGFPQEPKYSMLTKLHVTLEKYSSVILHDPNVPPPYVFPDNT 368
Query: 352 VSGSSYNLPAWSVSILPD-----CKTEEFNTAKVNT-QTNVKVKRPNQAGNDQAPL---- 401
V Y A SV L + K + N V Q +V++ N+ D +
Sbjct: 369 VEMIEYKKDAESVVFLVNWDDTFAKQVDMNGKNVKINQWSVQIYYNNELVFDTFEIPANL 428
Query: 402 -----QWKWRPEMINDFVVRGKGHFALNTLIDQ---------------------KSTNDV 435
+K + D L L+ K T D
Sbjct: 429 TRPNPPFKPIAKTSLDATAAATSRTGLVNLVSSWNEPFSFLTYNASSQTPTAQLKLTGDN 488
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE 495
SDY+WY T DL D I L + S + +V+G ++ W + F
Sbjct: 489 SDYIWYETEIDLTKTDEI--------LYLYKSYDFSYVFVDGQFL--YWHRGSPIQAYFN 538
Query: 496 RPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWT 555
K GK+ + +L A +G+ +YG+ + G+ G + L K+++ + W
Sbjct: 539 G--KFPVGKHTLQILCAAMGVPSYGAHIEQHERGLTGDIFLGS--------KNITDNGWK 588
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNR-----RMTWYKTTFEAP-LENDPV-V 608
+ L G + A+ S WS P+++ +TWYK + P E+ P
Sbjct: 589 MRPFLSG----ELLGLHASPSTVKWS----PVSKGTAGSGVTWYKFNVKTPSFEDGPAFA 640
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
L+L+ M KG +VNG ++GRYW G E C+ G Y + C NCG SQ +Y
Sbjct: 641 LDLKSMWKGLVFVNGNSIGRYWVA-----KGWCEEKCNQTGLYDNYGCRENCGESSQRYY 695
Query: 669 HVPRSWIKDGV-NTLVLFEEFGGNPSQINF 697
HVP+ ++K+ N +++FEE G+P I
Sbjct: 696 HVPKDFLKESSDNEVIIFEELQGDPYSIEL 725
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 250/766 (32%), Positives = 374/766 (48%), Gaps = 126/766 (16%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS D RA+ +DG R ++LSG++HYPRSTP MWP +++ ++ GL+ +ETY+FWN HE R
Sbjct: 3 VSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERRR 62
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
DF+G LDL+RF + Q +GL VILRIGPY+CAE NYGG P WL ++P I +RT N+
Sbjct: 63 GVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDI-RMRTDNE 121
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F E + L+ ++ + L A GGP+ILAQIENEY N+ + YG+ G+ Y+ W ++
Sbjct: 122 AFKREKARWVRLVAEVIR--PLCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 209 ATSLDIGVPWIMCQE------------SDAPSPMFTPN--------------NPNSPKIW 242
A SL +G+PW+ C + A + T N +P P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTT 302
TENW GW+++WGG PKR E+LA+A ARFF GG+ NY+++HGGTNFGR G LTT
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGR-DGMYLLTT 298
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS----YN 358
+Y++ P+DEYG + H + G + ++ V SS Y+
Sbjct: 299 AYEFGGPLDEYGLPTT------KARHLARLNAALAACAGELLASERPGVVEKSSGVVEYH 352
Query: 359 LPAWSVSILPDCKTEEF---NTAKVNTQTNVKVKRPNQA----GNDQAPLQWKWRPE-MI 410
+ V + D + +V ++V+V +A G AP W WR E +
Sbjct: 353 YDSGLVFVCDDTARAVRIVKKSGEVLYDSSVRVAPVRRAWKSSGVRFAP--WGWRAEPLP 410
Query: 411 NDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMT-------------------------- 443
+ + ++Q T D +DY WY T
Sbjct: 411 AAWPAEAQSAVTARKPLEQLLPTKDETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGA 470
Query: 444 --NADLKDDDPILSGSSNM-------TLRINSSGQVLHAYVNGNYVDSQWTKY------- 487
+ P ++G ++ TLR+ ++H +++G +V + T
Sbjct: 471 LARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVFIDGTFVATTPTPLRERRGKM 530
Query: 488 --GASNDLFE---RPVKLTRGKNQISLLSATVGLQNYG-----SKFDMVPNGIPGPVLLV 537
G FE + +++T GK+++SLL +GL + G+ PV
Sbjct: 531 DAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIKGDWMIGYENMALEKKGLWAPVFWN 590
Query: 538 GRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP----LNRRMTW 593
G K L +W ++ GL G ++ + AA S W + R + W
Sbjct: 591 G--------KKLEG-EWRHQPGLLG--ERCGFADPAAGSLLAWKTAKAATGRGARRPLNW 639
Query: 594 YKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGS 653
++TTF P + P L+L GMGKGF W+NG+ +GRYW L + D +G
Sbjct: 640 WRTTFTRPKGHGPWALDLGGMGKGFCWINGHCIGRYW--LLPDTDPMGPWMAWMKGSL-- 695
Query: 654 DKCAYNCGNPSQIWYHVPRSWIKD--GVNTLVLFEEFGGNPSQINF 697
A G P+Q +YHVP W++ G +TLVLFEE GG+P+ +
Sbjct: 696 --TAAPSGGPTQRYYHVPDDWLRTDGGPDTLVLFEELGGDPATVRL 739
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 227/542 (41%), Positives = 293/542 (54%), Gaps = 77/542 (14%)
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLT----------------------------YGNV-T 344
G L QPKWGHLR+LHK +K E L NV T
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68
Query: 345 NTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTN-----VKVKRPNQAGNDQA 399
+D S +G SY+LPAWSVSILPDCK FNTAK+N+ T + +P+ + +
Sbjct: 69 KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQSLKPDGGSSAEL 128
Query: 400 PLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSS 458
+W + E I + F L++Q +T D SDYLWY D+K D+ L S
Sbjct: 129 GSEWSYIKEPIG---ISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGS 185
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQN 518
L I S GQV++A++NG S K S D+ P+ L GKN + LLS TVGL N
Sbjct: 186 KAVLHIESLGQVVYAFINGKLAGSGHGKQKISLDI---PINLVAGKNTVDLLSVTVGLAN 242
Query: 519 YGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
YG+ FD+V GI GPV L G +I DL+S +WTY+VGL G D A
Sbjct: 243 YGAFFDLVGAGITGPVTLKSAKGGSSI--DLASQQWTYQVGLKGED-----TGLGAVDSS 295
Query: 579 GWSSKN-VPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
W SK+ +P + + WYKTTF+AP ++PV ++ G KG AWVNG ++GRYWPT +A
Sbjct: 296 EWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWPTSIAGN 355
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
GC T+SCDYRG Y ++KC NCG PSQ YHVPRSW+K NTLVLFEE GG+P+QI+F
Sbjct: 356 GGC-TDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISF 414
Query: 698 QTVVVGT----ACGQAH---------------ENKT---MELTC--HGRRISEIKYASFG 733
T G+ Q+H N+T + L C + IS IK+ASFG
Sbjct: 415 GTKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLQCPVSTQVISSIKFASFG 474
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
P+G CG+F GSC + L L++K C+G +SC+IE S G G VK L VEA
Sbjct: 475 TPKGTCGSFTSGSCNSSRS-LSLVQKACIGSRSCNIEVSTRVFGEP--CRGVVKSLAVEA 531
Query: 794 LC 795
C
Sbjct: 532 SC 533
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 369 bits (947), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 225/519 (43%), Positives = 283/519 (54%), Gaps = 63/519 (12%)
Query: 222 QESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVA 270
++ DAP P+ F+PN P +WTE WTGWF S+GG P R EDLAFAVA
Sbjct: 1 KQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVA 60
Query: 271 RFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKL 330
RF Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAPIDE+G L QPKWGHLR+LH+
Sbjct: 61 RFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRA 120
Query: 331 LKSMEKTLTYGNVT-----------------------------NTDYGNSVSGSSYNLPA 361
+K E L + T NT +G YNLPA
Sbjct: 121 IKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPA 180
Query: 362 WSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHF 421
WS+SILPDCKT FNTA V T + P W+ E N F
Sbjct: 181 WSISILPDCKTAVFNTATVKEPTLMPKMNP------VVRFAWQSYSEDTNSL---SDSAF 231
Query: 422 ALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYV 480
+ L++Q S T D SDYLWY T ++ +D + SG S L + S+G + +VNG
Sbjct: 232 TKDGLVEQLSMTWDKSDYLWYTTYVNIGTND-LRSGQSPQ-LTVYSAGHSMQVFVNGKSY 289
Query: 481 DSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRA 540
S + Y + VK+ +G N+IS+LS+ VGL N G+ F+ G+ GPV L
Sbjct: 290 GSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSLN 349
Query: 541 GDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEA 600
G KDLS KWTY+VGL G ++ E G PL TW+K F A
Sbjct: 350 GG---TKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQPL----TWHKAFFNA 402
Query: 601 PLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNC 660
P NDPV L++ MGKG WVNG+++GRYW +Y A GC C Y G Y DKC NC
Sbjct: 403 PAGNDPVALDMGSMGKGQLWVNGHHVGRYW-SYKAS-GGCG--GCSYAGTYHEDKCRSNC 458
Query: 661 GNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
G+ SQ WYHVPRSW+K G N LV+ EE+GG+ + ++ T
Sbjct: 459 GDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLAT 497
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 361 bits (927), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 210/489 (42%), Positives = 272/489 (55%), Gaps = 49/489 (10%)
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTE WTGWF ++GG P R ED+AFAVARF Q GG+F NYYMYHGGTNF RTSGGP++
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---------- 350
TSYDYDAPIDEYG L QPKWGHLR+LHK +K E L G+ T GN
Sbjct: 61 ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSS 120
Query: 351 -------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
+G Y+LPAWS+S+LPDCK FNTA V+ + R
Sbjct: 121 GGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPS--APARM 178
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDD 450
+ AG W+ E N R F + L++Q S T D SDYLWY T ++ +
Sbjct: 179 SPAGG----FSWQSYSEATNSLDGRA---FTKDGLVEQLSMTWDKSDYLWYTTYVNINSN 231
Query: 451 DPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLL 510
+ L L I S+G L +VNG + + Y + + VK+ +G N+IS+L
Sbjct: 232 EQFLKSGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISIL 291
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYN 570
SA VGL N G+ ++ G+ GPV L G + +DLS KWTY++GL+G
Sbjct: 292 SAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGK---RDLSDQKWTYQIGLHGESLGVQSV 348
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
A +++ E G ++ PL TW+K F AP + PV L++ MGKG AWVNG ++GRYW
Sbjct: 349 AGSSSVEWGSAAGKQPL----TWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW 404
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG 690
+Y A GC C Y G Y KC CG+ SQ +YHVPRSW+ N LV+ EEFGG
Sbjct: 405 -SYKASSSGCG--GCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGG 461
Query: 691 NPSQINFQT 699
+ S + T
Sbjct: 462 DLSGVKLVT 470
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 361 bits (926), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 184/394 (46%), Positives = 237/394 (60%), Gaps = 44/394 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R + SG+IHYPRS P MW L+K AK GGL+ IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F G DLIRF+ I+D +Y I+RIGP++ AEWN+GG P WL + I R N+
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHI-IFRANNE 154
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F EM+ F IV K ++FA QGGPIIL+QIENEYGN+ D G Y+ W A+M
Sbjct: 155 PFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEM 214
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNN------------PNSPKIWTENWTGWFKSWGGK 256
A S IGVPW+MC++S AP + N N P++WTENWT F+++G +
Sbjct: 215 AISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQ 274
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
+R+AED+A+AV RFF GGT NYYMYHGGTNFGRT G Y+ T Y +AP+DEYG
Sbjct: 275 LAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMC 333
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYG------------------------------NVTNT 346
+PK+GHLR+LH ++KS K +G N T
Sbjct: 334 KEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGE 393
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKV 380
D G + +P+ SVSIL DCKT +NT +V
Sbjct: 394 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV 427
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 358 bits (920), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 231/591 (39%), Positives = 303/591 (51%), Gaps = 96/591 (16%)
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN- 342
MY GGTNFGRTSGGP+ TSYDYDAP+DEYG ++PKWGHL++LH +K E L +
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60
Query: 343 -----------------------------VTNTDYGNSV----SGSSYNLPAWSVSILPD 369
+ N D S +G SY LP WSVSILPD
Sbjct: 61 PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120
Query: 370 CKTEEFNTAKVNTQTNVK-VKRPNQAGNDQAPLQWKWRPEMIN-----------DFVVRG 417
C+ FNTAKV QT+VK V+ + + LQ R + ++ + G
Sbjct: 121 CRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWG 180
Query: 418 KGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPIL--SGSSNMTLRINSSGQVLHAY 474
+ +F L++ T D SDYLW+ T + +DD N T+ I+S VL +
Sbjct: 181 ENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVF 240
Query: 475 VNGNYVDS---QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIP 531
VN S W K +PV+ +G N + LL+ TVGLQNYG+ + G
Sbjct: 241 VNKQLAGSIVGHWVKA-------VQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFR 293
Query: 532 GPVLLVG-RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR 590
G L G + GD DLS WTY+VGL G DK + N + WS+ +
Sbjct: 294 GKAKLTGFKNGD----LDLSKSSWTYQVGLKGEADKIY--TVEHNEKAEWSTLETDASPS 347
Query: 591 M-TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
+ WYKT F+ P DPVVLNL+ MG+G AWVNG ++GRYW ++++DGC +CDYRG
Sbjct: 348 IFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYW-NIISQKDGCD-RTCDYRG 405
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQA 709
Y SDKC NCG P+Q YHVPRSW+K N LVLFEE GGNP +I+ +TV G CGQ
Sbjct: 406 AYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQV 465
Query: 710 HE---------------NKTM---------ELTCH-GRRISEIKYASFGDPQGACGAFKK 744
E N TM L C G IS I++AS+G P+G+C F
Sbjct: 466 SESHYPPLRKWSTPDYINGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSI 525
Query: 745 GSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G C A + L ++ + C G+ SC IE S + C +GT+K L V + C
Sbjct: 526 GKCHAS-NSLSIVSEACKGRNSCFIEVSNTAFISDPC-SGTLKTLAVMSRC 574
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/336 (51%), Positives = 219/336 (65%), Gaps = 40/336 (11%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+VS+DGR++ I+G+RK+L SGSIHYPRSTP MWP LI KAK GGLD IETYVFWN HEP
Sbjct: 27 QVSYDGRSLIIEGQRKLLFSGSIHYPRSTPDMWPSLISKAKHGGLDVIETYVFWNLHEPR 86
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
QYDF G +++RFI+ IQ GLY +RIGP++ AEW YGG P WLH++PGI R+ N
Sbjct: 87 HGQYDFKGRHNIVRFIREIQAHGLYAFIRIGPFIEAEWTYGGLPFWLHDVPGI-VYRSDN 145
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F MQNFTT IV++ K E L+A QGGPIIL QIENEY N + + G Y+ W A
Sbjct: 146 EPFKYHMQNFTTKIVNLFKSEGLYAPQGGPIILQQIENEYKNAERAFHEKGPPYVQWAAA 205
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
MA L GVPW+MC++ DAP P+ PN+PN P IWT+NWT K+
Sbjct: 206 MAVGLQTGVPWVMCKQDDAPDPVINTCNGRTCGETFVGPNSPNKPAIWTDNWTS-LKN-- 262
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYG 314
G+F NYYMYHGGTNFGRT G ++ TSY +APIDEYG
Sbjct: 263 ----------------------GSFVNYYMYHGGTNFGRT-GSAFVLTSYYDEAPIDEYG 299
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
+ QPKWGHL++LH ++KS +TL +G ++ + G
Sbjct: 300 LIRQPKWGHLKQLHSVIKSCSQTLLHGVISVSPLGQ 335
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 245/719 (34%), Positives = 336/719 (46%), Gaps = 155/719 (21%)
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
M+ F TLIV+ K+ KLFASQGGPIILAQIENEY ++ + +AG YINW AKMA + +
Sbjct: 426 MKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMAIATN 485
Query: 214 IGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGGKDPKR 260
GVPWIMC+++ AP + P + P +WTENWT ++ +G +R
Sbjct: 486 TGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQR 545
Query: 261 TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPK 320
+AED+AF+VARFF GGT NYYMYHGGTNFGR +G ++ Y +AP+DE+G +PK
Sbjct: 546 SAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLYKEPK 604
Query: 321 WGHLRELHKLLKSMEKTLTYGNVT----------------------------NTDYGNSV 352
WGHLR+LH L+ +K L +GN + NT +V
Sbjct: 605 WGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKEDGTV 664
Query: 353 S--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMI 410
+ G Y + S+SIL DCKT F+T VN+Q N + DQ W EM
Sbjct: 665 TFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFH----FADQTVQDNVW--EMY 718
Query: 411 NDFVVRGKGHFALNT---LIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS 467
++ + ++ T L T D +DYLWY T+ L+ DD L +
Sbjct: 719 SEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVLEGAGT 778
Query: 468 GQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
G + + E+ + L G N +++LS+T+GL + GS +
Sbjct: 779 G-----------------RRSTRSFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHRM 821
Query: 528 NGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
G V V G T DL+++ W + G N PL
Sbjct: 822 AG----VYTVTIRGLNTGTLDLTTNGWGHVPG----------------------KDNQPL 855
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
TWY+ F+ P DPVV++L MGKGF +VNG LGRYW +Y
Sbjct: 856 ----TWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY-------------- 897
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC- 706
+ G PSQ YHVPRS ++ NTL+ FEE GG P I TV C
Sbjct: 898 ---------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNICT 948
Query: 707 -----GQAH-----ENK-------------------TMELTCHGRR-ISEIKYASFGDPQ 736
AH E+K T L+C ++ I + +AS+G+P
Sbjct: 949 FMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVVFASYGNPL 1008
Query: 737 GACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
G CG + GSC A ++EK C+G+K+CS+ S G GT L V+A C
Sbjct: 1009 GICGNYTVGSCHAP-RTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTGTLAVQAKC 1066
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 185/412 (44%), Positives = 244/412 (59%), Gaps = 54/412 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D R++ IDG R+I SGSIHYPRS P WPDLI KAKEGGL+ IE+YVFWN HEP +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF-PVWLHNMPGIEELRTTN 147
Y+F G DLI+F K IQ++ +Y I+RIGP+V AEWN+G + +P I RT N
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGFVCHIGSGEIPDI-IFRTNN 151
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F M+ F TLIV+ K+ KLFASQGGPIILAQIENEY ++ + +AG YINW AK
Sbjct: 152 EPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAK 211
Query: 208 MATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWG 254
MA + + GVPWIMC+++ AP + P + P +WTENWT ++ +G
Sbjct: 212 MAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFG 271
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYM------------------------------ 284
+R+AED+AF+VARFF GGT NYYM
Sbjct: 272 DPPSQRSAEDIAFSVARFFSVGGTMANYYMVVLNSNSNLFLTKKRDEISDRTDTGGFTCV 331
Query: 285 ----YHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTY 340
YHGGTNFGR +G ++ Y +AP+DE+G +PKWGHLR+LH L+ +K L +
Sbjct: 332 NNQQYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRHCKKALLW 390
Query: 341 GNVTNTDYGNSVSGSSYNLPAWSVSILPDCKT----EEFNTAKVNTQTNVKV 388
GN + G G Y + S+SIL DCKT ++F T VN K+
Sbjct: 391 GNPSVQPLGKLTRGQKYFVARRSISILADCKTVKYMKQFVTLIVNKLKEAKL 442
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 166/268 (61%), Positives = 201/268 (75%), Gaps = 12/268 (4%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
RA + L+L V +D RA+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K
Sbjct: 2 RAFEIVLVLLWFLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
+GGLD IETYVFWN HEP++ QYDF G DL++F+K + + GLYV LRIGPYVCAEWNYG
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GFP+WLH +PGI + RT N+ F EM+ FT IVD+ K+EKL+ASQGGPIIL+QIENEYG
Sbjct: 122 GFPLWLHFIPGI-KFRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYG 180
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
N+ S YG AGKSYINW AKMATSLD GVPW+MCQ+ DAP P+ FTPN+
Sbjct: 181 NIDSHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNT 240
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDL 265
PK+WTENW+GWF S+GG P R E L
Sbjct: 241 KPKMWTENWSGWFLSFGGAVPHRPVEIL 268
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 262/837 (31%), Positives = 376/837 (44%), Gaps = 211/837 (25%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
+CL++ L + + VS+DGR + ++G+R++L SGSIHYPRS P MWPD+I KA+ G
Sbjct: 41 VCLVVVRLSMVGVK-GVSYDGRPLIVNGKRELLFSGSIHYPRSIPEMWPDIIXKARHG-- 97
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
GL VI + A WN
Sbjct: 98 -------------------------------------GLNVI-----HTYAFWN------ 109
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS 192
LH + + M+ FT +I+DM KEK ASQGGPIILA +++
Sbjct: 110 -LH------------EPVQDHMKRFTRMIIDMMSKEKXIASQGGPIILALVDSAIA---- 152
Query: 193 DYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------------FT-PNNPNSP 239
+ + G ++W MA L G+P +MC++ DAP P+ FT PN PN
Sbjct: 153 -FKEMGTRCVHWAGTMAVGLKTGIPXVMCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKR 211
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+ + + G ++ +G +R AEDLAF+ F GT NYYMY+ TNFGRT+ +
Sbjct: 212 SV-SNHXLGMYRVFGDPPSQRAAEDLAFSX--FISKNGTLANYYMYYSVTNFGRTTSS-F 267
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG------------------ 341
TT Y +AP+DEYG + KWGHLR+LH L+ +K L +G
Sbjct: 268 ATTCYYDEAPLDEYGLPRETKWGHLRDLHAALRLSKKALLWGVTSAQKLGEDLEARIYEK 327
Query: 342 ------------NVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK 389
N+T T ++ GS Y LP S+S LPDCKT FNT V +Q +V
Sbjct: 328 PGSNICATFLLNNITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFNTQTVVSQYSVNKN 387
Query: 390 RPNQAGNDQAPLQWKWRPEMINDF---VVRGKGHFALNTLIDQKSTNDVSDYLWYMTNAD 446
LQW + + + + K L T+ T D +DYLWY TN +
Sbjct: 388 -----------LQWXMSQDALPTYEECPTKTKSPVELMTM-----TKDTTDYLWYTTNIE 431
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQW--TKYGASND---LFERPVKLT 501
L ++++ G V+HA++NG Y++ T++G++ + +F +P+ L
Sbjct: 432 LARTGLPFRKDVLRVPQVSNLGHVMHAFLNGEYMEFYLTGTRHGSNVEKSFVFNKPITLK 491
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
G NQI+ L ATVGL + GS + G+ + V +
Sbjct: 492 AGLNQIAPLGATVGLPDSGSYMEHRLAGV-------------------------HNVAIQ 526
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
GL+ + K GW +K F+AP + PV L L M KG AW+
Sbjct: 527 GLNTRTIDLPK-----NGWG------------HKAYFDAPEGDVPVALELSTMAKGMAWI 569
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG ++ YW +YL+ G PSQ YHVPR+++K N
Sbjct: 570 NGKSIDXYWVSYLSP-----------------------LGKPSQSVYHVPRAFLKTSDNL 606
Query: 682 LVLFEEFGGNPSQINFQTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGA 741
LVLFEE G NP I T+ T C E+ + R S+I+ FGDP G C
Sbjct: 607 LVLFEETGRNPDGIEILTLNRDTICCYISEHHPTHVRSWKREASDIQI--FGDPTGTCXE 664
Query: 742 FKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANL---GATSCAAGTVKRLVVEALC 795
F G+C A + ++EK C+GK SCSI + + G + +G K L V+ LC
Sbjct: 665 FIPGNCAAP-NSXKVVEKHCLGKSSCSIPVEQEIVSKDGISISGSGITKALAVQVLC 720
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 209/558 (37%), Positives = 307/558 (55%), Gaps = 76/558 (13%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
+++Y+V++DGR++ I+GERK+ +SGS+HYPRSTP +W ++ +K G++ I+TYVFW+
Sbjct: 103 NVSYKVTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDL 162
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP R Y+F GN +L F+ Q GL+V LRIGPY+CAEWNYGG P+WL ++PGI ++
Sbjct: 163 HEPQRGVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGI-KM 221
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
R N +M E++ + IVD FA QGGPI+LAQIENEY V Y ++G+ + +
Sbjct: 222 RDFNTQYMEEVERWMKFIVDYL--HGYFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAH 279
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGW 249
WCA +A LDIG+PWIMCQ+ D P+ + T N + P ++TENW+GW
Sbjct: 280 WCADLANRLDIGIPWIMCQQDDIPTVINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGW 339
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
F +W R DL ++ AR+F GG NYYM+HGGTNFGR S GP + SYDYDAP
Sbjct: 340 FNNWVNAVRHRPVADLLYSAARWFASGGALMNYYMWHGGTNFGRKS-GPMIALSYDYDAP 398
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEKTLT------------------YGN--------V 343
++EYG+ PK+ R+ +KL+ S+E L Y N +
Sbjct: 399 LNEYGNPRNPKYSQTRDFNKLILSLEDILLSQYPPTPIFLANNISVIHYRNGNNSASFII 458
Query: 344 TNTDYGNS---VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVK-RPNQAGNDQA 399
+ + GNS G SY A+SV IL + + ++ T+ V+ PN +
Sbjct: 459 NSNENGNSKVMFEGRSYFSYAYSVQILKNYVSVFDSSQNPRNYTDTVVESEPNIPFAN-- 516
Query: 400 PLQWKWRPEMINDFVVRGKGHFAL--NTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSG 456
+I+ V R +L N L++Q + T D +DY+WY T + D I
Sbjct: 517 --------SIISKHVERFDFEESLYDNRLMEQLNLTKDETDYIWYTTMINHDQDGEI--- 565
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGL 516
L++ + ++H +V+ YV + S+ L V L G + + LL +G+
Sbjct: 566 -----LKVINKTDIVHVFVDSYYVGTI-----MSDSLAITGVPL--GPSTLQLLHTKMGI 613
Query: 517 QNYGSKFDMVPNGIPGPV 534
Q+Y + GI GPV
Sbjct: 614 QHYELHMENTKAGILGPV 631
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 341 bits (875), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 252/747 (33%), Positives = 368/747 (49%), Gaps = 108/747 (14%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
+ YRVS+D RAITI+G R +L SG IHYPRSTP MWP L+ KAKE GL+ I+TYVFWN H
Sbjct: 30 IPYRVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNIH 89
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
E R YDF+G +L F++ + GL+V LR+GPYVCAEW+YG PVWL+N+P I R
Sbjct: 90 EQKRGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNI-AFR 148
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
++N + +EM+ F + I+ + A GGPIILAQIENEYG ++Y++W
Sbjct: 149 SSNDAWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGG-------NDRAYVDW 199
Query: 205 CAKMATS--LDIGVPWIMCQESDAPSPMFTPNN----------------PNSPKIWTENW 246
C + ++ +PWIMC A S + T N PN P ++TENW
Sbjct: 200 CGSLVSNDFASTQIPWIMCNGLAANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW 259
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDY 306
GWF+ WG RT EDLA++VA +F GG + YYM+HGG ++GRT GG LTT+Y
Sbjct: 260 -GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSD 317
Query: 307 DAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT----------YGNVTNTDYGNSVSGSS 356
D + G N+PK+ HL L +LL S + L Y N G S
Sbjct: 318 DVILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSNRLSIPYWNGKQWTVGTQQMVYS 377
Query: 357 Y--------NLPAWSVSIL----------PDCKTEEFNTAKVNTQTNVK-VKRPNQ--AG 395
Y N A+S+ +L + ++N + +V + R N
Sbjct: 378 YPPSVQFVINQAAFSLFVLFNKQNISIAGQSVQIYDYNEHLLWNSADVSGISRNNTFLVP 437
Query: 396 NDQAPLQWKWRPE-MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPI 453
PL W+ E +D V +T ++Q + TND + YLWY N L +
Sbjct: 438 IVVGPLDWQVYSEPFTSDLPV-----IVASTPLEQLNLTNDETIYLWYRRNVSLSQPS-V 491
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI-SLLSA 512
+ T R NS + G + D T+ + ++ + + I +LS
Sbjct: 492 QTIVQVQTRRANSLLFFMDRQFVGYFDDHSHTQGTINVNITLNLSQFLPNQQYIFEILSV 551
Query: 513 TVGLQNYGSKFDMVP-----NGIPGPVLLVGRA--GDETIIKDLSSHKWTYKVGLYGLDD 565
++G+ N F++ P GI G V L G++ GDE I W ++ GL+G +
Sbjct: 552 SLGIDN----FNIGPGSFEYKGIVGNVSLGGQSLVGDEASI-------WEHQKGLFG-EA 599
Query: 566 KKFYNAKAANSERGWSSK-NVPLNRRMTWYKTTFE------APLENDPVVLNLQGMGKGF 618
+ Y + + + W+ K +N+ +TW++T F+ L +P++L+ G +G
Sbjct: 600 HQIYTEQGSKTVE-WNPKWTTVINKPVTWFQTRFDLNHLAREDLNANPILLDAFGFNRGH 658
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDG 678
A+VNG ++G YW E C C + NC PSQ +YH+ W+K
Sbjct: 659 AFVNGNDIGLYWLI----EGTCQNNLC------CCLQNQTNCQQPSQRYYHISSDWLKPT 708
Query: 679 VNTLVLFEEFGG-NPSQINFQTVVVGT 704
N L +FEE G +P + ++ T
Sbjct: 709 NNLLTVFEEIGASSPKSVGLVQRIINT 735
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 251/737 (34%), Positives = 357/737 (48%), Gaps = 115/737 (15%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
+ Y VS+D RAITI+G R +L SG IHYPRSTP MWP L+ KAKE GL+ I+TYVFWN H
Sbjct: 30 IPYHVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNMH 89
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
E R YDF+G +L F++ + GL+V LR+GPYVCAEW+YG PVWL+N+P I R
Sbjct: 90 EQKRGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNI-AFR 148
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
++N + +EM+ F + I+ + A GGPIILAQIENEYG ++Y++W
Sbjct: 149 SSNDAWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGG-------NDRAYVDW 199
Query: 205 CAKMATS--LDIGVPWIMCQESDAPSPMFTPNN----------------PNSPKIWTENW 246
C + ++ +PWIMC A S + T N PN P ++TENW
Sbjct: 200 CGSLVSNDFASTQIPWIMCNGLAANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW 259
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDY 306
GWF+ WG RT EDLA++VA +F GG + YYM+HGG ++GRT GG LTT+Y
Sbjct: 260 -GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSD 317
Query: 307 DAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT----------YGNVTNTDYGNSVSGSS 356
D + G N+PK+ HL L +LL S + L Y + G S
Sbjct: 318 DVILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSARLPIPYWDGKQWSVGTQQMVYS 377
Query: 357 Y--------NLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGND----------- 397
Y N A+S+ +L + + V N + N A
Sbjct: 378 YPPSIQFVINQAAFSLFVLFNKQNISIAGQSVQIYDNNEHLLWNSADVSGIFRNNTFLVP 437
Query: 398 --QAPLQWKWRPE-MINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPI 453
PL W+ E ++D V +T ++Q + TND + YLWY N L
Sbjct: 438 IVVGPLDWQVYSEPFLSDLPV-----IVASTPLEQLNLTNDETIYLWYRRNVSLSQPSA- 491
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQ------- 506
+ T R NS + G Y D G N V +T +Q
Sbjct: 492 QTIVQVQTRRANSLIFFMDRQFVG-YFDDHSHAQGTIN------VNITLNLSQFLPNQQY 544
Query: 507 -ISLLSATVGLQNYG---SKFDMVPNGIPGPVLLVGRA--GDETIIKDLSSHKWTYKVGL 560
+LS ++G+ N+ F+ GI G V L G++ GDE I W ++ GL
Sbjct: 545 LFEILSVSLGIDNFNIGPGSFEY--KGIVGNVSLGGQSLVGDEASI-------WEHQKGL 595
Query: 561 YGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFE------APLENDPVVLNLQGM 614
+G + + Y + + + +N+ +TW++T F+ L +PV+L+ G+
Sbjct: 596 FG-EAYQIYTEQGSKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNANPVLLDAFGL 654
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
+G A+VNG ++G YW E C + C + NC PSQ +YH+P W
Sbjct: 655 NRGHAFVNGNDIGLYWLI----EGTCQNKLC------CCLQNQTNCQQPSQRYYHIPSDW 704
Query: 675 IKDGVNTLVLFEEFGGN 691
+K N L +FEE G +
Sbjct: 705 LKPTNNLLTVFEEIGAS 721
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 332 bits (850), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 249/776 (32%), Positives = 370/776 (47%), Gaps = 136/776 (17%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
Y VS+ R IDG R +LL GSIHYPRS+ G W L++ AK GL+ IE YVFWN HE
Sbjct: 85 YSVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVFWNLHEQ 144
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
R ++F GN + RF + + GL++ +R GPYVCAEW+ GG P+WL+ +PG+ ++R++
Sbjct: 145 ERGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGM-KVRSS 203
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N + EM+ F T +V++++ A GGPII+AQIENE+ M D Y+ WC
Sbjct: 204 NAPWQWEMERFVTYMVELSR--PFLAKNGGPIIMAQIENEFA--MHD-----PEYVEWCG 254
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPN--------------NPNSPKIWTENWTGWFKS 252
+ LD +PW+MC + A + + + N P+ P +WTE+ GWF++
Sbjct: 255 DLVKRLDTSIPWVMCYANAAENTILSCNGNDCVDFAVKHVKERPSDPLVWTED-EGWFQT 313
Query: 253 WG--GKDP----KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDY 306
W K+P +RTAED+A+AVAR+F GG NYYMYHGG NFGR + +TT Y
Sbjct: 314 WAKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGGNNFGRAASAG-VTTKYAD 372
Query: 307 DAPIDEYGHLNQPKWGHLRELHK-------LLKSMEKTLTYGNVTNTDYGNSVSGSS--- 356
+ G N+PK HLR+LH+ +L ++ L + + +G + SS
Sbjct: 373 GVNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPHELAPTHGETAEASSLQQ 432
Query: 357 ---------------------------------YNLPAWSVSILPDCKTEEFNTAKVNTQ 383
Y L S+ I+ D FNTA V
Sbjct: 433 RAFIYGAEDGPNQVAFLENQADKKVTVVFRDNKYELAPTSMMIIKDGAL-LFNTADVRKS 491
Query: 384 TNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMT 443
V R A LQW+ E+ + + A + + T D SDYL Y T
Sbjct: 492 FPGTVHRAYTPIVQAATLQWETWSELNVSSLTPRRRVVAERPVEQLRLTADRSDYLTYET 551
Query: 444 NADLKD-DDPILSGSSNMTLRINS-SGQVLHAYVNGNYVDSQWTKYGASN----DLFERP 497
+ D PI S T+++ S + A+V+G + + Y N F P
Sbjct: 552 TFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNLAYPGGNCSKEFRFSLP 611
Query: 498 --VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWT 555
+ +TR ++ + L+S ++G+ + GS G+ G V R G + + K H+W
Sbjct: 612 TNIDVTR-QHSLKLVSVSLGIYSLGSNHT---KGLTGKV----RVGRKNLAK---GHQWE 660
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR--MTWYKTT-----FEAPLENDPV- 607
L G + + Y + +S V + R M+WY T+ FE P E DPV
Sbjct: 661 MYPTLVG-EQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSYPAFELPAEADPVS 719
Query: 608 -----VLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGN 662
+L+ G+ +G A++NG++LGRYW L ++G
Sbjct: 720 EPFSILLDCIGLTRGRAYINGHDLGRYW---LVNDEGEFV-------------------- 756
Query: 663 PSQIWYHVPRSW-IKDGVNTLVLFEEFGGNPSQINF-QTVVVGTACGQAHENKTME 716
Q +YHVPR W +KD N LV+F+E GG+ + + + +V A G A K +E
Sbjct: 757 --QRYYHVPRDWLVKDQANVLVVFDELGGSVADVRLVSSSMVPDAVGDAAAAKFLE 810
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 153/293 (52%), Positives = 203/293 (69%), Gaps = 15/293 (5%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG ++ IDG+R++L SGSIHYPRSTP MWP +IK+AK+GGL+ I+TYVFWN HEP +
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F+G DL++FIK IQ G+YV LR+GP++ AEW +GG P WL +PGI RT NK
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGI-FFRTDNK 159
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + + +I+D K+E+LFASQGGPIIL QIENEY V Y G +YI W + +
Sbjct: 160 QFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNL 219
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT-------------PNNPNSPKIWTENWTGWFKSWGG 255
S+ +G+PW+MC+++DAP PM PN N P +WTENWT F+ +G
Sbjct: 220 VDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGD 279
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDA 308
+R+ ED+A++VARFF GT NYYMYHGGTNFGRTS Y+TT Y DA
Sbjct: 280 PPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYEDA 331
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 212/355 (59%), Gaps = 48/355 (13%)
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N+ F MQ FT IV M K EKLF +QGGPIIL+QIENE+
Sbjct: 1 GGFPVWLKYVPGIA-FRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEF 59
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G V + G GK+Y W A+MA LD GVPWIMC++ DAP P+ F PN
Sbjct: 60 GPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKD 119
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
PK+WTE WTGW+ +GG P R AED+AF+VARF Q GG+F NYYMYHGGTNFGRT+G
Sbjct: 120 YKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTAG 179
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
GP++ TSYDYDAP+DEYG +PKWGHLR+LHK +KS E L + + T G+
Sbjct: 180 GPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHV 239
Query: 351 ----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
S G Y+LP WS+SILPDCKTE +NTAKV +Q++
Sbjct: 240 FKSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQ 299
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYM 442
P +G + W+ + L+ L +Q + T D +DYLWYM
Sbjct: 300 MTPVHSG-------FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYM 347
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 322 bits (824), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 205/547 (37%), Positives = 273/547 (49%), Gaps = 72/547 (13%)
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN------ 350
G + Y D + G L +PKWGHL+ELHK +K E L G+ T GN
Sbjct: 132 GADVQMPYRLDHILVADGLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASV 191
Query: 351 -----------------------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
S +G Y+LP WS+SILPDCKT +NTA V +Q +
Sbjct: 192 FRSSTDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQ--IS 249
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNAD 446
+ AG W+ E IN G FA L++Q T D +DYLWY T D
Sbjct: 250 QMKMEWAGG----FTWQSYNEDINSL---GDESFATVGLLEQINVTRDNTDYLWYTTYVD 302
Query: 447 LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND---LFERPVKLTRG 503
+ D+ LS N L + S+G LH +VNG T YG+ D + VKL G
Sbjct: 303 IAQDEQFLSNGKNPMLTVMSAGHALHIFVNGQLTG---TVYGSVEDPKLTYSGNVKLWSG 359
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
N IS LS VGL N G F+ GI GPV L G +DL+ KWTYKVGL G
Sbjct: 360 SNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGR---RDLTWQKWTYKVGLKGE 416
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
+ +++ E G + PL +WYK F AP ++P+ L++ MGKG W+NG
Sbjct: 417 ALSLHSLSGSSSVEWGEPVQKQPL----SWYKAFFNAPDGDEPLALDMSSMGKGQIWING 472
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
+GRYWP Y A + CDYRG Y KC NCG+ SQ WYHVPRSW+ N LV
Sbjct: 473 QGIGRYWPGYKASG---TCGICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLV 529
Query: 684 LFEEFGGNPSQINFQTVVVGTACG--------------QAHENKTMELTC-HGRRISEIK 728
+FEE+GG+P+ I+ + G+ C + +E + L C HGR+++ IK
Sbjct: 530 IFEEWGGDPTGISMVKRIAGSICADVSEWQPSMANWRTKGYEKAKVHLQCDHGRKMTHIK 589
Query: 729 YASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKR 788
+ASFG PQG+CG++ +G C A + K C+G++ C + G C GT+KR
Sbjct: 590 FASFGTPQGSCGSYSEGGCHAH-KSYDIFWKSCIGQERCGVSVVPDAFGGDPC-PGTMKR 647
Query: 789 LVVEALC 795
VVEA+C
Sbjct: 648 AVVEAIC 654
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 315 bits (807), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 193/465 (41%), Positives = 252/465 (54%), Gaps = 48/465 (10%)
Query: 265 LAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHL 324
+AFAVARF Q GG+F NYYMYHGGTNF RTSGGP++ TSYDYDAPIDEYG L QPKWGHL
Sbjct: 1 MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60
Query: 325 RELHKLLKSMEKTLTYGNVTNTDYGN-----------------------------SVSGS 355
R+LHK +K E L G+ T GN +G
Sbjct: 61 RDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGR 120
Query: 356 SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV 415
Y+LPAWS+S+LPDCK FNTA V+ + P AG W+ E N
Sbjct: 121 RYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSP--AGG----FSWQSYSEATNSLDG 174
Query: 416 RGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAY 474
R F + L++Q S T D SDYLWY T ++ ++ L L + S+G L +
Sbjct: 175 RA---FTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSLQVF 231
Query: 475 VNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPV 534
VNG + + Y + + VK+ +G N+IS+LSA VGL N G+ ++ G+ GPV
Sbjct: 232 VNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLGPV 291
Query: 535 LLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWY 594
L G + +DLS+ KWTY++GL+G A +++ E G ++ PL TW+
Sbjct: 292 TLSGLNEGK---RDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAGKQPL----TWH 344
Query: 595 KTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD 654
K F AP + PV L++ MGKG AWVNG ++GRYW +Y A C Y G Y
Sbjct: 345 KAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSS-GGCGGCSYAGTYSET 402
Query: 655 KCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
KC CG+ SQ +YHVPRSW+ N LVL EEFGG+ + T
Sbjct: 403 KCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVT 447
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 156/287 (54%), Positives = 187/287 (65%), Gaps = 41/287 (14%)
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWN+GGFPVWL +PGI RT N+ F MQNFT IV M K EKLF SQGGPIIL+QI
Sbjct: 1 EWNFGGFPVWLKFVPGIS-FRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQI 59
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEY +G AG++Y+NW A+MAT L+ GVPW+MC+E DAP P+ F+
Sbjct: 60 ENEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKFS 119
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN P PK+WTE WTGWF +GG +R EDLAFAVARF Q GG+F NYYMYHGGTNFG
Sbjct: 120 PNKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNFG 179
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTY--------GNVT 344
RT+GGP++TTSYDYDAPIDEYG + +PK+ HL+ELH+ +K E L Y GN
Sbjct: 180 RTAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGNYE 239
Query: 345 NTDYGNSVSG------SSYN---------------LPAWSVSILPDC 370
+S SG S++N LP WS+SILPDC
Sbjct: 240 QAHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 175/433 (40%), Positives = 231/433 (53%), Gaps = 43/433 (9%)
Query: 306 YDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG----------- 354
YDAP+DEYG PKWGHL++LHK +K E L YG N G SV
Sbjct: 1 YDAPVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGACA 60
Query: 355 ------------------SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP---NQ 393
+SY++PAWSVSILPDCK +NTAKV TQTN P Q
Sbjct: 61 AFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNKIAMIPEKLQQ 120
Query: 394 AGNDQAPLQWK-WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDD 451
+ Q +W W+ + + GK F +N +D +T D +DYLW+ T+ + +++
Sbjct: 121 SDKGQKTFKWDVWK----ENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENE 176
Query: 452 PILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
+L S L I S G LHA+VN Y + + S F+ P+ L GKN+I+LLS
Sbjct: 177 ELLKKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLS 236
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
TVGLQ G +D V G+ + + ++TI DLSS+ WTYK+G+ G + K Y
Sbjct: 237 LTVGLQTAGPFYDFVGAGVTS--VKIKGLNNKTI--DLSSNAWTYKIGVQG-EHLKIYQG 291
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
NS S+ P + +TWYK +AP ++PV L++ MGKGFAW+NG +GRYWP
Sbjct: 292 NGLNSVSWTSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWP 351
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
+ E CDYRG + DKC CG PSQ WYHVPRSW K N LV FEE GG+
Sbjct: 352 RISEFKKEDCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGD 411
Query: 692 PSQINFQTVVVGT 704
P++I F V T
Sbjct: 412 PTKITFVRRKVST 424
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 225/732 (30%), Positives = 344/732 (46%), Gaps = 119/732 (16%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
Y V + R IDG+ ILL GSIHY RSTP W L+ KAKE GL+ ++ Y+FWN HEP
Sbjct: 97 YDVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFHEP 156
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
R + F +L F + + GL+V LR GPYVCAEWN GG P+WL +PG+ ++R+
Sbjct: 157 RRGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGM-KVRSN 215
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
++ + EM +++++A+ F+ GGPII+AQIENEY +Y+ W +
Sbjct: 216 SESWRQEMNRIILIMINLAR--PYFSVNGGPIIMAQIENEYNG-------HDPTYVAWLS 266
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKS 252
++ L IG+PW MC + A + + T N+ P+ P +WTEN W++
Sbjct: 267 QLVRKLGIGIPWTMCNGASAVNTISTCNDNDCFQFAEKNAKVFPSQPLVWTEN-EAWYEK 325
Query: 253 WG-------GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYD 305
W G++ +R+ E +A+ VAR+F GG NYYMYHGG NFGRT+ +TT Y
Sbjct: 326 WATKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYHGGNNFGRTASAG-VTTMYA 384
Query: 306 YDAPIDEYGHLNQPKWGHLRELH-------KLLKSMEKTL-------------------T 339
A + G N+PK HLR+LH K L S E+ L
Sbjct: 385 DGAILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNHAKPLGPEGKNAYTQRAYI 444
Query: 340 YGNVTNTDYGNSVSGS-------SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
YGN + + +++ + Y LP ++ IL D +NT+ V+ + R
Sbjct: 445 YGNCSFLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDVSGTLGSRSTRSF 503
Query: 393 QAGNDQAPLQWK-WRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDD 450
WK W +N VR + ++ ++Q T D +DYL Y +
Sbjct: 504 SPLIRFRKSDWKIWSEWDVNPHNVRDQ--IVNDSPLEQLLVTQDTTDYLMYQNEVRWGSN 561
Query: 451 DPILSGSSNMTLR-INSSGQVLHAYVNGNYVDSQWTKYGASN--DLFER---PVKLTRGK 504
P + + L+ I+ ++NG ++ Q Y + ++F P+
Sbjct: 562 GPTKNKMKSSILKFISCDANSFLVFINGEFIGEQHLAYPGDDCSNIFRFDLGPLGKYGAN 621
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLD 564
+S+LS ++G+ + G K GI V + DE + +W GL G +
Sbjct: 622 LTLSILSISLGIHSLGEKHQ---KGIVSDVQI-----DERSLVYGPHERWVMFSGLIG-E 672
Query: 565 DKKFYNAKAANSERGWSSKNVPLNRRMT--WYKTTFE-APLEND---PVVLNLQGMGKGF 618
K Y+ +NS W + NV +R+ T WY T F L+ D V+L+ +GM +G
Sbjct: 673 LLKLYDPMWSNSV-PWRNLNVQTDRKRTSKWYMTKFVLKQLDWDTETSVLLDCKGMNRGR 731
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK-- 676
++NG++LGRYW + DG Q +Y +P +W+
Sbjct: 732 IYLNGHDLGRYW--LIRRSDGAYV----------------------QRYYTIPVAWLHAA 767
Query: 677 DGVNTLVLFEEF 688
+ N LV+FEE
Sbjct: 768 NKSNYLVIFEEL 779
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 226/753 (30%), Positives = 361/753 (47%), Gaps = 148/753 (19%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
Y+V++D R+ +DG+R I L+GS+HYPR+TP MW ++ +A E GL+ I+ Y FWN HEP
Sbjct: 33 YKVTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQIYTFWNLHEP 92
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++ QY++ G D+ F++ D+GL+V +RIGPYVCAEW+ GG PVW++ + G+ LR
Sbjct: 93 VKGQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNYLDGV-RLRAN 151
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N V+ EM ++ ++ D + FA +GGPII +QIENE +G A + YI+WC
Sbjct: 152 NDVWKKEMGDWMKVLTDYTR--DFFADRGGPIIFSQIENEL------WGGA-REYIDWCG 202
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPNNPN------------------SPKIWTENWTG 248
+ A SL++ VPW+MC D N N P WTEN G
Sbjct: 203 EFAESLELNVPWMMCN-GDTSEKTINACNGNDCSSYLESHGQSGRILVDQPGCWTEN-EG 260
Query: 249 WFKSWGGKDPK---------RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
WF+ G + R+AED F V +F GG++ NYYM+ GG ++G+ +G
Sbjct: 261 WFQIHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHNYYMWFGGNHYGKWAGNG- 319
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL----------TYGNVTNTD-- 347
+T Y I N+PK H ++H++L ++ + L + N N +
Sbjct: 320 MTNWYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDKAQVNNQKHLNCDNCNAF 379
Query: 348 ---YGNSV-------SGSS---------YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKV 388
YG+ + GS+ Y LPAWS+ +L + F T NVK
Sbjct: 380 EYRYGDRLVSFVENNKGSADKVIYRDIVYELPAWSMIVLDEYDNVLFET------NNVKP 433
Query: 389 KRPNQAGNDQAPLQWKWRPEMINDF-------VVRGKGHFALNTLIDQKSTNDVSDYLWY 441
++ + + L++++ E ++ VV K + LN T D++++L+Y
Sbjct: 434 VNKHRVYHCEEKLEFEYWNEPVSTLSQEAPRVVVSPKANEQLNM------TRDLTEFLYY 487
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYV--DSQWTKYGASNDLFERPVK 499
T + D+ LS + AYV+ ++V D + T + + + +K
Sbjct: 488 ETEVEFPQDECTLSIGG-------TDANAFVAYVDDHFVGSDDEHTHHDGWHTM-NINMK 539
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPN-------GIPGPVLLVGRAGDETIIKDLSSH 552
+GK+++ LLS ++G+ N G ++ P+ GI G + L G D+ +
Sbjct: 540 SGKGKHKLVLLSESLGVSN-GMDSNLDPSWASSRLKGICGWIKLCG--------NDIFNQ 590
Query: 553 KWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPL---ENDPVVL 609
+W + GL G + F + W S +V + WY++TF+ P V+L
Sbjct: 591 EWKHYPGLVGEAKQVFTDEGMKTVT--WKS-DVENADNLAWYRSTFKTPQGLKRGIEVLL 647
Query: 610 NLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYH 669
+GM +G A+VNG+N+GRYW +DG G +Q +YH
Sbjct: 648 RPEGMNRGQAYVNGHNIGRYWMI----KDG--------------------NGEYTQGYYH 683
Query: 670 VPRSWIK--DGVNTLVLFEEFGGNPSQINFQTV 700
+P+ W+K N LVL E G + + T
Sbjct: 684 IPKDWLKGEGEENVLVLGETLGASDPSVTICTT 716
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 194/573 (33%), Positives = 290/573 (50%), Gaps = 73/573 (12%)
Query: 168 EKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAP 227
E+ FA+ GGPII++Q+ENEYG V YG++G Y W A++A SL++GVPWIMCQ+ D
Sbjct: 13 ERHFAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQSLNVGVPWIMCQQDDID 72
Query: 228 SPMFTPNN--------------PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFF 273
S + T N PN P +TENW GWF+ W P R ED+ +AV +F
Sbjct: 73 SVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQSTPHRPVEDVLYAVGNWF 132
Query: 274 QFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKS 333
GG+ NYYM+HGGTNFGRTS P + SYDYDA +DEYG+ ++PK+ H + + LL+
Sbjct: 133 ARGGSLMNYYMWHGGTNFGRTSS-PMVVNSYDYDAALDEYGNPSEPKYSHAAKFNNLLQK 191
Query: 334 MEKT-LTYGNVTNTDY-GNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV-KVKR 390
L + ++Y G S S Y S+S L + N N Q ++ K
Sbjct: 192 YSHIFLNAPEIPRSEYLGGSSSIYHYTFGGESLSFLINNHESALNDIVWNGQNHIIKPWS 251
Query: 391 PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTL-----------------------I 427
+ N+ PE ++ + K +N+ +
Sbjct: 252 VHLLYNNHTVFDSAATPE-VSKLAMTSKRFSPVNSFNNAYISQWVEEIDMTDSTWSSKPL 310
Query: 428 DQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTK 486
+Q S T+D +DYLWY+T +L+ + G+ T ++ VLHAY++G Y + W
Sbjct: 311 EQLSLTHDKTDYLWYVTEINLQ-----VRGAEVFTTNVS---DVLHAYIDGKYQSTIW-- 360
Query: 487 YGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETII 546
S + F + G +++ +L++ +G+Q+Y + V G+ G + + G
Sbjct: 361 ---SANPFNIKSDIPLGWHKLQILNSKLGVQHYTVDMEKVTGGLLGNIWVGG-------- 409
Query: 547 KDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLE-ND 605
D++++ W+ K + G + YN + WSS + + + +TWYK F L N
Sbjct: 410 TDITNNGWSMKPYVNG-ERLAIYNPNNI-FKVDWSSFS-GVQQPLTWYKINFLHELSPNK 466
Query: 606 PVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQ 665
LN+ GM KG W+NG ++ RYW T G C Y+G Y C+ NCG PSQ
Sbjct: 467 HYSLNMSGMNKGMIWLNGKHVARYWIT-----KGWGCNGCSYQGGYTDQLCSTNCGEPSQ 521
Query: 666 IWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
I YH+P+ W+ +G N LV+FEE GGNP I +
Sbjct: 522 INYHLPQDWLIEGANLLVIFEEVGGNPKSIKLE 554
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 293 bits (750), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 199/612 (32%), Positives = 282/612 (46%), Gaps = 97/612 (15%)
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENWT F+++G + R+AED+A+AV RFF GG+ NYYMYHGGTNFGRT G Y+
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG------------------- 341
T Y +AP+DEYG +PK+GHLR+LH +++S +K +G
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 342 -----------NVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
N T D G + +P+ SVSIL CK +NT +V Q + +
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSFH 180
Query: 391 PNQAGNDQAPLQWKWRPEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
+ + QW+ E I D VR K L T D +DYLWY T+ L
Sbjct: 181 TSDVTSKNN--QWEMSSETIPKYRDTKVRTK-----EPLEQYNQTKDDTDYLWYTTSFRL 233
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ DD L++ SS + + N +V +FE+PV L G N +
Sbjct: 234 ESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHV 293
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
LLS+T+G+++ G + V GI ++ G T DL + W +K L G + K+
Sbjct: 294 VLLSSTMGMKDSGGELAEVKGGIQECLI----QGLNTGTLDLQVNGWGHKAALEG-EYKE 348
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
Y+ K + ++N +R TWYK F+ P +DPVVL++ M KG +VNG +G
Sbjct: 349 IYSEKGLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVG 405
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYW +Y G PSQ YH+PR ++K N LV+FEE
Sbjct: 406 RYWVSYRTL-----------------------AGTPSQAVYHIPRPFLKSKDNLLVIFEE 442
Query: 688 FGGNPSQINFQTVVVGTACGQAHENKTMELTC--------------HGRR---------- 723
G P I QTV C E+ ++ H RR
Sbjct: 443 EMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCPPEKT 502
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
I E+ +ASFG+P G CG F G+C + ++EK+C+GK SC + GA
Sbjct: 503 IQEVVFASFGNPDGMCGNFTVGTCHTP-NAKQIVEKECLGKPSCMLPVDHTVYGADINCQ 561
Query: 784 GTVKRLVVEALC 795
T L V+ C
Sbjct: 562 STTATLGVQVRC 573
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 199/612 (32%), Positives = 282/612 (46%), Gaps = 97/612 (15%)
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
+WTENWT F+++G + R+AED+A+AV RFF GG+ NYYMYHGGTNFGRT G Y+
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG------------------- 341
T Y +AP+DEYG +PK+GHLR+LH +++S +K +G
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 342 -----------NVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKR 390
N T D G + +P+ SVSIL CK +NT +V Q + +
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSFH 180
Query: 391 PNQAGNDQAPLQWKWRPEMI---NDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
+ + QW+ E I D VR K L T D +DYLWY T+ L
Sbjct: 181 TSDVTSKNN--QWEMFSETIPKYRDTKVRTK-----EPLEQYNQTKDDTDYLWYTTSFRL 233
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
+ DD L++ SS + + N +V +FE+PV L G N +
Sbjct: 234 ESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHV 293
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK 567
LLS+T+G+++ G + V GI ++ G T DL + W +K L G + K+
Sbjct: 294 VLLSSTMGMKDSGGELAEVKGGIQECLI----QGLNTGTLDLQVNGWGHKAALEG-EYKE 348
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
Y+ K + ++N +R TWYK F+ P +DPVVL++ M KG +VNG +G
Sbjct: 349 IYSEKGLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVG 405
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
RYW +Y G PSQ YH+PR ++K N LV+FEE
Sbjct: 406 RYWVSYRTL-----------------------AGTPSQAVYHIPRPFLKSKDNLLVIFEE 442
Query: 688 FGGNPSQINFQTVVVGTACGQAHENKTMELTC--------------HGRR---------- 723
G P I QTV C E+ ++ H RR
Sbjct: 443 EMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCPPEKT 502
Query: 724 ISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAA 783
I E+ +ASFG+P G CG F G+C + ++EK+C+GK SC + GA
Sbjct: 503 IQEVVFASFGNPDGMCGNFTVGTCHTP-NAKQIVEKECLGKPSCMLPVDHTVYGADINCQ 561
Query: 784 GTVKRLVVEALC 795
T L V+ C
Sbjct: 562 STTATLGVQVRC 573
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 281 bits (720), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 144/305 (47%), Positives = 189/305 (61%), Gaps = 20/305 (6%)
Query: 43 KILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRF 102
+IL SIHYPR P W LI+ AKE G++ IETYVFWN HE + YDF+G LDL F
Sbjct: 476 RILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGF 535
Query: 103 IKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIV 162
I+TI GLY +LRIGPY+CAE ++GGFP WL ++ GI E RT N+ F E + +V
Sbjct: 536 IRTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGI-EFRTQNEPFQRESSRWVRFLV 594
Query: 163 DMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ 222
+ F SQGGPI++ Q ENEY + +YG+AG +Y+ WC+++A L + VP MC+
Sbjct: 595 EKLNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCK 654
Query: 223 ESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFA 268
S + + T N+ PN P IWTE WTGW+ WG R +DL +A
Sbjct: 655 GS-IENVLETINDFYGHQEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYA 713
Query: 269 VARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-TTSYDYDAPIDEYGHLNQPKWGHLREL 327
V RFF GG NYYM+HGGTN+ + + YL TTSYDYDAPIDEYG + +G L+ +
Sbjct: 714 VLRFFAQGGKGINYYMFHGGTNYDQLAM--YLQTTSYDYDAPIDEYGRKTKKYFG-LQYI 770
Query: 328 HKLLK 332
H+ L+
Sbjct: 771 HRQLE 775
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 143/286 (50%), Positives = 177/286 (61%), Gaps = 40/286 (13%)
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWN+GGFPVWL +PGI+ RT N F +MQ FT IV+M K EKLF Q GPII++QI
Sbjct: 1 EWNFGGFPVWLKYVPGIQ-FRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQI 59
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYG + + G GK+Y W A+MA L GVPWIMC++ DAP P+ F
Sbjct: 60 ENEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFM 119
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN PK++TE WTGW+ +GG P R AED+A++VARF Q G+F NYYMYHGGTNFG
Sbjct: 120 PNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFG 179
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-- 350
RT+GGP++ TSYDYDAP+DEYG +PKWGHLR+LHK +K E +L + T G+
Sbjct: 180 RTAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQ 239
Query: 351 --------------------------SVSGSSYNLPAWSVSILPDC 370
+ Y+LP WSVSILPDC
Sbjct: 240 EAHVFWTKTSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 174/464 (37%), Positives = 240/464 (51%), Gaps = 44/464 (9%)
Query: 357 YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVR 416
YNLP WSVSILPDC+ FNTAKV QT+ P + W+ E D
Sbjct: 54 YNLPPWSVSILPDCRNAVFNTAKVGVQTSQMQMLPTNSER----FSWESFEE---DTSSS 106
Query: 417 GKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYV 475
+ L++Q T D SDYLWY+T+ D+ + L G +L + S+G +H ++
Sbjct: 107 SATTITASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFI 166
Query: 476 NGNYVDSQWTKYGASNDLFER---PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPG 532
NG S YG D R V L G N I+LLS VGL N G F+ GI G
Sbjct: 167 NGRLSGS---AYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILG 223
Query: 533 PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMT 592
PV++ G + DLS KWTY+VGL G ++ E S+ V N+ +T
Sbjct: 224 PVVIHGLDKGKL---DLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLT 280
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
W+KT F+AP +P+ L++ GMGKG W+NG ++GRYW T +A S C+Y G +
Sbjct: 281 WHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRYW-TAIATG---SCNDCNYAGSFR 336
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN 712
KC CG P+Q WYHVPRSW+K N LV+FEE GG+PS+I+ V + C E
Sbjct: 337 PPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCADVSEY 396
Query: 713 K--------------------TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEI 751
+ L C+ G+ IS IK+ASFG P G CG++++G+C +
Sbjct: 397 HPNLKNWHIDSYGKSENFRPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQGACHSS- 455
Query: 752 DVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
++E++C+GK C + S +N G C +KRL VEA+C
Sbjct: 456 SSYDILEQKCIGKPRCIVTVSNSNFGRDPC-PNVLKRLSVEAVC 498
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 180/500 (36%), Positives = 253/500 (50%), Gaps = 102/500 (20%)
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFT---------- 232
IENEYGN+ + + + G SY++W AKMA L GVPWIMC++ DAP P+
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKCGET 60
Query: 233 ---PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
PN+PN P +WTENWT +++ +GG+ R+A+D+AF VA F G++ NYYMYHGGT
Sbjct: 61 FGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 120
Query: 290 NFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
NFGRT+ Y+ T Y AP+DEYG + QPKWGHL+ELH ++KS TL G TN G
Sbjct: 121 NFGRTAAA-YVITGYYDQAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSVG 179
Query: 350 ----------------------NSVSGS------SYNLPAWSVSILPDCKTEEFNTAKVN 381
+SV+ + S+ L S+SILPDC FNTAKVN
Sbjct: 180 QLQQAYMFEAQGGGCVAFLVNNDSVNATVGFRNKSFELLPKSISILPDCDNIIFNTAKVN 239
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLW 440
+N ++ ++ N W+ ++I ++ +TL++ +T D SDYLW
Sbjct: 240 AGSNRRITTSSKKLN-----TWEKYIDVIPNY---SDSTIKSDTLLEHMNTTKDKSDYLW 291
Query: 441 YMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDS-QWTKYGASNDLFERPVK 499
Y + P LS + + L + S V +A+VN Y S +K G + E P+
Sbjct: 292 YTFSF-----QPNLSCTKPL-LHVESLAHVAYAFVNNKYSGSAHGSKNGKVPFIMEVPIV 345
Query: 500 LTRG--KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
L N IS+LS VGL VG G+
Sbjct: 346 LDDDGLSNNISILSVLVGLS-------------------VGLLGE--------------T 372
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
+ LYG + + WS ++ + + +TW+K F+ P NDPVVLNL M KG
Sbjct: 373 LQLYGKEHLEMVK---------WSKADISIAQPLTWFKLEFDTPKGNDPVVLNLATMSKG 423
Query: 618 FAWVNGYNLGRYWPTYLAEE 637
AWVNG ++GRYW ++L +
Sbjct: 424 EAWVNGQSIGRYWISFLTSK 443
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 224/749 (29%), Positives = 335/749 (44%), Gaps = 112/749 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE--- 85
+++D R++ I+G+ LSG++HY RS P WP + + + GL+ +ETYVFW HE
Sbjct: 10 ITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEFEP 69
Query: 86 ----PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNM---- 137
+ DF+G DL+RF++ + GL ILR+GPYVCAE NYGGFP WL +
Sbjct: 70 PEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCEKG 129
Query: 138 -PGIEELRTTNKVFMNEMQNFTTLIVD-MAKKEKLFASQGGPIILAQIENEYGNVMSDYG 195
RT + + +++ + +VD + K ++FA QGGP+ILAQIENEY + YG
Sbjct: 130 SSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAESYG 189
Query: 196 DAGKSYINWCAKMATSLDIGVPWIMC---QESDAPSPMFTPN-----------------N 235
G+ Y++W A +A L +GVP +MC + ++ + T N N
Sbjct: 190 PDGQQYLDWIASLANQLALGVPLVMCYGASQRESGRVIETINAFYAHEHVESLRRAQGAN 249
Query: 236 PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
P P +WTE WTGW+ WG +R A DLA+AV RF GG NYYMY GGTN+ R +
Sbjct: 250 PQ-PLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAAGGAGINYYMYFGGTNWRREN 308
Query: 296 GGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK----LLKSMEKTLTYGNVTNTDYGNS 351
TSYDYDAP++EY + K HLR LH+ L + L + +
Sbjct: 309 TMYLQATSYDYDAPLNEYV-METTKSRHLRRLHESIQPFLSDRDGVLDMSRLELKVFEGE 367
Query: 352 VSGSSYNLPAWSVSILPDCKTEE-----FNTAKVNTQTNVKVKR--PNQAGNDQAP-LQW 403
Y +VS D ++EE F++A + ++++ N A D L+W
Sbjct: 368 RRAILYERS--TVSGDADHRSEESVRCVFDSADIRVHLALELREIIVNAASRDTGQDLRW 425
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTL 462
+ PE L T+ D +T SDY WY+ P GS + L
Sbjct: 426 RMLPEPPPLRAALSDTSATLATIPDLVDATAGTSDYAWYILRC------PTAQGSGLLQL 479
Query: 463 RINSSGQVLH--AYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
+ G+V A G+ + Q ++ A+ E PV+ R N + G+ G
Sbjct: 480 EVADFGRVWRRKAVDQGDDAERQPLEWAAAGP--EPPVE-DRFPNAWNSTEYGYGIVEVG 536
Query: 521 ------------SKFDMVPNG---IPGPVLLVGRAG--------DETIIKDLSSHKWT-- 555
S MV PG + R G D T D +W
Sbjct: 537 AIDCHEEYVVLVSSLGMVKGDWQLPPGYGMARERKGLLRASYRSDVTFADD----EWRDA 592
Query: 556 ----YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN-RRMT---WYKTTFEAPL----E 603
+ GL G + A W+ + L+ RR + WY+ + P E
Sbjct: 593 LVVGFAAGLRGERIRSVIEGDADAYPYLWTPQKAALSGRRFSWPRWYRASLAIPPPNADE 652
Query: 604 NDPVVLNL--QGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCG 661
+ ++L+L G+ KG+ ++NG GR+W + D P G
Sbjct: 653 TEGIILDLYESGVEKGWIYMNGEPCGRHWRVHGTMPKNGFLRQGDQEAPIEQ----VGHG 708
Query: 662 NPSQIWYHVPRSW---IKDGVNTLVLFEE 687
P+Q ++++P W K +TLV+F+E
Sbjct: 709 QPTQRYFYIP-PWHLHAKGRPSTLVIFDE 736
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 128/219 (58%), Positives = 156/219 (71%), Gaps = 12/219 (5%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MWPDLI++AK+GGLD I+TYVFWN HEP +Y F N DL++FIK +Q GLYV LRIG
Sbjct: 2 MWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIG 61
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
PYVCAEWN+GGFPVWL +PGI + RT N F ++MQ FTT IV+M K E+LF S GGPI
Sbjct: 62 PYVCAEWNFGGFPVWLKYIPGI-QFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPI 120
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-------- 230
IL+QIENEYG + + G GK+Y +W A+MA L GVPW+MC++ DAP P+
Sbjct: 121 ILSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFY 180
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
F+PN PK+WTE WTGWF +GG P R AEDLA
Sbjct: 181 CDYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 180/551 (32%), Positives = 275/551 (49%), Gaps = 64/551 (11%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
Y V++ R IDG++ +LL GSIHYPRS+PG W L+++AK GL+ IE YVFWN HE
Sbjct: 83 YSVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWEQLLREAKRDGLNHIEMYVFWNLHEQ 142
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
R ++F GN ++ RF + + GL++ +R GPYVCAEWN GG P+WL+ +PG+ E+R++
Sbjct: 143 ERGVFNFAGNANITRFYELAAEVGLFLHVRFGPYVCAEWNNGGLPLWLNWIPGM-EVRSS 201
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
N + EM+ F +V++++ A GGPII+AQIENE+ + D YI WC
Sbjct: 202 NAPWQREMERFIRYMVELSR--PFLAKNGGPIIMAQIENEFA-----WHD--PEYIAWCG 252
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPNN--------------PNSPKIWTENWTGWFKS 252
+ LD +PW+MC + A + + + N+ P+ P +WTE+ GWF++
Sbjct: 253 NLVKQLDTSIPWVMCYANAAENTILSCNDDDCVDFAVKHVKERPSDPLVWTED-EGWFQT 311
Query: 253 W--GGKDP----KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDY 306
W K+P +R+ ED+A+AVAR+F GG NYYMYHGG N+GR + +TT Y
Sbjct: 312 WQKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHNYYMYHGGNNYGRAASAG-VTTMYAD 370
Query: 307 DAPIDEYGHLNQPKWGHLRELHK--------LLKSMEKTLTYGNVTNTDYGNSVSGSSYN 358
+ G N+PK HLR+LH+ LL++ + L + D + S
Sbjct: 371 GVNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRNDRQVLNPRELPLVDEQTVKASSQQR 430
Query: 359 LPAWSVSILPDCKTE-EFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRG 417
+ P+ F+TA V + R + L WK E+
Sbjct: 431 AFVYGPEAEPNQDGAILFDTADVRKSFPGRQHRTYTPLVKASALAWKAWSELNVSSTTPR 490
Query: 418 KGHFALNTLIDQKSTNDVSDYLWYMTN------ADLKDDDPILSGSSNMTLRINS-SGQV 470
+ A + + T D SDYL Y T +D+ DD T+++ S
Sbjct: 491 RRVVADQPIEQLRLTADQSDYLTYETTFTPKQLSDVDDD--------MWTVKVTSCEASS 542
Query: 471 LHAYVNGNYVDSQWTKYGASN----DLFERPVKLTRGKNQ-ISLLSATVGLQNYGSKFDM 525
+ A V+G + + Y N F P + G+ + L+S ++G+ + GS
Sbjct: 543 IIALVDGWLIGERNLAYPGGNCSKEFSFHLPASIEVGRQHDLKLVSVSLGIYSLGSNHS- 601
Query: 526 VPNGIPGPVLL 536
G+ G V +
Sbjct: 602 --KGVTGSVRI 610
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 134/263 (50%), Positives = 160/263 (60%), Gaps = 39/263 (14%)
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
T N+ F MQ FT IV M K E+LF SQGGPIIL+QIENE+G V + G GK+Y W
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
A+MA L+ GVPWIMC++ DAP P+ FTPN PK+WTE WTGW+ +
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
GG P R AEDLAF++AR Q GG+F NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARLIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNS---------------------- 351
G +PKWGHLR+LHK +KS E L + T GNS
Sbjct: 181 GLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSGCAAFLANYDTK 240
Query: 352 ------VSGSSYNLPAWSVSILP 368
Y LP WS+SILP
Sbjct: 241 SSAKVSFGNGQYELPPWSISILP 263
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 134/263 (50%), Positives = 160/263 (60%), Gaps = 39/263 (14%)
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
T N+ F MQ FT IV M K E+LF SQGGPIIL+QIENE+G V + G GK+Y W
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSW 253
A+MA L+ GVPWIMC++ DAP P+ FTPN PK+WTE WTGW+ +
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEY 313
GG P R AEDLAF++ARF Q GG+ NYYMYHGGTNFGRT+GGP++ TSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARFIQKGGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS----------------- 356
G +PKWGHLR LHK +KS E L + T GNS +
Sbjct: 181 GLPREPKWGHLRNLHKAIKSSESALVSAEPSVTSLGNSQEAHAFKSKSGCAAFLANYDTK 240
Query: 357 -----------YNLPAWSVSILP 368
Y LP WS+SILP
Sbjct: 241 SSAKVSFGNGQYELPPWSISILP 263
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 169/506 (33%), Positives = 257/506 (50%), Gaps = 58/506 (11%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+V+ D RA+ IDG+R IL GS HYP+ WP ++ AK+ GL+ +E Y+FWN HE
Sbjct: 5 QVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHEKK 64
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ Y F ++ RF++ Q++GL VILR+GPY+CAE +YGGFP WL +PGI E RT N
Sbjct: 65 KGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGI-EFRTYN 123
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ FM EM+ + T I M K+ KL+ +GGPIIL QIENEY V S YG AG+ Y++WC +
Sbjct: 124 EPFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWCYE 183
Query: 208 MATSLDIGVPWIMCQESD------APSPMFTPNN--------------PNSPKIWTENWT 247
+ + W+ ++S+ + T N+ P+ P +WTE W
Sbjct: 184 LYK--EGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALKPHQPLLWTEFWI 241
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYD 307
GW+ W G +R +D+ +A ARF GG+ NYYM+HGGT+FG + TT YD+D
Sbjct: 242 GWYNIWRGAQRQRPVDDVIYAAARFIAQGGSGMNYYMFHGGTHFGNLAMYG-QTTGYDFD 300
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSME-----------------------KTLTYGNVT 344
AP+D YG + K+ L++L+ L ++E K + G+
Sbjct: 301 APVDSYGRPTE-KFERLKQLNHCLSNLEYILLSQDEPEVQKLTPNVNVYRWKDIESGDEC 359
Query: 345 NTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKV--NTQTNVKVKRPNQAGNDQAPLQ 402
+ + S S + +V + P N +V ++Q + V + + D +
Sbjct: 360 SFVCNDQRSQSYVIVAERAVCLKPLSVKIYLNHEEVFDSSQNSYNVSQKSYHRLDYVCNE 419
Query: 403 WKWRPEMINDFVVRGKGHFALN--TLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
WK I + K HF + + D T D +DY+WY + P ++
Sbjct: 420 WKTMQIPIPSKEKKDKEHFEFSFPHIPDMLHITQDETDYMWYTGVGTIY--CPFKGENTP 477
Query: 460 MTLRIN---SSGQVLHAYVNGNYVDS 482
L+I+ + +H ++N YV S
Sbjct: 478 HCLKIHMELEAADYVHVFLNRKYVGS 503
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 263 bits (671), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 124/203 (61%), Positives = 153/203 (75%), Gaps = 1/203 (0%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A +L L S V++D +A+ IDG+R++L+SGSIHYPRSTP MWPDLI+K+K+
Sbjct: 7 AFVLLWFLGVYVPASFCSNVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKD 66
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GG+D IETYVFWN HEP+R QY+F G DL+ F+K + GLYV LRIGPYVCAEWNYGG
Sbjct: 67 GGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEWNYGG 126
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FP+WLH + GI + RT N+ F EM+ FT IVDM K+E L+ASQGGPIIL+QIENEYGN
Sbjct: 127 FPLWLHFIAGI-KFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGN 185
Query: 190 VMSDYGDAGKSYINWCAKMATSL 212
+ + A KSYI+W A MATSL
Sbjct: 186 IDTHDARAAKSYIDWAASMATSL 208
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 149/373 (39%), Positives = 202/373 (54%), Gaps = 104/373 (27%)
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV 343
MYHG TNF RT+GGP++TT+YDYDAP+DE+G+LNQPK+GHL++LH + +MEKTLTYGN+
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82
Query: 344 TNTDYGNSV------------------------SGSSYNLPAWSVSILPDCKTEEFNTAK 379
+ D+GN V G+SY++PAW VSILPDCKTE +NTAK
Sbjct: 83 STADFGNLVMTTVYQTEEGSSCFIGNVNAKINFQGTSYDVPAWYVSILPDCKTESYNTAK 142
Query: 380 -VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDY 438
+ +T+++ K +ND SD+
Sbjct: 143 RMKLRTSLRFK----------------------------------------NVSNDESDF 162
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
LWYMT +LK+ DP + NM+LRINS+ VLH +VNG + + + G + +FE+
Sbjct: 163 LWYMTTVNLKEQDP--AWGKNMSLRINSTAHVLHGFVNGQHTGNYRVENGKFHYVFEQDA 220
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
K G N I+LLS TV L NYG+ F+ VP GI GPV ++GR GDET++K LS+H K+
Sbjct: 221 KFNPGVNVITLLSVTVDLPNYGAFFENVPAGITGPVFIIGRNGDETVVKYLSTHNGATKL 280
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
T F+APL ++PVV++L G GKG
Sbjct: 281 -------------------------------------TIFKAPLGSEPVVVDLLGFGKGK 303
Query: 619 AWVNGYNLGRYWP 631
A +N GRYWP
Sbjct: 304 ASINENYTGRYWP 316
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 259 bits (663), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 115/204 (56%), Positives = 151/204 (74%), Gaps = 1/204 (0%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DGRA+ +DG R++L SG +HYPRSTP MWPDLI KAK+GGLD I+TYVFWNAHEP++
Sbjct: 38 VTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEPVQ 97
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q++F G DL++FI+ I QGLYV LRIGP+V +EW YGG P WL +P I R+ N+
Sbjct: 98 GQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNI-TFRSDNE 156
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F MQ F T IV++ K E+LF QGGPII++QIENEY V + + G SY++W A M
Sbjct: 157 PFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYVHWAAAM 216
Query: 209 ATSLDIGVPWIMCQESDAPSPMFT 232
A +L GVPW+MC++ DAP P+ +
Sbjct: 217 AVNLQTGVPWMMCKQDDAPDPIVS 240
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 259 bits (661), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 115/207 (55%), Positives = 147/207 (71%), Gaps = 1/207 (0%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
L +++DGRA+ + G R++ SG +HY RSTP MWP LI KAK GGLD I+TYVFWN
Sbjct: 24 ELGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNV 83
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP++ QY+F G DL++FI+ IQ QGLYV LRIGP+V AEW YGGFP WLH++P I
Sbjct: 84 HEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSI-TF 142
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
R+ N+ F MQNF T IV M K E L+ QGGPII++QIENEY + +G +G Y+
Sbjct: 143 RSDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVR 202
Query: 204 WCAKMATSLDIGVPWIMCQESDAPSPM 230
W A MA L GVPW+MC+++DAP P+
Sbjct: 203 WAAAMAVGLQTGVPWMMCKQNDAPDPV 229
>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
Length = 451
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 176/500 (35%), Positives = 233/500 (46%), Gaps = 107/500 (21%)
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELH-KLLKSMEKTLTYGN 342
MYHGGTNF R SGGP + TSYDYDAP+DEYG+LNQPKWGHLR+LH ++L + ++ G
Sbjct: 38 MYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPKWGHLRDLHVRILLHLSQSRGLGF 97
Query: 343 VTN-----TDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGND 397
T T Y N+ +G + + K N N+ ++ Q G
Sbjct: 98 ATVYALNLTTYINNATGERFCF---------------LSNTKTNEDANIDLQ---QDGIF 139
Query: 398 QAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGS 457
P + + +G+F K+T+D +DYL Y+T +
Sbjct: 140 FVPAWIYYYSSRVQ------QGNFQ-----QCKATSDETDYLRYITRYFDFFTVSVKDVH 188
Query: 458 SNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQ 517
S N+ L G +F LT GK Q
Sbjct: 189 SRCQQCNNTEEHDLACDFFGTSPACSCQSAARLQQVFHSIYNLTSGK------------Q 236
Query: 518 NYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSE 577
NYG FD P GI G DLSS++W YK+GL G + K+ Y+ + + +
Sbjct: 237 NYGEFFDEGPEGIAGAA-------------DLSSNQWAYKIGL-GGEAKRLYDPNSGHRD 282
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
+S +P+ R MTWYKTTF P DP+VLNLQGMGKG AWVNG++LGR+WP A+
Sbjct: 283 VFRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHAWVNGHSLGRFWPMQSADP 342
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
G S SCDYRG Y DKC NCGNP+Q W H+
Sbjct: 343 TGYSG-SCDYRGKYDKDKCLTNCGNPTQRWKHI--------------------------- 374
Query: 698 QTVVVGTACGQAHENKTMELTCHGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLI 757
+GR IS I++ASFG+P+G CG+ +KG EA +
Sbjct: 375 -----------------ATFMPNGRIISVIQFASFGNPEGTCGSLQKGDFEAAYTAFA-V 416
Query: 758 EKQCVGKKSCSIEASEANLG 777
EK CVGK+SCS+ SE+ LG
Sbjct: 417 EKACVGKESCSLGVSESTLG 436
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/26 (76%), Positives = 23/26 (88%)
Query: 164 MAKKEKLFASQGGPIILAQIENEYGN 189
MAK+ KLFAS GGPI+ AQIEN+YGN
Sbjct: 1 MAKEAKLFASSGGPIVFAQIENDYGN 26
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 246 bits (629), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/287 (46%), Positives = 164/287 (57%), Gaps = 46/287 (16%)
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWN+GGFPVWL +PGI RT N F M FT IV M K E LF SQGGPIIL+QI
Sbjct: 1 EWNFGGFPVWLKYVPGIN-FRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQI 59
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FT 232
ENEYG V G A K+Y++W A+MA L+ VPW+MC++ DAP P+ F+
Sbjct: 60 ENEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYCDYFS 119
Query: 233 PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
PN P P +WTE WTGWF + G + A V R + T + GTNFG
Sbjct: 120 PNKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQVIRRWILVTTIVPW-----GTNFG 174
Query: 293 RTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN-- 350
RT+GGP+++TSYDYDAPIDEYG L QPKWGHLR+LHK +K E L G+ T T GN
Sbjct: 175 RTAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQ 234
Query: 351 ---------------------------SVSGSSYNLPAWSVSILPDC 370
+ +G YN+P+WS+SILPDC
Sbjct: 235 EAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/275 (48%), Positives = 165/275 (60%), Gaps = 41/275 (14%)
Query: 208 MATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGK 256
MATSLD GVPWIMCQ+++AP P+ FTPN+ N PK+WTENW+GWF ++GG
Sbjct: 1 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGA 60
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
P R EDLAFAVARFFQ GGTFQNYYMYHGGTNFGRT+GGP+++TSYDYDAPIDEYG +
Sbjct: 61 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDI 120
Query: 317 NQPKWGHLRELHKLLKSMEKTLT---------------------------YGNVTNTDYG 349
QPKWGHL++LHK +K E+ L N+ +D
Sbjct: 121 RQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYKTGAVCSAFLANIGMSDAT 180
Query: 350 NSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK--VKRPNQAGNDQAPLQWKWRP 407
+ +G+SY+LP WSVSILPDCK NTAKVNT + + + D
Sbjct: 181 VTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSSSGWS 240
Query: 408 EMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWY 441
+ + F + L++Q +T D SDYLWY
Sbjct: 241 WISEPVGISTPDAFTKSGLLEQINTTADRSDYLWY 275
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 177/326 (54%), Gaps = 38/326 (11%)
Query: 25 LAYR--VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWN 82
+AY+ S D RAIT++G+R +LL GS+ YP+ W + +K AKE GL+ ++ YVFWN
Sbjct: 1 MAYQGVASFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWN 60
Query: 83 AHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE 142
HE R + FT D+ RF++ GL V+LR+GPY+CAE +YGGFP WL +PGI +
Sbjct: 61 VHEKKRGIFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGI-Q 119
Query: 143 LRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
RT N FM E++ + I + K+++LF QGGPI+L Q+ENEY V G+ Y+
Sbjct: 120 FRTYNDPFMREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYL 179
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPMFTP----------------------------- 233
NW ++ L VP IMC+ S F
Sbjct: 180 NWYNELYRELAFDVPLIMCRSSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKI 239
Query: 234 -----NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
P+ P +WTE W GW+ W KR+ ED+ +A RF GG +YYM+HGG
Sbjct: 240 ADLRRRKPHQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQGGAGFSYYMFHGG 299
Query: 289 TNFGRTSGGPYLTTSYDYDAPIDEYG 314
T+F + TTSY +D+PIDEYG
Sbjct: 300 THFNNLAMYSQ-TTSYYFDSPIDEYG 324
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/327 (41%), Positives = 181/327 (55%), Gaps = 35/327 (10%)
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSH 552
+FE P+ L G N I+LLS VGL N G F+ GI L + G +DLS
Sbjct: 1 MFELPISLIPGTNDIALLSVMVGLPNSGGHFERKIAGISTVTLRGFKDG----TRDLSQE 56
Query: 553 KWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQ 612
WTY++GL G + + + W+S + P N +TWYK + P ++PV+L+L
Sbjct: 57 LWTYQIGLLGEMSTIYSDVGFISVN--WTSSSTP-NPPLTWYKAVIDVPDGDEPVILDLS 113
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
MGKG AW+NG ++GRYW ++LA CS CDYRG Y KCA NCG PSQ YHVPR
Sbjct: 114 SMGKGQAWINGEHIGRYWISFLAPLGDCS--KCDYRGNYSLHKCATNCGQPSQTLYHVPR 171
Query: 673 SWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK------------------- 713
SW++ N LVLFEE GG+PS+++ T + + C A E
Sbjct: 172 SWLRPTGNLLVLFEETGGDPSKVSLLTRSIDSVCAHAFETHPPSIQSWQKTKVNSEVLRE 231
Query: 714 ----TMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCS 768
+++L C GRRIS IK+ASFG+P+G CG F KG+C + ++ +EK C+G+ CS
Sbjct: 232 NVEPSLQLDCSVGRRISSIKFASFGNPKGVCGNFMKGTCHS-VESEKAVEKACLGQHGCS 290
Query: 769 IEASEANLGATSCAAGTVKRLVVEALC 795
I S G +C GTVK L VEA C
Sbjct: 291 ITNSPKEFGGDAC-VGTVKSLAVEATC 316
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 238 bits (608), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 134/285 (47%), Positives = 163/285 (57%), Gaps = 41/285 (14%)
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
EWN+GGFPVWL +PGI RT N F M FT IV M K E LF SQGGPIIL+QI
Sbjct: 1 EWNFGGFPVWLKYVPGIN-FRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQI 59
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNN-------- 235
ENEYG V G A K+Y++W A+MA L+ GVPW+MC++ DAP P+ N
Sbjct: 60 ENEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYCDYFS 119
Query: 236 PNSPKIWTENWT-GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRT 294
PNS K + W G +T F V + + G F+NYYMYHGGTNFGRT
Sbjct: 120 PNSLKTFFGGLKLDWLVPVSGSSSSQTVRT-GFCVQVYTE-GWIFRNYYMYHGGTNFGRT 177
Query: 295 SGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN---- 350
+GG +++TSYDYDAPIDEY L QPKWGHLR+LHK +K E L G+ T T GN
Sbjct: 178 AGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQEA 237
Query: 351 -------------------------SVSGSSYNLPAWSVSILPDC 370
+ +G YN+P+WS+SILPDC
Sbjct: 238 HVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 200/708 (28%), Positives = 313/708 (44%), Gaps = 152/708 (21%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
Y V++D RA IDG R +LL GSIHYPR W ++++ GL+ ++ YVFWN HEP
Sbjct: 49 YSVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYHEP 108
Query: 87 -----------LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH 135
L +YDF+G DL+ FI+ + L+V LRIGPYVCAEW +GG P+WL
Sbjct: 109 RPPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLWLR 168
Query: 136 NMPGI--------------------EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQG 175
++ G+ + R+ + + M +F I M K+ L A+QG
Sbjct: 169 DVEGMCFRSICGYNGSPGKCKPWEGGKFRSCDP-WRKYMADFVMEIGRMVKEANLMAAQG 227
Query: 176 GPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNN 235
GP+IL Q+ENEYG+ + DAG++YI+W +++ L + VPW+MC A + N
Sbjct: 228 GPVILGQLENEYGH----HSDAGRAYIDWVGELSFGLGLDVPWVMCNGISANGTLNVCNG 283
Query: 236 ---------------PNSPKIWTENWTGWFKSWGGK--DPKRTAEDLAFAVARFFQFGGT 278
P+ P WTEN GWF +WGG + KR+AE++A+ +A++ GG+
Sbjct: 284 DDCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVGGS 342
Query: 279 FQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL 338
NYYM++GG + + G LT +Y G N+PK HL+ LH++L + L
Sbjct: 343 HHNYYMWYGGNHLAQW-GAASLTNAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGEL 401
Query: 339 TY----GNVTNTDYGNSV-----------------SGS---------SYNLPAWSVSIL- 367
+V N V SGS +Y++ V ++
Sbjct: 402 MQVEDRHSVMPVQLENGVEVYEWTAGLAFLHRPACSGSPVEVHYAKATYSIACREVLVVD 461
Query: 368 PDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLI 427
P T F TA V + + D +W R E + + +G + L
Sbjct: 462 PSSSTVLFATASVEPPPELVRRVVATLTAD----RWSMRKEELLHGMATVEGREPVEHL- 516
Query: 428 DQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSS-GQVLHAYVNG--------- 477
+ + +DY+ Y T + G +N++L I+S QV H V+
Sbjct: 517 --RVSGLDTDYVTYKTTVTATE------GVTNVSLEIDSRISQVFHVSVDNASSLAATVM 568
Query: 478 --NYVDSQWTKYGASNDLFERPVKLTRGKN-QISLLSATVGLQN---YGSKFDMVPN--- 528
N +++WT ++ LT G+ + +LS ++G++N YG+ P+
Sbjct: 569 DVNKGNTEWTAVAQLHN-------LTAGRTYDLWILSESLGVENGMLYGAPAATEPSLQK 621
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
GI G + R +++I K +W+ GL G D G +P
Sbjct: 622 GIFGDI----RLNEKSIRKG----RWSMVKGLDGEVDG------------GQGKAELPCC 661
Query: 589 RRM--TWYKTTFEAPLENDPVV-----LNLQGMGKGFAWVNGYNLGRY 629
+ W+ F + L L G W+NG ++GR+
Sbjct: 662 DSLGPAWFVAGFTLHSVRSKSISLTLPLGLPQQAGGHIWLNGVDIGRW 709
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 145/376 (38%), Positives = 197/376 (52%), Gaps = 36/376 (9%)
Query: 443 TNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTR 502
TN D+ + L G TL + S+G LH +VNG + S + F +PV L
Sbjct: 1 TNVDISSSE--LHGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRA 58
Query: 503 GKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYG 562
G N+I+LLS VGL N G ++ GI GPV L G KDL+ KW KVGL G
Sbjct: 59 GINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGR---KDLTMQKWFNKVGLKG 115
Query: 563 --LDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAW 620
+D + + RG S + + WYK F AP ++P+ L+++ MGKG W
Sbjct: 116 EAMDLVSPNGGSSVDWIRG--SLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVW 173
Query: 621 VNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVN 680
+NG ++GRYW Y A D CS C Y G + KC CG P+Q WYHVPRSW+K N
Sbjct: 174 INGQSIGRYWMAY-ANGD-CSL--CSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKN 229
Query: 681 TLVLFEEFGGNPSQINFQTVVVGTACG--QAH-------------ENKTM-----ELTC- 719
+V+FEE GG+PS+I V C Q H E+KT+ L C
Sbjct: 230 LMVMFEELGGDPSKITLVKRSVAGVCADLQEHHPNAEKFDIDSHEESKTLHQAQVHLQCV 289
Query: 720 HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGAT 779
G+ IS IK+ASFG P G CG+F++G+C A + ++EK C+G++SC + S + G
Sbjct: 290 PGQSISSIKFASFGTPTGTCGSFQQGTCHA-TNSHAIVEKNCIGRESCLVTVSNSIFGTD 348
Query: 780 SCAAGTVKRLVVEALC 795
C +KRL VEA+C
Sbjct: 349 PC-PNVLKRLSVEAVC 363
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 184/337 (54%), Gaps = 22/337 (6%)
Query: 357 YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVR 416
Y LP WS+SILPDCKT FNTA++ Q+++K P + W+ +
Sbjct: 43 YELPPWSISILPDCKTAVFNTARLGAQSSLKQMTPVST--------FSWQSYIEESASSS 94
Query: 417 GKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYV 475
F + L +Q + T D SDYLWYMTN ++ ++ L + L I S+G LH ++
Sbjct: 95 DDKTFTTDGLWEQLNVTRDASDYLWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFI 154
Query: 476 NGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPG 532
NG T YG ++ F + VK+ G NQ+SLLS +VGLQN G+ F+ G+ G
Sbjct: 155 NGQLSG---TVYGGVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLG 211
Query: 533 PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMT 592
PV L G +DLS +W+YK+GL G +D + ++S ++ + +T
Sbjct: 212 PVTLRGL---NEGTRDLSKQQWSYKIGLKG-EDLSLHTVSGSSSVEWVEGSSLAQKQPLT 267
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
WYKTTF AP N+P+ L++ MGKG W+N ++GR+WP Y+A S C+Y G Y
Sbjct: 268 WYKTTFNAPAGNEPLALDMSTMGKGLIWINSQSIGRHWPGYIAHG---SCGECNYAGTYT 324
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
KC NCG PSQ WYHVPRSW+ N LV+ + G
Sbjct: 325 DKKCHTNCGQPSQRWYHVPRSWLNPTGNLLVVLKRVG 361
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/172 (61%), Positives = 131/172 (76%), Gaps = 1/172 (0%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
+LL LI +F V++D RA+ IDG+R++L SGSIHYPRS P +WP++I+K+KEG
Sbjct: 142 VLLVLIAVCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEG 201
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD IETYVFWN HEP+R +Y F G DL+RF+KT+Q+ GL V LRIGPY CAEWNYGGF
Sbjct: 202 GLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGF 261
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
PVWLH +PGI + RTTN +F NEM+ F IV + K+ LFA QGGPIILAQ
Sbjct: 262 PVWLHFIPGI-QFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 312
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 115/267 (43%), Positives = 162/267 (60%), Gaps = 16/267 (5%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++DG ++ I+G+R++L S S+HYPRSTP MWP +I KA+ GGL+ I+TYVFWN HEP
Sbjct: 42 VTYDGTSLIINGKRELLFSVSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEH 101
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
R+YDF G DL+ FIK IQ++GLYV LR+GP++ AEWN+GG P WL +P + RT N+
Sbjct: 102 RKYDFKGRFDLVTFIKLIQEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPEV-YFRTDNE 160
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F + + I+ M K+EKL ASQ L ENE V Y + G+ YI W A +
Sbjct: 161 PFKEHTERYVRKILGMMKEEKLLASQRRSHHLG-TENECNAVQLAYKENGERYIKWAANL 219
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFA 268
S+ +G+PW+MC++++A + N F+ G +ED+AF+
Sbjct: 220 VESMKLGIPWVMCKQNNASDNLINACNGRH----------CFEFLGILQLIEQSEDIAFS 269
Query: 269 VARFFQFGGTFQNYYM----YHGGTNF 291
VAR+F G+ NYYM YH +F
Sbjct: 270 VARYFSKNGSHVNYYMMVDRYHIPRSF 296
Score = 45.8 bits (107), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 47/109 (43%), Gaps = 27/109 (24%)
Query: 668 YHVPRSWIKD--GVNTLVLFEEFGG-NPSQINFQTVVVGTACGQAHEN------------ 712
YH+PRS++K+ N LV+ EE G I+F V T C E+
Sbjct: 290 YHIPRSFMKEEKKKNMLVILEEEPGVKLEAIDFVLVNRDTICSYVGEDYPVSVKSWKRER 349
Query: 713 -------KTMELTC-----HGRRISEIKYASFGDPQGACGAFKKGSCEA 749
K M L +++ +++ASFGDP G CG F G C A
Sbjct: 350 PKIASRSKDMRLKAVMKCPPEKQMVAVEFASFGDPTGTCGNFTMGKCSA 398
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 230 bits (586), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 114/205 (55%), Positives = 135/205 (65%), Gaps = 13/205 (6%)
Query: 53 PRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLY 112
PRSTP MWPDLI+ AKEGGLD I+TYVFWN HEP Y F D ++FIK + GLY
Sbjct: 1 PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60
Query: 113 VILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFA 172
V LRIGPY+C EWN+GGFPVWL +PGI + RT N F +MQ FT IV+M K EKLF
Sbjct: 61 VHLRIGPYICGEWNFGGFPVWLKYVPGI-QFRTDNGPFKAQMQKFTEKIVNMMKAEKLFE 119
Query: 173 SQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-- 230
QGGP I++QIE EYG + + G GK+Y W A+MA L GVPWIMC++ DAP P+
Sbjct: 120 PQGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIID 178
Query: 231 ---------FTPNNPNSPKIWTENW 246
F PN PK+WTE W
Sbjct: 179 TCNGFYCENFMPNANYKPKMWTEAW 203
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 229 bits (583), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 106/172 (61%), Positives = 131/172 (76%), Gaps = 1/172 (0%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
+LL LI +F V++D RA+ IDG+R++L SGSIHYPRS P +WP++I+K+KEG
Sbjct: 7 VLLVLIAVCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEG 66
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GLD IETYVFWN HEP+R +Y F G DL+RF+KT+Q+ GL V LRIGPY CAEWNYGGF
Sbjct: 67 GLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGF 126
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
PVWLH +PGI + RTTN +F NEM+ F IV + K+ LFA QGGPIILAQ
Sbjct: 127 PVWLHFIPGI-QFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 225 bits (573), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 145/394 (36%), Positives = 203/394 (51%), Gaps = 54/394 (13%)
Query: 432 TNDVSDYLWYMTNADLKDDDPILSGSSNM--TLRINSSGQVLHAYVNGNYVDSQWTKYGA 489
T D SDYLWY T + D D + +++ L I+ +L ++NG +
Sbjct: 62 TKDQSDYLWYSTRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLI--------V 113
Query: 490 SNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDL 549
++ F+ + ++ GKN + S + NYG+ + GI G + + G + DL
Sbjct: 114 KDEQFKAVISVSIGKNDCTAGS----INNYGAFLEKDGAGIRGKIKITGFENGDI---DL 166
Query: 550 SSHKWTYKVGLYGLDDKKFYNAKAANSERGW---SSKNVPLNRRMTWYKTTFEAPLENDP 606
S WTY+VGL G + KFY+ + NSE W + +P TWYKT F+ P DP
Sbjct: 167 SKSLWTYQVGLQG-EFLKFYSEENENSE--WVELTPDAIP--STFTWYKTYFDVPGGIDP 221
Query: 607 VVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQI 666
V L+ + MGKG AWVNG ++GRYW T ++ + GC + CDYRG Y SDKC+ NCG P+Q
Sbjct: 222 VALDFKSMGKGQAWVNGQHIGRYW-TRVSPKSGCQ-QVCDYRGAYNSDKCSTNCGKPTQT 279
Query: 667 WYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE--------------- 711
YHVPRSW+K N LV+ EE GGNP +I+ + C Q E
Sbjct: 280 LYHVPRSWLKATNNLLVILEETGGNPFEISVKLHSSRIICAQVSESNYPPLQKLVNADLI 339
Query: 712 -------NKTMELTCH---GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQC 761
N EL H G IS + +ASFG P G+C F +G+C A + ++ + C
Sbjct: 340 GEEVSANNMIPELHLHCQQGHTISSVAFASFGTPGGSCQNFSRGNCHAP-SSMSIVSEAC 398
Query: 762 VGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
GK+SCSI+ S++ G C G VK L VEA C
Sbjct: 399 QGKRSCSIKISDSAFGVDPC-PGVVKTLSVEARC 431
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 163/459 (35%), Positives = 220/459 (47%), Gaps = 73/459 (15%)
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV 343
MYHGGTNFGRTS ++T YD AP+DEYG L QPK+GHL+ELH +KS L G
Sbjct: 1 MYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQ 59
Query: 344 TNTDYG---------------------NSVSGS-------SYNLPAWSVSILPDCKTEEF 375
T G N S +Y+L S+ IL +CK +
Sbjct: 60 TILSLGPMQQAYVFEDANNGCVAFLVNNDAKASQIQFRNNAYSLSPKSIGILQNCKNLIY 119
Query: 376 NTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TND 434
TAKVN + N +V P Q N P W E I F N L++ + T D
Sbjct: 120 ETAKVNVKMNTRVTTPVQVFN--VPDNWNLFRETIPAFP---GTSLKTNALLEHTNLTKD 174
Query: 435 VSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF 494
+DYLWY ++ K D P +N ++ SSG V+H +VN S
Sbjct: 175 KTDYLWYTSS--FKLDSPC----TNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKL 228
Query: 495 ERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW 554
+ PV L G+N IS+LS VGL + G+ + G+ + G G + I DLS +W
Sbjct: 229 QAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLTKVQISCG--GTKPI--DLSRSQW 284
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL--NRRMTWYKTTFEAPLENDPVVLNLQ 612
Y VGL G + + Y K N + WS L NR + WYKTTF+ P + PV L++
Sbjct: 285 GYSVGLLG-EKVRLYQWKNLNRVK-WSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMS 342
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
MGKG WVNG ++GRYW ++L G PSQ YH+PR
Sbjct: 343 SMGKGEIWVNGESIGRYWVSFLTP-----------------------AGQPSQSIYHIPR 379
Query: 673 SWIKDGVNTLVLFEEFGGNPSQINFQTV-VVGTACGQAH 710
+++K N LV+FEE GG+P I+ T+ VVG++ Q+
Sbjct: 380 AFLKPSGNLLVVFEEEGGDPLGISLNTISVVGSSQAQSQ 418
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 103/154 (66%), Positives = 119/154 (77%), Gaps = 1/154 (0%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V++D +AI I+G+R+IL+SGSIHYPRSTP MWPDLI+KAK+GGLD IETYVFWN HEP
Sbjct: 2 VTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPSP 61
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y F DL+RFIK +Q GLYV LRIGPYVCAEWNYGGFP+WL +PGI RT N
Sbjct: 62 DKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGI-AFRTDNA 120
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
F MQ F IVDM K EKLF +QGGPIIL+Q
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154
>gi|414880685|tpg|DAA57816.1| TPA: putative RAN GTPase activating family protein [Zea mays]
Length = 598
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/285 (42%), Positives = 166/285 (58%), Gaps = 46/285 (16%)
Query: 283 YMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN 342
+ YHGGTNFGRTSGGPY+TTSYDYDAP+DEYG++ QPK+GHL++LH L++SMEK L +G
Sbjct: 308 FKYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGK 367
Query: 343 VTNTDYGN-----------SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRP 391
+T YG ++SG ++ +PAWSVSILPDCKT +NTAK+ TQT+V VK+
Sbjct: 368 YNDTSYGKNAIFVDRDVKVTLSGGTHLVPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKA 427
Query: 392 NQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDD 450
N + L+W W PE + F+ + F + L++Q +T+ D SDYLWY T+ + K
Sbjct: 428 NSVEKEPEALRWSWMPENLKPFMTDHRDSFRHSQLLEQITTSTDQSDYLWYRTSLEHK-- 485
Query: 451 DPILSGSSNMTLRINSSGQ-----------VLHAYVNGN-------------YVDSQWTK 486
G + TL +N+SG L A V+G + +Q
Sbjct: 486 -----GEGSYTLYVNTSGHEMAKLLGRWSVRLPAPVSGEAPLRKELRFSPQRHSRTQGQN 540
Query: 487 YGASNDL---FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN 528
Y A + PVKL GKN +SLLS TVGL++ + +V N
Sbjct: 541 YSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKSAKTLVIVVEN 585
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 212 bits (539), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 181/340 (53%), Gaps = 33/340 (9%)
Query: 474 YVNGNYVDSQWTKYGASND---LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGI 530
+ N + S+ T YG+ +D + VKL G N IS LS VGL N G F+ GI
Sbjct: 155 HCNFTWKCSEGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGI 214
Query: 531 PGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR 590
GPV L G +DL+ KWTY+VGL G + ++ E G +N
Sbjct: 215 LGPVTLDGLNEGR---RDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNA---SN 268
Query: 591 MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGP 650
M + F AP ++P+ L++ MGKG W+NG +GRYWP Y A + C T CDYRG
Sbjct: 269 MAF----FNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGN-CGT--CDYRGE 321
Query: 651 YGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQA- 709
Y KC NCG+ SQ WYHVPRSW+ N LV+FEE+GG+P+ I+ +G+ C
Sbjct: 322 YDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVS 381
Query: 710 -------------HENKTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLP 755
+E + L C +G++I+EIK+ASFG PQG+CG++ +G C A
Sbjct: 382 EWQPSMKNWHTKDYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAH-KSYD 440
Query: 756 LIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
+ K CVG++ C + G C GT+KR VVEA+C
Sbjct: 441 IFWKNCVGQERCGVSVVPEIFGGDPC-PGTMKRAVVEAIC 479
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 74/143 (51%), Positives = 94/143 (65%), Gaps = 11/143 (7%)
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
MQ FTT IV+M K E LF QGGPIIL+QIENE+G + D G+ K+Y +W A MA +L+
Sbjct: 1 MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60
Query: 214 IGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTA 262
VPWIMC+E DAP P+ F+PN P+ P +WTE WT W+ +G P R
Sbjct: 61 TSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPV 120
Query: 263 EDLAFAVARFFQFGGTFQNYYMY 285
EDLA+ VA+F Q GG+F NYYM+
Sbjct: 121 EDLAYGVAKFIQKGGSFVNYYMF 143
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/353 (34%), Positives = 185/353 (52%), Gaps = 32/353 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+ D + IDG+RK ++S ++HY R W +I+KA+ GG +AIETY+ WN HE
Sbjct: 2 IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q+DF+G+ DL F D+G+YVI+R GPY+CAEW++GG P +L+N GI E R +N
Sbjct: 62 EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGI-EYRCSNA 120
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ ++ + I+ + ++ +L GG II+ QIENEY +G ++I + ++
Sbjct: 121 AYEQAVRRYFERIMPIIRRYQL--GSGGSIIMVQIENEY----HAFGKKDLAHIRFLEEL 174
Query: 209 ATSLDIGVPWIMC--------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
I VP + C ++ + + P E W GW + WG
Sbjct: 175 TRGFGITVPLVSCYGAGRNTVEMRNFWSGAERAAAVLRERQSGQPLGIMEFWIGWVEHWG 234
Query: 255 GKDPK-RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGP--YLTTSYDYD 307
G+ K + AE + + G F NYYMY GG+NF GRT G ++T SYDYD
Sbjct: 235 GEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYDYD 294
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNV---TNTDYGNSVSGSSY 357
AP+DE+G K+ L LH + +E LT G++ ++ SV+ + Y
Sbjct: 295 APLDEFG-FETEKYRLLAVLHTFIAWLENDLTAGSLLIQEQAEHELSVTKAEY 346
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 208 bits (530), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 120/307 (39%), Positives = 165/307 (53%), Gaps = 51/307 (16%)
Query: 48 GSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQ 107
GS+HYPR P MWPD+ KKAK Q++F GN DLI+FIK I
Sbjct: 11 GSVHYPRCPPEMWPDIFKKAK---------------------QFNFEGNYDLIKFIKMI- 48
Query: 108 DQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK 167
G+ + ++ ++ + P+WL +P I R+ N+ FM M+ FT +I+ +
Sbjct: 49 --GIMICMQ---HLELVHSLKELPIWLREIPNII-FRSDNQPFMYHMEQFTKMIIKKMRD 102
Query: 168 EKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAP 227
EK F + QIENE+ V Y + G Y+ W MA LD GVPWIMC++ +A
Sbjct: 103 EKFFPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNAL 155
Query: 228 SPMFT-------------PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
P+ PN + I ++ ++++G +RTAED+A AVARFF
Sbjct: 156 GPVMNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFS 213
Query: 275 FGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
GT NYYMY+GGTNFGRTS ++TT Y +API EYG +PKWGH R+LH LK
Sbjct: 214 KKGTMANYYMYYGGTNFGRTSSS-FVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLC 272
Query: 335 EKTLTYG 341
+K L +G
Sbjct: 273 QKALLWG 279
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 47/113 (41%), Gaps = 24/113 (21%)
Query: 661 GNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN-------- 712
G+ + YH PR+ ++ N LV+ EE GG I TV T C A E+
Sbjct: 298 GSYVSMLYHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVNRDTICSIAGEHYPPNVETW 357
Query: 713 ---------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEA 749
L C + I+++ +AS+GDP G CG F G C A
Sbjct: 358 SRYKGVIRTNVDTPKPAANLVCLDNKTITQVDFASYGDPVGNCGHFILGKCNA 410
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/292 (41%), Positives = 158/292 (54%), Gaps = 45/292 (15%)
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
F S+G P R EDLAFAVARF+Q GGTFQNYYM+HGGTNFGRT+GGP+++TSYD+D P
Sbjct: 6 FVSFGDVVPHRPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTP 65
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSMEK-------TLTY--------------------GN 342
IDEYG + QPKW HL+ +HK +K EK T+TY N
Sbjct: 66 IDEYGIIRQPKWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVYNIGAVSAAFLAN 125
Query: 343 VTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ 402
+ TD S +G+SY+LPAW VS LPDCK+ NTAK+N+ + + + L
Sbjct: 126 IAKTDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGSLD 185
Query: 403 -----WKWRPEMINDFVVRGKGH-FALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILS 455
W W E I K H F+ L++Q +T D SDYLWY ++ DL
Sbjct: 186 DSGSGWSWISEPIG----ISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDLD------- 234
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
++ L I S G LHA+VNG S + + + P+ L GKN I
Sbjct: 235 AATETVLHIESLGHALHAFVNGKLAGSGTGNHEKVSVKVDIPITLVYGKNTI 286
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 182/638 (28%), Positives = 286/638 (44%), Gaps = 118/638 (18%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
A ++G++ +LLSG++HY R P W D + K K GL+ +ETYV WNAHE +R +DF+
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEM 154
G LDL RFI+ QD GLYV+LR GPY+C+EW++GG P WL + P + ++RT+ ++ +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEM-KVRTSYPPYLEAV 128
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD----------YGDAGKSYINW 204
+ I+ + ++ S+GGPII Q+ENEYG+ D + G + +
Sbjct: 129 DAYLAKILPLVNDLQM--SKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLF 186
Query: 205 CAKMATSLDIG-VPWIMC----QESDAPSPMF----TPNNPNSPKIWTENWTGWFKSWGG 255
+ T + G +P ++ QE + MF P P + E W+GWF WG
Sbjct: 187 TSDNGTGIQNGPIPGVLATTNFQEQEQGYLMFEYLRNIKQPGLPMMVMEFWSGWFDHWGE 246
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------------PYL-- 300
+ + V ++ G+ N+YM+HGGTNFG +G PY
Sbjct: 247 QHNLCHHAEF-IDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATNEGGGEPYAAD 305
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLP 360
TTSYDYD P+ E G LN+ E+ +L M+ L G+ G V +++
Sbjct: 306 TTSYDYDCPVSESGQLNE----KFYEIRNILSEMKTLLPPGS------GGLVKKHFFSII 355
Query: 361 AWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGH 420
+ S D K E + + P A + + EM++ G+GH
Sbjct: 356 KFFAS---DLKME-----RCLPLEKLSSLAPCIASKEAVAM------EMLDINNHGGQGH 401
Query: 421 FALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYV 480
LI + + +D L + T ++ VL VNG V
Sbjct: 402 ----GLILYRKQINSADKL-------------------HFTEIVHDRAIVL---VNGQQV 435
Query: 481 ---DSQWTKYGASNDLFERPVKLTRGKNQI-SLLSATVGLQNYG----SKFDMVPNGIPG 532
D + + + DL +G N + ++ +G NY + F+ G+
Sbjct: 436 DVFDHRSADHVTTLDL--------KGNNHVLEIVVENMGRVNYSDFQKNIFNEQRKGLTS 487
Query: 533 PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMT 592
PVLL G+ + I L +W F ++E ++N NR +
Sbjct: 488 PVLLDGQVMQDWEITPL---EWK----------SDFLERVRQSNEWTPCTENAFQNRPVL 534
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
Y+++ + + L+G GKGF VNG+N+GRYW
Sbjct: 535 -YESSLVVDGDPRDTFVQLKGWGKGFVIVNGFNIGRYW 571
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 202 bits (514), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 186/665 (27%), Positives = 298/665 (44%), Gaps = 148/665 (22%)
Query: 115 LRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQ 174
+RIGPYVCAEW+ GG PVW++ + G+ LR N V+ EM ++ ++ D + FA +
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGV-RLRANNDVWKKEMGDWMKVLTDYTR--DFFADR 57
Query: 175 GGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPN 234
GGPII +QIENE +G A + YI+WC + A SL++ VPW+MC D
Sbjct: 58 GGPIIFSQIENEL------WGGA-REYIDWCGEFAESLELNVPWMMCN-GDTSEKTINAC 109
Query: 235 NPN------------------SPKIWTENWTGWFKSWGGKDPK---------RTAEDLAF 267
N N P WTEN GWF+ G + R+AED F
Sbjct: 110 NGNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTF 168
Query: 268 AVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLREL 327
V +F GG++ NYYM+ GG ++G+ +G +T Y I N+PK H ++
Sbjct: 169 NVLKFMDRGGSYHNYYMWFGGNHYGKWAGNG-MTNWYTNGVMIHSDTLPNEPKHSHTAKM 227
Query: 328 HKLLKSMEKTL----------TYGNVTNTD-----YGNSV-------SGSS--------- 356
H++L ++ + L + N N + YG+ + GS+
Sbjct: 228 HRMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYRYGDRLVSFVENSKGSADKVIYRDIV 287
Query: 357 YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDF--- 413
Y LPAWS+ +L + F T NVK ++ + + L++++ E ++
Sbjct: 288 YELPAWSMIVLDEYDNVLFET------NNVKPVNKHRVYHCEEKLEFEYWNEPVSTLSQE 341
Query: 414 ----VVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQ 469
VV K + LN T D++++L+Y T + D+ LS +
Sbjct: 342 APRVVVSPKANEQLNM------TRDLTEFLYYETEVEFPQDECTLSIGG-------TDAN 388
Query: 470 VLHAYVNGNYV--DSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVP 527
AYV+ ++V D + T + + + +K +GK+++ LLS ++G+ N G ++ P
Sbjct: 389 AFVAYVDDHFVGSDDEHTHHDGWHTM-NINMKSGKGKHKLVLLSESLGVSN-GMDSNLDP 446
Query: 528 N-------GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
+ GI G + L G D+ + +W + GL G + F + W
Sbjct: 447 SWASSRLKGICGWIKLCG--------NDIFNQEWKHYPGLVGEAKQVFTDEGMKTVT--W 496
Query: 581 SSKNVPLNRRMTWYKTTFEAPL---ENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
S +V + WY++TF+ P V+L +GM +G A+ NG+N+GRYW +
Sbjct: 497 KS-DVENADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYANGHNIGRYWMI----K 551
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK--DGVNTLVLFEEFGGNPSQI 695
DG G Y +Q +YH+P+ W+K N LVL E G + +
Sbjct: 552 DG--------NGEY------------TQGFYHIPKDWLKGEGEENVLVLGETLGASDPSV 591
Query: 696 NFQTV 700
T
Sbjct: 592 TICTT 596
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 202 bits (513), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 109/275 (39%), Positives = 156/275 (56%), Gaps = 8/275 (2%)
Query: 424 NTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDS 482
N L++Q K T D SDYLWYMT+ ++ ++ + L S+G VLH +VNG + +
Sbjct: 36 NALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGT 95
Query: 483 QWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
+ F VKL G N+ISLLS VGL N G ++ G+ GPV L G
Sbjct: 96 AYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSNVGLHYETWNVGVLGPVTLKGL--- 152
Query: 543 ETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPL 602
+DLS KW+YK+GL G + + ++S + ++ + +TWYK TF+AP
Sbjct: 153 NEGTRDLSGQKWSYKIGLKG-ETLNLHTLIGSSSVQWTKGSSLVEKQPLTWYKATFDAPA 211
Query: 603 ENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGN 662
NDP+ L++ MGKG WVNG ++GR+WP Y+A S C+Y G + KC +CG
Sbjct: 212 GNDPLALDMSSMGKGEIWVNGESIGRHWPAYIAR---GSCGGCNYAGTFTDKKCRTSCGQ 268
Query: 663 PSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
P+Q WYH+PRSW+ N LV+ EE+GG+PS I+
Sbjct: 269 PTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISL 303
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 118/319 (36%), Positives = 158/319 (49%), Gaps = 39/319 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DGE +LSG++HY R P +W D I KA+ GL+ IETYV WNAH P R +D G L
Sbjct: 13 LDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTDGML 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF++ + GLY I+R GPY+CAEW+ GG P WL PG+ +R F+ ++ +
Sbjct: 73 DLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGV-GVRRYEPRFLAAVEQY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++D+ + L QGGP++L Q+ENEYG +D Y+ A M I VP
Sbjct: 132 LEQVLDLVR--PLQVDQGGPVLLLQVENEYGAFGND-----PEYLEAVAGMIRKAGITVP 184
Query: 218 WIMCQE-----------------------SDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+ + S + P P + E W GWF WG
Sbjct: 185 LVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDHWG 244
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP------YLTTSYDYDA 308
G + ED A + G + N YM+HGGTNFG TSG TSYDYDA
Sbjct: 245 GPHHTTSVEDAARELDALLAAGASV-NIYMFHGGTNFGLTSGADDKGVFRPTVTSYDYDA 303
Query: 309 PIDEYGHLNQPKWGHLREL 327
P+DE G K+ RE+
Sbjct: 304 PLDEAGRPTA-KYHAFREV 321
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 199 bits (505), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 114/173 (65%), Gaps = 12/173 (6%)
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GGFPVWL +PGI RT N+ F N MQ FT IV++ K E LF SQGGPIIL+QIENEY
Sbjct: 1 GGFPVWLKYVPGIS-FRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEY 59
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNP 236
G GDAG Y+ W A MA L GVPW+MC+E DAP P+ F+PN P
Sbjct: 60 GPQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRP 119
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
P IWTE W+GWF +GG +R +DLAFAVARF Q GG+F NYYMYHGGT
Sbjct: 120 YKPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 198 bits (504), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 116/307 (37%), Positives = 164/307 (53%), Gaps = 38/307 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DGE +LSG++HY R P W D I+KA+ GL+ IETYV WNAH P +D G L
Sbjct: 13 LDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTDGIL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF++ ++D G+Y I+R GP++CAEW+ GG P WL PG+ +R F++E++ +
Sbjct: 73 DLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGV-GIRRHEPRFLDEVEKY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + + ++ GGP++L Q+ENEYG YGD + Y+ A M I VP
Sbjct: 132 LHQVLALVRPHQV--DLGGPVLLVQVENEYGA----YGD-DRDYLQAVADMIRGAGIDVP 184
Query: 218 WIMCQE---------------------SDAPSPMFT--PNNPNSPKIWTENWTGWFKSWG 254
+ + SD+ + + T + P P + E W GWF WG
Sbjct: 185 LVTVDQPVDAMLAAGGLDGVLRTSSFGSDSANRLRTLRDHQPTGPLMCMEFWDGWFDHWG 244
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYDYDA 308
G+ E A + G + N YM+HGGTNFG TSG G Y TSYDYDA
Sbjct: 245 GRHHTTPVEQAAEELDALLAAGASV-NVYMFHGGTNFGLTSGANDKGIYRPTVTSYDYDA 303
Query: 309 PIDEYGH 315
P+DE G+
Sbjct: 304 PLDEAGN 310
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 116/339 (34%), Positives = 179/339 (52%), Gaps = 28/339 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D ++ I +R +LS +IHY R W D+++KAK GG + IETY+ WN HE
Sbjct: 2 ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
++DF+G+ DL F++ ++GLYVI R GPY+CAEW++GGFP WL I + R+
Sbjct: 62 GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDI-QYRSAQP 120
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F++ + + ++ + + +L ++ G +I+ QIENE+ YG K Y+ +
Sbjct: 121 SFLHYVDQYFDQVISIIDEYQL--TKNGSVIMVQIENEF----QAYGKPDKKYMEYLRDG 174
Query: 209 ATSLDIGVPWIMC--------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+ I VP++ C ++ + + + PK E W GWF+ WG
Sbjct: 175 MIARGIEVPFVTCYGAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEHWG 234
Query: 255 G-KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRT-SGGPYLTTSYDYDA 308
G K ++T E L + + G T NYYMY GGTNF GRT S + TT+YDYD
Sbjct: 235 GNKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYDYDV 294
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTD 347
IDEY + K+ L+ H +K +E T N+D
Sbjct: 295 AIDEYLQPTR-KYEVLKRYHLFVKWLEPLFTNAEQANSD 332
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 163/312 (52%), Gaps = 48/312 (15%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG++HY R P W D I+ AK GL+ IETYV WNAHEP+R ++D TG
Sbjct: 13 LDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDATGWN 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+ I +GL+ I+R GPY+CAEW+ GG PVWL + PGI +R + F+ + +
Sbjct: 73 DLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGI-GIRRSEPQFVEAVSEY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA-------- 209
+ ++ ++ +GG ++L QIENEYG SD K Y+ ++
Sbjct: 132 LRRVYEIVAPRQI--DRGGNVVLVQIENEYGAYGSD-----KEYLRELVRVTKDAGITVP 184
Query: 210 -TSLDIGVPWIMCQESDAPSPMFT---------------PNNPNSPKIWTENWTGWFKSW 253
T++D +PW M + P T + P P + +E W GWF W
Sbjct: 185 LTTVDQPMPW-MLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWDGWFDWW 243
Query: 254 GG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------PYLTTS 303
G DP +A DL +A G N YM HGGTNFG T+G + TS
Sbjct: 244 GSIHHTTDPAASAHDLDVLLA-----AGASVNIYMVHGGTNFGTTNGANDKGRFDPIVTS 298
Query: 304 YDYDAPIDEYGH 315
YDYDAPIDE GH
Sbjct: 299 YDYDAPIDESGH 310
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/337 (37%), Positives = 176/337 (52%), Gaps = 45/337 (13%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+S D + I G++ +LSGSIHY R P W D +KK K GL+ ++TYV WN HEP+
Sbjct: 71 LSLDEDSFYIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMP 130
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
++DF+G L++ FIK L VI+R GPY+C+EW+ GG P WL + P + ++R+ K
Sbjct: 131 GEFDFSGLLNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNM-KIRSNYK 189
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD---AGKSYINWC 205
+ + ++ F T + ++ L +S GGPII Q+ENEY + YG G+ ++ +
Sbjct: 190 PYQDAVKRFFTKLFEILT--PLQSSYGGPIIAFQVENEY----AAYGPRNATGRHHMQYL 243
Query: 206 AKMATSLDIGVPWIMC--QESDAPSPMFTPNN----------------------PNSPKI 241
A + SL +I Q S PNN PN P +
Sbjct: 244 ANLMRSLGAVELFITSDGQNDIKASSDMAPNNALLTVNFQNDPSEALNKLLLVQPNKPPL 303
Query: 242 WTENWTGWFKSWGGKDPKRT--AEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-----RT 294
E WTGWF WG + +RT L + Q GG+F N YM+HGGTNFG
Sbjct: 304 VMEYWTGWFDHWGRRHLERTLSPSQLIVNIGTILQMGGSF-NLYMFHGGTNFGFMNGANI 362
Query: 295 SGGPYL--TTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
GG Y TSYDYDAP+ E G + + K+ LREL K
Sbjct: 363 EGGEYRPDVTSYDYDAPLSEAGDITK-KYTLLRELLK 398
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 135/442 (30%), Positives = 207/442 (46%), Gaps = 49/442 (11%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D ++ I ER +LS +IHY R W +++ KAK GG + IETY+ WN HE
Sbjct: 2 ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
++DF+G+ DL F + D+ LYVI R GPY+CAEW++GGFP WL I + R+
Sbjct: 62 GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDI-QYRSAQP 120
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F++ + + ++ + + +L ++ G +I+ Q+ENE+ YG K Y+ +
Sbjct: 121 AFLHYVDQYFDRVIPIIDEYQL--TKNGTVIMVQVENEF----QAYGKPDKPYMEYIRDG 174
Query: 209 ATSLDIGVPWIMC--------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+ I VP + C S + + P+ PK E W GWF+ WG
Sbjct: 175 MKARGIDVPLVTCYGAVEGAVEFRNFWSHSKHAAAILDERFPDQPKGVMEFWIGWFEQWG 234
Query: 255 G-KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGPYL-TTSYDYDA 308
G K ++T E L + G T NYYMY GGTNF GRT G L TT+YDYD
Sbjct: 235 GNKADQKTPEQLERECYQLLSNGFTAINYYMYFGGTNFDHWGGRTVGEQTLCTTTYDYDV 294
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILP 368
IDEY + K+ L+ H +K +E T +D LP+
Sbjct: 295 AIDEYLQPTR-KYEVLKRYHSFVKWLEPLFTDAEKVASD---------MKLPS------- 337
Query: 369 DCKTEEFNTAK-----VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFAL 423
D K+E + + N +++ + G DQ + + V+ HF +
Sbjct: 338 DLKSERIASPYGEVIFIENNRNERIQSHVKHGYDQILFTIEANTVLPIVRNVKVGNHFTI 397
Query: 424 NTLIDQKSTNDVSDYLWYMTNA 445
TL Q + D ++ + Y N
Sbjct: 398 KTLTGQTTGFDSNEAVIYHENG 419
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 65/166 (39%), Gaps = 40/166 (24%)
Query: 539 RAGDETIIKDLSSHKWTYKVGLYGLDDK------KFYNAKAANSERGWSSKNVPLNRRM- 591
+ G+ + D+ + + LY DK K + + E+ W + N + +
Sbjct: 708 KQGENVLDLDVQNISSIRRFDLYLFHDKEQIFDWKTKSFAELHEEKDWKTANCGDQQTIY 767
Query: 592 -TWYKTTFEAPLENDPVV-LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
WYK+ F +N +V + L + KG WVNG LGRYW G
Sbjct: 768 PRWYKSHFTWNPDNGSIVKVRLNHLSKGCFWVNGECLGRYWNI----------------G 811
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
P Q Y +P S +KD N +V+F+E G P +
Sbjct: 812 P--------------QEDYKIPVSLLKDQ-NEIVIFDEEGYAPDDV 842
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 92/173 (53%), Positives = 112/173 (64%), Gaps = 11/173 (6%)
Query: 178 IILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM------- 230
++L + G + ++YG GK Y W AK A SL +GVPW+MC++ DAP +
Sbjct: 32 LVLGTVSLGVGAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAY 91
Query: 231 ----FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYH 286
F PN+ N P +WTENW GW+ WG + P R EDLAFAVA FFQ GG+FQNYYMY
Sbjct: 92 YCDGFKPNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYYMYF 151
Query: 287 GGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLT 339
G TNFGRT+GGP TSYDY A IDEYG L +PKWGHL++LH LK E L
Sbjct: 152 GRTNFGRTAGGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALV 204
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 120/316 (37%), Positives = 163/316 (51%), Gaps = 46/316 (14%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P +W D +++ GL+ +ETYV WN HE +R + DFTG DL RFI
Sbjct: 26 VLSGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARFIS 85
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
D GL VI+R GPY+CAEW++GG P WL PGI LRT++ F+ + ++ +V +
Sbjct: 86 LAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGI-ALRTSDPAFLAAVDDWFDAVVPV 144
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
+ L + GGP++ Q+ENEYG+ YGD +Y+ C K LD G+ ++ S
Sbjct: 145 IR--PLLTTAGGPVVAVQVENEYGS----YGD-DAAYLEHCRK--GLLDRGID-VLLFTS 194
Query: 225 DAPSPMFTPN--------------------------NPNSPKIWTENWTGWFKSWGGKDP 258
D P P + N P P + E W GWF WG
Sbjct: 195 DGPGPDWLDNGTIPGVLATVNFGSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHWGEPHH 254
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LTTSYDYDAPID 311
R +D A + + GG+ N+YM HGGTNFG SG TSYDYDA +
Sbjct: 255 VRDVDDAAGVLDDVLRAGGSV-NFYMAHGGTNFGLWSGANVEDGKLQPTVTSYDYDAAVG 313
Query: 312 EYGHLNQPKWGHLREL 327
E G L PK+ RE+
Sbjct: 314 EAGELT-PKFHAFREV 328
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 195 bits (496), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 142/469 (30%), Positives = 218/469 (46%), Gaps = 66/469 (14%)
Query: 354 GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQ-WKWRPEMIND 412
G + +P+ SVSIL DCKT +NT +V Q + +R ++ + W+ E I
Sbjct: 9 GEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---ERSFHTTDETSKNNVWEMYSEAIPK 65
Query: 413 FVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLH 472
F R L T D SDYLWY T+ L+ DD ++I S+ +
Sbjct: 66 F--RKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKSTAHAMI 123
Query: 473 AYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPG 532
+ N +V + + +FE+P+ L G N I++LS+++G+++ G + V GI
Sbjct: 124 GFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVKGGIQD 183
Query: 533 PVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW--SSKNVPLNRR 590
V+ G T DL + W +K L G +DK+ Y K ++ W + ++P+
Sbjct: 184 CVV----QGLNTGTLDLQGNGWGHKARLEG-EDKEIYTEKGM-AQFQWKPAENDLPI--- 234
Query: 591 MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGP 650
TWYK F+ P +DP+V+++ M KG +VNG +GRYW +++
Sbjct: 235 -TWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL-------------- 279
Query: 651 YGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAH 710
G+PSQ YH+PR+++K N L++FEE G P I QTV C
Sbjct: 280 ---------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 330
Query: 711 EN-----KTME------------------LTCHGRR-ISEIKYASFGDPQGACGAFKKGS 746
E+ KT E L C +R I E+ +ASFG+P+GACG F G+
Sbjct: 331 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAGT 390
Query: 747 CEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C D ++EK+C+GK+SC + GA T L V+ C
Sbjct: 391 CHTP-DAKAIVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRC 438
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 195 bits (496), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 160/306 (52%), Gaps = 38/306 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DGE +LSG++HY R P +W D I+KA+ GL+ IETYV WNAH P R +D TGNL
Sbjct: 13 LDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDLTGNL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+ + +GL+ I+R GPY+CAEW+ GG P WL PG+ +RT ++ + +
Sbjct: 73 DLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGV-GVRTAEPQYLEAIAGY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
I+ + ++ ++GGP+++ Q+ENEYG YGD Y+ M I VP
Sbjct: 132 YDEILAVVAPRQV--TRGGPVLMVQVENEYGA----YGD-DADYLRALVTMMRERGIEVP 184
Query: 218 WIMCQE---------------------SDAPSPMFT--PNNPNSPKIWTENWTGWFKSWG 254
C + S +P + T + P P + E W GWF SWG
Sbjct: 185 LTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFDSWG 244
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPYL--TTSYDYDA 308
+ T A A G N YM+HGGTN G T+G G YL TTSYDYDA
Sbjct: 245 EQH-HTTDAAEAAADLDLLLSQGASANLYMFHGGTNLGFTNGANDKGTYLPITTSYDYDA 303
Query: 309 PIDEYG 314
P+ E G
Sbjct: 304 PLAEDG 309
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 172/324 (53%), Gaps = 29/324 (8%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+G T+DG+ +LSG+IHY R W D + K K GL+ +ETYV WN HEP + ++
Sbjct: 14 EGENFTLDGKPVQILSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHEPEKGKF 73
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
DFTG LD+ +++ + GL+VI R GPY+CAEW+YGG P WL P + ++RTT + +M
Sbjct: 74 DFTGMLDIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNM-QVRTTYQPYM 132
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD--YGDAGKSYIN--WCAK 207
++ F ++ + K + +GGPII Q+ENEYG+ D Y A K I +
Sbjct: 133 EAVERFFDALLPIVKPFQY--KEGGPIIAMQVENEYGSYARDDKYLTAVKQAIQKRGIEE 190
Query: 208 MATSLDIG---------VPWIMCQESDAPSP-----MFTPNNPNSPKIWTENWTGWFKSW 253
+ + D G +P ++ + +P PN P++ E W+GWF W
Sbjct: 191 LLLTSDGGQIERLERGCIPGVLMTANFNFNPKKQLGALKKLQPNRPQMVMEFWSGWFDHW 250
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYD 307
G K E + +F + N+YM+HGGTNFG +G Y+ TSYDYD
Sbjct: 251 GRDHHKLHVEKFEQLLGDILRFPSSV-NFYMFHGGTNFGFMNGANYINGYKPDVTSYDYD 309
Query: 308 APIDEYGHLNQPKWGHLRELHKLL 331
AP+ E G PK+ REL K L
Sbjct: 310 APLSEAGD-PTPKYYKTRELLKTL 332
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 192 bits (489), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 181/699 (25%), Positives = 278/699 (39%), Gaps = 164/699 (23%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SGSIHY R P W D ++K K G + +ETY+ WN EP + ++ F G
Sbjct: 12 LDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFDGLC 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D +F+ Q GLY I+R PY+CAEW GG P W+ +PG+E R N+ + ++++
Sbjct: 72 DFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEP-RCKNEPYYQNVRDY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ ++ +GG IIL QIENEYG D SY+++ + I VP
Sbjct: 131 YKVLLPRLVNHQI--DKGGNIILMQIENEYGYYGKDM-----SYMHFLEGLMREGGITVP 183
Query: 218 WIMCQ----------ESDAPSP-----------------MFTPNNPNSPKIWTENWTGWF 250
++ + D P M P + E W GWF
Sbjct: 184 FVTSDGPWGKMFIHGQCDGALPTGNFGSHARPLFANMKRMMKKTGNRGPLMCMEFWIGWF 243
Query: 251 KSWGGKDPK-----RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT---- 301
+WG K+ K R +DL + + + G N+YM+HGGTNFG +G Y T
Sbjct: 244 DAWGNKEHKTSKLKRNIKDLNYMLKK----GNV--NFYMFHGGTNFGFMNGSNYFTKLTP 297
Query: 302 --TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNL 359
TSYDYDAP+ E G + + K+ + + K + E+ + YG +G
Sbjct: 298 DTTSYDYDAPLSEDGKITE-KYRTFQSIIKKYRDFEEMPLSTKIEQKAYGKVKAGK---- 352
Query: 360 PAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKG 419
S+ + T A T + K+ +G D
Sbjct: 353 ---SIKLFDILDT----LAVAKTSSVEKLTGMEASGQDYG-------------------- 385
Query: 420 HFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNY 479
Y+ Y T + +SN TL+I +H + NG
Sbjct: 386 ------------------YILYKTK---------VPAASN-TLKIEDGLDRIHEFKNGEL 417
Query: 480 VDSQWTKYGASNDLFERPVKLTRGK-NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
+ K A +PV+LT ++++LL +G N+ +K GI G VL
Sbjct: 418 KAVLFDKETA------KPVELTLASGDELTLLVENLGRVNFATKIPFQRKGILGRVL--- 468
Query: 539 RAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
DE + D WTY LD + +E G + + T
Sbjct: 469 --ADEKPLTD-----WTYYN--LNLDKAQLSKIDWNKAEEGIAGTGKITSPSFTHMTLMV 519
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+ + L+ G GKG ++NG+NLGR+W GP
Sbjct: 520 DKACD---TYLDFTGWGKGCIFLNGFNLGRFWEI----------------GP-------- 552
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
Q +VP +K+G N +++FE G I F
Sbjct: 553 ------QKRLYVPAPLLKEGENEIIIFETEGKTADSIEF 585
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 125/331 (37%), Positives = 172/331 (51%), Gaps = 44/331 (13%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
DG TIDG+ LLSG++HY R P W D + K K GL+ +ETYV WN HEP + Y
Sbjct: 26 DGANFTIDGKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHEPEKYTY 85
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
+F G LDL R++ + GL+VILR GPY+CAEW +GG P WL + E +RTT +F+
Sbjct: 86 NFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYVK--EHVRTTRPMFI 143
Query: 152 NEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMAT 210
+ ++ F L+ ++ ++ + GGPII QIENEYG + Y+ K+
Sbjct: 144 DPVEVWFGRLLAEVVPRQ---YTNGGPIIAVQIENEYGGFSNS-----TEYMERLKKILE 195
Query: 211 SLDI----------------GVPWIMCQ---ESDAPSPM--FTPNNPNSPKIWTENWTGW 249
S I G+P ++ +++A + P+ P + E WTGW
Sbjct: 196 SRGIVELLFTSDGKGALISGGIPGVLKTVNFQNNASDKLQKLKEIQPDRPMMVMEYWTGW 255
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQF-GGTFQNYYMYHGGTNFG--------RTSGGPYL 300
F WG E +F + F+ G N+YM+HGGTNFG SGG L
Sbjct: 256 FDHWGEDHHLYRLESESFVHSVFYILDAGASVNFYMFHGGTNFGFMNGANTRYKSGGRTL 315
Query: 301 --TTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
TSYDYDAPI E G L PK+ +RE+ K
Sbjct: 316 PTITSYDYDAPISETGDLT-PKYFKIREILK 345
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 110/271 (40%), Positives = 149/271 (54%), Gaps = 31/271 (11%)
Query: 548 DLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW--SSKNVPLNRRMTWYKTTFEAPLEND 605
DLS KWTY+VGL G + + GW +S V + +TW+KT F+AP N+
Sbjct: 2 DLSWQKWTYQVGLKGEAMNLAFPTNTPSI--GWMDASLTVQKPQPLTWHKTYFDAPEGNE 59
Query: 606 PVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQ 665
P+ L+++GMGKG WVNG ++GRYW T A D CS C Y G Y +KC CG P+Q
Sbjct: 60 PLALDMEGMGKGQIWVNGESIGRYW-TAFATGD-CS--HCSYTGTYKPNKCQTGCGQPTQ 115
Query: 666 IWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTAC------------------- 706
WYHVPR+W+K N LV+FEE GGNPS ++ V C
Sbjct: 116 RWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYHPNIKNWQIESYG 175
Query: 707 -GQAHENKTMELTCH-GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGK 764
GQ + L C G+ I+ IK+ASFG P G CG++++G C A ++E++CVGK
Sbjct: 176 KGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYA-ILERKCVGK 234
Query: 765 KSCSIEASEANLGATSCAAGTVKRLVVEALC 795
C++ S +N G C +KRL VEA+C
Sbjct: 235 ARCAVTISNSNFGKDPC-PNVLKRLTVEAVC 264
>gi|253755017|ref|YP_003028157.1| beta-galactosidase [Streptococcus suis BM407]
gi|251817481|emb|CAZ55222.1| putative beta-galactosidase precursor [Streptococcus suis BM407]
Length = 590
Score = 192 bits (488), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 185/704 (26%), Positives = 293/704 (41%), Gaps = 168/704 (23%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K+Y+ A +
Sbjct: 125 HLDEYYVSLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KAYLRAVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WG + +R E++ +V + G N YM+HGGTNFG +G
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQI 295
Query: 301 ----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS 356
TSYDYDA +DE G N K ++ L + LK + L Y
Sbjct: 296 DLPQVTSYDYDAILDEAG--NPTKKFYI--LQQRLKEVYPELEYAE-------------- 337
Query: 357 YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVR 416
++ + K F+ ++ + ++ N ++D V
Sbjct: 338 --------PLVKEAKA--FSDVSLHDKVSLSATLEN-----------------VSDCV-- 368
Query: 417 GKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVN 476
KG + N +ST Y+ Y T +L+ D + R+ + + Y +
Sbjct: 369 -KGFYPKNMEELDQSTG----YILYRT--ELERDK-----TEAERFRVVDARDRIQIYAD 416
Query: 477 GNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVL 535
G +V +Q+ T+ G +L + KLT + +L +G NYG K P G
Sbjct: 417 GKFVATQYQTEIGDDVELDFKDDKLT-----LDILVENMGRVNYGHKL-TAPTQSKG--- 467
Query: 536 LVGRAGDETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTW 593
+GR + DL H TY + L ++D F +GW +
Sbjct: 468 -IGRGA----MADLHFIGHWETYPLHLESVEDLDF--------SKGWEEGQA------AF 508
Query: 594 YKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGS 653
Y+ FE E L++ G GKG +VN N+GR+W +GP
Sbjct: 509 YRYQFELD-ELADTYLDMTGFGKGVVFVNNVNIGRFWE----------------KGPI-- 549
Query: 654 DKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
++ ++P+ ++K G N +V+FE G +I+F
Sbjct: 550 ------------LYLYIPKGYLKKGANEIVVFETEGKYREKIHF 581
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 192 bits (487), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 185/371 (49%), Gaps = 30/371 (8%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
RV +D + IDG R +LS ++HY R W +++ K+KE G + IETYV WN HE
Sbjct: 5 RVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEEE 64
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
Q+DF+G+ DL F+ ++GLYVI+R GPY+CAEW+ GG P WL P + + R +
Sbjct: 65 EGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDM-QYRKFH 123
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ F++ + + +V + L S G +I+ Q+ENE+ + G K+Y+ +
Sbjct: 124 REFLHYVDLYWDRLVPVVLPRLL--SNSGTVIMVQVENEFQAL----GKPDKAYMEYLRD 177
Query: 208 MATSLDIGVPWIMC--------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSW 253
I VP + C ++ + + PK E W GWF+ W
Sbjct: 178 GLIERGIDVPLVTCYGAVDGAVEFRNFWSHAEEHARTLEERFADQPKGVLEFWIGWFEQW 237
Query: 254 GG-KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSG-GPYLTTSYDYD 307
GG + ++TA + + G T NYYM+ GGTNF GRT G ++TTSYDYD
Sbjct: 238 GGPRANQKTASQVERKTYELIREGFTAINYYMFFGGTNFGHWGGRTIGEHTFMTTSYDYD 297
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTL--TYGNVTNTDYGNSVSGSSYNLPAWSVS 365
A +DEY K+ L+ +H ++ ME L T G+ G S + P ++
Sbjct: 298 AALDEYLRPTA-KYKALKLVHDFVRWMEPLLTETTGSTAFIPLGKHSSAKKKSGPQGTIL 356
Query: 366 ILPDCKTEEFN 376
+ + TE N
Sbjct: 357 FIHNDDTERLN 367
>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
Length = 216
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/178 (56%), Positives = 118/178 (66%), Gaps = 30/178 (16%)
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTEN 245
AGK+Y++WC+ MA SLDIGVPWI+CQ+ DAP PM FTPN NSPK WTEN
Sbjct: 56 AGKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQFTPNTANSPKKWTEN 115
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-LTTSY 304
WTGWFKSWG KDP RTAE +AFAVARFFQ FQN YMYHGGTNFGRT+GGPY TTS+
Sbjct: 116 WTGWFKSWGDKDPHRTAEGVAFAVARFFQ----FQNCYMYHGGTNFGRTAGGPYSTTTSH 171
Query: 305 DYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT-DYGNSVSGSSYNLPA 361
DYDAP+DE+ +H K E + +GN+ T D G+ Y +PA
Sbjct: 172 DYDAPLDEH-----------VTIHATEK--ESSCFFGNINETSDAVIEFRGAKYKIPA 216
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 164/323 (50%), Gaps = 39/323 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
R T+DGE ++SG+IHY R P W D I+KA+ GL+ IETYV WN H P R ++
Sbjct: 9 RDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHT 68
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
G DL RF+ IQ++GL I+R GPY+CAEW+ GG P WL P I +R+++ ++ E
Sbjct: 69 DGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDI-VVRSSDPTYLTE 127
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ + + + + ++ + GGPIIL Q+ENEYG +D ++Y+ + +L
Sbjct: 128 VERYLEHLAPIVEPRQI--NHGGPIILMQVENEYGAYGND-----RAYLTHLTNVYRNLG 180
Query: 214 IGVPWIMCQES-----------------------DAPSPMFTPNNPNSPKIWTENWTGWF 250
VP + D + P + +E W GWF
Sbjct: 181 FVVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIGWF 240
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSY 304
WG D A A+ R G + N YM+HGGTNFG T+G G Y L TSY
Sbjct: 241 DHWGAHHHTTDVADAANALDRLLGAGASV-NIYMFHGGTNFGFTNGANDKGVYQPLVTSY 299
Query: 305 DYDAPIDEYGHLNQPKWGHLREL 327
DYDAP+ E G+ + W RE+
Sbjct: 300 DYDAPLAEDGYPTEKYWA-FREV 321
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 36/81 (44%), Gaps = 30/81 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
L+ G GKG WVNG+N+GRYW RGP Q
Sbjct: 515 LDTSGWGKGAVWVNGFNVGRYWS----------------RGP--------------QHTL 544
Query: 669 HVPRSWIKDGVNTLVLFEEFG 689
VP ++ GVN++++FE FG
Sbjct: 545 FVPAELLRPGVNSIMVFELFG 565
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 192/714 (26%), Positives = 292/714 (40%), Gaps = 166/714 (23%)
Query: 6 HCSRAILLCLILQTLFNL--SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
H + ++L +I+ L + S +V TI+G+ L+ G +HYPR W D
Sbjct: 5 HKTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDR 64
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
+K+A+ GL+ + YVFWN HE ++DF+G D+ FI+T Q++GLYVILR GPYVCA
Sbjct: 65 LKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCA 124
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQ 182
EW++GG+P WL + R+ + F++ + + I ++ K+ L + GG II+ Q
Sbjct: 125 EWDFGGYPSWLLKEKDM-TYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQ 180
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ----------ESDAPS---- 228
+ENEYG+ +D K Y+ M VP C E P+
Sbjct: 181 VENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGV 235
Query: 229 ------PMFTPNNPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGT 278
+ P E + WF WG + +R AE L + ++ G
Sbjct: 236 FGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GV 290
Query: 279 FQNYYMYHGGTNF----GRTSGGPY--LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLK 332
+ YM+HGGTNF G +GG Y TSYDYDAP+ E+G+ PK+ RE+
Sbjct: 291 SVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YPKYHAFREV----- 344
Query: 333 SMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
++K L G V LP + D T F T ++
Sbjct: 345 -IQKYLPVGTV---------------LP----EVPADNPTTTFATVEL------------ 372
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
+ APL+ + P ++ N L + D Y+ Y T
Sbjct: 373 ---KESAPLRTAFHPTTQSE-----------NVLSMEDLGVDFG-YIHYQTT-------- 409
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L I ++G V S +Y ++ + +++ + +L
Sbjct: 410 -LQKAGKQKLVIQDLRDYAVILIDGKQVASLDRRYNQNS----VTLNVSKTPATLEILVE 464
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
G NYG GI VL G+E K+ + + Y K
Sbjct: 465 NTGRVNYGPDILFNRKGITSQVLW----GNE-------------KLTGWSITPLPLYKEK 507
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
+ E G + K VP ++K TF + D V ++ GKG WVNG +LGR+W
Sbjct: 508 VSEMEFGETIKGVP-----AFHKGTFTVEKKGDCFV-DMSQWGKGAVWVNGKSLGRFW-- 559
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
N G P Q Y +P W+K+G N +V+FE
Sbjct: 560 --------------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 585
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 115/322 (35%), Positives = 164/322 (50%), Gaps = 45/322 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG +LSG++HY R P +W D I KA+ GL+ IETYV WNAH P +D +G L
Sbjct: 13 LDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLSGGL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF++ + D G+Y I+R GPY+CAEW+ GG P WL P + R K +++ ++ +
Sbjct: 73 DLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPK-YLDAVREY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
T + ++ ++ +GGP++L Q+ENEYG D K Y+ A+ + VP
Sbjct: 132 LTKVYEVVVPHQI--DRGGPVLLVQVENEYGAFGDD-----KRYLKALAEHTREAGVTVP 184
Query: 218 WIMCQESDAPSP--------------------------MFTPNNPNSPKIWTENWTGWFK 251
D P+P + + P P + +E W GWF
Sbjct: 185 LTTV---DQPTPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGWFD 241
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYD 305
WG +A D A + G + N YM+HGGTNFG T+G G Y L TSYD
Sbjct: 242 HWGAHHHTTSAADSAAELDALLAAGASV-NLYMFHGGTNFGLTNGANDKGVYQPLITSYD 300
Query: 306 YDAPIDEYGHLNQPKWGHLREL 327
YDAP+DE G PK+ R++
Sbjct: 301 YDAPLDEAGD-PTPKYHAFRDV 321
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 114/306 (37%), Positives = 160/306 (52%), Gaps = 38/306 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG +L+G++HY R P +W D I+KA+ GL+ IETY WN HEP+ YDFTG L
Sbjct: 13 LDGRPHRVLAGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFTGML 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF++ + D G++ I+R GPY+CAEW+ GG P WL+ P + +R + ++ + +
Sbjct: 73 DLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEV-GVRRSEPRYLGAVSAY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ D+ ++ +GGP++L QIENEYG SD K Y+ + I VP
Sbjct: 132 LRRVYDVVTPLQI--DRGGPVVLVQIENEYGAYGSD-----KFYLRHLVDLTRECGITVP 184
Query: 218 WIMCQE---------------------SDAPSPMFT--PNNPNSPKIWTENWTGWFKSWG 254
+ S A + T + P P + +E W GWF WG
Sbjct: 185 LTTVDQPTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFWNGWFDHWG 244
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYDYDA 308
+ +AED A + G + N YM+HGGTNFG TSG G Y TSYDYDA
Sbjct: 245 DRHHTTSAEDSAAELDALLAAGASV-NIYMFHGGTNFGLTSGANDKGVYQPTITSYDYDA 303
Query: 309 PIDEYG 314
P+DE G
Sbjct: 304 PLDEAG 309
>gi|417092513|ref|ZP_11957129.1| Beta-galactosidase [Streptococcus suis R61]
gi|353532192|gb|EHC01864.1| Beta-galactosidase [Streptococcus suis R61]
Length = 590
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 190/704 (26%), Positives = 291/704 (41%), Gaps = 168/704 (23%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWHHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K Y+ A +
Sbjct: 125 HLDEYYASLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KEYLRSVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WG + +R E++ +V + G N YM+HGGTNFG +G
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQI 295
Query: 301 ----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS 356
TSYDYDA +DE G N K ++ L + LK + L Y +
Sbjct: 296 DLPQVTSYDYDAILDEAG--NPTKKFYI--LQQRLKEVYPELEYAEPLVKE--------- 342
Query: 357 YNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVR 416
A+S +L D K F T E ++D V
Sbjct: 343 --AKAFSDVLLHD-KVSLFATL-----------------------------ENVSDCV-- 368
Query: 417 GKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVN 476
KG + N +ST Y+ Y T +L+ D + R+ + + Y +
Sbjct: 369 -KGFYPKNMEELDQSTG----YILYRT--ELERDK-----TEAERFRVVDARDRIQIYAD 416
Query: 477 GNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVL 535
G +V +Q+ T+ G +L + KLT + +L +G NYG K P G
Sbjct: 417 GKFVATQYQTEIGDDVELDFKDDKLT-----LDILVENMGRVNYGHKL-TAPTQSKG--- 467
Query: 536 LVGRAGDETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTW 593
+GR + DL H TY + L ++D F +GW +
Sbjct: 468 -LGRGA----MADLHFIGHWETYPLHLESVEDLDF--------SKGWEEGQA------AF 508
Query: 594 YKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGS 653
Y+ FE E L++ G GKG +VN N+GR+W +GP
Sbjct: 509 YRYQFELD-ELADTYLDMTGFGKGVVFVNNVNIGRFWE----------------KGPI-- 549
Query: 654 DKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
++ ++P+ ++K G N +V+FE G +I+F
Sbjct: 550 ------------LYLYIPKGYLKKGENEIVVFETEGKYREKISF 581
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 131/333 (39%), Positives = 168/333 (50%), Gaps = 37/333 (11%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+GR T+DG+ +LSG++HY R P W D I K K GL+ +ETYV WN HE ++ +
Sbjct: 45 NGRHFTMDGKPFTILSGAMHYFRIPPQYWEDRIVKLKAMGLNTVETYVSWNLHEEIQGDF 104
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
+F LD++ FIKT Q LYVI+R GPY+CAEW+ GG P WL + P I LR+ + +FM
Sbjct: 105 NFKDGLDIVEFIKTAQKHDLYVIMRPGPYICAEWDLGGLPSWLLHNPNI-YLRSLDPIFM 163
Query: 152 NE-MQNFTTLIVDMAKKEKLFASQGGPIILAQIENE---YGNVMSDYGDAGKSYINWCAK 207
++ F LI + + S GGPII QIENE Y N + + + K
Sbjct: 164 KATLRFFDELIPRLIDYQ---YSNGGPIIAWQIENEYLSYDNSSAYMRKLQQEMVIRGVK 220
Query: 208 MATSLDIGVPWIMCQES--DAPSPMFTPN---------------NPNSPKIWTENWTGWF 250
G+ W M E P + T N PN P + TE W+GWF
Sbjct: 221 ELLFTSDGI-WQMQIEKKYSLPGVLKTVNFQRNETNILKGLRKLQPNMPLMVTEFWSGWF 279
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG-----GPY--LTTS 303
WG T E A + + NYYM HGGTNFG +G G Y TS
Sbjct: 280 DHWGEDKHVLTVEKAAERTKNILKMESSI-NYYMLHGGTNFGFMNGANAENGKYKPTITS 338
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
YDYDAPI E G + PK+ LRE KLLK K
Sbjct: 339 YDYDAPISESGDIT-PKYRELRE--KLLKYAPK 368
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 190 bits (482), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/319 (37%), Positives = 157/319 (49%), Gaps = 39/319 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG +LSG+IHY R P W D I KA+ GL+ IETYV WNAHEP+ Q+ + G L
Sbjct: 13 LDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEGQWSWEGGL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F+K + D+G++ I+R PY+CAEW+ GG P WL +R VFM +Q +
Sbjct: 73 DLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKA-AGVRRDEPVFMAAVQAY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ ++ E L GGP+IL QIENEYG SD Y+ + +S I VP
Sbjct: 132 LRRVYEVI--EPLQIHHGGPVILVQIENEYGAYGSD-----PEYLRKLVDITSSAGITVP 184
Query: 218 WIMCQE---------------------SDAPSPMFT--PNNPNSPKIWTENWTGWFKSWG 254
+ S +P + T + P P + E W GWF WG
Sbjct: 185 LTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWNGWFDDWG 244
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYDYDA 308
AE A + G + N YM GGTNFG T+G G Y + TSYDYDA
Sbjct: 245 TPHHTTDAEASAADLDALLGSGASV-NLYMLCGGTNFGLTNGANDKGTYEPIVTSYDYDA 303
Query: 309 PIDEYGHLNQPKWGHLREL 327
P+DE GH W RE+
Sbjct: 304 PLDEAGHPTAKYWA-FREV 321
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 189 bits (480), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 192/708 (27%), Positives = 290/708 (40%), Gaps = 165/708 (23%)
Query: 11 ILLCLILQTLF-NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L L++ LF S R+ DG +DG+ L+ G +HY R W D +K+A+
Sbjct: 10 FILGLLMPFLFLACSSKERIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARA 69
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GL+ I YVFWN HE ++DF+G D+ F++ Q++GLYVILR GPY CAEW++GG
Sbjct: 70 MGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGG 129
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYG 188
+P WL + R+ + F+ + + I + K+ L + GG I++ Q+ENEYG
Sbjct: 130 YPSWLLKEKDM-VYRSKDPRFLEYCERY---IKALGKQLAPLTVNNGGNILMVQVENEYG 185
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ--------ESDAPSP----MFTPN-- 234
+ +D K Y+ M VP C D P +F+ +
Sbjct: 186 SYAAD-----KEYLAALRDMIKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIF 240
Query: 235 ------NPNSPKIWTENWTGWFKSWGGK----DPKRTAEDLAFAVARFFQFGGTFQNYYM 284
+P P E + WF WG + D KR AE L + + + G + YM
Sbjct: 241 KIIDKYHPGGPYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSMYM 295
Query: 285 YHGGTNF----GRTSGGPYL--TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL 338
+HGGTNF G + G Y TSYDYDAP+ E+G+ PK+ RE+ ++K L
Sbjct: 296 FHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWGNC-YPKYYAFREV------IQKHL 348
Query: 339 TYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQ 398
+G V LP + D T F T ++ ++ +Q +
Sbjct: 349 PHGTV---------------LP----EVPADNPTTTFATIELKESAPLQAAF-HQTTESE 388
Query: 399 APLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSS 458
L + +M DF G+ T I++ + DL+D IL
Sbjct: 389 NVLSME---DMGVDF-----GYIHYQTTINKAGKQK-------LIIQDLRDYAVIL---- 429
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQN 518
V+G V S +Y +N + + + + + +L G N
Sbjct: 430 ----------------VDGKQVASLDRRYNQNNVMLD----IQKAPATLEILVENTGRVN 469
Query: 519 YGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
YG GI VL GDE K+ + + Y K +
Sbjct: 470 YGPDILFNRKGITNQVL----CGDE-------------KLTGWSITPLPLYKEKVSEMNF 512
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
G S + P ++K F + D V ++ GKG WVNG +LGR+W
Sbjct: 513 GESIQGKP-----AFHKGIFTVRQKGDCFV-DMSRWGKGAVWVNGKSLGRFW-------- 558
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
N G P Q Y +P W+K+G N +V+FE
Sbjct: 559 --------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 584
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 192/714 (26%), Positives = 290/714 (40%), Gaps = 177/714 (24%)
Query: 11 ILLCLILQTLF-NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L L++ LF S R+ DG +DG+ L+ G +HY R W D +K+A+
Sbjct: 10 FILGLLMPFLFLACSSKERIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARA 69
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GL+ I YVFWN HE ++DF+G D+ F++ Q++GLYVILR GPY CAEW++GG
Sbjct: 70 MGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGG 129
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYG 188
+P WL + R+ + F+ + + I + K+ L + GG I++ Q+ENEYG
Sbjct: 130 YPSWLLKEKDM-VYRSKDPRFLEYCERY---IKALGKQLAPLTVNNGGNILMVQVENEYG 185
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ--------ESDAPSP----MFTPN-- 234
+ +D K Y+ M VP C D P +F+ +
Sbjct: 186 SYAAD-----KEYLAALRDMIKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIF 240
Query: 235 ------NPNSPKIWTENWTGWFKSWGGK----DPKRTAEDLAFAVARFFQFGGTFQNYYM 284
+P P E + WF WG + D KR AE L + + + G + YM
Sbjct: 241 KIIDKYHPGGPYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSMYM 295
Query: 285 YHGGTNF----GRTSGGPYL--TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL 338
+HGGTNF G + G Y TSYDYDAP+ E+G+ PK+ RE+ ++K L
Sbjct: 296 FHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWGNC-YPKYYAFREV------IQKHL 348
Query: 339 TYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQ 398
+G V LP + D T F T ++ +
Sbjct: 349 PHGTV---------------LP----EVPADNPTTTFATIEL---------------KES 374
Query: 399 APLQWKWRPEMINDFVVRGK------GHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
APLQ + ++ V+ + G+ T I++ + DL+D
Sbjct: 375 APLQAAFHQTTESENVLSMEDLGVDFGYIHYQTTINKAGKQK-------LIIQDLRDYAV 427
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
IL V+G V S +Y +N + + + + + +L
Sbjct: 428 IL--------------------VDGKQVASLDRRYNQNNVMLD----IQKAPATLEILVE 463
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
G NYG GI VL GDE K+ + + Y K
Sbjct: 464 NTGRVNYGPDILFNRKGITNQVL----CGDE-------------KLTGWSITPLPLYKEK 506
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
+ G S + P ++K F + D V ++ GKG WVNG +LGR+W
Sbjct: 507 VSEMNFGESIQGKP-----AFHKGIFTVRQKGDCFV-DMSRWGKGAVWVNGKSLGRFW-- 558
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
N G P Q Y +P W+K+G N +V+FE
Sbjct: 559 --------------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 584
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 163/322 (50%), Gaps = 45/322 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG++HY R P +W D I KA+ GL+ IETYV WNAH P R ++ G L
Sbjct: 10 LDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTDGAL 69
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF++ ++ +G+ I+R GPY+CAEW+ GG P WL P + +R ++M + +
Sbjct: 70 DLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAV-GVRRDEPLYMEAVSEY 128
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++D+ ++ +GGP++L Q+ENEYG SD+ Y+ + S I VP
Sbjct: 129 LGTVLDLVAPFQV--DRGGPVVLVQVENEYGAYGSDH-----VYLEKLMALTRSHGITVP 181
Query: 218 WIMCQESDAPS--------------------------PMFTPNNPNSPKIWTENWTGWFK 251
D PS + P P + E W GWF
Sbjct: 182 ---LTSIDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGWFD 238
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYD 305
WG +A+D A + G + N YM+HGGTNFG TSG G Y TTSYD
Sbjct: 239 HWGAHHHTTSAQDAARELDELLAAGASV-NIYMFHGGTNFGFTSGANDKGVYQPTTTSYD 297
Query: 306 YDAPIDEYGHLNQPKWGHLREL 327
YDAP+ E G+ + K+ RE+
Sbjct: 298 YDAPLAEDGYPTE-KFFAFREV 318
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 188 bits (478), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 115/315 (36%), Positives = 161/315 (51%), Gaps = 40/315 (12%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
++ DG +DGE +LSG +HY R PG+W D + KA+ GL+ +ETYV WN H+P
Sbjct: 10 QIEDDG--FRLDGEPFRILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPR 67
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
++ G LDL RF+ +GL+V+LR GPY+CAEW GG P WL P + LR+ +
Sbjct: 68 PDEFRMDGGLDLPRFLDLAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAM-RLRSRD 126
Query: 148 KVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
F+ + + F L+ + + AS+GGP++ Q+ENEYG YGD +Y+ A
Sbjct: 127 PNFLAAVDDYFRRLLPPLHDR---LASRGGPVLAVQVENEYGA----YGD-DTAYLEHLA 178
Query: 207 KMATSLDIGVPWIMCQ-----ESDAPSPMFTPNN----------------PNSPKIWTEN 245
+ VP C E A + + N P++P + TE
Sbjct: 179 DSLRRHGVDVPLFTCDQPADLERGALAGVLATANFGSRPAAHLATLRTARPSAPLLCTEF 238
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------PY 299
W GWF WGG R AE + + G + N+YM+HGGTNFG +G
Sbjct: 239 WIGWFDRWGGNHVVRDAEQASQELDELLATGASV-NFYMFHGGTNFGFMNGANDKHTYRP 297
Query: 300 LTTSYDYDAPIDEYG 314
TSYDYDAP+DE G
Sbjct: 298 TVTSYDYDAPLDEAG 312
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 33/66 (50%), Gaps = 15/66 (22%)
Query: 579 GWSSKNVPLNRRM--------------TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
GW+S+ +PL +Y+ TFEA D L+L G KG AWVNG+
Sbjct: 475 GWTSRPLPLTAPQDLPFGIGPATPTGPAFYRGTFEADRAAD-AFLHLDGWTKGSAWVNGF 533
Query: 625 NLGRYW 630
LGRYW
Sbjct: 534 ALGRYW 539
>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 627
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 200/728 (27%), Positives = 301/728 (41%), Gaps = 164/728 (22%)
Query: 1 MATLKHCSRAILLCLILQTL-FNLSLAYRVSHDGRAIT-----IDGERKILLSGSIHYPR 54
M LK + A +L L T + + + + AIT +G+ L SG +HY R
Sbjct: 1 MKHLKCLAMATMLLLTATTAEAKQNKQTKTTRNTFAITDGQFVYNGKPMQLHSGEMHYAR 60
Query: 55 STPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF-TGNLDLIRFIKTIQDQGLYV 113
W +K K GL+A+ TYVFWN HE ++D+ TGN +L +F+KT ++G+ V
Sbjct: 61 VPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKTGNRNLRQFVKTAAEEGMLV 120
Query: 114 ILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFAS 173
ILR GPY CAEW++GG+P WL G+ +R N+ F++ + + + + ++ +
Sbjct: 121 ILRPGPYCCAEWDFGGYPWWLSKAKGLV-IRADNQPFLDSCRVYINQLASQMRDLQI--T 177
Query: 174 QGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG--VPWIMCQES----- 224
+GGPII+ Q ENE+G+ ++ D +S+ + AK+ L D G VP S
Sbjct: 178 KGGPIIMVQAENEFGSYVAQRKDVPLESHRAYSAKIKQQLIDAGFDVPLFTSDGSWLFKG 237
Query: 225 -DAPSPMFTPNNPNS----------------PKIWTENWTGWFKSWGGKDPKRTAEDLAF 267
+ T N N P + E + GW W P+ + E +
Sbjct: 238 GTIEGALPTANGENDIEKLKKVVNEYNGGKGPYMVAEFYPGWLSHWAEPFPQVSTESIVK 297
Query: 268 AVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TSYDYDAPIDEYGHLNQP 319
A++ + G +F NYYM HGGTNFG TSG Y T TSYDYDAPI E G N P
Sbjct: 298 QTAKYLENGVSF-NYYMVHGGTNFGFTSGANYTTATNLQSDLTSYDYDAPISEAG-WNTP 355
Query: 320 KWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAK 379
K+ L + L NV YN+PA I P +K
Sbjct: 356 KYDAL-----------RALMIKNV------------KYNVPAVPQRI-PVIAIPNIKLSK 391
Query: 380 VNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYL 439
N+ K +A + PL + E +N +G G+ +Q
Sbjct: 392 SADVLNLLTK--GKAVENDTPLTF----EDLN----QGHGYVLYRRHFNQP--------- 432
Query: 440 WYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVK 499
+SG T++I YVNG V + P
Sbjct: 433 --------------ISG----TMKIAGLADYALVYVNGQKVGELDRVSDVDSIEINMPFN 474
Query: 500 LTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVG 559
+ +L +G NYG++ GI GPV++ G +++ + YK+
Sbjct: 475 -----GVLDILVENMGRINYGARIPQSIKGINGPVVIDGN--------EITGNWQMYKLP 521
Query: 560 LYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
+ D N+ ++K +P T Y TF D LN++ GKG
Sbjct: 522 MNEAPD--------VNALPTANNKGLP-----TLYSGTFNLDTTGD-TFLNMETWGKGIV 567
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
++NG+NLGRYW RGP Q ++P ++K G
Sbjct: 568 FINGFNLGRYWK----------------RGP--------------QQTLYLPGCFLKKGE 597
Query: 680 NTLVLFEE 687
N +V+FE+
Sbjct: 598 NKIVVFEQ 605
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 176/669 (26%), Positives = 265/669 (39%), Gaps = 153/669 (22%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGAAADTERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+GN D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQAYLDALAN--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D ++ AE+ + + + G
Sbjct: 240 GEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE GH PK+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGIQPPALPATIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPN---QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTN 444
+ P Q G D Y+ Y T
Sbjct: 395 IDTPRPMEQFGQDYG--------------------------------------YILYRTT 416
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
++G L + V YV+ V S + + E P G+
Sbjct: 417 ---------VTGPRKGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQ 463
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLY 561
+ + +L G NY GP + GRAG D ++ + W + + +
Sbjct: 464 HTLDVLVENSGRINY------------GPRMADGRAGLVDPVVLDNQQLTGWQAFPLPM- 510
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ +S RGW+ K V + +++ T D L+++ GKGFAW
Sbjct: 511 ----------RTPDSIRGWTRKAV---QGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWA 556
Query: 622 NGYNLGRYW 630
NG NLGR+W
Sbjct: 557 NGVNLGRHW 565
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 155/309 (50%), Gaps = 38/309 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T DGE L SG+IHY R P W D ++K K G + +ETYV WN HEP ++ F G
Sbjct: 11 FTYDGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEG 70
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
DL RFI+ GL+VI+R PY+CAEW +GG P WL PG+ +LR + ++++++
Sbjct: 71 MADLERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGM-KLRCADPLYLSKVD 129
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ + L + GGP+IL Q+ENEYG+ SD K+Y+ I
Sbjct: 130 AYYDELI--PRLVPLLCTSGGPVILVQVENEYGSYGSD-----KAYLEHLRDGLVRRGID 182
Query: 216 VPWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKS 252
VP M Q P + T N P P + E W GWF
Sbjct: 183 VPLFTSDGPTDSMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDH 242
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDY 306
W + +R A D A + G + N+YM+HGGTNFG +G ++ TSYDY
Sbjct: 243 WMEEHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFHNGANHIKTYEPTITSYDY 301
Query: 307 DAPIDEYGH 315
D+P+ E+G
Sbjct: 302 DSPLTEWGE 310
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 155/309 (50%), Gaps = 38/309 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T DGE L SG+IHY R P W D ++K K G + +ETYV WN HEP ++ F G
Sbjct: 11 FTYDGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEG 70
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
DL RFI+ GL+VI+R PY+CAEW +GG P WL PG+ +LR + ++++++
Sbjct: 71 MADLERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGM-KLRCADPLYLSKVD 129
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ + L + GGP+IL Q+ENEYG+ SD K+Y+ I
Sbjct: 130 AYYDELI--PRLVPLLCTSGGPVILVQVENEYGSYGSD-----KAYLEHLRDGLVRRGID 182
Query: 216 VPWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKS 252
VP M Q P + T N P P + E W GWF
Sbjct: 183 VPLFTSDGPTDSMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDH 242
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDY 306
W + +R A D A + G + N+YM+HGGTNFG +G ++ TSYDY
Sbjct: 243 WMEEHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDY 301
Query: 307 DAPIDEYGH 315
D+P+ E+G
Sbjct: 302 DSPLTEWGE 310
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 155/309 (50%), Gaps = 38/309 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T DGE L SG+IHY R P W D ++K K G + +ETYV WN HEP ++ F G
Sbjct: 11 FTYDGEELRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEG 70
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
DL RFI+ GL+VI+R PY+CAEW +GG P WL PG+ +LR + ++++++
Sbjct: 71 MADLERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGM-KLRCADPLYLSKVD 129
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ + L + GGP+IL Q+ENEYG+ SD K+Y+ I
Sbjct: 130 AYYDELI--PRLVPLLCTSGGPVILVQVENEYGSYGSD-----KAYLEHLRDGLVRRGID 182
Query: 216 VPWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKS 252
VP M Q P + T N P P + E W GWF
Sbjct: 183 VPLFTSDGPTDAMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDH 242
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDY 306
W + +R A D A + G + N+YM+HGGTNFG +G ++ TSYDY
Sbjct: 243 WMEEHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDY 301
Query: 307 DAPIDEYGH 315
D+P+ E+G
Sbjct: 302 DSPLTEWGE 310
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 165/317 (52%), Gaps = 45/317 (14%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
+ +DG+ LLSG++HY R+ P W D + K K G + +ETYV WN HEP Q+ F
Sbjct: 9 SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEPEEGQFVFE 68
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEM 154
G D++RFIKT + GL+VI+R GP++CAEW +GGFP WL +P I +LR N+ ++ ++
Sbjct: 69 GIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNI-KLRCFNQPYLEKV 127
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ ++ + + L +S GGPII QIENEYG+ +D + Y+ + + +
Sbjct: 128 DAYFDVLFERLR--PLLSSNGGPIIALQIENEYGSFGND-----QKYLQYL-RDGIKKRV 179
Query: 215 GVPWIMCQESDAPSP----------MFTPNN----------------PNSPKIWTENWTG 248
G + SD P P +F N PN+P + E W G
Sbjct: 180 GNELLFT--SDGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNAPLMCMEFWHG 237
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LT 301
WF WG + R+AE + + + G+ N+YM HGGTNFG +G +
Sbjct: 238 WFDHWGEEHHTRSAESVVETLEEILKQNGSV-NFYMAHGGTNFGFYNGANHNETDYQPTI 296
Query: 302 TSYDYDAPIDEYGHLNQ 318
TSYDYD + E G + +
Sbjct: 297 TSYDYDGLLTESGDVTE 313
>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
Length = 255
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 93/199 (46%), Positives = 118/199 (59%), Gaps = 49/199 (24%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+D R++ IDG+R+I+LSGSIHYPRSTP
Sbjct: 30 VSYDDRSLVIDGQRRIILSGSIHYPRSTP------------------------------- 58
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+ IQ+ G+Y ILRIGPY+C EWNYGG P WL ++PG++ R N+
Sbjct: 59 ---------------EEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQ-FRLHNE 102
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD--AGKSYINWCA 206
F NEM+ FTTLIV+ K K+FA QGGPIILAQIENEYGN+M + + YI+WCA
Sbjct: 103 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 162
Query: 207 KMATSLDIGVPWIMCQESD 225
MA ++GVPWIMCQ+ D
Sbjct: 163 DMANKQNVGVPWIMCQQDD 181
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 176/701 (25%), Positives = 289/701 (41%), Gaps = 166/701 (23%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+D E +LSG+IHY R P W + K G + +ETYV WN HEP +DF+G++
Sbjct: 12 LDDEPFTILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSGSI 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F+ GLY I+R P++CAEW +GG P WL + + K + Q +
Sbjct: 72 DLAAFLDEAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQYY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
L+ + ++ +GG II+ Q+ENEYG+ D K Y+ ++ + VP
Sbjct: 132 DHLMPILVSRQ---IDKGGNIIMMQVENEYGSYCED-----KDYLRAIRRLMVERGVSVP 183
Query: 218 W--------------------IMC---------QESDAPSPMFTPNNPNSPKIWTENWTG 248
++C + +A S + P + E W G
Sbjct: 184 LCTSDGPWRGCLRAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF +G +R EDLA V + GG+ N YM+HGGTNFG R + +
Sbjct: 244 WFNRYGENVIRRDPEDLASCVREVLELGGSL-NLYMFHGGTNFGFMNGCSARHTHDLHQV 302
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPA 361
TSYDYDAP+DE G+ + + R +H+L + ++ ++ ++++P
Sbjct: 303 TSYDYDAPLDEQGNPTEKYFAIQRTVHELYPDIAQS------------KPLTKKAFSMPD 350
Query: 362 WSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHF 421
SVS + FN + + P +A Q P+ + EM + G+
Sbjct: 351 ISVSE----RVSLFNVLDI-------LSEPIEA---QYPMPME---EMGQSY-----GYT 388
Query: 422 ALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVD 481
T +++ D D++ I R+ + +VNG+ V
Sbjct: 389 LYTTTVER----------------DRADEERI---------RVIDARDRAQMFVNGDKVA 423
Query: 482 SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNGIPGPVLLVGR 539
+Q+ ++ + P + N++ +L+ +G NYG K D GI R
Sbjct: 424 TQYQEHIGEDIHCVLPCE----HNRLDVLTEDMGRVNYGHKLLADTQHKGI--------R 471
Query: 540 AGDETIIKDLSSHKWT-YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
G + DL H T +++ LD N + GW + + ++Y+ F
Sbjct: 472 TG---VCVDL--HFVTGWEMRCLPLD-----NIDNLDYSAGW------VEGQPSFYRAKF 515
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+ D ++ G GKG A+VNG N+GR+W +GP
Sbjct: 516 DISEPAD-TFIDTTGFGKGVAFVNGTNVGRFWD----------------KGPI------- 551
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
+ +VP + G N LV+FE G ++I+ ++
Sbjct: 552 -------MTLYVPHGLLHPGTNELVMFETEGVYDAKISLRS 585
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 116/347 (33%), Positives = 170/347 (48%), Gaps = 51/347 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
IDG++ ++SG++HY R P W D + K+ G +A+ETY+ WN HEP + ++DF G
Sbjct: 12 IDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDFDGQK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F++ + GLYVI+R PY+C+EW GG P WL I LRT + V+M ++ +
Sbjct: 72 DVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDI-RLRTNDSVYMKHLEEY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ M K ++ ++ G IILAQ+ENEYG+ D K Y+ KM I VP
Sbjct: 131 YAVLLPMIAKYQI--NREGTIILAQLENEYGSYNQD-----KDYLKALLKMMREYGIEVP 183
Query: 218 WI--------------MCQESDAPSPMFTPNNPN---------------SPKIWTENWTG 248
+ +E P+ F N +P + E W G
Sbjct: 184 IFTADGTWEEALEAGSLFEEDVFPTGNFGSNAKENIAVLKEFMKKHQIVAPIMCMEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF W + KR E+L + G N+YM+HGGTNFG R
Sbjct: 244 WFNRWNMEIVKRDPEELVQSAKEMIDLGSI--NFYMFHGGTNFGWMNGCSARKEHDLPQI 301
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL----HKLLKSMEKTLTYGNVT 344
TSYDYDA + EYG + K+ LR++ +L KT +YG V+
Sbjct: 302 TSYDYDAILTEYGAKTE-KYHLLRKMITGKQDILPDRRKTASYGRVS 347
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 186 bits (472), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/300 (38%), Positives = 160/300 (53%), Gaps = 14/300 (4%)
Query: 401 LQWKWRPEMINDFVVRGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSN 459
W+ E N R F + L++Q S T D SDYLWY T ++ ++ L
Sbjct: 7 FSWQSYSEATNSLDGRA---FTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQW 63
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L I S+G L +VNG + + Y + + VK+ +G N+IS+LSA VGL N
Sbjct: 64 PQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQ 123
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ ++ G+ GPV L G + +DLS KWTY++GL+G A +++ E G
Sbjct: 124 GTHYETWNVGVLGPVTLSGLNEGK---RDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWG 180
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
++ PL TW+K F AP + PV L++ MGKG AWVNG ++GRYW +Y A G
Sbjct: 181 SAAGKQPL----TWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSG 235
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
C C Y G Y KC CG+ SQ +YHVPRSW+ N LV+ EEFGG+ S + T
Sbjct: 236 CG--GCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 293
>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
Length = 608
Score = 186 bits (471), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 161/318 (50%), Gaps = 43/318 (13%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSGS+HY R W D ++K K GL+ ++TY+ WN HEP + F LD+ F+K
Sbjct: 19 ILSGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDELDVSEFLK 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
+D GLYVI+R GPY+CAEW +GGFP WL + +T ++ ++ +QN+ T++
Sbjct: 79 IAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQNWFTVLFSQ 138
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIM--CQ 222
+ + S+GGPII Q+ENEY + D Y+ W + T D+G +++
Sbjct: 139 LRDHQW--SRGGPIISIQVENEYASYNKD-----SEYLPWVKNLLT--DVGKCFLLKIIN 189
Query: 223 ESD--------APSPMFTPN--------------NPNSPKIWTENWTGWFKSWGGKDPKR 260
E++ P T N PN PK+ TE W GWF WG +
Sbjct: 190 ETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQPNRPKMVTEFWAGWFDHWGQQGHST 249
Query: 261 TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------TTSYDYDAPID 311
+ R G+ N YM+HGGT+FG +G +L TTSYDYDAP+
Sbjct: 250 LSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTTSYDYDAPLS 309
Query: 312 EYGHLNQPKWGHLRELHK 329
E G L + KW RE+ K
Sbjct: 310 ESGDLTE-KWNVTREIIK 326
>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 186 bits (471), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 161/318 (50%), Gaps = 43/318 (13%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSGS+HY R W D ++K K GL+ ++TY+ WN HEP + F LD+ F+K
Sbjct: 19 ILSGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDELDVSEFLK 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
+D GLYVI+R GPY+CAEW +GGFP WL + +T ++ ++ +QN+ T++
Sbjct: 79 IAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQNWFTVLFSQ 138
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIM--CQ 222
+ + S+GGPII Q+ENEY + D Y+ W + T D+G +++
Sbjct: 139 LRDHQW--SRGGPIISIQVENEYASYNKD-----SEYLPWVKNLLT--DVGKCFLLKIIN 189
Query: 223 ESD--------APSPMFTPN--------------NPNSPKIWTENWTGWFKSWGGKDPKR 260
E++ P T N PN PK+ TE W GWF WG +
Sbjct: 190 ETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQPNRPKMVTEFWAGWFDHWGQQGHSL 249
Query: 261 TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------TTSYDYDAPID 311
+ R G+ N YM+HGGT+FG +G +L TTSYDYDAP+
Sbjct: 250 LSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTTSYDYDAPLS 309
Query: 312 EYGHLNQPKWGHLRELHK 329
E G L + KW RE+ K
Sbjct: 310 ESGDLTE-KWNVTREIIK 326
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 185 bits (470), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 174/669 (26%), Positives = 266/669 (39%), Gaps = 153/669 (22%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+G+ D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQAYLDALAN--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D ++ AE+ + + + G
Sbjct: 240 GEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE GH PK+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGVQPPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPN---QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTN 444
+ P Q G D Y+ Y T
Sbjct: 395 IDTPQPMEQFGQDYG--------------------------------------YILYRTT 416
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
++G L + V YV+ V S + + E P G+
Sbjct: 417 ---------VTGPRKGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQ 463
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLY 561
+ + +L G NYG++ + GRAG D ++ + W + + +
Sbjct: 464 HTLDVLVENSGRINYGTR------------MADGRAGLVDPVVLDNRQLTGWQAFPLPM- 510
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ +S RGW+ K V + +++ T D L+++ GKGFAW
Sbjct: 511 ----------RTPDSIRGWTRKAV---QGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWA 556
Query: 622 NGYNLGRYW 630
NG NLGR+W
Sbjct: 557 NGVNLGRHW 565
>gi|387791561|ref|YP_006256626.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379654394|gb|AFD07450.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 619
Score = 185 bits (469), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 197/694 (28%), Positives = 286/694 (41%), Gaps = 159/694 (22%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF- 93
A DG+ + SG +H+ R W +K K GL+++ TYVFWN HE +DF
Sbjct: 32 AFVYDGKPVQIHSGEMHFARVPQEYWRHRLKMMKAMGLNSVATYVFWNYHETAPGVWDFK 91
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TGN ++ FIK ++GL VILR GPY CAEW YGG+P +L N+ G+ E+R N F+
Sbjct: 92 TGNKNISEFIKIAGEEGLMVILRPGPYACAEWEYGGYPWFLQNVEGL-EVRRNNPKFLAA 150
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD----AGKSYINWCAKMA 209
+ + + K +++ ++GGPII+ Q ENE+G+ ++ D K+Y +
Sbjct: 151 CKEYIDHLAKEVKNQQI--TKGGPIIMVQAENEFGSYVAQRKDIPLAEHKAYSSAIKAQL 208
Query: 210 TSLDIGVP-------WIM---CQESDAPSPMFTPNNPN------------SPKIWTENWT 247
+ VP W+ E+ P+ N N P + E +
Sbjct: 209 LAAGFDVPLFTSDGSWLFEGGSIENCLPTANGEDNIENLKKVVDQYNGGKGPYMVAEFYP 268
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------- 299
GW W PK ED+ ++ Q +F NYYM HGGTNFG TSG Y
Sbjct: 269 GWLDHWAEPFPKVPTEDVVKQTEKYLQNNVSF-NYYMVHGGTNFGYTSGANYDKNHDIQP 327
Query: 300 LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNL 359
TSYDYDAPI E G PK+ +REL M+K ++Y + L
Sbjct: 328 DMTSYDYDAPISEAGWAT-PKYIAIREL------MKKHVSY----------KIPEVPQPL 370
Query: 360 PAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKG 419
P V +P+ K + T + +K Q + PL + E +N +G
Sbjct: 371 P---VIEIPEIKLTQ-------TAALLDLKNTIQPVVNDKPLTF----EELN------QG 410
Query: 420 HFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNY 479
H Y+ Y K + PI SG L +N YVNG
Sbjct: 411 H----------------GYVLY----SRKFNQPI-SGK----LELNGLRDYALVYVNGEK 445
Query: 480 VDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGR 539
V Y + + P T + + +G NYG+K GI PV++ G
Sbjct: 446 VAELNRYYKNYSCEIDVPFNAT-----LDIFVENMGRINYGAKITENNKGIISPVVINGT 500
Query: 540 AGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFE 599
++S + YK+ L ++ AK S+ P+ + T+ T
Sbjct: 501 --------EISGNWKMYKMPLEKQEEVASIKAKEVKSQ--------PVVLKGTFNLT--- 541
Query: 600 APLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYN 659
E L+++ GKG +VNGY+LGRYW N
Sbjct: 542 ---ETGDTFLDMEAWGKGIVFVNGYHLGRYW----------------------------N 570
Query: 660 CGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPS 693
G P Q Y +P W+K G N + + EF PS
Sbjct: 571 VG-PQQTLY-LPGCWLKKGANEITIV-EFNKVPS 601
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 185 bits (469), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 189/714 (26%), Positives = 291/714 (40%), Gaps = 166/714 (23%)
Query: 6 HCSRAILLCLILQTLFNL--SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
H + ++L +I+ L + S +V TI+G+ L+ G +HYPR W D
Sbjct: 5 HKTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDR 64
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
+K+A GL+ + YVFWN HE ++DF+G D+ FI+T Q++GLYVILR GPYVCA
Sbjct: 65 LKRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCA 124
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQ 182
EW++GG+P WL + R+ + F++ + + I ++ K+ L + GG II+ Q
Sbjct: 125 EWDFGGYPSWLLKEKDMT-YRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQ 180
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ----------ESDAPS---- 228
+ENEYG+ +D K Y+ M VP C E P+
Sbjct: 181 VENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGV 235
Query: 229 ------PMFTPNNPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGT 278
+ P E + WF WG + +R AE L + ++ G
Sbjct: 236 FGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GV 290
Query: 279 FQNYYMYHGGTNF----GRTSGGPYLT--TSYDYDAPIDEYGHLNQPKWGHLRELHKLLK 332
+ YM+HGGTNF G +GG Y TSYDYDAP+ E+G+ PK+ H +
Sbjct: 291 SVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YPKY------HAFRE 343
Query: 333 SMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
++K L G ++LP+ + T T V++K
Sbjct: 344 VIQKYLPAG-----------------------TVLPEVPADNPTT----TFATVELK--- 373
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
+ APL+ + ++ N L + D Y+ Y T
Sbjct: 374 ----ESAPLRTAFHQTTQSE-----------NVLSMEDLGVDFG-YIHYQTT-------- 409
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
L + L I ++G V S +Y ++ + +++ + +L
Sbjct: 410 -LQKAGKQKLVIQDLRDYAVILIDGKQVASLDRRYNQNS----VTLNVSKTPATLEILVE 464
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
G NYG GI VL G+E K+ + + Y K
Sbjct: 465 NTGRVNYGPDILFNRKGITSQVLW----GNE-------------KLAGWSITPLPLYKEK 507
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
+ E G + K VP ++K TF + D V ++ GKG WVNG +LGR+W
Sbjct: 508 VSEMEFGETIKGVP-----AFHKGTFTVEKKGDCFV-DMSQWGKGAVWVNGKSLGRFW-- 559
Query: 633 YLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
N G P Q Y +P W+K+G N +V+FE
Sbjct: 560 --------------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 585
>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 584
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/309 (36%), Positives = 161/309 (52%), Gaps = 36/309 (11%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G ++DG+ ++SG +HY R P W D ++KA+ GL+ I+TY+ WN HE +D
Sbjct: 8 GDGFSLDGQPFRIVSGGLHYFRVHPAQWSDRLRKARLMGLNTIDTYIPWNLHERRPGTFD 67
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F G LDL F+ +GL+V+LR GPY+C EW GG P WL P + LR+T+ F+
Sbjct: 68 FGGILDLAAFLDAAAAEGLHVLLRPGPYICGEWEGGGLPSWLLADPDL-ALRSTDPAFLQ 126
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
++ + I+ + ++GGP+I Q+ENEYG SD +Y+ + TS
Sbjct: 127 AVEAYLDAIMPIVLPR--LGTRGGPVIAVQVENEYGAYGSD-----TAYMERLYEALTSR 179
Query: 213 DIGVPWIMCQESD------APSPMFTPN---------------NPNSPKIWTENWTGWFK 251
I VP+ + + P + T N P P + E W GWF
Sbjct: 180 GIDVPFFTSDQPNDLADGALPGVLATANFGGKVTASLAALRAQQPTGPLMCAEFWNGWFD 239
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYD 305
WGG +R+AED A+ Q G + N+YM+HGGTNFG T+G G Y TSYD
Sbjct: 240 YWGGTHAQRSAEDAGAALEEMLQAGASV-NFYMFHGGTNFGFTNGANDKGTYRATVTSYD 298
Query: 306 YDAPIDEYG 314
YD+P+DE G
Sbjct: 299 YDSPLDEAG 307
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 54/136 (39%), Gaps = 45/136 (33%)
Query: 579 GWSSKNVPLNRRMT--------------WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
GW+S+ +PL+ +++ TF+ D L+L G KG AW+NG+
Sbjct: 472 GWTSRPLPLDDLTGLAYAELDGPAVGPGFHRGTFDLDRCAD-TYLHLPGWTKGVAWINGF 530
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
NLGRYW RGP GS +VP ++ G N LV+
Sbjct: 531 NLGRYW----------------SRGPQGS--------------LYVPGPVLRAGTNELVV 560
Query: 685 FEEFGGNPSQINFQTV 700
E G + + V
Sbjct: 561 LELHGARAAAAELRPV 576
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 180/354 (50%), Gaps = 47/354 (13%)
Query: 16 ILQTLFNLSLAYRVS-----HDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I+ + F+++L Y DG + + G+ + SG +HYPR W ++ K
Sbjct: 10 IILSFFSINLLYSQKGNFEIKDGHFL-LSGKPFTIYSGEMHYPRVPSEYWKHRLQMMKSM 68
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GL+ + TYVFWN HE +++F+G DL +FIKT Q+ GLYVI+R GPYVCAEW +GG+
Sbjct: 69 GLNTVTTYVFWNYHEEEPGKWNFSGEKDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFGGY 128
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYGN 189
P WL + E+RT NK F+ + +N+ I ++AK+ L + GGP+I+ Q ENE+G+
Sbjct: 129 PWWLQKDKNL-EIRTDNKAFLKQCENY---INELAKQIIPLQINNGGPVIMVQAENEFGS 184
Query: 190 VMSDYGDAG----KSYINWCAKMATSLDIGVPWI------MCQESDAPSPMFTP------ 233
++ D K Y + I VP+ + +E + T
Sbjct: 185 YVAQRKDISLEQHKKYSHKIKDFLVKSGITVPFFTSDGSWLFKEGSIEGALPTANGEGDV 244
Query: 234 ----------NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYY 283
NN P + E + GW W K + ED+ + + G +F NYY
Sbjct: 245 DNLRKKINEFNNGKGPYMVAEYYPGWLDHWAEPFVKVSTEDVVKQTELYIKNGISF-NYY 303
Query: 284 MYHGGTNFGRTSGGPYLT--------TSYDYDAPIDEYGHLNQPKWGHLRELHK 329
M HGGTNFG TSG Y TSYDYDAPI+E G + PK+ LR++ +
Sbjct: 304 MIHGGTNFGFTSGANYDKNHDIQPDLTSYDYDAPINEAGWVT-PKFNALRDIFQ 356
>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
carolinensis]
Length = 584
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 161/312 (51%), Gaps = 40/312 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+L GS+HY R W D + K K GL+ + TYV WN HE +R ++DF+GNLDL FIK
Sbjct: 29 ILGGSLHYFRIPREYWKDRLMKMKACGLNTVTTYVPWNLHEAIRGKFDFSGNLDLQVFIK 88
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-FTTLIVD 163
++ GL+VILR GPY+C+EW+ GG P WL P + +LRTT + F + N F LI
Sbjct: 89 MAEEVGLWVILRPGPYICSEWDLGGLPSWLLQDPEM-QLRTTYRGFTEAVDNYFDRLIPQ 147
Query: 164 MAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS------------ 211
+ + + GGPII Q+ENEYG+ D SY+ + TS
Sbjct: 148 VVPLQYKY---GGPIIAVQVENEYGSYAQD-----PSYMTYIKMALTSRKIVEMLMTSDN 199
Query: 212 --------LDIGVPWIMCQESDAPSPMF--TPNNPNSPKIWTENWTGWFKSWGGKDPKRT 261
+D + I Q+ D +F T PK+ E WTGWF SWGG
Sbjct: 200 HDGLVSGTVDGALATINFQKLDTAIMVFLSTDQRNKMPKMVMEYWTGWFDSWGGLHHVFD 259
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAPIDEYGH 315
A+D+ V + + G + N YM+HGGTNFG +G + TSYDYDA + E G
Sbjct: 260 ADDMVQTVGKVIKLGASI-NLYMFHGGTNFGFLNGAQHSNEYKSTITSYDYDAVLTESGD 318
Query: 316 LNQPKWGHLREL 327
K+ LR+L
Sbjct: 319 YTS-KFFKLRQL 329
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 174/664 (26%), Positives = 266/664 (40%), Gaps = 143/664 (21%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 41 RTTLAPLVLALAFALPVTAAAADTERWPDFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 100
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+GN D+ F++ QGL VILR GP
Sbjct: 101 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 160
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + L GGPII
Sbjct: 161 YACAEWEAGGYPAWLFGQGNIR-VRSRDPRFLAASQAYLDAVAK--QVQPLLNHNGGPII 217
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 218 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 277
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D + AE+ + + + G
Sbjct: 278 GEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGKPHAATDATQQAEEFEWILRQ-----G 332
Query: 278 TFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE G K+ +R+
Sbjct: 333 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAGRPTA-KFALMRDA 391
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 392 IARVTGVQPPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 432
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
+ P P++ HF + Y+ Y T
Sbjct: 433 IDTPQ-------PME-----------------HFGQDY-----------GYILYRTT--- 454
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
++G L + V H Y++ V S + + + P G + +
Sbjct: 455 ------VTGPRKGPLYLGDVRDVAHVYLDQTPVGSVERRLQQVSTAVDIPA----GHHTL 504
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW-TYKVGLYGLDDK 566
+L G NYG + G+ PVLL G++ + W + + +
Sbjct: 505 DVLVENSGRINYGPRMADGRAGLVDPVLL----GNQQVT------GWQAFPLPM------ 548
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+A +S RGW+ K V + +++ T D L+++ GKGFAW NG NL
Sbjct: 549 -----RAPDSIRGWTRKAV---QGPAFHRGTVRIGTPAD-TYLDMRAFGKGFAWANGVNL 599
Query: 627 GRYW 630
GR+W
Sbjct: 600 GRHW 603
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 123/335 (36%), Positives = 171/335 (51%), Gaps = 50/335 (14%)
Query: 31 HDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQ 90
+DG+AI I +SG +HY R W +K K GL+A+ TYVFWN HEP +
Sbjct: 36 YDGKAIRI-------ISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPGK 88
Query: 91 YDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVF 150
+DF+G+ +L +I+ ++GL VILR GPYVCAEW +GG+P WL N+ G+ ELR N+ F
Sbjct: 89 WDFSGDRNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGM-ELRRDNEQF 147
Query: 151 MNEMQNFTTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGD----AGKSYINW 204
+ +T L ++ KE KL +QGGPII+ Q ENE+G+ +S D ++Y
Sbjct: 148 L----KYTKLYLERLYKEVGKLQITQGGPIIMVQGENEFGSYVSQRKDITLEEHRAYNAK 203
Query: 205 CAKMATSLDIGVPWI------MCQESDAPSPMFTPNNPNS----------------PKIW 242
K + VP + + P + T N N+ P +
Sbjct: 204 IIKQLKEVGFDVPMFTSDGSWLFEGGYVPGALPTANGENNIENLKKVVNQYNGGQGPYMV 263
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT- 301
E + GW W P+ A +A ++ G +F NYYM HGGTNFG TSG Y
Sbjct: 264 AEFYPGWLAHWCEPHPQVKASTIARQTEKYLANGVSF-NYYMVHGGTNFGFTSGANYDKK 322
Query: 302 -------TSYDYDAPIDEYGHLNQPKWGHLRELHK 329
TSYDYDAPI E G + PK+ +R + K
Sbjct: 323 HDIQPDLTSYDYDAPISEAGWVT-PKFDSIRNVIK 356
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 89/217 (41%), Gaps = 48/217 (22%)
Query: 473 AYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPG 532
YV+G +V + +Y N + +++ N + +L +G NYGS+ GI
Sbjct: 439 VYVDGEFV-GRLNRY---NKKYSMDIEIPFNGN-LEILVENMGRINYGSEIVHNNKGIIS 493
Query: 533 PVLLVGRAGDETIIKDLSSHKWTY-KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM 591
PV + D+ I+ +W K+ + + + A S G SS N L +
Sbjct: 494 PVKI-----DDNFIEG----EWEMTKLPMSEVPAFEKMPANTVTSIMG-SSANA-LVGKP 542
Query: 592 TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPY 651
+ YK TF D L+++ GKG +VNG N+GRYW
Sbjct: 543 SLYKGTFTLQETGD-TFLDMKDWGKGIVFVNGINIGRYWQV------------------- 582
Query: 652 GSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
P Q + VP W+K G+N +V+F++
Sbjct: 583 ----------GPQQTLF-VPGVWLKKGINEIVIFDQL 608
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 174/669 (26%), Positives = 265/669 (39%), Gaps = 153/669 (22%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGAAADTERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+GN D+ F++ QGL +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNIILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQAYLDALAN--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D ++ AE+ + + + G
Sbjct: 240 GEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ YM+ GGT+FG +G + TTSYDYDA +DE GH PK+ +R+
Sbjct: 295 HSASLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGVQTPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPN---QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTN 444
+ P Q G D Y+ Y T
Sbjct: 395 IDTPRPMEQFGQDYG--------------------------------------YILYRTT 416
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
++G L + V YV+ V S + + E P G+
Sbjct: 417 ---------VTGPRKGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQ 463
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLY 561
+ + +L G NY GP + GRAG D ++ + W + + +
Sbjct: 464 HTLDVLVENSGRINY------------GPRMADGRAGLVDPVVLDNQQLTGWQAFPLPM- 510
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ +S RGW+ K V + +++ T D L+++ GKGFAW
Sbjct: 511 ----------RTPDSIRGWTRKAV---QGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWA 556
Query: 622 NGYNLGRYW 630
NG NLGR+W
Sbjct: 557 NGVNLGRHW 565
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 189/717 (26%), Positives = 292/717 (40%), Gaps = 172/717 (23%)
Query: 6 HCSRAILLCLILQTLFNL--SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
H + ++L +I+ L + S +V TI+G+ L+ G +HYPR W D
Sbjct: 7 HKTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDR 66
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
+K+A GL+ + YVFWN HE ++DF+G D+ FI+T Q++GLYVILR GPYVCA
Sbjct: 67 LKRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCA 126
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQ 182
EW++GG+P WL + R+ + F++ + + I ++ K+ L + GG II+ Q
Sbjct: 127 EWDFGGYPSWLLKEKDMT-YRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQ 182
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ----------ESDAPS---- 228
+ENEYG+ +D K Y+ M VP C E P+
Sbjct: 183 VENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGV 237
Query: 229 ------PMFTPNNPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGT 278
+ P E + WF WG + +R AE L + ++ G
Sbjct: 238 FGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GV 292
Query: 279 FQNYYMYHGGTNF----GRTSGGPYLT--TSYDYDAPIDEYGHLNQPKWGHLRELHKLLK 332
+ YM+HGGTNF G +GG Y TSYDYDAP+ E+G+ PK+ H +
Sbjct: 293 SVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YPKY------HAFRE 345
Query: 333 SMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
++K L G ++LP+ + T T V++K
Sbjct: 346 VIQKYLPAG-----------------------TVLPEVPADNPTT----TFATVELK--- 375
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVS---DYLWYMTNADLKD 449
+ APL+ + ++ V+ S D+ Y+ Y T
Sbjct: 376 ----ESAPLRTAFHQTTQSENVL---------------SMEDLGVDFGYIHYQTT----- 411
Query: 450 DDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL 509
L + L I ++G V S +Y ++ + +++ + +
Sbjct: 412 ----LQKAGKQKLVIQDLRDYAVILIDGKQVASLDRRYNQNS----VTLNVSKTPATLEI 463
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
L G NYG GI VL G+E K+ + + Y
Sbjct: 464 LVENTGRVNYGPDILFNRKGITSQVLW----GNE-------------KLAGWSITPLPLY 506
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
K + E G + K VP ++K TF + D V ++ GKG WVNG +LGR+
Sbjct: 507 KEKVSEMEFGETIKGVP-----AFHKGTFTVEKKGDCFV-DMSQWGKGAVWVNGKSLGRF 560
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
W N G P Q Y +P W+K+G N +V+FE
Sbjct: 561 W----------------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 587
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 174/666 (26%), Positives = 265/666 (39%), Gaps = 147/666 (22%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPVTAAAADTERWPDFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+GN D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGQGNIR-VRSRDPRFLAASQAYLDAVAK--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D + AE+ + + + G
Sbjct: 240 GEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGKPHAATDATQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE G K+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAGRPTA-KFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGVQPPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
+ P P++ HF + Y+ Y T
Sbjct: 395 IDTPQ-------PME-----------------HFGQD-----------YGYILYRTT--- 416
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
++G L + V H Y++ V S + + + P G + +
Sbjct: 417 ------VTGPRKGPLYLGDVRDVAHVYLDQTPVGSVERRLQQVSTAVDIPA----GHHTL 466
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLYGLD 564
+L G NY GP + GRAG D ++ + W + + +
Sbjct: 467 DVLVENSGRINY------------GPRMADGRAGLVDPVLLGNQQVTGWQAFPLPM---- 510
Query: 565 DKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
+A +S RGW+ K V + +++ T D L+++ GKGFAW NG
Sbjct: 511 -------RAPDSIRGWTRKAV---QGPAFHRGTVRIGTPAD-TYLDMRAFGKGFAWANGV 559
Query: 625 NLGRYW 630
NLGR+W
Sbjct: 560 NLGRHW 565
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/482 (29%), Positives = 217/482 (45%), Gaps = 69/482 (14%)
Query: 344 TNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
T D + G Y +P S+S+L DC+T F T VN Q N + DQ
Sbjct: 16 TKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFH----FADQTAQNN 71
Query: 404 KWRPEMINDFVV--RGKGHFALNTLIDQKS-TNDVSDYLWYMTNADLKDDDPILSGSSNM 460
W EM + V + L D + T D +DY+WY ++ L+ DD +
Sbjct: 72 VW--EMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 129
Query: 461 TLRINSSGQVLHAYVNGNYVD-SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
L +NS G A+VN +V TK + L E+P+ L +G N +++L++++G+ +
Sbjct: 130 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTL-EKPMDLKKGVNHVAVLASSMGMTDS 188
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
G+ + G+ + AG DL+++ W + VGL G + K+ Y K S
Sbjct: 189 GAYMEHRLAGVDRVQITGLNAG----TLDLTNNGWGHIVGLVG-ERKQIYTDKGMGSVTW 243
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
+ N +R +TWYK F+ P DPVVL++ MGKG +VNG +GRYW +Y
Sbjct: 244 KPAMN---DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY------ 294
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
+ G PSQ YHVPRS+++ N LVLFEE G P I T
Sbjct: 295 -----------------KHALGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILT 337
Query: 700 VVVGTAC------GQAH----ENKTMELTCHG----------------RRISEIKYASFG 733
V C AH E K ++T + I ++ +AS+G
Sbjct: 338 VKRDNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVVFASYG 397
Query: 734 DPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEA 793
+P G CG + GSC ++EK C+GK+ C++ + G + +GT L V+A
Sbjct: 398 NPAGICGNYTVGSCHTP-RAKEVVEKACLGKRVCTLPVAADVYGGDANCSGTTATLAVQA 456
Query: 794 LC 795
C
Sbjct: 457 KC 458
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 128/364 (35%), Positives = 184/364 (50%), Gaps = 49/364 (13%)
Query: 7 CSRAILLCLILQTLFNLSLAYRVSHDGRA----ITIDGERKILLSGSIHYPRSTPGMWPD 62
C + IL +F++S + H DG+ ++SG +HYPR W
Sbjct: 2 CKKICSTFFILLFVFSISSFSQKKHTFEIKNGDFVYDGKPVRIISGEMHYPRIPHQYWRH 61
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
++ K GL+A+ TYVFWN HEP ++DFTG+ +L +IK ++GL VILR GPYVC
Sbjct: 62 RMQMLKAMGLNAVATYVFWNIHEPEPGKWDFTGDKNLAEYIKIAGEEGLMVILRPGPYVC 121
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKE--KLFASQGGPIIL 180
AEW +GG+P WL N+ G+ ELR N+ F+ +T L ++ KE L ++GGPI++
Sbjct: 122 AEWEFGGYPWWLQNVEGL-ELRRDNEQFL----KYTQLYINRLYKEVGNLQITKGGPIVM 176
Query: 181 AQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG--VP-------WIMCQESDAPSP 229
Q ENE+G+ +S D + + + AK+ L D G VP W+ + P
Sbjct: 177 VQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLKDAGFDVPSFTSDGSWLF-EGGAVPGA 235
Query: 230 MFTPNNPNS----------------PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFF 273
+ T N ++ P + E + GW W P+ +A +A ++
Sbjct: 236 LPTANGESNIENLKKAVDKYNGGQGPYMVAEFYPGWLAHWLEPHPQISATSIARQTEKYL 295
Query: 274 QFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TSYDYDAPIDEYGHLNQPKWGHLR 325
Q + NYYM HGGTNFG TSG Y TSYDYDAPI E G + PK+ LR
Sbjct: 296 QNNVSI-NYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGWVT-PKYDSLR 353
Query: 326 ELHK 329
+ K
Sbjct: 354 NVIK 357
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 174/669 (26%), Positives = 265/669 (39%), Gaps = 153/669 (22%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+G+ D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQAYLDALAN--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D ++ AE+ + + + G
Sbjct: 240 GEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE GH PK+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGVQPPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPN---QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTN 444
+ P Q G D Y+ Y T
Sbjct: 395 IDTPQPMEQFGQDYG--------------------------------------YILYRTT 416
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
++G L + V YV+ V S + + E P G+
Sbjct: 417 ---------VTGPRKGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQ 463
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLY 561
+ + +L G NYG++ + GRAG D ++ + W + + +
Sbjct: 464 HTLDVLVENSGRINYGTR------------MADGRAGLVDPVVLDNRQLTGWQAFPLPM- 510
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ +S RGW+ K V + +++ T D L+++ GKGFAW
Sbjct: 511 ----------RTPDSIRGWTRKAV---QGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWA 556
Query: 622 NGYNLGRYW 630
NG NLGR W
Sbjct: 557 NGVNLGRQW 565
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 182/370 (49%), Gaps = 60/370 (16%)
Query: 4 LKHCSRAILLCLILQ---TLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
+K+ R ++L ++ +F+ S ++G+ + SG +HYPR W
Sbjct: 1 MKNLQRLLVLFILFACNVLIFSQSRKSTFEIKNGHFLLNGKLFSIHSGEMHYPRIPQEYW 60
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
++ K GL+A+ TYVFWN HE +++++G DL +FIKT Q+ GLYVI+R GPY
Sbjct: 61 KHRLQMMKAMGLNAVTTYVFWNYHEENPGKWNWSGEKDLKKFIKTAQEVGLYVIIRPGPY 120
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEW +GG+P WL N+ G+ ++R N +F+ E Q + T + + K ++ + GGP+I+
Sbjct: 121 VCAEWEFGGYPWWLQNIKGL-KIREDNNLFLAETQKYITQLYNQVKDLQI--TNGGPVIM 177
Query: 181 AQIENEYGNVMSDYGDAG-KSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTP------ 233
Q ENE+G+ ++ D S+ + AK+ L +++ PMFT
Sbjct: 178 VQAENEFGSFVAQRKDIPLASHRTYNAKIVKQL---------KDAGFSVPMFTSDGSWLF 228
Query: 234 ----------------------------NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDL 265
NN P + E + GW W K P+ A +
Sbjct: 229 EGGSVVGALPTANGEDNIENLKKIVNQYNNNQGPYMVAEFYPGWLAHWAEKFPRVDAGTV 288
Query: 266 AFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TSYDYDAPIDEYGHLN 317
A ++ + +F NYYM HGGTNFG T+G Y TSYDYDAPI E G
Sbjct: 289 ARQTDKYLKNDVSF-NYYMVHGGTNFGFTNGANYDKNHDIQPDLTSYDYDAPITEAG-WR 346
Query: 318 QPKWGHLREL 327
PK+ LR +
Sbjct: 347 TPKYDSLRAV 356
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 124/354 (35%), Positives = 175/354 (49%), Gaps = 46/354 (12%)
Query: 29 VSHDGRAITIDGERKI-LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
+S+D T+ G+R I L+SG+IHY R P W D ++K K G + IETYV WN HEP
Sbjct: 4 LSYDQGQFTM-GDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEPR 62
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
++ F G D+ F++ + GLYVI+R PY+CAEW +GG P WL + LR +
Sbjct: 63 EGEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL--LKDDMRLRCND 120
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
F+ ++ + + + + L A++GGPII QIENEYG+ +D ++Y+ A+
Sbjct: 121 PRFLEKVAAYYDAL--LPQLTPLLATKGGPIIAVQIENEYGSYGND-----QAYLQ--AQ 171
Query: 208 MATSLDIGVPWI----------MCQESDAPSPMFTPN---------------NPNSPKIW 242
A ++ GV + M Q A + T N P+ P +
Sbjct: 172 RAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMC 231
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--- 299
E W GWF W + R AED A + G + N+YM HGGTNFG SG +
Sbjct: 232 MEYWNGWFDHWFEQHHTRDAEDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDK 290
Query: 300 ---LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
TSYDYDA I E G L PK+ RE+ S+ + N DYG+
Sbjct: 291 YEPTVTSYDYDAAISEAGDLT-PKYHAFREVIGKYVSLPEGDLPANTPKADYGS 343
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 105/271 (38%), Positives = 146/271 (53%), Gaps = 29/271 (10%)
Query: 547 KDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN-VPLNRRMTWYKTTFEAPLEND 605
+DLS KWTYKVGL G + +++ E W+ V + +TWYKTTF AP +
Sbjct: 6 RDLSWQKWTYKVGLKGESLSLHSLSGSSSVE--WAEGAFVAQKQPLTWYKTTFSAPAGDS 63
Query: 606 PVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQ 665
P+ +++ MGKG W+NG +LGR+WP Y A S C Y G + DKC NCG SQ
Sbjct: 64 PLAVDMGSMGKGQIWINGQSLGRHWPAYKAVG---SCSECSYTGTFREDKCLRNCGEASQ 120
Query: 666 IWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE-------------- 711
WYHVPRSW+K N LV+FEE+GG+P+ I V + C +E
Sbjct: 121 RWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQLHASG 180
Query: 712 --NKTMELTCH-----GRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGK 764
NK + H G++I+ +K+ASFG P+G CG++++GSC A K CVG+
Sbjct: 181 KVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAH-HSYDAFNKLCVGQ 239
Query: 765 KSCSIEASEANLGATSCAAGTVKRLVVEALC 795
CS+ + G C +K+L VEA+C
Sbjct: 240 NWCSVTVAPEMFGGDPC-PNVMKKLAVEAVC 269
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 174/669 (26%), Positives = 265/669 (39%), Gaps = 153/669 (22%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTPLAPLVLALAFALPITGTAAETERWPNFGTQGTQFARDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+G+ D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQAYLDALAN--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D ++ AE+ + + + G
Sbjct: 240 GEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE GH PK+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGVQPPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPN---QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTN 444
+ P Q G D Y+ Y T
Sbjct: 395 IDTPQPMEQFGQDYG--------------------------------------YILYRTT 416
Query: 445 ADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGK 504
++G L + V YV+ V S + + E P G+
Sbjct: 417 ---------VTGPRKGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQ 463
Query: 505 NQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLY 561
+ + +L G NYG++ + GRAG D ++ + W + + +
Sbjct: 464 HTLDVLVENSGRINYGTR------------MADGRAGLVDPVVLDNRQLTGWQAFPLPM- 510
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ +S RGW+ K V + +++ T D L+++ GKGFAW
Sbjct: 511 ----------RTPDSIRGWTRKAV---QGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWA 556
Query: 622 NGYNLGRYW 630
NG NLGR W
Sbjct: 557 NGVNLGRQW 565
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 172/359 (47%), Gaps = 54/359 (15%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P W D + K + GL+ +ETY+ WN HEP Q+ F G DL RF++
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
D GL+VILR PY+CAEW +GG P WL P I+ LR + V++ ++ + ++
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQ-LRCMDPVYLEKVDQYYDELI-- 137
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWI----- 219
+ L S+GGP+I QIENEYG+ +D +Y+ + + V
Sbjct: 138 PRLVPLLTSKGGPVIAMQIENEYGSYGND-----TAYLEYLKDGLIKRGVDVLLFTSDGP 192
Query: 220 ---MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRT 261
M Q P + T N P P + E W GWF W R
Sbjct: 193 TDGMLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRD 252
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAPIDEYGH 315
AED A + N+YM+HGGTNFG +G + TSYDYDAP+ E G
Sbjct: 253 AEDAAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECGD 311
Query: 316 LN-----------QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWS 363
+ Q + L +L L + ++K ++YG+V+ T Y + + +LPA S
Sbjct: 312 VTAKFEAIRSAIAQHQGKELSDLPSLPQPVKK-ISYGSVSMTHYADLLE----HLPALS 365
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
R T+Y+ F D + L G GKG WVNG+NLGRYW
Sbjct: 504 RPTFYRGEFLVDDIGD-TFIRLDGWGKGVVWVNGFNLGRYW 543
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/326 (35%), Positives = 169/326 (51%), Gaps = 36/326 (11%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
DG +DG+ ++ SG +HYPR W + ++ A+ GL+ + TY FW+ HEP Q+
Sbjct: 36 DGAHFKLDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQHEPEPGQW 95
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
F+G DL FIKT ++GL V+LR GPYVCAE ++GGFP WL G+ +
Sbjct: 96 SFSGQNDLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRSMDARYLA 155
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS--DYGDAGKSYINWC---A 206
+ F L ++A L +S+GGPI++ Q+ENEYG+ DY A ++ + A
Sbjct: 156 ASARYFKRLAQEVA---DLQSSRGGPILMLQLENEYGSYGRDHDYLRAVRTQMRQAGFDA 212
Query: 207 KMATSLDIG------------VPWIM-----CQESDAPSPMFTPNNPNSPKIWTENWTGW 249
+ TS D G VP ++ ++ A P+ P++ E W GW
Sbjct: 213 PLFTS-DGGAGRLFEGGTLADVPAVVNFGGGADDAQASVQELAAWRPHGPRMAGEYWAGW 271
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--------T 301
F WG + ++ E+ A V R G +F N YM+HGGT+FG +G Y T
Sbjct: 272 FDHWGEQHHTQSPEEAARTVERMLSQGVSF-NLYMFHGGTSFGWLAGANYSGSEPYQPDT 330
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDA +DE G PK+ LR++
Sbjct: 331 TSYDYDAALDEAGR-PTPKYFALRDV 355
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 172/359 (47%), Gaps = 54/359 (15%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P W D + K + GL+ +ETY+ WN HEP Q+ F G DL RF++
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
D GL+VILR PY+CAEW +GG P WL P I+ LR + V++ ++ + ++
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQ-LRCMDPVYLEKVDQYYDELI-- 137
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWI----- 219
+ L S+GGP+I QIENEYG+ +D +Y+ + + V
Sbjct: 138 PRLVPLLTSKGGPVIAMQIENEYGSYGND-----TAYLEYLKDGLIKRGVDVLLFTSDGP 192
Query: 220 ---MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRT 261
M Q P + T N P P + E W GWF W R
Sbjct: 193 TDGMLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRD 252
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAPIDEYGH 315
AED A + N+YM+HGGTNFG +G + TSYDYDAP+ E G
Sbjct: 253 AEDAAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECGD 311
Query: 316 LN-----------QPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWS 363
+ Q + L +L L + ++K ++YG+V+ T Y + + +LPA S
Sbjct: 312 VTAKFEAIRSAIAQHQGKELSDLPSLPQPVKK-ISYGSVSMTHYADLLE----HLPALS 365
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
R T+Y+ F D + L G GKG WVNG+NLGRYW
Sbjct: 504 RPTFYRGEFYVDDIGD-TFIRLDGWGKGVVWVNGFNLGRYW 543
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 120/376 (31%), Positives = 181/376 (48%), Gaps = 47/376 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+S++ + ++G+ L+SG++HY R P W D ++K K G + +ETY+ WN HEP
Sbjct: 4 LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q++F G D++ FI+ Q L VI+R PY+CAEW +GG P WL + LR ++
Sbjct: 64 GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWL--LKEDIRLRCSDP 121
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F+ ++ + ++ K L ++ GGPII QIENEYG+ +D ++Y+ M
Sbjct: 122 RFLEKVSAYYDALIPQLK--PLLSTSGGPIIAVQIENEYGSYGND-----QAYLQALRNM 174
Query: 209 ATSLDIGVPWIMCQESDAPSP-----------MFTPN---------------NPNSPKIW 242
I V + SD P+ + T N PN+P +
Sbjct: 175 LVERGIDV---LLFTSDGPADDMLQGGMTEGVLATVNFGSRPKEAFGKLEEYQPNAPLMC 231
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--- 299
E W GWF W + R+AED A + G + N+YM HGGTNFG +SG +
Sbjct: 232 MEYWNGWFDHWFEEHHTRSAEDAAQVLDEMLSMGASV-NFYMLHGGTNFGFSSGANHGGR 290
Query: 300 ---LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSS 356
TSYDYD+ I E G + PK+ R++ S+ + N YG S
Sbjct: 291 YKPTVTSYDYDSAISEAGDIT-PKYQLFRKVIGKYVSLSEDDMPQNTPKAAYGEVKVNRS 349
Query: 357 YNLPAWSVSILPDCKT 372
L ++S + D KT
Sbjct: 350 VKLFD-TLSSMTDVKT 364
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 155/310 (50%), Gaps = 41/310 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG++HY R P W D I+KA+ GL+ +ETYV WN H P R +D +G
Sbjct: 13 LDGRSLQIVSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVHSPERGVFDTSGRR 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+ + +GL+ I+R GPY+CAEW GG P WL P + R + + +
Sbjct: 73 DLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIGEYY 132
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
L+ +A+++ ++GGP+++ Q+ENEYG D + Y+ A M + I VP
Sbjct: 133 AALLPIVAERQ---VTRGGPVLMVQVENEYGAYGDDPPVERERYLRALADMIRAQGIDVP 189
Query: 218 WIMCQESD--------APSPMFTPN---------------NPNSPKIWTENWTGWFKSWG 254
+++ P + T N P P + E W GWF S G
Sbjct: 190 LFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMCMEFWDGWFDSAG 249
Query: 255 ----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSY 304
P+ A DL +A G N YM HGGTNFG TSG G Y +TTSY
Sbjct: 250 LHHHTTPPEANARDLDDLLA-----AGASVNLYMLHGGTNFGLTSGANDKGVYRPITTSY 304
Query: 305 DYDAPIDEYG 314
DYDAP+ E+G
Sbjct: 305 DYDAPLSEHG 314
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 183 bits (464), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 162/325 (49%), Gaps = 39/325 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++GE ++ + IHYPR W IK +K G++ I YVFWN HEP +YDF
Sbjct: 33 KTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG D+ F + Q+ G+YVI+R GPYVCAEW GG P WL I +LR + +M
Sbjct: 93 TGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKEDI-KLREQDPYYMER 151
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ F + + L S+GG II+ Q+ENEYG+ D K YI M
Sbjct: 152 VKLFMNEV--GKQLADLQISKGGNIIMVQVENEYGSFGID-----KPYIAAIRDMVKQAG 204
Query: 214 I-GVPWIMCQ-----ESDAPSPMFTPNN------------------PNSPKIWTENWTGW 249
GVP C E++A + N PN+P + +E W+GW
Sbjct: 205 FTGVPLFQCDWNSNFENNALDDLLWTVNFGTGANIDQQFERLKELRPNTPLMCSEFWSGW 264
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-----LTTSY 304
F WG K R+AE+L + +F + YM HGGT+FG G + TSY
Sbjct: 265 FDHWGAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 305 DYDAPIDEYGHLNQPKWGHLRELHK 329
DYDAPI+E G + PK+ +R+L K
Sbjct: 324 DYDAPINESGKVT-PKFLEVRDLLK 347
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 133/404 (32%), Positives = 193/404 (47%), Gaps = 40/404 (9%)
Query: 3 TLKHCSRAILLCLILQTLFNLSLAYRVSHDG---RAITIDGERKILLSGSIHYPRSTPGM 59
++ H +A LL L A + G + ++G+ I+ + +HYPR
Sbjct: 7 SISHVLKASLLTAGLFLFTPTEAAAKTETFGVGNKTFLLNGKPFIIKAAEVHYPRIPRPY 66
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W IK K G++ + YVFWN HE ++DFTGN D+ FI+ Q+ GLYVI+R GP
Sbjct: 67 WEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDFTGNNDVAEFIRLAQENGLYVIVRPGP 126
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEW GG P WL I LR + FM + F + + L +GGPII
Sbjct: 127 YVCAEWEMGGLPWWLLKKKDI-RLREQDPYFMERYRIFAQKLGEQIG--DLTIEKGGPII 183
Query: 180 LAQIENEYGNVMSD--YGDAGKSYI-------------NWCAKMATSLDIGVPWIMCQES 224
+ Q+ENEYG+ D Y A + I +W + + + W M +
Sbjct: 184 MVQVENEYGSYGEDKPYVSAIRDIIRDSGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGT 243
Query: 225 DA----PSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQ 280
A P SP++ +E W+GWF WGG+ R ++++ + G +F
Sbjct: 244 GANIENEFKKLGELRPESPQMCSEFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF- 302
Query: 281 NYYMYHGGTNFGRTSGG--PYLT---TSYDYDAPIDEYGHLNQPKWGHLREL-----HKL 330
+ YM HGGT++G +G P + TSYDYDAPI+E G + PK+ LRE+ K
Sbjct: 303 SLYMTHGGTSWGHWAGANSPGFSPDVTSYDYDAPINEAGQVT-PKYMELREMLAGYSDKK 361
Query: 331 LKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEE 374
L S+ K + NV + V+ NLPA S+ D +T E
Sbjct: 362 LPSIPKEIPVINVPKIQF-TEVAPLFENLPAPHASM--DIQTME 402
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 173/664 (26%), Positives = 266/664 (40%), Gaps = 143/664 (21%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALTFALPVTAAAADTERWPDFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+GN D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGQGNIR-VRSRDPRFLAASQAYLDAVAK--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGAEMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D + AE+ + + + G
Sbjct: 240 GEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGKPHAATDATQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE G K+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAGRPTA-KFALMRDA 353
Query: 328 HKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVK 387
+ ++ + T LPD E + N +
Sbjct: 354 IARVTGVQPPALPAPIATT-------------------TLPDTPLRESASLWDNLPAPIA 394
Query: 388 VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADL 447
+ P P++ HF + Y+ Y T
Sbjct: 395 IDTPQ-------PME-----------------HFGQD-----------YGYILYRTT--- 416
Query: 448 KDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQI 507
++G L + V H Y++ V S + + + P G + +
Sbjct: 417 ------VTGPRKGPLYLGDVRDVAHVYLDQTPVGSVERRLQQVSTTVDIPA----GHHTL 466
Query: 508 SLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW-TYKVGLYGLDDK 566
+L G NYG++ G+ PVLL G++ + W + + +
Sbjct: 467 DVLVENSGRINYGTRMADGRAGLVDPVLL----GNQQLT------GWQAFPLPM------ 510
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
+ +S RGW+ K V + +++ T D L+++ GKGFAW NG NL
Sbjct: 511 -----RTPDSIRGWTRKAV---QGPAFHRGTVRIGTPAD-TYLDMRAFGKGFAWANGVNL 561
Query: 627 GRYW 630
GR+W
Sbjct: 562 GRHW 565
>gi|419799561|ref|ZP_14324899.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
gi|385697826|gb|EIG28233.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
Length = 595
Score = 182 bits (463), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 186/701 (26%), Positives = 284/701 (40%), Gaps = 188/701 (26%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+ G+ +LSG+IHY R P W + K G + +ETYV WN HEP + Q+DF+G L
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFMNEM 154
DL RFI+T Q GLY+I+R P++CAEW +GG P WL +EE +R+++ VF+ +
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDMRIRSSDPVFIEAV 126
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ ++ + + ++ QGGPI++ Q+ENEYG+ D K+Y+ + +
Sbjct: 127 DRYYDHLLGLLTRYQV--DQGGPILMMQVENEYGSYGED-----KAYLRAIRDLMKEKGV 179
Query: 215 GVPWIMCQESDAP------------SPMFTPNNPNS--------------------PKIW 242
P SD P +F N S P +
Sbjct: 180 TCPLFT---SDGPWRATLRAGNLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMC 236
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-- 300
E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 237 MEFWDGWFTRWKEPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGT 294
Query: 301 -----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS 355
TSYDY A ++E GN T Y +
Sbjct: 295 LDLPQVTSYDYGALLNEQ---------------------------GNPTEKYYAVQKMMA 327
Query: 356 SYNLPAWSVS--ILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDF 413
+Y P + ++ +C E+ T ++ +T++ N A ++ PE + +
Sbjct: 328 TY-YPEYPQQEPLIKECLPEQ--TLQLAAKTSLFGNLDNLA-----QVETSLYPEKMEEL 379
Query: 414 VVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHA 473
+ YL Y T+ +L ++ LRI +
Sbjct: 380 -------------------GQTTGYLLYETDLELDAEEE--------RLRIIDGRDRVQI 412
Query: 474 YVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNGI 530
Y++ +V +Q+ T+ G DLF + K + + +L +G NYG K D GI
Sbjct: 413 YLDDQHVATQYQTEIG--EDLFIKGKK--KAVTNLKILLENMGRVNYGHKLLADSQHKGI 468
Query: 531 PGPVLLVGRAGDETIIKDLSSH-KW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
R G + DL H W Y + L L F A +
Sbjct: 469 --------RTG---VCVDLHFHLHWKQYPLDLQDLSQLDFSKEWQAGAP----------- 506
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYR 648
+Y+ F+ D L++ G GKG A+VNG+NLGR+W
Sbjct: 507 ---AFYRYDFQLDQTLD-TYLDMTGFGKGVAFVNGHNLGRFWEV---------------- 546
Query: 649 GPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
GP S +VP ++K+G N+L++FE G
Sbjct: 547 GPTTS--------------LYVPHGFLKEGANSLIVFETEG 573
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 182 bits (463), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 116/333 (34%), Positives = 176/333 (52%), Gaps = 38/333 (11%)
Query: 23 LSLAYRVSHDGRA-ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+ L+ + + G+A T++G + +++ GSIHY R W D + K + G + + TY+ W
Sbjct: 42 VGLSTKTNALGKAYFTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPW 101
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HE R ++DF+ LDL ++ + GL+VILR GPY+CAE + GG P WL P +
Sbjct: 102 NLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNP-VT 160
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
+LRTTNK F+ + + ++ K L GGP+I Q+ENEYG+ D ++Y
Sbjct: 161 DLRTTNKGFIEAVDKYFDHLI--PKILPLQYRHGGPVIAVQVENEYGSFQKD-----RNY 213
Query: 202 INWCAKMATSLDIGVPWIMCQESD-----APSPMFTPNNPNS----------------PK 240
+N+ K I + + D + + T N NS P
Sbjct: 214 MNYLKKALLKRGIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPI 273
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY- 299
+ E WTGW+ SWG K +++AE++ V +F +G +F N YM+HGGTNFG +GG Y
Sbjct: 274 MIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYE 332
Query: 300 -----LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ TSYDYDA + E G + K+ LR+L
Sbjct: 333 NHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKL 364
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 182 bits (463), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 110/308 (35%), Positives = 159/308 (51%), Gaps = 43/308 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+D + +LSG++HY R P W D + + K GL+ +ETYV WN HE + ++ FTG L
Sbjct: 65 LDNKELRILSGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTGML 124
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF+ + GL VILR GP++C+EW +GG P WL P + ++R+T + FM+ +++
Sbjct: 125 DIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQM-DVRSTYRPFMDAARSY 183
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL-DIGV 216
++ ++ E + GGPII QIENEYG+ D +N+ ++ + D GV
Sbjct: 184 MRSLI--SELEDMQYQYGGPIIAMQIENEYGSYSDD--------VNYMQELKNIMTDSGV 233
Query: 217 PWIM--------CQESDAPSPMFTPN-----------------NPNSPKIWTENWTGWFK 251
I+ Q P T N P P + E W+GWF
Sbjct: 234 IEILFTSDNKHGLQPGRVPGVFMTTNFKNTNEGGRMFDKLHELQPGKPLMVMEFWSGWFD 293
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG---PYL--TTSYDY 306
W K + E+ A AV Q G + N YM+HGGTNFG +G PYL TSYDY
Sbjct: 294 HWEEKHHTMSLEEYASAVEYILQQGSSI-NLYMFHGGTNFGFLNGANTEPYLPTVTSYDY 352
Query: 307 DAPIDEYG 314
D+P+ E G
Sbjct: 353 DSPLSEAG 360
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 182 bits (463), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 116/333 (34%), Positives = 176/333 (52%), Gaps = 38/333 (11%)
Query: 23 LSLAYRVSHDGRA-ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+ L+ + + G+A T++G + +++ GSIHY R W D + K + G + + TY+ W
Sbjct: 55 VGLSTKTNALGKAYFTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPW 114
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HE R ++DF+ LDL ++ + GL+VILR GPY+CAE + GG P WL P +
Sbjct: 115 NLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNP-VT 173
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
+LRTTNK F+ + + ++ K L GGP+I Q+ENEYG+ D ++Y
Sbjct: 174 DLRTTNKGFIEAVDKYFDHLI--PKILPLQYRHGGPVIAVQVENEYGSFQKD-----RNY 226
Query: 202 INWCAKMATSLDIGVPWIMCQESD-----APSPMFTPNNPNS----------------PK 240
+N+ K I + + D + + T N NS P
Sbjct: 227 MNYLKKALLKRGIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPI 286
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY- 299
+ E WTGW+ SWG K +++AE++ V +F +G +F N YM+HGGTNFG +GG Y
Sbjct: 287 MIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYE 345
Query: 300 -----LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ TSYDYDA + E G + K+ LR+L
Sbjct: 346 NHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKL 377
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 182 bits (462), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 112/274 (40%), Positives = 150/274 (54%), Gaps = 40/274 (14%)
Query: 284 MYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN- 342
MYHGGTNF R++GGP++ TSYDYDAPIDEYG + Q KWGHL++++K +K E+ L +
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60
Query: 343 -----------------------VTNTDYGN----SVSGSSYNLPAWSVSILPDCKTEEF 375
+ N D N + SG+SY+LPAWSVS+LPDCK
Sbjct: 61 KISSLGQNLEAAVYKTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVVL 120
Query: 376 NTAKVNTQTNVK-VKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNT-LIDQ-KST 432
NTAK+N+ + + + + + + +W W IN+ V K T L++Q +T
Sbjct: 121 NTAKINSASAISNFVTEDISSLETSSSKWSW----INEPVGISKDDILSKTGLLEQINTT 176
Query: 433 NDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
D SDYLWY + DL DD S L I S G LHA++NG +Q S
Sbjct: 177 ADRSDYLWYSLSLDLADDP-----GSQTVLHIESLGHTLHAFINGKLAGNQAGNSDKSKL 231
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMV 526
+ P+ L GKN+I LLS TVGLQNYG+ FD V
Sbjct: 232 NVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTV 265
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 182 bits (462), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 116/333 (34%), Positives = 176/333 (52%), Gaps = 38/333 (11%)
Query: 23 LSLAYRVSHDGRA-ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+ L+ + + G+A T++G + +++ GSIHY R W D + K + G + + TY+ W
Sbjct: 81 VGLSTKTNALGKAYFTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPW 140
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE 141
N HE R ++DF+ LDL ++ + GL+VILR GPY+CAE + GG P WL P +
Sbjct: 141 NLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNP-VT 199
Query: 142 ELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSY 201
+LRTTNK F+ + + ++ K L GGP+I Q+ENEYG+ D ++Y
Sbjct: 200 DLRTTNKGFIEAVDKYFDHLI--PKILPLQYRHGGPVIAVQVENEYGSFQKD-----RNY 252
Query: 202 INWCAKMATSLDIGVPWIMCQESD-----APSPMFTPNNPNS----------------PK 240
+N+ K I + + D + + T N NS P
Sbjct: 253 MNYLKKALLKRGIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPI 312
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY- 299
+ E WTGW+ SWG K +++AE++ V +F +G +F N YM+HGGTNFG +GG Y
Sbjct: 313 MIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYE 371
Query: 300 -----LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ TSYDYDA + E G + K+ LR+L
Sbjct: 372 NHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKL 403
>gi|395846556|ref|XP_003795969.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Otolemur
garnettii]
Length = 633
Score = 182 bits (461), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 188/673 (27%), Positives = 265/673 (39%), Gaps = 154/673 (22%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+ GSIHY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL F+
Sbjct: 63 IFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPQRGKFDFSGNLDLEAFVL 122
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
+ GL+VILR GPY+C+E + GG P WL PG+ LRTT K F + + + M
Sbjct: 123 LAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPGM-RLRTTYKGFTEAVDLYFDHL--M 179
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSD----------YGDAGKSYINWCAKMATSLDI 214
++ L GGPII Q+ENEYG+ D D G + + + L
Sbjct: 180 SRVVPLQYKHGGPIIAVQVENEYGSYYKDPAYMPYVKKALEDRGIVELLFTSDNKDGLRK 239
Query: 215 GVPWIMCQESDAPSPMFTPNNPN--------SPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
G+ + + SP PK+ TE WTGWF SWGG + ++
Sbjct: 240 GIIHGVLATINLQSPQELQLLTTLLVSIQGVQPKMVTEYWTGWFDSWGGPHNILDSSEVL 299
Query: 267 FAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHLNQPK 320
V+ G + N YM+HGGTNFG +G + TSYDYDA + E G PK
Sbjct: 300 KTVSAIVDTGSSI-NLYMFHGGTNFGFINGAMHFQDYRSDITSYDYDAVLTEAGDYT-PK 357
Query: 321 WGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKT--EEFNTA 378
+ LR+ L + T Y V P + +S+ K E +
Sbjct: 358 YIKLRDFFDSLSDGPLPPPPDPLPKTVYEPMV-------PVFYLSLWDALKYIGEPIKSE 410
Query: 379 KVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDY 438
K N+ P GN Q+ G+ T I
Sbjct: 411 KPINMENL----PVNEGNGQS------------------FGYVLYETTITSSG------- 441
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
+LSG + GQV V+ ++D + TK +
Sbjct: 442 --------------VLSGHA------RDRGQVFVNTVSIGFLDYKNTKI---------VI 472
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
L +G + +L G NYG D G+ G + L D++ +K+
Sbjct: 473 PLVQGHTVLRILVENCGRVNYGYNIDEQRKGLIGNLYL-----DDSPLKNFR-------- 519
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDP----VVLNLQGM 614
+Y LD K K+ GW+S VP + + F L P L L+G
Sbjct: 520 -IYSLDMK-----KSVFQRFGWNS--VPEAPALPAF---FLGGLSVGPSPADTFLKLEGW 568
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
KG ++NG NLGRYW G K Y +P W
Sbjct: 569 EKGVVFINGQNLGRYWSI-------------------GPQKTLY-----------LPGPW 598
Query: 675 IKDGVNTLVLFEE 687
+ G+N +++FEE
Sbjct: 599 LDRGINQVIIFEE 611
>gi|340372779|ref|XP_003384921.1| PREDICTED: beta-galactosidase-like [Amphimedon queenslandica]
Length = 659
Score = 182 bits (461), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 197/716 (27%), Positives = 290/716 (40%), Gaps = 144/716 (20%)
Query: 6 HCSRAILLCLILQTLFNL---SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
HC +LL L + S ++ + +D + + DG+ +SGS+HY R W D
Sbjct: 11 HCKVFLLLFLCSGASLFIGVDSRSFTIDYDSNSFSKDGQPFRYISGSMHYSRVPSYYWRD 70
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
+ K GL+A++TYV WN HEP Y+F G+ DL+ F+KT QD GL VILR GPY+C
Sbjct: 71 RLSKMYYAGLNAVQTYVPWNFHEPFPGVYNFEGDHDLVGFLKTAQDVGLLVILRAGPYIC 130
Query: 123 AEWNYGGFPVW-LHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILA 181
EW GGFP W L N P LR+++ +++ + + ++ + K GGPII
Sbjct: 131 GEWEMGGFPSWTLRNQPP-PTLRSSDPSYLSLVDAWMGKLLPLVKPLLY--ENGGPIITV 187
Query: 182 QIENEYGNV-------MSDYGDAGKSYINWCAKMATSLDIGVPWIMC------------Q 222
Q+ENEYG+ M+ + Y+ + T+ G ++ C
Sbjct: 188 QVENEYGSFYTCDQKYMNHLESTFRQYLGPNVVLFTTDGAGDGYLKCGTIPSLYATVDFG 247
Query: 223 ESDAPSPMFTPN---NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTF 279
+D P F P P + +E +TGW WG R + +A ++ + +
Sbjct: 248 ATDNPEGYFAFQRKYEPKGPLVNSEFYTGWLDHWGQAHQTRNGDQIASSLDKILALNASV 307
Query: 280 QNYYMYHGGTNFG-----RTSGGPY--LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLK 332
N YM+ GGTNFG G Y TSYDYDAP++E G + K+G LR +
Sbjct: 308 -NMYMFEGGTNFGFWNGANCGGQSYQPQPTSYDYDAPLNERGEMTD-KFGLLRSV----- 360
Query: 333 SMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPN 392
++K ++ + SV G S I D + F + KV R
Sbjct: 361 -IKKYHPVPSIPPIESDVSVYGDK------SGHIYYDEYADLFESLKVLGTKQTTADR-- 411
Query: 393 QAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDP 452
PL ++ +M DF F L T + DD
Sbjct: 412 -------PLTFE---DMEQDF------GFILYTPV----------------------DDI 433
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
LS S +TL I+ Y N V + G + ++ T K +++L
Sbjct: 434 TLSSSDQVTLTIDELHDRATIYWNRQLVGTLLRSAGLTKNM-SVSFNATSSKGSLAILVE 492
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
+G NYGS + GI V L G E + H T + L ++ +F
Sbjct: 493 NMGRVNYGS-YIADKKGILNGVYL---NGVEVL------HWTTTSLPLNNTNELQFTQVG 542
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTF--EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+ +YK +F + ND +L Q KG A VN +NLGRYW
Sbjct: 543 STTPP-----------TSAVFYKASFTIDGSTLNDTYLLTDQ-WTKGVAIVNDFNLGRYW 590
Query: 631 PTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
P +GP Q +VP S +K G N +VLFE
Sbjct: 591 PV---------------KGP--------------QKTLYVPASVLKKGTNGVVLFE 617
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 134/407 (32%), Positives = 194/407 (47%), Gaps = 41/407 (10%)
Query: 3 TLKHCSRAILLCLILQTLFNLSLAYRVSHDG---RAITIDGERKILLSGSIHYPRSTPGM 59
++ H +A LL L A + G + ++G+ I+ + +HYPR
Sbjct: 7 SISHVLKASLLTAGLFLFTPTEAAAKTETFGVGNKTFLLNGKPFIIKAAEVHYPRIPRPY 66
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W IK K G++ + YVFWN HE ++DFTGN D+ FI+ Q+ GLYVI+R GP
Sbjct: 67 WEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDFTGNNDVAEFIRLAQENGLYVIVRPGP 126
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
YVCAEW GG P WL I LR + FM + F + + L +GGPII
Sbjct: 127 YVCAEWEMGGLPWWLLKKKDI-RLREQDPYFMERYRIFAKKLGEQIG--DLTIEKGGPII 183
Query: 180 LAQIENEYGNVMSD----------YGDAGKSYI-----NWCAKMATSLDIGVPWIMCQES 224
+ Q+ENEYG+ D D+G + +W + + + W M +
Sbjct: 184 MVQVENEYGSYGEDKPYVSGIRDIIRDSGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGT 243
Query: 225 DA----PSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQ 280
A P SP++ +E W+GWF WGG+ R ++++ + G +F
Sbjct: 244 GANIENEFKKLGELRPESPQMCSEFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF- 302
Query: 281 NYYMYHGGTNFGRTSGG--PYLT---TSYDYDAPIDEYGHLNQPKWGHLREL-----HKL 330
+ YM HGGT++G +G P + TSYDYDAPI+E G + PK+ LRE+ K
Sbjct: 303 SLYMTHGGTSWGHWAGANSPGFSPDVTSYDYDAPINEAGQVT-PKYMELREMLSGYSDKK 361
Query: 331 LKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKT-EEFN 376
L S+ K NV + V+ NLPA S+ D +T E FN
Sbjct: 362 LPSIPKEFPVINVPKIQF-TEVAPLFENLPAPHASM--DIQTMEAFN 405
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 129/387 (33%), Positives = 188/387 (48%), Gaps = 50/387 (12%)
Query: 281 NYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTY 340
NYYMYHGGTNFGRTS + YD +AP+DE+G +PKWGHLR+LH LK +K L +
Sbjct: 3 NYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 341 GNVTNTDYGN------------------------------SVSGSSYNLPAWSVSILPDC 370
G + G + G SY +P S+SIL DC
Sbjct: 62 GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121
Query: 371 KTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV--RGKGHFALNTLID 428
KT F T VN Q N + DQ W +M ++ V + L D
Sbjct: 122 KTVVFGTQHVNAQHNQRTFHFA----DQTTQNNVW--QMFDEEKVPKYKQSKIRLRKAGD 175
Query: 429 QKS-TNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVD-SQWTK 486
+ T D +DY+WY ++ L+ DD + L +NS G A+VN +V TK
Sbjct: 176 LYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTK 235
Query: 487 YGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETII 546
+ L E+P+ L +G N +++L++T+G+ + G+ + G+ + AG
Sbjct: 236 MNKAFTL-EKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAG----T 290
Query: 547 KDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDP 606
DL+++ W + VGL G + K+ Y K S + N +R +TWYK F+ P DP
Sbjct: 291 LDLTNNGWGHIVGLVG-EQKQIYTDKGMGSVTWKPAVN---DRPLTWYKRHFDMPSGEDP 346
Query: 607 VVLNLQGMGKGFAWVNGYNLGRYWPTY 633
+VL++ MGKG +VNG +GRYW +Y
Sbjct: 347 IVLDMSTMGKGLMFVNGQGIGRYWISY 373
>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
Length = 620
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 195/730 (26%), Positives = 293/730 (40%), Gaps = 179/730 (24%)
Query: 6 HCSRAILLCLILQTLFNLSLAYRVSHDGRAITID-------GERKILLSGSIHYPRSTPG 58
R L+L L S + + D + I+ G+ + SG +HY R
Sbjct: 2 QVVRTNFFALVLIVL---SFGFAQAQDDASFKIENGSFVYNGKPTPIYSGEMHYERIPKE 58
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF-TGNLDLIRFIKTIQDQGLYVILRI 117
W I+ K GL+ I TYVFWN H P +DF +GN ++ FIK +++ ++VILR
Sbjct: 59 YWRHRIQMMKAMGLNTIATYVFWNYHNPAPGVWDFESGNRNVAEFIKIAKEEEMFVILRP 118
Query: 118 GPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGG 176
GPY C EW +GG+P +L N+PG+ ++R N F+ + + I ++AK+ L + GG
Sbjct: 119 GPYACGEWEFGGYPWFLQNIPGL-KVRENNAQFLAACKEY---INELAKQVAPLQVNNGG 174
Query: 177 PIILAQIENEYGNVMSDYGDAG----KSYINWCAKMATSLDIGVPWIMCQ---------- 222
II+ Q+ENE+G+ ++ D K+Y KM P+
Sbjct: 175 NIIMTQVENEFGSYVAQREDIAPEDHKAYKEAIFKMLKDAGFQAPFFTSDGAWLFEGGSL 234
Query: 223 ESDAPSP------------MFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVA 270
E P+ + NN P + E + GW W K +A D+A
Sbjct: 235 EGVLPTANGEGNIDNLKKVVNKFNNNEGPYMVAEFYPGWLDHWAEPFVKISASDIAKQTE 294
Query: 271 RFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTTSYDYDAPIDEYGHLNQPKWG 322
+ + G F N+YM HGGTNFG TSG Y TSYDYDAPI E G + PK+
Sbjct: 295 VYLKNGVNF-NFYMAHGGTNFGFTSGANYNDEHDIQPDITSYDYDAPISEAGWVT-PKYD 352
Query: 323 HLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAK-VN 381
+R L M+K + Y +PA I P + + AK +
Sbjct: 353 SIRAL------MQKY-----------------APYEIPAVPEQI-PVIEIPQIQLAKTTD 388
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWY 441
T +K ++P + +PL ++ + +G G+ L ++ T ++
Sbjct: 389 ALTFIKKQKPVTS---DSPLTFEQ--------LEQGFGY----VLYKKRFTQPITG---- 429
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF---ERPV 498
TL++ YVNG K G N +F E P+
Sbjct: 430 -------------------TLKVPGLRDFATVYVNGK-------KVGELNRVFNSYEMPI 463
Query: 499 KLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKV 558
K+ + +L +G NYG++ GI PV + D I + W
Sbjct: 464 KIPF-NGSLEILVENMGRINYGAEIVNNLKGITAPVSI----NDYEI-----TGGW---- 509
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGF 618
+ Y A A +S V R + Y +F+ + D LN+ MGKG
Sbjct: 510 --------EMYKAPFAEVPEVINSTEVKTGRPVV-YSGSFDLKKQGD-TFLNMSEMGKGI 559
Query: 619 AWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDG 678
+VNG+NLGRYW P Q Y VP W+K
Sbjct: 560 VFVNGHNLGRYWKV-----------------------------GPQQTLY-VPGCWLKKK 589
Query: 679 VNTLVLFEEF 688
NT+ +FE+
Sbjct: 590 GNTITIFEQL 599
>gi|322390566|ref|ZP_08064082.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
gi|321142719|gb|EFX38181.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
Length = 595
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 184/703 (26%), Positives = 281/703 (39%), Gaps = 186/703 (26%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
A + G+ +LSG+IHY R P W + K G + +ETYV WNAHEP + Q+DF+
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFM 151
G LDL RFI+T Q GLY+I+R P++CAEW +GG P WL +EE +R+++ F+
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDLRIRSSDPAFI 123
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ + ++ + ++ QGGPI++ Q+ENEYG+ D K Y+ +
Sbjct: 124 EAVDRYYDRLLGLLTPYQV--DQGGPILMMQVENEYGSYGED-----KDYLRAIRDLMKE 176
Query: 212 LDIGVPWIMCQESDAP------------SPMFTPNNPNS--------------------P 239
+ P SD P +F N S P
Sbjct: 177 KGVTCPLFT---SDGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWP 233
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+ E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 234 LMCMEFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSA 291
Query: 300 L-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG-NS 351
TSYDY A ++E GN T Y
Sbjct: 292 RGTLDLPQVTSYDYGALLNE---------------------------QGNPTEKYYAIQK 324
Query: 352 VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMIN 411
+ + Y+ ++ +C E+ T ++ +T++ N A ++ PE +
Sbjct: 325 MMATYYSEYPQQEPLIKECLPEQ--TLQLAAKTSLFGNLDN-----LAQVETSLYPEKME 377
Query: 412 DFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVL 471
+ + YL Y T+ +L ++ LRI +
Sbjct: 378 EL-------------------GQTTGYLLYETDLELDAEEE--------RLRIIDGRDRV 410
Query: 472 HAYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPN 528
Y++ +V +Q+ T+ G DLF + K + + +L +G NYG K D
Sbjct: 411 QIYLDDQHVATQYQTEIG--EDLFIKGKK--KAVTNLKILLENMGRVNYGHKLLADSQHK 466
Query: 529 GIPGPVLLVGRAGDETIIKDLSSH-KW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
GI R G + DL H W Y + L L F A +
Sbjct: 467 GI--------RTG---VCVDLHFHLHWKQYPLDLQDLSQLDFTKEWQAGAP--------- 506
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+Y+ F+ D L++ G GKG ++NG+NLGR+W
Sbjct: 507 -----AFYRYDFQLDHTLD-TYLDMTGFGKGVVFINGHNLGRFWEV-------------- 546
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
GP S +VP ++K+G N+L++FE G
Sbjct: 547 --GPTTS--------------LYVPHGFLKEGANSLIVFETEG 573
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 160/327 (48%), Gaps = 45/327 (13%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G+ + ++ +++G+IHY R P W D + K K G + +ETYV WN HEP ++
Sbjct: 8 GKQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEEGRFV 67
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F G DL +FI + GLY I+R PY+CAEW +GG P WL PG+ LR + K F++
Sbjct: 68 FEGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGM-RLRCSYKPFLD 126
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ + +++GGP+I QIENEYG+ +D K+Y+N+ +
Sbjct: 127 KADAYYDELI--PRLTPFLSTKGGPLIAMQIENEYGSYGND-----KTYLNYLKEALVKR 179
Query: 213 DIGVPWIMCQESDAPSPMFTPN--------------------------NPNSPKIWTENW 246
+ V + SD P P+ P + E W
Sbjct: 180 GVDV---LLFTSDGPEDFMLQGGMVEGVWETVNFGSRSAEAFAKLQEYQPDQPLMCMEFW 236
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------L 300
GWF WG R A D+A + G + N+YM+HGGTNFG SG Y
Sbjct: 237 NGWFDHWGETHHTRGAADVALVLDEMLAAGASV-NFYMFHGGTNFGFFSGANYTDRLLPT 295
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYD+P+ E G L + K+ +RE+
Sbjct: 296 VTSYDYDSPLSESGELTE-KYYAVREV 321
>gi|325914137|ref|ZP_08176490.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
gi|325539640|gb|EGD11283.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
Length = 635
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 170/632 (26%), Positives = 261/632 (41%), Gaps = 136/632 (21%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ +LSG+IH+ R W D ++KA+ GL+ +ETYVFWN EP + Q+D
Sbjct: 58 GTQFVRDGKPYQILSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 117
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F+ N D+ F++ QGL VILR GPY CAEW GG+P WL I +R+ + F+
Sbjct: 118 FSANNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKDNIR-VRSRDPRFLA 176
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---DAGKSYIN------ 203
Q + + + + L GGPII Q+ENEYG+ D+ D ++
Sbjct: 177 ASQAYLDAVAK--QVQPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMFVKAGFDKA 234
Query: 204 --WCAKMATSLDIG-VPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSWG 254
+ + A L G +P + + AP P P++ E W GWF WG
Sbjct: 235 LLFTSDGADMLANGTLPGTLAVVNFAPGEAKSAFDKLIKFRPEQPRMVGEYWAGWFDHWG 294
Query: 255 ----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------- 300
D K+ E+L + + + G N YM+ GGT+FG +G +
Sbjct: 295 TPHASTDAKQQTEELEWILRQ-----GHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQ 349
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLP 360
TTSYDYDA +DE GH PK+ +R+ VT T LP
Sbjct: 350 TTSYDYDAILDEAGHPT-PKFALMRD------------AIARVTGT--------QPPALP 388
Query: 361 A-WSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKG 419
A +++ LPD + E + N + + P P++
Sbjct: 389 APIAMAALPDTQLRESASLWDNLPAPIAIDTPQ-------PME----------------- 424
Query: 420 HFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNY 479
HF + Y+ Y T ++G +L + V YV+
Sbjct: 425 HFGQDY-----------GYILYRTT---------ITGPRKGSLYLGEVRDVARVYVDQKP 464
Query: 480 VDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGR 539
V S + V + G++ + +L G NYG + G+ PVLL
Sbjct: 465 VGSVERRL----QQVATDVDIPAGQHTLDVLVENSGRINYGPRMADGRAGLVDPVLL--- 517
Query: 540 AGDETIIKDLSSHKW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
G++ + W + + + ++ +S RGW+ K V + +++
Sbjct: 518 -GNQQLT------GWQAFPLPM-----------RSPDSLRGWTRKAV---QGPAFHRGNL 556
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
D L+++ GKG AW NG NLGR+W
Sbjct: 557 RIGTPTD-TYLDMRAFGKGIAWANGVNLGRHW 587
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 174/363 (47%), Gaps = 41/363 (11%)
Query: 1 MATLKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
M ILL L T+F+ + + DG + ++G+ + SG IHYPR W
Sbjct: 5 MRNFNRYFSIILLFFSLNTVFSQKGKFEI-RDGHFL-LNGKPFTIYSGEIHYPRVPSAYW 62
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
++ K GL+ + TYVFWN HE +++F+G DL +FIKT Q+ GLYVI+R GPY
Sbjct: 63 KHRLEMMKAMGLNTVTTYVFWNYHEEAPGKWNFSGEKDLQKFIKTAQETGLYVIIRPGPY 122
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEW +GG+P WL + E+R NK F E + + + ++ + GGP+I+
Sbjct: 123 VCAEWEFGGYPWWLQKNKEL-EIRRDNKAFSEECWKYISQLAKQITPMQI--TNGGPVIM 179
Query: 181 AQIENEYGNVMSDYGD----AGKSYINWCAKMATSLDIGVPWIMCQ-------------- 222
Q ENE+G+ ++ D + Y + +M I VP
Sbjct: 180 VQAENEFGSYVAQRKDIPLEEHRKYSHKIKEMLLKSGISVPLFTSDGSSLFKGGSVEGAL 239
Query: 223 -----ESDAPSPMFTPNNPN---SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
ESD + N N P + E + GW W K + E++ + +
Sbjct: 240 PTANGESDIDVLKKSINEYNGGKGPYMIAEYYPGWLDHWAEPFVKVSTEEVVKQTNLYIE 299
Query: 275 FGGTFQNYYMYHGGTNFGRTSGGPYLT--------TSYDYDAPIDEYGHLNQPKWGHLRE 326
G +F NYYM HGGTNFG TSG Y TSYDYDAPI E G PK+ LR+
Sbjct: 300 NGVSF-NYYMIHGGTNFGFTSGANYDKDHDIQPDLTSYDYDAPISEAG-WATPKYNALRK 357
Query: 327 LHK 329
+ +
Sbjct: 358 IFQ 360
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 118/322 (36%), Positives = 166/322 (51%), Gaps = 39/322 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G R + GSIHY R W D + K K GL+ + TY+ WN HEP R +++F+GNL
Sbjct: 92 LEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSGNL 151
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F++ D GL+VILR GPY+C+EW+ GG P WL + ELRTT F+ + +
Sbjct: 152 DVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSM-ELRTTYVGFIKAVDLY 210
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + L +QGGPII Q+ENEYG+ D +Y+ + KMA V
Sbjct: 211 FNQLI--PRVVPLQYTQGGPIIAVQVENEYGSY-----DKDPNYMPYI-KMALLKRGIVE 262
Query: 218 WIMCQES-------------------DAPSPMFT---PNNPNSPKIWTENWTGWFKSWGG 255
+M ++ + S +F N P + TE WTGWF +WGG
Sbjct: 263 LLMTSDNKDGLSGGYVEGVLATINLKNVDSIIFNYLQSFQDNKPTMVTEFWTGWFDTWGG 322
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT------TSYDYDAP 309
A+D+ +V+ Q G + N YM+HGGTNFG +G + T TSYDYDA
Sbjct: 323 PHHIVDADDVMVSVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFTDYQADVTSYDYDAI 381
Query: 310 IDEYGHLNQPKWGHLRELHKLL 331
+ E G PK+ LRE L
Sbjct: 382 LTEAGDYT-PKFFKLREYFSTL 402
>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
Length = 493
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/342 (32%), Positives = 166/342 (48%), Gaps = 44/342 (12%)
Query: 17 LQTLFNLSLAYRVSHDGR--AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDA 74
L LF+L+ A R + R A +DGE+ L+SGSIHY R W D + K K GL+
Sbjct: 42 LGLLFSLAFAERSPLEARDGAFWLDGEKITLVSGSIHYFRVPNEYWLDRLTKLKYAGLNT 101
Query: 75 IETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL 134
+E YV WN HEP +++F+G+LD++RFI+ + GL+V+ R GPY+CAEW +GG P WL
Sbjct: 102 VELYVSWNLHEPYSGEFNFSGDLDVVRFIEMAGELGLHVLFRPGPYICAEWEWGGHPYWL 161
Query: 135 HNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDY 194
+ + ++RTT ++ ++ F + + + L GGPII QIENEY +
Sbjct: 162 LHDTDM-KVRTTYPGYLEAVEKFYSEL--FGRVNHLMYRNGGPIIAVQIENEYAGFADAF 218
Query: 195 --GDAGKSYINW---------CAKMATSLDIGVPWIMCQESDAPSPM------------- 230
G ++ W C ++ + D G + + P +
Sbjct: 219 EIGPLDPGFLTWLRQTIKDQQCEELLFTSDGGWDFYKYELEGDPYGLNFDDVLRANYWLN 278
Query: 231 -FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGT 289
N P PK+ E W+GWF WG TA+ + + NYYM+HGGT
Sbjct: 279 ILENNQPGKPKMVMEWWSGWFDFWGYHHQGTTADSFEENLRAILSQNASV-NYYMFHGGT 337
Query: 290 NFGRTSGGPY-------------LTTSYDYDAPIDEYGHLNQ 318
NFG +G + + TSYDYD P+ E G + +
Sbjct: 338 NFGYMNGANFNTNDQTNDLEYQPVVTSYDYDCPLSEEGRITK 379
>gi|358341339|dbj|GAA31081.2| beta-galactosidase [Clonorchis sinensis]
Length = 657
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 163/318 (51%), Gaps = 32/318 (10%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + D DG + ++GS HY R W D ++KAK GLDAI+ Y+ WN HE
Sbjct: 39 SFTIDPDTHTFLKDGAQFQYIAGSFHYFRIPTLYWRDRLEKAKAAGLDAIQLYIPWNFHE 98
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P +Y+F + DL FI IQ + I+R GPY+CAEW +GG P WL ++R+
Sbjct: 99 PEEGEYNFADDRDLEYFIDIIQQLDMLAIVRAGPYICAEWAFGGLPPWLLRKNPYMKIRS 158
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN-------VMSDYGDAG 198
++ + E+ N+ ++ + K K ++GGPII+ Q+ENEYG+ M++ D
Sbjct: 159 SDPAYYQEVVNWFNVL--LPKLRKHLYTEGGPIIMVQMENEYGSYGLCDRTYMTNLYDLA 216
Query: 199 KSYINWCAKMATSLDIGVPWIMCQESD---------APSPMFTPN---------NPNSPK 240
+S++ + T+ + ++ C D P+ M P+ P P
Sbjct: 217 RSHLGQDVILFTTDGCALSYLRCGVLDPRYLATIDFGPTTM-PPDLSFSSVEQFRPGQPL 275
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQ-NYYMYHGGTNFGRTSGGPY 299
+ +E ++GWF WGGK + AE L ++ + N YM+HGGTNFG +G P+
Sbjct: 276 VNSEFYSGWFDGWGGKHARTGAEFLRNSLMNLMNYSKRVNVNMYMFHGGTNFGLWNGKPH 335
Query: 300 ---LTTSYDYDAPIDEYG 314
TSYDYDAPI E G
Sbjct: 336 NIPAITSYDYDAPISEAG 353
>gi|417918764|ref|ZP_12562312.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
gi|342827747|gb|EGU62128.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
Length = 595
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 185/703 (26%), Positives = 281/703 (39%), Gaps = 186/703 (26%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
A + G+ +LSG+IHY R P W + K G + +ETYV WNAHEP + Q+DF+
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFM 151
G LDL RFI+T Q GLY+I+R P++CAEW +GG P WL +EE +R+++ VF+
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDLRIRSSDPVFI 123
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ + ++ + ++ +GGPI++ Q+ENEYG+ D K Y+ +
Sbjct: 124 EAVDRYYDRLLGLLTPYQV--DRGGPILMMQVENEYGSYGED-----KDYLRAIRDLMKE 176
Query: 212 LDIGVPWIMCQESDAP------------SPMFTPNNPNS--------------------P 239
+ P SD P +F N S P
Sbjct: 177 KGVTCPLFT---SDGPWRATLRAGTLIEEDLFVTGNFGSKATYNFGQMKEFFDEYGKRWP 233
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+ E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 234 LMCMEFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSA 291
Query: 300 L-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG-NS 351
TSYDY A ++E GN T Y
Sbjct: 292 RGTLDLPQVTSYDYGALLNEQ---------------------------GNPTEKYYAIQK 324
Query: 352 VSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMIN 411
+ + Y ++ +C E+ T ++ +T++ N A ++ PE +
Sbjct: 325 MMATYYPEYPQQEPLIKECLPEQ--TLQLAAKTSLFGNLDNLA-----QVETSLYPEKME 377
Query: 412 DFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVL 471
+ + YL Y T+ +L ++ LRI +
Sbjct: 378 EL-------------------GQTTGYLLYETDLELDAEEE--------KLRIIDGRDRV 410
Query: 472 HAYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPN 528
Y++ +V +Q+ T+ G DLF + K + + +L +G NYG K D
Sbjct: 411 QIYLDDQHVATQYQTEIG--EDLFIKGKK--KAITNLKILLENMGRVNYGHKLLADSQHK 466
Query: 529 GIPGPVLLVGRAGDETIIKDLSSH-KW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVP 586
GI R G + DL H W Y + L L F A +
Sbjct: 467 GI--------RTG---VCVDLHFHLHWKQYPLDLQDLSQLDFSKEWQAGAP--------- 506
Query: 587 LNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCD 646
+Y+ F+ D L++ G GKG +VNG+NLGR+W
Sbjct: 507 -----AFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRFWEV-------------- 546
Query: 647 YRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
GP S +VP ++K+G N+L++FE G
Sbjct: 547 --GPTTS--------------LYVPHGFLKEGANSLIVFETEG 573
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/312 (35%), Positives = 154/312 (49%), Gaps = 28/312 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++GE ++ + +HYPR W IK+ K G++ I YVFWN HE ++DFTG
Sbjct: 42 LNGEPFVVKAAELHYPRIPRAYWEHRIKQCKALGMNTICLYVFWNFHEEKPGEFDFTGQK 101
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL I LR + F+ + F
Sbjct: 102 DLAEFCRLCQKNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLREDDPYFLERVAIF 160
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ + L +GGPII+ Q+ENEYG+ V ++GD +
Sbjct: 161 EKEVANQVA--GLTIQKGGPIIMVQVENEYGSYGESKEYVAKIRDIVRGNFGDVTLFQCD 218
Query: 204 WCAKMATSLDIGVPWIMCQESDAP-SPMFTP---NNPNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W M + A F P P+SP + +E W+GWF WG
Sbjct: 219 WASNFQLNALDDLVWTMNFGTGANIDEQFAPLKKVRPDSPLMCSEFWSGWFDKWGANHET 278
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A+D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 279 RAADDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 337
Query: 315 HLNQPKWGHLRE 326
+ PK+ LRE
Sbjct: 338 KIT-PKYEKLRE 348
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 53/229 (23%), Positives = 82/229 (35%), Gaps = 47/229 (20%)
Query: 461 TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
TL + + +++G Y+ K N + + Q+ +L +G N+G
Sbjct: 422 TLTVTEAHDYAQIFIDGKYIG----KLDRRNGEKQLDIPACAEGAQLDILVEAMGRINFG 477
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
GI V L G T +K W +Y L+D+ + K E
Sbjct: 478 RAIKDF-KGITEKVEL-KNGGRTTELKG-----WK----VYNLEDR-YEGYKGLKFEPLK 525
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
S K+ R Y+ TF D LN + GKG +VNGY +GR W
Sbjct: 526 SVKDAQGQRVPGCYRATFHVEKPGD-TFLNFETWGKGLVYVNGYGIGRIWEI-------- 576
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
P Q Y +P W+K+G N +++F+ G
Sbjct: 577 ---------------------GPQQTLY-MPGCWLKEGENEILVFDIVG 603
>gi|445062232|ref|ZP_21374649.1| beta-galactosidase [Brachyspira hampsonii 30599]
gi|444506390|gb|ELV06735.1| beta-galactosidase [Brachyspira hampsonii 30599]
Length = 592
Score = 179 bits (455), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 184/691 (26%), Positives = 281/691 (40%), Gaps = 170/691 (24%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
++G+ +LSG+IHY R W D + K G + +ETY+ WN HE +DF+G
Sbjct: 10 FILNGKPIKILSGAIHYFRFVREYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGFFDFSG 69
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
N D+ FIKT Q L VILR PY+CAEW +GG P WL I+ +RT ++F++++
Sbjct: 70 NKDIASFIKTAQKLDLLVILRPTPYICAEWEFGGLPAWLLRYDNIK-VRTNTQLFLSKVD 128
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYG-------------NVMSDYGDAGKSYI 202
+ + + L ++ GP+I+ QIENEYG N+M +G +
Sbjct: 129 AYYKEL--FKHIDDLQITRNGPVIMMQIENEYGSFGNDKEYLRALKNLMIKHGAEVPLFT 186
Query: 203 N---WCAKM--ATSLDIGVPWIM-----CQES-DAPSPMFTPNNPNSPKIWTENWTGWFK 251
+ W A + T +D G+ + +ES D F P + E W GWF
Sbjct: 187 SDGAWDAVLEAGTLIDDGILATVNFGSKAKESFDDTEKFFARKGIKKPLMCMEFWDGWFN 246
Query: 252 SWGGKDP--KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TT 302
W KDP KR A+D V + G N YM+ GGTNFG +G T
Sbjct: 247 LW--KDPIIKRDADDFIMEVKEILKRGSI--NLYMFIGGTNFGFYNGTSVTGYTDFPQIT 302
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAW 362
SYDYDA + E+G + +L KL+ + P
Sbjct: 303 SYDYDAVLTEWGEPTE----KFYKLQKLINEL------------------------FPEI 334
Query: 363 SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFA 422
D K +F+ AK+ +T +
Sbjct: 335 KTFEPRDHKRLDFSEAKLKNKT-------------------------------------S 357
Query: 423 LNTLIDQKSTNDVSDYLWYMTNAD-----LKDDDPILSGSSNMTLRINSSGQVLHAYVNG 477
L ++ID+ S SD+ M A + + ++NM +R + +H Y+NG
Sbjct: 358 LFSVIDKISKCQKSDFPITMEKAGSGYGYMLYRTKVKGFNNNMNVRAVGASDRVHFYLNG 417
Query: 478 NYVDSQWTKYGASNDLFERPVKL--TRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVL 535
Y + KY ++L E P+++ G N + LL VG NYG K + G +
Sbjct: 418 EY---KGVKY--QDELIE-PIEMHFNDGDNILELLVENVGRVNYGYKLQECSQ-VKG--I 468
Query: 536 LVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYK 595
+G D + Y + L ++D F W +N P ++Y+
Sbjct: 469 RIGVMAD----IHFETGFEQYALSLDNIEDVDF--------SADW-IENTP-----SFYR 510
Query: 596 TTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDK 655
FE E L+ +GKG A++NG+NLGRYW +E C
Sbjct: 511 YEFEVK-EAADTFLDCSKLGKGVAFINGFNLGRYW----SEGPAC--------------- 550
Query: 656 CAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
+ ++P +K GVN +++FE
Sbjct: 551 -----------YLYIPAPLLKIGVNEIIVFE 570
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 179 bits (455), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 92/171 (53%), Positives = 107/171 (62%), Gaps = 12/171 (7%)
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
GF +PGI RT N F MQ FT IV+M K EKLF QGGPII++QIENEYG
Sbjct: 3 GFSCLAQYVPGIA-FRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYG 61
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPM-----------FTPNNPN 237
V + G GKSY W A+MA L+ GVPWIMC++ DAP P+ F PN
Sbjct: 62 PVEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKNY 121
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
PK+WTENWTGW+ +GG P R EDLAF+VARF Q G+F NYYMYHG
Sbjct: 122 KPKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHGA 172
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 179 bits (454), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 175/347 (50%), Gaps = 35/347 (10%)
Query: 14 CLILQTLFNLSL----AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
L+L LF SL ++ V + DGE+ +SGSIHY R W D + K
Sbjct: 9 VLLLLMLFGRSLGESPSFTVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKDRLLKMYM 68
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GL+AI+TYV WN HE + Y+F+G+ DL F+K QD GL VILR GPY+CAEW+ GG
Sbjct: 69 AGLNAIQTYVPWNYHEEVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYICAEWDMGG 128
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG- 188
P WL I LR+T+ ++ + + ++ M K GGPII Q+ENEYG
Sbjct: 129 LPAWLLKKKDI-VLRSTDPDYIAAVDKWMGKLLPMIK--PYLYQNGGPIITVQVENEYGS 185
Query: 189 ------NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC---QESDAP---------SPM 230
N M +SY+ + T+ G+ ++ C Q+ A +
Sbjct: 186 YFACDYNYMRHLSKLFRSYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPGANVTAA 245
Query: 231 FTPN---NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
F P P+ P + +E +TGW WG + + +A A++ G N YM+ G
Sbjct: 246 FEPQRQVQPHGPLVNSEFYTGWLDHWGSRHSVVSPTQVAKALSEMLLMGANV-NLYMFIG 304
Query: 288 GTNFGRTSGG--PYLT--TSYDYDAPIDEYGHLNQPKWGHLRELHKL 330
GTNFG +G PY TSYDYDAP+ E G L + K+ +RE+ K+
Sbjct: 305 GTNFGYWNGANTPYAAQPTSYDYDAPLTEAGDLTE-KYFAIREVIKM 350
>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
Length = 644
Score = 179 bits (454), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 173/326 (53%), Gaps = 39/326 (11%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
+H T++G + +++ GSIHY R W D + K + G + + TY+ WN HE R
Sbjct: 63 AHGQAYFTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERG 122
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
++DF+ LDL ++ + GL+VILR GPY+CAE + GG P WL PG LRTTNK
Sbjct: 123 KFDFSEILDLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPG-SNLRTTNKD 181
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F+ + + ++ K L +GGP+I Q+ENEYG+ +D K+Y+ + K
Sbjct: 182 FIEAVDKYFDHLI--PKILPLQYRRGGPVIAVQVENEYGSFRND-----KNYMEYIKKAL 234
Query: 210 TSLDIGVPWIMCQESDAPSPM------FTPNNPNS----------------PKIWTENWT 247
+ I V ++ ++++ + N NS P + E WT
Sbjct: 235 LNRGI-VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWT 293
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LT 301
GW+ SWG K +++A ++ + RFF +G +F N YM+HGGTNFG +GG + +
Sbjct: 294 GWYDSWGSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVV 352
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDA + E G + K+ LR+L
Sbjct: 353 TSYDYDAVLSEAGDYTE-KYFKLRKL 377
>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
Length = 631
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 173/326 (53%), Gaps = 39/326 (11%)
Query: 30 SHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRR 89
+H T++G + +++ GSIHY R W D + K + G + + TY+ WN HE R
Sbjct: 50 AHGQAYFTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERG 109
Query: 90 QYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKV 149
++DF+ LDL ++ + GL+VILR GPY+CAE + GG P WL PG LRTTNK
Sbjct: 110 KFDFSEILDLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPG-SNLRTTNKD 168
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
F+ + + ++ K L +GGP+I Q+ENEYG+ +D K+Y+ + K
Sbjct: 169 FIEAVDKYFDHLI--PKILPLQYRRGGPVIAVQVENEYGSFRND-----KNYMEYIKKAL 221
Query: 210 TSLDIGVPWIMCQESDAPSPM------FTPNNPNS----------------PKIWTENWT 247
+ I V ++ ++++ + N NS P + E WT
Sbjct: 222 LNRGI-VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWT 280
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LT 301
GW+ SWG K +++A ++ + RFF +G +F N YM+HGGTNFG +GG + +
Sbjct: 281 GWYDSWGSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVV 339
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDA + E G + K+ LR+L
Sbjct: 340 TSYDYDAVLSEAGDYTE-KYFKLRKL 364
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 117/341 (34%), Positives = 170/341 (49%), Gaps = 50/341 (14%)
Query: 22 NLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFW 81
+++ +RV+ G ++GE LLSG +HY R W ++ AK GL+ + TY+FW
Sbjct: 37 SVTHTFRVA--GDHFELNGEPVQLLSGEMHYARIPREYWRARLQMAKAMGLNTVATYIFW 94
Query: 82 NAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGI- 140
N HEP YDF+GN D+ F+K Q++GL VILR GPY CAEW +GG+P WL P +
Sbjct: 95 NVHEPKPGVYDFSGNHDVAAFVKMAQEEGLNVILRAGPYACAEWEFGGYPSWLMKDPKMG 154
Query: 141 EELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG----- 195
LR+ ++V+M ++ + + + L S GGPI+ Q+ENEYG+ D
Sbjct: 155 SALRSNDEVYMAPVERWIKRLGQ--EMVPLLISNGGPIVAVQVENEYGDFGGDKKYLAHM 212
Query: 196 -----------------DAGKSYINWCAK-MATSLDIGVPWIMCQESDAPSPMFTPNNPN 237
D K+ +N + + + ++ GV ++ P
Sbjct: 213 LEIFQNAGFKDSFLYTVDPSKALVNGSLEGLPSGVNFGV-----GNAERGLTALAHLRPG 267
Query: 238 SPKIWTENWTGWFKSWGGKDPKR----TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
P +E W GWF WG R +D+A+ + + N YM+HGGT+FG
Sbjct: 268 QPLFASEYWPGWFDHWGHPHETRPIPPQLKDIAYTLDH-----KSSINIYMFHGGTSFGF 322
Query: 294 TS-----GGPYL--TTSYDYDAPIDEYGHLNQPKWGHLREL 327
S GG YL TSYDYDAP+DE GH PK+ R+L
Sbjct: 323 MSGASWTGGEYLPDVTSYDYDAPLDEAGH-PTPKFYAYRDL 362
>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
Length = 653
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 175/351 (49%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ GR T++G + ++ GSIHY R W D + K K G + +
Sbjct: 61 LKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F+ ++ + ++ + L QGGP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKSFIEAVEKYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHKAL--LRRGIVELLLT-SDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KIQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G Y + TSYDYDA + E G + +L KL +S+ T
Sbjct: 349 FMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE----KYLKLQKLFQSVSAT 395
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 155/311 (49%), Gaps = 32/311 (10%)
Query: 44 ILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFI 103
++ + +HYPR W IK K G++ I YVFWN HE ++DF+GN D+ F
Sbjct: 47 VVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFWNIHEQREGEFDFSGNSDVAAFC 106
Query: 104 KTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVD 163
+ Q G+Y+I+R GPYVCAEW GG P WL I LR ++ FM ++ F + +
Sbjct: 107 RLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFMERVEIFEQKVAE 165
Query: 164 MAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKS--------YIN----------WC 205
+ L GGPII+ Q+ENEYG+ D G+ Y N W
Sbjct: 166 --QLAPLTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRDVLRKYWYTNGRGPALFQCDWA 223
Query: 206 AKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRT 261
+ + + W M DA P++PK+ +E W+GWF WG + R
Sbjct: 224 SNFEKNGLEDLIWTMNFGTGANIDAQFMRLGELRPDAPKMCSEFWSGWFDKWGARHETRP 283
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYGHL 316
A+D+ + G +F + YM HGGT+FG +G P TSYDYDAPI+EYG +
Sbjct: 284 AKDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQV 342
Query: 317 NQPKWGHLREL 327
PK+ LR++
Sbjct: 343 T-PKFWELRKM 352
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 165/321 (51%), Gaps = 41/321 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T+ G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+G
Sbjct: 91 FTLGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSG 150
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ + GL+VILR GPY+C+E + GG P WL P + LRTT K F+ +
Sbjct: 151 NLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKM-ILRTTYKGFVEAVN 209
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ ++ L + GPII Q+ENEYG+ D K Y+ + K L+ G
Sbjct: 210 KYFDHLI--SRVVPLQYRKRGPIIAVQVENEYGSFAED-----KDYMPYIQK--ALLERG 260
Query: 216 VPWIMCQESDAPSPM---------------FTPNN--------PNSPKIWTENWTGWFKS 252
+ ++ DA + F N+ N P + E W GWF +
Sbjct: 261 IVELLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFDT 320
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDY 306
WGGK + AED+ V++F +F N YM+HGGTNFG +G Y + TSYDY
Sbjct: 321 WGGKHMIKNAEDVEDTVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGKHRGVVTSYDY 379
Query: 307 DAPIDEYGHLNQPKWGHLREL 327
DA + E G + K+ LR+L
Sbjct: 380 DAVLTEAGDYTE-KYFKLRKL 399
>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
Length = 583
Score = 179 bits (453), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 119/343 (34%), Positives = 167/343 (48%), Gaps = 45/343 (13%)
Query: 40 GERKI-LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
G+R I L+SG+IHY R P W D ++K K G + IETYV WN HEP ++ F D
Sbjct: 14 GDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEPREGEFHFERMAD 73
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
+ F++ + GLYVI+R PY+CAEW +GG P WL + LR + F+ ++ +
Sbjct: 74 VAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL--LKDDMRLRCNDPRFLEKVSAYY 131
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPW 218
+ + + L A++GGPII QIENEYG+ +D ++Y+ A+ A ++ GV
Sbjct: 132 DAL--LPQLTPLLATKGGPIIAVQIENEYGSYGND-----QAYLQ--AQRAMLIERGVDV 182
Query: 219 I----------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
+ M Q A + T N P+ P + E W GWF W
Sbjct: 183 LLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMCMEYWNGWFDHW 242
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYD 307
R A+D A + G + N+YM HGGTNFG SG + TSYDYD
Sbjct: 243 FEPHHTRDAKDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDKYEPTVTSYDYD 301
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
A I E G L PK+ RE+ S+ + N DYG+
Sbjct: 302 AAISEAGDLT-PKYHAFREVIGKYVSLPEGELPANTPKADYGS 343
>gi|421767985|ref|ZP_16204697.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
gi|421773235|ref|ZP_16209883.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411182327|gb|EKS49478.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411186672|gb|EKS53794.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
Length = 656
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 174/652 (26%), Positives = 266/652 (40%), Gaps = 156/652 (23%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 67 FSIDHE---FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 123
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++DF+G LD+ RF+KT +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 124 REGEFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 181
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D + Y+ A
Sbjct: 182 DPAYLVAIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGSYGED-----QDYLAAVA 234
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
K+ + VP SD P P + N S
Sbjct: 235 KLMQQHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEH 291
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP TAEDL + R G+ N YM+HGGTN
Sbjct: 292 GRDWPLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTN 345
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
FG +G + D+D P VT+ DY
Sbjct: 346 FGFMNG---TSARKDHDLP--------------------------------QVTSYDYDA 370
Query: 351 SVSGSSYNLPAW-SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
++ P + ++ + + E AK + +P A PL K
Sbjct: 371 PLNEQGNPTPKYFAIQKMIHEELPEVQQAK-------PLVKPTMAPASH-PLTAK----- 417
Query: 410 INDFVVRGKGHFALNTLIDQ------KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT-- 461
+L ++DQ S ++L T L P++SG+ T
Sbjct: 418 -----------VSLFAVLDQLAKPIAASYPQTQEFLGQYTGYTLYRTQPLISGTDKGTPA 466
Query: 462 -LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
LR+ + + AY++ ++ +Q+ + +D+ V+ G +Q+ LL + NYG
Sbjct: 467 KLRVIDARDRVQAYLDQKWLATQYQE-AIGDDILLPEVE---GHHQLDLLVENMSRVNYG 522
Query: 521 SKFDMVPN--GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
SK + + GI V++ D IK Y LD + A
Sbjct: 523 SKIEAITQFKGIRTGVMV-----DLHFIKGYQQ---------YPLDLNR---ASQLTFTE 565
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
GW +YK TF+ D L+ +G GKG VNG N+GR+W
Sbjct: 566 GWQPATP------AFYKYTFDLTAPQD-TYLDCRGFGKGVMLVNGVNVGRFW 610
>gi|258507331|ref|YP_003170082.1| beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|385827042|ref|YP_005864814.1| beta-galactosidase [Lactobacillus rhamnosus GG]
gi|257147258|emb|CAR86231.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|259648687|dbj|BAI40849.1| beta-galactosidase [Lactobacillus rhamnosus GG]
Length = 593
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 174/652 (26%), Positives = 266/652 (40%), Gaps = 156/652 (23%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++DF+G LD+ RF+KT +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 REGEFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D + Y+ A
Sbjct: 119 DPAYLAAIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGSYGED-----QDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
K+ + VP SD P P + N S
Sbjct: 172 KLMQQHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GRDWPLMCVEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
FG +G + D+D P VT+ DY
Sbjct: 283 FGFMNG---TSARKDHDLP--------------------------------QVTSYDYDA 307
Query: 351 SVSGSSYNLPAW-SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
++ P + ++ + + E AK + +P A PL K
Sbjct: 308 PLNEQGNPTPKYFAIQKMIHEELPEVQQAK-------PLVKPTMAPASH-PLTAK----- 354
Query: 410 INDFVVRGKGHFALNTLIDQ------KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT-- 461
+L ++DQ S ++L T L P++SG+ T
Sbjct: 355 -----------VSLFAVLDQLTKPIAASYPQTQEFLGQYTGYTLYRTQPLISGTDKGTPA 403
Query: 462 -LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
LR+ + + AY++ ++ +Q+ + +D+ V+ G +Q+ LL + NYG
Sbjct: 404 KLRVIDARDRVQAYLDQKWLATQYQE-AIGDDILLPEVE---GHHQLDLLVENMSRVNYG 459
Query: 521 SKFDMVPN--GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
SK + + GI V++ D IK Y LD + A
Sbjct: 460 SKIEAITQFKGIRTGVMV-----DLHFIKGYQQ---------YPLDLNR---ASRLTFTE 502
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
GW +YK TF+ D L+ +G GKG VNG N+GR+W
Sbjct: 503 GWQPATP------AFYKYTFDLTAPQD-TYLDCRGFGKGVMLVNGVNVGRFW 547
>gi|332838248|ref|XP_001156615.2| PREDICTED: galactosidase, beta 1-like 3 [Pan troglodytes]
Length = 653
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 175/351 (49%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ GR T++G + ++ GSIHY R W D + K K G + +
Sbjct: 61 LKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F+ ++ + ++ + L QGGP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKSFIEAVEKYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHKAL--LRRGIVELLLT-SDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G Y + TSYDYDA + E G + +L KL +S+ T
Sbjct: 349 FMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE----KYLKLQKLFQSVSAT 395
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 114/338 (33%), Positives = 169/338 (50%), Gaps = 43/338 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
++SGSIHY R P W D ++K + G + +ETYV WN HEP ++DF+ NLDL RFI+
Sbjct: 19 IISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQ 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLYVILR PY+CAEW +GG P WL P + ++R FM ++ + T +
Sbjct: 79 LAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFM-KIRFDYPPFMEKIARYFTQL--F 135
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV-------P 217
++ L +Q GPI++ Q+ENEYG+ +D KSY+ A++ I V P
Sbjct: 136 SQVSDLQITQEGPILMMQVENEYGSYGND-----KSYLRKSAELMRHNGIDVSLFTSDGP 190
Query: 218 WIMCQESDAPSPMFTP------------------NNPNSPKIWTENWTGWFKSWG-GKDP 258
W+ E+ + + P + P + E W GWF +WG K
Sbjct: 191 WLDMLENGSIKDIALPTINCGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHH 250
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDE 312
+ D A + + G N YM+HGGTNFG +G Y TSYDYDA + E
Sbjct: 251 TTSVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLSPDVTSYDYDALLSE 308
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
+G + PK+ +++ + + +T YG+
Sbjct: 309 WGDVT-PKYEAFQQVIGEITEIPSFPLTTKITKRAYGS 345
Score = 39.7 bits (91), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 78/204 (38%), Gaps = 54/204 (26%)
Query: 495 ERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW 554
++ + LT N++ +L +G NY + + GI V++ G E I
Sbjct: 427 KKTLTLTEESNELGILVENMGRVNYSVQMNHQYKGIKDGVIVNGAFQSEWEI-------- 478
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
Y + + LD F + + + G S + K +F+ D V L G
Sbjct: 479 -YSLPMDNLDQVDF----SGHWQTGQPS----------FSKVSFQVDECADTFV-ELPGW 522
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKGF +NG+N+GR+W RGP Q ++P +
Sbjct: 523 GKGFIVINGHNIGRFWE----------------RGP--------------QRRLYIPAPY 552
Query: 675 IKDGVNTLVLFEEFGGNPSQINFQ 698
+++G N V+FE G I F
Sbjct: 553 LREGNNEAVIFESDGRVSDMIIFH 576
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/338 (33%), Positives = 169/338 (50%), Gaps = 43/338 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
++SGSIHY R P W D ++K + G + +ETYV WN HEP ++DF+ NLDL RFI+
Sbjct: 19 IISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQ 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLYVILR PY+CAEW +GG P WL P + ++R FM ++ + T +
Sbjct: 79 LAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFM-KIRFDYPPFMEKIARYFTQL--F 135
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV-------P 217
++ L +Q GPI++ Q+ENEYG+ +D KSY+ A++ I V P
Sbjct: 136 SQVSDLQITQEGPILMMQVENEYGSYGND-----KSYLRKSAELMRHNGIDVPLFTSDGP 190
Query: 218 WIMCQESDAPSPMFTP------------------NNPNSPKIWTENWTGWFKSWG-GKDP 258
W+ E+ + + P + P + E W GWF +WG K
Sbjct: 191 WLDMLENGSIKDIALPTINCGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHH 250
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDE 312
+ D A + + G N YM+HGGTNFG +G Y TSYDYDA + E
Sbjct: 251 TTSVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDALLSE 308
Query: 313 YGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
+G + PK+ +++ + + +T YG+
Sbjct: 309 WGDVT-PKYEAFQQVIGEITEIPSFPLTTKITKRAYGS 345
Score = 39.7 bits (91), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 78/204 (38%), Gaps = 54/204 (26%)
Query: 495 ERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW 554
++ + LT N++ +L +G NY + + GI V++ G E I
Sbjct: 427 KKTLTLTEESNELGILVENMGRVNYSVQMNHQYKGIKDGVIVNGAFQSEWEI-------- 478
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
Y + + LD F + + + G S + K +F+ D V L G
Sbjct: 479 -YSLPMDNLDQVDF----SGHWQTGQPS----------FSKVSFQVDECADTFV-ELPGW 522
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
GKGF +NG+N+GR+W RGP Q ++P +
Sbjct: 523 GKGFIVINGHNIGRFWE----------------RGP--------------QRRLYIPAPY 552
Query: 675 IKDGVNTLVLFEEFGGNPSQINFQ 698
+++G N V+FE G I F
Sbjct: 553 LREGNNEAVIFESDGRVSDMIIFH 576
>gi|414156558|ref|ZP_11412859.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
gi|410869551|gb|EKS17511.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
Length = 595
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 185/699 (26%), Positives = 280/699 (40%), Gaps = 184/699 (26%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+ G+ +LSG+IHY R P W + K G + +ETYV WNAHEP + Q+DF+G L
Sbjct: 12 LKGQPFKILSGAIHYFRIDPTDWYHSLYNLKALGFNTVETYVPWNAHEPKKGQFDFSGRL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFMNEM 154
DL RFI+T Q GLY+I+R P++CAEW +GG P WL +EE +R+++ F+ +
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDLRIRSSDPAFIEAI 126
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ ++ + ++ +GGPI++ Q+ENEYG+ D K Y+ + +
Sbjct: 127 DRYYDRLLGLLTPYQV--DRGGPILMMQVENEYGSYGED-----KDYLRAIRDLMKEKGV 179
Query: 215 GVPWIMCQESDAP------------SPMFTPNNPNS--------------------PKIW 242
P SD P +F N S P +
Sbjct: 180 TCPLFT---SDGPWRATLRTGTLIEEDLFVTGNFGSKAAYNFGQMKEFFNEYGKKWPLMC 236
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-- 300
E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 237 MEFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGT 294
Query: 301 -----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGS 355
TSYDY A ++E G+ E + ++ M T +Y
Sbjct: 295 LDLPQVTSYDYGALLNEQGNPT--------EKYDAIQKMMATYY------PEYPQQEPLI 340
Query: 356 SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV 415
LP S+ + AK + N+ ++ A L+ PE + +
Sbjct: 341 KECLPEQSLQL----------AAKTSLFANL---------DNLAQLKTSLYPEKMEEL-- 379
Query: 416 RGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYV 475
+ YL Y T+ +L ++ LRI + Y+
Sbjct: 380 -----------------GQTTGYLLYETDLELDAEEE--------KLRIIDGRDRVQIYL 414
Query: 476 NGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNGIPG 532
+ +V +Q+ T+ G DLF + K + + +L +G NYG K D GI
Sbjct: 415 DDQHVATQYQTEIG--EDLFIKGKK--KAVTNLKILLENMGRVNYGHKLLADSQHKGI-- 468
Query: 533 PVLLVGRAGDETIIKDLSSH-KW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR 590
R G + DL H W Y + L L F A +
Sbjct: 469 ------RTG---VCVDLHFHLHWKQYPLDLQDLSQLDFSKEWQAGAP------------- 506
Query: 591 MTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGP 650
+Y+ F+ D L++ G GKG +VNG+NLGR+W GP
Sbjct: 507 -AFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRFWEV----------------GP 548
Query: 651 YGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
S +VP ++K+G N+L++FE G
Sbjct: 549 TTS--------------LYVPHGFLKEGANSLIVFETEG 573
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 176/345 (51%), Gaps = 44/345 (12%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G+ T+ + + SG+IHY R P W D + K K GL+ +ETYV WN HEP+ Q+D
Sbjct: 1 GKTFTLLDKPIHIRSGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFD 60
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+TG L++ +FI Q+ G YVILR GPY+CAEW +GG P WL + + ++R+T K F +
Sbjct: 61 YTGILNVRKFILLAQELGFYVILRPGPYICAEWEFGGMPSWLLSDKNM-QVRSTYKPFKD 119
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ F + K L AS+GGPII Q+ENEYG+ SD + Y+ + +
Sbjct: 120 AVNRFFDGFIPEIK--SLQASKGGPIIAVQVENEYGSYGSD-----EEYMQFIRDALINR 172
Query: 213 DIGVPWIMCQESD------APSPMFTPNN--------------PNSPKIWTENWTGWFKS 252
I + S+ AP + T N ++P I E W+GWF
Sbjct: 173 GIVELLVTSDNSEGIKHGGAPGVLKTYNFQGHAKSHLSILERLQDAPSIVMEFWSGWFDH 232
Query: 253 WGGKDPK-RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------TT 302
WG K+ + T + +F N+Y++HGGTNFG +G ++ T
Sbjct: 233 WGEKNHQVHTIAHVTNTFKDILDCDASF-NFYVFHGGTNFGFMNGANFIDFFSYYLPTVT 291
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRE--LHKL--LKSMEKTLTYGNV 343
SYDYDAP+ E G + + K+ LR+ + KL + K YGN+
Sbjct: 292 SYDYDAPLSEAGDITE-KYMELRKIMIDKLPEIPPSSKKFHYGNI 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 57/126 (45%), Gaps = 19/126 (15%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L G N+GS + GI G V +K+L WT ++ LD
Sbjct: 408 VDILVENCGRVNFGSILNTERKGILGSVY--------ANMKELKG--WT----IFSLDFD 453
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPV--VLNLQGMGKGFAWVNGY 624
Y K N+ +G +++ NR E + ++P L L+G GKG ++NG+
Sbjct: 454 TAYVNKVRNTLKGGKTQS---NRSFVPSIYRGELEISDNPYDSFLTLEGWGKGICFINGF 510
Query: 625 NLGRYW 630
N+GRYW
Sbjct: 511 NVGRYW 516
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 160/309 (51%), Gaps = 42/309 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DGE L+SG+IHY R P W D + K K G + +ETY+ WN HEP Q+ F G
Sbjct: 13 LDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFDGLA 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D++RF++ + GL+VI+R PY+CAEW +GG P WL PG+ +R ++ +++ + +
Sbjct: 73 DVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGM-RVRCMHRPYLDRVDAY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC--AKMATSLDI- 214
V + + L + GGPII QIENEYG+ +D ++Y+ + A + +D+
Sbjct: 132 YD--VLLPLLKPLLCTNGGPIIAMQIENEYGSYGND-----RAYLVYLKDAMLQRGMDVL 184
Query: 215 -----GVPWIMCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWG 254
G M Q P + T N P+ P + E W GWF WG
Sbjct: 185 LFTSDGPEHFMLQGGMIPGVLETVNFGSRAEEAFEMLRKYQPDGPIMCMEYWNGWFDHWG 244
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG---------PYLTTSYD 305
+ R A+D+A + G + N+YM+HGGTNFG SG P + TSYD
Sbjct: 245 EQHHTRDAKDVADVFDDMLRLGASV-NFYMFHGGTNFGYMSGANCPQRDHYEPTI-TSYD 302
Query: 306 YDAPIDEYG 314
YD P++E G
Sbjct: 303 YDVPLNESG 311
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 40/99 (40%), Gaps = 32/99 (32%)
Query: 601 PLENDPV--VLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
P+E P L L G KG +VNG++LGRYW RGP
Sbjct: 509 PIEGQPADTFLRLDGWNKGIVYVNGFHLGRYWK----------------RGP-------- 544
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
Q ++P ++ G N +V+FE G ++ F
Sbjct: 545 ------QQTLYIPAPMLRQGDNEIVVFELHGTEKRELTF 577
>gi|301763006|ref|XP_002916929.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Ailuropoda
melanoleuca]
Length = 1209
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 122/356 (34%), Positives = 173/356 (48%), Gaps = 43/356 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T+ G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+
Sbjct: 497 FTLGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSE 556
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ + GL+VILR GPY+C+E + GG P WL P + LRTT K F+ +
Sbjct: 557 NLDLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEM-ILRTTYKGFVEAVD 615
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ ++ L +GGPII Q+ENEYG+ D K Y+ + K L+ G
Sbjct: 616 KYFDHLI--SRVVPLQYHKGGPIIAVQVENEYGSFAVD-----KDYMPYVRKAL--LERG 666
Query: 216 VPWIMCQESDAPS-----------------------PMFTPNNPNSPKIWTENWTGWFKS 252
+ ++ DA + + N P + E W GWF +
Sbjct: 667 IVELLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDT 726
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDY 306
WGGK AED+ V++F +F N YM+HGGTNFG +G Y + TSYDY
Sbjct: 727 WGGKHMVNNAEDVEETVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGIHRAVVTSYDY 785
Query: 307 DAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAW 362
DA + E G + K+ L+ L + + +M Y SV S Y LP W
Sbjct: 786 DALLTEAGDYTK-KYFKLQRLFRSVLAMPLPPLPELTPKAKY-PSVKPSLY-LPLW 838
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 85/178 (47%), Gaps = 31/178 (17%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+G + T+DG ++++G+IHY R W D + K K G + + T
Sbjct: 52 EGSSFTLDGSPFLIIAGTIHYFRVPREYWRDRLMKLKACGFNTVTT-------------- 97
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
F+ D GL+VIL GPY+ ++ + GG P WL P + +LRTT + F
Sbjct: 98 ---------AFVAMASDVGLWVILCPGPYIGSDLDLGGLPSWLLRDPKM-KLRTTYRGFT 147
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA 209
+ + I+ K +L +GGPII Q+ENEYG+ D K Y+ + K+A
Sbjct: 148 KAVNLYFDKII--PKIVQLQYGKGGPIIALQVENEYGSYHQD-----KRYMPYIKKLA 198
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 154/311 (49%), Gaps = 47/311 (15%)
Query: 510 LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
+ + NYG+ + G G V L G E DLS + WTY+VGL G K +
Sbjct: 18 IHGEIAAGNYGAFLEKDGAGFKGQVKLTGFKNGEI---DLSEYSWTYQVGLRGEFQKIYM 74
Query: 570 NAKAANSERGWSSKNVPLN-RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGR 628
++ +E W+ + TWYKT F+AP +PV L+L MGKG AWVNG+++GR
Sbjct: 75 IDESEKAE--WTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGR 132
Query: 629 YWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
YW T +A +DGC CDYRG Y + K YH+PRSW++ N LVLFEE
Sbjct: 133 YW-TRVAPKDGCG--KCDYRGHYHTSK------------YHIPRSWLQASNNLLVLFEET 177
Query: 689 GGNPSQINFQTVVVGTACGQAHENK-----------------------TMELTC-HGRRI 724
GG P +I+ ++ T C + E+ M L C G I
Sbjct: 178 GGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHTI 237
Query: 725 SEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEANLGATSCAAG 784
S I++AS+G PQG+C F +G C A + L L+ K C GK SC I + G C G
Sbjct: 238 SSIEFASYGTPQGSCQMFSQGQCHAP-NSLALVSKACQGKGSCVIRILNSAFGGDPC-RG 295
Query: 785 TVKRLVVEALC 795
VK L VEA C
Sbjct: 296 IVKTLAVEAKC 306
>gi|258538519|ref|YP_003173018.1| beta-galactosidase [Lactobacillus rhamnosus Lc 705]
gi|385834266|ref|YP_005872040.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
gi|257150195|emb|CAR89167.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus Lc 705]
gi|355393757|gb|AER63187.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
Length = 593
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 173/652 (26%), Positives = 262/652 (40%), Gaps = 156/652 (23%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++DF+G LD+ RF+KT +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 REGEFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D + Y+ A
Sbjct: 119 DPAYLAAIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGSYGED-----QDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
K+ + VP SD P P + N S
Sbjct: 172 KLMQQHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GRDWPLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
FG +G + D+D P VT+ DY
Sbjct: 283 FGFMNG---TSARKDHDLP--------------------------------QVTSYDYDA 307
Query: 351 SVSGSSYNLPAW-SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
++ P + ++ + + E AK + +P A PL K
Sbjct: 308 PLNEQGNPTPKYFAIQKMIHEELPEVQQAK-------PLVKPTMAPASH-PLTAK----- 354
Query: 410 INDFVVRGKGHFALNTLIDQ------KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT-- 461
+L ++DQ S ++L T L P++SG+ T
Sbjct: 355 -----------VSLFAVLDQLAKPIAASYPQTQEFLGQYTGYTLYRTQPLISGTDKGTPA 403
Query: 462 -LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
LR+ + + AY++ ++ +Q+ + + L G +Q+ LL + NYG
Sbjct: 404 KLRVIDARDRVQAYLDQKWLATQYQEAIGDDILLPE----VEGHHQLDLLVENMSRVNYG 459
Query: 521 SKFDMVPN--GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
SK + + GI V++ D IK Y LD + A
Sbjct: 460 SKIEAITQFKGIRTGVMV-----DLHFIKGYQQ---------YPLDLNR---ASRLTFTE 502
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
GW +YK TF+ D L+ G GKG VNG N+GR+W
Sbjct: 503 GWQPATP------AFYKYTFDLTAPQD-TYLDCHGFGKGVMLVNGVNVGRFW 547
>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 590
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 156/337 (46%), Gaps = 44/337 (13%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+RVS +G ++DG LLSG++HY R P WP ++ + GLD +ETYV WN HEP
Sbjct: 2 FRVSTEG--FSLDGRPLRLLSGALHYFRVLPEQWPHRLRMLRAMGLDTVETYVPWNLHEP 59
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+YDF G DL RF+ ++ GL+ I+R PY+CAEW GG P WL P + LR
Sbjct: 60 RPGEYDFDGIADLDRFLHATREAGLHAIVRPSPYICAEWENGGLPWWLLADPEVGALRCQ 119
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + ++ + ++ S+GG +++ Q+ENEYG+ +D G Y+ A
Sbjct: 120 DPAYLAHVDRWFDRLIPVVAAHQV--SRGGNVLMVQVENEYGSYGTDTG-----YLEHLA 172
Query: 207 KMATSLDIGVPWIMCQESD--------APSPMFTPN---------------NPNSPKIWT 243
+ I VP D P + T N P+ P +
Sbjct: 173 AGLRARGIDVPLFTSDGPDDFFLTGGALPGHLATVNFGSRPKEALADLARLRPDDPAMCM 232
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY---- 299
E W GWF WG R D A + G + N YM HGGTNF +G
Sbjct: 233 EFWCGWFDHWGTDHVVRDPADAAGVLEELLAAGASV-NVYMAHGGTNFSTWAGANTEDPA 291
Query: 300 -------LTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
TSYDYDAP+DE G + W L +
Sbjct: 292 AGTGYRPTVTSYDYDAPVDERGAATEKFWAFREVLER 328
>gi|387878583|ref|YP_006308886.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
gi|386792040|gb|AFJ25075.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
Length = 595
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 184/702 (26%), Positives = 278/702 (39%), Gaps = 190/702 (27%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+ G+ +LSG+IHY R P W + K G + +ETYV WN HEP + Q+DF+G L
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFMNEM 154
DL RFI+ Q GLY+I+R P++CAEW +GG P WL +EE +R+++ F+ +
Sbjct: 72 DLERFIQIAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDMRIRSSDPAFIEAV 126
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ ++ + + ++ QGGPI++ Q+ENEYG+ D K Y+ + +
Sbjct: 127 DRYYDHLLGLLTRYQV--DQGGPILMMQVENEYGSYGED-----KVYLRAIRDLMKKKGV 179
Query: 215 GVPWIMCQESDAP------------SPMFTPNNPNS--------------------PKIW 242
P SD P +F N S P +
Sbjct: 180 TCPLFT---SDGPWRATLRAGTLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMC 236
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-- 300
E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 237 MEFWDGWFTRWKEPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGT 294
Query: 301 -----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG-NSVSG 354
TSYDY A ++E GN T Y +
Sbjct: 295 LDLPQVTSYDYGALLNEQ---------------------------GNPTEKYYAIQKMMA 327
Query: 355 SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN--DQAPLQWKWRPEMIND 412
+ Y ++ +C E QT V + + GN + A ++ PE + +
Sbjct: 328 TYYPEYPQQEPLIKECLPE---------QTLQLVAKTSLFGNLDNLAQVETSLYPEKMEE 378
Query: 413 FVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLH 472
+ YL Y T+ +L ++ LR+ +
Sbjct: 379 L-------------------GQTTGYLLYETDLELDAEEE--------RLRVIDGRDRVQ 411
Query: 473 AYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNG 529
Y++ +V +Q+ T+ G DLF + K + + +L +G NYG K D G
Sbjct: 412 IYLDDRHVATQYQTEIG--EDLFIKGKK--KAVTNLKILLENMGRVNYGHKLLADSQHKG 467
Query: 530 IPGPVLLVGRAGDETIIKDLSSH-KW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPL 587
I R G + DL H W Y + L L F A +
Sbjct: 468 I--------RTG---VCVDLHFHLHWKQYPLDLQDLSQLDFSKEWQAGAP---------- 506
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
+Y+ F+ D L++ G GKG A+VNG+NLGR+W
Sbjct: 507 ----AFYRYDFQLDQTLD-TYLDMTGFGKGVAFVNGHNLGRFWEV--------------- 546
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
GP S +VP ++K+G N+L++FE G
Sbjct: 547 -GPTTS--------------LYVPHGFLKEGANSLIVFETEG 573
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 154/324 (47%), Gaps = 41/324 (12%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A ++H G + G +LSGS+HY R PG W D + + GL+ ++TYV WN HE
Sbjct: 14 AATLTHAGGTLLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHE 73
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
F G DL RF++ Q+ GL VI+R GPY+CAEW+ GG P WL PG+ RT
Sbjct: 74 RTPGDVRFDGWRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRP-RT 132
Query: 146 TNKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
++ F+ + F LI +A L A +GGP++ QIENEYG+ YGD G Y+ W
Sbjct: 133 SHPPFLAAVARWFDQLIPRIA---ALQAGRGGPVVAVQIENEYGS----YGDDG-DYVRW 184
Query: 205 CAKMATS-----------------LDIG------VPWIMCQESDAPSPMFTPNNPNSPKI 241
T+ LD G + + + P P
Sbjct: 185 VRDALTARGVTELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPFF 244
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-- 299
E W GWF WG + R A A V R GG+ + YM HGGTNFG +G +
Sbjct: 245 CAEFWNGWFDHWGEQHHVRPARSAADDVGRILGAGGSL-SLYMAHGGTNFGLWAGANHDG 303
Query: 300 -----LTTSYDYDAPIDEYGHLNQ 318
TSYD DAP+ E+G L +
Sbjct: 304 DRLQPTVTSYDSDAPVAEHGALTE 327
>gi|337283005|ref|YP_004622476.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
gi|335370598|gb|AEH56548.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
Length = 595
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 183/700 (26%), Positives = 279/700 (39%), Gaps = 186/700 (26%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+ G+ +LSG+IHY R P W + K G + +ETYV WN HEP + Q+DF+G L
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFMNEM 154
DL RFI+T Q GLY+I+R P++CAEW +GG P WL +EE +R+++ F+ +
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDMRIRSSDPAFIEAV 126
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ ++ + ++ QGGPI++ Q+ENEYG+ D K+Y+ + +
Sbjct: 127 DRYYDHLLGLLTPYQV--DQGGPILMMQVENEYGSYGED-----KAYLRAIRDLMKKKGV 179
Query: 215 GVPWIMCQESDAP------------SPMFTPNNPNS--------------------PKIW 242
P SD P +F N S P +
Sbjct: 180 TCPLFT---SDGPWRAALRAGTLIEEDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMC 236
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-- 300
E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 237 MEFWDGWFTRWKEPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGT 294
Query: 301 -----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG-NSVSG 354
TSYDY A ++E GN T Y +
Sbjct: 295 LDLPQVTSYDYGALLNEQ---------------------------GNPTEKYYAIQKMMA 327
Query: 355 SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFV 414
+ Y ++ +C E+ T ++ +T++ N A ++ PE + +
Sbjct: 328 TYYPEYPQQEPLIKECLPEQ--TLQLAAKTSLFGNLDNLA-----QVETSLYPEKMEEL- 379
Query: 415 VRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAY 474
+ YL Y T+ +L ++ LRI + Y
Sbjct: 380 ------------------GQTTGYLLYETDLELDAEEE--------KLRIIDGRDRVQIY 413
Query: 475 VNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNGIP 531
++ +V +Q+ T+ G DLF + K + + +L +G NYG K D GI
Sbjct: 414 LDDRHVATQYQTEIG--EDLFIKGKK--KAVTNLKILLENMGRVNYGHKLLADSQHKGI- 468
Query: 532 GPVLLVGRAGDETIIKDLSSH-KW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNR 589
R G + DL H W Y + L L F A +
Sbjct: 469 -------RTG---VCVDLHFHLHWKQYPLDLQDLSQLDFSKEWQAGAP------------ 506
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
+Y+ F+ D L++ G GKG +VNG+NLGR+W G
Sbjct: 507 --AFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRFWEV----------------G 547
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
P S +VP ++K+G N+L++FE G
Sbjct: 548 PTTS--------------LYVPHGFLKEGANSLIVFETEG 573
>gi|187736173|ref|YP_001878285.1| beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
gi|187426225|gb|ACD05504.1| Beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
Length = 780
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 163/630 (25%), Positives = 252/630 (40%), Gaps = 142/630 (22%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG +HYPR W D ++ K G++ + TY+FWN HEP ++DF+GNL
Sbjct: 43 MDGKPVKIISGEMHYPRVPRQHWKDRFQRIKAMGMNTVCTYLFWNVHEPEPGKWDFSGNL 102
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + FIK Q GL+VI+R GPYVCAEW +GGFP WL + ++R+ + F+ +
Sbjct: 103 DFVEFIKEAQKAGLWVIVRPGPYVCAEWEFGGFPGWLLKDEDL-KVRSQDPRFLEPAMAY 161
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ M E L ++GGPII+AQ+ENEYG+ SD K Y+ K + +P
Sbjct: 162 LKKVCSML--EPLQITKGGPIIMAQVENEYGSYGSD-----KDYVK---KHLDVIRKELP 211
Query: 218 WIMCQESDAPS---------PMFTP-----------------NNPNSPKIWTENWTGWFK 251
++ SD P+ P P + +P+I E W GWF
Sbjct: 212 GVVPFTSDGPNDWMIKNGTLPGVVPAMNFGGGAKGAFANLEKHKGKTPRINGEFWVGWFD 271
Query: 252 SWGGKDPKRTAEDLAFAV-ARFFQFGGTFQNYYMYHGGTNFGRTSG----GPYL--TTSY 304
WG PK F ++ N +M HGGT+FG +G G Y T+Y
Sbjct: 272 HWG--KPKNGGSTEGFNRDLKWMLENNVSPNLFMAHGGTSFGFMNGANWEGAYTPDVTNY 329
Query: 305 DYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSV 364
DY API E G L ++ T+ YG+ +Y LP
Sbjct: 330 DYGAPISENGTLT-----------------DRYRTFRQTIQDYYGD-----TYKLPE--- 364
Query: 365 SILPDCKTEEFNTAKVN-TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFAL 423
P + E + T+T R Q + P
Sbjct: 365 ---PPAQPEMMELPPITFTETAGMFSRLPQPVIRKEP----------------------- 398
Query: 424 NTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQ 483
+ ++ ++ Y T + G L++N+ YV+G
Sbjct: 399 ---VHMEALGQSLGFILYRTKVN---------GPVKGELKMNNMQDRAIVYVDGK----- 441
Query: 484 WTKYGASNDLFER---PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRA 540
+ GA++ +++ + + G + + + +G N+G + GI GP+ L G+
Sbjct: 442 --RQGAADRRYKQDSCDIVIPSGLHTVDIFVENMGRINFGGQIQGERKGIRGPITLDGKK 499
Query: 541 GDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEA 600
+ +I YN E S P + +++ F
Sbjct: 500 LENFLI----------------------YNFPCKGVELIPFSGKKPAGDQPVFHRGYFNV 537
Query: 601 PLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
D + G KG WVNG NLGR+W
Sbjct: 538 SNPKDTYLDMRDGWKKGVVWVNGRNLGRFW 567
>gi|229553373|ref|ZP_04442098.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
gi|229313254|gb|EEN79227.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
Length = 583
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 172/641 (26%), Positives = 258/641 (40%), Gaps = 153/641 (23%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+IHY R P W + K G + +ETYV WN HE ++DF+G L
Sbjct: 2 LDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSGIL 61
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF+KT +D GLY I+R PY+CAEW +GGFP WL + LRT + ++ + +
Sbjct: 62 DIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTDDPAYLAAIDRY 119
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
T ++ ++ + GG +I+ Q+ENEYG+ D + Y+ AK+ + VP
Sbjct: 120 YTALMPHLVDHQV--THGGNVIMMQVENEYGSYGED-----QDYLAAVAKLMQQHGVDVP 172
Query: 218 WIMCQESDAPSP------------MFTPNNPNS--------------------PKIWTEN 245
SD P P + N S P + E
Sbjct: 173 LFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLMCMEF 229
Query: 246 WTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT 301
W GWF WG +DP TAEDL + R G+ N YM+HGGTNFG +G +
Sbjct: 230 WDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTNFGFMNG---TS 280
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPA 361
D+D P VT+ DY ++ P
Sbjct: 281 ARKDHDLP--------------------------------QVTSYDYDAPLNEQGNPTPK 308
Query: 362 W-SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGH 420
+ ++ + + E AK + +P A PL K
Sbjct: 309 YFAIQKMIHEELPEVQQAK-------PLVKPTMAPASH-PLTAK---------------- 344
Query: 421 FALNTLIDQ------KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVL 471
+L ++DQ S ++L T L P++SG+ T LR+ + +
Sbjct: 345 VSLFAVLDQLAKPIAASYPQTQEFLGQYTGYTLYRTQPLISGTDKGTPAKLRVIDARDRV 404
Query: 472 HAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--G 529
AY++ ++ +Q+ + + L G +Q+ LL + NYGSK + + G
Sbjct: 405 QAYLDQKWLATQYQEAIGDDILLPE----VEGHHQLDLLVENMSRVNYGSKIEAITQFKG 460
Query: 530 IPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNR 589
I V++ D IK Y LD + A GW
Sbjct: 461 IRTGVMV-----DLHFIKGYQQ---------YPLDLNR---ASRLTFTEGWQPATP---- 499
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+YK TF+ D L+ G GKG VNG N+GR+W
Sbjct: 500 --AFYKYTFDLTAPQD-TYLDCHGFGKGVMLVNGVNVGRFW 537
>gi|332264040|ref|XP_003281056.1| PREDICTED: beta-galactosidase-1-like protein 3 [Nomascus
leucogenys]
Length = 655
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/332 (34%), Positives = 170/332 (51%), Gaps = 46/332 (13%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+G
Sbjct: 80 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 139
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
N+DL F+ + GL+VILR GPY+C+E + GG P WL P + LRTTNK F+ ++
Sbjct: 140 NMDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPQL-LLRTTNKGFIEAVE 198
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ + L QGGP+I Q+ENEYG+ D K+Y+ + K L G
Sbjct: 199 KYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD-----KTYMPYLHKAL--LRRG 249
Query: 216 VPWIMCQESDAPSPMFTPNNP------------------------NSPKIWTENWTGWFK 251
+ ++ SD + + + + P + E W GWF
Sbjct: 250 IVELLLT-SDGEKHVLSGHTKGVLAAINLQKLHQNTFSQLHKVQRDKPLLIMEYWVGWFD 308
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYD 305
WG K + A+++ AV+ F ++ +F N YM+HGGTNFG +G Y + TSYD
Sbjct: 309 RWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHTGIVTSYD 367
Query: 306 YDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
YDA + E G + + +L KL +S+ T
Sbjct: 368 YDAVLTEAGDYTEKYF----KLQKLFESVSAT 395
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 109/333 (32%), Positives = 162/333 (48%), Gaps = 39/333 (11%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+R++ D + +DG+ L+SG +HYPR W D ++KA+ GL+A+ Y FWN HE
Sbjct: 24 HRLTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHEE 83
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DFTG D+ F++ Q +GL+VILR GPYVCAEW+ GG+P WL P + LR+
Sbjct: 84 EEGHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAV-NLRSL 142
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + + L A++GGPI+ Q+ENEYG+ ++Y++
Sbjct: 143 DSRYIAAADKWMKAL--GQQLAPLQAAKGGPILAVQVENEYGSFPDSAQPNAQAYLDRVH 200
Query: 207 KMATSLDIGVPWIMCQESDAPS-------------------------PMFTPNNPNSPKI 241
+M LD G + D ++ PN+
Sbjct: 201 QMV--LDAGFKDSLLYTGDGADVLARGTFADLTAGIDYGTGDSARSIALYKKFRPNTNIY 258
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-- 299
E W GWF WG K A V GG+ + YM HGGT+FG +G
Sbjct: 259 TAEYWDGWFDHWGAKHEVVDASIHLKEVHDVLTSGGSI-SLYMLHGGTSFGWMNGANIDH 317
Query: 300 -----LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPIDE G L +P++ +R++
Sbjct: 318 NHYEPDVTSYDYDAPIDEAGQL-RPEYFAMRKV 349
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 179/672 (26%), Positives = 275/672 (40%), Gaps = 147/672 (21%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
++SG++HY R P W D + K K G + +ETYV WN HEP ++DF G D+I F++
Sbjct: 20 IISGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFGGIADVIAFVE 79
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
+ GL+VI+R PY+CAEW +GG P WL ++ LR ++ F+ ++ + ++ +
Sbjct: 80 LAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQ-LRCSDPKFLAKVDAYYDVL--L 136
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC--AKMATSLDI------GV 216
K L + GGPII Q+ENEYG+ +D K+Y+ + +A +D+ G
Sbjct: 137 PKFVPLLCTNGGPIIAMQVENEYGSYGND-----KAYLGYLRDGMIARGIDVLLFTSDGP 191
Query: 217 PWIMCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRT 261
M Q P + T N P+ P + E W GWF W + R
Sbjct: 192 TDEMLQGGTLPDVLATVNFGSRPEESFAKFREYRPDEPLMCMEFWNGWFDHWMEEHHTRD 251
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKW 321
ED A + G + N+YM+HGGTNFG SG ++ T Y+ + Y + + P
Sbjct: 252 GEDAARVLDDMLGAGASV-NFYMFHGGTNFGFYSGANHIKT---YEPTVTSYDY-DAP-- 304
Query: 322 GHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVN 381
L E L E + V + G S S LP S + ++ E
Sbjct: 305 --LTERGDLTAKYE---AFREVISKHEGESGSALPEPLPVRSYGEVKMTESAELFA---- 355
Query: 382 TQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWY 441
Q G P++ + PE + G+ + ++ Y
Sbjct: 356 -----------QLGKLSQPVR-RVTPEPMEKL---GQNY----------------GFILY 384
Query: 442 MTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLT 501
T+ + L I +++G+Y+ GA RP+KL
Sbjct: 385 STH--------VTGPRRGQELHIQDVRDRAQVFLDGSYI-------GAVERWDVRPLKLD 429
Query: 502 --RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETI-IKDLSSHKW-TYK 557
G ++ +L +G NYG P+L + E + + + + W Y
Sbjct: 430 VPAGGARLDILVENMGRVNYG------------PLLRDHKGITEGVRLDNQFQYGWDIYP 477
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
+ L L+ +F AA E + P +Y+ FEA D L L+G KG
Sbjct: 478 LPLDSLEGLEF--GTAAGPEDADVTGERP-----AFYRGFFEAEEAAD-TFLRLEGWTKG 529
Query: 618 FAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD 677
A+VNG+NLGRYW RGP S +VP ++
Sbjct: 530 VAYVNGFNLGRYWE----------------RGPQKS--------------LYVPGPLLRK 559
Query: 678 GVNTLVLFEEFG 689
G N +VLFE G
Sbjct: 560 GTNEIVLFELHG 571
>gi|384209874|ref|YP_005595594.1| beta-galactosidase [Brachyspira intermedia PWS/A]
gi|343387524|gb|AEM23014.1| beta-galactosidase [Brachyspira intermedia PWS/A]
Length = 592
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 176/642 (27%), Positives = 268/642 (41%), Gaps = 137/642 (21%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
+ V D ++G+ LLSG+IHY R W D + K G + +ETY+ WN HE
Sbjct: 3 VFEVKED---FILNGKPIKLLSGAIHYFRFVEEYWEDCLYNLKAAGFNTVETYIPWNIHE 59
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
+DF+GN D+ FIK Q L VILR PY+CAEW +GG P WL ++ +RT
Sbjct: 60 IDEGVFDFSGNKDIASFIKLAQKMDLLVILRPTPYICAEWEFGGLPAWLLRYDNMK-VRT 118
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG-------------NVMS 192
++F++++ + + + L ++ GP+I+ QIENEYG N+M
Sbjct: 119 NTELFLSKVDAYYKEL--FKQIADLQITRNGPVIMMQIENEYGSFGNDKEYLKALKNLMV 176
Query: 193 DYGDAGKSYIN---WCAKM--ATSLDIGVPWIM-----CQES-DAPSPMFTPNNPNSPKI 241
+G + + W A + T +D G+ + +ES DA F +P +
Sbjct: 177 KHGAEVPLFTSDGAWDAVLEAGTLVDDGILATVNFGSQAKESFDATEKFFERKGIKNPLM 236
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL- 300
E W GWF W KR A+D V + G N YM+ GGTNFG +G
Sbjct: 237 CMEFWDGWFNLWKEPIIKRDADDFIMEVKEIIKRGSI--NLYMFIGGTNFGFYNGTSVTG 294
Query: 301 ------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG 354
TSYDYDA + E+G + +L KL+ +
Sbjct: 295 YTDFPQITSYDYDAVLTEWGEPTE----KFYKLQKLINEL-------------------- 330
Query: 355 SSYNLPAWSVSILPDCKTEEFNTAKVNTQTNV--KVKRPNQAGNDQAPLQWKWRPEMIND 412
P D K +F AK+ +T++ + + ++ AP+ +
Sbjct: 331 ----FPEIKTFEPRDHKRADFGEAKLKDKTSLFSVIDKISKCQKSDAPITME-------- 378
Query: 413 FVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLH 472
G+G+ Y+ Y T D+ NM +R + +H
Sbjct: 379 --KAGRGY----------------GYMLYRTTVKGFDN--------NMNVRAVGASDRVH 412
Query: 473 AYVNGNYVDSQWTKYGASNDLFERPVKL--TRGKNQISLLSATVGLQNYGSKFDMVPN-- 528
Y+NG Y + KY ++L E P+++ G N + LL VG NYG K
Sbjct: 413 FYLNGEY---KGVKY--QDELIE-PIEMHFNNGDNVLELLVENVGRVNYGYKLQECSQVK 466
Query: 529 GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLN 588
GI R G ++ D+ + L LD N K + W +N P
Sbjct: 467 GI--------RIG---VMADIHFETGWEQYAL-PLD-----NIKDVDFSSKW-IENTP-- 506
Query: 589 RRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
++Y+ F+ D L+ +GKG A++NG+NLGRYW
Sbjct: 507 ---SFYRYEFDVKEPAD-TFLDCSKLGKGAAFINGFNLGRYW 544
>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
Length = 603
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 184/366 (50%), Gaps = 57/366 (15%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G+ +DG+ +LSG+ HY R+ P W D + + + GL+ +ETYV WN H+P ++ D
Sbjct: 31 GKEFLLDGKPFRILSGAFHYFRTHPQDWRDRLMRMRAMGLNTVETYVAWNFHQPDEKEAD 90
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL---HNMPGIEELRTTNKV 149
FTG D++ F++T + GL VI+R GPY+CAEW++GG P WL + P LR ++
Sbjct: 91 FTGWRDVVAFVRTADEVGLKVIVRPGPYICAEWDFGGLPAWLLKDKDAP----LRRSDPA 146
Query: 150 FMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG----------DAGK 199
F + + + + + L A++GGPII Q+ENEYG+ D+ G
Sbjct: 147 FERAVDAWFAEL--LPRFVDLQATRGGPIIAMQVENEYGSYGDDHAYLEHLRDTMRAQGI 204
Query: 200 SYINWCAKMAT--SLDIG-VPWIMCQ-----ESDAPSPMFTPNNPNSPKIWTENWTGWFK 251
+ +C+ AT +L G +P ++ + P P+ P TE W GWF
Sbjct: 205 DGLLFCSNGATQEALKAGSLPDLLSTVNFGGDPTGPFAELRAFQPDKPLFCTEFWDGWFD 264
Query: 252 SWGGK----DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------L 300
WG + DP +TA D V + + G + N+YM GGTNFG ++G
Sbjct: 265 HWGERHRTTDPAQTAAD----VEKMLEAGASI-NFYMAVGGTNFGWSAGANLSGSGYQPT 319
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLP 360
TSYDYD+PI E G L + + HK+ + K Y + NT + + + +P
Sbjct: 320 VTSYDYDSPISESGELTE-------KFHKVRDVLGK---YTTLPNT----PLPATPHRMP 365
Query: 361 AWSVSI 366
A V++
Sbjct: 366 AQRVAV 371
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 163/311 (52%), Gaps = 37/311 (11%)
Query: 44 ILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFI 103
+++ GSIHY R W D + K + G + + TY+ WN HE R ++DF+ LDL ++
Sbjct: 1 MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60
Query: 104 KTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVD 163
+ GL+VILR GPY+CAE + GG P WL P + +LRTTNK F+ + + ++
Sbjct: 61 LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNP-VTDLRTTNKGFIEAVDKYFDHLI- 118
Query: 164 MAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQE 223
K L GGP+I Q+ENEYG+ D ++Y+N+ K I + +
Sbjct: 119 -PKILPLQYRHGGPVIAVQVENEYGSFQKD-----RNYMNYLKKALLKRGIVELLLTSDD 172
Query: 224 SD-----APSPMFTPNNPNS----------------PKIWTENWTGWFKSWGGKDPKRTA 262
D + + T N NS P + E WTGW+ SWG K +++A
Sbjct: 173 KDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKSA 232
Query: 263 EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAPIDEYGHL 316
E++ V +F +G +F N YM+HGGTNFG +GG Y + TSYDYDA + E G
Sbjct: 233 EEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGDY 291
Query: 317 NQPKWGHLREL 327
+ K+ LR+L
Sbjct: 292 TE-KYFKLRKL 301
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 150/316 (47%), Gaps = 29/316 (9%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
+ ++G+ ++ + +HYPR W IK K G++ + YVFWN+HEP YDFT
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEM 154
DL F + Q +YVILR GPYVCAEW GG P WL I LR ++ F+ +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFIERV 474
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---------------DAGK 199
F + K L + GGPII+ Q+ENEYG+ +D G D
Sbjct: 475 NLFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIAL 532
Query: 200 SYINWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGG 255
+W + + + W M D PNSP + +E W+GWF WG
Sbjct: 533 FQCDWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKKLRPNSPLMCSEFWSGWFDKWGA 592
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPI 310
R AED+ + G +F + YM HGGTN+G +G P TSYDYDAPI
Sbjct: 593 NHETRPAEDMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPI 651
Query: 311 DEYGHLNQPKWGHLRE 326
E G PK+ LRE
Sbjct: 652 SESGQ-TTPKYWKLRE 666
>gi|403528012|ref|YP_006662899.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
gi|403230439|gb|AFR29861.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
Length = 598
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 158/319 (49%), Gaps = 43/319 (13%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY-DFTGNLD 98
GE +L+G+IHY R P +W D +++ K G + ++TYV WN H+P R + DF+G D
Sbjct: 17 GEPYRILAGAIHYFRVHPDLWQDRLRRLKAMGANTVDTYVAWNFHQPKRDEAPDFSGWQD 76
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-F 157
L RF+ ++GL VI+R GPY+CAEW+ GGFP WL +PGI LR + VF ++ F
Sbjct: 77 LGRFMDLAAEEGLDVIVRPGPYICAEWDNGGFPSWLTGIPGI-GLRCMDPVFTAAIEEWF 135
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW---------CAKM 208
L+ +A ++ S GGP++ QIENEYG+ YGD YI W ++
Sbjct: 136 DHLLPIVASRQ---TSAGGPVVAVQIENEYGS----YGDD-HEYIRWNRRALEERGITEL 187
Query: 209 ATSLDIGVPWI--------------MCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+ D G + + D + P P E W GWF WG
Sbjct: 188 LFTADGGTDYFLDGGAVEGTWATATLGSRGDEAVATWQRRRPGEPFFNVEFWGGWFDHWG 247
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LTTSYDYD 307
R AED A + GG+ YM HGGTNFG SG + TSYD D
Sbjct: 248 EHHHGRDAEDAALEARKMLDLGGSL-CAYMAHGGTNFGLRSGSNHDGTMLQPTVTSYDSD 306
Query: 308 APIDEYGHLNQPKWGHLRE 326
API E G L PK+ R+
Sbjct: 307 APIAENGALT-PKFHAFRK 324
>gi|194213013|ref|XP_001503036.2| PREDICTED: LOW QUALITY PROTEIN: galactosidase, beta 1-like 2 [Equus
caballus]
Length = 663
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 183/675 (27%), Positives = 269/675 (39%), Gaps = 156/675 (23%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+ GS+HY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL F+
Sbjct: 91 IFGGSVHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGRFDFSGNLDLEAFVL 150
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
T + GL+VILR GPY+C+E + GG P WL G+ LRTT K F N + + + M
Sbjct: 151 TAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSGM-RLRTTYKGFTNAVDLYFDHL--M 207
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
+ L GGPII Q+ENEYG+ D +Y+ + K I +
Sbjct: 208 PRVVPLQYKHGGPIIAVQVENEYGSYNKD-----PTYMPYIKKALEDRGIEELLLTSDNK 262
Query: 225 DAPSP------------------------MFTPNNPNSPKIWTENWTGWFKSWGGKDPKR 260
D S +FT PK+ E WTGWF SWGG
Sbjct: 263 DGLSSGAVDGVLATINLQSQHDLQLLSTFLFTVQGAR-PKMVMEYWTGWFDSWGGTHNIL 321
Query: 261 TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYG 314
+ ++ V+ G + N YM+HGGTNFG +G + TSYDYDA + E G
Sbjct: 322 DSSEVLKTVSAIIDAGSSI-NLYMFHGGTNFGFINGAMHYYDYKSHVTSYDYDAVLTEAG 380
Query: 315 HLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEE 374
K+ LR+ + + T Y SV+ PA+ +S+ K E
Sbjct: 381 DYT-AKYLQLRDFFGSISGTPLPPPPDPLPKTAY-ESVT------PAFYLSLWDALKYME 432
Query: 375 FNTAKVNTQTNVKVKR-PNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTN 433
A +N++ V ++ P GN Q+
Sbjct: 433 ---APINSEQPVNMENLPVNNGNGQS---------------------------------- 455
Query: 434 DVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDL 493
Y Y T ++ S ++ + GQV V+ ++D +
Sbjct: 456 --FGYTLYETT---------IASSGVLSAFVRDRGQVFVNTVSIGFLDYKRK-------- 496
Query: 494 FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHK 553
E + L +G + +L G NYG + D G+ G + L D +
Sbjct: 497 -EINIPLIQGYTTLRILVENCGRVNYG-EIDNQRKGLIGNIYL----NDSPL-------- 542
Query: 554 WTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFEAPLENDPVVLNLQ 612
K +Y LD KK + + + E W+ VP ++ L + L+
Sbjct: 543 --SKFRIYSLDMKKSFFQRFSFDE--WN--KVPEAPTFPAFFLGALSVALSPSDTFMKLE 596
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
G KG ++NG NLGRYW N G P + Y +P
Sbjct: 597 GWEKGVVFINGQNLGRYW----------------------------NIG-PQETLY-LPG 626
Query: 673 SWIKDGVNTLVLFEE 687
+W+ G+N +++FEE
Sbjct: 627 TWLDQGINQVIVFEE 641
>gi|119588243|gb|EAW67839.1| hCG1729998, isoform CRA_d [Homo sapiens]
Length = 653
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 174/351 (49%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ GR T++G + ++ GSIHY R W D + K K G + +
Sbjct: 61 LKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F+ ++ + ++ + L Q GP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKSFIEAVEKYFDHLI--PRVIPLQYRQAGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHKAL--LRRGIVELLLT-SDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G Y + TSYDYDA + E G + +L KL +S+ T
Sbjct: 349 FMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE----KYLKLQKLFQSVSAT 395
>gi|148273884|ref|YP_001223445.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
gi|147831814|emb|CAN02784.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
Length = 599
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 159/320 (49%), Gaps = 40/320 (12%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S +R+ D +DG +++G++HY R P W D I+KA+ GLD IETYV WNA
Sbjct: 8 STRFRIGTDD--FELDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNA 65
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
H P R +D + LDL RF+ + +G++ I+R GPY+CAEW+ GG P WL P + +
Sbjct: 66 HSPERGAFDTSAGLDLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFEDPAV-GV 124
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
R + +++ + F + ++ ++ GGP+IL QIENEYG YGD Y+
Sbjct: 125 RRSEPLYLAAVDEFLRRVYEIVAPRQI--DMGGPVILVQIENEYGA----YGD-DADYLR 177
Query: 204 WCAKMATSLDIGVPWIMCQE---------------------SDAPSPMFT--PNNPNSPK 240
+ I VP + S A + T + P P
Sbjct: 178 HLVDLTRESGIIVPLTTVDQPTDEMLSRGSLDELHRTGSFGSRATERLATLRRHQPTGPL 237
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG---- 296
+ +E W GWF W G+ T+ A A G N YM+HGGTNFG T+G
Sbjct: 238 MCSEFWDGWFDHW-GEHHHTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHK 296
Query: 297 GPYLT--TSYDYDAPIDEYG 314
G Y + TSYDYDAP+DE G
Sbjct: 297 GTYQSHVTSYDYDAPLDETG 316
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/319 (35%), Positives = 157/319 (49%), Gaps = 39/319 (12%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++GE ++ + IHYPR W IK K G++ I YVFWN HEP +YDF G
Sbjct: 37 LNGEPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGRYDFAGQK 96
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F + Q+ G+YVI+R GPYVCAEW GG P WL I +LR + +M ++ F
Sbjct: 97 DIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-KLREQDPYYMERVKLF 155
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI-GV 216
+ + L S+GG II+ Q+ENEYG D K YI+ M GV
Sbjct: 156 LNEV--GKQLADLQISKGGNIIMVQVENEYGAFGID-----KPYISEIRDMVKQAGFTGV 208
Query: 217 PWIMCQ-----ESDAPSPMFTPNN------------------PNSPKIWTENWTGWFKSW 253
P C E++A + N P++P + +E W+GWF W
Sbjct: 209 PLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLMCSEFWSGWFDHW 268
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-----LTTSYDYDA 308
G K R+AE+L + +F + YM HGGT+FG G + TSYDYDA
Sbjct: 269 GAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDA 327
Query: 309 PIDEYGHLNQPKWGHLREL 327
PI+E G + PK+ +R L
Sbjct: 328 PINESGKVT-PKYLEVRNL 345
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 158/323 (48%), Gaps = 39/323 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ ++ + IHYPR W IK K G++ I YVFWN HEP +YDF
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG D+ F + Q+ G+YVI+R GPYVCAEW GG P WL I +LR + +M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-KLREQDPYYMER 151
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ F + + L S+GG II+ Q+ENEYG+ D K YI +
Sbjct: 152 VKLFMNEV--GKQLADLQISKGGNIIMVQVENEYGSFGID-----KPYIAEIRDIVKQAG 204
Query: 214 I-GVPWIMCQ-----ESDAPSPMFTPNN------------------PNSPKIWTENWTGW 249
GVP C E++A + N P+ P + +E W+GW
Sbjct: 205 FTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-----LTTSY 304
F WG K R+AEDL + +F + YM HGGT+FG G + TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 305 DYDAPIDEYGHLNQPKWGHLREL 327
DYDAPI+E G + PK+ +R L
Sbjct: 324 DYDAPINESGKVT-PKYFEVRNL 345
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 47/127 (37%), Gaps = 34/127 (26%)
Query: 592 TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPY 651
+Y+ TF D LN+ KG WVNGY +GRYW
Sbjct: 530 AYYRGTFTLDKTGD-TFLNMTNWSKGMVWVNGYAIGRYWEI------------------- 569
Query: 652 GSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGG-NPSQINFQTVVVGTAC--GQ 708
P Q Y VP W+K G N +++ + G P Q ++ C G
Sbjct: 570 ----------GPQQTLY-VPGCWLKKGENEVIILDMAGSVQPQTEGLQQPILDNLCVHGA 618
Query: 709 AHENKTM 715
A+ ++ +
Sbjct: 619 AYAHRKV 625
>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
Length = 586
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 153/311 (49%), Gaps = 38/311 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
R +DGE +LSG+IHY R P +W D I+KA+ GL+ IETYV WN H +
Sbjct: 9 RDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFRT 68
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
G LDL RF+ + +G+ I+R GPY+CAEW+ GG P WL P I +R++ ++
Sbjct: 69 DGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSI-GVRSSEPGYLAA 127
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
+ F ++ + + ++ ++GGP+IL QIENEYG SD K+Y+ AT
Sbjct: 128 VDGFMDRLLPIVVERQI--TRGGPVILFQIENEYGAYGSD-----KAYLQHLVDTATRAG 180
Query: 214 IGVPWIMCQE-----------------------SDAPSPMFTPNNPNSPKIWTENWTGWF 250
+ VP C + +D P+ P + E W GWF
Sbjct: 181 VEVPLFTCDQPFETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGWF 240
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSY 304
+WG T + A G N YM+HGGTNFG T+G G Y TSY
Sbjct: 241 DNWGTHH-HTTDAAASAAELDALLAAGASVNIYMFHGGTNFGFTNGANDKGIYEPTITSY 299
Query: 305 DYDAPIDEYGH 315
DYDAP+ E GH
Sbjct: 300 DYDAPLSEDGH 310
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 80/208 (38%), Gaps = 47/208 (22%)
Query: 490 SNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDL 549
S +L ER + L RG + +L G NYG++ G+ GP LL G +
Sbjct: 415 SRELGERSIVLPRG-GLLQVLVEDQGRVNYGTRIGEA-KGLTGPALLDG----------V 462
Query: 550 SSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVL 609
W+ + L + A A G SS +++ TFEA D L
Sbjct: 463 ELQDWSVRP--VDLSSLAPFRAAAGELPAGSSSAGGVAGPSVSF--ATFEADGPGD-RHL 517
Query: 610 NLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYH 669
L G KG A++NG+NLGRYW RGP Q +
Sbjct: 518 RLDGWTKGNAFINGFNLGRYW----------------SRGP--------------QRTLY 547
Query: 670 VPRSWIKDGVNTLVLFEEFGGNPSQINF 697
VP I++G N L + E G ++ F
Sbjct: 548 VPGPLIREGANELAVLELQGSTTREVRF 575
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 158/323 (48%), Gaps = 39/323 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ ++ + IHYPR W IK K G++ I YVFWN HEP +YDF
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG D+ F + Q+ G+YVI+R GPYVCAEW GG P WL I +LR + +M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-KLREQDPYYMER 151
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ F + + L S+GG II+ Q+ENEYG+ D K YI +
Sbjct: 152 VKLFMNEV--GKQLTDLQISKGGNIIMVQVENEYGSFGID-----KPYIAEIRDIVKQAG 204
Query: 214 I-GVPWIMCQ-----ESDAPSPMFTPNN------------------PNSPKIWTENWTGW 249
GVP C E++A + N P+ P + +E W+GW
Sbjct: 205 FTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-----LTTSY 304
F WG K R+AEDL + +F + YM HGGT+FG G + TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 305 DYDAPIDEYGHLNQPKWGHLREL 327
DYDAPI+E G + PK+ +R L
Sbjct: 324 DYDAPINESGKVT-PKYFEVRNL 345
>gi|170034404|ref|XP_001845064.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875697|gb|EDS39080.1| beta-galactosidase [Culex quinquefasciatus]
Length = 650
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 189/727 (25%), Positives = 296/727 (40%), Gaps = 152/727 (20%)
Query: 5 KHCSRAILLCLILQTLFNL---SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWP 61
+H + + + Q LF+L ++ + +D +DG+ +SGS HY R+ P W
Sbjct: 10 QHDASDVQITPDAQDLFDLKNEERSFYIDYDRDTFVMDGKDFRYVSGSFHYFRALPQTWR 69
Query: 62 DLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYV 121
++ + GGL+A++ YV W+ H P QY + G ++ I+ ++ LYVILR GPY+
Sbjct: 70 SKLRTMRAGGLNAVDLYVQWSLHNPKDNQYVWDGIANITDVIEAAIEEDLYVILRPGPYI 129
Query: 122 CAEWNYGGFPVWLHN-MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
CAE + GG P WL N PGI+ +R ++ ++ E++ + + M++ GGPII+
Sbjct: 130 CAEIDNGGLPYWLFNKYPGIQ-VRISDANYIKEVKIWYEKL--MSQLTPYMYGNGGPIIM 186
Query: 181 AQIENEYGNVMSDYGDAGKSYIN--------WCAKMATSLDIGVPW---IMC-------- 221
Q+ENEYG +G K Y+N + A + P+ ++C
Sbjct: 187 VQLENEYGA----FGKCDKQYLNVLKEETEKYTQGKAVLFTVDRPYDDELVCGQIPGVFI 242
Query: 222 ---------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARF 272
E D + P P + TE +TGW W K+ +R A LA + +
Sbjct: 243 TTDFGLMTDDEVDTHAAKVRSIQPKGPLVNTEFYTGWLTHWQEKNQRRPAGPLAATLRKM 302
Query: 273 FQFGGTFQNYYMYHGGTNFGRTSG------GPYLT--TSYDYDAPIDEYGHLNQPKWGHL 324
+ G ++YMY GGTNFG +G G Y+ TSYDYDAP+DE G
Sbjct: 303 LKDGWNV-DFYMYFGGTNFGFWAGANDWGLGKYMADITSYDYDAPMDEAGD--------- 352
Query: 325 RELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSV-SILPDCKTEEFNTAKVNTQ 383
SM+ T+ + G LPA V P K E + V +
Sbjct: 353 -------PSMKYTIF----------RDIIGEYIPLPAVRVPDRAPKMKHEPVLLSSVESI 395
Query: 384 TNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMT 443
+ + + G + L+ + N S ++ Y T
Sbjct: 396 LSTSSRN------------------------LLGTPALKSDKLLTFEELNQNSGFVLYET 431
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
DP L L IN YV+ Y+ + N + P+ G
Sbjct: 432 TLPKFTRDPSL-------LTINDLRDRAQIYVDEFYLGT----LSRENAISSLPISAGWG 480
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
+++S+L G N+ D GI G V + + DE ++LS WT + Y
Sbjct: 481 -SKLSILVENQGRINFDVLDDY--KGILGNVTI--QIYDEPYTQELSD--WT--ITGYPF 531
Query: 564 DD-KKFYNAKAANSE---RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFA 619
D KF A +E G + K ++ +T+ E ++ G GKGF
Sbjct: 532 DSYDKFTQLFATLNEGAGHGVNGKGAAIHGPVTFKGELVIETSEIHDTYFDMTGWGKGFI 591
Query: 620 WVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGV 679
++NG+NLGRYWP GP Q+ ++P+ +K G
Sbjct: 592 FINGFNLGRYWPV---------------AGP--------------QVTMYLPKELLKSGA 622
Query: 680 NTLVLFE 686
N +VL E
Sbjct: 623 NEIVLVE 629
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 87/166 (52%), Positives = 106/166 (63%), Gaps = 12/166 (7%)
Query: 59 MWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIG 118
MW L+K AKEGG+D IETYVF N HE Y F G DL++F+K +Q G+Y+IL IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 119 PYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
P+V EWN+G +T +K F MQ F TLIV++ KK+KLFASQGGPI
Sbjct: 61 PFVATEWNFGTI------------FQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPI 108
Query: 179 ILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
IL Q +NEYG+ Y D GK Y+ W A M S +IGVPWIMCQ S
Sbjct: 109 ILTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQYS 154
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/40 (75%), Positives = 35/40 (87%)
Query: 281 NYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPK 320
NYYMYHGGTNFG TSGGP++TT+Y+Y+APIDEYG PK
Sbjct: 238 NYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK 277
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 168/362 (46%), Gaps = 64/362 (17%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
Y ++ DGR++ ++G R +LLSGSIHYPRSTP MWP L +A+ GL+AIE+Y FWN H
Sbjct: 1036 YSIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKHSA 1095
Query: 87 LR---RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP------------ 131
R Y F G++DL F+ + L+V+ R GPYVCAEW GG P
Sbjct: 1096 TRYGAYDYGFNGDVDL--FLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASN 1153
Query: 132 VWLHNMPGIEELRTTNKVFMNE----MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
W+H++PG+ + RT N ++NE M++ +I E + G +IENEY
Sbjct: 1154 AWIHDVPGM-KTRTNNTAWLNETGRWMRDHFAVI------EPHLSRNGAS---NRIENEY 1203
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC-------------------QESDAPS 228
G SD A + ++ W+MC + A +
Sbjct: 1204 GGSKSDAAAVAYVDALDALADAVAPEL--VWMMCGFVSLVAPDALHTGNGCPHDQGPASA 1261
Query: 229 PMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
+ P P + W W+ +WG R D+A+ VA + GG N+YM+HGG
Sbjct: 1262 HVVVPPAPGADPAWYTEDELWYDAWGLPSLARPPADVAYGVASYVATGGAMHNFYMWHGG 1321
Query: 289 TNFGRTS------GG------PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
++G S GG P Y AP+ G ++P + HL +H L + +
Sbjct: 1322 NHYGNWSTATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHGTLDAYAE 1381
Query: 337 TL 338
L
Sbjct: 1382 VL 1383
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 176 bits (445), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 119/334 (35%), Positives = 164/334 (49%), Gaps = 52/334 (15%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
+ V ++ DGE +SG +HY R W D I+K K GL+AI TYV W+ HE
Sbjct: 28 TFIVDYEKNEFLKDGEVFRYVSGDLHYFRVPKSYWKDRIQKIKAAGLNAITTYVEWSLHE 87
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P Y+F G DL FIK IQD+G+Y++LR GPY+CAE ++GGFP WL N+ LRT
Sbjct: 88 PFPGTYNFEGMADLEYFIKLIQDEGMYLLLRPGPYICAERDFGGFPYWLLNVTPKGSLRT 147
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM---SDYG----DAG 198
+ + + + +++ M K + GG II+ Q+ENEYG+ SDY D
Sbjct: 148 NDSSYKKYVSQWFSVL--MKKMQPHLYGNGGNIIMVQVENEYGSYYACDSDYKLWLRDLL 205
Query: 199 KSYINWCAKMATSLDIGVPWIMCQESD---APSPM------------------FTPN-NP 236
K Y+ A + T +DI C++ D P P F N
Sbjct: 206 KGYVEDKALLYT-IDI------CRQRDFDCGPIPEVYATVDFGISVNAATCFDFLKNYQK 258
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P + +E + GW W PK ++D+ + +F ++YM+HGGTNFG TSG
Sbjct: 259 GGPSVNSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLSLNASF-SFYMFHGGTNFGFTSG 317
Query: 297 G------------PYLTTSYDYDAPIDEYGHLNQ 318
P L TSYDYDAPI E G L +
Sbjct: 318 ANTNESDANIGYLPQL-TSYDYDAPITEAGDLTE 350
>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 640
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 126/369 (34%), Positives = 176/369 (47%), Gaps = 58/369 (15%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHDGRAI----TIDGERKILLSGSIHYPRSTPGMWPDLI 64
RAI + TL ++A +V H A+ +DG+ +L+G +HY R W D +
Sbjct: 4 RAIATLALAFTL--PAVAQQVPHSFAAVGDHFELDGKPFRILTGEMHYARIPRARWDDAM 61
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+KAK GL+AI TYVFWN HEP YDFTG DL ++ Q GL VILR GPY CAE
Sbjct: 62 QKAKALGLNAITTYVFWNVHEPRPGVYDFTGQNDLGEYLAAAQRAGLKVILRPGPYACAE 121
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQI 183
W +GG+P WL P + +R+++ FM + F L ++ + A+ GGPII Q+
Sbjct: 122 WEFGGYPAWLIKDPTV-VVRSSDPKFMKPVAKWFHRLGQEV---QPYLAANGGPIIAVQV 177
Query: 184 ENEYGNVMSDYG------------------------DAGKSYINWCAKMATSLDIGVPWI 219
ENEYG+ +D+ + GK+ M + D GV
Sbjct: 178 ENEYGSFGNDHAYMEQMKDLVISSGIGGKNPKKAVDEDGKNVPQDTGTMLYTADGGVQLP 237
Query: 220 MCQESDAPSPM-------------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
+ P+ + + PN P++ E W GWF WG K A +
Sbjct: 238 NGTLPELPAVVNFGGGQAKSELARYEAFRPNGPRMVGEYWAGWFDHWGNNHQKTNAAEQV 297
Query: 267 FAVARFFQFGGTFQNYYMYHGGTNFGRTSG------GPYL--TTSYDYDAPIDEYGHLNQ 318
+ G + + YM +GGT+FG +G PY TSYDYDAPIDE G+
Sbjct: 298 AEYEYMLKRGYSV-SLYMLYGGTSFGWMAGANSGDKAPYEPDVTSYDYDAPIDERGN-PT 355
Query: 319 PKWGHLREL 327
PK+ LRE+
Sbjct: 356 PKYFALREV 364
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 157/323 (48%), Gaps = 39/323 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G ++ + IHYPR W IK K G++ I YVFWN HEP +YDF
Sbjct: 33 KTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG D+ F + Q+ G+YVI+R GPYVCAEW GG P WL I +LR + +M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-KLREQDPYYMER 151
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ F + + L S+GG II+ Q+ENEYG+ D K YI +
Sbjct: 152 VKLFMNEV--GKQLTDLQISKGGNIIMVQVENEYGSFGID-----KPYIAEIRDIVKQAG 204
Query: 214 I-GVPWIMCQ-----ESDAPSPMFTPNN------------------PNSPKIWTENWTGW 249
GVP C E++A + N P+ P + +E W+GW
Sbjct: 205 FTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-----LTTSY 304
F WG K R+AEDL + +F + YM HGGT+FG G + TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 305 DYDAPIDEYGHLNQPKWGHLREL 327
DYDAPI+E G + PK+ +R L
Sbjct: 324 DYDAPINESGKVT-PKYFEVRNL 345
>gi|402895880|ref|XP_003911040.1| PREDICTED: beta-galactosidase-1-like protein 3 [Papio anubis]
Length = 653
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 176/351 (50%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ + GR T++G R ++ GSIHY R W D + K + G + +
Sbjct: 61 LKNRSVGLGTASTGRGKPHFTLEGRRFLICGGSIHYFRVPRAYWRDRLLKLRACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F ++ + ++ + L QGGP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKGFTEAVEKYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHKAL--LRRGIVELLLT-SDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVERAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G + TSYDYDA + E G + + +L KLL+S+ T
Sbjct: 349 FMNGATNFGKHTGIVTSYDYDAVLTEAGDYTEKYF----KLQKLLESVSAT 395
>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
Length = 632
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 118/331 (35%), Positives = 168/331 (50%), Gaps = 43/331 (12%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ ++SG +HYPR W ++ K GL+A+ TYVFWNAHEP ++D
Sbjct: 34 GGDFVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHEPEPGKWD 93
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
FT + +L +IK ++GL VILR GPYVCAEW +GG+P WL N+ + ELR N+ F+
Sbjct: 94 FTEDKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEM-ELRRDNEQFL- 151
Query: 153 EMQNFTTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMA 209
+T L ++ +E L ++GGPII+ Q ENE+G+ +S D + + + AK+
Sbjct: 152 ---KYTQLYINRLYQEVGNLQITKGGPIIMVQAENEFGSYVSQRKDIPLEEHRRYNAKIV 208
Query: 210 TSLDIGVPWIMCQESD---------APSPMFTPNNPNS----------------PKIWTE 244
L I SD P + T N ++ P + E
Sbjct: 209 QQLKTAGFDIPSFTSDGSWLFEGGAVPGALPTANGESNIDNLKKVVNRYNGGQGPYMVAE 268
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--- 301
+ GW W P+ +A +A ++ Q + NYYM HGGTNFG TSG Y
Sbjct: 269 FYPGWLAHWVEPHPQVSATSVARQTEKYLQNDVSI-NYYMVHGGTNFGFTSGANYDKKHD 327
Query: 302 -----TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAP+ E G + PK+ LR +
Sbjct: 328 IQPDLTSYDYDAPVSEAGWVT-PKFDSLRNV 357
Score = 42.7 bits (99), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 70/188 (37%), Gaps = 45/188 (23%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIK--DLSSHKWTYKVGLYGLD 564
+ +L +G NYGS+ GI PV R D I + S +D
Sbjct: 469 LEILVENMGRINYGSEIIHNTKGIISPV----RINDMEIEGGWQMISIPMDKAPDFSKMD 524
Query: 565 DKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
Y+ N+E S L + YK TF E +N++ GKG ++NG
Sbjct: 525 QASVYD----NNESAIKS----LAGKPVLYKGTFNL-TETGDTFINMEDWGKGIIFINGK 575
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
N+GRYW Y GP Q ++P W+K G N +++
Sbjct: 576 NIGRYW----------------YVGP--------------QQTLYIPGVWLKKGENKIII 605
Query: 685 FEEFGGNP 692
FE+ P
Sbjct: 606 FEQLNDKP 613
>gi|219847209|ref|YP_002461642.1| beta-galactosidase [Chloroflexus aggregans DSM 9485]
gi|219541468|gb|ACL23206.1| Beta-galactosidase [Chloroflexus aggregans DSM 9485]
Length = 898
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 184/374 (49%), Gaps = 38/374 (10%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
RV G I +D LLSG IHY R W L+++A+ GL+ I+T + WN HEP
Sbjct: 6 RVGRQG--IELDSRPFYLLSGCIHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRHEPQ 63
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+DF DL F+ D GL VI+R GPY+CAEW GG P WL G LRT +
Sbjct: 64 PGVFDFADEADLGAFLDLCHDLGLKVIVRPGPYICAEWENGGLPAWL-TANGDLRLRTND 122
Query: 148 KVFMNE-MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
VF++ ++ F TL+ + ++ ++GGPIIL QIENE+ D + + A
Sbjct: 123 PVFLSAVLRWFDTLMPILVPRQH---TRGGPIILCQIENEHWASGVYGADEHQQTL---A 176
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPN--------------NPNSPKIWTENWTGWFKS 252
+ A I VP C + P F P++P I +E W+GWF +
Sbjct: 177 RAAFERGIEVPQYTCMGATPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWFDN 236
Query: 253 WGG-KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGP--YLTTSYD 305
WGG + +++A L + + G +++M+ GGTNF GRT GG ++TT YD
Sbjct: 237 WGGHRQTRKSAAKLDMILHQLTAVGCAGFSHWMWAGGTNFGYWGGRTVGGDLIHMTTGYD 296
Query: 306 YDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVS 365
YDAPIDEYG L + R H L S +G ++ ++V G +P +++
Sbjct: 297 YDAPIDEYGRLTEKALVARR--HHLFLS-----CFGAELSSVLADAVPGGITVIPPAAIA 349
Query: 366 ILPDCKTEEFNTAK 379
+ + + T +
Sbjct: 350 GRSEGGVQPYRTVR 363
>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
Length = 636
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 168/350 (48%), Gaps = 31/350 (8%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + +D DG+ +SGSIHY R P W D + K K GLDAI+TYV WN HE
Sbjct: 8 SFGIDYDSNCFVKDGKPFRYISGSIHYSRVPPYYWKDRLLKMKMAGLDAIQTYVPWNYHE 67
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDF G DL F++ D GL VILR GPY+CAEW+ GG P WL I LR+
Sbjct: 68 PQMGTYDFFGGKDLQYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSI-VLRS 126
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG--------- 195
++ ++ ++ + ++ + K GGPII+ Q+ENEYG+ + DY
Sbjct: 127 SDSDYLEAVERWMGVL--LPKMRPYLYQNGGPIIMVQVENEYGSYFACDYNYLRFLLKLF 184
Query: 196 --DAGKSYINWCAKMATSLDI---GVPWIMCQESDAPSPMFTP-------NNPNSPKIWT 243
G + + A+ + + + AP T + P P + +
Sbjct: 185 RLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGANVTAAFLAQRSSEPKGPLVNS 244
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL- 300
E +TGW WG A+ +A + G N YM+ GGTNF +G PY+
Sbjct: 245 EFYTGWLDHWGHHHSVVPAQTIAKTLNEILASGANV-NLYMFIGGTNFAYWNGANMPYMP 303
Query: 301 -TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
TSYDYDAP+ E G L + K+ LR++ + K + + LT YG
Sbjct: 304 QPTSYDYDAPLSEAGDLTE-KYFALRKVIGMYKQLPEGLTPPTTPKFAYG 352
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 167/336 (49%), Gaps = 46/336 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+IHY R P W + K G +A+ETYV WN HE + ++DF+G
Sbjct: 12 LDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFDFSGTK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RFI T + GLYVI+R PY+CAEW +GG P WL P + +R+ + F+ ++ +
Sbjct: 72 DIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNL-RVRSRDPQFLEYVERY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ ++ ++ GPI++ Q+ENEYG+ D K+Y++ A+M + VP
Sbjct: 131 YDRLFEILTPLQI--DHHGPILMMQVENEYGSYGED-----KTYLSALARMMRDRGVTVP 183
Query: 218 -------WIMCQE--SDAPSPMFTPNNPNS--------------------PKIWTENWTG 248
W C E S A + + N S P + E W G
Sbjct: 184 LFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG + R +++L + + G N YM+HGGTNFG +G
Sbjct: 244 WFNRWGDRIITRQSDELIDEIGEVLKRGSI--NLYMFHGGTNFGFWNGCSARGRIDLPQV 301
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
TSYDYDAP+DE G+ + + +HKL +++T
Sbjct: 302 TSYDYDAPLDEAGNPTVKYYKIQQLVHKLHPEIQQT 337
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 93/239 (38%), Gaps = 62/239 (25%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGAS-NDLFERPVKLTRGKNQISLLSATVGLQNYG 520
LRI + + +++ V +T Y D FE V L + Q +L +G NYG
Sbjct: 402 LRIVDARDRVQLFLDNEKV---YTAYQEEIGDKFE--VALKQPVVQADVLVEHMGRVNYG 456
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLS-SHKW-TYKVGLYGLDDKKFYNAKAANSER 578
K + P G +G+ +++DL +W + + L+DK F E+
Sbjct: 457 YKL-VAPTQRKG----LGQG----LMQDLHFVQQWEQFDIDFDLLEDKHF--------EQ 499
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
W + R Y+ E P + L++ G GKG VNG+N+GRYW
Sbjct: 500 AWEADQPSFYR----YQFDIETP---ESTYLDVSGFGKGVVLVNGFNIGRYW-------- 544
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
N G ++ +P + +K G N +++FE G +I
Sbjct: 545 --------------------NIGPTLSLY--IPGALLKQGQNEIIIFETEGQYSEEIRL 581
>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
Length = 656
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 162/323 (50%), Gaps = 41/323 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G ++L GSIHY R W D + K K G + + TYV WN HEP R ++DF+G
Sbjct: 82 FTLEGHEFLILGGSIHYFRVPRESWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 141
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLD+ FI + GL+VILR GPY+C+E + GG P L P +LRTTN F+ +
Sbjct: 142 NLDMEAFILLAAEVGLWVILRPGPYICSEIDLGGLPSRLLQDP-TSQLRTTNHSFIEAVD 200
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ A+ L +GGPII Q+ENEYG+ D ++Y+ + K L G
Sbjct: 201 EYLDHLI--ARVVPLQYRKGGPIIAVQVENEYGSFHKD-----EAYMPYLHKAL--LKRG 251
Query: 216 VPWIMCQESDA--------PSPMFTPN---------------NPNSPKIWTENWTGWFKS 252
+ ++ + + T N N P + E W GWF +
Sbjct: 252 IVELLLTSDNTNEVLKGHIKGVLATVNMKSFKEGEFKDLYQVQSNKPILIMEFWVGWFDT 311
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDY 306
WG K R A D+ + F + +F N YM+HGGTNFG +G Y + TSYDY
Sbjct: 312 WGNKHAVRDAIDVENTIFDFIRLEISF-NVYMFHGGTNFGFMNGATYFEQHRGVVTSYDY 370
Query: 307 DAPIDEYGHLNQPKWGHLRELHK 329
DA + E G PK+ LREL K
Sbjct: 371 DAVLTEAGDYT-PKFFKLRELFK 392
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 169/327 (51%), Gaps = 43/327 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D+
Sbjct: 152 KAYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADV 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHK 329
SYDYDAPI E G + PK+ +R + K
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNVIK 356
Score = 46.2 bits (108), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 74/194 (38%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ I+ ++ +D+
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVQIAGKE----IVGGWDMYQLP-------MDEM 516
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A++ + S+ L Y+ TF D ++++ GKG +VNG N+
Sbjct: 517 PDLTKLKADTHKNVPSEVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y VP W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-VPGVWLKKGENKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNETP-QTEVKTV 618
>gi|355567243|gb|EHH23622.1| hypothetical protein EGK_07120 [Macaca mulatta]
Length = 653
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 176/351 (50%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ + GR T++G R ++ GSIHY R W D + K + G + +
Sbjct: 61 LKNRSVGLGTASTGRGKPHFTLEGHRFLICGGSIHYFRVPREYWRDRLLKLRACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F ++ + ++ + L QGGP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKGFTEAVEKYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHKAL--LRRGIVELLLT-SDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G + TSYDYDA + E G + + +L KLL+S+ T
Sbjct: 349 FMNGATNFGKHTGIVTSYDYDAVLTEAGDYTEKYF----KLQKLLESVSAT 395
>gi|384939972|gb|AFI33591.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
gi|387541294|gb|AFJ71274.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
Length = 653
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 176/351 (50%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ + GR T++G R ++ GSIHY R W D + K + G + +
Sbjct: 61 LKNRSVGLGTASTGRGKPHFTLEGHRFLICGGSIHYFRVPREYWRDRLLKLRACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F ++ + ++ + L QGGP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKGFTEAVEKYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHK--ALLRRGIVELLLT-SDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G + TSYDYDA + E G + + +L KLL+S+ T
Sbjct: 349 FMNGATNFGKHTGIVTSYDYDAVLTEAGDYTEKYF----KLQKLLESVSAT 395
>gi|199599299|ref|ZP_03212698.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
gi|199589801|gb|EDY97908.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
Length = 593
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 173/652 (26%), Positives = 265/652 (40%), Gaps = 156/652 (23%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
++DF+G LD+ RF+KT ++ GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 REGEFDFSGILDIERFLKTAEELGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D + Y+ A
Sbjct: 119 DPTYLAAIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGSYGED-----QDYLAVVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
K+ + VP SD P P + N S
Sbjct: 172 KLMQQHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GRDWPLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
FG +G + D+D P VT+ DY
Sbjct: 283 FGFMNG---TSARKDHDLP--------------------------------QVTSYDYDA 307
Query: 351 SVSGSSYNLPAW-SVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEM 409
++ P + ++ + + E AK + +P A PL K
Sbjct: 308 PLNEQGNPTPKYFAIQKMIHEELPEVQQAK-------PLVKPTMAPASH-PLTAK----- 354
Query: 410 INDFVVRGKGHFALNTLIDQ------KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT-- 461
+L ++DQ S ++L T L P++SG+ T
Sbjct: 355 -----------VSLFAVLDQLAKPIAASYPQTQEFLGQYTGYTLYRTQPLISGTDKGTPA 403
Query: 462 -LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
LR+ + + AY++ ++ +Q+ + +D+ V+ G +Q+ LL + NYG
Sbjct: 404 KLRVIDARDRVQAYLDQKWLATQYQE-AIGDDILLPEVE---GHHQLDLLVENMSRVNYG 459
Query: 521 SKFDMVPN--GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSER 578
SK + + GI V++ D IK Y LD + A
Sbjct: 460 SKIEAITQFKGIRTGVMV-----DLHFIKGYQQ---------YPLDLNR---ASRLTFTE 502
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
GW +YK TF D L+ +G GKG VNG N+GR+W
Sbjct: 503 GWQPATP------AFYKYTFGLTAPQD-TYLDCRGFGKGVMLVNGVNVGRFW 547
>gi|297194972|ref|ZP_06912370.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|297152570|gb|EFH31854.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 599
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 158/325 (48%), Gaps = 48/325 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG LLSG++HY R G W + + GL+ +ETYV WN HEP +Y G L
Sbjct: 20 LDGRPVRLLSGALHYFRVHEGQWGHRLAMLRAMGLNCVETYVPWNLHEPEPGRYADDGAL 79
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
RF+ + G++ I+R GPY+CAEW GG P WL G +RT + ++ ++
Sbjct: 80 G--RFLDAVHAAGMWAIVRPGPYICAEWENGGLPFWLTGRVG-RRVRTEDPEYLGHVERW 136
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
FT L+ + ++E ++GGP+++ Q+ENEYG+ SD G Y+ ++ S +GV
Sbjct: 137 FTRLLPQVVERE---ITRGGPVVMVQVENEYGSYGSDGG-----YLRQLVELLRSCGVGV 188
Query: 217 PWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
P M P + T N P P + E W GWF+ W
Sbjct: 189 PLFTSDGPEDHMLSGGSVPGVLATVNFGSGAGEAFAALRRHRPTGPLMCMEFWCGWFEHW 248
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-----------PYLTT 302
G + +R AED A A+ + G + N YM HGGT+FG +G T
Sbjct: 249 GAEPARRDAEDAARALREILEAGASV-NVYMAHGGTSFGGWAGANRSGELHDGVLEPTVT 307
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAP+DE G + W RE+
Sbjct: 308 SYDYDAPVDEAGRPTEKFW-RFREV 331
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 594 YKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
Y+ TFE D L L G +GF WVNG+NLGRYW
Sbjct: 515 YRGTFEVAEPGD-AGLELPGWTRGFVWVNGFNLGRYW 550
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 109/313 (34%), Positives = 159/313 (50%), Gaps = 42/313 (13%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
A +DG+ ++SG +HYPR W D ++KAK GL+ I TYVFWN HEP + +YDF+
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEM 154
GN D+ F+KT Q++GL+VILR PYVCAEW +GG+P WL N+ G+ E+R+ ++
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGL-EVRSKEPQYLQAY 464
Query: 155 QNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS-- 211
+N+ I+ + K+ L + GG I++ Q+ENEYG SD + Y++ ++
Sbjct: 465 KNY---IMQVGKQLAPLQVNHGGNILMVQVENEYGAYGSD-----REYLDINRRLFIEAG 516
Query: 212 ----LDIGVPWIMCQESDAPSPMFTP-----------------NNPNSPKIWTENWTGWF 250
L P + + P +FT N P E + WF
Sbjct: 517 FDGLLYTCDPEPFLAKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVAEWYPAWF 576
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
WG + K AE + G + N YM+HGGT +G Y +
Sbjct: 577 DWWGTQHHKVPAEKYTPGLDSVLSAGMSV-NMYMFHGGTTRDFMNGANYNDQNPYEPQIS 635
Query: 303 SYDYDAPIDEYGH 315
SYDYDAP+DE G+
Sbjct: 636 SYDYDAPLDEAGN 648
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 37/92 (40%), Gaps = 32/92 (34%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LNL GKG WVNG+NLGRYW N G P Q Y
Sbjct: 853 LNLGNWGKGVVWVNGHNLGRYW----------------------------NIG-PQQTLY 883
Query: 669 HVPRSWIKDGVNTLVLFEEFGGNPSQINFQTV 700
VP W+K G N +++ E P Q Q V
Sbjct: 884 -VPVEWLKKGGNEIIVLELL--KPEQSQLQAV 912
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 158/318 (49%), Gaps = 29/318 (9%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ ++ + +HYPR W IK K G++ + YVFWN HE ++DF
Sbjct: 37 KTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHEQEEGKFDF 96
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TGN D+ F + Q G+YVI+R GPYVCAEW GG P WL I LR + FM
Sbjct: 97 TGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-RLREQDPYFMQR 155
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD--YGDAGKSYI--------- 202
++ F + + L GGPII+ Q+ENEYG+ D Y A + +
Sbjct: 156 VEIFEKEV--GKQLAPLTIQNGGPIIMVQVENEYGSYGKDKPYVSAIRDIVRKSGFDKVS 213
Query: 203 ----NWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+W + + + W M D PN+PK+ +E W+GWF WG
Sbjct: 214 LFQCDWSSNFLNNGLDDLTWTMNFGTGANIDQQFKRLGEVRPNAPKMCSEFWSGWFDKWG 273
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAP 309
+ R A+D+ + G +F + YM HGGT+FG +G P TSYDYDAP
Sbjct: 274 ARHETRPAKDMVEGMDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDVTSYDYDAP 332
Query: 310 IDEYGHLNQPKWGHLREL 327
I+E+G L PK+ L+++
Sbjct: 333 INEWG-LATPKFYELQKM 349
>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
Length = 595
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 168/329 (51%), Gaps = 49/329 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++GE ++SG+IHY R P W + K G + +ETY+ WN HE R+YDF+G L
Sbjct: 12 LNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDFSGQL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF++T ++ GL+VILR PY+CAEW +GG P WL + +R+++ F+ ++ ++
Sbjct: 72 DIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNM-RIRSSDPQFIEKVSSY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ + L + GGP+I+ Q+ENEYG+ YG+ K Y+ ++ L + VP
Sbjct: 131 YKKLFEQIV--PLQVTSGGPVIMMQLENEYGS----YGE-DKEYLKTLYELMLELGVTVP 183
Query: 218 -------WIMCQESDAPSPM--FTPNN--------------------PNSPKIWTENWTG 248
W QE+ + + T N N P + E W G
Sbjct: 184 IFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEYWGG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W KR A+DL V + G N YM+HGGTNFG +G P L
Sbjct: 244 WFNRWNDPIIKRDAQDLTNDVKEALKIGSL--NLYMFHGGTNFGFMNGCSARLGKDLPQL 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
TSYDYDAP++E G+ K+ L+++ K
Sbjct: 302 -TSYDYDAPLNEQGNPTN-KYDSLQKMMK 328
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 84/229 (36%), Gaps = 60/229 (26%)
Query: 463 RINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSK 522
R+ +H ++N + +Q+ + ++ P+ G NQ+ +L +G NYG K
Sbjct: 403 RVIDGSDRVHFFLNEEKIATQYQE-EIGEKIYGSPIA---GSNQLDVLVENMGRVNYGHK 458
Query: 523 F--DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
D GI V+ D I D Y LD F + W
Sbjct: 459 LLADTQQKGIRRGVM-----SDLHFITDWEQ---------YSLD---FLKPLTIDFNEEW 501
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+N P YK T + P + +N++ GKG VNG+N+GR+W
Sbjct: 502 K-ENAP---SFYQYKVTIDTP---EDTFINMELFGKGIVLVNGFNIGRFWNV-------- 546
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
GP S + P+S K G N +++FE G
Sbjct: 547 --------GPTLS--------------LYAPKSLFKKGENEIIVFETEG 573
>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
Length = 617
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/396 (31%), Positives = 180/396 (45%), Gaps = 58/396 (14%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+L+CL+ F + S DG+ + SG +HY R W ++ K
Sbjct: 10 VVLICLM--PFFTKAQTKGFSISNGEFQKDGKIIKIHSGEMHYERIPKEYWRHRLQMLKA 67
Query: 70 GGLDAIETYVFWNAHEPLRRQYDF-TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
GL+ + TYVFWN HE +DF TGN DL F++ + +GLYVILR GPY C EW +G
Sbjct: 68 MGLNTVATYVFWNYHEIEPGVWDFKTGNRDLAEFLRIAKSEGLYVILRPGPYACGEWEFG 127
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
G+P WL N P + +RT NK F++ + + + + K FA+QGGPII+ Q ENE+G
Sbjct: 128 GYPWWLQNNPDL-VIRTNNKAFLDACKTYLEHLYAVVKGN--FANQGGPIIMVQAENEFG 184
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP------------ 236
+ +S D A+ + + + + +E+ P P FT +
Sbjct: 185 SYVSQRTDI-------SAEDHKAYKTAI-YNILKETGFPEPFFTSDGSWLFEGGMVEGVL 236
Query: 237 ----------------------NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
P + E + GW W K +E++A ++
Sbjct: 237 PTANGESNIENLKKQVDKYHKGQGPYMVAEFYPGWLDHWAEPFVKIGSEEIASQTKKYLD 296
Query: 275 FGGTFQNYYMYHGGTNFGRTSGGPY--------LTTSYDYDAPIDEYGHLNQPKWGHLRE 326
G +F NYYM HGGTNFG TSG Y TSYDYDAPI E G PK+ +R+
Sbjct: 297 AGVSF-NYYMAHGGTNFGFTSGANYNEESDIQPDITSYDYDAPISEAGWAT-PKFMAIRD 354
Query: 327 LHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAW 362
+ + + + Y N SS ++ +W
Sbjct: 355 VMQKYSKTKLAAIPEKIPVVKYPNQPVKSSMDVLSW 390
>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
Length = 583
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 154/319 (48%), Gaps = 42/319 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG +LSG++HY R P W D + +A+E GL+ IETY+ WNAH P R ++ G LD
Sbjct: 14 DGTPVRILSGALHYFRHHPDQWRDRLTRARELGLNTIETYIPWNAHSPARGEFRTDGILD 73
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L RF+ + QG++ I+R GPY+CAEW GG P WL +R ++ +Q++
Sbjct: 74 LGRFLDEVAAQGMWAIVRPGPYICAEWTGGGLPGWLFTAGA--AVRRHEPTYLAAIQDYY 131
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM---------A 209
+ + ++ +GGP++L Q+ENEYG YGD K Y+ K+
Sbjct: 132 EAVAGIVAPRQV--DRGGPVVLVQVENEYGA----YGD-DKDYLRALVKLLRESGITTPL 184
Query: 210 TSLDIGVPWIMCQESDAPS---------------PMFTPNNPNSPKIWTENWTGWFKSWG 254
T++D PW M + P + P P + E W GWF SWG
Sbjct: 185 TTIDQPEPW-MLENGSLPELHKTGSFGSRAAERLATLREHQPTGPLMCAEFWDGWFDSWG 243
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY--LTTSYDYDA 308
A A + G + N YM GGTNFG T+G G Y + TSYDYDA
Sbjct: 244 LHHHTTDAAASAHELDTLLAAGASV-NLYMVCGGTNFGFTNGANDKGTYVPIVTSYDYDA 302
Query: 309 PIDEYGHLNQPKWGHLREL 327
P+DE G W RE+
Sbjct: 303 PLDEAGRPTAKYWA-FREV 320
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 58/144 (40%), Gaps = 25/144 (17%)
Query: 490 SNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDL 549
S DL E L G + +L G NYG + P G+ GP R G E + +
Sbjct: 415 SRDLGEHRAVLPHG-GALEVLVEDQGRVNYGPRIGE-PKGLIGP----ARVGAEAVTR-- 466
Query: 550 SSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMT---WYKTTFEAPLENDP 606
+ V LDD A ++ P++ + + TF+ P +
Sbjct: 467 ------WGVRPLALDDVTALTAHVRDA--------APVDGVVAGPAFAHATFDTPDPDAD 512
Query: 607 VVLNLQGMGKGFAWVNGYNLGRYW 630
L+ G GKG WVNG+ LGR+W
Sbjct: 513 HFLDTAGWGKGVVWVNGFCLGRFW 536
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 158/323 (48%), Gaps = 39/323 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ ++ + IHYPR W IK K G++ I YVFWN HEP +YDF
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG D+ F + Q+ G+YVI+R GPYVCAEW GG P WL I +LR + +M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-KLREQDPYYMER 151
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ F + + L ++GG II+ Q+ENEYG+ D K YI +
Sbjct: 152 VKLFMNEV--GKQLTDLQINKGGNIIMVQVENEYGSFGID-----KPYIAEIRDIVKQAG 204
Query: 214 I-GVPWIMCQ-----ESDAPSPMFTPNN------------------PNSPKIWTENWTGW 249
GVP C E++A + N P+ P + +E W+GW
Sbjct: 205 FTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-----LTTSY 304
F WG K R+AEDL + +F + YM HGGT+FG G + TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 305 DYDAPIDEYGHLNQPKWGHLREL 327
DYDAPI+E G + PK+ +R L
Sbjct: 324 DYDAPINESGKVT-PKYFEVRNL 345
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 41/354 (11%)
Query: 3 TLKH--CSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
T KH + A+L+ +L + + + ++G+ ++ + +HYPR W
Sbjct: 2 TFKHFIATVALLVTAMLPPVSAARKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYW 61
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
IK K G++ + YVFWN HE ++DFTGN D+ F + Q GLYVI+R GPY
Sbjct: 62 EHRIKMCKALGMNTVCLYVFWNIHEQQEGRFDFTGNNDVAEFCRLAQRNGLYVIVRPGPY 121
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEW GG P WL I LR + FM ++ F + + + L GGPII+
Sbjct: 122 VCAEWEMGGLPWWLLKKKDI-RLREPDPYFMERVKLFERKVGE--QLASLTIQNGGPIIM 178
Query: 181 AQIENEYGNVMSDYGDAGKSYI--------------------NWCAKMATSLDIGVPWIM 220
Q+ENEYG+ YG+ K+Y+ +W + + + W M
Sbjct: 179 VQVENEYGS----YGE-NKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTM 233
Query: 221 ----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFG 276
+ D PN+P++ +E W+GWF WG + R A+ + + G
Sbjct: 234 NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKG 293
Query: 277 GTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYGHLNQPKWGHLR 325
+F + YM HGGT+FG +G P TSYDYDAPI+EYG PK+ LR
Sbjct: 294 ISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQAT-PKYWELR 345
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 150/323 (46%), Gaps = 27/323 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ I+ + +HYPR W IK K G++ I YVFWN HEP ++DFTG
Sbjct: 77 LNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFTGQN 136
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL I LR + F+ + F
Sbjct: 137 DLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLREADPYFIERVNIF 195
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ L GGPII+ Q+ENEYG+ V +++GD +
Sbjct: 196 EQEVARQVG--GLTIQNGGPIIMVQVENEYGSYGESKEYVSLIRDIVRTNFGDVTLFQCD 253
Query: 204 WCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W + D P+SP + +E W+GWF WG
Sbjct: 254 WASNFTKNALPDLLWTINFGTGANIDQQFAGLKKLRPDSPLMCSEFWSGWFDKWGANHET 313
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 314 RPASDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 372
Query: 315 HLNQPKWGHLRELHKLLKSMEKT 337
W + L K + ++T
Sbjct: 373 QTTPKYWALRKTLGKYMNGEKQT 395
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 57/234 (24%), Positives = 86/234 (36%), Gaps = 45/234 (19%)
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGL 516
+S+ L +N + +VNG Y+ + G K T Q+ +L +G
Sbjct: 453 TSSAQLTVNEAHDYAQIFVNGKYIGKLDRRNGEKQLTLPACPKGT----QLDILVEAMGR 508
Query: 517 QNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK-KFYNAKAAN 575
N+G GI V L I DL + + ++ ++D +FY +
Sbjct: 509 INFGRAIKDY-KGITENVELSINIDGYPFICDLKNWE------VFNIEDSYEFYKKMKFH 561
Query: 576 SERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
R S K+ R Y+ TF+ D LN + GKG +VNGY LGR W
Sbjct: 562 PIR--SLKDKYGQRIPGCYRATFQVKKPGD-TFLNFETWGKGLVYVNGYALGRIWEI--- 615
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
P Q Y VP W+K G N +++F+ G
Sbjct: 616 --------------------------GPQQTLY-VPGCWLKKGENEILVFDIIG 642
>gi|422852505|ref|ZP_16899175.1| beta-galactosidase [Streptococcus sanguinis SK150]
gi|325693831|gb|EGD35750.1| beta-galactosidase [Streptococcus sanguinis SK150]
Length = 592
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMECYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQGVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 153/319 (47%), Gaps = 46/319 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G+ +DGE ++SG++HY R P W + K G + +ETYV WN HEP ++
Sbjct: 7 GKDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGVFN 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F G DL+++++ Q GL VILR PY+CAEW +GG P WL I +R+ +F+N
Sbjct: 67 FEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDI-RVRSNTNLFLN 125
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+++NF +++ + L GGPII+ Q+ENEYG+ +D K Y+ K+ L
Sbjct: 126 KVENFYKVLLPLVT--SLQVENGGPIIMMQVENEYGSFGND-----KEYVRSIKKLMRDL 178
Query: 213 DIGVP-------WIMCQES----------------------DAPSPMFTPNNPNSPKIWT 243
+ VP W ES +A N P +
Sbjct: 179 GVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESFIKENKKEWPLMCM 238
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WG + +R + +LA V + N+YM+ GGTNFG +G
Sbjct: 239 EFWDGWFNRWGMEIIRRDSSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENV 296
Query: 301 ----TTSYDYDAPIDEYGH 315
TSYDYDA + E+G
Sbjct: 297 DLPQITSYDYDALLTEWGE 315
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 93/246 (37%), Gaps = 61/246 (24%)
Query: 453 ILSGSSNM-TLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
L G N+ ++ + +H ++N +D+Q+ E + LT+ +N + +L
Sbjct: 392 FLKGPKNIEKCKVVDARDRVHLFLNEQLIDTQYRDEIGR----EVSLDLTKEENTLDILV 447
Query: 512 ATVGLQNYGSKF--DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY 569
+G NYG++ GI V+ I L S+ Y + LD+ F
Sbjct: 448 ENMGRVNYGARLLSQTQRKGISSGVM---------IDIHLQSNWEHYALEFDNLDEIDF- 497
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
G N P ++Y+ TF D L+ +GKGF +NG+NLG+Y
Sbjct: 498 --------NGQWEPNTP-----SFYEYTFNVQELKD-TFLDCSKLGKGFVVLNGFNLGKY 543
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
W GP G + ++P + G N L++FE G
Sbjct: 544 WDV----------------GPTG--------------YLYIPAPLLIKGENKLIVFETEG 573
Query: 690 GNPSQI 695
++
Sbjct: 574 NYEEEL 579
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 158/322 (49%), Gaps = 48/322 (14%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G LL+GS+HY R PG W D +++ GL+A++TYV WN HE F G D
Sbjct: 16 NGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHERTAGDIRFDGPRD 75
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-F 157
L RFI+ Q++GL V++R GPY+CAEW+ GG P WL PG+ LRT++ ++ + F
Sbjct: 76 LARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGM-RLRTSHGPYLEAVDRWF 134
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
L+ +A +L A +GGP++ QIENEYG+ YGD ++Y+ + I
Sbjct: 135 DALVPRIA---ELQAGRGGPVVAVQIENEYGS----YGD-DRAYVRHIRDALVARGITE- 185
Query: 218 WIMCQESDAPSP--------------------------MFTPNNPNSPKIWTENWTGWFK 251
+ +D P+P + P P E W GWF
Sbjct: 186 --LLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFCAEFWNGWFD 243
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LTTSY 304
WG K R A A + GG+ + YM HGGTNFG +G + TSY
Sbjct: 244 HWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEGGTIRPTVTSY 302
Query: 305 DYDAPIDEYGHLNQPKWGHLRE 326
D DAPI E G L PK+ LR+
Sbjct: 303 DSDAPIAENGALT-PKFFALRD 323
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 108/341 (31%), Positives = 162/341 (47%), Gaps = 48/341 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G+ +DGE ++SG++HY R P W + K G + +ETYV WN HEP ++
Sbjct: 7 GKDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIHEPKEGVFN 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F G DL+++++ Q GL VILR PY+CAEW +GG P WL I +R+ +F++
Sbjct: 67 FEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDI-RVRSNTNLFLD 125
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+++NF +++ M L GGPII+ Q+ENEYG+ +D K Y+ K+ L
Sbjct: 126 KVENFYKVLLPMVT--PLQVENGGPIIMMQVENEYGSFGND-----KEYVRSIKKIMRDL 178
Query: 213 DIGVP-------WIMCQES----------------------DAPSPMFTPNNPNSPKIWT 243
D+ VP W ES + N P +
Sbjct: 179 DVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCM 238
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------ 297
E W GWF WG + +R +LA V + N+YM+ GGTNFG +G
Sbjct: 239 EFWDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENV 296
Query: 298 --PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
P + TSYDYDA + E+G + R + ++ +E+
Sbjct: 297 DLPQI-TSYDYDALLTEWGEPTPKYYAVQRVIKEVCSDVEQ 336
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/244 (22%), Positives = 94/244 (38%), Gaps = 57/244 (23%)
Query: 453 ILSGSSNMT-LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
L G N+ ++ + +H ++N +D+Q+ E + LT+ +N + +L
Sbjct: 392 FLKGPKNIEKCKVVDARDRVHMFLNEQLIDTQYRDEIGR----EVSLDLTKEENTLDILV 447
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
+G NYG++ + P G + I L S+ Y + LD+ F
Sbjct: 448 ENMGRVNYGAR-------LLSPTQRKGISSGVMIDIHLQSNWEHYALEFDNLDEIDF--- 497
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
G N P ++Y+ TF ND L+ +GKGF +NG+NLG+YW
Sbjct: 498 ------NGQWEPNTP-----SFYEYTFNVQELND-TFLDCSKLGKGFVVLNGFNLGKYWD 545
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
GP G + ++P + G N L++FE G
Sbjct: 546 V----------------GPTG--------------YLYIPAPLLIKGENNLIVFETEGNY 575
Query: 692 PSQI 695
++
Sbjct: 576 EEEL 579
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 158/322 (49%), Gaps = 48/322 (14%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G LL+GS+HY R PG W D +++ GL+A++TYV WN HE F G D
Sbjct: 16 NGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHERTAGDIRFDGPRD 75
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-F 157
L RFI+ Q++GL V++R GPY+CAEW+ GG P WL PG+ LRT++ ++ + F
Sbjct: 76 LARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGM-RLRTSHGPYLEAVDRWF 134
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
L+ +A +L A +GGP++ QIENEYG+ YGD ++Y+ + I
Sbjct: 135 DALVPRIA---ELQAGRGGPVVAVQIENEYGS----YGD-DRAYVRHIRDALVARGITE- 185
Query: 218 WIMCQESDAPSP--------------------------MFTPNNPNSPKIWTENWTGWFK 251
+ +D P+P + P P E W GWF
Sbjct: 186 --LLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFCAEFWNGWFD 243
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LTTSY 304
WG K R A A + GG+ + YM HGGTNFG +G + TSY
Sbjct: 244 HWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEGGTIRPTVTSY 302
Query: 305 DYDAPIDEYGHLNQPKWGHLRE 326
D DAPI E G L PK+ LR+
Sbjct: 303 DSDAPIAENGALT-PKFFALRD 323
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|194213011|ref|XP_001503026.2| PREDICTED: beta-galactosidase-1-like protein 3-like [Equus
caballus]
Length = 880
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 177/358 (49%), Gaps = 43/358 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+G
Sbjct: 248 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSG 307
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ T + GL+VILR GPY+C+E + GG P L P + LRTT+K F+ +
Sbjct: 308 NLDLEAFVLTAAEIGLWVILRPGPYICSEIDLGGLPSRLLQDPQV-NLRTTDKGFVEAVD 366
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ ++ L +GGPII Q+ENEYG+ D K Y+ + + I
Sbjct: 367 KYFDHLI--SRVVHLQYRKGGPIIAVQVENEYGSFYKD-----KDYMPYLQQALLKRGI- 418
Query: 216 VPWIMCQES-----------------------DAPSPMFTPNNPNSPKIWTENWTGWFKS 252
V ++ ++ DA ++ + P + E W GWF +
Sbjct: 419 VELLLTSDNVDDVLKGYIKGVLATINMKKFRKDAFQHLYKVQR-DKPIMIMEYWVGWFDT 477
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------PYLTTSYDY 306
WG K + A D+ V+ F +F +F N YM+HGGTNFG +G + TSYDY
Sbjct: 478 WGSKHEVKDAGDVKNTVSEFIKFEISF-NVYMFHGGTNFGFINGAINFVKHAGVVTSYDY 536
Query: 307 DAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSV 364
DA + E G + K+ LR+L + ++ Y + + SS+ LP W V
Sbjct: 537 DAVLTEAGDYTK-KYFKLRKLFGSILAVPLPPLPELTPKAVYPS--TRSSHYLPLWDV 591
>gi|363742521|ref|XP_003642647.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Gallus gallus]
Length = 637
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 191/713 (26%), Positives = 286/713 (40%), Gaps = 171/713 (23%)
Query: 15 LILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDA 74
L+ L+ +L + H ++G + GS+HY R W D + K K GL+
Sbjct: 34 LVPLRLWGRTLGLQTEHS--QFLLEGMPFRIFGGSVHYFRVPREYWEDRMLKMKACGLNT 91
Query: 75 IETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL 134
+ TYV WN HE R ++DF+ NLDL F+ GL+VILR GPY+C+EW+ GG P WL
Sbjct: 92 LTTYVPWNLHEQTRGKFDFSENLDLQAFLSLAAKNGLWVILRPGPYICSEWDLGGLPSWL 151
Query: 135 HNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDY 194
P + +LRTT K F + + ++ + L +GGPII Q+ENEYG+ D
Sbjct: 152 LQDPEM-QLRTTYKGFTEAVDAYFDHLMPIVV--PLQYKRGGPIIAVQVENEYGSYAKD- 207
Query: 195 GDAGKSYINWCAKMATSLDIGVPWIMCQES-------------------DAPSPMFT--- 232
+Y+ + + S I V +M ++ + P + T
Sbjct: 208 ----PNYMAYVKRALLSRGI-VELLMTSDNKNGLSFGLVEGALATVNFQNLPLSILTLFL 262
Query: 233 -PNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
+ PK+ E WTGWF +WGG A+++ VA + G + N YM+HGGTNF
Sbjct: 263 FXVQRDQPKMVMEYWTGWFDNWGGPHYVFDADEMVNTVASILKLGASI-NLYMFHGGTNF 321
Query: 292 GRTSGGPYL------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTN 345
G +G TSYDYDA + E G K+ LR+L + L +
Sbjct: 322 GFMNGALKTDEYKSDVTSYDYDAVLTEAGDYTS-KFFKLRQLFSTIIGQPLPLPPMIESK 380
Query: 346 TDYGNSVSGSSYNLPAWSV--SILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQW 403
YG + +L W V S++ K+ EF N Q N + +G
Sbjct: 381 ASYGAILLHQYISL--WDVLPSLVQPIKS-EFPVNMENLQLN------DSSGQSYG---- 427
Query: 404 KWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLR 463
++ + V+ G GH S + V D
Sbjct: 428 ----YVLYETVIFGGGHL--------HSRDHVRD-------------------------- 449
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
QV +VN YV + E + +G Q+ LL G NYG
Sbjct: 450 ---RAQV---FVNTMYVGE------LDYNTVELSLPEGQGFRQLRLLVENRGRVNYGLAL 497
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSK 583
+ G+ G + L ++T +++ +Y L+ K + + + GWS+
Sbjct: 498 NEQRKGLIGDIFL-----NKTPLRNFK---------IYSLEMKPDFLKRFVGTA-GWSA- 541
Query: 584 NVP-------LNRRMTWYKTTFEAPLENDP--VVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
VP R W +E+ P L LQG KG +VNG+NLGRYW
Sbjct: 542 -VPDYFVGPAFFRGRLW--------IEHQPQDTFLKLQGWEKGVVFVNGHNLGRYWKI-- 590
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
GP Q ++P W++ G N +++FEE
Sbjct: 591 --------------GP--------------QETLYLPGPWLQKGSNEIIIFEE 615
>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
Length = 596
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/333 (32%), Positives = 162/333 (48%), Gaps = 49/333 (14%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
+ +DG R + SGS HY R+ P +W D + + K GL+ + TYV WN HEP + Q+
Sbjct: 8 SFYLDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLG 67
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEM 154
G DL+ F++ +Q GLY+I+R GPY+CAEW +GGFP WL P + ++ ++NE+
Sbjct: 68 GLYDLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEV 127
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ + + + A K GGPII Q+ENE+G+ G Y+ + +S ++
Sbjct: 128 KQYLSQL--FAVLTKFTYKHGGPIIAFQVENEFGSK----GVHDPEYLQFLVTQYSSWNL 181
Query: 215 GVPWIMCQESDA---------PSPMFTPN---------------NPNSPKIWTENWTGWF 250
+ SD P + T N P P + TE W GWF
Sbjct: 182 NE---LLFTSDGKKYLSNGTLPDVLATINLNDHAKEDLEELKEFQPERPLMVTEFWAGWF 238
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------- 301
WG + +L + + N+YM+ GGTNFG +G YL+
Sbjct: 239 DHWGEEHHHYGTTELERELEAILSLNASV-NFYMFIGGTNFGFWNGANYLSYNKDKEASL 297
Query: 302 -----TSYDYDAPIDEYGHLNQPKWGHLRELHK 329
TSYDYDA + E+GH+ +PK+ +R L K
Sbjct: 298 LGPTVTSYDYDAAVSEWGHV-KPKYNVIRNLLK 329
>gi|426371159|ref|XP_004052521.1| PREDICTED: beta-galactosidase-1-like protein 3 [Gorilla gorilla
gorilla]
Length = 653
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 174/351 (49%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ GR T++G + ++ GSIH R W D + K K G + +
Sbjct: 61 LKNRSVGLGTESPGRGKPHFTLEGHKFLIFGGSIHCFRVPREYWRDRLLKLKACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMGAEIGLWVILRPGPYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F+ ++ + ++ + L QGGP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKSFIEAVEKYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFKKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMLYLHKAL--LRRGIVELLLT-SDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G Y + TSYDYDA + E G + +L KL +S+ T
Sbjct: 349 FMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE----KYLKLQKLFQSVSAT 395
>gi|422861007|ref|ZP_16907651.1| beta-galactosidase [Streptococcus sanguinis SK330]
gi|327468658|gb|EGF14137.1| beta-galactosidase [Streptococcus sanguinis SK330]
Length = 592
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVSVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 160/329 (48%), Gaps = 49/329 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMC-----------------------------QESDAPSPMFTPNNPNSPKIWTENWTG 248
+ + D F + P + E W G
Sbjct: 193 FFTSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFEEHGKKWPLMCMEFWDG 252
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W KR ++LA +V G N YM+HGGTNFG +G P +
Sbjct: 253 WFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI 310
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
TSYDYDAP+DE G+ + + + LH+
Sbjct: 311 -TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|401681814|ref|ZP_10813709.1| glycosyl hydrolase family 35 [Streptococcus sp. AS14]
gi|400185120|gb|EJO19350.1| glycosyl hydrolase family 35 [Streptococcus sp. AS14]
Length = 592
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGQPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRSFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 182/670 (27%), Positives = 271/670 (40%), Gaps = 168/670 (25%)
Query: 50 IHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQ 109
+HYPR W D +K+A+ GL+ + YVFWN HE ++DFTG D+ F++T Q++
Sbjct: 1 MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60
Query: 110 GLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-E 168
GLYVILR GPYVCAEW++GG+P WL + R+ + F++ + + I ++ K+
Sbjct: 61 GLYVILRPGPYVCAEWDFGGYPSWLLKEKDM-IYRSKDPRFLSYCERY---IKELGKQLS 116
Query: 169 KLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ------ 222
L + GG II+ Q+ENEYG+ +D K Y+ M VP C
Sbjct: 117 SLTINNGGNIIMVQVENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVE 171
Query: 223 ----ESDAPS----------PMFTPNNPNSPKIWTENWTGWFKSWGGKDP----KRTAED 264
E P+ + + P E + WF WG + +R AE
Sbjct: 172 AGHIEGALPTLNGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQ 231
Query: 265 LAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGPY--LTTSYDYDAPIDEYGHLNQ 318
L + ++ G + YM+HGGTNF G +GG Y TSYDYDAP+ E+G+
Sbjct: 232 LDWMLSH-----GVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-Y 285
Query: 319 PKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTA 378
PK+ RE+ ++K L G V LP + D T F T
Sbjct: 286 PKYHAFREV------IQKYLPEGTV---------------LP----EVPADNPTTTFATV 320
Query: 379 KVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDY 438
++ +K + ++ ++ DF G+ T I +
Sbjct: 321 ELKESAPLKTAFHHTTQSENVLSM----EDLGVDF-----GYIHYQTTIQKAGKQK---- 367
Query: 439 LWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPV 498
+ DL+D IL ++G V S +Y ++ V
Sbjct: 368 ---LVIQDLRDYAVIL--------------------IDGKQVASLDRRYNQNS------V 398
Query: 499 KLTRGKNQISL--LSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
L K SL L G NYG GI VL GDE
Sbjct: 399 TLNVAKTPASLEILVENTGRVNYGPDILFNRKGITNQVLW----GDE------------- 441
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGK 616
K+ + + Y ++ G + K+VP ++K TF + D V ++ GK
Sbjct: 442 KLTGWSITPLPLYKENVSDINFGGTIKDVP-----AFHKGTFTIQKKGDCFV-DMSRWGK 495
Query: 617 GFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIK 676
G WVNG +LGR+W N G P Q Y +P W+K
Sbjct: 496 GAVWVNGKSLGRFW----------------------------NIG-PQQTLY-LPAPWLK 525
Query: 677 DGVNTLVLFE 686
+G N +++FE
Sbjct: 526 EGENEIIVFE 535
>gi|297483826|ref|XP_002693891.1| PREDICTED: galactosidase, beta 1-like 3 [Bos taurus]
gi|296479482|tpg|DAA21597.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 899
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 163/323 (50%), Gaps = 41/323 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G ++L GS+HY R W D + K + G + + TYV WN HEP R +DF+G
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL FI ++ GL+VILR GPY+C+E + GG P WL P +LRTTN+ F+N +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDP-TSQLRTTNRSFVNAVN 439
Query: 156 N-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
F LI +A + L QGGPII Q+ENEYG D ++Y+ + + I
Sbjct: 440 KYFDHLIPRVALLQYL---QGGPIIAVQVENEYGFFYKD-----EAYMPYLLQALQQRGI 491
Query: 215 GVPWIMCQES----------------------DAPSPMFTPNNPNSPKIWTENWTGWFKS 252
G + + D+ ++ + P + E W GWF +
Sbjct: 492 GGLLLTADSTEEVMRGHIKGVLASINMKGFKVDSFKHLYKLQR-HKPILIMEFWVGWFDT 550
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDY 306
WG ++ +V+ F ++G +F N YM+HGGTNFG +G +TTSYDY
Sbjct: 551 WGIDHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTSYDY 609
Query: 307 DAPIDEYGHLNQPKWGHLRELHK 329
DA + E G K+ LR L +
Sbjct: 610 DAVLTEAGDYTA-KYFMLRSLFE 631
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 167/351 (47%), Gaps = 53/351 (15%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
+DG+ +LSG+IHY R P W D + K G + +ETY+ WN HEP ++DF G
Sbjct: 10 FIVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQG 69
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
D++ FIK Q+ L VI+R PY+CAEW +GG P WL + LR+ ++ +++
Sbjct: 70 IKDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNL-HLRSDCPRYLEKVK 128
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
N+ +++ M L ++QGGPII+ Q+ENE+G+ ++ K+Y+ K+ L +
Sbjct: 129 NYYEVLLPMLT--SLQSTQGGPIIMMQVENEFGSFSNN-----KTYLKKLKKIMLDLGVE 181
Query: 216 VP-------WIMCQES----------------------DAPSPMFTPNNPNSPKIWTENW 246
VP W ES D + P + E W
Sbjct: 182 VPLFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFW 241
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------ 300
GWF WG + R A+DLA V G N YM+HGGTNFG +G
Sbjct: 242 DGWFNRWGEEIITRDAQDLANCVKELLTRGSI--NLYMFHGGTNFGFMNGCSARGQKDLP 299
Query: 301 -TTSYDYDAPIDEYGHLN---QPKWGHLRELHKLLKSMEKTL----TYGNV 343
TSYDYDA + E G + Q ++EL ++ ME + +YG +
Sbjct: 300 QVTSYDYDALLTEAGDITEKYQCVKKVMKELFPDIQQMEPRMREKKSYGTI 350
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 37/128 (28%)
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
N + N ++GW +N P ++Y+ F E ++ +GKG ++NG++LGRY
Sbjct: 490 NIENINFDKGWQ-ENTP-----SFYEFVFNVD-ECQDTFIDCHQLGKGCIFINGFHLGRY 542
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
W RGP + ++P +K G+N +++FE G
Sbjct: 543 WS----------------RGPIE--------------YLYLPGPLLKKGMNQIIVFETEG 572
Query: 690 GNPSQINF 697
+ I F
Sbjct: 573 VAMNNITF 580
>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
harrisii]
Length = 704
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/323 (36%), Positives = 160/323 (49%), Gaps = 37/323 (11%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+G ++G + GSIHY R W D + K K GL+ + TY+ WN HEP R ++
Sbjct: 118 EGPNFLLEGSHFQIFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYIPWNLHEPERGKF 177
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
+F+GNLD+ F++ D GL+VILR GPY+C+EW+ GG P WL +E LRTT F+
Sbjct: 178 NFSGNLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSME-LRTTYAGFL 236
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ + ++ + L QGGPII Q+ENEYG+ D +Y+ + K S
Sbjct: 237 KAVDRYFNHLI--PRVVPLQYKQGGPIIAVQVENEYGSY-----DKDSNYMPYIKKALMS 289
Query: 212 LDIGVPWIMCQESDAPSPMFTPN---------------------NPNSPKIWTENWTGWF 250
I + D S + N P + TE WTGWF
Sbjct: 290 RGINELLMTSDNKDGLSGGYLEGVLATVNLKHVDSMIFNYLHSFQENKPTMVTEYWTGWF 349
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPYL--TTSY 304
+WGG A+D+ V+ Q G + N YM+HGGTNFG +G G YL TSY
Sbjct: 350 DTWGGPHNIVDADDVVVTVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFGEYLADVTSY 408
Query: 305 DYDAPIDEYGHLNQPKWGHLREL 327
DYDA + E G PK+ LRE
Sbjct: 409 DYDAILTEAGDYT-PKFFKLREF 430
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 80/195 (41%), Gaps = 55/195 (28%)
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
P+ +G ++S+L G NYG K + G+ G + L +E+ +++
Sbjct: 539 PIPEYQGHRKLSILVENRGRVNYGQKLNEQRKGLIGDIYL-----NESPLRNFK------ 587
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN----LQ 612
+Y L+ K+ N + S W+ VP + F L D +VL+ L+
Sbjct: 588 ---IYSLEMKE--NFFQSLSSIKWN--QVPEEATGPAF---FRGTLHIDSIVLDTFLKLE 637
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
G KG ++NG NLGR+W N G P + Y +P
Sbjct: 638 GWFKGVVFINGQNLGRFW----------------------------NIG-PQETLY-LPG 667
Query: 673 SWIKDGVNTLVLFEE 687
W++ G N +++FEE
Sbjct: 668 PWLRPGNNEIIVFEE 682
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/309 (36%), Positives = 158/309 (51%), Gaps = 35/309 (11%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
+DG+ ++SG +HY R W ++ AK GL+ I TYVFWN HEP ++DF+G
Sbjct: 37 FVLDGQPFQIISGEMHYERIPRAYWKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSG 96
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE-ELRTTNKVFMNEM 154
N DL +FI+ Q GL V+LR GPY CAEW +GGFP WL P ++ LR+ + FM
Sbjct: 97 NADLAQFIRDAQQTGLKVLLRAGPYSCAEWEFGGFPAWLMKNPKMQTALRSNDPEFMKPA 156
Query: 155 QNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYGNVMSD----------YGDAG-KSYI 202
+ + I+ + ++ L GGPII QIENEYG+ D + AG +
Sbjct: 157 EQW---ILRLGREVAPLQVGYGGPIIGVQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSL 213
Query: 203 NWCAKMATSLDIG-VPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSWGG 255
+ A + +L G +P + + AP P + +E WTGWF WG
Sbjct: 214 LYTANPSRALVRGSIPGVYSAVNFAPGHAAQALDSLAQLRAGQPLLSSEYWTGWFDHWG- 272
Query: 256 KDPKRTAEDLAFAVARF--FQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYDY 306
+P ++ + L+ V F G N YM+HGGT+FG SG + TSYDY
Sbjct: 273 -EPHQS-KPLSLQVKDFNYILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDY 330
Query: 307 DAPIDEYGH 315
AP+DE GH
Sbjct: 331 GAPLDEAGH 339
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 34/139 (24%), Positives = 63/139 (45%), Gaps = 31/139 (22%)
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW-TYKVGL-- 560
K ++ +L G N G+ GPV+L GRA H W TY++ +
Sbjct: 460 KTRLDILVENSGRINSTRMMLHANKGLMGPVMLAGRA----------LHGWKTYRLPMKP 509
Query: 561 ------YGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPV---VLNL 611
G+ + +N K+ ++ + + P +Y+ TF ++ + L++
Sbjct: 510 DTIADPLGMPQETHFNEKSTPAQ----AMSGP-----AFYRGTFRVETKSKQIPDTFLDI 560
Query: 612 QGMGKGFAWVNGYNLGRYW 630
+G+GKG W++G+ +GRYW
Sbjct: 561 RGLGKGAVWIDGHPIGRYW 579
>gi|358415935|ref|XP_600640.6| PREDICTED: uncharacterized protein LOC522360 [Bos taurus]
Length = 1360
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 163/323 (50%), Gaps = 41/323 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G ++L GS+HY R W D + K + G + + TYV WN HEP R +DF+G
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL FI ++ GL+VILR GPY+C+E + GG P WL P +LRTTN+ F+N +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDP-TSQLRTTNRSFVNAVN 439
Query: 156 N-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
F LI +A + L QGGPII Q+ENEYG D ++Y+ + + I
Sbjct: 440 KYFDHLIPRVALLQYL---QGGPIIAVQVENEYGFFYKD-----EAYMPYLLQALQQRGI 491
Query: 215 GVPWIMCQES----------------------DAPSPMFTPNNPNSPKIWTENWTGWFKS 252
G + + D+ ++ + P + E W GWF +
Sbjct: 492 GGLLLTADSTEEVMRGHIKGVLASINMKGFKVDSFKHLYKLQR-HKPILIMEFWVGWFDT 550
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDY 306
WG ++ +V+ F ++G +F N YM+HGGTNFG +G +TTSYDY
Sbjct: 551 WGIDHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTSYDY 609
Query: 307 DAPIDEYGHLNQPKWGHLRELHK 329
DA + E G K+ LR L +
Sbjct: 610 DAVLTEAGDYTA-KYFMLRSLFE 631
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/341 (31%), Positives = 161/341 (47%), Gaps = 48/341 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G+ +DGE ++SG++HY R P W + K G + +ETYV WN HEP ++
Sbjct: 7 GKDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGIFN 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F G DL+++++ Q GL VILR PY+CAEW +GG P WL I +R+ +F+N
Sbjct: 67 FEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDI-RVRSNTNLFLN 125
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+++NF +++ M L GGPII+ Q+ENEYG+ +D K Y+ K+ L
Sbjct: 126 KVENFYKVLLPMVT--PLQVENGGPIIMMQVENEYGSFGND-----KEYVRNIKKLMRDL 178
Query: 213 DIGVP-------WIMCQES----------------------DAPSPMFTPNNPNSPKIWT 243
+ VP W ES + N P +
Sbjct: 179 GVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCM 238
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------ 297
E W GWF WG + +R +LA V + N+YM+ GGTNFG +G
Sbjct: 239 EFWDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENV 296
Query: 298 --PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
P + TSYDYDA + E+G + R + ++ +E+
Sbjct: 297 DLPQI-TSYDYDALLTEWGEPTSKYYAVQRAIKEVCSDVEQ 336
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 94/244 (38%), Gaps = 57/244 (23%)
Query: 453 ILSGSSNMT-LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLS 511
L G N+ ++ + +H ++N VD+Q+ E + LT+ +N + +L
Sbjct: 392 FLKGPKNIEKCKVVDARDRVHLFLNEQLVDTQYRDEIGR----EVSLDLTKEENTLDILV 447
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
+G NYG++ + P G + I L S+ Y + LD+ F
Sbjct: 448 ENMGRVNYGAR-------LLSPTQRKGISSGVMIDIHLQSNWEHYALEFDNLDEIDF--- 497
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
G N P ++Y+ TF ND L+ +GKGF +NG+NLG+YW
Sbjct: 498 ------NGQWEPNTP-----SFYEYTFNVQELND-TFLDCSKLGKGFVVLNGFNLGKYWD 545
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
GP G + ++P + G N L++FE G
Sbjct: 546 V----------------GPTG--------------YLYIPAPLLIKGENNLIVFETEGNY 575
Query: 692 PSQI 695
++
Sbjct: 576 EEEL 579
>gi|417985674|ref|ZP_12626256.1| beta-galactosidase 3 [Lactobacillus casei 32G]
gi|410527574|gb|EKQ02437.1| beta-galactosidase 3 [Lactobacillus casei 32G]
Length = 598
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 172/651 (26%), Positives = 266/651 (40%), Gaps = 154/651 (23%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
FG +G + D+D P VT+ DY
Sbjct: 283 FGFMNG---TSARKDHDLP--------------------------------QVTSYDYDA 307
Query: 351 SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMI 410
++ P + Q + P+QA PL +
Sbjct: 308 PLNEQGNPTPKY-----------------FAIQKMIHEVLPSQA--QTTPLVKPAMRQAD 348
Query: 411 NDFVVRGKGHFALNTLIDQ------KSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--- 461
N + +L +++DQ S ++L T L +P++SG+ T
Sbjct: 349 NPLTAK----VSLFSVLDQLAQPVAASYPQTQEFLGQYTGYTLYRTNPLISGTDKGTPAK 404
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
LR+ + + A+ +G + +Q+ + +D+ V+ G++Q+ LL + NYGS
Sbjct: 405 LRVIDARDRVQAFFDGKSLATQYQE-AIGDDILLPEVE---GRHQLDLLVENMSRVNYGS 460
Query: 522 KFDMVPN--GIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
K + + GI V++ D IKD Y LD K A +
Sbjct: 461 KIEAITQFKGIRTGVMV-----DLHFIKDYLQ---------YPLDLNK---APQLDFTGD 503
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
W + +Y+ F+ D L+ +G GKG VNG N+GR+W
Sbjct: 504 WQAGTP------AFYQYGFDVVKPQD-TYLDCRGFGKGVMLVNGVNIGRFW 547
>gi|301065438|ref|YP_003787461.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
gi|300437845|gb|ADK17611.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
Length = 598
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DSAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|422864548|ref|ZP_16911173.1| beta-galactosidase [Streptococcus sanguinis SK1058]
gi|327490742|gb|EGF22523.1| beta-galactosidase [Streptococcus sanguinis SK1058]
Length = 592
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/355 (33%), Positives = 168/355 (47%), Gaps = 41/355 (11%)
Query: 12 LLCLILQTLFNLSLAYRVSHDG------RAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
L +I LF+LS ++ G ++G+ ++ + +HYPR W IK
Sbjct: 7 LKTIITTLLFSLSTLTALARGGDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIK 66
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
K G++ I YVFWN HE +YDFTGN D+ F + Q G+YVI+R GPYVCAEW
Sbjct: 67 MCKALGMNTICLYVFWNIHEQQESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEW 126
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
GG P WL I LR + F+ ++ F + + L GGPII+ Q+EN
Sbjct: 127 EMGGLPWWLLKKKDIR-LREDDPYFLARVKAFEAEV--GRQLAPLTIQNGGPIIMVQVEN 183
Query: 186 EYGNV------MSDYGDAGKS---------YINWCAKMATSLDIGVPWIM----CQESDA 226
EYG+ +S D K+ +W + + + W M DA
Sbjct: 184 EYGSYGVNKQYVSQIRDIVKASGFDKVTLFQCDWASNFEKNGLDDLLWTMNFGTGSNIDA 243
Query: 227 PSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYH 286
P +P + +E W+GWF WG + R A+ + + +F + YM H
Sbjct: 244 QFKRLKQLRPETPLMCSEFWSGWFDKWGARHETRPAKAMVEGINEMLSKNISF-SLYMTH 302
Query: 287 GGTNFGRTSGG--PYL---TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
GGT+FG +G P TSYDYDAPI+EYGH PK+ LR K+M+K
Sbjct: 303 GGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGHAT-PKFWELR------KTMQK 350
>gi|422880263|ref|ZP_16926727.1| beta-galactosidase [Streptococcus sanguinis SK1059]
gi|422930132|ref|ZP_16963071.1| beta-galactosidase [Streptococcus sanguinis ATCC 29667]
gi|422930724|ref|ZP_16963655.1| beta-galactosidase [Streptococcus sanguinis SK340]
gi|332364839|gb|EGJ42608.1| beta-galactosidase [Streptococcus sanguinis SK1059]
gi|339614112|gb|EGQ18823.1| beta-galactosidase [Streptococcus sanguinis ATCC 29667]
gi|339620700|gb|EGQ25268.1| beta-galactosidase [Streptococcus sanguinis SK340]
Length = 592
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|418004004|ref|ZP_12644053.1| beta-galactosidase 3 [Lactobacillus casei UW1]
gi|410551057|gb|EKQ25134.1| beta-galactosidase 3 [Lactobacillus casei UW1]
Length = 598
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DSAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 591
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/324 (35%), Positives = 155/324 (47%), Gaps = 54/324 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F+K Q GL VILR Y+CAEW +GG P WL N P LR+T+ FM +++N+
Sbjct: 72 DICAFVKQAQTLGLMVILRPSVYICAEWEFGGLPAWLLNEP--MRLRSTDPRFMAKVRNY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L + GGP+I+ Q+ENEYG+ YG K+Y+ ++ I VP
Sbjct: 130 FQVL--LPKLVPLQITHGGPVIMMQVENEYGS----YG-MEKAYLRQTKELMEEYGIDVP 182
Query: 218 WIM----------------------------CQESDAPSPMFTPNN-PNSPKIWTENWTG 248
+E+ A F + N P + E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDLPQV 300
Query: 302 TSYDYDA-------PIDEYGHLNQ 318
+SYDYDA P D+Y H+ +
Sbjct: 301 SSYDYDALLTEAGEPTDKYYHVQK 324
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 92/237 (38%), Gaps = 54/237 (22%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE-RPVKLTRGKNQISLLSATVGLQNYG 520
L++ + LH + +G Q+ + L + P K T ++ +L +G NYG
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELLIQGTPDKETI---ELDVLVENLGRVNYG 457
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
K + GP G G I++D+ H+ L +A+ +
Sbjct: 458 FKLN-------GPTQAKGIRGG--IMQDIHFHQGYRHYPL-------MLSAEQLQAIDYQ 501
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ KN ++Y+TTF D ++ +G GKG VNG NLGRYW
Sbjct: 502 AGKN---PTHPSFYQTTFRLTEVGD-TFIDCRGYGKGVVIVNGINLGRYWQ--------- 548
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G ++ F
Sbjct: 549 -------RGPVHSLYC--------------PKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 175/355 (49%), Gaps = 48/355 (13%)
Query: 6 HCSRAILLCLILQTLFNL--SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
H + ++L +I+ L + S +V TI+G+ L+ G +HYPR W D
Sbjct: 7 HKTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDR 66
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
+K+A+ GL+ + YVFWN HE ++DF+G D+ FI+T Q++GLYVILR GPYVCA
Sbjct: 67 LKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCA 126
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQ 182
EW++GG+P WL + R+ + F++ + + I ++ K+ L + GG II+ Q
Sbjct: 127 EWDFGGYPSWLLKEKDM-TYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQ 182
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ----------ESDAPS---- 228
+ENEYG+ +D K Y+ M VP C E P+
Sbjct: 183 VENEYGSYAAD-----KGYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNGV 237
Query: 229 ------PMFTPNNPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGT 278
+ P E + WF WG + +R AE L + ++ G
Sbjct: 238 FGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GV 292
Query: 279 FQNYYMYHGGTNF----GRTSGGPY--LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ YM+HGGTNF G +GG Y TSYDYDAP+ E+G+ PK+ RE+
Sbjct: 293 SVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YPKYHAFREV 346
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 36/119 (30%)
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
Y K + E G + K VP ++K TF + D V ++ GKG WVNG +LG
Sbjct: 505 LYKEKVSEMEFGETIKGVP-----AFHKGTFTVEKKGDCFV-DMSQWGKGAVWVNGKSLG 558
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
R+W N G P Q Y +P W+K+G N +V+FE
Sbjct: 559 RFW----------------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 587
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
Length = 628
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 168/327 (51%), Gaps = 43/327 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHK 329
SYDYDAPI E G + PK+ +R + K
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNVIK 356
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 74/194 (38%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ I+ ++ +D+
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVQIAGKE----IVGGWDMYQLP-------MDEM 516
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A++ + S+ L Y+ TF D ++++ GKG +VNG N+
Sbjct: 517 PDLTKLKADTHKNVPSEVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y VP W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-VPGVWLKKGENKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNETP-QTEVKTV 618
>gi|422845798|ref|ZP_16892481.1| beta-galactosidase [Streptococcus sanguinis SK72]
gi|325688586|gb|EGD30603.1| beta-galactosidase [Streptococcus sanguinis SK72]
Length = 592
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|422849537|ref|ZP_16896213.1| beta-galactosidase [Streptococcus sanguinis SK115]
gi|325689511|gb|EGD31516.1| beta-galactosidase [Streptococcus sanguinis SK115]
Length = 592
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
Length = 628
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 168/327 (51%), Gaps = 43/327 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHK 329
SYDYDAPI E G + PK+ +R + K
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNVIK 356
Score = 45.8 bits (107), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 74/194 (38%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ I+ ++ +D+
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVQIAGKE----IVGGWDMYQLP-------MDEM 516
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A++ + S+ L Y+ TF D ++++ GKG +VNG N+
Sbjct: 517 PDLTKLKADTHKNVPSEVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y +P W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-IPGVWLKKGENKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNETP-QTEVKTV 618
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 168/327 (51%), Gaps = 43/327 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHK 329
SYDYDAPI E G + PK+ +R + K
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNVIK 356
Score = 46.2 bits (108), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 74/194 (38%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ I+ ++ +D+
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVQIAGKE----IVGGWDMYQLP-------MDEM 516
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A++ + S+ L Y+ TF D ++++ GKG +VNG N+
Sbjct: 517 PDLTKLKADTHKNVPSEVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y VP W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-VPGVWLKKGENKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNETP-QTEVKTV 618
>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
Length = 595
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSVS 357
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|297727459|ref|NP_001176093.1| Os10g0340600 [Oryza sativa Japonica Group]
gi|255679317|dbj|BAH94821.1| Os10g0340600 [Oryza sativa Japonica Group]
Length = 143
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 74/104 (71%), Positives = 91/104 (87%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
VS+DGR++ +DGER+I++SGSIHYPRSTP MWPDLIKKAKEGGL+AIETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
R+++F GN D++RF K IQ+ G+Y ILRIGPY+C EWNYG P+
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGYMPM 134
>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
Length = 647
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 41/338 (12%)
Query: 23 LSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWN 82
+SL++ + +D DG+ +SG +HY R W D + K K G++ ++TYV WN
Sbjct: 16 ISLSFSIDYDNNCFMKDGKPFRYISGGMHYFRVPQYYWKDRLLKLKASGMNTVQTYVPWN 75
Query: 83 AHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE 142
HEP+ +QY+F GN +L F++ Q L VILR GPY+CAEW++GG P WL P I
Sbjct: 76 LHEPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGPYICAEWDFGGLPGWLLKDPSIVI 135
Query: 143 LRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI 202
+ K +M + + ++++ + K GGP+I+ Q+ENEYG DY Y+
Sbjct: 136 RSSQGKAYMEAVDAWMSVLLPLVK--PFLYENGGPVIMVQVENEYG----DYIHCDHQYM 189
Query: 203 NWCAKM----------ATSLDIGVPWIMCQESDAPSPMFT--------PNNP-------- 236
++ + D G + PS T P+ P
Sbjct: 190 LHLQQLFRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDFGANTDPSIPFANQRKLQ 249
Query: 237 -NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTS 295
P + +E +TGW WG RT++ +A A+ + + N YM+ GGTNFG S
Sbjct: 250 QKGPLVNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNASV-NLYMFEGGTNFGFWS 308
Query: 296 GGPY------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
G + + TSYDYDAP+ E G L + K+ +RE+
Sbjct: 309 GADFHGQYQPVPTSYDYDAPLTEAGDLTE-KYHAIREV 345
>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 628
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/327 (35%), Positives = 168/327 (51%), Gaps = 43/327 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHK 329
SYDYDAPI E G + PK+ +R + K
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNVIK 356
Score = 46.2 bits (108), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 74/194 (38%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ I+ ++ +D+
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVQIAGKE----IVGGWDMYQLP-------MDEM 516
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A++ + S+ L Y+ TF D ++++ GKG +VNG N+
Sbjct: 517 PDLTKLKADTHKNVPSEVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y VP W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-VPGVWLKKGENKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNETP-QTEVKTV 618
>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
Length = 595
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+++F+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 DIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSVS 357
>gi|345800024|ref|XP_546385.3| PREDICTED: galactosidase, beta 1-like 3 [Canis lupus familiaris]
Length = 808
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 182/390 (46%), Gaps = 57/390 (14%)
Query: 18 QTLFNLSLAYRVSHDGRA--------------ITIDGERKILLSGSIHYPRSTPGMWPDL 63
Q+LFN S + + RA T+ G + + GSIHY R W D
Sbjct: 203 QSLFNWSHLTPLELEDRAAGLEPQSPGGRKPCFTLGGHKFQVFGGSIHYFRVPRAYWGDR 262
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
++K K G + + TYV WN HEP R ++DF+GNLD+ F+ + GL+VILR GPY+C+
Sbjct: 263 LRKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDMEAFVLLAAEMGLWVILRPGPYICS 322
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQI 183
E + GG P WL P + LRTT F+ + + ++ ++ L +GGPII Q+
Sbjct: 323 EIDLGGLPSWLLQDPKM-VLRTTYSGFVKAVDKYFDHLI--SRVVPLQYRRGGPIIAVQV 379
Query: 184 ENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPS--------------- 228
ENEYG+ D G Y+ + K L+ G+ ++ DA +
Sbjct: 380 ENEYGSFAEDRG-----YMPYLQKAL--LERGIVELLVTSDDAENLLKGHIKGVLATINM 432
Query: 229 --------PMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQ 280
+ + N P + E W GWF +WG + + +D+ V +F +F
Sbjct: 433 NSFQESDFKLLSYVQSNKPIMVMEFWVGWFDTWGSEHKVKNPKDVEETVTKFIASEISF- 491
Query: 281 NYYMYHGGTNFGRTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
N YM+HGGTNFG +G + TSYDYDA + E G + K+ LR L + ++
Sbjct: 492 NVYMFHGGTNFGFMNGATDFGIHRGVVTSYDYDAVLTEAGDYTE-KYFKLRRLFGSVSAI 550
Query: 335 EKTLTYGNVTNTDYGNSVSGSSYNLPAWSV 364
+Y SV S Y LP W V
Sbjct: 551 PLPPLPELTPKAEY-PSVKPSLY-LPLWDV 578
>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
Length = 595
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSVS 357
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 173/350 (49%), Gaps = 41/350 (11%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+G +DGE +L+G++HY R P W D + K K GL+ +ETYV WN HEP ++
Sbjct: 7 EGDHFELDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEPHEGEF 66
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
F L++ R+I+ + GLYVI+R GPY+CAEW GG P WL P + +LR + ++
Sbjct: 67 HFGDWLNIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQM-KLRCMYQPYL 125
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ + + + + M + L +++GGPII Q+ENEYG+ +D Y+ + ++
Sbjct: 126 DAVGEYFSQL--MHRLVPLQSTRGGPIIAMQVENEYGSYGND-----TRYLKYLEELLRQ 178
Query: 212 LDI--------GVPWIMCQESDAPSPMFTPNNPNSPK---------------IWTENWTG 248
+ GV M Q P N N P + E W G
Sbjct: 179 CGVDVLLFTADGVADEMMQYGSLPHLFKAVNFGNRPGDAFEKLREYQTGGPLLVAEFWDG 238
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-----PYLT-- 301
WF WG + R+A ++A + G + N YM+HGGTNFG +G P+ T
Sbjct: 239 WFDHWGERHHTRSAGEVARVLDDLLSEGASV-NLYMFHGGTNFGFMNGANAFPSPHYTPT 297
Query: 302 -TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
TSYDYDAP+ E G++ PK+ +RE+ + + + V YG
Sbjct: 298 VTSYDYDAPLSECGNIT-PKYEAMREVIGKYVDLPEMPDFPPVVRHAYGK 346
>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
Length = 595
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSVS 357
>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
Length = 638
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 191/719 (26%), Positives = 292/719 (40%), Gaps = 152/719 (21%)
Query: 13 LCL-ILQTLFNLSLAYRVSHDGRAITI---------DGERKILLSGSIHYPRSTPGMWPD 62
LC +L T F ++A++ + T DG+ +LSG +HY R W
Sbjct: 8 LCYAVLTTTFMSAIAFQDVQAQKKHTFEIKDGNFVYDGKTTRILSGEMHYARIPHQYWKH 67
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
++ K GL+ + TYVFWN HE ++F G+ DL FIKT + GL+VILR GPY C
Sbjct: 68 RLQMVKSMGLNTVATYVFWNFHEESPGNWNFEGDHDLAAFIKTAGEVGLHVILRPGPYAC 127
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKE--KLFASQGGPIIL 180
AEW++GG+P WL + G+E +R N F+ +T +D KE L + GGPII+
Sbjct: 128 AEWDFGGYPWWLQKIDGLE-IRRDNAKFLE----YTKKYIDRLAKEVGSLQITNGGPIIM 182
Query: 181 AQIENEYGNVMSDYGD----AGKSYINWCAKMATSLDIGVPWIMCQES------DAPSPM 230
Q ENE+G+ +S D K+Y K VP S P +
Sbjct: 183 VQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLEEAGFNVPLFTSDGSWLFEGGAIPGAL 242
Query: 231 FTPNNPNS----------------PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
T N N+ P + E + GW W K A +A ++ Q
Sbjct: 243 PTANGENNISNLKKVVDQYNNNQGPYMVAEFYPGWLDHWAEPFAKVDAGRIARQTEKYLQ 302
Query: 275 FGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
+F NYYM HGGTNFG TSG +Y+ + I QP
Sbjct: 303 NDISF-NYYMVHGGTNFGFTSGA-----NYNNKSDI-------QP--------------- 334
Query: 335 EKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQA 394
++T+ DY +S + + P + SI + T + N ++ P+
Sbjct: 335 -------DITSYDYDAPISEAGWATPKYD-SIRTVIQKYADYTVPAVPKANPVIEIPSIK 386
Query: 395 GNDQAPLQWKWRPEMINDFVVRGKGHFALN-TLIDQKSTNDVSDYLWYMTNADLKDDDPI 453
A N F G +N T ++ + N + Y+ Y + PI
Sbjct: 387 LTAVA-----------NVFDYAKSGKTTINETPLNFEQLNQANGYVLYSKQFN----QPI 431
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF---ERPVKLTRGKNQISLL 510
N L+I+ Y++G TK G N +F E + + + + +L
Sbjct: 432 -----NGKLKIDGLRDFAVVYIDG-------TKVGELNRVFKNYEMDIDIPFN-STLQIL 478
Query: 511 SATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK-VGLYGLDDKKFY 569
+G NYGS+ GI PVL+ D I D WT + + + + D
Sbjct: 479 VENMGRINYGSEMIHNHKGIISPVLI----NDMEITGD-----WTMQQLPMDKVPDLAGK 529
Query: 570 NAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRY 629
A + + +SK L + Y+ TF+ D ++++ GKG ++NG N+GRY
Sbjct: 530 QTAAIQNTKTNASKIAALTGQPVLYQGTFDLKEIGD-TFIDMEKWGKGIVFINGINIGRY 588
Query: 630 WPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
W T GP Q ++P ++K G N++V+FE+
Sbjct: 589 WKT----------------GP--------------QHTLYIPAPYLKKGSNSIVIFEQL 617
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 175/355 (49%), Gaps = 48/355 (13%)
Query: 6 HCSRAILLCLILQTLFNL--SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
H + ++L +I+ L + S +V TI+G+ L+ G +HYPR W D
Sbjct: 7 HKTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDR 66
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
+K+A+ GL+ + YVFWN HE ++DF+G D+ FI+T Q++GLYVILR GPYVCA
Sbjct: 67 LKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCA 126
Query: 124 EWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQ 182
EW++GG+P WL + R+ + F++ + + I ++ K+ L + GG II+ Q
Sbjct: 127 EWDFGGYPSWLLKEKDM-TYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQ 182
Query: 183 IENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ----------ESDAPS---- 228
+ENEYG+ +D K Y+ M VP C E P+
Sbjct: 183 VENEYGSYAAD-----KGYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNGV 237
Query: 229 ------PMFTPNNPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGT 278
+ P E + WF WG + +R AE L + ++ G
Sbjct: 238 FGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GV 292
Query: 279 FQNYYMYHGGTNF----GRTSGGPY--LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ YM+HGGTNF G +GG Y TSYDYDAP+ E+G+ PK+ RE+
Sbjct: 293 SVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YPKYHAFREV 346
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 36/119 (30%)
Query: 568 FYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLG 627
Y K + E G + K VP ++K TF + D V ++ GKG WVNG +LG
Sbjct: 505 LYKEKVSEMEFGETIKGVP-----AFHKGTFTVEKKGDCFV-DMSQWGKGAVWVNGKSLG 558
Query: 628 RYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
R+W N G P Q Y +P W+K+G N +V+FE
Sbjct: 559 RFW----------------------------NIG-PQQTLY-LPAPWLKEGENEIVVFE 587
>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
Length = 595
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSVS 357
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 167/355 (47%), Gaps = 51/355 (14%)
Query: 14 CLILQTLFNLSLAYRVSH--------DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
L L +F ++L VS G DG L+SG+IH+ R W D ++
Sbjct: 5 LLTLPLIFAIALPIGVSAAPWPAFSTRGTQFIRDGRPYQLISGAIHFQRIPRAYWKDRLQ 64
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
KA+ GL+ +ETYVFWN E Q+DFTGN D+ F++ QGL VILR GPYVCAEW
Sbjct: 65 KARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGPYVCAEW 124
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
GGFP WL P + +R+ + F++ Q + + + L S GGPII Q+EN
Sbjct: 125 EAGGFPAWLFADPTLR-VRSQDPRFLDASQRYLEALGTQVRP--LLNSNGGPIIAMQVEN 181
Query: 186 EYGNVMSDYGDAGKSYINWCAKMATSLDIG-----------------VPWIMCQESDAPS 228
EYG+ D+G Y+ + +G +P ++ + AP
Sbjct: 182 EYGSYGDDHG-----YLQAVRALFIKAGLGGALLFTSDGAQMLGNGTLPDVLAAVNVAPG 236
Query: 229 PM------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNY 282
+P P++ E W GWF WG + A+ A + + G + N
Sbjct: 237 EAKQALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTDAKQQADEIEWMLRQGHSI-NL 295
Query: 283 YMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
YM+ GGT+FG +G + TTSYDYDA +DE G PK+ R++
Sbjct: 296 YMFVGGTSFGFMNGANFQGGPGDHYSPQTTSYDYDAALDEAGR-PMPKFALFRDV 349
>gi|418000981|ref|ZP_12641151.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|418009807|ref|ZP_12649594.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
gi|410548851|gb|EKQ23035.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|410554934|gb|EKQ28899.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
Length = 598
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQEAIGDDI 435
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
L G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 436 LLPE----VEGRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
Length = 595
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSVS 357
>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
Length = 591
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 151/314 (48%), Gaps = 47/314 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F+K Q GL VILR Y+CAEW +GG P WL N P LR+T+ FM +++N+
Sbjct: 72 DICAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP--MRLRSTDPRFMAKVRNY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L + GGP+I+ Q+ENEYG+ YG K+Y+ ++ I VP
Sbjct: 130 FQVL--LPKLVPLQITHGGPVIMMQVENEYGS----YG-MEKAYLRQTKELMEEYGIDVP 182
Query: 218 WIM----------------------------CQESDAPSPMFTPNN-PNSPKIWTENWTG 248
+E+ A F + N P + E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRAGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDLPQV 300
Query: 302 TSYDYDAPIDEYGH 315
+SYDYDA + E G
Sbjct: 301 SSYDYDALLTEAGE 314
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 96/237 (40%), Gaps = 54/237 (22%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE-RPVKLTRGKNQISLLSATVGLQNYG 520
L++ + LH + +G Q+ + L + P K T ++ +L +G NYG
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELLIQGTPDKETI---ELDVLVENLGRVNYG 457
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
K + GP G G I++D+ H+ Y+ L ++ +A + + G
Sbjct: 458 FKLN-------GPTQAKGIRGG--IMQDIHFHQ-GYRHYPLTLSAEQL---QAIDYQAGK 504
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ + ++Y+TTF D ++ +G GKG VNG NLGRYW
Sbjct: 505 NPTHP------SFYQTTFTLTEVGD-TFIDCRGYGKGVVIVNGINLGRYWQ--------- 548
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G ++ F
Sbjct: 549 -------RGPVHSLYC--------------PKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 114/332 (34%), Positives = 162/332 (48%), Gaps = 48/332 (14%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
++H A G +LSGS+HY R P W D + + GL+ ++TYV WN HE
Sbjct: 25 LTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHERRP 84
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+ F G DL RF++ Q GL V++R GPY+CAEW+ GG P WL PG+ LR ++
Sbjct: 85 GEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMR-LRAGHQ 143
Query: 149 VFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+++ + F L+ +A+ L A GGP++ QIENEYG+ YGD +Y+ W
Sbjct: 144 PYLDAVARWFDALVPRVAE---LQAVHGGPVVAVQIENEYGS----YGD-DHAYVRWVRD 195
Query: 208 MATSLDIGVPWIMCQESDAPSPM--------------------------FTPNNPNSPKI 241
+D G+ ++ +D P+P+ P P +
Sbjct: 196 --ALVDRGITELLYT-ADGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPFL 252
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-- 299
E W GWF WG K R+ + A V GG+ + YM HGGTNFG +G +
Sbjct: 253 CAEFWNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGSV-SLYMAHGGTNFGLWAGANHDG 311
Query: 300 -----LTTSYDYDAPIDEYGHLNQPKWGHLRE 326
TSYD DAP+ E+G L PK+ LRE
Sbjct: 312 GVLRPTVTSYDSDAPVSEHGALT-PKFHALRE 342
>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
Length = 628
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/325 (35%), Positives = 167/325 (51%), Gaps = 43/325 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGNLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------T 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAPI E G + PK+ +R +
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 76/194 (39%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G K+++ Y++ + + D
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVKIAG--------KEITGEWDMYQLPMSEMPDL 519
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A A + ++K L Y+ TF D ++++ GKG +VNG N+
Sbjct: 520 AKLKADAHANVPAEAAK---LKGCPVLYEGTFTLDNVGD-TFIDMENWGKGIIFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y +P W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-IPGVWLKKGTNKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNEVP-QAEVKTV 618
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|227533108|ref|ZP_03963157.1| beta-galactosidase 3, partial [Lactobacillus paracasei subsp.
paracasei ATCC 25302]
gi|227189289|gb|EEI69356.1| beta-galactosidase 3 [Lactobacillus paracasei subsp. paracasei ATCC
25302]
Length = 578
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 11 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 67
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 68 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 125
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 126 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 178
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 179 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 235
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 236 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 289
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 290 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQPQT 343
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +
Sbjct: 383 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQEAIGDDI 442
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
L G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 443 LLPE----VEGRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 493
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 494 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 534
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 535 CRGFGKGVMLVNGVNIGRFW 554
>gi|239629323|ref|ZP_04672354.1| glycosyl hydrolase [Lactobacillus paracasei subsp. paracasei
8700:2]
gi|417979668|ref|ZP_12620358.1| beta-galactosidase 3 [Lactobacillus casei 12A]
gi|417982493|ref|ZP_12623148.1| beta-galactosidase 3 [Lactobacillus casei 21/1]
gi|239528009|gb|EEQ67010.1| glycosyl hydrolase [Lactobacillus paracasei subsp. paracasei
8700:2]
gi|410526941|gb|EKQ01818.1| beta-galactosidase 3 [Lactobacillus casei 12A]
gi|410529717|gb|EKQ04508.1| beta-galactosidase 3 [Lactobacillus casei 21/1]
Length = 598
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQPQT 336
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 87/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +GN + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGNSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APRLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/346 (33%), Positives = 164/346 (47%), Gaps = 38/346 (10%)
Query: 16 ILQTLFNLSLAYRVSHDGRAITIDGERKILLSG--------SIHYPRSTPGMWPDLIKKA 67
IL L + ++ G I + G++ LL+G +HYPR W I+
Sbjct: 10 ILFALLTVFTSFGAPKRG-GIFVAGDKTFLLNGKPFVIKAAELHYPRIPRPYWEHRIRMC 68
Query: 68 KEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNY 127
K G++ I YVFWN HE +++FTGN D+ F + Q GLYVI+R GPYVCAEW
Sbjct: 69 KALGMNTICLYVFWNIHEQQEGKFNFTGNNDVAAFCRLAQKHGLYVIVRPGPYVCAEWEM 128
Query: 128 GGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEY 187
GG P WL I LR + FM ++ F + + + L +GGPII+ Q+ENEY
Sbjct: 129 GGLPWWLLKKKDIR-LRERDPYFMERVKVFEQQVGN--QLAPLTIDKGGPIIMVQVENEY 185
Query: 188 GNVMSD---------------YGDAGKSYINWCAKMATSLDIGVPWIM----CQESDAPS 228
G+ D + +W + + + W M D
Sbjct: 186 GSYGVDKEYVSQIRDIVRSSGFDKVALFQCDWASNFEKNGLDDLIWTMNFGTGANIDEQF 245
Query: 229 PMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGG 288
P SPK+ +E W+GWF WG + R A+++ + G +F + YM HGG
Sbjct: 246 KRLGELRPQSPKMCSEFWSGWFDKWGARHETRPAKNMVAGIDEMLTKGISF-SLYMTHGG 304
Query: 289 TNFGRTSGG--PYL---TTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
T+FG +G P TSYDYDAPI+EYG L PK+ LR + +
Sbjct: 305 TSFGHWAGANSPGFAPDVTSYDYDAPINEYG-LATPKYYELRAMMQ 349
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/344 (33%), Positives = 163/344 (47%), Gaps = 66/344 (19%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ + YVFWN HE Q+DFTG
Sbjct: 103 LNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGQN 162
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F + Q G+YVI+R GPYVCAEW GG P WL I LR + FM ++ F
Sbjct: 163 DVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-RLREQDPYFMERVELF 221
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ + + L +GGPII+ Q+ENEYG+ D K+Y++ +
Sbjct: 222 EQKVAE--QLAPLTIRRGGPIIMVQVENEYGSYGED-----KAYVSQIRDVLRRY----- 269
Query: 218 WIMCQ----ESDAPSPM---------FTPN---------------------------NPN 237
W + +A SP+ FT N P+
Sbjct: 270 WSLSPTGEGRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPD 329
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
+PK+ +E W+GWF WG + R A D+ + G +F + YM HGGT+FG +G
Sbjct: 330 APKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGA 388
Query: 298 --PYL---TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
P TSYDYDAPI+EYG PK+ LR K+MEK
Sbjct: 389 NSPGFAPDVTSYDYDAPINEYGQAT-PKFWELR------KTMEK 425
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
Length = 595
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+++F+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 DVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRIIKEVCPSVWQAEPRTKTLKNLGTYPVNRSVS 357
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 163/333 (48%), Gaps = 50/333 (15%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
ID + +LSG++HY R P W D + K G + +ETY+ WN HEP ++DF G
Sbjct: 12 IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ +FIK + GLYVILR PY+CAEW +GG P WL I +LR+++ F+ +++N+
Sbjct: 72 DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEI-KLRSSDDNFIEKLRNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ + + K ++GGP+++ Q+ENEYG+ YG+ K Y+ A + + VP
Sbjct: 131 YNDL--LPRLVKYQVTKGGPVLMMQVENEYGS----YGNE-KEYLRIVASIMKENGVDVP 183
Query: 218 -------WIMCQE----------------------SDAPSPMFTPNNPNSPKIWTENWTG 248
WI E D N P + E W G
Sbjct: 184 LFTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG +R + DLA V + G N YM+ GGTNFG +G
Sbjct: 244 WFNRWGEDIIRRDSIDLAEDVKEMLKIGSI--NLYMFRGGTNFGFMNGCSARGNNDLPQV 301
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
TSYDYDA + E+G+ + + EL K++KS+
Sbjct: 302 TSYDYDAILTEWGNPSDKYY----ELQKVMKSL 330
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|326933328|ref|XP_003212758.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Meleagris
gallopavo]
Length = 656
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 191/712 (26%), Positives = 281/712 (39%), Gaps = 170/712 (23%)
Query: 15 LILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDA 74
L+ L+ +L + H ++G + GS+HY R W D + K K GL+
Sbjct: 54 LVPLRLWGRTLGLQTEHS--QFLLEGMPFRIFGGSMHYFRVPREYWEDRMLKMKACGLNT 111
Query: 75 IETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL 134
+ TYV WN HE R ++DF+ NLDL F+ GL+VILR GPY+C+EW+ GG P WL
Sbjct: 112 LTTYVPWNLHEQTRGKFDFSENLDLEAFLSLAAKNGLWVILRPGPYICSEWDLGGLPSWL 171
Query: 135 HNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDY 194
P + +LRTT K F + + ++ + L +GGPII Q+ENEYG+ D
Sbjct: 172 LQDPEM-QLRTTYKGFTEAVDAYFDHLMPIVV--PLQYKRGGPIIAVQVENEYGSYAKD- 227
Query: 195 GDAGKSYINWCAKMATSLDIGVPWIMCQ-----------ESDAPSPMFTPNNP------- 236
+Y+ + S I V +M E + F P
Sbjct: 228 ----PNYMAYVKMALLSRGI-VELLMTSDNKNGLSFGLVEGALATVNFQKLEPGVLKYLD 282
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ PK+ E WTGWF +WGG A+++ VA + G + N YM+HGGTNFG
Sbjct: 283 TVQRDQPKMVMEYWTGWFDNWGGPHYVFDADEMVNTVASILKLGASI-NLYMFHGGTNFG 341
Query: 293 RTSGGPYL------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT 346
+G TSYDYDA + E G K+ LR+L + L +
Sbjct: 342 FMNGALKTDEYKSDVTSYDYDAVLTEAGDYTS-KFFKLRQLFSTIIGQPLPLPPMIESKA 400
Query: 347 DYGNSVSGSSYNLPAWSV--SILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWK 404
YG + +L W V S++ K+E V N ND + +
Sbjct: 401 SYGAILLHQYISL--WDVLPSLVQPIKSE------------FPVNMENLQLNDSSGQSYG 446
Query: 405 WRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
+ ++ + V+ G GH S + V D
Sbjct: 447 Y---VLYETVIFGGGHL--------HSRDHVRD--------------------------- 468
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFD 524
QV +VN YV + E + +G Q+ LL G NYG +
Sbjct: 469 --RAQV---FVNTMYVGE------LDYNTVELSLPEGQGFRQLRLLVENRGRVNYGLALN 517
Query: 525 MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKN 584
G+ G + L ++T +++ +Y L+ K + K+ GWS+
Sbjct: 518 EQRKGLIGDIFL-----NKTPLRNFK---------IYSLEMKPDF-LKSLRQTAGWSA-- 560
Query: 585 VP-------LNRRMTWYKTTFEAPLENDP--VVLNLQGMGKGFAWVNGYNLGRYWPTYLA 635
VP R W +E+ P L LQG KG +VNG+NLGRYW
Sbjct: 561 VPDYFVGPAFFRGRLW--------IEHQPQDTFLKLQGWEKGVVFVNGHNLGRYWKI--- 609
Query: 636 EEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
P + Y +P W+ G N +++FEE
Sbjct: 610 --------------------------GPQETLY-LPGPWLWKGSNEIIIFEE 634
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|125717147|ref|YP_001034280.1| glycosyl hydrolase family protein [Streptococcus sanguinis SK36]
gi|125497064|gb|ABN43730.1| Glycosylhydrolase, family 35, putative [Streptococcus sanguinis
SK36]
Length = 592
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 165/351 (47%), Gaps = 32/351 (9%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
+L+ +L T + A + + ++GE ++ + +HYPR W IK K
Sbjct: 6 LLITALLLTFAQFASAGDFTVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEHRIKMCKAL 65
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
G++ + YVFWN HE Q+DFT N D+ F + Q G+YVI+R GPYVCAEW GG
Sbjct: 66 GMNTLCIYVFWNIHEQREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGL 125
Query: 131 PVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV 190
P WL I LR + F+ ++ F + + + L GGPII+ Q+ENEYG+
Sbjct: 126 PWWLLKKKDI-RLRERDPYFLERVKIFEQKVGE--QLAPLTIQNGGPIIMVQVENEYGSY 182
Query: 191 MSD--------------YGDAGKSY-INWCAKMATSLDIGVPWIM----CQESDAPSPMF 231
D YG+ + +W + + + W M D
Sbjct: 183 GEDKPYVSEIRDCLRGIYGEKLTLFQCDWSSNFERNGLDDLVWTMNFGTGANIDHEFARL 242
Query: 232 TPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF 291
PN+P + +E W+GWF WG R A+D+ + +F + YM HGGT+F
Sbjct: 243 KQLRPNAPLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF-SLYMTHGGTSF 301
Query: 292 GRTSGG--PYL---TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
G +G P TSYDYDAPI+EYG + +L K+++ KT
Sbjct: 302 GHWAGANSPGFAPDVTSYDYDAPINEYGGTTE----KFFQLRKMMQKYSKT 348
>gi|449664450|ref|XP_002165261.2| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 589
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 102/337 (30%), Positives = 164/337 (48%), Gaps = 39/337 (11%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEG 70
I L ++ + + +++ ++ DG +SGSIHY R W D + K ++
Sbjct: 7 ICLLIVFAKISSSERTFKIDYENNKFLKDGTEFRYISGSIHYMRVPEDYWEDRLSKIRKA 66
Query: 71 GLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGF 130
GL+AI+TY+ WN HEP + F G ++ +F+K Q L VILR GPY+CAEW +GGF
Sbjct: 67 GLNAIQTYIPWNFHEPTEGNFQFGGQQNVFKFLKLAQKYDLLVILRPGPYICAEWEFGGF 126
Query: 131 PVWLHNMPG--IEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG 188
P WL G +LRT++ +++ +++N+ +++ ++ GGPII Q+ENEYG
Sbjct: 127 PYWLLKKVGNKTMQLRTSDNLYLQKVENYMSVL--LSGLRPYLYENGGPIITVQVENEYG 184
Query: 189 NVMSDY------GDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNN------- 235
+ D+ + Y+ + T+ G ++ C P+F +
Sbjct: 185 SYGCDHEYMYKLESIFRKYLGENVILFTTDGAGDSYLKC---GTIKPLFATVDFGPTAEP 241
Query: 236 -----------PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYM 284
P P + +E +TGW WGG+ + ED+ + + + N YM
Sbjct: 242 KLYFDIQRKYQPLGPLVNSEFYTGWLDHWGGQHAHTSLEDVTDTLDKMLSLNASV-NMYM 300
Query: 285 YHGGTNFGRTSGGPYLT-------TSYDYDAPIDEYG 314
+ GGTNFG +G + TSYDYDAP+ E G
Sbjct: 301 FEGGTNFGFMNGANQDSNSLQPQPTSYDYDAPLSEAG 337
>gi|28199702|ref|NP_780016.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182682446|ref|YP_001830606.1| beta-galactosidase [Xylella fastidiosa M23]
gi|386083781|ref|YP_006000063.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417557800|ref|ZP_12208811.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
gi|28057823|gb|AAO29665.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182632556|gb|ACB93332.1| Beta-galactosidase [Xylella fastidiosa M23]
gi|307578728|gb|ADN62697.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179583|gb|EGO82518.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
Length = 612
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 173/353 (49%), Gaps = 34/353 (9%)
Query: 4 LKHC-SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
L+H + +++ ++L + + S G DG L+SG+IH+ R W D
Sbjct: 2 LRHLLTLSLIFAIVLPIGVSAAPWPAFSTRGTQFIRDGRPYQLISGAIHFQRIPRAYWKD 61
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
++KA+ GL+ +ETYVFWN E Q+DFTGN D+ F++ QGL VILR GPYVC
Sbjct: 62 RLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGPYVC 121
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
AEW GGFP WL P + +R+ + F++ Q + + + L GGPII Q
Sbjct: 122 AEWEAGGFPAWLFADPTL-RVRSQDPRFLDASQRYLEALGTQVR--PLLNGNGGPIIAVQ 178
Query: 183 IENEYGNVMSDYG----------DAG-KSYINWCAKMATSLDIG-VPWIMCQESDAPSPM 230
+ENEYG+ D+G AG + + A A L G +P ++ + AP
Sbjct: 179 VENEYGSYGDDHGYLQAVRALFIKAGLGGALLFTADGAQMLGNGTLPDVLAAVNVAPGEA 238
Query: 231 ------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYM 284
+P P++ E W GWF WG + A+ A + + G + N YM
Sbjct: 239 KQALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYM 297
Query: 285 YHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ GGT+FG +G + TTSYDYDA +DE G PK+ R++
Sbjct: 298 FVGGTSFGFMNGANFQGGPSDHYSPQTTSYDYDAVLDEAGR-PMPKFALFRDV 349
>gi|422859360|ref|ZP_16906010.1| beta-galactosidase [Streptococcus sanguinis SK1057]
gi|327459140|gb|EGF05488.1| beta-galactosidase [Streptococcus sanguinis SK1057]
Length = 592
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 161/336 (47%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKEWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPEFEQ 336
>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
Length = 595
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I VP
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDVP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSVS 357
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/344 (33%), Positives = 163/344 (47%), Gaps = 66/344 (19%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ + YVFWN HE Q+DFTG
Sbjct: 41 LNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGQN 100
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F + Q G+YVI+R GPYVCAEW GG P WL I LR + FM ++ F
Sbjct: 101 DVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-RLREQDPYFMERVELF 159
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ + + L +GGPII+ Q+ENEYG+ D K+Y++ +
Sbjct: 160 EQKVAE--QLAPLTIRRGGPIIMVQVENEYGSYGED-----KAYVSQIRDVLRRY----- 207
Query: 218 WIMCQ----ESDAPSPM---------FTPN---------------------------NPN 237
W + +A SP+ FT N P+
Sbjct: 208 WSLSPTGEGRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPD 267
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
+PK+ +E W+GWF WG + R A D+ + G +F + YM HGGT+FG +G
Sbjct: 268 APKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGA 326
Query: 298 --PYL---TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
P TSYDYDAPI+EYG PK+ LR K+MEK
Sbjct: 327 NSPGFAPDVTSYDYDAPINEYGQAT-PKFWELR------KTMEK 363
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|71731106|gb|EAO33173.1| Beta-galactosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 612
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 130/401 (32%), Positives = 188/401 (46%), Gaps = 46/401 (11%)
Query: 14 CLILQTLFNLSLAYRVSH--------DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIK 65
L L +F ++L VS G DG L+SG+IH+ R W D ++
Sbjct: 5 LLTLSLIFAIALPIGVSAAPWPAFSTRGTQFIRDGRPYQLISGAIHFQRIPRAYWKDRLQ 64
Query: 66 KAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEW 125
KA+ GL+ +ETYVFWN E Q+DFTGN D+ F++ QGL VILR GPYVCAEW
Sbjct: 65 KARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGPYVCAEW 124
Query: 126 NYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIEN 185
GGFP WL P + +R+ + F++ Q + + + L GGPII Q+EN
Sbjct: 125 EAGGFPAWLFADPTL-RVRSQDPRFLDASQRYLEALGTQVR--PLLNGNGGPIIAVQVEN 181
Query: 186 EYGNVMSDYG----------DAG-KSYINWCAKMATSLDIG-VPWIMCQESDAPSPM--- 230
EYG+ D+G AG + + A A L G +P ++ + AP
Sbjct: 182 EYGSYGDDHGYLQAVHALFIKAGLGGALLFTADGAQMLGNGTLPDVLAAVNFAPGEAKQA 241
Query: 231 ---FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHG 287
+P P++ E W GWF WG + A+ A + + G + N YM+ G
Sbjct: 242 LDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVG 300
Query: 288 GTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
GT+FG +G + TTSYDYDA +DE G PK+ R++ + ++
Sbjct: 301 GTSFGFMNGANFQGGPGDHYSPQTTSYDYDAVLDEAGR-PMPKFALFRDVITRVTGLQPP 359
Query: 338 LTYGNVTNTDYGNSVSGSSY----NLPAWSVSILPDCKTEE 374
G D ++ +S NLPA +V+ D + E
Sbjct: 360 PLPGASRFIDLPDTPLRASASLWDNLPA-AVATTADPQPME 399
>gi|422864131|ref|ZP_16910760.1| beta-galactosidase [Streptococcus sanguinis SK408]
gi|327472954|gb|EGF18381.1| beta-galactosidase [Streptococcus sanguinis SK408]
Length = 592
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|422824944|ref|ZP_16873129.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|422827211|ref|ZP_16875390.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|422857055|ref|ZP_16903709.1| beta-galactosidase [Streptococcus sanguinis SK1]
gi|324992224|gb|EGC24146.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|324994315|gb|EGC26229.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|327459541|gb|EGF05887.1| beta-galactosidase [Streptococcus sanguinis SK1]
Length = 592
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 129/404 (31%), Positives = 189/404 (46%), Gaps = 39/404 (9%)
Query: 4 LKHC-SRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
L+H + ++ + L N + S G DG L+SG+IH+ R W D
Sbjct: 2 LRHLLTLPLIFVIALPIGVNAAPWPAFSTRGTQFIRDGRPYQLISGAIHFQRIPRAYWKD 61
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVC 122
++KA+ GL+ +ETYVFWN E Q+DFTGN D+ F++ QGL VILR GPYVC
Sbjct: 62 RLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDISAFVREAASQGLNVILRPGPYVC 121
Query: 123 AEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
AEW GGFP WL P + +R+ + F++ Q + + + L GGPII Q
Sbjct: 122 AEWEAGGFPAWLFADPTL-RVRSQDPRFLDASQRYLEALGTQVR--PLLNGNGGPIIAVQ 178
Query: 183 IENEYGNVMSDYG----------DAG-KSYINWCAKMATSLDIG-VPWIMCQESDAPSPM 230
+ENEYG+ D+G AG + + A A L G +P ++ + AP
Sbjct: 179 VENEYGSYGDDHGYLQAVRALFIKAGLGGALLFTADGAQMLGNGTLPDVLAAVNVAPGEA 238
Query: 231 ------FTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYM 284
+P P++ E W GWF WG + A+ A + + G + N YM
Sbjct: 239 KQALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYM 297
Query: 285 YHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
+ GGT+FG +G + TTSYDYDA +DE G PK+ R++ + +
Sbjct: 298 FVGGTSFGFMNGANFQGGPSDHYSPQTTSYDYDAALDEAGR-PMPKFVLFRDVITRVTGL 356
Query: 335 EKTLTYGNVTNTDYGNSVSGSSY----NLPAWSVSILPDCKTEE 374
+ D N+ +S NLPA +V+ D + E
Sbjct: 357 QPPPLPAATRFIDLPNTPLRASASLWDNLPA-AVATTADPQPME 399
>gi|164519026|ref|NP_001073876.2| beta-galactosidase-1-like protein 3 [Homo sapiens]
gi|269849685|sp|Q8NCI6.3|GLBL3_HUMAN RecName: Full=Beta-galactosidase-1-like protein 3
Length = 653
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 117/351 (33%), Positives = 173/351 (49%), Gaps = 49/351 (13%)
Query: 20 LFNLSLAYRVSHDGRA---ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIE 76
L N S+ GR T++G + ++ GSIHY R W D + K K G + +
Sbjct: 61 LKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVT 120
Query: 77 TYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHN 136
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR G Y+C+E + GG P WL
Sbjct: 121 TYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGRYICSEMDLGGLPSWLLQ 180
Query: 137 MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGD 196
P + LRTTNK F+ ++ + ++ + L Q GP+I Q+ENEYG+ D
Sbjct: 181 DPRL-LLRTTNKSFIEAVEKYFDHLI--PRVIPLQYRQAGPVIAVQVENEYGSFNKD--- 234
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNP-------------------- 236
K+Y+ + K L G+ ++ SD + + +
Sbjct: 235 --KTYMPYLHKAL--LRRGIVELLLT-SDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLH 289
Query: 237 ----NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
+ P + E W GWF WG K + A+++ AV+ F ++ +F N YM+HGGTNFG
Sbjct: 290 KVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFG 348
Query: 293 RTSGGPY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
+G Y + TSYDYDA + E G + +L KL +S+ T
Sbjct: 349 FMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE----KYLKLQKLFQSVSAT 395
>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
Length = 628
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/325 (35%), Positives = 167/325 (51%), Gaps = 43/325 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------T 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAPI E G + PK+ +R +
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 76/194 (39%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G K+++ Y++ + + D
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVKIAG--------KEITGEWDMYQLPMSEMPDL 519
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A A + ++K L Y+ TF D ++++ GKG +VNG N+
Sbjct: 520 AKLKADAHANVPAEAAK---LKGCPVLYEGTFTLDNVGD-TFIDMENWGKGIIFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y +P W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-IPGVWLKKGTNKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNEVP-QAEVKTV 618
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 160/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQLV--NGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 624
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 169/324 (52%), Gaps = 43/324 (13%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
GE +LSG +HY R W ++ K GL+ + TYVFWN HE ++DF+G+ +L
Sbjct: 35 GEEIPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTT 159
+I+ ++G+ VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+ +T
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNTEFL----KYTK 149
Query: 160 LIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG 215
+D +E L ++GGPII+ Q ENE+G+ +S D + + ++ AK+ L D G
Sbjct: 150 KYIDRLYEEVGDLQCTKGGPIIMVQCENEFGSYVSQRKDIPLEEHRSYNAKIKGQLADAG 209
Query: 216 --VP-------WIM---CQESDAPSPMFTPNNPN------------SPKIWTENWTGWFK 251
+P W+ C P+ + N P + E ++GW
Sbjct: 210 FTIPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGDKGPYMVAEFYSGWLS 269
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TS 303
WG P+ +A ++A + Q +F N+YM HGGTNFG TSG Y TS
Sbjct: 270 HWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTS 328
Query: 304 YDYDAPIDEYGHLNQPKWGHLREL 327
YDYDAPI E G L PK+ +R +
Sbjct: 329 YDYDAPISEAGWLT-PKYDSIRSV 351
>gi|170782982|ref|YP_001711316.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
gi|169157552|emb|CAQ02748.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
Length = 615
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 159/320 (49%), Gaps = 40/320 (12%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S +R+ D +DG +++G++HY R P W D I+KA+ GLD IETYV WNA
Sbjct: 25 SARFRIGADD--FELDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNA 82
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
H P R +D + LDL RF+ + +G++ I+R GPY+CAEW+ GG P WL P + +
Sbjct: 83 HSPERGTFDTSAGLDLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFGDPAV-GV 141
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
R + +++ + F + ++ ++ GGP+IL QIENEYG YGD + Y+
Sbjct: 142 RRSEPLYLAAVDEFLRRVYEIVAPRQI--DMGGPVILVQIENEYGA----YGDDAE-YLR 194
Query: 204 WCAKMATSLDIGVPWIMCQE---------------------SDAPSPMFT--PNNPNSPK 240
+ I VP + S A + T + P
Sbjct: 195 HLVDLTRESGIIVPLTTVDQPTDEMLSRGSLDELHRTGSFGSRAAERLETLRRHQRTGPL 254
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG---- 296
+ +E W GWF W G+ T+ A A G N YM+HGGTNFG T+G
Sbjct: 255 MCSEFWDGWFDHW-GEHHHTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHK 313
Query: 297 GPYLT--TSYDYDAPIDEYG 314
G Y + TSYDYDAP+DE G
Sbjct: 314 GTYQSHVTSYDYDAPLDETG 333
>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
Length = 628
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/325 (35%), Positives = 167/325 (51%), Gaps = 43/325 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------T 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAPI E G + PK+ +R +
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 45.1 bits (105), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 76/194 (39%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G K+++ Y++ + + D
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVKIAG--------KEITGEWDMYQLPMSEMPDL 519
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A A + ++K L Y+ TF D ++++ GKG +VNG N+
Sbjct: 520 AKLKADAHANVPAEAAK---LKGCPVLYEGTFTLDNVGD-TFIDMENWGKGIIFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y +P W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-IPGVWLKKGTNKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNEVP-QAEVKTV 618
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 160/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQLV--NGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|306832839|ref|ZP_07465973.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
gi|304424978|gb|EFM28110.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
Length = 595
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 155/318 (48%), Gaps = 47/318 (14%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ +DG+ +LSGSIHY R P W + K G + +ETYV WN HEP ++DF
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG LDL RF+ Q+ GLY I+R PY+CAEW +GG P WL G+ +R+ +K F+
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWLLE-KGV-RVRSQDKGFLQV 125
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ + +++ K +L QGG I++ Q+ENEYG+ D K Y+ +M L
Sbjct: 126 VKRYYEVLIPRLIKHQL--DQGGNILMFQVENEYGSYGED-----KVYLRELKQMMLELG 178
Query: 214 IGVPWIM----------------------------CQESDAPSPMFTPNNPNS-PKIWTE 244
+ P+ +E+ A MF P + E
Sbjct: 179 LEEPFFTSDGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMCME 238
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---- 300
W GWF WG KR E+LA AV + G N YM+HGGTNFG +G
Sbjct: 239 FWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARKQTD 296
Query: 301 ---TTSYDYDAPIDEYGH 315
TSYDYDA +DE G+
Sbjct: 297 LPQVTSYDYDAILDEAGN 314
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 98/207 (47%), Gaps = 41/207 (19%)
Query: 428 DQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSSGQVLHAYVNGNYVDSQWT 485
+ ++ N + Y++Y T L+G T +R+ + ++NGN++ +Q+
Sbjct: 375 NMEALNQSTGYIFYRTK---------LNGYQGNTEKVRLIDTRDRAQVFLNGNHIVTQYQ 425
Query: 486 KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETI 545
+ +D+ V T ++Q+ +L +G NYG K P+ G +GR +
Sbjct: 426 E-EIGDDI---QVNFTSEESQLDILVENMGRVNYGHKL-TAPSQHKG----IGRG----V 472
Query: 546 IKDLS-SHKW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLE 603
+ DL ++W TY + + + + K+ + W + VP ++Y+ F L
Sbjct: 473 MLDLHFVNQWETYPLSMNSIKNLKYSSP--------WR-EGVP-----SFYEFKFHC-LN 517
Query: 604 NDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+ +++ G GKG A++NGYNLGR+W
Sbjct: 518 PEDTYMDMSGFGKGVAFINGYNLGRFW 544
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 157/325 (48%), Gaps = 55/325 (16%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P W + K G + +ETYV WN HEP + + F G LDL RF+K
Sbjct: 29 ILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLK 88
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + + ++++
Sbjct: 89 LAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEYYDVLMEK 146
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
+L + GG I++ QIENEYG+ +G+ K+Y+ + + + P+ S
Sbjct: 147 IVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAPFFT---S 196
Query: 225 DAP--------------------------------SPMFTPNNPNSPKIWTENWTGWFKS 252
D P F + P + E W GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYLTTSY 304
W KR ++LA +V G N YM+HGGTNFG +G P + TSY
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 313
Query: 305 DYDAPIDEYGHLNQPKWGHLRELHK 329
DYDAP+DE G+ + + + LH+
Sbjct: 314 DYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 147/313 (46%), Gaps = 29/313 (9%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ + YVFWN+HEP YDFT
Sbjct: 359 LNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQN 418
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL + LR ++ F+ + F
Sbjct: 419 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDV-RLRESDPYFIERVALF 477
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---------------DAGKSYI 202
+ K L + GGPII+ Q+ENEYG+ D G D
Sbjct: 478 EEAVAKQVK--NLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALFQC 535
Query: 203 NWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+W + + + W M D PNSP + +E W+GWF WG
Sbjct: 536 DWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKQLRPNSPLMCSEFWSGWFDKWGANHE 595
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEY 313
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E
Sbjct: 596 TRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISES 654
Query: 314 GHLNQPKWGHLRE 326
G PK+ LRE
Sbjct: 655 GQ-TTPKYWALRE 666
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 147/313 (46%), Gaps = 29/313 (9%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ + YVFWN+HEP YDFT
Sbjct: 359 LNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQN 418
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL + LR ++ F+ + F
Sbjct: 419 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDV-RLRESDPYFIERVALF 477
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---------------DAGKSYI 202
+ K L + GGPII+ Q+ENEYG+ D G D
Sbjct: 478 EEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALFQC 535
Query: 203 NWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+W + + + W M D PNSP + +E W+GWF WG
Sbjct: 536 DWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKQLRPNSPLMCSEFWSGWFDKWGANHE 595
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEY 313
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E
Sbjct: 596 TRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISES 654
Query: 314 GHLNQPKWGHLRE 326
G PK+ LRE
Sbjct: 655 GQ-TTPKYWALRE 666
>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 605
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 115/320 (35%), Positives = 155/320 (48%), Gaps = 31/320 (9%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
D T++G+ +L GS+HY R W D + K K GL+ + TYV WN HEP R +
Sbjct: 10 DSSQFTLEGKPFRILGGSVHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGTF 69
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
+F LDL ++ GL+VILR GPY+CAEW+ GG P WL + +LRTT F+
Sbjct: 70 NFQDQLDLKAYVSLAAQLGLWVILRPGPYICAEWDLGGLPSWLLQDEEM-QLRTTYPGFV 128
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK---M 208
N + + ++ + K L GGPII Q+ENEYG+ D D +I C + +
Sbjct: 129 NAVNLYFDKLISVIK--PLMFEGGGPIIAVQVENEYGSFAKD--DKYMPFIKNCLQSRGI 184
Query: 209 ATSLDIGVPWIMCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
L W + + T N P P + E W+GWF W
Sbjct: 185 KELLMTSDNWEGLRCGGVEGALKTVNLQRLSFGAIQHLADIQPQKPLMVMEYWSGWFDVW 244
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPYLT--TSYDYD 307
G AED+ V+ G + N YM+HGGT FG +G G Y + TSYDYD
Sbjct: 245 GEHHHVFYAEDMLAVVSEILDRGVSI-NLYMFHGGTTFGFMNGAMDFGTYKSQVTSYDYD 303
Query: 308 APIDEYGHLNQPKWGHLREL 327
AP+ E G PK+ HLR L
Sbjct: 304 APLSEAGDCT-PKYHHLRNL 322
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 121/367 (32%), Positives = 175/367 (47%), Gaps = 55/367 (14%)
Query: 40 GERKI-LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
GE I +LSG+IHY R P W D + K K GL+ +ETY+ WN HEP +++F+G D
Sbjct: 14 GEEAIQILSGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMAD 73
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
+ FI GL+VI+R PY+CAEW +GG P WL P + +LR + F+ ++ +
Sbjct: 74 IEAFITLAGKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHM-QLRCLDPKFLKKVDAYY 132
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC--AKMATSLDI-- 214
++ + L ++ GGPII QIENEYG+ +D +Y+ + A +A +D+
Sbjct: 133 DELI--PRLVPLLSTNGGPIIAVQIENEYGSYGND-----TAYLQYLQEALIARGVDVLL 185
Query: 215 ----GVPWIMCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWGG 255
G M Q P T N P + E W GWF W
Sbjct: 186 FTSDGPTDGMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMK 245
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAP 309
R +ED A A G + N+YM+HGGTNFG +G Y TSYDYDAP
Sbjct: 246 PHHTRDSEDAASVFAEMLALGASV-NFYMFHGGTNFGFYNGANYHDKYEPTITSYDYDAP 304
Query: 310 IDEYGHLNQPKWGHLRELHKLLKSME-----------KTLTYGNVTNTDYGNSVSGSSYN 358
+ E G + K+ +R++ + +E + YG V+ T Y + + N
Sbjct: 305 LSECGDVTT-KYEAVRQVIAKHQGVELGDLPALPDPVRKKAYGTVSMTSYADLLE----N 359
Query: 359 LPAWSVS 365
LP + S
Sbjct: 360 LPVLASS 366
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 169/354 (47%), Gaps = 41/354 (11%)
Query: 3 TLKH--CSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMW 60
T KH + A+L+ +L + + + ++G+ ++ + +HYPR W
Sbjct: 6 TFKHFIATVALLVTAMLSPVSAARKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYW 65
Query: 61 PDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPY 120
IK K G++ + YVFWN HE ++DFT N D+ F + Q GLYVI+R GPY
Sbjct: 66 EHRIKMCKALGMNTVCLYVFWNIHEQQEGKFDFTDNNDVAEFCRLAQRNGLYVIVRPGPY 125
Query: 121 VCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIIL 180
VCAEW GG P WL I LR + FM ++ F + + + L GGPII+
Sbjct: 126 VCAEWEMGGLPWWLLKKKDI-RLREPDPYFMERVKLFERKVGE--QLASLTIQNGGPIIM 182
Query: 181 AQIENEYGNVMSDYGDAGKSYI--------------------NWCAKMATSLDIGVPWIM 220
Q+ENEYG+ YG+ K+Y+ +W + + + W M
Sbjct: 183 VQVENEYGS----YGE-NKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTM 237
Query: 221 ----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFG 276
+ D PN+P++ +E W+GWF WG + R A+ + + G
Sbjct: 238 NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKTMVEGIDEMLSKG 297
Query: 277 GTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYGHLNQPKWGHLR 325
+F + YM HGGT+FG +G P TSYDYDAPI+EYG PK+ LR
Sbjct: 298 ISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQAT-PKYWELR 349
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 168/324 (51%), Gaps = 43/324 (13%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
GE +LSG +HY R W ++ K GL+ + TYVFWN HE ++DF+G+ +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTT 159
+I+ ++G+ VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+ +T
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNTEFL----KYTK 149
Query: 160 LIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG 215
+D +E L ++GGPII+ Q ENE+G+ +S D + + ++ AK+ L D G
Sbjct: 150 KYIDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAG 209
Query: 216 --VP-------WIM---CQESDAPSPMFTPNNPN------------SPKIWTENWTGWFK 251
VP W+ C P+ + N P + E + GW
Sbjct: 210 FTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLS 269
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TS 303
WG P+ +A ++A + Q +F N+YM HGGTNFG TSG Y TS
Sbjct: 270 HWGEPFPQVSASEIARQTEAYLQNNVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTS 328
Query: 304 YDYDAPIDEYGHLNQPKWGHLREL 327
YDYDAPI E G + PK+ +R +
Sbjct: 329 YDYDAPISEAGWIT-PKYDSIRSV 351
>gi|422871792|ref|ZP_16918285.1| beta-galactosidase [Streptococcus sanguinis SK1087]
gi|328945306|gb|EGG39459.1| beta-galactosidase [Streptococcus sanguinis SK1087]
Length = 592
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 114/338 (33%), Positives = 163/338 (48%), Gaps = 51/338 (15%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKL---LKSME 335
TSYD+DAPI E+G + + R H++ LK ME
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELKQME 338
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 156/317 (49%), Gaps = 45/317 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G ++G+ +LSG++HY R P +W D + K K GL+ +ETYV WN HEP Q+
Sbjct: 12 GDQFHLNGQPFRVLSGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFR 71
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LDL FI+ + GLYVI+R GP++CAEW +GG P WL P + E+R + ++
Sbjct: 72 YEGGLDLAAFIRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYM-EVRCCYQPYLE 130
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
++ F ++ ++ +GGPI+ Q+ENEYG+ SD + Y+ W ++ L
Sbjct: 131 AVRRFYDDLLPRLLPLQI--QRGGPILAMQVENEYGSYGSD-----QLYLTWLRRLM--L 181
Query: 213 DIGVPWIMCQESDAPSPMFTPN-------------------------NPNSPKIWTENWT 247
D GV ++ A M P+ P + E W
Sbjct: 182 DGGVETLLFTSDGATDHMLKHGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWN 241
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------PYL 300
GWF WG R A D A A+ R G N YM+HGGTNFG +G Y
Sbjct: 242 GWFDHWGEPHHTRDAADAADALERIMACGAHV-NVYMFHGGTNFGFMNGANTDLLTRDYQ 300
Query: 301 TT--SYDYDAPIDEYGH 315
T SYDYDAP+DE G
Sbjct: 301 PTVNSYDYDAPLDETGQ 317
>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
Length = 595
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 174/357 (48%), Gaps = 52/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG ++SG+IHY R P W + K G + +ETY+ WN HEP +DF+G
Sbjct: 12 VDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGFK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+++RF+K Q+ L VILR Y+CAEW +GG P WL P I +R+T+ FM +++N+
Sbjct: 72 NVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNI-RVRSTDPRFMEKLKNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG KSY+ ++ + I +P
Sbjct: 131 YQVL--LPKLAPLQITQGGPVIMMQLENEYGS----YG-MEKSYLRQTKELMLAHSIDIP 183
Query: 218 -------WIMC---------------------QESDAPSPMFTPNN-PNSPKIWTENWTG 248
W+ +E+ F N+ N P + E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG R E+LA V + G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM----EKTLTYGNVTNTDYGNSVS 353
TSYDYDA ++E G + + R + ++ S+ +T T N+ SVS
Sbjct: 302 -TSYDYDALLNEAGQPTEKYYAVQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSVS 357
>gi|191637109|ref|YP_001986275.1| beta-galactosidase 3 [Lactobacillus casei BL23]
gi|385818812|ref|YP_005855199.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|385821988|ref|YP_005858330.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|409995961|ref|YP_006750362.1| beta-galactosidase 17 [Lactobacillus casei W56]
gi|190711411|emb|CAQ65417.1| Beta-galactosidase 3 [Lactobacillus casei BL23]
gi|327381139|gb|AEA52615.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|327384315|gb|AEA55789.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|406356973|emb|CCK21243.1| Beta-galactosidase 17 [Lactobacillus casei W56]
Length = 598
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFTIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 168/324 (51%), Gaps = 43/324 (13%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
GE +LSG +HY R W ++ K GL+ + TYVFWN HE ++DF+G+ +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTT 159
+I+ ++G+ VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+ +T
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNTEFL----KYTK 149
Query: 160 LIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG 215
+D +E L ++GGPII+ Q ENE+G+ +S D + + ++ AK+ L D G
Sbjct: 150 KYIDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAG 209
Query: 216 --VP-------WIM---CQESDAPSPMFTPNNPN------------SPKIWTENWTGWFK 251
VP W+ C P+ + N P + E + GW
Sbjct: 210 FTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLS 269
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TS 303
WG P+ +A ++A + Q +F N+YM HGGTNFG TSG Y TS
Sbjct: 270 HWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTS 328
Query: 304 YDYDAPIDEYGHLNQPKWGHLREL 327
YDYDAPI E G + PK+ +R +
Sbjct: 329 YDYDAPISEAGWIT-PKYDSIRSV 351
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 168/324 (51%), Gaps = 43/324 (13%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
GE +LSG +HY R W ++ K GL+ + TYVFWN HE ++DF+G+ +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTT 159
+I+ ++G+ VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+ +T
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNTEFL----KYTK 149
Query: 160 LIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG 215
+D +E L ++GGPII+ Q ENE+G+ +S D + + ++ AK+ L D G
Sbjct: 150 KYIDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAG 209
Query: 216 --VP-------WIM---CQESDAPSPMFTPNNPN------------SPKIWTENWTGWFK 251
VP W+ C P+ + N P + E + GW
Sbjct: 210 FTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLS 269
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TS 303
WG P+ +A ++A + Q +F N+YM HGGTNFG TSG Y TS
Sbjct: 270 HWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTS 328
Query: 304 YDYDAPIDEYGHLNQPKWGHLREL 327
YDYDAPI E G + PK+ +R +
Sbjct: 329 YDYDAPISEAGWIT-PKYDSIRSV 351
>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
Neff]
Length = 604
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 150/310 (48%), Gaps = 34/310 (10%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG+ ++SGSIHY RS P WP ++ + GL+ + TYV WN HEP QYDF+G LD
Sbjct: 36 DGQEFRIVSGSIHYFRSLPEQWPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLD 95
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
++RFI+ Q +G VI+R PY+CAE +GG P WL N G+ +LR ++ ++ + +F
Sbjct: 96 IVRFIEAAQQEGFLVIVRPPPYICAELEFGGLPAWLLNEEGL-QLRCSDPKYLKRVDSFL 154
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD--------------------YGDAG 198
+ M + S+GGPII Q+ENEYG+ +D + G
Sbjct: 155 DHFLPMLATYQY--SRGGPIIAMQVENEYGSYGNDHLYLRHLELKFRQHQIDAILFSSNG 212
Query: 199 KSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+ SL V + + + + P+ P TE W GWF WG +
Sbjct: 213 AGDQMFVGGALPSLLRTVNFGTGADVEGNLKVLRKYQPSGPLFVTEFWDGWFDHWGEEHH 272
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PY--LTTSYDYDA 308
T + + N YM GGTNFG T+G PY TTSYDYDA
Sbjct: 273 TTTPTQSMKTLEAILSNNASV-NLYMAFGGTNFGFTNGANKGYGETDPYQPTTTSYDYDA 331
Query: 309 PIDEYGHLNQ 318
P++E G Q
Sbjct: 332 PVNESGDATQ 341
>gi|422852902|ref|ZP_16899566.1| beta-galactosidase [Streptococcus sanguinis SK160]
gi|325697836|gb|EGD39720.1| beta-galactosidase [Streptococcus sanguinis SK160]
Length = 592
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + +P
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTIP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|18410234|ref|NP_565051.1| beta-galactosidase 17 [Arabidopsis thaliana]
gi|75163694|sp|Q93Z24.1|BGL17_ARATH RecName: Full=Beta-galactosidase 17; Short=Lactase 17; Flags:
Precursor
gi|16648842|gb|AAL25611.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|22655360|gb|AAM98272.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|332197279|gb|AEE35400.1| beta-galactosidase 17 [Arabidopsis thaliana]
Length = 697
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 172/352 (48%), Gaps = 49/352 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG R ++ G +HY R P W D + +A GL+ I+ YV WN HEP + F G D
Sbjct: 73 DGNRFQIIGGDLHYFRVLPEYWEDRLLRANALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 132
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L+ F+K + V+LR GPY+C EW+ GGFP WL + +LRT++ V++ ++ +
Sbjct: 133 LVSFLKLCEKLDFLVMLRAGPYICGEWDLGGFPAWLLAVKPRLQLRTSDPVYLKLVERWW 192
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA--------- 209
++ + K L S GGP+I+ QIENEYG+ +D K+Y+ MA
Sbjct: 193 DVL--LPKVFPLLYSNGGPVIMVQIENEYGSYGND-----KAYLRKLVSMARGHLGDDII 245
Query: 210 ---------TSLDIG-VPW------IMCQESDAPSPMFTP----NNP-NSPKIWTENWTG 248
+LD G VP + D P P+F N P SP + +E +TG
Sbjct: 246 VYTTDGGTKETLDKGTVPVADVYSAVDFSTGDDPWPIFKLQKKFNAPGRSPPLSSEFYTG 305
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG----------P 298
W WG K K AE A ++ + G+ YM HGGTNFG +G P
Sbjct: 306 WLTHWGEKITKTDAEFTAASLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDYKP 364
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
LT SYDYDAPI E G ++ PK+ L+ + K + ++ N YG+
Sbjct: 365 DLT-SYDYDAPIKESGDIDNPKFQALQRVIKKYNASPHPISPSNKQRKAYGS 415
>gi|281337336|gb|EFB12920.1| hypothetical protein PANDA_005061 [Ailuropoda melanoleuca]
Length = 655
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 173/362 (47%), Gaps = 49/362 (13%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T+ G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+
Sbjct: 76 FTLGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSE 135
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ + GL+VILR GPY+C+E + GG P WL P + LRTT K F+ +
Sbjct: 136 NLDLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEM-ILRTTYKGFVEAVD 194
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ ++ L +GGPII Q+ENEYG+ D K Y+ + K L+ G
Sbjct: 195 KYFDHLI--SRVVPLQYHKGGPIIAVQVENEYGSFAVD-----KDYMPYVRK--ALLERG 245
Query: 216 VPWIMCQESDAPS-----------------------PMFTPNNPNSPKIWTENWTGWFKS 252
+ ++ DA + + N P + E W GWF +
Sbjct: 246 IVELLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDT 305
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTS--- 303
WGGK AED+ V++F +F N YM+HGGTNFG +G Y + TS
Sbjct: 306 WGGKHMVNNAEDVEETVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGIHRAVVTSYGK 364
Query: 304 ---YDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLP 360
YDYDA + E G + K+ L+ L + + +M Y SV S Y LP
Sbjct: 365 CLLYDYDALLTEAGDYTK-KYFKLQRLFRSVLAMPLPPLPELTPKAKY-PSVKPSLY-LP 421
Query: 361 AW 362
W
Sbjct: 422 LW 423
>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
Length = 206
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 121/207 (58%), Gaps = 28/207 (13%)
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
G GKG AWVNG ++GRYWPT +A GC TESCDYRG Y ++KC NCG PSQ YHVPR
Sbjct: 3 GTGKGIAWVNGQSIGRYWPTSIAGNGGC-TESCDYRGSYRANKCLKNCGKPSQTLYHVPR 61
Query: 673 SWIKDGVNTLVLFEEFGGNPSQINFQTVVVGT----ACGQAH---------------ENK 713
SW+K N LVLFEE GG+P+QI+F T G+ Q+H N+
Sbjct: 62 SWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNR 121
Query: 714 T---MELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCS 768
T + L C + I IK+ASFG P+G CG+F +G C + L L++K C+G +SC+
Sbjct: 122 TRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRS-LSLVQKACIGLRSCN 180
Query: 769 IEASEANLGATSCAAGTVKRLVVEALC 795
+E S G G VK L VEA C
Sbjct: 181 VEVSTRVFGEP--CRGVVKSLAVEASC 205
>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
griseus]
Length = 761
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 168/318 (52%), Gaps = 35/318 (11%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T+DG + +++ GSIHY R W D + K + G + + TY+ WN HE R +DF+
Sbjct: 186 FTLDGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSE 245
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
LDL ++ GL+VILR GPY+CAE + GG P WL P + +LRTT + F++ +
Sbjct: 246 ILDLEAYVSLAATLGLWVILRPGPYICAEVDLGGLPSWLLGYPEL-QLRTTQQEFLDAVD 304
Query: 156 N-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
F LI + + L +GGP+I QIENEYG+ D GD YI + +++
Sbjct: 305 KYFDHLIPRILPLQYL---RGGPVIAVQIENEYGSFSKD-GDY-MEYIKEALQKRGIVEL 359
Query: 215 GVPW-------------------IMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGG 255
+ + E D+ + N + P + E WTGWF +WG
Sbjct: 360 LLTSDNHKGIQTGSVKGALTTINMASFEKDSFIKLLQMQN-DKPIMVMEYWTGWFDTWGR 418
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAP 309
+ ++AE++ + V+RF ++G +F N YM+HGGTNFG +G + + TSYDYDA
Sbjct: 419 EHNVKSAEEIRYTVSRFIKYGISF-NMYMFHGGTNFGFINGAFHYDKHSSVVTSYDYDAV 477
Query: 310 IDEYGHLNQPKWGHLREL 327
+ E G + K+ LR+L
Sbjct: 478 LTEAGDYTE-KYFKLRKL 494
>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
Length = 584
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 112/350 (32%), Positives = 165/350 (47%), Gaps = 51/350 (14%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ I+G + ++SG++HY R P W D + K G + +ETYV WN HEP + +YDF
Sbjct: 8 KEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYDF 67
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
+G D+ F+K ++ L+VILR PY+CAEW GG P WL P I LRT +K ++
Sbjct: 68 SGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRI-RLRTNDKQYLKC 126
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
+ + +++ + K K +Q GPIILAQ+ENEYG+ D K Y+ +M
Sbjct: 127 LDQYFSIL--LPKLSKYQITQNGPIILAQLENEYGSYGED-----KEYLLAVYQMMRKYG 179
Query: 214 IGVPWI--------------MCQESDAPSPMF---------------TPNNPNSPKIWTE 244
I VP + ++ P+ F + +P + E
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESHQITAPLMCME 239
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGG 297
W GWF W + KR ++ + G N+YM+ GGTNFG R
Sbjct: 240 FWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSARKEHD 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLREL----HKLLKSMEKTLTYGNV 343
TSYDYDA + EYG + K+ LRE+ + L +T YG +
Sbjct: 298 LPQITSYDYDAILTEYGAKTE-KYHLLREVITGKKERLPERRQTKNYGQI 346
>gi|119962102|ref|YP_948531.1| beta-galactosidase [Arthrobacter aurescens TC1]
gi|119948961|gb|ABM07872.1| beta-galactosidase [Arthrobacter aurescens TC1]
Length = 598
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 157/319 (49%), Gaps = 43/319 (13%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY-DFTGNLD 98
GE +L+G+IHY R P +W D +++ K G + ++TYV WN H+P R + DF+G D
Sbjct: 17 GEPYRILAGAIHYFRVHPDLWQDRLRRLKAMGANTVDTYVAWNFHQPKRDEAPDFSGWRD 76
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-F 157
L RF+ ++GL VI+R GPY+CAEW+ GGFP L +PGI LR + VF ++ F
Sbjct: 77 LGRFMDLAAEEGLDVIVRPGPYICAEWDNGGFPSCLTGIPGI-GLRCMDPVFTAAIEEWF 135
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW---------CAKM 208
L+ +A ++ S GGP++ QIENEYG+ YGD YI W ++
Sbjct: 136 DHLLPIVASRQ---TSAGGPVVAVQIENEYGS----YGDD-HEYIRWNRRALEERGITEL 187
Query: 209 ATSLDIGVPWI--------------MCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+ D G + + D + P P E W GWF WG
Sbjct: 188 LFTADGGTDYFLDGGAVEGTWATATLGSRGDEAVATWQRRRPGEPFFNVEFWGGWFDHWG 247
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LTTSYDYD 307
R AED A + GG+ YM HGGTNFG SG + TSYD D
Sbjct: 248 EHHHGRDAEDAALEARKMLDLGGSL-CAYMAHGGTNFGLRSGSNHDGTMLQPTVTSYDSD 306
Query: 308 APIDEYGHLNQPKWGHLRE 326
API E G L PK+ R+
Sbjct: 307 APIAENGALT-PKFHAFRK 324
>gi|336063700|ref|YP_004558559.1| beta-galactosidase [Streptococcus pasteurianus ATCC 43144]
gi|334281900|dbj|BAK29473.1| beta-galactosidase precursor [Streptococcus pasteurianus ATCC
43144]
Length = 595
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 154/318 (48%), Gaps = 47/318 (14%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ +DG+ +LSGSIHY R P W + K G + +ETYV WN HEP ++DF
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TG LDL RF+ Q+ GLY I+R PY+CAEW +GG P WL G+ +R+ +K F+
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWLLE-KGV-RVRSQDKDFLQV 125
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
++ + ++ K +L QGG I++ Q+ENEYG+ D K Y+ +M L
Sbjct: 126 VKRYYEALIPRLIKHQL--DQGGNILMFQVENEYGSYGED-----KVYLRELKQMMLELG 178
Query: 214 IGVPWIM----------------------------CQESDAPSPMFTPNNPNS-PKIWTE 244
+ P+ +E+ A MF P + E
Sbjct: 179 LEEPFFTSDGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMCME 238
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---- 300
W GWF WG KR E+LA AV + G N YM+HGGTNFG +G
Sbjct: 239 FWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARKQTD 296
Query: 301 ---TTSYDYDAPIDEYGH 315
TSYDYDA +DE G+
Sbjct: 297 LPQVTSYDYDAILDEAGN 314
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 98/207 (47%), Gaps = 41/207 (19%)
Query: 428 DQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMT--LRINSSGQVLHAYVNGNYVDSQWT 485
+ ++ N + Y++Y T L+G T +R+ + ++NGN++ +Q+
Sbjct: 375 NMEALNQSTGYIFYRTK---------LNGYQGNTEKVRLIDTRDRAQVFLNGNHIVTQYQ 425
Query: 486 KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETI 545
+ +D+ V T ++Q+ +L +G NYG K P+ G +GR +
Sbjct: 426 E-EIGDDI---QVNFTSEESQLDILVENMGRVNYGHKL-TAPSQHKG----IGRG----V 472
Query: 546 IKDLS-SHKW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLE 603
+ DL ++W TY + + + + K+ + W + VP ++Y+ F L
Sbjct: 473 MLDLHFVNQWETYPLSMNSIKNLKYSSP--------WR-EGVP-----SFYEFKFHC-LN 517
Query: 604 NDPVVLNLQGMGKGFAWVNGYNLGRYW 630
+ +++ G GKG A++NGYNLGR+W
Sbjct: 518 PEDTYMDMSGFGKGVAFINGYNLGRFW 544
>gi|417988603|ref|ZP_12629136.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|417997907|ref|ZP_12638140.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|418015108|ref|ZP_12654689.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
gi|410541233|gb|EKQ15720.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|410542248|gb|EKQ16704.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|410552187|gb|EKQ26219.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
Length = 598
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 SEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAE+L + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAENLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|146318103|ref|YP_001197815.1| beta-galactosidase [Streptococcus suis 05ZYH33]
gi|146320284|ref|YP_001199995.1| Beta-galactosidase [Streptococcus suis 98HAH33]
gi|253751293|ref|YP_003024434.1| beta-galactosidase precursor [Streptococcus suis SC84]
gi|253753194|ref|YP_003026334.1| beta-galactosidase precursor [Streptococcus suis P1/7]
gi|386577401|ref|YP_006073806.1| beta-galactosidase [Streptococcus suis GZ1]
gi|386579383|ref|YP_006075788.1| beta-galactosidase [Streptococcus suis JS14]
gi|386581447|ref|YP_006077851.1| beta-galactosidase [Streptococcus suis SS12]
gi|386587678|ref|YP_006084079.1| beta-galactosidase [Streptococcus suis A7]
gi|403061087|ref|YP_006649303.1| beta-galactosidase [Streptococcus suis S735]
gi|145688909|gb|ABP89415.1| Beta-galactosidase [Streptococcus suis 05ZYH33]
gi|145691090|gb|ABP91595.1| Beta-galactosidase [Streptococcus suis 98HAH33]
gi|251815582|emb|CAZ51165.1| putative beta-galactosidase precursor [Streptococcus suis SC84]
gi|251819439|emb|CAR44926.1| putative beta-galactosidase precursor [Streptococcus suis P1/7]
gi|292557863|gb|ADE30864.1| Beta-galactosidase [Streptococcus suis GZ1]
gi|319757575|gb|ADV69517.1| Beta-galactosidase [Streptococcus suis JS14]
gi|353733593|gb|AER14603.1| Beta-galactosidase [Streptococcus suis SS12]
gi|354984839|gb|AER43737.1| Beta-galactosidase [Streptococcus suis A7]
gi|402808413|gb|AFQ99904.1| beta-galactosidase [Streptococcus suis S735]
Length = 590
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 173/697 (24%), Positives = 284/697 (40%), Gaps = 154/697 (22%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K+Y+ A +
Sbjct: 125 HLDEYYVSLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KAYLRAVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTS 303
E W GWF WG + +R E++ +V + G N YM+HGG T+
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGG-------------TN 282
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWS 363
+ + G ++ P+ +Y D + + Y L
Sbjct: 283 FGFMNGCSARGQIDLPQ----------------VTSYDYDAILDEAGNPTKKFYILQQRL 326
Query: 364 VSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFAL 423
+ P+ + E + ++V + +D+ L E ++D V KG +
Sbjct: 327 KEVYPELEYAEPLVKEAKAFSDVSL-------HDKVSLFATL--ENVSDCV---KGFYPK 374
Query: 424 NTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQ 483
N +ST Y+ Y T +L+ D + R+ + + Y +G +V +Q
Sbjct: 375 NMEELDQSTG----YILYRT--ELERDK-----TEAERFRVVDARDRIQIYADGKFVATQ 423
Query: 484 W-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
+ T+ G +L + KLT + +L +G NYG K P G +GR
Sbjct: 424 YQTEIGDDVELDFKDDKLT-----LDILVENMGRVNYGHKL-TAPTQSKG----IGRGA- 472
Query: 543 ETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEA 600
+ DL H TY + L ++D F +GW +Y+ FE
Sbjct: 473 ---MADLHFIGHWETYPLHLESVEDLDF--------SKGWEEGQA------AFYRYQFEL 515
Query: 601 PLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNC 660
E L++ G GKG +VN N+GR+W +GP
Sbjct: 516 D-ELADTYLDMTGFGKGVVFVNNVNIGRFWE----------------KGPI--------- 549
Query: 661 GNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
++ ++P+ ++K G N +V+FE G +I+F
Sbjct: 550 -----LYLYIPKGYLKKGANEIVVFETEGKYREKIHF 581
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 110/321 (34%), Positives = 157/321 (48%), Gaps = 39/321 (12%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ ++ + +HYPR W IK K G++ + YVFWN HE ++DF
Sbjct: 26 KTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEGKFDF 85
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
TGN D+ F + Q GLYVI+R GPYVCAEW GG P WL I LR + FM
Sbjct: 86 TGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDI-RLREPDPYFMER 144
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYI----------- 202
++ F + + + L GGPII+ Q+ENEYG+ YG K+Y+
Sbjct: 145 VKLFERKVGE--QLASLTIQNGGPIIMVQVENEYGS----YG-KNKAYVSAIRDIVRRSG 197
Query: 203 ---------NWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGW 249
+W + + + W M + D PN+P++ +E W+GW
Sbjct: 198 FDKVTLFQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGW 257
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSY 304
F WG + R A+ + + G +F + YM HGGT+FG +G P TSY
Sbjct: 258 FDKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSY 316
Query: 305 DYDAPIDEYGHLNQPKWGHLR 325
DYDAPI+EYG PK+ LR
Sbjct: 317 DYDAPINEYGQAT-PKYWELR 336
>gi|417994975|ref|ZP_12635282.1| beta-galactosidase 3 [Lactobacillus casei M36]
gi|410539221|gb|EKQ13758.1| beta-galactosidase 3 [Lactobacillus casei M36]
Length = 598
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 170/354 (48%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ RF+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 SEGDFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DPAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAE+L + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAENLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
Length = 591
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 151/314 (48%), Gaps = 47/314 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R T W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LDGKPIKLISGAIHYFRMTSAQWADSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F+K Q GL VILR Y+CAEW +GG P WL N P LR+T+ FM +++N+
Sbjct: 72 DIFAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP--MRLRSTDPRFMAKVRNY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L + GGP+I+ Q+ENEYG+ YG K+Y+ ++ I VP
Sbjct: 130 FQVL--LPKLVPLQITHGGPVIMMQVENEYGS----YG-MEKAYLRQTKELMEECGIDVP 182
Query: 218 WIM----------------------------CQESDAPSPMFTPNN-PNSPKIWTENWTG 248
+E+ A F + N P + E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG ++G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFSNGCSARGALDLPQV 300
Query: 302 TSYDYDAPIDEYGH 315
+SYDYDA + E G
Sbjct: 301 SSYDYDALLTEAGE 314
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 93/237 (39%), Gaps = 54/237 (22%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE-RPVKLTRGKNQISLLSATVGLQNYG 520
L++ + LH + +G Q+ + L + P K T ++ +L +G NYG
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELLIQGAPDKETI---ELDVLVENLGRVNYG 457
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
K + GP G G I++D+ H+ + L +A+ +
Sbjct: 458 FKLN-------GPTQAKGIRGG--IMQDIHFHQGYHHYPLT-------LSAEQLQAIDYQ 501
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ KN ++Y+TTF D + + +G GKG VNG NLGRYW
Sbjct: 502 AGKN---PTHPSFYQTTFTLTEVGDTFI-DCRGYGKGVVIVNGINLGRYWQ--------- 548
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G ++ F
Sbjct: 549 -------RGPVHSLYC--------------PKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 650
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 163/326 (50%), Gaps = 41/326 (12%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ LLSG+IH+ R W D ++KA+ GL+ +ETYVFWN EP + Q+D
Sbjct: 73 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 132
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F+GN D+ F++ QGL VILR GPY CAEW GG+P WL I +R+ + F+
Sbjct: 133 FSGNNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLA 191
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---DAGKSYIN------ 203
Q++ + + + L GGPII Q+ENEYG+ D+ D Y+
Sbjct: 192 ASQSYLDALAK--QVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKA 249
Query: 204 --WCAKMATSLDIG-VPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSWG 254
+ + A L G +P + + AP P+ P++ E W GWF WG
Sbjct: 250 LLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG 309
Query: 255 ----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------- 300
D ++ AE+ + + + G N YM+ GGT+FG +G +
Sbjct: 310 KPHAATDARQQAEEFEWILRQ-----GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQ 364
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRE 326
TTSYDYDA +DE GH PK+ +R+
Sbjct: 365 TTSYDYDAILDEAGHPT-PKFALMRD 389
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 30/136 (22%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW- 554
V++ G++ + +L G NYG++ + GRAG D ++ + W
Sbjct: 494 VEIPAGQHTLDVLVENSGRINYGTR------------MADGRAGLVDPVLLDNQQLTGWQ 541
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
+ + + + +S RGW+ K V + +++ D L+++
Sbjct: 542 AFPLPM-----------RTPDSIRGWTRKAV---QGPAFHRGALRIGTPTD-TYLDMRAF 586
Query: 615 GKGFAWVNGYNLGRYW 630
GKGFAW NG NLGR+W
Sbjct: 587 GKGFAWANGVNLGRHW 602
>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
Length = 591
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 151/314 (48%), Gaps = 47/314 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LDGKPIKLISGAIHYFRMTPVQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F+K Q GL VILR Y+CAEW +GG P WL N P LR+T+ FM +++N+
Sbjct: 72 DICAFVKQAQTIGLMVILRPSVYICAEWEFGGLPAWLLNEP--MRLRSTDPRFMAKVRNY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L + GGP+I+ Q+ENEYG+ YG K+Y+ ++ I VP
Sbjct: 130 FQVL--LPKLVPLQITHGGPVIMMQVENEYGS----YG-MEKAYLRQTKELMEEYGIDVP 182
Query: 218 WIM----------------------------CQESDAPSPMFTPNN-PNSPKIWTENWTG 248
+E+ A F + N P + E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDLPQV 300
Query: 302 TSYDYDAPIDEYGH 315
+SYDYDA + E G
Sbjct: 301 SSYDYDALLTEAGE 314
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 96/237 (40%), Gaps = 54/237 (22%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE-RPVKLTRGKNQISLLSATVGLQNYG 520
L++ + LH + +G Q+ + L + P K T ++ +L +G NYG
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELLIQGTPDKETI---ELDVLVENLGRVNYG 457
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGW 580
K + GP G G I++D+ H+ Y+ L ++ +A + + G
Sbjct: 458 FKLN-------GPTQAKGIRGG--IMQDIHFHQ-GYRHYPLTLSAEQL---QAIDYQAGK 504
Query: 581 SSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGC 640
+ + ++Y+TTF D ++ +G GKG VNG NLGRYW
Sbjct: 505 NPTHP------SFYQTTFTLTEVGD-TFIDCRGYGKGVVIVNGINLGRYWQ--------- 548
Query: 641 STESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G ++ F
Sbjct: 549 -------RGPVHSLYC--------------PKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
Length = 200
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 119/203 (58%), Gaps = 25/203 (12%)
Query: 614 MGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRS 673
MGKG AWVNG ++GRYWPTY+A GC T+SC+YRGPY S KC NCG PSQ YHVPRS
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYVASNAGC-TDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRS 59
Query: 674 WIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHENK-------------------T 714
++K NTLVLFEE GG+P+QI+F T + + C ++
Sbjct: 60 FLKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKVGPA 119
Query: 715 MELTC--HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEAS 772
+ L+C H + IS IK+AS+G P G CG F +G C + L +++K C+G +SCS+ S
Sbjct: 120 LLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSN-KALSIVKKACIGSRSCSVGVS 178
Query: 773 EANLGATSCAAGTVKRLVVEALC 795
G G K L VEA C
Sbjct: 179 TDTFGDP--CRGVPKSLAVEATC 199
>gi|255652865|ref|NP_001157373.1| beta-galactosidase [Bombyx mori]
gi|239938036|gb|ACS36117.1| beta-galactosidase [Bombyx mori]
Length = 606
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 116/343 (33%), Positives = 171/343 (49%), Gaps = 38/343 (11%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
+ +S G IDG+ ++SGS+HY R W D + K K GL+ + TYV W+ HE
Sbjct: 3 GHNISIVGDKFMIDGKPLHIISGSLHYFRVPAVYWRDRLHKFKAAGLNTVATYVEWSYHE 62
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVW-LHNMPGIEELR 144
P +QY+F G+ DL+RF++T + GL+V+LR+GPY+CAE + GG P W L P I +LR
Sbjct: 63 PEEKQYNFEGDRDLVRFVQTAAEVGLHVLLRVGPYICAERDLGGLPYWLLGKYPNI-KLR 121
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD----------- 193
TT+K F+ E + + + L GGPIIL Q+ENEYG+ SD
Sbjct: 122 TTDKDFIAESDIWLKKLFEQVS--HLLFGNGGPIILVQVENEYGSYDSDLAYKEKMRDLI 179
Query: 194 ----------YGDAGKSYI--NWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKI 241
Y G S + + ++D GV Q ++ +F P +
Sbjct: 180 SAHVGDKALLYTTDGPSLVGAGMIPGVHATIDFGV---TSQPTEQFDSLFHLRPAPGPLM 236
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----G 297
+E + GW WG + + D+ + R N+Y++ GG+NF TSG G
Sbjct: 237 NSEFYPGWLTHWGERMARVGTNDIVLTL-RNMIVNKIHVNFYVFFGGSNFEFTSGANFDG 295
Query: 298 PYL--TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL 338
Y TSYDYDAP+ E G PK+ +RE K L +++ +
Sbjct: 296 TYQPDITSYDYDAPLSEAGD-PTPKYYAIRETLKQLNFVDEKI 337
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 592 TWYKTTFEAPLENDPV--VLNLQGMGKGFAWVNGYNLGRYWP 631
T+Y+ TF P P+ L+ G KG+ WVNG+NLGRYWP
Sbjct: 507 TFYEGTFVLPEGQKPLDTFLDTTGWDKGYVWVNGHNLGRYWP 548
>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
Length = 598
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 162/326 (49%), Gaps = 41/326 (12%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ LLSG+IH+ R W D ++KA+ GL+ +ETYVFWN EP + Q+D
Sbjct: 34 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 93
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F+GN D+ F+K QGL VILR GPY CAEW GG+P WL I +R+ + F+
Sbjct: 94 FSGNNDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLA 152
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---DAGKSYIN------ 203
Q + + + + L GGPII Q+ENEYG+ D+ D Y+
Sbjct: 153 ASQAYLDALAK--QVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKA 210
Query: 204 --WCAKMATSLDIG-VPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSWG 254
+ + A L G +P + + AP P+ P++ E W GWF WG
Sbjct: 211 LLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG 270
Query: 255 ----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------- 300
D ++ AE+ + + + G N YM+ GGT+FG +G +
Sbjct: 271 KPHAATDARQQAEEFEWILRQ-----GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQ 325
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRE 326
TTSYDYDA +DE GH PK+ +R+
Sbjct: 326 TTSYDYDAILDEAGHPT-PKFALMRD 350
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 57/133 (42%), Gaps = 24/133 (18%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
V++ G++ + +L G NYG++ G+ PVLL D + + +
Sbjct: 455 VEIPAGQHTLDVLVENSGRINYGTRMADGRAGLVDPVLL-----DSQQLTGWQAFPLPMR 509
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
+S RGW+ K V + +++ T D L+++ GKG
Sbjct: 510 T---------------PDSIRGWTGKAV---QGPAFHRGTLRIGTPTD-TYLDMRAFGKG 550
Query: 618 FAWVNGYNLGRYW 630
FAW NG NLGR+W
Sbjct: 551 FAWANGVNLGRHW 563
>gi|348508360|ref|XP_003441722.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 648
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 156/316 (49%), Gaps = 29/316 (9%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++ + ++L GSIHY R W D + K K GL+ + TYV WN HEP R + F
Sbjct: 64 FTLERKPFLILGGSIHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGVFKFDD 123
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
LDL +++ GL+VILR GPY+CAEW+ GG P WL P + +LRTT F +
Sbjct: 124 QLDLEAYLRLAASLGLWVILRPGPYICAEWDLGGLPSWLLRDPQM-KLRTTYSGFTYAVN 182
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG----------DAGKSYINWC 205
+F ++ A + S+GGPII Q+ENEYG+ +D G + +
Sbjct: 183 SFFDEVIKKAVPHQY--SKGGPIIAVQVENEYGSYATDENYMPFIKEALLSRGITELLLT 240
Query: 206 AKMATSLDIG-----VPWIMCQESDAPSPMFTPN-NPNSPKIWTENWTGWFKSWGGKDPK 259
+ L +G + I Q+ D + P PK+ E W+GWF WGG
Sbjct: 241 SDNKDGLKLGGVKGALETINFQKLDPDEIKYLEQIQPQQPKMVMEYWSGWFDLWGGLHHV 300
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYLTTSYDYDAPID 311
TAE++ V + + N YM+HGGTNFG SG + TSYDYDAP+
Sbjct: 301 YTAEEMIPVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGLPAPKPMVTSYDYDAPLS 359
Query: 312 EYGHLNQPKWGHLREL 327
E G K+ LR L
Sbjct: 360 EAGDYTT-KYHLLRNL 374
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 66/190 (34%), Gaps = 55/190 (28%)
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
+GK + LL G NYG D G+ G +LL +E ++D + H K
Sbjct: 488 KGKRTLGLLVENCGRVNYGKTLDEQRKGLVGDILL-----NEHPLRDFNIHSLDMK---- 538
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPL----ENDPVVLNLQGMGKG 617
F N +A W S R + F+ L + L G KG
Sbjct: 539 ----PAFVNRFSAGH---WMSM-----RHQPSFPGFFQGRLYVNGSPQDTFIKLPGWSKG 586
Query: 618 FAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKD 677
++NG NLGRYW T P Q Y VP W+
Sbjct: 587 VVFINGKNLGRYWST-----------------------------GPQQTLY-VPGPWLHR 616
Query: 678 GVNTLVLFEE 687
G N + +FEE
Sbjct: 617 GDNQVTVFEE 626
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 165/348 (47%), Gaps = 35/348 (10%)
Query: 13 LCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGL 72
L L L TL + + ++G+ ++ + +HYPR W IK K G+
Sbjct: 53 LVLSLATLTAPARGGDFTVGKNTFLLNGQPFVVKAAELHYPRIPRPYWEQRIKMCKSLGM 112
Query: 73 DAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPV 132
+ + YVFWN HE ++DFTGN D+ F + Q G+YVI+R GPYVCAEW GG P
Sbjct: 113 NTVCLYVFWNIHEQQEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPW 172
Query: 133 WLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV-- 190
WL I LR + FM ++ F + + L GGPII+ Q+ENEYG+
Sbjct: 173 WLLKKKDIR-LREDDPYFMARVKAFEAEV--GRQLAPLTIQNGGPIIMVQVENEYGSYGV 229
Query: 191 ----MSDYGDAGKS---------YINWCAKMATSLDIGVPWIM----CQESDAPSPMFTP 233
+S D K+ +W + + + W M DA
Sbjct: 230 NKKYVSQIRDIVKASGFDKVTLFQCDWASNFENNGLDDLVWTMNFGTGSNIDAQFKRLKQ 289
Query: 234 NNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
P++P + +E W+GWF WG + R A+ + + +F + YM HGGT+FG
Sbjct: 290 LRPDAPLMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKNISF-SLYMTHGGTSFGH 348
Query: 294 TSGG--PYL---TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
+G P TSYDYDAPI+EYGH PK+ LR K+M+K
Sbjct: 349 WAGANSPGFAPDVTSYDYDAPINEYGHAT-PKFWELR------KTMQK 389
>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
Length = 628
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 167/327 (51%), Gaps = 43/327 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
+G+ +LSG +HY R W ++ K GL+ + TYVFWN HEP ++DFTG+ +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIK ++G+ VILR GPYVCAEW +GG+P WL N+ G+ E+R N F+ +T
Sbjct: 97 LAEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGM-EIRRDNPEFL----KYT 151
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DI 214
+D KE L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 152 KAYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADA 211
Query: 215 G--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
G VP + + P + T N + P + E + GW
Sbjct: 212 GFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWL 271
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W P+ A +A ++ Q +F N+YM HGGTNFG TSG Y T
Sbjct: 272 SHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMT 330
Query: 303 SYDYDAPIDEYGHLNQPKWGHLRELHK 329
SYDYDAPI E G + PK+ +R + K
Sbjct: 331 SYDYDAPISEAGWVT-PKYDSIRNVIK 356
Score = 46.2 bits (108), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 74/194 (38%), Gaps = 43/194 (22%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ I+ ++ +D+
Sbjct: 468 LQILVENMGRINYGSEIVHNTKGIISPVQIAGKE----IVGGWDMYQLP-------MDEM 516
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
A++ + S+ L Y+ TF D ++++ GKG +VNG N+
Sbjct: 517 PDLTKLKADTHKNVPSEVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNI 575
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW P Q Y VP W+K G N +V+FE
Sbjct: 576 GRYWKV-----------------------------GPQQTLY-VPGVWLKKGENKIVIFE 605
Query: 687 EFGGNPSQINFQTV 700
+ P Q +TV
Sbjct: 606 QLNETP-QTEVKTV 618
>gi|373953405|ref|ZP_09613365.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890005|gb|EHQ25902.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 608
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 125/389 (32%), Positives = 187/389 (48%), Gaps = 38/389 (9%)
Query: 16 ILQTLFNLSLAYRVSHD----GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGG 71
+L LF +V+H A +DG+ ++SG +HYPR W +K AK G
Sbjct: 10 LLMLLFVFPAVGQVNHTFALGDEAFLLDGKPFQMISGEMHYPRVPRESWRARMKMAKAMG 69
Query: 72 LDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFP 131
L+ I TYVFWN HEP + ++DFTGN D+ F++ + +GL+VILR PYVCAEW +GG+P
Sbjct: 70 LNTIGTYVFWNLHEPQKGKFDFTGNNDVAEFVRIAKQEGLWVILRPSPYVCAEWEFGGYP 129
Query: 132 VWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYGNV 190
WL N G+ +R+ ++ E +++ I ++ K+ L + GG I++ QIENEYG+
Sbjct: 130 YWLQNEKGL-VVRSKEAQYLKEYESY---IKEVGKQLAPLQINHGGNILMVQIENEYGSY 185
Query: 191 MSD----------YGDAG-KSYINWCAKMATSLDIGVPWIM-----CQESDAPSPMFTPN 234
SD + +AG + C A ++ +P ++ D + + N
Sbjct: 186 GSDKDYLAINQKLFKEAGFDGLLYTCDPAADLVNGHLPGLLPAVNGIDNPDKVKQIISQN 245
Query: 235 -NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR 293
N P E + WF WG K A + + G + N YM+HGGT G
Sbjct: 246 HNGKGPYYIAEWYPAWFDWWGTKHHTVPAAEYTGRLDSVLAAGISI-NMYMFHGGTTRGF 304
Query: 294 TSGGPYLTT--------SYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTN 345
+G Y T SYDYDAP+DE G+ PK+ R + + K + +T V
Sbjct: 305 MNGANYKDTSPYEPQVSSYDYDAPLDEAGNAT-PKFMAFRSV--IEKHLPAGVTLPPVPA 361
Query: 346 TDYGNSVSGSSYNLPAWSVSILPDCKTEE 374
SV+ A + ILP K +
Sbjct: 362 AKPAISVAAIKLTRSAGILDILPQAKVSK 390
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 51/139 (36%), Gaps = 47/139 (33%)
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVV------------- 608
G+ +K +N + N+ W ++P N + + + PV+
Sbjct: 482 GITEKVLFNTQQVNN---WQMYSLPFNHAEAINLKSGSSTMGTAPVIKSGYFNLQKTGDT 538
Query: 609 -LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIW 667
L+++ GKG WVNG+NLGRYW P Q
Sbjct: 539 YLDMRKWGKGLVWVNGHNLGRYWQV-----------------------------GPQQTL 569
Query: 668 YHVPRSWIKDGVNTLVLFE 686
Y VP W+K G N + + E
Sbjct: 570 Y-VPAEWLKKGQNEVRVLE 587
>gi|29376389|ref|NP_815543.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|227519038|ref|ZP_03949087.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553661|ref|ZP_03983710.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256961654|ref|ZP_05565825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293383358|ref|ZP_06629271.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388990|ref|ZP_06633475.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907816|ref|ZP_07766806.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910433|ref|ZP_07769280.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714340|ref|ZP_16771066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715597|ref|ZP_16772313.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676484|ref|ZP_18113355.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681702|ref|ZP_18118489.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685588|ref|ZP_18122282.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686206|ref|ZP_18122874.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690524|ref|ZP_18127059.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424694932|ref|ZP_18131318.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696643|ref|ZP_18132984.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424700339|ref|ZP_18136532.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703758|ref|ZP_18139884.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424712611|ref|ZP_18144783.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424718249|ref|ZP_18147501.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424721894|ref|ZP_18150963.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723972|ref|ZP_18152924.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733572|ref|ZP_18162127.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424741709|ref|ZP_18170052.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424751990|ref|ZP_18179997.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|29343852|gb|AAO81613.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|227073538|gb|EEI11501.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177203|gb|EEI58175.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256952150|gb|EEU68782.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291079149|gb|EFE16513.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081771|gb|EFE18734.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626177|gb|EFQ09460.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289706|gb|EFQ68262.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575942|gb|EFU88133.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580774|gb|EFU92965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350621|gb|EJU85522.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356496|gb|EJU91227.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358329|gb|EJU93003.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364102|gb|EJU98549.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367740|gb|EJV02077.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402369105|gb|EJV03397.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402374029|gb|EJV08075.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377412|gb|EJV11319.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402379869|gb|EJV13650.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402382152|gb|EJV15835.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402384002|gb|EJV17579.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402390099|gb|EJV23464.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402391584|gb|EJV24885.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402396442|gb|EJV29504.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402401146|gb|EJV33935.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402404973|gb|EJV37581.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 611
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 168/357 (47%), Gaps = 53/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + IETY+ WN HEP+ YDF G
Sbjct: 12 VDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D++ F+ Q+ GL VILR Y+CAEW +GG P WL + LR+T+ F+ +++ +
Sbjct: 72 DIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL--LKEHVRLRSTDPRFIAKVRTY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + K L + GGP+I+ Q+ENEYG+ YG K Y+ ++ I VP
Sbjct: 130 FSVL--LPKLVPLQVTHGGPVIMMQVENEYGS----YG-MEKEYLRQTKQVMEEFGIDVP 182
Query: 218 WIMC-----------------------------QESDAPSPMFTPNNPNSPKIWTENWTG 248
+ + ++ P + E W G
Sbjct: 183 LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHK-----LLKSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + K+ H++ K + ++ + T+G++ NSVS
Sbjct: 301 TSYDYDALLTEAGEPTE-KYFHVQRAIKEVCPEVWQAEPRRKTFGSLGTFPVQNSVS 356
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 94/251 (37%), Gaps = 68/251 (27%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL--LSATVGLQNY 519
L++ + LH + +G+ Q+ + N E +K T K I L L +G NY
Sbjct: 401 LKVVEASDRLHLFADGSLQTIQYQE----NLREEVMIKGTPEKEWIELDVLVENLGRVNY 456
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHK--WTYKVGLYGLDDKKFYNAKAANSE 577
G K + GP + G G I++D+ H+ Y + L KK N
Sbjct: 457 GFKLN-------GPTQVKGIRGG--IMQDIHFHQGYRQYALTLSADQLKKIDYTAGKNPA 507
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
+ ++Y+ F D + + + GKG VNG NLGRYW
Sbjct: 508 QP------------SFYQAEFTLTDLADTFI-DCRSYGKGVVIVNGINLGRYWQ------ 548
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G +++ F
Sbjct: 549 ----------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEGIEINELIF 584
Query: 698 QTVVVGTACGQ 708
CGQ
Sbjct: 585 --------CGQ 587
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 172/359 (47%), Gaps = 50/359 (13%)
Query: 9 RAILLCLILQTLFNLSLAYRVSH---------DGRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + + G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF+G+ D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q + + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQAYLDALAN--QVQPLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D Y+ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG D ++ AE+ + + + G
Sbjct: 240 GEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDEYGHLNQPKWGHLRE 326
N YM+ GGT+FG +G + TTSYDYDA +DE GH PK+ +R+
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRD 352
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 30/136 (22%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW- 554
V++ G++ + +L G NYG++ + GRAG D ++ + W
Sbjct: 457 VEIPAGQHTLDVLVENSGRINYGTR------------MADGRAGLVDPVVLDNRQLTGWQ 504
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
+ + + + +S RGW+ K V + +++ T D L+++
Sbjct: 505 AFPLPM-----------RTPDSIRGWTRKAV---QGPAFHRGTLRIGTPTD-TYLDMRAF 549
Query: 615 GKGFAWVNGYNLGRYW 630
GKGFAW NG NLGR+W
Sbjct: 550 GKGFAWANGVNLGRHW 565
>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
Length = 594
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV W+ HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|417991864|ref|ZP_12632235.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
gi|410534805|gb|EKQ09440.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
Length = 598
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 169/354 (47%), Gaps = 64/354 (18%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +LSG+IHY R P W + K G + +ETYV WN HE
Sbjct: 4 FSIDHE---FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEY 60
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DF+G LD+ F+ T +D GLY I+R PY+CAEW +GGFP WL + LRT
Sbjct: 61 NEGDFDFSGILDIEHFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL--LTKKMRLRTD 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ ++ + + T ++ ++ + GG +I+ Q+ENEYG+ D K Y+ A
Sbjct: 119 DSAYLQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGED-----KDYLAAVA 171
Query: 207 KMATSLDIGVPWIMCQESDAPSP------------MFTPNNPNS---------------- 238
++ + VP SD P P + T N S
Sbjct: 172 ELMKKHGVDVPLFT---SDGPWPATLNAGSMADAGILTTGNFGSHADMNFDRLAAFNQAH 228
Query: 239 ----PKIWTENWTGWFKSWGG----KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTN 290
P + E W GWF WG +DP+ TAEDL + R G+ N YM+HGGTN
Sbjct: 229 GHDWPLMCMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTN 282
Query: 291 FGRTSGGPYL-------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
FG +G TSYDYDAP++E G+ + + +H++L S +T
Sbjct: 283 FGFMNGTSARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQT 336
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 33/200 (16%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMT---LRINSSGQVLHAYVNGNYVDSQWTKYGASND 492
++L T L +P++SG+ T LR+ + + A+ +G + +Q+ + +D
Sbjct: 376 QEFLGQYTGYTLYRTNPLISGTDKGTPAKLRVIDARDRVQAFFDGKSLATQYQE-AIGDD 434
Query: 493 LFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN--GIPGPVLLVGRAGDETIIKDLS 550
+ V+ G++Q+ LL + NYGSK + + GI V++ D IKD
Sbjct: 435 ILLPEVE---GRHQLDLLVENMSRVNYGSKIEAITQFKGIRTGVMV-----DLHFIKDYL 486
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
Y LD K A + W + +Y+ F+ D L+
Sbjct: 487 Q---------YPLDLNK---APQLDFTGDWQAGTP------AFYQYGFDVVKPQD-TYLD 527
Query: 611 LQGMGKGFAWVNGYNLGRYW 630
+G GKG VNG N+GR+W
Sbjct: 528 CRGFGKGVMLVNGVNIGRFW 547
>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 588
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 116/331 (35%), Positives = 164/331 (49%), Gaps = 41/331 (12%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A S DG + GE ++SG++HY R P W D ++KA+ GL+ IETY+ WN HE
Sbjct: 6 ALTTSSDG--FLLHGEPFRIISGAMHYFRIHPDQWTDRLRKARLMGLNTIETYLPWNLHE 63
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P G LDL R+++ QD+GL+V+LR GP++CAEW+ GG P WL P I LR+
Sbjct: 64 PEPGTLVLDGFLDLPRWLRLAQDEGLHVLLRPGPFICAEWDDGGLPAWLLADPDI-RLRS 122
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
++ F + ++ + A+ GGP+I Q+ENEYG YGD +Y+
Sbjct: 123 SDPRFTGAFDGYLDQLLPALR--PFMAAHGGPVIAVQVENEYGA----YGD-DTAYLKHV 175
Query: 206 AKMATSLDIGVPWIMCQESDA--------PSPMFTP---------------NNPNSPKIW 242
+ + C ++ A P + T + P P +
Sbjct: 176 HQALRDRGVEELLYTCDQASAEHLAAGTLPGTLATATFGSRVEENLAALRTHQPEGPLMC 235
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--- 299
+E W GWF WGG R+A D A + R G + N YM+HGGTNFG T+G +
Sbjct: 236 SEFWVGWFDHWGGPHHVRSAADAAADLDRLLSAGASV-NIYMFHGGTNFGFTNGANHKHA 294
Query: 300 ---LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAP+ E G PK+ RE+
Sbjct: 295 YEPTVTSYDYDAPLTESGDPG-PKYHAFREV 324
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 48/124 (38%), Gaps = 46/124 (37%)
Query: 578 RGWSSKNVPLNRRMT---------------WYKTTFEAPLENDPVVLNLQGMGKGFAWVN 622
RGW + VPL+ +++ TFE D L+L G KG AWVN
Sbjct: 474 RGWECRPVPLDDLAAVPFGPSTATTDAVPAFHRGTFEVDSPAD-TFLSLPGWTKGQAWVN 532
Query: 623 GYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTL 682
G++LGRYW RGP Q +VP ++ G N L
Sbjct: 533 GFHLGRYW----------------NRGP--------------QHTLYVPAPVLRPGANEL 562
Query: 683 VLFE 686
VL E
Sbjct: 563 VLLE 566
>gi|222526932|ref|YP_002571403.1| beta-galactosidase [Chloroflexus sp. Y-400-fl]
gi|222450811|gb|ACM55077.1| Beta-galactosidase [Chloroflexus sp. Y-400-fl]
Length = 917
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/313 (36%), Positives = 164/313 (52%), Gaps = 33/313 (10%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
RV +G I +DG+ LLSG +HY R W L+++A+ GL+ I+T + WN HEP
Sbjct: 26 RVHRNG--IELDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRHEPQ 83
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
++DF+ DL F+ + GL I+R GPY+CAEW GG P WL G LR+ +
Sbjct: 84 PGEFDFSEEADLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWL-TASGDMRLRSDD 142
Query: 148 KVFMNE-MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
F + ++ F TL+ + ++ GGPIIL QIENE+ YG ++ A
Sbjct: 143 PAFRDAVLRWFDTLMPILVPRQY---PHGGPIILCQIENEHW-ASGVYG--ADTHQQTLA 196
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPNN---------------PNSPKIWTENWTGWFK 251
+ A I VP C + P F N P++P I +E W+GWF
Sbjct: 197 QAALERGIVVPQYTCVGAMPGYPEFR-NGWSGIAEKLVQTRQLWPDNPLIVSELWSGWFD 255
Query: 252 SWGG-KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGP--YLTTSY 304
+WGG + ++TA L + + G +++M+ GGTNF GRT GG ++TTSY
Sbjct: 256 NWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTSY 315
Query: 305 DYDAPIDEYGHLN 317
DYDAP+DEYG L
Sbjct: 316 DYDAPVDEYGRLT 328
>gi|312903586|ref|ZP_07762766.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|310633462|gb|EFQ16745.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 168/357 (47%), Gaps = 53/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + IETY+ WN HEP+ YDF G
Sbjct: 12 VDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D++ F+ Q+ GL VILR Y+CAEW +GG P WL + LR+T+ F+ +++ +
Sbjct: 72 DIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL--LKEHVRLRSTDPRFIAKVRTY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + K L + GGP+I+ Q+ENEYG+ YG K Y+ ++ I VP
Sbjct: 130 FSVL--LPKLVPLQVTHGGPVIMMQVENEYGS----YG-MEKEYLRQTKQVMEEFGIDVP 182
Query: 218 WIMC-----------------------------QESDAPSPMFTPNNPNSPKIWTENWTG 248
+ + ++ P + E W G
Sbjct: 183 LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLL-----KSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + K+ H++ K + ++ + T+G++ NSVS
Sbjct: 301 TSYDYDALLTEAGEPTE-KYFHVQRAIKEVCPEVWQAEPRRKTFGSLGTFPVQNSVS 356
>gi|229545563|ref|ZP_04434288.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256619317|ref|ZP_05476163.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853375|ref|ZP_05558745.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256964870|ref|ZP_05569041.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|257090147|ref|ZP_05584508.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|294614275|ref|ZP_06694194.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|307272958|ref|ZP_07554205.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|307277803|ref|ZP_07558888.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291733|ref|ZP_07571605.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|384518848|ref|YP_005706153.1| beta-galactosidase [Enterococcus faecalis 62]
gi|422685728|ref|ZP_16743941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422689100|ref|ZP_16747212.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422720655|ref|ZP_16777264.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422731066|ref|ZP_16787446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|422739263|ref|ZP_16794446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|430849460|ref|ZP_19467237.1| glycosyl hydrolase [Enterococcus faecium E1185]
gi|229309303|gb|EEN75290.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256598844|gb|EEU18020.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711834|gb|EEU26872.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256955366|gb|EEU71998.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256998959|gb|EEU85479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|291592934|gb|EFF24524.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|306497185|gb|EFM66730.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505543|gb|EFM74728.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306510572|gb|EFM79595.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|315029440|gb|EFT41372.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032046|gb|EFT43978.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144925|gb|EFT88941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|315162898|gb|EFU06915.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577862|gb|EFU90053.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|323480981|gb|ADX80420.1| beta-galactosidase [Enterococcus faecalis 62]
gi|430537598|gb|ELA77922.1| glycosyl hydrolase [Enterococcus faecium E1185]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 168/357 (47%), Gaps = 53/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + IETY+ WN HEP+ YDF G
Sbjct: 12 VDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D++ F+ Q+ GL VILR Y+CAEW +GG P WL + LR+T+ F+ +++ +
Sbjct: 72 DIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL--LKEHVRLRSTDPRFIAKVRTY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + K L + GGP+I+ Q+ENEYG+ YG K Y+ ++ I VP
Sbjct: 130 FSVL--LPKLVPLQVTHGGPVIMMQVENEYGS----YG-MEKEYLRQTKQVMEEFGIDVP 182
Query: 218 WIMC-----------------------------QESDAPSPMFTPNNPNSPKIWTENWTG 248
+ + ++ P + E W G
Sbjct: 183 LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLL-----KSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + K+ H++ K + ++ + T+G++ NSVS
Sbjct: 301 TSYDYDALLTEAGEPTE-KYFHVQRAIKEVCPEVWQAEPRRKTFGSLGTFPVQNSVS 356
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 94/251 (37%), Gaps = 68/251 (27%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL--LSATVGLQNY 519
L++ + LH + +G+ Q+ + N E +K T K I L L +G NY
Sbjct: 401 LKVVEASDRLHLFADGSLQTIQYQE----NLGEEVMIKGTPEKEWIELDVLVENLGRVNY 456
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHK--WTYKVGLYGLDDKKFYNAKAANSE 577
G K + GP + G G I++D+ H+ Y + L KK N
Sbjct: 457 GFKLN-------GPTQVKGIRGG--IMQDIHFHQGYRQYALTLSADQLKKIDYTAGKNPA 507
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
+ ++Y+ F D + + + GKG VNG NLGRYW
Sbjct: 508 QP------------SFYQAEFTLTDLADTFI-DCRSYGKGVVIVNGINLGRYWQ------ 548
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G +++ F
Sbjct: 549 ----------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEGIEINELIF 584
Query: 698 QTVVVGTACGQ 708
CGQ
Sbjct: 585 --------CGQ 587
>gi|421514041|ref|ZP_15960756.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|401672838|gb|EJS79281.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 168/357 (47%), Gaps = 53/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + IETY+ WN HEP+ YDF G
Sbjct: 12 VDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D++ F+ Q+ GL VILR Y+CAEW +GG P WL + LR+T+ F+ +++ +
Sbjct: 72 DIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL--LKEHVRLRSTDPRFIAKVRTY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + K L + GGP+I+ Q+ENEYG+ YG K Y+ ++ I VP
Sbjct: 130 FSVL--LPKLVPLQVTHGGPVIMMQVENEYGS----YG-MEKEYLRQTKQVMEEFGIDVP 182
Query: 218 WIMC-----------------------------QESDAPSPMFTPNNPNSPKIWTENWTG 248
+ + ++ P + E W G
Sbjct: 183 LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHK-----LLKSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + K+ H++ K + ++ + T+G++ NSVS
Sbjct: 301 TSYDYDALLTEAGEPTE-KYFHVQRAIKEVCPEVWQAEPRRKTFGSLGTFPVQNSVS 356
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 94/251 (37%), Gaps = 68/251 (27%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL--LSATVGLQNY 519
L++ + LH + +G+ Q+ + N E +K T K I L L +G NY
Sbjct: 401 LKVVEASDRLHLFADGSLQTIQYQE----NLGEEVMIKGTPEKEWIELDVLVENLGRVNY 456
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHK--WTYKVGLYGLDDKKFYNAKAANSE 577
G K + GP + G G I++D+ H+ Y + L KK N
Sbjct: 457 GFKLN-------GPTQVKGIRGG--IMQDIHFHQGYRQYALTLSADQLKKIDYTAGKNPA 507
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
+ ++Y+ F D + + + GKG VNG NLGRYW
Sbjct: 508 QP------------SFYQAEFTLTDLADTFI-DCRSYGKGVVIVNGINLGRYWQ------ 548
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G +++ F
Sbjct: 549 ----------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEGIEINELIF 584
Query: 698 QTVVVGTACGQ 708
CGQ
Sbjct: 585 --------CGQ 587
>gi|307275710|ref|ZP_07556850.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|306507586|gb|EFM76716.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 168/357 (47%), Gaps = 53/357 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG+IHY R TP W D + K G + IETY+ WN HEP+ YDF G
Sbjct: 12 VDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D++ F+ Q+ GL VILR Y+CAEW +GG P WL + LR+T+ F+ +++ +
Sbjct: 72 DIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL--LKEHVRLRSTDPRFIAKVRTY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + K L + GGP+I+ Q+ENEYG+ YG K Y+ ++ I VP
Sbjct: 130 FSVL--LPKLVPLQVTHGGPVIMMQVENEYGS----YG-MEKEYLRQTKQVMEEFGIDVP 182
Query: 218 WIMC-----------------------------QESDAPSPMFTPNNPNSPKIWTENWTG 248
+ + ++ P + E W G
Sbjct: 183 LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYWDG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG KR +DLA V G N YM+HGGTNFG +G
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHK-----LLKSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + K+ H++ K + ++ + T+G++ NSVS
Sbjct: 301 TSYDYDALLTEAGEPTE-KYFHVQRAIKEVCPEVWQAEPRRKTFGSLGTFPVQNSVS 356
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 94/251 (37%), Gaps = 68/251 (27%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISL--LSATVGLQNY 519
L++ + LH + +G+ Q+ + ++ +K T K I L L +G NY
Sbjct: 401 LKVVEASDRLHLFADGSLQTIQYQENLGEEEM----IKGTPEKEWIELDVLVENLGRVNY 456
Query: 520 GSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHK--WTYKVGLYGLDDKKFYNAKAANSE 577
G K + GP + G G I++D+ H+ Y + L KK N
Sbjct: 457 GFKLN-------GPTQVKGIRGG--IMQDIHFHQGYRQYALTLSADQLKKIDYTAGKNPA 507
Query: 578 RGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEE 637
+ ++Y+ F D + + + GKG VNG NLGRYW
Sbjct: 508 QP------------SFYQAEFTLTDLADTFI-DCRSYGKGVVIVNGINLGRYWQ------ 548
Query: 638 DGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
RGP S C P+ ++K G N +V+FE G +++ F
Sbjct: 549 ----------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEGIEINELIF 584
Query: 698 QTVVVGTACGQ 708
CGQ
Sbjct: 585 --------CGQ 587
>gi|167755577|ref|ZP_02427704.1| hypothetical protein CLORAM_01091 [Clostridium ramosum DSM 1402]
gi|167704516|gb|EDS19095.1| glycosyl hydrolase family 35 [Clostridium ramosum DSM 1402]
Length = 584
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 112/350 (32%), Positives = 164/350 (46%), Gaps = 51/350 (14%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ I+G + ++SG++HY R P W D + K G + +ETYV WN HEP + +YDF
Sbjct: 8 KEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYDF 67
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
+G D+ F+K ++ L+VILR PY+CAEW GG P WL P I LRT +K ++
Sbjct: 68 SGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRI-RLRTNDKQYLKC 126
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
+ + +++ + K K +Q GPIILAQ+ENEYG+ D K Y+ +M
Sbjct: 127 LDQYFSIL--LPKLSKYQITQNGPIILAQLENEYGSYGED-----KEYLLAVYQMMRKYG 179
Query: 214 IGVPWI--------------MCQESDAPSPMFTPN---------------NPNSPKIWTE 244
I VP + ++ P+ F +P + E
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESYQITAPLMCME 239
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGG 297
W GWF W + KR ++ + G N+YM+ GGTNFG R
Sbjct: 240 FWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSARKEHD 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLREL----HKLLKSMEKTLTYGNV 343
TSYDYDA + EYG + K+ LRE+ + L +T YG +
Sbjct: 298 LPQITSYDYDAILTEYGAKTE-KYHLLREVITGKKERLPERRQTKNYGQI 346
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 161/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV W+ HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|163848976|ref|YP_001637020.1| beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
gi|163670265|gb|ABY36631.1| Beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
Length = 897
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/313 (36%), Positives = 164/313 (52%), Gaps = 33/313 (10%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
RV +G I +DG+ LLSG +HY R W L+++A+ GL+ I+T + WN HEP
Sbjct: 6 RVHRNG--IELDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRHEPQ 63
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
++DF+ DL F+ + GL I+R GPY+CAEW GG P WL G LR+ +
Sbjct: 64 PGEFDFSEEADLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWL-TASGDMRLRSDD 122
Query: 148 KVFMNE-MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
F + ++ F TL+ + ++ GGPIIL QIENE+ YG ++ A
Sbjct: 123 PAFRDAVLRWFDTLMPILVPRQY---PHGGPIILCQIENEHW-ASGVYG--ADTHQQTLA 176
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPNN---------------PNSPKIWTENWTGWFK 251
+ A I VP C + P F N P++P I +E W+GWF
Sbjct: 177 QAALERGIVVPQYTCVGAMPGYPEFR-NGWSGIAEKLVQTRQLWPDNPLIVSELWSGWFD 235
Query: 252 SWGG-KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGP--YLTTSY 304
+WGG + ++TA L + + G +++M+ GGTNF GRT GG ++TTSY
Sbjct: 236 NWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTSY 295
Query: 305 DYDAPIDEYGHLN 317
DYDAP+DEYG L
Sbjct: 296 DYDAPVDEYGRLT 308
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 160/332 (48%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L + GG I++ QIENEYG+ +G+ K+Y+ + + + P
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAP 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGG NFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGINFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/327 (35%), Positives = 166/327 (50%), Gaps = 31/327 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
+ + ++G DG+ +SGSIHY R W D + K K GL+AIETYV WN HE
Sbjct: 60 TFTIDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFHE 119
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P QY F+G DL F++ + + GL VILR GPY+CAEW+ GG PVWL I LR+
Sbjct: 120 PFPGQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIF-LRS 178
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG------DAG 198
++ ++ + + L V + K + GGPII Q+ENEYG+ + DY
Sbjct: 179 SDPDYLKAVDKW--LEVLLPKMKPYLYQNGGPIITVQVENEYGSYFACDYNYLRFLLKVF 236
Query: 199 KSYINWCAKMATSLDIGVPWIMC---QESDAPSPMFTPNN------------PNSPKIWT 243
+ ++ + T+ G ++ C Q+ A T +N P P + +
Sbjct: 237 RQHLGEEVVLFTTDGAGENYLKCGTLQDLYATVDFGTSSNITQAFMIQRKVEPKGPLVNS 296
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYLT 301
E +TGW WG + +++ ++ G N YM+ GGTNFG +G PYL
Sbjct: 297 EFYTGWLDHWGESHQTVSTKNIVASLTDMLSRGANV-NLYMFIGGTNFGFWNGANMPYLP 355
Query: 302 --TSYDYDAPIDEYGHLNQPKWGHLRE 326
TSYDYDAP+ E G L + K+ +RE
Sbjct: 356 QPTSYDYDAPLSEAGDLTE-KYYAVRE 381
>gi|195342884|ref|XP_002038028.1| GM17976 [Drosophila sechellia]
gi|194132878|gb|EDW54446.1| GM17976 [Drosophila sechellia]
Length = 672
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 174/365 (47%), Gaps = 55/365 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +SGS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 46 FTIDHEANTFLLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNP 105
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+Y++ G DL++F++ Q++ Y+ILR GPY+CAE + GG P WL ++RT
Sbjct: 106 HDGEYNWEGIADLVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTN 165
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ +++E+ + + M + + LF GG II+ Q+ENEYG+ D+ Y+NW
Sbjct: 166 DPNYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWLR 218
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 219 DETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRI--NEIDKIWAMLRALQPT 276
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W ++ +R +++A A+ + + N YM+ GGTNFG T+G
Sbjct: 277 GPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGA 335
Query: 298 PY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTYG 341
Y TSYDYDA +DE G +L + G L ++ + K L YG
Sbjct: 336 NYNLDGGIGYAADITSYDYDAVMDEAGGVTTKYNLVKAVIGEFLPLPEITLNPAKRLAYG 395
Query: 342 NVTNT 346
V T
Sbjct: 396 RVELT 400
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 590 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 620
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP +K G N+L++ E
Sbjct: 621 YVPNEILKVGENSLMILE 638
>gi|318077940|ref|ZP_07985272.1| beta-galactosidase [Streptomyces sp. SA3_actF]
Length = 588
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 154/323 (47%), Gaps = 46/323 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
++VS +G ++DG LLSG++HY R P WP ++ + GL+ +ETYV WN HEP
Sbjct: 2 FQVSPEG--FSLDGRPLRLLSGALHYFRVLPEQWPHRLRMLRALGLNTVETYVPWNFHEP 59
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DFTG DL F+ +D GL+ I+R PY+CAEW GG P WL P + LR
Sbjct: 60 RPGHHDFTGQADLDAFLHATRDAGLHAIVRPSPYICAEWENGGLPWWLLADPEVRALRCQ 119
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ ++ + + LI +A + +QGG +++ Q+ENEYG+ +D G Y+
Sbjct: 120 DPAYLAHVDRWYDALIPRLAAHQ---VTQGGNVVMMQVENEYGSYGTDTG-----YLEHL 171
Query: 206 AKMATSLDIGVPWIMCQESD--------APSPMFTPN---------------NPNSPKIW 242
A I VP D P + T N P+ P +
Sbjct: 172 ADGMRRRGIDVPLFTSDGPDDFFLTGGTLPGHLATVNFGSRPAQAFAGLKRLRPHDPPMC 231
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR--------- 293
E W GWF WG R A + +A GG+ N YM HGGTNF
Sbjct: 232 AEFWCGWFDHWGAPRTVRDAAEATEELAATLGAGGSV-NVYMAHGGTNFSTWAGANTEDP 290
Query: 294 TSGGPYL--TTSYDYDAPIDEYG 314
+G YL TSYDYDAPIDE G
Sbjct: 291 ATGAGYLPTVTSYDYDAPIDERG 313
>gi|318059605|ref|ZP_07978328.1| beta-galactosidase [Streptomyces sp. SA3_actG]
Length = 588
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 154/323 (47%), Gaps = 46/323 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
++VS +G ++DG LLSG++HY R P WP ++ + GL+ +ETYV WN HEP
Sbjct: 2 FQVSPEG--FSLDGRPLRLLSGALHYFRVLPEQWPHRLRMLRALGLNTVETYVPWNFHEP 59
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DFTG DL F+ +D GL+ I+R PY+CAEW GG P WL P + LR
Sbjct: 60 RPGHHDFTGQADLDAFLHATRDAGLHAIVRPSPYICAEWENGGLPWWLLADPEVRALRCQ 119
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ ++ + + LI +A + +QGG +++ Q+ENEYG+ +D G Y+
Sbjct: 120 DPAYLAHVDRWYDALIPRLAAHQ---VTQGGNVVMMQVENEYGSYGTDTG-----YLEHL 171
Query: 206 AKMATSLDIGVPWIMCQESD--------APSPMFTPN---------------NPNSPKIW 242
A I VP D P + T N P+ P +
Sbjct: 172 ADGMRRRGIDVPLFTSDGPDDFFLTGGTLPGHLATVNFGSRPAQAFAGLKRLRPHDPPMC 231
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR--------- 293
E W GWF WG R A + +A GG+ N YM HGGTNF
Sbjct: 232 AEFWCGWFDHWGAPRTVRDAAEATEELAATLGAGGSV-NVYMAHGGTNFSTWAGANTEDP 290
Query: 294 TSGGPYL--TTSYDYDAPIDEYG 314
+G YL TSYDYDAPIDE G
Sbjct: 291 ATGAGYLPTVTSYDYDAPIDERG 313
>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 674
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/331 (35%), Positives = 170/331 (51%), Gaps = 41/331 (12%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
DG+ + +G+ L SG +HY R W +K K GL+A+ TYVFWN HE ++
Sbjct: 86 DGQFV-YNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKW 144
Query: 92 DF-TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVF 150
D+ TGN +L +F+KT ++G+ VILR GPY CAEW +GG+P WL G+ +R N+ F
Sbjct: 145 DWKTGNRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGL-VIRADNQPF 203
Query: 151 MNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMA 209
++ + + + + ++ ++GGPII+ Q ENE+G+ ++ D +++ + AK+
Sbjct: 204 LDSCRVYINQLASQMRDLQI--TKGGPIIMVQAENEFGSYVAQRKDIPLETHRAYSAKIK 261
Query: 210 TS-LDIG--VPWIMCQ-------------------ESDAPSPMFTPNNPN---SPKIWTE 244
LD G VP ESD N N P + E
Sbjct: 262 QQLLDAGFDVPLFTSDGSWLFKGGTIEGALPTANGESDIEKLKKVVNEYNGGKGPYMVAE 321
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--- 301
+ GW W P+ + E + A++ + G +F NYYM HGGTNFG TSG Y T
Sbjct: 322 FYPGWLSHWAEPFPQVSTESIVKQTAKYLENGISF-NYYMVHGGTNFGFTSGANYTTATN 380
Query: 302 -----TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G N PK+ LR L
Sbjct: 381 LQPDLTSYDYDAPISEAG-WNTPKYDALRAL 410
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/313 (34%), Positives = 151/313 (48%), Gaps = 29/313 (9%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ + YVFWN+HEP YDFT
Sbjct: 359 LNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQN 418
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL + LR ++ F+ + F
Sbjct: 419 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVR-LRESDPYFIERVALF 477
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSY-I 202
+ K L + GGPII+ Q+ENEYG+ V +++G+ +
Sbjct: 478 EEAVAKQVKD--LTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALFQC 535
Query: 203 NWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+W + + + W M D PNSP + +E W+GWF WG
Sbjct: 536 DWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKQLRPNSPLMCSEFWSGWFDKWGANHE 595
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEY 313
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E
Sbjct: 596 TRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISES 654
Query: 314 GHLNQPKWGHLRE 326
G PK+ LRE
Sbjct: 655 GQ-TTPKYWALRE 666
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 117/339 (34%), Positives = 165/339 (48%), Gaps = 39/339 (11%)
Query: 17 LQTLFNLSLAYRVSHDGRA-ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAI 75
LQ F L L + GRA T++G + ++ GSIHY R W D + K K G + +
Sbjct: 83 LQRRF-LGLGTASTTKGRAHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTV 141
Query: 76 ETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH 135
TY+ WN HEP R ++ F+GNLDL F+ + GL+VILR GPY+CAE + GG P WL
Sbjct: 142 TTYIPWNLHEPQRGKFVFSGNLDLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLL 201
Query: 136 NMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG 195
P +LRTT + F++ + + + M + L GGP+I Q+ENEYG+ D
Sbjct: 202 QNPK-TQLRTTERTFVDAVDAYFDHL--MRRMVPLQYHHGGPVIAVQVENEYGSFNRD-- 256
Query: 196 DAGKSYINWCAKMATSLDIGVPWIMCQ------ESDAPSPMFTPN--------------- 234
Y+ + + I C + T N
Sbjct: 257 ---GQYMAYLKEALLKRGIVELLFTCDYYKDVVNGSLKGVLATVNLGSLGKNSFYQLLQV 313
Query: 235 NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR- 293
+ P + E W GW+ SWG ++A ++A V+ F + G +F N YM+HGGTNFG
Sbjct: 314 QSHKPILIMEYWVGWYDSWGLPHANKSAAEVAHTVSTFIKNGISF-NVYMFHGGTNFGFI 372
Query: 294 -----TSGGPYLTTSYDYDAPIDEYGHLNQPKWGHLREL 327
G +TTSYDYDA + E G + K+ LREL
Sbjct: 373 NAAGIVEGRRSVTTSYDYDAVLSEAGDYTE-KYFKLREL 410
>gi|323353539|ref|ZP_08088072.1| beta-galactosidase [Streptococcus sanguinis VMC66]
gi|322121485|gb|EFX93248.1| beta-galactosidase [Streptococcus sanguinis VMC66]
Length = 592
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGG I++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGTILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V + Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKKMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 951
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 116/394 (29%), Positives = 173/394 (43%), Gaps = 83/394 (21%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH---- 84
VS+D RAI I+ +R +LLSGS+H R+T G W + +A GL+ I Y+FW AH
Sbjct: 150 VSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMITVYIFWGAHQSFR 209
Query: 85 -EPLRRQYDFTG------NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNM 137
EPL D + +L +++ ++GL++ +RIGPY C E+ YGG P WL
Sbjct: 210 DEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGEYTYGGIPEWLPLQ 269
Query: 138 PGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENE----------- 186
+R N+ +++ M+ F + L+A QGGPI++AQIENE
Sbjct: 270 SSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIENELGSGVDGSAAA 329
Query: 187 -------------------------YGNVMSDYGDAG----------KSYINWCAKMATS 211
YG+++ + G + Y +WC +
Sbjct: 330 NYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATVQDYADWCGNLVAR 389
Query: 212 LDIGVPWIMCQESDAPSPMFTPNNPN-----------------SPKIWTENWTGWFKSWG 254
L V W MC A + + T N N P IWTE+ G F+ WG
Sbjct: 390 LAPNVIWTMCNGLSAENTISTFNGNNGIDWLEKYGDSGRIQVDQPAIWTED-EGGFQLWG 448
Query: 255 GKDPK-------RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYD 307
+ K RT+ +A ++F GGT NYYM+ GG N GR+S + +Y D
Sbjct: 449 DQPSKPSDYFWGRTSRAMATDALQWFARGGTHLNYYMWWGGYNRGRSSAAG-IMNAYATD 507
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYG 341
A + G PK+ H LH ++ + L +
Sbjct: 508 AFLCSSGQRRHPKYDHFLALHLVIADIAAILLHA 541
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 119/278 (42%), Gaps = 60/278 (21%)
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMTLRINSS-GQVLHAYVNGNYV-DSQWTKYGASNDL 493
SDY WY T+ + D +LS + L I + L +++G ++ ++ ++ +
Sbjct: 686 SDYAWYGTDVKI---DVVLS---QVKLYIGTEKATALAVFIDGAFIGEANNHQHAEGPTV 739
Query: 494 FERPVK-LTRGKNQISLLSATVGLQN----YGSKFDMVPNGIPGPVLLVGRAGDETI-IK 547
++ L G +++++L ++G N +G+ P GI G VL+ E I +
Sbjct: 740 LSIEIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITGNVLIGSPLLSENISLV 799
Query: 548 DLSSHKWTY-------KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEA 600
D W+ K +GL + F +A A +E G PL W F +
Sbjct: 800 DGRQMWWSLPGLSVERKAARHGLRRESFEDA--AQAEAGLH----PL-----WSSVLFTS 848
Query: 601 PLENDPV---VLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCA 657
P + V L+L G+G W+NG +LGRYW RG +D
Sbjct: 849 PQFDSTVHSLFLDLTS-GRGHLWLNGKDLGRYWNI--------------TRGNSWNDY-- 891
Query: 658 YNCGNPSQIWYHVPRSWIK-DG-VNTLVLFEEFGGNPS 693
SQ +Y +P ++ DG +N L+LF+ GG+ S
Sbjct: 892 ------SQRYYFLPADFLHLDGQLNELILFDMLGGDHS 923
>gi|290956543|ref|YP_003487725.1| glycosyl hydrolase family 42 [Streptomyces scabiei 87.22]
gi|260646069|emb|CBG69162.1| putative glycosyl hydrolase (family 42) [Streptomyces scabiei
87.22]
Length = 591
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 112/333 (33%), Positives = 168/333 (50%), Gaps = 43/333 (12%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A S DG ++GE ++SG++HY R P +W D ++KA+ GL+ +ETYV WN H+
Sbjct: 5 ALTTSSDG--FLLNGEPFRIVSGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQ 62
Query: 86 P-LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
P G LDL R++ + +GL+V+LR GPY+CAEW+ GG P WL + PGI LR
Sbjct: 63 PDPDSPLVLDGLLDLPRYLSLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPGI-RLR 121
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
+++ F + + + L + + A+ GGP+I Q+ENEYG YGD +Y+
Sbjct: 122 SSDPRFTDALDGY--LDILLPPLLPYMAANGGPVIAVQVENEYGA----YGD-DTAYLKH 174
Query: 205 CAKMATSLDIGVPWIMCQESDA---------PSPMFT---------------PNNPNSPK 240
+ + + C ++ + P + T + P P
Sbjct: 175 VHQALRARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGPL 234
Query: 241 IWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY- 299
+ +E W GWF WG + R AE A + + G + N YM+HGGTNFG T+G +
Sbjct: 235 MCSEFWIGWFDHWGEEHHVRDAESAAADLDKLLAAGASV-NIYMFHGGTNFGFTNGANHD 293
Query: 300 -----LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ TSYDYDA + E G PK+ RE+
Sbjct: 294 QCYAPIVTSYDYDAALTESGDPG-PKYHAFREV 325
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 80/205 (39%), Gaps = 55/205 (26%)
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKW-T 555
P+++ R + +L +G NYG + G+ GPV G A H W T
Sbjct: 433 PLRVPRVGATLDVLVENMGGVNYGPRIGAA-KGLLGPVTFNGTA----------LHGWDT 481
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
+++ L L F A+AA VP +++ TFE D L+L G
Sbjct: 482 HRLPLADLSAVPFAPAEAA-------PVTVP-----AFHRGTFEIDTPAD-TFLSLPGWT 528
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG AW+NG++LGRYW RGP Q +VP +
Sbjct: 529 KGQAWINGFHLGRYW----------------NRGP--------------QRTLYVPGPVL 558
Query: 676 KDGVNTLVLFEEFGGNPSQINFQTV 700
+ G N LVL E S+ F V
Sbjct: 559 RPGANELVLLELNATTSSRAEFTDV 583
>gi|422881390|ref|ZP_16927846.1| beta-galactosidase [Streptococcus sanguinis SK355]
gi|332364328|gb|EGJ42102.1| beta-galactosidase [Streptococcus sanguinis SK355]
Length = 592
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 161/336 (47%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + Q GPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQDGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|422877900|ref|ZP_16924370.1| beta-galactosidase [Streptococcus sanguinis SK1056]
gi|332358593|gb|EGJ36417.1| beta-galactosidase [Streptococcus sanguinis SK1056]
Length = 592
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 161/336 (47%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVWREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|449489521|ref|XP_004174618.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein 2
[Taeniopygia guttata]
Length = 635
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 190/690 (27%), Positives = 276/690 (40%), Gaps = 171/690 (24%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G + GS+HY R W D + K + GL+ + TYV WN HE R ++DF+ NL
Sbjct: 55 LEGMPFRIFGGSMHYFRVPREYWEDRMLKMRACGLNTLTTYVPWNLHEKERGKFDFSKNL 114
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL +T GL+VILR GPY+C+EW+ GG P WL P + +LRTT K F + +
Sbjct: 115 DLRYVAQTALXNGLWVILRPGPYICSEWDLGGLPSWLLQDPEM-QLRTTYKGFTEAVDAY 173
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + L +GGPII Q+ENEYG+ D +Y+ + KMA L+ G+
Sbjct: 174 FDRLMRVVV--PLQYKKGGPIIAVQVENEYGSYAKD-----PNYMTYV-KMAL-LNRGIV 224
Query: 218 WIMCQ------------ESDAPSPMFTPNNP-----------NSPKIWTENWTGWFKSWG 254
++ E + F P + PK+ E WTGWF +WG
Sbjct: 225 ELLMTSDNKNGLSFGLVEGALATVNFQKLEPGLLKYLDTVQKDQPKMVMEYWTGWFDNWG 284
Query: 255 GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDA 308
G A+++ VA + G + N YM+HGGTNFG SG TSYDYDA
Sbjct: 285 GPHYVFDADEMVNTVASILKTGASI-NLYMFHGGTNFGFMSGALEADEYKSDVTSYDYDA 343
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSV--SI 366
+ E G K+ LR+L ++ L + YG + +L W V ++
Sbjct: 344 VLTEAGDYTS-KFFKLRQLFSMVIGQPLPLPPMIESKASYGAILLHQYISL--WDVLPAL 400
Query: 367 LPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTL 426
L K+ EF N N V +P ++ + V+ G GH
Sbjct: 401 LQPIKS-EFPVNMENLPLNASVGQPYGY--------------VLYETVIFGGGHL----- 440
Query: 427 IDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTK 486
T D + QV +VN YV
Sbjct: 441 ----HTRD----------------------------HVRDRAQV---FVNTVYVGE---- 461
Query: 487 YGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETII 546
+ E + +G Q+ +L G NYG + G+ G V L ++T +
Sbjct: 462 --LDYNTVELSIPEGQGFRQLRILVENRGRVNYGLALNEQRKGLIGDVFL-----NKTPL 514
Query: 547 KDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVP-------LNRRMTWYKTTFE 599
++ +Y L+ K + + S GWS+ VP R W
Sbjct: 515 RNFK---------IYSLEMKPSFMKRFHVS--GWST--VPDYFVGPAFFRGRLW------ 555
Query: 600 APLENDP--VVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCA 657
+E P L LQG KG +VNG NLGRYW GP
Sbjct: 556 --IEQQPQDTFLKLQGWEKGVVFVNGQNLGRYWKI----------------GP------- 590
Query: 658 YNCGNPSQIWYHVPRSWIKDGVNTLVLFEE 687
Q ++P W++ G N +V+FEE
Sbjct: 591 -------QETLYLPGPWLRRGGNEIVIFEE 613
>gi|300795929|ref|NP_001178947.1| beta-galactosidase-1-like protein 2 [Rattus norvegicus]
Length = 652
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 190/677 (28%), Positives = 271/677 (40%), Gaps = 159/677 (23%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+L GSIHY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL FI
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIW 138
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GL+VILR GPY+C+E + GG P WL P + +LRTT F + + + M
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDM-KLRTTYPGFTKAVDLYFDHL--M 195
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYG----------DAGKSYINWCAKMATSLDI 214
++ L GGPII Q+ENEYG+ D+ D G + + L+
Sbjct: 196 SRVVPLQYKHGGPIIAVQVENEYGSYNGDHAYMPYIKKALEDRGIIEMLLTSDNKDGLEK 255
Query: 215 GVPWIMC--------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
GV + QE A + + PK+ E WTGWF SWGG + ++
Sbjct: 256 GVVDGVLATINLQSQQELVALNSILLSIQGIQPKMVMEYWTGWFDSWGGSHNILDSSEVL 315
Query: 267 FAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHLNQPK 320
V+ + G + N YM+HGGTNFG +G + TSYDYDA + E G K
Sbjct: 316 QTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFGDYKADVTSYDYDAILTEAGDYTA-K 373
Query: 321 WGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKV 380
+ LREL + Y V+ S Y L W + D +
Sbjct: 374 YTKLRELFGTFSGVPPPPPPELTAKMVY-EPVTPSFY-LSLWDALLYMD--------KPI 423
Query: 381 NTQTNVKVKR-PNQAGNDQAPLQWKWRPEMINDFV----VRGKGHFALNTLIDQKSTNDV 435
++ + ++ P GN QA + + + V VR +G LN +
Sbjct: 424 TSEIPINMENLPVNNGNGQAFGYVLYETTIFSSGVLSGLVRDRGQVFLNRV--------S 475
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE 495
+L Y T K P+ G + + + + + G+V + GN +DSQ + L
Sbjct: 476 IGFLDYTTT---KITIPLTQGYTILRILVENRGRVNY----GNNIDSQRKGLIGNLYLNN 528
Query: 496 RPVKLTRGKNQISLLSATVGLQNYGSKFDM-----VPNGIPGPVLLVGRAGDETIIKDLS 550
+P+K +I L T + + +FDM VP I P +G
Sbjct: 529 KPLK----NFKIYSLDMT---KQFFQRFDMDKWSVVPKEITFPAFFLG------------ 569
Query: 551 SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLN 610
T VG+Y D TF L
Sbjct: 570 ----TLSVGIYPSD--------------------------------TF----------LK 583
Query: 611 LQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHV 670
L+G KG VN +NLGRYW N G P + Y +
Sbjct: 584 LEGWVKGVVLVNDHNLGRYW----------------------------NVG-PQETLY-L 613
Query: 671 PRSWIKDGVNTLVLFEE 687
P W+ G+N +++FEE
Sbjct: 614 PGVWLDKGLNKVIIFEE 630
>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
Length = 198
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 117/201 (58%), Gaps = 24/201 (11%)
Query: 614 MGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRS 673
MGKG AWVNG ++GRYWP YLA GC+T +CDYRG Y + KC NCG P+Q YH+PR+
Sbjct: 1 MGKGQAWVNGQSIGRYWPAYLAPSTGCTT-NCDYRGAYDASKCLRNCGQPAQTLYHIPRT 59
Query: 674 WIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHE------------------NKTM 715
W+ G N LVL EE GG+PS+I+ T C E + +
Sbjct: 60 WVHSGKNLLVLHEELGGDPSKISLLTRTGQEVCAHVSEADPPPADSWQPNLEFMSQSSQV 119
Query: 716 ELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEIDVLPLIEKQCVGKKSCSIEASEA 774
LTC G IS I +ASFG P+G CG F G+C A +VL ++++ C+G++ C+I S A
Sbjct: 120 RLTCEQGWHISMINFASFGTPRGHCGTFNPGNCHA--NVLSVVQQACIGQEGCAIPVSTA 177
Query: 775 NLGATSCAAGTVKRLVVEALC 795
LG G +K L +EALC
Sbjct: 178 RLGDP--CPGVLKSLAIEALC 196
>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
Length = 611
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/326 (34%), Positives = 163/326 (50%), Gaps = 41/326 (12%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ +LSG+IH+ R W D ++KA+ GL+ +ETYVFWN EP + Q+D
Sbjct: 34 GTQFVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 93
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F+GN D+ F++ QGL VILR GPY CAEW GG+P WL I +R+ + F+
Sbjct: 94 FSGNNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLA 152
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG---DAGKSYIN------ 203
Q++ + + + L GGPII Q+ENEYG+ D+ D Y+
Sbjct: 153 ASQSYLDALAK--QVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKA 210
Query: 204 --WCAKMATSLDIG-VPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSWG 254
+ + A L G +P + + AP P+ P++ E W GWF WG
Sbjct: 211 LLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG 270
Query: 255 ----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---------- 300
D ++ AE+ + + + G N YM+ GGT+FG +G +
Sbjct: 271 KPHAATDARQQAEEFEWILRQ-----GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQ 325
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRE 326
TTSYDYDA +DE GH PK+ +R+
Sbjct: 326 TTSYDYDAILDEAGHPT-PKFALMRD 350
Score = 42.7 bits (99), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 30/136 (22%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW- 554
V++ G++ + +L G NYG++ + GRAG D ++ + W
Sbjct: 455 VEIPAGQHTLDVLVENSGRINYGTR------------MADGRAGLVDPVLLDNQQLTGWQ 502
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
+ + + + +S RGW+ K V + +++ D L+++
Sbjct: 503 AFPLPM-----------RTPDSIRGWTRKAV---QGPAFHRGALRIGTPTD-TYLDMRAF 547
Query: 615 GKGFAWVNGYNLGRYW 630
GKGFAW NG NLGR+W
Sbjct: 548 GKGFAWANGVNLGRHW 563
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 168/349 (48%), Gaps = 49/349 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG+IHY R P W + K G + +ETYV WN HE Q+DFTG
Sbjct: 12 VDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTGGK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL+ F+K ++ GL VILR GPY+CAEW GG P WL N + ++R +++F+ +++N+
Sbjct: 72 DLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDM-KIRCDDELFLEKVENY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + L ++GGP+I+ Q+ENEYG+ +D K Y+ KM I VP
Sbjct: 131 FKVLLPLIV--PLQVTKGGPVIMVQVENEYGSFSND-----KLYLRALKKMIEDAGIDVP 183
Query: 218 WI--------------MCQES---------------DAPSPMFTPNNPNSPKIWTENWTG 248
+ +E D ++ P + E W G
Sbjct: 184 LFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEFWCG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W R A+++ + Q G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLPQV 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
TSYDYDA + E+G + K+ +EL K L T T DYG
Sbjct: 302 -TSYDYDAFLTEWGDPTK-KYEAAQELLKELFPDMIQQTPKLRTKKDYG 348
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 76/197 (38%), Gaps = 52/197 (26%)
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
+G++++SLL +G NYG++ + P G G + + Y +
Sbjct: 437 QGEHELSLLVENMGRNNYGAR-------LLAPTQRKGIRGGVMVDHHFETEWVQYALSFE 489
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ D F +GW N P +Y+ FEA E + L+ +GKG A++
Sbjct: 490 TIGDVDF--------AKGWIP-NTP-----AFYEYEFEAH-ECEDTFLDCSTLGKGVAFI 534
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
N +NLGRYW P Q Y +P +K G+N
Sbjct: 535 NDFNLGRYWSV-----------------------------GPIQYLY-IPGPLLKVGINK 564
Query: 682 LVLFEEFGGNPSQINFQ 698
LVLFE G +I +
Sbjct: 565 LVLFETEGVVAERIALK 581
>gi|195116355|ref|XP_002002721.1| GI11295 [Drosophila mojavensis]
gi|193913296|gb|EDW12163.1| GI11295 [Drosophila mojavensis]
Length = 678
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 177/705 (25%), Positives = 279/705 (39%), Gaps = 154/705 (21%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + H ++G+ ++GS HY R+ P W + ++ + GL+A++TYV W+ H
Sbjct: 51 SFSIDHQANTFLLNGKPFRYVAGSFHYFRALPEAWRNRLRTMRAAGLNALDTYVEWSLHN 110
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P +Y++ G DL++F++ Q++ Y++LR GPY+CAE + GG P WL ++RT
Sbjct: 111 PHDGEYNWEGIADLVKFLEIAQEEDFYIVLRPGPYICAERDNGGLPHWLFTKYPDIKVRT 170
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW- 204
+ ++ E+ + + M + + L GG II+ Q+ENEY + Y Y+NW
Sbjct: 171 NDPRYIAEVSKWYAEL--MPRLKHLLIGNGGKIIMVQVENEY----AAYYACDHDYLNWL 224
Query: 205 -------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNP 236
C K+ + D G+ I E D P
Sbjct: 225 RDETDKYVENKALLFTVDIPNERMHCGKIDNVFATTDFGIDRI--HEIDQIWKYLRSVQP 282
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P + +E + GW W + +R +++A A+ + + N YM+ GGTNFG T+G
Sbjct: 283 TGPLVNSEFYPGWLTHWQEMNQRRDPQEVASALKTILSYNASV-NLYMFFGGTNFGFTAG 341
Query: 297 GPYL----------TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT 346
Y TSYDYDA +DE G + + ++L+K
Sbjct: 342 ANYDLDGSIGYTADITSYDYDAVMDEAGGVTKK--------YELVKQ------------- 380
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWR 406
V G LP +L K + ++ T + A AP++ +
Sbjct: 381 -----VIGEVLELPN---IVLNPAKRLAYGKVELTPTTELLSAEGRAALAKGAPVK-STK 431
Query: 407 PEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINS 466
P+ + +DQ S L Y T D DP L L++
Sbjct: 432 PKSFEE--------------MDQ-----YSGLLLYETPLPSLDLDPTL-------LQVED 465
Query: 467 SGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG-KNQISLLSATVGLQNYGSKFDM 525
H +V+ V + S + + L++G + + LL G NY D
Sbjct: 466 LRDRAHVFVDQQLVGT------LSREARIYALPLSKGWGSTLQLLVENQGRINYDRANDT 519
Query: 526 VPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNV 585
GI G V L G ++D WT Y L+ N + E ++
Sbjct: 520 --KGIFGKVTLQLHNGGALPLED-----WTTTA--YPLEAITIENWRQKLPENAALDSSI 570
Query: 586 PLNRRM----TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
R + Y +F+ D LN G GKG A+VNG+NLGRYWP
Sbjct: 571 AKQRLLRSGPILYTGSFQVSEVGD-TYLNPAGWGKGVAYVNGFNLGRYWP---------- 619
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
G P QI +VP +K G N+LVL E
Sbjct: 620 ------------------LGGP-QITLYVPNELLKVGSNSLVLIE 645
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 168/349 (48%), Gaps = 49/349 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG+IHY R P W + K G + +ETYV WN HE Q+DFTG
Sbjct: 12 VDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTGGK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL+ F+K ++ GL VILR GPY+CAEW GG P WL N + ++R +++F+ +++N+
Sbjct: 72 DLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDM-KIRCDDELFLEKVENY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ + L ++GGP+I+ Q+ENEYG+ +D K Y+ KM I VP
Sbjct: 131 FKVLLPLIV--PLQVTKGGPVIMVQVENEYGSFSND-----KLYLRALKKMIEDAGIDVP 183
Query: 218 WI--------------MCQES---------------DAPSPMFTPNNPNSPKIWTENWTG 248
+ +E D ++ P + E W G
Sbjct: 184 LFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEFWCG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W R A+++ + Q G N YM+HGGTNFG +G P +
Sbjct: 244 WFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLPQV 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
TSYDYDA + E+G + K+ +EL K L T T DYG
Sbjct: 302 -TSYDYDAFLTEWGDPTK-KYEAAQELLKELFPDMIQQTPKLRTKKDYG 348
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 76/197 (38%), Gaps = 52/197 (26%)
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
+G++++SLL +G NYG++ + P G G + + Y +
Sbjct: 437 QGEHELSLLVENMGRNNYGAR-------LLAPTQRKGIRGGVMVDHHFETEWVQYALSFE 489
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ D F +GW N P +Y+ FEA E + L+ +GKG A++
Sbjct: 490 TIGDVDF--------TKGWIP-NTP-----AFYEYEFEAH-ECEDTFLDCSTLGKGVAFI 534
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
N +NLGRYW P Q Y +P +K G+N
Sbjct: 535 NDFNLGRYWSV-----------------------------GPIQYLY-IPGPLLKVGINK 564
Query: 682 LVLFEEFGGNPSQINFQ 698
LVLFE G +I +
Sbjct: 565 LVLFETEGVVAERIALK 581
>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
Length = 638
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 155/325 (47%), Gaps = 43/325 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG+ +LSG +HY R W ++ K GL+ + TYVFWN HE ++F G+ D
Sbjct: 44 DGKATRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEGDHD 103
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L FIKT + GL+VILR GPY CAEW++GG+P WL + G+ E+R N F+ +T
Sbjct: 104 LAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGL-EIRRDNAKFL----EYT 158
Query: 159 TLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGD----AGKSYINWCAKMATSL 212
+D KE L + GGPII+ Q ENE+G+ +S D K+Y K
Sbjct: 159 KKYIDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLEEA 218
Query: 213 DIGVPWIMCQES------DAPSPMFTPNNPNS----------------PKIWTENWTGWF 250
VP S P + T N N+ P + E + GW
Sbjct: 219 GFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQYNNNQGPYMVAEFYPGWL 278
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTT 302
W K A +A ++ Q +F NYYM HGGTNFG TSG Y T
Sbjct: 279 DHWAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKSDIQPDIT 337
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAPI E G PK+ +R +
Sbjct: 338 SYDYDAPISEAG-WTTPKYDSIRTV 361
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 94/234 (40%), Gaps = 52/234 (22%)
Query: 459 NMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLF---ERPVKLTRGKNQISLLSATVG 515
N L+I+ Y++G TK G N +F E + + + + +L +G
Sbjct: 432 NGKLKIDGLRDFAVVYIDG-------TKVGELNRVFKNYEMDIDIPFN-STLQILVENMG 483
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK-VGLYGLDDKKFYNAKAA 574
NYGS+ GI PVL+ D I D WT + + + + D
Sbjct: 484 RINYGSEIIHNHKGIISPVLI----NDMEITGD-----WTMQQLPMDKVPDLAGKQTATI 534
Query: 575 NSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
+ + +SK L + Y+ TF+ D ++++ GKG ++NG N+GRYW T
Sbjct: 535 QNTKVNTSKIATLKGQPVLYQGTFDLKEIGD-TFIDMEKWGKGIVFINGINIGRYWKT-- 591
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEF 688
GP Q ++P ++K G N++V+FE+
Sbjct: 592 --------------GP--------------QHTLYIPGPYLKKGSNSIVIFEQL 617
>gi|389856131|ref|YP_006358374.1| beta-galactosidase [Streptococcus suis ST1]
gi|353739849|gb|AER20856.1| Beta-galactosidase [Streptococcus suis ST1]
Length = 590
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 172/697 (24%), Positives = 284/697 (40%), Gaps = 154/697 (22%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K+Y+ A +
Sbjct: 125 HLDEYYASLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KAYLRAVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTS 303
E W GWF WG + +R E++ +V + G N YM+HGG T+
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGG-------------TN 282
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWS 363
+ + G ++ P+ +Y D + + Y L
Sbjct: 283 FGFMNGCSARGQIDLPQ----------------VTSYDYDAILDEAGNPTKKFYILQQRL 326
Query: 364 VSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFAL 423
+ P+ + E + ++V + +D+ L E ++D V KG +
Sbjct: 327 KEVYPELEYAEPLVKEAKAFSDVSL-------HDKVSLFATL--ENVSDCV---KGFYPK 374
Query: 424 NTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQ 483
N +ST Y+ Y T +L+ D + R+ + + Y +G +V +Q
Sbjct: 375 NMEELDQSTG----YILYRT--ELERDK-----TEAERFRVVDARDRIQIYADGKFVATQ 423
Query: 484 W-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGD 542
+ T+ G +L + KLT + +L +G NYG K P G +GR
Sbjct: 424 YQTEIGDDVELDFKDDKLT-----LDILVENMGRVNYGHKL-TAPTQSKG----LGRGA- 472
Query: 543 ETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEA 600
+ DL H TY + L ++D F +GW +Y+ FE
Sbjct: 473 ---MADLHFIGHWATYPLHLESVEDLDF--------SKGWEEGQA------AFYRYQFEL 515
Query: 601 PLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNC 660
E L++ G GKG +VN N+GR+W +GP
Sbjct: 516 D-ELADTYLDMTGFGKGVVFVNNVNIGRFWE----------------KGPI--------- 549
Query: 661 GNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
++ ++P+ ++K G N +++FE G +I+F
Sbjct: 550 -----LYLYIPKGYLKKGANEIIVFETEGKYREKIHF 581
>gi|422822094|ref|ZP_16870287.1| beta-galactosidase [Streptococcus sanguinis SK353]
gi|324990399|gb|EGC22337.1| beta-galactosidase [Streptococcus sanguinis SK353]
Length = 592
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 161/336 (47%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W D + K G + +ETY+ W HEP Q+ L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEEML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + K +++ GLY+I+R PY+CAE+++GG P WL P + LR + +F+ ++ +F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSM-RLRVNHPLFLEKVSHF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ K + QGGPI++ Q+ENEYG+ D K+Y+ A+M + VP
Sbjct: 131 YDWL--FPKLLPYQSDQGGPILMMQVENEYGSYAED-----KAYMRSIAQMMKVRGVTVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF W + +R AEDLA V Q G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAPI E+G + + R H++ +E+
Sbjct: 302 -TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
Length = 1106
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/317 (34%), Positives = 149/317 (47%), Gaps = 27/317 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ I YVFWN+HE +DFTG
Sbjct: 360 LNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQN 419
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL I LR ++ FM + F
Sbjct: 420 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFMERVGIF 478
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ + + GGPII+ Q+ENEYG+ V ++Y +
Sbjct: 479 EKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQCD 536
Query: 204 WCAKMATSLDIGVPWIMCQESDAP-SPMFTPNN---PNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W M + A F P P+SP + +E W+GWF WG
Sbjct: 537 WASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHET 596
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 597 RPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 655
Query: 315 HLNQPKWGHLRELHKLL 331
W + L K +
Sbjct: 656 QTTPKYWELRKALSKYM 672
>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
Length = 1106
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/317 (34%), Positives = 149/317 (47%), Gaps = 27/317 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ I YVFWN+HE +DFTG
Sbjct: 360 LNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQN 419
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL I LR ++ FM + F
Sbjct: 420 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFMERVGIF 478
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ + + GGPII+ Q+ENEYG+ V ++Y +
Sbjct: 479 EKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQCD 536
Query: 204 WCAKMATSLDIGVPWIMCQESDAP-SPMFTPNN---PNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W M + A F P P+SP + +E W+GWF WG
Sbjct: 537 WASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHET 596
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 597 RPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 655
Query: 315 HLNQPKWGHLRELHKLL 331
W + L K +
Sbjct: 656 QTTPKYWELRKALSKYM 672
>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
Length = 1106
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/317 (34%), Positives = 149/317 (47%), Gaps = 27/317 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ I YVFWN+HE +DFTG
Sbjct: 360 LNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQN 419
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL I LR ++ FM + F
Sbjct: 420 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFMERVGIF 478
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ + + GGPII+ Q+ENEYG+ V ++Y +
Sbjct: 479 EKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQCD 536
Query: 204 WCAKMATSLDIGVPWIMCQESDAP-SPMFTPNN---PNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W M + A F P P+SP + +E W+GWF WG
Sbjct: 537 WASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHET 596
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 597 RPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 655
Query: 315 HLNQPKWGHLRELHKLL 331
W + L K +
Sbjct: 656 QTTPKYWELRKALSKYM 672
>gi|182439300|ref|YP_001827019.1| beta-galactosidase [Streptomyces griseus subsp. griseus NBRC 13350]
gi|178467816|dbj|BAG22336.1| putative beta-galactosidase [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 630
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 155/325 (47%), Gaps = 48/325 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ LLSG++HY R W + GL+ +ETYV WN HEP + G L
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVGAL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
RF+ ++ GL+ I+R GPY+CAEW GG PVW+ G +RT + + ++
Sbjct: 73 G--RFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFG-RRVRTRDAAYRAVVERW 129
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
F L+ + +++ S+GGP++L Q ENEYG+ SD Y+ W A + + V
Sbjct: 130 FRELLPQVVRRQ---VSRGGPVVLVQAENEYGSYGSD-----AVYLEWLAGLLRQCGVTV 181
Query: 217 PWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
P M P + T N P P + E W GWF W
Sbjct: 182 PLFTSDGPEDHMLTGGSVPGLLATANFGSGAREGFAVLRRHQPGGPLMCMEFWCGWFDHW 241
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY-------LTT 302
G + +R E A A+ + G + N YM HGGTNFG +G GP+ T
Sbjct: 242 GAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSGPHQDESFQPTVT 300
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAP+DEYG + K+ RE+
Sbjct: 301 SYDYDAPVDEYGRATE-KFRLFREV 324
>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
Length = 1106
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/317 (34%), Positives = 149/317 (47%), Gaps = 27/317 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ I YVFWN+HE +DFTG
Sbjct: 360 LNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQN 419
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + Q +YVILR GPYVCAEW GG P WL I LR ++ FM + F
Sbjct: 420 DLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFMERVGIF 478
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ + + GGPII+ Q+ENEYG+ V ++Y +
Sbjct: 479 EKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQCD 536
Query: 204 WCAKMATSLDIGVPWIMCQESDAP-SPMFTPNN---PNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W M + A F P P+SP + +E W+GWF WG
Sbjct: 537 WASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHET 596
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 597 RPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 655
Query: 315 HLNQPKWGHLRELHKLL 331
W + L K +
Sbjct: 656 QTTPKYWELRKALSKYM 672
>gi|326779952|ref|ZP_08239217.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
gi|326660285|gb|EGE45131.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
Length = 648
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 155/325 (47%), Gaps = 48/325 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ LLSG++HY R W + GL+ +ETYV WN HEP + G L
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVGAL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
RF+ ++ GL+ I+R GPY+CAEW GG PVW+ G +RT + + ++
Sbjct: 73 G--RFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFG-RRVRTRDAAYRAVVERW 129
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
F L+ + +++ S+GGP++L Q ENEYG+ SD Y+ W A + + V
Sbjct: 130 FRELLPQVVRRQ---VSRGGPVVLVQAENEYGSYGSD-----AVYLEWLAGLLRQCGVTV 181
Query: 217 PWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
P M P + T N P P + E W GWF W
Sbjct: 182 PLFTSDGPEDHMLTGGSVPGLLATANFGSGAREGFKVLRRHQPGGPLMCMEFWCGWFDHW 241
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY-------LTT 302
G + +R E A A+ + G + N YM HGGTNFG +G GP+ T
Sbjct: 242 GAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSGPHQDESFQPTVT 300
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAP+DEYG + K+ RE+
Sbjct: 301 SYDYDAPVDEYGRATE-KFRLFREV 324
>gi|431919325|gb|ELK17922.1| Beta-galactosidase-1-like protein 3 [Pteropus alecto]
Length = 1113
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/321 (34%), Positives = 160/321 (49%), Gaps = 41/321 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T+ G + + GSIHY R W D + K K G + + TYV WN HEP R +DF+
Sbjct: 629 FTLGGHKFRIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPQRGAFDFSE 688
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ + GL+VILR GPY+C+E + GG P WL + LRTT++ F+ +
Sbjct: 689 NLDLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSNV-RLRTTDQGFVEAVD 747
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ A+ L QGGPII Q+ENEYG+ D K Y+ + + I
Sbjct: 748 KYFDHLI--ARVVPLQYRQGGPIIAVQVENEYGSF-----DKDKYYMPYIQQALLKRGI- 799
Query: 216 VPWIMCQES-----------------------DAPSPMFTPNNPNSPKIWTENWTGWFKS 252
V ++ ++ DA P++ N P + E W GWF
Sbjct: 800 VELLLTSDAKTEVLKGYIKGVLAAINIEKFQNDAFEPLYNIQK-NKPILVMEYWVGWFDK 858
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------PYLTTSYDY 306
WG + + A+D+ V+ F +F +F N YM+HGGTNFG +G + TSYDY
Sbjct: 859 WGDEHNVKDAQDVENTVSEFIKFEISF-NVYMFHGGTNFGFINGATNFGKHKSIATSYDY 917
Query: 307 DAPIDEYGHLNQPKWGHLREL 327
DA + E G + K+ LR+L
Sbjct: 918 DAVLTEAGDYTE-KYFKLRKL 937
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 141/310 (45%), Gaps = 35/310 (11%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+G T+DG ++++G+IHY R W D + K K G + + +V W+ HEP R ++
Sbjct: 52 EGSNFTLDGFPFLIIAGTIHYFRVPREYWKDRLLKLKACGFNTVTMHVPWSHHEPQRHKF 111
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
FTG+LDL FI ++GL+VIL GPY+ ++ + GG P WL P + +LRTT K F
Sbjct: 112 YFTGDLDLRAFISIASNEGLWVILCPGPYIGSDLDLGGLPSWLLQDPKM-KLRTTYKGFT 170
Query: 152 NEM-QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK--- 207
+ Q F LI +A + GPII Q+ENEYG+ D K Y+++ K
Sbjct: 171 KAVNQYFDQLIPRIAPFQ---YENYGPIIAVQVENEYGSYHLD-----KRYMSYVKKALV 222
Query: 208 ------MATSLDIGVPWIMCQESDAPSPMFTPNNPN------------SPKIWTENWTGW 249
M + D G I + + + N SP + T
Sbjct: 223 KRGIKAMLMTADDGQEIIRGYLNKVIATVHMKNIKKETYKNLFSIQGLSPILMMVYTTSS 282
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAP 309
SWG + L V F +F N+YM+HGGTNFG G L + Y
Sbjct: 283 SDSWGHSHHTLDSHVLMKNVHEMFNLRFSF-NFYMFHGGTNFGFIGGASSLNS---YLPV 338
Query: 310 IDEYGHLNQP 319
+ YG P
Sbjct: 339 VTSYGKYLYP 348
>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
Length = 768
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 165/326 (50%), Gaps = 43/326 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG +HYPR W ++ + GL+ + TYVFWN HE ++DF G+
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +I+ ++GL VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNPEFLKR---- 153
Query: 158 TTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-D 213
T L +D ++ L S+GGPII+ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 154 TKLYIDKLYEQVGDLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLAD 213
Query: 214 IG--VPWIMCQES------DAPSPMFTPNNPNS----------------PKIWTENWTGW 249
G VP S P + T N ++ P + E + GW
Sbjct: 214 AGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGW 273
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-------- 301
W P + +A + Q +F N+YM HGGTNFG TSG Y
Sbjct: 274 LMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDL 332
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G + PK+ +R +
Sbjct: 333 TSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
Length = 768
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 165/326 (50%), Gaps = 43/326 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG +HYPR W ++ + GL+ + TYVFWN HE ++DF G+
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +I+ ++GL VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNPEFLKR---- 153
Query: 158 TTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-D 213
T L +D ++ L S+GGPII+ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 154 TKLYIDKLYEQVGDLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLAD 213
Query: 214 IG--VPWIMCQES------DAPSPMFTPNNPNS----------------PKIWTENWTGW 249
G VP S P + T N ++ P + E + GW
Sbjct: 214 AGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGW 273
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-------- 301
W P + +A + Q +F N+YM HGGTNFG TSG Y
Sbjct: 274 LMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDL 332
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G + PK+ +R +
Sbjct: 333 TSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|194761012|ref|XP_001962726.1| GF14288 [Drosophila ananassae]
gi|190616423|gb|EDV31947.1| GF14288 [Drosophila ananassae]
Length = 661
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 172/365 (47%), Gaps = 55/365 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ + +DGE +SGS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 35 FTIDHEANSFMLDGEPFRYVSGSFHYFRAVPEAWRSRLRTMRASGLNALDTYVEWSLHNP 94
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+Y++ G D+++F++ Q++ Y+ILR GPY+CAE + GG P WL ++RT
Sbjct: 95 HEDEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTN 154
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ ++ E+ + + M + + L GG II+ Q+ENEYG+ D+ Y+NW
Sbjct: 155 DPDYIAEVGKWYAQL--MPRLQHLLVGNGGKIIMVQVENEYGDYACDH-----DYLNWLR 207
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 208 DETEKYVSGKALLFTVDIPNEKMSCGKIDNVFATTDFGIDRI--NEIDEIWKMLRVQQPT 265
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W ++ +R + +A A+ + + N YM+ GGTNFG T+G
Sbjct: 266 GPLVNSEFYPGWLTHWQEQNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGA 324
Query: 298 PY----------LTTSYDYDAPIDEYGHLN------QPKWGHLRELHKLLKSMEKTLTYG 341
Y TSYDYDA +DE G + + G EL ++ + K + YG
Sbjct: 325 NYDLDGGIGYAADITSYDYDAVMDEAGGVTSKYDKVKAVIGEFLELPEITLNPAKRIAYG 384
Query: 342 NVTNT 346
V T
Sbjct: 385 KVEVT 389
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 579 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 609
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP +K G N++V+ E
Sbjct: 610 YVPNELLKVGENSVVILE 627
>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
Length = 768
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 166/326 (50%), Gaps = 43/326 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG +HYPR W ++ + GL+ + TYVFWN HE ++DF G+
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +I+ ++GL VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNPEFLKR---- 153
Query: 158 TTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-D 213
T L +D ++ L S+GGPII+ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 154 TKLYIDKLYEQVGDLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLAD 213
Query: 214 IG--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGW 249
G VP + + P + T N ++ P + E + GW
Sbjct: 214 AGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGW 273
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-------- 301
W P + +A + Q +F N+YM HGGTNFG TSG Y
Sbjct: 274 LMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDL 332
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G + PK+ +R +
Sbjct: 333 TSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|346725882|ref|YP_004852551.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
gi|346650629|gb|AEO43253.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
Length = 611
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/314 (35%), Positives = 159/314 (50%), Gaps = 41/314 (13%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
LLSG+IH+ R W D ++KA+ GL+ +ETYVFWN EP + Q+DF+GN D+ F++
Sbjct: 46 LLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVR 105
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
QGL VILR GPY CAEW GG+P WL I +R+ + F+ Q++ +
Sbjct: 106 EAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLAASQSYLDALAK- 163
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLD 213
+ + L GGPII Q+ENEYG+ D+ D Y+ + + A L
Sbjct: 164 -QVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLA 222
Query: 214 IG-VPWIMCQESDAPSPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTA 262
G +P + + AP P+ P++ E W GWF WG D ++ A
Sbjct: 223 NGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQA 282
Query: 263 EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL----------TTSYDYDAPIDE 312
E+ + + + G N YM+ GGT+FG +G + TTSYDYDA +DE
Sbjct: 283 EEFEWILRQ-----GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDE 337
Query: 313 YGHLNQPKWGHLRE 326
GH PK+ +R+
Sbjct: 338 AGHPT-PKFALMRD 350
Score = 42.7 bits (99), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 56/133 (42%), Gaps = 24/133 (18%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
V++ G++ + +L G NYG++ G+ PVLL D + + +
Sbjct: 455 VEIPAGQHTLDVLVENSGRINYGTRMADGRAGLVDPVLL-----DNQQLTGWQAFPLPMR 509
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
+S RGW+ K V + +++ D L+++ GKG
Sbjct: 510 T---------------PDSIRGWTRKAV---QGPAFHRGALRIGTPTD-TYLDMRAFGKG 550
Query: 618 FAWVNGYNLGRYW 630
FAW NG NLGR+W
Sbjct: 551 FAWANGVNLGRHW 563
>gi|24582088|ref|NP_608978.2| beta galactosidase, isoform A [Drosophila melanogaster]
gi|21430516|gb|AAM50936.1| LP09580p [Drosophila melanogaster]
gi|22945722|gb|AAF52321.2| beta galactosidase, isoform A [Drosophila melanogaster]
Length = 672
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 174/365 (47%), Gaps = 55/365 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +SGS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 46 FTIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNP 105
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+Y++ G D+++F++ Q++ Y+ILR GPY+CAE + GG P WL ++RT
Sbjct: 106 HDGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTN 165
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ +++E+ + + M + + LF GG II+ Q+ENEYG+ D+ Y+NW
Sbjct: 166 DPNYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWLR 218
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 219 DETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRI--NEIDKIWAMLRALQPT 276
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W ++ +R +++A A+ + + N YM+ GGTNFG T+G
Sbjct: 277 GPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGA 335
Query: 298 PY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTYG 341
Y TSYDYDA +DE G +L + G L ++ + K L YG
Sbjct: 336 NYNLDGGIGYAADITSYDYDAVMDEAGGVTTKYNLVKAVIGEFLPLPEITLNPAKRLAYG 395
Query: 342 NVTNT 346
V T
Sbjct: 396 RVELT 400
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 590 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 620
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP +K G N+LV+ E
Sbjct: 621 YVPNEILKVGENSLVILE 638
>gi|302765290|ref|XP_002966066.1| hypothetical protein SELMODRAFT_61485 [Selaginella moellendorffii]
gi|300166880|gb|EFJ33486.1| hypothetical protein SELMODRAFT_61485 [Selaginella moellendorffii]
Length = 620
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 182/655 (27%), Positives = 274/655 (41%), Gaps = 132/655 (20%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S ++ + +D A DGE +L G IHY R P W D I++AK GL+ I+TYV WN
Sbjct: 1 SRSFSIEND--AFYKDGEPFRILGGEIHYFRIVPEYWKDRIQRAKAMGLNTIQTYVPWNV 58
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP ++ F ++L F+K Q+ + V+LR+GPYVC EW+ GGFP WL + +L
Sbjct: 59 HEPSEGEFFFGDPVNLEAFLKLAQELEVLVMLRMGPYVCGEWDLGGFPSWLLSKKPQLKL 118
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD-------YGD 196
RT++ ++ + + ++ + K S+GGP+I+ Q+ENEYG+ SD D
Sbjct: 119 RTSDSSYLKLVDQWWNVL--LPKLVPFLYSRGGPLIMLQVENEYGSFGSDKQYLHHLVSD 176
Query: 197 A----GKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTP---------------NNP- 236
A G I + AT D + D + + P N+P
Sbjct: 177 AREYLGNEIILYTTDGAT--DDALQRGTISRDDVYAAVDFPTGWDPVAAFALQKNYNSPG 234
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
SP + TE +TGW WG + A + + G+ YM HGG+NFG SG
Sbjct: 235 KSPALSTEFYTGWLTHWGENLATTSPYVAAAELDKLLSANGSVV-LYMAHGGSNFGFFSG 293
Query: 297 G----------PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT 346
P + TSYDYDAPI E G L + W RE
Sbjct: 294 ANTGGKETIYQPDI-TSYDYDAPIGEGGDLGEKFW-RFRE-------------------- 331
Query: 347 DYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN-----DQAPL 401
V S N P LP + NT V Q + + Q+ + QAP+
Sbjct: 332 -----VLSSYVNFPLPDPPQLPSRR----NTGTVVLQKLANLFQVLQSLSHEFYLQQAPV 382
Query: 402 QWKWRPEMINDFVVRGK--GHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSN 459
+ + V R + H ++++ K +D + + G S+
Sbjct: 383 AMELLNQSFGFIVYRSRLPSHAKPGSILEIKKIHDRAQ---------------VYVGKSS 427
Query: 460 MTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNY 519
+LR+ + Q +W SN + P + G +I +L +G NY
Sbjct: 428 QSLRLVGTLQ-------------RW-----SNSSLQLPDGSSAGL-EIYILVENMGRINY 468
Query: 520 GSKFDMVPNGIPGPVLL--VGRAGDETIIKDLSSHKWTYKVGL-YGLDDKKFYNAKAANS 576
G F GI V+L V G L+ +V + + + K F+ + AA
Sbjct: 469 G-PFIFDQKGILSSVILDNVPMLGWRAYTLSLADVAENQEVLINFAIHSKFFFLSLAA-- 525
Query: 577 ERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
P +Y TFE+ + D ++ +G KG A+VNG+NLGR+WP
Sbjct: 526 ---------PHCDGPAFYAATFESEAQMD-TFISFKGWSKGVAFVNGFNLGRFWP 570
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 157/314 (50%), Gaps = 30/314 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + H + ++G+ + + +HYPR W IK K G++AI YVFWN HE
Sbjct: 30 SFDIGH--KTFLLNGKPFTVKAAELHYPRIPRPYWEHRIKMCKALGMNAICIYVFWNIHE 87
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
+++FTGN D+ F + Q G+YVI+R GPYVCAEW GG P WL I+ LR
Sbjct: 88 QKEGEFNFTGNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIK-LRE 146
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD------------ 193
+ FM ++ F + + + L +GGPII+ Q+ENEYG+ D
Sbjct: 147 RDPYFMERVKIFEDKVAE--QLAPLTIQRGGPIIMVQVENEYGSYGIDKQYVGEIRDMLR 204
Query: 194 --YGDAGKSY-INWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENW 246
+G+ K + +W + + + W M D P++P + +E W
Sbjct: 205 QGWGNDVKMFQCDWSSNFTHNGLDDLIWTMNFGTGANIDNQFKKLKSLRPDAPLMCSEFW 264
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---T 301
+GWF WG + R A+D+ + G +F + YM HGGT+FG +G P
Sbjct: 265 SGWFDKWGARHETRPAQDMVNNIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDV 323
Query: 302 TSYDYDAPIDEYGH 315
TSYDYDAPI+EYG
Sbjct: 324 TSYDYDAPINEYGQ 337
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 44/203 (21%), Positives = 78/203 (38%), Gaps = 40/203 (19%)
Query: 495 ERPVKL--TRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSH 552
E+ ++L + + +++L +G N+G GI V + + G I L
Sbjct: 451 EKSIELPAVKAGSTLTILVEAMGRINFGRAIKDF-KGITNDVTITTQQGKHEIQYTLKGW 509
Query: 553 KWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQ 612
+ +Y +DD +A ++ R + K+ P+ + +Y+ F D L+ +
Sbjct: 510 QSSY------IDDSYETAVRALSAARKHTEKDTPIMGKRGYYRGYFNLKKVGD-TFLDFE 562
Query: 613 GMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPR 672
GKG +VNG+ +GR W P Q Y VP
Sbjct: 563 TWGKGQVYVNGHAMGRIWSI-----------------------------GPQQTLY-VPG 592
Query: 673 SWIKDGVNTLVLFEEFGGNPSQI 695
W+K G N +V+ + G S +
Sbjct: 593 CWLKKGRNEVVVLDITGPQKSHV 615
>gi|442626280|ref|NP_001260120.1| beta galactosidase, isoform B [Drosophila melanogaster]
gi|440213416|gb|AGB92656.1| beta galactosidase, isoform B [Drosophila melanogaster]
Length = 670
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 174/365 (47%), Gaps = 55/365 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +SGS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 44 FTIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNP 103
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+Y++ G D+++F++ Q++ Y+ILR GPY+CAE + GG P WL ++RT
Sbjct: 104 HDGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTN 163
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ +++E+ + + M + + LF GG II+ Q+ENEYG+ D+ Y+NW
Sbjct: 164 DPNYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWLR 216
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 217 DETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRI--NEIDKIWAMLRALQPT 274
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W ++ +R +++A A+ + + N YM+ GGTNFG T+G
Sbjct: 275 GPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGA 333
Query: 298 PY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTYG 341
Y TSYDYDA +DE G +L + G L ++ + K L YG
Sbjct: 334 NYNLDGGIGYAADITSYDYDAVMDEAGGVTTKYNLVKAVIGEFLPLPEITLNPAKRLAYG 393
Query: 342 NVTNT 346
V T
Sbjct: 394 RVELT 398
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 588 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 618
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP +K G N+LV+ E
Sbjct: 619 YVPNEILKVGENSLVILE 636
>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 649
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 166/324 (51%), Gaps = 31/324 (9%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG+ +SGSIHY R W D + K K GLDAI+TYV WN HEP R Y+FTG+ D
Sbjct: 42 DGQPFRYISGSIHYSRIPRYYWKDRLLKMKMAGLDAIQTYVPWNFHEPERGVYNFTGDRD 101
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L F++ Q+ GL VILR GPY+CAEW+ GG P WL I LR+++ ++ + ++
Sbjct: 102 LEYFLQLAQEVGLLVILRAGPYICAEWDMGGLPAWLLEKESI-VLRSSDPDYLTAVGSWM 160
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS---DY----GDAGKSYINWCAKMATS 211
+ + K + GGPII+ Q+ENEYG+ + DY + + Y+ + T+
Sbjct: 161 GIF--LPKMKPHLYQNGGPIIMVQVENEYGSYFACDFDYLRYLQNLFRQYLGDEVVLFTT 218
Query: 212 LDIGVPWIMC---------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGK 256
+ ++ C + A P P + +E +TGW WG +
Sbjct: 219 DGASMFYLRCGALQGLYSTVDFGPGRNVTAAFSTQRHTEPKGPLVNSEFYTGWLDHWGHR 278
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYLT--TSYDYDAPIDE 312
A +A +++ G N YM+ GGTNFG +G PY+ TSYDYDAP+ E
Sbjct: 279 HITVPASIVAKSLSEILASGANV-NMYMFIGGTNFGYWNGANMPYMAQPTSYDYDAPLSE 337
Query: 313 YGHLNQPKWGHLRELHKLLKSMEK 336
G L + K+ +RE+ + K + +
Sbjct: 338 AGDLTE-KYFAIREVIGMFKKLPE 360
>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
Length = 765
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 165/326 (50%), Gaps = 43/326 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG +HYPR W ++ + GL+ + TYVFWN HE ++DF G+
Sbjct: 36 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +I+ ++GL VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+
Sbjct: 96 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNPEFLKR---- 150
Query: 158 TTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-D 213
T L +D ++ L S+GGPII+ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 151 TKLYIDKLYEQVGDLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLAD 210
Query: 214 IG--VPWIMCQES------DAPSPMFTPNNPNS----------------PKIWTENWTGW 249
G VP S P + T N ++ P + E + GW
Sbjct: 211 AGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGW 270
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-------- 301
W P + +A + Q +F N+YM HGGTNFG TSG Y
Sbjct: 271 LMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDL 329
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G + PK+ +R +
Sbjct: 330 TSYDYDAPISEAGWVT-PKFDSIRNV 354
>gi|334347175|ref|XP_003341899.1| PREDICTED: beta-galactosidase-1-like protein [Monodelphis
domestica]
Length = 646
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 158/322 (49%), Gaps = 51/322 (15%)
Query: 34 RAITIDGERKILL---------SGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
R+ +D +R I L SGSIHY R +W D + K + GL+A++ YV WN H
Sbjct: 45 RSFEVDRQRGIFLLDGVPFRYVSGSIHYSRVPSPLWSDRLHKMRMSGLNAVQVYVPWNYH 104
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELR 144
EP Y+F GN DL+ F+K ++ L VILR GPY+CAEW GG P WL P I LR
Sbjct: 105 EPQPGVYNFQGNRDLVAFLKAAANEDLLVILRPGPYICAEWEMGGLPAWLLQNPEI-VLR 163
Query: 145 TTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW 204
T++ F+ + ++ +++ M + GG II Q+ENEYG+ Y Y+
Sbjct: 164 TSDPDFLAAVDSWFHVLMPMV--QPWLYHNGGNIISVQVENEYGS----YFACDFRYMRH 217
Query: 205 CAKMATSLDIGVPWIMCQESDAPSPM-------------FTPNN-------------PNS 238
A + +L +G I +D P F P++ PN
Sbjct: 218 LAGLFRAL-LG-DQIFLFTTDGPRGFSCGTLQGLYSTVDFGPDDNMTEIFAMQQKYEPNG 275
Query: 239 PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGP 298
P + +E +TGW WGG K + LA + + G N YM+HGGTNFG SG
Sbjct: 276 PLVNSEYYTGWLDYWGGNHSKWDTKTLANGLQNMLELGANV-NMYMFHGGTNFGYWSGAD 334
Query: 299 Y------LTTSYDYDAPIDEYG 314
+ +TTSYDYDAP+ E G
Sbjct: 335 FKKIYQPVTTSYDYDAPLSEAG 356
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 156/322 (48%), Gaps = 39/322 (12%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
+ ++GE ++SG++HY R P W D ++KA+ GL+ +ETYV WN H+P
Sbjct: 10 SFELNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLD 69
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEM 154
G LDL RF++ +GL V+LR GPY+CAEW+ GG P WL + ++ LR+++ F +
Sbjct: 70 GLLDLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQ-LRSSDPKFTAII 128
Query: 155 QNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDI 214
+ L++ A GGP+I Q+ENEYG +D Y+ + + S I
Sbjct: 129 DRYLDLLLPPLLPH--MAESGGPVIAVQVENEYGAYGND-----AEYLKYLVEAFRSRGI 181
Query: 215 GVPWIMC--------QESDAPSPMFT---------------PNNPNSPKIWTENWTGWFK 251
C Q P + T + P P + E W GWF
Sbjct: 182 EELLFTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFD 241
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYD 305
WGG R D+A + + G + N YM+HGGTNFG T+G + TSYD
Sbjct: 242 HWGGPHHTRDTADVAADLDKLLAAGASV-NIYMFHGGTNFGLTNGANHHHTYAPTITSYD 300
Query: 306 YDAPIDEYGHLNQPKWGHLREL 327
YDAP+ E G PK+ RE+
Sbjct: 301 YDAPLTENGDPG-PKYHAFREV 321
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 73/192 (38%), Gaps = 56/192 (29%)
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
PV++ R + +L +G NYG + P G+ GPV G + L
Sbjct: 428 PVQVHRRGAVLEVLVENMGRVNYGPRIG-APKGLLGPVTFDGMPVTGWECRPLPM----- 481
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPV--VLNLQGM 614
D A A++E ++ +++ TFE DP L+L G
Sbjct: 482 --------DAPLGAALYADAETEACAEPA-------FHRGTFEV---TDPADTFLSLPGW 523
Query: 615 GKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSW 674
KG AWVNG++LGRYW RGP Q +VP
Sbjct: 524 TKGQAWVNGFSLGRYW----------------NRGP--------------QQTLYVPGPV 553
Query: 675 IKDGVNTLVLFE 686
++ G NTL++ E
Sbjct: 554 LRPGANTLIVLE 565
>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 768
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 166/326 (50%), Gaps = 43/326 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG +HYPR W ++ + GL+ + TYVFWN HE ++DF G+
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +I+ ++GL VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNPEFLKR---- 153
Query: 158 TTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-D 213
T L +D ++ L S+GGPII+ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 154 TKLYIDKLYEQVGDLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLAD 213
Query: 214 IG--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGW 249
G VP + + P + T N ++ P + E + GW
Sbjct: 214 AGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGW 273
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-------- 301
W P + +A + Q +F N+YM HGGTNFG TSG Y
Sbjct: 274 LMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDL 332
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G + PK+ +R +
Sbjct: 333 TSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 107/325 (32%), Positives = 156/325 (48%), Gaps = 55/325 (16%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P W + K G + +ETYV WN HEP + + F G LDL RF+K
Sbjct: 29 ILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLK 88
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + + ++++
Sbjct: 89 LAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEYYDVLMEK 146
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
+L + GG I++ QIENEYG+ +G+ K+Y+ + + + P+ S
Sbjct: 147 IVPHQL--ANGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAPFFT---S 196
Query: 225 DAP--------------------------------SPMFTPNNPNSPKIWTENWTGWFKS 252
D P F + P + E W GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYLTTSY 304
W KR ++LA +V G N YM+HGGTNF +G P + TSY
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFEFMNGCSARGTIDLPQI-TSY 313
Query: 305 DYDAPIDEYGHLNQPKWGHLRELHK 329
DYDAP+DE G+ + + + LH+
Sbjct: 314 DYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
Length = 768
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 166/326 (50%), Gaps = 43/326 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG +HYPR W ++ + GL+ + TYVFWN HE ++DF G+
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +I+ ++GL VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGM-EIRRDNPEFLKR---- 153
Query: 158 TTLIVDMAKKE--KLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-D 213
T L +D ++ L S+GGPII+ Q ENE+G+ ++ D + + + AK+ L D
Sbjct: 154 TKLYIDKLYEQVGDLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLAD 213
Query: 214 IG--VPWI------MCQESDAPSPMFTPNNPNS----------------PKIWTENWTGW 249
G VP + + P + T N ++ P + E + GW
Sbjct: 214 AGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGW 273
Query: 250 FKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-------- 301
W P + +A + Q +F N+YM HGGTNFG TSG Y
Sbjct: 274 LMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDL 332
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAPI E G + PK+ +R +
Sbjct: 333 TSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|302776532|ref|XP_002971424.1| hypothetical protein SELMODRAFT_61474 [Selaginella moellendorffii]
gi|300160556|gb|EFJ27173.1| hypothetical protein SELMODRAFT_61474 [Selaginella moellendorffii]
Length = 620
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 181/659 (27%), Positives = 273/659 (41%), Gaps = 140/659 (21%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S ++ + +D A DGE +L G IHY R P W D I++AK GL+ I+TYV WN
Sbjct: 1 SRSFSIEND--AFYKDGEPFRILGGEIHYFRIVPEYWKDRIQRAKAMGLNTIQTYVPWNV 58
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEP ++ F ++L F+K Q+ + V+LR+GPYVC EW+ GGFP WL + +L
Sbjct: 59 HEPSEGEFFFGDPVNLEAFLKLAQELEVLVMLRMGPYVCGEWDLGGFPSWLLSKQPQLKL 118
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYIN 203
RT++ ++ + + ++ + K S+GGP+I+ Q+ENEYG+ SD K Y++
Sbjct: 119 RTSDSSYLKLVDQWWNVL--LPKLVPFLYSRGGPVIMLQVENEYGSFGSD-----KQYLH 171
Query: 204 WCAKMATSLDIGVPWIMCQESDA--------------------------PSPMFTP---- 233
A +G I+ A P F
Sbjct: 172 HLVSEAREY-LGNEIILYTTDGATEDALQRGTISRDDVYAAVDFPTGWDPVAAFALQKNY 230
Query: 234 NNP-NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
N+P SP + TE +TGW WG + A + + G+ YM HGG+NFG
Sbjct: 231 NSPGKSPALSTEFYTGWLTHWGENLATTSPYVAAAELDKLLSANGSVV-LYMAHGGSNFG 289
Query: 293 RTSGG----------PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN 342
SG P + TSYDYDAPI E G L + W RE
Sbjct: 290 FFSGANTGGKETIYQPDI-TSYDYDAPIGEGGDLGEKFW-RFRE---------------- 331
Query: 343 VTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGN-----D 397
V S N P LP + NT V Q + + Q+ +
Sbjct: 332 ---------VLSSYVNFPLPDPPQLPSRR----NTGTVVLQKLANLFQVLQSLSHEFYLQ 378
Query: 398 QAPLQWKWRPEMINDFVVRGK--GHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILS 455
QAP+ + + V R + H ++++ K +D + +
Sbjct: 379 QAPVTMELLNQSFGFIVYRSRLPSHAKPGSILEIKKIHDRAQ---------------VYV 423
Query: 456 GSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVG 515
G S+ +LR+ + Q +W SN + P + G +I +L +G
Sbjct: 424 GKSSQSLRLVGTLQ-------------RW-----SNSSLQLPDGSSAGM-EIYILVENMG 464
Query: 516 LQNYGSKFDMVPNGIPGPVLL--VGRAGDETIIKDLSSHKWTYKVGL-YGLDDKKFYNAK 572
NYG F GI V+L V G L+ +V + + + K F+ +
Sbjct: 465 RINYG-PFIFDQKGILSSVILDNVPMLGWRAYTLSLADVAENQEVLINFAIHSKFFFLSL 523
Query: 573 AANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
AA P +Y TFE+ + D ++ +G KG A+VNG+NLGR+WP
Sbjct: 524 AA-----------PHCDGPAFYAATFESEAQMD-TFISFKGWSKGVAFVNGFNLGRFWP 570
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 157/313 (50%), Gaps = 28/313 (8%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
++ G+ +DG+ ++SG++HY R W D + K K GL+ IETYV WN HEP+
Sbjct: 58 LTAQGKTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEPIP 117
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+Y+FTG+LDL+ FI YV+LR GPY+C+EW +GG P WL P + ++RT
Sbjct: 118 GKYNFTGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKM-KVRTMYP 176
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVM--SDYGDAGKSYINWCA 206
++ + + ++ K L GGPII Q++NEYG+ +DY K ++
Sbjct: 177 PYIAAVTKYFNYLLPFVK--PLQYQYGGPIIAFQLDNEYGSYFKDADYLPYLKEFLQ-NK 233
Query: 207 KMATSLDIGVPWIMCQESDAPSPMFTPN--------------NPNSPKIWTENWTGWFKS 252
+ L I ++ P + T N P++P + E WTGWF
Sbjct: 234 GIIELLFISDSIEGLRQQTIPGVLKTVNFKRMENHFTDLSNMQPDAPLMVMEFWTGWFDW 293
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------PYLTTSYD 305
WG K T ++ + F GG+ N+YM+ GGTNFG +G TSYD
Sbjct: 294 WGEKHHILTVQEFGETLNEIFSQGGSV-NFYMFFGGTNFGFMNGAYKDGTGFHADITSYD 352
Query: 306 YDAPIDEYGHLNQ 318
YDA I E G L +
Sbjct: 353 YDALIAENGDLTE 365
>gi|195030628|ref|XP_001988170.1| GH10713 [Drosophila grimshawi]
gi|193904170|gb|EDW03037.1| GH10713 [Drosophila grimshawi]
Length = 680
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 178/695 (25%), Positives = 279/695 (40%), Gaps = 137/695 (19%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A+ + H ++G+ +SGS HY R+ P W ++ + GL+A++TYV W+ H
Sbjct: 55 AFSIDHVANTFLMNGKPFRYVSGSFHYFRALPDAWRSRLRTMRASGLNALDTYVEWSLHN 114
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P +YD+ G D++RF++ Q++ Y++LR GPY+CAE + GG P WL ++RT
Sbjct: 115 PHDGEYDWEGIADIVRFLEIAQEEDFYIVLRPGPYICAERDNGGLPHWLFTKYPDIKVRT 174
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW- 204
+ ++ E+ + + M + + L GG II+ Q+ENEYG Y Y+NW
Sbjct: 175 NDPNYIAEVGKWYAQL--MPRLKHLLFGNGGKIIMVQVENEYGA----YHACDHDYLNWL 228
Query: 205 -------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNP 236
C K+ + D G+ I E D + P
Sbjct: 229 RDETDKYVENKALLFTVDIPNERMHCGKIDNVFATTDFGIDRIF--EIDKIWELLRGIQP 286
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P + +E + GW W + +R +++A A+ + +G + N YM+ GGTNFG T+G
Sbjct: 287 TGPLVNSEFYPGWLTHWQEMNQRRDGKEVADALKKILSYGASV-NLYMFFGGTNFGFTAG 345
Query: 297 GPYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNT-DYGNSVSGS 355
+YD D I + + + + G VTN + V G
Sbjct: 346 -----ANYDLDGGIGYAADITSYDYDAVMD------------EAGGVTNKYELVKQVIGE 388
Query: 356 SYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVV 415
LP +++ P K + T ++ + A + P++ +P+ +
Sbjct: 389 VLELP--DITLNP-AKRLSYGTVELTPALELLSAEGRAALSKGTPVKSD-QPKSFEE--- 441
Query: 416 RGKGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYV 475
+DQ S L Y T D DP L L++ H +V
Sbjct: 442 -----------MDQ-----YSGLLLYETTLPSMDLDPSL-------LKVEELRDRAHVFV 478
Query: 476 NGNYVDSQWTKYGASNDLFERPVKLTRG-KNQISLLSATVGLQNYGSKFDMVPNGIPGPV 534
+ V + S + + L++G + + LL G NY D GI G V
Sbjct: 479 DQQLVGT------LSREARIYALPLSKGWGSTLQLLVENQGRINYDRANDT--KGIFGKV 530
Query: 535 LLVGRAGDETIIKDLSSHKWT---YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRM 591
L G LS WT Y + Y ++ + K A + +K L
Sbjct: 531 TLQLHNGGA-----LSLDGWTTTGYPLEAYAIETWR----KDAAALDPSIAKQRLLRTGP 581
Query: 592 TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPY 651
Y +FE D LN G GKG A+VNG+NLGRYWP
Sbjct: 582 IAYTGSFEVTEVGD-TYLNTAGWGKGVAYVNGFNLGRYWP-------------------- 620
Query: 652 GSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
G P QI +VP +K G N++VL E
Sbjct: 621 --------LGGP-QITLYVPNELLKVGQNSVVLLE 646
>gi|302523005|ref|ZP_07275347.1| beta-galactosidase [Streptomyces sp. SPB78]
gi|302431900|gb|EFL03716.1| beta-galactosidase [Streptomyces sp. SPB78]
Length = 588
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 154/323 (47%), Gaps = 46/323 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
++VS +G ++DG LLSG++HY R P WP ++ + GL+ +ETYV WN HEP
Sbjct: 2 FQVSPEG--FSLDGRPLRLLSGALHYFRVLPEQWPHRLRMLRALGLNTVETYVPWNFHEP 59
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DFTG DL F+ +D GL+ I+R PY+CAEW GG P WL P + LR
Sbjct: 60 RPGHHDFTGQADLDAFLHATRDAGLHAIVRPSPYICAEWENGGLPWWLLADPEVRALRCQ 119
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ ++ + + LI +A + ++GG +++ Q+ENEYG+ +D G Y+
Sbjct: 120 DPAYLAHVDRWYDALIPRLAAHQ---VTRGGNVVMMQVENEYGSYGTDTG-----YLEHL 171
Query: 206 AKMATSLDIGVPWIMCQESD--------APSPMFTPN---------------NPNSPKIW 242
A I VP D P + T N P+ P +
Sbjct: 172 ADGMRRRGIDVPLFTSDGPDDFFLTGGTLPGHLATVNFGSRPAQAFAGLKRLRPHDPPMC 231
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR--------- 293
E W GWF WG R A + +A GG+ N YM HGGTNF
Sbjct: 232 AEFWCGWFDHWGAPRTVRDAAEATEELAATLGAGGSV-NVYMAHGGTNFSTWAGANTEDP 290
Query: 294 TSGGPYL--TTSYDYDAPIDEYG 314
+G YL TSYDYDAPIDE G
Sbjct: 291 ATGAGYLPTVTSYDYDAPIDERG 313
>gi|365860016|ref|ZP_09399844.1| putative beta-galactosidase [Streptomyces sp. W007]
gi|364010544|gb|EHM31456.1| putative beta-galactosidase [Streptomyces sp. W007]
Length = 645
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 111/325 (34%), Positives = 155/325 (47%), Gaps = 48/325 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ LLSG++HY R W + GL+ +ETYV WN HEP + G L
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVGAL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
RF+ ++ GL+ I+R GPY+CAEW GG PVW+ G +RT + + ++
Sbjct: 73 G--RFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFG-RRVRTRDAAYRAVVERW 129
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
F L+ + +++ S+GGP+IL Q ENEYG+ SD Y+ W A + + V
Sbjct: 130 FRELLPQVVQRQ---VSRGGPVILVQAENEYGSYGSD-----AVYLEWLAGLLRQCGVTV 181
Query: 217 PWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
P M P + T N P P + E W GWF W
Sbjct: 182 PLFTSDGPEDHMLTGGSVPGLLATANFGSGAREGFEVLLRHQPRGPLMCMEFWCGWFDHW 241
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG----GPY-------LTT 302
G + +R E A A+ + G + N YM HGGTNFG +G GP+ T
Sbjct: 242 GAEPVRRDPEQAAGALREVLECGASV-NIYMAHGGTNFGGWAGANRSGPHQDESFQPTVT 300
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAP+DEYG + K+ RE+
Sbjct: 301 SYDYDAPVDEYGRATE-KFRLFREV 324
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 150/317 (47%), Gaps = 27/317 (8%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++ + +HYPR W IK K G++ I YVFWN+HEP +DFTG
Sbjct: 358 LNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFTGQN 417
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F + + +YVILR GPYVCAEW GG P WL I LR ++ F+ + F
Sbjct: 418 DLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDI-RLRESDPYFIERVGIF 476
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN--------------VMSDYGDAGKSYIN 203
+ + + GGPII+ Q+ENEYG+ V ++Y +
Sbjct: 477 EKAVAEQVA--DMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVTLFQCD 534
Query: 204 WCAKMATSLDIGVPWIMCQESDAP-SPMFTPNN---PNSPKIWTENWTGWFKSWGGKDPK 259
W + + + W M + A F P P+SP + +E W+GWF WG
Sbjct: 535 WASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHET 594
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEYG 314
R A D+ + G +F + YM HGGTN+G +G P TSYDYDAPI E G
Sbjct: 595 RPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESG 653
Query: 315 HLNQPKWGHLRELHKLL 331
W + L K +
Sbjct: 654 QTTPKYWELRKTLSKYM 670
>gi|225872977|ref|YP_002754436.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
gi|225792973|gb|ACO33063.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
Length = 619
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 173/349 (49%), Gaps = 61/349 (17%)
Query: 11 ILLCLILQTLFNLSLAY-RVSH-----DGRAITIDGERKILLSGSIHYPRSTPGMWPDLI 64
+ L L + LS + R +H DG I +DG+ ++SGSIH+ R W D +
Sbjct: 15 LFAVLPLHAVPALSETHTRAAHTATVGDGHFI-LDGKPVQIISGSIHFARVPRAEWGDRL 73
Query: 65 KKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAE 124
+KA+ GL+AI YVFWN EP R Q+DF+G D+ RFI+ Q GLYVILR GPY CAE
Sbjct: 74 RKARAMGLNAISVYVFWNVQEPHRGQWDFSGQYDVARFIRMAQQAGLYVILRPGPYACAE 133
Query: 125 WNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIE 184
W+ GG+P WL G ++R+++ +++ Q++ + K L + GGPII Q+E
Sbjct: 134 WSMGGYPAWLWK-DGRVKIRSSDPAYLHAAQDYMDHLGQQLK--PLLWTHGGPIIAVQVE 190
Query: 185 NEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPS---------------- 228
NEYG+ +G + ++Y+ +M +G ++ +D P
Sbjct: 191 NEYGS----FGKS-RAYLEEVRRMVAGAGLGG--VVLYTADGPGLWSGSLPELPEAIDVG 243
Query: 229 --------PMFTPNNPNSPKIWT-ENWTGWFKSWG-----GKDPKRTAEDLAFAVARFFQ 274
P+S ++ E + GWF WG G K +DL + ++R +
Sbjct: 244 PGGVENGVKQLLAYRPHSKLVYVAEYYPGWFDQWGQPHHHGAPLKEQLKDLRWILSRGYS 303
Query: 275 FGGTFQNYYMYHGGTNFGRTSGG---------PYLTTSYDYDAPIDEYG 314
N YM+HGGT++G +G TTSYDY AP++E G
Sbjct: 304 V-----NLYMFHGGTDWGFMNGANDNAADTDYAPQTTSYDYAAPLNEAG 347
Score = 40.4 bits (93), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 49/125 (39%), Gaps = 43/125 (34%)
Query: 575 NSER--GWSSKNVPLNR--RMTWYKTTFEAPL---------ENDPVVLNLQGMGKGFAWV 621
N ER GW + ++P+ R R++W T P + LN+ +GKG WV
Sbjct: 503 NGERLTGWKNYSLPMQRVPRLSWKTQTAAGPAFHRGTFTIEKTGDTYLNVSEIGKGLLWV 562
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG+ +GR W GP SD +VP W+ G NT
Sbjct: 563 NGHAIGRIWNI----------------GPQQSD--------------YVPACWLHKGKNT 592
Query: 682 LVLFE 686
+ + +
Sbjct: 593 VTVLD 597
>gi|195473731|ref|XP_002089146.1| GE18961 [Drosophila yakuba]
gi|194175247|gb|EDW88858.1| GE18961 [Drosophila yakuba]
Length = 672
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 175/366 (47%), Gaps = 57/366 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DG+ +SGS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 46 FAIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNP 105
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH-NMPGIEELRT 145
+Y++ G D+++F++ Q++ Y+ILR GPY+CAE + GG P WL P I ++RT
Sbjct: 106 HDGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFAKYPSI-KMRT 164
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW- 204
+ +++E+ + + M + + LF GG II+ Q+ENEYG+ D+ Y+NW
Sbjct: 165 NDPNYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWL 217
Query: 205 -------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNP 236
C K+ + D G+ I E D M P
Sbjct: 218 RDETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRI--NEIDKIWAMLRALQP 275
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P + +E + GW W ++ +R +++A A+ + + N YM+ GGTNFG T+G
Sbjct: 276 TGPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAG 334
Query: 297 GPY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTY 340
Y TSYDYDA +DE G L + G L ++ + K L Y
Sbjct: 335 ANYNLDGGIGYAADITSYDYDAVMDEAGGVTTKYDLVKAVIGEFLPLPEITLNPAKRLAY 394
Query: 341 GNVTNT 346
G V T
Sbjct: 395 GRVELT 400
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 590 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 620
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP ++ G N+LV+ E
Sbjct: 621 YVPNEILQVGENSLVILE 638
>gi|433679946|ref|ZP_20511609.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814938|emb|CCP42238.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 615
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 164/330 (49%), Gaps = 41/330 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V+ G T DG+ ++SG+IH+ R W D ++KA+ GL+ +ETYVFWN EP +
Sbjct: 33 VATQGDHFTRDGKPYQIISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRQ 92
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
Q+DF+GN DL FI QGL VILR GPYVCAEW GG+P WL PG+ +R+ +
Sbjct: 93 GQFDFSGNNDLAAFIDAAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAQPGL-RVRSQDP 151
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD--YGDAGKSY----- 201
F+ Q + + K + GGP+I Q+ENEYG+ D Y A ++
Sbjct: 152 RFLAASQAYLDAVAAQVKPK--LNRNGGPVIAVQVENEYGSYDDDHVYMQANRTMFVKAG 209
Query: 202 ----INWCAKMATSLDIG-VPWIMCQESDAPS------PMFTPNNPNSPKIWTENWTGWF 250
+ + A A L G +P + + P + P P++ E W GWF
Sbjct: 210 FDKALLFTADGADVLANGTLPDTLAVVNFGPGDAEKAFQTLSKFRPGQPQMVGEYWAGWF 269
Query: 251 KSWGGK----DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------- 299
WG K D K+ A + + + + G N YM+ GGT+FG +G +
Sbjct: 270 DQWGDKHANTDAKKQASEFEWILRQ-----GHSANIYMFVGGTSFGFMNGANFQKNASDH 324
Query: 300 ---LTTSYDYDAPIDEYGHLNQPKWGHLRE 326
TTSYDYDA +DE G PK+ R+
Sbjct: 325 YAPQTTSYDYDAVLDEAGRPT-PKFALFRD 353
Score = 42.4 bits (98), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 15/84 (17%)
Query: 559 GLYGLDDKKFYNAKAANSERGWSSKNVPLN---RRMTWYKTTFEAPLENDPVV------- 608
G GL D N K GW + ++P++ + W E P + V
Sbjct: 487 GRAGLVDPVLLNGKPLT---GWQTFSLPMDDPSKLTGWTTAKVEGPAFHRGTVKIATPTD 543
Query: 609 --LNLQGMGKGFAWVNGYNLGRYW 630
L++Q GKG AW NG+NLGR+W
Sbjct: 544 TFLDMQAFGKGVAWANGHNLGRHW 567
>gi|296216696|ref|XP_002807336.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Callithrix jacchus]
Length = 652
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 155/307 (50%), Gaps = 38/307 (12%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+G
Sbjct: 79 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSG 138
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ + GL+VILR GPY+C+E + GG P WL P + LRTTNK F+ ++
Sbjct: 139 NLDLEAFVLMASEIGLWVILRPGPYICSEIDLGGLPSWLLQDPQL-LLRTTNKGFIEAVE 197
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ + L QGGP+I Q+ENEYG+ D K Y+ + K I
Sbjct: 198 KYFDHLI--PRVIPLQYRQGGPVIAVQVENEYGSFNKD-----KKYMPYLHKAMLRRGI- 249
Query: 216 VPWIMCQESD-------APSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
V ++ + + + T N + P + E W GWF W
Sbjct: 250 VELLLTSDGEKNVLSGHTKGVLATINLQKLHRNTFSQLHKVQRDKPLLNMEYWVGWFDRW 309
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYD 307
K A+++ V+ F ++ +F N YM+HGGTNFG +G Y + TSYDYD
Sbjct: 310 XDKHHVTDAKEIEHTVSEFIKYEISF-NVYMFHGGTNFGFLNGATYFGKHAGVVTSYDYD 368
Query: 308 APIDEYG 314
A + E G
Sbjct: 369 AVLTEAG 375
>gi|333023172|ref|ZP_08451236.1| putative beta-galactosidase [Streptomyces sp. Tu6071]
gi|332743024|gb|EGJ73465.1| putative beta-galactosidase [Streptomyces sp. Tu6071]
Length = 588
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 154/323 (47%), Gaps = 46/323 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
++VS +G ++DG LLSG++HY R P WP ++ + GL+ +ETYV WN HEP
Sbjct: 2 FQVSPEG--FSLDGRPLRLLSGALHYFRVLPEQWPHRLRMLRALGLNTVETYVPWNFHEP 59
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+DFTG DL F+ +D GL+ I+R PY+CAEW GG P WL P + LR
Sbjct: 60 RPGHHDFTGQADLDAFLHATRDAGLHAIVRPSPYICAEWENGGLPWWLLADPEVRALRCQ 119
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ ++ + + LI +A + ++GG +++ Q+ENEYG+ +D G Y+
Sbjct: 120 DPAYLAHVDRWYDALIPRLAAHQ---VTRGGNVVMMQVENEYGSYGTDTG-----YLEHL 171
Query: 206 AKMATSLDIGVPWIMCQESD--------APSPMFTPN---------------NPNSPKIW 242
A I VP D P + T N P+ P +
Sbjct: 172 ADGLRRRGIDVPLFTSDGPDDFFLTGGTLPGHLATVNFGSRPAQAFAGLKRLRPHDPPMC 231
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGR--------- 293
E W GWF WG R A + +A GG+ N YM HGGTNF
Sbjct: 232 AEFWCGWFDHWGAPRTVRDAAEATEELAATLGAGGSV-NVYMAHGGTNFSTWAGANTEDP 290
Query: 294 TSGGPYL--TTSYDYDAPIDEYG 314
+G YL TSYDYDAPIDE G
Sbjct: 291 ATGAGYLPTVTSYDYDAPIDERG 313
>gi|453049630|gb|EME97211.1| beta-galactosidase [Streptomyces mobaraensis NBRC 13819 = DSM
40847]
Length = 584
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 147/323 (45%), Gaps = 47/323 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V+ DG IDG LLSG++HY R G WP + + GL+ +ETYV WN HEP
Sbjct: 4 FTVAEDG--FRIDGREVRLLSGALHYFRVHEGHWPHRLAMLRAMGLNCVETYVPWNRHEP 61
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+ + G L RF+ GLY I+R GPYVCAEW GG P WL G +RT+
Sbjct: 62 VEGRLHDVGELG--RFLDAAGAAGLYAIVRPGPYVCAEWENGGLPHWLTGRLG-RRVRTS 118
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCA 206
+ F+ + + + + +GGP++L Q+ENEYG+ SD + Y+
Sbjct: 119 DPEFLRAVDGWLEAVGAELTGRQF--GRGGPVVLVQVENEYGSYGSD-----QPYLEHLV 171
Query: 207 KMATSLDIGVPWI--------MCQESDAPSPMFTPN---------------NPNSPKIWT 243
+ VP + M P T N P P +
Sbjct: 172 GRLRDSGVVVPLVTSDGPEDHMLTGGTVPGATATVNFGSGAREAFRVLRRHRPAGPLMCM 231
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WGG R A + A A+ + G + N YM HGGTNFG +G
Sbjct: 232 EFWCGWFAHWGGAPAARDAGEAAEALREVLECGASV-NVYMAHGGTNFGGWAGANRAGAE 290
Query: 301 --------TTSYDYDAPIDEYGH 315
TTSYDYDAP+DEYG
Sbjct: 291 HRGALRPTTTSYDYDAPVDEYGR 313
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 34/78 (43%), Gaps = 5/78 (6%)
Query: 553 KWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQ 612
++ + V GL F A R W S R Y+ F P D VL L
Sbjct: 470 QYVHGVRARGLRLAAFEEEGALAVVR-WRSAG---GREPGLYRGAFRVPEAGD-AVLRLP 524
Query: 613 GMGKGFAWVNGYNLGRYW 630
G +GF WVNG+ LGRYW
Sbjct: 525 GWERGFVWVNGFCLGRYW 542
>gi|432894411|ref|XP_004075980.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oryzias
latipes]
Length = 640
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 117/329 (35%), Positives = 156/329 (47%), Gaps = 47/329 (14%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
D T++ + ++L GSIHY R W D + K K GL+ + TYV WN HEP R +
Sbjct: 51 DSSNFTLERKPFLILGGSIHYFRVPKAYWEDRLLKLKACGLNTLTTYVPWNLHEPERGVF 110
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL---HNMPGIEELRTTNK 148
DF G LDL ++ G++VILR GPY+CAEW+ GG P WL NM LRTT
Sbjct: 111 DFEGELDLEAYLGLAASLGIWVILRPGPYICAEWDLGGLPSWLLRDQNM----RLRTTYP 166
Query: 149 VFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
F + + F LI +A + S+GGPII Q+ENEYG+ D + Y+ + +
Sbjct: 167 GFTAAVDSYFDHLIKKVAPYQ---YSRGGPIIAVQVENEYGSYAMD-----EEYMPFIKE 218
Query: 208 MATSLDI--------------------GVPWIMCQESDAPSPMFTPN-NPNSPKIWTENW 246
S I + I Q+ D + P PK+ E W
Sbjct: 219 ALLSRGITELLVTSDNKDGLKLGGVKGALETINFQKLDPEEIKYLEKIQPQKPKMVMEYW 278
Query: 247 TGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------P 298
+GWF WGG AE++ V + + N YM+HGGTNFG SG
Sbjct: 279 SGWFDLWGGLHHVFPAEEMMAVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGRPSPA 337
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLREL 327
+ TSYDYDAP+ E G K+ LR L
Sbjct: 338 PMVTSYDYDAPLSEAGDYTT-KYHLLRNL 365
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 68/187 (36%), Gaps = 46/187 (24%)
Query: 502 RGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLY 561
+GK + LL G NYG D G+ G + L + I++D H
Sbjct: 479 KGKRTLGLLVENCGRVNYGKTLDEQRKGLVGDIQL-----NANILRDFMIH--------- 524
Query: 562 GLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
LD K + ++ +S + S + P +++T L L G KG +V
Sbjct: 525 SLDMKPDFVSRLQSSAQWKSMREKP--SFPAFFQTKLYLSSSPKDTFLKLPGWSKGVVFV 582
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG NLGRYW P Q Y VP +W+ N
Sbjct: 583 NGKNLGRYWSV-----------------------------GPQQTLY-VPGAWLNRWDNE 612
Query: 682 LVLFEEF 688
+++FEE
Sbjct: 613 IIVFEEL 619
>gi|339640120|ref|ZP_08661564.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
gi|339453389|gb|EGP66004.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
Length = 595
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 109/336 (32%), Positives = 162/336 (48%), Gaps = 48/336 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG+I Y R P W + + K G + +ETY+ W+ HEP Q+ G L
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRETLHNLKALGYNTVETYIPWSLHEPQEGQFVTDGLL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D + +Q+ GL++I+R PY+CAE+++GG P WL N PG+ R + +F+ ++ F
Sbjct: 72 DFEAYFDLVQEMGLHLIVRPTPYICAEFDFGGMPPWLLNYPGM-RFRVNDALFLEKVSRF 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ + ++GGPI++ Q+ENEYG+ D K Y+ AKM + VP
Sbjct: 131 YDWLFPKLLPYQF--TEGGPILMMQVENEYGSYAED-----KEYMRNIAKMMRDRGVSVP 183
Query: 218 -------WIMCQESDA--PSPMFTPNNPNS--------------------PKIWTENWTG 248
WI ES +F N S P + TE W G
Sbjct: 184 LFTSDGTWIEALESGTLIEDDIFVTGNFGSQAKENTDNLRAFMERHGKKWPLMCTEFWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYL 300
WF WG + +R AEDLA V + G N ++ GGTNFG SG P +
Sbjct: 244 WFSRWGEEIVRRDAEDLAQDVKEMMRIGSM--NLFLLRGGTNFGFISGCSARKTRDLPQI 301
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYD+DAP+ E+G + + R H+L +E+
Sbjct: 302 -TSYDFDAPVTEWGVPTEKYYAVQRVTHELFPELEQ 336
>gi|427392896|ref|ZP_18886799.1| hypothetical protein HMPREF9698_00605 [Alloiococcus otitis ATCC
51267]
gi|425730982|gb|EKU93810.1| hypothetical protein HMPREF9698_00605 [Alloiococcus otitis ATCC
51267]
Length = 597
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 183/706 (25%), Positives = 283/706 (40%), Gaps = 174/706 (24%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DGE LSG+IHY R W + K G + +ETYV WN HEP +DF+GNL
Sbjct: 12 LDGEPFQFLSGAIHYFRIPRADWHHSLYNLKALGFNTVETYVPWNVHEPEPGHFDFSGNL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL--RTTNKVFMNEMQ 155
D+ FIK ++ GLYVILR PY+CAEW YGG P W+ N E+L R+++ F+ +
Sbjct: 72 DVKAFIKEAEELGLYVILRPSPYICAEWEYGGLPGWIIN----EDLHPRSSDPAFLELVD 127
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
F + + L + GGPI++ QIENEYG+ D K Y+ +
Sbjct: 128 KFFARL--FKEVGDLQFTHGGPILMMQIENEYGSYGED-----KDYLKGVYDSMKAHGAD 180
Query: 216 VP-------WIMCQE----SDAPSPMFTPNNPNS--------------------PKIWTE 244
VP W+ +D + N S P + E
Sbjct: 181 VPLCTSDGAWLATLRAGTLTDIDEDILITGNFGSKAKENFGNLKDFHDKIGKEWPLMVME 240
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGG 297
W GWF WG R ++L A+ Q G N YM+ GGTNFG R +
Sbjct: 241 FWCGWFNRWGEPIVTRETDELVEALREAVQLGSV--NLYMFQGGTNFGFMNGCSARGTHD 298
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSY 357
+ TSYDY AP+DE G+ + + + K++K + D + S
Sbjct: 299 LHQITSYDYGAPLDEQGNPTEKYYA----IQKMIKE--------EFPDIDQAEPLVKES- 345
Query: 358 NLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRG 417
TA+ N Q KV + DQ + ++ R
Sbjct: 346 -------------------TAQENVQLEAKVNLVDSL--DQV-------ADRVDSLYTRS 377
Query: 418 KGHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNG 477
+ Y+ Y T+ +KD D LR+ H ++N
Sbjct: 378 MDELGQHY-----------GYILYQTDF-VKDVD------EEERLRVIDGRDRAHVFLND 419
Query: 478 NYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNGIPGPVL 535
++ +Q+ + D+ P++ + N++ +L +G NYG K D GI
Sbjct: 420 QHLATQYQE-EIGEDITTGPLEES---NKLDVLVENMGRVNYGHKLLADTQEKGI----- 470
Query: 536 LVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYK 595
R G + + L++ W + + D+ Y+ + E +P ++YK
Sbjct: 471 ---RQGVTSDLHFLTN--WRQYLIDFDRVDQIDYSLEKDFKE------GLP-----SFYK 514
Query: 596 --TTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGS 653
TF+ LE+ ++L GKG VNG+NLGR+W GP S
Sbjct: 515 FNVTFD-DLED--TYIDLSDFGKGIVLVNGHNLGRFWDL----------------GPTLS 555
Query: 654 DKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
++P++++K+GVN + +FE G ++F+
Sbjct: 556 --------------LYLPKAFLKEGVNEVTIFETEGKYAPNLSFKA 587
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 80/153 (52%), Positives = 101/153 (66%), Gaps = 12/153 (7%)
Query: 208 MATSLDIGVPWIMCQESDAPSPM-----------FTPNNPNSPKIWTENWTGWFKSWGGK 256
MA L GVPWIMC++ DAP P+ F PN+ N PK+WTENWTGW+ +GG
Sbjct: 1 MALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFGGA 60
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPIDEYGHL 316
P R ED+A++VARF Q GG+ NYYMYHGGTNF RT+ G ++ +SYDYDAP+DEYG
Sbjct: 61 VPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLP 119
Query: 317 NQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
+PK+ HL+ LHK +K E L + T T G
Sbjct: 120 REPKYSHLKALHKAIKLSEPALLSADATVTSLG 152
>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
Length = 629
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 167/326 (51%), Gaps = 39/326 (11%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G++ +LSG +HY R W ++ K GL+A+ TYVFWN HE ++DFTG+
Sbjct: 38 LNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHETEPGKWDFTGDK 97
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
+L +IKT ++G+ VILR GPYVCAEW +GG+P WL N+PG+ E+R N F+ + +
Sbjct: 98 NLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGM-EIRRDNPQFLKHTEAY 156
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAG-KSYINWCAKMATSL-DIG 215
+ L ++GGPI++ Q ENE+G+ ++ D + + + AK+ L D G
Sbjct: 157 IQRLYKEVG--HLQCTKGGPIVMVQCENEFGSYVAQRKDITLQEHRAYNAKIKQQLADAG 214
Query: 216 --VP-------WIM---CQESDAPSPMFTPNNPN------------SPKIWTENWTGWFK 251
VP W+ E P+ + N P + E + GW
Sbjct: 215 FDVPLFTSDGSWLFEGGSTEGALPTANGETDIANLKKVVNQYHGGQGPYMVAEFYPGWLS 274
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT--------TS 303
W P+ +A +A + + +F N YM HGGTNFG TSG Y TS
Sbjct: 275 HWAEPFPQVSASSVARTTESYLKNDVSF-NVYMVHGGTNFGFTSGANYDKKRDIQPDLTS 333
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHK 329
YDYDAPI E G + PK+ +R + K
Sbjct: 334 YDYDAPISEAGWVT-PKYDSIRAVIK 358
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 70/188 (37%), Gaps = 46/188 (24%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ +L +G NYGS+ GI PV + G+ + W +Y L
Sbjct: 470 LQILVENMGRINYGSEIVHNTKGIISPVTIGGKE---------ITGGWN----MYPLPMS 516
Query: 567 KFYNAKAA--NSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
K A A N+ S++ L Y+ TF D ++++ GKG +VNG
Sbjct: 517 KAPEAAKAGRNAYPNTSAQAGKLKGSPVAYEGTFTLNRTGD-TFIDMEDWGKGIIFVNGI 575
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
N+GRYW P Q Y +P W+K G N +V+
Sbjct: 576 NIGRYW-----------------------------QAGPQQTLY-IPGVWLKKGENKIVI 605
Query: 685 FEEFGGNP 692
FE+ P
Sbjct: 606 FEQLNEKP 613
>gi|433461907|ref|ZP_20419504.1| beta-galactosidase [Halobacillus sp. BAB-2008]
gi|432189486|gb|ELK46587.1| beta-galactosidase [Halobacillus sp. BAB-2008]
Length = 579
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 153/301 (50%), Gaps = 36/301 (11%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R+ P W D ++K K GL+ +ETYV WN HEP R +++F+G D+ FI+
Sbjct: 18 ILSGAIHYFRTVPEHWEDRLEKLKALGLNTVETYVPWNLHEPRRGEFEFSGLADIEGFIQ 77
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
T D GLYVI+R PY+CAEW GG P WL + +R+++ V+++ ++++ + +
Sbjct: 78 TAADLGLYVIVRPAPYICAEWEMGGLPSWLLKDKDV-VMRSSDPVYLSYVESYYKEL--L 134
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK------MATSLDIGVPW 218
K GGPII QIENEYG +D + Y+ + K + T L
Sbjct: 135 PKFVPHLYQNGGPIIAMQIENEYGAYGND-----QKYLTFLKKQYEQHGLDTFLFTSDGP 189
Query: 219 IMCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRTAE 263
++ P T N SPK+ E W GWF W G+ R A
Sbjct: 190 DFIEQGSLPDVTTTLNFGSKVEQAFERLDAFKTGSPKMVAEFWIGWFDYWTGEHHTRDAG 249
Query: 264 DLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHLN 317
D A AV R N+YM+HGGTNFG +G + TSYDYD+ + E G +
Sbjct: 250 DAA-AVFRELMERKASVNFYMFHGGTNFGFMNGANHYDVYYPTITSYDYDSLLTESGAIT 308
Query: 318 Q 318
+
Sbjct: 309 E 309
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 1/45 (2%)
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPT 632
+R +++ TF+AP +D + + +G KG +VNG+NLGRYW T
Sbjct: 488 SRYPKFFRGTFDAPGRHDTYI-DSEGFTKGNLFVNGFNLGRYWNT 531
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 159/332 (47%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 22 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 81
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 82 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 139
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L GG I++ QIENEYG+ +G+ K+Y+ + + +
Sbjct: 140 YDVLMEKIVPHQLV--NGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAL 192
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 193 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 249
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 250 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 307
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 308 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 77/142 (54%), Positives = 99/142 (69%), Gaps = 2/142 (1%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
A+ L F L A VS+D R++ I+GERK+L+S +IHYPRS P MWP+L+K AKE
Sbjct: 2 ALGLIFFFSLCFTLCFAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKE 61
Query: 70 GGLDAIETYVFWNAHEPLR-RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
GG+D IETYVFWN H+P +Y F G DL++FI +Q+ G+Y+ILRIGP+V AEWN+G
Sbjct: 62 GGVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFG 121
Query: 129 GFPVWLHNMPGIEELRTTNKVF 150
G PVWLH + G RT N F
Sbjct: 122 GIPVWLHYVNG-TVFRTDNYNF 142
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 159/332 (47%), Gaps = 55/332 (16%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ +LSG+IHY R P W + K G + +ETYV WN HEP + + F G L
Sbjct: 12 LNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RF+K Q+ GLY I+R PY+CAEW +GGFP WL N PG +R+ N ++ + +
Sbjct: 72 DLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG--RMRSNNPTYLKHVAEY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++++ +L GG I++ QIENEYG+ +G+ K+Y+ + + +
Sbjct: 130 YDVLMEKIVPHQLV--NGGNILMIQIENEYGS----FGEE-KAYLRAIRDLMIARGVTAL 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
+ SD P F + P + E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF W KR ++LA +V G N YM+HGGTNFG +G
Sbjct: 240 WDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDL 297
Query: 298 PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHK 329
P + TSYDYDAP+DE G+ + + + LH+
Sbjct: 298 PQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|170034400|ref|XP_001845062.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875695|gb|EDS39078.1| beta-galactosidase [Culex quinquefasciatus]
Length = 611
Score = 168 bits (426), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 116/339 (34%), Positives = 174/339 (51%), Gaps = 42/339 (12%)
Query: 25 LAYRVSHDGR------AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETY 78
+++R HD +DGE +SGS HY R+ PG W +++ + GL+A+ TY
Sbjct: 1 MSFRYQHDHSIDYERDTFLLDGEPFRFISGSFHYFRALPGSWRHILRAMRAAGLNAVMTY 60
Query: 79 VFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVW-LHNM 137
+ W+ HEP Y + DL +FI+ +++ LYVILR GPY+CAE + GGFP W L
Sbjct: 61 IEWSTHEPTEGDYRWNEIADLEQFIRIAEEENLYVILRPGPYICAERDMGGFPYWLLTKF 120
Query: 138 PGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNV------- 190
P I +LRT + +M E+Q + +++ M + +K +GGP+I+ IENEYG+
Sbjct: 121 PNI-KLRTQDSDYMREVQKWYSVL--MPRIQKYLYGRGGPVIMVSIENEYGSFSACDKTY 177
Query: 191 MSDYGDAGKSYINWCAKMATS-----LDIG-VPWIMCQ----ESDAPS---PMFTPNNPN 237
+ + +SYI + A + T+ L+ G +P I+ + +P P
Sbjct: 178 LKFLKNMTESYIQYDAVLFTNDGPEQLNCGRIPGILATLDFGSTGSPERYWQKLRKVQPK 237
Query: 238 SPKIWTENWTGWFKSWGGKDP-KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P + E + GW W +P RTA R G N+YM+ GGTNF T+G
Sbjct: 238 GPLVNAEFYPGWLTHW--MEPMARTATGPVVDTLRLMLNQGANVNFYMFFGGTNFAFTAG 295
Query: 297 ------GPYLT--TSYDYDAPIDEYGHLNQPKWGHLREL 327
G + T TSYDYDAP+DE G PK+ LR++
Sbjct: 296 ANDGGPGKFNTDITSYDYDAPLDEAGD-PTPKYFALRDV 333
Score = 46.2 bits (108), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 87/217 (40%), Gaps = 51/217 (23%)
Query: 472 HAYVNGNYVDSQWTK-YGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGI 530
AYV +VD+Q+ N + P+ L +GK Q+ LL G NYG D GI
Sbjct: 422 RAYV---HVDNQFIGVLSRENAIDTLPISLGQGK-QLQLLVENQGRINYGIANDF--KGI 475
Query: 531 PGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRR 590
GPV L G+E + WT + + LDD + N G+ S+
Sbjct: 476 LGPVTL---DGNELL-------NWT--MTGFPLDDYSLL-SNYLNQFSGYDSEQA-RQAS 521
Query: 591 MTWYKTTFEAPLEN-DPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRG 649
+ ++ F E L+ G GKG A +NG+NLGRYWP G
Sbjct: 522 VRIFRGHFTITNEEIHDTYLDPSGWGKGLAIINGFNLGRYWPL---------------AG 566
Query: 650 PYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
P Q+ +VPR + G N LV+ E
Sbjct: 567 P--------------QVTLYVPRHILMQGKNELVVIE 589
>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
Length = 596
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 158/319 (49%), Gaps = 48/319 (15%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ +LSG++HY R P W + K G + +ETYV WN H+P Q++F
Sbjct: 8 KDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFNF 67
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
+ DL++F++T +D GLYVILR PY+CAEW +GG P WL N+P I LR + +F+ E
Sbjct: 68 SKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNI-RLRQNDPLFIAE 126
Query: 154 MQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ F L+ +A + +QGG I++ QIENEYG+ +D K+Y+ +
Sbjct: 127 IDRYFQELLPRIAPYQ---ITQGGNILMMQIENEYGSFGND-----KNYLRAILALMLIH 178
Query: 213 DIGVPWI--------------MCQESDAPSPMF-TPNNPN--------------SPKIWT 243
+ VP + ++ P+ F + +N N P +
Sbjct: 179 GVNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDKHGKSYPLMCM 238
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSG 296
E W GWF W +R A+DLA + N+YM+ GGTNFG R
Sbjct: 239 EFWDGWFNRWKEPVIRRDAQDLADCTKELLERASI--NFYMFQGGTNFGFWNGCSARLDT 296
Query: 297 GPYLTTSYDYDAPIDEYGH 315
TSYDYDAP+ E+G
Sbjct: 297 DLPQVTSYDYDAPVHEWGE 315
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 107/312 (34%), Positives = 153/312 (49%), Gaps = 29/312 (9%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G ++ + +HYPR W IK K G++ + YVFWN HE Q+DFTGN
Sbjct: 37 LNGRPFVIKAAELHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGNN 96
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F + G+YVI+R GPYVCAEW GG P WL + LR + FM ++ F
Sbjct: 97 DVAAFCRLAHKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDV-RLREDDPYFMARVKAF 155
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGN------VMSDYGDAGKS---------YI 202
+ + L GGPII+ Q+ENEYG+ +S+ D K+
Sbjct: 156 EAEV--GRQLAPLTIQNGGPIIMVQVENEYGSYGINKKYVSEIRDIVKASGFDKVTLFQC 213
Query: 203 NWCAKMATSLDIGVPWIM----CQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDP 258
+W + + + W M D P +P + +E W+GWF WG +
Sbjct: 214 DWASNFEHNGLDDLVWTMNFGTGANIDEQFRRLKQLRPEAPLMCSEFWSGWFDKWGARHE 273
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL---TTSYDYDAPIDEY 313
R A+D+ + + G +F + YM HGGT+FG +G P TSYDYDAPI+EY
Sbjct: 274 TRPAKDMVEGIDEMLRKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEY 332
Query: 314 GHLNQPKWGHLR 325
G + PK+ LR
Sbjct: 333 G-MPTPKFFALR 343
>gi|257870316|ref|ZP_05649969.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
gi|257804480|gb|EEV33302.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 153/325 (47%), Gaps = 51/325 (15%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
LLSG+IHY R P W + K G + +ETYV WN HEP + + F G LDL RF+
Sbjct: 19 LLSGAIHYFRVHPDDWEHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEGILDLERFLS 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLYVILR PY+CAEW +GG P WL G LR + ++ + + +++
Sbjct: 79 LAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG--RLRACDPSYLAHVAEYYDVLLPK 136
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ-- 222
+L S GG I++ Q+ENEYG+ YG+ K+Y+ +M + I +P
Sbjct: 137 IIPYQL--SHGGNILMIQVENEYGS----YGEE-KAYLRAIKEMLINRGIDMPLFTSDGP 189
Query: 223 -----------ESD----------------APSPMFTPNNPNSPKIWTENWTGWFKSWGG 255
E D A F +N P + E W GWF W
Sbjct: 190 WQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWDGWFNRWNE 249
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYDYDA 308
+R +DLA +V + G N YM+HGGTNFG +G TSYDYDA
Sbjct: 250 PIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQVTSYDYDA 307
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKS 333
P+DE G+ + L K+LK
Sbjct: 308 PLDEQGNPTAKYYA----LQKMLKE 328
>gi|195434721|ref|XP_002065351.1| GK15404 [Drosophila willistoni]
gi|194161436|gb|EDW76337.1| GK15404 [Drosophila willistoni]
Length = 673
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 173/365 (47%), Gaps = 54/365 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ +DGE +SGS HY R+ P W ++ + GL+A++TY+ W+ H P
Sbjct: 48 FTIDHEANTFLLDGEPFQYVSGSFHYFRALPDAWRSRLQTMRASGLNALDTYIEWSLHNP 107
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
Y++ G D+++F++ Q++G Y++LR GPY+CAE + GG P WL ++RT
Sbjct: 108 HDGVYNWEGIADVVKFLEMAQEEGFYIVLRPGPYICAERDNGGLPHWLFTKYPNIKVRTN 167
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ ++ E++ + ++ M + + LF GG II+ Q+ENEYG + Y+NW
Sbjct: 168 DSNYLAEVEKWYDIL--MPRIQHLFIGNGGKIIMVQVENEYGA----FDACDHDYLNWLR 221
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 222 DETEKHVSGNALLFTVDIPNERMSCGKIENVFATTDFGIDRI--HEIDEIWKMLRNLQPT 279
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W ++ +R + +A A+ + + N YM+ GGTNFG T+G
Sbjct: 280 GPLVNSEFYPGWLTHWQEENQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGA 338
Query: 298 PY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTYG 341
+ TSYDYDA +DE G L + G L L ++ + K L+YG
Sbjct: 339 NWNLDGGIGYAADVTSYDYDAVMDEAGGVTSKYELVKKAIGELWTLPEITLTPAKRLSYG 398
Query: 342 NVTNT 346
V T
Sbjct: 399 KVELT 403
Score = 42.7 bits (99), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 20/24 (83%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPT 632
LN+ G GKG A+VNG+NLGRYWP
Sbjct: 592 LNMAGWGKGVAYVNGFNLGRYWPV 615
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 173/695 (24%), Positives = 273/695 (39%), Gaps = 164/695 (23%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+D + ++SG+IHY R P W D ++K + G + +ETYV WN HE Y F G L
Sbjct: 12 LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RFI+T Q+ GLYVILR PY+CAEW +GG P WL P + +LR FM ++ +
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDP-MMKLRFDYPPFMEKITRY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL----- 212
+ + ++ +QGGPII+ Q+ENEYG+ +D K Y+ KM ++
Sbjct: 131 FAHLFPQVRDLQI--TQGGPIIMMQVENEYGSYAND-----KEYLR---KMVAAMRQHGV 180
Query: 213 ---------------------DIGVPWIMCQES--DAPSPMFTPNNPNSPKIWTENWTGW 249
D+ +P I C + + + + P + E W GW
Sbjct: 181 ETPLVTSDGPWHDMLENGSIKDLALPTINCGSNIKENFEKLRKFHGEKRPLMVMEFWIGW 240
Query: 250 FKSWGGKDPKRTA-EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDA 308
F +WG T+ +D + G N YM+HGGTNFG +G Y Y+ A
Sbjct: 241 FDAWGDDQHHTTSIQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNY----YERLA 294
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILP 368
P +VT+ DY
Sbjct: 295 P--------------------------------DVTSYDY-------------------- 302
Query: 369 DCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLID 428
D E+ Q KV + A + PL K + F VR + +L + ID
Sbjct: 303 DALLTEWGEPTAKYQAFKKVI-ADYAEIPEFPLSMKIERKAYGTFSVRER--VSLFSTID 359
Query: 429 QKSTNDVSDYLWYMTNAD-----LKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQ 483
S +S+Y M + + I R+ ++ H ++N +
Sbjct: 360 TISQPIISNYPLSMEACNQATGYIYYRSLIGPARKIADYRLINTMDRAHTFINQELLRID 419
Query: 484 WTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDE 543
+ + F+ L+ KN++ +L +G NY K + GI V++ G
Sbjct: 420 YDQEIGQTYSFD----LSESKNELGILVENMGRVNYSVKMNHQHKGIKDGVIING----- 470
Query: 544 TIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLE 603
+ + +++ +D N A + + W +R ++ F+ E
Sbjct: 471 -------AFQSNWEIYPLPMD-----NLHAIDFQGKWQKGQPSFSR----FECVFD---E 511
Query: 604 NDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNP 663
+ L G GKGF VNG+ +GR+W +GP
Sbjct: 512 CADTFIELPGWGKGFVQVNGHTIGRFWE----------------KGP------------- 542
Query: 664 SQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
Q +VP ++K G+N +++FE G +I F
Sbjct: 543 -QQRLYVPAPFLKTGMNEIIVFESDGKIADEIVFH 576
>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
parasuis SH0165]
Length = 596
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 158/319 (49%), Gaps = 48/319 (15%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ +LSG++HY R P W + K G + +ETYV WN H+P Q++F
Sbjct: 8 KDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFNF 67
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
+ DL++F++T +D GLYVILR PY+CAEW +GG P WL N+P I LR + +F+ E
Sbjct: 68 SKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNI-RLRQNDPLFIAE 126
Query: 154 MQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ F L+ +A + +QGG I++ QIENEYG+ +D K+Y+ +
Sbjct: 127 IDRYFQELLPRIAPYQ---ITQGGNILMMQIENEYGSFGND-----KNYLRAIRALMLIH 178
Query: 213 DIGVPWI--------------MCQESDAPSPMF-TPNNPN--------------SPKIWT 243
+ VP + ++ P+ F + +N N P +
Sbjct: 179 GVNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDKHGKSYPLMCM 238
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSG 296
E W GWF W +R A+DLA + N+YM+ GGTNFG R
Sbjct: 239 EFWDGWFNRWKEPVIRRDAQDLANCTKELLERASI--NFYMFQGGTNFGFWNGCSARLDT 296
Query: 297 GPYLTTSYDYDAPIDEYGH 315
TSYDYDAP+ E+G
Sbjct: 297 DLPQVTSYDYDAPVHEWGE 315
>gi|257067624|ref|YP_003153879.1| beta-galactosidase [Brachybacterium faecium DSM 4810]
gi|256558442|gb|ACU84289.1| beta-galactosidase [Brachybacterium faecium DSM 4810]
Length = 631
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 161/328 (49%), Gaps = 34/328 (10%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
G+ +++SG++HY R P W D +++ G + +ETYV WN H+P R F G DL
Sbjct: 16 GDPHLIVSGALHYFRIHPEQWRDRLRRLVVMGCNTVETYVAWNIHQPSREVTTFEGFADL 75
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-FT 158
RF+ ++GL I+R GPY+CAEW GGFP W+ + LR N ++ + F
Sbjct: 76 GRFLDIAAEEGLDAIVRPGPYICAEWENGGFPGWILADRNL-RLRNRNAAYLQLVDAWFD 134
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK-----MATSLD 213
LI +A+++ A +GG +++ Q+ENEYG+ D A+ + TS
Sbjct: 135 QLIPVIAQRQ---AGRGGNVVMVQVENEYGSFGDDTAYLAHLRDGLVARGIEELLVTSDG 191
Query: 214 IGVPWIMCQESDAP-------------SPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKR 260
W+ D M P+ P++ E W GWF WG + +R
Sbjct: 192 PARMWLTGGTVDGALGTVNFGSRTLEVLAMAERELPDQPQMCMEFWNGWFDHWGEEHHER 251
Query: 261 TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTTSYDYDAPIDEYG 314
T D A +A + G + N+YM HGGTNFG +G + TTSYDYDAPI E G
Sbjct: 252 TGGDAAGELADMLEHGMSV-NFYMAHGGTNFGMQAGANHDGTLQPTTTSYDYDAPIAENG 310
Query: 315 HLNQPKWGHLREL---HKLLKSMEKTLT 339
L K+ RE+ H+ L + E+ L
Sbjct: 311 ALTD-KFRAFREVVAAHRELPAYEEHLA 337
>gi|148231352|ref|NP_001080304.1| galactosidase, beta 1-like 2 [Xenopus laevis]
gi|28422231|gb|AAH46858.1| Loc89944-prov protein [Xenopus laevis]
Length = 634
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 154/311 (49%), Gaps = 39/311 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+L GS+HY R W D +KK K G++ + TYV WN HEP + ++DF+ +LD+ F+
Sbjct: 60 ILGGSMHYFRVPMPYWRDRMKKMKACGINTLTTYVPWNLHEPRKGKFDFSKDLDISEFLA 119
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT-TLIVD 163
+ GL+VILR GPY+CAEW+ GG P WL + +LRTT + F + + LI
Sbjct: 120 IASEMGLWVILRPGPYICAEWDLGGLPSWLLRDKDM-KLRTTYRGFTEATEAYLDELIPR 178
Query: 164 MAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQE 223
+AK + S GGPII Q+ENEYG+ D +Y+ + I +
Sbjct: 179 IAKYQ---YSNGGPIIAVQVENEYGSYAKD-----ANYMEFIKNALVEKGIVELLLTSDN 230
Query: 224 SDAPSP------------------MFTPNN---PNSPKIWTENWTGWFKSWGGKDPKRTA 262
D S +F+ N N P + E WTGWF WGGK
Sbjct: 231 KDGLSSGSLENVLATVNFQKIEPVLFSYLNSIQSNKPVMVMEFWTGWFDYWGGKHHIFDV 290
Query: 263 EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHL 316
+++ V+ G + N YM+HGGTNFG +G + TSYDYDAP+ E G
Sbjct: 291 DEMISTVSEVLNRGASI-NLYMFHGGTNFGFMNGALHFHEYRPDITSYDYDAPLTEAGDY 349
Query: 317 NQPKWGHLREL 327
K+ LREL
Sbjct: 350 TS-KYFKLREL 359
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 130/419 (31%), Positives = 193/419 (46%), Gaps = 70/419 (16%)
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVT----------------------- 344
P+DE+G +PKWGHL+++H+ L ++ L +G T
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63
Query: 345 -----NTDYGNSVS--GSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGND 397
NT V+ G LPA S+S+LPDCKT FNT V TQ N + N ++
Sbjct: 64 LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSR----NFVRSE 119
Query: 398 QAPLQWKWRPEMINDFVVRGKGHFALNTLIDQ-KSTNDVSDYLWYMTNADLKDDDPILSG 456
A + W EM + G G F + + T D +DY WY T+ L D +
Sbjct: 120 IANKNFNW--EMYREVPPVGLG-FKFDVPRELFHLTKDTTDYAWYTTSLLLGRRDLPMKK 176
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGL 516
+ LR+ S G +HAYVNG Y S + + L G+N I+LL VGL
Sbjct: 177 NVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALLGYLVGL 236
Query: 517 QNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANS 576
+ G+ + G P + ++G T D+S + W ++VG G + KK + + + S
Sbjct: 237 PDSGAYMEKRFAG-PRSITILGL---NTGTLDISQNGWGHQVGTDG-EKKKLFTEEGSKS 291
Query: 577 ERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAE 636
+ W+ + +TWYK F+AP ++PV + + GMGKG WVNG ++GRYW YL+
Sbjct: 292 VQ-WTKPDQ--GGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP 348
Query: 637 EDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQI 695
P+Q YH+PR+++K N +VL EE GGNP +
Sbjct: 349 -----------------------LKKPTQSEYHIPRAYLKPK-NLIVLLEEEGGNPKDV 383
>gi|390336578|ref|XP_792349.2| PREDICTED: beta-galactosidase-like [Strongylocentrotus purpuratus]
Length = 671
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 171/364 (46%), Gaps = 50/364 (13%)
Query: 1 MATLKHCSRAILLCL----ILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRST 56
M L+ +L+C+ + Q L + S + + +D DG+ +SGS HY R
Sbjct: 1 MGRLRSYVSKLLICMAVLAVKQALPDRS--FTIDYDSNTFLKDGQPFRYVSGSFHYSRVP 58
Query: 57 PGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILR 116
W D + K K GL+A++TYV WN HE +++F G+ D++ F+K D GL VILR
Sbjct: 59 AFYWQDRLDKMKMAGLNAVQTYVIWNFHELKPGEFNFDGDHDILSFLKKANDTGLAVILR 118
Query: 117 IGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGG 176
GPY+C EW+ GG P WL N+PGI LR++N ++M + + + K GG
Sbjct: 119 PGPYICGEWDLGGLPAWLLNIPGI-VLRSSNDLYMAHVTEWMNFF--LPKLRPYLYVNGG 175
Query: 177 PIILAQIENEYG-----------------------NVMSDYGDAGKSYINWCA---KMAT 210
PII+ Q+ENEYG +V+ D ++ C M
Sbjct: 176 PIIMVQVENEYGSYQTCDHQYQRQLYHLFRANLGPDVVLFTTDGPGDHLLQCGTLQDMYA 235
Query: 211 SLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAF--A 268
++D G S P P + +E +TGW W + P +T + A +
Sbjct: 236 TIDFGA----GSNSTGMFQEMRKFEPKGPLVNSEYYTGWLDHW--EHPHQTVKTAAVCTS 289
Query: 269 VARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT-----TSYDYDAPIDEYGHLNQPKWGH 323
+ + G N YM+ GGTNFG +G Y T TSYDYDAP+ E G PK+
Sbjct: 290 LDQMLALGANV-NMYMFEGGTNFGFWNGANYPTFNPQPTSYDYDAPLTEAGD-PTPKYMA 347
Query: 324 LREL 327
+R +
Sbjct: 348 IRNV 351
>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
Length = 592
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 160/325 (49%), Gaps = 46/325 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ L+SG++HY R P W D ++K K G + +ETY+ WN HEP + Q+DF+G
Sbjct: 12 LDGQPVKLISGALHYFRIVPEYWQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFSGRK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF++ Q GL+VILR PY+CAEW +GG P WL + +R+T + +++ + +
Sbjct: 72 DVARFVRKAQALGLWVILRPTPYICAEWEFGGLPAWLLADDSM-RVRSTYQPYLDAVDAY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+ + + LF + GGP+++ QIENEYG+ +D K Y+ ++ VP
Sbjct: 131 YAELFKVIR--PLFFTHGGPVLMCQIENEYGSFGND-----KQYLKAIKRLMEKHGCDVP 183
Query: 218 WI--------------MCQESDAPSPMF---------------TPNNPNSPKIWTENWTG 248
+ E P+ F N+ + P + E W G
Sbjct: 184 MFTSDGGWREVLDAGTLLNEGVLPTANFGSRTDEQIGALRQFMNDNDIHGPLMCMEFWIG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY------LTT 302
WF +WG R A++ A + + G N YM+HGGTN +G Y T
Sbjct: 244 WFNNWGSPLKTRDAKEAADELDAMLRQGSV--NIYMFHGGTNPEFYNGCSYHNGMDPQIT 301
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDY AP+ E+G K+ RE+
Sbjct: 302 SYDYAAPLTEWG-TEAEKYAAFREV 325
>gi|157106609|ref|XP_001649402.1| beta-galactosidase [Aedes aegypti]
gi|108879821|gb|EAT44046.1| AAEL004575-PA [Aedes aegypti]
Length = 648
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/359 (31%), Positives = 179/359 (49%), Gaps = 38/359 (10%)
Query: 3 TLKHCSRAILLCLIL---QTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGM 59
+L + A++LC + + L N + + ++ +DG ++GS HY R+ P
Sbjct: 7 SLLFTAIAVVLCYHVNGQRLLDNRQRTFTIDYENNTFLLDGAPFQYIAGSFHYFRALPQA 66
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W ++K + GL+A+ TYV W+ H P + Y++ G D+ RF++ Q++ L VILR GP
Sbjct: 67 WGPILKSMRAAGLNAVTTYVEWSLHNPKKGVYNWDGMADIERFVQLAQNEDLLVILRPGP 126
Query: 120 YVCAEWNYGGFPVWLHN-MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPI 178
Y+CAE + GGFP WL N PGI +LRT + ++ E++ + + ++ E F GGPI
Sbjct: 127 YICAERDMGGFPYWLLNKYPGI-QLRTADVAYLREVRTWYAEL--FSRLEPYFYGNGGPI 183
Query: 179 ILAQIENEYGNVMS-DYG------DAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMF 231
I+ Q+ENEYG+ + DY D + Y+ A + T+ G+ + + F
Sbjct: 184 IMVQVENEYGSFFACDYKYMKWLRDETERYVRGKAVLFTNNGPGLTQCGGIDGVLSTLDF 243
Query: 232 TPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFG 276
P P P + E + GW W + R+ + R+
Sbjct: 244 GPGTALEIDGYWKDLRKLQPKGPLVNAEYYPGWLTHWQEQQMARSPIEPVVTSLRYMLSS 303
Query: 277 GTFQNYYMYHGGTNFGRTSG------GPYL--TTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM++GGTNFG T+G G ++ TSYDYDAP+DE G PK+ +R++
Sbjct: 304 KVNVNIYMFYGGTNFGFTAGANEQGPGRFIPDITSYDYDAPLDESGD-PTPKYEAIRKV 361
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 52/122 (42%), Gaps = 30/122 (24%)
Query: 566 KKFYNAKAANSERGWSSKNVPLNR-RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGY 624
+K + + +S+R S+ P+ M +Y A ++ LN G GKG ++NG+
Sbjct: 536 QKHLSDRKKSSKRPKISQGTPMRHGPMVFYGNFDIARIDILDTYLNPSGWGKGVVFINGF 595
Query: 625 NLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVL 684
NLGRYWP GP QI +VPR +K +N ++L
Sbjct: 596 NLGRYWPRV---------------GP--------------QITLYVPRHILKSTMNEIIL 626
Query: 685 FE 686
E
Sbjct: 627 IE 628
>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
ED99]
Length = 590
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 158/321 (49%), Gaps = 47/321 (14%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R W D + K G + +ETYV WN HE + +YDF G+ DL FI+
Sbjct: 19 ILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYDFKGHKDLKHFIE 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GLYVI+R PY+CAEW +GGFP WL N + +R+ ++ ++ +++ + + +
Sbjct: 79 LAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTM-RIRSRDEKYLEKVKKYYHELFKI 137
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP------- 217
++ QGGPII+ Q+ENEYG+ D+ Y+ A M + VP
Sbjct: 138 LTPLQI--DQGGPIIMMQVENEYGSFGQDH-----DYLRSLAHMMREEGVTVPFFTSDGA 190
Query: 218 WIMC-------QESDAPSPMFTPNNPNS---------------PKIWTENWTGWFKSWGG 255
W C ++ P+ F + P + E W GWF WG
Sbjct: 191 WDQCLRAGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCMEFWDGWFNRWGE 250
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLTTSYDYDA 308
KR ++DLA V + G N YM+HGGTNFG R + TSYDY A
Sbjct: 251 PVIKRDSDDLAEEVRDAVKLGSL--NLYMFHGGTNFGFWNGCSARGTKDLPQVTSYDYHA 308
Query: 309 PIDEYGHLNQPKWGHLRELHK 329
P+DE G+ + K+ L+E+ K
Sbjct: 309 PLDEAGNPTE-KYFALQEMLK 328
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 104/246 (42%), Gaps = 58/246 (23%)
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGAS-NDLFERPVKLTRGKNQISLLS 511
I + LRI + +H +V+ +V +T Y D FE V LT + QI +L
Sbjct: 393 IHKATEQEKLRIVDARDRVHCFVDQQHV---YTAYQEEIGDQFE--VTLTSDQPQIDVLI 447
Query: 512 ATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNA 571
+G NYG K + P G +G+ +++DL + V + D F
Sbjct: 448 ENMGRVNYGYKL-LAPTQRKG----LGQG----LMQDL------HFVQGWEQFDIDFDRL 492
Query: 572 KAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
A + +R WS + + +YK TF+ E++ +++ G GKG VNG+N+GRYW
Sbjct: 493 TANHFKREWSEQ------QPAFYKYTFDLA-ESNNTHIDVSGFGKGVVLVNGFNIGRYWE 545
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
PSQ Y +P++++K G N +++F+ G
Sbjct: 546 I-----------------------------GPSQSLY-IPKAFLKQGQNEIIVFDSEGKY 575
Query: 692 PSQINF 697
P I
Sbjct: 576 PESIQL 581
>gi|386585602|ref|YP_006082004.1| beta-galactosidase [Streptococcus suis D12]
gi|353737748|gb|AER18756.1| Beta-galactosidase [Streptococcus suis D12]
Length = 590
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 164/346 (47%), Gaps = 51/346 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K Y+ A +
Sbjct: 125 HLDEYYASLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KEYLRSVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WG + +R E++ +V + G N YM+HGGTNFG +G
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQI 295
Query: 301 ----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGN 342
TSYDYDA +DE G N K +L L + LK + L Y
Sbjct: 296 DLPQVTSYDYDAILDEAG--NPTKKFYL--LQQRLKEVYPELEYAE 337
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 96/239 (40%), Gaps = 62/239 (25%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
R+ + + Y +G +V +Q+ T+ G +L + KL ++ +L +G NYG
Sbjct: 402 FRVVDARDRIQIYADGKFVATQYQTEIGDDVELDFKDDKL-----KLDILVENMGRVNYG 456
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSER 578
K P G +GR + DL H TY + L ++D F +
Sbjct: 457 HKL-TAPTQSKG----LGRGA----MADLHFIGHWETYPLHLESVEDLDF--------SK 499
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
GW +Y+ FE E L++ G GKG +VN N+GR+W
Sbjct: 500 GWEEGQA------AFYRYQFELD-ELADTYLDMTGFGKGVVFVNNVNIGRFWE------- 545
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+GP ++ ++P+ ++K G N +++FE G +I+F
Sbjct: 546 ---------KGPI--------------LYLYIPKGYLKKGANEIIVFETEGKYREKIHF 581
>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
Length = 592
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 72 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGV-RLRSTDPIFMTKVRNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 131 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 183
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 184 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 244 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 301
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 302 TSYDYDALLTEAG 314
>gi|256072678|ref|XP_002572661.1| beta-galactosidase [Schistosoma mansoni]
gi|360044217|emb|CCD81764.1| putative beta-galactosidase [Schistosoma mansoni]
Length = 420
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/338 (34%), Positives = 163/338 (48%), Gaps = 43/338 (12%)
Query: 46 LSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKT 105
+SGSIHY R W D + K K GLDAI+ Y+ WN H+P + YDF G+ +L +F++
Sbjct: 10 VSGSIHYFRIPEEYWHDRLSKMKAAGLDAIQIYIPWNFHQPEKGVYDFDGDRNLEKFLEL 69
Query: 106 IQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-FTTLIVDM 164
L VI R+GPY+CAEW++GG PVWL + + +LR+++ +M + F L+ M
Sbjct: 70 ATSLDLLVIARVGPYICAEWDFGGLPVWLLRINPLMKLRSSDPEYMKFVTTWFNVLLPSM 129
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIM---- 220
++ GGPII+ Q+ENEYG+ Y ++Y+ +A L +G I+
Sbjct: 130 ---KRFLYENGGPIIMVQLENEYGS----YSTCDETYLKELYNLA-RLHLGENVIIFTSD 181
Query: 221 --------CQESD-------------APSP----MFTPNNPNSPKIWTENWTGWFKSWGG 255
C SD AP P + N P + +E + GW WGG
Sbjct: 182 GPSNGLLKCGSSDKRYLATVNFGPTTAPVPKVFKVLEDFRQNQPWVNSEYYVGWLDVWGG 241
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQ-NYYMYHGGTNFGRTSGGPY---LTTSYDYDAPID 311
K E + R + N YM+ GGTNFG +GG TSYDYDAPI
Sbjct: 242 DHHKTNPEWAVDGLNRLISYSMRVNVNMYMFQGGTNFGFWNGGARPESSITSYDYDAPIS 301
Query: 312 EYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
E G + + K+ +R+L K E N T YG
Sbjct: 302 EAGDITR-KYMIIRDLLFRRKGTEPPKLPRNTTKISYG 338
>gi|297842039|ref|XP_002888901.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297334742|gb|EFH65160.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 686
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 170/351 (48%), Gaps = 49/351 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG ++ G +HY R P W D + +AK GL+ I+ YV WN HEP + F G D
Sbjct: 72 DGNHFQIIGGDLHYFRVLPEYWEDRLLRAKALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 131
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
L+ F+K V+LR GPY+C EW+ GGFP WL ++ +LRT++ ++ ++ +
Sbjct: 132 LVSFLKLCDKLDFMVMLRAGPYICGEWDLGGFPAWLLSVKPRLQLRTSDPAYLKLVERWW 191
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMA--------- 209
++ + K L S GGP+I+ QIENEYG+ +D K+Y+ MA
Sbjct: 192 GVL--LPKIFPLIYSNGGPVIMVQIENEYGSYGND-----KAYLRKLVSMARGHLGDDII 244
Query: 210 ---------TSLDIG-VPW------IMCQESDAPSPMFTP----NNP-NSPKIWTENWTG 248
+L+ G VP + D P P+F N P +SP + +E +TG
Sbjct: 245 VYTTDGGTKETLEKGTVPVDDVYSAVDFTTGDDPWPIFELQKKFNAPGSSPPLSSEFYTG 304
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG----------P 298
W WG K K AE A ++ + G+ YM HGGTNFG +G P
Sbjct: 305 WLTHWGEKIAKTDAEFTATSLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDYKP 363
Query: 299 YLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
LT SYDYDAPI E G ++ PK+ L+ + K ++ N YG
Sbjct: 364 DLT-SYDYDAPIKESGDIDNPKFRALQRVIKKYNVASHSIIPSNKQRKAYG 413
>gi|297198988|ref|ZP_06916385.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|297147253|gb|EDY55124.2| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 601
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 166/364 (45%), Gaps = 64/364 (17%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG LLSG++HY R W + GL+ +ETYV WN HEP D
Sbjct: 19 LDGRPVRLLSGALHYFRVHEAQWGHRLAMLGAMGLNCVETYVPWNLHEP--HPGDVRDVE 76
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
L RF+ ++ GL+ I+R GPY+CAEW GG P WL RT+++V++ +++
Sbjct: 77 ALGRFLDAAREAGLWAIVRPGPYICAEWENGGLPHWLKG-----HARTSDEVYLGQVERW 131
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
F L+ + +++ +GGP+I+ Q ENEYG+ SD +Y+ ++ + I V
Sbjct: 132 FGRLLPQVVERQ---IDRGGPVIMVQAENEYGSYGSD-----AAYLLRLTELLRAQGITV 183
Query: 217 PWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
P M P + T N P+ P + E W GWF+ W
Sbjct: 184 PLFTSDGPEDHMLTGGSVPGVLATVNFGSGARTAFEALRRYRPDGPLMCMEFWCGWFEHW 243
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGPYL--------T 301
GG+ R AED A A+ + G + N YM HGGTNF G GG L
Sbjct: 244 GGEPVVRDAEDAAEALREILECGASV-NLYMAHGGTNFAGWAGANRGGGALHDGPLEPDV 302
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSG--SSYNL 359
TSYDYDAPIDEYG R K + E YG V V G S +L
Sbjct: 303 TSYDYDAPIDEYG----------RPTEKFWRFREVLSAYGPVAELPPAPEVLGAVSDVDL 352
Query: 360 PAWS 363
AW+
Sbjct: 353 TAWA 356
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 60/153 (39%), Gaps = 20/153 (13%)
Query: 479 YVDSQWTKYGASNDL-FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLV 537
YVD + D+ + PV G ++ L ++G NYG + GI G +L
Sbjct: 415 YVDGERAGVLTEADVQLKEPVA---GYARVELWVESLGRVNYGPRSGEA-KGITGGLL-- 468
Query: 538 GRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTT 597
++ + V GL + + +S G +P + Y+
Sbjct: 469 ------------HERQFLHGVRARGLRLDALDSLDSLDSRTGIGFGELPGDGSAGLYRGE 516
Query: 598 FEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
D VL L G +GF WVNG+NLGRYW
Sbjct: 517 VSVRGAGD-AVLELPGWTRGFVWVNGFNLGRYW 548
>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 592
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 169/356 (47%), Gaps = 50/356 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 72 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 131 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 183
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 184 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 244 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 301
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLL----KSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + + + + ++ ++ +T GN+ + SVS
Sbjct: 302 TSYDYDALLTEAGEPTEKYYAVQKAIKEVCPEVWQAQPRTKKLGNLGSFSVTASVS 357
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 165/343 (48%), Gaps = 41/343 (11%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG+IHY R P W D ++K K G + +ETY+ WN HEP + ++ F G L
Sbjct: 12 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF+KT Q+ GLYVILR PY+CAEW +GG P WL G+ +LR + F+ +Q++
Sbjct: 72 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGM-KLRVSYPPFLKHVQDY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ ++ + GGP+IL Q+ENEYG +D + Y+ + VP
Sbjct: 131 YDVLLKKIVPYQI--NYGGPVILMQVENEYGYYAND-----REYLLAMRDKMQKGGVVVP 183
Query: 218 WIMCQ------------ESDAPSPMFTPNNPNS-----------PKIWTENWTGWFKSWG 254
+ E P+ F P + TE W GWF WG
Sbjct: 184 LVTSDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFWVGWFDHWG 243
Query: 255 -GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYD 307
G E+ + + + G N YM+ GGTNFG +G Y TSYDYD
Sbjct: 244 NGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTSYDYD 301
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
A + E G + + K+ R++ + + + + YG+
Sbjct: 302 ALLTEDGQITE-KYRRYRDVIAKYREIPEVTFTTEIKRKAYGS 343
>gi|62955063|ref|NP_001017547.1| beta-galactosidase precursor [Danio rerio]
gi|62089564|gb|AAH92166.1| Galactosidase, beta 1 [Danio rerio]
gi|182890870|gb|AAI65636.1| Glb1 protein [Danio rerio]
Length = 651
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 168/349 (48%), Gaps = 31/349 (8%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V + DGE +SGSIHY R W D + K GL+AI+TYV WN HE
Sbjct: 26 FSVDYHRNCFLKDGEPFRYISGSIHYSRIPRVYWKDRLLKMYMAGLNAIQTYVPWNFHEA 85
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+ QYDF+G+ DL +F++ QD GL VI+R GPY+CAEW+ GG P WL I LR++
Sbjct: 86 VPGQYDFSGDRDLEQFLQLCQDIGLLVIMRPGPYICAEWDMGGLPAWLLKKKDI-VLRSS 144
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG-------NVMSDYGDAGK 199
+ ++ + + ++ + K + GGPII Q+ENEYG N M +
Sbjct: 145 DPDYLAAVDKWMGKLLPIIK--RYLYQNGGPIITVQVENEYGSYFACDFNYMRHLSQLFR 202
Query: 200 SYINWCAKMATSLDIGVPWIMCQESDA--PSPMFTPN-------------NPNSPKIWTE 244
Y+ A + T+ G+ ++ C + F P P P + +E
Sbjct: 203 FYLGEEAVLFTTDGAGLGYLKCGSLQGLYATVDFGPGANVTAAFEAQRHVEPRGPLVNSE 262
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PY--L 300
+ GW WG K + + + G N YM+ GGTNFG +G PY
Sbjct: 263 FYPGWLDHWGEKHSVVPTSAVVKTLNEILEIGANV-NLYMFIGGTNFGYWNGANTPYGPQ 321
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
TSYDYD+P+ E G L + K+ +RE+ K+ K + + + + YG
Sbjct: 322 PTSYDYDSPLTEAGDLTE-KYFAIREVIKMYKDVPEGILPPSTPKFAYG 369
>gi|336424850|ref|ZP_08604882.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013315|gb|EGN43197.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 596
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 165/340 (48%), Gaps = 61/340 (17%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGN- 96
++GE ++SG IHY R P W D ++K KE G + +ETY+ WN HEP++ ++DF G
Sbjct: 16 LNGEPFQIISGGIHYFRILPEYWEDRLQKLKELGCNTVETYIPWNMHEPVKGKFDFYGEH 75
Query: 97 ----LDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIE-ELRTTNKVFM 151
LD++ F++T Q GL+VILR PY+CAEW++GG P WL M G E +LRT+++ ++
Sbjct: 76 VHGMLDVVSFVRTAQRLGLWVILRPSPYICAEWDFGGLPFWL--MAGEEMDLRTSDERYL 133
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
++++ ++ + ++ QGGP+++ Q+ENEYG+ +D K Y+ M
Sbjct: 134 RHVRDYYDRLMPLLAPLQI--DQGGPVLMLQVENEYGSFGND-----KKYLESLRDMMRE 186
Query: 212 LDIGVPWIMCQESDAPSPMFTPNNPNS--------------------------PKIWTEN 245
I VP SD P N P + TE
Sbjct: 187 RGITVPLFA---SDGPDHNMLANTKTEGIFPTANFGSGASKAFSILEEYTDGGPCMCTEF 243
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARF---FQFGGTFQNYYMYHGGTNFGRTSGGPY--- 299
W GWF +W D D AV + G N YM+ GGTNFG +G Y
Sbjct: 244 WIGWFDAW--HDEVHHEGDTETAVKELENILELGNV--NIYMFEGGTNFGFMNGSNYSDH 299
Query: 300 LT---TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
LT TSYDYDA + E G + R K++ K
Sbjct: 300 LTADVTSYDYDALLTEDGQITD----KYRRFQKVISQFSK 335
>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
gallopavo]
Length = 643
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 169/350 (48%), Gaps = 31/350 (8%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + +D DG +SGSIHY R W D + K K GLDAI+TYV WN HE
Sbjct: 16 FGIDYDCNCFVKDGRPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLDAIQTYVPWNYHET 75
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
YDF+G+ DL F++ + GL VILR GPY+CAEW+ GG P WL I LR++
Sbjct: 76 QMGVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESI-VLRSS 134
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG---------- 195
+ ++ ++ + ++ + K + GGPII+ Q+ENEYG+ + DY
Sbjct: 135 DSDYLTAVEKWMGVL--LPKMKPHLYQNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFR 192
Query: 196 -DAGKSYINWCAKMATSLDI---GVPWIMCQESDAPSPMFTP-------NNPNSPKIWTE 244
G + + A+ + + + AP T + P P + +E
Sbjct: 193 QHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPTGPLVNSE 252
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYLT- 301
+TGW WG + ++ +A + G N YM+ GGTNF +G PY++
Sbjct: 253 FYTGWLDHWGHRHAVVPSQTIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMSQ 311
Query: 302 -TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
TSYDYDAP+ E G L + K+ LRE+ + + + L + YGN
Sbjct: 312 PTSYDYDAPLSEAGDLTE-KYFALREVIGMYNQLPEGLIPPTTSKFAYGN 360
>gi|188990653|ref|YP_001902663.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
B100]
gi|167732413|emb|CAP50607.1| exported beta-galactosidase [Xanthomonas campestris pv. campestris]
Length = 680
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 169/361 (46%), Gaps = 54/361 (14%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHD---------GRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L L + + D G DG+ +LSG+IH+ R
Sbjct: 70 RTTLAPLVLALAIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRAY 129
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF N D+ F++ QGL VILR GP
Sbjct: 130 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 189
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKE--KLFASQGGP 177
Y CAEW GG+P WL I +R+ + F+ Q + +D K+ L GGP
Sbjct: 190 YACAEWETGGYPAWLFGKDNIR-VRSRDPRFLAASQAY----LDAVSKQVHPLLNHNGGP 244
Query: 178 IILAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESD 225
II Q+ENEYG+ D+ D Y+ + + A L G +P + +
Sbjct: 245 IIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLAVVNF 304
Query: 226 APSPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQF 275
AP P+ P++ E W GWF WG D K+ E+L + + +
Sbjct: 305 APGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHASTDAKQQTEELEWILRQ---- 360
Query: 276 GGTFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLR 325
G N YM+ GGT+FG +G + TTSYDYDA +DE G PK+ +R
Sbjct: 361 -GHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRAT-PKFALMR 418
Query: 326 E 326
+
Sbjct: 419 D 419
>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 593
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 593
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 165/343 (48%), Gaps = 41/343 (11%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG+IHY R P W D ++K K G + +ETY+ WN HEP + ++ F G L
Sbjct: 19 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 78
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF+KT Q+ GLYVILR PY+CAEW +GG P WL G+ +LR + F+ +Q++
Sbjct: 79 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGM-KLRVSYPPFLKHVQDY 137
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ ++ + GGP+IL Q+ENEYG +D + Y+ + VP
Sbjct: 138 YDVLLKKIVPYQI--NYGGPVILMQVENEYGYYAND-----REYLLAMRDKMQKGGVVVP 190
Query: 218 WIMCQ------------ESDAPSPMFTPNNPNS-----------PKIWTENWTGWFKSWG 254
+ E P+ F P + TE W GWF WG
Sbjct: 191 LVTSDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFWVGWFDHWG 250
Query: 255 -GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYD 307
G E+ + + + G N YM+ GGTNFG +G Y TSYDYD
Sbjct: 251 NGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTSYDYD 308
Query: 308 APIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN 350
A + E G + + K+ R++ + + + + YG+
Sbjct: 309 ALLTEDGQITE-KYRRYRDVIAKYREIPEVTFTTEIKRKAYGS 350
>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
Length = 592
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 72 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 131 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 183
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 184 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 244 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 301
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 302 TSYDYDALLTEAG 314
>gi|332187631|ref|ZP_08389367.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
gi|332012379|gb|EGI54448.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
Length = 613
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 157/323 (48%), Gaps = 44/323 (13%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A+ + G DG+ ++S +HY R W D ++KAK GL+ I TY FWNAHE
Sbjct: 27 AHSFTVQGNGFLKDGKPYQVISAEMHYTRIPRAYWRDRLRKAKAMGLNTITTYSFWNAHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDFTG D+ FI+ Q +GL VILR GPYVCAEW GG+P WL + LR+
Sbjct: 87 PRPGTYDFTGQNDIAAFIRDAQAEGLDVILRPGPYVCAEWELGGYPSWLLKDRNL-LLRS 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD----------YG 195
T+ + + + + K L GGPI+ Q+ENEYG SD Y
Sbjct: 146 TDPKYTAAVDRWLARLGQEVK--PLLLRNGGPIVAIQLENEYGAFGSDKAYLEGLKASYQ 203
Query: 196 DAG-KSYINWCAKMATSLDIG----VPWIM------CQESDAPSPMFTPNNPNSPKIWTE 244
AG + + + A L G VP ++ Q + A F P+ ++ E
Sbjct: 204 RAGLADGVLFTSNQAGDLAKGSLPEVPSVVNFGSGGAQNAVAKLEAF---RPDGLRMVGE 260
Query: 245 NWTGWFKSWGGK----DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
W GWF WG D K+ AE+L F + R + + YM+HGGT FG +G
Sbjct: 261 YWAGWFDKWGEDHHETDGKKEAEELGFMLKRGYSV-----SLYMFHGGTTFGWMNGADSH 315
Query: 301 --------TTSYDYDAPIDEYGH 315
TTSYDY+AP+DE G+
Sbjct: 316 TGTDYHPDTTSYDYNAPLDEAGN 338
>gi|295689222|ref|YP_003592915.1| beta-galactosidase [Caulobacter segnis ATCC 21756]
gi|295431125|gb|ADG10297.1| Beta-galactosidase [Caulobacter segnis ATCC 21756]
Length = 617
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 158/322 (49%), Gaps = 44/322 (13%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
A+R G DG ++S +HY R W D ++KAK GL+ I TY FWN HE
Sbjct: 31 AHRFEVSGAGFLKDGAPHQVISAEMHYVRIPRAYWRDRLQKAKTMGLNTITTYAFWNVHE 90
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDFTG DL FI+ Q +GL VILR GPYVC+EW GG+P WL + LR+
Sbjct: 91 PRPGVYDFTGQNDLAAFIRAAQAEGLDVILRPGPYVCSEWELGGYPSWLLKDRNV-LLRS 149
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSD----------YG 195
T + ++ + + K L GGPI+ Q+ENEYG D Y
Sbjct: 150 TEPQYAAAVERWMARLGREVK--PLLLKNGGPIVAIQLENEYGAFGDDKAYLEGLEATYR 207
Query: 196 DAGKSY-INWCAKMATSLDIG----VPWIM------CQESDAPSPMFTPNNPNSPKIWTE 244
AG + + + + A+ L G +P ++ ++S A F P+ ++ E
Sbjct: 208 RAGLADGVLFTSNQASDLAKGSLPHLPSMVNFGSGGAEKSVAQLETF---RPDGLRMVGE 264
Query: 245 NWTGWFKSWGGK----DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL 300
W GWF WG + D ++ AE+L F + R + + YM+HGGT+FG +G
Sbjct: 265 YWAGWFDKWGEEHHETDGRKEAEELRFMLQRGYSV-----SLYMFHGGTSFGWMNGADSH 319
Query: 301 --------TTSYDYDAPIDEYG 314
TTSYDYDAP+DE G
Sbjct: 320 TGKDYHPDTTSYDYDAPLDEAG 341
>gi|330832298|ref|YP_004401123.1| beta-galactosidase [Streptococcus suis ST3]
gi|329306521|gb|AEB80937.1| Beta-galactosidase [Streptococcus suis ST3]
Length = 590
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 154/319 (48%), Gaps = 47/319 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K Y+ A +
Sbjct: 125 HLDEYYASLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KEYLRSVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WG + +R E++ +V + G N YM+HGGTNFG +G
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQI 295
Query: 301 ----TTSYDYDAPIDEYGH 315
TSYDYDA +DE G+
Sbjct: 296 DLPQVTSYDYDAILDEAGN 314
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 99/239 (41%), Gaps = 62/239 (25%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
R+ + + Y +G +V +Q+ T+ G +L + KLT + +L +G NYG
Sbjct: 402 FRVVDARDRIQIYADGKFVATQYQTEIGDDVELEFKDDKLT-----LDILVENMGRVNYG 456
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSER 578
K P G +GR + DL H TY + L ++D F + + E
Sbjct: 457 HKL-TAPTQSKG----LGRGA----MADLHFIGHWETYPLHLESVEDLDF----SKDWEE 503
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
G ++ +Y+ FE E L++ G GKG +VN N+GR+W
Sbjct: 504 GQAA----------FYRYQFELD-ELADTYLDMTGFGKGVVFVNNVNIGRFWE------- 545
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+GP ++ ++P+ ++K G N +++FE G +I+F
Sbjct: 546 ---------KGPI--------------LYLYIPKGYLKKGANEIIVFETEGKYREKIHF 581
>gi|315647882|ref|ZP_07900983.1| Beta-galactosidase [Paenibacillus vortex V453]
gi|315276528|gb|EFU39871.1| Beta-galactosidase [Paenibacillus vortex V453]
Length = 587
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 146/299 (48%), Gaps = 38/299 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG++HY R P W D + K K G + +ETY+ WN HEP Q+ F G DL F++
Sbjct: 21 ILSGAVHYFRIVPEYWEDRLMKLKACGFNTVETYIPWNLHEPKEGQFTFDGIADLEGFVQ 80
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GL+VILR PY+CAEW +GG P WL P I LR + V++ ++ ++ ++
Sbjct: 81 KAGHLGLHVILRPSPYICAEWEFGGLPAWLLQYPDI-HLRCMDPVYLEKVDHYYDELI-- 137
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWI----- 219
+ L S+GGP+I QIENEYG+ +D +Y+ + ++ + V
Sbjct: 138 PRIVPLLTSKGGPVIAIQIENEYGSYGND-----TAYLEYLKDGLSARGVDVLLFTSDGP 192
Query: 220 ---MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRT 261
M Q P+ + T N P + E W GWF W R+
Sbjct: 193 TDGMLQGGTVPNVLATVNFGSRPGEAFAKLREYRTEDPLMCMEYWNGWFDHWLKPHHTRS 252
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------PYLTTSYDYDAPIDEYG 314
+E++A + + N+YM+HGGTNFG +G TSYDYDAP+ E G
Sbjct: 253 SEEVAQVFEEMLRLNASV-NFYMFHGGTNFGFYNGANDQEKYEPTVTSYDYDAPLSECG 310
Score = 39.7 bits (91), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 590 RMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
R T+Y F D + L G GKG W+NG+NLGRYW
Sbjct: 504 RPTFYTGEFTVDEIGD-TFIRLDGWGKGVVWINGFNLGRYW 543
>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
Length = 593
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 53/241 (21%), Positives = 96/241 (39%), Gaps = 52/241 (21%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L++ + LH YV+G+ +Q+ + L + + + +L +G NYG
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEELLISGQTE--KDTLALDILVENLGRVNYGF 460
Query: 522 KFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWS 581
K + P G G +++D+ H+ G ++ ++ ++
Sbjct: 461 KLN-------NPTQSKGIRGG--VMQDIHFHQ--------GYQHYPLTFSQEQLAKIDYT 503
Query: 582 SKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCS 641
+ PL + ++Y+ TFE D + + +G GKGF VNG++LGRYW
Sbjct: 504 AGKNPL--QPSFYQVTFELEQLADTYI-DCRGYGKGFVVVNGHHLGRYWEI--------- 551
Query: 642 TESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVV 701
GP S C P+ +++ G N +V+FE G + F V
Sbjct: 552 -------GPIHSLYC--------------PKEFLQQGQNEVVIFETEGIEIEYLKFTNQV 590
Query: 702 V 702
+
Sbjct: 591 I 591
>gi|223932593|ref|ZP_03624593.1| Beta-galactosidase [Streptococcus suis 89/1591]
gi|302023447|ref|ZP_07248658.1| beta-galactosidase precursor [Streptococcus suis 05HAS68]
gi|386583558|ref|YP_006079961.1| beta-galactosidase [Streptococcus suis D9]
gi|223898703|gb|EEF65064.1| Beta-galactosidase [Streptococcus suis 89/1591]
gi|353735704|gb|AER16713.1| Beta-galactosidase [Streptococcus suis D9]
Length = 590
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 154/319 (48%), Gaps = 47/319 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +LSG+IHY R P W + K G + +ETYV WN HEP + ++
Sbjct: 7 GDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFC 66
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
+ G LD+ RF+K Q+ GLY I+R PY+CAEW +GG P WL M +R+++ V++
Sbjct: 67 YEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWL--MKEELRVRSSDSVYLQ 124
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
+ + ++ K KL +QGG +++ Q+ENEYG+ YG+ K Y+ A +
Sbjct: 125 HLDEYYASLI--PKLAKLQLAQGGNVLMFQVENEYGS----YGEE-KEYLRSVAGLMRKH 177
Query: 213 DIGVPWIMCQ-------------ESDA----------------PSPMFTPNNPNSPKIWT 243
+ P E D + F + N P +
Sbjct: 178 GLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCM 237
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--- 300
E W GWF WG + +R E++ +V + G N YM+HGGTNFG +G
Sbjct: 238 EFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQI 295
Query: 301 ----TTSYDYDAPIDEYGH 315
TSYDYDA +DE G+
Sbjct: 296 DLPQVTSYDYDAILDEAGN 314
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 96/239 (40%), Gaps = 62/239 (25%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYG 520
R+ + + Y +G +V +Q+ T+ G +L + KLT + +L +G NYG
Sbjct: 402 FRVVDARDRIQIYADGKFVATQYQTEIGDDVELDFKDDKLT-----LDILVENMGRVNYG 456
Query: 521 SKFDMVPNGIPGPVLLVGRAGDETIIKDLS--SHKWTYKVGLYGLDDKKFYNAKAANSER 578
K P G +GR + DL H TY + L ++D F +
Sbjct: 457 HKL-TAPTQSKG----LGRGA----MADLHFIGHWETYPLPLDSVEDLDF--------SK 499
Query: 579 GWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEED 638
GW +Y+ FE E L++ G GKG +VN N+GR+W
Sbjct: 500 GWEEGQA------AFYRYQFELD-ELADTYLDMTGFGKGVVFVNNVNIGRFWE------- 545
Query: 639 GCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+GP ++ ++P+ ++K G N +V+FE G +I+F
Sbjct: 546 ---------KGPI--------------LYLYIPKGYLKKGANEIVVFETEGKYREKIHF 581
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 169/350 (48%), Gaps = 48/350 (13%)
Query: 11 ILLCLILQTLFNLSLAYR--VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
I+L LI+ + + R V + I+G+ L+ G +HYPR W D + +A+
Sbjct: 10 IMLNLIVSFFISACSSPREQVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRAR 69
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
GL+ + YVFWN HE +DF+G D+ F++ Q++GLYVILR GPYVCAEW++G
Sbjct: 70 AMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFG 129
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEY 187
G+P WL + R+ + FM+ + + I ++ K+ L + GG II+ Q+ENEY
Sbjct: 130 GYPSWLLKEKDL-TYRSKDPRFMSYCERY---IKELGKQLAPLTINNGGNIIMVQVENEY 185
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC------QESDAPSPMFTPN------- 234
G+ +D K Y+ M VP C + + T N
Sbjct: 186 GSYAAD-----KEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDI 240
Query: 235 -------NPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGTFQNYY 283
+P P E + WF WG + +R AE L + + G + Y
Sbjct: 241 FKIVDKYHPGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGH-----GVSVSMY 295
Query: 284 MYHGGTNF-----GRTSGGPY-LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
M+HGGTNF TSGG TSYDYDAP+ E+G+ PK+ RE+
Sbjct: 296 MFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWGNC-YPKYHAFREI 344
>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
Length = 586
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 154/319 (48%), Gaps = 51/319 (15%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
++SG+IHY R P W +K K G + +ETYV WN HEP + QY F+ LDL RFI+
Sbjct: 19 IISGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQ 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GL VILR PY+CAE+ +GG P WL + +R+T FM ++ + +
Sbjct: 79 LADSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMR-VRSTYPPFMERVRLYYREL--F 135
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIM---- 220
+ L + GGPIIL Q+ENEYG S+ K Y+ M + VP +
Sbjct: 136 KEVIDLQITSGGPIILMQVENEYGGYGSE-----KKYLQELVTMMKENGVTVPLVTSDGP 190
Query: 221 ---------CQESDAPS------------PMFTPNNPNSPKIWTENWTGWFKSWGGK--- 256
QES P+ + P + E W GWF +W K
Sbjct: 191 WGDMLENGSLQESALPTVNCGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAWQDKKHH 250
Query: 257 --DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDA 308
D K + E L + R G+ N+YM+HGGTNFG +G Y TTSYDYDA
Sbjct: 251 TTDVKSSVESLEEILKR-----GSV-NFYMFHGGTNFGFMNGANYYGKLLPDTTSYDYDA 304
Query: 309 PIDEYGHLNQPKWGHLREL 327
P++EYG + K+ +E+
Sbjct: 305 PLNEYGEQTE-KYKAFKEV 322
>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 593
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIMCQES-----DA----PSPMFTPNNPNS--------------------PKIWTENWTG 248
+ DA +F N S P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|348573619|ref|XP_003472588.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Cavia
porcellus]
Length = 880
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/307 (36%), Positives = 152/307 (49%), Gaps = 29/307 (9%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+ GSIHY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL F+
Sbjct: 307 IFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 366
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
+ GL+VILR GPY+CAE + GG P WL PG+ +LRTT + F + + + M
Sbjct: 367 LAAEIGLWVILRPGPYICAEIDLGGLPSWLLQDPGM-KLRTTYQGFTEAVDLYFDHL--M 423
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYG----------DAGKSYINWCAKMATSLDI 214
++ L GGPII Q+ENEYG+ D D G + + L
Sbjct: 424 SRVVPLQYKHGGPIIAVQVENEYGSYNRDPAYMPYIKKALEDRGIIELLLTSDNKDGLQK 483
Query: 215 GVPWIMC--------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
GV + QE + + N PK+ E WTGWF SWGG + ++
Sbjct: 484 GVVHGVLATINLQSQQELQSLTTSLLSVQGNQPKMVMEYWTGWFDSWGGPHNILDSSEVL 543
Query: 267 FAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHLNQPK 320
V+ G + N YM+HGGTNFG +G + TSYDYDA + E G K
Sbjct: 544 DTVSAITNAGSSI-NLYMFHGGTNFGFINGAMHFNDYKSDVTSYDYDAVLTEAGDYTA-K 601
Query: 321 WGHLREL 327
+G LR+
Sbjct: 602 YGKLRDF 608
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 89/230 (38%), Gaps = 58/230 (25%)
Query: 464 INSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKF 523
+ GQV V+ ++D + T E + LT+G + +L G NYG+
Sbjct: 691 VRDRGQVFVNTVSIGFLDYKTT---------EIVIPLTQGYTVLRILVENRGRVNYGNNI 741
Query: 524 DMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKK-FYNAKAANSERGWSS 582
D G+ G + L + + +K+ +Y LD KK F+ +A+ WS
Sbjct: 742 DDQRKGLIGDLYL-----NNSPLKNFR---------IYSLDMKKSFFQRFSADK---WSP 784
Query: 583 KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCST 642
+ P +D L L+G KG +VNG+NLGRYW
Sbjct: 785 VPEAPALPAFFLGVLSILPSPSD-TFLKLEGWEKGVVFVNGHNLGRYW------------ 831
Query: 643 ESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNP 692
N G P + Y +P +W+ G N +++FEE P
Sbjct: 832 ----------------NIG-PQETLY-LPGAWLNSGANQVIVFEETMAGP 863
>gi|123788298|sp|Q3UPY5.1|GLBL2_MOUSE RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|74224567|dbj|BAE25259.1| unnamed protein product [Mus musculus]
Length = 636
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 156/314 (49%), Gaps = 43/314 (13%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+L GSIHY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL FI+
Sbjct: 63 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 122
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GL+VILR GPY+C+E + GG P WL P + +LRTT F ++ + + M
Sbjct: 123 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDM-KLRTTYHGFTKAVELYFDHL--M 179
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC--- 221
++ L GGPII Q+ENEYG+ D ++Y+ + K D G+ ++
Sbjct: 180 SRVVPLQYKHGGPIIAVQVENEYGSYNKD-----RAYMPYIKKALE--DRGIIEMLLTSD 232
Query: 222 ----------------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPK 259
QE A + + PK+ E WTGWF SWGG
Sbjct: 233 NKDGLEKGVVDGVLATINLQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNI 292
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEY 313
+ ++ V+ + G + N YM+HGGTNFG +G + TSYDYDA + E
Sbjct: 293 LDSSEVLQTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEA 351
Query: 314 GHLNQPKWGHLREL 327
G K+ LREL
Sbjct: 352 GDYT-AKYTKLREL 364
>gi|408677368|ref|YP_006877195.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
gi|328881697|emb|CCA54936.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
Length = 611
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 155/325 (47%), Gaps = 48/325 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG LLSG++HY R W + + GL+ +ETYV WN HEP +Y L
Sbjct: 13 LDGRPVRLLSGALHYFRVREEQWEHRLGMLRAMGLNCVETYVPWNLHEPEPGRYADVAAL 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
RF+ + G++ I+R GPY+CAEW GG P WL G +R+ + F+ ++
Sbjct: 73 G--RFLDAVARAGMWAIVRPGPYICAEWENGGLPHWLTGPLG-RRVRSFDPEFLAPVEAW 129
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
F L+ + +++ +GGP++L Q+ENEYG+ SD ++Y+ W A++ + V
Sbjct: 130 FRRLLPQVVERQ---IDRGGPVVLVQVENEYGSYGSD-----RAYLEWLAELLRGCGVAV 181
Query: 217 PWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTENWTGWFKSW 253
P M P + T N P+ P + E W GWF W
Sbjct: 182 PLFTSDGPEDHMLTGGSVPGVLATANFGSGAREGFATLRRHQPSGPLMCMEFWCGWFDHW 241
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG---------GPY--LTT 302
G + R A D A A+ + G + N YM HGGTNFG +G GP T
Sbjct: 242 GTEHAVRDAADAAEALREILECGASV-NVYMAHGGTNFGGFAGANRAGELHDGPLRATVT 300
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDAP+DE G + W RE+
Sbjct: 301 SYDYDAPVDEAGRPTEKFW-RFREV 324
>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
Length = 593
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 169/356 (47%), Gaps = 50/356 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLL----KSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + + + + ++ ++ +T GN+ + SVS
Sbjct: 303 TSYDYDALLTEAGEPTEKYYAVQKAIKEVCPEVWQAQPRTKKLGNLGSFSVTASVS 358
>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 593
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 169/356 (47%), Gaps = 50/356 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLL----KSMEKTLTYGNVTNTDYGNSVS 353
TSYDYDA + E G + + + + ++ ++ +T GN+ + SVS
Sbjct: 303 TSYDYDALLTEAGEPTEKYYAVQKAIKEVCPEVWQAQPRTKKLGNLGSFSVTASVS 358
>gi|32709094|gb|AAP86763.1| beta-galactosidase Gal35I [Xanthomonas campestris pv. campestris]
Length = 613
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 169/361 (46%), Gaps = 54/361 (14%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHD---------GRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L L + + D G DG+ +LSG+IH+ R
Sbjct: 3 RTTLAPLVLALAIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF N D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKE--KLFASQGGP 177
Y CAEW GG+P WL I +R+ + F+ Q + +D K+ L GGP
Sbjct: 123 YACAEWETGGYPAWLFGKDNIR-VRSRDPRFLAASQAY----LDAVSKQVHPLLNHNGGP 177
Query: 178 IILAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESD 225
II Q+ENEYG+ D+ D Y+ + + A L G +P + +
Sbjct: 178 IIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLAVVNF 237
Query: 226 APSPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQF 275
AP P+ P++ E W GWF WG D K+ E+L + + +
Sbjct: 238 APGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHASTDAKQQTEELEWILRQ---- 293
Query: 276 GGTFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLR 325
G N YM+ GGT+FG +G + TTSYDYDA +DE G PK+ +R
Sbjct: 294 -GHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRAT-PKFALMR 351
Query: 326 E 326
+
Sbjct: 352 D 352
>gi|134096920|ref|YP_001102581.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|291006638|ref|ZP_06564611.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|133909543|emb|CAL99655.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/322 (34%), Positives = 166/322 (51%), Gaps = 34/322 (10%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G +DGE +++G +HY R+ P W + + + + GL++++TYV WN HEP R + D
Sbjct: 21 GNEFLLDGEPFRIIAGEMHYFRTHPDQWRNRLDRMRALGLNSVDTYVAWNFHEPRRGEVD 80
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
FTG D++RF++T + GL VI+R GPY+CAEW++GG P WL G LR ++ +
Sbjct: 81 FTGWRDVVRFVETAAEAGLKVIIRPGPYICAEWDFGGLPAWLLES-GNPPLRCSDPAYTE 139
Query: 153 -EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
++ F L+ +A L A++GGP++ Q+ENEYG+ +D + + S
Sbjct: 140 LTLRWFDELLPRLA---PLQATRGGPVLAFQVENEYGSYGNDQTHLEQLRAGMLERGIDS 196
Query: 212 LDI---GVPWIMCQESDAPSPMFTPN---NPNSP-----------KIW-TENWTGWFKSW 253
L G M + + P + T N +P +P +W TE W GWF W
Sbjct: 197 LLFCSNGPSDYMLRGGNLPDTLATVNFAGDPTAPFEALREYQPEGPLWCTEFWDGWFDHW 256
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYLT---------TSY 304
G + + A V R G + + YM GGTNFG +G Y T TSY
Sbjct: 257 GEEHHTTDPVETAGHVDRMLAAGASV-SLYMAVGGTNFGWWAGANYDTSKDQYQPTITSY 315
Query: 305 DYDAPIDEYGHLNQPKWGHLRE 326
DYD+PI E G L + K+ +RE
Sbjct: 316 DYDSPIGEAGELTE-KFQRIRE 336
>gi|411007376|ref|ZP_11383705.1| beta-galactosidase [Streptomyces globisporus C-1027]
Length = 606
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 152/327 (46%), Gaps = 49/327 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V DG DG+ LLSG++HY R W + GL+ +ETYV WN HEP
Sbjct: 4 FTVGDDG--FRSDGKPVRLLSGALHYFRVHEEQWEHRLAMLAAMGLNCVETYVPWNLHEP 61
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+ G L RF+ ++ GL+ I+R GPY+CAEW GG PVW+ G +RT
Sbjct: 62 REGEVRDVGALG--RFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFG-RRVRTR 118
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ + ++ F L+ + +++ + +GGP+IL Q ENEYG+ SD Y+ W
Sbjct: 119 DAEYRAVVERWFRELLPQVVQRQVV---RGGPVILVQAENEYGSFGSD-----AVYLEWL 170
Query: 206 AKMATSLDIGVPWI--------MCQESDAPSPMFTPN---------------NPNSPKIW 242
A + + VP M P + T N P P +
Sbjct: 171 AGLLRECGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGAREGFEVLRRHQPKGPLMC 230
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGP 298
E W GWF WG + R AE+ A A+ + G + N YM HGGTNF G GGP
Sbjct: 231 MEFWCGWFDHWGAEPVLRDAEEAAGALREILECGASV-NVYMAHGGTNFAGWAGANRGGP 289
Query: 299 Y-------LTTSYDYDAPIDEYGHLNQ 318
TSYDYDAP+DEYG +
Sbjct: 290 LQDGEFQPTVTSYDYDAPVDEYGRATE 316
>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 593
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 153/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLSPLQITQGGPVIMMQVENEYGS----YG-MEKAYLQQTKQIMEELGIEVP 184
Query: 218 -------W--IMCQESDAPSPMFTPNNPNS--------------------PKIWTENWTG 248
W ++ + +F N S P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/306 (35%), Positives = 147/306 (48%), Gaps = 42/306 (13%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+++G +HY R+ W D + K K G + +ETYV WN HE + Y F GNLD+ FI+
Sbjct: 20 IIAGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIE 79
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q L+VI+R PY+CAEW +GG P WL PG++ +RT K FM ++ + ++ +
Sbjct: 80 LAQSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMK-VRTVYKPFMKHVKEYFEVLFKI 138
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC--- 221
L Q GPIIL QIENEYG +D K Y++ K+ VP +
Sbjct: 139 LAP--LQIDQDGPIILMQIENEYGYYGND-----KEYLSTLLKIMRDFGTTVPVVTSDGP 191
Query: 222 --QESDAPSPM--------------------FTPNNPNSPKIWTENWTGWFKSWGG-KDP 258
+ DA S + F N P + E W GWF +WG +
Sbjct: 192 WGEALDAGSLLADVSLPTMNFGTGAKEHIENFKEKYVNKPVMCMEFWVGWFDAWGDDRHH 251
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDE 312
R A D A + G N YM+HGGTNFG +G L TSYDYDA + E
Sbjct: 252 TRDASDAANELRDILNEGSV--NIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTE 309
Query: 313 YGHLNQ 318
G L +
Sbjct: 310 CGDLTE 315
>gi|281207977|gb|EFA82155.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 626
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 182/363 (50%), Gaps = 63/363 (17%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
AI L LI TL +S A +G + DGE ++SGS HY RS P +W D ++K K
Sbjct: 15 AISLTLIALTLATVS-ASSFYIEGNSFLKDGESFQIISGSFHYFRSHPLLWRDRLQKMKA 73
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GL+ ++TY+ WN H+ + Q+DFT ++ +F + Q++GL V++R GPY+C EW YGG
Sbjct: 74 AGLNTVQTYIAWNVHQSIDMQFDFT-TYNITQFFEIAQEEGLLVVVRAGPYICGEWEYGG 132
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
FP ++ I R+++ ++ + + +++ M E+L+ + GGPII+ Q+ENEYG+
Sbjct: 133 FPAFIDQTVAI---RSSDPAYLTYVTQYFNVLLPMLN-EQLY-TNGGPIIMVQVENEYGS 187
Query: 190 VMSDYGDAGKSYINWCAKM-------ATSLDIGVPWIMCQES------------------ 224
SD K Y+N + A + GV + S
Sbjct: 188 YGSD-----KLYLNTLLSLYEKYFGTARGQESGVVFYSTDGSGDLYLYGSQIAGVYQTID 242
Query: 225 ----DAPSPMFTPN---NPNSPKIWTENWTGWFKSW-----GGKDPKRTAEDLAFAVARF 272
D P F P P + +E +TGW W G D K A+ L+ A+ +
Sbjct: 243 FGPTDDPESNFKIQRKFEPTGPLMNSEYYTGWLTHWLDSSPAGADTKSVADGLS-AILKL 301
Query: 273 FQFGGTFQNYYMYHGGTNF----GRTSGGP--YLTT--SYDYDAPIDEYGHLNQPKWGHL 324
G N YM++GG+NF G SGG Y T SYDYD+P++E G + K+ +
Sbjct: 302 ----GASVNMYMFYGGSNFGFMNGANSGGANDYEITIQSYDYDSPLNEAGDITN-KYLAI 356
Query: 325 REL 327
R++
Sbjct: 357 RQV 359
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 152/313 (48%), Gaps = 29/313 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
++ DG +DG+ +LSG+IHY R W ++ + GL+ I+ Y+ WN HE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+DF G LDL+ F + GL V+ R GPY+C+EW++GG P WL P + +R+
Sbjct: 68 GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKM-HIRSNYC 126
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ + ++ + ++ + L S GGPII Q+ENEYG DY D ++ W A +
Sbjct: 127 GYQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYG----DYVDKDNEHLPWLADL 180
Query: 209 ATSLDIGVPWI------------MCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGK 256
S + + M + + + PN P + TE W GWF WG
Sbjct: 181 MKSHGLFELFFISDGGHTIRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYWGHG 240
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL--------TTSYDYDA 308
+ + + G + N+YM+HGGTNFG +G L TSYDYD
Sbjct: 241 RNLLNNDVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGYYTADVTSYDYDC 299
Query: 309 PIDEYGHLNQPKW 321
P+DE G+ + KW
Sbjct: 300 PVDESGNRTE-KW 311
>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 593
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|354490770|ref|XP_003507529.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Cricetulus griseus]
Length = 689
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 152/307 (49%), Gaps = 29/307 (9%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+ GS+HY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL FI+
Sbjct: 116 IFGGSVHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 175
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GL+VILR GPY+C+E + GG P WL P + +LRTT F + + + M
Sbjct: 176 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPNM-KLRTTYYGFTKAVDLYFDHL--M 232
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYG----------DAGKSYINWCAKMATSLDI 214
++ L GGPII Q+ENEYG+ D+ D G + + L
Sbjct: 233 SRVVPLQYKHGGPIIAVQVENEYGSYYKDHAYMPYIKKALEDRGIIEMLLTSDNKDGLQK 292
Query: 215 GVPWIMC--------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLA 266
GV + QE A S + PK+ E WTGWF SWGG + ++
Sbjct: 293 GVVSGVLATINLQSQQELKALSSVLLSIQGIQPKMVMEYWTGWFDSWGGPHNILDSSEVL 352
Query: 267 FAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHLNQPK 320
V+ + G + N YM+HGGTNFG +G + TSYDYDA + E G K
Sbjct: 353 QTVSAIIKSGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAVLTEAGDYTA-K 410
Query: 321 WGHLREL 327
+ LR+L
Sbjct: 411 YTKLRDL 417
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 74/192 (38%), Gaps = 51/192 (26%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
+ LT+G + +L G NYG+ D G+ G + L D +K K
Sbjct: 525 IPLTQGYTTLRILVENRGRVNYGNNIDTQRKGLIGNLYL-----DNNPLK---------K 570
Query: 558 VGLYGLD-DKKFYNAKAANSERGWSSKNVPLNRRM-TWYKTTFEAPLENDPVVLNLQGMG 615
+Y LD K+F+ + W+ +P R ++ + L L+G
Sbjct: 571 FRIYSLDMTKRFFERLDTDK---WNF--IPKQRTFPAFFLGALSVGIYPSDTFLKLEGWT 625
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG +VN +NLGRYW N G P + Y +P W+
Sbjct: 626 KGVVFVNDHNLGRYW----------------------------NIG-PQETLY-LPGVWL 655
Query: 676 KDGVNTLVLFEE 687
G+N +++FEE
Sbjct: 656 DKGLNKVIIFEE 667
>gi|157824103|ref|NP_001101662.1| beta-galactosidase precursor [Rattus norvegicus]
gi|149018351|gb|EDL76992.1| galactosidase, beta 1 (mapped) [Rattus norvegicus]
Length = 647
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 159/324 (49%), Gaps = 31/324 (9%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG+ +SGSIHY R W D + K K GLDAI+TYV WN HEP QYDF+G+ D
Sbjct: 45 DGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLDAIQTYVPWNFHEPQPGQYDFSGDRD 104
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
+ FI+ GL VILR GPY+CAEW+ GG P WL I LR+++ ++ + +
Sbjct: 105 VEHFIQLAHQLGLLVILRPGPYICAEWDMGGLPAWLLEKESI-VLRSSDPDYLAAVDKW- 162
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG-----------DAGKSYINWCA 206
L V + K ++L GGPII Q+ENEYG+ + DY G I +
Sbjct: 163 -LAVLLPKMKRLLYQNGGPIITVQVENEYGSYFACDYNYLRFLEHRFRYHLGNDIILFTT 221
Query: 207 --------KMATSLDIGVPWIMCQESDAPSPMFTPNN--PNSPKIWTENWTGWFKSWGGK 256
K T D+ + N P P I +E +TGW WG
Sbjct: 222 DGAAEKLLKCGTLQDLYATVDFGTTGNITRAFLIQRNFEPKGPLINSEFYTGWLDHWGQP 281
Query: 257 DPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL--TTSYDYDAPIDE 312
K + L ++ +G + N YM+ GGTNF +G PY TSYDYDAP+ E
Sbjct: 282 HSKVNTKKLVASLYNLLAYGASV-NLYMFIGGTNFAYWNGANMPYAPQPTSYDYDAPLSE 340
Query: 313 YGHLNQPKWGHLRELHKLLKSMEK 336
G L + K+ +R++ + K + +
Sbjct: 341 AGDLTE-KYFAVRDVIRKFKEVPE 363
>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
Length = 593
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 593
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
Length = 593
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 153/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLQQTKQIMEELGIEVP 184
Query: 218 -------W--IMCQESDAPSPMFTPNNPNS--------------------PKIWTENWTG 248
W ++ + +F N S P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 166/340 (48%), Gaps = 42/340 (12%)
Query: 11 ILLCLILQTLFNLSLAYRVSHDG-RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
I L ++ LF+++ G A +DG+ ++SG IHYPR W D +K AK
Sbjct: 7 ITLLIVFSYLFSIAQQQHTFTLGDTAFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAKA 66
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GL+ I TYVFWN HEP + QYDF+GN D+ F+K +++ L+V+LR PYVCAEW +GG
Sbjct: 67 MGLNTIGTYVFWNVHEPEKGQYDFSGNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFGG 126
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEYG 188
+P WL + G+ ++R+ ++ +N+ I+ + K+ L + GG I++ QIENEYG
Sbjct: 127 YPYWLQEIKGL-KVRSKEPQYLEAYRNY---IMAVGKQLSPLLVTHGGNILMVQIENEYG 182
Query: 189 NVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDA------PSPMFTPNNPNSP--- 239
+ D K Y++ KM C A P + N + P
Sbjct: 183 SYSDD-----KDYLDINRKMFVEAGFDGLLYTCDPKAAIKNGHLPGLLPAINGVDDPLQV 237
Query: 240 -KIWTENWTG------------WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYH 286
++ EN +G WF WG K + G + N YM+H
Sbjct: 238 KQLINENHSGKGPYYIAEWYPAWFDWWGTKHHTVPYRQYLGKLDSVLAAGISI-NMYMFH 296
Query: 287 GGTNFGRTSGG------PY--LTTSYDYDAPIDEYGHLNQ 318
GGT G +G PY +SYDYDAP+DE G+ +
Sbjct: 297 GGTTRGFMNGANANDADPYEPQISSYDYDAPLDEAGNATE 336
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 72/191 (37%), Gaps = 57/191 (29%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG-DETIIKDLSSHK-WT 555
+ L GK ++ LL +G N+G P LL R G E ++ D K W
Sbjct: 450 LDLPAGKVKLDLLVENLGRINFG------------PYLLSNRKGITEKVLFDRQELKGWQ 497
Query: 556 YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
YGL K A + G NVP T+ + TF D L++ G
Sbjct: 498 Q----YGLPFDKLPAVAAKGIKAG---ANVP-----TYRQGTFTLDKTGD-TWLDMSNWG 544
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG W+NG++LGRYW P Q Y VP W+
Sbjct: 545 KGAVWINGHHLGRYWQV-----------------------------GPQQTIY-VPAEWL 574
Query: 676 KDGVNTLVLFE 686
K G+N +V+ E
Sbjct: 575 KKGMNDIVIME 585
>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 593
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 168/333 (50%), Gaps = 31/333 (9%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
V ++ + I+GE+ L S +IHY R W +++ KAK G++ ++TY WN HEP
Sbjct: 18 VQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNCVDTYFAWNVHEPEE 77
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+++F G+ D F+ + GL+VI R GP++CAEW++GGFP WL+ + + R +
Sbjct: 78 GEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWLNTKKDM-KFRAFDM 136
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
++ + + I+ + + ++ A GG +IL Q+ENEYG + SD + + Y+ +
Sbjct: 137 QYLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYLASD--EVARDYMLHLRDV 192
Query: 209 ATSLDIGVPWIMC--------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG 254
+ VP I C +D P++PKI TE WTGWF+ WG
Sbjct: 193 MLDRGVMVPLITCVGGAEGTVEGANFWSGADHHYNNLVQKQPDTPKIVTEFWTGWFEHWG 252
Query: 255 GKDPKRTAEDLAFAVARFFQ---FGGTFQNYYM----YHGGTNFGRTSGGP--YLTTSYD 305
P T + A R + G T ++YM + G GRT G ++ TSYD
Sbjct: 253 A--PAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNFGGYGGRTVGASDIFMVTSYD 310
Query: 306 YDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTL 338
YDAP+ EYG + K+ + + +++ E L
Sbjct: 311 YDAPLSEYGRVTD-KYNTAKRMSYFVQATESVL 342
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 47/110 (42%), Gaps = 35/110 (31%)
Query: 592 TWYKTTFEAPL----ENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
W+ F+ P N + L L GM KG W+NG +LGRYW
Sbjct: 826 VWHTVQFDKPELPADVNAKLKLRLTGMSKGTLWLNGIDLGRYWQV--------------- 870
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
GP Q Y +P +W+KD N LVLF+E G +PS++
Sbjct: 871 -GP--------------QEDYKIPMAWLKDR-NELVLFDENGASPSKVRL 904
>gi|357132771|ref|XP_003568002.1| PREDICTED: beta-galactosidase 8-like [Brachypodium distachyon]
Length = 674
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 172/360 (47%), Gaps = 57/360 (15%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
+G A DGER ++ G +HY R P W D + +AK GL+ ++TYV WN HEP + +
Sbjct: 36 EGDAFRKDGERFQIVGGDVHYFRIVPEYWKDRLLRAKALGLNTVQTYVPWNLHEPEPQSW 95
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
+F G D+ +++ + + V+LR+GPY+C EW+ GGFP WL + +LR+++ ++
Sbjct: 96 EFNGFADIESYLRLAHELEMLVMLRVGPYICGEWDLGGFPPWLLTIEPALKLRSSDSAYL 155
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ ++ + ++ + K L S GGPII+ QIENE+G+ D K+Y+++ +A
Sbjct: 156 SLVERWWKVL--LPKVAPLLYSNGGPIIMVQIENEFGSFGDD-----KNYLHYLVLLARR 208
Query: 212 L-------------------------DIGVPWIMCQESDAPSPMFTP----NNP-NSPKI 241
D + D P P+F N P S +
Sbjct: 209 YLGNDIILYTTDGGTIGTLKNGSIHQDDVFAAVDFSTGDDPWPIFRLQKEYNFPGKSAPL 268
Query: 242 WTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG---- 297
E +TGW WG A A A+ G+ YM HGGTNFG +G
Sbjct: 269 TAEFYTGWLTHWGESIATTDASSTAKALKSILCRNGS-AVLYMAHGGTNFGFYNGANTGQ 327
Query: 298 -----PYLTTSYDYDAPIDEYGHLNQPKWGHLRE---------LHKLLKSMEKTLTYGNV 343
TSYDYDAPI E+G ++ PK+ LR LH L ++E++ YG V
Sbjct: 328 NESAYKADLTSYDYDAPIKEHGDVHNPKYKALRSVIHECTGTPLHPLPANIERS-NYGLV 386
>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
Length = 593
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTRQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|261406481|ref|YP_003242722.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261282944|gb|ACX64915.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 619
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 155/309 (50%), Gaps = 44/309 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG+IHY R P W D + K K G + +ETY+ WN HEP +++F+G
Sbjct: 13 LDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQEGEFNFSGMA 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ FI+ GL+VI+R P++CAEW +GG P WL I LR ++ ++++++ ++
Sbjct: 73 DVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEI-RLRCSDPLYLSKVDHY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + L ++ GGPI+ Q+ENEYG+ +D+ +Y+ + + + V
Sbjct: 132 YDELI--PQLVPLLSTHGGPILAVQVENEYGSYGNDH-----AYLEYLREGLVRRGVDV- 183
Query: 218 WIMCQESDAPSP--------------------------MFTPNNPNSPKIWTENWTGWFK 251
+ SD P+ + P + E W GWF
Sbjct: 184 --LLFTSDGPTDEMLLGGTLSDVHATVNFGSRVEESFRKYREYRAEEPLMVMEFWNGWFD 241
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYD 305
W R A D+A + + G + N YM+HGGTNFG SG ++ TTSYD
Sbjct: 242 HWMEDHHVRDAADVAGVLDEMLEMGSSM-NMYMFHGGTNFGFYSGANHIQAYEPTTTSYD 300
Query: 306 YDAPIDEYG 314
YDAP+ E+G
Sbjct: 301 YDAPLTEWG 309
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 37/97 (38%), Gaps = 31/97 (31%)
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
+Y+ FE D L G KG AW+NG+NLGRYW G
Sbjct: 519 FYRGCFEVEEIGD-TFLRFDGWTKGVAWINGFNLGRYWKA-------------------G 558
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
K Y +P ++ G N LVLFE G
Sbjct: 559 PQKALY-----------IPGPLLRKGENELVLFELHG 584
>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
Length = 593
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTRQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|357391354|ref|YP_004906195.1| putative beta-galactosidase [Kitasatospora setae KM-6054]
gi|311897831|dbj|BAJ30239.1| putative beta-galactosidase [Kitasatospora setae KM-6054]
Length = 588
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 151/326 (46%), Gaps = 50/326 (15%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
+++D +DG +LSG++HY RS P W D + + GL+ +ETYV WN HEP
Sbjct: 2 LTYDSTGFRLDGRPLRVLSGAVHYFRSRPEQWADRLAAVRAMGLNTVETYVPWNLHEPAP 61
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
++ G L F+ + QGL+ I+R GPY+CAEW+ GG P WL G +RT +
Sbjct: 62 GRFARVGELGA--FLDEARRQGLWTIVRPGPYICAEWDNGGLPGWLTARLG-RRVRTGDP 118
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
F+ + F +++ E+ + G +++ Q+ENEYG SD G Y+ A+
Sbjct: 119 EFLAAVGAFFDVLLPQV-VERQWGRPDGSVLMVQVENEYGAFGSDAG-----YLAALARG 172
Query: 209 ATSLDIGVPWI--------MCQESDAPSPMFTPN---------------NPNSPKIWTEN 245
+ VP M P + T N P P E
Sbjct: 173 LRERGVSVPLFTSDGPEDHMLAAGTVPGVLATVNFGSDPERGFAALRRHRPEDPPFCMEF 232
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-------- 297
W GWF WG R A+D A ++ R GG+ N YM HGGT+FG ++G
Sbjct: 233 WNGWFDQWGRPHHTRGADDAADSLRRILAAGGSV-NLYMAHGGTSFGTSAGANHADPPFN 291
Query: 298 -------PY--LTTSYDYDAPIDEYG 314
PY TSYDYDAP+DE G
Sbjct: 292 STDWTHSPYQPTVTSYDYDAPLDERG 317
>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
Length = 613
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 123/361 (34%), Positives = 171/361 (47%), Gaps = 54/361 (14%)
Query: 9 RAILLCLILQTLFNLSL-AYRVSHD--------GRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L F L + A + D G DG+ LLSG+IH+ R
Sbjct: 3 RTTLAPLVLALAFALPVTAIAATTDTWPSFGTQGTQFVRDGKPYQLLSGAIHFQRIPREY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF GN D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAGNNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKE--KLFASQGGP 177
Y CAEW GG+P WL I +R+ + F+ Q + +D K+ L GGP
Sbjct: 123 YTCAEWEAGGYPAWLFGKDNIR-VRSRDPRFLAASQAY----LDAVSKQVHPLLNHNGGP 177
Query: 178 IILAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESD 225
II Q+ENEYG+ D+ D Y+ + + A L G +P + +
Sbjct: 178 IIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLAVVNF 237
Query: 226 APSPMFTPN------NPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQF 275
AP T P P++ E W GWF WG D K+ E+ + + +
Sbjct: 238 APGEAKTAFEKLIKFRPEQPRMVGEYWAGWFDHWGKPHASTDAKQQTEEFEWILRQ---- 293
Query: 276 GGTFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLR 325
G N YM+ GGT+FG +G + TTSYDYDA +DE G PK+ +R
Sbjct: 294 -GHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PKFALMR 351
Query: 326 E 326
+
Sbjct: 352 D 352
>gi|114641374|ref|XP_001157987.1| PREDICTED: galactosidase, beta 1-like 2 isoform 2 [Pan troglodytes]
Length = 636
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 118/341 (34%), Positives = 165/341 (48%), Gaps = 42/341 (12%)
Query: 19 TLFNLSLAYR---VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAI 75
TL L L +R + G ++G + GSIHY R W D + K K GL+ +
Sbjct: 34 TLVPLRLRHRQLGLQAKGWNFVLEGSTFWIFGGSIHYFRVPREYWRDRLLKMKACGLNTL 93
Query: 76 ETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH 135
TYV WN HEP R ++DF+GNLDL F+ + GL+VILR GPY+C+E + GG P WL
Sbjct: 94 TTYVPWNLHEPERSKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLL 153
Query: 136 NMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG 195
PG+ LRTT K F + + + M++ L +GGPII Q+ENEYG+ D
Sbjct: 154 QDPGM-RLRTTYKGFTEAVDLYFDHL--MSRVVPLQYKRGGPIIAVQVENEYGSYNKD-- 208
Query: 196 DAGKSYINWCAK-------------------MATSLDIGVPWIMCQESDAPSPMFTPNNP 236
+Y+ + K ++ + GV + +S + T
Sbjct: 209 ---PAYMPYVKKALEDRGIVELLLTSDNKDGLSKGIVQGVLATINLQSTHELQLLTTFLF 265
Query: 237 N----SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG 292
N PK+ E WTGWF SWGG + ++ V+ G + N YM+HGGTNFG
Sbjct: 266 NVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSAIVDAGSSI-NLYMFHGGTNFG 324
Query: 293 RTSGGPYL------TTSYDYDAPIDEYGHLNQPKWGHLREL 327
+G + TSYDYDA + E G K+ LR+
Sbjct: 325 FMNGAMHFHDYKSDVTSYDYDAVLTEAGDYTA-KYMKLRDF 364
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 64/242 (26%)
Query: 453 ILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSA 512
ILSG R++ GQV V+ ++D + TK + L +G + +L
Sbjct: 442 ILSG------RVHDRGQVFVNTVSIGFLDYKTTKIA---------LPLIQGYTVLRILVE 486
Query: 513 TVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAK 572
G NYG D G+ G + L +++ +K+ +Y LD KK + +
Sbjct: 487 NRGRVNYGENIDDQRKGLIGNLYL-----NDSPLKNFR---------IYSLDMKKSFFQR 532
Query: 573 AANSERGWSSKNVPLNRRM-TWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWP 631
+ WSS +P + ++ + L L+G KG ++NG NLGRYW
Sbjct: 533 FGLDK--WSS--LPETPTLPAFFLGSLSISSTPCDTFLKLEGWEKGVVFINGQNLGRYW- 587
Query: 632 TYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGN 691
N G P + Y +P W+ G+N +++FEE
Sbjct: 588 ---------------------------NIG-PQKTLY-LPGPWLSSGINQVIIFEETMAG 618
Query: 692 PS 693
P+
Sbjct: 619 PA 620
>gi|239986962|ref|ZP_04707626.1| putative beta-galactosidase [Streptomyces roseosporus NRRL 11379]
Length = 606
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 152/327 (46%), Gaps = 49/327 (14%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ V DG DG+ LLSG++HY R W + GL+ +ETYV WN HEP
Sbjct: 4 FTVDDDG--FRFDGKPVRLLSGALHYFRVHEEQWGHRLAVLAAMGLNCVETYVPWNLHEP 61
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
+ G L RF+ ++ GL+ I+R GPY+CAEW GG PVW+ G +RT
Sbjct: 62 REGEVRDVGALG--RFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFG-RRVRTR 118
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ + ++ F L+ + +++ + +GGP+IL Q ENEYG+ SD Y+ W
Sbjct: 119 DAEYRAVVERWFRELLPQVVERQVV---RGGPVILVQAENEYGSFGSD-----AVYLEWL 170
Query: 206 AKMATSLDIGVPWI--------MCQESDAPSPMFTPN---------------NPNSPKIW 242
A + + VP M P + T N P P +
Sbjct: 171 AGLLRECGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGAREGFAVLRRHQPKGPLMC 230
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNF----GRTSGGP 298
E W GWF WG + R AE+ A A+ + G + N YM HGGTNF G GGP
Sbjct: 231 MEFWCGWFDHWGAEPVLRDAEEAAGALREILECGASV-NIYMAHGGTNFAGWAGANRGGP 289
Query: 299 Y-------LTTSYDYDAPIDEYGHLNQ 318
TSYDYDAP+DEYG +
Sbjct: 290 LQDGEFQPTVTSYDYDAPVDEYGRATE 316
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 172/357 (48%), Gaps = 48/357 (13%)
Query: 34 RAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF 93
+ ++G+ + SG++HY R P W D ++K K GL+ +ETY+ WN HEP Q+ F
Sbjct: 10 KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69
Query: 94 TGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNE 153
D+ +F+K Q GLYVILR PY+CAEW +GG P WL P + +R+ FM +
Sbjct: 70 EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDM-VVRSNTPRFMEK 128
Query: 154 MQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLD 213
+ N+ + + ++ + GGP+++ Q+ENEYG+ +D K+Y+ + +
Sbjct: 129 VANYYEALFKVLVPLQI--THGGPVLMMQVENEYGSFGND-----KAYLRHVKSLMETNG 181
Query: 214 IGVPWIMC----------------------------QESDAPSPMFT-PNNPNSPKIWTE 244
+ VP +E+ A F ++ N P + E
Sbjct: 182 VDVPLFTADGSWQQALKAGSLIEDDVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCME 241
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG------- 297
W GWF W + R+A+ +A + +F N YM+ GGTNFG +G
Sbjct: 242 FWDGWFNRWQEEIVTRSADSFQTDLAELVKEQASF-NLYMFRGGTNFGFFNGCSSRQNVD 300
Query: 298 -PYLTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVS 353
P + TSYDYDA + E G ++ K+ L+ + + S KT+ G + + N V+
Sbjct: 301 YPQI-TSYDYDAVLHEDGRPSE-KYDKLQTILNVKASTPKTIDIGTYSQPNLVNRVN 355
>gi|380512533|ref|ZP_09855940.1| beta-galactosidase [Xanthomonas sacchari NCPPB 4393]
Length = 616
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 174/380 (45%), Gaps = 53/380 (13%)
Query: 39 DGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLD 98
DG+ ++SG+IH+ R W D ++KA+ GL+ +ETYVFWN EP Q+DF+GN D
Sbjct: 44 DGKPYQVISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRPGQFDFSGNND 103
Query: 99 LIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFT 158
+ F+ QGL VILR GPYVCAEW GG+P WL PG+ +R+ + F+ Q +
Sbjct: 104 IAAFVDEAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAEPGMR-VRSQDPRFLAASQAYL 162
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPW 218
+ K GGPI+ Q+ENEYG+ YGD +Y+ M + G
Sbjct: 163 DALAAQVKPR--LNGNGGPIVAVQVENEYGS----YGD-DHAYMRLNRAM--FVQAGFDK 213
Query: 219 IMCQESDAPSPM-------------FTPNN------------PNSPKIWTENWTGWFKSW 253
+ +D P + F P + P P++ E W GWF W
Sbjct: 214 ALLFTADGPDVLANGTLPDTLAVVNFAPGDAKNAFETLAKFRPGQPQMVGEYWAGWFDQW 273
Query: 254 GGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY----------LTTS 303
G K A A + G + N YM+ GGT+FG +G + TTS
Sbjct: 274 GEKHAATDATKQASEFEWILRQGHS-ANIYMFVGGTSFGFMNGANFQKNPSDHYAPQTTS 332
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHKLL-----KSMEKTLTYGNVTNTDYGNSVSGSSYN 358
YDYDA +DE G PK+ R+ + + ++ K + + + T S S N
Sbjct: 333 YDYDAVLDEAGRPT-PKFTLFRDAIQRVTGIAPPALPKPIRFAELPATPLRESASLWD-N 390
Query: 359 LPAWSVSILPDCKTEEFNTA 378
LPA + + E + A
Sbjct: 391 LPAPAATTDTPQPMERYGQA 410
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 24/133 (18%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYK 557
V + G + + +L G NYG+ G+ PVLL G K L+ ++
Sbjct: 460 VDIPAGTHTLDVLVENTGRINYGAHLPDGRAGLVDPVLLDG--------KQLTG----WQ 507
Query: 558 VGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKG 617
+DD + GW++ + +++ T + D L++Q GKG
Sbjct: 508 TFPLPMDDP--------SKLTGWTTAKI---DGPAFHRGTLKIGTPAD-TFLDMQAFGKG 555
Query: 618 FAWVNGYNLGRYW 630
FAW NG+NLGR+W
Sbjct: 556 FAWANGHNLGRHW 568
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 162/327 (49%), Gaps = 31/327 (9%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + +D DG +SGSIHY R W D + K K GL+AI+TYV WN HEP
Sbjct: 25 FGIDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNYHEP 84
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
YDF+G+ DL F++ + GL VILR GPY+CAEW+ GG P WL I LR++
Sbjct: 85 QMGVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESI-VLRSS 143
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG---------- 195
+ ++ ++ + ++ + K + GGPII+ Q+ENEYG+ + DY
Sbjct: 144 DSDYLTAVEKWMGVL--LPKMKPHLYHNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFR 201
Query: 196 -DAGKSYINWCAKMATSLDI---GVPWIMCQESDAPSPMFTP-------NNPNSPKIWTE 244
G + + A+ + + + AP T + P P + +E
Sbjct: 202 QHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPTGPLVNSE 261
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYLT- 301
+TGW WG + +E +A + G N YM+ GGTNF +G PY++
Sbjct: 262 FYTGWLDHWGHRHIVVPSETIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMSQ 320
Query: 302 -TSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAP+ E G L + K+ LRE+
Sbjct: 321 PTSYDYDAPLSEAGDLTE-KYFALREV 346
>gi|115361550|gb|ABI95864.1| beta-galactosidase [Planococcus sp. L4]
Length = 552
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 108/332 (32%), Positives = 163/332 (49%), Gaps = 44/332 (13%)
Query: 50 IHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQ 109
+HY R+ P W D ++K K GL+ +ETY+ WN HEP + Q+ F+G D+ FI+
Sbjct: 1 MHYFRTVPEQWEDRLQKLKALGLNTVETYIPWNFHEPKKGQFHFSGMADIEGFIELAHRL 60
Query: 110 GLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEK 169
GLYVILR PY+CAEW GG P WL + LR+++ F+ ++++ + + K K
Sbjct: 61 GLYVILRPAPYICAEWEMGGLPSWLMKDKNL-VLRSSDPAFLGHVEDYFAEL--LPKFTK 117
Query: 170 LFASQGGPIILAQIENEYGNVMSD--------------------YGDAGKSYINWCA--K 207
GGP+I QIENEYG +D + G +I +
Sbjct: 118 HLYQNGGPVIAMQIENEYGAYGNDSAYLDFFKAQYEHHGLNTFLFTSDGPDFITQGSMPD 177
Query: 208 MATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAF 267
+ T+L+ G D P+SPK+ E W GWF W G+ R+ +D+A
Sbjct: 178 VTTTLNFG------SRVDESFQALDAFKPDSPKMVAEFWIGWFDYWSGEHTVRSGDDVA- 230
Query: 268 AVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHLNQPKW 321
+V + N+YM+HGGTNFG +G + TSYDYD+ + E G + +
Sbjct: 231 SVFKEIMEKNISVNFYMFHGGTNFGFMNGANHYDIYYPTITSYDYDSLLTEGGAITEKYK 290
Query: 322 G---HLRELHKLLKSMEKTLT---YGNVTNTD 347
LRE ++ E++++ YG VT T+
Sbjct: 291 AVKEVLREYREVPADFEESVSAKAYGTVTLTE 322
>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
Length = 578
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 150/301 (49%), Gaps = 38/301 (12%)
Query: 57 PGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILR 116
P W D +KK K GL+ +ETYV WN HE ++ + F +D+++F+ Q+ GL+VI+R
Sbjct: 2 PEYWADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIR 61
Query: 117 IGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGG 176
GPY+C+EW+ GG P WL N P + LR+T FM ++ + + + A L S+GG
Sbjct: 62 PGPYICSEWDLGGLPSWLLNDPNM-RLRSTYGPFMEAVEKYFSKL--FALLTPLQFSRGG 118
Query: 177 PIILAQIENEYGNVMSDYGDAGKSYINWCAKMA----------TSLDIGVPWIMCQESDA 226
PII Q+ENEY +V + Y+ K+ TS D+G + D
Sbjct: 119 PIIAWQVENEYASVQE---EVDNHYMELLHKLMLKNGATELLFTSDDVGYTKRYPIKLDG 175
Query: 227 PSPM--------FTPNNPNSPKIWTENWTGWFKSWGGKDPK-RTAEDLAFAVARFFQFGG 277
M F P+ P + TE W+GWF WG K T + V G
Sbjct: 176 GKYMSFNKWFCLFLHFQPDKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILDMGA 235
Query: 278 TFQNYYMYHGGTNFGRTSGGPYL-----------TTSYDYDAPIDEYGHLNQPKWGHLRE 326
+ N+YM+HGGTNFG +G TSYDYDAP+ E G + PK+ LR+
Sbjct: 236 SI-NFYMFHGGTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDIT-PKYKALRK 293
Query: 327 L 327
L
Sbjct: 294 L 294
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 71/189 (37%), Gaps = 45/189 (23%)
Query: 507 ISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDK 566
+ ++ +G N+G GI G VL+ G + +KW +Y LD
Sbjct: 419 LEIMVENMGRVNFGKGLHSQRKGILGQVLIDGH----------TQNKWK----VYPLDFH 464
Query: 567 KFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNL 626
K + +A E WS + +Y+ + ++ +G GKG VNG NL
Sbjct: 465 KTFTERAF-LEVSWSKPTEGASFSPGFYRGILHIQGQPRDSFVHPKGWGKGVCLVNGKNL 523
Query: 627 GRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
GRYW GP Q ++P SW++ G NT++LFE
Sbjct: 524 GRYWKL----------------GP--------------QEALYLPASWLRSGDNTIILFE 553
Query: 687 EFGGNPSQI 695
G I
Sbjct: 554 VDGAKDDGI 562
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 160/343 (46%), Gaps = 61/343 (17%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
R++ + +DGE +LSG+IHY R P W D + K K G + +ETY+ WN HEP
Sbjct: 3 RLTWKDQKYRLDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEPR 62
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
+ F G D+ RFI+T GL+VI+R PY+CAEW +GG P WL + LR +
Sbjct: 63 EGSFRFDGFADVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWL--LKSSMGLRCMD 120
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
++ ++ + ++ + L S+GGPII Q+ENEYG SY N A
Sbjct: 121 NEYLEKVDRYYDELI--PRLLPLLDSRGGPIIAVQVENEYG-----------SYGNDTAY 167
Query: 208 MATSLD----IGVPWIMCQESDAPS--------------------------PMFTPNNPN 237
+A D GV ++ SD P+ + +
Sbjct: 168 LAYLRDGLIRRGVDCLLFT-SDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQD 226
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + E W GWF W R A D+A + + G + N YM+HGGTNFG SG
Sbjct: 227 EPLMVMEYWLGWFDHWRKPHHVREAGDVANVLDEMLEQGASV-NLYMFHGGTNFGFYSGA 285
Query: 298 PY------LTTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSM 334
Y TSYDYDAP+ E WG + E +K ++S+
Sbjct: 286 NYGEHYEPTITSYDYDAPLTE--------WGDITEKYKAIRSV 320
>gi|24418925|ref|NP_722498.1| beta-galactosidase-1-like protein 2 [Mus musculus]
gi|23512349|gb|AAH38479.1| Galactosidase, beta 1-like 2 [Mus musculus]
gi|148693361|gb|EDL25308.1| cDNA sequence BC038479, isoform CRA_b [Mus musculus]
Length = 652
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 155/314 (49%), Gaps = 43/314 (13%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+L GSIHY R W D + K K GL+ + TYV WN HEP R ++DF+GNLDL FI+
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 138
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
GL+VILR GPY+C+E + GG P WL P + +LRTT F + + + M
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDM-KLRTTYHGFTKAVDLYFDHL--M 195
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC--- 221
++ L GGPII Q+ENEYG+ D ++Y+ + K D G+ ++
Sbjct: 196 SRVVPLQYKHGGPIIAVQVENEYGSYNKD-----RAYMPYIKKALE--DRGIIEMLLTSD 248
Query: 222 ----------------------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPK 259
QE A + + PK+ E WTGWF SWGG
Sbjct: 249 NKDGLEKGVVDGVLATINLQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNI 308
Query: 260 RTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEY 313
+ ++ V+ + G + N YM+HGGTNFG +G + TSYDYDA + E
Sbjct: 309 LDSSEVLQTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEA 367
Query: 314 GHLNQPKWGHLREL 327
G K+ LREL
Sbjct: 368 GDYT-AKYTKLREL 380
>gi|76636681|ref|XP_597358.2| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|297483828|ref|XP_002693892.1| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|296479483|tpg|DAA21598.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 758
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 155/325 (47%), Gaps = 39/325 (12%)
Query: 32 DGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQY 91
DG+ ++ + GS+HY R W D + K + GL+ + TYV WN HEP R +
Sbjct: 172 DGQNFKLENSAFWIFGGSVHYFRVPRAYWRDRLLKLRACGLNTLTTYVPWNLHEPERGTF 231
Query: 92 DFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFM 151
DF+GNLDL FI + GL+VILR GPY+C+E + GG P WL P + LRTT K F
Sbjct: 232 DFSGNLDLEAFILLAAEVGLWVILRPGPYICSEVDLGGLPSWLLRDPDMR-LRTTYKGFT 290
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ + + M + L GGPII Q+ENEYG+ D +Y+ + K
Sbjct: 291 EAVDLYFDHL--MLRVVPLQYKHGGPIIAVQVENEYGSYNKD-----PAYMPYIKKALQD 343
Query: 212 LDI-------------------GVPWIMCQESDAPSPMFTP----NNPNSPKIWTENWTG 248
I GV + +S + +FT + PK+ E WTG
Sbjct: 344 RGIAELLLTSDNQGGLKSGVLDGVLATINLQSQSELQLFTTILLGAQGSQPKMVMEYWTG 403
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TT 302
WF SWGG + ++ V+ + G + N YM+HGGTNFG G + T
Sbjct: 404 WFDSWGGPHYILDSSEVLNTVSAIVKAGSSI-NLYMFHGGTNFGFIGGAMHFQDYKPDVT 462
Query: 303 SYDYDAPIDEYGHLNQPKWGHLREL 327
SYDYDA + E G K+ LRE
Sbjct: 463 SYDYDAVLTEAGDYTA-KYTKLREF 486
>gi|194857009|ref|XP_001968877.1| GG24263 [Drosophila erecta]
gi|190660744|gb|EDV57936.1| GG24263 [Drosophila erecta]
Length = 672
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 172/366 (46%), Gaps = 57/366 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H +DG+ +SGS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 46 FTIDHAANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNP 105
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLH-NMPGIEELRT 145
+Y++ G D+++F++ Q + Y+ILR GPY+CAE + GG P WL P I ++RT
Sbjct: 106 HDGEYNWEGIADVVKFLEIAQQEDFYIILRPGPYICAERDNGGLPHWLFAKYPSI-KMRT 164
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW- 204
+ ++ E+ + + M + + LF GG II+ Q+ENEYG+ D+ Y+NW
Sbjct: 165 NDPNYIAEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWL 217
Query: 205 -------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNP 236
C K+ + D G+ I E D M P
Sbjct: 218 RDETEKYVTGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRI--NEIDQIWAMLRTLQP 275
Query: 237 NSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG 296
P + +E + GW W ++ +R +++A A+ + + N YM+ GGTNFG T+G
Sbjct: 276 TGPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAG 334
Query: 297 GPY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTY 340
Y TSYDYDA +DE G +L + G L + + K L Y
Sbjct: 335 ANYNLDGGIGYAADITSYDYDAVMDEAGGVTTKYNLVKAVIGEFLPLPDITLNPAKRLAY 394
Query: 341 GNVTNT 346
G V T
Sbjct: 395 GRVELT 400
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 590 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 620
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP ++ G N+LV+ E
Sbjct: 621 YVPNEILQVGENSLVILE 638
>gi|222152241|ref|YP_002561416.1| beta-galactosidase [Streptococcus uberis 0140J]
gi|222113052|emb|CAR40398.1| putative beta-galactosidase precursor [Streptococcus uberis 0140J]
Length = 594
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 177/701 (25%), Positives = 275/701 (39%), Gaps = 173/701 (24%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSGSIHY R P W + K G + +ETYV WN HEP + + F G
Sbjct: 12 LDGKPFKILSGSIHYFRVAPEAWYRSLYNLKALGFNTVETYVPWNLHEPQKGNFHFDGLA 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F+ Q+ GLY I+R PY+CAEW +GG P WL N P +R+ + ++ ++++
Sbjct: 72 DLEGFLDLAQELGLYAIVRPSPYICAEWEFGGLPGWLLNEP--IRVRSRDPKYLKHVKDY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ K +L GG I++ Q+ENEYG+ D K Y+ M L + P
Sbjct: 130 YDVLMPKLVKRQL--ENGGNILMFQVENEYGSYGED-----KDYLRELMTMMRQLGVTAP 182
Query: 218 WIMCQESDAP--------------------------------SPMFTPNNPNSPKIWTEN 245
SD P F NN P + E
Sbjct: 183 LFT---SDGPWHATLRSGSLIEDDVLVTGNFGSKAKINFESMKAFFKENNKKWPLMCMEF 239
Query: 246 WTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL----- 300
W GWF W +R ++ A+ + G N YM+HGGTNFG +G
Sbjct: 240 WIGWFNRWKEPIIRRDPKETIDAIMEVLEEGSI--NLYMFHGGTNFGFMNGASARLQQDL 297
Query: 301 --TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYN 358
TSYDYDA +DE G+ PK+ L+E +L K N N + +
Sbjct: 298 PQVTSYDYDAILDEAGN-PTPKYFLLQE--RLQK---------NFPNLHFDKPLENK--- 342
Query: 359 LPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGK 418
+++I TE+ N E ++ +
Sbjct: 343 ----TIAIKGIALTEKVNLV-----------------------------ETLDSISTLTE 369
Query: 419 GHFALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGN 478
+ +N +S N + Y+ Y T + ++ LR+ + Y+N
Sbjct: 370 AFYPVNM----ESLNQTTGYILYRTY--------LPKDNARERLRLIDARDRAKVYLNNR 417
Query: 479 YVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVG 538
+++Q+ ++ ND+ ++ NQ+ +L +G +YG K P G +G
Sbjct: 418 LIETQY-QFEIGNDII---IEQETENNQLDILIENMGRVSYGHKL-TAPTQSKG----IG 468
Query: 539 RAGDETIIKDLS-SHKW-TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKT 596
R ++ DL W Y + L ++ F G + +P ++Y
Sbjct: 469 RG----LMADLHFVGNWQQYPLPLESIEKVDF---------SGSWQEGLP-----SFYAY 510
Query: 597 TFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKC 656
F D L+L GKG A++N +LGR+W GP+ S
Sbjct: 511 DFVCDQMGD-TYLDLSQFGKGVAYINNNHLGRFWNV----------------GPHLS--- 550
Query: 657 AYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINF 697
+VP S++K G N LV+FE G I F
Sbjct: 551 -----------LYVPESFLKLGKNRLVIFETEGQMTPSIQF 580
>gi|62471477|gb|AAH93575.1| LOC443705 protein, partial [Xenopus laevis]
Length = 439
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 157/323 (48%), Gaps = 40/323 (12%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S ++ + ++ DG+ +SGSIHY R W D + K GL+A++ Y+ WN
Sbjct: 68 SKSFSIDYNKNCFRKDGQCFRYISGSIHYFRIPADYWRDRLLKMYMTGLNAVQVYIPWNF 127
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEPL YDF G+ DL RF+ + GL VI+R GPY+CAEW+ GG P WL N I L
Sbjct: 128 HEPLPGLYDFNGDRDLSRFLDLTDELGLLVIIRPGPYICAEWDMGGLPAWLLNNKDI-AL 186
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG------- 195
RT++ ++N + ++ +++ + K S GG II Q+ENEYG+ M+ DY
Sbjct: 187 RTSDPDYLNAVDSWFSVL--LPKLRSRLYSNGGNIISVQVENEYGSFMACDYSYLRHLLH 244
Query: 196 ---------------DAGKSYINWCAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
D C + T++D G + + P
Sbjct: 245 LFRLYLGDEVVLFTTDGNTERELQCGSLQDLYTTVDFGPG----DNATKAFKLLRKYQPK 300
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E +TGW WG K + E ++ + + G + N YM+ GGTNFG +G
Sbjct: 301 GPLVNSEYYTGWLDYWGEKHSTTSKELVSQGLKNILEMGASV-NMYMFEGGTNFGYWNGA 359
Query: 298 PY------LTTSYDYDAPIDEYG 314
+ +TTSYDYDAP+ E G
Sbjct: 360 DFKKIYKPITTSYDYDAPLSEAG 382
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 168/350 (48%), Gaps = 48/350 (13%)
Query: 11 ILLCLILQTLFNLSLAYR--VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAK 68
I+L LI+ + + R V + I+G+ L+ G +HYPR W D + +A
Sbjct: 10 IMLNLIVSFFISACSSPREQVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRAH 69
Query: 69 EGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYG 128
GL+ + YVFWN HE +DF+G D+ F++ Q++GLYVILR GPYVCAEW++G
Sbjct: 70 AMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFG 129
Query: 129 GFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKK-EKLFASQGGPIILAQIENEY 187
G+P WL + R+ + FM+ + + I ++ K+ L + GG II+ Q+ENEY
Sbjct: 130 GYPSWLLKEKDL-TYRSKDPRFMSYCERY---IKELGKQLAPLTINNGGNIIMVQVENEY 185
Query: 188 GNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMC------QESDAPSPMFTPN------- 234
G+ +D K Y+ M VP C + + T N
Sbjct: 186 GSYAAD-----KEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDI 240
Query: 235 -------NPNSPKIWTENWTGWFKSWGGKDP----KRTAEDLAFAVARFFQFGGTFQNYY 283
+P P E + WF WG + +R AE L + + G + Y
Sbjct: 241 FKIVDKYHPGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGH-----GVSVSMY 295
Query: 284 MYHGGTNF-----GRTSGGPY-LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
M+HGGTNF TSGG TSYDYDAP+ E+G+ PK+ RE+
Sbjct: 296 MFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWGNC-YPKYHAFREI 344
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 107/312 (34%), Positives = 153/312 (49%), Gaps = 48/312 (15%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+D + ++SG+IHY R P W D ++K + G + +ETYV WN HE Y F G L
Sbjct: 12 LDNKPLKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL RFI+T Q+ GLYVILR PY+CAEW +GG P WL P + +LR FM ++ +
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDP-MMKLRFDYPPFMEKITRY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL----- 212
+ + ++ +QGGPII+ Q+ENEYG+ +D K Y+ KM ++
Sbjct: 131 FAHLFPQVRDLQI--TQGGPIIMMQVENEYGSYAND-----KEYLR---KMVAAMRQHGV 180
Query: 213 ---------------------DIGVPWIMCQES--DAPSPMFTPNNPNSPKIWTENWTGW 249
D+ +P I C + + + + P + E W GW
Sbjct: 181 ETPLVTSDGPWHDMLENGSIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGW 240
Query: 250 FKSWGGKDPKRTA-EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TT 302
F +WG T+ +D + G N YM+HGGTNFG +G Y T
Sbjct: 241 FDAWGDDQHHTTSTQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDVT 298
Query: 303 SYDYDAPIDEYG 314
SYDYDA + E+G
Sbjct: 299 SYDYDALLTEWG 310
>gi|198475912|ref|XP_002132214.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
gi|198137462|gb|EDY69616.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
Length = 672
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 171/365 (46%), Gaps = 55/365 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ + ++GE ++GS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 47 FTIDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNP 106
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
Y++ G D+++F++ Q++ Y+ILR GPY+CAE + GG P WL ++RT+
Sbjct: 107 HDGVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTS 166
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ +M E+ + + M + + L GG II+ Q+ENEYG+ D K Y+NW
Sbjct: 167 DSNYMAEVGKWYAEL--MPRLQHLLIGNGGKIIMVQVENEYGDYECD-----KDYLNWLR 219
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 220 DETEKYVNGNALLFTTDIPNERMSCGKIDNVFATTDFGIDRI--HEIDDIWAMLRKLQPT 277
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W + +R + +A A+ + + N YM+ GGTNFG T+G
Sbjct: 278 GPLVNSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGA 336
Query: 298 PY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTYG 341
Y TSYDYDA +DE G +L + G L ++ + K L YG
Sbjct: 337 NYNLDGGVGYAADITSYDYDAVMDEAGGVTSKYNLVKQVIGEFLPLPEITLNPAKRLAYG 396
Query: 342 NVTNT 346
V T
Sbjct: 397 KVEVT 401
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 590 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 620
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP +K G N+LV+ E
Sbjct: 621 YVPNEILKVGDNSLVILE 638
>gi|301617189|ref|XP_002938028.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Xenopus (Silurana) tropicalis]
Length = 620
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 148/311 (47%), Gaps = 39/311 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+L GS+HY R W D +KK K G++ + TYV WN HEP + YDF LD+ F+
Sbjct: 46 ILGGSMHYFRVPTAYWRDRMKKMKACGINTLTTYVPWNLHEPGKGTYDFNNGLDISEFLA 105
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-FTTLIVD 163
+ GL+VILR GPY+CAEW+ GG P WL + +LRTT F + + F LI
Sbjct: 106 VAGEMGLWVILRPGPYICAEWDLGGLPSWLLRDKDM-KLRTTYPGFTEAVDDYFNELIPR 164
Query: 164 MAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQE 223
+AK + S GGPII Q+ENEYG+ D +Y+ + I +
Sbjct: 165 VAKYQ---YSNGGPIIAVQVENEYGSYAKD-----ANYMEFIKNALIERGIVELLLTSDN 216
Query: 224 SDAPS------------------PMFTPNN---PNSPKIWTENWTGWFKSWGGKDPKRTA 262
D S +F+ N P P + E WTGWF WGG
Sbjct: 217 KDGISYGSLEGVLATVNFQKIEPVLFSYLNSIQPKKPIMVMEFWTGWFDYWGGDHHLFDV 276
Query: 263 EDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGHL 316
E + ++ G N YM+HGGTNFG SG + TSYDYDAP+ E G
Sbjct: 277 ESMMSTISEVLNRGANI-NLYMFHGGTNFGFMSGALHFHEYRPDITSYDYDAPLTEAGDY 335
Query: 317 NQPKWGHLREL 327
K+ +REL
Sbjct: 336 TS-KFFKIREL 345
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 73/202 (36%), Gaps = 55/202 (27%)
Query: 506 QISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDD 565
++S+L G NYG D GI G V L + +K+ +Y LD
Sbjct: 464 KLSILVENCGRVNYGPMIDNQRKGIVGDVYL-----RDNPLKNFK---------IYSLDM 509
Query: 566 KKFYNAKAANSERGWSS----KNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWV 621
+ + +E WS K+ P T+Y+ L LQG KG ++
Sbjct: 510 NSTFMNRI--NEVHWSDLSECKSGP-----TFYQGALHVGPTPMDTFLRLQGWKKGVVFI 562
Query: 622 NGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNT 681
NG NLGRYW GP Q +P W+ GVN
Sbjct: 563 NGKNLGRYWDI----------------GP--------------QETLFIPAPWLWPGVNE 592
Query: 682 LVLFEEFGGNPSQINFQTVVVG 703
+ +FEE+ + T ++G
Sbjct: 593 ITIFEEYAAGLTLFTLDTPILG 614
>gi|296399387|gb|ADH10509.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 161/328 (49%), Gaps = 31/328 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + ++ + DG+ +SGSIHY R W D + K K GLDAI+TYV WN HE
Sbjct: 6 SFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHE 65
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDF G DL F++ D GL VILR GPY+CAEW+ GG P WL I LR+
Sbjct: 66 PRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSI-VLRS 124
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS---DYGDA----- 197
++ ++ ++ + ++ + K GGPII+ Q+ENEYG+ + DY
Sbjct: 125 SDSDYLEAVERWMGVL--LPKMRPYLYQNGGPIIMVQVENEYGSYFACDYDYLRFLLKLF 182
Query: 198 ----GKSYINWCAKMATSLDI---GVPWIMCQESDAPSPMFTP-------NNPNSPKIWT 243
G + + A+ + + + AP T + P P + +
Sbjct: 183 RLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPMGPLVNS 242
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL- 300
E +TGW WG + AE +A + G N YM+ GGTNF +G PY+
Sbjct: 243 EFYTGWLDHWGHRHSVVPAETVAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMP 301
Query: 301 -TTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAP+ E G L + K+ +R++
Sbjct: 302 QPTSYDYDAPLSEAGDLTE-KYFTIRKV 328
>gi|168019162|ref|XP_001762114.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686831|gb|EDQ73218.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 652
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 126/389 (32%), Positives = 181/389 (46%), Gaps = 57/389 (14%)
Query: 10 AILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKE 69
+LLCL + S + + DG ++ G +HY R P +W D + +AK
Sbjct: 4 VLLLCLWISVGAQSSSKHSFVIENNLFLKDGVPFRIIGGDLHYFRVHPLLWEDRLLRAKA 63
Query: 70 GGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGG 129
GL+AI+TYV WN HEP +F G+ DL+ F+K Q+ VILRIGPY+CAEW+ GG
Sbjct: 64 LGLNAIQTYVPWNLHEPRPGLLNFNGSADLLSFLKLAQELDFLVILRIGPYICAEWDLGG 123
Query: 130 FPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN 189
P WL + LR+++ +++++ N+ ++ E LF S GG +I+ QIENEYG+
Sbjct: 124 LPAWLLELKPSVRLRSSDASYLSQVDNWWKELLPKIAPE-LF-SAGGSVIMVQIENEYGS 181
Query: 190 VMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDA----------------------- 226
D K Y+ + K S +G I+ A
Sbjct: 182 FGID-----KLYLQFLQKQVRS-HLGNDIIIYTTDGAVEENLSYGSLSDDGVFAAIDFPT 235
Query: 227 ---PSPMFTP----NNPN-SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGT 278
P+ F N+P SP E +TGW WG + + A+ A A+ + +
Sbjct: 236 GWDPAAAFALQKRFNSPGMSPPFSAEFYTGWLTHWGERLAQTDAKSTAVALDDILRLNAS 295
Query: 279 FQNYYMYHGGTNFGRTSG-----GPY----LTTSYDYDAPIDEYGHLNQPKWGHLRE-LH 328
YM HGGTNFG SG GP TSYDYDAPI E G + K+ +R+ L
Sbjct: 296 VV-LYMVHGGTNFGFWSGANTGAGPSDFQPDITSYDYDAPIGEAGDVTGNKYQEIRKVLS 354
Query: 329 KLL-------KSMEKTLTYGNVTNTDYGN 350
K + + + YG VT T G+
Sbjct: 355 KYVGRELPDPPPLPQRTAYGEVTMTKMGS 383
>gi|395846590|ref|XP_003795986.1| PREDICTED: beta-galactosidase-1-like protein 3 [Otolemur garnettii]
Length = 681
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 160/322 (49%), Gaps = 43/322 (13%)
Query: 36 ITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTG 95
T++G + ++ GSIHY R W D + K K G + + TYV WN HEP R ++DF+
Sbjct: 108 FTLEGHKFLIFGGSIHYFRVPREYWQDRLLKLKACGFNTVTTYVPWNLHEPQRGKFDFSE 167
Query: 96 NLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQ 155
NLDL F+ + GL+VILR GPY+C+E + GG P WL P + +LRTT+ F+ +
Sbjct: 168 NLDLEAFVLLAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEL-KLRTTSPGFLEAVD 226
Query: 156 NFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIG 215
+ ++ + L SQGGP+I Q+ENEYG D Y+ + K T L G
Sbjct: 227 KYFDHLI--PRVIPLQYSQGGPVIALQVENEYGAYAQDV-----KYMPYLHK--TLLQRG 277
Query: 216 VPWIMCQ------------------------ESDAPSPMFTPNNPNSPKIWTENWTGWFK 251
+ ++ +A S ++ P + E W GWF
Sbjct: 278 IVELLLTSDGEKEVLKGHIKGVLATVNLKKLRKNAFSQLYEVQR-GKPLLIMEFWVGWFD 336
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYD 305
WG A++L + V++ + +F N YM+HGGTNFG +G Y+ TSYD
Sbjct: 337 RWGESHHITNADNLEYNVSKLIKHEISF-NLYMFHGGTNFGFMNGASYMGRHVSVVTSYD 395
Query: 306 YDAPIDEYGHLNQPKWGHLREL 327
YDA + E G + K+ LR+L
Sbjct: 396 YDAVLTEAGDYTE-KYFKLRKL 416
>gi|319945941|ref|ZP_08020191.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|417919516|ref|ZP_12563047.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
gi|319748006|gb|EFW00250.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|342832897|gb|EGU67186.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
Length = 595
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 153/324 (47%), Gaps = 54/324 (16%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R W + K G + +ETYV WNAHEP R + F GNLDL FI+
Sbjct: 19 ILSGAIHYFRIDREDWYHSLYNLKALGFNTVETYVPWNAHEPQRGHFHFEGNLDLEHFIQ 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ LYVILR P++C+EW +GG P WL + +R+++ F+ E+ + ++
Sbjct: 79 VAQELDLYVILRPSPFICSEWEFGGLPAWL--IEKDLRIRSSDPAFLEEVARYYDELLPR 136
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
K +L +GG I++ Q+ENEYG+ D K+Y+ + DI P S
Sbjct: 137 VAKYQL--DRGGNILMMQVENEYGSYGED-----KAYLRAIRDLMIERDITCPLFT---S 186
Query: 225 DAP--------------------------------SPMFTPNNPNSPKIWTENWTGWFKS 252
D P F ++ P + E W GWF
Sbjct: 187 DGPWRATLRAGTLIEDGLFVTGNFGSRANYNFSQMKEFFAEHDRKWPLMCMEFWDGWFNR 246
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYD 305
W KR E+LA AV Q G N YM+HGGTNFG +G TSYD
Sbjct: 247 WKEPIIKRDPEELAEAVHEVLQEGSI--NLYMFHGGTNFGFMNGCSARGTVDLPQVTSYD 304
Query: 306 YDAPIDEYGHLNQPKWGHLRELHK 329
YDA +DE G+ PK+ ++++ K
Sbjct: 305 YDALLDEQGN-PTPKYDAVKKMMK 327
>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 593
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 152/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K + +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPMQITQGGPVIMMQVENEYGS----YG-MEKAYLQQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|296399420|gb|ADH10537.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 161/328 (49%), Gaps = 31/328 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + ++ + DG+ +SGSIHY R W D + K K GLDAI+TYV WN HE
Sbjct: 6 SFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHE 65
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
P YDF G DL F++ D GL VILR GPY+CAEW+ GG P WL I LR+
Sbjct: 66 PRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSI-VLRS 124
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS---DYGDA----- 197
++ ++ ++ + ++ + K GGPII+ Q+ENEYG+ + DY
Sbjct: 125 SDSDYLEAVERWMGVL--LPKMRPYLYQNGGPIIMVQVENEYGSYFACDYDYLRFLLKLF 182
Query: 198 ----GKSYINWCAKMATSLDI---GVPWIMCQESDAPSPMFTP-------NNPNSPKIWT 243
G + + A+ + + + AP T + P P + +
Sbjct: 183 RLHLGHEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPMGPLVNS 242
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL- 300
E +TGW WG + AE +A + G N YM+ GGTNF +G PY+
Sbjct: 243 EFYTGWLDHWGHRHSVVPAETVAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMP 301
Query: 301 -TTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAP+ E G L + K+ +R++
Sbjct: 302 QPTSYDYDAPLSEAGDLTE-KYFTIRKV 328
>gi|329927841|ref|ZP_08281902.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938242|gb|EGG34637.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 619
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 158/309 (51%), Gaps = 44/309 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG+IHY R P W D + K K G + +ETY+ WN HEP ++ F+G
Sbjct: 13 LDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQEGKFSFSGMA 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ FI+ GL+VI+R P++CAEW +GG P WL I LR ++ ++++++ ++
Sbjct: 73 DVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEI-RLRCSDPLYLSKVDHY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + L +S GGPI+ Q+ENEYG+ +D+ +Y+++ I V
Sbjct: 132 YDELI--PRLVPLLSSNGGPILAVQVENEYGSYGNDH-----AYLDYLRAGLVRRGIDV- 183
Query: 218 WIMCQESDAPSPMF----TPNNPNS----------------------PKIWTENWTGWFK 251
+ SD P+ T N+ ++ P + E W GWF
Sbjct: 184 --LLFTSDGPTDEMLLGGTLNDVHATVNFGSRVEESFRKYREYRTEEPLMVMEFWNGWFD 241
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYD 305
W R A D+A + + G + N YM+HGGTNFG SG ++ TTSYD
Sbjct: 242 HWMEDHHVRDAADVAGVLDEMLEKGSSM-NMYMFHGGTNFGFYSGANHIQTYEPTTTSYD 300
Query: 306 YDAPIDEYG 314
YDAP+ E+G
Sbjct: 301 YDAPLTEWG 309
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 39/97 (40%), Gaps = 31/97 (31%)
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
+Y+ FE D L G KG AW+NG+NLGRYW
Sbjct: 519 FYRGCFEVEEIGD-TFLRFDGWTKGVAWINGFNLGRYW---------------------- 555
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
N G P + Y +P ++ G N LVLFE G
Sbjct: 556 ------NAG-PQKALY-IPGPLLRKGENELVLFELHG 584
>gi|357050580|ref|ZP_09111778.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
gi|355381233|gb|EHG28360.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
Length = 593
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 111/325 (34%), Positives = 152/325 (46%), Gaps = 51/325 (15%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
LLSG+IHY R P W + K G + +ETYV WN HEP + + F G LDL F+
Sbjct: 19 LLSGAIHYFRVHPDDWRHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEGILDLEHFLS 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLYVILR PY+CAEW +GG P WL G LR + ++ + + +++
Sbjct: 79 LAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG--RLRACDPSYLAHVAEYYDVLLPK 136
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ-- 222
+L S GG I++ Q+ENEYG+ YG+ K+Y+ +M + I +P
Sbjct: 137 IIPYQL--SHGGNILMIQVENEYGS----YGEE-KAYLRAIKEMLINRGIDMPLFTSDGP 189
Query: 223 -----------ESD----------------APSPMFTPNNPNSPKIWTENWTGWFKSWGG 255
E D A F +N P + E W GWF W
Sbjct: 190 WQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWDGWFNRWNE 249
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYDYDA 308
+R +DLA +V + G N YM+HGGTNFG +G TSYDYDA
Sbjct: 250 PIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQVTSYDYDA 307
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKS 333
P+DE G+ + L K+LK
Sbjct: 308 PLDEQGNPTAKYYA----LQKMLKE 328
>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 593
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 151/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL G+ LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG R DLA V G N YM+HGGTNFG +G
Sbjct: 245 WFNRWGEPVIHREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGEKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 97/243 (39%), Gaps = 56/243 (23%)
Query: 462 LRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGS 521
L++ + LH YV+G+ +Q+ + L + + + +L +G NYG
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEELLILGQTE--KDTLALDILVENLGRVNYGF 460
Query: 522 KFD--MVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERG 579
K + GI G V+ +D+ H+ G ++ ++
Sbjct: 461 KLNNPTQSKGIRGGVM-----------QDIHFHQ--------GYQHYPLTFSQEQLAKID 501
Query: 580 WSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDG 639
+++ PL + ++Y+ TFE D + + +G GKGF VNG++LGRYW
Sbjct: 502 YTAGKNPL--QPSFYQVTFELEQLADTYI-DCRGYGKGFVVVNGHHLGRYWEI------- 551
Query: 640 CSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQT 699
GP S C P+ +++ G N +V+FE G + + F
Sbjct: 552 ---------GPIHSLYC--------------PKEFLQQGQNEVVIFETEGIDIEYLKFTN 588
Query: 700 VVV 702
V+
Sbjct: 589 QVI 591
>gi|195146534|ref|XP_002014239.1| GL19091 [Drosophila persimilis]
gi|194106192|gb|EDW28235.1| GL19091 [Drosophila persimilis]
Length = 672
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 171/365 (46%), Gaps = 55/365 (15%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + H+ + ++GE ++GS HY R+ P W ++ + GL+A++TYV W+ H P
Sbjct: 47 FTIDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNP 106
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
Y++ G D+++F++ Q++ Y+ILR GPY+CAE + GG P WL ++RT+
Sbjct: 107 HDGVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTS 166
Query: 147 NKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINW-- 204
+ +M E+ + + M + + L GG II+ Q+ENEYG+ D K Y+NW
Sbjct: 167 DSNYMAEVGKWYAEL--MPRLQHLLIGNGGKIIMVQVENEYGDYECD-----KDYLNWLR 219
Query: 205 ------------------------CAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
C K+ + D G+ I E D M P
Sbjct: 220 DETEKYVNRNALLFTTDIPNERMSCGKIDNVFATTDFGIDRI--HEIDDIWTMLRKLQPT 277
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E + GW W + +R + +A A+ + + N YM+ GGTNFG T+G
Sbjct: 278 GPLVNSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGA 336
Query: 298 PY----------LTTSYDYDAPIDEYG------HLNQPKWGHLRELHKLLKSMEKTLTYG 341
Y TSYDYDA +DE G +L + G L ++ + K L YG
Sbjct: 337 NYNLDGGIGYAADITSYDYDAVMDEAGGVTSKYNLVKQVIGEFLPLPEITLNPAKRLAYG 396
Query: 342 NVTNT 346
V T
Sbjct: 397 KVEVT 401
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 29/78 (37%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWY 668
LN+ G GKG A+VNG+NLGRYWP GP Q+
Sbjct: 590 LNMAGWGKGVAYVNGFNLGRYWPV---------------AGP--------------QVTL 620
Query: 669 HVPRSWIKDGVNTLVLFE 686
+VP +K G N+LV+ E
Sbjct: 621 YVPNEILKVGNNSLVILE 638
>gi|289664883|ref|ZP_06486464.1| beta-galactosidase [Xanthomonas campestris pv. vasculorum NCPPB
702]
Length = 582
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 109/329 (33%), Positives = 156/329 (47%), Gaps = 47/329 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ LLSG++H+ R W D ++KA+ GL+ +ETYVFWN EP + Q+D
Sbjct: 5 GTQFVRDGKPYQLLSGAVHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 64
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F+GN D+ F++ GL VILR GPY CAEW GG+P WL I +R+ + F+
Sbjct: 65 FSGNNDVAAFVREAAALGLNVILRPGPYACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLA 123
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
Q + + + L GGPII Q+ENEYG+ D+ + N +
Sbjct: 124 ASQAYLDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMAE---NRAMYVKAGF 178
Query: 213 DIGVPWI-----MCQESDAPSPM----FTPNNPNS------------PKIWTENWTGWFK 251
D + + M P + F P S P++ E W GWF
Sbjct: 179 DKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRSDQPRMVGEYWAGWFD 238
Query: 252 SWG----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------- 299
WG D ++ A++ + + + G N YM+ GGT+FG +G Y
Sbjct: 239 HWGKPHAATDARQQADEFEWILRQ-----GHSANLYMFIGGTSFGFMNGANYQNNPSDHY 293
Query: 300 --LTTSYDYDAPIDEYGHLNQPKWGHLRE 326
TTSYDYDA +DE GH PK+ +R+
Sbjct: 294 APQTTSYDYDAILDEAGHPT-PKFALMRD 321
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 60/136 (44%), Gaps = 30/136 (22%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW- 554
V + G++ + +L G NYG P + GRAG D ++ + W
Sbjct: 426 VDIPAGQHTLDVLVENSGRINYG------------PRMADGRAGLIDPVLLDNQQLTSWQ 473
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
+ + + +A +S RGW+ K V + +++ T D L+++
Sbjct: 474 AFPLPM-----------RAPDSIRGWTRKTV---QGPAFHRGTLRIGTPTD-TYLDMRAF 518
Query: 615 GKGFAWVNGYNLGRYW 630
GKGFAW NG NLGR+W
Sbjct: 519 GKGFAWANGVNLGRHW 534
>gi|66767541|ref|YP_242303.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
gi|66572873|gb|AAY48283.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
Length = 613
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 168/365 (46%), Gaps = 60/365 (16%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHD---------GRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L L + + D G DG+ +LSG+IH+ R
Sbjct: 3 RTTLAPLVLALSIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRTY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF N D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q++ + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKDNI-RIRSRDPRFLAASQSYLDAVAQQVR--PLLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYGDAGKSYI--NWCAKMATSLDIGVPWI-----MCQESDAPSPM-- 230
Q+ENEYG+ D+ +YI N + D + + M P +
Sbjct: 180 AVQVENEYGSYDDDH-----AYIADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAV 234
Query: 231 --FTPN------------NPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARF 272
F P P+ P++ E W GWF WG + K+ E+L + + +
Sbjct: 235 VNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWGTPHASTNAKQQTEELEWILRQ- 293
Query: 273 FQFGGTFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWG 322
G N YM+ GGT+FG +G + TTSYDYDA +DE G PK+
Sbjct: 294 ----GHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PKFA 348
Query: 323 HLREL 327
+R++
Sbjct: 349 LMRDV 353
>gi|384428898|ref|YP_005638258.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
gi|341938001|gb|AEL08140.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
Length = 613
Score = 165 bits (418), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 169/360 (46%), Gaps = 50/360 (13%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHD---------GRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L L + + D G DG+ +LSG+IH+ R
Sbjct: 3 RTTLAPLVLALAIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRAY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF N D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q++ + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIR-VRSRDPRFLAASQSYLDAVAQQVRP--LLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D ++ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG + K+ E+L + + + G
Sbjct: 240 GEAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWGTPHASTNAKQQTEELEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE G PK+ +R++
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PKFALMRDV 353
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 72/180 (40%), Gaps = 34/180 (18%)
Query: 454 LSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSAT 513
++G +L + V YVN V S + V + G++ + +L
Sbjct: 417 VTGPRKGSLYLGEVRDVARVYVNQQPVGSVERRL----QQVATEVDIPAGQHTLDVLVEN 472
Query: 514 VGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW-TYKVGLYGLDDKKFYN 570
G NYG P + GRAG D ++ + W + + +
Sbjct: 473 SGRINYG------------PRMADGRAGLVDPVLLDNQPLTNWQAFPLPM---------- 510
Query: 571 AKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYW 630
++ +S RGW+ K V + +++ T D L+++ GKG AW NG NLGR+W
Sbjct: 511 -RSPDSIRGWTGKPV---QGPAFHRGTLRIGTPAD-TYLDMRAFGKGIAWANGVNLGRHW 565
>gi|289670687|ref|ZP_06491762.1| beta-galactosidase [Xanthomonas campestris pv. musacearum NCPPB
4381]
Length = 612
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 109/329 (33%), Positives = 157/329 (47%), Gaps = 47/329 (14%)
Query: 33 GRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYD 92
G DG+ LLSG++H+ R W D ++KA+ GL+ +ETYVFWN EP + Q+D
Sbjct: 35 GTQFVRDGKPYQLLSGAVHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 94
Query: 93 FTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMN 152
F+GN D+ F++ GL VILR GPY CAEW GG+P WL I +R+ + F+
Sbjct: 95 FSGNNDVAAFVREAAALGLNVILRPGPYACAEWEAGGYPAWLFGKGNIR-VRSRDPRFLA 153
Query: 153 EMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSL 212
Q + + + + L GGPII Q+ENEYG+ D+ + N +
Sbjct: 154 ASQAYLDALAK--QVQPLLNHNGGPIIAVQVENEYGSYADDHAYMAE---NRAMYVKAGF 208
Query: 213 DIGVPWI-----MCQESDAPSPM----FTPNNPNS------------PKIWTENWTGWFK 251
D + + M P + F P S P++ E W GWF
Sbjct: 209 DKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRSDQPRMVGEYWAGWFD 268
Query: 252 SWG----GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------- 299
WG D ++ A++ + + + G N YM+ GGT+FG +G Y
Sbjct: 269 HWGKPHAATDARQQADEFEWILRQ-----GHSANLYMFIGGTSFGFMNGANYQNNPSDHY 323
Query: 300 --LTTSYDYDAPIDEYGHLNQPKWGHLRE 326
TTSYDYDA +DE GH PK+ +R+
Sbjct: 324 APQTTSYDYDAILDEAGHPT-PKFALMRD 351
Score = 46.6 bits (109), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 60/136 (44%), Gaps = 30/136 (22%)
Query: 498 VKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG--DETIIKDLSSHKW- 554
V + G++ + +L G NYG P + GRAG D ++ + W
Sbjct: 456 VDIPAGQHTLDVLVENSGRINYG------------PRMADGRAGLIDPVLLDNQQLTSWQ 503
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
+ + + +A +S RGW+ K V + +++ T D L+++
Sbjct: 504 AFPLPM-----------RAPDSIRGWTRKTV---QGPAFHRGTLRIGTPTD-TYLDMRAF 548
Query: 615 GKGFAWVNGYNLGRYW 630
GKGFAW NG NLGR+W
Sbjct: 549 GKGFAWANGVNLGRHW 564
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 145/308 (47%), Gaps = 47/308 (15%)
Query: 57 PGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILR 116
P W D + K K GL+ +ETYV WN HE ++ + F LD+++F+K Q GLYVI+R
Sbjct: 2 PEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIR 61
Query: 117 IGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGG 176
GPY+CAEW+ GG P WL + P + +LRT+ FM + + + + L QGG
Sbjct: 62 PGPYICAEWDLGGLPSWLLSDPEM-KLRTSYGPFMEAVDRYFQKLFPLL--TPLQYCQGG 118
Query: 177 PIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNN- 235
PII QIENEY + +Y+ KM GV ++ + S P N
Sbjct: 119 PIIAWQIENEYSSFDK---KVDMTYMELLQKMMVK--NGVTEMLLMSDNLFSMKTHPINL 173
Query: 236 ----------------------PNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFF 273
P+ P + TE W GWF WG K E L + F
Sbjct: 174 VLKTINLQKNVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLF 233
Query: 274 QFGGTFQNYYMYHGGTNFGRTSGGPYL--------------TTSYDYDAPIDEYGHLNQP 319
G + N+YM+HGGTNFG +G + TSYDYDAP+ E G + P
Sbjct: 234 SLGASI-NFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDIT-P 291
Query: 320 KWGHLREL 327
K+ LR+
Sbjct: 292 KYKALRKF 299
Score = 42.4 bits (98), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 58/141 (41%), Gaps = 21/141 (14%)
Query: 494 FERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHK 553
FE+P K + ++ +G N+G D GI G VL+ G+ + K
Sbjct: 425 FEKPND-EDDKVLLEIMVENMGRANFGKAMDAQRKGILGKVLIDGK----------TPRK 473
Query: 554 WTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMT----WYKTTFEAPLENDPVVL 609
W +Y LD K + + S WS +N + +Y+ + +
Sbjct: 474 WK----IYPLDFHKTFTERFPRS--SWSQAGTKINGSVGHSPGFYRGILHIQGQPRDTFV 527
Query: 610 NLQGMGKGFAWVNGYNLGRYW 630
+ +G GKG VNG NLGRYW
Sbjct: 528 HPKGWGKGVCLVNGKNLGRYW 548
>gi|21232326|ref|NP_638243.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
gi|21114096|gb|AAM42167.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
Length = 613
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 169/360 (46%), Gaps = 50/360 (13%)
Query: 9 RAILLCLILQTLFNLSLAYRVSHD---------GRAITIDGERKILLSGSIHYPRSTPGM 59
R L L+L L + + D G DG+ +LSG+IH+ R
Sbjct: 3 RTTLAPLVLALSIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRTY 62
Query: 60 WPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGP 119
W D ++KA+ GL+ +ETYVFWN EP + Q+DF N D+ F++ QGL VILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 120 YVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPII 179
Y CAEW GG+P WL I +R+ + F+ Q++ + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIR-IRSRDPRFLAASQSYLDAVAQQVRP--LLNHNGGPII 179
Query: 180 LAQIENEYGNVMSDYG---DAGKSYIN--------WCAKMATSLDIG-VPWIMCQESDAP 227
Q+ENEYG+ D+ D ++ + + A L G +P + + AP
Sbjct: 180 AVQVENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAP 239
Query: 228 SPM------FTPNNPNSPKIWTENWTGWFKSWG----GKDPKRTAEDLAFAVARFFQFGG 277
P+ P++ E W GWF WG + K+ E+L + + + G
Sbjct: 240 GEAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWGTPHASTNAKQQTEELEWILRQ-----G 294
Query: 278 TFQNYYMYHGGTNFGRTSGGPY----------LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
N YM+ GGT+FG +G + TTSYDYDA +DE G PK+ +R++
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PKFALMRDV 353
>gi|242004937|ref|XP_002423332.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
gi|212506351|gb|EEB10594.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
Length = 596
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 113/341 (33%), Positives = 171/341 (50%), Gaps = 32/341 (9%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +SGS HY R W D ++K K GL+A+ TYV W+ HE + YDF G+L
Sbjct: 1 MDGKPFQYVSGSAHYFRMPNQYWRDRLRKIKAAGLNAVSTYVEWSQHERVPGVYDFEGDL 60
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ RF++ Q++GL+VILR GPY+CAE + GG P WL +LR+++ + +Q +
Sbjct: 61 DVKRFVEMAQEEGLFVILRPGPYICAERDMGGLPYWLMTKHPDIQLRSSDFFYTYYVQRW 120
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG------DAGKSYINWCAKMAT 210
+ + K L+ +GGPIIL Q+ENEYG+ S DY + + ++++ A + T
Sbjct: 121 MDKL--LGKFTDLWYGKGGPIILVQVENEYGSYHSCDYNHTYWLRNLFEKHVDYNAVLFT 178
Query: 211 S-------LDIG-VPWIMCQESDAP----SPMFTPN---NPNSPKIWTENWTGWFKSWGG 255
+ L G +P + P S MF P+ P + +E + GW WG
Sbjct: 179 TDGASRNFLKCGKIPGVYATVDFGPNSNVSKMFEAQREFEPSGPLVNSEYYPGWLTHWGE 238
Query: 256 KDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYDYDA 308
K R R N+YM++GG+NFG T+G TSYDYDA
Sbjct: 239 KKHARQDTKDVVKTLREMLNEKANVNFYMFYGGSNFGFTAGANQFGSIYQSDITSYDYDA 298
Query: 309 PIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYG 349
PI E G L K+ ++ + + ++ +T DYG
Sbjct: 299 PISEAGDLTD-KYYAIKNVLEEYFNLTSNITVETHDKGDYG 338
>gi|49256283|gb|AAH74351.1| LOC443705 protein, partial [Xenopus laevis]
Length = 672
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 157/323 (48%), Gaps = 40/323 (12%)
Query: 24 SLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNA 83
S ++ + ++ DG+ +SGSIHY R W D + K GL+A++ Y+ WN
Sbjct: 73 SKSFSIDYNKNCFRKDGQCFRYISGSIHYFRIPADYWRDRLLKMYMTGLNAVQVYIPWNF 132
Query: 84 HEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEEL 143
HEPL YDF G+ DL RF+ + GL VI+R GPY+CAEW+ GG P WL N I L
Sbjct: 133 HEPLPGLYDFNGDRDLSRFLDLTDELGLLVIIRPGPYICAEWDMGGLPAWLLNNKDI-AL 191
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG------- 195
RT++ ++N + ++ +++ + K S GG II Q+ENEYG+ M+ DY
Sbjct: 192 RTSDPDYLNAVDSWFSVL--LPKLRSRLYSNGGNIISVQVENEYGSFMACDYSYLRHLLH 249
Query: 196 ---------------DAGKSYINWCAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPN 237
D C + T++D G + + P
Sbjct: 250 LFRLYLGDEVVLFTTDGNTERELQCGSLQDLYTTVDFGP----GDNATKAFKLLRKYQPK 305
Query: 238 SPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG 297
P + +E +TGW WG K + E ++ + + G + N YM+ GGTNFG +G
Sbjct: 306 GPLVNSEYYTGWLDYWGEKHSTTSKELVSQGLKNILEMGASV-NMYMFEGGTNFGYWNGA 364
Query: 298 PY------LTTSYDYDAPIDEYG 314
+ +TTSYDYDAP+ E G
Sbjct: 365 DFKKIYKPITTSYDYDAPLSEAG 387
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 63/159 (39%), Gaps = 40/159 (25%)
Query: 537 VGRAGDETIIKDLSSHKWTYKVGLYGLDDKKFY--NAKAANSERGWSS-------KNVPL 587
+GR + + DL +G+ L D Y N + SE GW N
Sbjct: 526 MGRINFGSCVNDLKGLVSNLTLGVDILTDWLVYPLNLEGPISE-GWPQMGNNFIFSNTEA 584
Query: 588 NRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDY 647
N ++Y TF+ + D L+L KG W+NG+N+GRYWP
Sbjct: 585 NTGPSFYSGTFQITTQGD-TFLSLPQWTKGQVWINGFNVGRYWPA--------------- 628
Query: 648 RGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFE 686
RGP QI +VP + ++ G NT+ L E
Sbjct: 629 RGP--------------QITLYVPGNILRLGENTVTLLE 653
>gi|392427936|ref|YP_006468947.1| beta-galactosidase [Streptococcus intermedius JTH08]
gi|419777127|ref|ZP_14303045.1| glycosyl hydrolase family 35 [Streptococcus intermedius SK54]
gi|383845338|gb|EID82742.1| glycosyl hydrolase family 35 [Streptococcus intermedius SK54]
gi|391757082|dbj|BAM22699.1| beta-galactosidase [Streptococcus intermedius JTH08]
Length = 601
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/326 (34%), Positives = 154/326 (47%), Gaps = 58/326 (17%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P W + K G + +ETY+ WNAHEP++ Q+DF G LD+ +F++
Sbjct: 25 ILSGAIHYFRIQPDDWYHSLYNLKALGFNTVETYIPWNAHEPMKGQFDFEGILDVEKFLQ 84
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWL--HNMPGIEELRTTNKVFMNEMQNFTTLIV 162
T QD GLYV+LR PY+CAEW +GG P WL NM +R+++ ++ + N+ ++
Sbjct: 85 TAQDLGLYVLLRSSPYICAEWEFGGLPAWLLEENM----RIRSSDPAYLAAVANYYDELL 140
Query: 163 DMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ 222
L GG I++ Q+ENEYG+ D K Y+ M + P
Sbjct: 141 PRLVSHLL--ENGGSILMMQVENEYGSYGED-----KEYLRAVRDMMQERGVTCPLFT-- 191
Query: 223 ESDAP------------SPMFTPNNPNS--------------------PKIWTENWTGWF 250
SD P MF N S P + E W GWF
Sbjct: 192 -SDGPWRATLRAGTLIEDDMFVTGNFGSKAKENFAQMQEFFDEHDKKWPLMCMEFWDGWF 250
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLTTS 303
W R E+LA AV Q G N YM+HGGTNFG R S TS
Sbjct: 251 NRWKEPTVTRDPEELAEAVHEVLQQGSI--NLYMFHGGTNFGFMNGCSARGSIDLPQVTS 308
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHK 329
YDY+A +DE G+ PK+ ++ + K
Sbjct: 309 YDYEALLDEQGN-PTPKYFAVQRMLK 333
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 81/196 (41%), Gaps = 39/196 (19%)
Query: 438 YLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQW-TKYGASNDLFER 496
YL Y T A+ D +R+ + +V+GN++ +Q+ T+ G D+F
Sbjct: 391 YLLYRTQAEWDADKE--------RVRVIDGRDRMQLFVDGNFITTQYQTEIG--EDIFIS 440
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKF--DMVPNGIPGPVLLVGRAGDETIIKDLSSHKW 554
K R +++ +L +G NYG K + GI R G + KDL +
Sbjct: 441 QQK--RSIHRLDILMENMGRVNYGHKLLAESQHKGI--------RTG---VCKDLH---F 484
Query: 555 TYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGM 614
Y L+ F N +A + + W + +Y E D L+L
Sbjct: 485 MLHWNQYPLE---FENPEAIDFTKEWHED------QPAFYAFAVELKALKD-TYLDLTHF 534
Query: 615 GKGFAWVNGYNLGRYW 630
GKG +VNG N+GR+W
Sbjct: 535 GKGVVFVNGVNIGRFW 550
>gi|374312360|ref|YP_005058790.1| glycoside hydrolase family protein [Granulicella mallensis
MP5ACTX8]
gi|358754370|gb|AEU37760.1| glycoside hydrolase family 35 [Granulicella mallensis MP5ACTX8]
Length = 627
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/318 (33%), Positives = 155/318 (48%), Gaps = 50/318 (15%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
++SG + Y R W D ++KA GL+AI YVFWN HEP YDF+G D+ F++
Sbjct: 55 IVSGELEYARIPRPYWRDRLRKAHAMGLNAITIYVFWNIHEPTPEVYDFSGQNDVAEFVR 114
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWL---HNMPGIEELRTTNKVFMNEMQNFTTLI 161
Q +GLYVILR GPYVCAEW+ GG+P WL H M +LR+ F T +
Sbjct: 115 EAQQEGLYVILRPGPYVCAEWDLGGYPAWLLKDHEM----KLRSLQPEFKAAA---TRWM 167
Query: 162 VDMAKK-EKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWI- 219
+ + ++ L AS+GGPI+ Q+ENEYG+ D+ Y+ W ++ G +
Sbjct: 168 LRLGQELTPLQASRGGPILAVQVENEYGSFGDDH-----EYMKWVHELVLQAGFGGSLLY 222
Query: 220 ------MCQESDAPS----------------PMFTPNNPNSPKIWTENWTGWFKSWGGKD 257
+ ++ PS ++ P +P E W GWF WG K
Sbjct: 223 TGDGADVLKQGTLPSVFAGIDFGTGDAARSIKLYKAFRPQTPVYVAEYWDGWFDHWGEKH 282
Query: 258 PKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--------PYLTTSYDYDAP 309
A + + G + + YM HGGT+FG +G P + +SYDYDAP
Sbjct: 283 QLTDAAKQETEIRSMLEQGDSI-SLYMVHGGTSFGWMNGANNDHDGYQPDV-SSYDYDAP 340
Query: 310 IDEYGHLNQPKWGHLREL 327
+DE G +PK+ LR +
Sbjct: 341 LDESGR-PRPKYFRLRNI 357
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 41/106 (38%), Gaps = 31/106 (29%)
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
+Y F P D LN + + KG WVNG+ LGR+W GP G
Sbjct: 531 FYHAEFSTPNPGD-TFLNTEQLVKGVVWVNGHLLGRFWDI----------------GPAG 573
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQ 698
+ +VP W+ G N L +F+ GG QI Q
Sbjct: 574 A--------------LYVPGVWLHQGKNELTVFDLNGGRNLQIEGQ 605
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 160/313 (51%), Gaps = 32/313 (10%)
Query: 40 GERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDL 99
G +L+G++HY R P W D +++ GL+ ++TY+ WN HE ++ F G D+
Sbjct: 20 GRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHERRTGEHRFDGWRDI 79
Query: 100 IRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN-FT 158
RF++T Q GL VI+R GPY+CAEW+ GG P WL + PG+ R++ +++E+ F
Sbjct: 80 ERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRP-RSSYAPYLDEVARWFD 138
Query: 159 TLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYG------DA----GKSYINWCAKM 208
LI +A L A++GGP++ Q+ENEYG+ D+ DA G + + + A
Sbjct: 139 VLIPRIA---DLQAARGGPVVAVQVENEYGSYGDDHAYMRWVHDALAGRGVTELLYTADG 195
Query: 209 ATSLDI---GVPWIMC-----QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKR 260
T L + +P ++ +D + + P + E W GWF WG K R
Sbjct: 196 PTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLCAEFWNGWFDHWGEKHHTR 255
Query: 261 TAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYDYDAPIDEY 313
+ A A+ GG+ + Y HGGTNFG +G + TSYD DAPI E+
Sbjct: 256 SVGSAAAALDEILAKGGSV-SLYPAHGGTNFGLWAGANHADGALQPTVTSYDSDAPIAEH 314
Query: 314 GHLNQPKWGHLRE 326
G PK+ R+
Sbjct: 315 G-APTPKFHAFRD 326
>gi|354585216|ref|ZP_09004105.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
gi|353188942|gb|EHB54457.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
Length = 619
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 155/309 (50%), Gaps = 44/309 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ ++SG++HY R P W D + K K G + +ETY+ WN HEP +++F+G
Sbjct: 13 LDGQPYRIISGAVHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPTEGEFNFSGMA 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ FI+ GL+VI+R P++CAEW +GG P WL I LR ++ ++++++ ++
Sbjct: 73 DVGSFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEI-RLRCSDPLYLSKVDHY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + L +S GGPI+ Q+ENEYG+ +D+ +Y+ + + V
Sbjct: 132 YDELI--PRMVPLLSSNGGPILAVQVENEYGSYGNDH-----AYLEYLRAGLVRRGVDV- 183
Query: 218 WIMCQESDAPSP--------------------------MFTPNNPNSPKIWTENWTGWFK 251
+ SD P+ + + P + E W GWF
Sbjct: 184 --LLFTSDGPTDEMLLGGSIDHVHATVNFGSRVEESFGKYREYRTDEPLMVMEFWNGWFD 241
Query: 252 SWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYD 305
W R A D+A + + G + N YM+HGGTNFG SG ++ TTSYD
Sbjct: 242 HWMEDHHVRDAADVAGVLDEMLEKGSSI-NMYMFHGGTNFGFYSGANHIKTYEPTTTSYD 300
Query: 306 YDAPIDEYG 314
YDAP+ E+G
Sbjct: 301 YDAPLTEWG 309
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 42/101 (41%), Gaps = 31/101 (30%)
Query: 593 WYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYG 652
+Y+ +F+ D L G KG AW+NG+NLGRYW
Sbjct: 519 FYRGSFQVEDIGD-TFLRFDGWTKGVAWINGFNLGRYW---------------------- 555
Query: 653 SDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPS 693
N G P + Y +P ++ G N LVLFE GG S
Sbjct: 556 ------NAG-PQKALY-IPGPLLRKGENELVLFELHGGPES 588
>gi|340722578|ref|XP_003399681.1| PREDICTED: beta-galactosidase-like [Bombus terrestris]
Length = 646
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 161/320 (50%), Gaps = 35/320 (10%)
Query: 25 LAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAH 84
++ V ++ +DG+ +SGS HY R+ W D ++K + GL+A+ TYV W+ H
Sbjct: 30 FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLRKMRAAGLNAVSTYVEWSLH 89
Query: 85 EPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVW-LHNMPGIEEL 143
+P ++ +TG+ D+I FI Q++GL+V+LR GPY+CAE ++GG P W L +P I +L
Sbjct: 90 QPTENEWHWTGDADVIEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLARVPDI-KL 148
Query: 144 RTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGN------VMSDYGDA 197
RT + +M ++ + I+D K + GGPII+ Q+ENEYG+ +S D
Sbjct: 149 RTNDSRYMKYVEIYLNEILD--KVQPYLRGNGGPIIMVQVENEYGSYACDREYLSRLRDI 206
Query: 198 GKSYINWCAKM-------ATSLDIG-VPWIMCQESDAPSPMFTPN-------NPNSPKIW 242
+ I A + A L G +P + P+ T N P P +
Sbjct: 207 MRQKIGTKALLYSTDGANANMLRCGFIPEVYATVDFGPNTNVTKNFEIMRMYQPRGPLVN 266
Query: 243 TENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG----- 297
+E + GW W + + + G + N YM++GGTNFG T+G
Sbjct: 267 SEFYPGWLTHWREPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGYTAGANGGHN 325
Query: 298 ---PYLTTSYDYDAPIDEYG 314
P L TSYDYDAP+ E G
Sbjct: 326 AYNPQL-TSYDYDAPLTEAG 344
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 157/337 (46%), Gaps = 41/337 (12%)
Query: 29 VSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLR 88
++ DG +DG+ +LSG+IHY R W ++ + GL+ I+ Y+ WN HE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 89 RQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNK 148
+DF G LDL+ F + GL V+ R GPY+C+EW++GG P WL P + +R+
Sbjct: 68 GNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKM-HIRSNYC 126
Query: 149 VFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKM 208
+ + ++ + ++ + L S GGPII Q+ENEYG DY D ++ W A +
Sbjct: 127 GYQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYG----DYVDKDNEHLPWLADL 180
Query: 209 ATSLDIGVPWIMCQ------------------------ESDAPSPMFTPNNPNSPKIWTE 244
S + + + + A + PN P + TE
Sbjct: 181 MKSHGLFELFFISDGGHTIRKANMLKVRSTAQLNSGSFQLLAKAFSLKSLQPNKPMLVTE 240
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---- 300
W GWF WG E + + G + N+YM+HGGTNFG +G L
Sbjct: 241 FWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGY 299
Query: 301 ----TTSYDYDAPIDEYGHLNQPKWGHLRELHKLLKS 333
TSYDYD P+DE G+ + KW +R + K+
Sbjct: 300 YTADVTSYDYDCPVDESGNRTE-KWEIIRRCLNVQKT 335
>gi|256957323|ref|ZP_05561494.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|257077681|ref|ZP_05572042.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|307270129|ref|ZP_07551446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|422710565|ref|ZP_16767610.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|422721468|ref|ZP_16778057.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422867159|ref|ZP_16913760.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|256947819|gb|EEU64451.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256985711|gb|EEU73013.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|306513498|gb|EFM82113.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|315031294|gb|EFT43226.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315035298|gb|EFT47230.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|329577710|gb|EGG59137.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 593
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 151/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL + LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 625
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 162/332 (48%), Gaps = 44/332 (13%)
Query: 34 RAITIDGERKI-------LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
R +TIDG R + ++S +IHY R P +W D +++ + G + +E Y+ WN H+P
Sbjct: 5 RVLTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQP 64
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTT 146
F G D+ F++ + G VI R GPY+CAEW++GG P WL + LRTT
Sbjct: 65 TPAAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENV-RLRTT 123
Query: 147 NKVFMNEMQN-FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS--DYGD-AGKSYI 202
+ V++ + F LI +A +L A++GGP++ QIENEYG+ + DY D K I
Sbjct: 124 DPVYLAAVDAWFDELIPVLA---ELQATRGGPVVAVQIENEYGSFGADPDYLDHLRKGLI 180
Query: 203 NWCAKMATSLDIGVPWIMCQESDAPSPMFTPN---------------NPNSPKIWTENWT 247
G +M P + T N P+ P + E W
Sbjct: 181 ERGVDTLLFTSDGPQELMLAGGTVPDVLATVNFGSRADEAFATLRRVRPDDPPVCMEFWN 240
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------- 299
GWF +G R+A+D A ++ GG+ N+YM HGGTNFG +G +
Sbjct: 241 GWFDHFGEPHHTRSAQDAARSLDEILAAGGSV-NFYMGHGGTNFGFWAGANHSGVGTGDP 299
Query: 300 ----LTTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDAP+ E G L PK+ RE+
Sbjct: 300 GYQPTITSYDYDAPVGEAGELT-PKFHLFREV 330
>gi|295135993|ref|YP_003586669.1| beta-galactosidase [Zunongwangia profunda SM-A87]
gi|294984008|gb|ADF54473.1| putative exported beta-galactosidase [Zunongwangia profunda SM-A87]
Length = 616
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 193/729 (26%), Positives = 292/729 (40%), Gaps = 170/729 (23%)
Query: 4 LKHCSRAILL-CLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPD 62
++ ++ ILL CL + T +++ DG GE + +G +HY R W
Sbjct: 1 MRFLAKLILLFCLAVNTTQAQDEVFKI-EDGN-FKYKGEPIHIYAGEMHYARIPKAYWRH 58
Query: 63 LIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT-GNLDLIRFIKTIQDQGLYVILRIGPYV 121
++ K GL+A+ TYVFWN H ++D+T GN +L FIK +++GL+VILR GPY
Sbjct: 59 RLQMIKALGLNAVNTYVFWNYHNTAPGKWDWTSGNKNLPEFIKMAKEEGLFVILRPGPYA 118
Query: 122 CAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLF-ASQGGPIIL 180
CAEW +GG+P WL P ++ +R N F++ + T I + A + K F AS GG +I+
Sbjct: 119 CAEWEFGGYPWWLQKNPDLK-IRQNNPAFLDSCR---TYINEFATQVKPFQASNGGNVIM 174
Query: 181 AQIENEYGNVMSDYGDAG----KSYINWCAKMATSLDIGVPWIMCQES--------DAPS 228
Q ENE+G+ ++ D K+Y M + P+ + D
Sbjct: 175 VQAENEFGSFVAQREDISTEDHKAYKQKIFDMLKDSGLDGPFFTSDGTWLFKGGAIDGVL 234
Query: 229 PMFTPNNPNS----------------PKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARF 272
P T N N+ P + E + GW W K +AE +A +
Sbjct: 235 P--TANGENNVANLKKAVNEYHNGEGPYMVAEFYPGWLDHWAEPFNKISAEAIAKQTEVY 292
Query: 273 FQFGGTFQNYYMYHGGTNFGRTSGGPY--------LTTSYDYDAPIDEYGHLNQPKWGHL 324
+ F N+YM HGGTNFG TSG Y TSYDYDAPI E G PK+
Sbjct: 293 LKNDVDF-NFYMVHGGTNFGFTSGANYNDDHDIQPDITSYDYDAPISEAGWAT-PKY--- 347
Query: 325 RELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQT 384
R + KL+K +Y++P I P E K +
Sbjct: 348 RAIRKLMKEY--------------------VAYDIPEIPKQI-PVISIPEIKLNKKQSAL 386
Query: 385 NV-KVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDVSDYLWYMT 443
+V K +P A AP+ + + + +G G+ L +K T ++
Sbjct: 387 DVIKASKPVIA---DAPMSF--------EALDQGYGY----VLYRKKFTQPITG------ 425
Query: 444 NADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRG 503
L+I YVNG YV Y +E +K+
Sbjct: 426 -----------------KLQIPGLRDYATVYVNGKYVGKLNRMYNE----YEMDIKIPFN 464
Query: 504 KNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTYKVGLYGL 563
+ +L +G NYG+K GI IK +S + + G + +
Sbjct: 465 -GTLEILVENMGRINYGAKMTKNKKGI---------------IKAVSINDYEISGG-WEM 507
Query: 564 DDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNG 623
F +A A ++E+ N Y F+ D L++Q GKG +VNG
Sbjct: 508 YKAPFDSAPALSNEQEIK------NGLPVLYSGNFDLEEIGD-TFLDMQKWGKGIVFVNG 560
Query: 624 YNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLV 683
++LGRYW P Q Y +P W+K+ NT+
Sbjct: 561 HHLGRYWKV-----------------------------GPQQTLY-LPGCWLKEKGNTIT 590
Query: 684 LFEEFGGNP 692
+ E+ +P
Sbjct: 591 ILEQLNEDP 599
>gi|432954511|ref|XP_004085513.1| PREDICTED: beta-galactosidase-like [Oryzias latipes]
Length = 653
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 170/337 (50%), Gaps = 31/337 (9%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
++ + ++ DG+R +SGSIHY R W D + K GL+AI+TY+ WN HE
Sbjct: 27 SFSLDYNADCFRKDGQRFRFISGSIHYSRIPRVYWKDRLVKMYMAGLNAIQTYIPWNYHE 86
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
Y+F+G+ D+ F+K QD GL VILR GPY+CAEW GG P WL + I LR+
Sbjct: 87 ESPGMYNFSGDRDVEYFLKLAQDIGLLVILRPGPYICAEWEMGGLPAWLLSKKDI-VLRS 145
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYG-------NVMSDYGDAG 198
++ ++ + + ++ M K GGPII Q+ENEYG N M
Sbjct: 146 SDPDYVAAVDTWMGKLLPMMK--PYLYQNGGPIITVQVENEYGSYFACDYNYMRHLTKLF 203
Query: 199 KSYINWCAKMATSLDIGVPWIMCQESDA--PSPMFTPNN-------------PNSPKIWT 243
+S++ + T+ G+ ++ C + F P + P+ P + +
Sbjct: 204 RSHLGEDVVLFTTDGAGLNYLKCGAIQGLYATVDFGPGSNITAAFEAQRHAEPHGPLVNS 263
Query: 244 ENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYLT 301
E +TGW WG + + + +A ++ + G N YM+ GGTNFG +G PY
Sbjct: 264 EFYTGWLDHWGSRHSVVSPDLVAKSLNQQLAMGANV-NMYMFIGGTNFGYWNGANSPYSA 322
Query: 302 --TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
TSYDYDAP+ E G L + K+ +RE+ ++ + + +
Sbjct: 323 QPTSYDYDAPLTEAGDLTE-KYFAIREVIRMYRRIPE 358
>gi|386316666|ref|YP_006012830.1| beta-galactosidase [Streptococcus dysgalactiae subsp. equisimilis
ATCC 12394]
gi|410494431|ref|YP_006904277.1| beta-galactosidase [Streptococcus dysgalactiae subsp. equisimilis
AC-2713]
gi|417753610|ref|ZP_12401718.1| putative beta-galactosidase [Streptococcus dysgalactiae subsp.
equisimilis SK1249]
gi|417927388|ref|ZP_12570776.1| glycosyl hydrolase family 35 [Streptococcus dysgalactiae subsp.
equisimilis SK1250]
gi|323126953|gb|ADX24250.1| beta-galactosidase precursor [Streptococcus dysgalactiae subsp.
equisimilis ATCC 12394]
gi|333769390|gb|EGL46514.1| putative beta-galactosidase [Streptococcus dysgalactiae subsp.
equisimilis SK1249]
gi|340765262|gb|EGR87788.1| glycosyl hydrolase family 35 [Streptococcus dysgalactiae subsp.
equisimilis SK1250]
gi|410439591|emb|CCI62219.1| K12308 beta-galactosidase [Streptococcus dysgalactiae subsp.
equisimilis AC-2713]
Length = 594
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 164/688 (23%), Positives = 280/688 (40%), Gaps = 163/688 (23%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG++HY R P W ++ K G + +ETYV WN HEP + + F G
Sbjct: 12 LDGKPFKILSGAVHYFRIIPDSWYRVLYNLKALGFNTVETYVPWNLHEPQKGHFCFEGLA 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F+ Q+ GLY I+R PY+CAEW +GG P WL P +R+ +KV+++ + +
Sbjct: 72 DLEAFLDLAQNLGLYAIVRPSPYICAEWEFGGLPAWLLEEPC--RVRSRDKVYLDHVAAY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ K +L +GG I++ Q+ENEYG+ D K Y+ M + I P
Sbjct: 130 YDVLLPKLAKRQL--DRGGNILMFQVENEYGSYGED-----KEYLRALKDMMLARGIEAP 182
Query: 218 WI-----------------------------MCQESDAPSPMFTPNNPNSPKIWTENWTG 248
+ Q + + + P + E W G
Sbjct: 183 LFTSDGAWESALEAGSLIEDNLLVTGNFGSKVSQNVASLRAFMSKHGKEWPMMCMEFWLG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG +R ++ + + G N YM+ GGTNFG +G
Sbjct: 243 WFNRWGEAIIRRDPQETVATIMDMIEQGSI--NLYMFCGGTNFGFMNGSSARLQKDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPA 361
TSYDYDA +DE G+ P + L K LK+ L + + P+
Sbjct: 301 TSYDYDALLDEAGN---PTLKY-SLLQKALKATYPDLAFAEPLVS-------------PS 343
Query: 362 WSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHF 421
++ +P T KV+ T +K N +G + + + P+ +
Sbjct: 344 MAIGPIP-------LTQKVSLLTTLK----NVSG-----ITFSFYPQSM----------- 376
Query: 422 ALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVD 481
++ N Y+ Y + D LRI + + +++ +
Sbjct: 377 --------EALNHSLGYMLYRSRLSKYGDQE--------RLRIIDARDRVQVFLDDRRIQ 420
Query: 482 SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG 541
+Q+ + + L + L++ ++Q+++L +G +YG K P+ G +GR
Sbjct: 421 TQYQEDIGKDIL----MTLSKQRSQLTILVENMGRVSYGHKL-TAPSQHKG----LGRG- 470
Query: 542 DETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAP 601
++ DL + +G + F + +GW + +P ++Y F+
Sbjct: 471 ---VMSDL------HFIGQWEQIPLDFQELSWLDFSQGW-IEGLP-----SFYAYDFDCQ 515
Query: 602 LENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCG 661
+ D + +L GKG A +NG+NLGR+W +GP S
Sbjct: 516 VPTDTYI-DLSQFGKGIALINGFNLGRFWQ----------------KGPILS-------- 550
Query: 662 NPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
++P+ ++ G N LV+FE G
Sbjct: 551 ------LYLPKGLLQKGKNRLVIFETEG 572
>gi|294779195|ref|ZP_06744602.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|294453706|gb|EFG22101.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
Length = 592
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 151/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL + LR+T+ +FM +++N+
Sbjct: 72 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDV-RLRSTDPIFMTKVRNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 131 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 183
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 184 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 244 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 301
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 302 TSYDYDALLTEAG 314
>gi|251782093|ref|YP_002996395.1| beta-galactosidase [Streptococcus dysgalactiae subsp. equisimilis
GGS_124]
gi|242390722|dbj|BAH81181.1| beta-galactosidase precursor [Streptococcus dysgalactiae subsp.
equisimilis GGS_124]
Length = 594
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 164/688 (23%), Positives = 280/688 (40%), Gaps = 163/688 (23%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG+ +LSG++HY R P W ++ K G + +ETYV WN HEP + + F G
Sbjct: 12 LDGKPFKILSGAVHYFRIIPDSWYRVLYNLKALGFNTVETYVPWNLHEPQKGHFCFEGLA 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
DL F+ Q+ GLY I+R PY+CAEW +GG P WL P +R+ +KV+++ + +
Sbjct: 72 DLEAFLDLAQNLGLYAIVRPSPYICAEWEFGGLPAWLLEEPC--RVRSRDKVYLDHVAAY 129
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ K +L +GG I++ Q+ENEYG+ D K Y+ M + I P
Sbjct: 130 YDVLLPKLAKRQL--DRGGNILMFQVENEYGSYGED-----KEYLRALKDMMLARGIEAP 182
Query: 218 WI-----------------------------MCQESDAPSPMFTPNNPNSPKIWTENWTG 248
+ Q + + + P + E W G
Sbjct: 183 LFTSDGAWESALEAGSLIEDNLLVTGNFGSKVSQNVASLRAFMSKHGKEWPMMCMEFWLG 242
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------T 301
WF WG +R ++ + + G N YM+ GGTNFG +G
Sbjct: 243 WFNRWGEAIIRRDPQETVATIMDMIEQGSI--NLYMFCGGTNFGFMNGSSARLQKDLPQV 300
Query: 302 TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPA 361
TSYDYDA +DE G+ P + L K LK+ L + + P+
Sbjct: 301 TSYDYDALLDEAGN---PTLKY-SLLQKALKATYPDLAFAEPLVS-------------PS 343
Query: 362 WSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHF 421
++ +P T KV+ T +K N +G + + + P+ +
Sbjct: 344 MAIGPIP-------LTQKVSLLTTLK----NVSG-----ITFSFYPQSM----------- 376
Query: 422 ALNTLIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVD 481
++ N Y+ Y + D LRI + + +++ +
Sbjct: 377 --------EALNHSLGYMLYRSRLSKYGDQE--------RLRIIDARDRVQVFLDDRRIQ 420
Query: 482 SQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAG 541
+Q+ + + L + L++ ++Q+++L +G +YG K P+ G +GR
Sbjct: 421 TQYQEDIGKDIL----MTLSKQRSQLTILVENMGRVSYGHKL-TAPSQHKG----LGRG- 470
Query: 542 DETIIKDLSSHKWTYKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTFEAP 601
++ DL + +G + F + +GW + +P ++Y F+
Sbjct: 471 ---VMSDL------HFIGQWEQIPLDFQELSWLDFSQGW-IEGLP-----SFYAYDFDCQ 515
Query: 602 LENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCG 661
+ D + +L GKG A +NG+NLGR+W +GP S
Sbjct: 516 VPTDTYI-DLSQFGKGIALINGFNLGRFWQ----------------KGPILS-------- 550
Query: 662 NPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
++P+ ++ G N LV+FE G
Sbjct: 551 ------LYLPKGLLQKGKNRLVIFETEG 572
>gi|307289489|ref|ZP_07569436.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422703871|ref|ZP_16761687.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499556|gb|EFM68926.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315164595|gb|EFU08612.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 593
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 151/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 13 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 72
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL + LR+T+ +FM +++N+
Sbjct: 73 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKSV-RLRSTDPIFMTKVRNY 131
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 132 FQVL--LPKLAPLQITQGGPVIMMQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 184
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 185 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 244
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 245 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 302
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 303 TSYDYDALLTEAG 315
>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
Length = 314
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 118/224 (52%), Gaps = 28/224 (12%)
Query: 595 KTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD 654
+T F P DPV ++L MGKG AWVNG+ +GRYW + +A E GCS+ SC Y G Y
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSS-SCYYPGAYNER 140
Query: 655 KCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN-- 712
KC NCG P+Q WYH+PR W+K+ N LVLFEE GG+PS I+ + T C + EN
Sbjct: 141 KCQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYY 200
Query: 713 --------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEI 751
+ L C G ISEI +AS+G P G C F KG+C A
Sbjct: 201 PPLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHAS- 259
Query: 752 DVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
L L+ + CVG C+I S G G +K L VEA C
Sbjct: 260 STLDLVTEACVGNTKCAISVSNDVFGDP--CRGVLKDLAVEAKC 301
>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 317
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 118/224 (52%), Gaps = 28/224 (12%)
Query: 595 KTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSD 654
+T F P DPV ++L MGKG AWVNG+ +GRYW + +A E GCS+ SC Y G Y
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSS-SCYYPGAYNER 140
Query: 655 KCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFGGNPSQINFQTVVVGTACGQAHEN-- 712
KC NCG P+Q WYH+PR W+K+ N LVLFEE GG+PS I+ + T C + EN
Sbjct: 141 KCQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYY 200
Query: 713 --------------------KTMELTC-HGRRISEIKYASFGDPQGACGAFKKGSCEAEI 751
+ L C G ISEI +AS+G P G C F KG+C A
Sbjct: 201 PPLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHAS- 259
Query: 752 DVLPLIEKQCVGKKSCSIEASEANLGATSCAAGTVKRLVVEALC 795
L L+ + CVG C+I S G G +K L VEA C
Sbjct: 260 STLDLVTEACVGNTKCAISVSNDVFGDP--CRGVLKDLAVEAKC 301
>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
garnettii]
Length = 669
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 172/350 (49%), Gaps = 52/350 (14%)
Query: 20 LFNLSL-AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETY 78
LFN SL +++ + DG+ +SGSIHY R W D + K K GL+AI+TY
Sbjct: 24 LFNASLKTFKIDYSRDRFLKDGQPFRYISGSIHYSRLPRFYWKDRLLKMKMAGLNAIQTY 83
Query: 79 VFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMP 138
V WN HEP +Y F+ + D+ FI+ + GL VILR GPY+CAEW+ GG P WL
Sbjct: 84 VPWNFHEPQPGKYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKE 143
Query: 139 GIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS---DY- 194
+ LR+++ ++ + + L V + K + L GGPII Q+ENEYG+ + DY
Sbjct: 144 SM-ILRSSDPDYLAAVDKW--LGVLLPKMKPLLYQNGGPIISVQVENEYGSYFTCDHDYM 200
Query: 195 -----------GD---------AGKSYINWCA--------KMATSLDIGVPWIMCQESDA 226
GD + Y+N A T ++I + + ++S+
Sbjct: 201 RFLLKRFRYYLGDDVVLFTTDGIFEKYLNCGALQGLYATVDFGTGVNITAAFKLQRKSE- 259
Query: 227 PSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYH 286
P P I +E +TGW WG ED+AF++ G + N YM+
Sbjct: 260 ---------PKGPLINSEFYTGWLDHWGQPHSTVKTEDVAFSLFDILARGASV-NLYMFT 309
Query: 287 GGTNFGRTSGG--PYLT--TSYDYDAPIDEYGHLNQPKWGHLRELHKLLK 332
GGTNF +G PY TSYDYDAP+ E G L + K+ LR + + K
Sbjct: 310 GGTNFAYWNGANIPYSAQPTSYDYDAPLSEAGDLTE-KYFALRSVIQKFK 358
>gi|413954159|gb|AFW86808.1| putative RAN GTPase activating family protein [Zea mays]
Length = 449
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 90/234 (38%), Positives = 132/234 (56%), Gaps = 36/234 (15%)
Query: 314 GHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGN----------------------- 350
G++ QPK+GHL++LH L++SMEK L +G +T YG
Sbjct: 200 GNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGKNAIVTKYTYGGSSVCFINNQFVD 259
Query: 351 -----SVSGSSYNLPAWSVSILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKW 405
++ G ++ +PAWSVSILPDCKT +NTAK+ TQT+V VK+ N + L+W W
Sbjct: 260 RDVKVTLGGGTHLVPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKANSVEKELEALRWSW 319
Query: 406 RPEMINDFVVRGKGHFALNTLIDQKSTN-DVSDYLWYMTNADLKDDDPILSGSSNMTLRI 464
PE + F+ + F + L++Q +T+ D SDYLWY T+ + K G + TL +
Sbjct: 320 MPENLKPFMTDHRDSFRQSQLLEQIATSTDQSDYLWYRTSLEHK-------GEGSYTLYV 372
Query: 465 NSSGQVLHAYVNGNYVDSQWTKYGASNDLFERPVKLTRGKNQISLLSATVGLQN 518
N+SG ++ +VNG V ++ GA + PVKL GKN +SLLS TVGL++
Sbjct: 373 NTSGHEMYVFVNGRLVGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKS 426
>gi|344288159|ref|XP_003415818.1| PREDICTED: beta-galactosidase-like [Loxodonta africana]
Length = 570
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/342 (32%), Positives = 162/342 (47%), Gaps = 42/342 (12%)
Query: 26 AYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHE 85
+++ + DG+ +SGSIHY R W D + K K GL+AI+TY+ WN HE
Sbjct: 15 TFKIDDSRKCFLKDGQPFRYISGSIHYHRVPRFYWKDRLLKMKMAGLNAIQTYIPWNFHE 74
Query: 86 PLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRT 145
PL QY F+ + D+ FI+ + GL VILR GPY+CAEW+ GG P WL I LR+
Sbjct: 75 PLPGQYQFSDDHDVEHFIQLTHEIGLLVILRPGPYICAEWDMGGLPAWLLEKQSI-VLRS 133
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMS-DYG--------- 195
++ ++ + + L V + K + L GGPII Q+ENEYG+ + DY
Sbjct: 134 SDPYYLAAVDKW--LGVLLPKMKPLLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQKCF 191
Query: 196 -------------DAGKSYINWCAKMA---TSLDIGVPWIMCQESDAPSPMFTPNNPNSP 239
D + + C + ++D G A P P
Sbjct: 192 HSHLGDDVLLFTTDGARESLLQCGTLQGLYATVDFGP----VSNITAAFQTQRRTEPRGP 247
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG-- 297
+ +E +TGW WG + + E + A+ G N YM+ GGTNF +G
Sbjct: 248 LVNSEFYTGWLDHWGQPHSRVSTEAVTSALYNMLALGANV-NLYMFTGGTNFAYWNGANT 306
Query: 298 PYLT--TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEKT 337
PY TSYDYDAP+ E G L + K+ +RE +++ EK
Sbjct: 307 PYAAQPTSYDYDAPLTEAGDLTE-KYFAVRE---IIRKFEKV 344
>gi|311264379|ref|XP_003130137.1| PREDICTED: galactosidase, beta 1-like 2 [Sus scrofa]
Length = 635
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 153/524 (29%), Positives = 225/524 (42%), Gaps = 63/524 (12%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+ GS+HY R W D + K K GL+ + TYV WN HEP R ++DF+GNLD+ FI
Sbjct: 62 IFGGSVHYFRVPRAYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDMEAFIL 121
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
+ GL+VILR GPY+C+E + GG P WL + +LRTT + F + + + M
Sbjct: 122 LAAEVGLWVILRPGPYICSEIDLGGLPSWLLQDSSM-KLRTTYEGFTKAVDLYFDHL--M 178
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
A+ L GGPII Q+ENEYG+ D +Y+ + K I +
Sbjct: 179 ARVVPLQYKNGGPIIAVQVENEYGSYNKD-----PAYMPYIKKALEDRGIVELLLTSDNE 233
Query: 225 DAPSP-----MFTPNNPNS------------------PKIWTENWTGWFKSWGGKDPKRT 261
D S + N S PK+ E WTGWF SWGG
Sbjct: 234 DGLSKGTVDGVLATINLQSQNELRLLHNFLQSVQGVRPKMVMEYWTGWFDSWGGPHHILD 293
Query: 262 AEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYDYDAPIDEYGH 315
++ V+ G + N YM+HGGTNFG +G + TSYDYDA + E G
Sbjct: 294 TSEVLRTVSAIIDAGASI-NLYMFHGGTNFGFINGAMHFQDYMSDVTSYDYDAVLTEAGD 352
Query: 316 LNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVSILPDCKTEEF 375
PK+ LREL + L + Y V ++ L W LP K
Sbjct: 353 YT-PKYIRLRELFGSISGASLPLPPDLLPKVRYEPVV--PAFYLSLWDA--LPYIKEPVT 407
Query: 376 NTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNTLIDQKSTNDV 435
+ VN + N+ + N + I +VR +G LNT +
Sbjct: 408 SEKPVNME-NLPINDGNGQSFGYTLYETTITSSGILSALVRDRGQVFLNT--------ET 458
Query: 436 SDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWTKYGASNDLFE 495
+L Y T K P++ G + + + + + G+V + G +D+Q + G D++
Sbjct: 459 IGFLDYKTK---KIPIPLIQGFTILRILVENCGRVNY----GENIDNQ--RKGLIGDIYL 509
Query: 496 RPVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPG-PVLLVG 538
L + K + + + K+D VP G+P P +G
Sbjct: 510 NDTPLKKFKIYSLDMKKSFFQRFTAEKWDPVP-GVPTLPAFFLG 552
Score = 46.6 bits (109), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 78/197 (39%), Gaps = 49/197 (24%)
Query: 497 PVKLTRGKNQISLLSATVGLQNYGSKFDMVPNGIPGPVLLVGRAGDETIIKDLSSHKWTY 556
P+ L +G + +L G NYG D G+ G + L ++T +K
Sbjct: 470 PIPLIQGFTILRILVENCGRVNYGENIDNQRKGLIGDIYL-----NDTPLK--------- 515
Query: 557 KVGLYGLDDKKFYNAKAANSERGWSSKNVP-LNRRMTWYKTTFEAPLENDPVVLNLQGMG 615
K +Y LD KK + + +E+ VP L + P + + L+G
Sbjct: 516 KFKIYSLDMKKSFFQRFT-AEKWDPVPGVPTLPAFFLGALSVTSFPYDT---FVKLEGWE 571
Query: 616 KGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWI 675
KG +VNGYNLGRYW N G P + Y +P W+
Sbjct: 572 KGVVFVNGYNLGRYW----------------------------NIG-PQETLY-LPGVWL 601
Query: 676 KDGVNTLVLFEEFGGNP 692
+G+N +++FEE P
Sbjct: 602 NEGINQVIVFEEMMQGP 618
>gi|344291571|ref|XP_003417508.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Loxodonta africana]
Length = 770
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 159/329 (48%), Gaps = 44/329 (13%)
Query: 28 RVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPL 87
R+ T++G + ++ GSIHY R W D + K K G + + TYV WN HEP
Sbjct: 192 RMGRGKPHFTLEGHKFLIFGGSIHYFRVPRAYWRDRLLKLKACGFNTLTTYVPWNLHEPE 251
Query: 88 RRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTN 147
R ++DF+GNLDL FI + GL+VILR GPY+C+E + GG P WL P +L +
Sbjct: 252 RGKFDFSGNLDLEAFIWMAAELGLWVILRPGPYICSEIDLGGLPSWLLQDP---DLNWRH 308
Query: 148 KVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAK 207
+ + F LI + L +GGPII Q+ENEYG+ D K Y+ + +
Sbjct: 309 TXLVTQXSLFDHLI---PRVVPLQYHRGGPIIAVQVENEYGSYNKD-----KDYMPYVQQ 360
Query: 208 MATSLDIGVPWIMCQES-----------------------DAPSPMFTPNNPNSPKIWTE 244
I V ++ ++ DA S + P + E
Sbjct: 361 ALLQRGI-VELLLTSDNERDVLKGYIKGVLATVNMKTLSRDAFS-LLNKAQSEKPIMIME 418
Query: 245 NWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL---- 300
W GWF +WG + R A+++ V F + +F N YM+HGGTNFG +G YL
Sbjct: 419 FWVGWFDTWGNQHFLRDAKEVEHTVLEFIKAEISF-NAYMFHGGTNFGFMNGATYLGKHR 477
Query: 301 --TTSYDYDAPIDEYGHLNQPKWGHLREL 327
TSYDYDA + E G + K+ LR+L
Sbjct: 478 GVVTSYDYDAVLTEAGDYTE-KYFKLRKL 505
>gi|295113973|emb|CBL32610.1| Beta-galactosidase [Enterococcus sp. 7L76]
Length = 592
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 151/313 (48%), Gaps = 46/313 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
++G+ ++SG+IHY R TP W D + K G + +ETY+ WN HEP YDF G
Sbjct: 12 LNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
++ F++ + L VILR Y+CAEW +GG P WL + LR+T+ +FM +++N+
Sbjct: 72 NIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDV-RLRSTDPIFMTKVRNY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
++ + K L +QGGP+I+ Q+ENEYG+ YG K+Y+ ++ L I VP
Sbjct: 131 FQVL--LPKLAPLQITQGGPVIMIQVENEYGS----YG-MEKAYLRQTKQIMEELGIEVP 183
Query: 218 WIM----------------------------CQESDAPSPMF-TPNNPNSPKIWTENWTG 248
+E+ A F T + P + E W G
Sbjct: 184 LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG +R DLA V G N YM+HGGTNFG R +
Sbjct: 244 WFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLPQV 301
Query: 302 TSYDYDAPIDEYG 314
TSYDYDA + E G
Sbjct: 302 TSYDYDALLTEAG 314
>gi|347967091|ref|XP_001689312.2| AGAP002056-PA [Anopheles gambiae str. PEST]
gi|333469762|gb|EDO63217.2| AGAP002056-PA [Anopheles gambiae str. PEST]
Length = 629
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 162/345 (46%), Gaps = 45/345 (13%)
Query: 27 YRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEP 86
+ + +D +DG+ ++GS HY R+ P WP +++ + GL+AI TYV W+ H P
Sbjct: 26 FSIDYDNDTFVMDGKPFQYVAGSFHYFRALPESWPSILRSMRAAGLNAITTYVEWSLHNP 85
Query: 87 LRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWL-HNMPGIEELRT 145
Y++ G D+ F++ GLYVILR GPY+CAE + GGFP WL H P I LRT
Sbjct: 86 KEDVYNWQGMADIEHFLELADSAGLYVILRPGPYICAERDMGGFPSWLLHKYPDIL-LRT 144
Query: 146 TNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWC 205
+ ++ E++ + + +++ ++ QGGPII+ Q+ENEYG+ + Y+NW
Sbjct: 145 NDLRYLREVRTWYAQL--LSRVQRFLVGQGGPIIMVQVENEYGSFYA----CDHKYLNWL 198
Query: 206 -----------AKMATSLDIGVPWIMCQESDAPSPMFTP---------------NNPNSP 239
A + T+ G+ E S F P P P
Sbjct: 199 RDETERYVMGNAVLFTNNGPGLEGCGAIEHVLSSLDFGPGTEDEINGFWSTLRKTQPKGP 258
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSG--- 296
+ E + GW W RT F N YM+ GGTN+G T+G
Sbjct: 259 LVNAEYYPGWLTHWQEPHMARTDTKPVVDSLDFMLRNKVNVNIYMFFGGTNYGFTAGANN 318
Query: 297 ---GPYLT--TSYDYDAPIDEYGHLNQPKWGHLRELHKLLKSMEK 336
G Y TSYDYDAP+DE G PK+ LR+ +LK K
Sbjct: 319 MGAGGYAADLTSYDYDAPLDESGD-PTPKYFALRD--TILKYFPK 360
>gi|312866933|ref|ZP_07727144.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
gi|311097415|gb|EFQ55648.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
Length = 595
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 154/323 (47%), Gaps = 59/323 (18%)
Query: 35 AITIDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFT 94
A + G+ +LSG+IHY R P W + K G + +ETY+ WNAHEP + Q+DF+
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYIPWNAHEPRKGQFDFS 68
Query: 95 GNLDLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEE---LRTTNKVFM 151
G LDL RFI+T Q GLY+I+R P++CAEW +GG P WL +EE +R+++ F+
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-----LEEDLRIRSSDPAFI 123
Query: 152 NEMQNFTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATS 211
+ + ++ + ++ +GGPI++ Q+ENEYG+ D K Y+ +
Sbjct: 124 EAVDRYYDRLLGLLTPYQV--DRGGPILMMQVENEYGSYGED-----KDYLRAIRDLMKE 176
Query: 212 LDIGVPWIMCQESDAP------------SPMFTPNNPNS--------------------P 239
+ P SD P +F N S P
Sbjct: 177 KGVTCPLFT---SDGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWP 233
Query: 240 KIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY 299
+ E W GWF W +R E+LA AV + G N YM+HGGTNFG +G
Sbjct: 234 LMCMEFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSA 291
Query: 300 L-------TTSYDYDAPIDEYGH 315
TSYDY A ++E G+
Sbjct: 292 RGTLDLPQVTSYDYGALLNEQGN 314
>gi|406657850|ref|ZP_11065990.1| family 35 glycosyl hydrolase [Streptococcus iniae 9117]
gi|405578065|gb|EKB52179.1| family 35 glycosyl hydrolase [Streptococcus iniae 9117]
Length = 594
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 166/691 (24%), Positives = 279/691 (40%), Gaps = 183/691 (26%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R PG W + K G + +ETYV WN HEP R +++F G DL +F+
Sbjct: 19 ILSGAIHYFRLAPGSWYKSLYNLKALGFNTVETYVPWNLHEPQRGKFNFEGLADLEKFLD 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
Q+ GLY I+R PY+CAEW +GG P WL + +R+ + ++ ++++ +++
Sbjct: 79 LAQEMGLYAIVRPTPYICAEWEFGGLPAWL--LKENVRVRSHDAKYLAFVKDYYQVLLPK 136
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
K ++ SQGG I++ Q+ENEYG+ YG+ K Y+ +M I VP S
Sbjct: 137 LVKRQI--SQGGNILMFQVENEYGS----YGE-DKQYLKQLMQMMREFGISVPLFT---S 186
Query: 225 DAP--------------------------------SPMFTPNNPNSPKIWTENWTGWFKS 252
D P ++ P + E W GWF
Sbjct: 187 DGPWQSALQAGSLIDEDVLVTGNFGSQSKANFSNLRAFLDAHDKKWPLMCMEFWVGWFNR 246
Query: 253 WGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL-------TTSYD 305
W +R +++ A+ + G N YM+HGGTNFG +G TSYD
Sbjct: 247 WKEPVIRRDPKEMVDAIMEVLEEGSI--NLYMFHGGTNFGFMNGSSARLQEDLPQVTSYD 304
Query: 306 YDAPIDEYGHLNQPKWGHLRELHKLLKSMEKTLTYGNVTNTDYGNSVSGSSYNLPAWSVS 365
YDA +DE G+ + K+ L+E S++K + DY + + + ++S
Sbjct: 305 YDAILDEAGNPTK-KYFLLQE------SLKKAF-----PDLDYSTPLYNETIEIKDIALS 352
Query: 366 ILPDCKTEEFNTAKVNTQTNVKVKRPNQAGNDQAPLQWKWRPEMINDFVVRGKGHFALNT 425
KVN + + + + ++ P+
Sbjct: 353 ------------EKVNLVSTLDAI---------SQMHEEYYPQ----------------- 374
Query: 426 LIDQKSTNDVSDYLWYMTNADLKDDDPILSGSSNMTLRINSSGQVLHAYVNGNYVDSQWT 485
+ + N + Y++Y T L D P LR+ + +++ ++V +Q+
Sbjct: 375 --NMEELNQQTGYIFYRTT--LPKDSP------KECLRLIDARDRAQVFLDHHFVTTQY- 423
Query: 486 KYGASNDLFERPVKLTRGKNQISLLSATVGLQNYGSKFDMVPN------GIPGPVLLVGR 539
++ D+F ++ K+Q+ +L +G YG K V G+ + VG
Sbjct: 424 QFEIGEDIF---IEQNSEKSQLDVLVENMGRVCYGHKLTSVTQRKGLGRGLMANLHFVG- 479
Query: 540 AGDETIIKDLSSHKWT-YKVGLYGLDDKKFYNAKAANSERGWSSKNVPLNRRMTWYKTTF 598
+W Y + L +++ F + + + G S +Y F
Sbjct: 480 -------------EWQHYALPLESVENVDF----SGDYQEGLPS----------FYAFDF 512
Query: 599 EAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYLAEEDGCSTESCDYRGPYGSDKCAY 658
+ D L+L GKG A++N NLGR+W GP+ S
Sbjct: 513 NCDIIGD-TYLDLTSFGKGVAFINNINLGRFWNV----------------GPHLS----- 550
Query: 659 NCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
++P ++ G N +V+FE G
Sbjct: 551 ---------LYIPADFLTQGKNRIVIFETEG 572
>gi|424786841|ref|ZP_18213614.1| beta-galactosidase jelly roll domain protein [Streptococcus
intermedius BA1]
gi|422114356|gb|EKU18061.1| beta-galactosidase jelly roll domain protein [Streptococcus
intermedius BA1]
Length = 595
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 152/326 (46%), Gaps = 58/326 (17%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSG+IHY R P W + K G + +ETY+ WNAHEP++ Q+DF G LD+ +F++
Sbjct: 19 ILSGAIHYFRIQPDDWYHSLYNLKALGFNTVETYIPWNAHEPMKGQFDFEGILDVEKFLQ 78
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWL--HNMPGIEELRTTNKVFMNEMQNFTTLIV 162
T QD GLYV+LR PY+CAEW +GG P WL NM +R+++ ++ + N+ ++
Sbjct: 79 TAQDLGLYVLLRSSPYICAEWEFGGLPAWLLEENM----RIRSSDPAYLAAVANYYDALL 134
Query: 163 DMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQ 222
L GG I++ Q+ENEYG+ D K Y+ M + P
Sbjct: 135 PRLVPHLL--ENGGSILMMQVENEYGSYGED-----KEYLRAVRDMMLERGVTCPLFT-- 185
Query: 223 ESDAP--------------------------------SPMFTPNNPNSPKIWTENWTGWF 250
SD P F + P + E W GWF
Sbjct: 186 -SDGPWRGTLRAGTLIEDDVFVTGNFGSKANENFAQMQEFFDEHGKKWPLMCMEFWDGWF 244
Query: 251 KSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLTTS 303
W R E+LA AV Q G N YM+HGGTNFG R S TS
Sbjct: 245 NRWKEPTVTRDPEELAEAVHEVLQQGSI--NLYMFHGGTNFGFMNGCSARGSIDLPQVTS 302
Query: 304 YDYDAPIDEYGHLNQPKWGHLRELHK 329
YDY+A +DE G+ PK+ ++ + K
Sbjct: 303 YDYEALLDEQGN-PTPKYFAIQRMLK 327
>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
Length = 581
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 156/322 (48%), Gaps = 45/322 (13%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
ID ++ ++SG +HY R W D + K K G + +ETY+ WN HE + ++ F GNL
Sbjct: 12 IDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCFEGNL 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ +F+ +D GLYVILR PY+CAEW +GG P WL G+ LR + K F+ ++ +
Sbjct: 72 DITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGM-RLRCSYKPFLKHVEEY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYG-------------NVMSDYG--------D 196
+ ++ L ++GGP+I+ Q+ENEYG + M YG D
Sbjct: 131 YHRLFEVIA--PLQYTKGGPVIMMQVENEYGYYGNDTLYLKTLQDFMVSYGCEVPLVTSD 188
Query: 197 AGKSYINWCAKMATSLDIGVPWIMCQESDAPSPMFTPNNPNSPKIWTENWTGWFKSWG-- 254
C K+ L G +S + N P + E W GWF SWG
Sbjct: 189 GPWGDAFDCGKLEGVLQTGN---FGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFDSWGQT 245
Query: 255 ---GKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------TTSYD 305
+DP + AE+L + G N YM+ GGTNFG +G Y TSYD
Sbjct: 246 EHKQEDPNKNAENL----DEILESGHV--NIYMFMGGTNFGFMNGSNYYDVLTPDVTSYD 299
Query: 306 YDAPIDEYGHLNQPKWGHLREL 327
YDA + E G L PK+ L+ +
Sbjct: 300 YDALLTEAGDLT-PKYELLKNV 320
>gi|326332570|ref|ZP_08198838.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325949571|gb|EGD41643.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 603
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 149/307 (48%), Gaps = 46/307 (14%)
Query: 45 LLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIK 104
+LSGS+HY R P +W D +++ G + ++TYV WN HEP DFTG DL RF+
Sbjct: 21 ILSGSVHYFRIHPDLWEDRLRRVAATGFNTVDTYVAWNFHEPDEGSPDFTGPRDLARFVT 80
Query: 105 TIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDM 164
D GL VI+R GPY+CAEW GG P WL R+++ V+ + + + L V +
Sbjct: 81 IAGDLGLDVIVRPGPYICAEWTNGGLPSWLTAR--TRAPRSSDPVYQDAVTRW--LDVLL 136
Query: 165 AKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQES 224
+ L A GGP++ Q+ENEYG+ YGD +++ W + LD GV ++ +
Sbjct: 137 PRLVPLQAGHGGPVVAVQLENEYGS----YGD-DAAHLVWLRQAL--LDRGVTELLYT-A 188
Query: 225 DAPSPM--------------------------FTPNNPNSPKIWTENWTGWFKSWGGKDP 258
D P+ + + P P + E W GWF WG
Sbjct: 189 DGPTDVMLDAGMVEGTLAAATFGSRATEAATKLSARRPGEPFLCAEFWNGWFDHWGENHH 248
Query: 259 KRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPY-------LTTSYDYDAPID 311
R+ E A + GG+ + YM HGGTNFG +G + TSYD DAP+
Sbjct: 249 VRSPESAAATLREIVDLGGSV-SVYMAHGGTNFGLWAGSNHDGRRIQPTVTSYDSDAPVG 307
Query: 312 EYGHLNQ 318
E G +++
Sbjct: 308 EDGRVSE 314
>gi|420143773|ref|ZP_14651269.1| Beta-galactosidase 3 [Lactococcus garvieae IPLA 31405]
gi|391856250|gb|EIT66791.1| Beta-galactosidase 3 [Lactococcus garvieae IPLA 31405]
Length = 597
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 147/314 (46%), Gaps = 46/314 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+D E ++SG+IHY R W D + K G + +ETY+ WN HEP +DF G
Sbjct: 12 LDNEPVKIISGAIHYFRIPQSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVFDFEGMK 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNF 157
D+ F+K + GL VILR Y+CAEW +GG P WL P + LR+T+ FM +++N+
Sbjct: 72 DIRAFVKLAESLGLMVILRPSVYICAEWEFGGLPAWLLKEPEM-RLRSTDSRFMTKVENY 130
Query: 158 TTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVP 217
+++ ++ A GGP+I+ Q+ENEYG+ YG K Y+ + I VP
Sbjct: 131 FKVLLPYISPLQITA--GGPVIMMQVENEYGS----YG-MEKEYLRQTMALMKKYGINVP 183
Query: 218 WIM----------------------------CQESDAPSPMFTPNNPNS-PKIWTENWTG 248
+E+ A F + P + E W G
Sbjct: 184 LFTSDGAWQAALDAGSLIEDDVLVTGNFGSRSKENAAVLAGFMKEHGKKWPLMCMEYWDG 243
Query: 249 WFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFG-------RTSGGPYLT 301
WF WG KR +DLA V + G N YM+HGGTNFG R +G
Sbjct: 244 WFNRWGEPIIKREPQDLADEVKTMLELGSL--NLYMFHGGTNFGFYNGCSARDTGNLPQI 301
Query: 302 TSYDYDAPIDEYGH 315
TSYDYDA + E G
Sbjct: 302 TSYDYDALLTEAGE 315
>gi|225407896|ref|ZP_03761085.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
gi|225042575|gb|EEG52821.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
Length = 590
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/331 (32%), Positives = 156/331 (47%), Gaps = 49/331 (14%)
Query: 38 IDGERKILLSGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNL 97
+DG LLSG++HY R P W D + K G + +ETY+ WN HEP ++DF+G+
Sbjct: 12 LDGRPVKLLSGAVHYFRLMPEYWEDCLYNLKAMGFNTVETYIPWNIHEPEEGEFDFSGSR 71
Query: 98 DLIRFIKTIQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQN- 156
D+ F++ GL+VILR P++CAEW GG P WL P + ++RT +F+ +++
Sbjct: 72 DVEAFVRLAGSMGLHVILRPSPFICAEWEMGGLPAWLLRYPDM-KVRTNTPLFLVKVEAY 130
Query: 157 FTTLIVDMAKKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGV 216
+ L +A L ++GGP+IL Q+ENEYG+ +D K Y+ + V
Sbjct: 131 YRELFRHIA---DLQITRGGPVILMQVENEYGSFGND-----KEYLRRIKSLMERFGAEV 182
Query: 217 PWIMCQES-----------------------------DAPSPMFTPNNPNSPKIWTENWT 247
P+ S D F + P + E W
Sbjct: 183 PFFTSDGSWDAALEAGSLIEDGVLATANFGSRSDENLDVLEAFFKRHGRKWPLMCMEFWD 242
Query: 248 GWFKSWGGKDPKRTAEDLAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGGPYL------- 300
GWF W K R AEDLA V + + N YM+ GGTNFG +G
Sbjct: 243 GWFNRWREKIITRDAEDLAMEVRQLLERASI--NLYMFQGGTNFGFYNGCSARGYTDLPQ 300
Query: 301 TTSYDYDAPIDEYGHLNQPKWGHLRELHKLL 331
TSY+YDA + E+G + K+ +RE+ + L
Sbjct: 301 ITSYNYDAILTEWGQPTE-KFYQVREVIREL 330
Score = 43.5 bits (101), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 93/235 (39%), Gaps = 61/235 (25%)
Query: 457 SSNMTLRINSSGQVLHAYVNGNYVDSQW-TKYGASNDLFERPVKLTRGKNQISLLSATVG 515
+ M ++ + + Y+NG + +Q+ G +LF P +N++ LL +G
Sbjct: 397 NRKMKVKAVQASDRVQYYLNGMFEGTQYQNNSGEELELFFGP------ENRLDLLVENMG 450
Query: 516 LQNYGSKFDMVPNGIPGPVLLVGRAGDET-IIKDLSSHKWTYKVGLYGLDDKKFYNAKAA 574
NYG K + P R G T ++ D+ + L LD N
Sbjct: 451 RVNYGYK-------LQAPT---QRKGIRTGVMVDIHFESGWEQYAL-PLD-----NVNRV 494
Query: 575 NSERGWSSKNVPLNRRMTWYKTTFEAPLENDPVVLNLQGMGKGFAWVNGYNLGRYWPTYL 634
+ E+ W ++ P +Y+ F+ D LN + +GKG A++NG+NLGRYW
Sbjct: 495 DFEKEWI-QDTP-----AFYRYEFQVDQPKD-TFLNCRELGKGVAFINGFNLGRYWSE-- 545
Query: 635 AEEDGCSTESCDYRGPYGSDKCAYNCGNPSQIWYHVPRSWIKDGVNTLVLFEEFG 689
P Q Y +P +++G N L++FE G
Sbjct: 546 ---------------------------GPVQYLY-IPAPLLREGKNELIVFETEG 572
>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 154/301 (51%), Gaps = 36/301 (11%)
Query: 47 SGSIHYPRSTPGMWPDLIKKAKEGGLDAIETYVFWNAHEPLRRQYDF-TGNLDLIRFIKT 105
SGS+HY R W D ++ AK GL+ I TYV WN HE +DF T DL RF+
Sbjct: 70 SGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFDFETHAHDLARFLNL 129
Query: 106 IQDQGLYVILRIGPYVCAEWNYGGFPVWLHNMPGIEELRTTNKVFMNEMQNFTTLIVDMA 165
+ GL V++R PY+CAEW++GG P L P + ELR++N F++E++ + ++ +
Sbjct: 130 AHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDL-ELRSSNDAFLDEVERYYDALMPIL 188
Query: 166 KKEKLFASQGGPIILAQIENEYGNVMSDYGDAGKSYINWCAKMATSLDIGVPWIMCQESD 225
+ L AS GGPII +ENEYG+ YG A + Y+ M I C +
Sbjct: 189 R--PLQASNGGPIIAFYVENEYGS----YG-ADRDYLQALVAMMRDRGIVEQMFTCDNAQ 241
Query: 226 A------PSPMFTPN---------------NPNSPKIWTENWTGWFKSWGGKDPKRTAED 264
P + T N P+ P + +E WTGWF G + +ED
Sbjct: 242 GLSRGALPGALQTINFQDNVERHLDQLAHFQPDQPLMVSEYWTGWFDHDGEEHHTFDSED 301
Query: 265 LAFAVARFFQFGGTFQNYYMYHGGTNFGRTSGG--PYL--TTSYDYDAPIDEYGHLNQPK 320
L + + G +F N Y++HGGT+FG +G PY TSYDYDAP+ E+G + PK
Sbjct: 302 LVEGLQKILDRGASF-NLYVFHGGTSFGWNAGANSPYAPDITSYDYDAPLSEHGQVT-PK 359
Query: 321 W 321
+
Sbjct: 360 Y 360
>gi|312378199|gb|EFR24839.1| hypothetical protein AND_10320 [Anopheles darlingi]
Length = 639
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 172/348 (49%), Gaps = 46/348 (13%)
Query: 4 LKHCSRAILLCLILQTLFNLSLAYRVSHDGRAITIDGERKILLSGSIHYPRSTPGMWPDL 63
L+H + +CL + L ++ + ++ +DG+ ++GS HY R+ P W
Sbjct: 2 LRH-PIVLAVCLAIAGLAEAQRSFTIDYERDTFVMDGKDFRYVAGSFHYFRALPQTWRTK 60
Query: 64 IKKAKEGGLDAIETYVFWNAHEPLRRQYDFTGNLDLIRFIKTIQDQGLYVILRIGPYVCA 123
++ + GGL+A++ YV W+ H P Y + G ++ I+ ++ LYVILR GPY+CA
Sbjct: 61 LRTLRAGGLNAVDLYVQWSLHNPRDGVYSWEGIANVTDIIEAAIEEDLYVILRPGPYICA 120
Query: 124 EWNYGGFPVWLHN-MPGIEELRTTNKVFMNEMQNFTTLIVDMAKKEKLFASQGGPIILAQ 182
E + GG P WL N PGI ++RT++ ++ E++ + + M++ E GGPII+ Q
Sbjct: 121 EIDNGGLPYWLFNKYPGI-QVRTSDANYLAEVKKWYGEL--MSRMEPYMYGNGGPIIMVQ 177
Query: 183 IENEYGNVMSDYGDAGKSYINWCAK--------MATSLDIGVPW---IMC---------- 221
IENEYG +G K Y+N+ + A + P+ I C
Sbjct: 178 IENEYGA----FGKCDKPYLNFLKEETNRYVQDKAVLFTVDRPYDDEIGCGQIDGVFITT 233
Query: 222 -------QESDAPSPMFTPNNPNSPKIWTENWTGWFKSWGGKDPKRTAEDLAFAVARFFQ 274
+E D + P P + TE +TGW W + +R A LA + + +
Sbjct: 234 DFGLMTDEEVDTHAAKVRSYQPKGPLVNTEFYTGWLTHWQESNQRRPAGPLAATLRKMLK 293
Query: 275 FGGTFQNYYMYHGGTNFGRTSG------GPYLT--TSYDYDAPIDEYG 314
G ++YMY GGTNFG +G G Y+ TSYDYDAP+DE G
Sbjct: 294 DGWNV-DFYMYFGGTNFGFWAGANDWGLGKYMADITSYDYDAPMDEAG 340
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 20/24 (83%)
Query: 609 LNLQGMGKGFAWVNGYNLGRYWPT 632
LN+ G GKGF +VNG+NLGRYWP
Sbjct: 570 LNMAGWGKGFIFVNGFNLGRYWPV 593
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,983,348,384
Number of Sequences: 23463169
Number of extensions: 652549420
Number of successful extensions: 1286849
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2154
Number of HSP's successfully gapped in prelim test: 204
Number of HSP's that attempted gapping in prelim test: 1271907
Number of HSP's gapped (non-prelim): 5956
length of query: 795
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 644
effective length of database: 8,816,256,848
effective search space: 5677669410112
effective search space used: 5677669410112
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)