BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004525
(747 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 1213 bits (3139), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/724 (78%), Positives = 638/724 (88%), Gaps = 3/724 (0%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
T A NVTYD RSLII+G+R+L+ISA+IHYPRSVPGMWPGLV+ AKEGG++ IE+YVFW
Sbjct: 16 TSSLAANVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFW 75
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
NGHELSP YYFGGR++L+KF+KI+QQARMY+ILR+GPFVAAE+N+GG+PVWLHY+PGTV
Sbjct: 76 NGHELSPDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTV 135
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR ++EPFKYHMQKFMTLIV++MK+EKLFASQGGPIILAQVENEYG E YG+GGK YA
Sbjct: 136 FRTNSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYA 195
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFK 260
+WAA MA++QNIGVPWIMCQQ+D PDPVINTCNSFYCDQFTP+SP+ PK+WTENWPGWFK
Sbjct: 196 MWAANMALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFK 255
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 320
TFG DPHRP EDIAFSVARFFQKGGS+ NYYMYHGGTNFGRT+GGPFITTSYDY APID
Sbjct: 256 TFGAPDPHRPHEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPID 315
Query: 321 EYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANM 380
EYGL R PKWGHLKELH AIK CEH LL GE NLSLG SQE DVY DSSG CAAF++N+
Sbjct: 316 EYGLARLPKWGHLKELHRAIKSCEHVLLYGEPINLSLGPSQEVDVYTDSSGGCAAFISNV 375
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
D+K DK +VF+NVSYH+PAWSVSILPDCK VVFNTA V +Q+S VEMVPE LQPS +
Sbjct: 376 DEKEDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEELQPSLVPSN 435
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
KGL+W+ F E AGIWGEADFVK+GFVDHINTTKDTTDYLWYT S+ V E+E FLK
Sbjct: 436 KDLKGLQWETFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGESENFLKEI 495
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
S+PVLL+ESKGHALHAF NQ+LQGSASGNG+H PFK++ PISLKAGKN+IALLSMTVGLQ
Sbjct: 496 SQPVLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDIALLSMTVGLQ 555
Query: 561 NAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
NAGPFYEWVGAG+TSVKI G N+G +DLSTY+WTYKIGLQGEHL IY P N++ W+ST
Sbjct: 556 NAGPFYEWVGAGLTSVKIKGLNNGIMDLSTYTWTYKIGLQGEHLLIYKPEGLNSVKWLST 615
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
EPPK QPLTWYKAVV P G+EPIGLDM+ MGKGLAWLNGEEIGRYWP RKSS HD+
Sbjct: 616 PEPPKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWP---RKSSIHDK 672
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
CVQECDYRGKF P+KC TGCGEP+QRWYH+PRSWFKPS NILVIFEEKGGDPTKI FS R
Sbjct: 673 CVQECDYRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTKIRFSRR 732
Query: 741 KISG 744
K +G
Sbjct: 733 KTTG 736
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 1206 bits (3121), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 560/726 (77%), Positives = 635/726 (87%), Gaps = 6/726 (0%)
Query: 19 SITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYV 78
+ T +GNV+YD RSL+I+G+R+L+ISA+IHYPRSVP MWPGLVQ AKEGGV+ IE+YV
Sbjct: 13 TFTVALSGNVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYV 72
Query: 79 FWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
FWNGHELSPG YYFGGRF+LVKF K +QQA MY+ILRIGPFVAAE+N+GG+PVWLHY+PG
Sbjct: 73 FWNGHELSPGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPG 132
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
TVFR +PF YHMQKF T IV++MK+EKLFASQGGPIIL+Q+ENEYGYYE+FY E GK+
Sbjct: 133 TVFRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKK 192
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
YALWAAKMAV+QN GVPWIMCQQ+D PDPVI+TCNSFYCDQFTP SP+ PKIWTENWPGW
Sbjct: 193 YALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGW 252
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
FKTFGGRDPHRP+ED+AFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDY+AP
Sbjct: 253 FKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAP 312
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLA 378
+DEYGLPR PKWGHLKELH AIKLCEH LLNG+ N+SLG S EADVY DSSGACAAF++
Sbjct: 313 VDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFIS 372
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N+DDKNDKTV FRN SYHLPAWSVSILPDCK VVFNTA V +Q++ V M+PE+LQ S
Sbjct: 373 NVDDKNDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNVVAMIPESLQQS--- 429
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
D G LKW + KE GIWG+ADFVKSGFVD INTTKDTTDYLW+TTSI V+ENEEFLK
Sbjct: 430 -DKGVNSLKWDIVKEKPGIWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSENEEFLK 488
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
GS+PVLLIES GHALHAF NQE QG+ +GNGTH PF +KNPISL+AGKNEIALL +TVG
Sbjct: 489 KGSKPVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIALLCLTVG 548
Query: 559 LQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
LQ AGPFY+++GAG+TSVKI G +GT+DLS+Y+WTYKIG+QGE+L +Y N +NW
Sbjct: 549 LQTAGPFYDFIGAGLTSVKIKGLKNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNKVNWT 608
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
ST EP K QPLTWYKA+V PPGDEP+GLDML MGKGLAWLNGEEIGRYWPRKS S
Sbjct: 609 STSEPQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKS-- 666
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
++CV+ECDYRGKFNPDKC TGCGEP+QRWYH+PRSWFKPS NILV+FEEKGGDP KI F
Sbjct: 667 EDCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFV 726
Query: 739 IRKISG 744
RK+SG
Sbjct: 727 RRKVSG 732
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 1206 bits (3119), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 560/726 (77%), Positives = 634/726 (87%), Gaps = 6/726 (0%)
Query: 19 SITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYV 78
+ T + NV+YD RSLII+ +R+L+ISA+IHYPRSVP MWPGLVQ AKEGGV+ IE+YV
Sbjct: 68 TFTVASSANVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYV 127
Query: 79 FWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
FWNGHELSPG YYFGGRF+LVKF + +QQA MY+ILRIGPFVAAE+N+GG+PVWLHY+PG
Sbjct: 128 FWNGHELSPGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPG 187
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
TVFR +PF YHMQKF T IV++MK+EKLFASQGGPIILAQ+ENEYGYYE+FY E GK+
Sbjct: 188 TVFRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGKK 247
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
YALWAAKMAV+QN GVPWIMCQQ+D PDPVI+TCNSFYCDQFTP SP+ PKIWTENWPGW
Sbjct: 248 YALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGW 307
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
FKTFGGRDPHRP+ED+AFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDY+AP
Sbjct: 308 FKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAP 367
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLA 378
+DEYGLPR PKWGHLKELH AIKLCEH LLNG+ N+SLG S EADVY DSSGACAAF++
Sbjct: 368 VDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFIS 427
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N+DDKNDKTV FRN S+HLPAWSVSILPDCK VVFNTA V +Q+S V MVPE+LQ S
Sbjct: 428 NVDDKNDKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSVVAMVPESLQQS--- 484
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
D KW + KE GIWG+ADFVK+GFVD INTTKDTTDYLW+TTSI V+ENEEFLK
Sbjct: 485 -DKVVNSFKWDIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSENEEFLK 543
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
G++PVLLIES GHALHAF NQE +G+ SGNGTH PF +KNPISL+AGKNEIALL +TVG
Sbjct: 544 KGNKPVLLIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIALLCLTVG 603
Query: 559 LQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
LQ AGPFY++VGAG+TSVKI G N+GT+DLS+Y+WTYKIG+QGE+L +Y NN+NW
Sbjct: 604 LQTAGPFYDFVGAGLTSVKIKGLNNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNNVNWT 663
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
ST EPPK QPLTWYKA+V PPGDEP+GLDML MGKGLAWLNGEEIGRYWPRKS S
Sbjct: 664 STSEPPKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKS-- 721
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
++CV+ECDYRGKFNPDKC TGCGEP+QRWYH+PRSWFKPS NILV+FEEKGGDP KI F
Sbjct: 722 EDCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFV 781
Query: 739 IRKISG 744
RK+SG
Sbjct: 782 RRKVSG 787
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 1192 bits (3085), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 564/718 (78%), Positives = 629/718 (87%), Gaps = 4/718 (0%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YDSRSLII+G+R+L+ISAAIHYPRSVP MWP LVQ AKEGGV+ IE+YVFWNGHE S
Sbjct: 28 NVSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG YYFGGR++LVKF+KI++QA M++ILRIGPFVAAE+ +GGIPVWLHY+PGTVFR + +
Sbjct: 88 PGNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTVFRTENK 147
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFKYHMQKF T IVD+MK+EK FASQGGPIILAQVENEYGYYE YGEGGK+YA+WAA M
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV+QNIGVPWIMCQQFD P+ VINTCNSFYCDQFTP + PKIWTENWPGWFKTFGG +
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQFTPIYQNKPKIWTENWPGWFKTFGGWN 267
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT+GGPFITTSYDYEAPIDEYGLPR
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 327
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH AIKLCEH +LN + +N+SLG S EADV+ +SSGACAAF+ANMDDKNDK
Sbjct: 328 LPKWGHLKQLHRAIKLCEHIMLNSQPTNVSLGPSLEADVFTNSSGACAAFIANMDDKNDK 387
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
TV FRN+SYHLPAWSVSILPDCK VVFNTA V +QSS VEM+PE+LQ S S D K L
Sbjct: 388 TVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSVVEMLPESLQLSVGSADKSLKDL 447
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
KW VF E AGIWGEADFVKSG VDHINTTK TTDYLWYTTSI+V ENEEFLK GS PVLL
Sbjct: 448 KWDVFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGENEEFLKKGSSPVLL 507
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
IESKGHA+HAF NQELQ SA+GNGTH PFK K PISLK GKN+IALLSMTVGLQNAG FY
Sbjct: 508 IESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIALLSMTVGLQNAGSFY 567
Query: 567 EWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
EWVGAG+TSVKI GFN+GT+DLS Y+WTYKIGL+GEH G+ N+NW+S EPPK
Sbjct: 568 EWVGAGLTSVKIQGFNNGTIDLSAYNWTYKIGLEGEHQGLDKEEGFGNVNWISASEPPKE 627
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTWYK +V PPGD+P+GLDM+ MGKGLAWLNGEEIGRYWPRK P CV+EC+
Sbjct: 628 QPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPRK----GPLHGCVKECN 683
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
YRGKF+PDKC TGCGEP+QRWYH+PRSWFK S N+LVIFEEKGGDP+KI FS RKI+G
Sbjct: 684 YRGKFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSKIEFSRRKITG 741
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 1177 bits (3046), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 542/734 (73%), Positives = 624/734 (85%), Gaps = 3/734 (0%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
+L +S++T +VTYD RSLIING+R+L+ISA+IHYPRSVP MWPGLV+ AKEGG
Sbjct: 29 SLAAVDASNVTTIGTDSVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGG 88
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
V+ IE+YVFWNGHE SPG YYFGGRF+LVKF KIIQQA MYMILRIGPFVAAE+N+GG+P
Sbjct: 89 VDVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLP 148
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWLHY+PGT FR D+EPFKYHMQKFMT V++MKRE+LFASQGGPIIL+QVENEYGYYE+
Sbjct: 149 VWLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYEN 208
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
YGEGGKRYALWAAKMA++QN GVPWIMCQQ+D PDPVI+TCNSFYCDQF P SP+ PKI
Sbjct: 209 AYGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKI 268
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENWPGWFKTFG RDPHRP+ED+A+SVARFFQKGGSV NYYMYHGGTNFGRTAGGPFIT
Sbjct: 269 WTENWPGWFKTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFIT 328
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+APIDEYGLPR PKWGHLKELH IK CEHALLN + + LSLG QEADVY D+S
Sbjct: 329 TSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDAS 388
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
GACAAFLANMDDKNDK V FR+VSYHLPAWSVSILPDCK V FNTA V Q+S V M P
Sbjct: 389 GACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPI 448
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
+L P+ +SP K L+W+VFKE AG+WG ADF K+GFVDHINTTKD TDYLWYTTSI V
Sbjct: 449 DLHPTASSPKRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFV 508
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ E+FL+N +L +ESKGHA+H F N++LQ SASGNGT P FK+ PI+LKAGKNEI
Sbjct: 509 HAEEDFLRNRGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEI 568
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
ALLSMTVGLQ AG FYEW+GAG TSVK+ GF +GT+DL+ +WTYKIGLQGEHL I
Sbjct: 569 ALLSMTVGLQTAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQKSY 628
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+ W T +PPK QPLTWYKAVV PPG+EP+ LDM+ MGKG+AWLNG+EIGRYWPR
Sbjct: 629 NLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPR 688
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ K ++ CV +CDYRGKFNPDKC+TGCG+P+QRWYH+PRSWFKPS N+L+IFEE GG
Sbjct: 689 RTSK---YENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGG 745
Query: 731 DPTKITFSIRKISG 744
DP++I FS+RK+SG
Sbjct: 746 DPSQIRFSMRKVSG 759
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 1176 bits (3042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 541/734 (73%), Positives = 624/734 (85%), Gaps = 3/734 (0%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
+L +S++T +VTYD RSLIING+R+L+ISA+IHYPRSVP MWPGLV+ AKEGG
Sbjct: 29 SLAAVDASNVTTIGTDSVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGG 88
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
V+ IE+YVFWNGHE SPG YYFGGRF+LVKF KIIQQA MYMILRIGPFVAAE+N+GG+P
Sbjct: 89 VDVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLP 148
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWLHY+PGT FR D+EPFKYHMQKFMT V++MKRE+LFASQGGPIIL+QVENEYGYYE+
Sbjct: 149 VWLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYEN 208
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
YGEGGKRYALWAAKMA++QN GVPWIMCQQ+D PDPVI+TCNSFYCDQF P SP+ PKI
Sbjct: 209 AYGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKI 268
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENWPGWFKTFG RDPHRP+ED+A+SVARFFQKGGSV NYYMYHGGTNFGRTAGGPFIT
Sbjct: 269 WTENWPGWFKTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFIT 328
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+APIDEYGLPR PKWGHLKELH IK CEHALLN + + LSLG QEADVY D+S
Sbjct: 329 TSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDAS 388
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
GACAAFLANMDDKNDK V FR+VSYHLPAWSVSILPDCK V FNTA V Q+S V M P
Sbjct: 389 GACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPI 448
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
+L P+ +SP K L+W+VFKE AG+WG ADF K+GFVDHINTTKD TDYLWYTTSI V
Sbjct: 449 DLHPTASSPKRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFV 508
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ E+FL+N +L +ESKGHA+H F N++LQ SASGNGT P FK+ PI+LKAGKNEI
Sbjct: 509 HAEEDFLRNRGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEI 568
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
+LLSMTVGLQ AG FYEW+GAG TSVK+ GF +GT+DL+ +WTYKIGLQGEHL I
Sbjct: 569 SLLSMTVGLQTAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQKSY 628
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+ W T +PPK QPLTWYKAVV PPG+EP+ LDM+ MGKG+AWLNG+EIGRYWPR
Sbjct: 629 NLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPR 688
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ K ++ CV +CDYRGKFNPDKC+TGCG+P+QRWYH+PRSWFKPS N+L+IFEE GG
Sbjct: 689 RTSK---YENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGG 745
Query: 731 DPTKITFSIRKISG 744
DP++I FS+RK+SG
Sbjct: 746 DPSQIRFSMRKVSG 759
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 1172 bits (3031), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 550/736 (74%), Positives = 628/736 (85%), Gaps = 7/736 (0%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F +L F + C A NVTYD RSLII+G R+L+ISA+IHYPRSVP MWP L+Q AKEG
Sbjct: 7 FLVLCLF---LPLCLAANVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQNAKEG 63
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
GV+ IE+YVFWNGHELSP Y+F GRF+LVKFI I+ A +Y+ILRIGPFVAAE+N+GG+
Sbjct: 64 GVDVIETYVFWNGHELSPDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWNFGGV 123
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWLHYIP TVFR D FK++MQKF T IV +MK+EKLFASQGGPIIL+QVENEYG E
Sbjct: 124 PVWLHYIPNTVFRTDNASFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEYGDIE 183
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YGEGGK YA+WAA+MAV+QNIGVPWIMCQQ+D PDPVINTCNSFYCDQFTP+SP+ PK
Sbjct: 184 RVYGEGGKPYAMWAAQMAVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPK 243
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENWPGWFKTFG RDPHRP EDIAFSVARFFQKGGS+ NYYMYHGGTNFGRTAGGPFI
Sbjct: 244 MWTENWPGWFKTFGARDPHRPPEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTAGGPFI 303
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGLPR PKWGHLKELH AIKL E LLN E + +SLG S EADVY DS
Sbjct: 304 TTSYDYDAPIDEYGLPRLPKWGHLKELHRAIKLTERVLLNSEPTYVSLGPSLEADVYTDS 363
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SGACAAF+AN+D+K+DKTV FRN+SYHLPAWSVSILPDCK VVFNTA +R+Q++ VEMVP
Sbjct: 364 SGACAAFIANIDEKDDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAMVEMVP 423
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
E LQPS + + K LKW+VF E GIWG+ADFVK+ VDH+NTTKDTTDYLWYTTSI
Sbjct: 424 EELQPSADATNKDLKALKWEVFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYTTSIF 483
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
VNENE+FLK GS+PVL++ESKGHALHAF N++LQ SA+GNG+ FK+K ISLKAGKNE
Sbjct: 484 VNENEKFLK-GSQPVLVVESKGHALHAFINKKLQVSATGNGSDITFKFKQAISLKAGKNE 542
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
IALLSMTVGLQNAGPFYEWVGAG++ V I GFN+G +DLS+Y+W+YKIGLQGEHLGIY P
Sbjct: 543 IALLSMTVGLQNAGPFYEWVGAGLSKVVIEGFNNGPVDLSSYAWSYKIGLQGEHLGIYKP 602
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
N+ W+S+ EPPK QPLTWYK ++ P G+EP+GLDM+ MGKGLAWLNGEEIGRYWP
Sbjct: 603 DGIKNVKWLSSREPPKQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLAWLNGEEIGRYWP 662
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
KSS HD CVQ+CDYRGKF PDKC+TGCGEP+QRWYH+PRSWFKPS NILVIFEEKG
Sbjct: 663 ---TKSSIHDVCVQKCDYRGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKG 719
Query: 730 GDPTKITFSIRKISGF 745
GDPT+I S RK+ G
Sbjct: 720 GDPTQIRLSKRKVLGI 735
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 1162 bits (3005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 557/737 (75%), Positives = 619/737 (83%), Gaps = 27/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L+ FFS T CFAGNV+YDSRSLIING R+L+ISAAIHYPRSVP MWP LV+ AKEG
Sbjct: 3 LGLIFFFSLCFTLCFAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEG 62
Query: 70 GVNTIESYVFWNGHE-LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GV+ IE+YVFWN H+ SP +Y+F GRF+LVKFI I+Q+A MY+ILRIGPFVAAE+N+GG
Sbjct: 63 GVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGG 122
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ--VENEYG 186
IPVWLHY+ GTVFR D FKY+M++F T IV +MK+EKLFASQGGPIIL+Q VENEYG
Sbjct: 123 IPVWLHYVNGTVFRTDNYNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENEYG 182
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
YYE YGEGGKRYA WAA+MAV+QN GVPWIMCQQFD P VINTCNSFYCDQF P P
Sbjct: 183 YYEGAYGEGGKRYAAWAAQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQFKPIFPD 242
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PKIWTENWPGWF+TFG +PHRP+ED+AFSVARFFQKGGSV NYYMYHGGTNFGRTAGG
Sbjct: 243 KPKIWTENWPGWFQTFGAPNPHRPAEDVAFSVARFFQKGGSVQNYYMYHGGTNFGRTAGG 302
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFITTSYDYEAPIDEYGLPR PKWGHLKELH AIKLCEH LLN + NLSLG SQEADVY
Sbjct: 303 PFITTSYDYEAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLNSKPVNLSLGPSQEADVY 362
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
AD+SG C AFLAN+DDKNDKTV F+NVSY LPAWSVSILPDCK VV+NTA +
Sbjct: 363 ADASGGCVAFLANIDDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAKQK------- 415
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
+GSK LKW+VF E AGIWGE DF+K+GFVDHINTTKDTTDYLWYTT
Sbjct: 416 --------------DGSKALKWEVFVEKAGIWGEPDFMKNGFVDHINTTKDTTDYLWYTT 461
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
SI+V ENEEFLK G PVLLIES GHALHAF NQELQGSASGNG+H PFK+KNPISLKAG
Sbjct: 462 SIVVGENEEFLKEGRHPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPISLKAG 521
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGI 606
NEIALLSMTVGL NAG FYEWVGAG+TSV+I GFN+GT+DLS ++W YKIGLQGE LGI
Sbjct: 522 NNEIALLSMTVGLPNAGSFYEWVGAGLTSVRIEGFNNGTVDLSHFNWIYKIGLQGEKLGI 581
Query: 607 YNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
Y P N+++WV+T EPPK QPLTWYK V+ P G+EP+GLDML MGKGLAWLNGEEIGR
Sbjct: 582 YKPEGVNSVSWVATSEPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNGEEIGR 641
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YWP RKSS H++CV ECDYRGKF PDKC TGCG+P+QRWYH+PRSWFKPS N+LVIFE
Sbjct: 642 YWP---RKSSVHEKCVTECDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLVIFE 698
Query: 727 EKGGDPTKITFSIRKIS 743
EKGGDP KITFS RK+S
Sbjct: 699 EKGGDPEKITFSRRKMS 715
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 1148 bits (2970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 538/739 (72%), Positives = 615/739 (83%), Gaps = 10/739 (1%)
Query: 7 IAPFALLIF--FSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
IA A+L+ F S A NV+YD RSL I RR+LIISAAIHYPRSVP MWP LVQ
Sbjct: 9 IASTAILVVMVFLFSWRSIEAANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQ 68
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
AKEGG N IESYVFWNGHE SPGKYYFGGR+N+VKFIKI+QQA M+MILRIGPFVAAE+
Sbjct: 69 TAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEW 128
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
NYGG+PVWLHY+PGTVFR D EP+K++M+ F T IV+++K+EKLFA QGGPIIL+QVENE
Sbjct: 129 NYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENE 188
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YGYYE YGEGGKRYA W+A MAV+QNIGVPW+MCQQ+D P VI+TCN FYCDQFTP++
Sbjct: 189 YGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNT 248
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P PKIWTENWPGWFKTFGGRDPHRP+ED+A+SVARFF KGGSVHNYYMYHGGTNFGRT+
Sbjct: 249 PDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTS 308
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEAD 364
GGPFITTSYDYEAPIDEYGLPR PKWGHLK+LH AI L E+ L++GE N +LG S EAD
Sbjct: 309 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEAD 368
Query: 365 VYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST 424
VY DSSG CAAFL+N+DDKNDK V+FRN SYHLPAWSVSILPDCK VFNTA V ++SS
Sbjct: 369 VYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSK 428
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
VEM+PE+L+ S GLKW+VF E GIWG ADFVK+ VDHINTTKDTTDYLWY
Sbjct: 429 VEMLPEDLK--------SSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWY 480
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
TTSI V+ENE FLK GS PVL IESKGH LH F N+E G+A+GNGTH PFK K P++LK
Sbjct: 481 TTSITVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALK 540
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
AG+N I LLSMTVGL NAG FYEWVGAG+TSV I GFN GTL+L+ W+YK+G++GEHL
Sbjct: 541 AGENNIDLLSMTVGLANAGSFYEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHL 600
Query: 605 GIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
++ PG + W T +PPK QPLTWYK V++ P G EP+GLDM+ MGKG+AWLNGEEI
Sbjct: 601 ELFKPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEI 660
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYWPR +RK+SP+DECV+ECDYRGKF PDKC+TGCGEPSQRWYH+PRSWFK S N LVI
Sbjct: 661 GRYWPRIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVI 720
Query: 725 FEEKGGDPTKITFSIRKIS 743
FEEKGG+P KI S RK+S
Sbjct: 721 FEEKGGNPMKIKLSKRKVS 739
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 1146 bits (2964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 537/739 (72%), Positives = 614/739 (83%), Gaps = 10/739 (1%)
Query: 7 IAPFALLIF--FSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
IA A+L+ F S A NV+YD RSL I RR+LIISAAIHYPRSVP MWP LVQ
Sbjct: 9 IASTAILVVMVFLFSWRSIEAANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQ 68
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
AKEGG N IESYVFWNGHE SPGKYYFGGR+N+VKFIKI+QQA M+MILRIGPFVAAE+
Sbjct: 69 TAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEW 128
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
NYGG+PVWLHY+PGTVFR D EP+K++M+ F T IV+++K+EKLFA QGGPIIL+QVENE
Sbjct: 129 NYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENE 188
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YGYYE YGEGGKRYA W+A MAV+QNIGVPW+MCQQ+D P VI+TCN FYCDQFTP++
Sbjct: 189 YGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNT 248
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P PKIWTENWPGWFKTFGGRDPHRP+ED+A+SVARFF KGGSVHNYYMYHGGTNFGRT+
Sbjct: 249 PDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTS 308
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEAD 364
GGPFITTSYDYEAPIDEYGLPR PKWGHLK+LH AI L E+ L++GE N +LG S EAD
Sbjct: 309 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEAD 368
Query: 365 VYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST 424
VY DSSG CAAFL+N+DDKNDK V+FRN SYHLPAWSVSILPDCK VFNTA V ++SS
Sbjct: 369 VYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSK 428
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
VEM+PE+L+ S GLKW+VF E GIWG ADFVK+ VDHINTTKDTTDYLWY
Sbjct: 429 VEMLPEDLK--------SSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWY 480
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
TTSI V+ENE FLK GS PVL IESKGH LH F N+E G+A+GNGTH PFK K P++LK
Sbjct: 481 TTSITVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALK 540
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
AG+ I LLSMTVGL NAG FYEWVGAG+TSV I GFN GTL+L+ W+YK+G++GEHL
Sbjct: 541 AGETNIDLLSMTVGLANAGSFYEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHL 600
Query: 605 GIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
++ PG + W T +PPK QPLTWYK V++ P G EP+GLDM+ MGKG+AWLNGEEI
Sbjct: 601 ELFKPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEI 660
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYWPR +RK+SP+DECV+ECDYRGKF PDKC+TGCGEPSQRWYH+PRSWFK S N LVI
Sbjct: 661 GRYWPRIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVI 720
Query: 725 FEEKGGDPTKITFSIRKIS 743
FEEKGG+P KI S RK+S
Sbjct: 721 FEEKGGNPMKIKLSKRKVS 739
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 1145 bits (2961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 540/739 (73%), Positives = 613/739 (82%), Gaps = 10/739 (1%)
Query: 7 IAPFALLI--FFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
IA A+L+ F S A NV+YD RSL I RR+LIISAAIHYPRSVP MWP LVQ
Sbjct: 8 IASTAILVGLVFLFSWRSIDAANVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQ 67
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
AKEGG N IESYVFWNGHE SP KYYFGGR+N+VKFIKI+QQA M+MILRIGPFVAAE+
Sbjct: 68 TAKEGGCNAIESYVFWNGHEPSPRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEW 127
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
NYGG+PVWLHY+PGTVFR D EP+K++M+ F T IV+++K+EKLFA QGGPIIL+QVENE
Sbjct: 128 NYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENE 187
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YGYYE YGEGGKRYA W+A MAV+QNIGVPW+MCQQ+D P VI+TCN FYCDQFTP++
Sbjct: 188 YGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNT 247
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P PKIWTENWPGWFKTFGGRDPHRP+ED+A+SVARFF KGGSVHNYYMYHGGTNFGRT+
Sbjct: 248 PDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTS 307
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEAD 364
GGPFITTSYDYEAPIDEYGLPR PKWGHLK+LH AI L E+ L+NGE N +LG S EAD
Sbjct: 308 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLINGEHQNFTLGHSLEAD 367
Query: 365 VYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST 424
VY DSSG CAAFL+N+DDKNDKTV+FRN SYHLPAWSVSILPDCK VFNTA V ++ S
Sbjct: 368 VYTDSSGTCAAFLSNLDDKNDKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTSKFSK 427
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
VEM+PE+L+ S GLKW+VF E GIWGEADFVK+ VDHINTTKDTTDYLWY
Sbjct: 428 VEMLPEDLR--------SSSGLKWEVFSEKPGIWGEADFVKNELVDHINTTKDTTDYLWY 479
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
TTSI V+ NEEFLK GS PVL IESKGH LH F N+E G+A+GNGTH PFK K ++LK
Sbjct: 480 TTSITVSTNEEFLKKGSPPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVALK 539
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
AG+N I LLSMTVGL NAG FYEWVGAG+TSV I GFN GTL+L+ W+YK+G+QG HL
Sbjct: 540 AGENNIDLLSMTVGLSNAGSFYEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVQGVHL 599
Query: 605 GIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
++ PG + W T +PPK QPLTWYK V+ P G EP+GLDM+ MGKG+AWLNGEEI
Sbjct: 600 ELFKPGDSGAVKWTVTTKPPKKQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEEI 659
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYWPR +RKS+P+DECV+ECDYRGKF PDKC+TGCGEPSQRWYH+PRSWFK S N LVI
Sbjct: 660 GRYWPRIARKSTPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVI 719
Query: 725 FEEKGGDPTKITFSIRKIS 743
FEEKGGDP KIT S RK+S
Sbjct: 720 FEEKGGDPMKITLSKRKVS 738
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 1141 bits (2951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 541/731 (74%), Positives = 626/731 (85%), Gaps = 11/731 (1%)
Query: 12 LLIFFSSSITYCFA-----GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
+L S+S+T+ NV+YD RSLII+G+R+L+ISA+IHYPRSVP MWP L+Q A
Sbjct: 6 ILCLVSTSLTFTLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTA 65
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
KEGG++ IE+YVFWNGHELSPG YYFGGRF+LV+F K++Q A MY+ILRIGPFVAAE+N+
Sbjct: 66 KEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNF 125
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+PVWLHYIPGTVFR +PF +HM+KF T IV++MK+EKLFASQGGPIIL+Q+ENEYG
Sbjct: 126 GGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYG 185
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
YYE++Y E GK+YALWAAKMAV+QN VPWIMCQQ+D PDPVI+TCNSFYCDQFTP SP
Sbjct: 186 YYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPK 245
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTENWPGWFKTFGGRDPHRP ED+AFSVARFFQKGGS++NYYMYHGGTNFGRTAGG
Sbjct: 246 RPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGG 305
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFITTSYDY+APIDEYGLPR PKWGHLKELH AIKLCEH LL G+ N+SLG S EAD+Y
Sbjct: 306 PFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIY 365
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
DSSGACAAF++N+DDKNDK VVFRN SYHLPAWSVSILPDCK VVFNTA V + ++ V
Sbjct: 366 TDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVA 425
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
M+PE+LQ S D G K LKW VFKE GIWG+ADFVK+GFVDHINTTKDTTDYLW+TT
Sbjct: 426 MIPEHLQQS----DKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTT 481
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
SI+++ NEEFLK GS+P LLIESKGH LHAF NQ+ QG+ +GNG+H F +KNPISL+AG
Sbjct: 482 SILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAG 541
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGI 606
KNEIA+LS+TVGLQ AGPFY+++GAG+TSVKI G N+ T+DLS+ +W YKIG+ GEHL I
Sbjct: 542 KNEIAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSI 601
Query: 607 YNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
Y N++ W ST EPPK Q LTWYKA+V P GDEP+GLDML MGKGLAWLNGEEIGR
Sbjct: 602 YQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGR 661
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YWPR S ++CVQECDYRGKFNPDKC TGCGEPSQ+WYH+PRSWFKPS N+LVIFE
Sbjct: 662 YWPRISEFKK--EDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFE 719
Query: 727 EKGGDPTKITF 737
EKGGDPTKITF
Sbjct: 720 EKGGDPTKITF 730
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 1088 bits (2814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/727 (72%), Positives = 602/727 (82%), Gaps = 40/727 (5%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
+ISA+IHYPRSVP MWP L+Q AKEGG++ IE+YVFWNGHELSPG YYFGGRF+LV+F K
Sbjct: 1 LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59
Query: 104 IIQQARMYMILRIGPFVAAEYNYGG---------------------------------IP 130
++Q A MY+ILRIGPFVAAE+N+GG +P
Sbjct: 60 VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWLHYIPGTVFR +PF +HM+KF T IV++MK+EKLFASQGGPIIL+Q+ENEYGYYE+
Sbjct: 120 VWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYEN 179
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
+Y E GK+YALWAAKMAV+QN VPWIMCQQ+D PDPVI+TCNSFYCDQFTP SP PK+
Sbjct: 180 YYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKM 239
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENWPGWFKTFGGRDPHRP ED+AFSVARFFQKGGS++NYYMYHGGTNFGRTAGGPFIT
Sbjct: 240 WTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFIT 299
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+APIDEYGLPR PKWGHLKELH AIKLCEH LL G+ N+SLG S EAD+Y DSS
Sbjct: 300 TSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSS 359
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
GACAAF++N+DDKNDK VVFRN SYHLPAWSVSILPDCK VVFNTA V + ++ V M+PE
Sbjct: 360 GACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPE 419
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
+LQ S D G K LKW VFKE GIWG+ADFVK+GFVDHINTTKDTTDYLW+TTSI++
Sbjct: 420 HLQQS----DKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILI 475
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NEEFLK GS+P LLIESKGH LHAF NQ+ QG+ +GNG+H F +KNPISL+AGKNEI
Sbjct: 476 DANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEI 535
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
A+LS+TVGLQ AGPFY+++GAG+TSVKI G N+ T+DLS+ +W YKIG+ GEHL IY
Sbjct: 536 AILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGE 595
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
N++ W ST EPPK Q LTWYKA+V P GDEP+GLDML MGKGLAWLNGEEIGRYWPR
Sbjct: 596 GMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPR 655
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
S ++CVQECDYRGKFNPDKC TGCGEPSQ+WYH+PRSWFKPS N+LVIFEEKGG
Sbjct: 656 ISEFKK--EDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGG 713
Query: 731 DPTKITF 737
DPTKITF
Sbjct: 714 DPTKITF 720
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 1075 bits (2779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/724 (72%), Positives = 582/724 (80%), Gaps = 53/724 (7%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
T C GN+TYDSRSLII+G+R+L+ISAAIHYPRSVPGMWP LVQ AKEGGV+ IE+YVFW
Sbjct: 22 TLCCGGNITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFW 81
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
NGHE SP YYF R++LVKF+KI+QQA MY+ILRIGPFVAAE+N+GG+PVWLHY+PGTV
Sbjct: 82 NGHEPSPSNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTV 141
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR D FKYHMQKFMT IV++MK+EKLFASQGGPIILAQVENEYG+YES YGEGGKRYA
Sbjct: 142 FRTDNYNFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYA 201
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFK 260
+WAA+MAV+QNIGVPWIMCQQFD P+ VINTCNSFYCDQF P P PKIWTENWPGWF+
Sbjct: 202 MWAAQMAVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQ 261
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 320
TFG +PHRP+EDIAFSVARFFQKGGSV NYYMYHGGTNFGRT+GGPFITTSYDYEAPID
Sbjct: 262 TFGAPNPHRPAEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPID 321
Query: 321 EYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANM 380
EYGL R PKW HLKELH AIKLCE LLN NLSLG SQEADVYA+ SGACAAFLANM
Sbjct: 322 EYGLARLPKWAHLKELHKAIKLCELTLLNSVPVNLSLGPSQEADVYAEESGACAAFLANM 381
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
D+KNDKTVVFRN+SYHLPAWSVSILPDCK VVFNTA V +Q+S VEMVP++L+ S D
Sbjct: 382 DEKNDKTVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSIVEMVPDDLR----SSD 437
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
G+K LKW+ F E AGIWG +D VK+GFVDHINTTKDTTDYLWYTTSI V ENEEFLK G
Sbjct: 438 KGTKALKWETFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGENEEFLKKG 497
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
RPVLLIESKGHALHAF NQELQG+ASGNGTH PFK+K P+SL AGKN+IALLSMTVGLQ
Sbjct: 498 GRPVLLIESKGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIALLSMTVGLQ 557
Query: 561 NAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
NAG FYEWVGAG+TSVK+ GFN+GT+DLST++WTYKIGLQGE LG+YN +NWV+T
Sbjct: 558 NAGSFYEWVGAGLTSVKMKGFNNGTIDLSTFNWTYKIGLQGEKLGMYNGIAVETVNWVAT 617
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
+PPK+QPLTWYK + ML W E+ W R
Sbjct: 618 SKPPKDQPLTWYKRQIH--------ARQMLN----WMWRINSEMILVWTR---------- 655
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
YH+PRSWFKPS NILVIFEEKGGDPTKITFS R
Sbjct: 656 ---------------------------YHVPRSWFKPSGNILVIFEEKGGDPTKITFSRR 688
Query: 741 KISG 744
KISG
Sbjct: 689 KISG 692
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 1038 bits (2683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/720 (67%), Positives = 573/720 (79%), Gaps = 15/720 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD RSLII+GRR LIIS +IHYPRSVP MWP LV +AK+GG + IE+YVFWNGHE
Sbjct: 26 ASNVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
++PG+YYF RF+LV+F+K+++ A + +ILRIGPFVAAE+N+GG+PVWLHY+PGTVFR D
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTD 145
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-YYESFYGEGGKRYALWA 203
EPFK HM+ F T IV+MMK+E+LFASQGG IILAQ+ENEYG YYE Y GGK YA+WA
Sbjct: 146 NEPFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMWA 205
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MAVAQN GVPWIMCQ+ D PDPVIN+CN FYCD F P+SP+ PK+WTENWPGWF+TFG
Sbjct: 206 ASMAVAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKLWTENWPGWFQTFG 265
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
+PHRP ED+AF+VARFF+KGGSV NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKW HL++LH +I+LCEH LL G + LSLG QEAD+Y+D SG C AFLAN+D
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
NDK V FRN Y LPAWSVSILPDC+ VVFNTA V++Q+S V MVPE+LQ S
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVAMVPESLQ--------AS 437
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
K +W +F+E GIWG+ DFV++GFVDHINTTKD+TDYLWYTTS V+E+ GS
Sbjct: 438 KPERWNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDES---YSKGSHV 494
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL I+SKGH +HAF N E GSA GNG+ F K PI+L+ GKNE+ALLSMTVGLQNAG
Sbjct: 495 VLNIDSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNELALLSMTVGLQNAG 554
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
YEW+GAG T+V I+G +GT++LS+ +W YKIGL+GE+ ++ P RNN W+ EP
Sbjct: 555 FSYEWIGAGFTNVNISGVRNGTINLSSNNWAYKIGLEGEYYSLFKPDQRNNQRWIPQSEP 614
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
PKNQPLTWYK V P GD+P+G+DM MGKGL WLNG IGRYWP R SS D C
Sbjct: 615 PKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWP---RTSSIDDRCTP 671
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
CDYRG+FNP+KC TGCG+P+QRWYHIPRSWF PS NILVIFEEKGGDPTKITFS R ++
Sbjct: 672 SCDYRGEFNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDPTKITFSRRAVT 731
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 1030 bits (2662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/720 (66%), Positives = 572/720 (79%), Gaps = 14/720 (1%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + IE+YVFWNGHE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
++PG+YYF RF+LV+F+K+++ A + +ILRIGP+VAAE+NYGG+PVWLHY+PGTVFR +
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-YYESFYGEGGKRYALWA 203
EPFK H++ F T IVDMMK+E+LFASQGG IILAQ+ENEYG YYE YG GGK YA+WA
Sbjct: 146 NEPFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MA+AQN GVPWIMCQ+ D PDPVIN+CN FYCD F P+SP+ PKIWTENWPGWF+TFG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
+PHRP ED+AF+VARFF+KGGSV NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKW HL++LH +I+LCEH LL G + LSLG QEAD+Y+D SG C AFLAN+D
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
NDK V FRN Y LPAWSVSILPDC+ VVFNTA V++Q+S V MVPE+LQ S
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVPESLQ--------AS 437
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
K +W +F+E GIWG+ DFV++GFVDHINTTKD+TDYLWYTTS V+ + + GS
Sbjct: 438 KPERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGS--YSSKGSHA 495
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL I+S GH +HAF N L GSA GNG+ F K PI+L+ GKNE+ALLSMTVGLQNAG
Sbjct: 496 VLNIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNELALLSMTVGLQNAG 555
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
YEW+GAG T+V I+G +GT+DLS+ +W YKIGL+GE+ ++ P NN W+ EP
Sbjct: 556 FAYEWIGAGFTNVNISGVRTGTIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEP 615
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
PKNQPLTWYK V P GD+P+G+DM MGKGLAWLNG IGRYWP R SS +D C
Sbjct: 616 PKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWP---RTSSINDRCTP 672
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C+YRG F PDKC TGCG+P+QRWYHIPRSWF PS NILV+FEEKGGDPTKITFS R ++
Sbjct: 673 SCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVT 732
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 1029 bits (2661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/713 (67%), Positives = 565/713 (79%), Gaps = 13/713 (1%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + IE+YVFWNGHE +P
Sbjct: 102 VTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHETAP 161
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF RF+LV+F K+++ A +Y++LRIGPFVAAE+N+GG+PVWLHYIPG VFR + EP
Sbjct: 162 GEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPGAVFRTNNEP 221
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HM+ F T IVDMMKRE+ FASQGG IILAQ+ENEYG E YG GK YA+WAA MA
Sbjct: 222 FKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAASMA 281
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+AQN GVPWIMCQQ+D P+ VINTCNSFYCDQF +SP+ PKIWTENWPGWF+TFG +P
Sbjct: 282 LAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKTNSPTKPKIWTENWPGWFQTFGESNP 341
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AFSVARFFQKGGSV NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYGL R
Sbjct: 342 HRPPEDVAFSVARFFQKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLTRL 401
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKW HL++LH +IKLCEH+LL G ++LSLG+ QEADVY D SG C AFLAN+D +ND
Sbjct: 402 PKWAHLRDLHKSIKLCEHSLLYGNLTSLSLGTKQEADVYTDHSGGCVAFLANIDPENDTV 461
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V FR+ Y LPAWSVSILPDCK VFNTA V++Q+ V+MVPE LQ ++ PD +
Sbjct: 462 VTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQTLMVDMVPETLQSTK--PD------R 513
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
W +F+E GIW + DF+++GFVDHINTTKD+TDYLW+TTS N + + NG+R +L I
Sbjct: 514 WSIFREKTGIWDKNDFIRNGFVDHINTTKDSTDYLWHTTSF--NVDRSYPTNGNRELLSI 571
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
+SKGHA+HAF N EL GSA GNG+ F PI LK GKNEIALLSMTVGLQNAGP YE
Sbjct: 572 DSKGHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIALLSMTVGLQNAGPHYE 631
Query: 568 WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
WVGAG+TSV I+G +G++DLS+ +W YKIGL+GEH G++ P NN W EPPK Q
Sbjct: 632 WVGAGLTSVNISGMKNGSIDLSSNNWAYKIGLEGEHYGLFKPDQGNNQRWSPQSEPPKGQ 691
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
PLTWYK V P GD+P+G+DM MGKGLAWLNG IGRYWP R SS D C C+Y
Sbjct: 692 PLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWP---RTSSSDDRCTPSCNY 748
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
RG FNP KC TGCG+P+QRWYH+PRSWF PS N LV+FEE+GGDPTKITFS R
Sbjct: 749 RGPFNPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRR 801
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 1029 bits (2660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/720 (66%), Positives = 570/720 (79%), Gaps = 14/720 (1%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + IE+YVFWNGHE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
++PG+YYF RF+LV+F+K+++ A + +ILRIGP+VAAE+NYGG+PVWLHY+PGTVFR +
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-YYESFYGEGGKRYALWA 203
EPFK HM+ F T IVDMMK+E+LFASQGG IILAQ+ENEYG YYE YG GGK YA+WA
Sbjct: 146 NEPFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MA+AQN GVPWIMCQ+ D PDPVIN+CN FYCD F P+SP+ PKIWTENWPGWF+TFG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
+PHRP ED+AF+VARFF+KGGSV NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKW HL+ELH +I+LCEH LL G + LSLG QEAD+Y+D SG C AFLAN+D
Sbjct: 326 LRRFPKWAHLRELHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
NDK V FRN Y LPAWSVSILPDC+ VVFNTA V++Q+S V MVPE+LQ S
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVPESLQ--------AS 437
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
K +W +F+E GIWG+ DFV++GFVDHINTTKD+TDYLWYTTS V+ + + GS
Sbjct: 438 KPERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGS--YSSKGSHA 495
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL I+S GH +HAF N L GSA GNG+ F K I+L+ GKNE+ALLSMTVGLQNAG
Sbjct: 496 VLNIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNELALLSMTVGLQNAG 555
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
YEW+GAG T+V I+G +G +DLS+ +W YKIGL+GE+ ++ P NN W+ EP
Sbjct: 556 FAYEWIGAGFTNVNISGVRTGIIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEP 615
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
PKNQPLTWYK V P GD+P+G+DM MGKGLAWLNG IGRYWP R SS +D C
Sbjct: 616 PKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWP---RTSSINDRCTP 672
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C+YRG F PDKC TGCG+P+QRWYHIPRSWF PS NILV+FEEKGGDPTKITFS R ++
Sbjct: 673 SCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVT 732
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 1023 bits (2646), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/713 (66%), Positives = 564/713 (79%), Gaps = 13/713 (1%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSL+I+GRR L+ISA+IHYPRSVP MWP LV +AKEGG + IE+YVFWNGHE +P
Sbjct: 31 VTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHETAP 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKYYF RF+LV+F ++++ A ++++LRIGPFVAAE+N+GG+P WLHYIPGTVFR + EP
Sbjct: 91 GKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTVFRTNNEP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HM+ F T IVDMMK ++ FASQGG IILAQ+ENEYGYY+ YG GGK YA+WA MA
Sbjct: 151 FKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGSMA 210
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
AQN GVPWIMCQQ+D PD VINTCNSFYCDQF P+SP+ PKIWTENWPGWF+TFG +P
Sbjct: 211 QAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQFKPNSPTQPKIWTENWPGWFQTFGESNP 270
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AFSVARFF KGGSV NYY+YHGGTNF RTAGGPFITTSYDY+APIDEYGL R
Sbjct: 271 HRPPEDVAFSVARFFGKGGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGLRRL 330
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKW HLKELH +IKLCEH+LL G + LSLG QEADVY D SG C AFLAN+D + D+
Sbjct: 331 PKWAHLKELHQSIKLCEHSLLFGNSTLLSLGPQQEADVYTDHSGGCVAFLANIDSEKDRV 390
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V FRN Y LPAWSVSILPDCK VVFNTA VR+Q+ V+MVP LQ S+ PD +
Sbjct: 391 VTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDMVPGTLQASK--PD------Q 442
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
W +F E G+W + DFV++ FVDHINTTKD+TDYLW+TTS V+ N + +G+ PVL I
Sbjct: 443 WSIFTERIGVWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRN--YPSSGNHPVLNI 500
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
+SKGHA+HAF N L GSA GNG+ F PI+LKAGKNEIA+LSMTVGL++AGP+YE
Sbjct: 501 DSKGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAILSMTVGLKSAGPYYE 560
Query: 568 WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
WVGAG+TSV I+G +GT DLS+ +W YK+GL+GEH G++ NN W +PPK+Q
Sbjct: 561 WVGAGLTSVNISGMKNGTTDLSSNNWAYKVGLEGEHYGLFKHDQGNNQRWRPQSQPPKHQ 620
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
PLTWYK V P GD+P+GLDM MGKGL WLNG IGRYWP R S +D C CDY
Sbjct: 621 PLTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWP---RTSPTNDRCTTSCDY 677
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
RGKF+P+KC GCG+P+QRWYH+PRSWF PS N LV+FEE+GGDPTKITFS R
Sbjct: 678 RGKFSPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRR 730
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 998 bits (2581), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/717 (65%), Positives = 555/717 (77%), Gaps = 14/717 (1%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + +E+YVFWNGHE +
Sbjct: 37 SVTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 96
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF RF+LV+F KI++ A +YMILRIGPFVAAE+ +GG+PVWLHY PGTVFR + E
Sbjct: 97 QGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNE 156
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK HM++F T IVDMMK+E+ FASQGG IILAQVENEYG E YG G K YA+WAA M
Sbjct: 157 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 216
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+AQN GVPWIMCQQ+D PDPVINTCNSFYCDQF P+SP+ PK WTENWPGWF+TFG +
Sbjct: 217 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 276
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AFSVARFF KGGS+ NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYGL R
Sbjct: 277 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 336
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKW HL++LH +IKL EH LL G S +SLG QEADVY D SG C AFL+N+D + DK
Sbjct: 337 LPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKDK 396
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F++ SY LPAWSVSILPDCK V FNTA VR+Q+ ++MVP NL+ S+
Sbjct: 397 VVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVD-------- 448
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W +F+E GIWG D V++GFVDHINTTKD+TDYLWYTTS V+ + G VL
Sbjct: 449 GWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSH---LAGGNHVLH 505
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
IESKGHA+ AF N EL GSA GNG+ F + P++L+AGKN+++LLSMTVGLQN GP Y
Sbjct: 506 IESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMY 565
Query: 567 EWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
EW GAGITSVKI+G + +DLS+ W YKIGL+GE+ ++ +I W+ EPPKN
Sbjct: 566 EWAGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKN 625
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QP+TWYK V P GD+P+GLDM MGKGLAWLNG IGRYWPR S S D C CD
Sbjct: 626 QPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVS---DRCTSSCD 682
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
YRG F+P+KC GCG+P+QRWYH+PRSWF PS N LVIFEEKGGDPTKITFS R ++
Sbjct: 683 YRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVA 739
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 998 bits (2579), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/717 (65%), Positives = 555/717 (77%), Gaps = 14/717 (1%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + +E+YVFWNGHE +
Sbjct: 37 SVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 96
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF RF+LV+F KI++ A +YMILRIGPFVAAE+ +GG+PVWLHY PGTVFR + E
Sbjct: 97 QGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNE 156
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK HM++F T IVDMMK+E+ FASQGG IILAQVENEYG E YG G K YA+WAA M
Sbjct: 157 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 216
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+AQN GVPWIMCQQ+D PDPVINTCNSFYCDQF P+SP+ PK WTENWPGWF+TFG +
Sbjct: 217 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 276
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AFSVARFF KGGS+ NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYGL R
Sbjct: 277 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 336
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKW HL++LH +IKL EH LL G S +SLG QEADVY D SG C AFL+N+D + DK
Sbjct: 337 LPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKDK 396
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F++ SY LPAWSVSILPDCK V FNTA VR+Q+ ++MVP NL+ S+
Sbjct: 397 VVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVD-------- 448
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W +F+E GIWG D V++GFVDHINTTKD+TDYLWYTTS V+ + G VL
Sbjct: 449 GWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSH---LAGGNHVLH 505
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
IESKGHA+ AF N EL GSA GNG+ F + P++L+AGKN+++LLSMTVGLQN GP Y
Sbjct: 506 IESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMY 565
Query: 567 EWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
EW GAGITSVKI+G + +DLS+ W YKIGL+GE+ ++ +I W+ EPPKN
Sbjct: 566 EWAGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKN 625
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QP+TWYK V P GD+P+GLDM MGKGLAWLNG IGRYWPR S S D C CD
Sbjct: 626 QPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVS---DRCTSSCD 682
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
YRG F+P+KC GCG+P+QRWYH+PRSWF PS N LVIFEEKGGDPTKITFS R ++
Sbjct: 683 YRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVA 739
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 998 bits (2579), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/717 (65%), Positives = 555/717 (77%), Gaps = 14/717 (1%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + +E+YVFWNGHE +
Sbjct: 105 SVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 164
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF RF+LV+F KI++ A +YMILRIGPFVAAE+ +GG+PVWLHY PGTVFR + E
Sbjct: 165 QGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNE 224
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK HM++F T IVDMMK+E+ FASQGG IILAQVENEYG E YG G K YA+WAA M
Sbjct: 225 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 284
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+AQN GVPWIMCQQ+D PDPVINTCNSFYCDQF P+SP+ PK WTENWPGWF+TFG +
Sbjct: 285 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 344
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AFSVARFF KGGS+ NYY+YHGGTNFGRT GGPFITTSYDY+APIDEYGL R
Sbjct: 345 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 404
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKW HL++LH +IKL EH LL G S +SLG QEADVY D SG C AFL+N+D + DK
Sbjct: 405 LPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKDK 464
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F++ SY LPAWSVSILPDCK V FNTA VR+Q+ ++MVP NL+ S+
Sbjct: 465 VVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVD-------- 516
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W +F+E GIWG D V++GFVDHINTTKD+TDYLWYTTS V+ + G VL
Sbjct: 517 GWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSH---LAGGNHVLH 573
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
IESKGHA+ AF N EL GSA GNG+ F + P++L+AGKN+++LLSMTVGLQN GP Y
Sbjct: 574 IESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMY 633
Query: 567 EWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
EW GAGITSVKI+G + +DLS+ W YKIGL+GE+ ++ +I W+ EPPKN
Sbjct: 634 EWAGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKN 693
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QP+TWYK V P GD+P+GLDM MGKGLAWLNG IGRYWPR S S D C CD
Sbjct: 694 QPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVS---DRCTSSCD 750
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
YRG F+P+KC GCG+P+QRWYH+PRSWF PS N LVIFEEKGGDPTKITFS R ++
Sbjct: 751 YRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVA 807
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 955 bits (2469), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/625 (72%), Positives = 533/625 (85%), Gaps = 9/625 (1%)
Query: 12 LLIFFSSSITYCFA-----GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
+L S+S+T+ NV+YD RSLII+G+R+L+ISA+IHYPRSVP MWP L+Q A
Sbjct: 6 ILCLVSTSLTFTLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTA 65
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
KEGG++ IE+YVFWNGHELSPG YYFGGRF+LV+F K++Q A MY+ILRIGPFVAAE+N+
Sbjct: 66 KEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNF 125
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+PVWLHYIPGTVFR +PF +HM+KF T IV++MK+EKLFASQGGPIIL+Q+ENEYG
Sbjct: 126 GGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYG 185
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
YYE++Y E GK+YALWAAKMAV+QN VPWIMCQQ+D PDPVI+TCNSFYCDQFTP SP
Sbjct: 186 YYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPK 245
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTENWPGWFKTFGGRDPHRP ED+AFSVARFFQKGGS++NYYMYHGGTNFGRTAGG
Sbjct: 246 RPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGG 305
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFITTSYDY+APIDEYGLPR PKWGHLKELH AIKLCEH LL G+ N+SLG S EAD+Y
Sbjct: 306 PFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIY 365
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
DSSGACAAF++N+DDKNDK VVFRN SYHLPAWSVSILPDCK VVFNTA V + ++ V
Sbjct: 366 TDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVA 425
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
M+PE+LQ S D G K LKW VFKE GIWG+ADFVK+GFVDHINTTKDTTDYLW+TT
Sbjct: 426 MIPEHLQQS----DKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTT 481
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
SI+++ NEEFLK GS+P LLIESKGH LHAF NQ+ QG+ +GNG+H F +KNPISL+AG
Sbjct: 482 SILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAG 541
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGI 606
KNEIA+LS+TVGLQ AGPFY+++GAG+TSVKI G N+ T+DLS+ +W YKIG+ GEHL I
Sbjct: 542 KNEIAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSI 601
Query: 607 YNPGYRNNINWVSTMEPPKNQPLTW 631
Y N++ W ST EPPK Q LTW
Sbjct: 602 YQGEGMNSVKWTSTSEPPKGQALTW 626
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 906 bits (2341), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/722 (59%), Positives = 527/722 (72%), Gaps = 7/722 (0%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD RSL+I+G+R ++IS +IHYPRS P MWP ++Q+AK+GG++ IESYVFWN HE
Sbjct: 28 AANVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPDIIQKAKDGGLDVIESYVFWNMHE 87
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+YYF RF+LVKF+KI+QQA + + LRIGP+ AE+NYGG PVWLH IPG FR D
Sbjct: 88 PKQNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYACAEWNYGGFPVWLHLIPGIHFRTD 147
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F IVDMMK+EKLFASQGGPIILAQ+ENEYG + YG GK Y WAA
Sbjct: 148 NEPFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQIENEYGNIDGPYGAAGKSYVKWAA 207
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAV N GVPW+MCQQ D PDP+INTCN FYCD FTP+SP+ PK+WTENW GWF +FGG
Sbjct: 208 SMAVGLNTGVPWVMCQQADAPDPIINTCNGFYCDAFTPNSPNKPKMWTENWSGWFLSFGG 267
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R P RP+ED+AFSVARFFQ+GG+ NYYMYHGGTNFGRT GGPFI TSYDY+APIDEYG+
Sbjct: 268 RLPFRPTEDLAFSVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYGI 327
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHLKELH AIKLCE AL+N E + SLGS EA VY+ SG CAAFLAN + ++
Sbjct: 328 VRQPKWGHLKELHKAIKLCEAALVNAESNYTSLGSGLEAHVYSPGSGTCAAFLANSNTQS 387
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS- 443
D TV F SYHLPAWSVSILPDCK VVFNTA + +Q+++V+M P NL + ++ G+
Sbjct: 388 DATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGSQTTSVQMNPANLILAGSNSMKGTD 447
Query: 444 --KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
W E GI G F K G ++ INTT D++DYLWYTTSI V++NE FL NG+
Sbjct: 448 SANAASWSWLHEQIGIGGSNTFSKPGLLEQINTTVDSSDYLWYTTSIQVDDNEPFLHNGT 507
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+PVL ++S GHALH F N E G +G+ + + PI+LK+GKN I LLS+TVGLQN
Sbjct: 508 QPVLHVQSLGHALHVFINGEFAGRGAGSSSSSKIALQTPITLKSGKNNIDLLSITVGLQN 567
Query: 562 AGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
G F++ GAGIT V + GF G DLST WTY+IGL GE LGIY+ + + WV+
Sbjct: 568 YGSFFDTWGAGITGPVILQGFKDGEHDLSTQQWTYQIGLTGEQLGIYSGDTKASAQWVAG 627
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
+ P QP+ WYK P G++P+ L++L MGKG+AW+NG+ IGRYWP S
Sbjct: 628 SDLPTKQPMIWYKTNFDAPSGNDPVALNLLGMGKGVAWVNGQSIGRYWPSYIASQS---G 684
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C CDYRG ++ KC T CG+PSQ+ YH+PRSW +P+ N+LV+FEE GGDPT+I+F R
Sbjct: 685 CTDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSWIQPTGNVLVLFEELGGDPTQISFMTR 744
Query: 741 KI 742
+
Sbjct: 745 SV 746
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/736 (56%), Positives = 525/736 (71%), Gaps = 17/736 (2%)
Query: 12 LLIFFSSSIT--YCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L++FF S + FA NVTYD R+L+I+G+R ++IS +IHYPRS P MWPGL+Q++K+G
Sbjct: 7 LVVFFFSVVLAETSFAANVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPGLIQKSKDG 66
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWNGHE +Y F GR++LVKF+K++ +A +Y+ +RIGP+V AE+NYGG
Sbjct: 67 GLDVIETYVFWNGHEPVRNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVCAEWNYGGF 126
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH+IPG FR D EPFK MQ+F IVDMMK+EKL+ASQGGPIIL+Q+ENEYG +
Sbjct: 127 PLWLHFIPGIKFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNID 186
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S +G K Y WAA MA++ + GVPW+MCQQ D PDPVINTCN FYCDQFTP+S + PK
Sbjct: 187 SAFGPAAKTYINWAAGMAISLDTGVPWVMCQQADAPDPVINTCNGFYCDQFTPNSKNKPK 246
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWF++FGG P+RP ED+AF+VARF+Q G+ NYYMYHGGTNFGRT GGPFI
Sbjct: 247 MWTENWSGWFQSFGGAVPYRPVEDLAFAVARFYQLSGTFQNYYMYHGGTNFGRTTGGPFI 306
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
+TSYDY+AP+DEYGL R PKWGHLK++H AIKLCE AL+ + + SLGS+ EA VY
Sbjct: 307 STSYDYDAPLDEYGLLRQPKWGHLKDVHKAIKLCEEALIATDPTTTSLGSNLEATVYKTG 366
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN+ DKTV F SY+LPAWSVSILPDCK V NTA + ++V +VP
Sbjct: 367 S-LCAAFLANI-ATTDKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKI----NSVTIVP 420
Query: 430 ENLQPSEASPDNGSK--GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTS 487
+ S + SK G W E GI FVKSG ++ INTT D +DYLWY+ S
Sbjct: 421 SFARQSLVGDVDSSKAIGSGWSWINEPVGISKNDAFVKSGLLEQINTTADKSDYLWYSLS 480
Query: 488 IIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGK 547
+ +E FL++GS+ VL +ES GHALHAF N +L GS +G ++ PI+L GK
Sbjct: 481 TNIKGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITLTPGK 540
Query: 548 NEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGI 606
N I LLS+TVGLQN G FYE GAGIT VK+ N T+DLS+ WTY+IGL+GE GI
Sbjct: 541 NTIDLLSLTVGLQNYGAFYELTGAGITGPVKLKAQNGNTVDLSSQQWTYQIGLKGEDSGI 600
Query: 607 YNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+ ++ WVS PKNQPL WYK P G++P+ +D MGKG AW+NG+ IGR
Sbjct: 601 SS---GSSSEWVSQPTLPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVNGQSIGR 657
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YWP SP C C+YRG ++ +KC+ CG+PSQ +YHIPRSW K S NILV+ E
Sbjct: 658 YWP---TNVSPSSGCADSCNYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNILVLLE 714
Query: 727 EKGGDPTKITFSIRKI 742
E GGDPT+I F+ R++
Sbjct: 715 EIGGDPTQIAFATRQV 730
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 847 bits (2187), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/538 (72%), Positives = 448/538 (83%), Gaps = 8/538 (1%)
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV+QNIGVPW+MCQQ+D P VI+TCN FYCDQFTP++P PKIWTENWPGWFKTFGGR
Sbjct: 1 MAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGR 60
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
DPHRP+ED+A+SVARFF KGGSVHNYYMYHGGTNFGRT+GGPFITTSYDYEAPIDEYGLP
Sbjct: 61 DPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLP 120
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKWGHLK+LH AI L E+ L++GE N +LG S EADVY DSSG CAAFL+N+DDKND
Sbjct: 121 RLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKND 180
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
K V+FRN SYHLPAWSVSILPDCK VFNTA V ++SS VEM+PE+L+ S G
Sbjct: 181 KAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSKVEMLPEDLK--------SSSG 232
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
LKW+VF E GIWG ADFVK+ VDHINTTKDTTDYLWYTTSI V+ENE FLK GS PVL
Sbjct: 233 LKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVL 292
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
IESKGH LH F N+E G+A+GNGTH PFK K P++LKAG+N I LLSMTVGL NAG F
Sbjct: 293 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSF 352
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
YEWVGAG+TSV I GFN GTL+L+ W+YK+G++GEHL ++ PG + W T +PPK
Sbjct: 353 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPK 412
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPLTWYK V++ P G EP+GLDM+ MGKG+AWLNGEEIGRYWPR +RK+SP+DECV+EC
Sbjct: 413 KQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKEC 472
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
DYRGKF PDKC+TGCGEPSQRWYH+PRSWFK S N LVIFEEKGG+P KI S RK+S
Sbjct: 473 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKVS 530
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 843 bits (2177), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/735 (54%), Positives = 514/735 (69%), Gaps = 10/735 (1%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L+ + T FA VTYD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+G
Sbjct: 8 FVLVSLLGAIATTSFASTVTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE +Y F GR +LVKF+K + +A +Y+ LRIGP+V AE+NYGG
Sbjct: 68 GLDVIETYVFWNLHEPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH+IPG FR D PFK MQ F IVDMMK+E L+ASQGGPIIL+Q+ENEYG +
Sbjct: 128 PLWLHFIPGIQFRTDNGPFKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGNID 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S YG K Y WAA MA + + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S PK
Sbjct: 188 SAYGSAAKSYIQWAASMATSLDTGVPWVMCQQADAPDPMINTCNGFYCDQFTPNSVKKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWF +FGG P+RP EDIAF+VARFFQ GG+ NYYMYHGGTNFGRT GGPFI
Sbjct: 248 MWTENWTGWFLSFGGAVPYRPVEDIAFAVARFFQLGGTFQNYYMYHGGTNFGRTTGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGL R PKWGHLK+LH AIKLCE AL+ + + SLG++ EA VY
Sbjct: 308 ATSYDYDAPIDEYGLLRQPKWGHLKDLHKAIKLCEAALIATDPTITSLGTNLEASVYKTG 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G+CAAFLAN+ +D TV F SYHLPAWSVSILPDCK V NTA + + + +
Sbjct: 368 TGSCAAFLANVRTNSDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQINSMAVMPRFMQ 427
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
++L+ S D G W E GI F K G ++ IN T D +DYLWY+ S
Sbjct: 428 QSLKNDIDSSDGFQSGWSW--VDEPVGISKNNAFTKLGLLEQINITADKSDYLWYSLSTE 485
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+ +E FL++GS+ VL +ES GHALHAF N +L GS +GN + P++L GKN
Sbjct: 486 IQGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHGKNT 545
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIY 607
I LLS+TVGLQN G FY+ GAGIT +K+ G +G T+DLS+ WTY++GLQGE LG+
Sbjct: 546 IDLLSLTVGLQNYGAFYDKQGAGITGPIKLKGLANGTTVDLSSQQWTYQVGLQGEELGLP 605
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ ++ WV+ PK QPL WYK P G++P+ LD + MGKG AW+NG+ IGRY
Sbjct: 606 S---GSSSKWVAGSTLPKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQSIGRY 662
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP S + C C+YRG ++ +KC+ CG+PSQ+ YH+PRSW +PS N LV+FEE
Sbjct: 663 WP---AYVSSNGGCTSSCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVLFEE 719
Query: 728 KGGDPTKITFSIRKI 742
GGDPT+I+F+ +++
Sbjct: 720 IGGDPTQISFATKQV 734
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 834 bits (2155), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/737 (54%), Positives = 511/737 (69%), Gaps = 20/737 (2%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
++ LL F S++ G+V+YDSR++ ING+R ++IS +IHYPRS P MWP L+++A
Sbjct: 17 VSALFLLGFLVCSVS----GSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKA 72
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
KEGG++ I++YVFWNGHE SPGKYYF G ++LVKF+K++QQ+ +Y+ LRIGP+V AE+N+
Sbjct: 73 KEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNF 132
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL YIPG FR D PFK MQ+F T IV+MMK E+LF SQGGPIIL+Q+ENEYG
Sbjct: 133 GGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYG 192
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
E G G+ Y WAAKMAV GVPW+MC+Q D PDP+IN CN FYCD F+P+
Sbjct: 193 PMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAY 252
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTE W GWF FGG P+RP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGG
Sbjct: 253 KPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGG 312
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFI TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++GE + + LG+ QEA VY
Sbjct: 313 PFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVY 372
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
SGAC+AFLAN + K+ V F N Y+LP WS+SILPDCK V+NTA V AQ+S ++
Sbjct: 373 KSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMK 432
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
MV P +G GL WQ + E + + F G V+ INTT+DT+DYLWY T
Sbjct: 433 MV--------RVPVHG--GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMT 482
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
+ V+ NE FL+NG P L + S GHA+H F N +L GSA G+ P ++ ++L+AG
Sbjct: 483 DVKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAG 542
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N+IA+LS+ VGL N GP +E AG+ V + G N G DLS WTYK+GL+GE L
Sbjct: 543 FNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLS 602
Query: 606 IYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIG 665
+++ +++ W + QPLTWYK P GD P+ +DM MGKG W+NG+ +G
Sbjct: 603 LHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLG 662
Query: 666 RYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF 725
R+WP S EC Y G F DKC+ CGE SQRWYH+PRSW KPS N+LV+F
Sbjct: 663 RHWPAYKAVGS-----CSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVF 717
Query: 726 EEKGGDPTKITFSIRKI 742
EE GGDP IT R++
Sbjct: 718 EEWGGDPNGITLVRREV 734
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 834 bits (2154), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/737 (54%), Positives = 511/737 (69%), Gaps = 20/737 (2%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
++ LL F S++ G+V+YDSR++ ING+R ++IS +IHYPRS P MWP L+++A
Sbjct: 17 VSALFLLGFLVCSVS----GSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKA 72
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
KEGG++ I++YVFWNGHE SPGKYYF G ++LVKF+K++QQ+ +Y+ LRIGP+V AE+N+
Sbjct: 73 KEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNF 132
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL YIPG FR D PFK MQ+F T IV+MMK E+LF SQGGPIIL+Q+ENEYG
Sbjct: 133 GGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYG 192
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
E G G+ Y WAAKMAV GVPW+MC+Q D PDP+IN CN FYCD F+P+
Sbjct: 193 PMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAY 252
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTE W GWF FGG P+RP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGG
Sbjct: 253 KPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGG 312
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFI TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++GE + + LG+ QEA VY
Sbjct: 313 PFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVY 372
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
SGAC+AFLAN + K+ V F N Y+LP WS+SILPDCK V+NTA V AQ+S ++
Sbjct: 373 KSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMK 432
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
MV P +G GL WQ + E + + F G V+ INTT+DT+DYLWY T
Sbjct: 433 MV--------RVPVHG--GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMT 482
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
+ V+ NE FL+NG P L + S GHA+H F N +L GSA G+ P ++ ++L+AG
Sbjct: 483 DVKVDANEGFLRNGDLPTLTVLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLRAG 542
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N+IA+LS+ VGL N GP +E AG+ V + G N G DLS WTYK+GL+GE L
Sbjct: 543 FNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLS 602
Query: 606 IYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIG 665
+++ +++ W + QPLTWYK P GD P+ +DM MGKG W+NG+ +G
Sbjct: 603 LHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLG 662
Query: 666 RYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF 725
R+WP S EC Y G F DKC+ CGE SQRWYH+PRSW KPS N+LV+F
Sbjct: 663 RHWPAYKAVGS-----CSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVF 717
Query: 726 EEKGGDPTKITFSIRKI 742
EE GGDP IT R++
Sbjct: 718 EEWGGDPNGITLVRREV 734
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 832 bits (2149), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/717 (54%), Positives = 508/717 (70%), Gaps = 13/717 (1%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD R+L+I+G+R+++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFWNGHE
Sbjct: 32 SVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNGHEPE 91
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
KY F GR++LVKF+K+ +A +Y+ LRIGP+ AE+NYGG PVWLH++PG FR D E
Sbjct: 92 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNYGGFPVWLHFVPGIKFRTDNE 151
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ+F IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG GK Y W+A M
Sbjct: 152 PFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSSYGAAGKSYMKWSASM 211
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A++ + GVPW MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF FG
Sbjct: 212 ALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGEPS 271
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GGP I+TSYDY+APIDEYGL R
Sbjct: 272 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFERTSGGPLISTSYDYDAPIDEYGLLR 331
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHL++LH AIKLCE AL+ + SLGS+ EA VY S+G+CAAFLAN+ K+D
Sbjct: 332 QPKWGHLRDLHKAIKLCEDALIATDPKITSLGSNLEAAVYKTSTGSCAAFLANIGTKSDA 391
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
TV F SY LPAWSVSILPDCK V FNTA + + + + ++L+P+ S + G
Sbjct: 392 TVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPNADS--SAELGS 449
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
+W KE GI FVK G ++ INTT D +DYLWY+ + + +E FL GS+ VL
Sbjct: 450 QWSYIKEPVGISKADAFVKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAVLH 509
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S G ++AF N +L G SGNG PI+L GKN I LLS+TVGL N GPF+
Sbjct: 510 VQSIGQLVYAFINGKLAG--SGNGKQ-KISLDIPINLVTGKNTIDLLSVTVGLANYGPFF 566
Query: 567 EWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+ GAGIT V + +G + DLS+ WTY++GL+GE G+ G ++ WVS P
Sbjct: 567 DLTGAGITGPVSLKSAKTGSSTDLSSQQWTYQVGLKGEDKGL---GSGDSSEWVSNSPLP 623
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
+QPL WYK P G +P+ +D GKG+AW+NG+ IGRYWP ++ D CV
Sbjct: 624 TSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNGQSIGRYWPTSIART---DGCVGS 680
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
CDYRG + +KC+ CG+PSQ YH+PRSW KPS N LV+ EE GGDPTKI+F+ ++
Sbjct: 681 CDYRGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLVLLEEMGGDPTKISFATKQ 737
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 829 bits (2141), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/732 (53%), Positives = 509/732 (69%), Gaps = 16/732 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+ F+ S+ + +V+YD +++IING+R +++S +IHYPRS P MWP L+Q+AKEGG+
Sbjct: 14 LLVVFACSLLGQASASVSYDHKAIIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGL 73
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPGKYYFGG ++LV+FIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 74 DVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPV 133
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL YIPG FR D PFK+ M+KF IVDMMK E+LF SQGGPIIL+Q+ENEYG E
Sbjct: 134 WLKYIPGISFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYE 193
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G G+ Y WAA MAV GVPWIMC+Q D PDP+INTCN FYCD F+P+ PK+W
Sbjct: 194 IGAPGRSYTQWAAHMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMW 253
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TE W GWF FGG PHRP+ED+AFS+ARF QKGGS NYYMYHGGTNFGRTAGGPFI T
Sbjct: 254 TEAWTGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIAT 313
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++G+ + LG+ +EA V+ SG
Sbjct: 314 SYDYDAPLDEYGLARQPKWGHLKDLHRAIKLCEPALVSGDSTVQRLGNYEEAHVFRSKSG 373
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFLAN + ++ TV F N Y+LP WS+SILP+CK V+NTA V +QS+T++M
Sbjct: 374 ACAAFLANYNPQSYATVAFGNQHYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMT--- 430
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
P +G GL W+ F E ++ F +G ++ IN T+D +DYLWY+T +++N
Sbjct: 431 -----RVPIHG--GLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVIN 483
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
NE FL+NG PVL + S GHALH F N +L G+A G+ P + + L+AG N+I+
Sbjct: 484 SNEGFLRNGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKIS 543
Query: 552 LLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LLS+ VGL N GP +E AG+ + ++G N G DL+ W+YK+GL+GE L +++
Sbjct: 544 LLSVAVGLPNVGPHFERWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLS 603
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+++ W+ + QPLTWYK P G P+ LDM MGKG W+NG+ +GRYWP
Sbjct: 604 GSSSVEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPA 663
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
S C+Y G +N KC + CGE SQRWYH+P SW KPS N+LV+FEE GG
Sbjct: 664 YKASGS-----CGYCNYAGTYNEKKCGSNCGEASQRWYHVPHSWLKPSGNLLVVFEELGG 718
Query: 731 DPTKITFSIRKI 742
DP I R I
Sbjct: 719 DPNGIFLVRRDI 730
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 828 bits (2140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/719 (54%), Positives = 506/719 (70%), Gaps = 13/719 (1%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G+R+++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFW+GHE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
KY F GR++LVKF+K+ +A +Y+ LRIGP+V AE+NYGG PVWLH++PG FR D
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F T IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG K Y W+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA++ + GVPW MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF FG
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTNF RT+GGP I+TSYDY+APIDEYGL
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL++LH AIKLCE AL+ + + SLGS+ EA VY SG+CAAFLAN+D K+
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TV F SY+LPAWSVSILPDCK V FNTA + + + + ++L+P S +
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGS--SAEL 440
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
G +W KE GI F+K G ++ INTT D +DYLWY+ + +E FL GS+ V
Sbjct: 441 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 500
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L IES G ++AF N +L GS G PI+L G N I LLS+TVGL N G
Sbjct: 501 LHIESLGQVVYAFINGKLAGSGHG---KQKISLDIPINLVTGTNTIDLLSVTVGLANYGA 557
Query: 565 FYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
F++ VGAGIT V + G ++DL++ WTY++GL+GE G+ ++ WVS
Sbjct: 558 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL---ATVDSSEWVSKSP 614
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P QPL WYK P G EP+ +D GKG+AW+NG+ IGRYWP + + C
Sbjct: 615 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP---TSIAGNGGCT 671
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+ CDYRG + +KC+ CG+PSQ YH+PRSW KPS NILV+FEE GGDPT+I+F+ ++
Sbjct: 672 ESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQ 730
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 828 bits (2140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/719 (54%), Positives = 506/719 (70%), Gaps = 13/719 (1%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G+R+++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFW+GHE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
KY F GR++LVKF+K+ +A +Y+ LRIGP+V AE+NYGG PVWLH++PG FR D
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F T IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG K Y W+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA++ + GVPW MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF FG
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTNF RT+GGP I+TSYDY+APIDEYGL
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL++LH AIKLCE AL+ + + SLGS+ EA VY SG+CAAFLAN+D K+
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TV F SY+LPAWSVSILPDCK V FNTA + + + + ++L+P S +
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGS--SAEL 446
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
G +W KE GI F+K G ++ INTT D +DYLWY+ + +E FL GS+ V
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L IES G ++AF N +L GS G PI+L G N I LLS+TVGL N G
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHG---KQKISLDIPINLVTGTNTIDLLSVTVGLANYGA 563
Query: 565 FYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
F++ VGAGIT V + G ++DL++ WTY++GL+GE G+ ++ WVS
Sbjct: 564 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL---ATVDSSEWVSKSP 620
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P QPL WYK P G EP+ +D GKG+AW+NG+ IGRYWP + + C
Sbjct: 621 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP---TSIAGNGGCT 677
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+ CDYRG + +KC+ CG+PSQ YH+PRSW KPS NILV+FEE GGDPT+I+F+ ++
Sbjct: 678 ESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQ 736
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 828 bits (2140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/725 (54%), Positives = 513/725 (70%), Gaps = 15/725 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G R +++S +IHYPRS P MWPGL+Q+AK+GG++ +E+YVFW+ HE
Sbjct: 26 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDIHE 85
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ +Y F GR +LV+F+K +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR D
Sbjct: 86 TATXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 145
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F +V MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA
Sbjct: 146 NEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIRWAA 205
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAVA + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S S PK+WTENW GWF +FGG
Sbjct: 206 GMAVALDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSNSKPKLWTENWSGWFLSFGG 265
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP+ED+AF+VARF+Q+GG++ NYYMYHGGTNFGR++GGPFI+TSYDY+APIDEYGL
Sbjct: 266 AVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGL 325
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHLK++H AIK CE AL+ + S +S+G + EA VY S CAAFLANMD ++
Sbjct: 326 VRQPKWGHLKDVHKAIKQCEPALIATDPSYMSMGQNAEAHVYKAGS-VCAAFLANMDTQS 384
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
DKTV F +Y LPAWSVSILPDCK VV NTA + +Q++T EM +L S + D S
Sbjct: 385 DKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEM--RSLGSSTKASDGSSI 442
Query: 445 GLK-----WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
+ W E GI E K G ++ INTT D +D+LWY+TS++V E +L N
Sbjct: 443 ETELALSGWSYAIEPVGITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGGEPYL-N 501
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS+ LL+ S GH L A+ N + GSA G+ T + PI+L GKN+I LLS TVGL
Sbjct: 502 GSQSNLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKIDLLSGTVGL 561
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F++ VGAGIT VK++G G LDLS+ WTY++GL+GE L +YNP + WV
Sbjct: 562 SNYGAFFDLVGAGITGPVKLSG-PKGVLDLSSTDWTYQVGLRGEGLHLYNPS-EASPEWV 619
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S P NQPL WYK+ P GD+P+ +D MGKG AW+NG+ IGRYWP +P
Sbjct: 620 SDKAYPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLAPQ 676
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
CV C+YRG ++ KC+ CG+PSQ YH+PRS+ +P N +V+FE+ GGDP+KI+F+
Sbjct: 677 SGCVNSCNYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFT 736
Query: 739 IRKIS 743
++ +
Sbjct: 737 TKQTA 741
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 828 bits (2139), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/732 (53%), Positives = 511/732 (69%), Gaps = 13/732 (1%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
+L+ + A NVTYD R+L+I+G+R+++IS +IHYPRS P MWP L++++K+GG+
Sbjct: 10 ILLLILQIMMAATAVNVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIKKSKDGGL 69
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFW+GHE KY F GR++LVKF+K++++A +Y+ LRIGP+V AE+NYGG PV
Sbjct: 70 DVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCAEWNYGGFPV 129
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WLH++PG FR D EPFK MQ+F T IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S
Sbjct: 130 WLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSA 189
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
YG K Y W+A MA++ + GVPW MCQQ D PDP+INTCN FYCDQFTP+S S PK+W
Sbjct: 190 YGAAAKIYIKWSASMALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFTPNSNSKPKMW 249
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FG P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTNF RT+GGP I+T
Sbjct: 250 TENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLIST 309
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL R PKWGHL++LH AIKLCE AL+ + + SLGS+ EA VY +SG
Sbjct: 310 SYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASG 369
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFLAN+ K+D TV F SYHLPAWSVSILPDCK V FNTA + + + ++
Sbjct: 370 SCAAFLANVGTKSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQS 429
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
L+P S + G +W KE GI F+K G ++ INTT D +DYLWY+ + +
Sbjct: 430 LKPDGGS--SAELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIK 487
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
+E FL GS+ VL IES G ++AF N +L GS G PI+L AGKN +
Sbjct: 488 GDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGK---QKISLDIPINLAAGKNTVD 544
Query: 552 LLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNP 609
LLS+TVGL N G F++ VGAGIT V + G ++DL++ WTY++GL+GE G+
Sbjct: 545 LLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL--- 601
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
++ WVS P QPL WYK P G EP+ +D GKG+AW+NG+ IGRYWP
Sbjct: 602 ATVDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP 661
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + C CDYRG + +KC+ CG+PSQ YH+PRSW KPS N LV+FEE G
Sbjct: 662 ---TSIAGNGGCTDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMG 718
Query: 730 GDPTKITFSIRK 741
GDPT+I+F ++
Sbjct: 719 GDPTQISFGTKQ 730
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 828 bits (2139), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/734 (54%), Positives = 510/734 (69%), Gaps = 13/734 (1%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+ L + T + NVTYD R+L+I+G+R +++S +IHYPRS MW L+Q++K+G
Sbjct: 14 YVFLSVLLTLATTSYGVNVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDG 73
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE +Y F GR++LVKFIK++ +A +Y LRIGP+V AE+NYGG
Sbjct: 74 GLDVIETYVFWNAHEPVQNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGF 133
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH++PG FR D EPFK MQ+F IVDMMK+EKL+ASQGGPIIL+Q+ENEYG +
Sbjct: 134 PLWLHFVPGIKFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNID 193
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S YG K Y WAA MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK
Sbjct: 194 SSYGPAAKSYINWAASMAVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSKNKPK 253
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWF +FGG P+RP ED+AF+VARF+Q GG+ NYYMYHGGTNFGR+ GGPFI
Sbjct: 254 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQLGGTFQNYYMYHGGTNFGRSTGGPFI 313
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
+TSYDY+AP+DEYGL R PKWGHLK+LH +IKLCE AL+ + SLG + EA VY
Sbjct: 314 STSYDYDAPLDEYGLTRQPKWGHLKDLHKSIKLCEEALVATDPVTSSLGQNLEATVYKTG 373
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G C+AFLAN +DKTV F SY+LP WSVSILPDCK V NTA + + + V
Sbjct: 374 TGLCSAFLANF-GTSDKTVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVIPNFVH 432
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
++L S D + G W E GI FVK G ++ INTT D +DYLWY+ S +
Sbjct: 433 QSLIGDADSAD--TLGSSWSWIYEPVGISKNDAFVKPGLLEQINTTADKSDYLWYSLSTV 490
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+ +NE FL++GS+ VL +ES GHALHAF N +L GS +GN + + P++L GKN
Sbjct: 491 IKDNEPFLEDGSQTVLHVESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNT 550
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIY 607
I LLS+T GLQN G F+E GAGIT VK+ G +G T+DLS+ WTY+IGL+GE LG+
Sbjct: 551 IDLLSLTAGLQNYGAFFELEGAGITGPVKLEGLKNGTTVDLSSLQWTYQIGLKGEELGLS 610
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ N WV+ P QPL WYK P G++PI +D MGKG AW+NG+ IGRY
Sbjct: 611 S----GNSQWVTQPALPTKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGRY 666
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP K SP C C+YRG ++ KC+ C +PSQ YH+PRSW + S N LV+FEE
Sbjct: 667 WP---TKVSPTSGC-SNCNYRGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEE 722
Query: 728 KGGDPTKITFSIRK 741
GGDPT+I F+ ++
Sbjct: 723 IGGDPTQIAFATKQ 736
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 828 bits (2138), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/737 (54%), Positives = 511/737 (69%), Gaps = 20/737 (2%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
++ LL F S++ G+V+YDSR++ ING+R ++IS +IHYPRS P MWP L+++A
Sbjct: 17 VSALFLLGFLVCSVS----GSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKA 72
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
KEGG++ I++YVFWNGHE SPGKYYF G ++LV+F+K++QQ+ +Y+ LRIGP+V AE+N+
Sbjct: 73 KEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGPYVCAEWNF 132
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL YIPG FR D PFK MQ+F T IV+MMK E+LF SQGGPIIL+Q+ENEYG
Sbjct: 133 GGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYG 192
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
E G G+ Y WAAKMAV GVPW+MC+Q D PDP+IN CN FYCD F+P+
Sbjct: 193 PMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAY 252
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTE W GWF FGG P+RP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGG
Sbjct: 253 KPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGG 312
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFI TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++GE + + LG+ QEA VY
Sbjct: 313 PFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVY 372
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
SGAC+AFLAN + K+ V F + Y+LP WS+SILPDCK V+NTA V AQ+S ++
Sbjct: 373 KAKSGACSAFLANYNPKSYAKVSFGSNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMK 432
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
MV P +G GL WQ + E + + F G V+ INTT+DT+DYLWY T
Sbjct: 433 MV--------RVPVHG--GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMT 482
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
+ ++ NE FL+NG P L + S GHA+H F N +L GSA G+ P ++ ++L+AG
Sbjct: 483 DVKIDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAG 542
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N+IA+LS+ VGL N GP +E AG+ V + G + G DLS WTYK+GL+GE L
Sbjct: 543 FNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLSGGRRDLSWQKWTYKVGLKGESLS 602
Query: 606 IYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIG 665
+++ +++ W + QPLTWYK P GD P+ +DM MGKG W+NG+ +G
Sbjct: 603 LHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLG 662
Query: 666 RYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF 725
R+WP S EC Y G F DKC+ CGE SQRWYH+PRSW KPS N+LV+F
Sbjct: 663 RHWPAYKAVGS-----CSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVF 717
Query: 726 EEKGGDPTKITFSIRKI 742
EE GGDP I+ R++
Sbjct: 718 EEWGGDPNGISLVRREV 734
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 827 bits (2137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/719 (54%), Positives = 506/719 (70%), Gaps = 13/719 (1%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G+R+++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFW+GHE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
KY F GR++LVKF+K+ +A +Y+ LRIGP+V AE+NYGG PVWLH++PG FR D
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F T IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG K Y W+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA++ + GVPW MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF FG
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTNF RT+GGP I+TSYDY+APIDEYGL
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL++LH AIKLCE AL+ + + SLGS+ EA VY SG+CAAFLAN+D K+
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TV F SY+LPAWSVSILPDCK V FNTA + + + + ++L+P S +
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGS--SAEL 446
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
G +W KE GI F+K G ++ INTT D +DYLWY+ + +E FL GS+ V
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L IES G ++AF N +L GS G PI+L G N I LLS+TVGL N G
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHG---KQKISLDIPINLVTGTNTIDLLSVTVGLANYGA 563
Query: 565 FYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
F++ +GAGIT V + G ++DL++ WTY++GL+GE G+ ++ WVS
Sbjct: 564 FFDLMGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL---ATVDSSEWVSKSP 620
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P QPL WYK P G EP+ +D GKG+AW+NG+ IGRYWP + + C
Sbjct: 621 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP---TSIAGNGGCT 677
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+ CDYRG + +KC+ CG+PSQ YH+PRSW KPS NILV+FEE GGDPT+I+F+ ++
Sbjct: 678 ESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQ 736
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/734 (53%), Positives = 511/734 (69%), Gaps = 15/734 (2%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
++F + FA NVTYD R+L+++GRR ++IS +IHYPRS P MWP L+Q++K+GG++
Sbjct: 18 VVFLHCLVMTSFAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLD 77
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
IE+YVFWN HE +Y F GR +L+ F+K++++A +++ +RIGP+V AE+NYGG P+W
Sbjct: 78 VIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVEKAGLFVHIRIGPYVCAEWNYGGFPLW 137
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY--YES 190
LH+IPG FR D EPFK M++F IVDM+K+E L+ASQGGP+IL+Q+ENEYG ES
Sbjct: 138 LHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIES 197
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
YG K Y WAA MA + N GVPW+MCQQ D P VINTCN FYCDQF +S PK+
Sbjct: 198 RYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQNSDKTPKM 257
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENW GWF +FGG P+RP EDIAF+VARFFQ+GG+ NYYMYHGGTNFGRT+GGPFI
Sbjct: 258 WTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIA 317
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYGL PKWGHLK+LH AIKLCE A++ E + SLGS+ E VY S
Sbjct: 318 TSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPNITSLGSNIEVSVYKTDS 377
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
CAAFLAN ++D V F SYHLP WSVSILPDCK V F+TA + + S+ V
Sbjct: 378 -QCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTISTFVTR 436
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
+ SEA GS W E GI E F + G ++ INTT D +DYLWY+ S+ +
Sbjct: 437 S---SEADASGGSLS-GWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNI 492
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+E FL++GS VL +++ GH LHA+ N +L GS GN H F + P++L G+N+I
Sbjct: 493 KNDEPFLQDGSATVLHVKTLGHVLHAYINGKLSGSGKGNSRHSNFTIEVPVTLVPGENKI 552
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYN 608
LLS TVGLQN G F++ GAGIT V++ GF +G T DLS+ WTY++GL+GE LG+ N
Sbjct: 553 DLLSATVGLQNYGAFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGLSN 612
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
G + W S P NQPL WYKA P GD P+ +D MGKG AW+NG+ IGR+W
Sbjct: 613 GG---STLWKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFW 669
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
P +P+D C C+YRG +N +KC+ CG+PSQ YH+PRSW K S N+LV+FEE
Sbjct: 670 P---AYIAPNDGCTDPCNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEM 726
Query: 729 GGDPTKITFSIRKI 742
GGDPTK++F+ R+I
Sbjct: 727 GGDPTKLSFATREI 740
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/734 (53%), Positives = 510/734 (69%), Gaps = 15/734 (2%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
++F + FA NVTYD R+L+++GRR ++IS +IHYPRS P MWP L+Q++K+GG++
Sbjct: 18 VVFLHCLVMTSFAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLD 77
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
IE+YVFWN HE +Y F GR +L+ F+K++++A +++ +RIGP+V AE+NYGG P+W
Sbjct: 78 VIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVERAGLFVHIRIGPYVCAEWNYGGFPLW 137
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY--YES 190
LH+IPG FR D EPFK M++F IVDM+K+E L+ASQGGP+IL+Q+ENEYG ES
Sbjct: 138 LHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIES 197
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
YG K Y WAA MA + N GVPW+MCQQ D P VINTCN FYCDQF +S PK+
Sbjct: 198 RYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQNSDKTPKM 257
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENW GWF +FGG P+RP EDIAF+VARFFQ+GG+ NYYMYHGGTNFGRT+GGPFI
Sbjct: 258 WTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIA 317
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYGL PKWGHLK+LH AIKLCE A++ E + SLGS+ E VY S
Sbjct: 318 TSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPNVTSLGSNIEVSVYKTDS 377
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
CAAFLAN ++D V F SYHLP WSVSILPDCK V F+TA + + S+ V
Sbjct: 378 -QCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTISTFVTR 436
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
+ SEA GS W E GI E F + G ++ INTT D +DYLWY+ S+ +
Sbjct: 437 S---SEADASGGSLS-GWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNI 492
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+E FL++GS VL +++ GH LHA+ N L GS GN H F + P++L G+N+I
Sbjct: 493 KNDEPFLQDGSATVLHVKTLGHVLHAYINGRLSGSGKGNSRHSNFTIEVPVTLVPGENKI 552
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYN 608
LLS TVGLQN G F++ GAGIT V++ GF +G T DLS+ WTY++GL+GE LG+ N
Sbjct: 553 DLLSATVGLQNYGAFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGLSN 612
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
G + W S P NQPL WYKA P GD P+ +D MGKG AW+NG+ IGR+W
Sbjct: 613 GG---STLWKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFW 669
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
P +P+D C C+YRG +N +KC+ CG+PSQ YH+PRSW K S N+LV+FEE
Sbjct: 670 P---AYIAPNDGCTDPCNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEM 726
Query: 729 GGDPTKITFSIRKI 742
GGDPTK++F+ R+I
Sbjct: 727 GGDPTKLSFATREI 740
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 823 bits (2125), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/722 (54%), Positives = 504/722 (69%), Gaps = 26/722 (3%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G+R+++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFW+GHE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
KY F GR++LVKF+K+ +A +Y+ LRIGP+V AE+NYGG PVWLH++PG FR D
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F T IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG K Y W+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA++ + GVPW MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF FG
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTNF RT+GGP I+TSYDY+APIDEYGL
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL++LH AIKLCE AL+ + + SLGS+ EA VY SG+CAAFLAN+D K+
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TV F SY+LPAWSVSILPDCK V FNTA V+ S + +PD GS
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKVKFNSIS------------KTPDGGSS 430
Query: 445 ---GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
G +W KE GI F+K G ++ INTT D +DYLWY+ + +E FL GS
Sbjct: 431 AELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGS 490
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+ VL IES G ++AF N +L GS G PI+L G N I LLS+TVGL N
Sbjct: 491 KAVLHIESLGQVVYAFINGKLAGSGHG---KQKISLDIPINLVTGTNTIDLLSVTVGLAN 547
Query: 562 AGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS 619
G F++ VGAGIT V + G ++DL++ WTY++GL+GE G+ ++ WVS
Sbjct: 548 YGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL---ATVDSSEWVS 604
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
P QPL WYK P G EP+ +D GKG+AW+NG+ IGRYWP + +
Sbjct: 605 KSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP---TSIAGNG 661
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
C + CDYRG + +KC+ CG+PSQ YH+PRSW KPS NILV+FEE GGDPT+I+F+
Sbjct: 662 GCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFAT 721
Query: 740 RK 741
++
Sbjct: 722 KQ 723
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 823 bits (2125), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/728 (53%), Positives = 506/728 (69%), Gaps = 16/728 (2%)
Query: 16 FSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIE 75
F+ S+ + +V+YD +++ ING+R +++S +IHYPRS P MWP L+Q+AKEGG++ I+
Sbjct: 20 FACSLIGHASASVSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQ 79
Query: 76 SYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY 135
+YVFWNGHE SPGKYYFGG ++LV+FIK++QQA +Y+ LRIGP+V AE+N+GG PVWL Y
Sbjct: 80 TYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKY 139
Query: 136 IPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEG 195
IPG FR D PFK+ M+KF IVDMMK E+LF SQGGPIIL+Q+ENEYG E G
Sbjct: 140 IPGISFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAP 199
Query: 196 GKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENW 255
G+ Y WAA MAV GVPWIMC+Q D PDP+INTCN FYCD F+P+ PK+WTE W
Sbjct: 200 GRAYTQWAAHMAVGLGTGVPWIMCKQEDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAW 259
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDY 315
GWF FGG PHRP+ED+AFS+ARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY
Sbjct: 260 TGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDY 319
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAA 375
+AP+DEYGLPR PKWGHLK+LH AIKLCE AL++G+ + LG+ +EA V+ SGACAA
Sbjct: 320 DAPLDEYGLPRQPKWGHLKDLHRAIKLCEPALVSGDPTVQQLGNYEEAHVFRSKSGACAA 379
Query: 376 FLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPS 435
FLAN + ++ TV F N Y+LP WS+SILP+CK V+NTA V +QS+T++M
Sbjct: 380 FLANYNPQSYATVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMT------- 432
Query: 436 EASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEE 495
P +G GL W+ F E ++ F +G ++ IN T+D +DYLWY+T +++N NE
Sbjct: 433 -RVPIHG--GLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEG 489
Query: 496 FLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSM 555
FL+NG PVL + S GHALH F N +L G+A G+ P + + L+AG N+I+LLS+
Sbjct: 490 FLRNGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSV 549
Query: 556 TVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNN 614
VGL N GP +E AG+ + ++G N G DL+ W+YK+GL+GE L +++ ++
Sbjct: 550 AVGLPNVGPHFERWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSS 609
Query: 615 INWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
+ W+ + QPLTWYK P G P+ LDM MGKG W+NG+ +GRYWP
Sbjct: 610 VEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKAS 669
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
S C+Y G +N KC + CG+ SQRWYH+P SW KP+ N+LV+FEE GGDP
Sbjct: 670 GS-----CGYCNYAGTYNEKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNG 724
Query: 735 ITFSIRKI 742
I R I
Sbjct: 725 IFLVRRDI 732
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 822 bits (2122), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/734 (55%), Positives = 507/734 (69%), Gaps = 19/734 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F LL S ++ F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+G
Sbjct: 11 FWLLCIHSPTL---FCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN +E G+Y F GR +LVKF+K + A +Y+ LRIGP+V AE+NYGG
Sbjct: 68 GLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH+IPG FR D EPFK M++F IVDM+K E L+ASQGGP+IL+Q+ENEYG +
Sbjct: 128 PLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNID 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S YG GK Y WAA MA + + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK
Sbjct: 188 SAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWF FGG P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GGPFI
Sbjct: 248 MWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYG+ R PKWGHLKE+H AIKLCE AL+ + + SLG + EA VY
Sbjct: 308 ATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTG 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN+D K+D TV F SYHLPAWSVSILPDCK VV NTA + + S+
Sbjct: 368 S-VCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISSFTT 426
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
E+L+ S + S G W E GI F ++G ++ INTT D +DYLWY+ SI
Sbjct: 427 ESLKEDIGSSEASSTGWSW--ISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSID 484
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+ GS+ VL IES GHALHAF N +L GS +GN F P++L AGKN
Sbjct: 485 YKGDA-----GSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNT 539
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGF-NSGTLDLSTYSWTYKIGLQGEHLGIY 607
I LLS+TVGLQN G F++ GAGIT V + G N TLDLS WTY++GL+GE LG+
Sbjct: 540 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGLS 599
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ ++ W S PKNQPL WYK P G +P+ +D MGKG AW+NG+ IGRY
Sbjct: 600 S---GSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRY 656
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + C C+YRG ++ KC CG+PSQ YH+PRSW KPS NILV+FEE
Sbjct: 657 WPTYVASDA---GCTDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEE 713
Query: 728 KGGDPTKITFSIRK 741
KGGDPT+I+F ++
Sbjct: 714 KGGDPTQISFVTKQ 727
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 820 bits (2117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/722 (55%), Positives = 506/722 (70%), Gaps = 21/722 (2%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFWN H
Sbjct: 18 FCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLH 77
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E G+Y F GR +LVKF+K + +A +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR
Sbjct: 78 EPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRT 137
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
D EPFK M++F IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG GK Y WA
Sbjct: 138 DNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWA 197
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
AKMA + + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF +FG
Sbjct: 198 AKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFG 257
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G PHRP ED+AF+VARFFQ+GG+ NYYMYHGGTNF R+ GGPFI TSYDY+APIDEYG
Sbjct: 258 GAVPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDYDAPIDEYG 317
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
+ R KWGHLK++H AIKLCE AL+ + SLG + EA VY S CAAFLAN+D K
Sbjct: 318 IIRQQKWGHLKDVHKAIKLCEEALIATDPKISSLGQNLEAAVYKTGS-VCAAFLANVDTK 376
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
NDKTV F SYHLPAWSVSILPDCK VV NTA + + S+ V E++ E S
Sbjct: 377 NDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISNFVTEDISSLETSSS--- 433
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
KW E GI + K+G ++ INTT D +DYLWY+ S+ + ++ GS+
Sbjct: 434 ---KWSWINEPVGISKDDILSKTGLLEQINTTADRSDYLWYSLSLDLADDP-----GSQT 485
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL IES GHALHAF N +L G+ +GN PI+L +GKN+I LLS+TVGLQN G
Sbjct: 486 VLHIESLGHALHAFINGKLAGNQAGNSDKSKLNVDIPIALVSGKNKIDLLSLTVGLQNYG 545
Query: 564 PFYEWVGAGITS-VKITGFNSG--TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
F++ VGAGIT V + G +G TLDLS+ WTY+IGL+GE LG+ ++ W S
Sbjct: 546 AFFDTVGAGITGPVILKGLKNGNNTLDLSSRKWTYQIGLKGEDLGLS---SGSSGGWNSQ 602
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
PKNQPL WYK P G P+ +D MGKG AW+NG+ IGRYWP ++
Sbjct: 603 STYPKNQPLVWYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASNA---G 659
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C C+YRG + KC CG+PSQ YH+PRS+ KP+ N LV+FEE GGDPT+I+F+ +
Sbjct: 660 CTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDPTQISFATK 719
Query: 741 KI 742
++
Sbjct: 720 QL 721
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 819 bits (2116), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/719 (54%), Positives = 502/719 (69%), Gaps = 18/719 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NV+YD R+++ING+R ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE
Sbjct: 14 AWNVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHE 73
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
S GKYYF GR++LV+FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+ G FR +
Sbjct: 74 PSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTN 133
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK+HMQ+F IVDMMK E LF SQGGPIIL+Q+ENEYG E G G+ Y WAA
Sbjct: 134 NEPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAA 193
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
KMAV GVPW+MC+Q D PDP+INTCN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 194 KMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGG 253
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DE+GL
Sbjct: 254 AVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGL 313
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHLK+LH AIKLCE AL++G+ + SLG+ +EA V+ SGACAAFLAN + ++
Sbjct: 314 LRQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRS 373
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
V FRN+ Y+LP WS+SILPDCK V+NTA + AQS+T++M P S
Sbjct: 374 YAKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTPV------------SG 421
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
WQ + E + ++ F G ++ INTT+D +DYLWY+T + + NE FLK+G PV
Sbjct: 422 RFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPV 481
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L + S GHALH F N L G+A G+ +P + + L+AG N IALLS+ VGL N GP
Sbjct: 482 LTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGP 541
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+E AG+ V + G N G DLS W+YK+GL+GE L +++ +++ WV
Sbjct: 542 HFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLM 601
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+ QPLTWYK P G+ P+ LDM MGKG W+NG+ +GRYWP D
Sbjct: 602 ARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKATGGCGD---- 657
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C+Y G ++ KC++ CGEPSQRWYH+P SW P+ N+LV+FEE GG+P I+ R+I
Sbjct: 658 -CNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREI 715
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 819 bits (2115), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/733 (53%), Positives = 512/733 (69%), Gaps = 18/733 (2%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
ALL+ FS + +V+YDS+++ ING+R ++IS +IHYPRS P MWP L+Q+AK+GG
Sbjct: 14 ALLLVFS--LIGSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGG 71
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
++ I++YVFWNGHE SPGKYYF G ++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG P
Sbjct: 72 LDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFP 131
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWL YIPG FR D EPFK+ MQKF T IVD+MK E+L+ SQGGPII++Q+ENEYG E
Sbjct: 132 VWLKYIPGISFRTDNEPFKHQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEY 191
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
G GK Y WAA+MA+ GVPW+MC+Q DTPDP+INTCN FYCD F+P+ PK+
Sbjct: 192 EIGAAGKAYTKWAAEMAMGLGTGVPWVMCKQDDTPDPLINTCNGFYCDYFSPNKAYKPKM 251
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTE W GWF FGG PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 252 WTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIA 311
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++G+ + +G+ QEA V+ S
Sbjct: 312 TSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSKS 371
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
GACAAFLAN + K+ TV F N+ Y+LP WS+SILPDCK V+NTA V +QS+ ++M
Sbjct: 372 GACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSAQMKMT-- 429
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
P +G G W F E ++ F +G ++ +NTT+D +DYLWY+T +++
Sbjct: 430 ------RVPIHG--GFSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVL 481
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE FL+NG PVL + S GHALH F N +L G+A G+ P + + L+AG N+I
Sbjct: 482 DPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRAGVNKI 541
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS+ VGL N GP +E AG+ + ++G N G DLS W+YK+GL+GE L +++
Sbjct: 542 SLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGEILSLHSL 601
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ W+ + QPLTWYK P G P+ LDM MGKG WLNG+ +GRYWP
Sbjct: 602 SGSSSVEWIQGSLVSQRQPLTWYKTTFDAPAGTAPLALDMDSMGKGQVWLNGQNLGRYWP 661
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ CDY G +N +KC + CGE SQRWYH+P+SW KP+ N+LV+FEE G
Sbjct: 662 AYKASGT-----CDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELG 716
Query: 730 GDPTKITFSIRKI 742
GDP I R I
Sbjct: 717 GDPNGIFLVRRDI 729
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 818 bits (2113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/727 (54%), Positives = 519/727 (71%), Gaps = 17/727 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+++I+G R +++S +IHYPRS P MWPGL+Q++K+GG++ IE+YVFW+ HE
Sbjct: 30 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LV+F+K + A +Y+ LRIGP+V AE+NYGG PVWLH++PG FR D
Sbjct: 90 AVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 149
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
E FK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA
Sbjct: 150 NEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 209
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S S PK+WTENW GWF +FGG
Sbjct: 210 GMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGG 269
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP+ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR+ GGPFI TSYDY+APIDEYG+
Sbjct: 270 AVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGM 329
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY--ADSSGACAAFLANMDD 382
R PKWGHL+++H AIKLCE AL+ E S SLG + EA VY AD+S CAAFLAN+D
Sbjct: 330 VRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNS-ICAAFLANVDA 388
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM--VPENLQPSEAS-- 438
++DKTV F +Y LPAWSVSILPDCK VV NTA + +Q +T EM + ++Q ++ S
Sbjct: 389 QSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLI 448
Query: 439 -PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL 497
P+ + G W E GI E K G ++ INTT D +D+LWY+TSI+V +E +L
Sbjct: 449 TPELATAG--WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL 506
Query: 498 KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
NGS+ LL+ S GH L + N +L GSA G+ + + P++L GKN+I LLS TV
Sbjct: 507 -NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTV 565
Query: 558 GLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
GL N G F++ VGAG+T VK++G N G L+LS+ WTY+IGL+GE L +YNP +
Sbjct: 566 GLSNYGAFFDLVGAGVTGPVKLSGPN-GALNLSSTDWTYQIGLRGEDLHLYNPS-EASPE 623
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
WVS P NQPL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP +
Sbjct: 624 WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLA 680
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
P CV C+YRG ++ +KC+ CG+PSQ YH+PRS+ +P N LV+FE+ GGDP+ I+
Sbjct: 681 PQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMIS 740
Query: 737 FSIRKIS 743
F+ R+ S
Sbjct: 741 FTTRQTS 747
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/717 (54%), Positives = 501/717 (69%), Gaps = 18/717 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD R+++ING+R ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE S
Sbjct: 29 SVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 88
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
GKYYF GR++LV+FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+ G FR + E
Sbjct: 89 QGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNNE 148
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK+HMQ+F IVDMMK E LF SQGGPIIL+Q+ENEYG E G G+ Y WAAKM
Sbjct: 149 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 208
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D PDP+INTCN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 209 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 268
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DE+GL R
Sbjct: 269 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLLR 328
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH AIKLCE AL++G+ + SLG+ +EA V+ SGACAAFLAN + ++
Sbjct: 329 QPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRSYA 388
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V FRN+ Y+LP WS+SILPDCK V+NTA + AQS+T++M P S
Sbjct: 389 KVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTPV------------SGRF 436
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
WQ + E + ++ F G ++ INTT+D +DYLWY+T + + NE FLK+G PVL
Sbjct: 437 GWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLT 496
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHALH F N L G+A G+ +P + + L+AG N IALLS+ VGL N GP +
Sbjct: 497 VLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHF 556
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E AG+ V + G N G DLS W+YK+GL+GE L +++ +++ WV +
Sbjct: 557 ETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLMAR 616
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPLTWYK P G+ P+ LDM MGKG W+NG+ +GRYWP D C
Sbjct: 617 GQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKATGGCGD-----C 671
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
+Y G ++ KC++ CGEPSQRWYH+P SW P+ N+LV+FEE GG+P I+ R+I
Sbjct: 672 NYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREI 728
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 817 bits (2111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/729 (54%), Positives = 519/729 (71%), Gaps = 17/729 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+++I+G R +++S +IHYPRS P MWPGL+Q++K+GG++ IE+YVFW+ HE
Sbjct: 128 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 187
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LV+F+K + A +Y+ LRIGP+V AE+NYGG PVWLH++PG FR D
Sbjct: 188 AVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 247
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
E FK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA
Sbjct: 248 NEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 307
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S S PK+WTENW GWF +FGG
Sbjct: 308 GMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGG 367
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP+ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR+ GGPFI TSYDY+APIDEYG+
Sbjct: 368 AVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGM 427
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY--ADSSGACAAFLANMDD 382
R PKWGHL+++H AIKLCE AL+ E S SLG + EA VY AD+S CAAFLAN+D
Sbjct: 428 VRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNS-ICAAFLANVDA 486
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM--VPENLQPSEAS-- 438
++DKTV F +Y LPAWSVSILPDCK VV NTA + +Q +T EM + ++Q ++ S
Sbjct: 487 QSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLI 546
Query: 439 -PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL 497
P+ + G W E GI E K G ++ INTT D +D+LWY+TSI+V +E +L
Sbjct: 547 TPELATAG--WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL 604
Query: 498 KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
NGS+ LL+ S GH L + N +L GSA G+ + + P++L GKN+I LLS TV
Sbjct: 605 -NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTV 663
Query: 558 GLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
GL N G F++ VGAG+T VK++G N G L+LS+ WTY+IGL+GE L +YNP +
Sbjct: 664 GLSNYGAFFDLVGAGVTGPVKLSGPN-GALNLSSTDWTYQIGLRGEDLHLYNPS-EASPE 721
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
WVS P NQPL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP +
Sbjct: 722 WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLA 778
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
P CV C+YRG ++ +KC+ CG+PSQ YH+PRS+ +P N LV+FE+ GGDP+ I+
Sbjct: 779 PQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMIS 838
Query: 737 FSIRKISGF 745
F+ R+ S
Sbjct: 839 FTTRQTSSI 847
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 817 bits (2110), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/723 (54%), Positives = 510/723 (70%), Gaps = 15/723 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G R +++S +IHYPRS P MWPGL+Q+AK+GG++ +E+YVFW+ HE
Sbjct: 27 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDVHE 86
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LV+F+K A +Y+ LRIGP+V AE+NYGG P+WLH+IPG R D
Sbjct: 87 PVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTD 146
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F +V MK L+ASQGGPIIL+Q+ENEYG + YG GK Y WAA
Sbjct: 147 NEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAA 206
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAVA + GVPW+MCQQ D P+P+INTCN FYCDQFTP PS PK+WTENW GWF +FGG
Sbjct: 207 GMAVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGG 266
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP+ED+AF+VARF+Q+GG++ NYYMYHGGTNFGR++GGPFI+TSYDY+APIDEYGL
Sbjct: 267 AVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGL 326
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL+++H AIK+CE AL+ + S +SLG + EA VY S CAAFLAN+DD++
Sbjct: 327 VRQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANIDDQS 385
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS- 443
DKTV F +Y LPAWSVSILPDCK VV NTA + +Q ++ +M NL S + D S
Sbjct: 386 DKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQM--RNLGFSTQASDGSSV 443
Query: 444 ----KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
W E GI E K G ++ INTT D +D+LWY+TSI+V E +L N
Sbjct: 444 EAELAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYL-N 502
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS+ LL+ S GH L F N +L GS+ G+ + P++L GKN+I LLS TVGL
Sbjct: 503 GSQSNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGL 562
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F++ VGAGIT VK+TG GTLDLS+ WTY+IGL+GE L +YNP + WV
Sbjct: 563 TNYGAFFDLVGAGITGPVKLTG-PKGTLDLSSAEWTYQIGLRGEDLHLYNPS-EASPEWV 620
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S P N PLTWYK+ P GD+P+ +D MGKG AW+NG+ IGRYWP +P
Sbjct: 621 SDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNIAPQ 677
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
CV C+YRG ++ KC+ CG+PSQ YH+PRS+ +P N +V+FE+ GG+P+KI+F+
Sbjct: 678 SGCVNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFT 737
Query: 739 IRK 741
++
Sbjct: 738 TKQ 740
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 817 bits (2110), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/718 (54%), Positives = 496/718 (69%), Gaps = 23/718 (3%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD +S+IING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE S
Sbjct: 26 SVTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 85
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+YYFGGR++LV+F+K+++QA +Y LRIGP+V AE+N+GG PVWL Y+PG FR D
Sbjct: 86 PGQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNG 145
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M KF IV MMK E L+ +QGGPIIL+Q+ENEYG E + G GK Y WAAKM
Sbjct: 146 PFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWAAKM 205
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV N GVPW+MC+Q D PDPVINTCN FYCD F+P+ + PK+WTE W GWF FGG
Sbjct: 206 AVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKDNKPKMWTEAWTGWFTGFGGAV 265
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P RP+ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI+TSYDY+APIDEYGL R
Sbjct: 266 PQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGLLR 325
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHL++LH AIKLCE AL++GE + SLG +QE+ VY S +CAAFLAN + +
Sbjct: 326 QPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQESYVYRSKS-SCAAFLANFNSRYYA 384
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
TV F + Y+LP WSVSILPDCK VFNTA V AQ++T++M G
Sbjct: 385 TVTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQTTTMKM-------------QYLGGF 431
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E + F K G V+ ++TT D +DYLWYTT + + +NEEFLK G P L
Sbjct: 432 SWKAYTEDTDALNDNTFTKDGLVEQLSTTWDRSDYLWYTTYVDIAKNEEFLKTGKYPYLT 491
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHA+H F N +L G+A G+ +P Y L AG N+I++LS++VGL N G +
Sbjct: 492 VMSAGHAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKISILSVSVGLPNVGNHF 551
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E W + V +TG N G DLS WTY+IGL GE L +++ +N+ W E +
Sbjct: 552 ETWNTGVLGPVTLTGLNEGKRDLSLQKWTYQIGLHGETLSLHSLTGSSNVEW---GEASQ 608
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPLTWYK PPG+EP+ LDM MGKG W+NG+ IGRYWP S C
Sbjct: 609 KQPLTWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPAYKASGS-----CGSC 663
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
DYRG +N KC++ CGE SQRWYH+PRSW P+ N LV+ EE GGDPT I+ R ++
Sbjct: 664 DYRGTYNEKKCLSNCGEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTGISMVKRSVA 721
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 816 bits (2109), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/717 (54%), Positives = 499/717 (69%), Gaps = 16/717 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YDS+++ ING+R ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE S
Sbjct: 32 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 91
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PGKYYF G ++LVKF+K+ ++A +Y+ LRIGP++ AE+N+GG PVWL YIPG FR D
Sbjct: 92 PGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRTDNG 151
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQKF T IV+MMK E+LF +QGGPIIL+Q+ENEYG E G GK Y WAA+M
Sbjct: 152 PFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWAAEM 211
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D PDP+INTCN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 212 AVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFGGPV 271
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL R
Sbjct: 272 PHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 331
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH AIKLCE AL++G+ + + LG+ QEA V+ +G CAAFLAN ++
Sbjct: 332 QPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQRSFA 391
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V FRN+ Y+LP WS+SILPDCK V+NTA V AQS+ ++M P P +G G
Sbjct: 392 KVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTP--------VPMHG--GF 441
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
WQ + E G++ F G ++ INTT+D +DYLWY T + ++ +E FL++G PVL
Sbjct: 442 SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLG 501
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHALH F N +L G+A G+ P + + L+AG N+I+LLS+ VGL N GP +
Sbjct: 502 VLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHF 561
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E AGI V + G N G DLS W+YKIGL GE LG+++ +++ W +
Sbjct: 562 ETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSLVAQ 621
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPL+WYK P G+ P+ LDM MGKG W+NG+ +GR+WP + D C
Sbjct: 622 RQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASGTCGD-----C 676
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
Y G +N KC T CGE SQRWYH+P+SW KP+ N+LV+FEE GGDP I+ R +
Sbjct: 677 SYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDV 733
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 816 bits (2109), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/733 (53%), Positives = 507/733 (69%), Gaps = 17/733 (2%)
Query: 12 LLIFFSSSITYCFAGNVT-YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
L++F + C + YDS+++ ING+R ++IS +IHYPRS P MWP L+Q+AKEGG
Sbjct: 15 LVVFLLLGLWVCSVSSSVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGG 74
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
++ I++YVFWNGHE SPGKYYF G ++LVKFIK+++QA +Y+ LRIGP+V AE+N+GG P
Sbjct: 75 LDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAEWNFGGFP 134
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWL Y+PG FR D PFK MQ+F T IV+MMK E+LF SQGGPIIL+Q+ENEYG E
Sbjct: 135 VWLKYVPGINFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEY 194
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
G G+ Y+ WAAKMAV GVPW+MC+Q D PDPVINTCN FYCD F+P+ P PK+
Sbjct: 195 ELGAPGQAYSKWAAKMAVGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKPYKPKM 254
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTE W GWF FGG P+RP+ED+AFSVARF QKGG+ NYYMYHGGTNFGRTAGGPFI
Sbjct: 255 WTEAWTGWFTEFGGAVPYRPAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRTAGGPFIA 314
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++G S + LG+ QEA V+ S
Sbjct: 315 TSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGAPSVMPLGNYQEAHVFKSKS 374
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
GACAAFLAN + ++ V F N+ Y+LP WS+SILPDCK V+NTA + AQS+ ++M P
Sbjct: 375 GACAAFLANYNQRSFAKVSFGNMHYNLPPWSISILPDCKNTVYNTARIGAQSARMKMSPI 434
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
++ G WQ + E A G+ F+ G ++ INTT+D +DYLWY+T + +
Sbjct: 435 PMR----------GGFSWQAYSEEASTEGDNTFMMVGLLEQINTTRDVSDYLWYSTDVRI 484
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE FL++G PVL + S GHALH F N +L G+A G+ P + + ++AG N I
Sbjct: 485 DSNEGFLRSGKYPVLTVLSAGHALHVFVNGQLSGTAYGSLESPKLTFSQGVKMRAGINRI 544
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
LLS+ VGL N GP +E AG+ V + G N G DLS WTYKIGL GE L +++
Sbjct: 545 YLLSIAVGLPNVGPHFETWNAGVLGPVTLNGLNEGRRDLSWQKWTYKIGLHGEALSLHSL 604
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ W + QPL WYK P G+ P+ LDM MGKG W+NG+ +GRYWP
Sbjct: 605 SGSSSVEWAQGSFVSRKQPLMWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQSVGRYWP 664
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ K+S + C+Y G FN KC+T CGE SQRWYH+PRSW + N+LV+FEE G
Sbjct: 665 --AYKASGN---CGVCNYAGTFNEKKCLTNCGEASQRWYHVPRSWLNTAGNLLVVFEEWG 719
Query: 730 GDPTKITFSIRKI 742
GDP I+ R++
Sbjct: 720 GDPNGISLVRREV 732
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 815 bits (2106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/734 (52%), Positives = 493/734 (67%), Gaps = 19/734 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
FA L+ VTYD ++L+ING R ++IS +IHYPRS MWP L ++AK+G
Sbjct: 7 FAFLVLSVMLAVGGVECGVTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDG 66
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWN HE SPG Y F GRF+LVKF+K+ Q+A +Y+ LRIGP+V AE+N+GG
Sbjct: 67 GLDVIQTYVFWNMHEPSPGNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGF 126
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK M+ F +VD+MK E LF SQGGPIILAQVENEY E
Sbjct: 127 PVWLKYVPGISFRTDNEPFKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEE 186
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YG G +Y WAA+MAV + GVPW+MC+Q D PDPVINTCN FYCD F P+ P P
Sbjct: 187 MEYGLAGAQYMNWAAQMAVGMDTGVPWVMCKQDDAPDPVINTCNGFYCDNFVPNKPYKPT 246
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG PHRP ED+AF+VARFF KGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 247 MWTEAWSGWYTEFGGASPHRPVEDLAFAVARFFVKGGSFVNYYMYHGGTNFGRTAGGPFI 306
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGL R PKWGHLKELH AIKLCE AL++G+ SLG Q+A VY+
Sbjct: 307 ATSYDYDAPIDEYGLIRQPKWGHLKELHKAIKLCEPALVSGDPVVTSLGHFQQAYVYSAG 366
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G CAAF+ N D + V+F Y + WSVSILPDC+ VVFNTA V Q+S ++M P
Sbjct: 367 AGNCAAFIVNYDSNSVGRVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQTSQMKMTP 426
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
G W+ E + + G ++ IN T+D TDYLWY TS+
Sbjct: 427 VG-------------GFGWESIDENIASFEDNSISAVGLLEQINITRDNTDYLWYITSVE 473
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
V+E+E F+KNG PVL ++S G ALH F N +L GS G +P ++ + + L G N+
Sbjct: 474 VDEDEPFIKNGGLPVLTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNK 533
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
I+LLSMTVGLQN GP +E AG+ + ++GF GT DLS+ W+Y+IGL+GE + ++
Sbjct: 534 ISLLSMTVGLQNIGPHFEMANAGVLGPITLSGFKDGTRDLSSQRWSYQIGLKGETMNLHT 593
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
G N + W+ + P++QPL WYKA P G++P+GLD+ MGKG AW+NG+ IGRYW
Sbjct: 594 SG-DNTVEWMKGVAVPQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYW 652
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
P + C C Y G + P KC T CG+ SQRWYH+PRSW +PS N LV+FEE
Sbjct: 653 PSYLAEGV----CSDGCSYEGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEI 708
Query: 729 GGDPTKITFSIRKI 742
GG+P+ ++ R +
Sbjct: 709 GGNPSGVSLVTRSV 722
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 815 bits (2105), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/717 (53%), Positives = 499/717 (69%), Gaps = 16/717 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YDS+++ ING+R ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE S
Sbjct: 25 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 84
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PGKYYF G ++LVKF+K+ ++A +Y+ LRIGP++ AE+N+GG PVWL YIPG FR D
Sbjct: 85 PGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRTDNG 144
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQKF T +V+MMK E+LF +QGGPIIL+Q+ENEYG E G GK Y WAA+M
Sbjct: 145 PFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWAAEM 204
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D PDP+INTCN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 205 AVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFGGPV 264
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL R
Sbjct: 265 PHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 324
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH AIKLCE AL++G+ + + LG+ QEA V+ +G CAAFLAN ++
Sbjct: 325 QPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQRSFA 384
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V FRN+ Y+LP WS+SILPDCK V+NTA V AQS+ ++M P P +G G
Sbjct: 385 KVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTP--------VPMHG--GF 434
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
WQ + E G++ F G ++ INTT+D +DYLWY T + ++ +E FL++G PVL
Sbjct: 435 SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLG 494
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHALH F N +L G+A G+ P + + L+AG N+I+LLS+ VGL N GP +
Sbjct: 495 VLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHF 554
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E AGI V + G N G DLS W+YKIGL GE LG+++ +++ W +
Sbjct: 555 ETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSLVAQ 614
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPL+WYK P G+ P+ LDM MGKG W+NG+ +GR+WP + D C
Sbjct: 615 RQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASGTCGD-----C 669
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
Y G +N KC T CGE SQRWYH+P+SW KP+ N+LV+FEE GGDP I+ R +
Sbjct: 670 SYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDV 726
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 814 bits (2102), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/727 (53%), Positives = 499/727 (68%), Gaps = 22/727 (3%)
Query: 23 CFAG------NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIES 76
CF G +V+YDS+++IING R ++IS +IHYPRS MWP L+Q+AKEGG++ IE+
Sbjct: 17 CFFGVLSVQASVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLIQKAKEGGLDVIET 76
Query: 77 YVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYI 136
YVFWNGHE PGKYYF G ++LV+F+K++ QA +Y+ LRIGP+V AE+N+GG PVWL YI
Sbjct: 77 YVFWNGHEPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYI 136
Query: 137 PGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGG 196
PG FR D PFK+ M++F IV+MMK E+L+ SQGGPIIL+Q+ENEYG E G G
Sbjct: 137 PGISFRTDNAPFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIENEYGPMEYELGAPG 196
Query: 197 KRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWP 256
K Y+ WAA+MA+ GVPW+MC+Q D PDP+INTCN FYCD F+P+ PK+WTE W
Sbjct: 197 KAYSKWAAQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWT 256
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYE 316
GWF FGG PHRP+ED+AF+VARF QKGG++ NYYMYHGGTNFGRTAGGPFI TSYDY+
Sbjct: 257 GWFTQFGGAVPHRPAEDMAFAVARFIQKGGALINYYMYHGGTNFGRTAGGPFIATSYDYD 316
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAF 376
APIDEYGL R PKWGHLK+L+ AIKLCE AL++G+ LG+ QEA V+ SGACAAF
Sbjct: 317 APIDEYGLLRQPKWGHLKDLNRAIKLCEPALVSGDPIVTRLGNYQEAHVFKSKSGACAAF 376
Query: 377 LANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSE 436
L+N + ++ TV F N+ Y++P WS+SILPDCK VFNTA V AQ++ ++M P + S
Sbjct: 377 LSNYNPRSYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGAQTAIMKMSPVPMHES- 435
Query: 437 ASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
WQ + E + E F G ++ INTT+D TDYLWYTT + ++ NE F
Sbjct: 436 ---------FSWQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWYTTDVHIDANEGF 486
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L++G PVL + S GHA+H F N +L G+A G+ P + ++L+AG N+IALLS+
Sbjct: 487 LRSGKYPVLTVLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNNKIALLSIA 546
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGL N GP +E AGI V + G + G DL+ WTYKIGL GE + +++ +++
Sbjct: 547 VGLPNVGPHFEMWNAGILGPVNLNGLDEGRRDLTWQKWTYKIGLDGEAMSLHSLSGSSSV 606
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
W+ + QPLTW+K P G+ P+ LDM MGKG WLNG+ +GRYWP
Sbjct: 607 EWIQGSLVAQKQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGRYWPAYKSTG 666
Query: 676 SPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
S CDY G +N KC + CGE SQRWYH+PRSW P+ N+LV+FEE GGDP I
Sbjct: 667 S-----CGSCDYTGTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWGGDPNGI 721
Query: 736 TFSIRKI 742
R +
Sbjct: 722 HLVRRDV 728
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 814 bits (2102), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/733 (53%), Positives = 512/733 (69%), Gaps = 18/733 (2%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
ALL+ FS + +V+YDS+++ ING+R ++IS +IHYPRS P MWP L+Q+AK+GG
Sbjct: 15 ALLLAFS--LIGSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGG 72
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
++ I++YVFWNGHE SPGKYYF G ++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG P
Sbjct: 73 LDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFP 132
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWL YIPG FR D EPFK MQKF T IVD+MK E+L+ SQGGPII++Q+ENEYG E
Sbjct: 133 VWLKYIPGISFRTDNEPFKVQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEY 192
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
G GK Y WAA+MA+ GVPWIMC+Q DTPDP+INTCN FYCD F+P+ PK+
Sbjct: 193 EIGAAGKAYTKWAAEMAMELGTGVPWIMCKQDDTPDPLINTCNGFYCDYFSPNKAYKPKM 252
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTE W GWF FGG PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 253 WTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIA 312
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++G+ + +G+ QEA V+ S
Sbjct: 313 TSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSMS 372
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
GACAAFLAN + K+ TV F N+ Y+LP WS+SILP+CK V+NTA V +QS+ ++M
Sbjct: 373 GACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPNCKNTVYNTARVGSQSAQMKMT-- 430
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
P +G GL W F E ++ F +G ++ +NTT+D +DYLWY+T +++
Sbjct: 431 ------RVPIHG--GLSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVL 482
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE FL+NG PVL + S GHALH F N +L G+A G+ P + + L+ G N+I
Sbjct: 483 DPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRTGVNKI 542
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS+ VGL N GP +E AG+ + ++G N G DLS W+YK+GL+GE L +++
Sbjct: 543 SLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGETLSLHSL 602
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
G +++ W+ + QPLTWYK P G P+ LDM MGKG WLNG+ +GRYWP
Sbjct: 603 GGSSSVEWIQGSLVSQRQPLTWYKTTFDAPDGTAPLALDMNSMGKGQVWLNGQNLGRYWP 662
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ CDY G +N +KC + CGE SQRWYH+P+SW KP+ N+LV+FEE G
Sbjct: 663 AYKASGT-----CDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELG 717
Query: 730 GDPTKITFSIRKI 742
GD I+ R I
Sbjct: 718 GDLNGISLVRRDI 730
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 813 bits (2099), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/730 (54%), Positives = 519/730 (71%), Gaps = 20/730 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+++I+G R +++S +IHYPRS P MWPGL+Q++K+GG++ IE+YVFW+ HE
Sbjct: 30 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89
Query: 85 LSPGK---YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVF 141
G+ Y F GR +LV+F+K + A +Y+ LRIGP+V AE+NYGG PVWLH++PG F
Sbjct: 90 AVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKF 149
Query: 142 RNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYAL 201
R D E FK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y
Sbjct: 150 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 209
Query: 202 WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKT 261
WAA MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S S PK+WTENW GWF +
Sbjct: 210 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 269
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDE 321
FGG P+RP+ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR+ GGPFI TSYDY+APIDE
Sbjct: 270 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 329
Query: 322 YGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY--ADSSGACAAFLAN 379
YG+ R PKWGHL+++H AIKLCE AL+ E S SLG + EA VY AD+S CAAFLAN
Sbjct: 330 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNS-ICAAFLAN 388
Query: 380 MDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM--VPENLQPSEA 437
+D ++DKTV F +Y LPAWSVSILPDCK VV NTA + +Q +T EM + ++Q ++
Sbjct: 389 VDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDD 448
Query: 438 S---PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENE 494
S P+ + G W E GI E K G ++ INTT D +D+LWY+TSI+V +E
Sbjct: 449 SLITPELATAG--WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDE 506
Query: 495 EFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLS 554
+L NGS+ LL+ S GH L + N +L GSA G+ + + P++L GKN+I LLS
Sbjct: 507 PYL-NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLS 565
Query: 555 MTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRN 613
TVGL N G F++ VGAG+T VK++G N G L+LS+ WTY+IGL+GE L +YNP
Sbjct: 566 TTVGLSNYGAFFDLVGAGVTGPVKLSGPN-GALNLSSTDWTYQIGLRGEDLHLYNPS-EA 623
Query: 614 NINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSR 673
+ WVS P NQPL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP
Sbjct: 624 SPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---T 680
Query: 674 KSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPT 733
+P CV C+YRG ++ +KC+ CG+PSQ YH+PRS+ +P N LV+FE+ GGDP+
Sbjct: 681 NLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPS 740
Query: 734 KITFSIRKIS 743
I+F+ R+ S
Sbjct: 741 MISFTTRQTS 750
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 811 bits (2094), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/733 (53%), Positives = 505/733 (68%), Gaps = 18/733 (2%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
++ +YC VTYD R+L+I+G+R +++S +IHYPRS P MWP L+Q++K+GG++ IE+Y
Sbjct: 14 ATASYC--AKVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 71
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFWN HE G+Y FGGR +LVKF+K + +A +Y+ LRIGP+V AE+NYGG P+WLH+IP
Sbjct: 72 VFWNLHEAVRGQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIP 131
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G R D EPFK MQ+F IVDMMK+EKL+ASQGGPIIL+Q+ENEYG + YG +
Sbjct: 132 GIQLRTDNEPFKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQ 191
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS-MPKIWTENWP 256
Y WAA MAV+ + GVPW+MCQQ D P VI+TCN FYCDQ+TP P PK+WTENW
Sbjct: 192 TYIKWAADMAVSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQWTPRLPEKRPKMWTENWS 251
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYE 316
GWF +FGG P RP ED+AF+VARFFQ+GG+ NYYMYHGGTNFGR+ GGPFI TSYDY+
Sbjct: 252 GWFLSFGGAVPQRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYD 311
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAF 376
APIDEYGL R PKWGHLK++H AIKLCE A++ + S G + EA VY S ACAAF
Sbjct: 312 APIDEYGLLRQPKWGHLKDVHKAIKLCEEAMVATDPKYSSFGPNVEATVYKTGS-ACAAF 370
Query: 377 LANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSE 436
LAN D K+D TV F SYHLPAWSVSILPDCK VV NTA + + + M+P + S
Sbjct: 371 LANSDTKSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAA----MIPSFMHHSV 426
Query: 437 ASPDNGSKGL--KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENE 494
+ S+ L W E GI + F + G ++ INTT D +DYLWY+ SI V ++
Sbjct: 427 LDDIDSSEALGSGWSWINEPVGISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVTSSD 486
Query: 495 EFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLS 554
FL++GS+ +L +ES GHALHAF N + G + P++ +GKN I LLS
Sbjct: 487 TFLQDGSQTILHVESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNTIDLLS 546
Query: 555 MTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
+T+GLQN G F++ GAGIT V++ G +G T DLS+ WTY+IGLQGE G +
Sbjct: 547 LTIGLQNYGAFFDKSGAGITGPVQLKGLKNGTTTDLSSQRWTYQIGLQGEDSGFSS---G 603
Query: 613 NNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
++ W+S PK QPLTWYKA P G P+ LD MGKG AW+NG+ IGRYWP
Sbjct: 604 SSSQWISQPTLPKKQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYWP--- 660
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDP 732
++P C C++RG ++ +KC CG+PSQ YH+PRSW KPS N LV+FEE GGDP
Sbjct: 661 TNNAPTSGCPDSCNFRGPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGDP 720
Query: 733 TKITFSIRKISGF 745
T+I+F+ R+I
Sbjct: 721 TQISFATRQIESL 733
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 810 bits (2093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/737 (51%), Positives = 505/737 (68%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+ +F +T C +VTYD ++LIING+R ++ S +IHYPRS P MW GL+Q+AK+G
Sbjct: 12 LCMWVFLCIQLTQC---SVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKAKDG 68
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWN HE SPGKY F GR++LV+FIK+IQ+A +Y+ LRIGP++ AE+N+GG
Sbjct: 69 GLDAIDTYVFWNLHEPSPGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNFGGF 128
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL ++PG FR D EPFK MQ+F IV MMK EKLF SQGGPII++Q+ENEYG+
Sbjct: 129 PVWLKFVPGVSFRTDNEPFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEYGHES 188
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+G G Y WAAKMAVA + GVPW+MC++ D PDPVINTCN FYCD F+P+ P+ P
Sbjct: 189 RAFGAPGYAYLTWAAKMAVAMDTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPNKPT 248
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF F G RP ED++F+V RF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 249 LWTEAWSGWFTEFAGPIQQRPVEDLSFAVTRFIQKGGSFVNYYMYHGGTNFGRTAGGPFI 308
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCE ALL+ + + SLG+ +A V+
Sbjct: 309 TTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCERALLSADPAETSLGTYAKAQVFYSE 368
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFL+N + + V F ++ Y+L WS+SILPDCK VVFNTA V Q+S ++M+P
Sbjct: 369 SGGCAAFLSNYNPTSAARVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQTSQMQMLP 428
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N S+ L W+ F E I+ ++ G ++ +N T+DT+DYLWY+T I
Sbjct: 429 TN-----------SELLSWETFNEDISSADDDSTITVVGLLEQLNVTRDTSDYLWYSTRI 477
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL G P L+++S GHA+H F N L GSA G F + ++L+ G N
Sbjct: 478 DISSSESFLHGGQHPTLIVQSTGHAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSN 537
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
I++LS+ VGL N GP +E W + V + G + G DLS W+Y++GL+GE + +
Sbjct: 538 IISVLSIAVGLPNNGPHFETWSTGVLGPVVLHGLDEGKKDLSWQKWSYQVGLKGEAMNLV 597
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P +NI+W+ ++ K QPLTWYKA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 598 SPNVISNIDWMKGSLFAQKQQPLTWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSIGR 657
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW ++ + C Y G F KC GCG+P+QRWYH+PRSW KP++N+LV+FE
Sbjct: 658 YWTAYAKGN------CSGCSYSGTFRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVLFE 711
Query: 727 EKGGDPTKITFSIRKIS 743
E GGD +KI+F R ++
Sbjct: 712 ELGGDASKISFMKRSVT 728
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 810 bits (2092), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/720 (54%), Positives = 496/720 (68%), Gaps = 20/720 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+V+YD +++IING++ ++IS +IHYPRS P MWP L+Q++K+GG++ I++YVFWNGHE
Sbjct: 26 ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEP 85
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPGKYYF R++LVKFIK++ QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG VFR D
Sbjct: 86 SPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDN 145
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQKF IV MMK E+LF SQGGPIIL+Q+ENE+G E G GK Y WAA+
Sbjct: 146 EPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQ 205
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV N GVPWIMC+Q D PDPVI+TCN FYC+ FTP+ PK+WTE W GW+ FGG
Sbjct: 206 MAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGA 265
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P RP+ED+AFS+ARF QKGGS NYYMYHGGTNFGRTAGGPF+ TSYDY+AP+DEYGLP
Sbjct: 266 VPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLP 325
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKWGHL++LH AIK E AL++ E S SLG+ QEA V+ SG CAAFLAN D K+
Sbjct: 326 REPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVFKSKSG-CAAFLANYDTKSS 384
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V F N Y LP W +SILPDCK V+NTA + +QSS ++M P
Sbjct: 385 AKVSFGNGQYELPPWPISILPDCKTAVYNTARLGSQSSQMKMTPVK------------SA 432
Query: 446 LKWQVFKEIAGIWGEADFVK-SGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
L WQ F E + E+D G + IN T+DTTDYLWY T I ++ +E F+K G P+
Sbjct: 433 LPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKRGESPL 492
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L I S GHALH F N +L G+ G +P + + ++G N++ALLS++VGL N G
Sbjct: 493 LTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKPRSGINKLALLSISVGLPNVGL 552
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+E AG+ V + G NSGT D+S + WTYKIGL+GE LG++ +++ W
Sbjct: 553 HFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKIGLKGEALGLHTVSGSSSVEWAEGPSM 612
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+ QPLTWYKA PPG+ P+ LDM MGKG W+NG+ IGR+WP + + +
Sbjct: 613 AQKQPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTARGN-----CG 667
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C Y G ++ KC T CGEPSQRWYH+PRSW PS N+LV+FEE GGDPTKI+ R+ S
Sbjct: 668 NCYYAGTYDDKKCRTHCGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPTKISLVERRTS 727
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 810 bits (2092), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/730 (54%), Positives = 518/730 (70%), Gaps = 20/730 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+++I+G R +++S +IHYPRS P MWPGL+Q++K+GG++ IE+YVFW+ HE
Sbjct: 30 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89
Query: 85 LSPGK---YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVF 141
G+ Y F GR +LV+F+K + A +Y+ LRIGP+V AE+NYGG PVWLH++PG F
Sbjct: 90 PVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKF 149
Query: 142 RNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYAL 201
R D E FK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y
Sbjct: 150 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 209
Query: 202 WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKT 261
WAA MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S S PK+WTENW GWF +
Sbjct: 210 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 269
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDE 321
FGG P+RP+ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR+ GGPFI TSYDY+APIDE
Sbjct: 270 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 329
Query: 322 YGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY--ADSSGACAAFLAN 379
YG+ R PKWGHL+++H AIKLCE AL+ E S SLG + EA VY AD+S CAAFLAN
Sbjct: 330 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNS-ICAAFLAN 388
Query: 380 MDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM--VPENLQPSEA 437
+D ++DK V F +Y LPAWSVSILPDCK VV NTA + +Q +T EM + ++Q ++
Sbjct: 389 VDAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDD 448
Query: 438 S---PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENE 494
S P+ + G W E GI E K G ++ INTT D +D+LWY+TSI+V +E
Sbjct: 449 SLITPELATAG--WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDE 506
Query: 495 EFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLS 554
+L NGS+ LL+ S GH L + N +L GSA G+ + + P++L GKN+I LLS
Sbjct: 507 PYL-NGSQSNLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLS 565
Query: 555 MTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRN 613
TVGL N G F++ +GAG+T VK++G N G L+LS+ WTY+IGL+GE L +YNP
Sbjct: 566 TTVGLSNYGAFFDLIGAGVTGPVKLSGPN-GALNLSSTDWTYQIGLRGEDLHLYNPS-EA 623
Query: 614 NINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSR 673
+ WVS P NQPL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP
Sbjct: 624 SPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---T 680
Query: 674 KSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPT 733
+P CV C+YRG ++ +KC+ CG+PSQ YH+PRS+ +P N LV+FE+ GGDP+
Sbjct: 681 NLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPS 740
Query: 734 KITFSIRKIS 743
I+F+ R+ S
Sbjct: 741 MISFTTRQTS 750
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/739 (52%), Positives = 498/739 (67%), Gaps = 25/739 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
AL + F +C +VTYD ++++ING+R ++ S +IHYPRS P MW L+ +AKEG
Sbjct: 17 LALWLGFQLEQVHC---SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEG 73
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+Y+FWN HE S G Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG
Sbjct: 74 GLDVIETYIFWNVHEPSRGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGF 133
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV MMK E+L+ SQGGPIIL+Q+ENEYG
Sbjct: 134 PVWLKYVPGISFRTDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQS 193
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G+ Y WAAKMAV GVPW+MC++ D PDPVINTCN FYCD FTP+ P P
Sbjct: 194 KLLGPAGQNYVNWAAKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPS 253
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG + RP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 254 IWTEAWSGWFSEFGGPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFI 313
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+AP+DEYGL R PK+GHLKELH AIK+CE AL++ + + S+G+ Q+A VY
Sbjct: 314 TTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSADPAVTSMGNFQQAHVYTTK 373
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S ++M+P
Sbjct: 374 SGDCAAFLSNFDTKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLP 433
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFV---KSGFVDHINTTKDTTDYLWYTT 486
N + W+ F E + + SG ++ IN T+DT+DYLWY T
Sbjct: 434 TN-----------THMFSWESFDEDISSLDDGSAITITTSGLLEQINVTRDTSDYLWYIT 482
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
S+ + +E FL+ G P L+++S GHA+H F N +L GSA G F+Y ++L+AG
Sbjct: 483 SVDIGSSESFLRGGKLPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFRYTGTVNLRAG 542
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N IALLS+ VGL N G +E GI V + G N G LDLS WTY++GL+GE +
Sbjct: 543 TNRIALLSVAVGLPNVGGHFETWNTGILGPVVLRGLNQGKLDLSWQKWTYQVGLKGEAMN 602
Query: 606 IYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
+ +P +++ W+ S + KNQPLTW+K P GDEP+ LDM MGKG W+NG I
Sbjct: 603 LASPNGISSVEWMQSALVSEKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSI 662
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYW ++P C Y G F P KC GCG+P+QRWYH+PRSW KP+ N+LV+
Sbjct: 663 GRYW------TAPAAGICNGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPNHNLLVV 716
Query: 725 FEEKGGDPTKITFSIRKIS 743
FEE GGDP+KI+ R +S
Sbjct: 717 FEELGGDPSKISLVKRSVS 735
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/720 (54%), Positives = 497/720 (69%), Gaps = 20/720 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+V+YD +++IING++ ++IS +IHYPRS P MWP L+Q++K+GG++ I++YVFWNGHE
Sbjct: 26 ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEP 85
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPGKYYF R++LVKFIK++ QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG VFR D
Sbjct: 86 SPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDN 145
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQKF IV MMK E+LF SQGGPIIL+Q+ENE+G E G GK Y WAA+
Sbjct: 146 EPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQ 205
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV N GVPWIMC+Q D PDPVI+TCN FYC+ FTP+ PK+WTE W GW+ FGG
Sbjct: 206 MAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGA 265
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P RP+ED+AFS+ARF QKGGS NYYMYHGGTNFGRTAGGPF+ TSYDY+AP+DEYGLP
Sbjct: 266 VPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLP 325
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKWGHL++LH AIK E AL++ E S SLG+SQEA V+ SG CAAFLAN D K+
Sbjct: 326 REPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSG-CAAFLANYDTKSS 384
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V F N Y LP WS+SILPDC+ V+NTA + +QSS ++M P
Sbjct: 385 AKVSFGNGQYELPPWSISILPDCRTAVYNTARLGSQSSQMKMTPVK------------SA 432
Query: 446 LKWQVFKEIAGIWGEADFVK-SGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
L WQ F E + E+D G + IN T+DTTDY WY T I ++ +E F+K G P+
Sbjct: 433 LPWQSFIEESASSDESDTTTLDGLWEQINVTRDTTDYSWYMTDITISPDEGFIKRGESPL 492
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L I S GHALH F N +L G+ G +P + + L++G N++ALLS++VGL N G
Sbjct: 493 LTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSISVGLPNVGL 552
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+E AG+ V + G NSGT D+S + WTYK+GL+GE LG++ +++ W
Sbjct: 553 HFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKVGLKGEALGLHTVSGSSSVEWAEGPSM 612
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+ QPLTWY+A PPG+ P+ LDM MGKG W+NG+ IGR+WP + + +
Sbjct: 613 AQKQPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTARGN-----CG 667
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C Y G ++ KC T CGEPSQRWYH+PRSW S N+LV+FEE GGDPTKI+ R+ S
Sbjct: 668 NCYYAGTYDDKKCRTHCGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPTKISLVERRTS 727
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 808 bits (2088), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/739 (52%), Positives = 498/739 (67%), Gaps = 25/739 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
AL + F +C +VTYD ++++ING+R ++ S +IHYPRS P MW L+ +AKEG
Sbjct: 17 LALWLGFQLEQVHC---SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEG 73
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE S G Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG
Sbjct: 74 GLDVIETYVFWNVHEPSRGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFGGF 133
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV MMK E+L+ SQGGPIIL+Q+ENEYG
Sbjct: 134 PVWLKYVPGISFRTDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQS 193
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G+ Y WAAKMAV GVPW+MC++ D PDPVINTCN FYCD FTP+ P P
Sbjct: 194 KLLGSAGQNYVNWAAKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPS 253
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG + RP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 254 IWTEAWSGWFSEFGGPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFI 313
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+AP+DEYGL R PK+GHLKELH AIK+CE AL++ + + SLG+ Q+A VY+
Sbjct: 314 TTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSTDPAVTSLGNFQQAHVYSAK 373
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S ++M+P
Sbjct: 374 SGDCAAFLSNFDTKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLP 433
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFV---KSGFVDHINTTKDTTDYLWYTT 486
N ++ W+ F E + + SG ++ IN T+DT+DYLWY T
Sbjct: 434 TN-----------TRMFSWESFDEDISSLDDGSSITTTTSGLLEQINVTRDTSDYLWYIT 482
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
S+ + +E FL+ G P L+++S GHA+H F N +L GSA G F Y ++L+AG
Sbjct: 483 SVDIGSSESFLRGGKLPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAG 542
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N IALLS+ VGL N G +E GI V + GF+ G LDLS WTY++GL+GE +
Sbjct: 543 TNRIALLSVAVGLPNVGGHFETWNTGILGPVVLRGFDQGKLDLSWQKWTYQVGLKGEAMN 602
Query: 606 IYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
+ +P +++ W+ S + KNQPLTW+K P GDEP+ LDM MGKG W+NG I
Sbjct: 603 LASPNGISSVEWMQSALVSDKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSI 662
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYW + + C Y G F P KC GCG+P+QRWYH+PRSW KP N+LV+
Sbjct: 663 GRYWTALAAGN------CNGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDHNLLVV 716
Query: 725 FEEKGGDPTKITFSIRKIS 743
FEE GGDP+KI+ R +S
Sbjct: 717 FEELGGDPSKISLVKRSVS 735
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 808 bits (2088), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/740 (53%), Positives = 503/740 (67%), Gaps = 17/740 (2%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
T I LL FF F NVTYD R+L+I+G+R +++S +IHYPRS P MWP L+Q
Sbjct: 4 TQILFVGLLWFFCVYAPSSFCANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQ 63
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
++K+GG++ IE+YVFWN HE G+Y F GR +LVKF+K + A +Y+ LRIGP+ AE+
Sbjct: 64 KSKDGGLDVIETYVFWNLHEPVQGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYACAEW 123
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
NYGG P+WLH+IPG FR D +PF+ M++F IVDMMK+E L+ASQGGPIIL+QVENE
Sbjct: 124 NYGGFPLWLHFIPGIQFRTDNKPFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQVENE 183
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YG ++ YG K Y WAA MA + + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S
Sbjct: 184 YGNIDAAYGPAAKSYIKWAASMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNS 243
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
+ PK+WTENW GWF +FGG P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTNFGRT
Sbjct: 244 NAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRTT 303
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEAD 364
GGPFI+TSYDY+APID+YG+ R PKWGHLK++H AIKLCE AL+ + + S G + EA
Sbjct: 304 GGPFISTSYDYDAPIDQYGIIRQPKWGHLKDVHKAIKLCEEALIATDPTITSPGPNIEAA 363
Query: 365 VYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST 424
VY S CAAFLAN+ +D TV F SYHLPAWSVSILPDCK VV NTA + + S
Sbjct: 364 VYKTGS-ICAAFLANI-ATSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSASMI 421
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
E+ + S D+ G W E GI F K G ++ INTT D +DYLWY
Sbjct: 422 SSFTTESFKEEVGSLDDSGSGWSW--ISEPIGISKSDSFSKFGLLEQINTTADKSDYLWY 479
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
+ SI V + +GS+ VL IES GHALHAF N ++ GS +GN P++L
Sbjct: 480 SISIDVEGD-----SGSQTVLHIESLGHALHAFINGKIAGSGTGNSGKAKVNVDIPVTLV 534
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGE 602
AGKN I LLS+TVGLQN G F++ GAGIT V + G +G T+DLS+ WTY++GL+ E
Sbjct: 535 AGKNSIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGSTVDLSSQQWTYQVGLKYE 594
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
LG P ++ W S P NQ L WYK P G P+ +D MGKG AW+NG+
Sbjct: 595 DLG---PSNGSSGQWNSQSTLPTNQSLIWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQ 651
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
IGRYWP SP+ C C+YRG ++ KC+ CG+PSQ YHIPRSW +P N L
Sbjct: 652 SIGRYWP---TYVSPNGGCTDSCNYRGAYSSSKCLKNCGKPSQTLYHIPRSWLQPDSNTL 708
Query: 723 VIFEEKGGDPTKITFSIRKI 742
V+FEE GGDPT+I+F+ ++I
Sbjct: 709 VLFEESGGDPTQISFATKQI 728
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 808 bits (2086), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/724 (53%), Positives = 501/724 (69%), Gaps = 19/724 (2%)
Query: 23 CFA---GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVF 79
CFA +V+YDS++++ING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVF
Sbjct: 22 CFASVRASVSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGLDVIQTYVF 81
Query: 80 WNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGT 139
WNGHE SPGKYYF ++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PVWL Y+PG
Sbjct: 82 WNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI 141
Query: 140 VFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRY 199
FR D PFK MQ+F T IV+MMK E+LF S GGPIIL+Q+ENEYG E G GK Y
Sbjct: 142 QFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYEIGAPGKAY 201
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWF 259
WAA+MAV GVPW+MC+Q D PDPVIN CN FYCD F+P+ PK+WTE W GWF
Sbjct: 202 TDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKAYKPKMWTEAWTGWF 261
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPI 319
FGG P+RP+ED+AFSVA+F QKGG+ NYYMYHGGTNFGRTAGGPFI TSYDY+AP+
Sbjct: 262 TEFGGAVPYRPAEDLAFSVAKFLQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPL 321
Query: 320 DEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLAN 379
DEYGL R PKWGHLK+LH AIKLCE AL++ + + LG+ QEA V+ +SGACAAFLAN
Sbjct: 322 DEYGLLRQPKWGHLKDLHRAIKLCEPALVSSDPTVTPLGTYQEAHVFKSNSGACAAFLAN 381
Query: 380 MDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASP 439
+ K+ V F N+ Y+LP WS+SILPDCK V+NTA + AQ++ ++M P
Sbjct: 382 YNRKSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQTARMKM--------PRVP 433
Query: 440 DNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
+G G WQ + + + + F +G ++ IN T+D TDYLWY T + ++ +E+FL++
Sbjct: 434 IHG--GFSWQAYNDETATYSDTSFTTAGLLEQINITRDATDYLWYMTDVKIDPSEDFLRS 491
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
G+ PVL + S GHAL F N +L G+A G+ P +K ++L+AG N+IALLS+ VGL
Sbjct: 492 GNYPVLTVLSAGHALRVFINGQLAGTAYGSLETPKLTFKQGVNLRAGINQIALLSIAVGL 551
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N GP +E AGI V + G N G DLS W+YKIGL+GE L +++ +++ W
Sbjct: 552 PNVGPHFETWNAGILGPVILNGLNEGRRDLSWQKWSYKIGLKGEALSLHSLTGSSSVEWT 611
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
+ QPLTWYK +P G+ P+ LDM MGKG W+N IGRYWP +
Sbjct: 612 EGSFVAQRQPLTWYKTTFNRPAGNSPLALDMGSMGKGQVWINDRSIGRYWPAYKASGT-- 669
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
EC+Y G F+ KC++ CGE SQRWYH+PRSW P+ N+LV+ EE GGDP I
Sbjct: 670 ---CGECNYAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTGNLLVVLEEWGGDPNGIFLV 726
Query: 739 IRKI 742
R++
Sbjct: 727 RREV 730
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 807 bits (2085), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/737 (51%), Positives = 503/737 (68%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
++++ S + C NVTYD ++LIING+R+++ S +IHYPRS P MW GL+Q+AK+G
Sbjct: 13 LSVVLLTSLQLIQC---NVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWN HE SPG Y F GR++LV+FIK++ +A +Y+ LRIGP++ AE+N+GG
Sbjct: 70 GLDVIDTYVFWNLHEPSPGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQKF IV MMK E LF SQGGPIIL+Q+ENEY
Sbjct: 130 PVWLKYVPGISFRTDNEPFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPES 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+G G Y WAA MA++ + GVPW+MC++FD PDPVINTCN FYCD F+P+ P P
Sbjct: 190 KAFGSPGHAYMTWAAHMAISMDTGVPWVMCKEFDAPDPVINTCNGFYCDYFSPNKPYKPT 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG + RP+ED+AF+VARF QKGGS+ NYYMYHGGTNFGRT+GGPFI
Sbjct: 250 MWTEAWTGWFTDFGGPNHQRPAEDLAFAVARFIQKGGSLVNYYMYHGGTNFGRTSGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCE ALL + + SLGS ++A V++
Sbjct: 310 TTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEKALLAADSTVTSLGSYEQAHVFSSD 369
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFL+N + K V F N+ Y LP WS+SILPDCK VVFNTA+V Q+S V M+P
Sbjct: 370 SGGCAAFLSNYNTKQAARVKFNNIQYSLPPWSISILPDCKNVVFNTAHVGVQTSQVHMLP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ S+ L W+ F E I+ + + +G ++ +N T+DT+DYLWYTTS+
Sbjct: 430 TD-----------SELLSWETFNEDISSVDDDKMITVAGLLEQLNITRDTSDYLWYTTSV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL+ G PVL ++S GHALH F N EL GSA G F + + AGKN
Sbjct: 479 HISSSESFLRGGRLPVLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKN 538
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
I+LLS+ VGL N GP +E GI V + G + G DL+ W+YK+GL+GE + +
Sbjct: 539 RISLLSVAVGLPNNGPRFETWNTGILGPVTLHGLDEGQRDLTWQKWSYKVGLKGEDMNLR 598
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+ + ++W+ ++ K QPLTWYKA P GD+P+ LDM MGKG W+NG IGR
Sbjct: 599 SRKSVSLVDWIQGSLMVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGR 658
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + + C Y F P +C GCG+P+Q+WYH+PRSW K + N+LV+FE
Sbjct: 659 YWTLYAEGN------CSGCSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVLFE 712
Query: 727 EKGGDPTKITFSIRKIS 743
E GGD ++I+ R ++
Sbjct: 713 EIGGDASRISLVKRLVT 729
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 807 bits (2084), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/720 (55%), Positives = 497/720 (69%), Gaps = 16/720 (2%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFWN H
Sbjct: 22 FCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 81
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E G+Y F GR +LVKF+K + A +Y+ LRIGP+V AE+NYGG PVWLH+IPG FR
Sbjct: 82 EPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRT 141
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
D EPFK M++F IVDM+K+EKL+ASQGGP+IL+Q+ENEYG ++ YG GK Y WA
Sbjct: 142 DNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWA 201
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MA + + GVPW+MC Q D PDP+INT N FY D+FTP+S + PK+WTENW GWF FG
Sbjct: 202 ATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPKMWTENWSGWFLVFG 261
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF R +GGPFI TSYDY+APIDEYG
Sbjct: 262 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYG 321
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
+ R PKWGHLKE+H AIKLCE AL+ + + SLG + EA VY S CAAFLAN+ K
Sbjct: 322 IIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTGS-VCAAFLANVGTK 380
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
+D TV F SYHLPAWSVSILPDCK VV NTA + + S+ E+ + S + S
Sbjct: 381 SDVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASS 440
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
G W E GI F ++G ++ INTT D +DYLWY+ SI + S+
Sbjct: 441 TGWSW--ISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADAS-----SQT 493
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL IES GHALHAF N +L GS GN F P++L AGKN I LLS+TVGLQN G
Sbjct: 494 VLHIESLGHALHAFINGKLAGSQPGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYG 553
Query: 564 PFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
F++ G GIT V + GF +G TLDLS+ WTY++GLQGE LG+ + G N ST
Sbjct: 554 AFFDTWGVGITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGL-SSGSSGQWNLQSTF 612
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
PKNQPLTWYK P G +P+ +D MGKG AW+NG+ IGRYWP + C
Sbjct: 613 --PKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDA---SC 667
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
C+YRG ++ KC C +PSQ YH+PRSW KPS NILV+FEE+GGDPT+I+F ++
Sbjct: 668 TDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQ 727
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 807 bits (2084), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/718 (54%), Positives = 494/718 (68%), Gaps = 16/718 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+V+YDS+++ ING+ ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE
Sbjct: 26 ASVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPGKYYF G ++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PVWL YIPG FR D
Sbjct: 86 SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 145
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK+ MQKF IVDMMK ++LF SQGGPII++Q+ENEYG E G GK Y WAA
Sbjct: 146 EPFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTKWAAD 205
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV GVPWIMC+Q D PDPVINTCN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 206 MAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCDYFSPNKDYKPKMWTEAWTGWFTEFGGP 265
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL
Sbjct: 266 VPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
+ PKWGHLK+LH AIKL E AL++G+ + +G+ QEA V+ SGACAAFL N + K
Sbjct: 326 QQPKWGHLKDLHRAIKLSEPALISGDPTVTRIGNYQEAHVFKSKSGACAAFLGNYNPKAF 385
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV F N+ Y+LP WS+SILPDCK V+NTA V +QS+ ++M P +G G
Sbjct: 386 ATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSAQMKMT--------RVPIHG--G 435
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
L WQVF E ++ F +G ++ +NTT+D TDYLWY+T ++++ NE FL++G PVL
Sbjct: 436 LSWQVFTEQTASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDVVIDPNEGFLRSGKDPVL 495
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
+ S GHALH F N +L G+ G+ P + + L G N+I+LLS+ VGL N GP
Sbjct: 496 TVLSAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKISLLSVAVGLPNVGPH 555
Query: 566 YEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+E AG+ + + G + G DLS W+YK+GL GE L +++ G +++ WV
Sbjct: 556 FETWNAGVLGPITLNGLDEGRRDLSWQKWSYKVGLHGEALSLHSLGGSSSVEWVQGSLVS 615
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
+ QPLTWYK P G P LDM MGKG WLNG+ +GRYWP +
Sbjct: 616 RMQPLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPAYKASGT-----CDN 670
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
CDY G +N +KC + CGE SQRWYH+P SW P+ N+LV+FEE GGDP I R I
Sbjct: 671 CDYAGTYNENKCRSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNGIFLVRRDI 728
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 807 bits (2084), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/726 (52%), Positives = 509/726 (70%), Gaps = 20/726 (2%)
Query: 21 TYCFAG--NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYV 78
++ F+G +V+YD R++I+NG+R ++IS ++HYPRS P MWPG++Q+AKEGGV+ I++YV
Sbjct: 18 SWVFSGTASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYV 77
Query: 79 FWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
FWNGHE GKYYF GR++LVKFIK++ QA +Y+ LR+GP+ AE+N+GG PVWL Y+PG
Sbjct: 78 FWNGHEPQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPG 137
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
FR D PFK MQKF IV+MMK E+L+ +QGGPIIL+Q+ENEYG E G GK
Sbjct: 138 ISFRTDNGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKS 197
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
YA WAAKMAV + GVPW+MC+Q D PDP+IN CN FYCD F+P+ PKIWTE W W
Sbjct: 198 YAQWAAKMAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAW 257
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
F FG P+RP+ED+AFSVA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP
Sbjct: 258 FTGFGNPVPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAP 317
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLA 378
+DEYGL R PKWGHLK+LH AIKLCE AL++G+ + +LG QEA V+ +G+CAAFLA
Sbjct: 318 LDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLA 377
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N D + TV F N Y+LP WS+SILPDCK VFNTA + AQS+ ++M P
Sbjct: 378 NYDQHSFATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSAQMKMTPV-------- 429
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
S+GL WQ F E + ++ F G ++ INTT+D +DYLWY+T + ++ E+FL+
Sbjct: 430 ----SRGLPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLR 485
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
G P L I S GHALH F N +L G+A G+ P + ++L+AG N+I+LLS+ VG
Sbjct: 486 GGKWPWLTIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVG 545
Query: 559 LQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
L N GP +E AG+ V +TG + G DL+ W+YK+GL+GE L +++ +++ W
Sbjct: 546 LPNIGPHFETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEW 605
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
V + QPLTWYK+ P G++P+ LD+ MGKG W+NG+ +GRYWP K+S
Sbjct: 606 VEGSLVAQRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWP--GYKASG 663
Query: 678 HDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
+ C+Y G FN KC++ CGE SQRWYH+PRSW P+ N+LV+FEE GG+P I+
Sbjct: 664 N---CGACNYAGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISL 720
Query: 738 SIRKIS 743
R+++
Sbjct: 721 VKREVA 726
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 806 bits (2083), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/726 (52%), Positives = 509/726 (70%), Gaps = 20/726 (2%)
Query: 21 TYCFAG--NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYV 78
++ F+G +V+YD R++I+NG+R ++IS ++HYPRS P MWPG++Q+AKEGGV+ I++YV
Sbjct: 18 SWVFSGTASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYV 77
Query: 79 FWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
FWNGHE GKYYF GR++LVKFIK++ QA +Y+ LR+GP+ AE+N+GG PVWL Y+PG
Sbjct: 78 FWNGHEPQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPG 137
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
FR D PFK MQKF IV+MMK E+L+ +QGGPIIL+Q+ENEYG E G GK
Sbjct: 138 ISFRTDNGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKS 197
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
YA WAAKMAV + GVPW+MC+Q D PDP+IN CN FYCD F+P+ PKIWTE W W
Sbjct: 198 YAQWAAKMAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAW 257
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
F FG P+RP+ED+AFSVA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP
Sbjct: 258 FTGFGNPVPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAP 317
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLA 378
+DEYGL R PKWGHLK+LH AIKLCE AL++G+ + +LG QEA V+ +G+CAAFLA
Sbjct: 318 LDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLA 377
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N D + TV F N Y+LP WS+SILPDCK VFNTA + AQS+ ++M P
Sbjct: 378 NYDQHSFATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSAQMKMTPV-------- 429
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
S+GL WQ F E + ++ F G ++ INTT+D +DYLWY+T + ++ E+FL+
Sbjct: 430 ----SRGLPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLR 485
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
G P L I S GHALH F N +L G+A G+ P + ++L+AG N+I+LLS+ VG
Sbjct: 486 GGKWPWLTIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVG 545
Query: 559 LQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
L N GP +E AG+ V +TG + G DL+ W+YK+GL+GE L +++ +++ W
Sbjct: 546 LPNIGPHFETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEW 605
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
V + QPLTWYK+ P G++P+ LD+ MGKG W+NG+ +GRYWP K+S
Sbjct: 606 VEGSLVAQRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWP--GYKASG 663
Query: 678 HDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
+ C+Y G FN KC++ CGE SQRWYH+PRSW P+ N+LV+FEE GG+P I+
Sbjct: 664 N---CGACNYAGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISL 720
Query: 738 SIRKIS 743
R+++
Sbjct: 721 VKREVA 726
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 806 bits (2082), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/723 (54%), Positives = 509/723 (70%), Gaps = 15/723 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G R +++S +IHYPRS P MWPG++Q+AK+GG++ IE+YVFW+ HE
Sbjct: 34 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFWDIHE 93
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +L F+K + A +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR D
Sbjct: 94 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 153
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA
Sbjct: 154 NEPFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 213
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA++ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF +FGG
Sbjct: 214 GMAISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 273
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTN R++GGPFI TSYDY+APIDEYGL
Sbjct: 274 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 333
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL+++H AIKLCE AL+ + S SLG + EA VY S CAAFLAN+D ++
Sbjct: 334 VREPKWGHLRDVHKAIKLCEPALIATDPSYTSLGQNAEAAVYKTGS-VCAAFLANIDGQS 392
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM--VPENLQPSEAS---P 439
DKTV F Y LPAWSVSILPDCK VV NTA + +Q ++ EM + + S+ S P
Sbjct: 393 DKTVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRYLESSNMASDGSFITP 452
Query: 440 DNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
+ G W E GI + K+G ++ INTT D +D+LWY+TSI V +E +L N
Sbjct: 453 ELAVSG--WSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYL-N 509
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS+ L++ S GH L + N ++ GSA G+ + ++ PI L GKN+I LLS TVGL
Sbjct: 510 GSQSNLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 569
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F++ VGAGIT VK++G N G LDLS+ WTY+IGL+GE L +Y+P + WV
Sbjct: 570 SNYGAFFDLVGAGITGPVKLSGTN-GALDLSSAEWTYQIGLRGEDLHLYDPS-EASPEWV 627
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S P NQPL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP +P
Sbjct: 628 SANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLAPQ 684
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
CV C+YRG +N +KC+ CG+PSQ YH+PRS+ +P N +V+FE+ GGDP+KI+F
Sbjct: 685 SGCVNSCNYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFV 744
Query: 739 IRK 741
IR+
Sbjct: 745 IRQ 747
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 806 bits (2081), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/736 (51%), Positives = 497/736 (67%), Gaps = 22/736 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F +++ S + +C VTYD +++II+G+R ++IS +IHYPRS P MW LVQ+AK+G
Sbjct: 13 FLMVLIVGSKLIHC---TVTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKAKDG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWN HE SPG Y F GRF+LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG
Sbjct: 70 GLDVIDTYVFWNVHEPSPGNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQ F IV MMK E+LF SQGGPII +Q+ENEYG
Sbjct: 130 PVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEYGPES 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+G G Y WAA+MAV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P
Sbjct: 190 RAFGAAGHSYINWAAQMAVGLKTGVPWVMCKEDDAPDPVINTCNGFYCDAFSPNKPYKPT 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG HRP +D+AF+VARF QKGGS NYYMYHGGTNFGR+AGGPFI
Sbjct: 250 MWTEAWSGWFTEFGGAFHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRSAGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCEH L++ + + LG+ Q+A V++
Sbjct: 310 TTSYDYDAPIDEYGLIREPKYGHLKELHRAIKLCEHELVSSDPTITLLGTYQQAHVFSSG 369
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+C+AFLAN ++ V+F N+ Y LP WS+SILPDC+ VVFNTA V Q+S V+M+P
Sbjct: 370 KRSCSAFLANYHTQSAARVMFNNMHYVLPPWSISILPDCRNVVFNTAKVGVQTSHVQMLP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
GS+ W+ + E I+ + + G ++ IN T+DTTDYLWY TS+
Sbjct: 430 -----------TGSRFFSWESYDEDISSLGASSRMTALGLMEQINVTRDTTDYLWYITSV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+N +E FL+ G P L +ES GHALH F N + GSA G + F + P++L+AG N
Sbjct: 479 NINPSESFLRGGQWPTLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVNLRAGTN 538
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G YE W + V + G N G DL+ W+Y++GL+GE + +
Sbjct: 539 RIALLSIAVGLPNVGVHYETWKTGILGPVMLHGLNQGNKDLTWQQWSYQVGLKGEAMNLV 598
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+P ++++W+ + QPL WYKA P G+EP+ LDM MGKG W+NG+ IGRY
Sbjct: 599 SPNRASSVDWIQGSLATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWINGQSIGRY 658
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
W ++ +C C Y G F P KC GCG+P+QRWYH+PRSW KP +N+LVIFEE
Sbjct: 659 WLSYAK-----GDC-SSCGYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIFEE 712
Query: 728 KGGDPTKITFSIRKIS 743
GGD +KI+ R +
Sbjct: 713 LGGDASKISLVKRSTT 728
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/718 (53%), Positives = 500/718 (69%), Gaps = 17/718 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD +++ ING+R +++S +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE S
Sbjct: 20 SVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 79
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PGKYYFGG ++LV+FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL YIPG FR +
Sbjct: 80 PGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNNG 139
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK +MQ+F IVDMMK E LF SQGGPIIL+Q+ENEYG E G G+ Y+ WAA+M
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D PDP+IN+CN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 259
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+RP ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL R
Sbjct: 260 PYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLVR 319
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH AIKLCE AL++G+ S + LG QEA V+ G CAAFLAN + ++
Sbjct: 320 QPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSFA 379
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F N+ Y+LP WS+SILPDCK V+NTA V AQS+ ++MVP P +G+
Sbjct: 380 KVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVP--------VPIHGA--F 429
Query: 447 KWQVFKEIA-GIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
WQ + E A GE F G V+ INTT+D +DYLWY+T + ++ +E FLK G P L
Sbjct: 430 SWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTL 489
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
+ S GHALH F N +L G+A G+ P + ++L+AG N+I++LS+ VGL N GP
Sbjct: 490 TVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKISILSIAVGLPNVGPH 549
Query: 566 YEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+E AG+ V + G N G DLS W+YK+G++GE + +++ +++ W +
Sbjct: 550 FETWNAGVLGPVTLNGLNEGRRDLSWQKWSYKVGVEGEAMSLHSLSGSSSVEWTAGSFVA 609
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
+ QPLTW+K P G+ P+ LDM MGKG W+NG+ IGR+WP S
Sbjct: 610 RRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPAYKASGS-----CGW 664
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
CDY G FN KC++ CGE SQRWYH+PRSW P+ N+LV+FEE GGDP I+ R++
Sbjct: 665 CDYAGTFNEKKCLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNGISLVRREV 722
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/737 (51%), Positives = 502/737 (68%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++ S + C +VTYD ++++ING+R ++IS +IHYPRS P MW ++Q+AK+G
Sbjct: 66 LCMVLQLGSQLIQC---SVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDG 122
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ +E+YVFWN HE SPG Y F GR++LV+FI+ +Q+A +Y LRIGP+V AE+N+GG
Sbjct: 123 GLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGF 182
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV +MK E+LF SQGGPIIL+Q+ENEYG
Sbjct: 183 PVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQS 242
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G+ G Y WAA MAV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P
Sbjct: 243 KLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPT 302
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 303 IWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFI 362
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH +IKLCE AL++ + SLGS Q+A VY+
Sbjct: 363 TTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSD 422
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G CAAFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VFNTA V Q++ +EM+P
Sbjct: 423 AGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEMLP 482
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N ++ L W+ + E I+ + + F G ++ IN T+D +DYLWY T I
Sbjct: 483 TN-----------AEMLSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRI 531
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FL+ G P L++++ GHA+H F N +L GSA G + F + ++L AG N
Sbjct: 532 DIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTN 591
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E GI V + G N G DLS WTYK+GL+GE + +
Sbjct: 592 TIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLV 651
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P ++++W+ ++ + QPLTW+KA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 652 SPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGR 711
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + + Q C Y G + P KC GCG+P+QRWYH+PRSW KP++N+LV+FE
Sbjct: 712 YWTAYANGN------CQGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFE 765
Query: 727 EKGGDPTKITFSIRKIS 743
E GGDP++I+ R ++
Sbjct: 766 ELGGDPSRISLVRRSMT 782
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/737 (51%), Positives = 502/737 (68%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++ S + C +VTYD ++++ING+R ++IS +IHYPRS P MW ++Q+AK+G
Sbjct: 13 LCMVLQLGSQLIQC---SVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ +E+YVFWN HE SPG Y F GR++LV+FI+ +Q+A +Y LRIGP+V AE+N+GG
Sbjct: 70 GLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV +MK E+LF SQGGPIIL+Q+ENEYG
Sbjct: 130 PVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQS 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G+ G Y WAA MAV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P
Sbjct: 190 KLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPT 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 250 IWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH +IKLCE AL++ + SLGS Q+A VY+
Sbjct: 310 TTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSD 369
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G CAAFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VFNTA V Q++ +EM+P
Sbjct: 370 AGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEMLP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N ++ L W+ + E I+ + + F G ++ IN T+D +DYLWY T I
Sbjct: 430 TN-----------AEMLSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRI 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FL+ G P L++++ GHA+H F N +L GSA G + F + ++L AG N
Sbjct: 479 DIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTN 538
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E GI V + G N G DLS WTYK+GL+GE + +
Sbjct: 539 TIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLV 598
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P ++++W+ ++ + QPLTW+KA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 599 SPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGR 658
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + + Q C Y G + P KC GCG+P+QRWYH+PRSW KP++N+LV+FE
Sbjct: 659 YWTAYANGN------CQGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFE 712
Query: 727 EKGGDPTKITFSIRKIS 743
E GGDP++I+ R ++
Sbjct: 713 ELGGDPSRISLVRRSMT 729
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/720 (53%), Positives = 495/720 (68%), Gaps = 20/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+ING+R ++ S +IHYPRS P MW L+Q+AK+GG++ IE+YVFWN HE SP
Sbjct: 30 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 89
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LV+F+K I +A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 90 GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 149
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F IV++MK E LF SQGGPIIL+Q+ENEYG G G Y WAAKMA
Sbjct: 150 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 209
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+A GVPW+MC++ D PDPVINTCN FYCD F P+ P P IWTE W GWF FGG
Sbjct: 210 IATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFGGPMH 269
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R
Sbjct: 270 HRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQ 329
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH AIK+CE AL++ + S+G+ Q+A VY+ SG C+AFLAN D ++
Sbjct: 330 PKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAAR 389
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F NV Y+LP WS+SILPDC+ VFNTA V Q+S +EM+P + +K +
Sbjct: 390 VLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTD-----------TKNFQ 438
Query: 448 WQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E ++ + + F G ++ IN T+DT+DYLWY TS+ + ++E FL G P L+
Sbjct: 439 WESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 498
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
I+S GHA+H F N +L GSA G + F Y+ I+L +G N IALLS+ VGL N G +
Sbjct: 499 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 558
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPP 624
E GI V + G + G +DLS WTY++GL+GE + + P +I W+ +++
Sbjct: 559 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 618
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPLTW+K P G+EP+ LDM MGKG W+NGE IGRYW + H
Sbjct: 619 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSH------ 672
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + P+KC TGCG+P+QRWYH+PR+W KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 673 CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 732
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/720 (53%), Positives = 495/720 (68%), Gaps = 20/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+ING+R ++ S +IHYPRS P MW L+Q+AK+GG++ IE+YVFWN HE SP
Sbjct: 33 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LV+F+K I +A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 93 GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F IV++MK E LF SQGGPIIL+Q+ENEYG G G Y WAAKMA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+A GVPW+MC++ D PDPVINTCN FYCD F P+ P P IWTE W GWF FGG
Sbjct: 213 IATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFGGPMH 272
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R
Sbjct: 273 HRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQ 332
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH AIK+CE AL++ + S+G+ Q+A VY+ SG C+AFLAN D ++
Sbjct: 333 PKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAAR 392
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F NV Y+LP WS+SILPDC+ VFNTA V Q+S +EM+P + +K +
Sbjct: 393 VLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTD-----------TKNFQ 441
Query: 448 WQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E ++ + + F G ++ IN T+DT+DYLWY TS+ + ++E FL G P L+
Sbjct: 442 WESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 501
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
I+S GHA+H F N +L GSA G + F Y+ I+L +G N IALLS+ VGL N G +
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 561
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPP 624
E GI V + G + G +DLS WTY++GL+GE + + P +I W+ +++
Sbjct: 562 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 621
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPLTW+K P G+EP+ LDM MGKG W+NGE IGRYW + H
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSH------ 675
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + P+KC TGCG+P+QRWYH+PR+W KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 676 CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 735
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/720 (53%), Positives = 495/720 (68%), Gaps = 20/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+ING+R ++ S +IHYPRS P MW L+Q+AK+GG++ IE+YVFWN HE SP
Sbjct: 33 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LV+F+K I +A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 93 GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F IV++MK E LF SQGGPIIL+Q+ENEYG G G Y WAAKMA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+A GVPW+MC++ D PDPVINTCN FYCD F P+ P P IWTE W GWF FGG
Sbjct: 213 IATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFGGPMH 272
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R
Sbjct: 273 HRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQ 332
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH AIK+CE AL++ + S+G+ Q+A VY+ SG C+AFLAN D ++
Sbjct: 333 PKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAAR 392
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F NV Y+LP WS+SILPDC+ VFNTA V Q+S +EM+P + +K +
Sbjct: 393 VLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTD-----------TKNFQ 441
Query: 448 WQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E ++ + + F G ++ IN T+DT+DYLWY TS+ + ++E FL G P L+
Sbjct: 442 WESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 501
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
I+S GHA+H F N +L GSA G + F Y+ I+L +G N IALLS+ VGL N G +
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 561
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPP 624
E GI V + G + G +DLS WTY++GL+GE + + P +I W+ +++
Sbjct: 562 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 621
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPLTW+K P G+EP+ LDM MGKG W+NGE IGRYW + H
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSH------ 675
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + P+KC TGCG+P+QRWYH+PR+W KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 676 CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 735
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/721 (53%), Positives = 498/721 (69%), Gaps = 18/721 (2%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
C +V+YD +++I+NG+R+++IS +IHYPRS P MWP L+Q+AKEGGV+ I++YVFWNG
Sbjct: 19 CGIASVSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNG 78
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE GKYYF R++LVKFIK++Q+A +Y+ LRIGP+ AE+N+GG PVWL Y+PG FR
Sbjct: 79 HEPEEGKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGISFR 138
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
+ EPFK MQKF T IVDMMK EKL+ +QGGPIIL+Q+ENEYG E GE GK Y+ W
Sbjct: 139 TNNEPFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEW 198
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
AAKMAV GVPWIMC+Q D PDP+INTCN FYCD FTP+ + PK+WTE W WF F
Sbjct: 199 AAKMAVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNKANKPKMWTEAWTAWFTEF 258
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG P+RP+ED+AF+VARF Q GGS NYYMYHGGTNFGRT+GGPFI TSYDY+AP+DE+
Sbjct: 259 GGPVPYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEF 318
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
G R PKWGHLK+LH AIKLCE AL++ + + SLG+ QEA V+ SGACAAFLAN +
Sbjct: 319 GSLRQPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQEARVFKSESGACAAFLANYNQ 378
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
+ V F N+ Y+LP WS+SILPDCK V+NTA V AQS+ ++M P
Sbjct: 379 HSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSAQMKMTPV------------ 426
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S+G W+ F E A + F G ++ IN T+D +DYLWY T I ++ E FL +G+
Sbjct: 427 SRGFSWESFNEDAASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEIDPTEGFLNSGNW 486
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P L + S GHALH F N +L G+ G+ +P + N I+L+AG N+I+LLS+ VGL N
Sbjct: 487 PWLTVFSAGHALHVFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNKISLLSIAVGLPNV 546
Query: 563 GPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
GP +E AG+ V + G N GT DL+ W YK+GL+GE L +++ ++ WV
Sbjct: 547 GPHFETWNAGVLGPVSLNGLNEGTRDLTWQKWFYKVGLKGEALSLHSLSGSPSVEWVEGS 606
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
+ QPL+WYK P G+EP+ LDM MGKG W+NG+ +GR+WP S
Sbjct: 607 LVAQKQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHWPAYKSSGS----- 661
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
C+Y G F+ KC+T CGE SQRWYH+PRSW P+ N+LV+FEE GGDP IT R+
Sbjct: 662 CSVCNYTGWFDEKKCLTNCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDPYGITLVKRE 721
Query: 742 I 742
I
Sbjct: 722 I 722
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/737 (51%), Positives = 499/737 (67%), Gaps = 25/737 (3%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
ALL F S+ T VTYD ++++ING+R ++IS +IHYPRS P MW L+Q+AK+GG
Sbjct: 17 ALLGFRSTQCT-----TVTYDKKAILINGQRRILISGSIHYPRSTPEMWDDLMQKAKDGG 71
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
++ +++YVFWN HE SPG Y F GR++LV+FIK Q+ +Y+ LRIGP+V AE+N+GG P
Sbjct: 72 LDVVDTYVFWNVHEPSPGNYDFEGRYDLVRFIKTAQRVGLYVHLRIGPYVCAEWNFGGFP 131
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES 190
VWL Y+PG FR D PFK MQ F IV MMK EKLFASQGGPIIL+Q+ENEYG
Sbjct: 132 VWLKYVPGISFRTDNGPFKMAMQGFTQKIVQMMKSEKLFASQGGPIILSQIENEYGPQSK 191
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
G G Y WAAKMAV N GVPW+MC++ D PDPVIN+CN FYCD F+P+ P P +
Sbjct: 192 ALGAAGHAYMNWAAKMAVGLNTGVPWVMCKEDDAPDPVINSCNGFYCDYFSPNKPYKPTL 251
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTE W GWF FGG RP +D+AF+VARF QKGGS+ NYYMYHGGTNFGRTAGGPFIT
Sbjct: 252 WTEAWSGWFTEFGGPVYGRPVQDLAFAVARFVQKGGSLFNYYMYHGGTNFGRTAGGPFIT 311
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYG+ R PK+GHLK LH AIKLCEHAL++ + + SLG+ ++A V++
Sbjct: 312 TSYDYDAPLDEYGMLRQPKYGHLKNLHRAIKLCEHALVSSDPTVTSLGAYEQAHVFSSGP 371
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
G CAAFLAN + TVVF N+ Y LPAWS+SILPDCK+VVFNTA V + +M+P
Sbjct: 372 GRCAAFLANYHTNSAATVVFNNMRYALPAWSISILPDCKRVVFNTAQVGVHIAQTQMLPT 431
Query: 431 NLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
+ L W+ + E + G + +G ++ IN T+DT+DYLWY TS+
Sbjct: 432 ISK------------LSWETYNEDTYSLGGSSRMTVAGLLEQINVTRDTSDYLWYMTSVG 479
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
++ +E FL+ G +P L + S GHA+H F N + GSA G+ HP F Y PI+L+AG N+
Sbjct: 480 ISSSEAFLRGGQKPTLSVRSAGHAVHVFINGQFSGSAYGSREHPAFTYTGPINLRAGMNK 539
Query: 550 IALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
IALLS+ VGL N G +E W + + I+G N G DL+ W+Y++GL+GE + + +
Sbjct: 540 IALLSIAVGLPNVGLHFEKWQTGILGPISISGLNGGKKDLTWQKWSYQVGLKGEAMNLVS 599
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
P +++W+ +PLTWYKA P G+EP+ LD+ MGKG AW+NG+ IGRYW
Sbjct: 600 PTEATSVDWIKGSLLQGQRPLTWYKASFNAPRGNEPLALDLRSMGKGQAWINGQSIGRYW 659
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
++ C Y G + P C GCG+P+QRWYH+PRSW KP+ N+LV+FEE
Sbjct: 660 MAYAKGG------CSRCTYAGTYRPPTCENGCGQPTQRWYHVPRSWLKPTNNVLVLFEEL 713
Query: 729 GGDPTKITFSIRKISGF 745
GGD +KI+ R ++G
Sbjct: 714 GGDASKISLMRRSVTGL 730
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/720 (53%), Positives = 496/720 (68%), Gaps = 20/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+ING+R ++ S +IHYPRS P MW GL+Q+AK+GG++ IE+YVFWN HE +P
Sbjct: 30 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNLHEPTP 89
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LV+F+K I +A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 90 GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 149
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F IV++MK E LF SQGGPIIL+Q+ENEYG G G Y WAAKMA
Sbjct: 150 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 209
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+A GVPW+MC++ D PDPVINTCN FYCD F P+ P P IWTE W GWF FGG
Sbjct: 210 IATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFGGPMH 269
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R
Sbjct: 270 HRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRE 329
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH AIK+CE AL++ + S+G+ Q+A VY+ SG C+AFLAN D ++
Sbjct: 330 PKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAAR 389
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F NV Y+LP WS+SILPDC+ VFNTA V Q+S +EM+P + +K +
Sbjct: 390 VLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTD-----------TKNFQ 438
Query: 448 WQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
WQ + E ++ + + F G ++ IN T+DT+DYLWY TS+ + + E FL G P L+
Sbjct: 439 WQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSVDIGDTESFLHGGELPTLI 498
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
I+S GHA+H F N +L GSA G + F Y+ I+L +G N IALLS+ VGL N G +
Sbjct: 499 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 558
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPP 624
E GI V + G + G DLS WTY++GL+GE + + P +I W+ +++
Sbjct: 559 ESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAFPTNTRSIGWMDASLTVQ 618
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPLTW+K P G+EP+ LDM MGKG W+NGE IGRYW + +C Q
Sbjct: 619 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYW-----TAFATGDCSQ- 672
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + P+KC TGCG+P+QR+YH+PRSW KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 673 CSYTGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLVIFEELGGNPSSVSLVKRSVSG 732
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/733 (51%), Positives = 500/733 (68%), Gaps = 18/733 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
+L+ SS + +V+YD +++I+NG+R ++IS +IHYPRS P MWP L+Q+AKEGGV
Sbjct: 15 VLLVLLSSCVFSGLASVSYDHKAIIVNGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGV 74
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE GKYYF R++LVKFIK++ QA +Y+ LR+GP+ AE+N+GG PV
Sbjct: 75 DVIQTYVFWNGHEPEQGKYYFEERYDLVKFIKLVHQAGLYVNLRVGPYACAEWNFGGFPV 134
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D EPFK MQKF T IV+MMK E+L+ SQGGPIIL+Q+ENEYG E
Sbjct: 135 WLKYVPGISFRTDNEPFKAAMQKFTTKIVNMMKAERLYESQGGPIILSQIENEYGPLEVR 194
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
+GE GK YA WAAKMA+ GVPW+MC+Q D PDPVINTCN FYCD F P+ PKIW
Sbjct: 195 FGEQGKSYAEWAAKMALDLGTGVPWLMCKQDDAPDPVINTCNGFYCDYFYPNKAYKPKIW 254
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TE W WF FG P+RP ED+AF VA F Q GGS NYYMYHGGTNFGRTAGGPF+ T
Sbjct: 255 TEAWTAWFTEFGSPVPYRPVEDLAFGVANFIQTGGSFINYYMYHGGTNFGRTAGGPFVAT 314
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DE+GL R PKWGHLK+LH AIKLCE AL++G+ + +LG+ Q+A V+ +SG
Sbjct: 315 SYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTALGNYQKAHVFRSTSG 374
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFLAN D + TV F N Y+LP WS+SILPDCK V+NTA V AQS+ ++M P N
Sbjct: 375 ACAAFLANNDPNSFATVAFGNKHYNLPPWSISILPDCKHTVYNTARVGAQSALMKMTPAN 434
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
+G WQ + + + + F G ++ +NTT+D +DYLWY T + ++
Sbjct: 435 ------------EGYSWQSYNDQTAFYDDNAFTVVGLLEQLNTTRDVSDYLWYMTDVKID 482
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
+E FL++G+ P L + S G ALH F N +L G+ G+ + ++L+AG N+I+
Sbjct: 483 PSEGFLRSGNWPWLTVSSAGDALHVFVNGQLAGTVYGSLKKQKITFSKAVNLRAGVNKIS 542
Query: 552 LLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LLS+ VGL N GP +E W + V ++G + G DL+ W+YK+GL+GE L +++
Sbjct: 543 LLSIAVGLPNIGPHFETWNTGVLGPVSLSGLDEGKRDLTWQKWSYKVGLKGEALNLHSLS 602
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+++ WV + QPLTWYK P G+EP+ LDM MGKG W+NG+ IGRYWP
Sbjct: 603 GSSSVEWVEGSLVAQRQPLTWYKTTFNAPAGNEPLALDMNSMGKGQVWINGQSIGRYWPG 662
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
+ C+Y G FN KC++ CG+ SQRWYH+PRSW P+ N+LV+FEE GG
Sbjct: 663 YKASGT-----CDACNYAGPFNEKKCLSNCGDASQRWYHVPRSWLHPTGNLLVVFEEWGG 717
Query: 731 DPTKITFSIRKIS 743
DP I+ R+++
Sbjct: 718 DPNGISLVKRELA 730
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/733 (53%), Positives = 500/733 (68%), Gaps = 21/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++L+ FS I + +V+YD +++IING++ ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 9 WSILLLFSC-IFSAASASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +++ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQKF IV MMK EKLF SQGGPIIL+Q+ENE+G E
Sbjct: 128 PVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN FYC+ F P+ PK
Sbjct: 188 WEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+
Sbjct: 248 MWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFM 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGLPR PKWGHL++LH AIK CE AL++ + S LGS+QEA V+
Sbjct: 308 ATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVFKSE 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN D K V F Y LP WS+SILPDCK V+NTA V +QSS V+M P
Sbjct: 368 SD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQMTP 426
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ G WQ F +E G + IN T+DTTDYLWY T I
Sbjct: 427 VH------------SGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L I S GHAL+ F N +L G+ G+ +P + ++L++G N
Sbjct: 475 TIGSDEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 534
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS++VGL N G +E AG+ + + G NSGT D+S + WTYK GL+GE LG++
Sbjct: 535 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV K QPLTWYKA PPGD P+ LDM MGKG W+NG+ +GR+
Sbjct: 595 TVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S D C Y G ++ KC T CGEPSQRWYHIPRSW P+ N+LV+FEE
Sbjct: 655 WPGYIARGSCGD-----CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEE 709
Query: 728 KGGDPTKITFSIR 740
GGDP+ I+ R
Sbjct: 710 WGGDPSGISLVER 722
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/721 (52%), Positives = 495/721 (68%), Gaps = 20/721 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+VTYD ++++ING+R ++ S +IHYPRS P MW L+ +AKEGG++ +E+YVFWN HE
Sbjct: 25 ASVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWNVHEP 84
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D
Sbjct: 85 SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQ F IV MMK E+LF SQGGPIIL+Q+ENEYG G+ G+ Y WAAK
Sbjct: 145 EPFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYVNWAAK 204
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV GVPW+MC++ D PDPVINTCN FYCD+FTP+ P P IWTE W GWF FGG
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
RP +D+AF+VARF +GGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL
Sbjct: 265 IHKRPVQDLAFAVARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PK+GHLKELH AIK+CE AL++ + SLG SQ+A VY SG CAAFL+N D K+
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIITSLGESQQAHVYTTESGDCAAFLSNYDSKSS 384
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V+F N+ Y+LP WSVSILPDC+ VVFNTA V Q+S ++M+P N Q
Sbjct: 385 ARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKVGVQTSQMQMLPTNTQL----------- 433
Query: 446 LKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
W+ F E + + + + G ++ IN TKD +DYLWY TS+ + +E FL+ G P
Sbjct: 434 FSWESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPT 493
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L+++S+GHA+H F N +L GSA G + F Y ++L+AG N IALLS+ +GL N G
Sbjct: 494 LIVQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRIALLSVAIGLPNVGE 553
Query: 565 FYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STME 622
+E W + V + G + G DLS WTY++GL+GE + + +P +++ W+ S +
Sbjct: 554 HFESWSTGILGPVALHGLDQGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIV 613
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+NQPLTW+K P GDEP+ LDM MGKG W+NG+ IGRYW + +
Sbjct: 614 VQRNQPLTWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTTFATGN------C 667
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
+C+Y G F P KC GCG+P+QRWYH+PRSW KP++N+LVIFEE GG+P+KI+ R +
Sbjct: 668 NDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGGNPSKISLVKRSV 727
Query: 743 S 743
S
Sbjct: 728 S 728
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 804 bits (2077), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/721 (52%), Positives = 492/721 (68%), Gaps = 20/721 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD ++++ING+R ++ S +IHYPRS P MW GL+Q+AKEGG++ +E+YVFWN HE S
Sbjct: 28 SVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVHEPS 87
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG Y F GR++L +FIK IQ+A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D E
Sbjct: 88 PGNYNFEGRYDLARFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 147
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ F IV +MK E LF SQGGPIIL+Q+ENEYG +G G+ Y WAAKM
Sbjct: 148 PFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWAAKM 207
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 208 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEFGGPI 267
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
RP +D+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R
Sbjct: 268 HQRPVQDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 327
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHLKELH A+K+CE AL++ + SLGSSQ+A VY SG CAAFL+N D +
Sbjct: 328 QPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTDSAA 387
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S +EM+P N S L
Sbjct: 388 RVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQLEMLPTN-----------SPML 436
Query: 447 KWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+ + E ++ SG ++ IN TKDT+DYLWY TS+ + E FL G P L
Sbjct: 437 LWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSVDIGSTESFLHGGELPTL 496
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
+++S GHA+H F N L GSA G+ + F Y ++ +AG+N IALLS+ VGL N G
Sbjct: 497 IVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVGGH 556
Query: 566 YEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEP 623
+E GI V + G + G LDLS WTYK+GL+GE + + +P +++ W+ ++
Sbjct: 557 FETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSLAA 616
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
QPLTW+K+ P GDEP+ +DM MGKG W+NG IGRYW + +
Sbjct: 617 QAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAYATGN------CD 670
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
+C+Y G F P KC GCG+P+QRWYH+PR+W KP +N+LV+FEE GG+PT I+ R ++
Sbjct: 671 KCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVT 730
Query: 744 G 744
G
Sbjct: 731 G 731
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 804 bits (2076), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/734 (54%), Positives = 502/734 (68%), Gaps = 29/734 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F LL S ++ F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+G
Sbjct: 11 FWLLCIHSPTL---FCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN +E G+Y F GR +LVKF+K + A +Y+ LRIGP+V AE+NYGG
Sbjct: 68 GLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH+IPG FR D EPFK M++F IVDM+K E L+ASQGGP+IL+Q+ENEYG +
Sbjct: 128 PLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNID 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S YG GK Y WAA MA + + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK
Sbjct: 188 SAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWF FGG P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GGPFI
Sbjct: 248 MWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYG+ R PKWGHLKE+H AIKLCE AL+ + + SLG + EA VY
Sbjct: 308 ATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTG 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN+D K+D TV F SYHLPAWSVSILPDCK VV NTA V ++ + M
Sbjct: 368 S-VCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKV-CLTNFISMF- 424
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
PS W E GI F ++G ++ INTT D +DYLWY+ SI
Sbjct: 425 -MWLPSSTG---------WSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSID 474
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+ GS+ VL IES GHALHAF N +L GS +GN F P++L AGKN
Sbjct: 475 YKGDA-----GSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNT 529
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGF-NSGTLDLSTYSWTYKIGLQGEHLGIY 607
I LLS+TVGLQN G F++ GAGIT V + G N TLDLS WTY++GL+GE LG+
Sbjct: 530 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGLS 589
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ ++ W S PKNQPL WYK P G +P+ +D MGKG AW+NG+ IGRY
Sbjct: 590 S---GSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRY 646
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + C C+YRG ++ KC CG+PSQ YH+PRSW KPS NILV+FEE
Sbjct: 647 WPTYVASDA---GCTDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEE 703
Query: 728 KGGDPTKITFSIRK 741
KGGDPT+I+F ++
Sbjct: 704 KGGDPTQISFVTKQ 717
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 804 bits (2076), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/735 (53%), Positives = 497/735 (67%), Gaps = 17/735 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F LL F + F NVTYD R+L+I+G+R +++S +IHYPRS P MWP L+Q++K+G
Sbjct: 8 FVLLWFLGVYVPASFCSNVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE G+Y F GR +LV F+K + A +Y+ LRIGP+V AE+NYGG
Sbjct: 68 GIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH+I G FR + EPFK M++F IVDMMK+E L+ASQGGPIIL+Q+ENEYG +
Sbjct: 128 PLWLHFIAGIKFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNID 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+ K Y WAA MA + + GVPWIMCQQ + PDP+INTCNSFYCDQFTP+S + PK
Sbjct: 188 THDARAAKSYIDWAASMATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWF FGG P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNFGRT GGPFI
Sbjct: 248 MWTENWSGWFLAFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
+TSYDY+APIDEYG R PKWGHLK+LH AIKLCE AL+ + + S G + E VY +
Sbjct: 308 STSYDYDAPIDEYGDIRQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVY-KT 366
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
C+AFLAN+ +D TV F SYHLP WSVSILPDCK VV NTA V S
Sbjct: 367 GAVCSAFLANI-GMSDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFAT 425
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
E+L+ E S W E GI F KSG ++ INTT D +DYLWY+ SI+
Sbjct: 426 ESLK--EKVDSLDSSSSGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIV 483
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+N G +PVL IES GHALHAF N +L GS +G+ + PI+L GKN
Sbjct: 484 YEDNA-----GDQPVLHIESLGHALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGKNT 538
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIY 607
I LLS+TVGLQN G FY+ VGAGIT V + G +G ++DL++ WTY++GLQGE +G+
Sbjct: 539 IDLLSLTVGLQNYGAFYDTVGAGITGPVILKGLKNGSSVDLTSQQWTYQVGLQGEFVGLS 598
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ N W S P NQPLTWYK P G P+ +D MGKG AW+NG+ IGRY
Sbjct: 599 S---GNVGQWNSQSNLPANQPLTWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRY 655
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP SP+ C C+YRG ++ KC+ CG+PSQ YH+PR+W KP N V+FEE
Sbjct: 656 WP---TYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLFEE 712
Query: 728 KGGDPTKITFSIRKI 742
GGDPTKI+F ++I
Sbjct: 713 SGGDPTKISFGTKQI 727
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 803 bits (2074), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/733 (53%), Positives = 500/733 (68%), Gaps = 21/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++L+ FS I + +V+YD +++IING++ ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 2 WSILLLFSC-IFSAASASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDG 60
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +++ LRIGP+V AE+N+GG
Sbjct: 61 GLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGF 120
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQKF IV MMK EKLF SQGGPIIL+Q+ENE+G E
Sbjct: 121 PVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVE 180
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN FYC+ F P+ PK
Sbjct: 181 WEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPK 240
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+
Sbjct: 241 MWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFM 300
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGLPR PKWGHL++LH AIK CE AL++ + S LGS+QEA V+
Sbjct: 301 ATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVFKSE 360
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN D K V F Y LP WS+SILPDCK V+NTA V +QSS V+M P
Sbjct: 361 SD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQMTP 419
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ G WQ F +E G + IN T+DTTDYLWY T I
Sbjct: 420 VH------------SGFPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLWYMTDI 467
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L I S GHAL+ F N +L G+ G+ +P + ++L++G N
Sbjct: 468 TIGSDEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 527
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS++VGL N G +E AG+ + + G NSGT D+S + WTYK GL+GE LG++
Sbjct: 528 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLH 587
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV K QPLTW+KA PPGD P+ LDM MGKG W+NG+ +GR+
Sbjct: 588 TVTGSSSVEWVEGPSMAKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRH 647
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S D C Y G ++ KC T CGEPSQRWYHIPRSW P+ N+LV+FEE
Sbjct: 648 WPGYIARGSCGD-----CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEE 702
Query: 728 KGGDPTKITFSIR 740
GGDP+ I+ R
Sbjct: 703 WGGDPSGISLVER 715
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 803 bits (2074), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/721 (52%), Positives = 492/721 (68%), Gaps = 20/721 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD ++++ING+R ++ S +IHYPRS P MW GL+Q+AKEGG++ +E+YVFWN HE S
Sbjct: 28 SVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVHEPS 87
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG Y F GR++LV+FIK IQ+A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D E
Sbjct: 88 PGNYNFEGRYDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 147
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ F IV +MK E LF SQGGPIIL+Q+ENEYG +G G+ Y WAAKM
Sbjct: 148 PFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWAAKM 207
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 208 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEFGGPI 267
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
RP +D+AF+VA F QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R
Sbjct: 268 HQRPVQDLAFAVALFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 327
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHLKELH A+K+CE AL++ + SLGSSQ+A VY SG CAAFL+N D +
Sbjct: 328 QPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTDSAA 387
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S +EM+P N S L
Sbjct: 388 RVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQLEMLPTN-----------SPML 436
Query: 447 KWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+ + E ++ SG ++ IN TKDT+DYLWY TS+ + E FL G P L
Sbjct: 437 LWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSVDIGSTESFLHGGELPTL 496
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
+++S GHA+H F N L GSA G+ + F Y ++ +AG+N IALLS+ VGL N G
Sbjct: 497 IVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVGGH 556
Query: 566 YEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEP 623
+E GI V + G + G LDLS WTYK+GL+GE + + +P +++ W+ ++
Sbjct: 557 FETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSLAA 616
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
QPLTW+K+ P GDEP+ +DM MGKG W+NG IGRYW + +
Sbjct: 617 QAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAYATGN------CD 670
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
+C+Y G F P KC GCG+P+QRWYH+PR+W KP +N+LV+FEE GG+PT I+ R ++
Sbjct: 671 KCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVT 730
Query: 744 G 744
G
Sbjct: 731 G 731
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/720 (53%), Positives = 493/720 (68%), Gaps = 20/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+ING+R ++ S +IHYPRS P MW GL+Q+AK+GG++ IE+YVFWN HE SP
Sbjct: 33 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNLHEPSP 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LV+F+K I +A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 93 GKYDFEGRNDLVRFVKAIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F IV++MK E LF SQGGPIIL+Q+ENEYG G G Y WAAKMA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQILGAEGHNYMTWAAKMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+A GVPW+MC++ D PDPVI+TCN FYCD F P+ P P IWTE W GWF FGG
Sbjct: 213 IATETGVPWVMCKEDDAPDPVISTCNGFYCDSFAPNKPYKPTIWTEAWSGWFTEFGGPMH 272
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP +D+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R
Sbjct: 273 HRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQ 332
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH AIK+CE AL++ + SLG+ Q+A VY+ SG C+AFLAN D ++
Sbjct: 333 PKYGHLKELHRAIKMCEKALVSTDPVVTSLGNKQQAHVYSSESGDCSAFLANYDTESAAR 392
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F NV Y+LP WS+SILPDC+ VFNTA V Q+S +EM+P + + +
Sbjct: 393 VLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTS-----------TGSFQ 441
Query: 448 WQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
WQ + E ++ + + F G ++ IN T+DT+DYLWY TS+ + E E FL G P L+
Sbjct: 442 WQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSVDIGETESFLHGGELPTLI 501
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
I+S GHA+H F N +L GSA G + F YK I+L +G N IALLS+ VGL N G +
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKINLHSGTNRIALLSVAVGLPNVGGHF 561
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPP 624
E GI V + G + G DLS WTY++GL+GE + + P + W+ +++
Sbjct: 562 ESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAYPTNTPSFGWMDASLTVQ 621
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPLTW+K P G+EP+ LDM MGKG W+NGE IGRYW + H
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCGH------ 675
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + P+KC +GCG+P+Q+WYH+PRSW KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 676 CSYTGTYKPNKCNSGCGQPTQKWYHVPRSWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 735
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/724 (53%), Positives = 508/724 (70%), Gaps = 12/724 (1%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
++ A NVTYD R+L+I+G+R++++S ++HYPRS P MWPG++Q++K+GG++ IE+YVFW
Sbjct: 20 SFSLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFW 79
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE +Y F GR +LVKFIK++ A +Y+ +RIGP+V AE+NYGG PVWLH++PG
Sbjct: 80 NLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQ 139
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR D EPFK M++F IVD++K+EKL+ASQGGPIIL+Q+ENEYG +S +G K Y
Sbjct: 140 FRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYV 199
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFK 260
WAA MA + N GVPW+MC Q D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF
Sbjct: 200 QWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 320
+FGG P+RP ED+AF+VARF+Q GGS+ NYYMYHGGTNFGRT+GGPFI TSYDY+APID
Sbjct: 260 SFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPID 319
Query: 321 EYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANM 380
EYGL R PKWGHL+++H AIK+CE AL++ + + SLG + EA VY S C+AFLAN+
Sbjct: 320 EYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGS-QCSAFLANV 378
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
D ++DKTV F SYHLPAWSVSILPDCK VV NTA + + ++ + L+ ++ +
Sbjct: 379 DTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASE 438
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
G W E GI F G + INTT D +DYLWY+ S + +E +L NG
Sbjct: 439 AFDSGWSW--IDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANG 496
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
S VL ++S GH LH F N++L GS G+G PI+L GKN I LLS+TVGLQ
Sbjct: 497 SNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQ 556
Query: 561 NAGPFYEWVGAGITS-VKITGF-NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F+E GAG+T VK+ N+ T+DLS+ WTY+IGL+GE LG+ + + W+
Sbjct: 557 NYGAFFELRGAGVTGPVKLENXKNNITVDLSSGQWTYQIGLEGEDLGLPS---GSTSQWL 613
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S PKN+PLTWYK P G +P+ LD GKG AW+NG IGRYWP
Sbjct: 614 SQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIASG--- 670
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
+C CDY+G ++ +KC+ CG+PSQ YH+P+SW KP+ N LV+FEE G DPT++TF+
Sbjct: 671 -QCTSYCDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFA 729
Query: 739 IRKI 742
+++
Sbjct: 730 SKQL 733
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/719 (53%), Positives = 495/719 (68%), Gaps = 20/719 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD +++IING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE S
Sbjct: 38 SVSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 97
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+YYF GR++LVKFIK++++A +Y+ LRIGP+ AE+N+GG PVWL YIPG FR D E
Sbjct: 98 PGEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTDNE 157
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M F IVDMMK E+LF +QGGPIIL+Q+ENEYG E G G+ Y WAA M
Sbjct: 158 PFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAANM 217
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D PDP+INTCN YCD F+P+ P +WTE W WF FGG
Sbjct: 218 AVGLGTGVPWVMCKQDDAPDPIINTCNDHYCDWFSPNKNYKPTMWTEAWTSWFTAFGGPV 277
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+RP+ED+AF++A+F Q+GGS NYYMYHGGTNFGRTAGGPF+ TSYDY+APIDEYGL R
Sbjct: 278 PYRPAEDMAFAIAKFIQRGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYGLIR 337
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH AIK+CE AL++G+ SLGSSQE+ V+ SG CAAFLAN D+K+
Sbjct: 338 QPKWGHLKDLHKAIKMCEAALVSGDPIVTSLGSSQESHVFKSESGDCAAFLANYDEKSFA 397
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F+ + Y+LP WS+SILPDC VFNTA V AQ+S++ M N PD G
Sbjct: 398 KVAFQGMHYNLPPWSISILPDCVNTVFNTARVGAQTSSMTMTSVN-------PD----GF 446
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E + +A G ++ IN T+D TDYLWYTT I ++ NE FLKNG PVL
Sbjct: 447 SWETYNEETASYDDASITMEGLLEQINVTRDVTDYLWYTTDITIDPNEGFLKNGEYPVLT 506
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHALH F N EL G+ G+ +P Y + L AG N+I++LS+ VGL N G +
Sbjct: 507 VMSAGHALHIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKISVLSIAVGLPNIGAHF 566
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E W + V + G N G DLS +W+YKIGL+GE L +++ +++ W S + +
Sbjct: 567 ETWNTGVLGPVVLNGLNEGRRDLSWQNWSYKIGLKGEALQLHSLTGSSSVEWSSLIA--Q 624
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPLTWYK P G+ P LDM MGKG W+NG+ IGRYWP + C EC
Sbjct: 625 KQPLTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWP----AYKAYGNC-GEC 679
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
Y G++N KC+ CGE SQRWYH+P SW P+ N+LV+FEE GGDPT I+ +R+ +G
Sbjct: 680 SYTGRYNEKKCLANCGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTGISL-VRRTTG 737
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/728 (53%), Positives = 490/728 (67%), Gaps = 20/728 (2%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
SS +V+YD +++IING+R ++IS +IHYPRS P MWP L+Q+AK+GGV+ I++Y
Sbjct: 18 SSRISTVTASVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTY 77
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFWNGHE SPG YYF R++LVKFIK++QQA +Y+ LRIGP++ AE+N+GG PVWL Y+P
Sbjct: 78 VFWNGHEPSPGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVP 137
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G FR D PFK MQKF IV MMK EKLF +QGGPIIL+Q+ENEYG E G GK
Sbjct: 138 GIEFRTDNGPFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGK 197
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y WAA MAV GVPWIMC+Q D PDP+I+TCN FYC+ F P+ PKIWTE W G
Sbjct: 198 AYTKWAADMAVKLGTGVPWIMCKQEDAPDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTG 257
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
W+ FGG PHRP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPFI TSYDY+A
Sbjct: 258 WYTEFGGAVPHRPAEDMAFSVARFIQNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDA 317
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
P+DE+GLPR PKWGHL++LH AIKLCE AL++ + + SLGS+QEA V+ S CAAFL
Sbjct: 318 PLDEFGLPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVFKSKS-VCAAFL 376
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN D K V F N Y LP WSVSILPDCK V+NTA + +QSS ++MVP
Sbjct: 377 ANYDTKYSVKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQSSQMKMVP-------- 428
Query: 438 SPDNGSKGLKWQVFKEIAGIWGEADFVK-SGFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
S WQ + E + D +G + IN T+D TDYLWY T + ++ +E F
Sbjct: 429 ----ASSSFSWQSYNEETASADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKIDADEGF 484
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
LK+G P+L I S GHALH F N +L G+A G ++P + I L G N+I+LLS+
Sbjct: 485 LKSGQNPLLTIFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKISLLSVA 544
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGL N G +E AG+ + + G N GT DLS W+YKIGL+GE L ++ ++
Sbjct: 545 VGLPNVGLHFETWNAGVLGPITLKGLNEGTRDLSGQKWSYKIGLKGESLSLHTASGSESV 604
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
WV + Q LTWYK P G++P+ LDM MGKG W+NG+ IGR+WP
Sbjct: 605 EWVEGSLLAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWP----GY 660
Query: 676 SPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
H C +C+Y G F+ KC T CGEPSQRWYH+PRSW KPS N+L +FEE GGDPT I
Sbjct: 661 IAHGSC-GDCNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPTGI 719
Query: 736 TFSIRKIS 743
+F R +
Sbjct: 720 SFVKRTTA 727
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 802 bits (2071), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/724 (53%), Positives = 508/724 (70%), Gaps = 12/724 (1%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
++ A NVTYD R+L+I+G+R++++S ++HYPRS P MWPG++Q++K+GG++ IE+YVFW
Sbjct: 20 SFSLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFW 79
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE +Y F GR +LVKFIK++ A +Y+ +RIGP+V AE+NYGG PVWLH++PG
Sbjct: 80 NLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQ 139
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR D EPFK M++F IVD++K+EKL+ASQGGPIIL+Q+ENEYG +S +G K Y
Sbjct: 140 FRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYV 199
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFK 260
WAA MA + N GVPW+MC Q D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF
Sbjct: 200 QWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 320
+FGG P+RP ED+AF+VARF+Q GGS+ NYYMYHGGTNFGRT+GGPFI TSYDY+APID
Sbjct: 260 SFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPID 319
Query: 321 EYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANM 380
EYGL R PKWGHL+++H AIK+CE AL++ + + SLG + EA VY S C+AFLAN+
Sbjct: 320 EYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGS-QCSAFLANV 378
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
D ++DKTV F SYHLPAWSVSILPDCK VV NTA + + ++ + L+ ++ +
Sbjct: 379 DTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASE 438
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
G W E GI F G + INTT D +DYLWY+ S + +E +L NG
Sbjct: 439 AFDSGWSW--IDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANG 496
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
S VL ++S GH LH F N++L GS G+G PI+L GKN I LLS+TVGLQ
Sbjct: 497 SNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQ 556
Query: 561 NAGPFYEWVGAGITS-VKITG-FNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F+E GAG+T VK+ N+ T+DLS+ WTY+IGL+GE LG+ + + W+
Sbjct: 557 NYGAFFELRGAGVTGPVKLENQKNNITVDLSSGQWTYQIGLEGEDLGLPS---GSTSQWL 613
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S PKN+PLTWYK P G +P+ LD GKG AW+NG IGRYWP
Sbjct: 614 SQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIASG--- 670
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
+C CDY+G ++ +KC+ CG+PSQ YH+P+SW KP+ N LV+FEE G DPT++TF+
Sbjct: 671 -QCTSYCDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFA 729
Query: 739 IRKI 742
+++
Sbjct: 730 SKQL 733
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 801 bits (2070), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/718 (53%), Positives = 496/718 (69%), Gaps = 16/718 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+V+YDS++++ING+R ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE
Sbjct: 26 ASVSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPGKYYF ++LVKFIK+IQQA +Y+ LRIGP+V AE+N+GG PVWL YIPG FR D
Sbjct: 86 SPGKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIQFRTDN 145
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
PFK MQ+F T IV+MMK E+LF SQGGPIIL+Q+ENEYG E G GK Y WAA
Sbjct: 146 GPFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTDWAAH 205
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MA+ GVPW+MC+Q D PDP+IN CN FYCD F+P+ PK+WTE W GW+ FGG
Sbjct: 206 MALGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWYTEFGGA 265
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P RP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL
Sbjct: 266 VPSRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKWGHLK+LH AIKLCE AL++ + + LG+ QEA V+ SGACAAFLAN + ++
Sbjct: 326 RQPKWGHLKDLHRAIKLCEPALVSADPTVTPLGTYQEAHVFKSKSGACAAFLANYNPRSF 385
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V F N+ Y+LP WS+SILPDCK V+NTA V AQS+ ++M P +G+
Sbjct: 386 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSAQMKM--------PRVPLHGA-- 435
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
WQ + + + + F +G ++ INTT+D++DYLWY T + ++ NEEFL++G PVL
Sbjct: 436 FSWQAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKIDPNEEFLRSGKYPVL 495
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I S GHAL F N +L G++ G+ P + ++L+AG N+IALLS+ VGL N GP
Sbjct: 496 TILSAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIALLSIAVGLPNVGPH 555
Query: 566 YEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+E AG+ V + G N G DLS W+YK+GL+GE L +++ +++ W+
Sbjct: 556 FETWNAGVLGPVILNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWIQGSLVT 615
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
+ QPLTWYK P G+ P+ LDM MGKG W+NG IGRYWP S
Sbjct: 616 RRQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPAYKASGS-----CGA 670
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C+Y G ++ KC++ CGE SQRWYH+PR+W P+ N+LV+ EE GGDP I R+I
Sbjct: 671 CNYAGSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNGIFLVRREI 728
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 801 bits (2069), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/737 (51%), Positives = 495/737 (67%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L+ F + C VTYD R+++ING+R ++IS +IHYPRS P MW L+Q+AK+G
Sbjct: 13 LGLVCFLGFQLVQC---TVTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ +E+YVFWN HE SPG Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG
Sbjct: 70 GLDVVETYVFWNVHEPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV +MK EKLF SQGGPIIL+Q+ENEYG
Sbjct: 130 PVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIENEYGAQS 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+G G Y WAA MAV GVPW+MC++ D PDPVINTCN FYCD F P+ P P
Sbjct: 190 KLFGAAGHNYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFAPNKPYKPT 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+A++VARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 250 IWTEAWSGWFSEFGGPIHQRPVQDLAYAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+AP+DEYGL R PK+GHLKELH AIK+CE AL++ + SLG+ Q+A VY
Sbjct: 310 TTSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSADPIITSLGNFQQAYVYTSE 369
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG C+AFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S + M+P
Sbjct: 370 SGDCSAFLSNHDSKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMGMLP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N+Q L W+ + E I + + G ++ IN T+D+TDYLWY TS+
Sbjct: 430 TNIQM-----------LSWESYDEDITSLDDSSTITAPGLLEQINVTRDSTDYLWYKTSV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FL+ G P L+++S GHA+H F N +L GS+ G F Y ++L AG N
Sbjct: 479 DIGSSESFLRGGELPTLIVQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAGTN 538
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E GI V + G + G DLS WTY++GL+GE + +
Sbjct: 539 RIALLSVAVGLPNVGGHFEAWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLV 598
Query: 608 NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P ++++W+ ++ K QPLTW+K + P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 599 SPNSISSVDWMRGSLAAQKQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQSIGR 658
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + + C C Y G F P KC GCG+P+QR YH+PRSW KP +N+LVIFE
Sbjct: 659 YW-----TAFANGNC-NGCSYAGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVIFE 712
Query: 727 EKGGDPTKITFSIRKIS 743
E GGDP++I+ R +S
Sbjct: 713 EFGGDPSRISLVKRSVS 729
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 801 bits (2069), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/720 (53%), Positives = 490/720 (68%), Gaps = 20/720 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
+ +VTYD RS IING+R+++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE
Sbjct: 20 SASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 79
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
S GKYYF GR++LV+FIK++Q A +Y+ LRIGP++ AE+N+GG PVWL Y+PG FR D
Sbjct: 80 PSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTD 139
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
PFK MQ F IVDMMK EKLF QGGPII++Q+ENEYG E G GK Y WAA
Sbjct: 140 NGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWAA 199
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MAV GVPW+MC+Q D PDPVI+ CN FYC+ F P+ PK++TE W GW+ FGG
Sbjct: 200 EMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDYKPKMFTEAWTGWYTEFGG 259
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP+ED+A+SVARF Q GS NYYMYHGGTNFGRTAGGPFI+TSYDY+APIDEYGL
Sbjct: 260 AIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGL 319
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
P PKWGHL++LH AIKLCE AL++ + + LG++ EA VY SGACAAFLAN D K+
Sbjct: 320 PSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANYDPKS 379
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
V F N Y LP WSVSILPDCK VVFNTA + AQSS ++M P +
Sbjct: 380 SAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQSSQMKMNPVST------------ 427
Query: 445 GLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
WQ + +E A + E G ++ IN T+DTTDYLWY T + + +E FLK G P
Sbjct: 428 -FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYP 486
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL + S GHALH F N +L G+ G ++P + + + L G N+I+LLS+ +GL N G
Sbjct: 487 VLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVG 546
Query: 564 PFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
+E AG+ V + G N GT+D+S++ W+YKIGL+GE L + ++ WV
Sbjct: 547 LHFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSL 606
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ QPLTWYK P G++P+ LDM MGKG W+NGE IGR+WP + H C
Sbjct: 607 LAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWP----AYTAHGNC- 661
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C+Y G FN KC TGCG PSQRWYH+PRSW KPS N L++FEE GG+P IT R +
Sbjct: 662 NGCNYAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTM 721
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 801 bits (2068), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/720 (53%), Positives = 490/720 (68%), Gaps = 20/720 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
+ +VTYD RS IING+R+++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE
Sbjct: 23 SASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
S GKYYF GR++LV+FIK++Q A +Y+ LRIGP++ AE+N+GG PVWL Y+PG FR D
Sbjct: 83 PSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTD 142
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
PFK MQ F IVDMMK EKLF QGGPII++Q+ENEYG E G GK Y WAA
Sbjct: 143 NGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWAA 202
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MAV GVPW+MC+Q D PDPVI+ CN FYC+ F P+ PK++TE W GW+ FGG
Sbjct: 203 EMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDYKPKMFTEAWTGWYTEFGG 262
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP+ED+A+SVARF Q GS NYYMYHGGTNFGRTAGGPFI+TSYDY+APIDEYGL
Sbjct: 263 AIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGL 322
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
P PKWGHL++LH AIKLCE AL++ + + LG++ EA VY SGACAAFLAN D K+
Sbjct: 323 PSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANYDPKS 382
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
V F N Y LP WSVSILPDCK VVFNTA + AQSS ++M P +
Sbjct: 383 SAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQSSQMKMNPVST------------ 430
Query: 445 GLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
WQ + +E A + E G ++ IN T+DTTDYLWY T + + +E FLK G P
Sbjct: 431 -FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYP 489
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL + S GHALH F N +L G+ G ++P + + + L G N+I+LLS+ +GL N G
Sbjct: 490 VLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVG 549
Query: 564 PFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
+E AG+ V + G N GT+D+S++ W+YKIGL+GE L + ++ WV
Sbjct: 550 LHFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSL 609
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ QPLTWYK P G++P+ LDM MGKG W+NGE IGR+WP + H C
Sbjct: 610 LAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWP----AYTAHGNC- 664
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C+Y G FN KC TGCG PSQRWYH+PRSW KPS N L++FEE GG+P IT R +
Sbjct: 665 NGCNYAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTM 724
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 801 bits (2068), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/738 (51%), Positives = 502/738 (68%), Gaps = 24/738 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F +++ S + C VTYD +++IING+R ++IS +IHYPRS P MW L+Q+AK+G
Sbjct: 13 FLMVLLMGSKLVQC---TVTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFW+ HE SPG Y F GR++LV+FIK +Q+ +Y LRIGP+V AE+N+GG
Sbjct: 70 GLDVIDTYVFWDVHETSPGNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG
Sbjct: 130 PVWLKYVPGISFRTDNEPFKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIENEYGPES 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G+ Y WAAKMAV + GVPW+MC++ D PDP+INTCN FYCD F P+ P P
Sbjct: 190 RALGAAGRSYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINTCNGFYCDAFAPNKPYKPT 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG RP ED+AF+VARF QKGGS NYYMYHGGTNFGR+AGGPFI
Sbjct: 250 LWTEAWSGWFTEFGGPIHQRPVEDLAFAVARFIQKGGSYFNYYMYHGGTNFGRSAGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLK LH AIKLCEHAL++ + S SLG+ Q+A V++ S
Sbjct: 310 TTSYDYDAPIDEYGLIREPKYGHLKALHKAIKLCEHALVSSDPSITSLGTYQQAHVFS-S 368
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+CAAFLAN + K+ V+F N+ Y LP WS+SILPDC+ VVFNTA V AQ+ ++M+P
Sbjct: 369 GRSCAAFLANYNAKSAARVMFNNMHYDLPPWSISILPDCRNVVFNTARVGAQTLRMQMLP 428
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
GS+ W+ + +EI+ + + G ++ IN T+DT+DYLWY TS+
Sbjct: 429 -----------TGSELFSWETYDEEISSLTDSSRITALGLLEQINVTRDTSDYLWYLTSV 477
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL+NG +P L ++S GH LH F N + GSA G + + P++L+AG N
Sbjct: 478 DISPSEAFLRNGQKPSLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVNLRAGTN 537
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G YE G+ V + G N G DL+ W+Y++GL+GE + +
Sbjct: 538 RIALLSIAVGLPNVGLHYETWKTGVQGPVLLNGLNQGKKDLTWQKWSYQVGLKGEAMNLV 597
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P ++++W+ ++ + Q L W+KA P G+EP+ LDM MGKG W+NG+ IGR
Sbjct: 598 SPNGVSSVDWIEGSLASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWINGQSIGR 657
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW ++ +C C Y F P KC GCGEP+QRWYH+PRSW KP++N+LV+FE
Sbjct: 658 YWMAYAK-----GDC-NSCSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLLVVFE 711
Query: 727 EKGGDPTKITFSIRKISG 744
E GGD +KI+ R I G
Sbjct: 712 ELGGDASKISLVKRSIEG 729
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 800 bits (2067), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/731 (52%), Positives = 495/731 (67%), Gaps = 19/731 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L++ S+ NV+YD R+++ING+R+++IS +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 9 LVVLVICSLDLLVKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGL 68
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE SPGKY F GR++LVKFIK++Q A +Y+ LRIGP++ AE+N+GG+PV
Sbjct: 69 DVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPV 128
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+ G FR D +PFK MQ F+ IV MMK EKLF QGGPII+AQ+ENEYG E
Sbjct: 129 WLKYVSGMEFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWE 188
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y WAA+MAV VPWIMC+Q D PDPVI+TCN FYC+ F P+ P PK+W
Sbjct: 189 IGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMW 248
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TE W GWF FGG P RP+EDIAFSVARF Q GS NYYMYHGGTNFGRT+ G FI T
Sbjct: 249 TEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIAT 308
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL PK+GHL+ELH AIK CE AL++ + SLGS+QEA VY SG
Sbjct: 309 SYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSG 368
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFL+N D K V F+N+ Y LP WS+SILPDCK VV+NTA V +Q S+++M P
Sbjct: 369 ACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSSIKMTP-- 426
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIV 490
GL WQ + E ++D +++ G + N T+D++DYLWY T I +
Sbjct: 427 ----------AGGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDINI 476
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
NE FLK+G P L + S GH LH F N +L G+ G +P Y + L AG N+I
Sbjct: 477 ASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKI 536
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS++VGL N G Y+ AG+ V ++G N G+ DL+ W+YK+GL+GE L ++
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTL 596
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ WV + QPLTWYKA P G+EP+ LDM MGKG W+NGE +GR+WP
Sbjct: 597 SGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWP 656
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + +C +C Y G FN KC T CG+PSQRWYH+PRSW K S N+LV+FEE G
Sbjct: 657 GYAAQG----DC-SKCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWG 711
Query: 730 GDPTKITFSIR 740
GDPT I+ R
Sbjct: 712 GDPTGISLVRR 722
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 800 bits (2066), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/723 (54%), Positives = 502/723 (69%), Gaps = 15/723 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G R +++S +IHYPRS P MWPGL+Q+AK+GG++ IE+YVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +L F+K + A +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR D
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA
Sbjct: 147 NEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAA 206
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF +FGG
Sbjct: 207 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 266
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTN R++GGPFI TSYDY+APIDEYGL
Sbjct: 267 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 326
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL+++H AIKLCE AL+ + S SLG + EA VY S CAAFLAN+D ++
Sbjct: 327 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLANIDGQS 385
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG-- 442
DKTV F Y LPAWSVSILPDCK VV NTA + +Q++ EM L+ S + D
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEM--RYLESSNVASDGSFV 443
Query: 443 ---SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
W E GI + K+G ++ INTT D +D+LWY+TSI V +E +L N
Sbjct: 444 TPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYL-N 502
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS+ L + S GH L + N ++ GSA G+ + ++ PI L GKN+I LLS TVGL
Sbjct: 503 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 562
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F++ VGAGIT VK++G N G LDLS+ WTY+IGL+GE L +Y+P + WV
Sbjct: 563 SNYGAFFDLVGAGITGPVKLSGLN-GALDLSSAEWTYQIGLRGEDLHLYDPS-EASPEWV 620
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S P N PL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP +P
Sbjct: 621 SANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLAPQ 677
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
CV C+YRG ++ KC+ CG+PSQ YH+PRS+ +P N LV+FE GGDP+KI+F
Sbjct: 678 SGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFV 737
Query: 739 IRK 741
+R+
Sbjct: 738 MRQ 740
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 800 bits (2066), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/720 (53%), Positives = 498/720 (69%), Gaps = 16/720 (2%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F +V+YD +++ ING+R++++S +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGH
Sbjct: 22 FEASVSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGH 81
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E SPGKYYF G ++LVKFI+++QQA +Y+ LRIGP+ AE+N+GG PVWL YIPG FR
Sbjct: 82 EPSPGKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRT 141
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
D PFK+ MQKF T IV++MK E+L+ SQGGPIIL+Q+ENEYG E G GK YA WA
Sbjct: 142 DNGPFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWA 201
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MA+ GVPW+MC+Q D PDPVINTCN FYCD F+P+ PK+WTE W GWF FG
Sbjct: 202 AHMAIGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTGFG 261
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYG
Sbjct: 262 GTVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKWGHLK+LH AIKLCE AL++ + + LG+ QEA V+ SGACAAFLAN +
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSADPTVTRLGNYQEAHVFKSKSGACAAFLANYNPH 381
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
+ TV F N Y+LP WS+SILP+CK V+NTA + +QS+ ++M P +G
Sbjct: 382 SYSTVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQSAQMKMT--------RVPIHG- 432
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
GL W+ F E ++ F +G ++ IN T+D +DYLWY+T +++N +E + +NG P
Sbjct: 433 -GLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPDEGYFRNGKNP 491
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL + S GHALH F N +L G+ G+ P + ++L+AG N+I+LLS+ VGL N G
Sbjct: 492 VLTVLSAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKISLLSVAVGLPNVG 551
Query: 564 PFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
P +E AG+ + + G N G DL+ W+YK+GL+GE L +++ ++++W+
Sbjct: 552 PHFETWNAGVLGPITLNGLNEGRRDLTWQKWSYKVGLKGEDLSLHSLSGSSSVDWLQGYL 611
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ QPLTWYK P G P+ LDM MGKG WLNG+ +GRYWP S
Sbjct: 612 VSRRQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPAYKATGS-----C 666
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C+Y G +N KC T CGE SQRWYH+P SW KP+ N+LV+FEE GGDP + R I
Sbjct: 667 DYCNYAGTYNEKKCGTNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNGVFLVRRDI 726
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/715 (53%), Positives = 494/715 (69%), Gaps = 11/715 (1%)
Query: 36 IINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGR 95
+I+G R ++IS +IHYPRS P MWP L+ ++K GG++ IE+YVFW+ HE G+Y F GR
Sbjct: 1 VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60
Query: 96 FNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKF 155
+LV+FIK + +A +Y+ LRIGP+ AE+NYGG P+WLH+IPG FR D +PFK MQ+F
Sbjct: 61 KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRF 120
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
T IVD+MK+E L+ASQGGPIIL+Q+ENEYG + YG K Y WAA MA + + GVP
Sbjct: 121 TTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVP 180
Query: 216 WIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIA 275
W+MCQQ D PDP+INTCN FYCDQF+P+S + PKIWTENW GWF +FGG P RP ED+A
Sbjct: 181 WVMCQQTDAPDPIINTCNGFYCDQFSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDLA 240
Query: 276 FSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKE 335
F+VARFFQ+GG+ NYYMY G NFG T+GGPFI TSYDY+APIDEYG+ R PKWGHLKE
Sbjct: 241 FAVARFFQRGGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLKE 300
Query: 336 LHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSY 395
LH AIKLCE AL+ + L LG + EA VY +SG CAAFLAN+ ++D TV F SY
Sbjct: 301 LHKAIKLCEPALVATDHHTLRLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNGKSY 360
Query: 396 HLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL---KWQVFK 452
LPAWSVSILPDC+ VVFNTA + +Q+ EM N + + GS + W
Sbjct: 361 SLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDWSFVI 420
Query: 453 EIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGH 512
E GI K+G ++ INTT D +DYLWY+ SI ++ +E FL NG++ L ES GH
Sbjct: 421 EPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEPFLSNGTQSNLHAESLGH 480
Query: 513 ALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAG 572
LHAF N +L GS GN + ++ I L G N I LLS TVGLQN G F++ +GAG
Sbjct: 481 VLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAFFDLMGAG 540
Query: 573 ITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY-NPGYRNNINWVSTMEPPKNQPLT 630
IT VK+ G N GTLDLS+ +WTY+IGL+GE L ++ N G + W+S PKNQPL
Sbjct: 541 ITGPVKLKGQN-GTLDLSSNAWTYQIGLKGEDLSLHENSG--DVSQWISESTLPKNQPLI 597
Query: 631 WYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGK 690
WYK P G++P+ +D MGKG AW+NG+ IGRYWP SSP + C C+YRG
Sbjct: 598 WYKTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWP---TYSSPQNGCSTACNYRGP 654
Query: 691 FNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISGF 745
++ KCI CG+PSQ YH+PRS+ + N LV+FEE GGDPT+I+ + ++++
Sbjct: 655 YSASKCIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQMTSL 709
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/734 (53%), Positives = 504/734 (68%), Gaps = 30/734 (4%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
++ +YC V+YD R+L+I+G+R +++S +IHYPRS P MWP L+Q++K+GG++ IE+Y
Sbjct: 22 ATASYCT--TVSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 79
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFWN HE G+Y F GR +LV F+K + +A +Y+ LRIGP+V AE+NYGG P+WLH+IP
Sbjct: 80 VFWNLHEPVRGQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIP 139
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G R D EP+K M +F IV+MMK EKL+ASQGGPIIL+Q+ENEYG + YG K
Sbjct: 140 GIKLRTDNEPYKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAK 199
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y WAA MAV+ + GVPW+MCQQ D P VINTCN FYCDQF+P+S S PKIWTENW G
Sbjct: 200 TYINWAANMAVSLDTGVPWVMCQQADAPSSVINTCNGFYCDQFSPNSNSTPKIWTENWSG 259
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF +FGG P RP ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR++GGPFI TSYDY+A
Sbjct: 260 WFLSFGGAVPQRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDA 319
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
P+DEYGL R PKWGHLK++H AIKLCE A++ + + SLG + EA VY S C+AFL
Sbjct: 320 PLDEYGLLRQPKWGHLKDVHKAIKLCEPAMVATDPTISSLGQNIEAAVYKTGS-VCSAFL 378
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ----SSTVEMVPENLQ 433
AN+D K+D TV F SY LPAWSVSILPDCK VV NTA + S T + + +++
Sbjct: 379 ANVDTKSDATVTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATMVPSFTRQSISADVE 438
Query: 434 PSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNEN 493
P+EA G W E GI F + G ++ INTT D +DYLWY+TSI V
Sbjct: 439 PTEAV------GSGWSWINEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDV--- 489
Query: 494 EEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALL 553
K G + L ++S GHALHAF N +L GS +GN + + P+ +GKN I LL
Sbjct: 490 ----KGGYKADLHVQSLGHALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDLL 545
Query: 554 SMTVGLQNAGPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGY 611
S+TVGLQN G F++ VGAGIT V++ G +G T+DLS+ WTY+IGL+GE + +
Sbjct: 546 SLTVGLQNYGAFFDLVGAGITGPVQLKGSANGTTIDLSSQQWTYQIGLKGEDEDLPS--- 602
Query: 612 RNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRK 671
+ W+S PKNQPLTWYK P G P+ LD MGKG AW+NG+ IGRYWP
Sbjct: 603 -GSSQWISQPTLPKNQPLTWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWP-- 659
Query: 672 SRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGD 731
+P C +C+YRG ++ DKC CG PSQ+ YH+PRSW K S N LV+FEE GGD
Sbjct: 660 -TNVAPKTGCT-DCNYRGAYSADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGD 717
Query: 732 PTKITFSIRKISGF 745
PT+++F+ R++
Sbjct: 718 PTQLSFATRQVESL 731
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 798 bits (2062), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/733 (53%), Positives = 500/733 (68%), Gaps = 21/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++L+ FS I + +V+YD +++IING++ ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 9 WSILLLFSC-IFSAASASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +++ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENE+G E
Sbjct: 128 PVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN FYC+ F P+ PK
Sbjct: 188 WEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+
Sbjct: 248 MWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFM 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL R PKWGHL++LH AIK CE AL++ + S LGS+QEA V+
Sbjct: 308 ATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVFKSE 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN D K V F Y LP WS+SILPDCK V++TA V +QSS V+M P
Sbjct: 368 SD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKVGSQSSQVQMTP 426
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ G WQ F +E G + IN T+DTTDYLWY T I
Sbjct: 427 VH------------SGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L I S GHAL+ F N +L G+ G+ +P + ++L++G N
Sbjct: 475 TIGSDEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 534
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS++VGL N G +E AG+ + + G NSGT D+S + WTYK GL+GE LG++
Sbjct: 535 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV K QPLTWYKA PPGD P+ LDM MGKG W+NG+ +GR+
Sbjct: 595 TVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S D C Y G ++ KC T CGEPSQRWYHIPRSW P+ N+LV+FEE
Sbjct: 655 WPGYIARGSCGD-----CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPNGNLLVVFEE 709
Query: 728 KGGDPTKITFSIR 740
GGDP++I+ R
Sbjct: 710 WGGDPSRISLVER 722
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 798 bits (2062), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/737 (51%), Positives = 506/737 (68%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+ +++F SS + +C +VTYD ++++ING+R L+ S +IHYPRS P MW L+ +AKEG
Sbjct: 13 WCIVLFISSGLVHC---DVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ +E+YVFWN HE SPG Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG
Sbjct: 70 GLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK M+ + IV++MK LF SQGGPIIL+Q+ENEYG
Sbjct: 130 PVWLKYVPGISFRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQA 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G +Y+ WAA MAV + GVPW+MC++ D PDPVINTCN FYCD F P+ P P
Sbjct: 190 KVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPA 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF+VA+F Q+GGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 250 IWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH A+K+CE ++++ + + SLG+ Q+A VY+
Sbjct: 310 TTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSE 369
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G CAAFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S +EM+P
Sbjct: 370 TGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSKMEMLP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSI 488
N S+ L W+ + E ++ ++S G ++ IN T+DT+DYLWY TS+
Sbjct: 430 TN-----------SEMLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ E FL G P L++E+ GHA+H F N +L GSA G + F +K ++L+AG N
Sbjct: 479 DIGSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSN 538
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E W + V I G + G DLS WTY++GL+GE + +
Sbjct: 539 RIALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLV 598
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+ + ++W+ ++ K QPLTW+KA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 599 STNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGR 658
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + +C C Y G F P KC GCGEP+Q+WYH+PRSW KP++N+LV+FE
Sbjct: 659 YWTAYAT-----GDC-NGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFE 712
Query: 727 EKGGDPTKITFSIRKIS 743
E GGDPT+I+ R ++
Sbjct: 713 ELGGDPTRISLVKRSVT 729
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/736 (51%), Positives = 494/736 (67%), Gaps = 23/736 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L+F S + C +VTYD ++++ING+R ++IS +IHYPRS P MW L+++AK+G
Sbjct: 14 FVPLMFLHSQLIQC---SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDG 70
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++Y+FWN HE SPG Y F GR++LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG
Sbjct: 71 GLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGF 130
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL ++PG FR + EPFK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG
Sbjct: 131 PVWLKFVPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPES 190
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G Y WAAKMAV + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P+
Sbjct: 191 RELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPR 250
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF VARF Q GGS NYYMYHGGTNFGR+AGGPFI
Sbjct: 251 IWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFI 310
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCEHA+++ + + +SLGS Q+A V++
Sbjct: 311 TTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSG 370
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G CAAFL+N + K+ V+F NV Y LPAWS+SILPDC+ VVFNTA V Q+S + M P
Sbjct: 371 RGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N SK W+ + E I+ + G ++ IN T+D+TDYLWY TS+
Sbjct: 431 TN-----------SKLHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSV 479
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL+ G P L ++SKGHA+H F N + GSA G + F Y +L AG N
Sbjct: 480 NIDSSESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTN 539
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E W + V + G + G DLS W+Y++GL+GE + +
Sbjct: 540 RIALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLV 599
Query: 608 NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P + + WV ++ QPL WYKA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 600 SPNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGR 659
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW ++ +C C Y G + P KC GCG P+QRWYH+PRSW KP++N+L+IFE
Sbjct: 660 YWMAYAK-----GDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFE 713
Query: 727 EKGGDPTKITFSIRKI 742
E GGD +KI R +
Sbjct: 714 ELGGDASKIALMKRAM 729
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/736 (51%), Positives = 494/736 (67%), Gaps = 23/736 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L+F S + C +VTYD ++++ING+R ++IS +IHYPRS P MW L+++AK+G
Sbjct: 14 FVPLMFLHSQLIQC---SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDG 70
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++Y+FWN HE SPG Y F GR++LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG
Sbjct: 71 GLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGF 130
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL ++PG FR + EPFK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG
Sbjct: 131 PVWLKFVPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPES 190
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G Y WAAKMAV + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P+
Sbjct: 191 RELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPR 250
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF VARF Q GGS NYYMYHGGTNFGR+AGGPFI
Sbjct: 251 IWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFI 310
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCEHA+++ + + +SLGS Q+A V++
Sbjct: 311 TTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSG 370
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G CAAFL+N + K+ V+F NV Y LPAWS+SILPDC+ VVFNTA V Q+S + M P
Sbjct: 371 RGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N SK W+ + E I+ + G ++ IN T+D+TDYLWY TS+
Sbjct: 431 TN-----------SKLHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSV 479
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL+ G P L ++SKGHA+H F N + GSA G + F Y +L AG N
Sbjct: 480 NIDSSESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTN 539
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E W + V + G + G DLS W+Y++GL+GE + +
Sbjct: 540 RIALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLV 599
Query: 608 NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P + + WV ++ QPL WYKA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 600 SPNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGR 659
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW ++ +C C Y G + P KC GCG P+QRWYH+PRSW KP++N+L+IFE
Sbjct: 660 YWMAYAK-----GDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFE 713
Query: 727 EKGGDPTKITFSIRKI 742
E GGD +KI R +
Sbjct: 714 ELGGDASKIALMKRAM 729
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/731 (52%), Positives = 495/731 (67%), Gaps = 19/731 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L++ S+ NV+YD R+++ING+R+++IS +IHYPRS P MWP L+++AK+GG+
Sbjct: 9 LVVLVICSLDLLVKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIEKAKDGGL 68
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE SPGKY F GR++LVKFIK++Q A +Y+ LRIGP++ AE+N+GG+PV
Sbjct: 69 DVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPV 128
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+ G FR D +PFK MQ F+ IV MMK EKLF QGGPII+AQ+ENEYG E
Sbjct: 129 WLKYVSGMEFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWE 188
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y WAA+MAV VPWIMC+Q D PDPVI+TCN FYC+ F P+ P PK+W
Sbjct: 189 IGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMW 248
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TE W GWF FGG P RP+EDIAFSVARF Q GS NYYMYHGGTNFGRT+ G FI T
Sbjct: 249 TEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIAT 308
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL PK+GHL+ELH AIK CE AL++ + SLGS+QEA VY SG
Sbjct: 309 SYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSG 368
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFL+N D K V F+N+ Y LP WS+SILPDCK VV+NTA V +Q S+++M P
Sbjct: 369 ACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSSIKMTP-- 426
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIV 490
GL WQ + E ++D +++ G + N T+D++DYLWY T + +
Sbjct: 427 ----------AGGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDVNI 476
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
NE FLK+G P L + S GH LH F N +L G+ G +P Y + L AG N+I
Sbjct: 477 ASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKI 536
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS++VGL N G Y+ AG+ V ++G N G+ DL+ W+YK+GL+GE L ++
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTL 596
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ WV + QPLTWYKA P G+EP+ LDM MGKG W+NGE +GR+WP
Sbjct: 597 SGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWP 656
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + +C +C Y G FN KC T CG+PSQRWYH+PRSW K S N+LV+FEE G
Sbjct: 657 GYAAQG----DC-SKCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWG 711
Query: 730 GDPTKITFSIR 740
GDPT I+ R
Sbjct: 712 GDPTGISLVRR 722
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/733 (53%), Positives = 499/733 (68%), Gaps = 21/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++L+ FS I + +V+YD +++IING++ ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 9 WSILLLFSC-IFSAASASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPG YYF R++LVKFIK++QQ +++ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIQTYVFWNGHEPSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENE+G E
Sbjct: 128 PVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN FYC+ F P+ PK
Sbjct: 188 WEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+
Sbjct: 248 MWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFM 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGLPR PKWGHL++LH AIK CE AL++ + S LGS+QEA V+
Sbjct: 308 ATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVFKSE 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S CAAFLAN D K V F Y LP WS+SILPDCK V+NTA V +QSS V+M P
Sbjct: 368 SD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQMTP 426
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ G WQ F +E G + IN T+DTTDYLWY T I
Sbjct: 427 VH------------SGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L I S GHAL+ F N +L G+ G+ +P + ++L++G N
Sbjct: 475 TIGSDEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 534
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS++VGL N G +E AG+ + + G NSGT D+S + WTYK GL+GE LG++
Sbjct: 535 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV + QPLTWYKA PPGD P+ LDM MGKG W+NG+ +GR+
Sbjct: 595 TVTGSSSVEWVEGPSMAEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S D C Y G ++ KC T CGEPSQRWYHIPRSW P+ N+LV+FEE
Sbjct: 655 WPGYIARGSCGD-----CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEE 709
Query: 728 KGGDPTKITFSIR 740
GGDP++I+ R
Sbjct: 710 WGGDPSRISLVER 722
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/741 (52%), Positives = 498/741 (67%), Gaps = 20/741 (2%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
T ++ F L +F S ++ +VTYD +++IING+R ++ S +IHYPRS P MW L+
Sbjct: 4 TSVSKF-LFLFVSLTLFLAVYSDVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIY 62
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
+AKEGG++ IE+YVFWN HE SPG Y F GR +LV+FI+ + +A +Y LRIGP+V AE+
Sbjct: 63 KAKEGGLDVIETYVFWNVHEPSPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEW 122
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
N+GG PVWL Y+PG FR D EPFK MQ F IV MMK E+L+ SQGGPIIL+Q+ENE
Sbjct: 123 NFGGFPVWLKYVPGISFRQDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENE 182
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YG G G Y WAAKMAV GVPWIMC++ D PDPVINTCN FYCD+FTP+
Sbjct: 183 YGAQSKMLGPVGYNYMSWAAKMAVEMGTGVPWIMCKEDDAPDPVINTCNGFYCDKFTPNK 242
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P P +WTE W GWF FGG RP +D+AF+VARF QKGGS NYYMYHGGTNFGRTA
Sbjct: 243 PYKPTMWTEAWSGWFSEFGGPIHKRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTA 302
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEAD 364
GGPFITTSYDY+AP+DEYGL R PK+GHLKELH AIK+CE AL++ + SLG+ Q+A
Sbjct: 303 GGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCEKALISTDPVVTSLGNFQQAY 362
Query: 365 VYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST 424
VY SG C+AFL+N D K+ V+F N+ Y+LP WSVSILPDC+ VFNTA V Q+S
Sbjct: 363 VYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKVGVQTSQ 422
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
++M+P N S+ W+ F+E SG ++ IN T+DT+DYLWY
Sbjct: 423 MQMLPTN-----------SERFSWESFEEDTSSSSATTITASGLLEQINVTRDTSDYLWY 471
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
TS+ V +E FL G P L+++S GHA+H F N L GSA G F+Y ++L+
Sbjct: 472 ITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNLR 531
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEH 603
AG N IALLS+ VGL N G +E GI V I G + G LDLS WTY++GL+GE
Sbjct: 532 AGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGLDKGKLDLSWQKWTYQVGLKGEA 591
Query: 604 LGIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
+ + +P +++ W+ S + +NQPLTW+K P G+EP+ LDM MGKG W+NG
Sbjct: 592 MNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGI 651
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
IGRYW + S +C+Y G F P KC GCG+P+QRWYH+PRSW K + N+L
Sbjct: 652 SIGRYWTAIATGS------CNDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLL 705
Query: 723 VIFEEKGGDPTKITFSIRKIS 743
V+FEE GGDP+KI+ + R +S
Sbjct: 706 VVFEELGGDPSKISLAKRSVS 726
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 798 bits (2060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/736 (51%), Positives = 494/736 (67%), Gaps = 23/736 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L+F S + C +VTYD ++++ING+R ++IS +IHYPRS P MW L+++AK+G
Sbjct: 14 FVPLMFLHSQLIQC---SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDG 70
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++Y+FWN HE SPG Y F GR++LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG
Sbjct: 71 GLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGF 130
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL ++PG FR + EPFK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG
Sbjct: 131 PVWLKFVPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPES 190
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G Y WAAKMAV + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P+
Sbjct: 191 RELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPR 250
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF VARF Q GGS NYYMYHGGTNFGR+AGGPFI
Sbjct: 251 IWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFI 310
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCEHA+++ + + +SLGS Q+A V++
Sbjct: 311 TTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSG 370
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G CAAFL+N + K+ V+F NV Y LPAWS+SILPDC+ VVFNTA V Q+S + M P
Sbjct: 371 RGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N SK W+ + E I+ + G ++ IN T+D+TDYLWY TS+
Sbjct: 431 TN-----------SKLHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSV 479
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL+ G P L ++SKGHA+H F N + GSA G + F Y +L AG N
Sbjct: 480 NIDSSESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTN 539
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E W + V + G + G DLS W+Y++GL+GE + +
Sbjct: 540 RIALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLV 599
Query: 608 NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P + + WV ++ QPL WYKA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 600 SPNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGR 659
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW ++ +C C Y G + P KC GCG P+QRWYH+PRSW KP++N+L+IFE
Sbjct: 660 YWMAYAK-----GDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFE 713
Query: 727 EKGGDPTKITFSIRKI 742
E GGD +KI R +
Sbjct: 714 ELGGDASKIALMKRAM 729
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/748 (52%), Positives = 502/748 (67%), Gaps = 40/748 (5%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFWN H
Sbjct: 18 FCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLH 77
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E G+Y F GR +LVKF+K + +A +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR
Sbjct: 78 EPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRT 137
Query: 144 DTEPFKYH--MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYAL 201
D EPFK M++F IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG GK Y
Sbjct: 138 DNEPFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYIN 197
Query: 202 WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKT 261
WAAKMA + + GVPW+MCQQ D PD +INTCN FYCDQFTP+S + PK+WTENW W+
Sbjct: 198 WAAKMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQFTPNSNTKPKMWTENWSAWYLL 257
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYM---------------------YHGGTNF 300
FGG PHRP ED+AF+VARFFQ+GG+ NYYM YHGGTNF
Sbjct: 258 FGGGFPHRPVEDLAFAVARFFQRGGTFQNYYMVLQPEMFFTSSIYYMVLFLRPYHGGTNF 317
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
R+ GGPFI TSYD++APIDEYG+ R PKWGHLK+LH A+KLCE AL+ E SLG +
Sbjct: 318 DRSTGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLCEEALIATEPKITSLGPN 377
Query: 361 QEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRA 420
EA VY S CAAFLAN+D K+DKTV F SYHLPAWSVSILPDCK VV NTA + +
Sbjct: 378 LEAAVYKTGS-VCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINS 436
Query: 421 QSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTD 480
S+ V ++ + +S + S KW E GI + F K+G ++ IN T D +D
Sbjct: 437 ASAISNFVTKSSKEDISSLETSSS--KWSWINEPVGISKDDIFSKTGLLEQINITADRSD 494
Query: 481 YLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNP 540
YLWY+ S+ + ++ GS+ VL IES GHALHAF N +L GS +GN P P
Sbjct: 495 YLWYSLSVDLKDDL-----GSQTVLHIESLGHALHAFVNGKLAGSHTGNKDKPKLNVDIP 549
Query: 541 ISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG--TLDLSTYSWTYKI 597
I + G N+I LLS+TVGLQN G F++ GAGIT V + G +G TLDLS+ WTY++
Sbjct: 550 IKVIYGNNQIDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLKNGNNTLDLSSQKWTYQV 609
Query: 598 GLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLA 657
GL+GE LG+ + W S PKNQPL WYK P G P+ +D MGKG A
Sbjct: 610 GLKGEDLGLSSGSSE---GWNSQSTFPKNQPLIWYKTNFDAPSGSNPVAIDFTGMGKGEA 666
Query: 658 WLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKP 717
W+NG+ IGRYWP ++ +C C+YRG F KC CG+PSQ YH+PRS+ KP
Sbjct: 667 WVNGQSIGRYWPTYVASNA---DCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPRSFLKP 723
Query: 718 SENILVIFEEKGGDPTKITFSIRKISGF 745
+ N LV+FEE GGDPT+I F+ +++
Sbjct: 724 NGNTLVLFEENGGDPTQIAFATKQLESL 751
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/737 (51%), Positives = 494/737 (67%), Gaps = 24/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F ++ S + C +VTYD ++++ING+R ++ S +IHYPRS P MW L+Q+AK+G
Sbjct: 14 FLVVFLGCSELIQC---SVTYDRKAIMINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDG 70
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE +PG Y+F GR+++V+F+K IQ+A +Y LRIGP+V AE+N+GG
Sbjct: 71 GIDVIETYVFWNVHEPTPGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPYVCAEWNFGGF 130
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQ F IV +MK E LF SQGGPIIL+Q+ENEYG
Sbjct: 131 PVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKAENLFESQGGPIILSQIENEYGVQS 190
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+G G Y WAA MA+ GVPW+MC++ D PDPVINTCN FYCD F P+ P P
Sbjct: 191 KLFGAAGYNYMTWAANMAIQTGTGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPT 250
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GWF FGG RP +D+AF+VA+F QKGGS NYYM+HGGTNFGR+AGGPFI
Sbjct: 251 IWTEAWSGWFSEFGGTIHQRPVQDLAFAVAKFIQKGGSFINYYMFHGGTNFGRSAGGPFI 310
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH +IK+CE AL++ + LG+ Q+ VY+
Sbjct: 311 TTSYDYDAPIDEYGLIRQPKYGHLKELHRSIKMCERALVSVDPIVTQLGTYQQVHVYSTE 370
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFLAN D K+ V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S +EM+P
Sbjct: 371 SGDCAAFLANYDTKSAARVLFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMEMLP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N W+ + E I+ + + F +G ++ IN T+D +DYLWY TS+
Sbjct: 431 TN------------GIFSWESYDEDISSLDDSSTFTTAGLLEQINVTRDASDYLWYMTSV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FL G P L+I+S GHA+H F N +L GSA G + F Y ++L+ G N
Sbjct: 479 DIGSSESFLHGGELPTLIIQSTGHAVHIFINGQLSGSAFGTRENRRFTYTGKVNLRPGTN 538
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G YE GI V + G + G DLS WTY++GL+GE + +
Sbjct: 539 RIALLSVAVGLPNVGGHYESWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLL 598
Query: 608 NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P ++ W+ S++ + QPLTW+KA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 599 SPDSVTSVEWMQSSLAAQRPQPLTWHKAYFNAPEGDEPLALDMEGMGKGQIWINGQSIGR 658
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + + C Y G F P KC GCG+P+QRWYH+PRSW KP+ N+LV+FE
Sbjct: 659 YWTAYASGN------CNGCSYAGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTNNLLVVFE 712
Query: 727 EKGGDPTKITFSIRKIS 743
E GGDP++I+ R ++
Sbjct: 713 ELGGDPSRISLVKRSLA 729
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/737 (51%), Positives = 504/737 (68%), Gaps = 23/737 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+ +++F SS + +C +VTYD +++ING+R L+ S +IHYPRS P MW L+ +AKEG
Sbjct: 13 WCIVLFISSGLVHC---DVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ +E+YVFWN HE SPG Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG
Sbjct: 70 GLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK M+ + IV++MK LF SQGGPIIL+Q+ENEYG
Sbjct: 130 PVWLKYVPGISFRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQA 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G +Y+ WAA MAV + GVPW+MC++ D PDPVINTCN FYCD F P+ P P
Sbjct: 190 KVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPA 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
WTE W GWF FGG RP +D+AF+VA+F Q+GGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 250 TWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFI 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH A+K+CE ++++ + + SLG+ Q+A VY+
Sbjct: 310 TTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSE 369
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G CAAFL+N D K+ V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S +EM+P
Sbjct: 370 TGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSKMEMLP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSI 488
N S+ L W+ + E ++ ++S G ++ IN T+DT+DYLWY TS+
Sbjct: 430 TN-----------SEMLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ E FL G P L++E+ GHA+H F N +L GSA G + F +K ++L+AG N
Sbjct: 479 DIGSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSN 538
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G +E W + V I G + G DLS WTY++GL+GE + +
Sbjct: 539 RIALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLV 598
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+ + ++W+ ++ K QPLTW+KA P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 599 STNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGR 658
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + +C C Y G F P KC GCGEP+Q+WYH+PRSW KP++N+LV+FE
Sbjct: 659 YWTAYAT-----GDC-NGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFE 712
Query: 727 EKGGDPTKITFSIRKIS 743
E GGDPT+I+ R ++
Sbjct: 713 ELGGDPTRISLVKRSVT 729
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 795 bits (2054), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/539 (69%), Positives = 440/539 (81%), Gaps = 2/539 (0%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MW GLV+ AKEGG++ IE+YVF NGHELSP YYFGG ++L+KF+KI+QQA MY+IL IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
PFVA E+N+GG+P+WLHY+P T+F+ +++PFKYHMQKFMTLIV++MK++KLFASQGGPII
Sbjct: 61 PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
L QVENEYG + Y +GGK Y +WAA M ++ NIGVPWIMCQ + + DP+INTCNSFYC
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180
Query: 238 DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
DQFTP+SPS ++WTENWP WFKTFG + HR EDIAFSVA FF NYYMYHGG
Sbjct: 181 DQFTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKSX--NYYMYHGG 238
Query: 298 TNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSL 357
TNFG T+GGPFITT+Y+Y APIDEYGL R PK GHLKEL AIK CEH LL GE NL L
Sbjct: 239 TNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPINLXL 298
Query: 358 GSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTAN 417
G SQE DVYADS G AAF++N+D+K DK +VF+N SYH+PAWSVSILPDCK VVFNTA
Sbjct: 299 GPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFNTAK 358
Query: 418 VRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKD 477
V +Q S VEMV E+LQPS + KGL W+ F E AGIWGEADFVK+GFVDHINTTKD
Sbjct: 359 VVSQISQVEMVLEDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFVKNGFVDHINTTKD 418
Query: 478 TTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKY 537
TTD LWYT SI V E+E FLK S+P+LL+ESKGHALHAF NQ+LQGSASGNG+H PFK+
Sbjct: 419 TTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNGSHSPFKF 478
Query: 538 KNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYK 596
+ PISLKAGKNEI +LSMTVGLQN PFYEWVGA +TSVKI G N+G +DLSTY W YK
Sbjct: 479 ECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTSVKIKGLNNGIMDLSTYPWIYK 537
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 795 bits (2054), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/721 (52%), Positives = 487/721 (67%), Gaps = 20/721 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+VTYD ++L+ING+R ++ S +IHYPRS P MW L+ +AKEGG++ +E+YVFWN HE
Sbjct: 25 ASVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEP 84
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D
Sbjct: 85 SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQ F IV MMK E+LF SQGGPIIL+Q+ENEYG G G+ Y WAAK
Sbjct: 145 EPFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAK 204
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV GVPW+MC++ D PDPVINTCN FYCD+FTP+ P P IWTE W GWF FGG
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
RP +D+AF+ ARF +GGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL
Sbjct: 265 IHKRPVQDLAFAAARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PK+GHLKELH AIK+CE AL++ + SLG Q+A VY SG CAAFL+N D K+
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIVTSLGEFQQAHVYTTESGDCAAFLSNYDSKSS 384
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V+F N+ Y LP WSVSILPDC+ VVFNTA V Q+S ++M+P N Q
Sbjct: 385 ARVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGVQTSQMQMLPTNTQL----------- 433
Query: 446 LKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
W+ F E I + + G ++ IN TKD +DYLWY TS+ + +E FL+ G P
Sbjct: 434 FSWESFDEDIYSVDESSAITAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPT 493
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L+++S GHA+H F N +L GSA G + F Y ++L AG N IALLS+ +GL N G
Sbjct: 494 LIVQSTGHAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRIALLSVAIGLPNVGE 553
Query: 565 FYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STME 622
+E W + V + G + G DLS WTY++GL+GE + + +P +++ W+ S +
Sbjct: 554 HFESWSTGILGPVALHGLDKGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIV 613
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+NQPLTW+K P GDEP+ LDM MGKG W+NG+ IGRYW + +
Sbjct: 614 VQRNQPLTWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAFATGN------C 667
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
+C+Y G F P KC GCG+P+QRWYH+PRSW K ++N+LVIFEE GG+P+KI+ R +
Sbjct: 668 NDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGGNPSKISLVKRSV 727
Query: 743 S 743
S
Sbjct: 728 S 728
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 795 bits (2053), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/751 (52%), Positives = 502/751 (66%), Gaps = 26/751 (3%)
Query: 1 MKPRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWP 60
M+P + L+ + +C NV YD R+L+I+G+R ++IS +IHYPRS P MWP
Sbjct: 1 MRPAQIVLVLFWLLCIHTPKLFC--ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWP 58
Query: 61 GLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFV 120
L+Q++K+GG++ IE+YVFWN HE G+Y F GR +LVKF+K + A +Y+ LRIGP+V
Sbjct: 59 DLIQKSKDGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYV 118
Query: 121 AAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
AE+NYGG PVWLH+IPG FR D EPFK M++F IVDM+K+EKL+ASQGGP+IL+Q
Sbjct: 119 CAEWNYGGFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQ 178
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF 240
+ENEYG ++ YG GK Y WAA MA + + GVPW+MC Q D PDP+INT N FY D+F
Sbjct: 179 IENEYGNIDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEF 238
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
TP+S + PK+WTENW GWF FGG P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF
Sbjct: 239 TPNSNTKPKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 298
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
R +GGPFI TSYDY+APIDEYG+ R PKWGHLKE+H AIKLCE AL+ + + SLG +
Sbjct: 299 DRASGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPN 358
Query: 361 QEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRA 420
EA VY S CAAFLAN+ K+D TV F SYHLPAWSVSILPDCK VV NTA + +
Sbjct: 359 LEAAVYKTGS-VCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINS 417
Query: 421 QSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTD 480
S+ E+ + S + S G W E GI F ++G ++ INTT D +D
Sbjct: 418 ASAISSFTTESSKEDIGSSEASSTGWSW--ISEPVGISKTDSFSQTGLLEQINTTADKSD 475
Query: 481 YLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSAS--------GNGTH 532
YLWY+ SI + S+ VL IES GHALHAF N +L G N
Sbjct: 476 YLWYSLSIDYKADAS-----SQTVLHIESLGHALHAFINGKLAGKYKLKHSQLIICNSGK 530
Query: 533 PPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGF-NSGTLDLST 590
F P++L AGKN I LLS+TVGLQN G F++ G GIT V + GF N TLDLS+
Sbjct: 531 YKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITGPVILKGFANGNTLDLSS 590
Query: 591 YSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDML 650
WTY++GLQGE LG+ + G N ST PKNQPLTWYK P G +P+ +D
Sbjct: 591 QKWTYQVGLQGEDLGL-SSGSSGQWNLQSTF--PKNQPLTWYKTTFSAPSGSDPVAIDFT 647
Query: 651 KMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHI 710
MGKG AW+NG+ IGRYWP + C C+YRG ++ KC C +PSQ YH+
Sbjct: 648 GMGKGEAWVNGQRIGRYWPTYVASDA---SCTDSCNYRGPYSASKCRKNCEKPSQTLYHV 704
Query: 711 PRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
PRSW KPS NILV+FEE+GGDPT+I+F ++
Sbjct: 705 PRSWLKPSGNILVLFEERGGDPTQISFVTKQ 735
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 795 bits (2052), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/736 (52%), Positives = 499/736 (67%), Gaps = 24/736 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L ++ SS+ +VTYD +++IINGRR ++IS +IHYPRS+P MWP L+Q+AK+G
Sbjct: 12 LGLFLWVCSSVM----ASVTYDHKAIIINGRRRILISGSIHYPRSIPQMWPDLIQKAKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWNGHE SPG+Y F R++LV+F+K++ QA +Y+ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIETYVFWNGHEPSPGQYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQKF IV +MK EKL+ SQGGPIIL+Q+ENEYG E
Sbjct: 128 PVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MA+ N GVPW+MC+Q D PDPVI+TCN FYC+ F P+ PK
Sbjct: 188 WEIGAPGKSYTKWAAQMALGLNTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG P+RP ED+A+SVARF Q GGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 248 MWTEAWTGWFTEFGGPAPYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGL R PKW HL++LH AIKLCE AL++ + + LGS+QEA V+
Sbjct: 308 ATSYDYDAPIDEYGLLREPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTR 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG+CAAFLAN D + TV F N Y LP WSVSILPDCK V+FNTA V A +S +M P
Sbjct: 368 SGSCAAFLANYDASSSATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTP 427
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ W + +E A + E +G V+ I+ T+D+TDYLWY T I
Sbjct: 428 VS-------------SFSWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ NE FLK+G P+L + S GHALH F N +L G+ G + + ++L+AG N
Sbjct: 475 RIDPNEGFLKSGQWPLLTVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGIN 534
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++++LS+ VGL N G YE W + V + G N T D+S Y W+YKIGL+GE L ++
Sbjct: 535 KLSILSVAVGLPNGGLHYETWNTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ +++ WV+ + QPLTWYK P G+EP+ LDM MGKG W+NG+ IGR+
Sbjct: 595 SVSGSSSVEWVTGSLVAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + K S +C+Y G FN KC + CGEPSQRWYH+PR+W K S N+LVIFEE
Sbjct: 655 WPAYTAKGS-----CGKCNYGGIFNEKKCHSXCGEPSQRWYHVPRAWLKSSGNVLVIFEE 709
Query: 728 KGGDPTKITFSIRKIS 743
GG+P I+ R IS
Sbjct: 710 WGGNPEGISLVKRSIS 725
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 794 bits (2051), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/720 (51%), Positives = 494/720 (68%), Gaps = 21/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++ING+R ++IS +IHYPRS P MW L+Q+AK+GG++ +E+YVFWN HE +P
Sbjct: 28 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LV+F+K IQ+A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 88 GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F IV +MK E LF SQGGPIIL+Q+ENEYG +G G Y WAA+MA
Sbjct: 148 FKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEMA 207
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V + GVPW+MC++ D PDPVINTCN FYCD F+P+ P P IWTE W GWF FGG
Sbjct: 208 VGLDTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPYKPTIWTETWSGWFTEFGGPIH 267
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
RP +D+A++VA F QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL R
Sbjct: 268 QRPVQDLAYAVATFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQ 327
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH AIK+CE AL++ + SLG+ Q+A VY SG C+AFL+N D K+
Sbjct: 328 PKYGHLKELHKAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSKSAAR 387
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F N+ Y+LP WS+SILPDC+ VVFNTA V Q+S ++M+P N+ L
Sbjct: 388 VMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLPTNI-----------PMLS 436
Query: 448 WQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E + + + G ++ IN T+D+TDYLWY TS+ ++ +E FL G P L+
Sbjct: 437 WESYDEDLTSMDDSSTMTAPGLLEQINVTRDSTDYLWYITSVDIDSSESFLHGGELPTLI 496
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GHA+H F N +L GSA G F Y ++L+AG N+IALLS+ VGL N G +
Sbjct: 497 VQSTGHAVHIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIALLSVAVGLPNVGGHF 556
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV--STMEP 623
E GI V + G N G DLS WTY++GL+GE + + + +++ W+ S +
Sbjct: 557 EAWNTGILGPVALHGLNQGKWDLSWQKWTYQVGLKGEAMNLVSQNAFSSVEWISGSLIAQ 616
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
K QPLTW+K + +P G EP+ LDM MGKG W+NG+ IGRYW + + C
Sbjct: 617 KKQQPLTWHKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYW-----TAFANGNC-N 670
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C Y G F P KC +GCG+P+QR+YH+PRSW KP++N+LV+FEE GGDP++I+ R +S
Sbjct: 671 GCSYAGGFRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPSRISLVKRAVS 730
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/721 (53%), Positives = 492/721 (68%), Gaps = 21/721 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
V+YD R++ ING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE
Sbjct: 28 ATVSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 87
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG YYF R++LVKFIK++Q A +Y+ LRIGP++ AE+N+GG PVWL Y+PG FR D
Sbjct: 88 SPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDN 147
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
PFK MQKF IV MMK EKLF SQGGPIIL+Q+ENE+G E G GK Y WAA
Sbjct: 148 GPFKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAD 207
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV GVPW+MC+Q D PDPVINTCN FYC+ F P+ PK+WTENW GW+ FGG
Sbjct: 208 MAVKLGTGVPWVMCKQDDAPDPVINTCNGFYCENFKPNKDYKPKLWTENWTGWYTEFGGA 267
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P+RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRT+ G FI TSYDY+AP+DEYGL
Sbjct: 268 VPYRPAEDLAFSVARFIQNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDEYGLT 327
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R+PKWGHL++LH AIKLCE AL++ + + SLGS+QEA V+ S +CAAFLAN D K
Sbjct: 328 RDPKWGHLRDLHKAIKLCEPALVSVDPTVKSLGSNQEAHVF-QSKSSCAAFLANYDTKYS 386
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V F N Y LP WS+SILPDCK VFNTA + AQSS ++M P
Sbjct: 387 VKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSSQMKMTPVG------------GA 434
Query: 446 LKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
L WQ + +E A + + G + IN T+D +DYLWY T++ ++ +E FLKNG PV
Sbjct: 435 LSWQSYIEEAATGYTDDTTTLEGLWEQINVTRDASDYLWYMTNVNIDSDEGFLKNGDSPV 494
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L I S GH+LH F N +L G+ G+ +P + + L AG N+I+LLS+ VGL N G
Sbjct: 495 LTIFSAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKLTAGINKISLLSVAVGLPNVGV 554
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+E AGI V + G N GT DLS + W+YKIGL+GE L ++ +++ WV
Sbjct: 555 HFEKWNAGILGPVTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTVTGSSSVEWVEGSLS 614
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
K QPLTWYKA P G++P+ LDM MGKG W+NG+ IGR+WP + + S
Sbjct: 615 AKKQPLTWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPAYTARGS-----CS 669
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C+Y G ++ KC + CGEPSQRWYH+PRSW PS N+LV+FEE GG+P+ I+ +++ +
Sbjct: 670 ACNYAGTYDDKKCRSNCGEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSGISL-VKRTT 728
Query: 744 G 744
G
Sbjct: 729 G 729
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 793 bits (2048), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/737 (50%), Positives = 496/737 (67%), Gaps = 20/737 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
I +SS NV YD ++L+I+G+R L+ S +IHYPRS P MW GL+Q+AK+G
Sbjct: 13 LCCCIVWSSVYVEVTKCNVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKAKDG 72
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWN HE SPG Y F GR +LV+FIK + +A +Y+ LRIGP++ +E+N+GG
Sbjct: 73 GLDAIDTYVFWNLHEPSPGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNFGGF 132
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL ++PG FR D EPFK MQKF +V +MK EKLF SQGGPIIL+Q+ENEY
Sbjct: 133 PVWLKFVPGISFRTDNEPFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYEPES 192
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+G G Y WAAKMAV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P
Sbjct: 193 KAFGASGYAYMTWAAKMAVGMGTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPYKPT 252
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG RP ED+ F+VARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 253 MWTEAWSGWFTEFGGPIYQRPVEDLTFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 312
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH A+KLCE ALLN + + +LGS ++A V++
Sbjct: 313 TTSYDYDAPIDEYGLIRRPKYGHLKELHKAVKLCELALLNADPTVTTLGSYEQAHVFSSK 372
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG+ A FL+N + K+ V F N+++HLP WS+SILPDCK V FNTA V Q+S +++
Sbjct: 373 SGSGAVFLSNFNTKSATKVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTSQTQLLR 432
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N S+ W +F E ++ + G+ +G +D +N T+D++DYLWYTTS+
Sbjct: 433 TN-----------SELHSWGIFNEDVSSVAGDTTITVTGLLDQLNITRDSSDYLWYTTSV 481
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FL G P L ++S G A+H F N +L GSASG H F + ++L AG N
Sbjct: 482 DIDPSESFLGGGQHPSLTVQSAGDAMHVFINDQLSGSASGTREHRRFTFTGNVNLHAGLN 541
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+I+LLS+ VGL N GP +E G+ V + G + GT DLS W+Y++GL+GE +
Sbjct: 542 KISLLSIAVGLANNGPHFETRNTGVLGPVALHGLDHGTRDLSWQKWSYQVGLKGEATNLD 601
Query: 608 NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P + ++W++ ++ K QPLTWYKA +P GDEP+ LDM MGKG W+NG+ IGR
Sbjct: 602 SPNSISAVDWMTGSLVAQKQQPLTWYKAYFDEPNGDEPLALDMGSMGKGQVWINGQSIGR 661
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW + D C Y G F P KC GC P+Q+WYH+PRSW KPS+N+LV+FE
Sbjct: 662 YWTIYA------DSDCSACTYSGTFRPKKCQFGCQHPTQQWYHVPRSWLKPSKNLLVVFE 715
Query: 727 EKGGDPTKITFSIRKIS 743
E GGD +K+ + ++
Sbjct: 716 EIGGDVSKVALVKKSVT 732
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 792 bits (2046), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/715 (53%), Positives = 488/715 (68%), Gaps = 22/715 (3%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD +++++NG+R ++IS +IHYPRS P MWP L+++AK+GG++ +++YVFWNGHE SPG
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D EPF
Sbjct: 87 QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 146
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQKF T IV+MMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAV
Sbjct: 147 KAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 206
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N VPWIMC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PH
Sbjct: 207 ALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPH 266
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+A+ VA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYGL R P
Sbjct: 267 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 326
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHLK+LH AIKLCE AL+ G+ SLG++Q++ V+ S+GACAAFL N D + V
Sbjct: 327 KWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLENKDKVSYARV 386
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
F + Y LP WS+SILPDCK VFNTA V +Q S ++M + G W
Sbjct: 387 AFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKM-------------EWAGGFAW 433
Query: 449 QVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIE 508
Q + E +GE G ++ IN T+D TDYLWYTT + V ++E+FL NG L +
Sbjct: 434 QSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLTVM 493
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
S GHALH F N +L+G+ G+ P Y + L AG N I+ LS+ VGL N G +E
Sbjct: 494 SAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFET 553
Query: 569 VGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
AGI V + G N G DL+ WTY++GL+GE + +++ + + W EP + Q
Sbjct: 554 WNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEW---GEPVQKQ 610
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
PLTWYKA P GDEP+ LDM MGKG W+NG+ IGRYWP K+S + CDY
Sbjct: 611 PLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWP--GYKASGN---CGTCDY 665
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
RG+++ KC T CG+ SQRWYH+PRSW P+ N+LVIFEE GGDPT I+ R I
Sbjct: 666 RGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSI 720
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 791 bits (2044), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/738 (51%), Positives = 497/738 (67%), Gaps = 25/738 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+ +F +S + +C VTYD ++++ING+R L+IS +IHYPRS P MW GL+Q+AK+G
Sbjct: 14 LTMTLFMASELIHC--TTVTYDKKAILINGQRRLLISGSIHYPRSTPEMWEGLIQKAKDG 71
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPG YYF GR++LV+FIK +Q+A +++ LRIGP+V AE+N+GG
Sbjct: 72 GLDVIDTYVFWNGHEPSPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVCAEWNFGGF 131
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQ F IV MMK EKLFASQGGPIIL+Q+ENEYG
Sbjct: 132 PVWLKYVPGISFRTDNGPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPER 191
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G G+ Y WAAKMAV + GVPW+MC++ D PDP+IN CN FYCD FTP+ P P
Sbjct: 192 KALGAPGQNYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINACNGFYCDGFTPNKPYKPT 251
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG HRP +D+AF+VARF Q+GGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 252 MWTEAWSGWFLEFGGTIHHRPVQDLAFAVARFIQRGGSYVNYYMYHGGTNFGRTAGGPFI 311
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+APIDEYGL R PK+GHLKELH AIKLCEH+LL+ E + SLG+ +A V+
Sbjct: 312 TTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHSLLSSEPTVTSLGTYHQAYVFNSG 371
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
CAAFL+N + V F N Y LP WSVSILPDC+ V+NTA V Q+S V+M+P
Sbjct: 372 PRRCAAFLSNFHSVEAR-VTFNNKHYDLPPWSVSILPDCRNEVYNTAKVGVQTSHVQMIP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N S+ WQ + E I+ + + G ++ IN T+DT+DYLWY T++
Sbjct: 431 TN-----------SRLFSWQTYDEDISSVHERSSIPAIGLLEQINVTRDTSDYLWYMTNV 479
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ ++ L G +P L ++S GHALH F N + GSA G F + +P++L AG N
Sbjct: 480 DISSSD--LSGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVNLHAGIN 537
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
IALLS+ VGL N G YE GI V + G +G DL+ + W K+GL+GE + +
Sbjct: 538 RIALLSIAVGLPNVGLHYESWKTGIQGPVFLDGLGNGKKDLTLHKWFNKVGLKGEAMNLV 597
Query: 608 NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+P +++ W+ ++ Q L WYKA P G+EP+ LDM +MGKG W+NG+ IGR
Sbjct: 598 SPNGASSVGWIRRSLATQTKQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVWINGQSIGR 657
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
YW ++ +C C Y G F P KC CG P+QRWYH+PRSW KP++N++V+FE
Sbjct: 658 YWMAYAK-----GDC-SSCSYIGTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNLVVVFE 711
Query: 727 EKGGDPTKITFSIRKISG 744
E GGDP+KIT R ++G
Sbjct: 712 ELGGDPSKITLVRRSVAG 729
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 789 bits (2038), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/721 (51%), Positives = 490/721 (67%), Gaps = 21/721 (2%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
+ V YD R LIING+ ++ISA+IHYPR+ P MW L+ AK GG++ IE+YVFW+GH
Sbjct: 20 LSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGH 79
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
+ + Y F GRF+LV F+K++ +A +Y LRIGP+V AE+N GG PVWL +PG FR
Sbjct: 80 QPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIEFRT 139
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+ +PFK MQ F+ IV MMK +KLFA QGGPIILAQ+ENEYG ++ YG GK Y WA
Sbjct: 140 NNQPFKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMEWA 199
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MA GVPWIMCQQ D PD +++TCN FYCD + P++ PK+WTENW GWF+ +G
Sbjct: 200 ANMAQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWG 259
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGR++GGP++TTSYDY+APIDE+G
Sbjct: 260 EASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFG 319
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANMDD 382
+ R PKWGHLK+LH AIKLCE AL + + + +SLG QEA VY + SSGACAAFLAN+D
Sbjct: 320 VIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDS 379
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
+D TV F + +Y LPAWSVSILPDCK V NTA V Q++ M P
Sbjct: 380 SSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVHVQTAMPTMKPS------------ 427
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
GL W+ + E G+W ++ V S ++ INTTKDT+DYLWYTTS+ +++ + +
Sbjct: 428 ITGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQAD---AASGK 484
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
+L +ES +H F N +L GSAS GT + PI L +G N +A+L TVGLQN
Sbjct: 485 ALLSLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNY 544
Query: 563 GPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
GPF E GAGI SV + G SG +DL+ W +++GL+GE L I+ + W S +
Sbjct: 545 GPFIETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWSSAV 604
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
P+ Q L WYKA P G++P+ LD+ MGKG AW+NG+ IGR+WP S ++ C
Sbjct: 605 --PQGQALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWP--SLRAPDTAGC 660
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
Q CDYRG ++ KC +GCG+PSQRWYH+PRSW + S N++V+FEE+GG P+ ++F R
Sbjct: 661 PQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKPSGVSFVTRT 720
Query: 742 I 742
+
Sbjct: 721 V 721
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 789 bits (2037), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/733 (52%), Positives = 493/733 (67%), Gaps = 24/733 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L+++ SS+ +VTYD ++L+I+G+R ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 12 LGLVLWVCSSVM----ASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWNGHE SPG+YYF R+ LV+F+K++QQA +Y+ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQKF IV MMK EKL+ SQGGPIIL+Q+ENEYG E
Sbjct: 128 PVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MA+ + GVPW+MC+Q D PDP+I+TCN FYC+ F P+ PK
Sbjct: 188 WEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG P+RP ED+A++VARF Q GS+ NYYMYHGGTNFGRTAGGPFI
Sbjct: 248 MWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGL R PKWGHL++LH AIKLCE AL++ + + SLGS QEA VY
Sbjct: 308 ATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTR 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFLAN D V F N Y LP WSVSILPDCK VVFNTA V A S +M P
Sbjct: 368 SGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNAPSYWPKMTP 427
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ W + +E A + + +G V+ I+ T+D TDYLWY T I
Sbjct: 428 IS-------------SFSWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ NE FLK+G P+L I S GHALH F N +L G+ G +P + ++L+ G N
Sbjct: 475 RIDSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVN 534
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++++LS+ VGL N G +E AGI V + G N GT D+S Y W+YK+GL+GE L ++
Sbjct: 535 KLSMLSVAVGLPNVGVHFETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ W++ + QPLTWYK P G+EP+ LDM MGKG W+NGE IGR+
Sbjct: 595 TVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + + S +C Y G F KC CGEPSQRWYH+PR+W KPS NILVIFEE
Sbjct: 655 WPAYTARGS-----CGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEE 709
Query: 728 KGGDPTKITFSIR 740
GG+P I+ R
Sbjct: 710 WGGNPDGISLVKR 722
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 789 bits (2037), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/733 (52%), Positives = 493/733 (67%), Gaps = 24/733 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L+++ SS+ +VTYD ++L+I+G+R ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 12 LGLVLWVCSSVM----ASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWNGHE SPG+YYF R+ LV+F+K++QQA +Y+ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQKF IV MMK EKL+ SQGGPIIL+Q+ENEYG E
Sbjct: 128 PVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MA+ + GVPW+MC+Q D PDP+I+TCN FYC+ F P+ PK
Sbjct: 188 WEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG P+RP ED+A++VARF Q GS+ NYYMYHGGTNFGRTAGGPFI
Sbjct: 248 MWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPFI 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGL R PKWGHL++LH AIKLCE AL++ + + SLGS QEA VY
Sbjct: 308 ATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTR 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFLAN D V F N Y LP WSVSILPDCK VVFNTA V A S +M P
Sbjct: 368 SGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNAPSYWPKMTP 427
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ W + +E A + + +G V+ I+ T+D TDYLWY T I
Sbjct: 428 IS-------------SFSWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ NE FLK+G P+L I S GHALH F N +L G+ G +P + ++L+ G N
Sbjct: 475 RIDSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVN 534
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++++LS+ VGL N G +E AGI V + G N GT D+S Y W+YK+GL+GE L ++
Sbjct: 535 KLSMLSVAVGLPNVGVHFETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ W++ + QPLTWYK P G+EP+ LDM MGKG W+NGE IGR+
Sbjct: 595 TVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + + S +C Y G F KC CGEPSQRWYH+PR+W KPS NILVIFEE
Sbjct: 655 WPAYTARGS-----CGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEE 709
Query: 728 KGGDPTKITFSIR 740
GG+P I+ R
Sbjct: 710 WGGNPDGISLVKR 722
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 251/513 (48%), Positives = 327/513 (63%), Gaps = 14/513 (2%)
Query: 229 INTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSV 288
I+TCN FYC+ F P+ PKIWTENW GW+ FGG P+RP ED+AFSVARF Q GGS+
Sbjct: 723 IDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSL 782
Query: 289 HNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALL 348
NYYMYHGGTNFGRT+G F+TTSYD++APIDEYGL R PKWGHL++LH AIKLCE AL+
Sbjct: 783 VNYYMYHGGTNFGRTSG-LFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALV 841
Query: 349 NGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
+ + ++ LG QEA V+ SSGACAAFLAN D V F N Y LP WS+SILPDC
Sbjct: 842 SADPTSTWLGKDQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDC 901
Query: 409 KKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGF 468
K V FNTA VR ++ NL ++ +P + L ++ +E A + + K G
Sbjct: 902 KTVTFNTARVRRDP---KLFIPNLLMAKMTPISSFWWLSYK--EEPASAYAKDTTTKDGL 956
Query: 469 VDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASG 528
V+ ++ T DTTDYLWY T I ++ E FLK+G P+L + S GH LH F N +L GS G
Sbjct: 957 VEQVSVTWDTTDYLWYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYG 1016
Query: 529 NGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLD 587
+ P + ++LK G N++++LS+TVGL N G ++ AG+ V + G N GT D
Sbjct: 1017 SLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRD 1076
Query: 588 LSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGL 647
+S Y W+YK+GL+GE L +Y+ N++ W+ + QPLTWYK P G+EP+ L
Sbjct: 1077 MSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSF--QKQPLTWYKTTFNTPAGNEPLAL 1134
Query: 648 DMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRW 707
DM M KG W+NG IGRY+P +C +C Y G F KC+ CG PSQ+W
Sbjct: 1135 DMSSMSKGQIWVNGRSIGRYFPGYIASG----KC-NKCSYTGFFTEKKCLWNCGGPSQKW 1189
Query: 708 YHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
YHIPR W P+ N+L+I EE GG+P I+ R
Sbjct: 1190 YHIPRDWLSPNGNLLIILEEIGGNPQGISLVKR 1222
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 789 bits (2037), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/721 (52%), Positives = 496/721 (68%), Gaps = 21/721 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G VTYD ++++ING+R ++ S +IHYPRS P MW GL+Q+AK+GG++ I++YVFWNGHE
Sbjct: 30 GAVTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEP 89
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
+PG Y F GR++LVKFIK Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D
Sbjct: 90 TPGSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDN 149
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQ F IV MMK E+LFASQGGPIIL+Q+ENEYG E +G GK Y+ WAAK
Sbjct: 150 EPFKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAK 209
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV + GVPW+MC+Q D PDPVIN CN FYCD FTP++PS P +WTE W GWF FGG
Sbjct: 210 MAVGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGT 269
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
RP ED++F+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL
Sbjct: 270 IRKRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLA 329
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PK+GHLKELH AIKLCE AL++ + + SLGS QEA VY SG CAAFLAN + +
Sbjct: 330 REPKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSG-CAAFLANYNSNSH 388
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
+VF N Y LP WS+SILPDCK VV+NTA V Q+S ++M +G+
Sbjct: 389 AKIVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTSQMQMW-----------SDGASS 437
Query: 446 LKWQVFKEIAGIWGEADFV-KSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
+ W+ + E G A + +G ++ +N T+DT+DYLWY TS+ V+ +E+ L+ G
Sbjct: 438 MMWERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLS 497
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L ++S GHALH F N +LQGSASG YK + L+AG N+I+LLS+ GL N G
Sbjct: 498 LTVQSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGV 557
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
YE G+ V + G + G+ DL+ +WTY++GL+GE + + + +++ W+
Sbjct: 558 HYETWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLI 617
Query: 624 PKNQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+NQ PL WY+A P GDEP+ LDM MGKG W+NG+ IGRY + +C
Sbjct: 618 AQNQMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRY-----SLAYATGDC- 671
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
++C Y G F KC GCG+P+QRWYH+P+SW +P+ N+LV+FEE GGD +KI+ R +
Sbjct: 672 KDCSYTGSFRAIKCQAGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDTSKISLVKRSV 731
Query: 743 S 743
S
Sbjct: 732 S 732
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 788 bits (2034), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/740 (51%), Positives = 498/740 (67%), Gaps = 25/740 (3%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
I+ F L++ F + C +VTYD +++IING+R+++IS +IHYPRS P MW GL+Q+A
Sbjct: 14 ISLFLLVLHFQ--LIQC---SVTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLMQKA 68
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K+GG++ I++YVFWN HE SPG Y F GR++LV+F+K +Q+A +YM LRIGP+V AE+N+
Sbjct: 69 KDGGLDVIQTYVFWNVHEPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAEWNF 128
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL Y+PG FR D EPFK MQ F IV MMK E LF SQGGPIIL+Q+ENEYG
Sbjct: 129 GGFPVWLKYVPGISFRTDNEPFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIENEYG 188
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
G G Y WAAKMAV GVPW+MC++ D PDPVINTCN FYCD FTP+ P
Sbjct: 189 SESKALGAPGHAYMTWAAKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYCDAFTPNKPY 248
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
P +WTE W GWF FGG RP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGG
Sbjct: 249 KPTMWTEAWSGWFTEFGGTVHERPVEDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGG 308
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFITTSYDY+APIDEYGL R PK+GHLKELH AIKLCE AL++ + SLG Q++ V+
Sbjct: 309 PFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKLCEPALISADPIVTSLGPYQQSHVF 368
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
+ +G CAAFL+N + + V+F N+ Y LP WS+SILPDC+ VVFNTA V Q+S +
Sbjct: 369 SSGTGGCAAFLSNYNPNSVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKVGVQTSQMH 428
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
M +K L W+++ E IA + + G ++ +N T+DT+DYLWY
Sbjct: 429 MSAGE-----------TKLLSWEMYDEDIASLGDNSMITAVGLLEQLNVTRDTSDYLWYM 477
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
TS+ ++ +E L+ G PVL ++S GHALH + N +L GSA G+ + F + ++++A
Sbjct: 478 TSVDISPSESSLRGGRPPVLTVQSAGHALHVYINGQLSGSAHGSRENRRFTFTGDVNMRA 537
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
G N IALLS+ V L N G YE G+ V + G + G DL+ W+Y++GL+GE +
Sbjct: 538 GINRIALLSIAVELPNVGLHYESTNTGVLGPVVLHGLDQGKRDLTWQKWSYQVGLKGEAM 597
Query: 605 GIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEE 663
+ P + + W+ ++ K QPLTWYKA P GDEP+ LD+ MGKG W+NGE
Sbjct: 598 NLVAPSGISYVEWMQASFATQKLQPLTWYKAYFNAPGGDEPLALDLGSMGKGQVWINGES 657
Query: 664 IGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILV 723
IGRYW + H C Y G + KC TGCG+P+QRWYH+PRSW +P++N+LV
Sbjct: 658 IGRYWTAAANGDCNH------CSYAGTYRAPKCQTGCGQPTQRWYHVPRSWLQPTKNLLV 711
Query: 724 IFEEKGGDPTKITFSIRKIS 743
IFEE GGD + I+ R +S
Sbjct: 712 IFEEIGGDASGISLVKRSVS 731
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 787 bits (2033), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/728 (52%), Positives = 498/728 (68%), Gaps = 24/728 (3%)
Query: 15 FFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTI 74
FFSS +V+YD R++IING+R+++IS +IHYPRS P MWP L+Q+AK+GG++ I
Sbjct: 17 FFSS-----VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVI 71
Query: 75 ESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLH 134
E+YVFWNGHE SPGKY F GR++LV+FIK++Q+A +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 72 ETYVFWNGHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLK 131
Query: 135 YIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGE 194
Y+PG FR + +PFK MQ F+ IV+MMK E LF SQGGPII+AQ+ENEYG E G
Sbjct: 132 YVPGMEFRTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGA 191
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTEN 254
GK Y WAA+MAV GVPWIMC+Q D PDPVI+TCN FYC+ F P+ P PK+WTE
Sbjct: 192 PGKAYTKWAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEV 251
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYD 314
W GW+ FGG P RP+EDIAFSVARF Q GS NYYMYHGGTNFGRT+ G FI TSYD
Sbjct: 252 WTGWYTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYD 311
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACA 374
Y+AP+DEYGL PK+GHL++LH AIKL E AL++ + SLGS+QEA VY SGACA
Sbjct: 312 YDAPLDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACA 371
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQP 434
AFL+N D + V F+N Y+LP WS+SILPDCK V+NTA V +QSS+++M P
Sbjct: 372 AFLSNYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTP----- 426
Query: 435 SEASPDNGSKGLKWQVFKEIAGIWGEAD-FVKSGFVDHINTTKDTTDYLWYTTSIIVNEN 493
GL WQ + E ++D +G + N T+D++DYLWY T++ + N
Sbjct: 427 -------AGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASN 479
Query: 494 EEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALL 553
E FLKNG P L + S GH LH F N +L G+ G +P Y + L+AG N+I+LL
Sbjct: 480 EGFLKNGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLL 539
Query: 554 SMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
S++VGL N G Y+ AG+ V ++G N G+ +L+ W+YK+GL+GE L +++
Sbjct: 540 SVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGS 599
Query: 613 NNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
+++ WV + QPLTWYKA P G++P+ LDM MGKG W+NGE +GR+WP
Sbjct: 600 SSVEWVRGSLMAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYI 659
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDP 732
+ +C +C Y G FN KC T CG+PSQRWYH+PRSW KPS N+LV+FEE GG+P
Sbjct: 660 AQG----DC-SKCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNP 714
Query: 733 TKITFSIR 740
T I+ R
Sbjct: 715 TGISLVRR 722
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 786 bits (2031), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/721 (52%), Positives = 495/721 (68%), Gaps = 21/721 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G VTYD ++++ING+R ++ S +IHYPRS P MW GL+Q+AK+GG++ I++YVFWNGHE
Sbjct: 30 GAVTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEP 89
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
+PG Y F GR++LVKFIK Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D
Sbjct: 90 TPGSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDN 149
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQ F IV MMK E+LFASQGGPIIL+Q+ENEYG E +G GK Y+ WAAK
Sbjct: 150 EPFKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAK 209
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV + GVPW+MC+Q D PDPVIN CN FYCD FTP++PS P +WTE W GWF FGG
Sbjct: 210 MAVGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGT 269
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
RP ED++F+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL
Sbjct: 270 IRKRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLA 329
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PK+GHLKELH AIKLCE AL++ + + SLGS QEA VY SG CAAFLAN + +
Sbjct: 330 REPKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSG-CAAFLANYNSNSH 388
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
+VF N Y LP WS+SILPDCK VV+NTA V Q+S ++M +G+
Sbjct: 389 AKIVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTSQMQMW-----------SDGASS 437
Query: 446 LKWQVFKEIAGIWGEADFV-KSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
+ W+ + E G A + +G ++ +N T+DT+DYLWY TS+ V+ +E+ L+ G
Sbjct: 438 MMWERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLS 497
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L ++S GHALH F N +LQGSASG YK + L+AG N+I+LLS+ GL N G
Sbjct: 498 LTVQSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGV 557
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
YE G+ V + G + G+ DL+ +WTY++GL+GE + + + +++ W+
Sbjct: 558 HYETWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLI 617
Query: 624 PKNQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+NQ PL WY+A P GDEP+ LDM MGKG W+NG+ IGRY + +C
Sbjct: 618 AQNQMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRY-----SLAYATGDC- 671
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
++C Y G F KC GCG+P+QRWYH+P+ W +P+ N+LV+FEE GGD +KI+ R +
Sbjct: 672 KDCSYTGSFRAIKCQAGCGQPTQRWYHVPKPWLQPTRNLLVVFEELGGDTSKISLVKRSV 731
Query: 743 S 743
S
Sbjct: 732 S 732
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 786 bits (2031), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/731 (52%), Positives = 492/731 (67%), Gaps = 23/731 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L+F S ++ A +V YD R++I+NG+R ++IS +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 12 FLLFLVSWLSSALA-SVGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLLQKAKDGGL 70
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ +++YVFWNGHE SPGKYYF R++LVKFIK+ QQ +Y+ LRIGP++ AE+N+GG PV
Sbjct: 71 DVLQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAEWNFGGFPV 130
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D PF M+KF IV MMK E+LF +QGGPIIL+Q+ENEYG E
Sbjct: 131 WLKYVPGIAFRTDNRPFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIENEYGPVEWE 190
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y WAAKMAV N GVPW+MC+Q D PDP+I+TCN FYC+ FTP+ PK+W
Sbjct: 191 IGAPGKSYTQWAAKMAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNYKPKMW 250
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TE W GW+ FGG P RP++D+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPFI T
Sbjct: 251 TEIWTGWYTEFGGAVPTRPAQDLAFSVARFIQNGGSFANYYMYHGGTNFGRTAGGPFIAT 310
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK +H AIK+ E ALL + + LG++QEA VY SG
Sbjct: 311 SYDYDAPLDEYGLPREPKYSHLKYMHKAIKMAEPALLATDAAVSKLGNNQEAHVYQSRSG 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
CAAFLAN D K V F N Y+LP WS+SILPDCK VFNTA V QS +M P
Sbjct: 371 -CAAFLANYDTKYPVRVTFWNKQYNLPPWSISILPDCKTEVFNTARV-GQSPPTKMTP-- 426
Query: 432 LQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
L WQ + +++A + F G + I+ T D TDYLWY T I +
Sbjct: 427 -----------VAHLSWQAYIEDVATSADDNAFTSVGLREQISLTWDNTDYLWYMTDITI 475
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
NE+FL+ G P L ++S GHALH F N +L GSA G P ++ + L+AG N++
Sbjct: 476 GPNEQFLRTGKYPTLKVDSAGHALHVFINGQLSGSAYGTLAFPKLEFNQGVKLRAGINKL 535
Query: 551 ALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
ALLS++VGL N G +E W + V + G NSGT D++ + WTYKIG++GE + ++
Sbjct: 536 ALLSVSVGLANVGLHFETWNTGVLGPVTLAGVNSGTWDMTRWQWTYKIGMRGEDMSLHTV 595
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ WV + +PLTWYKA++ PPG+ P+ LDM MGKG W+NG+ IGR+WP
Sbjct: 596 SGSSSVEWVQGSLLAQYRPLTWYKAILNAPPGNAPLALDMGSMGKGQMWINGQSIGRHWP 655
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
H C C Y G + +KC T CG+PSQRWYH+PRSW K S N+LV+FEE G
Sbjct: 656 ----AYKAHGSC-GACYYAGTYTENKCRTNCGQPSQRWYHVPRSWLKSSGNLLVVFEEWG 710
Query: 730 GDPTKITFSIR 740
GDPTKI+ R
Sbjct: 711 GDPTKISLVAR 721
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 786 bits (2031), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/735 (51%), Positives = 491/735 (66%), Gaps = 20/735 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LI F + ++ VTYD ++++ING+R ++ S +IHYPRS P MW L+ +AK GG+
Sbjct: 11 FLIAFLLANSHLIHSTVTYDRKAILINGQRRILFSGSIHYPRSTPEMWEDLILKAKNGGL 70
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ +E+YVFWN HE PG Y F GRF+LV+FIK IQ+A +Y LRIGP+V AE+N+GG PV
Sbjct: 71 DVVETYVFWNVHEPYPGIYNFEGRFDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPV 130
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D E FK MQ F IV +MK E LF SQGGPIILAQ+ENEYG
Sbjct: 131 WLKYVPGISFRTDNEAFKNAMQGFTEKIVALMKSENLFESQGGPIILAQIENEYGTESKL 190
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
+GE G Y WAA MAV GVPW+MC++ D PDPVINTCN FYCD F+P+ P P +W
Sbjct: 191 FGEAGYNYMTWAANMAVGLQTGVPWVMCKEADAPDPVINTCNGFYCDTFSPNKPYKPTMW 250
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TE W GWF FGG RP +D+AF+VARF Q+GGS+ NYYMYHGGTNFGRTAGGPFITT
Sbjct: 251 TEAWTGWFSEFGGPLHQRPVQDLAFAVARFIQRGGSLVNYYMYHGGTNFGRTAGGPFITT 310
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL R PK+GHLKELH AIK+CE AL++ + SLG Q+A VY+ SG
Sbjct: 311 SYDYDAPIDEYGLLRQPKYGHLKELHRAIKMCEPALVSADPIVTSLGDYQQAHVYSSESG 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
CAAFL+N D K+ V+F N Y+LP WS+SILPDCK VFNTA V Q++ + M+P
Sbjct: 371 GCAAFLSNYDTKSFARVLFNNRHYNLPPWSISILPDCKNAVFNTAKVGVQTAQMGMLPAE 430
Query: 432 LQPSEASPDNGSKGLKWQ-VFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
S L W+ F++I+ + + G ++ IN T+DT+DYLWY TS+ +
Sbjct: 431 -----------STTLSWESYFEDISALDDRSMMTSPGLLEQINVTRDTSDYLWYITSVDI 479
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ +E FL G P LL++S GHA+H F N +L GS SG+ F Y ++L AG N+I
Sbjct: 480 SSSEPFLHGGELPTLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKVNLHAGTNKI 539
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
LLS+ VGL N G +E GI V + G G DLS+ WTYK+GL+GE + + +P
Sbjct: 540 GLLSVAVGLPNVGGHFETWNTGILGPVVLYGLRQGKWDLSSQKWTYKVGLKGEAMNLISP 599
Query: 610 GYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
+ + W+ +++ QPLTW+KA P G+EP+ LDM MGKG W+NG+ IGRYW
Sbjct: 600 SGFSPVEWMQASLAAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKGQIWINGQSIGRYW 659
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
+R + C+Y F P KC GCG+P+QRWYH+PRSW +P +N+LV+FEE
Sbjct: 660 TAYARGN------CSRCNYATAFRPPKCQLGCGQPTQRWYHVPRSWLRPEQNLLVVFEEV 713
Query: 729 GGDPTKITFSIRKIS 743
GG+P++I+ R ++
Sbjct: 714 GGNPSRISIVKRLVT 728
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/741 (52%), Positives = 497/741 (67%), Gaps = 20/741 (2%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
RT LL FF F NVTYD R+L+I+G+R +++S +IHYPRS P MWP L+
Sbjct: 2 RTSQILLVLLWFFCIYAPSSFGANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLI 61
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
Q++K+GG++ IE+YVFWN HE G+Y F GR +LVKF+K++ A +Y+ LRIGP+ AE
Sbjct: 62 QKSKDGGLDVIETYVFWNLHEPVRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAE 121
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+NYGG P+WLH+IPG FR D +PF+ M++F IVD+MK+E L+ASQGGPIIL+Q+EN
Sbjct: 122 WNYGGFPLWLHFIPGIQFRTDNKPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIEN 181
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPH 243
EYG E+ YG K Y WAA MA + GVPW+MCQQ + PDP+IN CN FYCDQF P+
Sbjct: 182 EYGNIEADYGPAAKSYIKWAASMATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQFKPN 241
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
S + PKIWTE + GWF FG PHRP ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR
Sbjct: 242 SNTKPKIWTEGYTGWFLAFGDAVPHRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRA 301
Query: 304 AGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEA 363
+GGPF+ +SYDY+APIDEYG R PKWGHLK++H AIKLCE AL+ + + SLG + EA
Sbjct: 302 SGGPFVASSYDYDAPIDEYGFIRQPKWGHLKDVHKAIKLCEEALIATDPTITSLGPNIEA 361
Query: 364 DVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS 423
VY + CAAFLAN+ +D TV F SYHLPAWSVSILPDCK VV NTA + + S
Sbjct: 362 AVY-KTGVVCAAFLANI-ATSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKITSASM 419
Query: 424 TVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
E+L+ + D+GS +W E GI F G ++ INTT D +DYLW
Sbjct: 420 ISSFTTESLKDVGSLDDSGS---RWSWISEPIGISKADSFSTFGLLEQINTTADRSDYLW 476
Query: 484 YTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISL 543
Y+ SI L G++ L I+S GHALHAF N +L GS +GN + PI+L
Sbjct: 477 YSLSID-------LDAGAQTFLHIKSLGHALHAFINGKLAGSGTGNHEKANVEVDIPITL 529
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGF--NSGTLDLSTYSWTYKIGLQG 601
+GKN I LLS+TVGLQN G F++ GAGIT I N +DLS+ WTY++GL+
Sbjct: 530 VSGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKCLKNGSNVDLSSKQWTYQVGLKN 589
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNG 661
E LG+ + G N ST+ P NQPLTWYK P G+ P+ +D MGKG AW+NG
Sbjct: 590 EDLGL-SSGCSGQWNSQSTL--PTNQPLTWYKTNFVAPSGNNPVAIDFTGMGKGEAWVNG 646
Query: 662 EEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENI 721
+ IGRYWP +SP C C+YRG ++ KC+ CG+PSQ YH+PRSW +P N
Sbjct: 647 QSIGRYWP---TYASPKGGCTDSCNYRGAYDASKCLKNCGKPSQTLYHVPRSWLRPDRNT 703
Query: 722 LVIFEEKGGDPTKITFSIRKI 742
LV+FEE GG+P +I+F+ ++I
Sbjct: 704 LVLFEESGGNPKQISFATKQI 724
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/748 (51%), Positives = 504/748 (67%), Gaps = 27/748 (3%)
Query: 1 MKPRTPIAPFALLIFFS-SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMW 59
M P AP L + + + + VTYD +++++NG+R +++S +IHYPRSVP MW
Sbjct: 1 MASSAPPAPAVLAVALTVALLASSAWAAVTYDRKAVVVNGQRRILLSGSIHYPRSVPEMW 60
Query: 60 PGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPF 119
P L+Q+AK+GG++ +++YVFWNGHE SPG+Y+F GR++LV FIK+++QA +Y+ LRIGP+
Sbjct: 61 PDLIQKAKDGGLDVVQTYVFWNGHEPSPGQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPY 120
Query: 120 VAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILA 179
V AE+N+GG P+WL Y+PG FR D EPFK MQKF T IV MMK E+LF QGGPIIL+
Sbjct: 121 VCAEWNFGGFPIWLKYVPGISFRTDNEPFKAEMQKFTTKIVQMMKSERLFEWQGGPIILS 180
Query: 180 QVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ 239
Q+ENE+G E GE K YA WAA MA+A N GVPWIMC++ D PDP+INTCN FYCD
Sbjct: 181 QIENEFGPLEWDQGEPAKDYASWAANMAMALNTGVPWIMCKEDDAPDPIINTCNGFYCDW 240
Query: 240 FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTN 299
F+P+ P P +WTE W W+ FG PHRP ED+A+ VA+F QKGGS NYYMYHGGTN
Sbjct: 241 FSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTN 300
Query: 300 FGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGS 359
F RTAGGPFI TSYDY+AP+DEYGL R PKWGHLKELH AIKLCE AL+ + SLG+
Sbjct: 301 FERTAGGPFIATSYDYDAPLDEYGLLREPKWGHLKELHRAIKLCEPALVAADPILSSLGN 360
Query: 360 SQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVR 419
+Q+A V+ S+GACAAFL N + V F + Y LP WS+SILPDCK VFNTA V
Sbjct: 361 AQKASVFRSSTGACAAFLENKHKLSYARVSFNGMHYDLPPWSISILPDCKTTVFNTARVG 420
Query: 420 AQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEAD-FVKSGFVDHINTTKDT 478
+Q S ++M + GL WQ + E + E + F G ++ IN T+D
Sbjct: 421 SQISQMKM-------------EWAGGLTWQSYNEEINSFSELESFTTVGLLEQINMTRDN 467
Query: 479 TDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYK 538
TDYLWYTT + V ++E+FL +G P L + S GHALH F N +L G+ G+ +P Y
Sbjct: 468 TDYLWYTTYVDVAKDEQFLTSGKNPKLTVMSAGHALHVFINGQLSGTVYGSVENPKLTYT 527
Query: 539 NPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKI 597
+ L +G N I+ LS+ VGL N G +E AGI V + G N G DL+ WTY++
Sbjct: 528 GKVKLWSGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGKRDLTWQKWTYQV 587
Query: 598 GLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLA 657
GL+GE + +++ +++ W EP + QPLTWYKA P GDEP+ LDM MGKG
Sbjct: 588 GLKGEAMSLHSLSGSSSVEW---GEPVQKQPLTWYKAFFNAPDGDEPLALDMNSMGKGQI 644
Query: 658 WLNGEEIGRYWP-RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK 716
W+NG+ IGRYWP K+ + H CDYRG++N KC T CG+PSQRWYH+PR W
Sbjct: 645 WINGQGIGRYWPGYKASGTCGH------CDYRGEYNETKCQTNCGDPSQRWYHVPRPWLN 698
Query: 717 PSENILVIFEEKGGDPTKITFSIRKISG 744
P+ N+LVIFEE GGDPT I+ +++ +G
Sbjct: 699 PTGNLLVIFEEWGGDPTGISM-VKRTTG 725
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/735 (51%), Positives = 501/735 (68%), Gaps = 30/735 (4%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
+ + F +S+ +V+YDS+++ ING+R ++IS +IHYPRS P MWP L+Q+AKEGG+
Sbjct: 9 VFLVFLASLVCSVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGL 68
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPGKYYF G ++LVKF+K++++A +Y+ LRIGP++ AE+N+G
Sbjct: 69 DVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAEWNFG---- 124
Query: 132 WLHYIPGTVFRNDTEPFK---YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY 188
F+N PF+ M+KF T IV+MMK E+LF SQGGPIIL+Q+ENEYG
Sbjct: 125 -------HQFQNGQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENEYGPM 177
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMP 248
E G G+ Y WAA+MAV GVPW+MC+Q D PDP+INTCN FYCD F+P+ P
Sbjct: 178 EYELGSPGQAYTKWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKP 237
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
K+WTE W GWF FGG PHRP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPF
Sbjct: 238 KMWTEAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPF 297
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD 368
I TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++G+ + + LG+ QEA V+
Sbjct: 298 IATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNY 357
Query: 369 SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV 428
+G CAAFLAN ++ V FRN+ Y+LP WS+SILPDCK V+NTA V AQS+T++M
Sbjct: 358 KAGGCAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSATIKMT 417
Query: 429 PENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
P P +G GL WQ + E G+ F G ++ INTT+D +DYLWY T +
Sbjct: 418 P--------VPMHG--GLSWQTYNEEPSSSGDNTFTMVGLLEQINTTRDVSDYLWYMTDV 467
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ +E FLK+G PVL + S GHALH F N +L G+A G+ P + +SL+AG N
Sbjct: 468 HIDPSEGFLKSGKYPVLTVLSAGHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVN 527
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+I+LLS+ VGL N GP +E AGI V + G N G +DLS W+YKIGL GE L ++
Sbjct: 528 KISLLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRMDLSWQKWSYKIGLHGEALSLH 587
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ +++ W + QPL+WYK P G+ P+ LDM MGKG W+NG+ +GR+
Sbjct: 588 SISGSSSVEWAEGSLVAQKQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRH 647
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + EC Y G +N +KC T CGE SQRWYH+P+SW KP+ N+LV+FEE
Sbjct: 648 WPAYKASGT-----CGECTYIGTYNENKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEE 702
Query: 728 KGGDPTKITFSIRKI 742
GGDP ++ R++
Sbjct: 703 WGGDPNGVSLVRREV 717
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/732 (51%), Positives = 502/732 (68%), Gaps = 26/732 (3%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
S +T+C NVTYD +SL+ING+R ++IS +IHYPRS P MW L+ +AK GG++ I++Y
Sbjct: 23 SELTHC---NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTY 79
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFW+ HE SPG Y F GR++LV+FIK +Q+ +Y LRIGP+V AE+N+GGIPVWL Y+P
Sbjct: 80 VFWDVHEPSPGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVP 139
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G FR D EPFK MQ F IV MMK EKLF SQGGPIIL+Q+ENEYG G G+
Sbjct: 140 GVSFRTDNEPFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGR 197
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y WAA MAV GVPW+MC++ D PDPVIN+CN FYCD F+P+ P P +WTE W G
Sbjct: 198 AYVNWAASMAVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSG 257
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF FGG RP ED++F+VARF QKGGS NYYMYHGGTNFGR+AGGPFITTSYDY+A
Sbjct: 258 WFTEFGGPIHQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDA 317
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
PIDEYGL R PK+ HLKELH AIK CEHAL++ + + LSLG+ +A V++ +G CAAFL
Sbjct: 318 PIDEYGLIRQPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFL 377
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + ++ TV F N Y LP WS+SILPDCK VFNTA VR Q S V+M+P ++P
Sbjct: 378 ANYNAQSAATVTFNNRHYDLPPWSISILPDCKIDVFNTAKVRVQPSQVKMLP--VKP--- 432
Query: 438 SPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
K W+ + E E+ + + G ++ +N T+DT+DYLWY TS+ ++ +E F
Sbjct: 433 ------KLFSWESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESF 486
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L+ G +P + ++S GHA+H F N + GSA G Y P+ L+AG N+IALLS+T
Sbjct: 487 LRGGQKPSINVQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVT 546
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGLQN G YE AGIT V + G + G DL+ W+YK+GL+GE + + +P +++
Sbjct: 547 VGLQNVGRHYETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSV 606
Query: 616 NWVSTMEPPKNQP-LTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
+WV + +++ L WYKA P G EP+ LD+ MGKG W+NG+ IGRYW ++
Sbjct: 607 DWVQESQATQSRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAYAK- 665
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+C C Y G F P KC GCG+P+QRWYH+PRSW KP++N++V+FEE GG+P K
Sbjct: 666 ----GDC-NSCTYSGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWK 720
Query: 735 ITFSIRKISGFP 746
I+ +++++ P
Sbjct: 721 ISL-VKRVAHTP 731
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 784 bits (2025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/727 (52%), Positives = 488/727 (67%), Gaps = 34/727 (4%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVP------------GMWPGLVQQAKEGGVNTIES 76
TYD +++++NG+R ++IS +IHYPRS P MWP L+++AK+GG++ +++
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86
Query: 77 YVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYI 136
YVFWNGHE SPG+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+
Sbjct: 87 YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146
Query: 137 PGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGG 196
PG FR D EPFK MQKF T IV+MMK E LF QGGPIIL+Q+ENE+G E GE
Sbjct: 147 PGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPA 206
Query: 197 KRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWP 256
K YA WAA MAVA N VPWIMC++ D PDP+INTCN FYCD F+P+ P P +WTE W
Sbjct: 207 KAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWT 266
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYE 316
W+ FG PHRP ED+A+ VA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+
Sbjct: 267 AWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYD 326
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAF 376
APIDEYGL R PKWGHLK+LH AIKLCE AL+ G+ SLG++Q++ V+ S+GACAAF
Sbjct: 327 APIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAF 386
Query: 377 LANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSE 436
L N D + V F + Y LP WS+SILPDCK VFNTA V +Q S ++M
Sbjct: 387 LENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKM--------- 437
Query: 437 ASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
+ G WQ + E +GE G ++ IN T+D TDYLWYTT + V ++E+F
Sbjct: 438 ----EWAGGFAWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQF 493
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L NG L + S GHALH F N +L+G+ G+ P Y + L AG N I+ LS+
Sbjct: 494 LSNGENLKLTVMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIA 553
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGL N G +E AGI V + G N G DL+ WTY++GL+GE + +++ + +
Sbjct: 554 VGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTV 613
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
W EP + QPLTWYKA P GDEP+ LDM MGKG W+NG+ IGRYWP K+
Sbjct: 614 EW---GEPVQKQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWP--GYKA 668
Query: 676 SPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
S + CDYRG+++ KC T CG+ SQRWYH+PRSW P+ N+LVIFEE GGDPT I
Sbjct: 669 SGN---CGTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGI 725
Query: 736 TFSIRKI 742
+ R I
Sbjct: 726 SMVKRSI 732
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 783 bits (2021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/734 (51%), Positives = 488/734 (66%), Gaps = 24/734 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+ + ++T +VTYD +++++NG+R ++IS +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 19 LLVLWVCAVT----ASVTYDHKAIVVNGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGL 74
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +Y+ LRIGP++ AE+N+GG PV
Sbjct: 75 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYICAEWNFGGFPV 134
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D EPFK MQKF IV +MK EKLF +QGGPII++Q+ENEYG E
Sbjct: 135 WLKYVPGIAFRTDNEPFKAAMQKFTEKIVSIMKEEKLFQTQGGPIIMSQIENEYGPVEWE 194
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W ++MAV + GVPWIMC+Q DTPDP+I+TCN +YC+ FTP+ PK+W
Sbjct: 195 IGAPGKAYTKWFSQMAVGLDTGVPWIMCKQQDTPDPLIDTCNGYYCENFTPNKKYKPKMW 254
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNF RT+ G FI T
Sbjct: 255 TENWTGWYTEFGGAVPRRPAEDMAFSVARFVQNGGSFVNYYMYHGGTNFDRTSSGLFIAT 314
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+ PIDEYGL PKWGHL++LH AIKLCE AL++ + + G++ E V+ +SG
Sbjct: 315 SYDYDGPIDEYGLLNEPKWGHLRDLHKAIKLCEPALVSVDPTVTWPGNNLEVHVF-KTSG 373
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFLAN D K+ +V F N Y LP WS+SILPDCK VFNTA + AQSS ++M N
Sbjct: 374 ACAAFLANYDTKSSASVKFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSSLMKMTAVN 433
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIV 490
WQ + E E D + + + IN T+D+TDYLWY T + +
Sbjct: 434 ------------SAFDWQSYNEEPASSNEDDSLTAYALWEQINVTRDSTDYLWYMTDVNI 481
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE F+KNG PVL + S GH LH N +L G+ G + + + L+ G N+I
Sbjct: 482 DANEGFIKNGQSPVLTVMSAGHVLHVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKI 541
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS+ VGL N GP +E AG+ V + G N GT DLS W+YKIGL+GE L +
Sbjct: 542 SLLSIAVGLPNVGPHFETWNAGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGEALNLNTV 601
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ WV K QPL WYK P G++P+ LDM+ MGKG AW+NG IGR+WP
Sbjct: 602 SGSSSVEWVQGSLLAKQQPLAWYKTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHWP 661
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + D C Y G + KC T CGEPSQRWYHIPRSW PS N LV+FEE G
Sbjct: 662 GYIARGNCGD-----CYYAGTYTDKKCRTNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWG 716
Query: 730 GDPTKITFSIRKIS 743
GDPT IT R +
Sbjct: 717 GDPTGITLVKRTTA 730
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/735 (51%), Positives = 491/735 (66%), Gaps = 21/735 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L + F S + + V+YD +++IINGRR ++IS +IHYPRS P MWP L+Q AKEG
Sbjct: 6 LVLFLLFCSWL-WSVEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEG 64
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPG YYF R++LVKFIK++ QA +Y+ LRIGP++ E+N+GG
Sbjct: 65 GLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYICGEWNFGGF 124
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQKF IV+MMK EKLF QGGPII++Q+ENEYG E
Sbjct: 125 PVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIE 184
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV GVPWIMC+Q D PDP+I+TCN FYC+ F P++ PK
Sbjct: 185 WEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPK 244
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
++TE W GW+ FGG P+RP+ED+A+SVARF Q GS NYYMYHGGTNFGRTAGGPFI
Sbjct: 245 MFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFI 304
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL R PKWGHL++LH IKLCE +L++ + SLGS+QEA V+
Sbjct: 305 ATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFWTK 364
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+ +CAAFLAN D K V F+N+ Y LP WSVSILPDCK VVFNTA V +Q S +M+
Sbjct: 365 T-SCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKMIA 423
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N WQ + +E +A F K G + I+ T+D TDYLWY T +
Sbjct: 424 VN------------SAFSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDV 471
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L + S GHALH F N +L G+ G +P + + L+AG N
Sbjct: 472 TIGPDEAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVN 531
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+++LLS+ VGL N G +E AG+ V + G NSGT D+S + W+YKIGL+GE L ++
Sbjct: 532 KVSLLSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLH 591
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV + QPL WYK P G++P+ LDM MGKG W+NG+ IGR+
Sbjct: 592 TVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRH 651
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S C+Y G ++ KC + CG+ SQRWYH+PRSW P+ N+LV+FEE
Sbjct: 652 WPGYKARGS-----CGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEE 706
Query: 728 KGGDPTKITFSIRKI 742
GGDPTKI+ R +
Sbjct: 707 WGGDPTKISLVKRVV 721
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/728 (52%), Positives = 497/728 (68%), Gaps = 24/728 (3%)
Query: 15 FFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTI 74
FFSS +V+YD R++IING+R+++IS +IHYPRS P MWP L+Q+AK+GG++ I
Sbjct: 17 FFSS-----VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVI 71
Query: 75 ESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLH 134
E+YVFWNGH SPGKY F GR++LV+FIK++Q+A +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 72 ETYVFWNGHGPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLK 131
Query: 135 YIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGE 194
Y+PG FR + +PFK M+ F+ IV+MMK E LF SQGGPII+AQ+ENEYG E G
Sbjct: 132 YVPGMEFRTNNQPFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGA 191
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTEN 254
GK Y WAA+MAV GVPWIMC+Q D PDPVI+TCN FYC+ F P+ P PK+WTE
Sbjct: 192 PGKAYTKWAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEV 251
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYD 314
W GW+ FGG P RP+EDIAFSVARF Q GS NYYMYHGGTNFGRT+ G FI TSYD
Sbjct: 252 WTGWYTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYD 311
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACA 374
Y+AP+DEYGL PK+GHL++LH AIKL E AL++ + SLGS+QEA VY SGACA
Sbjct: 312 YDAPLDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACA 371
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQP 434
AFL+N D + V F+N Y+LP WS+SILPDCK V+NTA V +QSS+++M P
Sbjct: 372 AFLSNYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTP----- 426
Query: 435 SEASPDNGSKGLKWQVFKEIAGIWGEAD-FVKSGFVDHINTTKDTTDYLWYTTSIIVNEN 493
GL WQ + E ++D +G + N T+D++DYLWY T++ + N
Sbjct: 427 -------AGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASN 479
Query: 494 EEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALL 553
E FLKNG P L + S GH LH F N +L G+ G +P Y + L+AG N+I+LL
Sbjct: 480 EGFLKNGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLL 539
Query: 554 SMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
S++VGL N G Y+ AG+ V ++G N G+ +L+ W+YK+GL+GE L +++
Sbjct: 540 SVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGS 599
Query: 613 NNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
+++ WV + QPLTWYKA P G++P+ LDM MGKG W+NGE +GR+WP
Sbjct: 600 SSVEWVRGSLVAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYI 659
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDP 732
+ +C +C Y G FN KC T CG+PSQRWYH+PRSW KPS N+LV+FEE GG+P
Sbjct: 660 AQG----DC-SKCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNP 714
Query: 733 TKITFSIR 740
T I+ R
Sbjct: 715 TGISLVRR 722
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 782 bits (2019), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/733 (52%), Positives = 494/733 (67%), Gaps = 21/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++L+ FS I + +V YD +++IING+R ++IS +IHYPRS PGMWP L+Q+AK G
Sbjct: 9 WSILLLFSC-IFSAASASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +++ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WL Y+PG FR D EPFK MQKF IV+MMK EKLF +QGGPIIL+Q+ENE+G E
Sbjct: 128 PIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN +YC+ F P+ PK
Sbjct: 188 WEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+
Sbjct: 248 MWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFM 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL + PKWGHL++LH AIK CEHAL+ + S LG++QEA V+
Sbjct: 308 ATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFNSK 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFLAN D K V F + Y LP WS+SILPDCK VFNTA V ++S V+M P
Sbjct: 368 SG-CAAFLANHDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASEVQMKP 426
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ L WQ F +E G + I T+D TDYLWY T I
Sbjct: 427 VYSR------------LPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L I S GHALH F N +L G+ G+ +P + + L+ G N
Sbjct: 475 TIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGIN 534
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS++VGL N G +E W + + + G N+GT D+S + WTYKIG++GE LG++
Sbjct: 535 KLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESLGLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
++++W + QPLTWYKA PPG P+ LDM MGKG W+NG+ +GR+
Sbjct: 595 TVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S C Y G FN KC T CG+PSQRWYHIPRSW P+ N+LV+FEE
Sbjct: 655 WPGYIAQGS-----CGNCYYAGTFNDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEE 709
Query: 728 KGGDPTKITFSIR 740
GGDP+ ++ R
Sbjct: 710 WGGDPSWMSLVER 722
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 782 bits (2019), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/717 (52%), Positives = 488/717 (68%), Gaps = 23/717 (3%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD +++++NG+R +++S +IHYPRSVP MWP L+Q+AK+GG++ +++YVFWNGHE S
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D EPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F T IVDMMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N VPW+MC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PH
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+A+ VA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYGL R P
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 329
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHLKELH AIKLCE AL+ G+ SLG++Q+A V+ S+ AC AFL N D + V
Sbjct: 330 KWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARV 389
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
F + Y LP WS+SILPDCK V+NTA+V +Q S ++M + G W
Sbjct: 390 SFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQMKM-------------EWAGGFTW 436
Query: 449 QVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIE 508
Q + E G+ F G ++ IN T+D TDYLWYTT + + ++E+FL NG P+L +
Sbjct: 437 QSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVM 496
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
S GHALH F N +L G+ G+ P Y + L +G N I+ LS+ VGL N G +E
Sbjct: 497 SAGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFET 556
Query: 569 VGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
AGI V + G N G DL+ WTYK+GL+GE L +++ +++ W EP + Q
Sbjct: 557 WNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALSLHSLSGSSSVEW---GEPVQKQ 613
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
PL+WYKA P GDEP+ LDM MGKG W+NG+ IGRYWP + CDY
Sbjct: 614 PLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGT-----CGICDY 668
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
RG+++ KC T CG+ SQRWYH+PRSW P+ N+LVIFEE GGDPT I+ +++I+G
Sbjct: 669 RGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISM-VKRIAG 724
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/736 (52%), Positives = 493/736 (66%), Gaps = 31/736 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+LIING+R ++ISA IHYPR+ P MWP LVQ++KEGG + ++SYVFWNGHE
Sbjct: 34 NVTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQSYVFWNGHEPK 93
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR++LVKFIK++QQA +Y LRIGP+V AE+N+GG P WL IPG VFR D E
Sbjct: 94 QGQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKDIPGIVFRTDNE 153
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M+ F++ IV++MK +LFA QGGPII+AQ+ENEYG E +G+GGKRYA+WAA++
Sbjct: 154 PFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDGGKRYAMWAAEL 213
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+ + GVPW+MCQQ D P +INTCN +YCD F ++ + P WTE+W GWF+ +G
Sbjct: 214 ALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKANTATKPAFWTEDWNGWFQYWGQSV 273
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED AF++ARFFQ+GGS NYYMY GGTNF RTAGGPF+TTSYDY+AP+DEYGL R
Sbjct: 274 PHRPVEDNAFAIARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTSYDYDAPLDEYGLIR 333
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLS--LGSSQEADVYADSSGACAAFLANMDDKN 384
PKWGHL++LH AIKLCE AL + LS LG + EA VY+ G CAAFLAN+D
Sbjct: 334 QPKWGHLRDLHAAIKLCEPALTAVDEVPLSTWLGPNVEAHVYS-GRGQCAAFLANIDSWK 392
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM------------VPENL 432
TV F+ +Y LP WSVSILPDCK VVFNTA V AQ++ M +P N+
Sbjct: 393 IATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSKLEGEVVMPSNM 452
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
A GLKW+ E GI G A V + ++ +N TKD+TDYLWY+ SI V+
Sbjct: 453 LRKHAPESIVGSGLKWEASVEPVGIRGAATLVSNRLLEQLNITKDSTDYLWYSISIKVSV 512
Query: 493 NE--EFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
K S+ +L++ S A+H F N++L GSA G+ + P+ LK GKN+I
Sbjct: 513 EAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAMGSDV----QVVQPVPLKEGKNDI 568
Query: 551 ALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
LLSMTVGLQN G + E GAGI S + G SG LDLST W+Y++G+QGE ++
Sbjct: 569 DLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSGVLDLSTERWSYQVGIQGEEKRLFET 628
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
G + I W S+ P LTWYK P G +P+ LD+ MGKG AW+NG +GRYWP
Sbjct: 629 GTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQAWVNGHHMGRYWP 688
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRW-----YHIPRSWFKPSENILVI 724
S CDYRG ++ DKC T CG+PSQRW YHIPR+W + S N+LV+
Sbjct: 689 SVLASQSG----CSTCDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWLQLSNNLLVL 744
Query: 725 FEEKGGDPTKITFSIR 740
FEE GGD +K++ R
Sbjct: 745 FEEIGGDVSKVSLVTR 760
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 780 bits (2014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/728 (52%), Positives = 497/728 (68%), Gaps = 24/728 (3%)
Query: 15 FFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTI 74
FFSS +V+YD R++IING+R+++IS +IHYPRS P MWP L+Q+AK+GG++ I
Sbjct: 17 FFSS-----VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVI 71
Query: 75 ESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLH 134
E+YVFWNGHE SPGKY F GR++LV+FIK++Q+A +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 72 ETYVFWNGHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLK 131
Query: 135 YIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGE 194
Y+PG FR + +PFK MQ F+ IV+MMK E LF SQGGPII+AQ+ENEYG E G
Sbjct: 132 YVPGMEFRTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGA 191
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTEN 254
GK Y WAA+MAV GVPWIMC++ D PDPVI+TCN FYC+ F P+ P PK+WTE
Sbjct: 192 PGKAYTKWAAQMAVGLKTGVPWIMCKREDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEV 251
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYD 314
W GW+ FGG P RP+EDIAFSVARF Q GS NYYMYHGGTNFGRT+ G FI TSYD
Sbjct: 252 WTGWYTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYD 311
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACA 374
Y+AP+DEYGL PK+GHL++LH AIKL E AL++ + SLGS+QEA VY SGACA
Sbjct: 312 YDAPLDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACA 371
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQP 434
AFL+N D + V F+N Y+LP WS+SILPDCK V+NTA V +QSS+++M P
Sbjct: 372 AFLSNYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTP----- 426
Query: 435 SEASPDNGSKGLKWQVFKEIAGIWGEAD-FVKSGFVDHINTTKDTTDYLWYTTSIIVNEN 493
GL WQ + E ++D +G + N T+D++DYLWY T++ + N
Sbjct: 427 -------AGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASN 479
Query: 494 EEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALL 553
E FL+NG P L + S GH LH F N +L G+ G +P Y + L+AG N+I+LL
Sbjct: 480 EGFLRNGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLL 539
Query: 554 SMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
S++VGL N G Y+ AG+ V ++G N G+ +L+ W+YK+GL+GE L +++
Sbjct: 540 SVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGS 599
Query: 613 NNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
+++ WV + QPLTWYKA P G++P+ L M MGKG W+NGE +GR+WP
Sbjct: 600 SSVEWVRGSLVAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWPGYI 659
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDP 732
+ +C +C Y G FN KC T CG+PSQRW+H+PRSW KPS N+LV+FEE GG+P
Sbjct: 660 AQG----DC-SKCSYAGTFNEKKCQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNP 714
Query: 733 TKITFSIR 740
T I+ R
Sbjct: 715 TGISLVRR 722
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 780 bits (2013), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/731 (52%), Positives = 489/731 (66%), Gaps = 22/731 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I S++ +V+YD +++IING+R +++S +IHYPRS P MWPGL+Q+AKEGG+
Sbjct: 13 LAILCCLSLSCIVKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE SPG+YYFG R++LVKFIK++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL ++PG FR D EPFK M+KF IV MMK EKLF +QGGPIILAQ+ENEYG E
Sbjct: 133 WLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W A+MA+ + GVPWIMC+Q D P P+I+TCN +YC+ F P+S + PK+W
Sbjct: 193 IGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+ FGG P+RP EDIA+SVARF QKGGS+ NYYMYHGGTNF RTA G F+ +
Sbjct: 253 TENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMAS 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK LH AIKL E ALL+ + + SLG+ QEA V+ S
Sbjct: 312 SYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFWSKS- 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N D+ + V+FR Y LP WSVSILPDCK V+NTA V A S MVP
Sbjct: 371 SCAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNAPSVHRNMVP-- 428
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
G+K W F E EA F ++G V+ I+ T D +DY WY T I +
Sbjct: 429 ---------TGTK-FSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITI 478
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
E FLK G P+L + S GHALH F N +L G+A G HP + I L AG N+I
Sbjct: 479 GSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKI 538
Query: 551 ALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
ALLS+ VGL N G +E W + V + G NSGT D+S + W+YKIG++GE L ++
Sbjct: 539 ALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTN 598
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG IGR+WP
Sbjct: 599 TESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWP 658
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ S C+Y G F+ KC++ CGE SQRWYH+PRSW K S+N++V+FEE G
Sbjct: 659 AYKAQGS-----CGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELG 712
Query: 730 GDPTKITFSIR 740
GDP I+ R
Sbjct: 713 GDPNGISLVKR 723
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 780 bits (2013), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/735 (51%), Positives = 490/735 (66%), Gaps = 21/735 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
L + F S + + V+YD +++IINGRR ++IS +IHYPRS P MWP L+Q AKEG
Sbjct: 6 LVLFLLFCSWL-WSVEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEG 64
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPG YYF R++LVKFIK++ QA +Y+ LRI P++ E+N+GG
Sbjct: 65 GLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYICGEWNFGGF 124
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQKF IV+MMK EKLF QGGPII++Q+ENEYG E
Sbjct: 125 PVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIE 184
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV GVPWIMC+Q D PDP+I+TCN FYC+ F P++ PK
Sbjct: 185 WEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPK 244
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
++TE W GW+ FGG P+RP+ED+A+SVARF Q GS NYYMYHGGTNFGRTAGGPFI
Sbjct: 245 MFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFI 304
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL R PKWGHL++LH IKLCE +L++ + SLGS+QEA V+
Sbjct: 305 ATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFWTK 364
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+ +CAAFLAN D K V F+N+ Y LP WSVSILPDCK VVFNTA V +Q S +M+
Sbjct: 365 T-SCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKMIA 423
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
N WQ + +E +A F K G + I+ T+D TDYLWY T +
Sbjct: 424 VN------------SAFSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDV 471
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L + S GHALH F N +L G+ G +P + + L+AG N
Sbjct: 472 TIGPDEAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVN 531
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+++LLS+ VGL N G +E AG+ V + G NSGT D+S + W+YKIGL+GE L ++
Sbjct: 532 KVSLLSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLH 591
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV + QPL WYK P G++P+ LDM MGKG W+NG+ IGR+
Sbjct: 592 TVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRH 651
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S C+Y G ++ KC + CG+ SQRWYH+PRSW P+ N+LV+FEE
Sbjct: 652 WPGYKARGS-----CGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEE 706
Query: 728 KGGDPTKITFSIRKI 742
GGDPTKI+ R +
Sbjct: 707 WGGDPTKISLVKRVV 721
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 780 bits (2013), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/739 (49%), Positives = 494/739 (66%), Gaps = 22/739 (2%)
Query: 10 FALLIFFSSSITYCFAG--NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAK 67
F + F S+ + NVTYD ++LIING+R+++ S +IHYPRSVP MW L+++AK
Sbjct: 10 FVVFFFLCWSLHFQLTNCENVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKAK 69
Query: 68 EGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYG 127
GG++ +++YVFWN HE SPG Y F GR +LVKFIK++++A +Y+ LRIGP++ E+N+G
Sbjct: 70 MGGLDVVDTYVFWNLHEPSPGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNFG 129
Query: 128 GIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
G P WL ++PG FR D EPFK M KF IV MMK E+LF SQGGPIIL+Q+ENEY
Sbjct: 130 GFPAWLKFVPGISFRTDNEPFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYET 189
Query: 188 YESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSM 247
+ +GE G Y WAAKMAV + GVPW+MC+Q D PDP+INTCN FYCD F+P+ P
Sbjct: 190 EDKVFGEAGFAYMNWAAKMAVQMDTGVPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYK 249
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGP 307
P WTE W WF FGG + RP ED+AF VARF QKGGS+ NYYMYHGGTNFGRTAGGP
Sbjct: 250 PNFWTEAWTAWFNNFGGPNHKRPVEDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGP 309
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA 367
FITTSYDY+APIDEYGL R PK+GHLK LH A+KLCE ALL GE + +L + Q+A V++
Sbjct: 310 FITTSYDYDAPIDEYGLIRQPKFGHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFS 369
Query: 368 DSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM 427
SSG CAAFL+N N V F Y LP WS+SILPDCK V++NTA V+ Q++ +
Sbjct: 370 SSSGDCAAFLSNYHSNNTARVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSF 429
Query: 428 VPENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
+P ++ W+ + E I+ I ++ G ++ + TKD +DYLWYTT
Sbjct: 430 LPTKVE-----------SFSWETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTT 478
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
S+ V+ NE +L+ G P L SKGH +H F N +L GS+ G + F + I+L+AG
Sbjct: 479 SVNVDPNESYLRGGKFPTLTATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAG 538
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N+++LLS+ GL N GP YE G+ V I G + G +DLS W+YK+GL+GE++
Sbjct: 539 VNKVSLLSIAGGLPNNGPHYEEREMGVLGPVAIHGLDKGKMDLSRQKWSYKVGLKGENMN 598
Query: 606 IYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
+ +P ++W +++ QPLTWYKA P GDEP+ LDM M KG W+NG+ +
Sbjct: 599 LGSPSSVQAVDWAKDSLKQENAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNV 658
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYW + + C +C Y G + P KC GCG+P+Q+WYH+PRSW P++N++V+
Sbjct: 659 GRYW-----TITANGNCT-DCSYSGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVV 712
Query: 725 FEEKGGDPTKITFSIRKIS 743
FEE GG+P++I+ R ++
Sbjct: 713 FEEVGGNPSRISLVKRSVT 731
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 779 bits (2012), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/731 (52%), Positives = 489/731 (66%), Gaps = 22/731 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I S++ +V+YD +++IING+R +++S +IHYPRS P MWPGL+Q+AKEGG+
Sbjct: 13 LAILCCLSLSCIVKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE SPG+YYFG R++LVKFIK++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL ++PG FR D EPFK M+KF IV MMK EKLF +QGGPIILAQ+ENEYG E
Sbjct: 133 WLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W A+MA+ + GVPWIMC+Q D P P+I+TCN +YC+ F P+S + PK+W
Sbjct: 193 IGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+ FGG P+RP EDIA+SVARF QKGGS+ NYYMYHGGTNF RTA G F+ +
Sbjct: 253 TENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLINYYMYHGGTNFDRTA-GEFMAS 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK LH AIKL E ALL+ + + SLG+ QEA V+ S
Sbjct: 312 SYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFWSKS- 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N D+ + V+FR Y LP WSVSILPDCK V+NTA V A S MVP
Sbjct: 371 SCAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNAPSVHRNMVP-- 428
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
G+K W F E EA F ++G V+ I+ T D +DY WY T I +
Sbjct: 429 ---------TGTK-FSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITI 478
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
E FLK G P+L + S GHALH F N +L G+A G HP + I L AG N+I
Sbjct: 479 GSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKI 538
Query: 551 ALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
ALLS+ VGL N G +E W + V + G NSGT D+S + W+YKIG++GE L ++
Sbjct: 539 ALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTN 598
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG IGR+WP
Sbjct: 599 TESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWP 658
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ S C+Y G F+ KC++ CGE SQRWYH+PRSW K S+N++V+FEE G
Sbjct: 659 AYKAQGS-----CGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELG 712
Query: 730 GDPTKITFSIR 740
GDP I+ R
Sbjct: 713 GDPNGISLVKR 723
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 779 bits (2011), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/733 (52%), Positives = 493/733 (67%), Gaps = 21/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+++L+ FS I + +V YD +++IING+R ++IS +IHYPRS PGMWP L+Q+AK G
Sbjct: 9 WSILLLFSC-IFSAASASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAG 67
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +++ LRIGP+V AE+N+GG
Sbjct: 68 GLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGF 127
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WL Y+PG FR D EPFK MQKF IV+MMK EKLF +QGGPIIL+Q+ENE+G E
Sbjct: 128 PIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVE 187
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN +YC+ F P+ PK
Sbjct: 188 WEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPK 247
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+
Sbjct: 248 MWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFM 307
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL + PKWGHL++LH AIK CEHAL+ + S LG++QEA V+
Sbjct: 308 ATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFNSK 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SG CAAFLAN D K V F + Y LP WS+SILPDCK VFNTA V ++S V+M P
Sbjct: 368 SG-CAAFLANYDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASEVQMKP 426
Query: 430 ENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ L WQ F +E G + I T+D TDYLWY T I
Sbjct: 427 VYSR------------LPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDI 474
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ +E FLKNG P+L I S GHALH F N +L G+ G+ +P + + L+ G N
Sbjct: 475 TIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGIN 534
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS++VGL N G +E W + + + G N+GT D+S + WTYKIG++GE LG++
Sbjct: 535 KLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESLGLH 594
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
++++W + QPLTWYKA PPG P+ LDM MGKG W+NG+ +GR+
Sbjct: 595 TVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRH 654
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S C Y G FN KC T CG+PSQRW HIPRSW P+ N+LV+FEE
Sbjct: 655 WPGYIAQGS-----CGNCYYAGTFNDKKCRTYCGKPSQRWCHIPRSWLTPTGNLLVVFEE 709
Query: 728 KGGDPTKITFSIR 740
GGDP+ ++ R
Sbjct: 710 WGGDPSWMSLVER 722
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/732 (50%), Positives = 495/732 (67%), Gaps = 22/732 (3%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
++F S + +C +VTYD +++IING+R ++IS +IHYPRS P MW L+++AK GG++
Sbjct: 16 ILFLGSELIHC---SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIRKAKGGGLD 72
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
I++YVFWN HE SPG Y F GR++LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG PVW
Sbjct: 73 AIDTYVFWNVHEPSPGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYVCAEWNFGGFPVW 132
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY 192
L Y+PG FR D PFK MQ F IV MMK EKLF SQGGPIIL+Q+ENEYG
Sbjct: 133 LKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGSESKQL 192
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWT 252
G G Y WAAKMAV N GVPW+MC+Q D PDPVIN CN FYCD F+P+ P P +WT
Sbjct: 193 GGAGYAYTNWAAKMAVGLNTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKPYKPTLWT 252
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTS 312
E+W GWF FGG RP +D+AF+VARF QKGGS NYYMYHGGTNFGR+AGGPFITTS
Sbjct: 253 ESWSGWFTEFGGPIYQRPVQDLAFAVARFIQKGGSYINYYMYHGGTNFGRSAGGPFITTS 312
Query: 313 YDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGA 372
YDY+APIDEYGL R PK+GHL +LH AIK CE AL++ + + SLG+ ++A V++ +GA
Sbjct: 313 YDYDAPIDEYGLIREPKYGHLMDLHKAIKQCERALVSSDPTVTSLGAYEQAHVFSSKNGA 372
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENL 432
CAAFLAN + V F N Y LP WS+SILPDCK VFNTA VR Q++ ++M+P N
Sbjct: 373 CAAFLANYHSNSAARVTFNNRKYDLPPWSISILPDCKTDVFNTARVRFQTTKIQMLPSN- 431
Query: 433 QPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
SK W+ + E ++ + + SG ++ +N T+DT+DYLWY TS+ ++
Sbjct: 432 ----------SKLFSWETYDEDVSSLSESSKITASGLLEQLNATRDTSDYLWYITSVDIS 481
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
+E FL+ G++P + + S GHA+H F N + GSA G + P++L+AG N+IA
Sbjct: 482 SSESFLRGGNKPSISVHSAGHAVHVFINGQFLGSAFGTSEDRSCTFNGPVNLRAGTNKIA 541
Query: 552 LLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGY 611
LLS+ VGL N G +E AGIT V + G + G DL+ W+Y+IGL+GE + + +P
Sbjct: 542 LLSVAVGLPNVGFHFETWKAGITGVLLYGLDHGQKDLTWQKWSYQIGLKGEAMNLVSPNG 601
Query: 612 RNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
++++WV +++ L W+KA P G EP+ LD+ MGKG W+NG+ IGRYW
Sbjct: 602 VSSVDWVRDSLDVRSQSQLKWHKAYFNAPDGVEPLALDLSSMGKGQVWINGQSIGRYWMV 661
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ + C+Y G + P KC GCG+P+Q+WYH+PRSW KP+ N++V+ EE GG
Sbjct: 662 YAKGA------CNSCNYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKPTNNLIVLLEELGG 715
Query: 731 DPTKITFSIRKI 742
+P KI+ R I
Sbjct: 716 NPWKISLQKRII 727
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/728 (51%), Positives = 484/728 (66%), Gaps = 42/728 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+ING+R ++ S +IHYPRS P MW L+Q+AK+GG++ IE+YVFWN HE SP
Sbjct: 33 VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLHEPSP 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LV+F+K I +A +Y LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 93 GKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F IV++MK E LF SQGGPIIL+Q+ENEYG G G Y WAAKMA
Sbjct: 153 FKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWAAKMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+A GVPW+MC++ D PDPVINTCN FYCD F P+ P P IWTE W GWF FGG
Sbjct: 213 IATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFGGPMH 272
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP +D+AF VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R
Sbjct: 273 HRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQ 332
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE--------ADVYADSSGACAAFLAN 379
PK+GHLKELH AIK+CE AL++ + S+G+ Q+ A VY+ SG C+AFLAN
Sbjct: 333 PKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQVWIYYERFAHVYSAESGDCSAFLAN 392
Query: 380 MDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASP 439
D ++ V+F NV Y+LP WS+SILPDC+ VFNTA V
Sbjct: 393 YDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKV--------------------- 431
Query: 440 DNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
+W+ + E ++ + + F G ++ IN T+DT+DYLWY TS+ + ++E FL
Sbjct: 432 ----SNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLH 487
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
G P L+I+S GHA+H F N +L GSA G + F Y+ I+L +G N IALLS+ VG
Sbjct: 488 GGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVG 547
Query: 559 LQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
L N G +E GI V + G + G +DLS WTY++GL+GE + + P +I W
Sbjct: 548 LPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGW 607
Query: 618 V-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
+ +++ K QPLTW+K P G+EP+ LDM MGKG W+NGE IGRYW +
Sbjct: 608 MDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDC 667
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
H C Y G + P+KC TGCG+P+QRWYH+PR+W KPS+N+LVIFEE GG+P+ ++
Sbjct: 668 SH------CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVS 721
Query: 737 FSIRKISG 744
R +SG
Sbjct: 722 LVKRSVSG 729
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/730 (52%), Positives = 486/730 (66%), Gaps = 20/730 (2%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
++ S I + +V YD +++IING+R ++IS +IHYPRS P MWP L+Q+AK GG++
Sbjct: 11 ILLLLSCIFSAASASVGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKAGGLD 70
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
I++YVFWNGHE SPGKYYF R++LVKFIK++QQA +++ LRIGP+V AE+N+GG P+W
Sbjct: 71 VIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIW 130
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY 192
L Y+PG FR D EPFK MQKF IV+MMK EKLF ++GGPIIL+Q+ENEYG E
Sbjct: 131 LKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIENEYGPVEWEI 190
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWT 252
G GK Y WAA+MAV N GVPWIMC+Q D PDPVI+TCN +YC+ F P+ PK+WT
Sbjct: 191 GAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWT 250
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTS 312
E W GW+ FGG P RP ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPF+ TS
Sbjct: 251 EVWTGWYTEFGGAIPTRPVEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATS 310
Query: 313 YDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGA 372
YDY+AP+DEYGL + PKWGHLK+LH AIK CE+AL+ + S LG++QEA V+ SG
Sbjct: 311 YDYDAPLDEYGLLQQPKWGHLKDLHKAIKSCEYALVAVDPSVTKLGNNQEAHVFNTKSG- 369
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENL 432
CAAFLAN D K V F Y LP WS+SILPDCK VFNTA V ++S V+M P
Sbjct: 370 CAAFLANYDTKYPVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKVTWKTSQVQMKPVYS 429
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVK-SGFVDHINTTKDTTDYLWYTTSIIVN 491
+ L WQ F E E+ G + I T+D TDYLWY T I +
Sbjct: 430 R------------LPWQSFIEETTTSDESGTTTLDGLYEQIYMTRDATDYLWYMTDITIG 477
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
+E FL NG P+L I S HALH F N +L G+ G+ +P + + L+ G N++A
Sbjct: 478 SDEAFLNNGKFPLLTIFSACHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLA 537
Query: 552 LLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LLS++VGL N G +E AG+ + + G N+GT D+S + WTYKIG++GE LG++
Sbjct: 538 LLSISVGLPNVGTHFETWNAGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGEALGLHTVT 597
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
++++W K QPLTWYKA PPG P+ LDM MGKG W+NG+ +GR+WP
Sbjct: 598 GSSSVDWAEGPSMAKKQPLTWYKATFNAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPG 657
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
+ S C+Y G F KC T CG+PSQRWYHIPRSW P+ N+LV+FEE GG
Sbjct: 658 YIAQGS-----CGTCNYAGTFYDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGG 712
Query: 731 DPTKITFSIR 740
DP ++ R
Sbjct: 713 DPQWMSLVER 722
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 778 bits (2008), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/714 (52%), Positives = 475/714 (66%), Gaps = 22/714 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD R+++ING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ +++YVFWNGHE
Sbjct: 31 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQTYVFWNGHEPQQ 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYFG R++LV+F+K+ +QA +++ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 91 GQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAAKMA
Sbjct: 151 FKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 210
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA GVPW+MC+Q D PDPVINTCN FYCD F+P+S S P +WTE W GWF FGG P
Sbjct: 211 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 270
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNF RT+GGPFI TSYDY+APIDEYGL R
Sbjct: 271 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQ 330
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E AL++G+ + ++G+ ++A VY SSGACAAFL+N
Sbjct: 331 PKWGHLRDLHKAIKQAEPALVSGDPTIQTIGNYEKAYVYKSSSGACAAFLSNYHTNAAAR 390
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
VVF Y LPAWS+S+LPDC+ VFNTA V + S+ M P + G
Sbjct: 391 VVFNGRRYDLPAWSISVLPDCRTAVFNTATVSSPSAPARMTP-------------AGGFS 437
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E + F K G V+ ++ T D +DYLWYTT + +N NE+FLK+G P L I
Sbjct: 438 WQSYSEATNSLDDRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTI 497
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GHAL F N + G+A G P Y + + G N+I++LS VGL N G YE
Sbjct: 498 YSAGHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYE 557
Query: 568 WVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
G+ V ++G N G DLS WTY+IGL GE LG+++ +++ W S
Sbjct: 558 AWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVHSVAGSSSVEWGSAA---GK 614
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P G+ P+ LDM MGKG AW+NG IGRYW K+ S C
Sbjct: 615 QPLTWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIGRYWSYKATGGS-----CGGCS 669
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ KC TGCG+ SQR+YH+PRSW PS N+LV+ EE GGD + + R
Sbjct: 670 YAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLVTR 723
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/720 (51%), Positives = 490/720 (68%), Gaps = 21/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++++G+R ++ S +IHYPRS P MW GL+++AK+GG++ I++YVFWNGHE +P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LV+FIK +Q+A M++ LRIGP++ E+N+GG PVWL Y+PG FR D EP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG +G GK Y WAAKMA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL R
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH A+KLCE L++ + + +LGS QEA V+ SSG CAAFLAN + +
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAK 385
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F N +Y LP WS+SILPDCK VVFNTA V Q++ ++M + G+ +
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWAD-----------GASSMM 434
Query: 448 WQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E A + S G ++ +N T+DT+DYLWY TS+ V+ +E+FL+ G+ L
Sbjct: 435 WEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLT 494
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GHALH F N +LQGSA G Y +L+AG N++ALLS+ GL N G Y
Sbjct: 495 VQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHY 554
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPP 624
E W + V I G + G+ DL+ +W+Y++GL+GE + + + ++ W+ ++
Sbjct: 555 ETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQ 614
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
QPL WY+A P GDEP+ LDM MGKG W+NG+ IGRYW + +C +
Sbjct: 615 NQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYW-----TAYAEGDC-KG 668
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + KC GCG+P+QRWYH+PRSW +P+ N+LV+FEE GGD +KI + R +SG
Sbjct: 669 CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSG 728
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/728 (50%), Positives = 494/728 (67%), Gaps = 23/728 (3%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
S + +C +VTYD +++IING+R ++IS +IHYPRS P MW L+Q+AK GG++ I++Y
Sbjct: 21 SQLIHC---SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTY 77
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFWN HE SP Y F GR++LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG PVWL Y+P
Sbjct: 78 VFWNVHEPSPSNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVP 137
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G FR D PFK MQ F IV MMK EKLF SQGGPIIL+Q+ENEYG G G
Sbjct: 138 GISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGH 197
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y+ WAAKMAV GVPW+MC++ D PDPVIN+CN FYCD F+P+ P PK+WTE+W G
Sbjct: 198 AYSNWAAKMAVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDFSPNKPYKPKLWTESWSG 257
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF FGG P RP++D+AF+VARF QKGGS NYYMYHGGTNFGR+AGGPFITTSYDY+A
Sbjct: 258 WFSEFGGPVPQRPAQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDA 317
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
PIDEYGL R PK+GHLK+LH AIK CEHAL++ + + SLG+ ++A V++ + CAAFL
Sbjct: 318 PIDEYGLLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTQTCAAFL 377
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + V F N Y LP WS+SILPDCK VFNTA VR Q+S ++M+P N
Sbjct: 378 ANYHSNSAARVTFNNRHYDLPPWSISILPDCKTDVFNTARVRFQNSKIQMLPSN------ 431
Query: 438 SPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
SK L W+ + E ++ + + SG ++ IN T+DT+DYLWY TS+ ++ +E F
Sbjct: 432 -----SKLLSWETYDEDVSSLAESSRITASGLLEQINATRDTSDYLWYITSVDISPSESF 486
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L+ G++P + + S G A+H F N + GSA G + PI+L AG N+IALLS+
Sbjct: 487 LRGGNKPSISVHSSGDAVHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKIALLSVA 546
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGL N G +E GIT + + G + G DL+ W+Y++GL+GE + + +P +++
Sbjct: 547 VGLPNGGIHFESWKTGITGPILLHGLDHGQKDLTWQKWSYQVGLKGEAMNLVSPNGVSSV 606
Query: 616 NWVSTMEPPKNQP-LTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
+WV +NQP L W+KA P G+E + LDM MGKG W+NG+ IGRYW ++
Sbjct: 607 DWVRESLASQNQPQLKWHKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGRYWLVYAKG 666
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+ C+Y G + KC GCG+P+QRWYH+PRSW KP+ N++V+FEE GG+P K
Sbjct: 667 N------CNSCNYAGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGGNPWK 720
Query: 735 ITFSIRKI 742
I+ R I
Sbjct: 721 ISLVKRTI 728
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/738 (50%), Positives = 490/738 (66%), Gaps = 38/738 (5%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
+ V YD R LIING+ ++ISA+IHYPR+ P MW L+ AK GG++ IE+YVFW+GH
Sbjct: 22 LSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGH 81
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
+ + Y F GRF+LV F+K++ +A +Y LRIGP+V AE+N GG PVWL + G FR
Sbjct: 82 QPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIEFRT 141
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+ +PFK MQ F+ IV MMK +KLFA QGGPIILAQ+ENEYG ++ YG GK Y +WA
Sbjct: 142 NNQPFKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMVWA 201
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A M+ GVPWIMCQQ D PD +++TCN FYCD + P++ PK+WTENW GWF+ +G
Sbjct: 202 ANMSQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWG 261
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGR++GGP++TTSYDY+APIDE+G
Sbjct: 262 EASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFG 321
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANMDD 382
+ R PKWGHLK+LH AIKLCE AL + + + +SLG QEA VY + SSGACAAFLAN+D
Sbjct: 322 VIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDS 381
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
+D TV F + +Y LPAWSVSILPDCK V NTA V Q++ M P
Sbjct: 382 SSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVDVQTAMPTMKPS------------ 429
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
GL W+ + E G+W ++ V S ++ INTTKDT+DYLWYTTS+ +++ + +
Sbjct: 430 ITGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQAD---AASGK 486
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
+L +ES +H F N +L GSAS GT + PI L +G N +A+L TVGLQN
Sbjct: 487 ALLYLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNY 546
Query: 563 GPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
GPF E GAGI SV + G SG +DL+ W +++GL+GE L I+ + W S +
Sbjct: 547 GPFIETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWSSAV 606
Query: 622 EPPKNQPLTWYKAVVKQ-----------------PPGDEPIGLDMLKMGKGLAWLNGEEI 664
P+ Q L WYK + + P G++P+ LD+ MGKG AW+NG+ I
Sbjct: 607 --PQGQALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMGKGQAWINGQSI 664
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GR+WP S ++ C Q CDYRG ++ KC +GCG+PSQRWYH+PRSW + N++V+
Sbjct: 665 GRFWP--SLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDGGNLVVL 722
Query: 725 FEEKGGDPTKITFSIRKI 742
FEE+GG P+ ++F R +
Sbjct: 723 FEEEGGKPSGVSFVTRTV 740
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/733 (50%), Positives = 489/733 (66%), Gaps = 22/733 (3%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+ SS+ C +VTYD ++++ING R +++S +IHYPRS P MW L+++AK+GG++
Sbjct: 19 MLIGSSVIQC--SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDV 76
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
I++YVFWNGHE SPG Y F GR++LV+FIK IQ+ +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 77 IDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWL 136
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
Y+ G FR D PFK MQ F IV MMK + FASQGGPIIL+Q+ENE+ G
Sbjct: 137 KYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLG 196
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTE 253
G Y WAAKMAV N GVPW+MC++ D PDP+INTCN FYCD FTP+ P P +WTE
Sbjct: 197 PAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTE 256
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSY 313
W GWF FGG P RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSY
Sbjct: 257 AWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSY 316
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGAC 373
DY+APIDEYGL + PK+ HLK+LH AIK CE AL++ + LG+ +EA V+ G+C
Sbjct: 317 DYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSC 376
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE-NL 432
AFL N VVF N Y LPAWS+SILPDC+ VVFNTA V A++S V+MVP ++
Sbjct: 377 VAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSI 436
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
S A D ++IA G ++ +N T+DTTDYLWYTTS+ +
Sbjct: 437 LYSVARYD-----------EDIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKA 485
Query: 493 NEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIAL 552
+E FL+ G P L ++S GHA+H F N GSA G + F + + ++L+ G N+IAL
Sbjct: 486 SESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIAL 545
Query: 553 LSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGY 611
LS+ VGL N GP +E W + SV + G + G DLS WTY+ GL+GE + + +P
Sbjct: 546 LSVAVGLPNVGPHFETWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTE 605
Query: 612 RNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
++++W+ ++ QPLTWYKA P G+EP+ LD+ MGKG AW+NG+ IGRYW
Sbjct: 606 DSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMA 665
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ +C C+Y G + +KC +GCGEP+QRWYH+PRSW KP N+LV+FEE GG
Sbjct: 666 FAK-----GDC-GSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGG 719
Query: 731 DPTKITFSIRKIS 743
D +K++ R ++
Sbjct: 720 DISKVSVVKRSVN 732
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/731 (52%), Positives = 487/731 (66%), Gaps = 21/731 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ + VTYD ++LIING+R ++IS +IHYPRS P MWP L+++AKEGG+
Sbjct: 13 LAILCFSSLIHSTEAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG YYF R++LVKF K++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQKF IVDMMK EKLF +QGGPIIL+Q+ENEYG +
Sbjct: 133 WLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y+ W A+MA+ + GVPWIMC+Q D P P+I+TCN FYC+ F P+S + PK+W
Sbjct: 193 MGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP EDIAFSVARF Q GGS NYYMY+GGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-GVFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL R PK+ HLKELH IKLCE AL++ + + SLG QE V+ S
Sbjct: 312 SYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKT 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N D + V+FR Y LP WSVSILPDCK +NTA +RA + ++M+P
Sbjct: 371 SCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIPT- 429
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
S W+ + E + EA FVK G V+ I+ T+D TDY WY T I +
Sbjct: 430 -----------STKFSWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITI 478
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+E FLK G P+L I S GHALH F N L G++ G ++ + I L G N++
Sbjct: 479 GSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKL 538
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
ALLS VGL NAG YE GI V + G NSGT D+S + W+YKIGL+GE + ++
Sbjct: 539 ALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTL 598
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG IGR+WP
Sbjct: 599 AGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWP 658
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + + C+Y G +N KC++ CGEPSQRWYH+PRSW KP N+LVIFEE G
Sbjct: 659 AYTARGN-----CGRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWG 713
Query: 730 GDPTKITFSIR 740
GDP+ I+ R
Sbjct: 714 GDPSGISLVKR 724
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/733 (50%), Positives = 489/733 (66%), Gaps = 22/733 (3%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+ SS+ C +VTYD ++++ING R +++S +IHYPRS P MW L+++AK+GG++
Sbjct: 19 MLIGSSVIQC--SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDV 76
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
I++YVFWNGHE SPG Y F GR++LV+FIK IQ+ +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 77 IDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWL 136
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
Y+ G FR D PFK MQ F IV MMK + FASQGGPIIL+Q+ENE+ G
Sbjct: 137 KYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLG 196
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTE 253
G Y WAAKMAV N GVPW+MC++ D PDP+INTCN FYCD FTP+ P P +WTE
Sbjct: 197 PAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTE 256
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSY 313
W GWF FGG P RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSY
Sbjct: 257 AWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSY 316
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGAC 373
DY+APIDEYGL + PK+ HLK+LH AIK CE AL++ + LG+ +EA V+ G+C
Sbjct: 317 DYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSC 376
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE-NL 432
AFL N VVF N Y LPAWS+SILPDC+ VVFNTA V A++S V+MVP ++
Sbjct: 377 VAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSI 436
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
S A D ++IA G ++ +N T+DTTDYLWYTTS+ +
Sbjct: 437 LYSVARYD-----------EDIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKA 485
Query: 493 NEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIAL 552
+E FL+ G P L ++S GHA+H F N GSA G + F + + ++L+ G N+IAL
Sbjct: 486 SESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIAL 545
Query: 553 LSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGY 611
LS+ VGL N GP +E W + SV + G + G DLS WTY+ GL+GE + + +P
Sbjct: 546 LSVAVGLPNVGPHFETWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTE 605
Query: 612 RNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
++++W+ ++ QPLTWYKA P G+EP+ LD+ MGKG AW+NG+ IGRYW
Sbjct: 606 DSSVDWIKGSLAKQNKQPLTWYKAYFDVPRGNEPLALDLKSMGKGQAWINGQSIGRYWMA 665
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ +C C+Y G + +KC +GCGEP+QRWYH+PRSW KP N+LV+FEE GG
Sbjct: 666 FAK-----GDC-GSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGG 719
Query: 731 DPTKITFSIRKIS 743
D +K++ R ++
Sbjct: 720 DISKVSVVKRSVN 732
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/694 (54%), Positives = 493/694 (71%), Gaps = 17/694 (2%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWPGL+Q++K+GG++ IE+YVFW+ HE G+Y F GR +LV+F+K + A +Y+ LRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
P+V AE+NYGG PVWLH++PG FR D E FK MQ+F +VD MK L+ASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
L+Q+ENEYG +S YG GK Y WAA MAV+ + GVPW+MCQQ D PDP+INTCN FYC
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180
Query: 238 DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
DQFTP+S S PK+WTENW GWF +FGG P+RP+ED+AF+VARF+Q+GG+ NYYMYHGG
Sbjct: 181 DQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGG 240
Query: 298 TNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSL 357
TNFGR+ GGPFI TSYDY+APIDEYG+ R PKWGHL+++H AIKLCE AL+ E S SL
Sbjct: 241 TNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSL 300
Query: 358 GSSQEADVY--ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNT 415
G + EA VY AD+S CAAFLAN+D ++DKTV F +Y LPAWSVSILPDCK VV NT
Sbjct: 301 GQNTEATVYQTADNS-ICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNT 359
Query: 416 ANVRAQSSTVEM--VPENLQPSEAS---PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVD 470
A + +Q +T EM + ++Q ++ S P+ + G W E GI E K G ++
Sbjct: 360 AQINSQVTTSEMRSLGSSIQDTDDSLITPELATAG--WSYAIEPVGITKENALTKPGLME 417
Query: 471 HINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNG 530
INTT D +D+LWY+TSI+V +E +L NGS+ LL+ S GH L + N +L GSA G+
Sbjct: 418 QINTTADASDFLWYSTSIVVKGDEPYL-NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSA 476
Query: 531 THPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLS 589
+ + P++L GKN+I LLS TVGL N G F++ VGAG+T VK++G N G L+LS
Sbjct: 477 SSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPN-GALNLS 535
Query: 590 TYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDM 649
+ WTY+IGL+GE L +YNP + WVS P NQPL WYK P GD+P+ +D
Sbjct: 536 STDWTYQIGLRGEDLHLYNPS-EASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDF 594
Query: 650 LKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYH 709
MGKG AW+NG+ IGRYWP +P CV C+YRG ++ +KC+ CG+PSQ YH
Sbjct: 595 TGMGKGEAWVNGQSIGRYWP---TNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYH 651
Query: 710 IPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
+PRS+ +P N LV+FE+ GGDP+ I+F+ R+ S
Sbjct: 652 VPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTS 685
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/720 (51%), Positives = 490/720 (68%), Gaps = 21/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++++G+R ++ S +IHYPRS P MW GL+++AK+GG++ I++YVFWNGHE +P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LV+FIK +Q+A M++ LRIGP++ E+N+GG PVWL Y+PG FR D EP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG +G GK Y WAAKMA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL R
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH A+KLCE L++ + + +LGS QEA V+ SSG CAAFLAN + +
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAK 385
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F N +Y LP WS+SILPDCK VVFNTA V Q++ ++M + G+ +
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWAD-----------GASSMM 434
Query: 448 WQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E A + S G ++ +N T+DT+DYLWY TS+ V+ +E+FL+ G+ L
Sbjct: 435 WEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLT 494
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GHALH F N +LQGSA G Y +L+AG N++ALLS+ GL N G Y
Sbjct: 495 VQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHY 554
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPP 624
E W + V I G + G+ DL+ +W+Y++GL+GE + + + ++ W+ ++
Sbjct: 555 ETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQ 614
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
QPL WY+A P GDEP+ LDM MGKG W+NG+ IGRYW + +C +
Sbjct: 615 NQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYW-----TAYAEGDC-KG 668
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
C Y G + KC GCG+P+QRWYH+PRSW +P+ N+LV+FEE GGD +KI + R +SG
Sbjct: 669 CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSG 728
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/733 (50%), Positives = 489/733 (66%), Gaps = 22/733 (3%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+ SS+ C +VTYD ++++ING R +++S +IHYPRS P MW L+++AK+GG++
Sbjct: 19 MLIGSSVIQC--SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDV 76
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
I++YVFWNGHE SPG Y F GR++LV+FIK IQ+ +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 77 IDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWL 136
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
Y+ G FR D PFK MQ F IV MMK + FASQGGPIIL+Q+ENE+ G
Sbjct: 137 KYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLG 196
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTE 253
G Y WAAKMAV N GVPW+MC++ D PDP+INTCN FYCD FTP+ P P +WTE
Sbjct: 197 PAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTE 256
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSY 313
W GWF FGG P RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSY
Sbjct: 257 AWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSY 316
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGAC 373
DY+APIDEYGL + PK+ HLK+LH AIK CE AL++ + LG+ +EA V+ G+C
Sbjct: 317 DYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSC 376
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE-NL 432
AFL N VVF N Y LPAWS+SILPDC+ VVFNTA V A++S V+MVP ++
Sbjct: 377 VAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSI 436
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
S A D ++IA G ++ +N T+DTTDYLWYTTS+ +
Sbjct: 437 LYSVARYD-----------EDIATYGNPGTITARGLLEQVNVTRDTTDYLWYTTSVDIKA 485
Query: 493 NEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIAL 552
+E FL+ G P L ++S GHA+H F N GSA G + F + + ++L+ G N+IAL
Sbjct: 486 SESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIAL 545
Query: 553 LSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGY 611
LS+ VGL N GP +E W + SV + G + G DLS WTY+ GL+GE + + +P
Sbjct: 546 LSVAVGLPNVGPHFETWATGIVGSVALHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTE 605
Query: 612 RNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
++++W+ ++ QPLTWYKA P G+EP+ LD+ MGKG AW+NG+ IGRYW
Sbjct: 606 DSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMA 665
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ +C C+Y G + +KC +GCGEP+QRWYH+PRSW KP N+LV+FEE GG
Sbjct: 666 FAK-----GDC-GSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGG 719
Query: 731 DPTKITFSIRKIS 743
D +K++ R ++
Sbjct: 720 DISKVSVVKRSVN 732
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/714 (52%), Positives = 471/714 (65%), Gaps = 22/714 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD R+++ING+R ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVFWNGHE
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYFG R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAAKMA
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA GVPW+MC+Q D PDPVINTCN FYCD F+P+S S P +WTE W GWF FGG P
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 267
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNF RT+GGPFI TSYDY+APIDEYGL R
Sbjct: 268 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQ 327
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E AL++G+ + SLG+ ++A V+ S GACAAFL+N
Sbjct: 328 PKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAAR 387
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
VVF Y LPAWS+S+LPDCK VFNTA V S+ M P + G
Sbjct: 388 VVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSP-------------AGGFS 434
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E F K G V+ ++ T D +DYLWYTT + +N NE+FLK+G P L I
Sbjct: 435 WQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTI 494
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH+L F N + G+ G P Y + + G N+I++LS VGL N G YE
Sbjct: 495 YSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYE 554
Query: 568 WVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
G+ V ++G N G DLS WTY+IGL GE LG+ + +++ W S
Sbjct: 555 TWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAA---GK 611
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P GD P+ LDM MGKG AW+NG IGRYW K+ S C
Sbjct: 612 QPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSG-----CGGCS 666
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ KC TGCG+ SQR+YH+PRSW PS N+LV+ EE GGD + + R
Sbjct: 667 YAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTR 720
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 774 bits (1999), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/728 (51%), Positives = 492/728 (67%), Gaps = 24/728 (3%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
S + +C VTYD +++IING+R ++IS +IHYPRS P MW L+Q+AK+GG++ I++Y
Sbjct: 22 SEVIHC---TVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTY 78
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFWN HE SPG Y F GR++LV+FIK +Q+ +Y+ LRIGP+V AE+N+GG PVWL Y+P
Sbjct: 79 VFWNVHEPSPGNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVP 138
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G FR D PFK MQ F IV MMK EKLF SQGGPIIL+Q+ENEYG G G
Sbjct: 139 GISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGH 198
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y+ WAAKMAV GVPW+MC++ D PDPVIN CN FYCD F+P+ P PK+WTE+W G
Sbjct: 199 AYSNWAAKMAVGLGTGVPWVMCKEDDAPDPVINACNGFYCDDFSPNKPYKPKLWTESWSG 258
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF FGG +P RP ED+AF+VARF QKGGS NYYMYHGGTNFGR+AGGPFITTSYDY+A
Sbjct: 259 WFSEFGGSNPQRPVEDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDA 318
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
PIDEYGL R PK+GHLK+LH AIK CEHAL++ + + SLG+ ++A V++ S CAAFL
Sbjct: 319 PIDEYGLLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFS-SGTTCAAFL 377
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + V F N Y LP WS+SILPDC+ VFNTA +R Q S ++M+P N
Sbjct: 378 ANYHSNSAARVTFNNRHYDLPPWSISILPDCRTDVFNTARMRFQPSQIQMLPSN------ 431
Query: 438 SPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
SK L W+ + E ++ + + S ++ I+ T+DT+DYLWY TS+ ++ +E F
Sbjct: 432 -----SKLLSWETYDEDVSSLAESSRITASRLLEQIDATRDTSDYLWYITSVDISSSESF 486
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L+ ++P + + S G A+H F N + GSA G F + PI L+AG N+IALLS+
Sbjct: 487 LRGRNKPSISVHSSGDAVHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIALLSVA 546
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGL N G +E +GIT V + + G DL+ W+Y++GL+GE + + +P +++
Sbjct: 547 VGLPNGGIHFESWKSGITGPVLLHDLDHGQKDLTGQKWSYQVGLKGEAMNLVSPNGVSSV 606
Query: 616 NWVSTMEPPKNQP-LTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
+WVS +NQP L W+KA P G EP+ LDM MGKG W+NG+ IGRYW ++
Sbjct: 607 DWVSESLASQNQPQLKWHKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRYWMVYAKG 666
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+ C+Y G + KC GCG+P+QRWYH+PRSW KP N++V+FEE GG+P K
Sbjct: 667 N------CNSCNYAGTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGNPWK 720
Query: 735 ITFSIRKI 742
I+ R I
Sbjct: 721 ISLVKRII 728
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/731 (51%), Positives = 478/731 (65%), Gaps = 23/731 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L FF +T +VTYD ++++ING+R ++IS +IHYPRS P MWP L+Q+AK+GGV
Sbjct: 16 FLCFFVCYVT----ASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGV 71
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE S GKYYF RF+LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 72 DVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 131
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D EPFK MQKF T IV +MK E LF SQGGPIIL+Q+ENEYG E
Sbjct: 132 WLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWE 191
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W ++MAV N GVPW+MC+Q D PDP+I+TCN +YC+ F+P+ PK+W
Sbjct: 192 IGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMW 251
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+ FG P+RP+ED+AFSVARF Q GS NYYMYHGGTNFGRT+ G FI T
Sbjct: 252 TENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL PKWGHL++LH AIK CE AL++ + + G + E +Y S G
Sbjct: 312 SYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSFG 371
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFLAN D + V F N Y LP WS+SILPDCK VFNTA VRA M P N
Sbjct: 372 ACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVHRSMTPAN 431
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
WQ + E GE+ + +G ++ ++ T D +DYLWY T + +
Sbjct: 432 ------------SAFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNI 479
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE F+KNG PVL S GH LH F N + G+A G+ +P + N + L+ G N+I
Sbjct: 480 SPNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKI 539
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS+ VGL N G YE G+ V + G N GT DLS W+YKIGL+GE L ++
Sbjct: 540 SLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGESLNLHTT 599
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ W K QPLTWYK P G++P+ LDM MGKG W+NG+ IGR+WP
Sbjct: 600 SGSSSVKWTQGSFLSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSIGRHWP 659
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + C+Y G F KC T CG+P+Q+WYHIPRSW PS N+LV+ EE G
Sbjct: 660 AYIARGN-----CGSCNYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEEWG 714
Query: 730 GDPTKITFSIR 740
GDPT I+ R
Sbjct: 715 GDPTGISLVKR 725
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/719 (51%), Positives = 489/719 (68%), Gaps = 22/719 (3%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD ++++I+G+R ++ S +IHYPRS P MW GL+Q+AK+GG++ I++YVFWNGHE +PG
Sbjct: 28 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 87
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
YYF R++LV+FIK +Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D EPF
Sbjct: 88 NYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 147
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F IV MMK EKLFASQGGPIIL+Q+ENEYG G G+ Y WAAKMA+
Sbjct: 148 KTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGPEGKELGAAGQAYINWAAKMAI 207
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 208 GLGTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 267
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R P
Sbjct: 268 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREP 327
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
K HLKELH A+KLCE AL++ + + +LG+ QEA V+ SG CAAFLAN + + V
Sbjct: 328 KHSHLKELHRAVKLCEQALVSVDPAITTLGTMQEAHVFRSPSG-CAAFLANYNSNSYAKV 386
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
VF N Y LP WS+SILPDCK VVFN+A V Q+S ++M +G+ + W
Sbjct: 387 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTSQMQMW-----------GDGASSMMW 435
Query: 449 QVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV-LL 506
+ + +E+ + +G ++ +N T+D++DYLWY TS+ ++ +E FL+ G +P+ L
Sbjct: 436 ERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPLSLS 495
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHALH F N ELQGSA G KY +L+AG N+IALLS+ GL N G Y
Sbjct: 496 VLSAGHALHVFVNGELQGSAYGTREDRRIKYNGNANLRAGTNKIALLSVACGLPNVGVHY 555
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPP 624
E G+ V + G N G+ DL+ +W+Y++GL+GE + + + ++ W+ ++
Sbjct: 556 ETWNTGVGGPVGLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSTSVEWMQGSLIAQ 615
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
QPL+WY+A + P GDEP+ LDM MGKG W+NG+ IGRYW ++ D +E
Sbjct: 616 NQQPLSWYRAYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYW------TAYADGDCKE 669
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C Y G F KC GCG+P+QRWYH+PRSW +P+ N+LV+FEE GGD +KI R +S
Sbjct: 670 CSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALVKRSVS 728
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/733 (51%), Positives = 486/733 (66%), Gaps = 19/733 (2%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F + F +VTYD +++ ING+R ++ S +IHYPRS P MWPGL+Q+AKEG
Sbjct: 11 FVCVGLFFLLCCCSVTASVTYDGKAIKINGQRRILFSGSIHYPRSTPEMWPGLIQKAKEG 70
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPG+YYF GR++LV+FIK+ QQA +Y+ LRIG +V AE+N+GG
Sbjct: 71 GLDVIQTYVFWNGHEPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRIGLYVCAEWNFGGF 130
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D PFK MQKF IV++MK EKLF SQGGPII++Q+ENEYG E
Sbjct: 131 PVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPIIMSQIENEYGPVE 190
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPWIMC+Q D PDP+I+TCN FYC+ FTP+ PK
Sbjct: 191 WEIGAPGKAYTKWAAEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFYCEGFTPNKNYKPK 250
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GW+ FGG +RP ED+A+SVARF Q GS NYYMYHGGTNFGRTA G F+
Sbjct: 251 MWTEAWTGWYTEFGGPIHNRPVEDLAYSVARFIQNNGSFVNYYMYHGGTNFGRTAAGLFV 310
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGLPR PKWGHL++LH AIKLCE +L++ + G + E V+
Sbjct: 311 ATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLCEPSLVSAYPTVTWPGKNLEVHVFKSK 370
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S +CAAFLAN D + V F+N+ Y LP WS+SILPDCK VFNTA V ++SS ++M P
Sbjct: 371 S-SCAAFLANYDPSSPAKVTFQNMQYDLPPWSISILPDCKNAVFNTARVSSKSSQMKMTP 429
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFV-KSGFVDHINTTKDTTDYLWYTTSI 488
+ WQ + E ++D + K+G + I+ T+D +DYLWY T +
Sbjct: 430 VS-----------GGAFSWQSYIEETVSADDSDTIAKNGLWEQISITRDGSDYLWYLTDV 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ NE FLKNG PVL + S GHALH F N +L G+ G+ +P + N + L+AG N
Sbjct: 479 NIHPNEGFLKNGQSPVLTVMSAGHALHVFINGQLAGTVYGSLENPKLTFSNNVKLRAGIN 538
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+I+LLS VGL N G +E W + V + G N GT DL+ W+YK+GL+GE L ++
Sbjct: 539 KISLLSAAVGLPNVGLHFETWNTGVLGPVTLKGLNEGTRDLTKQKWSYKVGLKGEDLSLH 598
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ WV + QPLTWYKA P G++P+ LDM MGKG W+NGE IGR+
Sbjct: 599 TLSGSSSVEWVQGSLLAQKQPLTWYKATFNAPEGNDPLALDMNTMGKGQIWINGESIGRH 658
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + C Y G + KC++ CGE SQRWYH+PRSW KPS N LV+FEE
Sbjct: 659 WPEYKASGN-----CGGCSYAGIYTEKKCLSNCGEASQRWYHVPRSWLKPSGNFLVVFEE 713
Query: 728 KGGDPTKITFSIR 740
GGDPT I+F R
Sbjct: 714 LGGDPTGISFVRR 726
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/731 (52%), Positives = 487/731 (66%), Gaps = 20/731 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ + VTYD ++LIING+R ++IS +IHYPRS P MWP L+++AKEGG+
Sbjct: 13 LAILCFSSLIWSTEAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG YYF R++LVKF K++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQ+F IVDMMK EKLF +QGGPIIL+Q+ENEYG E
Sbjct: 133 WLKYVPGIVFRTDNEPFKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y+ W A+MA+ + GVPWIMC+Q D P P+I+TCN FYC+ F P+S + PK+W
Sbjct: 193 MGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP EDIAFSVARF Q GGS NYYMY+GGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFLNYYMYYGGTNFDRTA-GVFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGL R PK+ HLKELH IKLCE AL++ + + SLG QE V+ S
Sbjct: 312 SYDYDAPLDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEVHVF-KSKT 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N D + ++FR Y LP WSVSILPDCK +NTA +RA + ++MVP +
Sbjct: 371 SCAAFLSNYDTSSAARIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMVPTS 430
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
+ S S + GS + FVK G V+ I+ T+D TDY WY T I +
Sbjct: 431 TKFSWESYNEGSPSSN-----------DDGTFVKDGLVEQISMTRDKTDYFWYLTDITIG 479
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
+E FLK G P+L I S GHALH F N L G++ G ++ + I L G N++A
Sbjct: 480 SDESFLKTGDDPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKIKLSVGINKLA 539
Query: 552 LLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LLS VGL NAG YE W + V + G NSGT D+S + W+YKIG++GE + +
Sbjct: 540 LLSTAVGLPNAGVHYETWNTGVLGPVTLKGVNSGTWDMSKWKWSYKIGIRGEAMSFHTIA 599
Query: 611 YRNNIN-WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W+ K +PLTWYK+ P G+EP+ LDM MGKG W+NG IGR+WP
Sbjct: 600 GSSAVKWWIKGSFVVKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWVNGHNIGRHWP 659
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + + C+Y G +N KC++ CGEPSQRWYH+PRSW KP N+LVIFEE G
Sbjct: 660 AYTARGN-----CGRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWG 714
Query: 730 GDPTKITFSIR 740
GDP+ I+ R
Sbjct: 715 GDPSGISLVKR 725
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/731 (52%), Positives = 486/731 (66%), Gaps = 21/731 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ + VTYD ++LIING+R ++IS +IHYPRS P MWP L+++AKEGG+
Sbjct: 13 LAILCFSSLIHSTEAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG YYF R++LVKF K++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQKF IVDMMK EKLF +QGGPIIL+Q+ENEYG +
Sbjct: 133 WLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y+ W A+MA+ + GVPWIM +Q D P P+I+TCN FYC+ F P+S + PK+W
Sbjct: 193 MGAAGKAYSKWTAEMALGLSTGVPWIMSKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP EDIAFSVARF Q GGS NYYMY+GGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-GVFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL R PK+ HLKELH IKLCE AL++ + + SLG QE V+ S
Sbjct: 312 SYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKT 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N D + V+FR Y LP WSVSILPDCK +NTA +RA + ++M+P
Sbjct: 371 SCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIPT- 429
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
S W+ + E + EA FVK G V+ I+ T+D TDY WY T I +
Sbjct: 430 -----------STKFSWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITI 478
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+E FLK G P+L I S GHALH F N L G++ G ++ + I L G N++
Sbjct: 479 GSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKL 538
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
ALLS VGL NAG YE GI V + G NSGT D+S + W+YKIGL+GE + ++
Sbjct: 539 ALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTL 598
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG IGR+WP
Sbjct: 599 AGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWP 658
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + + C+Y G +N KC++ CGEPSQRWYH+PRSW KP N+LVIFEE G
Sbjct: 659 AYTARGN-----CGRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWG 713
Query: 730 GDPTKITFSIR 740
GDP+ I+ R
Sbjct: 714 GDPSGISLVKR 724
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/732 (51%), Positives = 498/732 (68%), Gaps = 33/732 (4%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
S +T+C NVTYD +SL+ING+R ++IS +IHYPRS P MW L+ +AK GG++ I++Y
Sbjct: 23 SELTHC---NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTY 79
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP 137
VFW+ HE SPG Y F GR++LV+FIK +Q+ +Y LRIGP+V AE+N+GGIPVWL Y+P
Sbjct: 80 VFWDVHEPSPGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVP 139
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGK 197
G FR D EPFK MQ F IV MMK EKLF SQGGPIIL+Q+ENEYG G G+
Sbjct: 140 GVSFRTDNEPFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGR 197
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y WAA MAV GVPW+MC++ D PDPVIN+CN FYCD F+P+ P P +WTE W G
Sbjct: 198 AYVNWAASMAVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSG 257
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF FGG RP ED++F+VARF QKGGS NYYMYHGGTNFGR+AGGPFITTSYDY+A
Sbjct: 258 WFTEFGGPIHQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDA 317
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
PIDEYGL R PK+ HLKELH AIK CEHAL++ + + LSLG+ +A V++ +G CAAFL
Sbjct: 318 PIDEYGLIRQPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFL 377
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + ++ TV F N Y LP WS+SILPDCK VFNTA V+ M+P ++P
Sbjct: 378 ANYNAQSAATVTFNNRHYDLPPWSISILPDCKIDVFNTAKVK-------MLP--VKP--- 425
Query: 438 SPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
K W+ + E E+ + + G ++ +N T+DT+DYLWY TS+ ++ +E F
Sbjct: 426 ------KLFSWESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESF 479
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L+ G +P + ++S GHA+H F N + GSA G Y P+ L+AG N+IALLS+T
Sbjct: 480 LRGGQKPSINVQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVT 539
Query: 557 VGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGLQN G YE AGIT V + G + G DL+ W+YK+GL+GE + + +P +++
Sbjct: 540 VGLQNVGRHYETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSV 599
Query: 616 NWVSTMEPPKNQP-LTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
+WV + +++ L WYKA P G EP+ LD+ MGKG W+NG+ IGRYW ++
Sbjct: 600 DWVQESQATQSRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAYAK- 658
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+C C Y G F P KC GCG+P+QRWYH+PRSW KP++N++V+FEE GG+P K
Sbjct: 659 ----GDC-NSCTYSGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWK 713
Query: 735 ITFSIRKISGFP 746
I+ +++++ P
Sbjct: 714 ISL-VKRVAHTP 724
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/717 (53%), Positives = 485/717 (67%), Gaps = 30/717 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD RSLI+NG+R +++S ++HYPR+ P MWPG++Q+AKEGG++ IE+YVFW+ HE S
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+YYF GR++LVKF+K++QQA + M LRIGP+V AE+N GG P+WL IP VFR D E
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK +MQ F+T IV+MMK E LFASQGGPIILAQVENEYG +S YGE G RY WAA+M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A AQN GVPWIMC Q P+ +I+TCN YCD + P P +WTE++ GWF +G
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPILYKKPTMWTESYTGWFTYYGWPI 258
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYM--YHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
PHRP EDIAF+VARFF++GGS HNYYM Y GGTNFGRT+GGP++ +SYDY+AP+DEYG+
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLDEYGM 318
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
PKWGHLK+LH +KL E +L+ E + LG +QEA VY+ +G C AFLAN+D N
Sbjct: 319 QHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNG-CVAFLANVDSMN 377
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V FRNVSY LPAWSVSIL DCK V FN+A V++QS+ V M P
Sbjct: 378 DTVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKSQSAVVSMSPSK------------S 425
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
L W F E GI G + F ++ + TTKDT+DYLWYTTS+ E GS
Sbjct: 426 TLSWTSFDEPVGISG-SSFKAKQLLEQMETTKDTSDYLWYTTSV------EATGTGST-W 477
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L IES +H F N + Q S + + + PI+L G N IALLS TVGLQN G
Sbjct: 478 LSIESMRDVVHIFVNGQFQSSWHTSKSVLYNSVEAPITLAPGSNTIALLSATVGLQNFGA 537
Query: 565 FYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
F E AG++ S+ + G G +LS WTY++GL+GE L ++ ++NW +
Sbjct: 538 FIETWSAGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAV--- 594
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+PLTWY PPGD+P+ LD+ MGKG AW+NG+ IGRYWP S C +
Sbjct: 595 STEKPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSV---CPE 651
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
CDYRG ++ +KC+TGCG+ SQRWYH+PRSW KP N+LV+FEE GGDP+ I F R
Sbjct: 652 SCDYRGSYDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTR 708
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/718 (51%), Positives = 489/718 (68%), Gaps = 21/718 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++I+G+R ++ S +IHYPRS P MW GL Q+AK+GG++ I++YVFWNGHE +P
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LVKFIK Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D EP
Sbjct: 87 GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F IV MMK E+LFASQGGPIIL+Q+ENEYG +G GK Y+ WAAKMA
Sbjct: 147 FKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKMA 206
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V + GVPW+MC+Q D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 207 VGLDTGVPWVMCKQDDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWTGWFTEFGGTIR 266
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
RP ED++F+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL R
Sbjct: 267 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH A+KLCE AL++ + + +LGS QEA V+ S +CAAFLAN + +
Sbjct: 327 PKYGHLKELHRAVKLCEPALVSVDPAVTTLGSMQEAHVFRSPS-SCAAFLANYNSNSHAN 385
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
VVF N Y LP WS+SILPDCK VVFNTA V Q+S ++M + G +
Sbjct: 386 VVFNNEHYSLPPWSISILPDCKTVVFNTATVGVQTSQMQMWAD-----------GESSMM 434
Query: 448 WQVFKEIAGIWGEADFV-KSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E G A + +G ++ +N T+D++DYLWY TS+ V+ +E+FL+ G L
Sbjct: 435 WERYDEEVGSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPSEKFLQGGEPLSLT 494
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GHALH F N +LQGSASG F YK +L+AG N+IALLS+ GL N G Y
Sbjct: 495 VQSAGHALHIFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIALLSIACGLPNVGVHY 554
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E GI V + G + G+ DL+ +W+Y++GL+GE + + + +++ W+
Sbjct: 555 ETWNTGIVGPVVLHGLDVGSRDLTWQTWSYQVGLKGEQMNLNSLEGASSVEWMQG-SLLA 613
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
PL+WY+A P GDEP+ LDM MGKG W+NG+ IGRY S +C + C
Sbjct: 614 QAPLSWYRAYFDTPTGDEPLALDMGSMGKGQIWINGQSIGRY-----STSYASGDC-KAC 667
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
Y G + KC GCG+P+QRWYH+P+SW +PS N+LV+FEE GGD +KI+ R +S
Sbjct: 668 SYAGSYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSKISLVKRSVS 725
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/717 (52%), Positives = 478/717 (66%), Gaps = 19/717 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD ++++ING+R ++IS +IHYPRS P MWP L+Q+AKEGG++ I++YVFW+GHE S
Sbjct: 36 SVTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWDGHEPS 95
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PGKYYF GR++LVKFIK+++QA +Y+ LRIGP++ AE+N GG PVWL YIPG FR D E
Sbjct: 96 PGKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFPVWLKYIPGISFRTDNE 155
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK +M F IV+MMK E LF QGGPII++Q+ENEYG E G GK Y WAA M
Sbjct: 156 PFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEWEIGAIGKVYTRWAASM 215
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV N GVPWIMC+Q + PDP+INTCN FYCD F P+ P +WTE W GWF FGG
Sbjct: 216 AVNLNTGVPWIMCKQDEVPDPIINTCNGFYCDWFKPNKDYKPIMWTELWTGWFTAFGGPV 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+RP ED+A++V +F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL R
Sbjct: 276 PYRPVEDVAYAVVKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLKR 335
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHL++LH AIK+CE AL++ + + +G SQEA V+ SGAC+AFL N D+ N
Sbjct: 336 EPKWGHLRDLHRAIKMCEPALVSNDPTVTKIGDSQEAHVFKFESGACSAFLENKDETNFV 395
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F+ + Y LP WS+SILPDC VV+NT V Q+S + M+ + +
Sbjct: 396 KVTFQGMQYELPPWSISILPDCVNVVYNTGRVGTQTSMMTMLSAS-----------NNEF 444
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W + E + E G + I+ TKD+TDYL YTT + + +NE FLKNG PVL
Sbjct: 445 SWASYNEDTASYNEESMTIEGLSEQISITKDSTDYLRYTTDVTIGQNEGFLKNGEYPVLT 504
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GHAL F N +L G+A G+ P + + L AG N+I+LLS VGL N G +
Sbjct: 505 VNSAGHALQVFVNGQLSGTAYGSVNDPRLTFSGKVKLWAGNNKISLLSSAVGLPNVGTHF 564
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E W + V + G N G DLS W+YK+G+ GE L +++P +++ W S+ K
Sbjct: 565 ETWNYGVLGPVTLNGLNEGKRDLSLQKWSYKVGVIGEALQLHSPTGSSSVEWGSSTS--K 622
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QP TWYK P G++P+ LDM MGKG W+NG+ IGRYWP + +C C
Sbjct: 623 IQPFTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIGRYWP----AYKANGKC-SAC 677
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
Y G ++ KC CGE SQRWYHIPRSW P+ N+LV+FEE GGDPT IT R I
Sbjct: 678 HYTGWYDEKKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWGGDPTGITLVRRTI 734
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/733 (50%), Positives = 487/733 (66%), Gaps = 22/733 (3%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+ SS+ C +VTYD ++++ING R +++S +IHYPRS P MW L+++AK+GG++
Sbjct: 19 MLIGSSMIQC--SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDV 76
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
I++YVFWNGHE SPG Y F GR++LV+FIK IQ+ +Y+ LRIGP+V AE+N+GG PVWL
Sbjct: 77 IDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWL 136
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
Y+ G FR D PFK MQ F IV MMK + FASQGGPIIL+Q+ENE+ G
Sbjct: 137 KYVDGISFRTDNGPFKAAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPELKGLG 196
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTE 253
G Y WAAKMAV N GVPW+MC++ D PDP+IN+CN FYCD FTP+ P P +WTE
Sbjct: 197 PAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINSCNGFYCDYFTPNKPYKPTMWTE 256
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSY 313
W GWF FGG P RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSY
Sbjct: 257 AWSGWFTEFGGTIPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSY 316
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGAC 373
DY+APIDEYGL + PK+ HLK+LH AIK CE AL++ + LG+ +EA V+ G+C
Sbjct: 317 DYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSC 376
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE-NL 432
AFL N VVF N Y LPAWS+SILPDC+ VVFNTA V A++S V+M+P ++
Sbjct: 377 VAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMMPSGSI 436
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
S A D ++IA G ++ +N T+DTTDYLWYTTS+ +
Sbjct: 437 LYSVARYD-----------EDIATYGDRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKA 485
Query: 493 NEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIAL 552
+E FL+ G P L ++S GHA+H F N GSA G + F + + ++L+ G N IAL
Sbjct: 486 SESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANRIAL 545
Query: 553 LSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGY 611
LS+ VGL N GP +E W + SV + G + G DLS WTY+ GL+GE + + +P
Sbjct: 546 LSVAVGLPNVGPHFETWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGEAMKLVSPTE 605
Query: 612 RNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
++++W+ ++ QPLTWYKA P G+EP+ LD+ MGKG AW+NG+ IGRYW
Sbjct: 606 DSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMA 665
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
++ + C+Y G + +KC +GCGEP+QRWYH+PRSW KP N+LV+FEE GG
Sbjct: 666 FAKGN------CGSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPRGNLLVLFEELGG 719
Query: 731 DPTKITFSIRKIS 743
D +K++ R ++
Sbjct: 720 DISKVSVVKRSVN 732
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/718 (52%), Positives = 476/718 (66%), Gaps = 19/718 (2%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
+VTYD ++++I+G+R ++IS +IHYPRS P MWP L Q+AKEGG++ I++YVFWNGHE
Sbjct: 22 TASVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQTYVFWNGHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
SPGKYYF RF+LVKFIK+ QQA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D
Sbjct: 82 PSPGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 141
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQKF T IV MMK E LF +QGGPII++Q+ENEYG E G GK Y WAA
Sbjct: 142 NEPFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAPGKAYTNWAA 201
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MAV + GVPW MC+Q D PDPVI+TCN +YC+ FTP+ PK+WTENW GW+ FG
Sbjct: 202 QMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTDFGN 261
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
+RP ED+A+SVARF Q GS NYYMYHGGTNFGRT+ G FI TSYDY+APIDEYGL
Sbjct: 262 AICYRPVEDLAYSVARFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGL 321
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
PKW HL++LH AIK CE AL++ + + SLG+ EA VY+ + CAAFLAN D K+
Sbjct: 322 TNEPKWSHLRDLHKAIKQCEPALVSVDPTITSLGNKLEAHVYSTGTSVCAAFLANYDTKS 381
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
TV F N Y LP WSVSILPDCK VFNTA V AQSS M+ N
Sbjct: 382 AATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGAQSSQKTMISTN------------S 429
Query: 445 GLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
WQ + E E D + + + IN T+D++DYLWY T + ++ NE+F+KNG P
Sbjct: 430 TFDWQSYIEEPAFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPNEDFIKNGQYP 489
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
+L + S GH LH F N +L G+ G +P + N ++L G N+I+LLS+ VGL N G
Sbjct: 490 ILNVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKISLLSVAVGLPNVG 549
Query: 564 PFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
+E G+ V + G N GT DLS W+YK+GL+GE L ++ ++++W
Sbjct: 550 LHFETWNVGVLGPVTLKGLNEGTRDLSWQKWSYKVGLKGESLSLHTITGGSSVDWTQGSL 609
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
K QPLTWYKA P G++P+GLDM MGKG W+N + IGR+WP H C
Sbjct: 610 LAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWP----GYIAHGSC- 664
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
+CDY G F KC T CG P+Q WYHIPRSW P+ N+LV+ EE GGDP+ I+ R
Sbjct: 665 GDCDYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSGISLLKR 722
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/721 (51%), Positives = 485/721 (67%), Gaps = 21/721 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD ++LIING++ ++ S +IHYPRS P MW GL+Q+AK+GG++ I++YVFWN HE S
Sbjct: 27 NVTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLHEPS 86
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG Y F GR +LV+FIK++ +A +Y+ LRIGP++ E+N+GG PVWL YIPG +FR D E
Sbjct: 87 PGNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGMIFRTDNE 146
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQKF IV MMK E+L+ SQGGPIIL+Q+ENEY + +G G Y WAA M
Sbjct: 147 PFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWAAHM 206
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV+ N GVPW+MC++FD PDPV+NTCN FYCD F+P+ P +WTE W GWF FGG
Sbjct: 207 AVSLNTGVPWVMCKEFDAPDPVVNTCNGFYCDYFSPNKAYKPTMWTEAWTGWFTDFGGPI 266
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
RP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R
Sbjct: 267 HQRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 326
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHLK+LH AIKLCE ALL+ + +LGS ++A V++ +SG CAAFLAN + K
Sbjct: 327 QPKYGHLKDLHKAIKLCERALLSSDPVVTTLGSYEQAHVFSSNSGDCAAFLANYNPKATA 386
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V F N+ Y+LP WSVSILPDCK VVFNTA V Q S ++M+P ++ L
Sbjct: 387 KVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGVQPSKIQMLPTE-----------ARFL 435
Query: 447 KWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+ E I+ + + +G ++ IN T+D +DYLWYTT + ++ +E FL G P+L
Sbjct: 436 SWEALSEDISSVDDDKIGTVAGLLEQINVTRDASDYLWYTTGVHISSSETFLDGGQPPIL 495
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI-SLKAGKNEIALLSMTVGLQNAGP 564
+ S GH +H F N +L GS G + + + L AG+N I+LLS+ VGL N GP
Sbjct: 496 KVISAGHGIHVFVNGQLSGSVYGTRGNRRISFSGELKQLHAGRNRISLLSVAVGLPNNGP 555
Query: 565 FYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TME 622
+E W + V I G + G DL+ W+YK+GL+GE L + +P +INW+ +
Sbjct: 556 RFETWNTGVLGPVVIHGLDQGHRDLTWQKWSYKVGLKGEDLNLGSPNSIPSINWMQESAM 615
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ QPLTW++A P GD+P+ LDM M KG W+NG IGRYW + D
Sbjct: 616 VAERQPLTWHRAFFDAPRGDDPLALDMSSMVKGQVWINGNSIGRYWTVYA------DGNC 669
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C Y G F P C GCG+P+Q+WYHIPRS KP+EN+LV+FEE GGD +KI R +
Sbjct: 670 TACSYSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLVVFEEIGGDVSKIYLVKRLV 729
Query: 743 S 743
+
Sbjct: 730 T 730
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/733 (50%), Positives = 490/733 (66%), Gaps = 22/733 (3%)
Query: 11 ALLIFFSSSITYCFA-GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+L+F C+A VTYD +++IING+R +++S +IHYPRS P MWP L+Q AK+G
Sbjct: 4 CVLLFLGLLSWVCYAMATVTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLIQNAKDG 63
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWNGHE + GKYYF R++LV+FIK++QQA +Y+ LRIGP+V AE+NYGG
Sbjct: 64 GLDIIETYVFWNGHEPTQGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGF 123
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WL ++PG VFR + EPFK MQKF IV MMK EKL+ SQGGPIIL+Q+ENEYG E
Sbjct: 124 PIWLKHVPGIVFRTENEPFKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIENEYGPVE 183
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MA+ + GVPW+MC+Q D PDPVI+TCN FYC+ F P+ + PK
Sbjct: 184 WEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNRENKPK 243
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTE W GW+ FGG P+RP+ED+AFSVARF Q GGS+ NYYMYHGGTNFGR++ G FI
Sbjct: 244 IWTEVWSGWYTAFGGAVPYRPAEDLAFSVARFVQNGGSLFNYYMYHGGTNFGRSS-GLFI 302
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
SYD++APIDEYGL R PKW HL++LH AIKLCE AL++ + + LG + EA V+ S
Sbjct: 303 ANSYDFDAPIDEYGLKREPKWEHLRDLHKAIKLCEPALVSADPNVTWLGKNLEARVFKSS 362
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SGACAAFLAN D V F N Y LP WS+SIL DCK +FNTA + AQS+ ++M+
Sbjct: 363 SGACAAFLANYDISTSSKVSFWNTQYDLPPWSISILSDCKSAIFNTARIGAQSAPMKMML 422
Query: 430 ENLQPSEASPDNGSKGLKWQVFK-EIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ W +K E+A + K G V+ +N T D+TDYLWY T I
Sbjct: 423 VS-------------SFWWLSYKEEVASGYATDTTTKDGLVEQVNFTWDSTDYLWYMTDI 469
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
++ NE F+K+G P+L I S GH LH F N +L G+ G+ +P + ++LKAG N
Sbjct: 470 QIDPNEAFIKSGQWPLLNISSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNLKAGVN 529
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++++LS+TVGL N G +E AG+ V + G N G D+S Y W++K+GL+GE++ ++
Sbjct: 530 KLSMLSVTVGLPNVGLHFESWNAGVLGPVTLKGLNEGIRDMSGYKWSHKVGLKGENMNLH 589
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
G N++ W + QPLTWYK P G+EP+ LDM MGKG W+NG IGRY
Sbjct: 590 TIGGSNSVQWAKGSGLVQKQPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWINGRSIGRY 649
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S +C Y G F KC++ CG+PSQ+WYH+PR W + N LV+FEE
Sbjct: 650 WPAYAASGS-----CGKCSYAGIFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVVFEE 704
Query: 728 KGGDPTKITFSIR 740
GG+P I+ R
Sbjct: 705 LGGNPGGISLVKR 717
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/715 (52%), Positives = 484/715 (67%), Gaps = 29/715 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD RSLI+NG+R +++S ++HYPR+ P MWPG++Q+AKEGG++ IE+YVFW+ HE S
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+YYF GR++LVKF+K++QQA + + LRIGP+V AE+N GG P+WL IP VFR D E
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK +MQ F+T IV+MMK E LFASQGGPIILAQVENEYG +S YGE G RY WAA+M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A AQN GVPWIMC Q P+ +I+TCN YCD + P P +WTE++ GWF +G
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPTLYKKPTMWTESYTGWFTYYGWPL 258
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP EDIAF+VARFF++GGS HNYYMY GGTNFGRT+GGP++ +SYDY+AP+DEYG+
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQH 318
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH +KL E +L+ E + LG +QEA VY+ +G C AFLAN+D ND
Sbjct: 319 LPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNG-CVAFLANVDSMNDT 377
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V FRNVSY LPAWSVSI+ DCK V FN+A V++QS+ V M PS++S L
Sbjct: 378 VVEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKSQSAVVSM-----NPSKSS-------L 425
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W F E GI G + F ++ + TTKDT+DYLWYTT +L
Sbjct: 426 SWTSFDEPVGISG-SSFKAKQLLEQMETTKDTSDYLWYTTRYATGTGSTWLS-------- 476
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
IES +H F N + Q S + + + PI L G N IALLS TVGLQN G F
Sbjct: 477 IESMRDVVHIFVNGQFQSSWHTSKSVLYNSVEAPIKLAPGSNTIALLSATVGLQNFGAFI 536
Query: 567 EWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E AG++ S+ + G G +LS WTY++GL+GE L ++ ++NW +
Sbjct: 537 ETWSAGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAV---ST 593
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
+PLTWY PPGD+P+ LD+ MGKG AW+NG+ IGRYWP S C + C
Sbjct: 594 KKPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSV---CPESC 650
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
DYRG ++ +KC+TGCG+ SQRWYH+PRSW KP N+LV+FEE GGDP+ I F R
Sbjct: 651 DYRGSYDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTR 705
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/716 (52%), Positives = 483/716 (67%), Gaps = 24/716 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD +++++NG+R ++IS +IHYPRS P MWP L+++AK+GG++ +++YVFWNGHE SP
Sbjct: 23 LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D EP
Sbjct: 83 GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 142
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF T IV+MMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MA
Sbjct: 143 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 202
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA N GVPWIMC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG P
Sbjct: 203 VALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 262
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+A+ VA+F QKGGS NYYM+HGGTNFGRTAGGPFI TSYDY+APIDEYGL R
Sbjct: 263 HRPVEDLAYGVAKFIQKGGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 322
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHLK+LH AIKLCE AL+ G+ SLG++Q++ V+ S+GACAAFL N D +
Sbjct: 323 PKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLDNKDKVSYAR 382
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V F + Y LP WS+SILPDCK VFNTA V +Q S ++M + G
Sbjct: 383 VAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKM-------------EWAGGFA 429
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E +GE F G ++ IN T+D TDYLWYTT + V ++++FL NG P L +
Sbjct: 430 WQSYNEEINSFGEDPFTTVGLLEQINVTRDNTDYLWYTTYVDVAQDDQFLSNGENPKLTV 489
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
+ L G+ G+ P Y + L AG N I+ LS+ VGL N G +E
Sbjct: 490 MCF--LILNILFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFE 547
Query: 568 WVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
AGI V + G N G DL+ WTY++GL+GE + +++ + + W EP +
Sbjct: 548 TWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEW---GEPVQK 604
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTWYKA P GDEP+ LDM MGKG W+NG+ IGRYWP K+S + CD
Sbjct: 605 QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWP--GYKASGN---CGTCD 659
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
YRG+++ KC T CG+ SQRWYH+PRSW P+ N+LVIFEE GGDPT I+ R I
Sbjct: 660 YRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSI 715
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 770 bits (1988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/736 (50%), Positives = 487/736 (66%), Gaps = 31/736 (4%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
IF S + ++G V+YD R+L+I+G+R ++ S +IHYPR+ P +WP +++++KEGG++
Sbjct: 16 IFACSYLERGWSGKVSYDHRALVIDGKRRVLQSGSIHYPRTTPEVWPDIIRKSKEGGLDV 75
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
IE+YVFWN HE G+YYF GRF+LV+F+K IQ+A + + LRIGP+ AE+NYGG P+WL
Sbjct: 76 IETYVFWNYHEPVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRIGPYACAEWNYGGFPLWL 135
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
H+IPG FR E FK M+ F+T IV+MMK E LFASQGGPIILAQVENEYG E YG
Sbjct: 136 HFIPGIQFRTTNELFKEEMKLFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVEWAYG 195
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTE 253
G+ Y WAA+ AV+ N VPW+MC Q D PDP+INTCN FYCD+F+P+SPS PK+WTE
Sbjct: 196 AAGELYVKWAAETAVSLNTSVPWVMCAQVDAPDPIINTCNGFYCDRFSPNSPSKPKMWTE 255
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSY 313
N+ GWF +FG P+RP ED+AF+VARFF+ GG+ NYYMY GGTNFGRTAGGP + TSY
Sbjct: 256 NYSGWFLSFGYAIPYRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSY 315
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGAC 373
DY+APIDEYG R PKWGHL++LH AIK CE L++ + + LG++ EA +Y SS C
Sbjct: 316 DYDAPIDEYGFIRQPKWGHLRDLHKAIKQCEEHLISSDPIHQQLGNNLEAHIYYKSSNDC 375
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVR---------AQSST 424
AAFLAN D +D V F Y LPAWSVSILPDCK V+FNTA V A S++
Sbjct: 376 AAFLANYDSSSDANVTFNGNIYFLPAWSVSILPDCKNVIFNTAKVLILNLGDDFFAHSTS 435
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
V +P + + W +KE GIWG F G ++ INTTKD +D+LWY
Sbjct: 436 VNEIP-------------LEQIVWSWYKEEVGIWGNNSFTAPGLLEQINTTKDISDFLWY 482
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
+TSI VN ++ +L IES GHA F N+ L G GN F ISL
Sbjct: 483 STSISVNADQV-----KDIILNIESLGHAALVFVNKVLVGKY-GNHDDASFSLTEKISLI 536
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
G N + LLSM +G+QN GP+++ GAGI +V + G + +DLS+ WTY++GL+GE+
Sbjct: 537 EGNNTLDLLSMMIGVQNYGPWFDVQGAGIYAVLLVGQSKVKIDLSSEKWTYQVGLEGEYF 596
Query: 605 GIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
G+ N+ W PP N+ L WYK P G P+ L++ MGKG AW+NG+ I
Sbjct: 597 GLDKVSLANSSLWTQGASPPINKSLIWYKGTFVAPEGKGPLALNLAGMGKGQAWVNGQSI 656
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYWP SP C CDYRG ++ KC+ CG+P+Q YHIPR+W P EN+LV+
Sbjct: 657 GRYWP---AYLSPSTGCNDSCDYRGAYDSFKCLKKCGQPAQTLYHIPRTWVHPGENLLVL 713
Query: 725 FEEKGGDPTKITFSIR 740
EE GGDP+KI+ R
Sbjct: 714 HEELGGDPSKISVLTR 729
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 770 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/701 (53%), Positives = 476/701 (67%), Gaps = 23/701 (3%)
Query: 45 ISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKI 104
+S ++HYPRSVP MWP L+Q+AK+GG++ +++YVFWNGHE S G+YYF GR++LV FIK+
Sbjct: 1 MSGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKL 60
Query: 105 IQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMK 164
++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D EPFK MQKF T IVDMMK
Sbjct: 61 VKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMK 120
Query: 165 REKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDT 224
E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAVA N VPW+MC++ D
Sbjct: 121 SEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDA 180
Query: 225 PDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQK 284
PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PHRP ED+A+ VA+F QK
Sbjct: 181 PDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQK 240
Query: 285 GGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCE 344
GGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYGL R PKWGHLKELH AIKLCE
Sbjct: 241 GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCE 300
Query: 345 HALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSI 404
AL+ G+ SLG++Q+A V+ S+ AC AFL N D + V F + Y+LP WS+SI
Sbjct: 301 PALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWSISI 360
Query: 405 LPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFV 464
LPDCK V+NTA V +Q S ++M + G WQ + E G+ FV
Sbjct: 361 LPDCKTTVYNTARVGSQISQMKM-------------EWAGGFTWQSYNEDINSLGDESFV 407
Query: 465 KSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQG 524
G ++ IN T+D TDYLWYTT + V ++E+FL NG PVL + S GHALH F N +L G
Sbjct: 408 TVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTG 467
Query: 525 SASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNS 583
+ G+ P Y+ + L G N I+ LS+ VGL N G +E AGI V + G N
Sbjct: 468 TVYGSVDDPKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNE 527
Query: 584 GTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDE 643
G DL+ WTYK+GL+GE L +++ +++ W EP + QPLTWYKA P GDE
Sbjct: 528 GRRDLTWQKWTYKVGLKGEDLSLHSLSGSSSVEW---GEPMQKQPLTWYKAFFNAPDGDE 584
Query: 644 PIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEP 703
P+ LDM MGKG W+NG+ IGRYWP + CDYRG+++ KC T CG+
Sbjct: 585 PLALDMSSMGKGQIWINGQGIGRYWPGYKASGT-----CGICDYRGEYDEKKCQTNCGDS 639
Query: 704 SQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
SQRWYH+PRSW P+ N+LVIFEE GGDPT I+ +++ +G
Sbjct: 640 SQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISM-VKRTTG 679
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 770 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/722 (50%), Positives = 489/722 (67%), Gaps = 23/722 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++++G+R ++ S +IHYPRS P MW GL+++AK+GG++ I++YVFWNGHE +P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LV+FIK +Q+A M++ LRIGP++ E+N+GG PVWL Y+PG FR D EP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F IV MMK E LFASQGGPIIL+Q+ENEYG +G GK Y WAAKMA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+AP+DEYGL R
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+GHLKELH A+KLCE L++ + + +LGS QEA V+ SSG CAAFLAN + +
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAK 385
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V+F N +Y LP WS+SILPDCK VVFNTA V Q++ ++M + G+ +
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWAD-----------GASSMM 434
Query: 448 WQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E A + S G ++ +N T+DT+DYLWY T + V+ +E+FL+ G+ L
Sbjct: 435 WEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPSEKFLQGGTPLSLT 494
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GHALH F N +LQGSA G Y +L+AG N++ALLS+ GL N G Y
Sbjct: 495 VQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHY 554
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTY--KIGLQGEHLGIYNPGYRNNINWVS-TME 622
E W + V I G + G+ DL+ +W+Y ++GL+GE + + + ++ W+ ++
Sbjct: 555 ETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQFQVGLKGEQMNLNSLEGSGSVEWMQGSLV 614
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
QPL WY+A P GDEP+ LDM MGKG W+NG+ IGRYW + +C
Sbjct: 615 AQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYW-----TAYAEGDC- 668
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
+ C Y G + KC GCG+P+QRWYH+PRSW +P+ N+LV+FEE GGD +KI + R +
Sbjct: 669 KGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTV 728
Query: 743 SG 744
SG
Sbjct: 729 SG 730
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/736 (51%), Positives = 484/736 (65%), Gaps = 21/736 (2%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
++ LL F S I++ A +V+YD +++IING++ ++IS +IHYPRS P MWP L+Q+A
Sbjct: 19 VSMLVLLSFCSWEISFVKA-SVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDLIQKA 77
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K+GG++ I++YVFWNGHE + G YYF R++LV+FIK++QQA +Y+ LRIGP+V AE+NY
Sbjct: 78 KDGGLDVIQTYVFWNGHEPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNY 137
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL Y+PG FR D PFK M KF IV MMK EKLF +QGGPIIL+Q+ENE+G
Sbjct: 138 GGFPVWLKYVPGIEFRTDNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 197
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
E G GK YA WAA+MAV N GVPW+MC+Q D PDPVINTCN FYC++F P+
Sbjct: 198 PVEWDIGAPGKAYAKWAAQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCEKFVPNQNY 257
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTE W GWF FG P RP+ED+ FSVARF Q GGS NYYMYHGGTNFGRT+GG
Sbjct: 258 KPKMWTEAWTGWFTEFGSAVPTRPAEDLVFSVARFIQSGGSFINYYMYHGGTNFGRTSGG 317
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
F+ TSYDY+APIDEYGL PKWGHL+ LH AIKLCE AL++ + + SLG +QEA V+
Sbjct: 318 -FVATSYDYDAPIDEYGLLNEPKWGHLRGLHKAIKLCEPALVSVDPTVKSLGENQEAHVF 376
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
SG CAAFLAN D V F N Y LP WS+S+LPDCK VFNTA V QSS +
Sbjct: 377 NSISGKCAAFLANYDTTFSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQSSQKK 436
Query: 427 MVPENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
VP WQ + +E A + F K G + + T D +DYLWY
Sbjct: 437 FVPV------------INAFSWQSYIEETASSTDDNTFTKDGLWEQVYLTADASDYLWYM 484
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
T + + NE FLKNG P+L I S GHAL F N +L G+ G+ +P + + L+A
Sbjct: 485 TDVNIGSNEGFLKNGQDPLLTIWSAGHALQVFINGQLSGTVYGSLENPKLTFSKNVKLRA 544
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
G N+I+LLS +VGL N G +E AG+ V + G N GT D+S WTYKIGL+GE L
Sbjct: 545 GVNKISLLSTSVGLPNVGTHFEKWNAGVLGPVTLKGLNEGTRDISKQKWTYKIGLKGEAL 604
Query: 605 GIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
++ +++ W + QP+TWYK PPG++P+ LDM MGKG+ W+NG+ I
Sbjct: 605 SLHTVSGSSSVEWAQGASLAQKQPMTWYKTTFNVPPGNDPLALDMGAMGKGMVWINGQSI 664
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GR+WP + C+Y G + KC T CG+PSQRWYH+PRS KPS N+LV+
Sbjct: 665 GRHWPGYIGNGN-----CGGCNYAGTYTEKKCRTYCGKPSQRWYHVPRSRLKPSGNLLVV 719
Query: 725 FEEKGGDPTKITFSIR 740
FEE GG+P I+ R
Sbjct: 720 FEEWGGEPHWISLLKR 735
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/733 (51%), Positives = 485/733 (66%), Gaps = 22/733 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I + SS+ Y VTYD +++IING+R +++S +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 13 LGILWCSSLIYSVKAMVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG+YYF R++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+P VFR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENEYG E
Sbjct: 133 WLKYVPDMVFRTDNEPFKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENEYGPIEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W AKMA + GVPWIMC+Q D P+ +INTCN FYC+ F P+S PK+W
Sbjct: 193 IGAPGKAYTKWVAKMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDKKPKMW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP+EDIA SVARF Q GGS NYYMYHGGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK LH IKLCE AL++ + + SLG QEA V+ S
Sbjct: 312 SYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAQVFKSQS- 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV--EMVP 429
+CAAFL+N + + V F +Y LP WSVSILPDCK +NTA V+ ++S++ +MVP
Sbjct: 371 SCAAFLSNYNTSSAARVSFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKMVP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
N S S + +EI F + G V+ I+ T+D TDY WY T I
Sbjct: 431 TNTLFSWGSYN-----------EEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDIT 479
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
++ +E+FL G P+L I S GHALH F N +L G+A G+ P + I L AG N+
Sbjct: 480 ISPDEKFL-TGEDPLLNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNK 538
Query: 550 IALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
+ALLS+ GL N G YE W + V + G NSGT D+S + W+YKIG +GE L I+
Sbjct: 539 LALLSIAAGLPNVGVHYETWNTGVLGPVTLKGVNSGTWDMSQWKWSYKIGTKGEALSIHT 598
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
+ + W QPLTWYK+ P G+EP+ LDM MGKG W+NG+ IGR+W
Sbjct: 599 VTGSSTVEWKQGSLVATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWINGQNIGRHW 658
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
P + + + C Y G F +KC++ CGE SQRWYH+PRSW KP+ N++V+ EE
Sbjct: 659 PAYTARGK-----CERCSYAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVLEEW 713
Query: 729 GGDPTKITFSIRK 741
GG+P I+ R+
Sbjct: 714 GGEPNGISLVKRR 726
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/730 (50%), Positives = 490/730 (67%), Gaps = 31/730 (4%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++++G+R ++ S +IHYPRS P MW GL+++AK+GG++ I++YVFWNGHE +P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LV+FIK +Q+A M++ LRIGP++ E+N+GG PVWL Y+PG FR D EP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ----------VENEYGYYESFYGEGGK 197
FK MQ F IV MMK E LFASQGGPIIL+Q +ENEYG +G GK
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y WAAKMAV + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W G
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF FGG RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+A
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
P+DEYGL R PK+GHLKELH A+KLCE L++ + + +LGS QEA V+ SSG CAAFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFL 385
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + + V+F N +Y LP WS+SILPDCK VVFNTA V Q++ ++M +
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWAD------- 438
Query: 438 SPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
G+ + W+ + E A + S G ++ +N T+DT+DYLWY TS+ V+ +E+F
Sbjct: 439 ----GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKF 494
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L+ G+ L ++S GHALH F N +LQGSA G Y +L+AG N++ALLS+
Sbjct: 495 LQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVA 554
Query: 557 VGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
GL N G YE W + V I G + G+ DL+ +W+Y++GL+GE + + + ++
Sbjct: 555 CGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSV 614
Query: 616 NWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
W+ ++ QPL WY+A P GDEP+ LDM MGKG W+NG+ IGRYW
Sbjct: 615 EWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYW-----T 669
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+ +C + C Y G + KC GCG+P+QRWYH+PRSW +P+ N+LV+FEE GGD +K
Sbjct: 670 AYAEGDC-KGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSK 728
Query: 735 ITFSIRKISG 744
I + R +SG
Sbjct: 729 IALAKRTVSG 738
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/730 (50%), Positives = 490/730 (67%), Gaps = 31/730 (4%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++++++G+R ++ S +IHYPRS P MW GL+++AK+GG++ I++YVFWNGHE +P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++LV+FIK +Q+A M++ LRIGP++ E+N+GG PVWL Y+PG FR D EP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ----------VENEYGYYESFYGEGGK 197
FK MQ F IV MMK E LFASQGGPIIL+Q +ENEYG +G GK
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 198 RYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPG 257
Y WAAKMAV + GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W G
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
WF FGG RP ED+AF VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+A
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 318 PIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFL 377
P+DEYGL R PK+GHLKELH A+KLCE L++ + + +LGS QEA V+ SSG CAAFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFL 385
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + + V+F N +Y LP WS+SILPDCK VVFNTA V Q++ ++M +
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWAD------- 438
Query: 438 SPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
G+ + W+ + E A + S G ++ +N T+DT+DYLWY TS+ V+ +E+F
Sbjct: 439 ----GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKF 494
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
L+ G+ L ++S GHALH F N +LQGSA G Y +L+AG N++ALLS+
Sbjct: 495 LQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVA 554
Query: 557 VGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
GL N G YE W + V I G + G+ DL+ +W+Y++GL+GE + + + ++
Sbjct: 555 CGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSV 614
Query: 616 NWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
W+ ++ QPL WY+A P GDEP+ LDM MGKG W+NG+ IGRYW
Sbjct: 615 EWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYW-----T 669
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+ +C + C Y G + KC GCG+P+QRWYH+PRSW +P+ N+LV+FEE GGD +K
Sbjct: 670 AYAEGDC-KGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSK 728
Query: 735 ITFSIRKISG 744
I + R +SG
Sbjct: 729 IALAKRTVSG 738
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/720 (50%), Positives = 490/720 (68%), Gaps = 23/720 (3%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD ++++I+G+R ++ S +IHYPRS P MW GL+Q+AK+GG++ I++YVFWNGHE +PG
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
YYF R++LV+F+K +Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D EPF
Sbjct: 90 NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F IV MMK E LFASQGGPIIL+Q+ENEYG +G G+ Y WAAKMAV
Sbjct: 150 KTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAV 209
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
+ GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG
Sbjct: 210 GLDTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 269
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R P
Sbjct: 270 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREP 329
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
K HLKELH A+KLCE AL++ + + +LG+ QEA V+ SG CAAFLAN + + V
Sbjct: 330 KHSHLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKV 388
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
VF N Y LP WS+SILPDCK VVFN+A V Q+S ++M +G+ + W
Sbjct: 389 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTSQMQMW-----------GDGATSMMW 437
Query: 449 QVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR-PVLL 506
+ + +E+ + +G ++ +N T+D++DYLWY TS+ ++ +E FL+ G + P L
Sbjct: 438 ERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLS 497
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GHALH F N +LQGS+ G KY ++L+AG N+IALLS+ GL N G Y
Sbjct: 498 VQSAGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHY 557
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPP 624
E G+ V + G N G+ DL+ +W+Y++GL+GE + + + ++ W+ ++
Sbjct: 558 ETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQ 617
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPL WYKA + P GDEP+ LDM MGKG W+NG+ IGRYW ++ D +
Sbjct: 618 KQQPLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYW------TAYADGDCKG 671
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE-KGGDPTKITFSIRKIS 743
C Y G F KC GCG+P+QRWYH+PRSW +PS N+LV+ EE GGD +KI + R +S
Sbjct: 672 CSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVS 731
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/733 (51%), Positives = 483/733 (65%), Gaps = 24/733 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L+I S+ +V+YD +++IING+R +++S +IHYPRS P MWPGL+Q+AKEGG+
Sbjct: 13 LVILCCLSLVCIVKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE SPG+YYFG R++LVKFIK++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILA--QVENEYGYYE 189
WL ++PG FR D EPFK M+KF IV MMK EKLF +QGGPIILA Q+ENEYG E
Sbjct: 133 WLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQGQIENEYGPVE 192
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y W A+MA+ + GVPWIMC+Q D P P+I+TCN +YC+ F P+S + PK
Sbjct: 193 WEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYCEDFKPNSSNKPK 252
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GW+ FGG P+RP EDIA+SVARF QKGGS NYYMYHGGTNF RTA G F+
Sbjct: 253 MWTENWTGWYTEFGGAVPYRPVEDIAYSVARFIQKGGSFVNYYMYHGGTNFDRTA-GEFM 311
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
+SYDY+AP+DEYGLPR PK+ HLK LH IKL E ALL+ + + SLG+ QEA V+
Sbjct: 312 ASSYDYDAPLDEYGLPREPKYSHLKALHKVIKLSEPALLSADATVTSLGAKQEAYVFWSK 371
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
S +CAAFL+N D+ + V+FR Y LP WSVSILPDCK +NTA V A S MVP
Sbjct: 372 S-SCAAFLSNKDESSAARVMFRGFPYVLPPWSVSILPDCKTEFYNTAKVNAPSVHRNMVP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ W F E EA F ++G V+ I+ T D +DY WY T I
Sbjct: 431 TGAR------------FSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYLTDI 478
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ E FLK G P+ + S GHALH F N +L G+A G HP + I L AG N
Sbjct: 479 TIGSGETFLKTGDFPLFTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFTQKIKLHAGVN 538
Query: 549 EIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
++ALLS+ VGL N G +E W + V + G NSGT D+S + W+YKIG++GE L ++
Sbjct: 539 KLALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLH 598
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG IGR+
Sbjct: 599 TDTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRH 658
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S C+Y G FN KC++ CGE SQRWYH+PRSW K S+N++V+FEE
Sbjct: 659 WPAYKAQGS-----CGRCNYAGTFNAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEE 712
Query: 728 KGGDPTKITFSIR 740
GGDP I+ R
Sbjct: 713 WGGDPNGISLVKR 725
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/714 (52%), Positives = 470/714 (65%), Gaps = 21/714 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD R+++ING+R ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVFWNGHE
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYFG R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAAKMA
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA GVPW+MC+Q D PDPVINTCN FYCD F+P+S S P +WTE W GWF FGG P
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 267
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNF RT+GGPFI TSYDY+APIDEYGL R
Sbjct: 268 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQ 327
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E AL++G+ + SLG+ ++A V+ S GACAAFL+N
Sbjct: 328 PKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAAR 387
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
VVF Y LPAWS+S+LPDCK VFNTA V S+ M P + G
Sbjct: 388 VVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSP-------------AGGFS 434
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E F K G V+ ++ T D +DYLWYTT + +N NE+FLK+G P L +
Sbjct: 435 WQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTV 494
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH+L F N + G+ G P Y + + G N+I++LS VGL N G YE
Sbjct: 495 YSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYE 554
Query: 568 WVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
G+ V ++G N G DLS WTY+IGL GE LG+ + +++ W S
Sbjct: 555 TWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAA---GK 611
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P GD P+ LDM MGKG AW+NG IGRYW K+ S C
Sbjct: 612 QPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSG----GCGGCS 667
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ KC TGCG+ SQR+YH+PRSW PS N+LV+ EE GGD + R
Sbjct: 668 YAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTR 721
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/718 (53%), Positives = 489/718 (68%), Gaps = 24/718 (3%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV YDSR++ ING+R +++S +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE S
Sbjct: 25 NVWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 84
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PGKYYF G ++LV+FIK++QQ +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D E
Sbjct: 85 PGKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNE 144
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M+KF + IV+MMK EKLF QGGPIIL+Q+ENE+G E G K YA WAAKM
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC++ D PDPVINT N FY D F P+ P +WTENW GWF +G
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFYPNKRYKPMMWTENWTGWFTGYGVPV 264
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AFSVA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYG+ R
Sbjct: 265 PHRPVEDLAFSVAKFVQKGGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGMLR 324
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHL +LH AIKLCE AL++G SLG++QE++V+ +SGACAAFLAN D K
Sbjct: 325 QPKYGHLTDLHKAIKLCEPALVSGYPVVTSLGNNQESNVFRSNSGACAAFLANYDTKYYA 384
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
TV F + Y+LP WS+SILPDCK VFNTA V AQ++ ++M G
Sbjct: 385 TVTFNGMRYNLPPWSISILPDCKTTVFNTARVGAQTTQMQMTTVG-------------GF 431
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W + E + F K G V+ I+ T+D+TDYLWYTT + +++NE+FLKNG PVL
Sbjct: 432 SWVSYNEDPNSIDDGSFTKLGLVEQISMTRDSTDYLWYTTYVNIDQNEQFLKNGQYPVLT 491
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+S GH+LH F N +L G+A G+ P Y + L AG N+I+ LS+ VGL N G +
Sbjct: 492 AQSAGHSLHVFINGQLIGTAYGSVEDPRLTYTGNVKLFAGSNKISFLSIAVGLPNVGEHF 551
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E W + V + G N G DL+ WTYKIGL+GE L ++ +N+ W + +
Sbjct: 552 ETWNTGLLGPVTLNGLNEGKRDLTWQKWTYKIGLKGEALSLHTLSGSSNVEW---GDASR 608
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR-KSRKSSPHDECVQE 684
QPL WYK P G EP+ LDM MGKG W+NG+ IGRYWP K+R S P +
Sbjct: 609 KQPLAWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIGRYWPAYKARGSCP------K 662
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
CDY G + KC + CG+ SQRWYH+PRSW P+ N++V+FEE GG+PT I+ R +
Sbjct: 663 CDYEGTYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGGEPTGISLVKRSM 720
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/713 (51%), Positives = 476/713 (66%), Gaps = 11/713 (1%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD ++L+I+G+R ++ S +IHYPR+ P +WP +++++KEGG++ IE+YVFWN HE
Sbjct: 36 VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF GRF+LV+F+K +Q+A +++ LRIGP+ AE+NYGG P+WLH+IPG FR +
Sbjct: 96 GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGVQFRTSNDI 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+ F+T IVD+MK + LFASQGGPIILAQVENEYG + YG GG+ Y WAA+ A
Sbjct: 156 FKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAETA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
++ N VPW+MC Q D PDPVINTCN FYCDQFTP+SPS PK+WTEN+ GWF FG P
Sbjct: 216 ISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQFTPNSPSKPKMWTENYSGWFLAFGYAVP 275
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
+RP ED+AF+VARFF+ GGS NYYMY GGTNFGRTAGGP + TSYDY+APIDEYG R
Sbjct: 276 YRPVEDLAFAVARFFEYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 335
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK CE L++ + + LG+ EA VY S CAAFLAN D +D
Sbjct: 336 PKWGHLRDLHSAIKQCEEYLVSSDPVHQQLGNKLEAHVYYKHSNDCAAFLANYDSGSDAN 395
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V F +Y LPAWSVSIL DCK V+FNTA V Q + + S N
Sbjct: 396 VTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDAL---FSRSTTVDGNLVAASP 452
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
W +KE GIWG F K G ++ INTTKDT+D+LWY+TS+ V ++ +L I
Sbjct: 453 WSWYKEEVGIWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVEAGQD-----KEHLLNI 507
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
ES GHA F N+ GN F ISL+ G N + +LSM +G+QN GP+++
Sbjct: 508 ESLGHAALVFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDVLSMLIGVQNYGPWFD 567
Query: 568 WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
GAGI SV + + DLS+ WTY++GL+GE+LG+ N N+ W P N+
Sbjct: 568 VQGAGIHSVFLVDLHKSKKDLSSGKWTYQVGLEGEYLGLDNVSLANSSLWSQGTSLPVNK 627
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
L WYKA + P G+ P+ L++ MGKG AW+NG+ IGRYW S SP C CDY
Sbjct: 628 SLIWYKATIIAPEGNGPLALNLASMGKGQAWINGQSIGRYW---SAYLSPSAGCTDNCDY 684
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
RG +N KC CG+P+Q YHIPR+W P EN+LV+ EE GGDP++I+ R
Sbjct: 685 RGAYNSFKCQKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQISLLTR 737
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/735 (50%), Positives = 484/735 (65%), Gaps = 23/735 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L+ F ++ V YD +++ IN +R ++IS +IHYPRS P MWPGL+Q+AKEG
Sbjct: 7 FISLLLFVTAWVCNVTATVWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKAKEG 66
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+ I++YVFWNGHE SPG+YYF R++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG
Sbjct: 67 GIEVIQTYVFWNGHEPSPGQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGF 126
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WL Y+PG FR D PFK MQKF+TLIV+MMK +KLF +QGGPIIL+Q+ENEYG E
Sbjct: 127 PMWLKYVPGIEFRTDNGPFKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYGPVE 186
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA MA N GVPWIMC+Q D PDP I+TCN FYC+ + P++ + PK
Sbjct: 187 WTIGAPGKAYTKWAAAMATGLNTGVPWIMCKQEDAPDPTIDTCNGFYCEGYKPNNYNKPK 246
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GW+ +G P+RP ED AFSVARF GS NYYMYHGGTNF RTA G F+
Sbjct: 247 VWTENWTGWYTEWGASVPYRPPEDTAFSVARFIAASGSFVNYYMYHGGTNFDRTA-GLFM 305
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL +PKWGHL++LH AIK E AL++ + + +SLG +QEA V+
Sbjct: 306 ATSYDYDAPLDEYGLTHDPKWGHLRDLHRAIKQSERALVSADPTVISLGKNQEAHVFQSK 365
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G CAAFLAN D + V F N Y LP WS+S+LPDCK VV+NTA + AQS+ M+P
Sbjct: 366 MG-CAAFLANYDTQYSARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISAQSTQKWMMP 424
Query: 430 ENLQPSEASPDNGSKGLKWQV-FKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ G WQ E+ + F K G + T D TDYLWY T +
Sbjct: 425 V------------ASGFSWQSHIDEVPVGYSAGTFTKVGLWEQKYLTGDKTDYLWYMTDV 472
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+N NE FL++G P L + S GH LH F N L GSA G+ +P + + L G N
Sbjct: 473 TINSNEGFLRSGKNPFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLVGGVN 532
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+IALLS TVGL N G Y+ G+ V + G N GTLD++ + W+YKIGL+GE L ++
Sbjct: 533 KIALLSATVGLANVGVHYDTWNVGVLGPVTLQGLNQGTLDMTKWKWSYKIGLKGEDLKLF 592
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ G N+ W + K PLTWYK + PPG++P+ L M MGKG ++NG IGR+
Sbjct: 593 SGG--ANVGWAQGAQLAKKTPLTWYKTFINAPPGNDPVALYMGSMGKGQMYINGRSIGRH 650
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + K + D CDY G ++ KC +GCG+P Q+WYH+PRSW KP+ N+LV+FEE
Sbjct: 651 WPAYTAKGNCKD-----CDYAGYYDDQKCRSGCGQPPQQWYHVPRSWLKPTGNLLVVFEE 705
Query: 728 KGGDPTKITFSIRKI 742
GGDPT I+ R +
Sbjct: 706 MGGDPTGISLVKRVV 720
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/730 (50%), Positives = 473/730 (64%), Gaps = 18/730 (2%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
L+ F + +VTYD ++++ING+R ++IS +IHYPRS P MWP L+Q+AK+GGV+
Sbjct: 16 LVLFLCLFVFSVTASVTYDHKAIVINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGVD 75
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
I++YVFWNGHE SPG YYF RF+LVKF+K++QQA +Y+ LRIGP+V AE+N+GG PVW
Sbjct: 76 VIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYVCAEWNFGGFPVW 135
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY 192
L Y+PG FR D EPFK MQKF IV MMK E LF SQGGPII++Q+ENEYG E
Sbjct: 136 LKYVPGVAFRTDNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIMSQIENEYGPVEWEI 195
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWT 252
G GK Y W ++MA+ + GVPWIMC+Q D PDP+I+TCN +YC+ FTP+ PK+WT
Sbjct: 196 GAPGKAYTKWFSQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYCENFTPNKNYKPKMWT 255
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTS 312
ENW GW+ FG P+RP++D+AFSVARF Q GS NYYMYHGGTNFGRT+ G FI TS
Sbjct: 256 ENWSGWYTDFGSAVPYRPAQDVAFSVARFIQNRGSYVNYYMYHGGTNFGRTSAGLFIATS 315
Query: 313 YDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGA 372
YDY+APIDEYGL PKWGHL+ LH AIK CE L++ + + G + E VY S+GA
Sbjct: 316 YDYDAPIDEYGLLSEPKWGHLRNLHKAIKQCEPILVSVDPTVSWPGKNLEVHVYKTSTGA 375
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENL 432
CAAFLAN D + V F N Y LP WS+SILPDCK VFNTA V TV +
Sbjct: 376 CAAFLANYDTTSPAKVTFGNGQYDLPPWSISILPDCKTAVFNTAKV----GTVPSFHRKM 431
Query: 433 QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVN 491
P ++ D WQ + E G D + ++ I T+D++DYLWY T + ++
Sbjct: 432 TPVSSAFD-------WQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNIS 484
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
NE F+KNG PVL S GH LH F N + G+A G +P + N + L+ G N+I+
Sbjct: 485 PNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKIS 544
Query: 552 LLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LLS+ VGL N G YE G+ V + G N GT DLS W+YKIGL+GE L ++
Sbjct: 545 LLSVAVGLSNVGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLI 604
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+++ W K QPLTWYKA P G++P+ LDM MGKG W+NGE IGR+WP
Sbjct: 605 GSSSVQWTKGSSLVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPA 664
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
+ S C+Y G F KC T CG+P+Q+WYHIPRSW P N LV+ EE GG
Sbjct: 665 YIARGS-----CGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGG 719
Query: 731 DPTKITFSIR 740
DP+ I+ R
Sbjct: 720 DPSGISLVKR 729
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/733 (50%), Positives = 482/733 (65%), Gaps = 25/733 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
LL F+ +T +VTYD ++++I+G+R ++IS +IHYPRS P MWP L+Q+AK+G
Sbjct: 11 LMLLFFWVCGVT----ASVTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDG 66
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ I++YVFWNGHE SPGKYYF R++LV+F+K+ QQA +Y+ LRIGP++ AE+N+GG
Sbjct: 67 GLDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAEWNFGGF 126
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWL Y+PG FR D EPFK MQKF IV +MK E+LF SQGGPIIL+Q+ENEYG E
Sbjct: 127 PVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIENEYGPVE 186
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
G GK Y WAA+MAV + GVPW+MC+Q D PDPVI+TCN FYC+ F P+ + PK
Sbjct: 187 WEIGAPGKSYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPK 246
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRT+GG FI
Sbjct: 247 MWTENWTGWYTDFGGASPIRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFI 306
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL PKWGHL+ LH AIK E AL++ + SLG + EA V++ +
Sbjct: 307 ATSYDYDAPLDEYGLQNEPKWGHLRALHKAIKQSEPALVSTDPKVTSLGYNLEAHVFS-T 365
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
GACAAF+AN D K+ F + Y LP WS+SILPDCK VV+NTA V +M P
Sbjct: 366 PGACAAFIANYDTKSSAKATFGSGQYDLPPWSISILPDCKTVVYNTARV-GNGWVKKMTP 424
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSI 488
N G WQ + E + D + + + +N T+D++DYLWY T +
Sbjct: 425 VN------------SGFAWQSYNEEPASSSQDDSIAAEALWEQVNVTRDSSDYLWYMTDV 472
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+N NE FLKNG PVL + S GH LH F N +L G+ G +P + + ++L+ G N
Sbjct: 473 YINGNEGFLKNGRSPVLTVMSAGHLLHVFINGQLSGTVYGGLGNPKLTFSDNVNLRVGNN 532
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
+++LLS+ VGL N G +E AG+ V + G N GT DLS W+YK+GL+GE L ++
Sbjct: 533 KLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGEALNLH 592
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+++ W+ K QPLTWYKA P G++P+ LD+ MGKG W+NG IGR+
Sbjct: 593 TESGSSSVEWIQGSLVAKKQPLTWYKATFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRH 652
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP H C C+Y G + KC T CG+PSQRWYH+PRSW N LV+FEE
Sbjct: 653 WP----GYIAHGSC-NACNYAGYYTDQKCRTNCGKPSQRWYHVPRSWLNSGGNSLVVFEE 707
Query: 728 KGGDPTKITFSIR 740
GGDP I R
Sbjct: 708 WGGDPNGIALVKR 720
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/720 (50%), Positives = 483/720 (67%), Gaps = 21/720 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD +++I+NG+R ++I+ +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE SP
Sbjct: 31 VTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G YYF RF+LVKF+K++QQA +Y+ LRIGP+ AE+N+GG PVWL Y+PG FR D EP
Sbjct: 91 GNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMSFRTDNEP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF IV+MMK+E+LF QGGPIIL+Q+ENEYG E GK YA WAA+MA
Sbjct: 151 FKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWAAQMA 210
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V N GVPWI C+Q D PDP+I+TCN++YC++FTP+ PK+WTE W WF ++G
Sbjct: 211 VGLNTGVPWIACKQEDAPDPLIDTCNAYYCEKFTPNKSYKPKMWTEAWTAWFTSWGNPVL 270
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
+RP+ED AFSV +F Q GGS NYYMYHGGTNFGRTAGGPF+ TSYDY+AP+DEYGL +
Sbjct: 271 YRPAEDQAFSVLKFIQSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYGLTND 330
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PK+ HLK +H AIK E AL++ + + SLG++QEA VY+ SSG CAAFLAN D
Sbjct: 331 PKYTHLKHMHKAIKQSEKALVSADATVTSLGTNQEAHVYSSSSG-CAAFLANYDVSYSVK 389
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V F + Y LPAWS+SILPDCK V+NTA V A +M P G
Sbjct: 390 VNFGSGQYDLPAWSISILPDCKTEVYNTAKVLAPRVHKKMTPLG-------------GFT 436
Query: 448 WQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W + E+A + + G + + TKD++DYLWY + + +E FL NG P L
Sbjct: 437 WDSYIDEVASGFASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDEAFLTNGKDPFLN 496
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
++S GH L+ F N +L GSA G+ +P + + L G N+IALLS +VGL N G +
Sbjct: 497 VQSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIALLSASVGLANVGLHF 556
Query: 567 EWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E G+ V +TG N GT+D++ + W+YK+G+QGE L + +++ WV K
Sbjct: 557 ENYNVGVLGPVTLTGLNQGTVDMTKWKWSYKVGVQGEKLQLNTVAGSSSVEWVKGSMLAK 616
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
QPLTWYK+ P G++P+ LDM+ MGKG W+NG+ IGRYWP + + + C
Sbjct: 617 KQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAYTAQGN-----CGGC 671
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISGF 745
Y G F KC+TGCG+P+QRWYH+PRSW KP+ N+LV+FEE GGDPT I+ R + G
Sbjct: 672 SYGGYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGISMVKRTLPGM 731
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/732 (51%), Positives = 484/732 (66%), Gaps = 22/732 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ VTYD +++IING+R +++S +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 13 LGILCCSSLICSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG+YYF R++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENEYG E
Sbjct: 133 WLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W A+MA + GVPWIMC+Q D P+ +INTCN FYC+ F P+S + PK+W
Sbjct: 193 IGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP+EDIA SVARF Q GGS NYYMYHGGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK LH IKLCE AL++ + + SLG QEA V+ S
Sbjct: 312 SYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKSKS- 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV--EMVP 429
+CAAFL+N + + V+F +Y LP WSVSILPDCK +NTA V+ ++S++ +MVP
Sbjct: 371 SCAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKMVP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
N S S + +EI F + G V+ I+ T+D TDY WY T I
Sbjct: 431 TNTPFSWGSYN-----------EEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDIT 479
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
++ +E+FL G P+L I S GHALH F N +L G+A G+ P + I L AG N+
Sbjct: 480 ISPDEKFL-TGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNK 538
Query: 550 IALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
+ALLS GL N G YE W + V + G NSGT D++ + W+YKIG +GE L ++
Sbjct: 539 LALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHT 598
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG+ IGR+W
Sbjct: 599 LAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHW 658
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
P + + + C Y G F KC++ CGE SQRWYH+PRSW KP+ N++++ EE
Sbjct: 659 PAYTARGK-----CERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEW 713
Query: 729 GGDPTKITFSIR 740
GG+P I+ R
Sbjct: 714 GGEPNGISLVKR 725
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/740 (51%), Positives = 490/740 (66%), Gaps = 28/740 (3%)
Query: 3 PRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
P+T + +LL + S+I G VTYD +++IIN +R ++IS +IHYPRS P MWP L
Sbjct: 2 PKTVLLFLSLLTWVGSTI-----GAVTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDL 56
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q+AK+GG++ IE+YVFWNGHE S GKYYF R++LV FIK++Q+A +Y+ LRIGP+V A
Sbjct: 57 IQKAKDGGLDIIETYVFWNGHEPSEGKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYVCA 116
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+NYGG P+WL ++PG FR D EPFK MQKF+T IVDMMK EKL+ +QGGPIIL+Q+E
Sbjct: 117 EWNYGGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIE 176
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEYG E G GK Y W A+MAV GVPW+MC+Q D PDP+I+TCN FYC+ F P
Sbjct: 177 NEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKP 236
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
+ PKIWTENW GW+ FGG P+RP ED+AFSVARF Q GS+ NYY+YHGGTNFGR
Sbjct: 237 NQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGR 296
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
T+ G FI TSYD++APIDEYGL R PKWGHL++LH AIK CE AL++ + + LG +QE
Sbjct: 297 TS-GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKSCEPALVSADPTITWLGKNQE 355
Query: 363 ADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQS 422
A V+ SS ACAAFLAN D V F N Y LP WS+SILPDC V FNTA V +S
Sbjct: 356 ARVFKSSS-ACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCXTVTFNTAQVGVKS 414
Query: 423 STVEMVPENLQPSEASPDNGSKGLKWQVFKEI-AGIWGEADFVKSGFVDHINTTKDTTDY 481
+M+P + W +KE A + + K+G V+ ++ T DTTDY
Sbjct: 415 YQAKMMPIS-------------SFGWLSYKEEPASAYAKDTTTKAGLVEQVSITWDTTDY 461
Query: 482 LWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
LWY I ++ E FLK+G P+L + S GH LH F N +L GS G+ P + +
Sbjct: 462 LWYMQDISIDSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNV 521
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQ 600
LK G N++++LS+TVGL N G ++ AG+ V + G N GT D+S Y W+YK+GL
Sbjct: 522 DLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLEGLNEGTRDMSKYKWSYKVGLS 581
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
GE L +Y+ N++ W K QPLTWYK K P G+EP+GLDM M KG W+N
Sbjct: 582 GESLNLYSDKGSNSVQWTKGSLTQK-QPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWIN 640
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
G+ IGRY+P + +C +C Y G F KC+ CGEPSQ+WYHIPR W PS+N
Sbjct: 641 GQSIGRYFP----GYIANGKC-DKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDN 695
Query: 721 ILVIFEEKGGDPTKITFSIR 740
+LVIFEE GG P I+ R
Sbjct: 696 LLVIFEEIGGSPDGISLVKR 715
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 764 bits (1972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/742 (50%), Positives = 485/742 (65%), Gaps = 41/742 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+LII+GRR ++ SA IHYPR+ P MWP L+ ++KEGG + +++YVFW GHE
Sbjct: 35 NVTYDHRALIIDGRRRILNSAGIHYPRATPEMWPDLIAKSKEGGADVVQTYVFWGGHEPV 94
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF GR++LVKF+K++ ++ +Y+ LRIGP+V AE+N+GG PVWL +PG VFR D
Sbjct: 95 KGQYYFEGRYDLVKFVKLVGESGLYLHLRIGPYVCAEWNFGGFPVWLRDVPGVVFRTDNA 154
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQKF+T IVD+M+ E L + QGGPII+ Q+ENEYG E +G+GGK Y WAA M
Sbjct: 155 PFKEEMQKFVTKIVDLMREEMLLSWQGGPIIMFQIENEYGNIEHSFGQGGKEYMKWAAGM 214
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+A + GVPW+MC+Q D P+ +I+ CN +YCD F P+SP P WTE+W GW+ T+GGR
Sbjct: 215 ALALDAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSPKKPIFWTEDWDGWYTTWGGRL 274
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDY+APIDEYGL
Sbjct: 275 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVYA-------------DSSGA 372
PKWGHLK+LH AIKLCE AL+ + + + LG QEA VY S
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGPKQEAHVYGGSLSIQGMNFSQYGSQSK 394
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQS--STVEMV-- 428
C+AFLAN+D++ TV F S+ LP WSVSILPDC+ VFNTA V AQ+ TVE V
Sbjct: 395 CSAFLANIDERQAATVRFLGQSFTLPPWSVSILPDCRNTVFNTAKVAAQTHIKTVEFVLP 454
Query: 429 -------PENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDY 481
P+ + +E SP + S W + KE +W E +F G ++H+N TKD +DY
Sbjct: 455 LSNSSLLPQFIVQNEDSPQSTS----WLIAKEPITLWSEENFTVKGILEHLNVTKDESDY 510
Query: 482 LWYTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKN 539
LWY T I V++++ KN P + I+S L F N +L GS G+ K
Sbjct: 511 LWYFTRIYVSDDDIAFWEKNKVSPAVSIDSMRDVLRVFINGQLTGSVVGHWV----KAVQ 566
Query: 540 PISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIG 598
P+ + G NE+ LLS TVGLQN G F E GAG +K+TGF +G +DLS SWTY++G
Sbjct: 567 PVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFKGQIKLTGFKNGDIDLSNLSWTYQVG 626
Query: 599 LQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAW 658
L+GE L +Y+ G W TWYK P G +P+ LD+ MGKG AW
Sbjct: 627 LKGEFLKVYSTGDNEKFEWSELAVDATPSTFTWYKTFFDAPSGVDPVALDLGSMGKGQAW 686
Query: 659 LNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS 718
+NG IGRYW SP D C CDYRG ++ KC T CG P+Q WYH+PR+W + S
Sbjct: 687 VNGHHIGRYW----TVVSPKDGC-GSCDYRGAYSSGKCRTNCGNPTQTWYHVPRAWLEAS 741
Query: 719 ENILVIFEEKGGDPTKITFSIR 740
N+LV+FEE GG+P +I+ +R
Sbjct: 742 NNLLVVFEETGGNPFEISVKLR 763
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/743 (51%), Positives = 494/743 (66%), Gaps = 31/743 (4%)
Query: 8 APFALLIFFSSSITYCFAG------NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPG 61
AP + + S + F+G +VTYD +++IING+R ++IS +IHYPRS P MWP
Sbjct: 58 APAFVFLDSVSGTHHSFSGLASASRSVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPD 117
Query: 62 LVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVA 121
L+Q+AK+GG++ IE+YVFWNGHE SPGKYYF R++LV+FIK++QQA +Y+ LRIGP+V
Sbjct: 118 LIQKAKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVC 177
Query: 122 AEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQV 181
AE+NYGG P+WL ++PG FR D PFK MQKF+ IVDMMK EKLF +QGGPIIL+Q+
Sbjct: 178 AEWNYGGFPLWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQI 237
Query: 182 ENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFT 241
ENEYG E G GK Y WAA+MAV GVPW+MC+Q D PDP+I+TCN FYC+ F
Sbjct: 238 ENEYGPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFK 297
Query: 242 PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG 301
P+ PKIWTENW GW+ FGG P+RP ED+AFSVARF Q GGS+ NYYMYHGGTNFG
Sbjct: 298 PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFG 357
Query: 302 RTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQ 361
RT+ G F+TTSYD++APIDEYGL R PKWGHL++LH AIKLCE AL++ + ++ LG +Q
Sbjct: 358 RTS-GLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQ 416
Query: 362 EADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVR-- 419
EA V+ SSGACAAFLAN D V F N Y LP WS+SILPDCK V FNT +++
Sbjct: 417 EARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTGSLQIG 476
Query: 420 AQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEI-AGIWGEADFVKSGFVDHINTTKDT 478
+S +M P + W +KE A + + K G V+ ++ T DT
Sbjct: 477 VKSYEAKMTPIS-------------SFWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDT 523
Query: 479 TDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYK 538
TDYLWY SI ++ E FLK+G P+L + S GH LH F N +L GS G+ P +
Sbjct: 524 TDYLWYILSIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLEDPRITFS 583
Query: 539 NPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKI 597
++LK G N++++LS+TVGL N G ++ AG+ V + G N GT D+S Y W+YK+
Sbjct: 584 KYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKV 643
Query: 598 GLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLA 657
GL+GE L +Y+ N++ W+ + QPLTWYK P G+EP+ LDM M KG
Sbjct: 644 GLRGEILNLYSVKGSNSVQWMKGSF--QKQPLTWYKTTFNTPAGNEPLALDMSSMSKGQI 701
Query: 658 WLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKP 717
W+NG IGRY+P + +C +C Y G F KC+ CG PSQ+WYHIPR W P
Sbjct: 702 WVNGRSIGRYFPGYIARG----KC-NKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSP 756
Query: 718 SENILVIFEEKGGDPTKITFSIR 740
+ N+L+I EE GG+P I+ R
Sbjct: 757 NGNLLIILEEIGGNPQGISLVKR 779
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/734 (50%), Positives = 480/734 (65%), Gaps = 22/734 (2%)
Query: 10 FALLIFFSSSITYC-FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
F ++ S + C +VTYD ++++++G+R ++IS +IHYPRS P MWP L+Q+AK+
Sbjct: 6 FHGVVLMSLCLWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKD 65
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GG++ I++YVFWNGHE SPG+YYF RF+LVKF+K++QQA +Y+ LRIGP++ AE+N+GG
Sbjct: 66 GGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNFGG 125
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY 188
PVWL Y+PG FR D EPFK MQKF IV +MK +LF SQGGPII++Q+ENEYG
Sbjct: 126 FPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYGPV 185
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMP 248
E G GK Y WAA+MAV + GVPW+MC+Q D PDPVI+TCN +YC+ F P+ + P
Sbjct: 186 EWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGYYCENFKPNKNTKP 245
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
K+WTENW GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRT+GG F
Sbjct: 246 KMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLF 305
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD 368
I TSYDY+AP+DEYGL PK+ HL+ LH AIK CE AL+ + SLG + EA V++
Sbjct: 306 IATSYDYDAPLDEYGLQNEPKYEHLRNLHKAIKQCEPALVATDPKVQSLGYNLEAHVFS- 364
Query: 369 SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV 428
+ GACAAF+AN D K+ F N Y LP WS+SILPDCK VV+NTA V S +M
Sbjct: 365 TPGACAAFIANYDTKSYAKATFGNGQYDLPPWSISILPDCKTVVYNTAKV-GNSWLKKMT 423
Query: 429 PENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTS 487
P N WQ + E +AD + + + +N T+D++DYLWY T
Sbjct: 424 PVN------------SAFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTD 471
Query: 488 IIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGK 547
+ +N NE FLKNG PVL S GH LH F N +L G+ G +P + + + L+ G
Sbjct: 472 VYINANEGFLKNGQSPVLTAMSAGHVLHVFINDQLAGTVWGGLANPKLTFSDNVKLRVGN 531
Query: 548 NEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGI 606
N+++LLS+ VGL N G +E AG+ V + G N GT DLS+ W+YK+GL+GE L +
Sbjct: 532 NKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSSQKWSYKVGLKGESLSL 591
Query: 607 YNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGR 666
+ +++ W+ K QPLTWYK P G++P+ LD+ MGKG W+NG IGR
Sbjct: 592 HTESGSSSVEWIRGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGR 651
Query: 667 YWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFE 726
+WP H C C+Y G + KC T CG+PSQRWYH+PRSW N LV+FE
Sbjct: 652 HWP----GYIAHGSC-NACNYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFE 706
Query: 727 EKGGDPTKITFSIR 740
E GGDP I R
Sbjct: 707 EWGGDPNGIALVKR 720
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/714 (51%), Positives = 472/714 (66%), Gaps = 23/714 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD ++++ING+R +++S +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE
Sbjct: 26 VSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQ 85
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYFG R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 86 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 145
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAAKMA
Sbjct: 146 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 205
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA GVPW+MC+Q D PDPVINTCN FYCD FTP+S P +WTE W GWF FGG P
Sbjct: 206 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGGAVP 265
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNF RTAGGPFI TSYDY+APIDEYGL R
Sbjct: 266 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQ 325
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E A+++G+ + S+G+ ++A V+ S+GACAAFL+N +
Sbjct: 326 PKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSSPAK 385
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
VV+ Y LPAWS+SILPDCK V+NTA V+ S+ +M P + G
Sbjct: 386 VVYNGRRYELPAWSISILPDCKTAVYNTATVKEPSAPAKMNP-------------AGGFS 432
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E ++ F K G V+ ++ T D +D+LWYTT + ++ +E+FLK+G P L I
Sbjct: 433 WQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQLTI 492
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH L F N + G+ G P Y + + G N+I++LS VGL N G YE
Sbjct: 493 NSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYE 552
Query: 568 -WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
W + V ++G N G DLS WTY+IGL+GE LG+++ +++ W S
Sbjct: 553 NWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSANGA--- 609
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P G P+ LDM MGKG W+NG GRYW K+ S C
Sbjct: 610 QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWSYKASGS------CGSCS 663
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ KC T CG+ SQRWYH+PRSW PS N+LV+ EE GGD + + R
Sbjct: 664 YTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTR 717
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 761 bits (1965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/733 (51%), Positives = 484/733 (66%), Gaps = 23/733 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ VTYD +++IING+R +++S +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 13 LGILCCSSLICSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG+YYF R++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENEYG E
Sbjct: 133 WLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W A+MA + GVPWIMC+Q D P+ +INTCN FYC+ F P+S + PK+W
Sbjct: 193 IGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP+EDIA SVARF Q GGS NYYMYHGGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK LH IKLCE AL++ + + SLG QEA V+ S
Sbjct: 312 SYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKSKS- 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV--EMVP 429
+CAAFL+N + + V+F +Y LP WSVSILPDCK +NTA V+ ++S++ +MVP
Sbjct: 371 SCAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKMVP 430
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
N S S + +EI F + G V+ I+ T+D TDY WY T I
Sbjct: 431 TNTPFSWGSYN-----------EEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDIT 479
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
++ +E+FL G P+L I S GHALH F N +L G+A G+ P + I L AG N+
Sbjct: 480 ISPDEKFL-TGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNK 538
Query: 550 IALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYK-IGLQGEHLGIY 607
+ALLS GL N G YE W + V + G NSGT D++ + W+YK IG +GE L ++
Sbjct: 539 LALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKQIGTKGEALSVH 598
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ + W K QPLTWYK+ P G+EP+ LDM MGKG W+NG+ IGR+
Sbjct: 599 TLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRH 658
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + + + C Y G F KC++ CGE SQRWYH+PRSW KP+ N++++ EE
Sbjct: 659 WPAYTARGK-----CERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEE 713
Query: 728 KGGDPTKITFSIR 740
GG+P I+ R
Sbjct: 714 WGGEPNGISLVKR 726
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/723 (52%), Positives = 486/723 (67%), Gaps = 37/723 (5%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G R +++S +IHYPRS P MWPGL+Q+AK+GG++ IE+YVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +L F+K + A +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR D
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F A++ENEYG +S YG GK Y WAA
Sbjct: 147 NEPFKAEMQRFT----------------------AKIENEYGNIDSAYGAPGKAYMRWAA 184
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF +FGG
Sbjct: 185 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 244
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTN R++GGPFI TSYDY+APIDEYGL
Sbjct: 245 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 304
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL+++H AIKLCE AL+ + S SLG + EA VY S CAAFLAN+D ++
Sbjct: 305 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLANIDGQS 363
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG-- 442
DKTV F Y LPAWSVSILPDCK VV NTA + +Q++ EM L+ S + D
Sbjct: 364 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEM--RYLESSNVASDGSFV 421
Query: 443 ---SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
W E GI + K+G ++ INTT D +D+LWY+TSI V +E +L N
Sbjct: 422 TPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYL-N 480
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS+ L + S GH L + N ++ GSA G+ + ++ PI L GKN+I LLS TVGL
Sbjct: 481 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 540
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F++ VGAGIT VK++G N G LDLS+ WTY+IGL+GE L +Y+P + WV
Sbjct: 541 SNYGAFFDLVGAGITGPVKLSGLN-GALDLSSAEWTYQIGLRGEDLHLYDPS-EASPEWV 598
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S P N PL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP +P
Sbjct: 599 SANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLAPQ 655
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
CV C+YRG ++ KC+ CG+PSQ YH+PRS+ +P N LV+FE GGDP+KI+F
Sbjct: 656 SGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFV 715
Query: 739 IRK 741
+R+
Sbjct: 716 MRQ 718
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/714 (51%), Positives = 471/714 (65%), Gaps = 21/714 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD ++++ING+R +++S +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE
Sbjct: 26 VSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQ 85
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYFG R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 86 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 145
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAAKMA
Sbjct: 146 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 205
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA GVPW+MC+Q D PDPVINTCN FYCD FTP+S P +WTE W GWF FGG P
Sbjct: 206 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGGAVP 265
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNF RTAGGPFI TSYDY+APIDEYGL R
Sbjct: 266 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQ 325
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E A+++G+ + S+G+ ++A V+ S+GACAAFL+N +
Sbjct: 326 PKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSSPAK 385
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
VV+ Y LPAWS+SILPDCK V+NTA VR + ++ N + G
Sbjct: 386 VVYNGRRYELPAWSISILPDCKTAVYNTATVRQKWKEKKLWM-----------NPAGGFS 434
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E ++ F K G V+ ++ T D +D+LWYTT + ++ +E+FLK+G P L I
Sbjct: 435 WQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQLTI 494
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH L F N + G+ G P Y + + G N+I++LS VGL N G YE
Sbjct: 495 NSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYE 554
Query: 568 -WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
W + V ++G N G DLS WTY+IGL+GE LG+++ +++ W S
Sbjct: 555 NWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSANGA--- 611
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P G P+ LDM MGKG W+NG GRYW K+ S C
Sbjct: 612 QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWSYKASGS------CGSCS 665
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ KC T CG+ SQRWYH+PRSW PS N+LV+ EE GGD + + R
Sbjct: 666 YTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTR 719
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/722 (51%), Positives = 476/722 (65%), Gaps = 26/722 (3%)
Query: 23 CFA---GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVF 79
CFA V+YD RSL+INGRR ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVF
Sbjct: 86 CFAVANAAVSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVF 145
Query: 80 WNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGT 139
WNGHE G+YYF R++L++F+K+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG
Sbjct: 146 WNGHEPVKGQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI 205
Query: 140 VFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRY 199
FR D PFK MQ+F+ IV MMK E+LF QGGPII++QVENE+G ES G G K Y
Sbjct: 206 SFRTDNGPFKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPY 265
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWF 259
A WAAKMAVA N GVPW+MC+Q D PDPVINTCN FYCD FTP+ + P +WTE W GWF
Sbjct: 266 ANWAAKMAVATNTGVPWVMCKQEDAPDPVINTCNGFYCDYFTPNKKNKPAMWTEAWTGWF 325
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPI 319
+FGG PHRP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPF+ TSYDY+API
Sbjct: 326 TSFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPI 385
Query: 320 DEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLAN 379
DE+GL R PKWGHL++LH AIK E L++G+ + SLG+ ++A V+ +GACAAFL+N
Sbjct: 386 DEFGLLRQPKWGHLRDLHKAIKQAEPTLVSGDPTIQSLGNYEKAYVFKSKNGACAAFLSN 445
Query: 380 MDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASP 439
+ V F Y LPAWS+SILPDCK VVFNTA V+ + +M P
Sbjct: 446 YHMNSAVKVRFNGRHYDLPAWSISILPDCKTVVFNTATVKEPTLLPKMHP---------- 495
Query: 440 DNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
WQ + E ++ F K G V+ ++ T D +DYLWYTT + + E KN
Sbjct: 496 ---VVRFTWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPG-ELSKN 551
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
G P L + S GH++ F N + GS G +P Y + + G N+I++LS VGL
Sbjct: 552 GQWPQLTVYSAGHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISILSSAVGL 611
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G +E G+ V ++G + G DLS WTY++GL+GE LGI+ + + W
Sbjct: 612 PNVGDHFERWNVGVLGPVTLSGLSEGKRDLSHQKWTYQVGLKGESLGIHTVSGSSAVEWG 671
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
P QPLTW+KA+ P G +P+ LDM MGKG W+NG +GRYW K +P
Sbjct: 672 G---PGSKQPLTWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGRYWSYK----APS 724
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
C C Y G + DKC + CGE SQRWYH+PRSW KP N+LV+ EE GGD +T +
Sbjct: 725 RGC-GGCSYAGTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGVTLA 783
Query: 739 IR 740
R
Sbjct: 784 TR 785
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/716 (51%), Positives = 475/716 (66%), Gaps = 29/716 (4%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
+YD R+++ING+R +++S +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE + G
Sbjct: 24 SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+Y+F R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D PF
Sbjct: 84 QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPF 143
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ+F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAA MAV
Sbjct: 144 KAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMAV 203
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A + GVPW+MC+Q D PDPVINTCN FYCD FTP+S S P +WTE W GWF FGG PH
Sbjct: 204 ATDAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNSKPTMWTEAWTGWFTAFGGPVPH 263
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+AF+VARF QKGGS NYYMYHGGTNF RTAGGPFI TSYDY+APIDEYGL R P
Sbjct: 264 RPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQP 323
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHL++LH AIK E AL++G+ + +G+ ++A V+ S+GACAAFL+N + +
Sbjct: 324 KWGHLRDLHKAIKQAEPALVSGDPTIQRIGNYEKAYVFKSSTGACAAFLSNYHTSSAARI 383
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
V+ Y LPAWS+SILPDCK VFNTA V+ ++ +M P + G W
Sbjct: 384 VYNGRRYDLPAWSISILPDCKTAVFNTATVKEPTAPAKMNP-------------AGGFAW 430
Query: 449 QVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIE 508
Q + E + F K G V+ ++ T D +DYLWYTT + ++ +E+FLK G P L I
Sbjct: 431 QSYSEDTNALDSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSSEQFLKTGQWPQLTIN 490
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
S GH++ F N + G A G P Y P+ + G N+I++LS +GL N G YE
Sbjct: 491 SAGHSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISILSSAMGLPNQGTHYEA 550
Query: 569 VGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN- 626
G+ V ++G N G DLS WTY+IGL+GE LG+ N+I+ S++E
Sbjct: 551 WNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGV------NSISGSSSVEWSSAS 604
Query: 627 --QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
QPLTW+KA P G P+ LDM MGKG W+NG GRYW ++ S
Sbjct: 605 GAQPLTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYWSYRASGS------CGG 658
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C Y G F+ KC T CG+ SQRWYH+PRSW KPS N+LV+ EE GGD + +T R
Sbjct: 659 CSYAGTFSEAKCQTNCGDISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTLMTR 714
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/713 (50%), Positives = 472/713 (66%), Gaps = 23/713 (3%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
YD RSL+INGRR ++IS +IHYPRS P MWPGL+Q+AK+GG++ I++YVFWNGHE G+
Sbjct: 47 YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
Y+F R++LV+F+K+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D PFK
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFK 166
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
MQKF+ IV MMK E LF QGGPII+AQVENE+G ES G G K YA WAA+MAV
Sbjct: 167 AAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAVG 226
Query: 210 QNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHR 269
N GVPW+MC+Q D PDPVINTCN FYCD FTP+ P +WTE W GWF FGG PHR
Sbjct: 227 TNTGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNRKYKPTMWTEAWTGWFTKFGGALPHR 286
Query: 270 PSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPK 329
P ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+GL R PK
Sbjct: 287 PVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPK 346
Query: 330 WGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVV 389
WGHL++LH AIK E AL++G+ + S+G+ ++A ++ +GACAAFL+N K +
Sbjct: 347 WGHLRDLHRAIKQAEPALISGDPTIQSIGNYEKAYIFKSKNGACAAFLSNYHMKTAVKIR 406
Query: 390 FRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQ 449
F Y LPAWS+SILPDCK VFNTA V+ +P+ N WQ
Sbjct: 407 FDGRHYDLPAWSISILPDCKTAVFNTATVK-------------EPTLLPKMNPVLHFAWQ 453
Query: 450 VFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIES 509
+ E ++ F ++G V+ ++ T D +DYLWYTT + + NE+FLK+G P L + S
Sbjct: 454 SYSEDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWYTTHVSIGGNEQFLKSGQWPQLTVYS 513
Query: 510 KGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWV 569
GH++ F N GS G +P + + + G N+I++LS VGL N G +E
Sbjct: 514 AGHSMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILSSAVGLPNNGNHFELW 573
Query: 570 GAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQP 628
G+ V ++G N G DLS WTY++GL+GE LG++ + + W P QP
Sbjct: 574 NVGVLGPVTLSGLNEGKRDLSHQKWTYQVGLKGESLGLHTVTGSSAVEWAG---PGGKQP 630
Query: 629 LTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYR 688
LTW+KA+ P G +P+ LDM MGKG W+NG GRYW ++ S + C Y
Sbjct: 631 LTWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAGRYWSYRAYSGS-----CRRCSYA 685
Query: 689 GKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE-KGGDPTKITFSIR 740
G + D+C++ CG+ SQRWYH+PRSW KPS N+LV+ EE GGD +T + R
Sbjct: 686 GTYREDQCLSNCGDISQRWYHVPRSWLKPSGNLLVVLEEYGGGDLAGVTLATR 738
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/738 (51%), Positives = 495/738 (67%), Gaps = 35/738 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+++I+G R ++ISA IHYPR+ P MWP ++Q AK+GG + +++YVFWNGHE
Sbjct: 31 NVTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGADVVQTYVFWNGHEPE 90
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR++LVKFIK+++QA +Y LRIGP+V AE+N+GG P WL IPG VFR D E
Sbjct: 91 QGQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPYWLKEIPGIVFRTDNE 150
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ F + IV++MK +LF+ QGGPII+AQ+ENEYG ES +G+GGKRY WAA M
Sbjct: 151 PFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQFGDGGKRYVQWAADM 210
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A++ + VPWIMC+Q D P +INTCN FYCD + P++ P +WTE+W GWF+ +G
Sbjct: 211 ALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGWKPNTALKPILWTEDWNGWFQNWGQAA 270
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED AF+VARFFQ+GGS NYYMY GGTNF RTAGGPF+TT+YDY+APIDEYGL R
Sbjct: 271 PHRPVEDNAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTTYDYDAPIDEYGLIR 330
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLS--LGSSQEADVYADSSGACAAFLANMDDKN 384
PKWGHLK+LH AIKLCE AL + S +GS+QEA Y+ ++G CAAFLAN+D +N
Sbjct: 331 QPKWGHLKDLHAAIKLCEPALTAVDTVPQSTWIGSNQEAHEYS-ANGHCAAFLANIDSEN 389
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM------------VPENL 432
TV F+ SY LPAWSVSILPDCK V FNTA + AQ++ M +P N
Sbjct: 390 SVTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVTRMRIAPSNSRGDIFLPSNT 449
Query: 433 QPSEASPDNGS-KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI-IV 490
+ D G LKWQ E GI G V + ++ +N TKDT+DYLWY+TSI I
Sbjct: 450 LVHDHISDGGVFANLKWQASAEPFGIRGSGTTVSNSLLEQLNITKDTSDYLWYSTSITIT 509
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+E +G+ L++ + A+H F N +L GSA G + PI+LK GKN I
Sbjct: 510 SEGVTSDVSGTEANLVLGTMRDAVHIFVNGKLAGSAMGWN----IQVVQPITLKDGKNSI 565
Query: 551 ALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
LLSMT+GLQN G + E GAGI SV +TG G L LST W+Y++GL+GE L +++
Sbjct: 566 DLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYGNLSLSTAEWSYQVGLRGEELKLFHN 625
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
G + +W S+ + LTWYK P G +P+ LD+ MGKG AW+NG +GRY+
Sbjct: 626 GTADGFSWDSSSFTNASY-LTWYKTTFDAPGGTDPVALDLGSMGKGQAWINGHHLGRYF- 683
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRW-------YHIPRSWFKPSENIL 722
+P C + CDYRG +N +KC T CGEPSQRW YHIPR+W + + N+L
Sbjct: 684 ---LMVAPQSGC-ETCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIPRAWLQATGNLL 739
Query: 723 VIFEEKGGDPTKITFSIR 740
V+FEE GGD +K++ R
Sbjct: 740 VLFEEIGGDISKVSVVTR 757
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/716 (50%), Positives = 471/716 (65%), Gaps = 21/716 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD ++++++G+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE S
Sbjct: 24 SVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+YYF RF+LVKF+K+ QQA +Y+ LRIGP++ AE+N GG PVWL Y+PG FR D E
Sbjct: 84 PGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRTDNE 143
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQKF IV +MK +LF SQGGPIIL+Q+ENEYG E G GK Y WAA+M
Sbjct: 144 PFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV + GVPW+MC+Q D PDPVI+TCN FYC+ F P+ + PK+WTENW GW+ FGG
Sbjct: 204 AVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFGGAV 263
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRT+GG FI TSYDY+AP+DEYGL
Sbjct: 264 PRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYGLEN 323
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+ HL+ LH AIK E AL+ + SLG + EA V++ + GACAAF+AN D K+
Sbjct: 324 EPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVFS-APGACAAFIANYDTKSYA 382
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
F N Y LP WS+SILPDCK VV+NTA V +M P N
Sbjct: 383 KAKFGNGQYDLPPWSISILPDCKTVVYNTAKV-GYGWLKKMTPVN------------SAF 429
Query: 447 KWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
WQ + E +AD + + + +N T+D++DYLWY T + VN NE FLKNG P+L
Sbjct: 430 AWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSPLL 489
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
+ S GH LH F N +L G+ G +P + + + L+AG N+++LLS+ VGL N G
Sbjct: 490 TVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVGVH 549
Query: 566 YEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+E AG+ V + G N GT DLS W+YK+GL+GE L ++ +++ W+
Sbjct: 550 FETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSLVA 609
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K QPLTWYK P G++P+ LD+ MGKG W+NG IGR+WP H C
Sbjct: 610 KKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWP----GYIAHGSC-NA 664
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C+Y G + KC T CG+PSQRWYH+PRSW N LV+FEE GGDP I R
Sbjct: 665 CNYAGYYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKR 720
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/746 (50%), Positives = 489/746 (65%), Gaps = 29/746 (3%)
Query: 10 FALLIFFS-SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
ALL++F S +Y NV+YD R+LII G+R +++SA IHYPR+ P MW L+ ++KE
Sbjct: 19 IALLVYFPILSGSYFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKE 78
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GG + +++YVFWNGHE G+Y F GR++LVKF+K+I + +Y+ LRIGP+V AE+N+GG
Sbjct: 79 GGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGG 138
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY 188
PVWL IPG FR D EPFK MQKF+T IVD+M+ KLF QGGPII+ Q+ENEYG
Sbjct: 139 FPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDV 198
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMP 248
E YG+ GK Y WAA MA+ GVPW+MC+Q D P+ +I+ CN +YCD F P+S + P
Sbjct: 199 EKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRTKP 258
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
+WTE+W GW+ +GG PHRP+ED+AF+VARF+Q+GGS NYYMY GGTNFGRT+GGPF
Sbjct: 259 VLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPF 318
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL-SLGSSQEADVY- 366
TSYDY+AP+DEYGL PKWGHLK+LH AIKLCE AL+ + LGS QEA +Y
Sbjct: 319 YITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHIYH 378
Query: 367 --ADSSG-ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS 423
++ G CAAFLAN+D+ V F SY LP WSVSILPDC+ V FNTA V AQ+S
Sbjct: 379 GDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTS 438
Query: 424 TVEMVPENLQPSEAS---------PDNGSKGLK-WQVFKEIAGIWGEADFVKSGFVDHIN 473
+ E+ +PS S DN S K W KE GIWGE +F G ++H+N
Sbjct: 439 VKTV--ESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLN 496
Query: 474 TTKDTTDYLWYTTSIIVNENEE--FLKNGSRPVLLIESKGHALHAFANQELQGSASGNGT 531
TKD +DYLW+ T I V+E++ + KNG + I+S L F N++L GS G+
Sbjct: 497 VTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWV 556
Query: 532 HPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLST 590
K P+ G N++ LL+ TVGLQN G F E GAG K+TGF +G LDLS
Sbjct: 557 ----KAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSK 612
Query: 591 YSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDML 650
SWTY++GL+GE IY + W + WYK P G +P+ L++
Sbjct: 613 SSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLE 672
Query: 651 KMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHI 710
MG+G AW+NG+ IGRYW S+K D C + CDYRG +N DKC T CG+P+Q YH+
Sbjct: 673 SMGRGQAWVNGQHIGRYWNIISQK----DGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHV 728
Query: 711 PRSWFKPSENILVIFEEKGGDPTKIT 736
PRSW KPS N+LV+FEE GG+P KI+
Sbjct: 729 PRSWLKPSSNLLVLFEETGGNPFKIS 754
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/713 (51%), Positives = 465/713 (65%), Gaps = 23/713 (3%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD RSL ING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF R++LV+F+K+++QA +Y+ LRIGP+V AE+NYGG PVWL Y+PG FR D PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K Y WAAKMAV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N GVPWIMC+Q D PDPVINTCN FYCD FTP+S + P +WTE W GWF FGG P
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+AF+VARF QKGGS NYYMYHGGTNF RTAGGPFI TSYDY+APIDEYGL R P
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHL LH AIK E AL+ G+ + ++G+ ++A V+ SSG CAAFL+N V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
F Y LPAWS+S+LPDC+ V+NTA V A SS +M P + G W
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAKMNP-------------AGGFTW 429
Query: 449 QVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIE 508
Q + E E F K G V+ ++ T D +DYLWYTT + ++ E+FLK+G P L +
Sbjct: 430 QSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVY 489
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
S GH++ F N + G+A G P Y + + G N+I++LS VGL N G YE
Sbjct: 490 SAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYET 549
Query: 569 VGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
G+ V ++G N G DLS WTY+IGL+GE LG+++ +++ W Q
Sbjct: 550 WNIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGA---AGKQ 606
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
P+TW++A P G P+ LD+ MGKG AW+NG IGRYW K+ + C Y
Sbjct: 607 PVTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKASGN------CGGCSY 660
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
G ++ KC CG+ SQRWYH+PRSW PS N++V+ EE GGD + +T R
Sbjct: 661 AGTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTR 713
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/758 (50%), Positives = 489/758 (64%), Gaps = 34/758 (4%)
Query: 8 APFALLIFFSSSITY--CFAG-NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
A FA L+ FS +I FA NV+YD R+L+I+G+R +++SA IHYPR+ P MWP L+
Sbjct: 6 ALFAALLCFSLTIQLGVSFAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIA 65
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
++KEGG + I++YVFWNGHE +Y F GR+++VKF+K++ + +Y+ LRIGP+V AE+
Sbjct: 66 KSKEGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEW 125
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
N+GG PVWL IPG FR D PFK MQ+F+ IVD+M++E LF+ QGGPII+ Q+ENE
Sbjct: 126 NFGGFPVWLRDIPGIEFRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENE 185
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YG ES +G+ GK Y WAA+MA+ + GVPW+MCQQ D PD +IN CN FYCD F P+S
Sbjct: 186 YGNVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNS 245
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
+ PK+WTE+W GWF ++GGR P RP EDIAF+VARFFQ+GGS HNYYMY GGTNFGR++
Sbjct: 246 ANKPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSS 305
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEA 363
GGPF TSYDY+APIDEYGL PKWGHLKELH AIKLCE AL+ + + LG QEA
Sbjct: 306 GGPFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEA 365
Query: 364 DVY----------ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVF 413
VY + + +C+AFLAN+D+ +V F Y LP WSVSILPDC+ VF
Sbjct: 366 HVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVF 425
Query: 414 NTANVRAQSS--TVEM---VPENL---QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVK 465
NTA V AQ+S TVE + N+ QP W KE +W E +F
Sbjct: 426 NTAKVGAQTSIKTVEFDLPLVRNISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFTI 485
Query: 466 SGFVDHINTTKDTTDYLWYTTSIIVN-ENEEFL-KNGSRPVLLIESKGHALHAFANQELQ 523
G ++H+N TKD +DYLW T I V+ E+ F +N P L I+S LH F N +L
Sbjct: 486 QGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLI 545
Query: 524 GSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFN 582
GS G+ K PI L G N++ LLS TVGLQN G F E GAG VK+TGF
Sbjct: 546 GSVIGHWV----KVVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFK 601
Query: 583 SGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGD 642
+G +DLS YSWTY++GL+GE IY W TWYK P G+
Sbjct: 602 NGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGE 661
Query: 643 EPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGE 702
P+ LD+ MGKG AW+NG IGRYW R +P D C +CDYRG ++ KC T CG
Sbjct: 662 NPVALDLGSMGKGQAWVNGHHIGRYWTR----VAPKDGC-GKCDYRGHYHTSKCATNCGN 716
Query: 703 PSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
P+Q WYHIPRSW + S N+LV+FEE GG P +I+ R
Sbjct: 717 PTQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSR 754
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/746 (50%), Positives = 489/746 (65%), Gaps = 29/746 (3%)
Query: 10 FALLIFFS-SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
ALL++F S +Y NV+YD R+LII G+R +++SA IHYPR+ P MW L+ ++KE
Sbjct: 19 IALLVYFPILSGSYFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKE 78
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GG + +++YVFWNGHE G+Y F GR++LVKF+K+I + +Y+ LRIGP+V AE+N+GG
Sbjct: 79 GGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGG 138
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY 188
PVWL IPG FR D EPFK MQKF+T IVD+M+ KLF QGGPII+ Q+ENEYG
Sbjct: 139 FPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDV 198
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMP 248
E YG+ GK Y WAA MA+ GVPW+MC+Q D P+ +I+ CN +YCD F P+S + P
Sbjct: 199 EKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRTKP 258
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
+WTE+W GW+ +GG PHRP+ED+AF+VARF+Q+GGS NYYMY GGTNFGRT+GGPF
Sbjct: 259 VLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPF 318
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL-SLGSSQEADVY- 366
TSYDY+AP+DEYGL PKWGHLK+LH AIKLCE AL+ + LGS QEA +Y
Sbjct: 319 YITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHIYH 378
Query: 367 --ADSSG-ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS 423
++ G CAAFLAN+D+ V F SY LP WSVSILPDC+ V FNTA V AQ+S
Sbjct: 379 GDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTS 438
Query: 424 TVEMVPENLQPSEAS---------PDNGSKGLK-WQVFKEIAGIWGEADFVKSGFVDHIN 473
+ E+ +PS S DN S K W KE GIWGE +F G ++H+N
Sbjct: 439 VKTV--ESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLN 496
Query: 474 TTKDTTDYLWYTTSIIVNENEE--FLKNGSRPVLLIESKGHALHAFANQELQGSASGNGT 531
TKD +DYLW+ T I V+E++ + KNG + I+S L F N++L GS G+
Sbjct: 497 VTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWV 556
Query: 532 HPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLST 590
K P+ G N++ LL+ TVGLQN G F E GAG K+TGF +G LDLS
Sbjct: 557 ----KAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSK 612
Query: 591 YSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDML 650
SWTY++GL+GE IY + W + WYK P G +P+ L++
Sbjct: 613 SSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLE 672
Query: 651 KMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHI 710
MG+G AW+NG+ IGRYW S+K D C + CDYRG +N DKC T CG+P+Q YH+
Sbjct: 673 SMGRGQAWVNGQHIGRYWNIISQK----DGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHV 728
Query: 711 PRSWFKPSENILVIFEEKGGDPTKIT 736
PRSW KPS N+LV+FEE GG+P KI+
Sbjct: 729 PRSWLKPSSNLLVLFEETGGNPFKIS 754
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/713 (51%), Positives = 465/713 (65%), Gaps = 23/713 (3%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD RSL ING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE G
Sbjct: 25 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF R++LV+F+K+++QA +Y+ LRIGP+V AE+NYGG PVWL Y+PG FR D PF
Sbjct: 85 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 144
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K Y WAAKMAV
Sbjct: 145 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 204
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N GVPWIMC+Q D PDPVINTCN FYCD FTP+S + P +WTE W GWF FGG P
Sbjct: 205 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 264
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+AF+VARF QKGGS NYYMYHGGTNF RTAGGPFI TSYDY+APIDEYGL R P
Sbjct: 265 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 324
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHL LH AIK E AL+ G+ + ++G+ ++A V+ SSG CAAFL+N V
Sbjct: 325 KWGHLTNLHKAIKQAEPALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 384
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
F Y LPAWS+S+LPDC+ V+NTA V A SS +M P + G W
Sbjct: 385 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAKMNP-------------AGGFTW 431
Query: 449 QVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIE 508
Q + E E F K G V+ ++ T D +DYLWYTT + ++ E+FLK+G P L +
Sbjct: 432 QSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVY 491
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
S GH++ F N + G+A G P Y + + G N+I++LS VGL N G YE
Sbjct: 492 SAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYET 551
Query: 569 VGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
G+ V ++G N G DLS WTY+IGL+GE LG+++ +++ W Q
Sbjct: 552 WNIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAA---GKQ 608
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
P+TW++A P G P+ LD+ MGKG AW+NG IGRYW K+ + C Y
Sbjct: 609 PVTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKASGN------CGGCSY 662
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
G ++ KC CG+ SQRWYH+PRSW PS N++V+ EE GGD + +T R
Sbjct: 663 AGTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTR 715
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/731 (50%), Positives = 476/731 (65%), Gaps = 25/731 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+FF + Y A +VTYD ++++ING+R ++IS +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 22 LLLFFW--VCYVTA-SVTYDHKAIMINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGL 78
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE SPGKYYF RF+LV FIK++QQA +++ LRIGPF+ AE+N+GG PV
Sbjct: 79 DVIETYVFWNGHEPSPGKYYFEDRFDLVGFIKLVQQAGLFVHLRIGPFICAEWNFGGFPV 138
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D EPFK MQKF IV++MK EKLF SQGGPIIL+Q+ENEYG E
Sbjct: 139 WLKYVPGIAFRTDNEPFKEAMQKFTEKIVNIMKAEKLFQSQGGPIILSQIENEYGPVEWE 198
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y WAA+MAV + GVPW+MC+Q D PDP+I+TCN FYC+ FTP+ PK+W
Sbjct: 199 IGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNYKPKLW 258
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+ FGG P+RP+EDIAFSVARF Q GS+ NYYMYHGGTNFGRT+ G F+ T
Sbjct: 259 TENWTGWYTAFGGATPYRPAEDIAFSVARFIQNRGSLFNYYMYHGGTNFGRTSNGLFVAT 318
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL PKWGHL+ELH AIK CE AL++ + + G + E +Y S
Sbjct: 319 SYDYDAPIDEYGLLNEPKWGHLRELHRAIKQCESALVSVDPTVSWPGKNLEVHLYKTES- 377
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFLAN + V F N Y LP WS+SILPDCK VFNTA V + +M P N
Sbjct: 378 ACAAFLANYNTDYSTQVKFGNGQYDLPPWSISILPDCKTEVFNTAKVNSPRLHRKMTPVN 437
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIV 490
WQ + E E D V + + T+D++DYLWY T + +
Sbjct: 438 ------------SAFAWQSYNEEPASSSENDPVTGYALWEQVGVTRDSSDYLWYLTDVNI 485
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
N+ +K+G PVL S GH L+ F N + G+A G+ P + ++L+ G N+I
Sbjct: 486 GPND--IKDGKWPVLTAMSAGHVLNVFINGQYAGTAYGSLDDPRLTFSQSVNLRVGNNKI 543
Query: 551 ALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS++VGL N G +E W + V +TG +SGT DLS W+YKIGL+GE L ++
Sbjct: 544 SLLSVSVGLANVGTHFETWNTGVLGPVTLTGLSSGTWDLSKQKWSYKIGLKGESLSLHTE 603
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
N++ WV K QPL WYK P G++P+ LD+ MGKG W+NG+ IGR+WP
Sbjct: 604 AGSNSVEWVQGSLVAKKQPLAWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSIGRHWP 663
Query: 670 RKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKG 729
+ + C+Y G + KC+ CG+PSQRWYH+PRSW + N LV+ EE G
Sbjct: 664 GNKARGN-----CGNCNYAGTYTDTKCLANCGQPSQRWYHVPRSWLRSGGNYLVVLEEWG 718
Query: 730 GDPTKITFSIR 740
GDP I R
Sbjct: 719 GDPNGIALVER 729
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/746 (49%), Positives = 487/746 (65%), Gaps = 29/746 (3%)
Query: 10 FALLIFFS-SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
ALL++F S ++ NV+YD R+LII +R +++SA IHYPR+ P MW L++++KE
Sbjct: 19 IALLVYFPIVSGSFFKPFNVSYDHRALIIADKRRMLVSAGIHYPRATPEMWSDLIEKSKE 78
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GG + I++YVFW+GHE G+Y F GR++LVKF+K+I + +Y+ LRIGP+V AE+N+GG
Sbjct: 79 GGADVIQTYVFWSGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGG 138
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY 188
PVWL IPG FR D EPFK MQKF+T IVD+M+ KLF QGGPII+ Q+ENEYG
Sbjct: 139 FPVWLRDIPGIQFRTDNEPFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQIENEYGDV 198
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMP 248
E YG+ GK Y WAA MA+ GVPW+MC+Q D P+ +I+ CN +YCD F P+S P
Sbjct: 199 EKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSQMKP 258
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
+WTE+W GW+ +GG PHRP+ED+AF+VARF+Q+GGS NYYMY GGTNFGRT+GGPF
Sbjct: 259 ILWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPF 318
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL-SLGSSQEADVY- 366
TSYDY+AP+DEYGL PKWGHLK+LH AIKLCE AL+ + LGS+QEA +Y
Sbjct: 319 YITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSNQEAHIYR 378
Query: 367 --ADSSG-ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS 423
++ G CAAFLAN+D+ V F SY LP WSVSILPDC+ V FNTA V AQ+S
Sbjct: 379 GDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTS 438
Query: 424 TVEMVPENLQPSEASPDNGSKGLK----------WQVFKEIAGIWGEADFVKSGFVDHIN 473
+ E+ +PS S K ++ W KE GIWGE +F G ++H+N
Sbjct: 439 VKTV--ESARPSLGSKSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLN 496
Query: 474 TTKDTTDYLWYTTSIIVNENEE--FLKNGSRPVLLIESKGHALHAFANQELQGSASGNGT 531
TKD +DYLW+ T I V+E++ + KNG+ P + I+S L F N++L GS G+
Sbjct: 497 VTKDRSDYLWHKTRITVSEDDISFWKKNGANPTVSIDSMRDVLRVFVNKQLSGSVVGHWV 556
Query: 532 HPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLST 590
K P+ G N++ LL+ TVGLQN G F E GAG K+TGF +G +DL+
Sbjct: 557 ----KAVQPVRFMQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDMDLAK 612
Query: 591 YSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDML 650
SWTY++GL+GE IY + W + WYK P G +P+ LD+
Sbjct: 613 SSWTYQVGLKGEAEKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDTPAGTDPVVLDLE 672
Query: 651 KMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHI 710
MGKG AW+NG IGRYW S+K D C + CDYRG + DKC T CG+P+Q YH+
Sbjct: 673 SMGKGQAWVNGHHIGRYWNIISQK----DGCERTCDYRGAYYSDKCTTNCGKPTQTRYHV 728
Query: 711 PRSWFKPSENILVIFEEKGGDPTKIT 736
PRSW KPS N+LV+FEE GG+P I+
Sbjct: 729 PRSWLKPSSNLLVLFEETGGNPFNIS 754
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/714 (50%), Positives = 469/714 (65%), Gaps = 25/714 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+INGRR +++S +IHYPRS P MWPGL+Q+AK+GG++ I++YVFWNGHE
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF R++LV+F+K+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF+ IV MMK E LF QGGPII++QVENE+G ES G G K YA WAAKMA
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V N GVPW+MC+Q D PDPVINTCN FYCD F+P+ P +WTE W GWF +FGG P
Sbjct: 218 VGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVP 277
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+GL R
Sbjct: 278 HRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQ 337
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E L++ + + S+GS ++A V+ +GACAAFL+N
Sbjct: 338 PKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVK 397
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V F Y+LPAWS+SILPDCK VFNTA V+ +P+ N
Sbjct: 398 VRFNGQQYNLPAWSISILPDCKTAVFNTATVK-------------EPTLMPKMNPVVRFA 444
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E ++ F K G V+ ++ T D +DYLWYTT + + N+ L++G P L +
Sbjct: 445 WQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTV 502
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH++ F N + GS G +P Y + + G N+I++LS VGL N G +E
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 568 -WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
W + V ++ N GT DLS WTY++GL+GE LG++ + + W P
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGG---PGGY 619
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P G++P+ LDM MGKG W+NG +GRYW K+ C
Sbjct: 620 QPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGG------CGGCS 673
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ DKC + CG+ SQRWYH+PRSW KP N+LV+ EE GGD ++ + R
Sbjct: 674 YAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATR 727
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/772 (47%), Positives = 490/772 (63%), Gaps = 75/772 (9%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPG------------------------------- 57
TYD ++++I+G+R ++ S +IHYPRS P
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89
Query: 58 ---------------------MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
MW GL+Q+AK+GG++ I++YVFWNGHE +PG YYF R+
Sbjct: 90 LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+LV+F+K +Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D EPFK MQ F
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFT 209
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
IV MMK E LFASQGGPIIL+Q+ENEYG +G G+ Y WAAKMAV + GVPW
Sbjct: 210 EKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPW 269
Query: 217 IMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAF 276
+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG RP ED+AF
Sbjct: 270 VMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAF 329
Query: 277 SVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R PK HLKEL
Sbjct: 330 AVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKEL 389
Query: 337 HGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYH 396
H A+KLCE AL++ + + +LG+ QEA V+ SG CAAFLAN + + VVF N Y
Sbjct: 390 HRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFNNEQYS 448
Query: 397 LPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVF-KEIA 455
LP WS+SILPDCK VVFN+A V Q+S ++M +G+ + W+ + +E+
Sbjct: 449 LPPWSISILPDCKNVVFNSATVGVQTSQMQMW-----------GDGATSMMWERYDEEVD 497
Query: 456 GIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR-PVLLIESKGHAL 514
+ +G ++ +N T+D++DYLWY TS+ ++ +E FL+ G + P L ++S GHAL
Sbjct: 498 SLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHAL 557
Query: 515 HAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT 574
H F N +LQGS+ G KY ++L+AG N+IALLS+ GL N G YE G+
Sbjct: 558 HVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVG 617
Query: 575 S-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPPKNQPLTWY 632
V + G N G+ DL+ +W+Y++GL+GE + + + ++ W+ ++ K QPL WY
Sbjct: 618 GPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWY 677
Query: 633 KAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFN 692
KA + P GDEP+ LDM MGKG W+NG+ IGRYW ++ D + C Y G F
Sbjct: 678 KAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYW------TAYADGDCKGCSYTGTFR 731
Query: 693 PDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE-KGGDPTKITFSIRKIS 743
KC GCG+P+QRWYH+PRSW +PS N+LV+ EE GGD +KI + R +S
Sbjct: 732 APKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVS 783
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/714 (50%), Positives = 469/714 (65%), Gaps = 25/714 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+INGRR +++S +IHYPRS P MWPGL+Q+AK+GG++ I++YVFWNGHE
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF R++LV+F+K+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF+ IV MMK E LF QGGPII++QVENE+G ES G G K YA WAAKMA
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V N GVPW+MC+Q D PDPVINTCN FYCD F+P+ P +WTE W GWF +FGG P
Sbjct: 218 VRTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVP 277
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+GL R
Sbjct: 278 HRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQ 337
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E L++ + + S+GS ++A V+ +GACAAFL+N
Sbjct: 338 PKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVK 397
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V F Y+LPAWS+SILPDCK VFNTA V+ +P+ N
Sbjct: 398 VRFNGQQYNLPAWSISILPDCKTAVFNTATVK-------------EPTLMPKMNPVVRFA 444
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E ++ F K G V+ ++ T D +DYLWYTT + + N+ L++G P L +
Sbjct: 445 WQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTV 502
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH++ F N + GS G +P Y + + G N+I++LS VGL N G +E
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 568 -WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
W + V ++ N GT DLS WTY++GL+GE LG++ + + W P
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGG---PGGY 619
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P G++P+ LDM MGKG W+NG +GRYW K+ C
Sbjct: 620 QPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGG------CGGCS 673
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
Y G ++ DKC + CG+ SQRWYH+PRSW KP N+LV+ EE GGD ++ + R
Sbjct: 674 YAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATR 727
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 743 bits (1917), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/738 (49%), Positives = 477/738 (64%), Gaps = 33/738 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD R+LII+G+R ++ISA +HYPR+ P MWP +++++KEGG + I+SYVFWNGHE +
Sbjct: 32 NVSYDHRALIIDGKRRMLISAGVHYPRASPEMWPDIIEKSKEGGADVIQSYVFWNGHEPT 91
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR++LVKFI+++ + +Y+ LRIGP+V AE+N+GG P+WL +PG FR D
Sbjct: 92 KGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGPYVCAEWNFGGFPLWLRDVPGIEFRTDNA 151
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ+F+ IVD+++ EKLF QGGP+I+ QVENEYG ES YG+ G+ Y W M
Sbjct: 152 PFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENEYGNIESSYGKRGQEYIKWVGNM 211
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+ VPW+MCQQ D P +IN+CN +YCD F +SPS P WTENW GWF ++G R
Sbjct: 212 ALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKANSPSKPIFWTENWNGWFTSWGERS 271
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AFSVARFFQ+ GS NYYMY GGTNFGRTAGGPF TSYDY++PIDEYGL R
Sbjct: 272 PHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGRTAGGPFYITSYDYDSPIDEYGLIR 331
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVYADSSGA------------- 372
PKWGHLK+LH A+KLCE AL++ + + LG QEA VY S
Sbjct: 332 EPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQEAHVYHMKSQTDDLTLSKLGTLRN 391
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVEM--- 427
C+AFLAN+D++ V F +Y+LP WSVSILPDC+ VVFNTA V AQ+S +E+
Sbjct: 392 CSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQNVVFNTAKVAAQTSIKILELYAP 451
Query: 428 VPENLQPSEASPDNGSKGL---KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
+ N+ + D + W KE GIW + +F G ++H+N TKD +DYLWY
Sbjct: 452 LSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQNFTVKGILEHLNVTKDRSDYLWY 511
Query: 485 TTSI-IVNENEEFLKNGS-RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPIS 542
T I + N++ F K + P + I+S F N +L GSA G K+ P+
Sbjct: 512 MTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNGKLTGSAIGQWV----KFVQPVQ 567
Query: 543 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQG 601
G N++ LLS +GLQN+G F E GAGI +K+TGF +G +DLS WTY++GL+G
Sbjct: 568 FLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFKNGDIDLSKSLWTYQVGLKG 627
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNG 661
E L Y+ +W TWYKA P G +P+ +++ MGKG AW+NG
Sbjct: 628 EFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSSPDGTDPVAINLGSMGKGQAWVNG 687
Query: 662 EEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENI 721
IGRYW SP D C ++CDYRG +N KC T CG P+Q WYHIPRSW K S N+
Sbjct: 688 HHIGRYW----SVVSPKDGCPRKCDYRGAYNSGKCATNCGRPTQSWYHIPRSWLKESSNL 743
Query: 722 LVIFEEKGGDPTKITFSI 739
LV+FEE GG+P +I +
Sbjct: 744 LVLFEETGGNPLEIVVKL 761
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/745 (49%), Positives = 469/745 (62%), Gaps = 40/745 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD R+LII+G R ++IS IHYPR+ P MWP L+ ++KEGGV+ I++YVFWNGHE
Sbjct: 39 NVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYVFWNGHEPV 98
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F G+++LVKF+K++ + +Y+ LRIGP+V AE+N+GG PVWL IPG VFR D
Sbjct: 99 KGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIVFRTDNS 158
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PF MQ+F+ IVD+M+ E LF+ QGGPII+ Q+ENEYG E +G GGK Y WAA+M
Sbjct: 159 PFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKEYVKWAARM 218
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+ GVPW+MC+Q D P +I+ CN +YCD + P+S P +WTE+W GW+ T+GG
Sbjct: 219 ALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGYKPNSNKKPILWTEDWDGWYTTWGGSL 278
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNF RTAGGPF TSYDY+APIDEYGL
Sbjct: 279 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAPIDEYGLLS 338
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVYA-------------DSSGA 372
PKWGHLK+LH AIKLCE AL+ + + + LGS QEA VY S
Sbjct: 339 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGSKQEAHVYRANVHAEGQNLTQHGSQSK 398
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM----- 427
C+AFLAN+D+ TV F SY LP WSVS+LPDC+ VFNTA V AQ+S M
Sbjct: 399 CSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSIKSMELALP 458
Query: 428 ------VPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDY 481
P+ L A + W KE +W +F G ++H+N TKD +DY
Sbjct: 459 QFSGISAPKQLM---AQNEGSYMSSSWMTVKEPISVWSGNNFTVEGILEHLNVTKDHSDY 515
Query: 482 LWYTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKN 539
LWY T I V++++ +N P + I+S L F N +L GS G K
Sbjct: 516 LWYFTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIGRW----IKVVQ 571
Query: 540 PISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIG 598
P+ + G NE+ LLS TVGLQN G F E GAG K+TGF G +DLS WTY++G
Sbjct: 572 PVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDGDIDLSNLEWTYQVG 631
Query: 599 LQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAW 658
LQGE+ IY W TWYK P G +P+ LD+ MGKG AW
Sbjct: 632 LQGENQKIYTTENNEKAEWTDLTLDDIPSTFTWYKTYFDAPSGADPVALDLGSMGKGQAW 691
Query: 659 LNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS 718
+N IGRYW +P + C Q+CDYRG +N +KC T CG+P+Q WYHIPRSW +PS
Sbjct: 692 VNDHHIGRYWTL----VAPEEGC-QKCDYRGAYNSEKCRTNCGKPTQIWYHIPRSWLQPS 746
Query: 719 ENILVIFEEKGGDPTKITFSIRKIS 743
N+LVIFEE GG+P +I+ +R S
Sbjct: 747 NNLLVIFEETGGNPFEISIKLRSAS 771
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/735 (49%), Positives = 477/735 (64%), Gaps = 21/735 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
+ +F ++ C GNV YD R++ IN +R +++S +IHYPRS P MWP ++++AK+ +
Sbjct: 15 VYVFVLITLISCVYGNVWYDYRAIKINDQRRILLSGSIHYPRSTPEMWPDIIEKAKDSQL 74
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE S GKYYF GR++LVKFIK+I QA +++ LRIGPF AE+N+GG PV
Sbjct: 75 DVIQTYVFWNGHEPSEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIGPFACAEWNFGGFPV 134
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D PFK MQ F T IVDMMK EKLF QGGPIIL Q+ENEYG E
Sbjct: 135 WLKYVPGIEFRTDNGPFKEKMQVFTTKIVDMMKAEKLFHWQGGPIILNQIENEYGPVEWE 194
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKI 250
G GK Y WAA+MA + N GVPWIMC+Q D PD VI+TCN FYC+ F P S PK+
Sbjct: 195 IGAPGKAYTHWAAQMAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFYCEGFVPKDKSKPKM 254
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENW GW+ +G P+RP+ED+AFSVARF Q GGS NYYM+HGGTNF TA G F++
Sbjct: 255 WTENWTGWYTEYGKPVPYRPAEDVAFSVARFIQNGGSFMNYYMFHGGTNFETTA-GRFVS 313
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
TSYDY+AP+DEYGLPR PK+ HLK LH AIK+CE AL++ + +LGS+QEA VY+ +S
Sbjct: 314 TSYDYDAPLDEYGLPREPKYTHLKNLHKAIKMCEPALVSSDAKVTNLGSNQEAHVYSSNS 373
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
G+CAAFLAN D K V F + + LPAWS+SILPDCKK V+NTA V S +
Sbjct: 374 GSCAAFLANYDPKWSVKVTFSGMEFELPAWSISILPDCKKEVYNTARVNEPSPKLH---- 429
Query: 431 NLQPSEASPDNGSKGLKWQVFK-EIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
S+ +P L WQ + E+ F + + IN T D +DYLWY T ++
Sbjct: 430 ----SKMTPV--ISNLNWQSYSDEVPTADSPGTFREKKLYEQINMTWDKSDYLWYMTDVV 483
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
++ NE FLK G P L + S GH LH F N +LQG A G+ P + + + AG N
Sbjct: 484 LDGNEGFLKKGDEPWLTVNSAGHVLHVFVNGQLQGHAYGSLAKPQLTFSQKVKMTAGVNR 543
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
I+LLS VGL N G +E G+ V ++G N GT DL+ W+YKIG +GE +YN
Sbjct: 544 ISLLSAVVGLANVGWHFERYNQGVLGPVTLSGLNEGTRDLTWQYWSYKIGTKGEEQQVYN 603
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
G +++ W P QPL WYK P G++P+ LD+ MGKG AW+NG+ IGR+W
Sbjct: 604 SGGSSHVQW---GPPAWKQPLVWYKTTFDAPGGNDPLALDLGSMGKGQAWINGQSIGRHW 660
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
K S C C+Y G + KC++ CG+ SQ+WYH+PRSW +P N+LV+FEE
Sbjct: 661 SNNIAKGS----CNDNCNYAGTYTETKCLSDCGKSSQKWYHVPRSWLQPRGNLLVVFEEW 716
Query: 729 GGDPTKITFSIRKIS 743
GGD ++ R I+
Sbjct: 717 GGDTKWVSLVKRTIA 731
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/739 (49%), Positives = 478/739 (64%), Gaps = 36/739 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD R+LI+NG+R +ISA IHYPR+ P MWP L+ ++KEGG + IE+YVFWNGHE
Sbjct: 46 NVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNGHEPV 105
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR++LVKF+++ +Y LRIGP+ AE+N+GG PVWL IPG FR +
Sbjct: 106 RGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 165
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M++F++ +V++M+ E+LF+ QGGPIIL Q+ENEYG E+ YG+GGK Y WAAKM
Sbjct: 166 PFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYGKGGKEYMKWAAKM 225
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A++ GVPW+MC+Q D P +I+TCN++YCD F P+S + P +WTENW GW+ +G R
Sbjct: 226 ALSLGAGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSHNKPTMWTENWDGWYTQWGERL 285
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRTAGGP TSYDY+APIDEYGL R
Sbjct: 286 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGLLR 345
Query: 327 NPKWGHLKELHGAIKLCEHALLNGER-SNLSLGSSQEADVYA-------------DSSGA 372
PKWGHLK+LH A+KLCE AL+ + + + LG QEA VY +SS
Sbjct: 346 EPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPKQEAHVYQANVHLEGLNLSMFESSSI 405
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENL 432
C+AFLAN+D+ + TV FR Y +P WSVS+LPDC+ VFNTA VRAQ+S V++V L
Sbjct: 406 CSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVRAQTS-VKLVESYL 464
Query: 433 ---------QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
Q D W KE IW ++ F G +H+N TKD +DYLW
Sbjct: 465 PTVSNIFPAQQLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLW 524
Query: 484 YTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
Y+T + V++++ +N P L I+ L F N +L G+ G+ K +
Sbjct: 525 YSTRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLIGNVVGHW----IKVVQTL 580
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQ 600
G N++ LL+ TVGLQN G F E GAGI +KITGF +G +DLS WTY++GLQ
Sbjct: 581 QFLPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQ 640
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
GE L Y+ N+ WV TWYK P G +P+ LD MGKG AW+N
Sbjct: 641 GEFLKFYSEENENS-EWVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVN 699
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
G+ IGRYW R S KS C Q CDYRG +N DKC T CG+P+Q YH+PRSW K + N
Sbjct: 700 GQHIGRYWTRVSPKSG----CQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNN 755
Query: 721 ILVIFEEKGGDPTKITFSI 739
+LVI EE GG+P +I+ +
Sbjct: 756 LLVILEETGGNPFEISVKL 774
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/661 (53%), Positives = 455/661 (68%), Gaps = 19/661 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+ FSS + + A V+YD +++II+G+R ++IS +IHYPRS P MWP L+Q+AK+ GV
Sbjct: 19 LLMLFSSWVCFVEA-TVSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPDLIQKAKD-GV 76
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPGKYYF R++LV+FIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 77 DVIQTYVFWNGHEPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPV 136
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENE+G E
Sbjct: 137 WLKYVPGIEFRTDNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQIENEFGPVEWE 196
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y WAA+MAV + GVPW+MC+Q D PDPVINTCN FYC+ F P+ + PK+W
Sbjct: 197 IGAPGKAYTKWAAQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYCENFVPNQKNKPKMW 256
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGGPFI T
Sbjct: 257 TENWTGWFTAFGGPTPQRPAEDVAFSVARFIQNGGSFVNYYMYHGGTNFGRTAGGPFIAT 316
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGL R PKWGHL++LH AIKLCE AL++ + + SLG++QE V+ SG
Sbjct: 317 SYDYDAPLDEYGLLREPKWGHLRDLHKAIKLCESALVSTDPTVTSLGNNQEVHVFNPKSG 376
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFLAN D + V F+ + Y LP WS+SILPDCK VFNTA + AQSS +M P +
Sbjct: 377 SCAAFLANYDTTSSAKVNFKIMQYELPPWSISILPDCKTAVFNTARLGAQSSLKQMTPVS 436
Query: 432 LQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
WQ + +E A + F G + +N T+D +DYLWY T+I +
Sbjct: 437 T-------------FSWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDYLWYMTNINI 483
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE FLKNG P+L I S GHALH F N +L G+ G +P + + ++ G N++
Sbjct: 484 DSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVKMRVGVNQL 543
Query: 551 ALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNP 609
+LLS++VGLQN G +E W + V + G N GT DLS W+YKIGL+GE L ++
Sbjct: 544 SLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQQWSYKIGLKGEDLSLHTV 603
Query: 610 GYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+++ WV + QPLTWYK P G+EP+ LDM MGKGL W+N + IGR P
Sbjct: 604 SGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWINSQSIGR--P 661
Query: 670 R 670
R
Sbjct: 662 R 662
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/705 (50%), Positives = 463/705 (65%), Gaps = 25/705 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+INGRR +++S +IHYPRS P MWPGL+Q+AK+GG++ I++YVFWNGHE
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF R++LV+F+K+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF+ IV MMK E LF QGGPII++QVENE+G ES G G K YA WAAKMA
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V N GVPW+MC+Q D PDPVINTCN FYCD F+P+ P +WTE W GWF +FGG P
Sbjct: 218 VGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVP 277
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+GL R
Sbjct: 278 HRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQ 337
Query: 328 PKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKT 387
PKWGHL++LH AIK E L++ + + S+GS ++A V+ +GACAAFL+N
Sbjct: 338 PKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVK 397
Query: 388 VVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLK 447
V F Y+LPAWS+SILPDCK VFNTA V+ + +M P
Sbjct: 398 VRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNP-------------VVRFA 444
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLI 507
WQ + E ++ F K G V+ ++ T D +DYLWYTT + + N+ L++G P L +
Sbjct: 445 WQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTV 502
Query: 508 ESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
S GH++ F N + GS G +P Y + + G N+I++LS VGL N G +E
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 568 -WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN 626
W + V ++ N GT DLS WTY++GL+GE LG+ + + W P
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLQTVTGSSAVEWGG---PGGY 619
Query: 627 QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECD 686
QPLTW+KA P G++P+ LDM MGKG W+NG +GRYW K+ C
Sbjct: 620 QPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGG------CGGCS 673
Query: 687 YRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGD 731
Y G ++ DKC + CG+ SQRWYH+PRSW KP N+LV+ EE G +
Sbjct: 674 YAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/739 (48%), Positives = 477/739 (64%), Gaps = 36/739 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD R++ + G R +++SA +HYPR+ P MWP ++ + KEGG + IE+Y+FWNGHE +
Sbjct: 51 NVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCKEGGADVIETYIFWNGHEPA 110
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF RF+LV+FIK++ +++ LRIGP+ AE+N+GG PVWL IPG FR D E
Sbjct: 111 KGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 170
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+K MQ F+T IVDMMK EKL++ QGGPIIL Q+ENEYG + YG+ GKRY WAA+M
Sbjct: 171 PYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 230
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+ + G+PW+MC+Q D P+ +++TCN+FYCD F P+S + P IWTE+W GW+ +GG
Sbjct: 231 ALGLDTGIPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGPL 290
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+ED AF+VARF+Q+GGS+ NYYMY GGTNF RTAGGP TSYDY+API+EYG+ R
Sbjct: 291 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPINEYGMLR 350
Query: 327 NPKWGHLKELHGAIKLCEHALL--NGERSNLSLGSSQEADVY-----------ADSSGAC 373
PKWGHLK+LH AIKLCE AL+ +G + LGS QEA +Y A ++ C
Sbjct: 351 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYVKLGSMQEAHIYSSAKVHTNGSTAGNAQIC 410
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQ 433
+AFLAN+D+ +V SY+LP WSVSILPDC+ V FNTA V AQ+S E+
Sbjct: 411 SAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCENVAFNTARVGAQTSVFTF--ESGS 468
Query: 434 PSEASPDNGSKGL----------KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
PS +S S L W KE G WG+ F G ++H+N TKD +DYLW
Sbjct: 469 PSHSSRREPSVLLPGVRGSYLSSTWWTSKETIGTWGDGSFATQGILEHLNVTKDISDYLW 528
Query: 484 YTTSI-IVNENEEFLKN-GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
YTTS+ I +E+ F + G P L+I+ F N +L GS G+ K PI
Sbjct: 529 YTTSVNISDEDVAFWSSKGVLPSLIIDQIRDVARVFVNGKLAGSQVGHWV----SLKQPI 584
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQ 600
G NE+ LLS VGLQN G F E GAG VK+TG ++G DL+ +WTY++GL+
Sbjct: 585 QFVRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLTGLSNGDTDLTNSAWTYQVGLK 644
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
GE IY P + W + P TWYK +V P G +P+ +D+ MGKG AW+N
Sbjct: 645 GEFSMIYTPEKQECAEWSAMQTDNIQSPFTWYKTMVDAPEGTDPVAIDLGSMGKGQAWVN 704
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
G IGRYW +P C C+Y G ++ KC + CG P+Q WYHIPR W + S N
Sbjct: 705 GRLIGRYW----SLVAPESGCPSSCNYPGAYSETKCQSNCGMPTQSWYHIPREWLQESNN 760
Query: 721 ILVIFEEKGGDPTKITFSI 739
+LV+FEE GGDP+KI+ +
Sbjct: 761 LLVLFEETGGDPSKISLEV 779
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/782 (48%), Positives = 493/782 (63%), Gaps = 50/782 (6%)
Query: 1 MKPRTPIAPFALLIFFSSSITYCFAG-------NVTYDSRSLIINGRRELIISAAIHYPR 53
++ RT + + + F +SI A NVTYD R+LII+G R ++ISA IHYPR
Sbjct: 16 IRGRTVVFTWFCVCVFVASIIVAGAEAAWFKPFNVTYDHRALIIDGHRRMLISAGIHYPR 75
Query: 54 SVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMI 113
+ P MWP L+ +AKEGGV+ IE+YVFWNGH+ G+Y F GR++LVKF K++ +Y
Sbjct: 76 ATPEMWPDLIAKAKEGGVDVIETYVFWNGHQPVKGQYNFEGRYDLVKFAKLVASNGLYFF 135
Query: 114 LRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQG 173
LRIGP+ AE+N+GG PVWL IPG FR + PFK M++F++ +V++M+ E LF+ QG
Sbjct: 136 LRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMKRFVSKVVNLMREEMLFSWQG 195
Query: 174 GPIILAQV------ENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDP 227
GPIIL QV ENEYG ES YG GK Y WAA MA++ GVPW+MC+Q D P
Sbjct: 196 GPIILLQVRREYGIENEYGNLESSYGNEGKEYVKWAASMALSLGAGVPWVMCKQPDAPYD 255
Query: 228 VINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGS 287
+I+TCN++YCD F P+S + P WTENW GW+ +G R PHRP ED+AF+VARFFQ+GGS
Sbjct: 256 IIDTCNAYYCDGFKPNSRNKPIFWTENWDGWYTQWGERLPHRPVEDLAFAVARFFQRGGS 315
Query: 288 VHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHAL 347
+ NYYMY GGTNFGRTAGGP TSYDY+APIDEYGL PKWGHLK+LH A+KLCE AL
Sbjct: 316 LQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGLLNEPKWGHLKDLHAALKLCEPAL 375
Query: 348 LNGER-SNLSLGSSQEADVYADS-------------SGACAAFLANMDDKNDKTVVFRNV 393
+ + + + LGS QEA VY ++ S C+AFLAN+D++ TV FR
Sbjct: 376 VAADSPTYIKLGSKQEAHVYQENVHREGLNLSISQISNKCSAFLANIDERKAATVTFRGQ 435
Query: 394 SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV------PENLQPSEASPD-NGSKGL 446
+Y LP WSVSILPDC+ +FNTA V AQ+S V++V NL S+ S D NG +
Sbjct: 436 TYTLPPWSVSILPDCRSAIFNTAKVGAQTS-VKLVGSNLPLTSNLLLSQQSIDHNGISHI 494
Query: 447 --KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL--KNGSR 502
W KE IW + F G +H+N TKD +DYLWY+T I V++ + +N +
Sbjct: 495 SKSWMTTKEPINIWINSSFTAEGIWEHLNVTKDQSDYLWYSTRIYVSDGDILFWKENAAH 554
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P L I+S L F N +L G+ G+ K + + G N++ LL+ TVGLQN
Sbjct: 555 PKLAIDSVRDILRVFVNGQLIGNVVGHWV----KAVQTLQFQPGYNDLTLLTQTVGLQNY 610
Query: 563 GPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
G F E GAGI ++KITGF +G +DLS WTY++GLQGE L YN N WV
Sbjct: 611 GAFIEKDGAGIRGTIKITGFENGHIDLSKPLWTYQVGLQGEFLKFYNE-ESENAGWVELT 669
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
TWYK P G++P+ LD+ MGKG AW+NG IGRYW R S K+
Sbjct: 670 PDAIPSTFTWYKTYFDVPGGNDPVALDLESMGKGQAWVNGHHIGRYWTRVSPKTG----- 724
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
Q CDYRG ++ DKC T CG+P+Q YH+PRSW K S N LVI EE GG+P I+ +
Sbjct: 725 CQVCDYRGAYDSDKCTTNCGKPTQTLYHVPRSWLKASNNFLVILEETGGNPLGISVKLHS 784
Query: 742 IS 743
S
Sbjct: 785 AS 786
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 735 bits (1897), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/763 (47%), Positives = 472/763 (61%), Gaps = 69/763 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+INGRR ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVFWNGHE +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF R++LV+F+K+++QA +Y+ LR+GP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF+ IV MMK E LF QGGPII+AQVENE+G ES G GGK YA WAA+MA
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V N GVPW+MC+Q D PDPVINTCN FYCD FTP++ P +WTE W GWF FGG P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY----- 322
HRP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 323 --------------------------------------------GLPRNPKWGHLKELHG 338
GL R PKWGHL+ +H
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 339 AIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLP 398
AIK E AL++G+ + S+G+ ++A V+ +GACAAFL+N K+ + F Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459
Query: 399 AWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIW 458
AWS+SILPDCK VFNTA V+ + +M P + WQ + E
Sbjct: 460 AWSISILPDCKTAVFNTATVKEPTLLPKMSPVMHR------------FAWQSYSEDTNSL 507
Query: 459 GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFA 518
++ F + G ++ ++ T D +DYLWYTT + + NE FLK+G P L + S GH++ F
Sbjct: 508 DDSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFV 567
Query: 519 NQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VK 577
N GS G +P + + + G N+I++LS VGL N G +E G+ V
Sbjct: 568 NGRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVT 627
Query: 578 ITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVK 637
++G N G DLS W Y++GL+GE LG++ + + W QPLTW+KA+
Sbjct: 628 LSGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAG--PGGGTQPLTWHKALFN 685
Query: 638 QPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCI 697
P G +P+ LDM MGKG W+NG GRYW ++ H C Y G + D+C
Sbjct: 686 APAGSDPVALDMGSMGKGQVWVNGRHAGRYWSYRA-----HSRGCGRCSYAGTYREDQCT 740
Query: 698 TGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
+ CG+ SQRWYH+PRSW KPS N+LV+ EE GGD ++ + R
Sbjct: 741 SNCGDLSQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVSLATR 783
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/717 (49%), Positives = 480/717 (66%), Gaps = 27/717 (3%)
Query: 36 IINGRRELIISAAIHYPR-SVP---GMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++ + ++S A +P +VP MW GL+Q+AK+GG++ I++YVFWNGHE +PG YY
Sbjct: 3 VVSCVLDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYY 62
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F R++LV+F+K +Q+A +++ LRIGP++ E+N+GG PVWL Y+PG FR D EPFK
Sbjct: 63 FEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTA 122
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
MQ F IV MMK E LFASQGGPIIL+Q+ENEYG +G G+ Y WAAKMAV +
Sbjct: 123 MQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLD 182
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPS 271
GVPW+MC++ D PDPVIN CN FYCD F+P+ P P +WTE W GWF FGG RP
Sbjct: 183 TGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPV 242
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWG 331
ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R PK
Sbjct: 243 EDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHS 302
Query: 332 HLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFR 391
HLKELH A+KLCE AL++ + + +LG+ QEA V+ SG CAAFLAN + + VVF
Sbjct: 303 HLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFN 361
Query: 392 NVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVF 451
N Y LP WS+SILPDCK VVFN+A V Q+S ++M +G+ + W+ +
Sbjct: 362 NEQYSLPPWSISILPDCKNVVFNSATVGVQTSQMQMW-----------GDGATSMMWERY 410
Query: 452 -KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR-PVLLIES 509
+E+ + +G ++ +N T+D++DYLWY TS+ ++ +E FL+ G + P L ++S
Sbjct: 411 DEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQS 470
Query: 510 KGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWV 569
GHALH F N +LQGS+ G KY ++L+AG N+IALLS+ GL N G YE
Sbjct: 471 AGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETW 530
Query: 570 GAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPPKNQ 627
G+ V + G N G+ DL+ +W+Y++GL+GE + + + ++ W+ ++ K Q
Sbjct: 531 NTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQ 590
Query: 628 PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDY 687
PL WYKA + P GDEP+ LDM MGKG W+NG+ IGRYW ++ D + C Y
Sbjct: 591 PLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYW------TAYADGDCKGCSY 644
Query: 688 RGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE-KGGDPTKITFSIRKIS 743
G F KC GCG+P+QRWYH+PRSW +PS N+LV+ EE GGD +KI + R +S
Sbjct: 645 TGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVS 701
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/735 (49%), Positives = 473/735 (64%), Gaps = 33/735 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+++I G+R +++SA +HYPR+ P MWP L+ + KEGG + IE+YVFWNGHE +
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF RF+LVKF K++ +++ LRIGP+ AE+N+GG PVWL IPG FR D E
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ F+T IV +MK EKL++ QGGPIIL Q+ENEYG + YG+ GKRY WAA+M
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+ + G+PW+MC+Q D P+ +I+TCN+FYCD F P+S + P IWTE+W GW+ +GG
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+ED AF+VARF+Q+GGS+ NYYMY GGTNF RTAGGP TSYDY+APIDEYG+ R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362
Query: 327 NPKWGHLKELHGAIKLCEHALL--NGERSNLSLGSSQEADVY-----------ADSSGAC 373
PKWGHLK+LH AIKLCE AL+ +G + LGS QEA VY A ++ C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVE----M 427
+AFLAN+D+ +V SY LP WSVSILPDC+ V FNTA + AQ+S TVE
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482
Query: 428 VPENLQPSEASPDNGSKGLK--WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
+PS S +G L W KE G WG +F G ++H+N TKD +DYLWYT
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542
Query: 486 TSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISL 543
T + +++ + G P L I+ F N +L GS G+ K PI L
Sbjct: 543 TRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWV----SLKQPIQL 598
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGE 602
G NE+ LLS VGLQN G F E GAG V +TG + G +DL+ WTY++GL+GE
Sbjct: 599 VEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGE 658
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
IY P + W S M+ QP TWYK + P G +P+ +D+ MGKG AW+NG
Sbjct: 659 FSMIYAPEKQGCAGW-SRMQKDSVQPFTWYKTMFSTPKGTDPVAIDLGSMGKGQAWVNGH 717
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
IGRYW +P C C Y G +N KC + CG P+Q WYHIPR W K S+N+L
Sbjct: 718 LIGRYWSL----VAPESGCSSSCYYPGAYNERKCQSNCGMPTQNWYHIPREWLKESDNLL 773
Query: 723 VIFEEKGGDPTKITF 737
V+FEE GGDP+ I+
Sbjct: 774 VLFEETGGDPSLISL 788
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 733 bits (1891), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/762 (48%), Positives = 482/762 (63%), Gaps = 39/762 (5%)
Query: 12 LLIFFSSSITYCFAG----NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAK 67
LL+ + I C NV+YD R+LII+G+R ++IS+ IHYPR+ P MWP L+ ++K
Sbjct: 11 LLVVMTLQIAACTEFFKPFNVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLIAKSK 70
Query: 68 EGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYG 127
EGG + I++Y FWNGHE G+Y F GR+++VKFIK+ A +Y LRIGP+V AE+N+G
Sbjct: 71 EGGADLIQTYAFWNGHEPIRGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAEWNFG 130
Query: 128 GIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
G PVWL IPG FR D P+K MQ+F+ IVD+M++E LF+ QGGPIIL Q+ENEYG
Sbjct: 131 GFPVWLRDIPGIEFRTDNAPYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIENEYGN 190
Query: 188 YESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSM 247
E YG+ GK Y WAA MA+ GVPW+MC+Q D P+ +I+ CN+FYCD F P+S
Sbjct: 191 IERLYGQRGKDYVKWAADMAIGLGAGVPWVMCRQTDAPENIIDACNAFYCDGFKPNSYRK 250
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGP 307
P +WTE+W GW+ ++GGR PHRP ED AF+VARFFQ+GGS HNYYM+ GGTNFGRT+GGP
Sbjct: 251 PALWTEDWNGWYTSWGGRVPHRPVEDNAFAVARFFQRGGSYHNYYMFFGGTNFGRTSGGP 310
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERS--NLSLGSSQEADV 365
F TSYDY+APIDEYGL PKWGHLK+LH AIKLCE AL+ + + + LG QEA V
Sbjct: 311 FYVTSYDYDAPIDEYGLLSQPKWGHLKDLHSAIKLCEPALVAVDDAPQYIRLGPMQEAHV 370
Query: 366 YADSS-------------GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVV 412
Y SS C+AFLAN+D+ N V F Y LP WSVSILPDCK V
Sbjct: 371 YRHSSYVEDQSSSTLGNGTLCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDCKNVA 430
Query: 413 FNTANVRAQSS--TVE----MVPENLQPSEASPDNGSKGLK--WQVFKEIAGIWGEADFV 464
FNTA V +Q S TVE + +P +G + W + KE G WG +F
Sbjct: 431 FNTAKVASQISVKTVEFSSPFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEWGGNNFT 490
Query: 465 KSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR--PVLLIESKGHALHAFANQEL 522
G ++H+N TKDT+DYLWY + +++ + S P L+I+S + F N +L
Sbjct: 491 AEGILEHLNVTKDTSDYLWYIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFVNGQL 550
Query: 523 QGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGF 581
GS G + + P+ L G NE+A+LS TVGLQN G F E GAG +K+TG
Sbjct: 551 AGSHVGRWV----RVEQPVDLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQIKLTGL 606
Query: 582 NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPG 641
SG DL+ W Y++GL+GE + I++ + +WV TWYK P G
Sbjct: 607 KSGEYDLTNSLWVYQVGLRGEFMKIFSLEEHESADWVDLPNDSVPSAFTWYKTFFDAPQG 666
Query: 642 DEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCG 701
+P+ L + MGKG AW+NG IGRYW +P D C Q CDYRG ++ KC T CG
Sbjct: 667 KDPVSLYLGSMGKGQAWVNGHSIGRYWSL----VAPVDGC-QSCDYRGAYHESKCATNCG 721
Query: 702 EPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
+P+Q WYHIPRSW +PS+N+LVIFEE GG+P +I+ + S
Sbjct: 722 KPTQSWYHIPRSWLQPSKNLLVIFEETGGNPLEISVKLHSTS 763
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 732 bits (1889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/739 (48%), Positives = 476/739 (64%), Gaps = 35/739 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+LI+ G+R +++SA +HYPR+ P MWP L+ + KEGGV+ IE+YVFWNGHE +
Sbjct: 62 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGVDAIETYVFWNGHEPA 121
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF GRF++V+F K++ +++ LRIGP+ AE+N+GG PVWL +PG FR D E
Sbjct: 122 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDVPGIEFRTDNE 181
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+K MQ F+T IVD+MK EKL++ QGGPIIL Q+ENEYG + YG+ GKRY LWAA+M
Sbjct: 182 PYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGHYGQAGKRYMLWAAQM 241
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+A + GVPW+MC+Q D P+ ++NTCN+FYCD F P+S + P IWTE+W GW+ +G
Sbjct: 242 ALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGESL 301
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP++D AF+VARF+Q+GGS+ NYYMY GGTNF RTAGGP TSYDY+APIDEYG+ R
Sbjct: 302 PHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 361
Query: 327 NPKWGHLKELHGAIKLCEHAL--LNGERSNLSLGSSQEADVYAD-----------SSGAC 373
PKWGHLK+LH AIKLCE AL ++G + LG QEA VY+ +S C
Sbjct: 362 QPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHVYSSENVHTNGSISGNSQFC 421
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQ 433
+AFLAN+D+ +V SY LP WSVSILPDC+ V FNTA V Q+S + E+
Sbjct: 422 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNV--ESGS 479
Query: 434 PSEASPDNGS---------KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
PS +S W FKE GIWGE F G ++H+N TKD +DYL Y
Sbjct: 480 PSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQGILEHLNVTKDISDYLSY 539
Query: 485 TTSIIVNENEEFLKN--GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPIS 542
TT + ++E + N G P L I+ F N +L GS G+ P+
Sbjct: 540 TTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGSKVGHWV----SLNQPLQ 595
Query: 543 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQG 601
L G NE+ LLS VGLQN G F E GAG VK+TG ++G +DL+ WTY+IGL+G
Sbjct: 596 LVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLKG 655
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNG 661
E IY+P Y+ + W S P TW+K + P G+ P+ +D+ MGKG AW+NG
Sbjct: 656 EFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPVTIDLGSMGKGQAWVNG 715
Query: 662 EEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENI 721
IGRYW +P C C+Y G ++ KC + CG +Q WYHIPR W + S N+
Sbjct: 716 HLIGRYW----SLVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSWYHIPREWLQESGNL 771
Query: 722 LVIFEEKGGDPTKITFSIR 740
LV+FEE GGDP++I+ +
Sbjct: 772 LVLFEETGGDPSQISLEVH 790
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 730 bits (1884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/659 (54%), Positives = 458/659 (69%), Gaps = 15/659 (2%)
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+Y F GR +LV+F+K A +Y+ LRIGP+V AE+NYGG P+WLH+IPG R D EPF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ+F +V MK L+ASQGGPIIL+Q+ENEYG + YG GK Y WAA MAV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A + GVPW+MCQQ D P+P+INTCN FYCDQFTP PS PK+WTENW GWF +FGG P+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP+ED+AF+VARF+Q+GG++ NYYMYHGGTNFGR++GGPFI+TSYDY+APIDEYGL R P
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHL+++H AIK+CE AL+ + S +SLG + EA VY S CAAFLAN+DD++DKTV
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQSDKTV 299
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS----- 443
F +Y LPAWSVSILPDCK VV NTA + +Q ++ +M NL S + D S
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQM--RNLGFSTQASDGSSVEAEL 357
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W E GI E K G ++ INTT D +D+LWY+TSI+V E +L NGS+
Sbjct: 358 AASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYL-NGSQS 416
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
L + S GH L F N +L GS+ G+ + P++L GKN+I LLS TVGL N G
Sbjct: 417 NLPVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYG 476
Query: 564 PFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
F++ VGAGIT VK+TG GTLDLS+ WTY+IGL+GE L +YNP + WVS
Sbjct: 477 AFFDLVGAGITGPVKLTG-PKGTLDLSSAEWTYQIGLRGEDLHLYNPS-EASPEWVSDNS 534
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P N PLTWYK+ P GD+P+ +D MGKG AW+NG+ IGRYWP +P +CV
Sbjct: 535 YPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNIAPQSDCV 591
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
C+YRG ++ KC+ CG+PSQ YH+PRS+ +P N +V+FE+ GG+P+KI+F+ ++
Sbjct: 592 NSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQ 650
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 729 bits (1882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/737 (48%), Positives = 470/737 (63%), Gaps = 31/737 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD+R+LII G+R ++ISA IHYPR+ P MWP L+ ++KEGG + IE+Y FWNGHE +
Sbjct: 36 NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR+++VKF K++ +++ +RIGP+ AE+N+GG P+WL IPG FR D
Sbjct: 96 RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M++++ IVD+M E LF+ QGGPIIL Q+ENEYG ES +G GK Y WAA+M
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESTFGPKGKLYMKWAAEM 215
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D P+ +I+TCN++YCD FTP+S PKIWTENW GWF +G R
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGERL 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+RPSEDIAF++ARFFQ+GGS+ NYYMY GGTNFGRTAGGP TSYDY+AP+DEYGL R
Sbjct: 276 PYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGLLR 335
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVYADSS-----------GACA 374
PKWGHLK+LH AIKLCE AL+ + + LG QEA VY +S G CA
Sbjct: 336 QPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGICA 395
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQP 434
AF+AN+D+ TV F + LP WSVSILPDC+ FNTA V AQ+S + +++
Sbjct: 396 AFIANIDEHESATVKFYGQEFTLPPWSVSILPDCRNTAFNTAKVGAQTSIKTVGSDSVSV 455
Query: 435 SEAS--------PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
S S W KE G+WG+ +F G ++H+N TKD +DYLWY T
Sbjct: 456 GNNSLFLQVITKSKLESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKDQSDYLWYLT 515
Query: 487 SIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
I +++++ +N P + I+S + F N +L GS G K P+ L
Sbjct: 516 RIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----IKVVQPVKLV 571
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEH 603
G N+I LLS TVGLQN G F E GAG +K+TG SG ++L+T WTY++GL+GE
Sbjct: 572 QGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWTYQVGLRGEF 631
Query: 604 LGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEE 663
L +Y+ + W +WYK P G +P+ LD MGKG AW+NG
Sbjct: 632 LEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGKGQAWVNGHH 691
Query: 664 IGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILV 723
+GRYW +P++ C + CDYRG ++ DKC T CGE +Q WYHIPRSW K N+LV
Sbjct: 692 VGRYWTL----VAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKTLNNVLV 747
Query: 724 IFEEKGGDPTKITFSIR 740
IFEE P I+ S R
Sbjct: 748 IFEEIDKTPFDISISTR 764
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/719 (49%), Positives = 455/719 (63%), Gaps = 26/719 (3%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
+V+YD ++L+I+G+R ++IS +IHYPRS P MWP L Q+AK+GG++ I++YVFWNGHE
Sbjct: 22 TASVSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
SPG Y R + VK K+ QQA + + LR+ P + G PVWL Y+PG FR D
Sbjct: 82 PSPGNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMAFRTD 135
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQKF T IV MMK E LF +QGGPII++Q+ENEYG E G GK Y WAA
Sbjct: 136 NEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAA 195
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MAV + GVPW MC+Q D PDPVI+TCN +YC+ FTP+ PK+WTENW GW+ FGG
Sbjct: 196 QMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYTDFGG 255
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
HRP+ED+A+SVA F Q GS NYYMYHGGTNFGRT+ G FI TSYDY+APIDEYGL
Sbjct: 256 AISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGL 315
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQ-EADVYADSSGACAAFLANMDDK 383
P PKW HLK LH AIK CE AL++ + + LG+ EA VY ++ CAAFLAN D K
Sbjct: 316 PNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLANYDTK 375
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
+ TV F N Y LP WSVSILPDCK VVFNTA V S M P
Sbjct: 376 SAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHSFHKRMTPVETT---------- 425
Query: 444 KGLKWQVFKEIAGIWGEAD-FVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
WQ + E + D + + + IN T+D++DYLWY T + ++ +E F+KNG
Sbjct: 426 --FDWQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQF 483
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P L I S GH LH F N +L G+ G +P + ++LK G N+I+LLS+ VGL N
Sbjct: 484 PTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNV 543
Query: 563 GPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
G +E G+ V++ G + GT DLS W+YK+GL+GE L ++ ++I+W
Sbjct: 544 GLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQGS 603
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
K QPLTWYK P G++P+ LDM MGKG W+N + IGR+WP H C
Sbjct: 604 SLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWP----AYIAHGNC 659
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
EC+Y G F KC T CGEP+Q+WYHIPRSW S N+LV+ EE GGDPT I+ R
Sbjct: 660 -DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKR 717
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/737 (48%), Positives = 475/737 (64%), Gaps = 34/737 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+LI+ G+R +++SA +HYPR+ P MWP L+ +AKEGGV+ IE+Y+FWNGHE +
Sbjct: 68 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFWNGHEPA 127
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF GRF++V+F K++ +++ LRIGP+ AE+N+GG PVWL IPG FR D E
Sbjct: 128 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 187
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+K MQ F+T IVD+MK EKL++ QGGPIIL Q+ENEYG + YG+ GKRY WAA+M
Sbjct: 188 PYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 247
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+A + GVPW+MC+Q D P+ +++TCN+FYCD F P+S + P IWTE+W GW+ +G
Sbjct: 248 ALALDTGVPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGEAL 307
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP++D AF+VARF+Q+GGS NYYMY GGTNF RTAGGP TSYDY+APIDEYG+ R
Sbjct: 308 PHRPAQDSAFAVARFYQRGGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 367
Query: 327 NPKWGHLKELHGAIKLCEHAL--LNGERSNLSLGSSQEADVYAD-----------SSGAC 373
PKWGHLK+LH AIKLCE AL ++G + LG QEA VY+ ++ C
Sbjct: 368 QPKWGHLKDLHAAIKLCEPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSISGNAQFC 427
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQ 433
+AFLAN+D+ +V SY LP WSVSILPDC+ V FNTA V Q+S + E+
Sbjct: 428 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNV--ESGS 485
Query: 434 PSEAS---PDNGSKG-----LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
PS +S P S G W KE GIW E F G ++H+N TKD +DYL YT
Sbjct: 486 PSYSSRHKPRILSLGGPYLSSTWWASKEPVGIWSEDIFAAQGILEHLNVTKDISDYLSYT 545
Query: 486 TSIIVNENEEFLKN--GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISL 543
T + +++ + N G P L I+ + F N +L GS G+ P+ L
Sbjct: 546 TRVNISDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGHWV----SLNQPLQL 601
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGE 602
G NE+ LLS VGLQN G F E GAG VK+TG ++G +DL+ WTY+IGL+GE
Sbjct: 602 VQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLKGE 661
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
IY+P + + W S P TW+K P G+ P+ +D+ MGKG AW+NG
Sbjct: 662 FSRIYSPEKQGSAGWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSMGKGQAWVNGH 721
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
IGRYW +P C C+Y G + KC + CG +Q WYHIPR W + S+N+L
Sbjct: 722 LIGRYW----SLVAPESGCPSSCNYAGNYGDSKCRSNCGIATQSWYHIPREWLQESDNLL 777
Query: 723 VIFEEKGGDPTKITFSI 739
V+FEE GGDP++I+ +
Sbjct: 778 VLFEETGGDPSQISLEV 794
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/731 (48%), Positives = 469/731 (64%), Gaps = 22/731 (3%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
++C NV+YDS ++IING R +I S +IHYPRS MWP L+Q+AK+GG++ IE+Y+FW
Sbjct: 20 SFCIGNNVSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFW 79
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
+ HE KY F G N +K+ ++IQ+A +Y+++RIGP+V AE+NYGG P+WLH +PG
Sbjct: 80 DRHEPHRRKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQ 139
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
R + + +K MQ F T IV+M K+ LFASQGGPIILAQ+ENEYG + YGE GK Y
Sbjct: 140 LRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYI 199
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFK 260
W A+MA + NIG+PWIMCQQ D P P+INTCN FYCD FTP++P+ PK++TENW GWFK
Sbjct: 200 NWCAQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPNSPKMFTENWVGWFK 259
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 320
+G +DPHR +ED+AFSVARFFQ GG ++NYYMYHGGTNFGRT+GGPFITTSYDY+AP+D
Sbjct: 260 KWGDKDPHRTAEDVAFSVARFFQSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLD 319
Query: 321 EYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLAN 379
EYG PKWGHLK+LH +IKL E L N RS+ GSS +++ +G FL+N
Sbjct: 320 EYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSDQDFGSSVTFTKFSNLETGEKFCFLSN 379
Query: 380 MDDKNDKTV-VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
D+ ND V + + Y LPAWSVSIL C K +FNTA V +Q+S +
Sbjct: 380 ADENNDAIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTSL-------FFKKQNE 432
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
+N W + G F + ++ T D++DYLWY T++ N
Sbjct: 433 KENAKLSWNWASEPMRDTLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVNSNTTSSL-- 490
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
L + +KGH LHAF N+ GS G+ F ++ PI LK G N I LLS TVG
Sbjct: 491 --QNLTLQVNTKGHVLHAFINRRYIGSQWGSNGQ-SFVFEKPIQLKLGTNTITLLSATVG 547
Query: 559 LQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
L+N FY+ V GI + + G + T DLS+ W+YK+GL GE +YNP + N
Sbjct: 548 LKNYDAFYDTVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLNGERKQLYNPMFSNRTK 607
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
W + + + +TW+KA K P G +P+ LDM MGKG AW+NG IGR+WP +
Sbjct: 608 WSTLNKKSIGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRFWP---SFIA 664
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI- 735
+D C + CDY+G +NP+KC+ CG SQRWYHIPRS+ S N L++FEE GG+P +
Sbjct: 665 SNDSCSETCDYKGSYNPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGGNPQMVS 724
Query: 736 --TFSIRKISG 744
T +I I G
Sbjct: 725 VQTITIGTICG 735
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/720 (51%), Positives = 469/720 (65%), Gaps = 29/720 (4%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A V+YD R+L ++G+R +++S +IHYPRS P MWPGL+ +AKEGG++ I++YVFWNGHE
Sbjct: 25 AVTVSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFWNGHE 84
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G Y + GR+NL KFI+++ +A MY+ LRIGP+V AE+N GG P WL +IPG FR D
Sbjct: 85 PTRGVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGIEFRTD 144
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK Q+F+ +V +KREKLFA QGGPII+AQ+ENEYG ++ YGE G+RY W A
Sbjct: 145 NEPFKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRYLNWIA 204
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAVA N VPWIMCQQ + P VINTCN FYCD + P+S P WTENW GWF+++GG
Sbjct: 205 NMAVATNTSVPWIMCQQPEAPQLVINTCNGFYCDGWRPNSEDKPAFWTENWTGWFQSWGG 264
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P RP +DIAFSVARFF+KGGS NYYMYHGGTNF RT G +TTSYDY+APIDEY +
Sbjct: 265 GAPTRPVQDIAFSVARFFEKGGSFMNYYMYHGGTNFERT-GVESVTTSYDYDAPIDEYDV 323
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGER--SNLSLGSSQEADVYADSSGACAAFLANMDD 382
R PKWGHLK+LH A+KLCE AL+ + + +SLG +QEA VY SSG CAAFLA+ D
Sbjct: 324 -RQPKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQSSSGTCAAFLASW-D 381
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
ND V F+ Y LPAWSVSILPDCK VVFNTA V AQS + M A P
Sbjct: 382 TNDSLVTFQGQPYDLPAWSVSILPDCKSVVFNTAKVGAQSVIMTM-------QGAVPVT- 433
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN-GS 501
W + E G WG F +G ++ I TTKDTTDYLWY T++ V E++ ++N +
Sbjct: 434 ----NWVSYHEPLGPWGSV-FSTNGLLEQIATTKDTTDYLWYMTNVQVAESD--VRNISA 486
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+ L++ S A H F N G++ H + PISL+ G N I +LSMT+GLQ
Sbjct: 487 QATLVMSSLRDAAHTFVNGFYTGTSHQQFMHA----RQPISLRPGSNNITVLSMTMGLQG 542
Query: 562 AGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
GPF E AGI V+I SGT++L +WTY++GLQGE ++ W +
Sbjct: 543 YGPFLENEKAGIQYGVRIEDLPSGTIELGGSTWTYQVGLQGESKQLFEVNGSLTAEWNTI 602
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
E L W K P G+ I LD+ MGKG+ W+NG +GRYW S ++ D
Sbjct: 603 SEVSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYW---SSFTAQRDG 659
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C CDYRG + KC+T C +PSQ WYHIPR W P N +V+FEEKGG+P I+ + R
Sbjct: 660 CDASCDYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDISIATR 719
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/747 (48%), Positives = 479/747 (64%), Gaps = 34/747 (4%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L F S++I V++D R++ I+G+R ++IS +IHYPRS MWP L++++KEG
Sbjct: 36 FCLFTFVSATI-------VSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDLIKKSKEG 88
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE S +Y F G +LV+FIK IQ +Y +LRIGP+V AE+NYGG
Sbjct: 89 GLDAIETYVFWNSHEPSRRQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCAEWNYGGF 148
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH +PG R F MQ F +LIVDMMK E LFASQGGPIILAQVENEYG
Sbjct: 149 PMWLHNLPGCELRTANSVFMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVENEYGNVM 208
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S YG GK Y W + MA + +IGVPWIMCQQ D P P+INTCN +YCDQFTP++ + PK
Sbjct: 209 SAYGAAGKTYIDWCSNMAESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQFTPNNANSPK 268
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWFK++GG+DPHR +ED+AF+VARFFQ GG+ NYYMYHGGTNFGRTAGGP+I
Sbjct: 269 MWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYI 328
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-D 368
TTSYDY+AP+DEYG PKWGHLK+LH + E+ L +G S + +S A +YA D
Sbjct: 329 TTSYDYDAPLDEYGNLNQPKWGHLKQLHDILHSMEYTLTHGNISTIDYDNSVTATIYATD 388
Query: 369 SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV 428
AC F N ++ +D T+VF+ Y++PAWSVSILPDC+ V +NTA V+ Q T MV
Sbjct: 389 KESAC--FFGNANETSDATIVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQ--TAIMV 444
Query: 429 PENLQPSEASPDNGSKGLKWQVFKE---IAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
Q +EA S LKW E + G+ +D D +DYLWY
Sbjct: 445 K---QKNEAEDQPSS--LKWSWIPENTHTTSLLGKGHAHARQLIDQKAAANDASDYLWYM 499
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
TS+ + +++ S L + GH LHA+ N + GS + ++ + L+
Sbjct: 500 TSLHIKKDDPVWS--SDMSLRVNGSGHVLHAYVNGKHLGSQFAKYGVFSYVFEKSLKLRP 557
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSG---TLDLSTYSWTYKIGLQG 601
GKN I+LLS TVGLQN GP ++ V GI V+I G DLS++ W+Y +GL G
Sbjct: 558 GKNVISLLSATVGLQNYGPMFDLVQTGIPGPVEIIGHRGDEKVVKDLSSHKWSYSVGLNG 617
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNG 661
H +Y+ R+ WV + P N+ + WYK K P G +P+ LD+ MGKG AW+NG
Sbjct: 618 FHNELYSSNSRHASRWVE-QDLPTNKMMIWYKTTFKAPLGKDPVVLDLQGMGKGFAWVNG 676
Query: 662 EEIGRYWPRKSRKSSPHDECVQE-CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
IGRYWP + D C E CDYRG ++ +KC+T CG+P+QRWYH+PRS+F EN
Sbjct: 677 NNIGRYWP---SFLAEEDGCSTEVCDYRGAYDNNKCVTNCGKPTQRWYHVPRSFFNDYEN 733
Query: 721 ILVIFEEKGGDPTKITF---SIRKISG 744
LV+FEE GG+P + F ++ K+SG
Sbjct: 734 TLVLFEEFGGNPAGVNFQTVTVGKVSG 760
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/742 (48%), Positives = 479/742 (64%), Gaps = 28/742 (3%)
Query: 8 APFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAK 67
A L+F + I+ A NV++D R++II+G+R +++S +IHYPRS P MWP L+++AK
Sbjct: 5 AHLLCLLFQAVFISLSCAYNVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPDLIRKAK 64
Query: 68 EGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYG 127
EGG++ IE+YVFWN HE + +Y F G +L++FIK IQ +Y +LRIGP+V AE+NYG
Sbjct: 65 EGGLDAIETYVFWNAHEPARRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVCAEWNYG 124
Query: 128 GIPVWLHYIPGTV-FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
G PVWLH +PG FR E F MQ F TLIVDM+K+EKLFASQGGPII+AQ+ENEYG
Sbjct: 125 GFPVWLHNMPGVQEFRTVNEVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQIENEYG 184
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
S YG+ GK Y W AKMA + +IGVPWIMCQ+ D P P+INTCN +YCD FTP+ P+
Sbjct: 185 NMISNYGDAGKVYIDWCAKMAESLDIGVPWIMCQESDAPQPMINTCNGWYCDSFTPNDPN 244
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTENW GWFK++GG+DPHR +ED+AFSVARFFQ GG+ NYYMYHGGTNFGRT+GG
Sbjct: 245 SPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRTSGG 304
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
P++TTSYDY+AP+DE+G PKWGHLKELH +K E L +G S G+S A VY
Sbjct: 305 PYLTTSYDYDAPLDEFGNLNQPKWGHLKELHTVLKAMEKTLTHGNVSTTDFGNSVTATVY 364
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
A G+ + F N + D T+ F+ Y +PAWSVSILPDCK +NTA V Q+S +
Sbjct: 365 ATEEGS-SCFFGNANTTGDATITFQGSDYVVPAWSVSILPDCKTEAYNTAKVNTQTSVIV 423
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAG---IWGEADFVKSGFVDHINTTKDTTDYLW 483
P +N LKW E + G+ F S +D D +DYLW
Sbjct: 424 KKPN-------QAENEPSSLKWVWRPEAIDEPVVQGKGSFSASFLIDQ-KVINDASDYLW 475
Query: 484 YTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFK--YKNPI 541
Y TS+ + ++ + L + + G LHAF N E GS + FK ++ +
Sbjct: 476 YMTSVDLKPDDIIWSDNM--TLRVNTTGIVLHAFVNGEHVGSQWTK--YGVFKDVFQQQV 531
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS----VKITGFNSGTLDLSTYSWTYKI 597
L GKN+I+LLS+TVGLQN GP ++ V AGIT + G + DLS + WTY++
Sbjct: 532 KLNPGKNQISLLSVTVGLQNYGPMFDMVQAGITGPVELIGQKGDETVIKDLSCHKWTYEV 591
Query: 598 GLQG-EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGL 656
GL G E Y+ N S P N +TWYK K P G++P+ LD+ MGKG
Sbjct: 592 GLTGLEDNKFYSKASTNETCGWSAENVPSNSKMTWYKTTFKAPLGNDPVVLDLQGMGKGF 651
Query: 657 AWLNGEEIGRYWPRKSRKSSPHDECVQE-CDYRGKFNPDKCITGCGEPSQRWYHIPRSWF 715
AW+NG +GRYWP ++ D C + CDYRG+++ +KC+T CG+PSQRWYH+PRS+
Sbjct: 652 AWVNGYNLGRYWPSYLAEA---DGCSSDPCDYRGQYDNNKCVTNCGQPSQRWYHVPRSFL 708
Query: 716 KPSENILVIFEEKGGDPTKITF 737
+ EN LV+FEE GG+P ++ F
Sbjct: 709 QDGENTLVLFEEFGGNPWQVNF 730
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/764 (46%), Positives = 482/764 (63%), Gaps = 37/764 (4%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
R A+ ++ Y NV+YD R+LII+G+R +++SA IHYPR+ P MWP L+
Sbjct: 12 RCLFLCLAVQFALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLI 71
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++KEGGV+ I++Y FW+GHE G+Y F GR+++VKF ++ + +Y+ LRIGP+V AE
Sbjct: 72 AKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAE 131
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+N+GG PVWL IPG FR + FK MQ+F+ +VD+M+ E+L + QGGPII+ Q+EN
Sbjct: 132 WNFGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMLQIEN 191
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPH 243
EYG E +G+ GK Y WAA+MA+ GVPW+MC+Q D P +I+ CN +YCD + P+
Sbjct: 192 EYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPN 251
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
S + P +WTE+W GW+ ++GGR PHRP ED+AF+VARF+Q+GGS NYYMY GGTNFGRT
Sbjct: 252 SYNKPTMWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRT 311
Query: 304 AGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQE 362
+GGPF TSYDY+APIDEYGL PKWGHLK+LH AIKLCE AL+ + N + LG QE
Sbjct: 312 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQE 371
Query: 363 ADVYADSSG-------------ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCK 409
A VY +S +C+AFLAN+D+ +V F Y+LP WSVSILPDC+
Sbjct: 372 AHVYRMNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCR 431
Query: 410 KVVFNTANVRAQSS--TVEM-VP-----ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEA 461
VV+NTA V AQ+S TVE +P + Q D+ W KE G+W E
Sbjct: 432 NVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSEN 491
Query: 462 DFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFAN 519
+F G ++H+N TKD +DYLW+ T I V+E++ KN + I+S L F N
Sbjct: 492 NFTVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVN 551
Query: 520 QELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKI 578
+L GS G+ K + P+ G N++ LL+ TVGLQN G F E GAG +K+
Sbjct: 552 GQLTGSVIGHWV----KVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKL 607
Query: 579 TGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLT--WYKAVV 636
TGF +G +D S WTY++GL+GE L IY +W P + P T WYK
Sbjct: 608 TGFKNGDIDFSKLLWTYQVGLKGEFLKIYTIEENEKASWAEL--SPDDDPSTFIWYKTYF 665
Query: 637 KQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKC 696
P G +P+ LD+ MGKG AW+NG IGRYW +P D C + CDYRG ++ DKC
Sbjct: 666 DSPAGTDPVALDLGSMGKGQAWVNGHHIGRYWTL----VAPEDGCPEICDYRGAYDSDKC 721
Query: 697 ITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
CG+P+Q YH+PRSW + S N+LVI EE GG+P I+ +R
Sbjct: 722 SFNCGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLR 765
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/765 (46%), Positives = 482/765 (63%), Gaps = 38/765 (4%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
R A+ ++ Y NV+YD R+LII+G+R +++SA IHYPR+ P MWP L+
Sbjct: 12 RCLFLCLAVQFALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLI 71
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++KEGGV+ I++Y FW+GHE G+Y F GR+++VKF ++ + +Y+ LRIGP+V AE
Sbjct: 72 AKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAE 131
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+N+GG PVWL IPG FR + FK MQ+F+ +VD+M+ E+L + QGGPII+ Q+EN
Sbjct: 132 WNFGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMMQIEN 191
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPH 243
EYG E +G+ GK Y WAA+MA+ GVPW+MC+Q D P +I+ CN +YCD + P+
Sbjct: 192 EYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPN 251
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
S + P +WTE+W GW+ ++GGR PHRP ED+AF+VARF+Q+GGS NYYMY GGTNFGRT
Sbjct: 252 SYNKPTLWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRT 311
Query: 304 AGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQE 362
+GGPF TSYDY+APIDEYGL PKWGHLK+LH AIKLCE AL+ + N + LG QE
Sbjct: 312 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQE 371
Query: 363 ADVYADSSG-------------ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCK 409
A VY +S +C+AFLAN+D+ +V F Y+LP WSVSILPDC+
Sbjct: 372 AHVYRVNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCR 431
Query: 410 KVVFNTANVRAQSS--TVEM-VP-----ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEA 461
VV+NTA V AQ+S TVE +P + Q D+ W KE G+W E
Sbjct: 432 NVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSEN 491
Query: 462 DFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFAN 519
+F G ++H+N TKD +DYLW+ T I V+E++ KN + I+S L F N
Sbjct: 492 NFTVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVN 551
Query: 520 QEL-QGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVK 577
+L +GS G+ K + P+ G N++ LL+ TVGLQN G F E GAG +K
Sbjct: 552 GQLTEGSVIGHWV----KVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIK 607
Query: 578 ITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLT--WYKAV 635
+TGF +G +DLS WTY++GL+GE IY W P + P T WYK
Sbjct: 608 LTGFKNGDIDLSKLLWTYQVGLKGEFFKIYTIEENEKAGWAEL--SPDDDPSTFIWYKTY 665
Query: 636 VKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDK 695
P G +P+ LD+ MGKG AW+NG IGRYW +P D C + CDYRG +N DK
Sbjct: 666 FDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYWTL----VAPEDGCPEICDYRGAYNSDK 721
Query: 696 CITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C CG+P+Q YH+PRSW + S N+LVI EE GG+P I+ +R
Sbjct: 722 CSFNCGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLR 766
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 714 bits (1844), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/756 (48%), Positives = 486/756 (64%), Gaps = 55/756 (7%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N++YD R++II G+R ++IS +HYPR+ P MWP L++ AKEGG++ I++YVFW+GHE
Sbjct: 20 ATNISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
SPG Y F GR++L++F+K++ QA +Y+ LRIGP+V AE+N+GG P WL +PG FR
Sbjct: 80 PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTH 139
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
F+ M++F+ IVDM+K E+LFASQGGP++ +Q+ENEYG + YG GK Y LWAA
Sbjct: 140 NRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYMLWAA 199
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MA GVPWIMC+Q D PD +INTCN +YCD + P+S P +WTENW GW++ +G
Sbjct: 200 RMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQLWGE 259
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYM------------------YHGGTNFGRTAGG 306
P+R ED+AF+VARFFQ+GG NYYM Y GGTNFGRT+GG
Sbjct: 260 AAPYRTVEDVAFAVARFFQRGGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNFGRTSGG 319
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE---A 363
PFITTSYDY+AP+DE+G+ R PKWGHLKELH A+KLCE AL + + +LG QE A
Sbjct: 320 PFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPLYYTLGRMQEMVQA 379
Query: 364 DVYADSS---------GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
VY+D S CAAFLAN+ D + +V F Y+LP WSVSILPDC+ VVFN
Sbjct: 380 HVYSDGSLEANFSNLATPCAAFLANI-DTSSASVKFGGNVYNLPPWSVSILPDCRNVVFN 438
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGS------KGLKWQVFKEIAGIWGEADFVKSGF 468
TA V AQ+S +MV +PS +GS + L W+ F+E G G +
Sbjct: 439 TAQVSAQTSVTKMVAVQ-KPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHAL 497
Query: 469 VDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASG 528
++ I+TT D+TDYLWY+T +++ E LK G PVL+I S +H F N E GS S
Sbjct: 498 LEQISTTNDSTDYLWYSTRFEISDQE--LKGGD-PVLVITSMRDMVHIFVNGEFAGSTST 554
Query: 529 NGTHPPF-KYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTL 586
+ + + + PI LKAG N +A+LS TVGLQN G E GAGIT SV I G ++GT
Sbjct: 555 LKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVWIQGLSTGTR 614
Query: 587 DLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIG 646
+L++ W +++GL GEH + I W ST P QPL WYKA P GD+P+
Sbjct: 615 NLTSALWLHQVGLNGEH---------DAITWSSTTSLPFFQPLVWYKANFNIPDGDDPVA 665
Query: 647 LDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQR 706
+ + MGKG AW+NG +GR+WP ++P C CDYRG + KC++GCG PSQ
Sbjct: 666 IHLGSMGKGQAWVNGHSLGRFWP---AITAPSTGCSDRCDYRGTYYSSKCLSGCGLPSQE 722
Query: 707 WYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
WYH+PR W +N LV+ EE GG+ + ++F+ R +
Sbjct: 723 WYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVV 758
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 714 bits (1844), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/747 (48%), Positives = 482/747 (64%), Gaps = 32/747 (4%)
Query: 1 MKPRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWP 60
M P + + +FF + CFA VTYD+RSLIING R +I S A+HYPRS MWP
Sbjct: 1 MFPMGSSSWVGIALFFLAFTASCFATEVTYDARSLIINGERRVIFSGAVHYPRSTVQMWP 60
Query: 61 GLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFV 120
++Q+AK+GG++ IESYVFW+ HE +Y F G + +KF +IIQ+A +Y ILRIGP+V
Sbjct: 61 DIIQKAKDGGLDAIESYVFWDRHEPVRREYDFSGNLDFIKFFQIIQEAGLYAILRIGPYV 120
Query: 121 AAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
AE+N+GG P+WLH +PG R D +K MQ F T IV+M K KLFASQGGPIILAQ
Sbjct: 121 CAEWNFGGFPLWLHNMPGIELRTDNPIYKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQ 180
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF 240
+ENEYG + YGE GK Y W A+MA+AQNIGVPWIMCQQ D P P+INTCN YCD F
Sbjct: 181 IENEYGNIMTDYGEAGKTYIKWCAQMALAQNIGVPWIMCQQHDAPQPMINTCNGHYCDSF 240
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
P++P PK++TENW GWF+ +G R PHR +ED AFSVARFFQ GG ++NYYMYHGGTNF
Sbjct: 241 QPNNPKSPKMFTENWIGWFQKWGERVPHRSAEDSAFSVARFFQNGGILNNYYMYHGGTNF 300
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
GRTAGGP++TTSY+Y+AP+DEYG PKWGHLK+LH AIKL E + NG R++ G+
Sbjct: 301 GRTAGGPYMTTSYEYDAPLDEYGNLNQPKWGHLKQLHAAIKLGEKIITNGTRTDKDFGNE 360
Query: 361 QEADVYADSSGACAAFLANMDDKNDKTV-VFRNVSYHLPAWSVSILPDCKKVVFNTANVR 419
Y ++G FL+N +D D V + ++ +Y LPAWSV+IL C K VFNTA V
Sbjct: 361 VTLTTYTHTNGERFCFLSNTNDSKDANVDLQQDGNYFLPAWSVTILDGCNKEVFNTAKVN 420
Query: 420 AQSSTVEMVPENLQPSEASPDNGSKGLKWQVF--KEIAGIWGEADFVKSGFVDHINTTKD 477
+Q+S MV ++ D+ S L W K+ + G+ +F + ++ T D
Sbjct: 421 SQTSI--MVKKS--------DDASNKLTWAWIPEKKKDTMHGKGNFKVNQLLEQKELTFD 470
Query: 478 TTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSA----SGNGTHP 533
+DYLWY TS+ +N+ + S L + ++GH L A+ N G GN
Sbjct: 471 VSDYLWYMTSVDINDTSIW----SNATLRVNTRGHTLRAYVNGRHVGYKFSQWGGN---- 522
Query: 534 PFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTY 591
F Y+ +SLK G N I LLS TVGL N G ++ + GI V++ G N+ T+DLST
Sbjct: 523 -FTYEKYVSLKKGLNVITLLSATVGLPNYGAKFDKIKTGIAGGPVQLIGNNNETIDLSTN 581
Query: 592 SWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLK 651
W+YKIGL GE +Y+P R ++W + P + LTWYKA P G++P+ +D+L
Sbjct: 582 LWSYKIGLNGEKKRLYDPQPRIGVSWRTNSPYPIGRSLTWYKADFVAPSGNDPVVVDLLG 641
Query: 652 MGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNP-DKCITGCGEPSQRWYHI 710
+GKG AW+NG+ IGRYW + + + C CDYRGK+ P KC T CG PSQRWYH+
Sbjct: 642 LGKGEAWVNGQSIGRYW---TSWITATNGCSDTCDYRGKYVPAQKCNTNCGNPSQRWYHV 698
Query: 711 PRSWFKPSENILVIFEEKGGDPTKITF 737
PRS+ K +N LV+FEE GG+P ++F
Sbjct: 699 PRSFLKNDKNTLVLFEEIGGNPQNVSF 725
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 712 bits (1838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/738 (48%), Positives = 480/738 (65%), Gaps = 25/738 (3%)
Query: 10 FALLIFFSS-SITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
F L I FS + A +++D R++ I+G+R +++S +IHYPRS P MWP L++++KE
Sbjct: 6 FLLAISFSLFTFHLVSAAVISHDGRAITIDGKRRVLLSGSIHYPRSTPQMWPDLIKKSKE 65
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GG++ IE+YVFWN HE S +Y FGG +LV+FIK +Q +Y +LRIGP+V AE+NYGG
Sbjct: 66 GGLDAIETYVFWNVHEPSRRQYDFGGNLDLVRFIKAVQDEGLYAVLRIGPYVCAEWNYGG 125
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY 188
PVWLH +PG R F MQ F +LIVDMMK+E+LFASQGGPII+AQVENEYG
Sbjct: 126 FPVWLHNMPGIELRTANSIFMNEMQNFTSLIVDMMKQEQLFASQGGPIIIAQVENEYGNV 185
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMP 248
S YG GK Y W A MA + NIGVPWIMCQQ D PDP+INTCN +YCDQFTP +P+ P
Sbjct: 186 MSSYGAAGKAYIDWCANMAESLNIGVPWIMCQQSDAPDPMINTCNGWYCDQFTPSNPNSP 245
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
K+WTENW GWFK++GG+DPHR +ED+AF+VARFFQ GG+ NYYMYHGGTNFGRTAGGP+
Sbjct: 246 KMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPY 305
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA- 367
ITTSYDY+AP+DE+G PKWGHLK+LH + E L +G S++ +S A +YA
Sbjct: 306 ITTSYDYDAPLDEFGNLNQPKWGHLKQLHDVLHSMEEILTSGTVSSVDYDNSVTATIYAT 365
Query: 368 DSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM 427
D +C FL+N ++ +D T+ F+ +Y +PAWSVSILPDC V +NTA V+ Q+S M
Sbjct: 366 DKESSC--FLSNANETSDATIEFKGTTYTIPAWSVSILPDCANVGYNTAKVKTQTSV--M 421
Query: 428 VPENLQPSEASPDNGSKGLKWQ---VFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
V + ++A + S W+ V K + + G+ VD D +DYLWY
Sbjct: 422 VKRD---NKAEDEPTSLNWSWRPENVDKTV--LLGQGHIHAKQIVDQKAVANDASDYLWY 476
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
TS+ + +++ L + I GH LHA+ N E GS + + ++ + LK
Sbjct: 477 MTSVDLKKDD--LIWSKDMSIRINGSGHILHAYVNGEYLGSQWSEYSVSNYVFEKSVKLK 534
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITS----VKITGFNSGTLDLSTYSWTYKIGLQ 600
G+N I LLS TVGL N G Y+ + AGI V G + DLS W+YK+GL
Sbjct: 535 HGRNLITLLSATVGLANYGANYDLIQAGILGPVELVGRKGDETIIKDLSNNRWSYKVGLL 594
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
G +Y ++ W E P N+ LTWYK K P G +P+ LD+ +GKG+AW+N
Sbjct: 595 GLEDKLYLSDSKHASKW-QEQELPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKGMAWIN 653
Query: 661 GEEIGRYWPRKSRKSSPHDECVQE-CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSE 719
G IGRYWP + D C + CDYRG ++ +KC++ CG+P+QRWYH+PRS+ + +E
Sbjct: 654 GNSIGRYWPSFLAED---DGCSTDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQDNE 710
Query: 720 NILVIFEEKGGDPTKITF 737
N LV+FEE GG+P+++ F
Sbjct: 711 NTLVLFEEFGGNPSQVNF 728
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/729 (48%), Positives = 473/729 (64%), Gaps = 30/729 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YDS ++IING R +I S +IHYPRS MWP L+Q+AK+GG++ IE+Y+FW+ HE
Sbjct: 4 NVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 63
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
KY F G N +KF +++Q A +Y+++RIGP+V AE+NYGG P+WLH +PG R D +
Sbjct: 64 RQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTDNQ 123
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+K M F T IV+M K+ LFASQGGPIILAQ+ENEYG + YG GK Y W A+M
Sbjct: 124 VYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCAQM 183
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A + NIGVPWIMCQQ D P P+INTCN FYCD F+P++P PK++TENW GWFK +G +D
Sbjct: 184 AESLNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWGDKD 243
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+R +ED+AFSVARFFQ GG +NYYMYHGGTNFGRT+GGPFITTSYDY AP+DEYG
Sbjct: 244 PYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLN 303
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKND 385
PKWGHLK+LH +IKL E L NG SN + GS +++ ++ FL+N DD ND
Sbjct: 304 QPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTLTKFSNPTTKERFCFLSNTDDTND 363
Query: 386 KTVVFR-NVSYHLPAWSVSILPDCKKVVFNTANVRAQSST---VEMVPENLQPSEA-SPD 440
T+ + + Y +PAWSVSI+ CKK VFNTA + +Q+S V+ EN++ S +P+
Sbjct: 364 ATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSMFVKVQNEKENVKLSWVWAPE 423
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
S L+ G+ F ++ ++ TT D++DYLWY T++ N
Sbjct: 424 AMSDTLQ-----------GKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSI---- 468
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
L + +KGH LHAF N GS GN F ++ PI LKAG N I LLS TVGL+
Sbjct: 469 HNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQ-SFVFEKPILLKAGTNIITLLSATVGLK 527
Query: 561 NAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N FY+ + GI + + G + T +LS+ W+YK+GL GE +YNP + +W
Sbjct: 528 NYDAFYDTLPTGIDGGPIYLIGDGNVTTNLSSNLWSYKVGLNGEIKQLYNPVFSQETSWN 587
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
+ + + +TWYK K P G +P+ LDM MGKG AW+NG+ IGR+WP + +
Sbjct: 588 TLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWP---SFIAGN 644
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI--- 735
D C + CDYRG ++P KC+ CG PSQRWYHIPRS+ + N LV+FEE GG P ++
Sbjct: 645 DNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQ 704
Query: 736 TFSIRKISG 744
T +I I G
Sbjct: 705 TITIGTICG 713
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/756 (47%), Positives = 485/756 (64%), Gaps = 55/756 (7%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N++YD R++II G+R ++IS IHYPR+ P MWP L++ AKEGG++ I++YVFW+GHE
Sbjct: 20 ATNISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
SPG Y F GR++L++F+K++ QA +Y+ LRIGP+V AE+N+GG P WL +PG FR
Sbjct: 80 PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTH 139
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
F+ M++F+ IVDM+K E+LFASQGGP++ +Q+ENEYG + YG GK Y LWAA
Sbjct: 140 NRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYMLWAA 199
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MA GVPWIMC+Q D PD +INTCN +YCD + P+S P +WTENW GW++++G
Sbjct: 200 RMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQSWGE 259
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYM------------------YHGGTNFGRTAGG 306
P+R ED+AF+VARFFQ+GG NYYM Y GGTNFGRT+GG
Sbjct: 260 AAPYRTVEDVAFAVARFFQRGGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNFGRTSGG 319
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE---A 363
PFITTSYDY+AP+DE+G+ R PKWGHLKELH A+KLCE AL + + +LG QE A
Sbjct: 320 PFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPVYYTLGRMQEMVQA 379
Query: 364 DVYADSS---------GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
VY+D S CAAFLAN+ D + +V F Y+LP WSVSILPDC+ VVFN
Sbjct: 380 HVYSDGSLEANFSNLATPCAAFLANI-DTSSASVKFGGKVYNLPPWSVSILPDCRNVVFN 438
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGS------KGLKWQVFKEIAGIWGEADFVKSGF 468
TA V AQ+S +MV +PS +GS + L W+ F+E G G +
Sbjct: 439 TAQVSAQTSVTKMVAVQ-KPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHAL 497
Query: 469 VDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASG 528
++ I+TT D+TDY+WY+T + + E LK G PVL+I S +H F N E GS S
Sbjct: 498 LEQISTTNDSTDYMWYSTRFEILDQE--LKGGD-PVLVITSMRDMVHIFVNGEFAGSTST 554
Query: 529 NGTHPPF-KYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTL 586
+ + + + PI LKAG N +A+LS TVGLQN G E GAGIT S+ I G ++GT
Sbjct: 555 LKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIWIQGLSTGTR 614
Query: 587 DLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIG 646
+L++ W +++GL GEH + I W ST P QPL WYKA P GD+P+
Sbjct: 615 NLTSALWLHQVGLNGEH---------DAITWSSTTSLPFFQPLVWYKANFNIPDGDDPVA 665
Query: 647 LDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQR 706
+ + MGKG AW+NG +GR+WP ++P C CDYRG + KC++ CG PSQ
Sbjct: 666 IHLGSMGKGQAWVNGHSLGRFWP---VITAPSTGCSDRCDYRGTYYSSKCLSSCGLPSQE 722
Query: 707 WYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
WYH+PR W +N LV+ EE GG+ + ++F+ R +
Sbjct: 723 WYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVV 758
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/746 (46%), Positives = 480/746 (64%), Gaps = 27/746 (3%)
Query: 6 PIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
P +FF + + A VTYD R++II+G+ L++S +IHYPRS MWP LV++
Sbjct: 3 PSKVLLATLFFFTLAPWATASKVTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPDLVKK 62
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYN 125
++EGG++ IE+YVFW+ HE + +Y F G +L++F+K IQ +Y +LRIGP+V AE+N
Sbjct: 63 SREGGLDAIETYVFWDSHEPARREYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVCAEWN 122
Query: 126 YGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
YGG PVWLH +PG R + F M+ F TLIV+M+K+E LFASQGGP+ILAQ+ENEY
Sbjct: 123 YGGFPVWLHNMPGVQMRTANDVFMNEMRNFTTLIVNMVKQENLFASQGGPVILAQIENEY 182
Query: 186 GYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSP 245
G S YG+ GK Y W A MA + +IGVPW+MCQQ D P+P+INTCN +YCDQFTP+ P
Sbjct: 183 GNVMSSYGDEGKAYIEWCANMAQSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQFTPNRP 242
Query: 246 SMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG 305
+ PK+WTENW GWFK++GG+DPHR +ED+AFSVARF+Q GG+ NYYMYHGGTNFGRTAG
Sbjct: 243 TSPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAG 302
Query: 306 GPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADV 365
GP+ITTSYDY+AP+DEYG PKWGHLKELH + E L G S++ G+S +
Sbjct: 303 GPYITTSYDYDAPLDEYGNLNQPKWGHLKELHDVLHSMEDTLTRGNISSVDFGNSVSGTI 362
Query: 366 YADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV 425
Y+ G+ + FL N D +ND T+ F+ + Y +PAWSVSILPDC+ VV+NTA V AQ+S V
Sbjct: 363 YSTEKGS-SCFLTNTDSRNDTTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQTS-V 420
Query: 426 EMVPENLQPSEASPDNGSKGLKWQVFKEI---AGIWGEADFVKSGFVDHINTTKDTTDYL 482
+ +N+ E + L W E + ++G+ + + +D + D +DYL
Sbjct: 421 MVKKKNVAEDEPA------ALTWSWRPETNDKSILFGKGEVSVNQILDQKDAANDLSDYL 474
Query: 483 WYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPIS 542
+Y TS+ + E++ G L I G LH F N E GS + ++ I
Sbjct: 475 FYMTSVSLKEDDPIW--GDNMTLRITGSGQVLHVFVNGEFIGSQWAKYGVFDYVFEQQIK 532
Query: 543 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIG 598
L GKN I LLS TVG N G ++ AG+ V++ G++ + DLS++ W+YK+G
Sbjct: 533 LNKGKNTITLLSATVGFANYGANFDLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKVG 592
Query: 599 LQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAW 658
L+G +Y+ ++ W P N+ TWYKA K P G +P+ +D+L +GKGLAW
Sbjct: 593 LEGLRQNLYS---SDSSKWQQD-NYPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAW 648
Query: 659 LNGEEIGRYWPRKSRKSSPHDEC-VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWF-K 716
+NG IGRYWP D C + CDYRG ++ +KC+T CG+P+QRWYH+PRS+
Sbjct: 649 VNGNSIGRYWP----SFIAEDGCSLDPCDYRGSYDNNKCVTNCGKPTQRWYHVPRSFLNN 704
Query: 717 PSENILVIFEEKGGDPTKITFSIRKI 742
+N LV+FEE GGDP+ + F I
Sbjct: 705 EGDNTLVLFEEFGGDPSSVNFQTTAI 730
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/738 (47%), Positives = 471/738 (63%), Gaps = 27/738 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+ F+ A V+YDSR++ I+G+R+++ S +IHYPRS MWP L+ +AKEGG+
Sbjct: 6 LLLSFTLVNLAINAFEVSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKEGGL 65
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWN HE P +Y F G +LVKFIK IQ+ +Y +LRIGP+V AE+NYGG PV
Sbjct: 66 DVIETYVFWNAHEPQPRQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGGFPV 125
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WLH +P FR + + MQ F TLIVD M+ E LFASQGGPIILAQ+ENEYG S
Sbjct: 126 WLHNMPNMEFRTNNTAYMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNIMSE 185
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
YGE GK+Y W A++A + IGVPW+MCQQ D PDP+INTCN +YCDQF+P+S S PK+W
Sbjct: 186 YGENGKQYVQWCAQLAESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQFSPNSKSKPKMW 245
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWFK +GG PHR + D+A++VARFFQ GG+ NYYMYHGGTNFGRT+GGP+ITT
Sbjct: 246 TENWTGWFKNWGGPIPHRTARDVAYAVARFFQYGGTFQNYYMYHGGTNFGRTSGGPYITT 305
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYG PKWGHLK+LH +K E L G ++ G+ A VY + SG
Sbjct: 306 SYDYDAPLDEYGNKNQPKWGHLKQLHELLKSMEDVLTQGTTNHTDYGNLLTATVY-NYSG 364
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
A FL N + ND T++F++ Y +PAWSVSILP+C V+NTA + AQ+S + M +N
Sbjct: 365 KSACFLGNANSSNDATIMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVM-KDN 423
Query: 432 LQPSEASPDNGSKGLKWQVFKE------IAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
+E P + L WQ E + G + +D T DT+DYLWY
Sbjct: 424 KSDNEEEPHS---TLNWQWMHEPHVQMKDGQVLGSVSRKAAQLLDQKVVTNDTSDYLWYI 480
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
TS+ ++EN+ + + + GH LH F N G G F Y+ I LK
Sbjct: 481 TSVDISENDPIWSK-----IRVSTNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLKK 535
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGT---LDLSTYSWTYKIGLQG 601
G NEI+LLS TVGL N G + V G+ V++ + T D++ +W YK+GL G
Sbjct: 536 GTNEISLLSGTVGLPNYGAHFSNVSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLHG 595
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNG 661
E + +Y P NN W +T P N+ WYK + K P G +P+ +D+ + KG AW+NG
Sbjct: 596 EIVKLYCP--ENNKGW-NTNGLPTNRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWVNG 652
Query: 662 EEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK-PSEN 720
IGRYW +R + + C C+YRG ++ DKCIT CG P+QRWYH+PRS+ + ++N
Sbjct: 653 NNIGRYW---TRYLADDNGCTATCNYRGPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQN 709
Query: 721 ILVIFEEKGGDPTKITFS 738
LV+FEE GG P ++ F+
Sbjct: 710 TLVLFEEFGGHPNEVKFA 727
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/740 (49%), Positives = 470/740 (63%), Gaps = 32/740 (4%)
Query: 3 PRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
P+T + +LL + S+I G VTYD +++IIN +R ++IS +IHYPRS P MWP L
Sbjct: 2 PKTVLLFLSLLTWVGSTI-----GAVTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDL 56
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q+AK+GG++ IE+YVFWNGHE S GK + + +I+ ++ L P
Sbjct: 57 IQKAKDGGLDIIETYVFWNGHEPSEGKVTWEDFL----YEQILYINCFHVALFXFPPYFX 112
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
+ G P+WL ++PG FR D EPFK MQKF+T IVDMMK EKL+ +QGGPIIL+Q+E
Sbjct: 113 FQKFSGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIE 172
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEYG E G GK Y W A+MAV GVPW+MC+Q D PDP+I+TCN FYC+ F P
Sbjct: 173 NEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKP 232
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
+ PKIWTENW GW+ FGG P+RP ED+AFSVARF Q GS+ NYY+YHGGTNFGR
Sbjct: 233 NQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGR 292
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
T+ G FI TSYD++APIDEYGL R PKWGHL++LH AIKLCE AL++ + ++ LG +QE
Sbjct: 293 TS-GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQE 351
Query: 363 ADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQS 422
A V+ SS ACAAFLAN D V F N Y LP WS+SILPDCK V FNTA + +S
Sbjct: 352 ARVFKSSS-ACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCKTVTFNTAQIGVKS 410
Query: 423 STVEMVPENLQPSEASPDNGSKGLKWQVFKEI-AGIWGEADFVKSGFVDHINTTKDTTDY 481
+M+P + W +KE A + + K G V+ ++ T DTTDY
Sbjct: 411 YEAKMMPIS-------------SFGWLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDY 457
Query: 482 LWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
LWY I ++ E FLK+G P+L + S GH LH F N +L GS G+ P + +
Sbjct: 458 LWYMQDISIDSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYV 517
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQ 600
+LK G N++++LS+TVGL N G ++ AG+ V + G N GT D+S Y W+YK+GL
Sbjct: 518 NLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLS 577
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
GE L +Y+ N++ W K QPLTWYK K P G+EP+GLDM M KG W+N
Sbjct: 578 GESLNLYSDKGSNSVQWTKGSLTQK-QPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVN 636
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
G IGRY+P + +C +C Y G F KC+ CGEPSQ+WYHIPR W PS+N
Sbjct: 637 GRSIGRYFP----GYIANGKC-DKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDN 691
Query: 721 ILVIFEEKGGDPTKITFSIR 740
+LVIFEE GG P I+ R
Sbjct: 692 LLVIFEEIGGSPDGISLVKR 711
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 707 bits (1826), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/740 (47%), Positives = 473/740 (63%), Gaps = 31/740 (4%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F LL +S++ V+YD R+LII+G+R ++ S +IHYPRS P MWP L+++AK G
Sbjct: 28 FVLLNVLASAV------EVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMWPDLIRKAKAG 81
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE +Y F G +L++FI+ IQ +Y +LRIGP+V AE+ YGG
Sbjct: 82 GLDAIETYVFWNVHEPLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPYVCAEWTYGGF 141
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH +PG FR + F MQ F TLIVDM K+EKLFASQGGPII+AQ+ENEYG
Sbjct: 142 PMWLHNMPGIEFRTANKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIIIAQIENEYGNIM 201
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
+ YG+ GK Y W A MA + +IGVPWIMCQQ D P P+INTCN +YCD FTP++P+ PK
Sbjct: 202 APYGDAGKVYVDWCAAMANSLDIGVPWIMCQQSDAPQPMINTCNGWYCDSFTPNNPNSPK 261
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GWFK +GG+DPHR +ED+++SVARFFQ GG+ NYYMYHGGTNFGR AGGP+I
Sbjct: 262 MWTENWTGWFKNWGGKDPHRTAEDLSYSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYI 321
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+AP+DE+G PKWGHLK+LH +K E L G + + +G+S E VYA +
Sbjct: 322 TTSYDYDAPLDEFGNLNQPKWGHLKDLHTVLKSMEETLTEGNITTIDMGNSVEVTVYA-T 380
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+ F +N + ND T + Y +PAWSVSILPDCKK V+NTA V AQ+S ++
Sbjct: 381 QKVSSCFFSNSNTTNDATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVNAQTS---VMV 437
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEI---AGIWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
+N +E P LKW E+ + G+ + +D TT D +DYLWY
Sbjct: 438 KNKNEAEDQP----ASLKWSWRPEMIDDTAVLGKGQVSANRLIDQ-KTTNDRSDYLWYMN 492
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
S+ ++E++ L L + + GH LHA+ N E GS + ++ + LK G
Sbjct: 493 SVDLSEDD--LVWTDNMTLRVNATGHILHAYVNGEYLGSQWATNGIFNYVFEEKVKLKPG 550
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGE 602
KN IALLS T+G QN G FY+ V +GI+ V+I G DLS++ W+YK+G+ G
Sbjct: 551 KNLIALLSATIGFQNYGAFYDLVQSGISGPVEIVGRKGDETIIKDLSSHKWSYKVGMHGM 610
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
+ +Y+P + W P N+ LTWYK K P G + + +D+ +GKG AW+NG+
Sbjct: 611 AMKLYDP--ESPYKW-EEGNVPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKGEAWVNGQ 667
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
+GRYWP S D C CDYRG + KC+ CG P+QRWYH+PRS+ EN L
Sbjct: 668 SLGRYWP----SSIAEDGCNATCDYRGPYTNTKCVRNCGNPTQRWYHVPRSFLTADENTL 723
Query: 723 VIFEEKGGDPTKITFSIRKI 742
V+FEE GG+P+ + F I
Sbjct: 724 VLFEEFGGNPSLVNFQTVTI 743
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 707 bits (1825), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/734 (48%), Positives = 467/734 (63%), Gaps = 25/734 (3%)
Query: 20 ITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVF 79
+ +C NV+YDS ++IING R +I+S ++HYPRS MWP L+Q+AK+GG++ IE+Y+F
Sbjct: 4 VLFCKGDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIF 63
Query: 80 WNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGT 139
W+ HE KY F GR + +KF +++Q A +Y+++RIGP+V AE+NYGG P+WLH +PG
Sbjct: 64 WDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGI 123
Query: 140 VFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRY 199
FR D + +K MQ F T IV+M K+ LFASQGGPIILAQ+ENEYG + YG GK Y
Sbjct: 124 QFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSY 183
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCD-QFTPHSPSMPKIWTENWPGW 258
W A+MA + NIG+PWIMCQQ D P P+INTCN FYCD F+P++P PK++TENW GW
Sbjct: 184 INWCAQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGW 243
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
FK +G +DP+R ED+AF+VARFFQ GG +NYYMYHGGTNFGRTAGGPFITTSYDY AP
Sbjct: 244 FKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAP 303
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFL 377
+DEYG PKWGHLK+LH +IK+ E L N RS+ L S +++ +SG FL
Sbjct: 304 LDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKLXSFVTLTKFSNPTSGERFCFL 363
Query: 378 ANMDDKNDKTVVFR-NVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSE 436
+N D+KND T+ + + Y +PAWSVSIL C K VFNTA + +Q+S V +
Sbjct: 364 SNTDNKNDATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQTSMFVKV-------Q 416
Query: 437 ASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEF 496
+N W + G+ F + ++ TT D +DYLWY T+I N
Sbjct: 417 NKKENAQFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSL 476
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSA-SGNGTHPPFKYKNPISLKAGKNEIALLSM 555
L + +KGH LHAF N+ GS NG F + PI +K G N I LLS
Sbjct: 477 ----QNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQS--FVFXKPILIKPGTNTITLLSA 530
Query: 556 TVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRN 613
TVGL+N FY+ V GI + + G + +DLS+ W+YK+GL GE +YNP +
Sbjct: 531 TVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQ 590
Query: 614 NINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSR 673
NW + + + +T YK K P G +P+ LDM MGKG AW+NG+ IGR+WP
Sbjct: 591 RTNWSTINQKSIGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNGQSIGRFWP---S 647
Query: 674 KSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPT 733
+ +D C CDYRG +NP KC+ CG PSQRWYHIPRS+ N LV+FEE GG+P
Sbjct: 648 FIAGNDSCSTTCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQ 707
Query: 734 KI---TFSIRKISG 744
++ T +I I G
Sbjct: 708 QVSVQTITIGTICG 721
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/746 (47%), Positives = 472/746 (63%), Gaps = 28/746 (3%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F+L++ + +C NV+YDS ++IING R +I+S ++HYPRS MWP L+Q+AK+G
Sbjct: 20 FSLVVTLAC-FYFCKGDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDG 78
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+Y+FW+ HE KY F GR + +KF +++Q A +Y+++RIGP+V AE+NYGG
Sbjct: 79 GLDAIETYIFWDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGF 138
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH +PG FR D + +K MQ F T IV+M K+ LFASQGGPIILAQ+ENEYG
Sbjct: 139 PLWLHNLPGIQFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVM 198
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCD-QFTPHSPSMP 248
+ YG GK Y W A+MA + NIG+PWIMCQQ D P P+INTCN FYCD F+P++P P
Sbjct: 199 TPYGNAGKSYINWCAQMAESLNIGIPWIMCQQNDAPQPIINTCNGFYCDYDFSPNNPKSP 258
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
K++TENW GWFK +G +DP+R ED+AF+VARFFQ GG +NYYMYHGGTNFGRTAGGPF
Sbjct: 259 KMFTENWVGWFKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPF 318
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD 368
ITTSYDY AP+DEYG PKWGHLK+LH +IK+ E L N RS+ + S +++
Sbjct: 319 ITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKISSFVTLTKFSN 378
Query: 369 -SSGACAAFLANMDDKNDKTVVFRNVSYH---LPAWSVSILPDCKKVVFNTANVRAQSST 424
+SG FL+N D+KND T+ + + +PAWSVSIL C K VFNTA + +Q+S
Sbjct: 379 PTSGERFCFLSNTDNKNDATIDLQADGKYFVPVPAWSVSILDGCNKEVFNTAKINSQTSM 438
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
V + +N W + G+ F + ++ TT D +DYLWY
Sbjct: 439 FVKV-------QNKKENAQFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWY 491
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSA-SGNGTHPPFKYKNPISL 543
T+I N L + +KGH LHAF N+ GS NG F ++ PI +
Sbjct: 492 MTNIDSNATSSL----QNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQS--FVFEKPILI 545
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQG 601
K G N I LLS TVGL+N FY+ V GI + + G + +DLS+ W+YK+GL G
Sbjct: 546 KPGTNTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNG 605
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNG 661
E +YNP + NW + + + +TWYK K P G + + LDM MGKG AW+NG
Sbjct: 606 EMKQLYNPVFSQRTNWSTINQKSIGRRMTWYKTSFKTPSGIDRVTLDMQGMGKGQAWVNG 665
Query: 662 EEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENI 721
+ IGR+WP + +D C CDYRG +NP KC+ CG PSQRWYHIPRS+ N
Sbjct: 666 QSIGRFWP---SFIASNDSCSTTCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNT 722
Query: 722 LVIFEEKGGDPTKI---TFSIRKISG 744
LV+FEE GG+P ++ T +I I G
Sbjct: 723 LVLFEEIGGNPQQVSVQTITIGTICG 748
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/719 (48%), Positives = 471/719 (65%), Gaps = 21/719 (2%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV++D R++ I+G+R ++IS +IHYPRS P MWP L+Q+AKEGG++ IE+YVFWN HE S
Sbjct: 29 NVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAHEPS 88
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
Y F G ++++F+K IQ++ +Y +LRIGP+V AE+NYGGIPVW+H +P R
Sbjct: 89 RRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRTANS 148
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F MQ F TLIVDM+K+EKLFASQGGPIIL Q+ENEYG S YG+ GK Y W A M
Sbjct: 149 VFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAYMNWCANM 208
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A + +GVPWIMCQ+ D P P+INTCN +YCD F P+S + PK+WTENW GWFK +GGRD
Sbjct: 209 AESLKVGVPWIMCQESDAPQPMINTCNGWYCDNFEPNSFNSPKMWTENWIGWFKNWGGRD 268
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHR +ED+AF+VARFFQ GG+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP+DEYG
Sbjct: 269 PHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNIA 328
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLKELH A+K E AL +G S LG+S + +YA ++G+ + FL+N + D
Sbjct: 329 QPKWGHLKELHSALKAMEEALTSGNVSETDLGNSVKVTIYA-TNGSSSCFLSNTNTTADA 387
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
T+ FR +Y +PAWSVSILPDC+ +NTA V+ Q+S M EN S+A +
Sbjct: 388 TLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTSV--MTKEN---SKAEKEAAILKW 442
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + G+++ +D + D +DYLWY T + V ++ L
Sbjct: 443 VWRSENIDKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHDDPVWS--ENMTLR 500
Query: 507 IESKGHALHAFANQE-LQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I GH +HAF N E + + G H K++ I LK G N I+LLS+TVGLQN G F
Sbjct: 501 INGSGHVIHAFVNGEYIDSHWATYGIHND-KFEPKIKLKHGTNTISLLSVTVGLQNYGAF 559
Query: 566 YEWVGAGITS----VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG--YRNNINWVS 619
++ AG+ V + G + +LS++ W+YKIGL G +++ + W S
Sbjct: 560 FDTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDHKLFSDDSPFAAQSKWES 619
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
+ P N+ LTWYK K P G +P+ +D+ MGKG AW+NG+ IGR WP ++ D
Sbjct: 620 E-KLPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKNIGRIWP---SYNAEED 675
Query: 680 ECVQE-CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
C E CDYRG+++ KC+T CG+P+QRWYH+PRS+ K N LV+F E GG+P+ + F
Sbjct: 676 GCSDEPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLFAELGGNPSLVNF 734
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/741 (48%), Positives = 479/741 (64%), Gaps = 32/741 (4%)
Query: 15 FFSSSITYCF---------AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
F S S+ +CF A V++D R++II+G+R +++S +IHYPRS P MWP L+Q+
Sbjct: 3 FLSLSVWFCFVILSFIGSNAVEVSHDGRAIIIDGKRRVLLSGSIHYPRSTPEMWPELIQK 62
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYN 125
AKEGG++ IE+YVFWN HE S Y F G ++++F+K IQ++ +Y +LRIGP+V AE+N
Sbjct: 63 AKEGGLDAIETYVFWNAHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWN 122
Query: 126 YGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
YGGIPVW+H +P R + MQ F TLIVDM+K+EKLFASQGGPIIL Q+ENEY
Sbjct: 123 YGGIPVWVHNLPDVEIRTANSVYMNEMQNFTTLIVDMVKKEKLFASQGGPIILTQIENEY 182
Query: 186 GYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSP 245
G S YG+ GK Y W A MA + N+GVPWIMCQ+ D P +INTCN FYCD F P++P
Sbjct: 183 GNVISHYGDAGKAYMNWCANMAESLNVGVPWIMCQESDAPQSMINTCNGFYCDNFEPNNP 242
Query: 246 SMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG 305
S PK+WTENW GWFK +GGRDPHR +ED+AF+VARFFQ GG+ NYYMYHGGTNF RTAG
Sbjct: 243 SSPKMWTENWVGWFKNWGGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFDRTAG 302
Query: 306 GPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADV 365
GP+ITTSYDY+AP+DEYG PKWGHLKELH +K E L +G S G+S +A +
Sbjct: 303 GPYITTSYDYDAPLDEYGNIAQPKWGHLKELHNVLKSMEETLTSGNVSETDFGNSVKATI 362
Query: 366 YADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV 425
YA ++G+ + FL++ + D T+ FR +Y +PAWSVSILPDC+ +NTA V Q+S
Sbjct: 363 YA-TNGSSSCFLSSTNTTTDATLTFRGKNYTVPAWSVSILPDCEHEEYNTAKVNVQTSV- 420
Query: 426 EMVPENLQPSEASPDNGSKGLKWQVFKE--IAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
MV EN + E + LKW E + G+++ + +D + D +DYLW
Sbjct: 421 -MVKENSKAEEE-----ATALKWVWRSENIDNALHGKSNVSANRLLDQKDAANDASDYLW 474
Query: 484 YTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSA-SGNGTHPPFKYKNPIS 542
Y T + V ++ G L I S GH +HAF N E GS + G H K++ I
Sbjct: 475 YMTKLHVKHDDPVW--GENMTLRINSSGHVIHAFVNGEHIGSHWATYGIHND-KFEPKIK 531
Query: 543 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS----VKITGFNSGTLDLSTYSWTYKIG 598
LK G N I+LLS+TVGLQN G F++ AG+ V + G + +LS+ W+YK+G
Sbjct: 532 LKHGTNTISLLSVTVGLQNYGAFFDTWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKVG 591
Query: 599 LQG-EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLA 657
L G +H + N + + P ++ LTWYK P G +P+ +D+ MGKG A
Sbjct: 592 LHGWDHKLFSDDSPFAAPNKWESEKLPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGKGYA 651
Query: 658 WLNGEEIGRYWPRKSRKSSPHDECVQE-CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK 716
W+NG+ IGR WP ++ D C E CDYRG++ KC+T CG+P+QRWYH+PRS+ K
Sbjct: 652 WVNGQNIGRIWP---SYNAEEDGCSDEPCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLK 708
Query: 717 PSENILVIFEEKGGDPTKITF 737
N LV+F E GG+P+++ F
Sbjct: 709 DGANNLVLFAELGGNPSQVNF 729
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/661 (51%), Positives = 439/661 (66%), Gaps = 20/661 (3%)
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+S Y F R++LV+F+K++ QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D
Sbjct: 1 MSKIMYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 60
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
PFK MQKF IV +MK EKL+ SQGGPIIL+Q+ENEYG E G GK Y WAA
Sbjct: 61 NGPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAA 120
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+MA+ + GVPW+MC+Q D PDPVI+TCN FYC+ F P+ PK+WTE W GWF FGG
Sbjct: 121 QMALGLDTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGG 180
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+A+SVARF Q GGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYGL
Sbjct: 181 PAPYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGL 240
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKW HL++LH AIKLCE AL++ + + LGS+QEA V+ SG+CAAFLAN D +
Sbjct: 241 LREPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASS 300
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
TV F N Y LP WSVSILPDCK V+FNTA V A +S +M P +
Sbjct: 301 SATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTPVS------------- 347
Query: 445 GLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W + +E A + E +G V+ I+ T+D+TDYLWY T I ++ NE FLK+G P
Sbjct: 348 SFSWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWP 407
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
+L + S GHALH F N +L G+ G + + ++L+AG N++++LS+ VGL N G
Sbjct: 408 LLTVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGG 467
Query: 564 PFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
YE W + V + G N T D+S Y W+YKIGL+GE L +++ +++ WV+
Sbjct: 468 LHYETWNTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSL 527
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ QPLTWYK P G+EP+ LDM MGKG W+NG+ IGR+WP + K S
Sbjct: 528 VAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTAKGS-----C 582
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
+C+Y G FN KC + CGEPSQRWYH+PR+W K S N+LVIFEE GG+P I+ R I
Sbjct: 583 GKCNYGGIFNEKKCHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRSI 642
Query: 743 S 743
S
Sbjct: 643 S 643
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/740 (47%), Positives = 465/740 (62%), Gaps = 37/740 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD+R+LII G+R ++ISA IHYPR+ P MWP L+ ++KEGG + IE+Y FWNGHE +
Sbjct: 36 NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR+++VKF K++ +++ +RIGP+ AE+N+GG P+WL IPG FR D
Sbjct: 96 RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK M++++ IVD+M E LF+ QGGPIIL Q+ENEYG ES +G GK Y WAA+M
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESSFGPKGKLYMKWAAEM 215
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV GVPW+MC+Q D P+ +I+TCN++YCD FTP+S PKIWTENW GWF +G R
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGERL 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+RPSEDIAF++ARFFQ+GGS+ NYYMY GGTNFGRTAGGP TSYDY+AP+DEYGL R
Sbjct: 276 PYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGLLR 335
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVYADSS-----------GACA 374
PKWGHLK+LH AIKLCE AL+ + + LG QEA VY +S G CA
Sbjct: 336 QPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGICA 395
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTA---NVRAQSSTVEMVPEN 431
AF+AN+D+ TV F + LP WSV + ++ +T + QS +
Sbjct: 396 AFIANIDEHESATVKFYGQEFTLPPWSV-VFCQIAEIQLSTQLRWGHKLQSKQWAQILFQ 454
Query: 432 L--------QPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
L +AS ++ S+ W KE G+WG+ +F G ++H+N TKD +DYLW
Sbjct: 455 LGIILCFYKLSLKASSESFSQ--SWMTLKEPLGVWGDKNFTSKGILEHLNVTKDQSDYLW 512
Query: 484 YTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
Y T I +++++ +N P + I+S + F N +L GS G K P+
Sbjct: 513 YLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----IKVVQPV 568
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQ 600
L G N+I LLS TVGLQN G F E GAG +K+TG SG ++L+T WTY++GL+
Sbjct: 569 KLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWTYQVGLR 628
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
GE L +Y+ + W +WYK P G +P+ LD MGKG AW+N
Sbjct: 629 GEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGKGQAWVN 688
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
G +GRYW +P++ C + CDYRG ++ DKC T CGE +Q WYHIPRSW K N
Sbjct: 689 GHHVGRYWTL----VAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKTLNN 744
Query: 721 ILVIFEEKGGDPTKITFSIR 740
+LVIFEE P I+ S R
Sbjct: 745 VLVIFEETDKTPFDISISTR 764
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/745 (46%), Positives = 469/745 (62%), Gaps = 32/745 (4%)
Query: 11 ALLIFFSSSITYCF-AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
LL+ S+ I+ A +V+YD R++ I+G+R+++ S +IHYPRS MWP L++++KEG
Sbjct: 9 TLLLLCSALISIAIEAIDVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEG 68
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE PG+Y F G +LV+FIK IQ ++ +LRIGP+V AE+NYGG
Sbjct: 69 GLDVIETYVFWNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGPYVCAEWNYGGF 128
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWLH IP FR + F+ M+KF TLIVDMM+ EKLFASQGGPIILAQ+ENEYG
Sbjct: 129 PVWLHNIPNIEFRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIM 188
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YG+ GK Y W A++A + IGVPWIMCQQ DTPDP+INTCN FYCDQ+ P+S + PK
Sbjct: 189 GSYGQNGKEYVQWCAQLAQSYQIGVPWIMCQQSDTPDPLINTCNGFYCDQWHPNSNNKPK 248
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE+W GWF +GG PHR +ED+AF+V RFFQ GG+ NYYMYHGGTNFGRT+GGP+I
Sbjct: 249 MWTEDWTGWFMHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYI 308
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+AP++EYG PKWGHLK LH +K E L G N+ G+ A +++
Sbjct: 309 TTSYDYDAPLNEYGDLNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFS-Y 367
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
+G FL N D + F+N Y +PAWSVSILPDC V+NTA V AQ+S + +
Sbjct: 368 AGQSVCFLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSIMTINN 427
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEI-------AGIWGEADFVKSGFVDHINTTKDTTDYL 482
EN S L WQ E + G +D DT+DYL
Sbjct: 428 EN-----------SYALDWQWMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYL 475
Query: 483 WYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPIS 542
WY TS+ V + + L + + + + +KGH LH F N GS PF ++ I
Sbjct: 476 WYITSVDVKQGDPILSHDLK--IRVNTKGHVLHVFVNGAHIGSQYATYGKYPFTFEADIK 533
Query: 543 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSG---TLDLSTYSWTYKIGL 599
LK GKNEI+L+S TVGL N G +++ + G+T V++ N G T D+ST W YK+G+
Sbjct: 534 LKLGKNEISLVSGTVGLPNYGAYFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGM 593
Query: 600 QGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWL 659
GE++ +Y+P R++ W T ++ WYK + P G + + LD+ +GKG AW+
Sbjct: 594 HGENVKLYSPS-RSSEEWF-TNGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWV 651
Query: 660 NGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS- 718
NG IGRYW + D C CDYRG + +KC T CG P+QRWYH+P S+ +
Sbjct: 652 NGNNIGRYW---VSYLAGEDGCSSTCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGL 708
Query: 719 ENILVIFEEKGGDPTKITFSIRKIS 743
+N LV+FEE+GG+P ++ + I+
Sbjct: 709 DNTLVVFEEQGGNPFQVKIATVTIA 733
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 696 bits (1796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/724 (49%), Positives = 460/724 (63%), Gaps = 27/724 (3%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
CFA VTYDS +LIING R LI S AIHYPRS MWP L+Q+AK+GG++ IE+Y+FW+
Sbjct: 5 CFATEVTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDR 64
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE +Y F G + VKF ++IQ+A +Y I+RIGP+ AE+N+GG P WLH +PG R
Sbjct: 65 HEPVRREYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIELR 124
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
+ +K MQ F T IV+++K KLFASQGGPIILAQ+ENEYG Y + GK Y W
Sbjct: 125 TNNSVYKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQW 184
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
AA+MA+AQNIGVPWIMCQQ D P P+INTCN +YC F P++P PKI+TENW GWF+ +
Sbjct: 185 AAQMALAQNIGVPWIMCQQQDAPQPIINTCNGYYCHNFQPNNPKSPKIFTENWIGWFQKW 244
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R PHR +ED AFSVARFFQ GG ++NYYMYHGGTNFGRTAGGP+ITTSYDY+APIDEY
Sbjct: 245 GERVPHRSAEDSAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEY 304
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLN-GERSNLSLGSSQEADVYADSSGACAAFLANMD 381
G PKWGHLK LH AIKL E+ L N R + LG+ Y +SSGA FL+N +
Sbjct: 305 GNLNQPKWGHLKNLHAAIKLGENVLTNYSARKDEDLGNGLTLTTYTNSSGARFCFLSNNN 364
Query: 382 --DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASP 439
D + + + Y +PAWSVSI+ C + VFNTA V +Q+S + +N+ + +
Sbjct: 365 NTDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTSMMVKKSDNVSSTNLT- 423
Query: 440 DNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
+W+V + I G ++ T D +DYLWY TS +N+ +
Sbjct: 424 ------WEWKVEPKRDTIHGNGSLKAQKLLEQKELTLDASDYLWYMTSADINDTSIW--- 474
Query: 500 GSRPVLLIESKGHALHAFANQELQG---SASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
S L + + GH+LH + NQ G S GN F Y+ +SLK G N I LLS T
Sbjct: 475 -SNATLRVNTSGHSLHGYVNQRYVGYQFSQYGN----QFTYEKQVSLKNGTNIITLLSAT 529
Query: 557 VGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNN 614
VGL N G +++ GI+ V++ G N+ T+DLST W+YKIGL GE +Y+ +
Sbjct: 530 VGLANYGAWFDDKKTGISGGPVELIGKNNVTMDLSTNLWSYKIGLNGERRHLYDAQQNVS 589
Query: 615 INW-VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSR 673
+ W ++ P +PL WY+A K P G PI +D+ +GKG AW+NG IGRYW S
Sbjct: 590 VAWHTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGRYW---SS 646
Query: 674 KSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPT 733
SP D C CDYRG + P KC T CG PSQRWYH+PRS+ N LV+FEE GG+P
Sbjct: 647 WISPSDGCSDTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIGGNPQ 706
Query: 734 KITF 737
+ F
Sbjct: 707 SVQF 710
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 695 bits (1793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/748 (48%), Positives = 491/748 (65%), Gaps = 28/748 (3%)
Query: 1 MKPRTPIAPFALL-IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMW 59
M + + PF L IF + TY A V++D R++ I+G+R ++IS +IHYPRS P MW
Sbjct: 1 MASKCFVFPFFLCYIFLALYGTY--AVEVSHDGRAIKIDGKRRVLISGSIHYPRSTPQMW 58
Query: 60 PGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPF 119
P L+++AKEGG++ IE+YVFWN HE +Y F G +L++F+K IQ ++ +LRIGP+
Sbjct: 59 PDLIKKAKEGGLDAIETYVFWNAHEPIRREYDFSGNNDLIRFLKTIQDEGLFAVLRIGPY 118
Query: 120 VAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILA 179
V AE+NYGGIPVW++ +PG R + F MQ F TLIVDM+++EKLFASQGGPIIL+
Sbjct: 119 VCAEWNYGGIPVWVYNLPGVEIRTANKVFMNEMQNFTTLIVDMVRKEKLFASQGGPIILS 178
Query: 180 QVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ 239
Q+ENEYG S YG+ GK Y W A MA + NIGVPWIMCQQ D P P+INTCN +YC
Sbjct: 179 QIENEYGNVMSAYGDEGKAYINWCANMADSFNIGVPWIMCQQPDAPQPMINTCNGWYCHD 238
Query: 240 FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTN 299
F P++P+ PK+WTENW GWFK +GG+DPHR +EDIA+SVARFF+ GG+ NYYMYHGGTN
Sbjct: 239 FEPNNPNSPKMWTENWVGWFKNWGGKDPHRTAEDIAYSVARFFETGGTFQNYYMYHGGTN 298
Query: 300 FGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGS 359
FGRTAGGP+ITTSYDY+AP+DEYG PKWGHLKELH +K E++L NG S + LGS
Sbjct: 299 FGRTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHLVLKSMENSLTNGNVSKIDLGS 358
Query: 360 SQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVR 419
+A VYA ++ + + FL N + D TV F+ +Y++PAWSVSILPDC+ +NTA V
Sbjct: 359 YVKATVYA-TNDSSSCFLTNTNTTTDATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVN 417
Query: 420 AQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIA--GIWGEADFVKSGFVDHINTTKD 477
Q+S + E ++ + LKW E + G++ K+ VD D
Sbjct: 418 VQTSI-------MVKRENKAEDEPEALKWVWRAENVHNSLIGKSSVSKNTIVDQKIAAND 470
Query: 478 TTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSA-SGNGTHPPFK 536
++DYLWY T + +N+ + N + +L I GH +HAF N E GS + G H +
Sbjct: 471 SSDYLWYMTRLDINQKDPVWTNNT--ILRINGTGHVIHAFVNGEHIGSHWATYGIHND-Q 527
Query: 537 YKNPISLKAGKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTL---DLSTYS 592
++ I LK G+N+I+LLS+TVGLQN G Y+ W ++ +++ G DLS++
Sbjct: 528 FETNIKLKHGRNDISLLSVTVGLQNYGKEYDKWQDGLVSPIELIGTKGDETIIKDLSSHK 587
Query: 593 WTYKIGLQGEHLGIYNPG--YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDML 650
WTYK+GL G ++ + ++ W S E P N+ LTWYK K P +PI +D+
Sbjct: 588 WTYKVGLHGWENKFFSQDTFFASSSKWESN-ELPINKMLTWYKTTFKAPLESDPIVVDLQ 646
Query: 651 KMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE-CDYRGKFNPDKCITGCGEPSQRWYH 709
MGKG AW+NG +GRYWP ++ D C + CDYRG++N KC++ CG+PSQRWYH
Sbjct: 647 GMGKGYAWVNGHSLGRYWP---SYNADEDGCSDDPCDYRGEYNDTKCVSNCGKPSQRWYH 703
Query: 710 IPRSWFKPSENILVIFEEKGGDPTKITF 737
+PR + + N LV+FEE GG+P++I F
Sbjct: 704 VPRDFIEDGVNTLVLFEEIGGNPSQINF 731
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/737 (47%), Positives = 465/737 (63%), Gaps = 46/737 (6%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YDS ++IING R +I S +IHYPRS MWP L+Q+AK+GG++ IE+Y+FW+ HE
Sbjct: 4 NVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 63
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
KY F G N +KF +++Q A +Y+++RIGP+V AE+NYGG P+WLH +PG R D +
Sbjct: 64 RQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTDNQ 123
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+K M F T IV+M K+ LFASQGGPIILAQ+ENEYG + YG GK Y W A+M
Sbjct: 124 VYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCAQM 183
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A + NIGVPWIMCQQ D P P+INTCN FYCD F+P++P PK++TENW GWFK +G +D
Sbjct: 184 AESFNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWGDKD 243
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+R +ED+AFSVARFFQ GG +NYYMYHGGTNFGRT+GGPFITTSYDY AP+DEYG
Sbjct: 244 PYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLN 303
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD----------SSGACAAF 376
PKWGHLK+LH +IKL E L NG SN + GS + ++ F
Sbjct: 304 QPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTFKTFGSFVTLTKFSNPTTKERFCF 363
Query: 377 LANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST---VEMVPENLQ 433
L+N + K Y +PAWSVSI+ CKK VFNTA + +Q+S V+ EN++
Sbjct: 364 LSNTXKADGK--------YFVPAWSVSIIDGCKKEVFNTAKINSQTSIFVKVQNEKENVK 415
Query: 434 PSEA-SPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
S +P+ S L+ G+ F ++ ++ TT D++DYLWY T++ N
Sbjct: 416 LSWVWAPEAMSDTLQ-----------GKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNG 464
Query: 493 NEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIAL 552
L + +KGH LHAF N GS GN F ++ PI LKAG N I L
Sbjct: 465 TSSI----HNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQ-SFVFEKPILLKAGTNIITL 519
Query: 553 LSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LS TVGL+N FY+ + GI + + G + +DLS+ W+YK+GL GE +YNP
Sbjct: 520 LSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEIKQLYNPV 579
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+ +W + + + +TWYK K P G +P+ LDM MGKG AW+NG+ IGR+WP
Sbjct: 580 FSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWP- 638
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
+ +D C + CDYRG ++P KC+ CG PSQRWYHIPRS+ + N LV+FEE GG
Sbjct: 639 --SFIAGNDNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGG 696
Query: 731 DPTKI---TFSIRKISG 744
P ++ T +I I G
Sbjct: 697 SPQQVSVQTITIGTICG 713
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/755 (46%), Positives = 466/755 (61%), Gaps = 78/755 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPG--------------------------MWPG 61
VTYD ++++I+G+R ++ S +IHYPRS P MW G
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPPTIPWRGLWLRIYGSEMWEG 86
Query: 62 LVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVA 121
L+Q+AK+GG++ I++YVFWNGHE +PG G F ++
Sbjct: 87 LIQKAKDGGLDVIQTYVFWNGHEPTPGNDSDGIFFRFEQYY------------------- 127
Query: 122 AEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ- 180
+ G PVWL Y+PG FR D EPFK MQ F IV MMK E LFASQGGPIIL+Q
Sbjct: 128 --FEESGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPIILSQA 185
Query: 181 --------VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTC 232
+ENEYG +G G+ Y WAAKMAV GVPW+MC++ D PDPVIN C
Sbjct: 186 SIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVPWVMCKEEDAPDPVINAC 245
Query: 233 NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYY 292
N FYCD F+P+ P P +WTE W GWF FGG RP ED+AF+VARF QKGGS NYY
Sbjct: 246 NGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYY 305
Query: 293 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
MYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R PK HLKELH A+KLCE AL++ +
Sbjct: 306 MYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDP 365
Query: 353 SNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVV 412
+ +LG+ QEA V+ SG CAAFLAN + + VVF N Y LP WS+SILPDCK VV
Sbjct: 366 AITTLGTMQEARVFQSPSG-CAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVV 424
Query: 413 FNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDH 471
FN+A V Q+S ++M +G+ + W+ + +E+ + +G ++
Sbjct: 425 FNSATVGVQTSQMQMW-----------GDGASSMTWERYDEEVDSLAAAPLLTTTGLLEQ 473
Query: 472 INTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV-LLIESKGHALHAFANQELQGSASGNG 530
+N T+D++DYLWY TS+ ++ +E FL+ G +P+ L ++S GHALH F N +LQGSA G
Sbjct: 474 LNVTRDSSDYLWYITSVDISSSENFLQGGGKPLSLSVQSAGHALHVFVNGQLQGSAYGTR 533
Query: 531 THPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLS 589
KY SL+AG N+IALLS+ GL N G YE G+ V + G + G+ DL+
Sbjct: 534 EDRRIKYNGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLDEGSRDLT 593
Query: 590 TYSWTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLD 648
+W+Y++GL+GE + + + +++ W+ ++ QPL WY+A + P GDEP+ LD
Sbjct: 594 WQTWSYQVGLKGEQMNLNSIEGSSSVEWMQGSLIAQNQQPLAWYRAYFETPSGDEPLALD 653
Query: 649 MLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWY 708
M MGKG W+NG+ IGRYW ++ D +EC Y G F KC +GCG+P+QRWY
Sbjct: 654 MGSMGKGQIWINGQSIGRYW------TAYADGDCKECSYTGTFRAPKCQSGCGQPTQRWY 707
Query: 709 HIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
H+P+SW +P+ N+LV+FEE GGD +KI R +S
Sbjct: 708 HVPKSWLQPTRNLLVVFEELGGDSSKIALVKRSVS 742
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/730 (46%), Positives = 459/730 (62%), Gaps = 31/730 (4%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A +V+YD R++ I+G+R+++ S +IHYPRS MWP L++++KEGG++ IE+YVFWN HE
Sbjct: 24 AIDVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHE 83
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
PG+Y F G +LV+FIK IQ +Y +LRIGP+V AE+NYGG PVWLH IP FR +
Sbjct: 84 PHPGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTN 143
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
F+ M+KF TLIVDMM+ EKLFASQGGPIILAQ+ENEYG YG+ GK Y W A
Sbjct: 144 NAIFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCA 203
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
++A + IGVPWIMCQQ D PDP+INTCN FYCDQ+ P+S + PK+WTE+W GWF +GG
Sbjct: 204 QLAQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWGG 263
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
PHR +ED+AF+V RFFQ GG+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP++EYG
Sbjct: 264 PTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYGD 323
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
PKWGHLK LH +K E L G N+ G+ A +++ +G FL N
Sbjct: 324 LNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFS-YAGQSVCFLGNAHPSM 382
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D + F+N Y +PAWSVSILPDC V+NTA V AQ+S + + EN S
Sbjct: 383 DANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSIMTINNEN-----------SY 431
Query: 445 GLKWQVFKEI-------AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL 497
L WQ E + G +D DT+DYLWY TS+ V + + L
Sbjct: 432 ALDWQWMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPIL 490
Query: 498 KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
+ + + + +KGH LH F N GS F ++ I LK GKNEI+L+S TV
Sbjct: 491 SHDLK--IRVNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLKLGKNEISLVSGTV 548
Query: 558 GLQNAGPFYEWVGAGITSVKITGFNSG---TLDLSTYSWTYKIGLQGEHLGIYNPGYRNN 614
GL N G +++ + G+T V++ N G T D+ST W YK+G+ GE++ +Y+P R+
Sbjct: 549 GLPNYGAYFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPS-RST 607
Query: 615 INWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
W T ++ WYK + P G + + LD+ +GKG AW+NG IGRYW
Sbjct: 608 EEWF-TNGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYW---VSY 663
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS-ENILVIFEEKGGDPT 733
+ D C CDYRG + +KC T CG P+QRWYH+P S+ + +N LV+FEE+GG+P
Sbjct: 664 LAGEDGCSSTCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPF 723
Query: 734 KITFSIRKIS 743
++ + I+
Sbjct: 724 QVKIATVTIA 733
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 690 bits (1780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/623 (54%), Positives = 430/623 (69%), Gaps = 15/623 (2%)
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+Y F GR +LV+F+K A +Y+ LRIGP+V AE+NYGG P+WLH+IPG R D EPF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ+F +V MK L+ASQGGPIIL+Q+ENEYG + YG GK Y WAA MAV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A + GVPW+MCQQ D P+P+INTCN FYCDQFTP PS PK+WTENW GWF +FGG P+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP+ED+AF+VARF+Q+GG++ NYYMYHGGTNFGR++GGPFI+TSYDY+APIDEYGL R P
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHL+++H AIK+CE AL+ + S +SLG + EA VY S CAAFLAN+DD++DKTV
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANIDDQSDKTV 299
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS----- 443
F +Y LPAWSVSILPDCK VV NTA + +Q ++ +M NL S + D S
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQM--RNLGFSTQASDGSSVEAEL 357
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W E GI E K G ++ INTT D +D+LWY+TSI+V E +L NGS+
Sbjct: 358 AASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYL-NGSQS 416
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
LL+ S GH L F N +L GS+ G+ + P++L GKN+I LLS TVGL N G
Sbjct: 417 NLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYG 476
Query: 564 PFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
F++ VGAGIT VK+TG GTLDLS+ WTY+IGL+GE L +YNP + WVS
Sbjct: 477 AFFDLVGAGITGPVKLTG-PKGTLDLSSAEWTYQIGLRGEDLHLYNPS-EASPEWVSDNS 534
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P N PLTWYK+ P GD+P+ +D MGKG AW+NG+ IGRYWP +P CV
Sbjct: 535 YPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNIAPQSGCV 591
Query: 683 QECDYRGKFNPDKCITGCGEPSQ 705
C+YRG ++ KC+ CG+PSQ
Sbjct: 592 NSCNYRGSYSATKCLKKCGQPSQ 614
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/745 (47%), Positives = 470/745 (63%), Gaps = 31/745 (4%)
Query: 7 IAPFALLIFFSSSITYCFAGN---VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+ F LL F IT + N V++D R++ I+G+R +++S +IHYPRS MWP L+
Sbjct: 3 MKQFNLLSLFLILITSFGSANSTIVSHDERAITIDGQRRILLSGSIHYPRSTSDMWPDLI 62
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
+AK+GG++TIE+YVFWN HE S +Y F G +LV+FIK IQ A +Y +LRIGP+V AE
Sbjct: 63 SKAKDGGLDTIETYVFWNAHEPSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGPYVCAE 122
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+NYGG PVWLH +P FR F MQ F T IV+MMK E LFASQGGPIILAQ+EN
Sbjct: 123 WNYGGFPVWLHNMPDMKFRTINPGFMNEMQNFTTKIVNMMKEESLFASQGGPIILAQIEN 182
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPH 243
EYG S YG GK Y W A MA + +IGVPWIMCQQ P P+I TCN FYCDQ+ P
Sbjct: 183 EYGNVISSYGAEGKAYIDWCANMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCDQYKPS 242
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
+PS PK+WTENW GWFK +GG+ P+R +ED+AFSVARFFQ GG+ NYYMYHGGTNFGR
Sbjct: 243 NPSSPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRV 302
Query: 304 AGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEA 363
AGGP+ITTSYDY+AP+DEYG PKWGHLK+LH +K E L G S + LG+S A
Sbjct: 303 AGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHTLLKSMEKPLTYGNISTIDLGNSVTA 362
Query: 364 DVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS 423
VY+ + + + F+ N++ D V F+ Y++PAWSVS+LPDC K +NTA V Q+S
Sbjct: 363 TVYSTNEKS-SCFIGNVNATADALVNFKGKDYNVPAWSVSVLPDCDKEAYNTARVNTQTS 421
Query: 424 TVEMVPENLQPSEASPDNGSKGLKWQVFKEIAG----IWGEADFVKSGFVDHINTTKDTT 479
+ +E S D K LKW E + G D + G VD + T D +
Sbjct: 422 II---------TEDSCDEPEK-LKWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDAS 471
Query: 480 DYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKN 539
DYLWY T + +++ + L + S H LHA+ N + G+ ++++
Sbjct: 472 DYLWYMTRVHLDKKDPIWSRNMS--LRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFEK 529
Query: 540 PISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTY 595
++L G N +ALLS++VGLQN GPF+E GI VK+ G+ DLS + W Y
Sbjct: 530 KVNLVHGTNHLALLSVSVGLQNYGPFFESGPTGINGPVKLVGYKGDETIEKDLSKHQWDY 589
Query: 596 KIGLQGEHLGIYN--PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMG 653
KIGL G + +++ ++ W ST + P ++ L+WYKA K P G +P+ +D+ +G
Sbjct: 590 KIGLNGFNHKLFSMKSAGHHHRKW-STEKLPADRMLSWYKANFKAPLGKDPVIVDLNGLG 648
Query: 654 KGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRS 713
KG W+NG+ IGRYWP +S + C +ECDYRG++ DKC CG+P+QRWYH+PRS
Sbjct: 649 KGEVWINGQSIGRYWP---SFNSSDEGCTEECDYRGEYGSDKCAFMCGKPTQRWYHVPRS 705
Query: 714 WFKPS-ENILVIFEEKGGDPTKITF 737
+ N + +FEE GGDP+ + F
Sbjct: 706 FLNDKGHNTITLFEEMGGDPSMVKF 730
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/746 (46%), Positives = 468/746 (62%), Gaps = 27/746 (3%)
Query: 1 MKPRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWP 60
MK + +L +S + + V++D R++ ING+R +++S +IHYPRS MWP
Sbjct: 1 MKMKHFTRLLSLFFILITSFSLANSTIVSHDERAITINGKRRILLSGSIHYPRSTADMWP 60
Query: 61 GLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFV 120
L+ +AK+GG++ IE+YVFWN HE +Y F G ++V+FIK IQ A +Y +LRIGP+V
Sbjct: 61 DLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYV 120
Query: 121 AAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
AE+NYGG PVWLH +P FR F MQ F T IV+MMK EKLFASQGGPIILAQ
Sbjct: 121 CAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQ 180
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF 240
+ENEYG S YG GK Y W A MA + +IGVPW+MCQQ + P P++ TCN FYCDQ+
Sbjct: 181 IENEYGNVISSYGAAGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQY 240
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
P +PS PK+WTENW GWFK +GG+ P+R +ED+AFSVARFFQ GG+ NYYMYHGGTNF
Sbjct: 241 EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNF 300
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
GR AGGP+ITTSYDY APIDE+G PKWGHLK+LH +K E +L G S + LG+S
Sbjct: 301 GRVAGGPYITTSYDYHAPIDEFGNLNQPKWGHLKQLHRVLKSMEKSLTYGNISRIDLGNS 360
Query: 361 QEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRA 420
+A +Y G+ + F+ N++ + V F+ YH+PAWSVS+LP+C K +NTA V
Sbjct: 361 IKATIYTTKEGS-SCFIGNVNATANALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNT 419
Query: 421 QSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAG---IWGEADFVKSGFVDHINTTKD 477
Q+S M ++ +P + L+W E A + D + G VD + T D
Sbjct: 420 QTSI--MTEDSSKPEK---------LEWTWRPESAQKMILKSSGDLIAKGLVDQKDVTND 468
Query: 478 TTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKY 537
+DYLWY T + +++ + L + S H LHA+ N + G+ +++
Sbjct: 469 ASDYLWYMTRVHLDKKDPLWSRNM--TLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRF 526
Query: 538 KNPIS-LKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYS 592
+ ++ L G N I+LLS++VGLQN G F+E GI V + G+ DLS +
Sbjct: 527 EKKVNHLVHGTNHISLLSVSVGLQNYGAFFESGPTGINGPVSLVGYKGEETIEKDLSQHQ 586
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKM 652
W YKIGL G + +++ +I W + M P ++ LTWYKA K P G EP+ +D +
Sbjct: 587 WDYKIGLNGYNNKLFSTKSVGHIKWANEMFPT-SRMLTWYKAKFKAPLGKEPVIVDFNGL 645
Query: 653 GKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPR 712
GKG AW+NG+ IGRYWP +S D C ECDYRG++ DKC CGEP+QRWYH+PR
Sbjct: 646 GKGEAWINGQSIGRYWP---SFNSSDDGCKDECDYRGEYGSDKCAFMCGEPTQRWYHVPR 702
Query: 713 SWFKPS-ENILVIFEEKGGDPTKITF 737
S+ K S N + +FEE GG+P+ + F
Sbjct: 703 SFLKASGHNTITLFEEMGGNPSMVNF 728
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/746 (46%), Positives = 466/746 (62%), Gaps = 27/746 (3%)
Query: 1 MKPRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWP 60
MK + +L +S++ + V++D R++ ING+R +++S +IHYPRS MWP
Sbjct: 1 MKMKHFTRLLSLFFILITSLSLAKSTIVSHDERAITINGKRRILLSGSIHYPRSTADMWP 60
Query: 61 GLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFV 120
L+ +AK+GG++ IE+YVFWN HE +Y F G ++V+FIK IQ A +Y +LRIGP+V
Sbjct: 61 DLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYV 120
Query: 121 AAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
AE+NYGG PVWLH +P FR F MQ F T IV MMK EKLFASQGGPIILAQ
Sbjct: 121 CAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGGPIILAQ 180
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF 240
+ENEYG S YG GK Y W A MA + +IGVPW+MCQQ + P P++ TCN FYCDQ+
Sbjct: 181 IENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQY 240
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
P +PS PK+WTENW GWFK +GG+ P+R +ED+AFSVARFFQ GG+ NYYMYHGGTNF
Sbjct: 241 EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNF 300
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
GR AGGP+ITTSYDY AP+DE+G PKWGHLK+LH +K E +L G S + LG+S
Sbjct: 301 GRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNS 360
Query: 361 QEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRA 420
+A +Y G+ + F+ N++ D V F+ YH+PAWSVS+LPDC K +NTA V
Sbjct: 361 IKATIYTTKEGS-SCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNT 419
Query: 421 QSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAG---IWGEADFVKSGFVDHINTTKD 477
Q+S M ++ +P L+W E A + G D + G VD + T D
Sbjct: 420 QTSI--MTEDSSKPER---------LEWTWRPESAQKMILKGSGDLIAKGLVDQKDVTND 468
Query: 478 TTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKY 537
+DYLWY T + +++ + L + S H LHA+ N + G+ +++
Sbjct: 469 ASDYLWYMTRLHLDKKDPLWSRNM--TLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRF 526
Query: 538 KNPIS-LKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYS 592
+ ++ L G N I+LLS++VGLQN GPF+E GI V + G+ DLS +
Sbjct: 527 ERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQ 586
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKM 652
W YKIGL G + +++ + W + + P + LTWYKA K P G EP+ +D+ +
Sbjct: 587 WDYKIGLNGYNDKLFSIKSVGHQKWANE-KLPTGRMLTWYKAKFKAPLGKEPVIVDLNGL 645
Query: 653 GKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPR 712
GKG AW+NG+ IGRYWP +S D C ECDYRG + DKC CG+P+QRWYH+PR
Sbjct: 646 GKGEAWINGQSIGRYWP---SFNSSDDGCKDECDYRGAYGSDKCAFMCGKPTQRWYHVPR 702
Query: 713 SWFKPS-ENILVIFEEKGGDPTKITF 737
S+ S N + +FEE GG+P+ + F
Sbjct: 703 SFLNASGHNTITLFEEMGGNPSMVNF 728
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/722 (48%), Positives = 455/722 (63%), Gaps = 26/722 (3%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
C A V YDS +LIING R LI S AIHYPRS MWP LVQ+AK+GG++ IE+Y+FW+
Sbjct: 20 CTALEVKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDR 79
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE G+Y F G + VKF K IQ+A +Y I+RIGP+ AE+NYGG PVWLH IPG R
Sbjct: 80 HEQVRGRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGIEMR 139
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
D +K MQ F+T I+++ K LFASQGGPIILAQ+ENEYG + E GK Y W
Sbjct: 140 TDNAAYKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKW 199
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
AA+MA+AQNIGVPW MCQQ D P P+INTCN +YC F P++P PK++TENW GWF+ +
Sbjct: 200 AAQMALAQNIGVPWFMCQQNDAPQPIINTCNGYYCHNFKPNNPKSPKMFTENWIGWFQKW 259
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R PHR +ED A++VARFFQ GG +NYYMYHGGTNFGRT+GGP+I TSYDY+API+EY
Sbjct: 260 GERAPHRTAEDSAYAVARFFQNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEY 319
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLN-GERSNLSLGSSQEADVYADSSGACAAFLANMD 381
G PK+GHLK LH AIKL E L N R++ LG+ Y +S GA FL+N
Sbjct: 320 GNLNQPKYGHLKFLHEAIKLGEKVLTNYTSRNDKDLGNGITLTTYTNSVGARFCFLSNDK 379
Query: 382 DKNDKTVVFRNV-SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
D D V +N Y +PAWSV+IL C K VFNTA V +Q+S +E +N ++ +
Sbjct: 380 DNTDGNVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTSIMEKKIDNSSTNKLT-- 437
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
W + + + G ++ T D +DYLWY TS+ +N+ N
Sbjct: 438 -----WAWIMEPKKDTMNGRGSIKAHQLLEQKELTLDASDYLWYMTSVDINDTS----NW 488
Query: 501 SRPVLLIESKGHALHAFANQELQG---SASGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
S L +E+ GH LH + N+ G S GN F Y+ +SLK G N I LLS TV
Sbjct: 489 SNANLHVETSGHTLHGYVNKRYIGYGHSQFGNN----FTYEKQVSLKNGTNIITLLSATV 544
Query: 558 GLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
GL N G ++ + GI+ VK+ G NS T+DLST +W++K+GL GE Y+ R+ +
Sbjct: 545 GLANYGARFDEIKTGISDGPVKLVGQNSVTIDLSTGNWSFKVGLNGEKRRFYDLQPRSGV 604
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
W +T P +PLTWYK K P G PI +D+ +GKG AW+NG+ IGRYW +
Sbjct: 605 AW-NTSSYPTGKPLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGRYWTSWITST 663
Query: 676 SPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
+ C CDYRG + +KC TGC PSQRWYH+PRS+ N L++FEE GG+P +
Sbjct: 664 AG---CSDTCDYRGNYKKEKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIGGNPQNV 720
Query: 736 TF 737
+F
Sbjct: 721 SF 722
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/718 (47%), Positives = 456/718 (63%), Gaps = 22/718 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+Y +R + I+G+ ++ +S +IHYPRS P MWP L++++KEGG++TIE+YVFWN HE
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F +LV+FIK IQ +Y +LRIGP+V AE+NYGG PVWLH +PG T P
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 148 -FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F MQ F TLIVDMMK+E LFASQGGPIILAQ+ENEYG + YG+ GK Y W A M
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A +QN+GVPWIMCQQ D P+P INTCN +YCDQFTP++ PK+WTENW GWFK++GGRD
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGRD 265
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P R ED+AFSVARFFQ GG+ NYYMYHGGTNF R AGGP+ITT+YDY AP+DEYG
Sbjct: 266 PVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNLN 325
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHLK+LH A+K E AL++G + L S YA G + F +N+++ D
Sbjct: 326 QPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINETTDA 384
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V + +++PAWSVSILPDC++ V+NTA V Q+S + E +N + L
Sbjct: 385 LVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSV-------MVKKENKAENEPEVL 437
Query: 447 KWQVFKE---IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+W E G+ + +D + D +DYLWY TS+ + + + N
Sbjct: 438 EWMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSN--EM 495
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
L I GH +HAF N E GS + + ++ + LK GKN I+LLS T+GL+N G
Sbjct: 496 TLRINVSGHIVHAFVNGEHIGSQWASYDVYNYIFEQEVKLKPGKNIISLLSATIGLKNYG 555
Query: 564 PFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS 619
Y+ + +GI V++ G + DLS + W+Y++GL G +++P R W S
Sbjct: 556 AQYDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQS 615
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
P N+ +TWYK K P G +P+ LD+ +GKG+AW+NG IGRYWP + D
Sbjct: 616 G-NLPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSD 674
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
E CDYRG + KC+ CG+P+Q+WYH+PRSW +N LV+FEE GG+P+ + F
Sbjct: 675 E---PCDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNF 729
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/718 (47%), Positives = 455/718 (63%), Gaps = 22/718 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+Y +R + I+G+ ++ +S +IHYPRS P MWP L++++KEGG++TIE+YVFWN HE
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F +LV+FIK IQ +Y +LRIGP+V AE+NYGG PVWLH +PG T P
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 148 -FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F MQ F TLIVDMMK+E LFASQGGPIILAQ+ENEYG + YG+ GK Y W A M
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A +QN+GVPWIMCQQ D P+P INTCN +YCDQFTP++ PK+WTENW GWFK++GGRD
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGRD 265
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P R ED+AFSVARFFQ GG+ NYYMYHGGTNF R AGGP+ITT+YDY AP+DEYG
Sbjct: 266 PVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNLN 325
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHLK+LH A+K E AL++G + L S YA G + F +N+++ D
Sbjct: 326 QPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINETTDA 384
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V + +++PAWSVSILPDC++ V+NTA V Q+S + E +N + L
Sbjct: 385 LVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSV-------MVKKENKAENEPEVL 437
Query: 447 KWQVFKE---IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+W E G+ + +D + D +DYLWY TS+ + + + N
Sbjct: 438 EWMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSN--EM 495
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
L I GH +HAF N E GS + + + + LK GKN I+LLS T+GL+N G
Sbjct: 496 TLRINVSGHIVHAFVNGEHIGSQWASYDVYNYIXEQEVKLKPGKNIISLLSATIGLKNYG 555
Query: 564 PFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS 619
Y+ + +GI V++ G + DLS + W+Y++GL G +++P R W S
Sbjct: 556 AQYDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQS 615
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
P N+ +TWYK K P G +P+ LD+ +GKG+AW+NG IGRYWP + D
Sbjct: 616 G-NLPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSD 674
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
E CDYRG + KC+ CG+P+Q+WYH+PRSW +N LV+FEE GG+P+ + F
Sbjct: 675 E---PCDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNF 729
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/619 (54%), Positives = 433/619 (69%), Gaps = 12/619 (1%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD R+L+I+G R +++S +IHYPRS P MWPGL+Q+AK+GG++ IE+YVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +L F+K + A +Y+ LRIGP+V AE+NYGG P+WLH+IPG FR D
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA
Sbjct: 147 NEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAA 206
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MAV+ + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK+WTENW GWF +FGG
Sbjct: 207 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 266
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
P+RP ED+AF+VARF+Q+GG+ NYYMYHGGTN R++GGPFI TSYDY+APIDEYGL
Sbjct: 267 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 326
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHL+++H AIKLCE AL+ + S SLG + EA VY S CAAFLAN+D ++
Sbjct: 327 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLANIDGQS 385
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG-- 442
DKTV F Y LPAWSVSILPDCK VV NTA + +Q++ EM L+ S + D
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEM--RYLESSNVASDGSFV 443
Query: 443 ---SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
W E GI + K+G ++ INTT D +D+LWY+TSI V +E +L N
Sbjct: 444 TPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYL-N 502
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS+ L + S GH L + N ++ GSA G+ + ++ PI L GKN+I LLS TVGL
Sbjct: 503 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 562
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G F++ VGAGIT VK++G N G LDLS+ WTY+IGL+GE L +Y+P + WV
Sbjct: 563 SNYGAFFDLVGAGITGPVKLSGLN-GALDLSSAEWTYQIGLRGEDLHLYDPS-EASPEWV 620
Query: 619 STMEPPKNQPLTWYKAVVK 637
S P N PL WYK ++
Sbjct: 621 SANAYPINHPLIWYKVSME 639
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/723 (48%), Positives = 452/723 (62%), Gaps = 37/723 (5%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
VTYD RSLII+G R+++ S +IHYPRS P MW L+ +AKEGGV+ I++YVFWN HE
Sbjct: 24 AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 83
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG+Y F GR++L KFIK IQ +Y LRIGPF+ +E++YGG+P WLH + G V+R D
Sbjct: 84 QPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 143
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK++MQ F T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WAAK
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ-FT-PHSPSMPKIWTENWPGWFKTFG 263
MAV GVPW+MC+Q D PDPVINTCN C Q FT P+SP+ P +WTENW +++ FG
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G R +EDIAF VA F + GS NYYMYHGGTNFGR A +I TSY +AP+DEYG
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR-ASSAYIKTSYYDQAPLDEYG 322
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKWGHLKELH AI LC LLNG +SN+SLG QEA V+ + G C AFL N D+
Sbjct: 323 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 382
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
N+ TV+F+NVS L S+SILPDCK V+FNTA V + S + L S +
Sbjct: 383 NNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKVCSSSRQSAYKIQELSRSCIQSFDAV 442
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+W+ +K+ + + + ++H+N TKD +DYLWYT N + + P
Sbjct: 443 D--RWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPN------SSCTEP 494
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
+L IES HA+HAF N G+ G+ F +K+PISL N I++LS+ VG ++G
Sbjct: 495 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 554
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+T V+I G D + Y+W Y++GL GE L IY +N+ W T E
Sbjct: 555 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRKT-EI 613
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
NQPLTWYK V P GD+P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 614 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 659
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
F+ K G+PSQ YH+PR++ K SEN+LV+ EE GDP I+ +
Sbjct: 660 ------SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETISRT 708
Query: 744 GFP 746
P
Sbjct: 709 DLP 711
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/723 (48%), Positives = 452/723 (62%), Gaps = 44/723 (6%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
VTYD RSLII+G R+++ S +IHYPRS P MW L+ +AKEGGV+ I++YVFWN HE
Sbjct: 60 AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 119
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG+Y F GR++L KFIK IQ +Y LRIGPF+ +E++YGG+P WLH + G V+R D
Sbjct: 120 QPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 179
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK++MQ F T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WAAK
Sbjct: 180 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 239
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ-FT-PHSPSMPKIWTENWPGWFKTFG 263
MAV GVPW+MC+Q D PDPVINTCN C Q FT P+SP+ P +WTENW +++ FG
Sbjct: 240 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 299
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G R +EDIAF VA F + GS NYYMYHGGTNFGR A +I TSY +AP+DEYG
Sbjct: 300 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR-ASSAYIKTSYYDQAPLDEYG 358
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKWGHLKELH AI LC LLNG +SN+SLG QEA V+ + G C AFL N D+
Sbjct: 359 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 418
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
N+ TV+F+NVS L S+SILPDCK V+FNTA + + E + S S D
Sbjct: 419 NNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYN------ERIATSSQSFDAVD 472
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+ W+ +K+ + + + ++H+N TKD +DYLWYT N + + P
Sbjct: 473 R---WEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPN------SSCTEP 523
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
+L IES HA+HAF N G+ G+ F +K+PISL N I++LS+ VG ++G
Sbjct: 524 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 583
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+T V+I G D + Y+W Y++GL GE L IY +N+ W T E
Sbjct: 584 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRKT-EI 642
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
NQPLTWYK V P GD+P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 643 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 688
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
F+ K G+PSQ YH+PR++ K SEN+LV+ EE GDP I+ +
Sbjct: 689 ------SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETISRT 737
Query: 744 GFP 746
P
Sbjct: 738 DLP 740
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/717 (49%), Positives = 452/717 (63%), Gaps = 26/717 (3%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V YDS ++IING+R++I+S +IHYPRS MW L+Q+AKEGG++TIE+Y+FWN HE
Sbjct: 30 VEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQKAKEGGLDTIETYIFWNAHERRR 89
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G + VKF + +Q+A +Y ILRIGP+ AE+NYGG PVWLH IP FR D E
Sbjct: 90 REYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEWNYGGFPVWLHNIPEIKFRTDNEI 149
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F T IV+M K KLFASQGGPIILAQ+ENEYG YGE GK Y W A+MA
Sbjct: 150 FKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNVMGPYGEAGKSYVQWCAQMA 209
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VAQNIGVPWIMCQQ D P VINTCN FYCD FTP+SP PK+WTENW GW+K +G +DP
Sbjct: 210 VAQNIGVPWIMCQQSDAPSSVINTCNGFYCDTFTPNSPKSPKMWTENWTGWYKKWGQKDP 269
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HR +ED+AFSVARFFQ G + NYYMY+GGTNFGRT+GGPFI TSYDY+AP+DEYG
Sbjct: 270 HRTAEDLAFSVARFFQYNGVLQNYYMYYGGTNFGRTSGGPFIATSYDYDAPLDEYGNLNQ 329
Query: 328 PKWGHLKELHGAIKLCEHALLNG--ERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
PKWGHLK LH A+KL E L N + + S G + ++ G FL+N
Sbjct: 330 PKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVELTTYTSNIDGERLCFLSNTKMDGL 389
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS-TVEMVPENLQPSEASPDNGSK 444
+ ++ Y +PAWSVSIL DC K +NTA V Q+S V+ + EN P + S
Sbjct: 390 DVDLQQDGKYFVPAWSVSILQDCNKETYNTAKVNVQTSLIVKKLHENDTPLKLS------ 443
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
+W A + G+ F + ++ T D +DYLWY TS V+ N KN
Sbjct: 444 -WEWAPEPTKAPLHGQGGFKATQLLEQKAATYDESDYLWYMTS--VDNNGTASKN---VT 497
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L ++ G LHAF N + GS G F ++ P LK G N I+LLS TVGLQN G
Sbjct: 498 LRVKYSGQFLHAFVNGKEIGSQHGY----TFTFEKPALLKPGTNIISLLSATVGLQNYGE 553
Query: 565 FYEWVGAGITSVKITGFNSG--TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
F++ GI + +SG T DLS+ W+YK+GL GE Y+P WVS
Sbjct: 554 FFDEGPEGIAGGPVELIDSGNTTTDLSSNEWSYKVGLNGEGGRFYDP-TSGRAKWVSG-N 611
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ +TWYK + P G EP+ +D+ MGKG AW+NG +GR+WP ++ + C
Sbjct: 612 LRVGRAMTWYKTTFQAPSGTEPVVVDLQGMGKGHAWVNGNSLGRFWP---ILTADPNGCD 668
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
+CDYRG++ KC++ CG P+QRWYH+PRS+ N L++FEE GG+P+ ++F I
Sbjct: 669 GKCDYRGQYKEGKCLSNCGNPTQRWYHVPRSFLNNGSNTLILFEEIGGNPSDVSFQI 725
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/721 (47%), Positives = 464/721 (64%), Gaps = 33/721 (4%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A V+YD R+L ++G R +++S +IHYPRS P MWPGL+ +AK+GG++ I++YVFW+GHE
Sbjct: 22 AVTVSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVIQTYVFWSGHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G Y F GR++L KF++++ +A MY+ LRIGP+V AE+N+GG P WL ++PG FR D
Sbjct: 82 PTQGVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLRFLPGIEFRTD 141
Query: 145 TEPFKYHM-QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
E FK H+ F + ++ + R F Q +I AQ+ENEYG ++ YGE G++Y W
Sbjct: 142 NESFKVHLSHSFTSSLISVYSRS--FNIQ--LVICAQIENEYGSIDAVYGEAGQKYLNWI 197
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A MAVA NI VPWIMC Q D P VI+TCN FYCD F P+S P +WTENW GWF+++G
Sbjct: 198 ANMAVATNISVPWIMCNQPDAPPSVIDTCNGFYCDGFRPNSEGKPALWTENWTGWFQSWG 257
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
P RP +DIAF+VARFFQKGGS +YYMYHGGTNF R+A +TT+YDY+APIDEYG
Sbjct: 258 EGAPTRPVQDIAFAVARFFQKGGSFMHYYMYHGGTNFERSA-MEGVTTNYDYDAPIDEYG 316
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGER--SNLSLGSSQEADVYADSSGACAAFLANMD 381
R PKWGHLK+LH A+KLCE L+ + S +SLG QEA VY S+GACAAFLA+
Sbjct: 317 DVRQPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNSSTGACAAFLASW- 375
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
+D TV+F+ SY LPAWSVSILPDCK VVFNTA V QS T+ M A P
Sbjct: 376 GTDDSTVLFQGQSYDLPAWSVSILPDCKSVVFNTAKVGVQSMTMTM-------QSAIPVT 428
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG- 500
W ++E WG + F + V+ I TTKDTTDYLWYTT++ V E++ NG
Sbjct: 429 -----NWVSYREPLEPWG-STFSTNELVEQIATTKDTTDYLWYTTNVEVAESDA--PNGL 480
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
++ L++ A H F N+ L G+ S +G+ + ISL+ G N + +LSMT GLQ
Sbjct: 481 AQATLVMSYLRDAAHIFVNKWLTGTKSAHGS----EASQSISLRPGINSVKVLSMTTGLQ 536
Query: 561 NAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS 619
GPF E AGI +++ G SG + + +WTY++GLQGE+ ++ + W +
Sbjct: 537 GTGPFLEKEKAGIQFGIRVEGLPSGAIIMQRNTWTYQVGLQGENNRLFESNGSLSAVWST 596
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
+ + L+W+K P + + LD+ MGKG W+NG +GRYW S + D
Sbjct: 597 STDVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYW---SSCIAHTD 653
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
CV CDYRG + KC+T CG+PSQ WYH+PR W +N+LV+FEE+ G+P IT +
Sbjct: 654 GCVDNCDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNPEAITIAP 713
Query: 740 R 740
R
Sbjct: 714 R 714
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/727 (47%), Positives = 458/727 (62%), Gaps = 42/727 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD R+L+I+G+R +I+S +IHYPRS P MWP L+Q+AK+GG+NTIE+YVFWNGHE P
Sbjct: 33 VSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTIETYVFWNGHEPRP 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++++F K +Q+A MY ILRIGP++ E+NYGG+P WL IP FR EP
Sbjct: 93 RQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLRDIPDMQFRLHNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAAK 205
F+ M+ F TLIV+ MK +FA QGGPIIL Q+ENEYG +S E +Y W A
Sbjct: 153 FEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPDQESATKYIHWCAD 212
Query: 206 MAVAQNIGVPWIMCQQF-DTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P VI TCN FYC F P +MPKIWTENW GWFK +
Sbjct: 213 MANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHDFKPKGSNMPKIWTENWTGWFKAWDK 272
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HRP+ED+A++VA FFQ GSV NYYMYHGGTNFGRT+GGP+ITT+YDY+AP+DEYG
Sbjct: 273 PDYHRPAEDVAYAVAMFFQNRGSVQNYYMYHGGTNFGRTSGGPYITTTYDYDAPLDEYGN 332
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK LH + E L+ G+++ +L +A Y G+ A F++N D
Sbjct: 333 IRQPKYGHLKALHTVLTSMEKHLVYGQQNETNLDDKVKATKYTLDDGSSACFISNSHDNK 392
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V F +Y +PAWSVS+LPDCK V +NTA V+ Q+S + ++ A+
Sbjct: 393 DVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQTSVM------VKKESAA----KG 442
Query: 445 GLKWQVFKEI---AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
GLKW E + F + ++ I T D +DYLWY TS+ E+F
Sbjct: 443 GLKWSWLPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKTSLTRGPKEQF----- 497
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
L + + GH L+AF N EL G F+++ P++LK GKN I+LLS TVGL+N
Sbjct: 498 --TLYVNTTGHELYAFVNGELAGYKHAVNGPYLFQFEAPVTLKPGKNYISLLSATVGLKN 555
Query: 562 AGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYRNNINW 617
G +E + AGI VK+ + T+DLS +WTYK GL GE I+ PG R W
Sbjct: 556 YGASFELMPAGIVGGPVKLVSAHGNTIDLSNNTWTYKTGLFGEQKQIHLDKPGLR----W 611
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
S P N+P TWYKA + P G E + +D++ + KG+ ++NG +GRYWP S +
Sbjct: 612 -SPFAVPTNRPFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHNLGRYWP--SYVAGD 668
Query: 678 HDECVQECDYRGKF----NPDKCITGCGEPSQRWYHIPRSWFKPSE---NILVIFEEKGG 730
D C CDYRG++ N +KC+TGCGE QR+YH+PRS+ + N +V+FEE GG
Sbjct: 669 MDGC-HRCDYRGEYVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAHGAPNTVVLFEEAGG 727
Query: 731 DPTKITF 737
DP K+ F
Sbjct: 728 DPAKVNF 734
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 672 bits (1735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/726 (47%), Positives = 462/726 (63%), Gaps = 22/726 (3%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
C A +T D+R ++ING R+++IS ++HYPRS P MWP L+Q++K+GG+NTI++YVFW+
Sbjct: 25 CNADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDL 84
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE +Y F G +LV+FIK IQ +Y +LRIGP+V AE+ YGG PVWLH P R
Sbjct: 85 HEPQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLR 144
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
+ + MQ F T+IVDMMK+E+LFASQGGPII++Q+ENEYG Y + G +Y W
Sbjct: 145 TNNTVYMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINW 204
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
A+MA A + GVPWIMCQQ + P P+INTCN +YCDQFTP++P+ PK+WTENW GW+K +
Sbjct: 205 CAQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNW 264
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG DPHR +ED+AFSVARF+Q GG+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP++EY
Sbjct: 265 GGSDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEY 324
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
G PKWGHL++LH + E AL G+ N+ + A +Y+ G + F N +
Sbjct: 325 GNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYS-YQGKSSCFFGNSNA 383
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
D T+ + V+Y +PAWSVSILPDC V+NTA V +Q ST + SEA +N
Sbjct: 384 DRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVK-----KGSEA--ENE 436
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
L+W E F S +D +DT+DYL+Y T++ ++ ++ G
Sbjct: 437 PNSLQWTWRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDDPIW--GKD 494
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
L + + GH LHAF N E G F+++ ++L+ GKNEI LLS TVGL N
Sbjct: 495 LTLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNY 554
Query: 563 GPFYEWVGAGITS-VKITGFNSGTLDL-----STYSWTYKIGLQGEHLGIYNPGYRNNIN 616
GP ++ V GI V+I N G+ D+ + W YK GL GE I+ R N
Sbjct: 555 GPDFDMVNQGIHGPVQIIASN-GSADIIKDLSNNNQWAYKAGLNGEDKKIFLGRARYN-Q 612
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
W S P N+ WYKA PPG++P+ +D++ +GKG AW+NG +GRYWP +
Sbjct: 613 WKSD-NLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARG- 670
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
+ C ECDYRG + +KC T CG PSQRWYH+PRS+ ++N LV+FEE GG+P+ +T
Sbjct: 671 --EGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVT 728
Query: 737 FSIRKI 742
F +
Sbjct: 729 FQTVTV 734
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 669 bits (1726), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/708 (47%), Positives = 447/708 (63%), Gaps = 27/708 (3%)
Query: 39 GRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNL 98
G+R +++S +IHYPRS MWP L+ +AK+GG++ IE+YVFWN HE +Y F G ++
Sbjct: 1 GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60
Query: 99 VKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTL 158
V+FIK IQ A +Y +LRIGP+V AE+NYGG PVWLH +P FR F MQ F T
Sbjct: 61 VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTK 120
Query: 159 IVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIM 218
IV MMK EKLFASQGGPIILAQ+ENEYG S YG GK Y W A MA + +IGVPW+M
Sbjct: 121 IVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLM 180
Query: 219 CQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSV 278
CQQ + P P++ TCN FYCDQ+ P +PS PK+WTENW GWFK +GG+ P+R +ED+AFSV
Sbjct: 181 CQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSV 240
Query: 279 ARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
ARFFQ GG+ NYYMYHGGTNFGR AGGP+ITTSYDY AP+DE+G PKWGHLK+LH
Sbjct: 241 ARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHT 300
Query: 339 AIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLP 398
+K E +L G S + LG+S +A +Y G+ + F+ N++ D V F+ YH+P
Sbjct: 301 VLKSMEKSLTYGNISRIDLGNSIKATIYTTKEGS-SCFIGNVNATADALVNFKGKDYHVP 359
Query: 399 AWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAG-- 456
AWSVS+LPDC K +NTA V Q+S M ++ +P L+W E A
Sbjct: 360 AWSVSVLPDCDKEAYNTAKVNTQTSI--MTEDSSKPER---------LEWTWRPESAQKM 408
Query: 457 -IWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALH 515
+ G D + G VD + T D +DYLWY T + +++ + L + S H LH
Sbjct: 409 ILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNM--TLRVHSNAHVLH 466
Query: 516 AFANQELQGSASGNGTHPPFKYKNPIS-LKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT 574
A+ N + G+ ++++ ++ L G N I+LLS++VGLQN GPF+E GI
Sbjct: 467 AYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGIN 526
Query: 575 S-VKITGFNSGTL---DLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLT 630
V + G+ DLS + W YKIGL G + +++ + W + + P + LT
Sbjct: 527 GPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWANE-KLPTGRMLT 585
Query: 631 WYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGK 690
WYKA K P G EP+ +D+ +GKG AW+NG+ IGRYWP +S D C ECDYRG
Sbjct: 586 WYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWP---SFNSSDDGCKDECDYRGA 642
Query: 691 FNPDKCITGCGEPSQRWYHIPRSWFKPS-ENILVIFEEKGGDPTKITF 737
+ DKC CG+P+QRWYH+PRS+ S N + +FEE GG+P+ + F
Sbjct: 643 YGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNF 690
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 667 bits (1720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/726 (47%), Positives = 461/726 (63%), Gaps = 26/726 (3%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
C A +T D+R ++ING R+++IS ++HYPRS P MWP L+Q++K+GG+NTI++YVFW+
Sbjct: 25 CNADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDL 84
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE +Y F G +LV+FIK IQ +Y +LRIGP+V AE+ YGG PVWLH P R
Sbjct: 85 HEPQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLR 144
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
+ + MQ F T+IVDMMK+E+LFASQGGPII++Q+ENEYG Y + G +Y W
Sbjct: 145 TNNTVYMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINW 204
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
A+MA A + GVPWIMCQQ + P P+INTCN +YCDQFTP++P+ PK+WTENW GW+K +
Sbjct: 205 CAQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNW 264
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG DPHR +ED+AFSVARF+Q GG+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP++EY
Sbjct: 265 GGSDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEY 324
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
G PKWGHL++LH + E AL G+ N+ + A +Y+ G + F N +
Sbjct: 325 GNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYS-YQGKSSCFFGNSNA 383
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
D T+ + V+Y +PAWSVSILPDC V+NTA V +Q ST + SEA +N
Sbjct: 384 DRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVK-----KGSEA--ENE 436
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
L+W E F S +D +DT+DYL+Y T+ N++ + G
Sbjct: 437 PNSLQWTWRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTT---NDDPIW---GKD 490
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
L + + GH LHAF N E G F+++ ++L+ GKNEI LLS TVGL N
Sbjct: 491 LTLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNY 550
Query: 563 GPFYEWVGAGITS-VKITGFNSGTLDL-----STYSWTYKIGLQGEHLGIYNPGYRNNIN 616
GP ++ V GI V+I N G+ D+ + W YK GL GE I+ R N
Sbjct: 551 GPDFDMVNQGIHGPVQIIASN-GSADIIKDLSNNNQWAYKAGLNGEDKKIFLGRARYN-Q 608
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
W S P N+ WYKA PPG++P+ +D++ +GKG AW+NG +GRYWP +
Sbjct: 609 WKSD-NLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARG- 666
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
+ C ECDYRG + +KC T CG PSQRWYH+PRS+ ++N LV+FEE GG+P+ +T
Sbjct: 667 --EGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVT 724
Query: 737 FSIRKI 742
F +
Sbjct: 725 FQTVTV 730
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 667 bits (1720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/699 (47%), Positives = 448/699 (64%), Gaps = 26/699 (3%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWP L Q+AKEGG++ IE+Y+FW+ HE +YYF G ++VKF K+ Q+A +++ILRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
P+V AE++YGG P+WLH IPG R D E +K MQ F T IVD+ K KLFA QGGPII
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
LAQ+ENEYG YG+ G+RY W A+MAV QN+GVPWIMCQQ + P P+INTCN FYC
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180
Query: 238 DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
DQF P++P PK+WTENW GWFK +GGRDP+R +ED+AFSVARF Q GG +++YYMYHGG
Sbjct: 181 DQFKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYHGG 240
Query: 298 TNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSL 357
TNFGRTAGGP+ITTSYDY AP+DEYG PKWGHLK+LH AIK E L NG ++ +
Sbjct: 241 TNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSKNF 300
Query: 358 GSSQEADVYADS-SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTA 416
+ Y + +G FL+N + + + ++ Y LPAWSV+IL DC K ++NTA
Sbjct: 301 WGGVDQTTYTNQGTGERFCFLSNTNMEEANVDLGQDGKYSLPAWSVTILQDCNKEIYNTA 360
Query: 417 NVRAQSS-TVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTT 475
V Q+S V+ + E +P + S W + G+ F + ++ TT
Sbjct: 361 KVNTQTSIMVKKLHEEDKPVQLS-------WTWAPEPMKGVLQGKGRFRATELLEQKETT 413
Query: 476 KDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGS---------A 526
DTTDYLWY TS VN NE LK + L + ++GH LHA+ N++ G+
Sbjct: 414 VDTTDYLWYMTS--VNLNETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQ 471
Query: 527 SGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSG 584
S G F ++ P++L +G N I+LLS TVGL N G +Y+ GI V++
Sbjct: 472 SVKGDDYSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGKP 531
Query: 585 TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEP 644
+DL++Y W+YKIGL GE +P + + ++ P + +TWYK P G EP
Sbjct: 532 FMDLTSYQWSYKIGLSGEAKRYNDPNSPHASKFTASDNLPTGRAMTWYKTTFASPSGTEP 591
Query: 645 IGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPS 704
+ +D+L MGKG AW+NG+ +GR+WP + + C CDYRG +N DKC+T CG PS
Sbjct: 592 VVVDLLGMGKGHAWVNGKSLGRFWPTQIADAKG---CPDTCDYRGSYNGDKCVTNCGNPS 648
Query: 705 QRWYHIPRSWF-KPSENILVIFEEKGGDPTKITFSIRKI 742
QRWYHIPRS+ K +N L++FEE GG+PT ++F I +
Sbjct: 649 QRWYHIPRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAV 687
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 665 bits (1717), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/726 (48%), Positives = 460/726 (63%), Gaps = 40/726 (5%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y+ R+++I+G+R +I+S +IHYPRS P MWP L+ +AKEGG+NTIE+YVFWNGHE +
Sbjct: 30 YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
Y F G +++V+F K IQ A M+ ILRIGP++ E+NYGG+P WL IPG FR +PF+
Sbjct: 90 YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNDPFE 149
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAAKMA 207
M+ F TLIV+ MK +FA QGGPIILAQ+ENEYG + +Y W A MA
Sbjct: 150 REMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADMA 209
Query: 208 VAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
Q IGVPWIMCQQ D P VINTCN FYC + P+ +PKIWTENW GWFK + D
Sbjct: 210 NKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWFPNRTGIPKIWTENWTGWFKAWDKPD 269
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
HR +EDIAF+VA FFQK GSVHNYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG R
Sbjct: 270 FHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIR 329
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PK+GHLK+LH +K E L++GE + S G + Y G+ F++N D D
Sbjct: 330 QPKYGHLKDLHNLLKSMEKILVHGEYKDTSHGKNVTVTKYT-YGGSSVCFISNQFDDRDV 388
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
V ++ +PAWSVSILPDCK V +NTA ++ Q+S ++ + E P+ L
Sbjct: 389 NVTLAG-THLVPAWSVSILPDCKTVAYNTAKIKTQTS---VMVKKANSVEKEPE----AL 440
Query: 447 KWQVFKEIAGIWGEAD---FVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+W E + D F +S ++ I T+ D +DYLWY TS+ E GS
Sbjct: 441 RWSWMPENLKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRTSL------EHKGEGSY- 493
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
L + + GH ++AF N +L G + F+ ++P+ L +GKN ++LLS TVGL+N G
Sbjct: 494 TLYVNTTGHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYG 553
Query: 564 PFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYRNNINWVS 619
P +E V AGI VK+ G N +DL+ SW+YK GL GEH I+ PGY+ W S
Sbjct: 554 PLFELVPAGIAGGPVKLVGANDTAIDLTHSSWSYKSGLAGEHRQIHLDKPGYK----WRS 609
Query: 620 ---TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
+ P N+P TWYK P GDE + +D+L + KG AW+NG +GRYWP S ++
Sbjct: 610 HNGSGSIPVNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGRYWP--SYTAA 667
Query: 677 PHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGGD 731
C CDYRGKF + +C+TGCGEPSQR+YH+PRS+ + E N LV+FEE GGD
Sbjct: 668 EMGGCHGACDYRGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLVLFEEAGGD 727
Query: 732 PTKITF 737
P + F
Sbjct: 728 PARAAF 733
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/727 (48%), Positives = 461/727 (63%), Gaps = 38/727 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD RSL+I+G+R +I+S +IHYPRS P MWP L+++AKEGG++ IE+Y+FWNGHE
Sbjct: 30 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPH 89
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
+Y F G +++V+F K IQ A MY ILRIGP++ E+NYGG+P WL IPG FR E
Sbjct: 90 RRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 149
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAA 204
PF+ M+ F TLIV+ MK K+FA QGGPIILAQ+ENEYG + Y W A
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 205 KMAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDD 382
R PK+GHLKELH +K E L++GE + + G + Y DSS AC F+ N D
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSAC--FINNRFD 387
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
D V ++ LPAWSVSILPDCK V FN+A ++ Q+S + P + + S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQES---- 443
Query: 443 SKGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
LKW E + + +F K+ ++ I T+ D +DYLWY TS+ N E
Sbjct: 444 ---LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSL--NHKGE---- 494
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS L + + GH L+AF N +L G F+ ++P+ L GKN I+LLS TVGL
Sbjct: 495 GSYK-LYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGL 553
Query: 560 QNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYRNNI 615
+N GP +E + GI VK+ N +DLS SW+YK GL E+ I+ PGY+ N
Sbjct: 554 KNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYKWNG 613
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
N P N+P TWYKA + P G++ + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 614 N---NGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 668
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + +C+TGCGEPSQR+YH+PRS+ E N L++FEE GG
Sbjct: 669 AEMAGC-HRCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGG 727
Query: 731 DPTKITF 737
DP+ +
Sbjct: 728 DPSGVAL 734
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/718 (47%), Positives = 443/718 (61%), Gaps = 40/718 (5%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLI+NGRREL+ S +IHYPRS P MWP ++Q+AK GG+N I++YVFWN HE
Sbjct: 29 AKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPEMWPDILQKAKHGGLNLIQTYVFWNIHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G ++LVKFIK+I +Y LRIGPF+ AE+N+GG P WL +P +FR+
Sbjct: 89 PVEGQFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSY 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+K+ +I++MMK KLFA QGGPIILAQ+ENEY + Y E G +Y WA
Sbjct: 149 NEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYRELGVQYVQWAG 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTF 262
KMAV GVPWIMC+Q D PDPVINTCN +C D FT P+ P+ P +WTENW ++ F
Sbjct: 209 KMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVF 268
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +ED+AFSVARF K G++ NYYMYHGGTNFGRT G F+TT Y EAP+DEY
Sbjct: 269 GDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEY 327
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMD 381
GL R PKWGHLK+LH A++LC+ AL G LG +E Y + CAAFL N
Sbjct: 328 GLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNH 387
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
+ T+ FR Y LP S+SILPDCK VV+NT V AQ + V +
Sbjct: 388 SREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARNFVKSKI--------- 438
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
+K LKW++ +E + + + ++ N KD +DY W+ TSI ++ + +K
Sbjct: 439 ANKNLKWEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWFVTSIELSNYDLPMKKDI 498
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
PVL I + GHA+ AF N GSA G+ F ++ P+ KAG N IALL MTVGL N
Sbjct: 499 IPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFKAGTNYIALLCMTVGLPN 558
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
+G + E AGI SV+I G N+GTLD++ W ++G+ GEH+ Y G + + W T
Sbjct: 559 SGAYMEHRYAGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQW--TA 616
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
K +TWYK P G++P+ L M M KG+AW+NG+ IGRYW
Sbjct: 617 AKGKGPAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNIGRYW------------- 663
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
Y ++ +PSQ YH+PR+W KPS+N+LVIFEE GG+P +I +
Sbjct: 664 ---LSY---------LSPLEKPSQSEYHVPRAWLKPSDNLLVIFEETGGNPEEIEVEL 709
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/751 (45%), Positives = 468/751 (62%), Gaps = 42/751 (5%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
R +A LLI + C V Y+ R+L+I+G+R +++S +IHYPRS P MWP L+
Sbjct: 8 RASLALVLLLITAAVGAANCT--TVAYNDRALVIDGQRRIVLSGSIHYPRSTPEMWPDLI 65
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++AKEGG++ IE+YVFWNGHE P +Y F G +++V+F K IQ A MY ILRIGP++ E
Sbjct: 66 KKAKEGGLDAIETYVFWNGHEPRPRQYNFAGNYDIVRFFKEIQNAGMYAILRIGPYICGE 125
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+NYGG+P WL IPG FR +PF++ M+ F TLIV+ +K +FA QGGPIIL+Q+EN
Sbjct: 126 WNYGGLPAWLRDIPGMQFRMHNQPFEHEMETFTTLIVNKLKDANMFAGQGGPIILSQIEN 185
Query: 184 EYGYYESFY--GEGGKRYALWAAKMAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQF 240
EYG + + Y W A MA QN+GVPWIMCQQ D P VINTCN FYC +
Sbjct: 186 EYGNIMANLTDAQSASEYIHWCAAMANKQNVGVPWIMCQQDADVPPNVINTCNGFYCHDW 245
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
P +PKIWTENW GWFK + D HR ++DIAF+VA FFQK GS+ NYYMYHGGTNF
Sbjct: 246 FPKRTDIPKIWTENWTGWFKAWDKPDFHRSAQDIAFAVAMFFQKRGSLQNYYMYHGGTNF 305
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
GRTAGGP+ITTSYDY+AP+DEYG R PK+GHLK+LH +K E L++G+ S+++ G +
Sbjct: 306 GRTAGGPYITTSYDYDAPLDEYGNIREPKYGHLKDLHAVLKSMEKILVHGDFSDINYGRN 365
Query: 361 QEADVYA-DSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVR 419
Y D S C F++N D D ++ +PAWSVS+LPDCK V +NTA ++
Sbjct: 366 VTVTKYTLDGSSVC--FISNQFDDRDANATIDGTTHVVPAWSVSVLPDCKAVAYNTAKIK 423
Query: 420 AQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTK 476
AQ+S + P + E P+N LKW E + + F K+ ++ I T+
Sbjct: 424 AQTSVMVKKPNTV---EQEPEN----LKWSWMPEHLKPFMTDEKGSFRKNELLEQITTST 476
Query: 477 DTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFK 536
D +DYLWY TS K ++ L + + GH ++AF N +L G F+
Sbjct: 477 DQSDYLWYRTSFE-------HKGEAKYKLSVNTTGHQIYAFVNGKLAGRQHSPNGAFIFQ 529
Query: 537 YKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWT 594
++P+ L GKN ++LLS T+GL+N G +E + AGI VK+ N T+DLS SW+
Sbjct: 530 LESPVKLHDGKNYLSLLSATMGLKNYGALFELMPAGIVGGPVKLVDNNGSTIDLSNSSWS 589
Query: 595 YKIGLQGEHLGIY--NPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLK 651
YK GL GEH I+ PGY+ W P N+ TWYKA + P G+E + D++
Sbjct: 590 YKAGLAGEHRQIHLDKPGYK----WHGDNGTIPINRAFTWYKATFQAPAGEEAVVADLMG 645
Query: 652 MGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPD----KCITGCGEPSQRW 707
+ KG+AW+NG +GRYWP S ++ C CDYRG F + KC+TGC EP+QR+
Sbjct: 646 LNKGVAWVNGNNLGRYWP--SYVAAEMGGC-HHCDYRGAFKAEGDGLKCLTGCNEPAQRF 702
Query: 708 YHIPRSWFKPSE-NILVIFEEKGGDPTKITF 737
YH+PR + + E N +V+FEE GGDP+++ F
Sbjct: 703 YHVPRVFLRAGEPNTVVLFEEAGGDPSRVGF 733
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/727 (48%), Positives = 461/727 (63%), Gaps = 40/727 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTY+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++V+F K IQ A +Y ILRIGP++ E+NYGG+P WL IPG FR P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLIV+ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK+LH IK E L++GE + + + Y S + A F+ N +D
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTS-ACFINNRNDNK 389
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V ++ LPAWSVSILPDCK V FN+A ++AQ +T+ + N+ E P+N
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQ-TTIMVKKANM--VEKEPEN--- 443
Query: 445 GLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
LKW +E + + + K+ ++ I T+ D +DYLWY TS+ K +
Sbjct: 444 -LKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLD-------HKGEA 495
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
L + + GH L+AF N L G H F+ ++ + L GKN I+LLS T+GL+N
Sbjct: 496 SYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKN 555
Query: 562 AGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYR--NNI 615
GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PGYR NN
Sbjct: 556 YGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYRWDNNN 615
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
V P N+P TWYK + P G + + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 616 GTV-----PINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 668
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N L++FEE GG
Sbjct: 669 AEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGG 727
Query: 731 DPTKITF 737
DP+++ F
Sbjct: 728 DPSQVIF 734
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/728 (48%), Positives = 461/728 (63%), Gaps = 42/728 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTY+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++V+F K IQ A +Y ILRIGP++ E+NYGG+P WL IPG FR P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLIV+ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDK 383
R PK+GHLK+LH IK E L++GE + + Y DS+ AC F+ N +D
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSAC--FINNRNDN 388
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
D V ++ LPAWSVSILPDCK V FN+A ++AQ +TV + N+ E
Sbjct: 389 MDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQ-TTVMVNKANMVEKEP------ 441
Query: 444 KGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
+ LKW +E + + + K+ ++ I T+ D +DYLWY TSI N E
Sbjct: 442 ESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSI--NHKGE----- 494
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
+ L + + GH L+AF N L G H F+ ++P L GKN I+LLS T+GL+
Sbjct: 495 ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLK 554
Query: 561 NAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPG--YRNN 614
N GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PG + NN
Sbjct: 555 NYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPGCTWDNN 614
Query: 615 INWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
V P N+P TWYK + P G++ + +D+L + KG+AW+NG +GRYWP S
Sbjct: 615 NGTV-----PINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYT 667
Query: 675 SSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKG 729
++ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N L++FEE G
Sbjct: 668 AAEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTLILFEEAG 726
Query: 730 GDPTKITF 737
GDP+ ++F
Sbjct: 727 GDPSHVSF 734
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/733 (46%), Positives = 452/733 (61%), Gaps = 27/733 (3%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
IA ALL SS+ T V YDS ++I+NG R+LIIS AIHYPRS MWP L+ +A
Sbjct: 11 IACLALLYTCSSATT------VEYDSSAIILNGERKLIISGAIHYPRSTSQMWPDLIMKA 64
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K+G ++ IE+Y+FW+ HE KY F G + +KF+KI Q+ +Y++LRIGP+V AE+NY
Sbjct: 65 KDGDLDAIETYIFWDLHEPVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVCAEWNY 124
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG P+WLH +PG R D FK M+ F T IV M K LFA QGGPIILAQ+ENEYG
Sbjct: 125 GGFPMWLHNMPGIQLRTDNAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQIENEYG 184
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
S YGE G Y W A+MA+AQNIGVPWIMC+Q + P +I+TCN +YCD F P++P
Sbjct: 185 DVISHYGEAGNSYIKWCAEMALAQNIGVPWIMCKQKNAPATIIDTCNGYYCDTFKPNNPK 244
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PKI+TENW GWF+ +G R PHR +ED AFSVARFFQ GG++ NYY+YHGGTNFGRTAGG
Sbjct: 245 SPKIFTENWVGWFQKWGERRPHRTAEDSAFSVARFFQNGGALQNYYLYHGGTNFGRTAGG 304
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PFI T+YDY+AP+DEYG PK+GHLK LH AIKL E L NG + S G S Y
Sbjct: 305 PFIITTYDYDAPLDEYGNLIEPKYGHLKRLHAAIKLGEKVLTNGTATWESHGDSLWMTTY 364
Query: 367 ADS-SGACAAFLANMDDKNDKTV-VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSST 424
+ +G FL+N D V + ++ Y++PAWS+S+L DC K V+NTA AQ++
Sbjct: 365 TNKGTGQKFCFLSNSHTSKDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNTAKTEAQTNI 424
Query: 425 VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
+ + Q SP+ W G+ F S +D + T +DYLWY
Sbjct: 425 --YMKQLDQKLGNSPE-----WSWTSDPMEDTFQGKGTFTASQLLDQKSVTVGASDYLWY 477
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
T ++VN+ + + + + + GH L+ F N L G+ G + P F ++ ISL
Sbjct: 478 MTEVVVNDTNTW----GKAKVQVNTTGHILYLFINGFLTGTQHGTVSQPGFIHEGNISLN 533
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFN----SGTLDLSTYSWTYKIGLQ 600
G N I+LLS+TVG N G F++ GI + F+ + LDLS +W+YK+G+
Sbjct: 534 QGTNIISLLSVTVGHANYGAFFDMQETGIVGGPVKLFSIENPNNVLDLSKSTWSYKVGIN 593
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
G Y+P + W T P+TWYK K P G P+ LD++ + KG AW+N
Sbjct: 594 GMTKKFYDPKTTIGVQW-KTNNVSIGVPMTWYKTTFKTPDGTNPVVLDLIGLQKGEAWVN 652
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN 720
G+ IGRYWP + + C CDYRG++N DKC++GCGEPSQR+YH+PRS+ N
Sbjct: 653 GQSIGRYWP---AMLAENKGCSDTCDYRGEYNADKCLSGCGEPSQRFYHVPRSFLNNDVN 709
Query: 721 ILVIFEEKGGDPT 733
LV+FEE G D T
Sbjct: 710 TLVLFEEMGFDAT 722
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/727 (48%), Positives = 461/727 (63%), Gaps = 38/727 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD RSL+I+G+R +I+S +IHYPRS P MWP L+++AKEGG++ IE+Y+FWNGHE
Sbjct: 30 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPH 89
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
+Y F G +++V+F K IQ A MY ILRIGP++ E+NYGG+P WL IPG FR E
Sbjct: 90 RRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 149
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAA 204
PF+ M+ F TLIV+ MK K+FA QGGPIILAQ+ENEYG + Y W A
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 205 KMAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDD 382
R PK+GHLKELH +K E L++GE + + G + Y DSS AC F+ N D
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSAC--FINNRFD 387
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
D V ++ LPAWSVSILPDCK V FN+A ++ Q+S + P + + S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQES---- 443
Query: 443 SKGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
LKW E + + +F K+ ++ I T+ D +DYLWY TS+ N E
Sbjct: 444 ---LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSL--NHKGE---- 494
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS L + + GH L+AF N +L G F+ ++P+ L GKN I+LLS TVGL
Sbjct: 495 GSYK-LYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGL 553
Query: 560 QNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYRNNI 615
+N GP +E + GI VK+ N +DLS SW+YK GL E+ I+ PGY+ N
Sbjct: 554 KNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYKWNG 613
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
N P N+P TWYKA + P G++ + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 614 N---NGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 668
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + +C+TGCGEPSQR+YH+PRS+ E N L++FEE GG
Sbjct: 669 AEMAGC-HRCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGG 727
Query: 731 DPTKITF 737
DP+ +
Sbjct: 728 DPSGVAL 734
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 661 bits (1706), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/742 (47%), Positives = 456/742 (61%), Gaps = 53/742 (7%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
FA+L SS++ G VTYD RSLIING+R+++ S +IHYPRS P MWP L+ QAK+G
Sbjct: 13 FAVL---SSAVASVCGGEVTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE PG+Y F GR ++V+FI+ +Q +Y LRIGPF+ AE+NYGG
Sbjct: 70 GIDVIETYVFWNQHEPKPGQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGF 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P WLH +PG V+R D EPFK++M+ F T IV++MK E L+ASQGGPIIL Q+ENEY E
Sbjct: 130 PFWLHDVPGIVYRTDNEPFKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVE 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSM 247
+ +GE GKRY LWAA MAV GVPW+MC+Q D PDPVIN+CN C + P+SP+
Sbjct: 190 ANFGEAGKRYVLWAANMAVGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNK 249
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQK-GGSVHNYYMYHGGTNFGRTAGG 306
P IWTENW + FG RP EDIAF VA F K GS NYYMYHGGTNFGRTA
Sbjct: 250 PAIWTENWTSSYPLFGEDARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASA 309
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS-QEADV 365
++ T+Y EAP+DEYGL + P WGHLKELH A+KLC LL G +SNLSLG+ QEA V
Sbjct: 310 -YVQTAYYDEAPLDEYGLIQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYV 368
Query: 366 YADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV 425
+ SG CAAFL N D + D TVVF+N SY LP S+SILPDCK FNTA + +
Sbjct: 369 FRGQSGKCAAFLVNNDSRTDVTVVFQNTSYELPRKSISILPDCKNEAFNTAKASFRPGLI 428
Query: 426 EMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
+ + N ++ +W+ +KE + + + ++H+NTTKD +DYLWYT
Sbjct: 429 SI-------QTVTKFNSTE--QWEEYKESILNFDDTSSRANTLLEHMNTTKDASDYLWYT 479
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
N + + + VL S+ HALHAF N GS G+ ++ F N +S +A
Sbjct: 480 ----FRYNND--PSNGQSVLSTNSRAHALHAFINGRHTGSQHGSSSNLSFSLDNTVSFRA 533
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
G N ++LLS+ VGL ++G + E AG+ V+I N D + W Y++GL GE L
Sbjct: 534 GINNVSLLSVMVGLPDSGAYLERRVAGLRRVRIQS-NGSLKDFTNNPWGYQVGLLGEKLQ 592
Query: 606 IYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIG 665
IY + W S + LTWYK V P G+EP+ L+++ M KG W+NG+ IG
Sbjct: 593 IYTDVGSQKVQW-SKFGSSTSGLLTWYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIG 651
Query: 666 RYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF 725
RYW +T G+PSQ WYHIPRS+ KP+ N+LV+
Sbjct: 652 RYWV-------------------------SFLTPSGKPSQIWYHIPRSFLKPTGNLLVLL 686
Query: 726 EEKGGDPTKITF---SIRKISG 744
EE+ G P I+ SI KI G
Sbjct: 687 EEETGHPVGISIGKVSIPKICG 708
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/724 (45%), Positives = 448/724 (61%), Gaps = 46/724 (6%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+VTYD RSLII+G+R+++ S +IHYPRS P MWP LV +A+EGGV+ I++YVFWN HE
Sbjct: 23 GDVTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREGGVDVIQTYVFWNLHEP 82
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG+Y F GR +LV+FIK IQ +Y+ LRIGPF+ +E+ YGG P WLH +P V+R+D
Sbjct: 83 RPGEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPDIVYRSDN 142
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK++MQ F T IV+MMK E L+ASQGGPIIL+Q+ENEY E+ + + G Y +WAAK
Sbjct: 143 EPFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVEAAFRDKGPPYVIWAAK 202
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFG 263
MAV GVPW+MC+Q D PDPVINTCN C + P+SP+ P +WTENW +++ +G
Sbjct: 203 MAVELQTGVPWVMCKQTDAPDPVINTCNGMRCGETFGGPNSPTKPSLWTENWTSFYQVYG 262
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G R +EDIAF V F K GS NYYM+HGGTNFGRTA IT+ YD +AP+DEYG
Sbjct: 263 GEPYIRSAEDIAFHVTLFIAKNGSYINYYMFHGGTNFGRTASAYVITSYYD-QAPLDEYG 321
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKWGHLKELH AIK C +L G +SN SLG Q+A ++ + CAAFL N D K
Sbjct: 322 LIRQPKWGHLKELHAAIKSCSSTILEGVQSNFSLGQLQQAYIFEEEGAGCAAFLVNNDQK 381
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
N+ TV FRN+++ L S+S+LPDC+ ++FNTA V A+ + + L
Sbjct: 382 NNATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKGNEITRTSSQL---------FD 432
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+W+ + ++ + + + ++H+NTTKD +DYLWYT S + N + + P
Sbjct: 433 DADRWEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYTFSFLPN------SSCTEP 486
Query: 504 VLLIESKGHALHAFANQELQGSASGN-GTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
+L +ES H AF N + GSA G+ PF + PI L N I++LS VGLQ++
Sbjct: 487 ILHVESLAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTISILSTMVGLQDS 546
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLS-TYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
G F E AG+T V+I + + Y W Y+ GL GE L IY + +NI W S +
Sbjct: 547 GAFLERRYAGLTRVEIRCAQQEIYNFTNNYEWGYQAGLSGESLNIYMREHLDNIEW-SEV 605
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
+QPL+W+K P G++P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 606 VSATDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWL------------ 653
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+T G+PSQ YHIPR++ S N+LV+ EE GGDP I+
Sbjct: 654 -------------SFLTSKGQPSQTLYHIPRAFLNSSGNLLVLLEESGGDPLHISLDTVS 700
Query: 742 ISGF 745
+G
Sbjct: 701 RTGL 704
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/639 (51%), Positives = 415/639 (64%), Gaps = 22/639 (3%)
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
++ QA +Y+ LRIGP+V AE+N+GG PVWL ++PG FR D EPFK M+KF IV MM
Sbjct: 1 LVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMM 60
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFD 223
K EKLF +QGGPIILAQ+ENEYG E G GK Y W A+MA+ + GVPWIMC+Q D
Sbjct: 61 KAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQED 120
Query: 224 TPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ 283
P P+I+TCN +YC+ F P+S + PK+WTENW GW+ FGG P+RP EDIA+SVARF Q
Sbjct: 121 APGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFIQ 180
Query: 284 KGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLC 343
KGGS+ NYYMYHGGTNF RTA G F+ +SYDY+AP+DEYGLPR PK+ HLK LH AIKL
Sbjct: 181 KGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLS 239
Query: 344 EHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVS 403
E ALL+ + + SLG+ QEA V+ S +CAAFL+N D+ + V+FR Y LP WSVS
Sbjct: 240 EPALLSADATVTSLGAKQEAYVFWSKS-SCAAFLSNKDENSAARVLFRGFPYDLPPWSVS 298
Query: 404 ILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEA-D 462
ILPDCK V+NTA V A S MVP G+K W F E EA
Sbjct: 299 ILPDCKTEVYNTAKVNAPSVHRNMVP-----------TGTK-FSWGSFNEATPTANEAGT 346
Query: 463 FVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQEL 522
F ++G V+ I+ T D +DY WY T I + E FLK G P+L + S GHALH F N +L
Sbjct: 347 FARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQL 406
Query: 523 QGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKITGF 581
G+A G HP + I L AG N+IALLS+ VGL N G +E W + V + G
Sbjct: 407 SGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGV 466
Query: 582 NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPG 641
NSGT D+S + W+YKIG++GE L ++ + + W K QPLTWYK+ P G
Sbjct: 467 NSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAG 526
Query: 642 DEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCG 701
+EP+ LDM MGKG W+NG IGR+WP + S C+Y G F+ KC++ CG
Sbjct: 527 NEPLALDMNTMGKGQVWINGRNIGRHWPAYKAQGS-----CGRCNYAGTFDAKKCLSNCG 581
Query: 702 EPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
E SQRWYH+PRSW K S+N++V+FEE GGDP I+ R
Sbjct: 582 EASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKR 619
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/727 (47%), Positives = 459/727 (63%), Gaps = 40/727 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++++F K IQ A +Y ILRIGP++ E+NYGG+P WL IP FR P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLI++ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK+LH IK E L++GE + + + Y S + A F+ N +D
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTS-ACFINNRNDNK 385
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V ++ LPAWSVSILPDCK V FN+A ++AQ +T+ + N+ E P+N
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQ-TTIMVKKANM--VEKEPEN--- 439
Query: 445 GLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
LKW +E + + + K+ ++ I T+ D +DYLWY TS+ K +
Sbjct: 440 -LKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLD-------HKGEA 491
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
L + + GH L+AF N L G H F+ ++ + L GKN I+LLS T+GL+N
Sbjct: 492 SYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKN 551
Query: 562 AGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYR--NNI 615
GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PGYR NN
Sbjct: 552 YGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYRWDNNN 611
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
V P N+P TWYK + P G + + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 612 GTV-----PINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 664
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N L++FEE GG
Sbjct: 665 AEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGG 723
Query: 731 DPTKITF 737
DP+++ F
Sbjct: 724 DPSQVIF 730
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/742 (46%), Positives = 462/742 (62%), Gaps = 38/742 (5%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+ + A VTY+ R+L+I+G+R +I+S +IHYPRS P MWP L+ +AKEGG+
Sbjct: 7 LLLALVAVTQVASATTVTYNDRALVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGL 66
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
NTIE+YVFWNGHE +Y F G +++++F K IQ A M+ ILRIGP++ E+NYGG+P
Sbjct: 67 NTIETYVFWNGHEPRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGPYICGEWNYGGLPA 126
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--E 189
WL IPG FR PF+ M+ F TLIV+ MK +FA QGGPIILAQ+ENEYG +
Sbjct: 127 WLRDIPGMQFRLHNAPFEREMETFTTLIVNKMKDVNMFAGQGGPIILAQIENEYGNIMGQ 186
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMP 248
+ +Y W A MA Q +GVPWIMCQQ D P VINTCN FYC + P+ +P
Sbjct: 187 LKNNQSASQYIHWCADMANKQEVGVPWIMCQQDNDVPHNVINTCNGFYCHDWFPNRTGIP 246
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
KIWTENW GWFK + D HR +EDIAF+VA FFQK GSVHNYYMYHGGTNFGRT+GGP+
Sbjct: 247 KIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPY 306
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD 368
ITTSYDY+AP+DEYG R PK+GHLK+LH I+ E L++G+ ++ S G + Y
Sbjct: 307 ITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGKNVTVTKYM- 365
Query: 369 SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV 428
G+ F+ N D V ++ +PAWSVSILP+CK V +NTA ++ Q+S ++
Sbjct: 366 YGGSSVCFINNQFVDRDMKVTLGGETHLVPAWSVSILPNCKTVAYNTAKIKTQTS---VM 422
Query: 429 PENLQPSEASPDNGSKGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYT 485
+ E P+ ++W E + F +S ++ I T+ D +DYLWY
Sbjct: 423 VKKANSVEKEPET----MRWSWMPENLKPFMTDHRGSFRQSQLLEQIATSTDQSDYLWYR 478
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
TS+ E GS L + + GH ++AF N L G F+ ++P+ L +
Sbjct: 479 TSL------EHKGEGSY-TLYVNTSGHEMYAFVNGRLVGQNHSADGAFVFQLQSPVKLHS 531
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEH 603
GKN ++LLS TVGL+N GP +E V AGI VK+ G N +DL+ SW+YK GL GE
Sbjct: 532 GKNYVSLLSGTVGLKNYGPSFELVPAGIAGGPVKLVGTNGTAIDLTKSSWSYKSGLAGEL 591
Query: 604 LGIY--NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLN 660
I+ PGY+ W S P N+P TWYK + P G+E + +D+L + KG+AW+N
Sbjct: 592 RQIHLDKPGYK----WQSHNGTIPVNRPFTWYKTTFEAPAGEEAVVVDLLGLNKGVAWVN 647
Query: 661 GEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFK 716
G +GRYWP + P CDYRGKF + +C+TGCGEP+QR+YH+PRS+ +
Sbjct: 648 GNSLGRYWPSYTAAEMPG---CHVCDYRGKFIAEGDGIRCLTGCGEPAQRFYHVPRSFLR 704
Query: 717 PSE-NILVIFEEKGGDPTKITF 737
E N L++FEE GGDPT+ F
Sbjct: 705 AGEPNTLILFEEAGGDPTRAAF 726
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/624 (52%), Positives = 414/624 (66%), Gaps = 15/624 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ VTYD +++IING+R +++S +IHYPRS P MWP L+Q+AK+GG+
Sbjct: 13 LGILCCSSLICSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG+YYF R++LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENEYG E
Sbjct: 133 WLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W A+MA + GVPWIMC+Q D P+ +INTCN FYC+ F P+S + PK+W
Sbjct: 193 IGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP+EDIA SVARF Q GGS NYYMYHGGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+AP+DEYGLPR PK+ HLK LH IKLCE AL++ + + SLG QEA V+ S
Sbjct: 312 SYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKSKS- 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N + + V+F +Y LP WSVSILPDCK +NTA VR S ++MVP N
Sbjct: 371 SCAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVRTSSIHMKMVPTN 430
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
S S + +EI F + G V+ I+ T+D TDY WY T I ++
Sbjct: 431 TPFSWGSYN-----------EEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITIS 479
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
+E+FL G P+L I S GHALH F N +L G+A G+ P + I L AG N++A
Sbjct: 480 PDEKFL-TGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLA 538
Query: 552 LLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
LLS GL N G YE W + V + G NSGT D++ + W+YKIG +GE L ++
Sbjct: 539 LLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLA 598
Query: 611 YRNNINWVSTMEPPKNQPLTWYKA 634
+ + W K QPLTWYK
Sbjct: 599 GSSTVEWKEGSLVAKKQPLTWYKV 622
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/727 (47%), Positives = 457/727 (62%), Gaps = 40/727 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++++F K IQ A +Y ILRIGP++ E+NYGG+P WL IP FR P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLI++ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK+LH IK E L++GE + + + Y S + A F+ N +D
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 385
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V ++ LPAWSVSILPDCK V FN+A ++AQ +T+ + N+ E +
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQ-TTIMVKKANMVEKEP------E 438
Query: 445 GLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
LKW +E + + + K+ ++ I T+ D +DYLWY TS+ K +
Sbjct: 439 SLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLD-------HKGEA 491
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
L + + GH L+AF N L G H F+ ++ + L GKN I+LLS T+GL+N
Sbjct: 492 SYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKN 551
Query: 562 AGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYR--NNI 615
GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PGYR NN
Sbjct: 552 YGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYRWDNNN 611
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
V P N+P TWYK + P G + + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 612 GTV-----PINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 664
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N L++FEE GG
Sbjct: 665 AEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGG 723
Query: 731 DPTKITF 737
DP+++ F
Sbjct: 724 DPSQVIF 730
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/727 (47%), Positives = 457/727 (62%), Gaps = 40/727 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 31 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++++F K IQ A +Y ILRIGP++ E+NYGG+P WL IP FR P
Sbjct: 91 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLI++ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 151 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK+LH IK E L++GE + + + Y S + A F+ N +D
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 389
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V ++ LPAWSVSILPDCK V FN+A ++AQ +T+ + N+ E +
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQ-TTIMVKKANMVEKEP------E 442
Query: 445 GLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
LKW +E + + + K+ ++ I T+ D +DYLWY TS+ K +
Sbjct: 443 SLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLD-------HKGEA 495
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
L + + GH L+AF N L G H F+ ++ + L GKN I+LLS T+GL+N
Sbjct: 496 SYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKN 555
Query: 562 AGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYR--NNI 615
GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PGYR NN
Sbjct: 556 YGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYRWDNNN 615
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
V P N+P TWYK + P G + + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 616 GTV-----PINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 668
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N L++FEE GG
Sbjct: 669 AEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGG 727
Query: 731 DPTKITF 737
DP+++ F
Sbjct: 728 DPSQVIF 734
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/716 (45%), Positives = 453/716 (63%), Gaps = 48/716 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD+RSLIING+REL+ S AIHYPRS P MWP L+++AK+GG+N IE+YVFWNGHE
Sbjct: 46 ALGVTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFWNGHE 105
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F G F+LVKFIK+I + ++Y ++R+GPF+ AE+N+GG+P WL +PG +FR+D
Sbjct: 106 PVEGQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGIIFRSD 165
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK HM++F+TLIVD +K+EKLFA QGGPIILAQ+ENEY + + E G Y WA
Sbjct: 166 NEPFKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYVQWAG 225
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTF 262
K+A++ N VPWIMC+Q D PDP+INTCN +C + P+ + P +WTENW ++ F
Sbjct: 226 KLALSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQYRVF 285
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +ED+A+SVARFF K GS+ NYYM++GGTNFGRT+ F TT Y E P+DE+
Sbjct: 286 GDPPSQRSAEDLAYSVARFFSKNGSMVNYYMHYGGTNFGRTSAS-FTTTRYYDEGPLDEF 344
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMD 381
GL R PKWGHLK++H A+ LC+ AL G + L LG Q+A V+ + ACAAFLAN +
Sbjct: 345 GLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAAFLANNN 404
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
+ + V FR LPA S+S+LPDCK VVFNT V Q ++ V +
Sbjct: 405 TRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRSEI--------- 455
Query: 442 GSKGLKWQVFKEI--AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
+K W++ +E+ G+ + D + F + TKDTTDY WYTTS+++ + +K
Sbjct: 456 ANKNFNWEMCREVPPVGLGFKFDVPRELF----HLTKDTTDYAWYTTSLLLGRRDLPMKK 511
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
RPVL + S GH +HA+ N E GSA G+ F + +SLK G+N IALL VGL
Sbjct: 512 NVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLKEGENHIALLGYLVGL 571
Query: 560 QNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS 619
++G + E AG S+ I G N+GTLD+S W +++G+ GE ++ ++ W
Sbjct: 572 PDSGAYMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGIDGEKKKLFTEEGSKSVQWT- 630
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
+P + PLTWYK P GD P+ + M MGKG+ W+NG IGRYW
Sbjct: 631 --KPDQGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYW----------- 677
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
+ ++ +P+Q YHIPR++ KP +N++V+ EE+GG+P +
Sbjct: 678 --------------NNYLSPLKKPTQSEYHIPRAYLKP-KNLIVLLEEEGGNPKDV 718
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/727 (47%), Positives = 457/727 (62%), Gaps = 40/727 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++++F K IQ A +Y ILRIGP++ E+NYGG+P WL IP FR P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLI++ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK+LH IK E L++GE + + + Y S + A F+ N +D
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 385
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D V ++ LPAWSVSILPDCK V FN+A ++AQ +T+ + N+ E +
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQ-TTIMVKKANMVEKEP------E 438
Query: 445 GLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
LKW +E + + + K+ ++ I T+ D +DYLWY TS+ K +
Sbjct: 439 SLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLD-------HKGEA 491
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
L + + GH L+AF N L G H F+ ++ + L GKN I+LLS T+GL+N
Sbjct: 492 SYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKN 551
Query: 562 AGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPGYR--NNI 615
GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PGYR NN
Sbjct: 552 YGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYRWDNNN 611
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
V P N+P TWYK + P G + + +D+L + KG+AW+NG +GRYWP S +
Sbjct: 612 GTV-----PINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYTA 664
Query: 676 SPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKGG 730
+ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N L++FEE GG
Sbjct: 665 AEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGG 723
Query: 731 DPTKITF 737
DP+++ F
Sbjct: 724 DPSQVIF 730
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/600 (54%), Positives = 421/600 (70%), Gaps = 17/600 (2%)
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
MQ+F +VD MK L+ASQGGPIIL+Q+ENEYG +S YG GK Y WAA MAV+ +
Sbjct: 1 MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPS 271
GVPW+MCQQ D PDP+INTCN FYCDQFTP+S S PK+WTENW GWF +FGG P+RP+
Sbjct: 61 TGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPA 120
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWG 331
ED+AF+VARF+Q+GG+ NYYMYHGGTNFGR+ GGPFI TSYDY+APIDEYG+ R PKWG
Sbjct: 121 EDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWG 180
Query: 332 HLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY--ADSSGACAAFLANMDDKNDKTVV 389
HL+++H AIKLCE AL+ E S SLG + EA VY AD+S CAAFLAN+D ++DKTV
Sbjct: 181 HLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNS-ICAAFLANVDAQSDKTVK 239
Query: 390 FRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM--VPENLQPSEAS---PDNGSK 444
F +Y LPAWSVSILPDCK VV NTA + +Q +T EM + ++Q ++ S P+ +
Sbjct: 240 FNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLITPELATA 299
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
G W E GI E K G ++ INTT D +D+LWY+TSI+V +E +L NGS+
Sbjct: 300 G--WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL-NGSQSN 356
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
LL+ S GH L + N +L GSA G+ + + P++L GKN+I LLS TVGL N G
Sbjct: 357 LLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGA 416
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
F++ VGAG+T VK++G N G L+LS+ WTY+IGL+GE L +YNP + WVS
Sbjct: 417 FFDLVGAGVTGPVKLSGPN-GALNLSSTDWTYQIGLRGEDLHLYNPS-EASPEWVSDNAY 474
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
P NQPL WYK P GD+P+ +D MGKG AW+NG+ IGRYWP +P CV
Sbjct: 475 PTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWP---TNLAPQSGCVN 531
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
C+YRG ++ +KC+ CG+PSQ YH+PRS+ +P N LV+FE+ GGDP+ I+F+ R+ S
Sbjct: 532 SCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTS 591
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 653 bits (1685), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/429 (72%), Positives = 356/429 (82%), Gaps = 6/429 (1%)
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACA 374
Y+AP+DEYGLPR PKWGHLK+LH AIKLCEH LL G+ N+SLG S EADVY DSSGACA
Sbjct: 1 YDAPVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGACA 60
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQP 434
AF+AN+DDKNDKTV FRN SYH+PAWSVSILPDCK VV+NTA V Q++ + M+PE LQ
Sbjct: 61 AFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNKIAMIPEKLQQ 120
Query: 435 SEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENE 494
S D G K KW V+KE GIWG+ DFV +GFVDHINTTKDTTDYLW+TTSI ++ENE
Sbjct: 121 S----DKGQKTFKWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENE 176
Query: 495 EFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLS 554
E LK GS+PVL+IESKGHALHAF NQ+ QG+A GNG+H F +KNPISLKAGKNEIALLS
Sbjct: 177 ELLKKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLS 236
Query: 555 MTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNN 614
+TVGLQ AGPFY++VGAG+TSVKI G N+ T+DLS+ +WTYKIG+QGEHL IY N+
Sbjct: 237 LTVGLQTAGPFYDFVGAGVTSVKIKGLNNKTIDLSSNAWTYKIGVQGEHLKIYQGNGLNS 296
Query: 615 INWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
++W ST EPPK Q LTWYKA+V PPGDEP+GLDML MGKG AWLNGE IGRYWPR S
Sbjct: 297 VSWTSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWPRISEF 356
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
++CV+ECDYRGKFNPDKC TGCGEPSQ+WYH+PRSWFKPS N+LV FEEKGGDPTK
Sbjct: 357 KK--EDCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGDPTK 414
Query: 735 ITFSIRKIS 743
ITF RK+S
Sbjct: 415 ITFVRRKVS 423
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/732 (47%), Positives = 463/732 (63%), Gaps = 31/732 (4%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F VTYD+R++ I+G R+LI+S +IHYPRS P MWP L+++AKEGG+NTIE+YVFWN H
Sbjct: 3 FGYEVTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWNAH 62
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E +Y F G +L++FIK I+ +Y ILRIGP+V AE+NYGG PVWLH +PG R
Sbjct: 63 EPHQRQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGIQIRT 122
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+ E +K M+ F TLIV+MMK KLFASQGGPIIL+Q+ENEYG +S YG+ GK Y W
Sbjct: 123 NNEVYKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYVKWC 182
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A +A + +GVPWIMCQQ D P P+I++CN FYCDQ+ ++ S+PKIWTENW GWF+ +G
Sbjct: 183 ANLAESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYSNNKSLPKIWTENWTGWFQDWG 242
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
++PHR +ED+AF+VARFFQ GGSV NYYMYHGGTNFG T GGP+IT SYDY+AP+DEYG
Sbjct: 243 QKNPHRSAEDVAFAVARFFQLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPLDEYG 302
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS-SGACAAFLANMDD 382
R PKWGHL++LH + E L GE N + + + + G + F +++D
Sbjct: 303 NLRQPKWGHLRDLHSVLNSMEQTLTYGESKNSNYPDNNNIFITIFAYQGKRSCFFSSIDY 362
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K D+T+ F Y LPAWSVSILPDC V+NTA V Q+S +E N S P++
Sbjct: 363 K-DQTISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSIMEN-KANAADSFREPNS- 419
Query: 443 SKGLKWQVFKE-IAGIWGEADFVKSGFV-----DHINTTKDTTDYLWYTTSIIVNENEEF 496
L+W+ E I G+ + DFV + V D T T+DYLW T+ N N+
Sbjct: 420 ---LQWKWRPEKIRGLSLQGDFVGNTLVANELMDQKAVTNGTSDYLWIMTNYDHNMNDSL 476
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQG--SASGNGTHPPFKYKNPISLKAGKNEIALLS 554
G +L + + GH +HAF N + G SAS F +++ I LK G N I+L+S
Sbjct: 477 WGAGKDIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIKLKRGINRISLVS 536
Query: 555 MTVGLQNAGPFYEWVGAGITS-VKITGFN------SGTLDLSTYSWTYKIGLQGEHLGI- 606
++VGLQN G ++ GI + I G + T+D+S+ W YK GL GE G
Sbjct: 537 VSVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVYKTGLHGEDQGFQ 596
Query: 607 -YNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIG 665
P +R T NQP WYK P G +P+ +D+L +GKG AW+NG IG
Sbjct: 597 AVRPRHRRQF---YTKHVLINQPFVWYKTSFNAPLGQDPVVVDLLGLGKGTAWVNGRNIG 653
Query: 666 RYWPRKSRKSSPHD-ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
R+WP + +P D C C Y G + P +C+TGCGEP+QR+YHIPR W KP +N LV+
Sbjct: 654 RFWP---KALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPEDNKLVL 710
Query: 725 FEEKGGDPTKIT 736
FEE GG P ++
Sbjct: 711 FEELGGTPDFVS 722
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/723 (47%), Positives = 445/723 (61%), Gaps = 37/723 (5%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+VTYD RSLI++G+R+L+ S +IHYPRS P MW L+ +AKEGG++ I++YVFWN HE
Sbjct: 22 GDVTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFWNLHEP 81
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG+Y F GR ++V+FIK +Q +Y+ LRIGPF+ E++YGG+P WLH IPG VFR+D
Sbjct: 82 QPGQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGIVFRSDN 141
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQ F T IV MM+ EKL+ SQGGPIIL+Q+ENEYG E Y E G Y WAA+
Sbjct: 142 EPFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYVKWAAQ 201
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFG 263
MAV N GVPW+MC+Q D PDPVIN CN C + P+SP+ P IWTENW + G
Sbjct: 202 MAVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTRYVITG 261
Query: 264 GRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
R EDIAF V +F K GS NYYMYHGGTNFGRTA F+ TSY +APIDEY
Sbjct: 262 ENIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASA-FVPTSYYDQAPIDEY 320
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKE+H AIKLC LL+G + +SLG Q+A V+ SG CAAFL N D
Sbjct: 321 GLIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTISLGQQQQAFVFTGLSGECAAFLLNNDT 380
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
N +V FRN SY LP S+SILPDCK V FNTA V Q +T M L E
Sbjct: 381 ANTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYTTRSMTRSKLLDGED----- 435
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
KW ++E + E ++ ++TTKD +DYLWYT ++ ++
Sbjct: 436 ----KWVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWYTFRFQQESSD------TQ 485
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL + S GH LHAF N + G A G+ +P F ++ +SL G N ++LLS+ VG+ ++
Sbjct: 486 AVLNVRSLGHVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNVSLLSVMVGMPDS 545
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G + E AG+ VKI G + + YSW Y++GL GE L I+ + + W + +
Sbjct: 546 GAYMERRAAGLRKVKIQE-KEGNKEFTNYSWGYQVGLLGEKLQIFTDQGSSQVQWANFSK 604
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP--RKSRKSSPHDE 680
N PLTWYK + P D P+ L++ MGKG AW+NG+ IGRYWP R S SS
Sbjct: 605 NALN-PLTWYKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYRASDGSSQI-- 661
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
+ FN TG + R Y++PRS+ KP N+LV+ EE GG+P +I+
Sbjct: 662 ------WYAYFN-----TGAIFRAVR-YNVPRSFLKPKGNLLVVLEESGGNPLQISVDTA 709
Query: 741 KIS 743
IS
Sbjct: 710 SIS 712
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/728 (46%), Positives = 452/728 (62%), Gaps = 36/728 (4%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V YD R+L+I+G R L+IS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 26 VGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHEPRR 85
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++V+F K +Q A MY ILRIGP++ E+NYGG+P WL I G FR P
Sbjct: 86 RQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQFRMHNHP 145
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAAK 205
F+ M+ F TLIVD +K K+FA QGGPIIL+Q+ENEYG E Y W A
Sbjct: 146 FEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHWCAA 205
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P VINT N FYC + P +PKIWTENW GWFK +
Sbjct: 206 MANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWFPKRTDIPKIWTENWTGWFKAWDK 265
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAFSVA FFQ GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 266 PDFHRSAEDIAFSVAMFFQTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 325
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK+GHLK+LH +K E LL+G+ + ++G++ + A F++N D
Sbjct: 326 IRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMGNTNVTVTKYTLDNSSACFISNKFDDK 385
Query: 385 DKTVVFRNVSYH-LPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
+ V N + H +PAWSVSILPDCK V +N+A ++ Q+S + P + +
Sbjct: 386 EVNVTLDNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTSVMVKRP--------GAETVT 437
Query: 444 KGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
GL W E + + +F K+ ++ I T+ D +DYLWY TS K
Sbjct: 438 DGLAWSWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRTSFE-------HKGE 490
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
S L + + GH L+AF N +L G F+ + P+ L +GKN I+LLS T+GL+
Sbjct: 491 SNYKLHVNTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGKNYISLLSATIGLK 550
Query: 561 NAGPFYEWVGAGITS--VKI--TGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
N G +E + AGI VK+ T N+ DLS SW+YK GL GE+ + +
Sbjct: 551 NYGALFEMMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKAGLAGEYRETHLDKANDRSQ 610
Query: 617 WVSTMEP--PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
W + P ++P TWYKA + P G+EP+ D+L +GKG+ W+NG +GRYWP S
Sbjct: 611 WSGGLNGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNGNNLGRYWP--SYV 668
Query: 675 SSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKG 729
++ D C Q CDYRG F + KC+TGC EPSQR+YH+PRS+ K E N +V+FEE G
Sbjct: 669 AADMDGC-QRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGEPNTMVLFEEAG 727
Query: 730 GDPTKITF 737
GDPT+++F
Sbjct: 728 GDPTRVSF 735
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/724 (46%), Positives = 449/724 (62%), Gaps = 54/724 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD RSLIING+ +++ S +IHYPRS P MW L+ +AK GG++ I++YVFWN HE
Sbjct: 1 NVTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQ 60
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G++YF GR +LV+F+K IQ +Y LRIGPF+ +E+ YGG+P WLH IPG V+R+D +
Sbjct: 61 QGQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMVYRSDNQ 120
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFKYHM++F++ IV MMK EKL+ASQGGPIIL+QVENEY E+ + E G Y WAA M
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGG 264
AV GVPW+MC+Q D PDPVIN+CN C + P+SP+ P IWTE+W +++ +G
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R ++DIAF VA F K GS NYYMYHGGTNFGRTA IT+ YD +AP+DEYGL
Sbjct: 241 ETYMRSAQDIAFHVALFIAKTGSYVNYYMYHGGTNFGRTASAFTITSYYD-QAPLDEYGL 299
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PKWGHLKELH AIK C LL+G SLG Q+A V+ +SG CAAFL N D K
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHGAHKTFSLGPLQQAYVFQGNSGQCAAFLVNNDGKQ 359
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+ V+F++ SY LP S+SILPDCK + FNTA V AQ +T M P S
Sbjct: 360 EVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNAQYTTRSMKPNQKFNSVG------- 412
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
KW+ + E + + + ++H++TTKDT+DYLWYT ++ L N ++ V
Sbjct: 413 --KWEEYNEPIPEFDKTSLRANRLLEHMSTTKDTSDYLWYTFRF-----QQNLPN-AQSV 464
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
+S GH LHA+ N G G+ + F + + LK G N +ALLS TVGL ++G
Sbjct: 465 FNAQSHGHVLHAYVNGVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVALLSATVGLPDSGA 524
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+ E AG+ V+I D +TY+W Y++GL GE L IY N + W +
Sbjct: 525 YLERRVAGLRRVRIQ-----NKDFTTYTWGYQVGLLGERLQIYTENGSNKVKW---NKLG 576
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N+PL WYK + P G++P+ L++ MGKG AW+NG+ IGRYW S H
Sbjct: 577 TNRPLMWYKTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGRYW------VSFH------ 624
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI---TFSIRK 741
T G PSQ WY+IPR++ KP+ N+LV+ EE+ G P I T S+ K
Sbjct: 625 -------------TSQGSPSQTWYNIPRAFLKPTGNLLVLLEEEKGYPPGITVDTVSVTK 671
Query: 742 ISGF 745
+ G+
Sbjct: 672 VCGY 675
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/725 (45%), Positives = 437/725 (60%), Gaps = 44/725 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLII+G+R+++ S +IHYPRS P MWP L+ +AK+GG++ I++YVFWN HE
Sbjct: 24 AEEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHE 83
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
PG Y F GR++LV FIK IQ +Y+ LRIGPF+ +E+ YGG P WLH +PG V+R D
Sbjct: 84 PQPGMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPGIVYRTD 143
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK++MQ F T IV+MMK E L+ASQGGPIIL+Q+ENEY + +G G +Y WAA
Sbjct: 144 NEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAA 203
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTF 262
KMAV + GVPWIMC+Q D PDPVINTCN C + FT P+SP+ P +WTENW +++ +
Sbjct: 204 KMAVGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVY 263
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG R +EDIAF V F + GS NYYMYHGGTNFGRT IT YD +AP+DEY
Sbjct: 264 GGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRTGSAYVITGYYD-QAPLDEY 322
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLK+LH IK C LL G + N +LG E V+ + G C AFL N D
Sbjct: 323 GLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFTLGQLLEVYVFEEEKGECVAFLINNDR 382
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
N TV FRN SY L S+SILPDC+ V F+TANV S+ + P+ N
Sbjct: 383 DNKATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTSNRRIISPKQ---------NF 433
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S WQ F+++ + ++ +NTTKD +DYLWYT E+ + S+
Sbjct: 434 SSVDDWQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYTLRF------EYNLSCSK 487
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P L ++S H HAF N G GN F + P+++ G N +++LS+ VGL ++
Sbjct: 488 PTLSVQSAAHVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSVMVGLPDS 547
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G F E AG+ SV++ +L+L+ +W Y++GL GE L +Y ++ W S +
Sbjct: 548 GAFLERRFAGLISVELQCSEQESLNLTNSTWGYQVGLMGEQLQVYKEQNNSDTGW-SQLG 606
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
Q L WYK P GD+P+ LD+ MGKG AW+NGE IGRYW
Sbjct: 607 NVMEQTLFWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWIL------------ 654
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
F+ K G PSQ YH+PRS+ K S N+LV+ EE GG+P I+ +
Sbjct: 655 --------FHDSK-----GNPSQSLYHVPRSFLKDSGNVLVLLEEGGGNPLGISLDTVSV 701
Query: 743 SGFPK 747
+ +
Sbjct: 702 TDLQQ 706
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/731 (44%), Positives = 452/731 (61%), Gaps = 36/731 (4%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A V+YD R+L I+G+R ++ SA+IHYPRS P MWP L+++AKEGG++ IE+YVFWN HE
Sbjct: 25 ALEVSYDERALTIDGKRRILFSASIHYPRSTPEMWPYLIRKAKEGGLDVIETYVFWNAHE 84
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+Y F +LV+FI+ IQ+ +Y ++RIGP++++E+NYGG+PVWLH IP FR
Sbjct: 85 PQRRQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGLPVWLHNIPNMEFRTH 144
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
F M+ F T IVDMM+ E LFA QGGPII+AQ+ENEYG YG G +Y W A
Sbjct: 145 NRAFMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQIENEYGNVMHAYGNNGTQYLKWCA 204
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
++A + GVPW+M QQ + P +I++C+ +YCDQF P+ PKIWTENW G +K +G
Sbjct: 205 QLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPKIWTENWTGGYKNWGT 264
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
++PHRP+ED+A++VARFFQ GG+ NYYMYHGGTNF RTAGGP++TTSYDY+AP+DEYG
Sbjct: 265 QNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYVTTSYDYDAPLDEYGN 324
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
PKWGHL++LH +K E+ L G N G+ A VY G F+ N
Sbjct: 325 LNQPKWGHLRQLHNLLKSKENILTQGSSQNTDYGNMVTATVYT-YDGKSTCFIGNAHQSK 383
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D T+ FRN Y +PAWSVSILP+C +NTA V Q++ MV ++ + E +
Sbjct: 384 DATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTTI--MVKKDNEDLEYA------ 435
Query: 445 GLKWQ------VFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV--NENEEF 496
L+WQ V + I G D +D T D +DYLWY TSI + +++ +
Sbjct: 436 -LRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWYITSIDIKGDDDPSW 494
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
K L + + GH LH F N + G+ F +++ I L GKNEI+LLS T
Sbjct: 495 TKEFR---LRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIKLTTGKNEISLLSTT 551
Query: 557 VGLQNAGPFYEWVGAG-------ITSVKITGFNSGTL--DLSTYSWTYKIGLQGEHLGIY 607
VGL N GPF++ + G + +V ++ + DLS W+YK+GL GEH Y
Sbjct: 552 VGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQWSYKVGLHGEHEMHY 611
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ Y N++ T P ++ L WYK K P GD+P+ +D+ +GKG AW+NG IGRY
Sbjct: 612 S--YENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGKGHAWVNGNSIGRY 669
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS-ENILVIFE 726
W S + + C +CDYRG + +KC++ C +PSQRWYH+PRS+ + + +N LV+FE
Sbjct: 670 W---SSYLADENGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRDNDQNTLVLFE 726
Query: 727 EKGGDPTKITF 737
E GG P + F
Sbjct: 727 ELGGQPYYVNF 737
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/714 (47%), Positives = 442/714 (61%), Gaps = 53/714 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ TYD RSLI+NG +L+ S +IHYPRS P MWP L+ +AKEGG++ I++YVFWN HE
Sbjct: 15 SATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQ 74
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G Y F GR ++V+F+K IQ +Y LRIGPF+ AE++YGG+P WLH + G V+R+D E
Sbjct: 75 QGTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNE 134
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK HMQ F T IV+MMK E L+ASQGGPIIL+Q+ENEY E+ +GE G Y WAAKM
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGG 264
AV+ GVPW MC+Q D PDPVINTCN C + FT P+SP+ P IWTENW +++T+G
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254
Query: 265 RDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
R +E+IAF VA F K G+ NYYMYHGGTNFGR+A IT YD ++P+DEYG
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QSPLDEYG 313
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKWGHLKELH A+KLC LL G +SN SLG S EA V+ S CAAFL N
Sbjct: 314 LTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVNR-GA 372
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
D V+F+NV+Y LP S+SILPDCK V FNT V Q +T M+ +Q +
Sbjct: 373 IDSNVLFQNVTYELPLGSISILPDCKNVAFNTRRVSVQHNTRSMMA--VQKFDL------ 424
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
L+W+ FKE + + + ++H+ TTKD +DYLWYT + + + S+
Sbjct: 425 --LEWEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYLWYTFRVQQDSPD------SQQ 476
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
L ++S+ HALHAF N + GSA G F I+L+ G N I+LLS+ VGL ++G
Sbjct: 477 TLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSG 536
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
F E AG+ V I G D S W YK+GL GE I+ +N+ W
Sbjct: 537 AFLETRVAGLRRVGIQG-----EDFSEQHWGYKVGLSGEQSQIFLDTGSSNVQWSRLGN- 590
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+QPLTWYK PPGD+PI L++ MGKG W+NG IGRYW
Sbjct: 591 -SSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWV-------------- 635
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
+T GEPSQ+WY++PRS+ KP++N LVI EE+ G+P +I+
Sbjct: 636 -----------SFLTPKGEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISL 678
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/726 (46%), Positives = 445/726 (61%), Gaps = 49/726 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
G+VTYD RSLIING+R L+ S +IHYPRS P MWP L+ +AKEGG++ IE+Y FWN HE
Sbjct: 29 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR ++VKF K +Q +Y LRIGPF+ +E+NYGG+P WLH +PG ++R+D
Sbjct: 89 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK++MQ F T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WAA
Sbjct: 149 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTF 262
KMAV GVPW+MC+Q D PDPVIN CN C + P+ P+ P IWTENW ++ +
Sbjct: 209 KMAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVY 268
Query: 263 GGRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDE 321
G R +ED+AF VA F +K GS NYYMYHGGTNFGRT+ +T YD +AP+DE
Sbjct: 269 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDE 327
Query: 322 YGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMD 381
YGL R PKWGHLKELH IKLC LL+G + N SLG QEA ++ SG CAAFL N D
Sbjct: 328 YGLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNND 387
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
+ + TV+F+N +Y L A S+SILPDCKK+ FNTA V Q +T + +
Sbjct: 388 KRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSV--------QTRATF 439
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
GS +W ++E +G S ++H+ TTKD +DYLWYT I N + +
Sbjct: 440 GSTK-QWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIQNSSN------A 492
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+PVL ++S H LHAF N + SA G+ + F N + L +G N I+LLS+ VGL +
Sbjct: 493 QPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPD 552
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
AGP+ E AGI V+I + D S + W Y++GL GE IY + W +
Sbjct: 553 AGPYLEHKVAGIRRVEIQD-GGDSKDFSKHPWGYQVGLMGEKSQIYTSPGSQKVQW-HGL 610
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
PLTWYK + PPG++P+ L MGKG AW+NG+ IGRYW
Sbjct: 611 GSHGRGPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYW------------- 657
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI---TFS 738
Y +T GEPSQ WY++PR++ P N+LV+ EE+ GDP KI T S
Sbjct: 658 ---VSY---------LTPSGEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVS 705
Query: 739 IRKISG 744
+ + G
Sbjct: 706 VTNVCG 711
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/726 (46%), Positives = 445/726 (61%), Gaps = 49/726 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
G+VTYD RSLIING+R L+ S +IHYPRS P MWP L+ +AKEGG++ IE+Y FWN HE
Sbjct: 21 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 80
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR ++VKF K +Q +Y LRIGPF+ +E+NYGG+P WLH +PG ++R+D
Sbjct: 81 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 140
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK++MQ F T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WAA
Sbjct: 141 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 200
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
KMAV GVPW+MC+Q D PDPVIN CN C + P+ P+ P IWTENW ++ +
Sbjct: 201 KMAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVY 260
Query: 263 GGRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDE 321
G R +ED+AF VA F +K GS NYYMYHGGTNFGRT+ +T YD +AP+DE
Sbjct: 261 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDE 319
Query: 322 YGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMD 381
YGL R PKWGHLKELH IKLC LL+G + N SLG QEA ++ SG CAAFL N D
Sbjct: 320 YGLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNND 379
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
+ + TV+F+N +Y L A S+SILPDCKK+ FNTA V Q +T + +
Sbjct: 380 KRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSV--------QTRATF 431
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
GS +W ++E +G S ++H+ TTKD +DYLWYT I N + +
Sbjct: 432 GSTK-QWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIQNSSN------A 484
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+PVL ++S H LHAF N + SA G+ + F N + L +G N I+LLS+ VGL +
Sbjct: 485 QPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPD 544
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
AGP+ E AGI V+I + D S + W Y++GL GE IY + W +
Sbjct: 545 AGPYLEHKVAGIRRVEIQD-GGDSKDFSKHPWGYQVGLMGEKSQIYTSPGSQKVQW-HGL 602
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
PLTWYK + PPG++P+ L MGKG AW+NG+ IGRYW
Sbjct: 603 GSHGRGPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYW------------- 649
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI---TFS 738
Y +T GEPSQ WY++PR++ P N+LV+ EE+ GDP KI T S
Sbjct: 650 ---VSY---------LTPSGEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVS 697
Query: 739 IRKISG 744
+ + G
Sbjct: 698 VTNVCG 703
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/733 (46%), Positives = 452/733 (61%), Gaps = 52/733 (7%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
I I + NVTYD RSLII+G+ +++ S +IHYPRS P MWP L+ +AKEGG++
Sbjct: 12 FILIRVFIGAVYGDNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLD 71
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
I++YVFWN HE G+Y F G N+V+FIK IQ +Y+ LRIGP++ +E YGG+P+W
Sbjct: 72 VIQTYVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLW 131
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY 192
LH IPG VFR+D E FK+HMQ+F IV++MK LFASQGGPIIL+Q+ENEYG E +
Sbjct: 132 LHDIPGIVFRSDNEQFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAF 191
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKI 250
E G Y WAA+MAV GVPW+MC+Q + PDPVINTCN C + P+SP+ P +
Sbjct: 192 HEKGLSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSL 251
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENW +++ FG R +EDIA++VA F K GS NYYMYHGGTNF R A F+
Sbjct: 252 WTENWTSFYQVFGEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASA-FVV 310
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
T+Y EAP+DEYGL R PKWGHLKELH AIK C ++LL G +++ SLG+ Q A V+ SS
Sbjct: 311 TAYYDEAPLDEYGLVREPKWGHLKELHEAIKSCSNSLLYGTQTSFSLGTQQNAYVFRRSS 370
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
CAAFL N +D++ T+ F+N+ Y LP S+SILPDCK V FNTA VRAQ++ +
Sbjct: 371 IECAAFLENTEDRS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQNA--RAMKS 427
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
LQ + A KW+V++E + + + +D I+T KDT+DYLWYT +
Sbjct: 428 QLQFNSAE--------KWKVYREAIPSFADTSLRANTLLDQISTAKDTSDYLWYTFRLYD 479
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
N ++ +L S GH LHAF N L GS G+ + F +N ++L +G N I
Sbjct: 480 NSAN------AQSILSAYSHGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNLISGMNNI 533
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPG 610
+ LS TVGL N+G + E AG+ S+K+ G D + +W Y++GL GE L IY
Sbjct: 534 SFLSATVGLPNSGAYLEGRVAGLRSLKVQG-----RDFTNQAWGYQVGLLGEKLQIYTAS 588
Query: 611 YRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPR 670
+ + W S + K PLTWYK P G++P+ L++ MGKG W+NG+ IGRYW
Sbjct: 589 GSSKVKWESFLSSTK--PLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNGQGIGRYW-- 644
Query: 671 KSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
S H T G PSQ+WYHIPRS K + N+LV+ EE+ G
Sbjct: 645 ----VSFH-------------------TPQGTPSQKWYHIPRSLLKSTGNLLVLLEEETG 681
Query: 731 DPTKITFSIRKIS 743
+P IT I+
Sbjct: 682 NPLGITLDTVYIT 694
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/640 (50%), Positives = 420/640 (65%), Gaps = 28/640 (4%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+VTYD ++++++G+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE
Sbjct: 23 ASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEP 82
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG+YYF RF+LVKF+K+ QQA +Y+ LRIGP++ AE+N GG PVWL Y+PG FR D
Sbjct: 83 SPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRTDN 142
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK MQKF IV +MK +LF SQGGPIIL+Q+ENEYG E G GK Y WAA+
Sbjct: 143 EPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQ 202
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV + GVPW+MC+Q D PDPVI+TCN FYC+ F P+ + PK+WTENW GW+ FGG
Sbjct: 203 MAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFGGA 262
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRT+GG FI TSYDY+AP+DEYGL
Sbjct: 263 VPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYGLE 322
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
PK+ HL+ LH AIK E AL+ + SLG + EA V++ + GACAAF+AN D K+
Sbjct: 323 NEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVFS-APGACAAFIANYDTKSY 381
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
F N Y LP WS+SILPDCK VV+NTA V +M P N
Sbjct: 382 AKAKFGNGQYDLPPWSISILPDCKTVVYNTAKV-GYGWLKKMTPVN------------SA 428
Query: 446 LKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
WQ + E +AD + + + +N T+D++DYLWY T + VN NE FLKNG P+
Sbjct: 429 FAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSPL 488
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L + S GH LH F N +L G+ G +P + + + L+AG N+++LLS+ VGL N G
Sbjct: 489 LTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVGV 548
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+E AG+ V + G N GT DLS W+YK+GL+GE L ++ +++ W+
Sbjct: 549 HFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSLV 608
Query: 624 PKNQPLTWYKA------------VVKQPPGDEPIGLDMLK 651
K QPLTWY VV + G +P G+ ++K
Sbjct: 609 AKKQPLTWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVK 648
Score = 46.2 bits (108), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 21/34 (61%)
Query: 707 WYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
WYH+PRSW N LV+FEE GGDP I R
Sbjct: 616 WYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKR 649
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/725 (46%), Positives = 446/725 (61%), Gaps = 52/725 (7%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
T + GNVTYD RSLII+G+ +++ S +IHYPRS P MWP L+ +AKEGG++ I++YVFW
Sbjct: 21 TTVYGGNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFW 80
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE G+Y F G N+V+FIK IQ +Y+ LRIGP++ +E YGG+P+WLH IPG V
Sbjct: 81 NLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIV 140
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR+D E FK+HMQKF IV++MK LFASQGGPIIL+Q+ENEYG E + E G Y
Sbjct: 141 FRSDNEQFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYI 200
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGW 258
WAA+MAV GVPW+MC+Q + PDPVINTCN C + P+SP+ P +WTENW +
Sbjct: 201 RWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSLWTENWTSF 260
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
++ FG R +EDIA++VA F K GS NYYMYHGGTNF R A IT YD EAP
Sbjct: 261 YQVFGEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVITAYYD-EAP 319
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLA 378
+DEYGL R PKWGHLKELH AIK C +++L+G +++ SLG+ Q A V+ SS CAAFL
Sbjct: 320 LDEYGLVREPKWGHLKELHAAIKSCSNSILHGTQTSFSLGTQQNAYVFKRSSIECAAFLE 379
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N +D++ T+ F+N+ Y LP S+SILPDCK V FNTA V Q++ +E
Sbjct: 380 NTEDQS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVSIQNARAMKSQLEFNSAET- 437
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
W+V+KE +G+ + +D I+TTKDT+DYLWYT + N
Sbjct: 438 ---------WKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYTFRLYDNSPN---- 484
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVG 558
++ +L S GH LHAF N L GS G+ + F +N ++L G N I+ LS TVG
Sbjct: 485 --AQSILSAYSHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNISFLSATVG 542
Query: 559 LQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
L N+G + E AG+ S+K+ G D + +W Y+IGL GE L IY + + W
Sbjct: 543 LPNSGAYLERRVAGLRSLKVQG-----RDFTNQAWGYQIGLLGEKLQIYTASGSSKVQWE 597
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S K PLTWYK P G++P+ L++ MGKG W+NG+ IGRYW S H
Sbjct: 598 SFQSSTK--PLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYW------VSFH 649
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFS 738
T G PSQ+WYHIPRS K + N+LV+ EE+ G+P IT
Sbjct: 650 -------------------TPQGTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGITLD 690
Query: 739 IRKIS 743
I+
Sbjct: 691 TVYIT 695
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/722 (46%), Positives = 439/722 (60%), Gaps = 44/722 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLIING+R ++ S +IHYPRS P MWPGL+ +AK+GG++ I++YVFWN HE
Sbjct: 24 AEEVTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPGLIAKAKQGGLDVIQTYVFWNLHE 83
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
PGKY F GR +LV FIK I +Y+ LRIGPF+ +E+NYGG P WLH +PG V+R D
Sbjct: 84 PQPGKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIESEWNYGGFPFWLHDVPGIVYRTD 143
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK++MQ F T IV+MMK E L+ASQGGPIIL+Q+ENEYG + +G G +Y WAA
Sbjct: 144 NEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYGNIQKAFGTAGSQYVEWAA 203
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTF 262
KMAV N GVPW+MC+Q D PDPVINTCN C + FT P+SP+ P +WTENW +++ +
Sbjct: 204 KMAVGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETFTGPNSPNKPAMWTENWTSFYQVY 263
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG R +EDIAF V F + GS NYYMYHGGTNFGRT+ IT YD +AP+DEY
Sbjct: 264 GGVPYIRSAEDIAFHVTLFVARNGSFVNYYMYHGGTNFGRTSSAYMITGYYD-QAPLDEY 322
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH AIK C LL G + N SLG QE V+ + +G CAAFL N D
Sbjct: 323 GLFRQPKWGHLKELHAAIKSCSTTLLQGVQRNFSLGELQEGYVFEEENGKCAAFLINNDK 382
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
N TV F N SY L S+SILPDC+ V FNTA++ S+ + S N
Sbjct: 383 GNTVTVQFNNSSYKLLPKSISILPDCQNVAFNTAHLNTTSNRRII---------TSRQNF 433
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S W+ F+++ + + ++ +NTTKD +DYLWYT + N + +
Sbjct: 434 SSVDDWKQFQDVIPNFDDTSLRSDSLLEQMNTTKDKSDYLWYTLRLENN------LSCND 487
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P+L ++S H +AF N G GN F + PI+L N I++LS VGL ++
Sbjct: 488 PILHVQSSAHVAYAFVNNTYIGGEHGNHDVKSFTLELPITLNERTNNISILSGMVGLPDS 547
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G F E AG+ +V++ +L+L+ +W Y++GL GE L +Y +I W
Sbjct: 548 GAFLEKRFAGLNNVELQCSEQESLNLNNSTWGYQVGLLGEQLKVYTEQNSTDIKWTQLGN 607
Query: 623 PPKNQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
++ LTWYK P GD+PI LD+ M KG AW+NG+ IGRYW
Sbjct: 608 ITIDEVTLTWYKTTFDTPKGDDPIALDLSSMAKGEAWVNGQSIGRYW------------- 654
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+ D +G PSQ YH+PRS+ K SEN LV+ +E GG+P I+ +
Sbjct: 655 ILFLDSKGN------------PSQSLYHVPRSFLKDSENSLVLLDEGGGNPLDISLNTVS 702
Query: 742 IS 743
++
Sbjct: 703 VT 704
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/715 (46%), Positives = 439/715 (61%), Gaps = 55/715 (7%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
VTYD RSLII+G+R+++ S +IHYPRS P MWP L+ +AKEGG++ I++YVFWN HE
Sbjct: 2 AEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEP 61
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G+Y F GR++LV+FIK IQ +Y+ LRIGP++ +E+ YGG P WLH +P V+R D
Sbjct: 62 QFGQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIVYRTDN 121
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+PFK +MQ F T IV MM+ E L+ASQGGPIIL+Q+ENEY E +GE G RY WAA+
Sbjct: 122 QPFKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAE 181
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFG 263
MAV GVPW+MC+Q D PDP+INTCN C + FT P+SP+ P WTENW +++ +G
Sbjct: 182 MAVGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYG 241
Query: 264 GRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF V F +K GS NYYMYHGGTN GRT+ IT+ YD +AP+DEY
Sbjct: 242 GEPYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSSYVITSYYD-QAPLDEY 300
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH AIK C LL G++SN SLG QE V+ + G C AFL N D
Sbjct: 301 GLLRQPKWGHLKELHAAIKSCSTTLLEGKQSNFSLGQLQEGYVF-EEEGKCVAFLVNNDH 359
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
TV FRN SY LP+ S+SILPDC+ V FNTA V +S+ + ++
Sbjct: 360 VKMFTVQFRNRSYELPSKSISILPDCQNVTFNTATVNTKSN---------RRMTSTIQTF 410
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S KW+ F+++ + + + + ++ +N TKD +DYLWYT S
Sbjct: 411 SSADKWEQFQDVIPNFDQTTLISNSLLEQMNVTKDKSDYLWYTL--------------SE 456
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
L +S H HAFA+ G A G+ F + P+ L G N I++LS+ VGL +A
Sbjct: 457 SKLTAQSAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNNISILSVMVGLPDA 516
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G F E AG+T+V+I + + DL+ +W Y++GL GE L IY ++I W S +
Sbjct: 517 GAFLERRFAGLTAVEIQ-CSEESYDLTNSTWGYQVGLLGEQLEIYEEKSNSSIQW-SPLG 574
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
NQ LTWYK P GDEP+ L++ MGKG AW+NGE IGRYW S HD
Sbjct: 575 NTCNQTLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYWI------SFHD--- 625
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
G+PSQ YH+PRS+ K N LV+FEE+GG+P I+
Sbjct: 626 ----------------SKGQPSQTLYHVPRSFLKDIGNSLVLFEEEGGNPLHISL 664
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/739 (45%), Positives = 441/739 (59%), Gaps = 45/739 (6%)
Query: 7 IAPFALLIFFSSSITYCF-AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
+A LL+F+ + A VTYD RSLII+G+R+++ S IHYPRS P MWP L+ +
Sbjct: 5 VALVLLLVFWKIREGFGVKAEEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDLIAK 64
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYN 125
AK+GG++ I++YVFWN HE PG Y F GR++LV FIK IQ +Y+ LRIGPF+ +E+
Sbjct: 65 AKQGGLDVIQTYVFWNLHEPQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQSEWK 124
Query: 126 YGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
YGG P WLH +PG V+R D E FK++MQ F T IV+MMK E L+ASQGGPIIL+Q+ENEY
Sbjct: 125 YGGFPFWLHDVPGIVYRTDNESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEY 184
Query: 186 GYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PH 243
+ +G G +Y WAAKMAV N GVPW+MC+Q D PDPVINTCN C + FT P+
Sbjct: 185 QNIQKAFGTAGSQYVQWAAKMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFTGPN 244
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
SP+ P +WTENW +++ +GG R +EDIAF V F + GS NYYMYHGGTNFGRT
Sbjct: 245 SPNKPALWTENWTSFYQVYGGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRT 304
Query: 304 AGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEA 363
A IT YD +AP+DEYGL R PKWGHLK+LH IK C LL G + N SLG QE
Sbjct: 305 ASAYVITGYYD-QAPLDEYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFSLGQLQEG 363
Query: 364 DVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS 423
V+ + G C AFL N D N TV FRN SY L S+SILPDC+ V FNTANV S+
Sbjct: 364 YVFEEEKGECVAFLKNNDRDNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNTTSN 423
Query: 424 TVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
+ P+ N S W+ F+++ + ++ +NTTKD +DYLW
Sbjct: 424 RRIISPKQ---------NFSSLDDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYLW 474
Query: 484 YTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISL 543
YT E+ + +P L ++S H HAF N G GN F + P+++
Sbjct: 475 YTLRF------EYNLSCRKPTLSVQSAAHVAHAFINNTYIGGEHGNHDVKSFTLELPVTV 528
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEH 603
G N +++LS VGL ++G F E AG+ SV++ +L+L+ +W Y++GL GE
Sbjct: 529 NQGTNNLSILSAMVGLPDSGAFLERRFAGLISVELQCSEQESLNLTNSTWGYQVGLLGEQ 588
Query: 604 LGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEE 663
L +Y ++I W S + Q L WYK P GD+P+ LD+ MGKG AW+N +
Sbjct: 589 LQVYKKQNNSDIGW-SQLGNIMEQLLIWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQS 647
Query: 664 IGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILV 723
IGRYW F+ K G PSQ YH+PRS+ K + N+LV
Sbjct: 648 IGRYWIL--------------------FHDSK-----GNPSQSLYHVPRSFLKDTGNVLV 682
Query: 724 IFEEKGGDPTKITFSIRKI 742
+ EE GG+P I+ +
Sbjct: 683 LVEEGGGNPLGISLDTVSV 701
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/738 (46%), Positives = 454/738 (61%), Gaps = 66/738 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSLI++G R ++IS +IHYPRS P MWP L+++AKEGG+N IE+YVFWNGHE
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
++ F G +++V+F K IQ A MY ILRIGP++ E+NYGG+PVWL IPG FR +P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY--YESFYGEGGKRYALWAAK 205
F+ M+ F TLIV MK +FA QGGPIILAQ+ENEYGY + + Y W A
Sbjct: 151 FENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC ++ + S+PK+WTENW GW++ +
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWDQ 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
+ RP+EDIAF+VA FFQ GS+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP+DEYG
Sbjct: 271 PEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 330
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDK 383
R PK+GHLKELH + E LL+G+ + + G + Y +++ AC F+ N D
Sbjct: 331 LRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSAC--FINNRFDD 388
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ-------SSTVEM--------- 427
D V ++ LPAWSVSILPDCK V FN+A ++ Q +S VE
Sbjct: 389 RDVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSW 448
Query: 428 VPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTS 487
+PENL+P + +F K+ ++ I TT D +DYLWY TS
Sbjct: 449 MPENLRPFMTDE--------------------KGNFRKNELLEQIVTTTDQSDYLWYRTS 488
Query: 488 IIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGK 547
+ E GS VL + + GH L+AF N +L G + F+ K+P+ L GK
Sbjct: 489 L------EHKGEGSY-VLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGK 541
Query: 548 NEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N I+LLS TVGL+N G +E + AGI VK+ + +DLS SW+YK GL GE+
Sbjct: 542 NYISLLSGTVGLRNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRK 601
Query: 606 IY--NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
IY PG + W S P N+P TWYK + P G++ + +D+ + KG+AW+NG
Sbjct: 602 IYLDKPGNK----WRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGN 657
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPS 718
+GRYWP P CDYRG F + KC+TGCGEPSQ+ YH+PRS+
Sbjct: 658 SLGRYWPSYVAADMPG---CHHCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLHKG 714
Query: 719 E-NILVIFEEKGGDPTKI 735
E N L++FEE GGDP+++
Sbjct: 715 EPNTLILFEEAGGDPSEV 732
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/746 (43%), Positives = 455/746 (60%), Gaps = 37/746 (4%)
Query: 11 ALLIFFSSSITYCF-AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
A+ + S I+ A V+YD R+L I+G+R ++ S +IHYPRS P MWP L+++AKEG
Sbjct: 10 AMFLLCLSLISIAINALEVSYDERALTIDGKRRILFSGSIHYPRSTPEMWPYLIRKAKEG 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE +Y F +LV+FI+ IQ+ +Y ++RIGP++++E+NYGG+
Sbjct: 70 GLDVIETYVFWNAHEPQRRQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGL 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWLH IP FR F M+ F IVDMM+ E LFA QGGPII+AQ+ENEYG
Sbjct: 130 PVWLHNIPNMEFRTHNRAFMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQIENEYGNVM 189
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YG G +Y W A++A + GVPW+M QQ + P +I++C+ +YCDQF P+ PK
Sbjct: 190 HAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPK 249
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWTENW G +K +G ++PHRP+ED+A++VARFFQ GG+ NYYMYHGGTNF RTAGGP++
Sbjct: 250 IWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYV 309
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TTSYDY+AP+DEYG PKWGHL++LH +K E+ L G + G+ A VY
Sbjct: 310 TTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQHTDYGNMVTATVYT-Y 368
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G F+ N D T+ FRN Y +PAWSVSILP+C +NTA V Q++ MV
Sbjct: 369 DGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTTI--MVK 426
Query: 430 ENLQPSEASPDNGSKGLKWQ------VFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
++ + E + L+WQ V + I G D +D T D +DYLW
Sbjct: 427 KDNEDLEYA-------LRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLW 479
Query: 484 YTTSIIV--NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
Y TSI + +++ + K L + + GH LH F N + G+ F +++ I
Sbjct: 480 YITSIDIKGDDDPSWTKEFR---LRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKI 536
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAG-------ITSVKITGFNSGTL--DLSTYS 592
L GKNEI+LLS TVGL N GPF++ + G + +V ++ + DLS
Sbjct: 537 KLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQ 596
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKM 652
W+YK+GL GEH Y+ Y N++ T P ++ L WYK K P GD+P+ +D+ +
Sbjct: 597 WSYKVGLHGEHEMHYS--YENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGL 654
Query: 653 GKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPR 712
GKG AW+NG IGRYW S + + C +CDYRG + +KC++ C +PSQRWYH+PR
Sbjct: 655 GKGHAWVNGNSIGRYW---SSYLADENGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPR 711
Query: 713 SWFK-PSENILVIFEEKGGDPTKITF 737
S+ + +N LV+FEE GG P + F
Sbjct: 712 SFLRDDDQNTLVLFEELGGQPYYVNF 737
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/725 (46%), Positives = 449/725 (61%), Gaps = 53/725 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYDSRSL+ING+ ++I S +IHYPRS P MWP L+ +A+ GG++ I++YVFWN HE
Sbjct: 7 NVTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLHEPQ 66
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR +LV+FIK + +Y+ LRIGPF+ +E+ YGG+P WLH +PG VFR+D +
Sbjct: 67 QGQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIVFRSDNK 126
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFKYHM+++ +IV M+K EKL+ASQGGPIIL+Q+ENEYG E+ + E G Y WAAKM
Sbjct: 127 PFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWAAKM 186
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGG 264
AV + GVPW+MC+Q D PDPVIN CN C + F+ P+SP P IWTENW ++T+G
Sbjct: 187 AVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQTYGK 246
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R +EDIAF A F KGGS NYYMYHGGTNFGRTA ++ TSY +AP+DEYGL
Sbjct: 247 ETRSRSAEDIAFHAALFIAKGGSFVNYYMYHGGTNFGRTA-AEYVPTSYYDQAPLDEYGL 305
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
R PK GHLKELH AIKLC LL+ + N SLG QEA + +S CAAFL N D ++
Sbjct: 306 LRQPKHGHLKELHAAIKLCRKPLLSRKWINFSLGQLQEAFAFERNSDECAAFLVNHDGRS 365
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+ TV F+ SY LP S+SILP CK V FNTA V Q T L D+
Sbjct: 366 NATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGT------RLATRRHKFDSIE- 418
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
+W+ +KE + ++ + ++H+NTTKD++DYLWYT N + + V
Sbjct: 419 --QWKEYKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQNSSN------AHSV 470
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L + S GH LHAF N E GSA G+ + F + + LK G N ++LLS+ GL +AG
Sbjct: 471 LTVNSLGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVSLLSVMTGLPDAGA 530
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS--TME 622
+ E AG+ V I + D +TY W YK+GL GE++ + +RNN + + +
Sbjct: 531 YLERRVAGLRRVTIQRQHE-LHDFTTYLWGYKVGLSGENIQL----HRNNASVKAYWSRY 585
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
++PLTWYK++ P G++P+ L++ MGKG AW+NG IGRYW
Sbjct: 586 ASSSRPLTWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRYWV------------- 632
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI---TFSI 739
+ G P Q W HIPRS+ KPS N+LVI EE+ G+P I T SI
Sbjct: 633 ------------SFLDSDGNPYQTWNHIPRSFLKPSGNLLVILEEERGNPLGISLGTMSI 680
Query: 740 RKISG 744
K+ G
Sbjct: 681 TKVCG 685
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/722 (46%), Positives = 434/722 (60%), Gaps = 48/722 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD RSLII+G+ +++ S +IHY RS P MWP L+ +AK GG++ I++YVFWN HE
Sbjct: 22 AANVTYDGRSLIIDGQHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVIDTYVFWNIHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F GR ++VKFIK ++ +Y+ LRIGPF+ E++YGG+P WLH + G VFR D
Sbjct: 82 PQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 141
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+++ +IV +MK E L+ASQGGPIIL+Q+ENEYG + + GK Y WAA
Sbjct: 142 NEPFKYHMKRYAQMIVKLMKSENLYASQGGPIILSQIENEYGMVARAFRQDGKSYVKWAA 201
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
K+AV + GVPW+MC+Q D PDP++N CN C + P+SP+ P IWTENW +++T+
Sbjct: 202 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTY 261
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF VA F K GS NYYMYHGGTNFGR A F+ TSY +AP+DEY
Sbjct: 262 GEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 320
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH A+KLCE LL+G ++ +SLG Q A V+ + CAA L N D
Sbjct: 321 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAALLVN-QD 379
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K D TV FRN SY L S+S+LPDCK V FNTA V AQ +T P N
Sbjct: 380 KCDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTAKVNAQYNTRTRKPR---------QNL 430
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S W+ F E + E ++H+NTT+DT+DYLW TT +E G+
Sbjct: 431 SSPHMWEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFEQSE-------GAP 483
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL + GH LHAF N+ GS G F + +SL G N +ALLS+ VGL N+
Sbjct: 484 SVLKVNHLGHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMALLSVMVGLPNS 543
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G E G SV I S L + YSW Y++GL+GE +Y + W
Sbjct: 544 GAHLERRVVGSRSVNIWN-GSYQLFFNNYSWGYQVGLKGEKYHVYTEDGAKKVQW-KQYR 601
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
K+QPLTWYKA P G++P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 602 DSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWV------------- 648
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF-EEKGGDPTKITFSIRK 741
T G PSQ WYHIPRS+ KP+ N+LVI EE+ G P IT
Sbjct: 649 ------------SFYTSKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGYPLGITIDTVS 696
Query: 742 IS 743
++
Sbjct: 697 VT 698
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/723 (46%), Positives = 442/723 (61%), Gaps = 45/723 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLII+G+R+L+ S +IHYPRS P MWP L+++AKEGG++ I++YVFWN HE
Sbjct: 29 AKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKAKEGGIDVIQTYVFWNLHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LVKFIK I+ +Y+ LRIGPF+ AE+NYGG+P WL +PG V+R D
Sbjct: 89 PKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTD 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK+HMQKF IVD+MK E L+ASQGGPIIL+Q+ENEY E + E G Y WA
Sbjct: 149 NEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAG 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
+MAV GVPWIMC+ D PDPVINTCN C + P+SP+ PK+WTE+W +F+ +
Sbjct: 209 QMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVY 268
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF A F K GS NYYMYHGGTNFGRT+ FIT YD +AP+DEY
Sbjct: 269 GKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEY 327
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PK+GHLKELH AIK + LL G+++ LSLG Q+A V+ D++ C AFL N D
Sbjct: 328 GLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDA 387
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + + FRN +Y L S+ IL +CK +++ TA V + +T P + PDN
Sbjct: 388 KASQ-IQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQV---FNVPDN- 442
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
W +F+E + + ++H N TKD TDYLWYT+S ++ +
Sbjct: 443 -----WNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDS------PCTN 491
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P + ES GH +H F N L GS G+ K + P+SL G+N I++LS VGL ++
Sbjct: 492 PSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDS 551
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW-VSTM 621
G + E G+T V+I+ + +DLS W Y +GL GE + +Y N + W ++
Sbjct: 552 GAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKA 611
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
KN+PL WYK P GD P+GL M MGKG W+NGE IGRYW
Sbjct: 612 GLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWV------------ 659
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+T G+PSQ YHIPR++ KPS N+LV+FEE+GGDP I+ +
Sbjct: 660 -------------SFLTPAGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTIS 706
Query: 742 ISG 744
+ G
Sbjct: 707 VVG 709
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/738 (46%), Positives = 454/738 (61%), Gaps = 66/738 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSLI++G R ++IS +IHYPRS P MWP L+++AKEGG+N IE+YVFWNGHE
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
++ F G +++V+F K IQ A MY ILRIGP++ E+NYGG+PVWL IPG FR +P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY--YESFYGEGGKRYALWAAK 205
F+ M+ F TLIV MK +FA QGGPIILAQ+ENEYGY + + Y W A
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC ++ + S+PK+WTENW GW++ +
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWDQ 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
+ RP+EDIAF+VA FFQ GS+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP+DEYG
Sbjct: 271 PEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 330
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDK 383
R PK+GHLKELH + E LL+G+ + + G + Y +++ AC F+ N D
Sbjct: 331 LRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSAC--FINNRFDD 388
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ-------SSTVEM--------- 427
D V ++ LPAWSVSILP+CK V FN+A ++ Q +S VE
Sbjct: 389 RDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSW 448
Query: 428 VPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTS 487
+PENL+P + +F K+ ++ I TT D +DYLWY TS
Sbjct: 449 MPENLRPFMTDE--------------------KGNFRKNELLEQIVTTTDQSDYLWYRTS 488
Query: 488 IIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGK 547
+ E GS VL + + GH L+AF N +L G + F+ K+P+ L GK
Sbjct: 489 L------EHKGEGSY-VLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGK 541
Query: 548 NEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N I+LLS TVGL+N G +E + AGI VK+ + +DLS SW+YK GL GE+
Sbjct: 542 NYISLLSGTVGLRNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRK 601
Query: 606 IY--NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
IY PG + W S P N+P TWYK + P G++ + +D+ + KG+AW+NG
Sbjct: 602 IYLDKPGNK----WRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGN 657
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPS 718
+GRYWP P CDYRG F + KC+TGCGEPSQ+ YH+PRS+
Sbjct: 658 SLGRYWPSYVAADMPG---CHHCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKG 714
Query: 719 E-NILVIFEEKGGDPTKI 735
E N L++FEE GGDP+++
Sbjct: 715 EPNTLILFEEAGGDPSEV 732
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 341/730 (46%), Positives = 441/730 (60%), Gaps = 60/730 (8%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+ + S+ NVTYD SL+ING +++ S +IHYPRS P MWP L+ +AKEGG++
Sbjct: 12 LILTVSLCTVHGANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDV 71
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
I++YVFWN HE G+Y F GRF+LV FIK IQ +Y+ LRIGP++ +E YGG+P+WL
Sbjct: 72 IQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWL 131
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
H +PG VFR D + FK+HMQ+F T IV+MMK LFASQGGPIIL+Q+ENEYG +S +
Sbjct: 132 HDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFR 191
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIW 251
G Y WAA+MAV GVPW+MC+Q D PDPVIN CN C + P+SP+ P +W
Sbjct: 192 ANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLW 251
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW + + FGG R + DIA++VA F K GS NYYMYHGGTNF R A IT
Sbjct: 252 TENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASAFIITA 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
YD EAP+DEYGL R PKWGHLKELH +IK C LL+G ++ SLGS Q+A V+ SS
Sbjct: 312 YYD-EAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQAYVF-RSST 369
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
CAAFL N + D T+ F+N+SY LP S+SILP CK VVFNT V Q++ M P
Sbjct: 370 ECAAFLENSGPR-DVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNNVRAMKPR- 427
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
LQ + A W+V+ E + +D I+T KDT+DY+WYT
Sbjct: 428 LQFNSAE--------NWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYT------ 473
Query: 492 ENEEFLKNGSRP----VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGK 547
F N P VL I S+G LH+F N L GSA G+ + K ++L G
Sbjct: 474 ----FRFNNKSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGM 529
Query: 548 NEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
N I++LS TVGL N+G F E AG+ V++ G D S+YSW Y++GL GE L I+
Sbjct: 530 NNISILSATVGLPNSGAFLESRVAGLRKVEVQG-----RDFSSYSWGYQVGLLGEKLQIF 584
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ + W S K PLTWY+ P G++P+ +++ MGKGLAW+NG+ IGRY
Sbjct: 585 TVSGSSKVQWKSFQSSTK--PLTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRY 642
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
W + PD G PSQ+WYHIPRS+ K + N+LVI EE
Sbjct: 643 WVSFHK-------------------PD------GTPSQQWYHIPRSFLKSTGNLLVILEE 677
Query: 728 KGGDPTKITF 737
+ G+P IT
Sbjct: 678 ETGNPLGITL 687
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 334/723 (46%), Positives = 441/723 (60%), Gaps = 45/723 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLII+G+R+L+ S +IHYPRS P MWP L+++ KEGG++ I++YVFWN HE
Sbjct: 29 AKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LVKFIK I+ +Y+ LRIGPF+ AE+NYGG+P WL +PG V+R D
Sbjct: 89 PKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTD 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK+HMQKF IVD+MK E L+ASQGGPIIL+Q+ENEY E + E G Y WA
Sbjct: 149 NEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAG 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
+MAV GVPWIMC+ D PDPVINTCN C + P+SP+ PK+WTE+W +F+ +
Sbjct: 209 QMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVY 268
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF A F K GS NYYMYHGGTNFGRT+ FIT YD +AP+DEY
Sbjct: 269 GKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEY 327
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PK+GHLKELH AIK + LL G+++ LSLG Q+A V+ D++ C AFL N D
Sbjct: 328 GLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDA 387
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + + FRN +Y L S+ IL +CK +++ TA V + +T P + PDN
Sbjct: 388 KASQ-IQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQV---FNVPDN- 442
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
W +F+E + + ++H N TKD TDYLWYT+S ++ +
Sbjct: 443 -----WNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDS------PCTN 491
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P + ES GH +H F N L GS G+ K + P+SL G+N I++LS VGL ++
Sbjct: 492 PSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDS 551
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW-VSTM 621
G + E G+T V+I+ + +DLS W Y +GL GE + +Y N + W ++
Sbjct: 552 GAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKA 611
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
KN+PL WYK P GD P+GL M MGKG W+NGE IGRYW
Sbjct: 612 GLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWV------------ 659
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+T G+PSQ YHIPR++ KPS N+LV+FEE+GGDP I+ +
Sbjct: 660 -------------SFLTPAGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTIS 706
Query: 742 ISG 744
+ G
Sbjct: 707 VVG 709
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 336/725 (46%), Positives = 445/725 (61%), Gaps = 37/725 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD RSL ING R++IIS AIHYPRS PGMWP L+++AK GG+N IE+YVFWN HE
Sbjct: 15 SVTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIETYVFWNAHEPQ 74
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F G +LV+FIK +Q+ R+Y ILRIGP+V AE+NYGG PVWLH +PG FR + +
Sbjct: 75 RGQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNLPGIKFRTNNQ 134
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+K F L ++ K +F + +ENE+G E YG+ GK Y W A++
Sbjct: 135 VYKVTFX-FFFLTKNLKKINNMF-------LKNXIENEFGNVEGSYGQEGKEYVKWCAEL 186
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A + N+ PWIMCQQ D P P++ CN CDQF P++ + PK+WTE+W GWFK +G RD
Sbjct: 187 AQSYNLSEPWIMCQQGDAPQPIV--CN---CDQFKPNNKNSPKMWTESWAGWFKGWGERD 241
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
P+R +ED+AF+VARFFQ GGS+HNYYMYHGGTNFGR+AGGP+ITTSYDY AP+DEYG
Sbjct: 242 PYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYNAPLDEYGNMN 301
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHLK+LH I+ E L G+ ++ G S A Y G + F N ++ +D+
Sbjct: 302 QPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTATSYT-YKGKSSCFFGNPEN-SDR 359
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
+ F+ Y +P WSV++LPDCK V+NTA V Q++ EMVP + + K L
Sbjct: 360 EITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIREMVPSLVGKHK-------KPL 412
Query: 447 KWQVFKE-IAGIWGEADFVKSG-----FVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
KWQ E I + E D S +D T D++DYLWY T +N N+ G
Sbjct: 413 KWQWRNEKIEHLTHEGDISGSAITANSLIDQKMVTNDSSDYLWYLTGFHLNGNDPLF--G 470
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI-SLKAGKNEIALLSMTVGL 559
R L ++++GH LHAF N + G+ G F + + +L+ G N+IALLS TVGL
Sbjct: 471 KRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRHGFNQIALLSATVGL 530
Query: 560 QNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV 618
N G +YE V GI V++ DLST W YK+GL GE ++P ++ W+
Sbjct: 531 PNYGAYYENVEVGIYGPVELIADGKTIRDLSTNEWIYKVGLDGEKYEFFDPDHKFRKPWL 590
Query: 619 STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPH 678
S P NQ TWYK P G E + +D++ MGKG AW+NG+ IGRYWP +
Sbjct: 591 SN-NLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIGRYWP---SYLATE 646
Query: 679 DECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKP-SENILVIFEEKGGDPTKITF 737
+ C CDYRG + KC T CG+P+QRWYHIPRS+ EN L++FEE GG P I
Sbjct: 647 NGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEEFGGMPLNIEI 706
Query: 738 SIRKI 742
++
Sbjct: 707 KTTRV 711
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 336/724 (46%), Positives = 444/724 (61%), Gaps = 47/724 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLII+G+R+L+ S +IHYPRS P MWP L+++ KEGG++ I++YVFWN HE
Sbjct: 29 AKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LVKFIK I+ +Y+ LRIGPF+ AE+NYGG+P WL +PG V+R D
Sbjct: 89 PKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTD 148
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK+HMQKF IVD+MK E L+ASQGGPIIL+Q+ENEY E + E G Y WA
Sbjct: 149 NEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAG 208
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
+MAV GVPWIMC+ D PDPVINTCN C + P+SP+ PK+WTE+W +F+ +
Sbjct: 209 QMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVY 268
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF A F K GS NYYMYHGGTNFGRT+ FIT YD +AP+DEY
Sbjct: 269 GKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEY 327
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PK+GHLKELH AIK + LL G+++ LSLG Q+A V+ D++ C AFL N D
Sbjct: 328 GLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDA 387
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + + FRN +Y L S+ IL +CK +++ TA V + +T P + PDN
Sbjct: 388 KASQ-IQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQV---FNVPDN- 442
Query: 443 SKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
W +F+E +A +K+ ++H N TKD TDYLWYT+S ++ +
Sbjct: 443 -----WNLFRETIPA-SQAHLLKTNALLEHTNLTKDKTDYLWYTSSFKLDS------PCT 490
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
P + ES GH +H F N L GS G+ K + P+SL G+N I++LS VGL +
Sbjct: 491 NPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPD 550
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW-VST 620
+G + E G+T V+I+ + +DLS W Y +GL GE + +Y N + W ++
Sbjct: 551 SGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNK 610
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
KN+PL WYK P GD P+GL M MGKG W+NGE IGRYW
Sbjct: 611 AGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWV----------- 659
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
+T G+PSQ YHIPR++ KPS N+LV+FEE+GGDP I+ +
Sbjct: 660 --------------SFLTPAGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTI 705
Query: 741 KISG 744
+ G
Sbjct: 706 SVVG 709
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 309/570 (54%), Positives = 382/570 (67%), Gaps = 14/570 (2%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD RSL ING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ I++YVFWNGHE G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF R++LV+F+K+++QA +Y+ LRIGP+V AE+NYGG PVWL Y+PG FR D PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K Y WAAKMAV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N GVPWIMC+Q D PDPVINTCN FYCD FTP+S + P +WTE W GWF FGG P
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNP 328
RP ED+AF+VARF QKGGS NYYMYHGGTNF RTAGGPFI TSYDY+APIDEYGL R P
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 329 KWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTV 388
KWGHL LH AIK E AL+ G+ + ++G+ ++A V+ SSG CAAFL+N V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382
Query: 389 VFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKW 448
F Y LPAWS+S+LPDC+ V+NTA V A SS +M P + G W
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAKMNP-------------AGGFTW 429
Query: 449 QVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIE 508
Q + E E F K G V+ ++ T D +DYLWYTT + ++ E+FLK+G P L +
Sbjct: 430 QSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVY 489
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
S GH++ F N + G+A G P Y + + G N+I++LS VGL N G YE
Sbjct: 490 SAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYET 549
Query: 569 VGAGITS-VKITGFNSGTLDLSTYSWTYKI 597
G+ V ++G N G DLS WTY++
Sbjct: 550 WNIGVLGPVTLSGLNEGKRDLSKQKWTYQV 579
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 333/722 (46%), Positives = 434/722 (60%), Gaps = 48/722 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
NVTYD RSLII+G +++ S +IHY RS P MWP L+ +AK GG++ +++YVFWN HE
Sbjct: 22 VANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G ++VKFIK ++ +Y+ LRIGPF+ E++YGG+P WLH + G VFR D
Sbjct: 82 PQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 141
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+++ +IV +MK E L+ASQGGPIIL+Q+ENEYG + + GK Y W A
Sbjct: 142 NEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTA 201
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
K+AV + GVPW+MC+Q D PDP++N CN C + P+SP+ P IWTENW +++T+
Sbjct: 202 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTY 261
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF VA F K GS NYYMYHGGTNFGR A F+ TSY +AP+DEY
Sbjct: 262 GEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 320
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH A+KLCE LL+G ++ +SLG Q A V+ + CAA L N D
Sbjct: 321 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVN-QD 379
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + TV FRN SY L SVS+LPDCK V FNTA V AQ +T + + N
Sbjct: 380 KCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNT---------RTRKARQNL 430
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S W+ F E + E ++H+NTT+DT+DYLW TT +E G+
Sbjct: 431 SSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQSE-------GAP 483
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL + GHALHAF N GS G F + +SL G N +ALLS+ VGL N+
Sbjct: 484 SVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNS 543
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G E G SVKI L + YSW Y++GL+GE +Y + W
Sbjct: 544 GAHLERRVVGSRSVKIWN-GRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQW-KQYR 601
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
K+QPLTWYKA P G++P+ L++ MGKG AW+NG+ IGRYW V
Sbjct: 602 DSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYW-------------V 648
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF-EEKGGDPTKITFSIRK 741
Y+ G PSQ WYHIPRS+ KP+ N+LVI EE+ G+P IT
Sbjct: 649 SFHTYK------------GNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVS 696
Query: 742 IS 743
++
Sbjct: 697 VT 698
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 333/722 (46%), Positives = 434/722 (60%), Gaps = 48/722 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
NVTYD RSLII+G +++ S +IHY RS P MWP L+ +AK GG++ +++YVFWN HE
Sbjct: 22 VANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G ++VKFIK ++ +Y+ LRIGPF+ E++YGG+P WLH + G VFR D
Sbjct: 82 PQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 141
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+++ +IV +MK E L+ASQGGPIIL+Q+ENEYG + + GK Y W A
Sbjct: 142 NEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTA 201
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
K+AV + GVPW+MC+Q D PDP++N CN C + P+SP+ P IWTENW +++T+
Sbjct: 202 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTY 261
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF VA F K GS NYYMYHGGTNFGR A F+ TSY +AP+DEY
Sbjct: 262 GEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 320
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH A+KLCE LL+G ++ +SLG Q A V+ + CAA L N D
Sbjct: 321 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVN-QD 379
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + TV FRN SY L SVS+LPDCK V FNTA V AQ +T + + N
Sbjct: 380 KCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNT---------RTRKARQNL 430
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S W+ F E + E ++H+NTT+DT+DYLW TT +E G+
Sbjct: 431 SSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQSE-------GAP 483
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL + GHALHAF N GS G F + +SL G N +ALLS+ VGL N+
Sbjct: 484 SVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNS 543
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G E G SVKI L + YSW Y++GL+GE +Y + W
Sbjct: 544 GAHLERRVVGSRSVKIWN-GRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQW-KQYR 601
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
K+QPLTWYKA P G++P+ L++ MGKG AW+NG+ IGRYW V
Sbjct: 602 DSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYW-------------V 648
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF-EEKGGDPTKITFSIRK 741
Y+ G PSQ WYHIPRS+ KP+ N+LVI EE+ G+P IT
Sbjct: 649 SFHTYK------------GNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVS 696
Query: 742 IS 743
++
Sbjct: 697 VT 698
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 331/711 (46%), Positives = 434/711 (61%), Gaps = 43/711 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSLIING+REL+ S +IHYPRS P MWP L+ +AK GG+N I++YVFWN HE
Sbjct: 31 VTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F G ++LVKFIK I + M+ LR+GPF+ AE+N+GG+P WL IP +FR+D P
Sbjct: 91 GKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK+HM+KF+T I+DMMK EKLFASQGGPIIL+Q+ENEY + Y G Y WA MA
Sbjct: 151 FKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQWAGNMA 210
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ N GVPW+MC+Q D P PVINTCN +C D FT P+ P+ P +WTENW F+ FG
Sbjct: 211 LGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQFRVFGDP 270
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +ED AFSVAR+F K GS+ NYYMYHGGTNF RTA F+TT Y EAP+DEYGL
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PKWGHLK+LH A+ LC+ ALL G + L + EA Y + CAAFLA+ + K
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGNPNVQKLSADVEARFYEQPGTKVCAAFLASNNSKE 389
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+TV FR Y+LPA S+SILPDCK VV+NT V +Q ++ V +
Sbjct: 390 AETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRNFV----------KSRKTN 439
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
L+W ++ E + D S + N TKD TDY+W+TT+I V+ + + PV
Sbjct: 440 KLEWNMYSETIPAQLQVD--SSLPKELYNLTKDKTDYVWFTTTINVDRRDMNERKRINPV 497
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L + S GHA+ AF N E GSA G+ F ++ + LK G N + LL VGL ++G
Sbjct: 498 LRVASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSVDLKPGINFVTLLGTLVGLPDSGA 557
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+ E AG V I G N+GTLDL++ W +++GL GE ++ + W +
Sbjct: 558 YMEHRYAGPRGVSILGLNTGTLDLTSNGWGHQVGLSGETAKLFTKEGGGKVTWTKVQK-- 615
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
P+TWYK P G P+ + M M KG+ W+NG+ IGRYW
Sbjct: 616 AGPPVTWYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKSIGRYWM--------------- 660
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
++ GEP+Q YHIPRS+ KP++N++VIFEE+ +P KI
Sbjct: 661 ----------TYVSPLGEPTQSEYHIPRSYLKPTDNLMVIFEEEEANPEKI 701
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 316/711 (44%), Positives = 443/711 (62%), Gaps = 41/711 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSLIINGRREL+ S +IHYPRS P W G++ +A++GG+N +++YVFWN HE
Sbjct: 9 VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY +++ +KFIK+IQ+ MY+ LR+GPF+ AE+N+GG+P WL +P +FR++ EP
Sbjct: 69 GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEIIFRSNNEP 128
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HM+K+++ ++ +K LFA QGGPIILAQ+ENEY + + + E G Y WAAKMA
Sbjct: 129 FKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKMA 188
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
V+ +IGVPWIMC+Q D PDPVIN CN +C D F+ P+ P P IWTENW ++ FG
Sbjct: 189 VSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGDP 248
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAFSVARFF K GS+ NYYMYHGGTNFGRT+ F TT Y EAP+DEYG+
Sbjct: 249 PSQRSAEDIAFSVARFFSKNGSLVNYYMYHGGTNFGRTSSA-FTTTRYYDEAPLDEYGMQ 307
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PKW HL+++H A+ LC+ AL NG + + E V+ S CAAF+ N K
Sbjct: 308 REPKWSHLRDVHRALSLCKRALFNGASTVTKMSQHHEVIVFEKPGSNLCAAFITNNHTKV 367
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
T+ FR Y++P S+SILPDCK VVFNT + +Q S+ N + S A+ D+
Sbjct: 368 PTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSS-----RNFKRSMAANDH--- 419
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
KW+V+ E + + ++ + KDT+DY WYTTS+ + + KN +
Sbjct: 420 --KWEVYSETIPTTKQIPTHEKNPIELYSLLKDTSDYAWYTTSVELRPEDLPKKNDIPTI 477
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L I S GH+L AF N E GS G+ F+++ P++LK G N+IA+L+ TVGL ++G
Sbjct: 478 LRIMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQIAILASTVGLPDSGA 537
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+ E AG S+ I G NSG +DL++ W +++G++GE LGI+ + W P
Sbjct: 538 YMEHRFAGPKSIFILGLNSGKMDLTSNGWGHEVGIKGEKLGIFTEEGSKKVQWKEAKGP- 596
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
++WYK P G +P+ + M MGKG+ W+NG+ IGR+W
Sbjct: 597 -GPAVSWYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRHW---------------- 639
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
Y ++ G+P+Q YHIPR++F P +N+LV+FEE+ +P K+
Sbjct: 640 MSY---------LSPLGQPTQSEYHIPRTYFNPKDNLLVVFEEEIANPEKV 681
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 632 bits (1631), Expect = e-178, Method: Compositional matrix adjust.
Identities = 315/603 (52%), Positives = 397/603 (65%), Gaps = 24/603 (3%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L FF +T +VTYD ++++ING+R ++IS +IHYPRS P MWP L+Q+AK+GGV
Sbjct: 16 FLCFFVCYVT----ASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGV 71
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWNGHE S GKYYF RF+LVKFIK++QQA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 72 DVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPV 131
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG FR D EPFK MQKF T IV +MK E LF SQGGPIIL+Q+ENEYG E
Sbjct: 132 WLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWE 191
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y W ++MAV N GVPW+MC+Q D PDP+I+TCN +YC+ F+P+ PK+W
Sbjct: 192 IGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMW 251
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+ FG P+RP+ED+AFSVARF Q GS NYYMYHGGTNFGRT+ G FI T
Sbjct: 252 TENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL PKWGHL++LH AIK CE AL++ + + G + E +Y S G
Sbjct: 312 SYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSFG 371
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
ACAAFLAN D + V F N Y LP WS+SILPDCK VFNTA VRA M P N
Sbjct: 372 ACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVHRSMTPAN 431
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
WQ + E GE+ + +G ++ ++ T D +DYLWY T + +
Sbjct: 432 ------------SAFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNI 479
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+ NE F+KNG PVL S GH LH F N + G+A G+ +P + N + L+ G N+I
Sbjct: 480 SPNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKI 539
Query: 551 ALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYK------IGLQGEH 603
+LLS+ VGL N G YE G+ V + G N GT DLS W+YK IG+ +H
Sbjct: 540 SLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKVCYHLYIGVLRKH 599
Query: 604 LGI 606
I
Sbjct: 600 FNI 602
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 332/723 (45%), Positives = 438/723 (60%), Gaps = 45/723 (6%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLII+G+R+L+ S +IHYPRS P MWP L+++ KEGG++ I++YVFWN HE
Sbjct: 27 AKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHE 86
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR +LVKFIK I+ +Y+ LRIGPF+ AE+NYGG+P WL +PG V+R D
Sbjct: 87 PKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTD 146
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK+HMQKF T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WA
Sbjct: 147 NEPFKFHMQKFTTKIVNLMKSEGLYASQGGPIILSQIENEYANVEAAFHEKGASYIKWAG 206
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
+MAV GVPWIMC+ D PDPVINTCN C + P+SP+ PK+WTE+W +F+ +
Sbjct: 207 QMAVGLKTGVPWIMCKSPDAPDPVINTCNGMRCGETFPGPNSPNKPKMWTEDWTSFFQVY 266
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF F K GS NYYMYHGGTNFGRT+ FIT YD +AP+DEY
Sbjct: 267 GTEPYIRSAEDIAFHAVLFIAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEY 325
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PK+GHLKELH AIK + LL G+++ LSLG Q+A V+ D+S C AFL N D
Sbjct: 326 GLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDASSGCVAFLVNNDA 385
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + + FR SY L S+ IL +CK +++ TA V + + P + P+
Sbjct: 386 KVSQ-IQFRKSSYSLSPKSIGILQNCKNLIYETAKVNVEKNKRVTTPVQV---FNVPE-- 439
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
KW+ F+E + + ++H N TKD TDYLWYT+S + +
Sbjct: 440 ----KWEGFRETIPAFSGTSLKANALLEHTNLTKDKTDYLWYTSSFKPDS------PCTN 489
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P + IES GH +H F N L GS G+ K + P SL G+N I++LS VGL ++
Sbjct: 490 PSIYIESSGHVVHVFVNNALAGSGHGSRDIKVVKLQVPASLTNGQNSISILSGMVGLPDS 549
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW-VSTM 621
G + E G+T V+I+ + +DLS W Y +GL GE + + N + W ++
Sbjct: 550 GAYMERKSYGLTKVQISCGGTKPIDLSGSQWGYSVGLLGEKVRLQQWRNLNRVKWSMNNA 609
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
KN+PL WYK + P GD P+GL+M MGKG W+NGE IGRYW
Sbjct: 610 GLIKNRPLIWYKTIFDGPNGDGPVGLNMSSMGKGEIWVNGESIGRYWV------------ 657
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+T G PSQ YHIPR + KPS N+LV+FEE+GGDP I+ +
Sbjct: 658 -------------SFLTPSGHPSQSIYHIPREFLKPSGNLLVVFEEEGGDPLGISLNTIS 704
Query: 742 ISG 744
+ G
Sbjct: 705 VIG 707
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 331/713 (46%), Positives = 438/713 (61%), Gaps = 46/713 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSLIING+REL+ S +IHYPRS P MWP L+Q+AK GG+N I++YVFWN HE
Sbjct: 31 VTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F G ++LVKFIK I + M +R+GPF+ AE+N+GG+P WL IP +FR+D P
Sbjct: 91 GKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HM++F+T+I++ +K EKLFASQGGPIILAQ+ENEY + Y G Y WA MA
Sbjct: 151 FKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGVSYVQWAGNMA 210
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ GVPW+MC+Q D P PVINTCN +C D FT P+SP P +WTENW F+ FG
Sbjct: 211 LGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENWTAQFRVFGDP 270
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +ED AFSVAR+F K GS+ NYYMYHGGTNF RTA F+TT Y EAP+DEYGL
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PKWGHLK+LH A+ LC+ ALL G + L + EA + + CAAFLAN + K+
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGTPNVQRLSADVEARFFEQPRTNDCAAFLANNNTKD 389
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+TV FR Y+LPA S+SILPDCK VV+NT V +Q ++ V ++ +G
Sbjct: 390 PETVTFRGKKYYLPAKSISILPDCKTVVYNTMTVVSQHNSRNFV-------KSRKTDGK- 441
Query: 445 GLKWQVFKEI--AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
L+W++F E + + ++ + + N TKD TDY W+TT+I V+ N+ +
Sbjct: 442 -LEWKMFSETIPSNLLVDSRIPRELY----NLTKDKTDYAWFTTTINVDRNDLSARKDIN 496
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
PVL + S GHA+ AF N E GSA G+ F ++ + LK G N + LL VGL ++
Sbjct: 497 PVLRVASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGINFVTLLGSLVGLPDS 556
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G + E AG V I G N+GTLDLS+ W +++ L GE ++ + W +
Sbjct: 557 GAYMEHRYAGPRGVSILGLNTGTLDLSSNGWGHQVALSGETAKVFTKEGGRKVTWTKVNK 616
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P+TWYK P G P+ + M M KG+ W+NG+ IGRYW
Sbjct: 617 --DGPPVTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRYW-------------- 660
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
+Y I+ GEP+Q YHIPRS+ KP+ N++VI EE+G P KI
Sbjct: 661 --MNY---------ISPLGEPTQSEYHIPRSYLKPTNNLMVILEEEGASPEKI 702
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 321/640 (50%), Positives = 412/640 (64%), Gaps = 29/640 (4%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+++I G+R +++SA +HYPR+ P MWP L+ + KEGG + IE+YVFWNGHE +
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFWNGHEPA 122
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+YYF RF+LVKF K++ +++ LRIGP+ AE+N+GG PVWL IPG FR D E
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK MQ F+T IV +MK EKL++ QGGPIIL Q+ENEYG + YG+ GKRY WAA+M
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
A+ + G+PW+MC+Q D P+ +I+TCN+FYCD F P+S + P IWTE+W GW+ +GG
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+ED AF+VARF+Q+GGS+ NYYMY GGTNF RTAGGP TSYDY+APIDEYG+ R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362
Query: 327 NPKWGHLKELHGAIKLCEHALLN--GERSNLSLGSSQEADVY-----------ADSSGAC 373
PKWGHLK+LH AIKLCE AL+ G + LGS QEA VY A ++ C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVE----M 427
+AFLAN+D+ +V SY LP WSVSILPDC+ V FNTA + AQ+S TVE
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482
Query: 428 VPENLQPSEASPDNGSKGLK--WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
+PS S +G L W KE G WG +F G ++H+N TKD +DYLWYT
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542
Query: 486 TSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISL 543
T + +++ + G P L I+ F N +L GS G+ K PI L
Sbjct: 543 TRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWV----SLKQPIQL 598
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGE 602
G NE+ LLS VGLQN G F E GAG V +TG + G +DL+ WTY++GL+GE
Sbjct: 599 VEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGE 658
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGD 642
IY P + W S M+ QP TWYK + Q GD
Sbjct: 659 FSMIYAPEKQGCAGW-SRMQKDSVQPFTWYKNICNQSVGD 697
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 317/714 (44%), Positives = 434/714 (60%), Gaps = 41/714 (5%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYD +SL INGRRE++ S ++HY RS P MWP ++ +A+ GG+N I++YVFWN HE
Sbjct: 43 ARNVTYDGKSLFINGRREILFSGSVHYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHE 102
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
PGK+ F G ++LVKFI+++Q M++ LR+GPF+ AE+N+GG+P WL +PG +FR+D
Sbjct: 103 PEPGKFNFQGNYDLVKFIRLVQAKGMFVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSD 162
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EP+K+HM+ F++ I+ MMK EKLFA QGGPIILAQ+ENEY + + Y E G Y WAA
Sbjct: 163 NEPYKFHMKAFVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAA 222
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTF 262
MAVA +IGVPW+MC+Q D PDPVIN CN +C D F P+ P P IWTENW ++
Sbjct: 223 NMAVATDIGVPWLMCKQRDAPDPVINACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRVH 282
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAFSVARFF K G++ NYYMYHGGTNFGRT+ F TT Y EAP+DEY
Sbjct: 283 GDPPSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTS-SVFSTTRYYDEAPLDEY 341
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMD 381
GLPR PKW HL+++H A+ LC A+L G S L E + + CAAF+ N
Sbjct: 342 GLPREPKWSHLRDVHKALLLCRRAILGGVPSVQKLNHFHEVRTFERVGTNMCAAFITNNH 401
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
T+ FR +Y LP S+SILPDCK VVFNT + +Q N + E SP
Sbjct: 402 TMEPATINFRGTNYFLPPHSISILPDCKTVVFNTQQIVSQ--------HNSRNYERSP-- 451
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
+ W++F E + + + KDTTDY WYTTS +++ + +K G
Sbjct: 452 AANNFHWEMFNEAIPTAKKMPINLPVPAELYSLLKDTTDYAWYTTSFELSQEDMSMKPGV 511
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
PVL + S GH++ AF N ++ G+A G F+++ P+ L+ G N I+LLS TVGL +
Sbjct: 512 LPVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLRVGTNYISLLSSTVGLPD 571
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
+G + E AG S+ I G N GTLDL+ W +++GL+GE +++ ++ W
Sbjct: 572 SGAYMEHRYAGPKSINILGLNRGTLDLTRNGWGHRVGLKGEGKKVFSEEGSTSVKWKPLG 631
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
P+ L+WY+ P G P+ + M M KG+ W+NG IGRYW
Sbjct: 632 AVPR--ALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGRYWM------------ 677
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
++ G+P+Q YHIPRS+ P +N+LVIFEE+ P ++
Sbjct: 678 -------------SYLSPLGKPTQSEYHIPRSFLNPQDNLLVIFEEEARVPAQV 718
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 627 bits (1616), Expect = e-177, Method: Compositional matrix adjust.
Identities = 332/722 (45%), Positives = 430/722 (59%), Gaps = 79/722 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+VTYD RSLII+G+R+++ S +IHYPRS P MWP L+ +AKEGG++ IE+YVFWN HE
Sbjct: 24 GDVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSLIAKAKEGGLDAIETYVFWNVHEP 83
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG Y F G ++V+FIK +Q +Y LRIGPF+ +E++YGG+P WLH IPG VFR+D
Sbjct: 84 QPGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQSEWSYGGLPFWLHDIPGIVFRSDN 143
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK +MQ F +V MM+ E L+ASQGGPIIL+Q+ENEYG + YG+ G Y WAA+
Sbjct: 144 EPFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIENEYGTVQKAYGQEGLAYVQWAAQ 203
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFG 263
MA GVPW+MC+Q + P VIN+CN C Q P+SP+ P IWTENW
Sbjct: 204 MAEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFVGPNSPNKPSIWTENW-------- 255
Query: 264 GRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
+ +EDIAF V F K GS NYYMYHGGTNFGRTA F+TTSY +AP+DEY
Sbjct: 256 ---TTQSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTNFGRTASA-FVTTSYYDQAPLDEY 311
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL PKWGHLKELH AIKLC LL+G + NL LG Q+A ++ SG CAAFL N D
Sbjct: 312 GLTTQPKWGHLKELHAAIKLCSTPLLSGVQVNLYLGPQQQAYIFNAVSGECAAFLINNDS 371
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEM-VPENLQPSEASPDN 441
N +V FRN SY LP S+SILPDCK NV Q +T M E L ++
Sbjct: 372 SNAASVPFRNASYDLPPMSISILPDCK-------NVSTQYTTRTMGRGEVLDAADV---- 420
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
WQ F E + ++ +NTTKD++DYLWYT + + +
Sbjct: 421 ------WQEFTEAIPNFDSTSTRSETLLEQMNTTKDSSDYLWYTFRF------QHESSDT 468
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+ +L + S GHALHAF N + GS G+ +P FK++ +SL G N ++LLS+ VG+ +
Sbjct: 469 QAILDVSSLGHALHAFVNGQAVGSVQGSRKNPRFKFETSVSLSKGINNVSLLSVMVGMPD 528
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
+G F E AG+ +V I D + YSW Y+IGLQGE L IY + + W
Sbjct: 529 SGAFLENRAAGLRTVMIRDKQDNN-DFTNYSWGYQIGLQGETLQIYTEQGSSQVQWKKFS 587
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
PLTWYK V PPGD P+GL++ MGKG AW+NG+ IGRYWP
Sbjct: 588 N--AGNPLTWYKTQVDAPPGDVPVGLNLASMGKGEAWVNGQSIGRYWPS----------- 634
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
YH+PRS+ KP+ N+LV+ EE+GG+P +++
Sbjct: 635 --------------------------YHVPRSFLKPTGNLLVLQEEEGGNPLQVSLDTVT 668
Query: 742 IS 743
IS
Sbjct: 669 IS 670
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 312/658 (47%), Positives = 409/658 (62%), Gaps = 64/658 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+INGRR ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVFWNGHE +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYF R++LV+F+K+++QA +Y+ LR+GP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQKF+ IV MMK E LF QGGPII+AQVENE+G ES G GGK YA WAA+MA
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
V N GVPW+MC+Q D PDPVINTCN FYCD FTP++ P +WTE W GWF FGG P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY----- 322
HRP ED+AF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 323 --------------------------------------------GLPRNPKWGHLKELHG 338
GL R PKWGHL+ +H
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 339 AIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLP 398
AIK E AL++G+ + S+G+ ++A V+ +GACAAFL+N K+ + F Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459
Query: 399 AWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIW 458
AWS+SILPDCK VFNTA V+ + +M P + WQ + E
Sbjct: 460 AWSISILPDCKTAVFNTATVKEPTLLPKMSPVMHR------------FAWQSYSEDTNSL 507
Query: 459 GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFA 518
++ F + G ++ ++ T D +DYLWYTT + + NE FLK+G P L + S GH++ F
Sbjct: 508 DDSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFV 567
Query: 519 NQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VK 577
N GS G +P + + + G N+I++LS VGL N G +E G+ V
Sbjct: 568 NGRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVT 627
Query: 578 ITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAV 635
++G N G DLS W Y++GL+GE LG++ + + W QPLTW+K +
Sbjct: 628 LSGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAG--PGGGTQPLTWHKVL 683
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 306/620 (49%), Positives = 396/620 (63%), Gaps = 20/620 (3%)
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL Y+PG FR D PFK MQ F IV M+K E LFASQGGPIIL+Q+ENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
G G+ Y WAAKMAV N GVPW+MC++ D PDPVIN CN FYCD F+P+ P
Sbjct: 61 PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYCDGFSPNKPY 120
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
P +WTE W GWF FGG RP +D+AF+VARF QKGGS NYYMYHGGTNFGRTAGG
Sbjct: 121 KPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTAGG 180
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PF+TTSYDY+APIDEYGL R PK+ HLKELH AIKL E AL++ + SLG+ ++A +Y
Sbjct: 181 PFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQAYIY 240
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
CAAFLAN + K+ V+F N Y+LP WS+SILPDC+ V +NTA V Q+S V
Sbjct: 241 NSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALVGVQTSHVH 300
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGE-ADFVKSGFVDHINTTKDTTDYLWYT 485
M+P G+ L W+ + E+ E A G ++ IN T+DT+DYLWY
Sbjct: 301 MLP-----------TGTSLLSWETYDEVISSLDERARMTAVGLLEQINVTRDTSDYLWYM 349
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
TS+ ++ +E FL+ G +P L ++S GHA+ F N + GSA G H F + P++L+A
Sbjct: 350 TSVDISSSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRA 409
Query: 546 GKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
G N+I+LLS+ VGL N G YE W + V + G ++G DL+ W+Y++GL+GE +
Sbjct: 410 GSNKISLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKRDLTWQKWSYQVGLKGEAM 469
Query: 605 GIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEE 663
+ P ++ +WV ++ QPLTWYKA P G+EP+ LD+ MGKG +NG+
Sbjct: 470 NLVTPEGASSADWVRGSLAARSVQPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQS 529
Query: 664 IGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILV 723
IGRYW ++ +C + C Y G P+QRWYH+PRSW KP +N+LV
Sbjct: 530 IGRYWTAYAK-----GDC-EACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLV 583
Query: 724 IFEEKGGDPTKITFSIRKIS 743
IFEE GGD +KI R ++
Sbjct: 584 IFEELGGDASKIALLRRSLT 603
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 328/723 (45%), Positives = 429/723 (59%), Gaps = 71/723 (9%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
VTYD RSLII+G R+++ S +IHYPRS P MW L+ +AKEGGV+ I++YVFWN HE
Sbjct: 24 AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 83
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG+Y F GR++L KFIK IQ +Y LRIGPF+ +E++YGG+P WLH + G V+R D
Sbjct: 84 QPGQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 143
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK++MQ F T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WAAK
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ-FT-PHSPSMPKIWTENWPGWFKTFG 263
MAV GVPW+MC+Q D PDPVINTCN C Q FT P+SP+ P +WTENW +++ FG
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G R +EDIAF VA F + GS NYYM
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYM----------------------------VS 295
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
L R PKWGHLKELH AI LC LLNG +SN+SLG QEA V+ + G C AFL N D+
Sbjct: 296 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 355
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
N+ TV+F+NVS L S+SILPDCK V+FNTA + + E + S S D
Sbjct: 356 NNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYN------ERITTSSQSFDAVD 409
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+W+ +K+ + + + ++H+N TKD +DYLWYT N + + P
Sbjct: 410 ---RWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPN------SSCTEP 460
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
+L IES HA+HAF N G+ G+ F +K+PISL N I++LS+ VG ++G
Sbjct: 461 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 520
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+T V+I G D + Y+W Y++GL GE L IY +N+ W T E
Sbjct: 521 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRKT-EI 579
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
NQPLTWYK V P GD+P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 580 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 625
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
F+ K G+PSQ YH+PR++ K SEN+LV+ EE GDP I+ +
Sbjct: 626 ------SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETISRT 674
Query: 744 GFP 746
P
Sbjct: 675 DLP 677
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 322/648 (49%), Positives = 413/648 (63%), Gaps = 37/648 (5%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+++I G+R +++SA +HYPR+ P MWP L+ + KEGG + IE+YVFWNGHE +
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 87 PGKYYFGGRFNLVKFIKI--------IQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
G+YYF RF+LVKF KI + +++ LRIGP+ AE+N+GG PVWL IPG
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPG 182
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
FR D EPFK MQ F+T IV +MK EKL++ QGGPIIL Q+ENEYG + YG+ GKR
Sbjct: 183 IEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKR 242
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
Y WAA+MA+ + G+PW+MC+Q D P+ +I+TCN+FYCD F P+S + P IWTE+W GW
Sbjct: 243 YMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGW 302
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
+ +GG PHRP+ED AF+VARF+Q+GGS+ NYYMY GGTNF RTAGGP TSYDY+AP
Sbjct: 303 YADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAP 362
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALL--NGERSNLSLGSSQEADVY---------- 366
IDEYG+ R PKWGHLK+LH AIKLCE AL+ +G + LGS QEA VY
Sbjct: 363 IDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGS 422
Query: 367 -ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS-- 423
A ++ C+AFLAN+D+ +V SY LP WSVSILPDC+ V FNTA + AQ+S
Sbjct: 423 MAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVF 482
Query: 424 TVE----MVPENLQPSEASPDNGSKGLK--WQVFKEIAGIWGEADFVKSGFVDHINTTKD 477
TVE +PS S +G L W KE G WG +F G ++H+N TKD
Sbjct: 483 TVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKD 542
Query: 478 TTDYLWYTTSIIVNENEEFL--KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPF 535
+DYLWYTT + +++ + G P L I+ F N +L GS G+
Sbjct: 543 ISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWV---- 598
Query: 536 KYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWT 594
K PI L G NE+ LLS VGLQN G F E GAG V +TG + G +DL+ WT
Sbjct: 599 SLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWT 658
Query: 595 YKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGD 642
Y++GL+GE IY P + W S M+ QP TWYK + Q GD
Sbjct: 659 YQVGLKGEFSMIYAPEKQGCAGW-SRMQKDSVQPFTWYKNICNQSVGD 705
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 315/712 (44%), Positives = 439/712 (61%), Gaps = 42/712 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
++YD RSL+++GRRE+ S +IHYPRS P MWP L+ +AKEGG+NTIE+YVFWN HE
Sbjct: 38 ISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 97
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F GR+++VKF K+IQ+ M+ ++R+GPF+ AE+N+GG+P WL IP VFR + EP
Sbjct: 98 GQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 157
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K HM+ F+ +++ +K LFASQGGPIILAQ+ENEY + E+ + E G +Y WAA+MA
Sbjct: 158 YKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIHWAAQMA 217
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
+ NIG+PWIMC+Q P VI TCN C P + +MP +WTENW ++ FG
Sbjct: 218 IGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQYRVFGDP 277
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAF+VARFF GG++ NYYMYHGGTNFGRTA F+ Y EAP+DE+GL
Sbjct: 278 PSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTAAA-FVMPKYYDEAPLDEFGLY 336
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKN 384
+ PKWGHL++LH A+KLC+ ALL G+ S LG EA V+ C AFL+N + K+
Sbjct: 337 KEPKWGHLRDLHLALKLCKKALLWGKPSTEKLGKQLEARVFEIPEQKVCVAFLSNHNTKD 396
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D T+ FR Y +P S+SIL DCK VVF T +V AQ + Q + D ++
Sbjct: 397 DVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHN---------QRTFHFADQTNQ 447
Query: 445 GLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
WQ+F +E + +A D N TKD TDY+WYT+S + ++ ++ +
Sbjct: 448 NNVWQMFDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVWYTSSFKLEPDDMPIRRDIKT 507
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
V+ + S GHA AF N + G G + F + P+ LK G N +A+L+ ++G+ ++G
Sbjct: 508 VVEVNSHGHASVAFVNNKFAGCGHGTKMNKAFTLEKPMELKKGVNHVAVLASSMGMMDSG 567
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+ V+ITG N+GTLDL+ W + +GL GE IY ++ W +
Sbjct: 568 AYLEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGEQKEIYTEKGMASVTWKPAVN- 626
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
++PLTWYK P G++PI LDM MGKG+ ++NG+ IGRYW S H
Sbjct: 627 --DKPLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIGRYW-----MSYKH----- 674
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
G PSQ+ YHIPRS+ +P +N+LV+FEE+ G P I
Sbjct: 675 ---------------ALGRPSQQLYHIPRSFLRPKDNVLVLFEEEFGRPDAI 711
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 325/735 (44%), Positives = 444/735 (60%), Gaps = 47/735 (6%)
Query: 10 FALLIFFSSSITYCFAGN----VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
F++ +F S IT A N +TYD RSL+++G+ EL S +IHYPRS P MWP ++ +
Sbjct: 8 FSITLF--SIITIVCAQNAAQTITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDILDK 65
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYN 125
A+ GG+N I++YVFWNGHE K F GR++LVKF+K++Q+ MY+ LRIGPF+ AE+N
Sbjct: 66 ARRGGLNLIQTYVFWNGHEPEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQAEWN 125
Query: 126 YGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
+GG+P WL +P +FR++ EPFK +M+++++++++ MK EKLFA QGGPIILAQ+ENEY
Sbjct: 126 HGGLPYWLREVPDIIFRSNNEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIENEY 185
Query: 186 GYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PH 243
+ + Y G Y WAAKMAV+ GVPW+MC+Q D PDPVIN CN +C D FT P+
Sbjct: 186 NHIQLAYEADGDNYVQWAAKMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFTGPN 245
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
P P IWTENW ++ FG R +EDIAFSVARFF K GS+ NYYMYHGGTNFGRT
Sbjct: 246 KPYKPFIWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYHGGTNFGRT 305
Query: 304 AGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEA 363
F TT Y EAP+DE+GL R PKW HL++ H A+ LC+ +LLNG + + E
Sbjct: 306 TSA-FTTTRYYDEAPLDEFGLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQKISQYHEV 364
Query: 364 DVY-ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQS 422
VY S CAAF+ N + KT+ FR Y LP S+SILPDCK VVFNT N+ +Q
Sbjct: 365 IVYEKKESNLCAAFITNNHTQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNTQNIASQH 424
Query: 423 STVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYL 482
S+ + + S+ D KW+VF E E + + + KD TDY
Sbjct: 425 SS-----RHFEKSKTGND-----FKWEVFSEPIPSAKELPSKQKLPAELYSLLKDKTDYG 474
Query: 483 WYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPIS 542
WYTTS+ + + K+ PVL I S GH+L AF N E GS G+ F+++ P++
Sbjct: 475 WYTTSVELGPEDIPKKSDVAPVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKPVN 534
Query: 543 LKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGE 602
K G N+IA+L+ VGL ++G + E AG ++ I G SGT+DL++ W +++GLQGE
Sbjct: 535 FKVGVNQIAILANLVGLPDSGAYMEHRYAGPKTITILGLMSGTIDLTSNGWGHQVGLQGE 594
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
+ I+ + W K ++WYK P G P+ + M M KG+ W+NGE
Sbjct: 595 NDSIFTEKGSKKVEWKDGKG--KGSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNGE 652
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
IGR+W ++ G+P+Q YHIPRS+ KP +N+L
Sbjct: 653 SIGRHWM-------------------------SYLSPLGKPTQSEYHIPRSFLKPKDNLL 687
Query: 723 VIFEEKGGDPTKITF 737
VIFEE+ P KI
Sbjct: 688 VIFEEEAISPDKIAI 702
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 619 bits (1597), Expect = e-174, Method: Compositional matrix adjust.
Identities = 317/712 (44%), Positives = 437/712 (61%), Gaps = 42/712 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSLII+GRRE+ S +IHYPRS P MWP L+ +AKEGG+NTIE+Y+FWN HE
Sbjct: 41 VSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWNIHEPEK 100
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F GR+++V+F K+IQ+ MY ++R+GPF+ AE+N+GG+P WL IP VFR + EP
Sbjct: 101 GQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 160
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K HM+ F+ +I+ +K LFASQGGPIILAQ+ENEY + E+ + G +Y WAA MA
Sbjct: 161 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIKWAANMA 220
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
++ N+G+PWIMC+Q P VI TCN C P + SMP +WTENW ++ FG
Sbjct: 221 ISTNVGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPMNKSMPLLWTENWTAQYRVFGDP 280
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAF+VARFF GG++ NYYMYHGGTNFGRT+ F+ Y EAP+DE+GL
Sbjct: 281 PSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLY 339
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKN 384
+ PKWGHL++LH A+KLC+ ALL G+ S LG EA V+ C AFL+N + K+
Sbjct: 340 KEPKWGHLRDLHLALKLCKKALLWGKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKD 399
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D T+ FR SY +P S+SIL DCK VVF T +V AQ + Q + D ++
Sbjct: 400 DVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNAQHN---------QRTFHFADQTTQ 450
Query: 445 GLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
WQ+F +E + ++ D N TKD TDY+WYT+S + ++ ++ +
Sbjct: 451 NNVWQMFDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKT 510
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL + S GHA AF N + G G + F + P+ LK G N +A+L+ T+G+ ++G
Sbjct: 511 VLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSG 570
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+ V+I G N+GTLDL+ W + +GL GE IY ++ W +
Sbjct: 571 AYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAVN- 629
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
++PLTWYK P G++PI LDM MGKGL ++NG+ IGRYW S H
Sbjct: 630 --DRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWI-----SYKH----- 677
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
G PSQ+ YHIPRS+ + +N+LV+FEE+ G P I
Sbjct: 678 ---------------ALGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAI 714
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 619 bits (1597), Expect = e-174, Method: Compositional matrix adjust.
Identities = 292/504 (57%), Positives = 368/504 (73%), Gaps = 11/504 (2%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
+V+YD +++ ING+R +++S +IHYPRS P MWP L+Q+AKEGG++ I++YVFWNGHE
Sbjct: 19 ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPGKYYFGG ++LV+FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL YIPG FR +
Sbjct: 79 SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
PFK +MQ+F IVDMMK E LF SQGGPIIL+Q+ENEYG E G G+ Y+ WAA+
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV GVPW+MC+Q D PDP+IN+CN FYCD F+P+ PK+WTE W GWF FGG
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P+RP ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKWGHLK+LH AIKLCE AL++G+ S + LG QEA V+ G CAAFLAN + ++
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V F N+ Y+LP WS+SILPDCK V+NTA V AQS+ ++MVP P +G+
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVP--------VPIHGA-- 428
Query: 446 LKWQVFKEIA-GIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
WQ + E A GE F G V+ INTT+D +DYLWY+T + ++ +E FLK G P
Sbjct: 429 FSWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPT 488
Query: 505 LLIESKGHALHAFANQELQGSASG 528
L + S GHALH F N +L + G
Sbjct: 489 LTVLSAGHALHVFVNDQLSVARDG 512
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 334/741 (45%), Positives = 437/741 (58%), Gaps = 70/741 (9%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+ + S+ NVTYD SL+ING +++ S +IHYPRS P MWP L+ +AKEGG++
Sbjct: 12 LILTVSLCTVHGANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDV 71
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
I++YVFWN HE G+Y F GRF+LV FIK IQ +Y+ LRIGP++ +E YGG+P+WL
Sbjct: 72 IQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWL 131
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
H +PG VFR D + FK+HMQ+F T IV+MMK LFASQGGPIIL+Q+ENEYG +S +
Sbjct: 132 HDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFR 191
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIW 251
G Y WAA+MAV GVPW+MC+Q D PDPVIN CN C + P+SP+ P +W
Sbjct: 192 ANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLW 251
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW + + FGG R + DIA++VA F K GS NYYMYHGGTNF R A IT
Sbjct: 252 TENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASAFIITA 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
YD EAP+DEYGL R PKWGHLKELH +IK C LL+G ++ SLGS Q+ + +SS
Sbjct: 312 YYD-EAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQV-IKNESSW 369
Query: 372 ACAAFLANMDDKN-----------DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRA 420
+ + +N D T+ F+N+SY LP S+SILP CK VVFNT V
Sbjct: 370 TYFPLMFSEVPQNVLLSWKISGPRDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSI 429
Query: 421 QSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTD 480
Q++ M P LQ + A W+V+ E + +D I+T KDT+D
Sbjct: 430 QNNVRAMKPR-LQFNSAE--------NWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSD 480
Query: 481 YLWYTTSIIVNENEEFLKNGSRP----VLLIESKGHALHAFANQELQGSASGNGTHPPFK 536
Y+WYT F N P VL I S+G LH+F N L GSA G+ +
Sbjct: 481 YMWYT----------FRFNNKSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVT 530
Query: 537 YKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYK 596
K ++L G N I++LS TVGL N+G F E AG+ V++ G D S+YSW Y+
Sbjct: 531 MKKNVNLINGMNNISILSATVGLPNSGAFLESRVAGLRKVEVQG-----RDFSSYSWGYQ 585
Query: 597 IGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGL 656
+GL GE L I+ + + W S K PLTWY+ P G++P+ +++ MGKGL
Sbjct: 586 VGLLGEKLQIFTVSGSSKVQWKSFQSSTK--PLTWYQTTFHAPAGNDPVVVNLGSMGKGL 643
Query: 657 AWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK 716
AW+NG+ IGRYW + PD G PSQ+WYHIPRS+ K
Sbjct: 644 AWVNGQGIGRYWVSFHK-------------------PD------GTPSQQWYHIPRSFLK 678
Query: 717 PSENILVIFEEKGGDPTKITF 737
+ N+LVI EE+ G+P IT
Sbjct: 679 STGNLLVILEEETGNPLGITL 699
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 316/708 (44%), Positives = 436/708 (61%), Gaps = 43/708 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T+D RSL+++GRR+L S +IHYPRS P MWP L+ +AKEGG+N IESYVFWNGHE
Sbjct: 15 ITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWNGHEPEM 74
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++++KF K++Q+ M+ ++RIGPFV AE+N+GG+P WL +P +FR + EP
Sbjct: 75 GVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIIFRTNNEP 134
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HMQKF+T+IV+ +K KLFASQGGPIILAQ+ENEY + E+ + E G Y WAAKMA
Sbjct: 135 FKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIHWAAKMA 194
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
NIGVPWIMC+Q P VI TCN +C P + P +WTENW ++ FG
Sbjct: 195 SDLNIGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVFGDP 254
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAF+VARF+ GG++ NYYMYHGGTNFGRT G F+ Y EAP+DE+GL
Sbjct: 255 PSQRSAEDIAFAVARFYSVGGTMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGLY 313
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKN 384
+ PKWGHL++LH A++LC+ A+L G SN LG EA ++ C AFL+N + K
Sbjct: 314 KEPKWGHLRDLHHALRLCKKAILWGNPSNQPLGKLYEARLFEIPEQKICVAFLSNHNTKE 373
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TV FR Y +P SVSIL DCK VVF+T +V +Q + Q + D +
Sbjct: 374 DGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHN---------QRTFHFSDQTVQ 424
Query: 445 GLKWQVFKEIAGI--WGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
G W+++ E + + + ++ N TKD TDY+WYTTS + + +
Sbjct: 425 GNVWEMYTESDKVPTYKFTNIRTQKPLEAYNLTKDKTDYVWYTTSFKLEAEDLPFRKDIW 484
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
PVL + S GHA+ AF N + G+ G + F + PI ++ G N +++LS T+G+Q++
Sbjct: 485 PVLEVSSHGHAMVAFVNGKYVGAGHGTKINKAFTMEKPIEVRTGINHVSILSTTLGMQDS 544
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G + E AGI V I G N+GTLDL++ W + +GL+GE + + + WV +
Sbjct: 545 GVYLEHRQAGIDGVTIQGLNTGTLDLTSNGWGHLVGLEGERRNAHTEKGGDGVQWVPAV- 603
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
++PLTWY+ P GD+P+ +DM MGKG+ ++NGE +GRYW S H
Sbjct: 604 --FDRPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRYW-----SSYKH---- 652
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG 730
G PSQ YH+PR + KP+ N++ IFEE+GG
Sbjct: 653 ----------------ALGRPSQYLYHVPRCFLKPTGNVMTIFEEEGG 684
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 307/563 (54%), Positives = 381/563 (67%), Gaps = 15/563 (2%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L I SS+ + VTYD ++LIING+R ++IS +IHYPRS P MWP L+++AKEGG+
Sbjct: 13 LAILCFSSLIHSTEAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGL 72
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ I++YVFWNGHE SPG YYF R++LVKF K++ QA +Y+ LRIGP+V AE+N+GG PV
Sbjct: 73 DVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPV 132
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL Y+PG VFR D EPFK MQKF IVDMMK EKLF +QGGPIIL+Q+ENEYG +
Sbjct: 133 WLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWE 192
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G GK Y+ W A+MA+ + GVPWIMC+Q D P P+I+TCN FYC+ F P+S + PK+W
Sbjct: 193 MGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLW 252
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWF FGG P+RP EDIAFSVARF Q GGS NYYMY GGTNF RTA G FI T
Sbjct: 253 TENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYXGGTNFDRTA-GVFIAT 311
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY+APIDEYGL R PK+ HLKELH IKLCE AL++ + + SLG QE V+ S
Sbjct: 312 SYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKT 370
Query: 372 ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPEN 431
+CAAFL+N D + V+FR Y LP WSVSILPDCK +NTA +RA + ++M+P
Sbjct: 371 SCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIPT- 429
Query: 432 LQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGFVDHINTTKDTTDYLWYTTSIIV 490
S W+ + E + EA FVK G V+ I+ T+D TDY WY T I +
Sbjct: 430 -----------STKFSWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITI 478
Query: 491 NENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
+E FLK G P+L I S GHALH F N L G++ G ++ + I L G N++
Sbjct: 479 GSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKL 538
Query: 551 ALLSMTVGLQNAGPFYEWVGAGI 573
ALLS VGL NAG YE GI
Sbjct: 539 ALLSTAVGLPNAGVHYETWNTGI 561
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 329/736 (44%), Positives = 436/736 (59%), Gaps = 67/736 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L SS Y A V++D R++ I+G R +++S +IHYPRS MWP L+++ KEG
Sbjct: 7 FLLCCLLVSSCAY--ATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEG 64
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE + +Y F G +L++F+K IQ MY +LRIGP+V AE+NYGG
Sbjct: 65 GLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYVCAEWNYGGF 124
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWLH +PG FR F MQ F T+IV+M+K+EKLFASQGGPIILAQ+ENEYG
Sbjct: 125 PVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVI 184
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YGE GK Y W A MA + ++GVPWIMCQQ D P P++NTCN +YCD FTP++P+ PK
Sbjct: 185 GSYGEAGKAYIKWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFTPNNPNTPK 244
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GW+K +GG+DPHR +ED+AF+VARFFQ+GG+ NYYMYHGGTNF RTAGGP+I
Sbjct: 245 MWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQRGGTFQNYYMYHGGTNFDRTAGGPYI 304
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TT+YDY+AP+DE+G PK+GHLK+LH + E L G S + G+ A VY
Sbjct: 305 TTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYKTE 364
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G+ + F+ N+++ +D + F+ Y +PAWSVSILPDCK +NTA + Q+S MV
Sbjct: 365 EGS-SCFIGNVNETSDAKINFQGTFYDVPAWSVSILPDCKTETYNTAKINTQTSV--MVK 421
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAG---IWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
+ +EA +N LKW E + G+ + D + D +DYLWY T
Sbjct: 422 ---KANEA--ENEPSTLKWSWRPENIDNVLLKGKGESTMRQLFDQKVVSNDESDYLWYMT 476
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
++ + E + G L I S H LHAF N + G+ + ++ G
Sbjct: 477 TVNIKEQDPVW--GKNMSLRINSTAHVLHAFVNGQHIGNYRAENGKFHYVFEQDAKFNPG 534
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGE 602
N I LLS+TVGL N G F+E V AGIT V I G N DLST+ W+YK GL G
Sbjct: 535 ANVITLLSITVGLPNYGAFFENVPAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGF 594
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
+++ P TW P G EP+ +D+L +GKG AW+NG
Sbjct: 595 ENQLFS----------------SESPSTW-----SAPLGSEPVVVDLLGLGKGTAWINGN 633
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS-ENI 721
IGRYWP F D I GC YH+PRS+ +N
Sbjct: 634 NIGRYWP--------------------AFLAD--IDGCSAE----YHVPRSFLNSDGDNT 667
Query: 722 LVIFEEKGGDPTKITF 737
LV+FEE GG+P+ + F
Sbjct: 668 LVLFEEIGGNPSLVNF 683
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 291/500 (58%), Positives = 365/500 (73%), Gaps = 9/500 (1%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+GG++ IE+YVFWN H
Sbjct: 18 FCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLH 77
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E G+Y F GR +LVKF+K + +A +Y+ LRIGP+V +E+NYGG P+WLH+IPG FR
Sbjct: 78 EPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCSEWNYGGFPLWLHFIPGIKFRT 137
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
D EPFK M++F T IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +S YG GK Y WA
Sbjct: 138 DNEPFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWA 197
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDP-VINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
AKMA + + GVPW+MCQQ D PDP VINTCN FYCDQFTP+S + PK+WTENW W+ F
Sbjct: 198 AKMATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQFTPNSKTKPKLWTENWSAWYLLF 257
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG PHRP ED+AF+VARFFQ+GG+ NYYMYHGGTNF R+ GGPFI TSYD++APIDEY
Sbjct: 258 GGGFPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDFDAPIDEY 317
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
G+ R PKWGHLK++H AIKLCE AL+ E LG + EA VY S CAAFLAN+D
Sbjct: 318 GVIRQPKWGHLKDVHKAIKLCEEALIAAEPKITYLGPNLEAAVYKTGS-VCAAFLANVDA 376
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K+DKTV F SYHLPAWSVSILPDCK VV NTA + + S+ V E+L+ +S +
Sbjct: 377 KSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASTISNFVTESLKEDISSSETS 436
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
KW E GI + K+G ++ IN T D +DYLWY+ S+ + ++ GS+
Sbjct: 437 RS--KWSWINEPVGISKDDILSKTGLLEQINITADRSDYLWYSLSVDLKDDP-----GSQ 489
Query: 503 PVLLIESKGHALHAFANQEL 522
VL IES GHALHAF N +L
Sbjct: 490 TVLHIESLGHALHAFINGKL 509
Score = 206 bits (523), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 105/222 (47%), Positives = 142/222 (63%), Gaps = 9/222 (4%)
Query: 524 GSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFN 582
GS +GN P PI++ +GKN+I LLS+TVGLQN G F++ GAGIT V + G
Sbjct: 1933 GSQTGNKEKPKLNEDIPITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLK 1992
Query: 583 SG--TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPP 640
+G TLDLS+ WTY++GL+GE LG+ + ++ W S PK QPL WYK P
Sbjct: 1993 NGNKTLDLSSRKWTYQVGLKGEDLGLSSG---SSGAWNSKTTFPKKQPLIWYKTNFDAPS 2049
Query: 641 GDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGC 700
G P+ +D MGKG AW+NG+ IGRYWP + + +C C+YRG F KC C
Sbjct: 2050 GSNPVVIDFTGMGKGEAWVNGQSIGRYWPTYV---ASNVDCTDSCNYRGPFTQTKCHMNC 2106
Query: 701 GEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
G+PSQ YH+P+S+ KP+ N LV+FEE GGDPT+I+F+ ++I
Sbjct: 2107 GKPSQTLYHVPQSFLKPNGNTLVLFEESGGDPTQISFATKQI 2148
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 318/711 (44%), Positives = 433/711 (60%), Gaps = 40/711 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RS+I+NG REL+ S +IHYPR P MWP ++++AKEGG+N I++YVFWN HE
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G ++LVKFIK I + +Y+ LRIGP++ AE+N GG P WL +P FR+ EP
Sbjct: 88 GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F +HM+K+ +++D++K+EKLFA QGGPII+AQ+ENEY + Y + GK+Y WAA MA
Sbjct: 148 FIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANMA 207
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ GVPWIMC+Q D P VINTCN +C D FT P+ P+ P +WTENW ++TFG
Sbjct: 208 TSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAFSVARFF K G++ NYYMY+GGTN+GRT+ F+TT Y EAP+DE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRTSSS-FVTTRYYDEAPLDEFGLY 326
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PKW HL++LH A++L ALL G + + E V+ S CAAFL N
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPTVQKINQDLEITVFEKPGSTDCAAFLTNNHTTQ 386
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
T+ FR Y+LP SVSILPDCK VV+NT + +Q ++ N SE SK
Sbjct: 387 PSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHNS-----RNFITSEK-----SK 436
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
LKW++++E + ++ + TKDT+DY WY+TSI + ++ ++ PV
Sbjct: 437 NLKWEMYQEKVPTIADLPLKNREPLELYSLTKDTSDYAWYSTSITLERHDLPMRPDILPV 496
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L I S GHAL AF N E G GN F ++ PI LK G N I +L+ TVG N+G
Sbjct: 497 LQIASMGHALAAFVNGEYVGFGHGNNIEKSFVFQKPIILKPGTNTITILAETVGFPNSGA 556
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
+ E AG V I G +GTLD++ +W +++G+ GE ++ + W PP
Sbjct: 557 YMEKRFAGPRGVTIQGLMAGTLDITQNNWGHEVGVFGEKQELFTEEGAKKVQWTPVTGPP 616
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K +TWYK P G+ P+ L M KM KG+ W+NG+ +GRYW
Sbjct: 617 KGA-VTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLGRYW---------------- 659
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
++ G+P+Q YHIPR++ KP+ N+LVIFEE GG PT I
Sbjct: 660 ---------TSFLSPLGQPTQAEYHIPRAYLKPTNNLLVIFEETGGHPTNI 701
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 328/736 (44%), Positives = 435/736 (59%), Gaps = 67/736 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L SS Y A V++D R++ I+G R +++S +IHYPRS MWP L+++ KEG
Sbjct: 6 FILCCVLVSSCAY--ATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEG 63
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
++ IE+YVFWN HE + +Y F G +L++F+K IQ MY +LRIGP+V AE+NYGG
Sbjct: 64 SLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGF 123
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWLH +PG FR F MQ F T+IV+M+K+EKLFASQGGPIILAQ+ENEYG
Sbjct: 124 PVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVI 183
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YGE GK Y W A MA + ++GVPWIMCQQ D P P++NTCN +YCD F+P++P+ PK
Sbjct: 184 GSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNTPK 243
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GW+K +GG+DPHR +ED+AF+VARFFQK G+ NYYMYHGGTNF RTAGGP+I
Sbjct: 244 MWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGPYI 303
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TT+YDY+AP+DE+G PK+GHLK+LH + E L G S + G+ A VY
Sbjct: 304 TTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYQTE 363
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G+ + F+ N+++ +D + F+ SY +PAWSVSILPDCK +NTA + Q+S MV
Sbjct: 364 EGS-SCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTSV--MVK 420
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAG---IWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
+ +EA +N LKW E + G+ + D + D +DYLWY T
Sbjct: 421 ---KANEA--ENEPSTLKWSWRPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMT 475
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
++ + E + L G L I S H LHAF N + G+ + ++ G
Sbjct: 476 TVNLKEQDPVL--GKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPG 533
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGE 602
N I LLS+TVGL N G F+E AGIT V I G N DLST+ W+YK GL G
Sbjct: 534 ANVITLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGF 593
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
+++ P TW P G EP+ +D+L +GKG AW+NG
Sbjct: 594 ENQLFS----------------SESPSTW-----SAPLGSEPVVVDLLGLGKGTAWINGN 632
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS-ENI 721
IGRYWP F D I GC YH+PRS+ +N
Sbjct: 633 NIGRYWP--------------------AFLSD--IDGCSAE----YHVPRSFLNSEGDNT 666
Query: 722 LVIFEEKGGDPTKITF 737
LV+FEE GG+P+ + F
Sbjct: 667 LVLFEEIGGNPSLVNF 682
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 333/728 (45%), Positives = 444/728 (60%), Gaps = 61/728 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTY+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++V+F K IQ A +Y ILRIGP++ E+NYGG+P WL IPG FR P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLIV+ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GGP+ITTSYDY+AP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQK-------------------RGGPYITTSYDYDAPLDEYGN 311
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDK 383
R PK+GHLK+LH IK E L++GE + + Y DS+ AC F+ N +D
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSAC--FINNRNDN 369
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
D V ++ LPAWSVSILPDCK V FN+A ++AQ++ ++ + E P++
Sbjct: 370 MDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT---VMVNKAKMVEKEPES-- 424
Query: 444 KGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
LKW +E + + + K+ ++ I T+ D +DYLWY TSI N E
Sbjct: 425 --LKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSI--NHKGE----- 475
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
+ L + + GH L+AF N L G H F+ ++P L GKN I+LLS T+GL+
Sbjct: 476 ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLK 535
Query: 561 NAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPG--YRNN 614
N GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PG + NN
Sbjct: 536 NYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPGCTWDNN 595
Query: 615 INWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
V P N+P TWYK + P G++ + +D+L + KG+AW+NG +GRYWP S
Sbjct: 596 NGTV-----PINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWP--SYT 648
Query: 675 SSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKG 729
++ C CDYRG F + KC+TGCGEPSQR+YH+PRS+ K E N +++FEE G
Sbjct: 649 AAEMGGC-HHCDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAG 707
Query: 730 GDPTKITF 737
GDP+ ++F
Sbjct: 708 GDPSHVSF 715
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 325/715 (45%), Positives = 430/715 (60%), Gaps = 48/715 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD RSLII+G RE+ S +IHYPRS P WP L+ +AKEGG+N IESYVFWNGHE
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++L+KF K+IQ+ MY I+RIGPFV AE+N+GG+P WL IP +FR + EP
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK +M++F+TLIV+ +K KLFASQGGPIILAQ+ENEY + E + E G +Y WAAKMA
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
+A N GVPWIMC+Q P VI TCN +C P P +WTENW ++ FG
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDP 272
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAFSVARFF GG++ NYYMYHGGTNFGR G F+ Y EAP+DE+GL
Sbjct: 273 PSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRN-GAAFVMPRYYDEAPLDEFGLY 331
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANMDDKN 384
+ PKWGHL++LH A++ C+ ALL G S LG EA V+ C AFL+N + K
Sbjct: 332 KEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKE 391
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVEMVPENLQPSEASPDNG 442
D TV FR Y + S+SIL DCK VVF+T +V +Q + T + +Q DN
Sbjct: 392 DGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQ------DN- 444
Query: 443 SKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
W+++ +E + + ++ N TKD TDYLWYTTS + ++ +
Sbjct: 445 ----VWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEV 500
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+PVL + S GHA+ AF N G G + F + + LK G N +A+LS T+GL +
Sbjct: 501 KPVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMD 560
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
+G + E AG+ +V I G N+GTLDL+T W + +GL GE +++ + W
Sbjct: 561 SGSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAW---- 616
Query: 622 EPPK-NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
+P K NQPLTWY+ P G +P+ +D+ MGKG ++NGE +GRYW S H
Sbjct: 617 KPGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYW------VSYHH- 669
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
G+PSQ YH+PRS +P N L+ FEE+GG P I
Sbjct: 670 ------------------ALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAI 706
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 613 bits (1580), Expect = e-172, Method: Compositional matrix adjust.
Identities = 325/715 (45%), Positives = 429/715 (60%), Gaps = 48/715 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD RSLII+G RE+ S +IHYPRS P WP L+ +AKEGG+N IESYVFWNGHE
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++L+KF K+IQ+ MY I+RIGPFV AE+N+GG+P WL IP +FR + EP
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK +M++F+TLIV+ +K KLFASQGGPIILAQ+ENEY + E + E G +Y WAAKMA
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
+A N GVPWIMC+Q P VI TCN +C P P +WTENW ++ FG
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDP 272
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAFSVARFF GG++ NYYMYHGGTNFGR G F+ Y EAP DE+GL
Sbjct: 273 PSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRN-GAAFVMPRYYDEAPFDEFGLY 331
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANMDDKN 384
+ PKWGHL++LH A++ C+ ALL G S LG EA V+ C AFL+N + K
Sbjct: 332 KEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKE 391
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVEMVPENLQPSEASPDNG 442
D TV FR Y + S+SIL DCK VVF+T +V +Q + T + +Q DN
Sbjct: 392 DGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQ------DN- 444
Query: 443 SKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
W+++ +E + + ++ N TKD TDYLWYTTS + ++ +
Sbjct: 445 ----VWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEV 500
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
+PVL + S GHA+ AF N G G + F + + LK G N +A+LS T+GL +
Sbjct: 501 KPVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMD 560
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
+G + E AG+ +V I G N+GTLDL+T W + +GL GE +++ + W
Sbjct: 561 SGSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAW---- 616
Query: 622 EPPK-NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
+P K NQPLTWY+ P G +P+ +D+ MGKG ++NGE +GRYW S H
Sbjct: 617 KPGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYW------VSYHH- 669
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
G+PSQ YH+PRS +P N L+ FEE+GG P I
Sbjct: 670 ------------------ALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAI 706
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 613 bits (1580), Expect = e-172, Method: Compositional matrix adjust.
Identities = 324/716 (45%), Positives = 437/716 (61%), Gaps = 50/716 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD RSL+I+GRRE+ S +IHYPRS WP L+ +AKEGG+N IESYVFWN HE
Sbjct: 36 ITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWNIHEPEM 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F GR++++KF K+IQ+ M+ ++RIGPFV AE+N+GG+P WL +P VFR D EP
Sbjct: 96 GVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIVFRTDNEP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K MQKF+TL+V+ +K KLFASQGGPIILAQ+ENEY + E+ + E G RY WAAKMA
Sbjct: 156 YKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYIDWAAKMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
++ + GVPWIMC+Q P VI TCN +C P + P +WTENW ++ FG
Sbjct: 216 ISTSTGVPWIMCKQTKAPAEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVFGDP 275
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAF+VARFF GGS+ NYYMYHGGTNFGRT G F+ Y EAP+DE+G+
Sbjct: 276 PSQRSAEDIAFAVARFFSVGGSMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGMY 334
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKN 384
+ PKWGHL++LH A++LC+ ALL G S LG EA ++ C AFL+N + K
Sbjct: 335 KEPKWGHLRDLHHALRLCKKALLRGNPSTQPLGKLYEARLFEIPEQKVCVAFLSNHNTKE 394
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVEMVPENLQPSEASPDNG 442
D TV FR Y +P SVSIL DCK VVF+T +V AQ + T + + LQ +
Sbjct: 395 DGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLTDQTLQNN------- 447
Query: 443 SKGLKWQVFKEIAGI--WGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
W+++ E + + ++ N TKD TDYLWYTTS + + +
Sbjct: 448 ----VWEMYTEGDKVPTYKFTTDRSEKPLEAYNMTKDKTDYLWYTTSFKLEAEDLPFRQD 503
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
+PVL S GHA+ AF N +L G+A G + F + PI ++AG N +++LS T+GLQ
Sbjct: 504 IKPVLEASSHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEVRAGINHVSILSSTLGLQ 563
Query: 561 NAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY-NPGYRNNINWVS 619
++G + E AG+ SV I G N+GTLDLS+ W + +GL GE + + G + W
Sbjct: 564 DSGAYLEHRQAGVHSVTIQGLNTGTLDLSSNGWGHIVGLDGERKQAHMDKG--GEVQWKP 621
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
+ + PLTWY+ P G++P+ +D+ MGKG+ ++NGE +GRYW S H
Sbjct: 622 AV---FDLPLTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYW-----SSYKH- 672
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
G PSQ YH+PR + KP+ N+L IFEE+GG P I
Sbjct: 673 -------------------ALGRPSQYLYHVPRCFLKPTGNVLTIFEEEGGRPDAI 709
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 612 bits (1577), Expect = e-172, Method: Compositional matrix adjust.
Identities = 334/727 (45%), Positives = 420/727 (57%), Gaps = 89/727 (12%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
AGNVTYD RSLIING ++ S +IHYPRS P
Sbjct: 37 AGNVTYDGRSLIINGEHRILFSGSIHYPRSTP---------------------------- 68
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+Y F GR +LVKF+ +Q +Y LRIGPF+ E+ YGG+P WLH + G VFR+D
Sbjct: 69 ----EYDFDGRKDLVKFLLEVQAQGLYAALRIGPFIEGEWTYGGLPFWLHDVSGIVFRSD 124
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK HMQ+F+T IV+MMK +L+ASQGGPII++Q+ENEY E+ + E G RY WAA
Sbjct: 125 NEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQIENEYQNVETAFHEKGSRYVHWAA 184
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
MAV N GVPW+MC+Q D PDPVINTCN C + P+SP+ P +WTENW +++ F
Sbjct: 185 NMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFAGPNSPNKPSMWTENWTSFYQVF 244
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG R +EDIAF VA F + GS NYYMYHGGTNFGRT G F+TTSY +AP+DEY
Sbjct: 245 GGEPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRT-GSAFVTTSYYDQAPLDEY 303
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLK+LH IK C L+ G LG QEA V+ + SG C AFL N D
Sbjct: 304 GLIRQPKWGHLKDLHAKIKSCSKTLIRGTHQTFPLGRLQEAYVFREKSGDCVAFLVNNDG 363
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
+ D TV F+N SY LP S+SILPDCK + FNTA V Q +T + + S +
Sbjct: 364 RRDVTVRFQNRSYELPHKSISILPDCKSITFNTAKVNTQYAT--------RSATLSQEFS 415
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S G KW+ +KE + +DH++TTKDT+DYLWYT F + SR
Sbjct: 416 SVG-KWEEYKETVATFDSTSLRAKTLLDHLSTTKDTSDYLWYTF--------RFQNHFSR 466
Query: 503 P--VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
P L S+GH LHA+ N GSA G+ F +N + LK G N +ALLS+TVGL
Sbjct: 467 PQSTLRAYSRGHVLHAYVNGVYAGSAHGSHESTSFTLENSVRLKNGTNNVALLSVTVGLP 526
Query: 561 NAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
++G + E AG+ V+I D +TYSW Y++GL GE L IY N ++W
Sbjct: 527 DSGAYLERRVAGLHRVRIQ-----NKDFTTYSWGYQVGLLGEKLQIYTDNGLNKVSWNEF 581
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
QPLTWYK P G +PI L++ MGKG AW+NG+ IGRYW S
Sbjct: 582 R--GTTQPLTWYKTQFDAPAGSDPIALNLHSMGKGEAWVNGQSIGRYWVSFS-------- 631
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT---F 737
T G PSQ YHIP+S+ KP+ N+LV+ EE+ G P IT
Sbjct: 632 -----------------TSKGNPSQTRYHIPQSFVKPTGNLLVLLEEEKGYPPGITVDSI 674
Query: 738 SIRKISG 744
SI K+ G
Sbjct: 675 SISKVCG 681
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 610 bits (1572), Expect = e-171, Method: Compositional matrix adjust.
Identities = 314/714 (43%), Positives = 429/714 (60%), Gaps = 39/714 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RS+I+NG REL+ S +IHYPR P MWP ++++AKEGG+N I++YVFWN HE
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHEPVQ 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G +++VKFIK I + +Y+ LRIGP++ AE+N GG P WL +P FR+ EP
Sbjct: 88 GQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F +HM+K+ +++D+MK+EKLFA QGGPII+AQ+ENEY + Y + GK+Y WAA MA
Sbjct: 148 FIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAANMA 207
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
GVPWIMC+Q D P VINTCN +C D FT P+ P+ P +WTENW ++TFG
Sbjct: 208 TGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAFSVARFF K G++ NYYMY+GGTN+GRT G F+TT Y EAP+DE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRT-GSSFVTTRYYDEAPLDEFGLY 326
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKW HL++LH A++L ALL G S + E VY CAAFL N
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPSVQKINQHLEITVYEKPGTDCAAFLTNNHTTLP 386
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
T+ FR Y+LP SVSILPDCK + NT + +Q ++ N PSE +K
Sbjct: 387 ATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNS-----RNFLPSEK-----AKN 436
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
LKW++++E + ++ + TKDT+DY WY+TSI + ++ ++ PVL
Sbjct: 437 LKWEMYQEKVPTISDLSLKNREPLELYSLTKDTSDYAWYSTSINFDRHDLPMRPDILPVL 496
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I S GHAL AF N E G GN F ++ P+ LK G N I++L+ TVG N+G +
Sbjct: 497 QIASMGHALSAFVNGEFVGFGHGNNIEKSFVFQKPVILKPGTNTISILAETVGFPNSGAY 556
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E AG + + G +GTLD++ +W +++G+ GE ++ + W P K
Sbjct: 557 MEKRFAGPRGITVQGLMAGTLDITQNNWGHEVGVFGEKEQLFTEEGAKKVKWTPVNGPTK 616
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
+TWYK P G+ P+ L M KM KG+ W+NG +GRYW
Sbjct: 617 GA-VTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGRYW----------------- 658
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
++ G+P+Q YHIPR++ KP+ N+LVIFEE GG P I I
Sbjct: 659 --------SSFLSPLGQPTQFEYHIPRAFLKPTNNLLVIFEETGGHPETIEVQI 704
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 294/579 (50%), Positives = 382/579 (65%), Gaps = 23/579 (3%)
Query: 169 FASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPV 228
FASQGGPIIL+Q+ENEYG G G Y WAAKMAVA + GVPW+MC++ D PDP+
Sbjct: 2 FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61
Query: 229 INTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSV 288
IN CN FYCD F+P+ P P +WTE W GWF FGG HRP +D+AFSVARF QKGGS
Sbjct: 62 INACNGFYCDGFSPNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSY 121
Query: 289 HNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALL 348
NYYMYHGGTNFGRTAGGPFITTSYDY+ PIDEYGL R PK+GHLKELH AIKLCEHAL+
Sbjct: 122 INYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALV 181
Query: 349 NGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
+ + + SLG+ Q+A V+ CAAFL+N + + F N+ Y LPAWS+SILPDC
Sbjct: 182 SSDPTVTSLGAYQQAYVFNSGPRRCAAFLSNFHSTGAR-MTFNNMHYDLPAWSISILPDC 240
Query: 409 KKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSG 467
+ VVFNTA V Q+S V+M+P N S+ WQ + E ++ + + G
Sbjct: 241 RNVVFNTAKVGVQTSRVQMIPTN-----------SRLFSWQTYDEDVSSLHERSSIAAGG 289
Query: 468 FVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSAS 527
++ IN T+DT+DYLWY T++ ++ +E L+ G +P L ++S GHALH F N + GSA
Sbjct: 290 LLEQINVTRDTSDYLWYMTNVDISSSE--LRGGKKPTLTVQSAGHALHVFVNGQFSGSAF 347
Query: 528 GNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTL 586
G H F + P+ L+AG N+IALLS+ VGL N G YE W + V + G G
Sbjct: 348 GTREHRQFTFAKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRK 407
Query: 587 DLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPI 645
DL+ W K+GL+GE + + +P ++++W+ ++ Q L WYKA P GDEP+
Sbjct: 408 DLTMQKWFNKVGLKGEAMDLVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPL 467
Query: 646 GLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQ 705
LDM MGKG W+NG+ IG+YW + + +C C Y G F P KC GCG+P+Q
Sbjct: 468 ALDMRSMGKGQVWINGQSIGKYW-----MAYANGDC-SLCSYIGTFRPTKCQLGCGQPTQ 521
Query: 706 RWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
RWYH+PRSW KP++N++V+FEE GGDP+KIT R ++G
Sbjct: 522 RWYHVPRSWLKPTQNLVVVFEELGGDPSKITLVKRSVAG 560
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 324/683 (47%), Positives = 419/683 (61%), Gaps = 53/683 (7%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWP L+ +AKEGG++ I++YVFWN HE G Y F GR ++V+F+K IQ +Y LRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
PF+ AE++YGG+P WLH + G V+R+D EPFK HMQ F T IV+MMK E L+ASQGGPII
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
L+Q+ENEY E+ +GE G Y WAAKMAV+ GVPW MC+Q D PDPVINTCN C
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180
Query: 238 -DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFF-QKGGSVHNYYMY 294
+ FT P+SP+ P IWTENW +++T+G R +E+IAF VA F K G+ NYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240
Query: 295 HGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN 354
HGGTNFGR+A IT YD ++P+DEYGL R PKWGHLKELH A+KLC LL G +SN
Sbjct: 241 HGGTNFGRSASAFMITGYYD-QSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSN 299
Query: 355 LSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
SLG S EA V+ S CAAFL N D V+F+NV+Y LP S+SILPDCK V FN
Sbjct: 300 FSLGQSVEAIVFKTESNECAAFLVNR-GAIDSNVLFQNVTYELPLGSISILPDCKNVAFN 358
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINT 474
T V Q +T M+ +Q + L+W+ FKE + + + ++H+ T
Sbjct: 359 TRRVSVQHNTRSMMA--VQKFDL--------LEWEEFKEPIPNIDDTELRANELLEHMGT 408
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
TKD +DYLWYT + + + S+ L ++S+ HALHAF N + GSA G
Sbjct: 409 TKDRSDYLWYTFRVQQDSPD------SQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKG 462
Query: 535 FKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWT 594
F I+L+ G N I+LLS+ VGL ++G F E AG+ V I G D S W
Sbjct: 463 FSLAKNITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQG-----EDFSEQHWG 517
Query: 595 YKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGK 654
YK+GL GE I+ +N+ W +QPLTWYK PPGD+PI L++ MGK
Sbjct: 518 YKVGLSGEQSQIFLDTGSSNVQWSRLGN--SSQPLTWYKTQFDAPPGDDPIALNLGSMGK 575
Query: 655 GLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSW 714
G W+NG IGRYW +T GEPSQ+WY++PRS+
Sbjct: 576 GAVWVNGRGIGRYWV-------------------------SFLTPKGEPSQKWYNVPRSF 610
Query: 715 FKPSENILVIFEEKGGDPTKITF 737
KP++N LVI EE+ G+P +I+
Sbjct: 611 LKPTDNQLVILEEETGNPVEISL 633
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 309/712 (43%), Positives = 436/712 (61%), Gaps = 42/712 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+ +G RE+ +S +IHYPRS P MWP L+ +AKEGG+NTIE+YVFWN HE
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G+ ++V+F ++IQ+ MY ++R+GPF+ AE+N+GG+P WL IP VFR + EP
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K HM+ F+ +I+ +K LFASQGGPIILAQ+ENEY + E+ + + G +Y WAAKMA
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
++ NIG+PWIMC+Q P VI TCN C P + SMP +WTENW ++ FG
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVFGDP 282
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAF+VARFF GG++ NYYMYHGGTNFGRT+ F+ Y EAP+DE+GL
Sbjct: 283 PSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLY 341
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANMDDKN 384
+ PKWGHL++LH A+KLC+ ALL G S LG EA V+ C AFL+N + K+
Sbjct: 342 KEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTKD 401
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D T+ FR Y +P S+S+L DC+ VVF T +V AQ + Q + D ++
Sbjct: 402 DATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHN---------QRTFHFADQTAQ 452
Query: 445 GLKWQVFK-EIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W++F E + +A D N TKD TDY+WYT+S + ++ +++ +
Sbjct: 453 NNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 512
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL + S GHA AF N + G G + F + P+ LK G N +A+L+ ++G+ ++G
Sbjct: 513 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSG 572
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+ V+ITG N+GTLDL+ W + +GL GE IY ++ W M
Sbjct: 573 AYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKPAMN- 631
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
++PLTWYK P G++P+ LDM MGKG+ ++NG+ IGRYW
Sbjct: 632 --DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYW--------------- 674
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
Y+ G PSQ+ YH+PRS+ + +N+LV+FEE+ G P I
Sbjct: 675 -ISYKHAL---------GRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDAI 716
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 607 bits (1565), Expect = e-171, Method: Compositional matrix adjust.
Identities = 307/637 (48%), Positives = 393/637 (61%), Gaps = 33/637 (5%)
Query: 128 GIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
G P+WL +PG FR D PFK MQ+F+ IVD+++ EKLF QGGP+I+ QVENEYG
Sbjct: 6 GFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENEYGN 65
Query: 188 YESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSM 247
ES YG+ G+ Y W MA+ VPW+MCQQ D P +IN+CN +YCD F +SPS
Sbjct: 66 IESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKANSPSK 125
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGP 307
P WTENW GWF ++G R PHRP ED+AFSVARFFQ+ GS NYYMY GGTNFGRTAGGP
Sbjct: 126 PIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGRTAGGP 185
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVY 366
F TSYDY++PIDEYGL R PKWGHLK+LH A+KLCE AL++ + + LG QEA VY
Sbjct: 186 FYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQEAHVY 245
Query: 367 ADSSGA-------------CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVF 413
S C+AFLAN+D++ V F +Y+LP WSVSILPDC+ VVF
Sbjct: 246 HMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQNVVF 305
Query: 414 NTANVRAQSS--TVEM---VPENLQPSEASPDNGSKGL---KWQVFKEIAGIWGEADFVK 465
NTA V AQ+S +E+ + N+ + D + W KE GIW + +F
Sbjct: 306 NTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQNFTV 365
Query: 466 SGFVDHINTTKDTTDYLWYTTSI-IVNENEEFLKNGS-RPVLLIESKGHALHAFANQELQ 523
G ++H+N TKD +DYLWY T I + N++ F K + P + I+S F N +L
Sbjct: 366 KGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNGKLT 425
Query: 524 GSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFN 582
GSA G K+ P+ G N++ LLS +GLQN+G F E GAGI +K+TGF
Sbjct: 426 GSAIGQWV----KFVQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFK 481
Query: 583 SGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGD 642
+G +DLS WTY++GL+GE L Y+ +W TWYKA P G
Sbjct: 482 NGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSSPDGT 541
Query: 643 EPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGE 702
+P+ +++ MGKG AW+NG IGRYW SP D C ++CDYRG +N KC T CG
Sbjct: 542 DPVAINLGSMGKGQAWVNGHHIGRYW----SVVSPKDGCPRKCDYRGAYNSGKCATNCGR 597
Query: 703 PSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
P+Q WYHIPRSW K S N+LV+FEE GG+P +I +
Sbjct: 598 PTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKL 634
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 332/745 (44%), Positives = 443/745 (59%), Gaps = 88/745 (11%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSLI++G R ++IS +IHYPRS P MWP L+++AKEGG+N IE+YVFWNGHE
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
++ F G +++V+F K IQ A MY ILRIGP++ E+NYGG+PVWL IPG FR +P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY--YESFYGEGGKRYALWAAK 205
F+ M+ F TLIV MK +FA QGGPIILAQ+ENEYGY + + Y W A
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC ++ + S+PK+WTENW GW++ +
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWDQ 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
+ RP+EDIAF+VA FFQ GS+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP+DEYG
Sbjct: 271 PEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 330
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDK 383
R PK+GHLKELH + E LL+G+ + + G + Y +++ AC F+ N D
Sbjct: 331 LRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSAC--FINNRFDD 388
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ-------SSTVEM--------- 427
D V ++ LPAWSVSILP+CK V FN+A ++ Q +S VE
Sbjct: 389 RDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSW 448
Query: 428 VPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTS 487
+PENL+P + +F K+ ++ I TT D +DYLWY TS
Sbjct: 449 MPENLRPFMTDE--------------------KGNFRKNELLEQIVTTTDQSDYLWYRTS 488
Query: 488 IIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGK 547
+ E GS VL + + GH L+AF N +L G + F+ K+P
Sbjct: 489 L------EHKGEGSY-VLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSP------- 534
Query: 548 NEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
N G +E + AGI VK+ + +DLS SW+YK GL GE+
Sbjct: 535 -------------NYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRK 581
Query: 606 IY--NPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
IY PG + W S P N+P TWYK + P G++ + +D+ + KG+AW+NG
Sbjct: 582 IYLDKPGNK----WRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGN 637
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPS 718
+GRYWP P CDYRG F + KC+TGCGEPSQ+ YH+PRS+
Sbjct: 638 SLGRYWPSYVAADMPG---CHHCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKG 694
Query: 719 E-NILVIFEEKGGDPTKITFSIRKI 742
E N L++FEE GGDP+++ ++R +
Sbjct: 695 EPNTLILFEEAGGDPSEV--AVRTV 717
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 330/728 (45%), Positives = 440/728 (60%), Gaps = 59/728 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTY+ RSL+I+G R +IIS +IHYPRS P MWP L+++AKEGG++ IE+YVFWNGHE
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
+Y F G +++V+F K IQ A +Y ILRIGP++ E+NYGG+P WL IPG FR P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
F+ M+ F TLIV+ MK +FA QGGPIILAQ+ENEYG + + Y W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 206 MAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
D HR +EDIAF+VA FFQK GGP+ITTSYDY+AP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQK-------------------RGGPYITTSYDYDAPLDEYGN 311
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDK 383
R PK+GHLK+LH IK E L++GE + + Y DS+ AC F+ N +D
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSAC--FINNRNDN 369
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
D V ++ LPAWSVSILPDCK V FN+A ++AQ++ ++ + E P++
Sbjct: 370 MDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT---VMVNKAKMVEKEPES-- 424
Query: 444 KGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNG 500
LKW +E + + + K+ ++ I T+ D +DYLWY TSI N E
Sbjct: 425 --LKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSI--NHKGE----- 475
Query: 501 SRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
+ L + + GH L+AF N L G H F+ ++P L GKN I+LLS T+GL+
Sbjct: 476 ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLK 535
Query: 561 NAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY--NPG--YRNN 614
N GP +E + AGI VK+ N +DLS SW+YK GL GE+ I+ PG + NN
Sbjct: 536 NYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPGCTWDNN 595
Query: 615 INWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
V P N+P TWYK + P G++ + +D+L + KG+AW+NG +GRYWP +
Sbjct: 596 NGTV-----PINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAA 650
Query: 675 SSPHDECVQECDYRGKFNPD----KCITGCGEPSQRWYHIPRSWFKPSE-NILVIFEEKG 729
S YRG F + KC+TGCGEPSQR+YH+PRS+ K E N +++FEE G
Sbjct: 651 RSMR-RLPTTAHYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAG 709
Query: 730 GDPTKITF 737
GDP+ ++F
Sbjct: 710 GDPSHVSF 717
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 303/604 (50%), Positives = 376/604 (62%), Gaps = 20/604 (3%)
Query: 140 VFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRY 199
FR D EPFK MQKF T IV MMK E LF +QGGPII++Q+ENEYG E G GK Y
Sbjct: 2 AFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAY 61
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWF 259
WAA+MAV + GVPW MC+Q D PDPVI+TCN +YC+ FTP+ PK+WTENW GW+
Sbjct: 62 TKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWY 121
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPI 319
FGG HRP+ED+A+SVA F Q GS NYYMYHGGTNFGRT+ G FI TSYDY+API
Sbjct: 122 TDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPI 181
Query: 320 DEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQ-EADVYADSSGACAAFLA 378
DEYGLP PKW HLK LH AIK CE AL++ + + LG+ EA VY ++ CAAFLA
Sbjct: 182 DEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLA 241
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N D K+ TV F N Y LP WSVSILPDCK VVFNTA V S M P
Sbjct: 242 NYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHSFHKRMTPVETT----- 296
Query: 439 PDNGSKGLKWQVFKEIAGIWGEAD-FVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL 497
WQ + E + D + + + IN T+D++DYLWY T + ++ +E F+
Sbjct: 297 -------FDWQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFI 349
Query: 498 KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
KNG P L I S GH LH F N +L G+ G +P + ++LK G N+I+LLS+ V
Sbjct: 350 KNGQFPTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAV 409
Query: 558 GLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
GL N G +E G+ V++ G + GT DLS W+YK+GL+GE L ++ ++I+
Sbjct: 410 GLPNVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSID 469
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
W K QPLTWYK P G++P+ LDM MGKG W+N + IGR+WP
Sbjct: 470 WTQGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIA--- 526
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
H C EC+Y G F KC T CGEP+Q+WYHIPRSW S N+LV+ EE GGDPT I+
Sbjct: 527 -HGNC-DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGIS 584
Query: 737 FSIR 740
R
Sbjct: 585 LVKR 588
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 316/711 (44%), Positives = 429/711 (60%), Gaps = 42/711 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+I+G+R+L S AIHYPRS P MW LV+ AK GG+NTIE+YVFWNGHE P
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKYYF GRF+L++F+ +I+ MY I+RIGPF+ AE+N+GG+P WL I +FR + EP
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+KF+ IV +K ++FA QGGPIIL+Q+ENEYG + G +Y WAA+MA
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ IGVPW+MC+Q P VI TCN +C D +T + P++WTENW F+TFG +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGG++ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH IK A L G++S LG EA Y C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TVVFR +++P+ SVSIL DCK VV+NT V Q S + S + D SK
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---------ERSFHTTDETSKN 445
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+++ E + + ++ N TKDT+DYLWYTTS + ++ + RPV+
Sbjct: 446 NVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVI 505
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I+S HA+ FAN G+ G+ F ++ P+ L+ G N IA+LS ++G++++G
Sbjct: 506 QIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGE 565
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V GI + G N+GTLDL W +K L+GE IY W +P +
Sbjct: 566 LVEVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQW----KPAE 621
Query: 626 NQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N P+TWYK +P GD+PI +DM M KG+ ++NGE IGRYW
Sbjct: 622 NDLPITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT--------------- 666
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
IT G PSQ YHIPR++ KP N+L+IFEE+ G P I
Sbjct: 667 ----------SFITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGI 707
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 317/718 (44%), Positives = 433/718 (60%), Gaps = 43/718 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+I+G+R+L S AIHYPRS P MW LV+ AK GG+NTIE+YVFWNGHE P
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKYYF GRF+L++F+ +I+ MY I+RIGPF+ AE+N+GG+P WL I +FR + EP
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+KF+ IV +K ++FA QGGPIIL+Q+ENEYG + G +Y WAA+MA
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ IGVPW+MC+Q P VI TCN +C D +T + P++WTENW F+TFG +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGG++ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH IK A L G++S LG EA Y C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TVVFR +++P+ SVSIL DCK VV+NT V Q S + S + D SK
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---------ERSFHTTDETSKN 445
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+++ E + + ++ N TKDT+DYLWYTTS + ++ + RPV+
Sbjct: 446 NVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVI 505
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I+S HA+ FAN G+ G+ F ++ P+ L+ G N IA+LS ++G++++G
Sbjct: 506 QIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGE 565
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V GI + G N+GTLDL W +K L+GE IY W +P +
Sbjct: 566 LVEVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQW----KPAE 621
Query: 626 NQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N P+TWYK +P GD+PI +DM M KG+ ++NGE IGRYW
Sbjct: 622 NDLPITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT--------------- 666
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF-SIRK 741
IT G PSQ YHIPR++ KP N+L+IFEE+ G P I ++R+
Sbjct: 667 ----------SFITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRR 714
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 309/711 (43%), Positives = 433/711 (60%), Gaps = 42/711 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSL+I+G+R+L S AIHYPRS P +WP L+++AKEGG+NTIE+Y+FWN HE P
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GRF+L+K++K+IQ+ MY I+RIGPF+ AE+N+GG+P WL I +FR + +P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+KF+ IV +K +LFASQGGPIIL Q+ENEYG + + G +Y WAA+MA
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ GVPWIMC+Q P VI TCN +C D +T + P +WTENW F+ +G +
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGGS+ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH I+ + A L G+ S+ LG EA ++ C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV+FR +++P+ SVSIL CK VV+NT V Q + + S + + SK
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHN---------ERSYHTSEVTSKN 445
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
+W+++ E + + ++ N TKD +DYLWYTTS + ++ +N RPVL
Sbjct: 446 NQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVL 505
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
++S H++ FAN G A G+ F ++ P+ LK G N + LLS T+G++++G
Sbjct: 506 QVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGE 565
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V +GI I G N+GTLDL W +K L+GE IY+ + W +P +
Sbjct: 566 LAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQW----KPAE 621
Query: 626 N-QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N + TWYK +P GD+P+ LDM M KG+ ++NGE +GRYW
Sbjct: 622 NGRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYW---------------- 665
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
YR T G PSQ YHIPR + K +N+LV+FEE+ G P I
Sbjct: 666 VSYR---------TLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGI 707
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 309/711 (43%), Positives = 433/711 (60%), Gaps = 42/711 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSL+I+G+R+L S AIHYPRS P +WP L+++AKEGG+NTIE+Y+FWN HE P
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GRF+L+K++K+IQ+ MY I+RIGPF+ AE+N+GG+P WL I +FR + +P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+KF+ IV +K +LFASQGGPIIL Q+ENEYG + + G +Y WAA+MA
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ GVPWIMC+Q P VI TCN +C D +T + P +WTENW F+ +G +
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGGS+ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH I+ + A L G+ S+ LG EA ++ C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV+FR +++P+ SVSIL CK VV+NT V Q + + S + + SK
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHN---------ERSYHTSEVTSKN 445
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
+W+++ E + + ++ N TKD +DYLWYTTS + ++ +N RPVL
Sbjct: 446 NQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVL 505
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
++S H++ FAN G A G+ F ++ P+ LK G N + LLS T+G++++G
Sbjct: 506 QVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGE 565
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V +GI I G N+GTLDL W +K L+GE IY+ + W +P +
Sbjct: 566 LAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQW----KPAE 621
Query: 626 N-QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N + TWYK +P GD+P+ LDM M KG+ ++NGE +GRYW
Sbjct: 622 NGRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYW---------------- 665
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
YR T G PSQ YHIPR + K +N+LV+FEE+ G P I
Sbjct: 666 VSYR---------TLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGI 707
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 600 bits (1547), Expect = e-168, Method: Compositional matrix adjust.
Identities = 304/617 (49%), Positives = 395/617 (64%), Gaps = 17/617 (2%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWP L+Q+AK+GG++ IE+Y+FW+ HE KY F GR + +KF ++IQ A +Y+++RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
P+V AE+NYGG PVWLH +PG R + + +K MQ F T IV+M K+ LFASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120
Query: 178 LAQVENEYGYYES-FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFY 236
LAQ+ENEYG + YG+ GK Y W A+MA + NIGVPWIMCQQ D P P+INTCN FY
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180
Query: 237 CDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHG 296
CD FTP++P PK++TENW GWFK +G +DP+R +ED+AFSVARFFQ GG +NYYMYHG
Sbjct: 181 CDNFTPNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHG 240
Query: 297 GTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLS 356
GTNFGRT+GGPFITTSYDY AP+DEYG PKWGHLK+LH +IKL E L N RSN +
Sbjct: 241 GTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSNQN 300
Query: 357 LGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFR-NVSYHLPAWSVSILPDCKKVVFN 414
GSS +++ ++G FL+N D KND T+ + + Y +PAWSVSIL C K V+N
Sbjct: 301 FGSSVTLTKFSNPTTGERFCFLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKEVYN 360
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINT 474
TA V +Q+S V E + +N W + G F + ++
Sbjct: 361 TAKVNSQTSM--FVKE-----QNEKENAQLSWAWAPEPMKDTLQGNGKFAANLLLEQKRV 413
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
T D +DY WY T + N L + +KGH LHAF N+ GS G+
Sbjct: 414 TVDFSDYFWYMTKVDTNGTSSL----QNVTLQVNTKGHVLHAFVNKRYIGSKWGSNGQ-S 468
Query: 535 FKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYS 592
F ++ PI LK+G N I LLS TVGL+N FY+ V GI + + G + T DLS+
Sbjct: 469 FVFEKPILLKSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIGDGNVTTDLSSNL 528
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKM 652
W+YK+GL GE IYNP + NW+ + + +TWYK K P G +P+ LDM M
Sbjct: 529 WSYKVGLNGEMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKTSFKTPAGIDPVVLDMQGM 588
Query: 653 GKGLAWLNGEEIGRYWP 669
GKG AW+NG+ IGR+WP
Sbjct: 589 GKGQAWVNGQSIGRFWP 605
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 600 bits (1546), Expect = e-168, Method: Compositional matrix adjust.
Identities = 302/569 (53%), Positives = 385/569 (67%), Gaps = 15/569 (2%)
Query: 179 AQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCD 238
A++ENEYG +S YG GK Y WAA MAV+ + GVPW+MCQQ D PDP+INTCN FYCD
Sbjct: 6 AKIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCD 65
Query: 239 QFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGT 298
QFTP+S + PK+WTENW GWF +FGG P+RP ED+AF+VARF+Q+GG+ NYYMYHGGT
Sbjct: 66 QFTPNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGT 125
Query: 299 NFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLG 358
N R++GGPFI TSYDY+APIDEYGL R PKWGHL+++H AIKLCE AL+ + S SLG
Sbjct: 126 NLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLG 185
Query: 359 SSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANV 418
+ EA VY S CAAFLAN+D ++DKTV F Y LPAWSVSILPDCK VV NTA +
Sbjct: 186 PNVEAAVYKVGS-VCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQI 244
Query: 419 RAQSSTVEMVPENLQPSEASPDNG-----SKGLKWQVFKEIAGIWGEADFVKSGFVDHIN 473
+Q++ EM L+ S + D W E GI + K+G ++ IN
Sbjct: 245 NSQTTGSEM--RYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQIN 302
Query: 474 TTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHP 533
TT D +D+LWY+TSI V +E +L NGS+ L + S GH L + N ++ GSA G+ +
Sbjct: 303 TTADASDFLWYSTSITVKGDEPYL-NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSS 361
Query: 534 PFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYS 592
++ PI L GKN+I LLS TVGL N G F++ VGAGIT VK++G N G LDLS+
Sbjct: 362 LISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLN-GALDLSSAE 420
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKM 652
WTY+IGL+GE L +Y+P + WVS P N PL WYK P GD+P+ +D M
Sbjct: 421 WTYQIGLRGEDLHLYDPS-EASPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGM 479
Query: 653 GKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPR 712
GKG AW+NG+ IGRYWP +P CV C+YRG ++ KC+ CG+PSQ YH+PR
Sbjct: 480 GKGEAWVNGQSIGRYWP---TNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPR 536
Query: 713 SWFKPSENILVIFEEKGGDPTKITFSIRK 741
S+ +P N LV+FE GGDP+KI+F +R+
Sbjct: 537 SFLQPGSNDLVLFEHFGGDPSKISFVMRQ 565
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 317/758 (41%), Positives = 441/758 (58%), Gaps = 50/758 (6%)
Query: 1 MKPRTPIAPFALLIFFSSSITYC-----FAG--NVTYDSRSLIINGRRELIISAAIHYPR 53
M P +A ++L+ +I AG NVTYD +SL +NGRREL+ S +IHY R
Sbjct: 1 MTPTHNLAFLSILLVLLPAIVAAHDHGRVAGINNVTYDGKSLFVNGRRELLFSGSIHYTR 60
Query: 54 SVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMI 113
S P WP ++ +A+ GG+N I++YVFWN HE GK+ F G +LVKFI+++Q MY+
Sbjct: 61 STPDAWPDILDKARHGGLNVIQTYVFWNAHEPEQGKFNFEGNNDLVKFIRLVQSKGMYVT 120
Query: 114 LRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQG 173
LR+GPF+ AE+N+GG+P WL +PG +FR+D EP+K +M+ +++ I+ MMK EKLFA QG
Sbjct: 121 LRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNEPYKKYMKAYVSKIIQMMKDEKLFAPQG 180
Query: 174 GPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN 233
GPIILAQ+ENEY + + Y E G Y WAA MAVA +IGVPWIMC+Q D PDPVIN CN
Sbjct: 181 GPIILAQIENEYNHIQLAYEEKGDSYVQWAANMAVALDIGVPWIMCKQKDAPDPVINACN 240
Query: 234 SFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNY 291
+C D F+ P+ P P +WTENW ++ FG R +EDIAFSVARFF K G++ NY
Sbjct: 241 GRHCGDTFSGPNKPYKPSLWTENWTAQYRVFGDPVSQRSAEDIAFSVARFFSKNGNLVNY 300
Query: 292 YMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGE 351
YMYHGGTNFGRT F TT Y EAP+DEYG+ R PKW HL++ H A+ LC A+L G
Sbjct: 301 YMYHGGTNFGRTTSA-FTTTRYYDEAPLDEYGMERQPKWSHLRDAHKALLLCRKAILGGV 359
Query: 352 RSNLSLGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKK 410
+ L E ++ + C+AF+ N T+ FR +Y LPA S+S+LPDCK
Sbjct: 360 PTVQKLNDYHEVRIFEKPGTSTCSAFITNNHTNQAATISFRGSNYFLPAHSISVLPDCKT 419
Query: 411 VVFNTANVRAQSSTVEMVPEN----LQPSEASPDNGSKG-----LKWQVFKEIAGIWGEA 461
VV+NT NV Q +++ + L S+ + N K LKW++F E +
Sbjct: 420 VVYNTQNVMNQLVYYKLISSHLIIKLIVSQHNKRNFVKSAVANNLKWELFLEAIPSSKKL 479
Query: 462 DFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQE 521
+ + ++ KDTTDY WYTTS + E+ K + +L I S GH L AF N +
Sbjct: 480 ESNQKIPLELYTLLKDTTDYGWYTTSFELGP-EDLPKKSA--ILRIMSLGHTLSAFVNGQ 536
Query: 522 LQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGF 581
G+ G F+++ P + K G N I++L+ TVGL ++G + E AG S+ I G
Sbjct: 537 YIGTDHGTHEEKSFEFEQPANFKVGTNYISILATTVGLPDSGAYMEHRYAGPKSISILGL 596
Query: 582 NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPG 641
N G L+L+ W +++GL+GE L ++ + W + + L+W K P G
Sbjct: 597 NKGKLELTKNGWGHRVGLRGEQLKVFTEEGSKKVQWDPVT--GETRALSWLKTRFATPEG 654
Query: 642 DEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCG 701
P+ + M MGKG+ W+NG+ IGR+W ++ G
Sbjct: 655 RGPVAIRMTGMGKGMIWVNGKSIGRHWM-------------------------SFLSPLG 689
Query: 702 EPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSI 739
+PSQ YHIPR + +N+LV+ EE+ G P KI I
Sbjct: 690 QPSQEEYHIPRDYLNAKDNLLVVLEEEKGSPEKIEIMI 727
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 307/638 (48%), Positives = 396/638 (62%), Gaps = 35/638 (5%)
Query: 128 GIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
G PVWL +PG FR D EP+K MQ F+T IVD+MK EKL++ QGGPIIL Q+ENEYG
Sbjct: 19 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78
Query: 188 YESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSM 247
+ YG+ GKRY LWAA+MA+A + GVPW+MC+Q D P+ ++NTCN+FYCD F P+S +
Sbjct: 79 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNK 138
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGP 307
P IWTE+W GW+ +G PHRP++D AF+VARF+Q+GGS+ NYYMY GGTNF RTAGGP
Sbjct: 139 PTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGP 198
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHAL--LNGERSNLSLGSSQEADV 365
TSYDY+APIDEYG+ R PKWGHLK+LH AIKLCE AL ++G + LG QEA V
Sbjct: 199 LQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHV 258
Query: 366 YAD-----------SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
Y+ +S C+AFLAN+D+ +V SY LP WSVSILPDC+ V FN
Sbjct: 259 YSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFN 318
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGS---------KGLKWQVFKEIAGIWGEADFVK 465
TA V Q+S + E+ PS +S W FKE GIWGE F
Sbjct: 319 TARVGTQTSFFNV--ESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTA 376
Query: 466 SGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN--GSRPVLLIESKGHALHAFANQELQ 523
G ++H+N TKD +DYL YTT + ++E + N G P L I+ F N +L
Sbjct: 377 QGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLA 436
Query: 524 GSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFN 582
GS G+ P+ L G NE+ LLS VGLQN G F E GAG VK+TG +
Sbjct: 437 GSKVGHWV----SLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLS 492
Query: 583 SGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGD 642
+G +DL+ WTY+IGL+GE IY+P Y+ + W S P TW+K + P G+
Sbjct: 493 NGDIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGN 552
Query: 643 EPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGE 702
P+ +D+ MGKG AW+NG IGRYW +P C C+Y G ++ KC + CG
Sbjct: 553 GPVTIDLGSMGKGQAWVNGHLIGRYW----SLVAPESGCPSSCNYAGTYSDSKCRSNCGI 608
Query: 703 PSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
+Q WYHIPR W + S N+LV+FEE GGDP++I+ +
Sbjct: 609 ATQSWYHIPREWLQESGNLLVLFEETGGDPSQISLEVH 646
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 314/734 (42%), Positives = 431/734 (58%), Gaps = 44/734 (5%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A NVTYDSR+L+++G+R L+I+ IHYPRS P MWP L +AK G++ I++Y+FW+ ++
Sbjct: 47 AMNVTYDSRALLLDGQRRLLIAGCIHYPRSTPEMWPELFARAKANGLDVIQTYLFWDVNQ 106
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+PG++ RF+ V+FIK+ QQA + + RIGP+V AE+NYGG P WL I G VFR++
Sbjct: 107 PTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPYVCAEWNYGGFPAWLRQISGIVFRDN 166
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
+P+ + ++T V ++K KL A+ GGP+IL Q+ENEYG E Y GG Y W
Sbjct: 167 DKPWLDVVGPYITKTVQVLKDNKLLAADGGPVILLQIENEYGNIEDSYA-GGPAYVQWCG 225
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
++A + N G WIMCQQ D P I TCN FYCD + PH P +WTENWPGWF+T+G
Sbjct: 226 QLAASLNAGAQWIMCQQDDAPANTIATCNGFYCDNYVPHK-GQPMMWTENWPGWFQTWGQ 284
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
PHRP++D+AF+ ARF+ KGG+ +YYMYHGGTNFGRTAGGP ITTSYDY+ +DEYG+
Sbjct: 285 PSPHRPAQDVAFAAARFYAKGGTYMSYYMYHGGTNFGRTAGGPGITTSYDYDVALDEYGM 344
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGER-SNLSLGSSQEADVYADSSGACAAFLANMDDK 383
P PK+ HL LH + EH +++ + +SLG + EA V+ SSG C AFL+N+D
Sbjct: 345 PSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLGKNLEAHVFNSSSG-CVAFLSNIDSS 403
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG- 442
D V F ++ LPAWSVSIL +C ++NTA V A + M P + S
Sbjct: 404 VDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAAVSAPLNARRMTPLVVHEDAVSDAADH 463
Query: 443 ----SKG---------LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
SKG + + E G E + + INTT DTTDYLWYTT+
Sbjct: 464 RRSLSKGEGQERVGAFSTFASYAETIGRRAEEAVYFTSPQEQINTTNDTTDYLWYTTTY- 522
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
N + S + + F GS + + L AG N
Sbjct: 523 -NSASATSQVLSISNVNDVVYVYVNRQFVTMSWSGSVN-----------KAVPLMAGTNV 570
Query: 550 IALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
I +LS T GLQN G F E V GI +VK+ G+ DL+ W +++GL GE LGI+
Sbjct: 571 IDVLSTTFGLQNYGTFLEQVTRGIQGTVKL-----GSTDLTQNGWWHQVGLLGEELGIFL 625
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDE-PIGLDMLKMGKGLAWLNGEEIGRY 667
P +N+ W + N+ LTWY++ P + P+ LDM MGKG W+NG +GRY
Sbjct: 626 PQNASNVPWATPAT--TNRGLTWYRSSFDLPQSSQAPLALDMTGMGKGFVWVNGHNLGRY 683
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
WP + S D +CDYRG ++ +C GC PSQR+YH+PR W +P+ N++V+ EE
Sbjct: 684 WPSRIADSMACD----DCDYRGAYDDSRCRQGCNIPSQRYYHVPREWLQPTNNLIVMLEE 739
Query: 728 KGGDPTKITFSIRK 741
GG+P I+ R+
Sbjct: 740 IGGNPALISLVERE 753
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 315/718 (43%), Positives = 431/718 (60%), Gaps = 43/718 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+I+G+R+L S AIHYPRS P MWP L+ +AK+GG+NTIE+YVFWN HE P
Sbjct: 33 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWNAHEPEP 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +L+KF+K+IQ MY ++RIGPF+ AE+N+GG+P WL IP +FR + EP
Sbjct: 93 GKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 152
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+KF+ IV +K +FASQGGPIILAQ+ENEYG + + G +Y WAA+MA
Sbjct: 153 YKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLEWAAEMA 212
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ NIG+PWIMC+Q P VI TCN +C D +T + P++WTENW F+ FG +
Sbjct: 213 LSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWTLRDKNKPRLWTENWTAQFRAFGDQA 272
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA+SV RFF KGG++ NYYMY+GGTNFGRT G ++ T Y EAPIDEYGL +
Sbjct: 273 AVRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEAPIDEYGLNK 331
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH IK A L G++S LG EA Y C AF++N + D
Sbjct: 332 EPKFGHLRDLHKLIKSYHKAFLVGKQSFELLGHGYEAHNYELPEENLCLAFISNNNTGED 391
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV+FR Y++P+ SVSIL DC VV+NT V Q S + S + D +K
Sbjct: 392 GTVMFRGKKYYIPSRSVSILADCNHVVYNTKRVFVQHS---------ERSFHTADESTKN 442
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+++ E + ++ N TKD +DYLWYTTS + ++ + RPV+
Sbjct: 443 NVWEMYSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWYTTSFRLEADDLPFRRDIRPVV 502
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
++S HA+ F N GS G+ F ++ PI L+ G N +ALLS ++G++++G
Sbjct: 503 QVKSSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLALLSSSMGMKDSGGE 562
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V GI I G N+GTLDL W +KI L GE IY + W +P +
Sbjct: 563 LVEVKGGIQDCMIQGLNTGTLDLQGNGWGHKINLDGEDKEIYTEKGMGTVKW----KPAE 618
Query: 626 N-QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N +TWY+ +P GD+P+ LDM M KG+ ++NGE +GRYW
Sbjct: 619 NGHAVTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYW---------------- 662
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF-SIRK 741
Y+ T G PSQ YHIPR + K +N+LV+FEE+ G P I ++R+
Sbjct: 663 TSYK---------TIAGLPSQSLYHIPRPFLKSKKNLLVVFEEEIGKPEGILIQTVRR 711
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 322/735 (43%), Positives = 427/735 (58%), Gaps = 81/735 (11%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F L SS Y A V++D R++ I+G R +++S +IHYPRS MWP L+++ KEG
Sbjct: 29 FILCCVLVSSCAY--ATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEG 86
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
++ IE+YVFWN HE + +Y F G +L++F+K IQ MY +LRIGP+V AE+NYGG
Sbjct: 87 SLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGF 146
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
PVWLH +PG FR F MQ F T+IV+M+K+EKLFASQGGPIILAQ+ENEYG
Sbjct: 147 PVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVI 206
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
YGE GK Y W A MA + ++GVPWIMCQQ D P P++NTCN +YCD F+P++P+ PK
Sbjct: 207 GSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNTPK 266
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW GW+K +GG+DPHR +ED+AF+VARFFQK G+ NYYMYHGGTNF RTAGGP+I
Sbjct: 267 MWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGPYI 326
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TT+YDY+AP+DE+G PK+GHLK+LH + E L G S + G+ A VY
Sbjct: 327 TTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYQTE 386
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G+ + F+ N+++ +D + F+ SY +PAWSVSILPDCK +NTA + Q+S MV
Sbjct: 387 EGS-SCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTSV--MVK 443
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAG---IWGEADFVKSGFVDHINTTKDTTDYLWYTT 486
+ +EA +N LKW E + G+ + D + D +DYLWY T
Sbjct: 444 ---KANEA--ENEPSTLKWSWRPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMT 498
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
++ + E + L G L I S H LHAF N + G+ + ++ G
Sbjct: 499 TVNLKEQDPVL--GKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPG 556
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGE 602
N I LLS+TVGL N G F+E AGIT V I G N DLST+ W+YK GL G
Sbjct: 557 ANVITLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGF 616
Query: 603 HLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGE 662
+++ P TW P G EP+ +D+L +GKG AW+NG
Sbjct: 617 ENQLFS----------------SESPSTW-----SAPLGSEPVVVDLLGLGKGTAWINGN 655
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
IGRYWP F D I G +N L
Sbjct: 656 NIGRYWP--------------------AFLSD--IDG-------------------DNTL 674
Query: 723 VIFEEKGGDPTKITF 737
V+FEE GG+P+ + F
Sbjct: 675 VLFEEIGGNPSLVNF 689
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 320/722 (44%), Positives = 417/722 (57%), Gaps = 70/722 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
NVTYD RSLII+G +++ S +IHY RS P MWP L+ +AK GG++ +++YVFWN HE
Sbjct: 9 VANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHE 68
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G ++VKFIK ++ +Y+ LRIGPF+ E++YGG+P WLH + G VFR D
Sbjct: 69 PQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 128
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+++ +IV +MK E L+ASQGGPIIL+Q+ENEYG + + GK Y W A
Sbjct: 129 NEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTA 188
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
K+AV + GVPW+MC+Q D PDP++N CN C + P+SP+ P IWTENW
Sbjct: 189 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL---- 244
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
+EDIAF VA F K GS NYYMYHGGTNFGR A F+ TSY +AP+DEY
Sbjct: 245 -------SAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 296
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH A+KLCE LL+G ++ +SLG Q A V+ + CAA L N D
Sbjct: 297 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVN-QD 355
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + TV FRN SY L SVS+LPDCK V FNTA V AQ +T + + N
Sbjct: 356 KCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNT---------RTRKARQNL 406
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S W+ F E + E ++H+NTT+DT+DYLW TT +E G+
Sbjct: 407 SSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQSE-------GAP 459
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL + GHALHAF N GS G F + +SL G N +ALLS+ VGL N+
Sbjct: 460 SVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNS 519
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G E G SVKI L + YSW Y++GL+GE +Y + W
Sbjct: 520 GAHLERRVVGSRSVKIWN-GRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQW-KQYR 577
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
K+QPLTWYKA P G++P+ L++ MGKG AW+NG+ I +
Sbjct: 578 DSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF--------------- 622
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIF-EEKGGDPTKITFSIRK 741
S YHIPRS+ KP+ N+LVI EE+ G+P IT
Sbjct: 623 ---------------------SYFRYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVS 661
Query: 742 IS 743
++
Sbjct: 662 VT 663
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 310/709 (43%), Positives = 428/709 (60%), Gaps = 46/709 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLIING+REL S ++HYPRS P MWP ++ +A+ GG+NTI++YVFWN HE
Sbjct: 41 VTYDGTSLIINGKRELFFSGSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GRF+LVKFIK+I + +Y+ LR+GPF+ AE+N+GG+P WL +P FR + EP
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ I+ MMK EKLFASQGGPIIL Q+ENEY + Y E G++Y WAA +
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQF-TPHSPSMPKIWTENWPGWFKTFGGR 265
+ N+G+PW+MC+Q D P +IN CN +C D F P+ P +WTENW F+ FG
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAFSVAR+F K GS NYYMYHGGTNFGRT+ F+TT Y +AP+DE+GL
Sbjct: 281 PTQRTAEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTS-AHFVTTRYYDDAPLDEFGLE 339
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
+ PK+GHLK +H A++LC+ AL G+ +LG E Y + CAAFL+N + ++
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
T+ F+ Y LP+ S+SILPDCK VV+NTA + AQ S + V + SK
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFV---------KSEKTSK 450
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
GLK+++F E + D + G + ++ TKD TDY WYTTS+ ++E++ + G + +
Sbjct: 451 GLKFEMFSENIPSLLDGDSLIPGELYYL--TKDKTDYAWYTTSVKIDEDDFPDQKGLKTI 508
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L + S GHAL + N E G A G F++ P++ K G N I++L + GL ++G
Sbjct: 509 LRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGS 568
Query: 565 FYEWVGAGITSVKITGFNSGTLDLS-TYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG ++ I G SGT DL+ W + GL+GE +Y + W E
Sbjct: 569 YMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGE- 627
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+PLTWYK + P G + + M MGKGL W+NG +GRYW
Sbjct: 628 --RKPLTWYKTYFETPEGVNAVAIRMKGMGKGLIWVNGIGVGRYWM-------------- 671
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK--PSENILVIFEEKGG 730
++ GEP+Q YHIPRS+ K +N+LVI EE+ G
Sbjct: 672 -----------SFLSPLGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPG 709
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 310/709 (43%), Positives = 429/709 (60%), Gaps = 46/709 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLIING+REL+ S ++HYPRS P MWP ++ +A+ GG+NTI++YVFWN HE
Sbjct: 41 VTYDGTSLIINGKRELLFSGSVHYPRSTPHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GRF+LVKFIK+I + +Y+ LR+GPF+ AE+N+GG+P WL +P FR + EP
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ I+ MMK EKLFASQGGPIIL Q+ENEY + Y E G++Y WAA +
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQF-TPHSPSMPKIWTENWPGWFKTFGGR 265
+ N+G+PW+MC+Q D P +IN CN +C D F P+ P +WTENW F+ FG
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIAFSVAR+F K GS NYYMYHGGTNFGRT+ F+TT Y +AP+DE+GL
Sbjct: 281 PTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTS-AHFVTTRYYDDAPLDEFGLE 339
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
+ PK+GHLK +H A++LC+ AL G+ +LG E Y + CAAFL+N + ++
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
T+ F+ Y LP+ S+SILPDCK VV+NTA + AQ S + V + SK
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFV---------KSEKTSK 450
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
GLK+++F E + D + G + ++ TKD TDY WYTTS+ ++E++ + G + +
Sbjct: 451 GLKFEMFSENIPSLLDGDSLIPGELYYL--TKDKTDYAWYTTSVKIDEDDFPDQKGLKTI 508
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L + S GHAL + N E G A G F++ P++ K G N I++L + GL ++G
Sbjct: 509 LRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGS 568
Query: 565 FYEWVGAGITSVKITGFNSGTLDLS-TYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG ++ I G SGT DL+ W + GL+GE +Y + W +
Sbjct: 569 YMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKW---EKD 625
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
K +PLTWYK + P G + + M MGKGL W+NG +GRYW
Sbjct: 626 GKRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWM-------------- 671
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK--PSENILVIFEEKGG 730
++ GEP+Q YHIPRS+ K +N+LVI EE+ G
Sbjct: 672 -----------SFLSPLGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPG 709
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 312/717 (43%), Positives = 427/717 (59%), Gaps = 41/717 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SL+I+GRREL S AIHYPRS MWP L++ AKEGG+NTIE+YVFWN HE P
Sbjct: 38 VTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWNAHEPEP 97
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +++KF+K+IQ MY I+RIGPF+ E+N+G +P WL IP +FR + EP
Sbjct: 98 GKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHIIFRANNEP 157
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+KF+ IV M+K E LFASQGG +ILAQ+ENEYG + + G +Y WAA+MA
Sbjct: 158 YKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLEWAAEMA 217
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ NIGVPWIMC+Q P VI TCN +C D + + P +WTENW F+ FG
Sbjct: 218 ISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDENKPHLWTENWTAQFRAFGNDL 277
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA+SV RFF KGG++ NYYMY+GGTNFGRT G ++ T Y E PIDEYG+P+
Sbjct: 278 AQRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPIDEYGMPK 336
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH IK A L G++S LG EA + C AF++N + D
Sbjct: 337 APKYGHLRDLHNVIKSYSRAFLEGKQSFELLGQGYEARNFEIPEEKLCLAFISNNNTGED 396
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV+FR Y++P+ SVSIL DCK VV+NT V Q S + S + +K
Sbjct: 397 GTVIFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHS---------ERSFHKAEKATKN 447
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W++F E+ + + ++ N TKD +DYLWYTTS + ++ ++ RPV+
Sbjct: 448 NVWEMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEADDLPIRGDIRPVI 507
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
++S HA+ F N G+ G+ F ++ PISL+ G N +ALLS ++G++++G
Sbjct: 508 AVKSTAHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLRLGVNHLALLSSSMGMKDSGGE 567
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
+ GI I G N+GTLDL W +K L+GE IY + WV +
Sbjct: 568 LVELKGGIQDCTIQGLNTGTLDLQINGWGHKAKLEGEVKEIYTEKGMGAVKWVPAVS--- 624
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
Q +TWYK +P GD+P+ LDM M KG+ ++NGE +GRYW
Sbjct: 625 GQAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYW----------------T 668
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF-SIRK 741
Y+ P K SQ YHIPR++ K N+LV+FEE+ G P I ++R+
Sbjct: 669 SYK---TPGKV------ASQAVYHIPRTFLKSKNNLLVVFEEELGKPEGILIQTVRR 716
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 318/721 (44%), Positives = 434/721 (60%), Gaps = 52/721 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLIING REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFWN HE
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +LVKFIK+I++ MY+ LR+GPF+ AE+ +GG+P WL +PG FR D P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNTP 163
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ +I+D MK EKLFASQGGPIIL Q+ENEY + Y E G Y WA+K+
Sbjct: 164 FKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ ++G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW F+ +G
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYGDP 283
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIA+SVARFF K G+ NYYMYHGGTNFGRT+ ++TT Y +AP+DEYGL
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 342
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PK+GHLK LH A+ LC+ ALL G+ + E Y + CAAFLAN + ++
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTES 402
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+ + F+ Y +P S+SILPDCK VV+NT + + ++ N S+ + +K
Sbjct: 403 AEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHTS-----RNFMKSKKA----NK 453
Query: 445 GLKWQVFKEI--AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
++VF E + I G++ V+ TKD TDY WYTTS +++N+ K GS+
Sbjct: 454 NFDFKVFTETVPSKIKGDSYIP----VELYGLTKDETDYGWYTTSFKIDDNDLSKKKGSK 509
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P L I S GHALH + N E G+ G+ F ++ PISLK G+N + +L + G ++
Sbjct: 510 PTLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHLTMLGVLTGFPDS 569
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYS-WTYKIGLQGEHLGIYNPGYRNNINW--VS 619
G + E G SV I G SGTLDL+ + W K+G++GE LGI+ + W S
Sbjct: 570 GSYMEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGEKLGIHAEEGLKKVKWQKFS 629
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
EP LTWY+ P + M MGKGL W+NGE +GRYW
Sbjct: 630 GKEP----GLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWM---------- 675
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG-DPTKITFS 738
++ G+P+Q YHIPRS+ KP +N+LVIFEE+ P I F
Sbjct: 676 ---------------SFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFV 720
Query: 739 I 739
I
Sbjct: 721 I 721
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 590 bits (1522), Expect = e-166, Method: Compositional matrix adjust.
Identities = 317/675 (46%), Positives = 407/675 (60%), Gaps = 64/675 (9%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NVTYD R+++I G+R +++SA +HYPR+ P MWP L+ + KEGG + IE+YVFWNGHE +
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 87 PGKYYFGGRFNLVKFIKI--IQQARM---------------------------------Y 111
G+YYF RF+LVKF KI ++ A++ Y
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYY 182
Query: 112 MILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFAS 171
R P ++ G PVWL IPG FR D EPFK MQ F+T IV +MK EKL++
Sbjct: 183 FEERFDPVKFEKHVIFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSW 242
Query: 172 QGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINT 231
QGGPIIL Q+ENEYG + YG+ GKRY WAA+MA+ + G+PW+MC+Q D P+ +I+T
Sbjct: 243 QGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDT 302
Query: 232 CNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNY 291
CN+FYCD F P+S + P IWTE+W GW+ +GG PHRP+ED AF+VARF+Q+GGS+ NY
Sbjct: 303 CNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNY 362
Query: 292 YMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALL--N 349
YMY GGTNF RTAGGP TSYDY+APIDEYG+ R PKWGHLK+LH AIKLCE AL+ +
Sbjct: 363 YMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVD 422
Query: 350 GERSNLSLGSSQEADVY-----------ADSSGACAAFLANMDDKNDKTVVFRNVSYHLP 398
G + LGS QEA VY A ++ C+AFLAN+D+ +V SY LP
Sbjct: 423 GSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLP 482
Query: 399 AWSVSILPDCKKVVFNTANVRAQSS--TVE----MVPENLQPSEASPDNGSKGLK--WQV 450
WSVSILPDC+ V FNTA + AQ+S TVE +PS S +G L W
Sbjct: 483 PWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWT 542
Query: 451 FKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL--KNGSRPVLLIE 508
KE G WG +F G ++H+N TKD +DYLWYTT + +++ + G P L I+
Sbjct: 543 SKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTID 602
Query: 509 SKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEW 568
F N +L GS G+ K PI L G NE+ LLS VGLQN G F E
Sbjct: 603 KIRDVARVFVNGKLAGSQVGHWV----SLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEK 658
Query: 569 VGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
GAG V +TG + G +DL+ WTY++GL+GE IY P + W S M+ Q
Sbjct: 659 DGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGW-SRMQKDSVQ 717
Query: 628 PLTWYKAVVKQPPGD 642
P TWYK + Q GD
Sbjct: 718 PFTWYKNICNQSVGD 732
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 315/728 (43%), Positives = 440/728 (60%), Gaps = 52/728 (7%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
++ A ++TYD SLIING REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFW
Sbjct: 21 SFSGALSITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFW 80
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE GK+ F GR +LVKFIK+I++ +Y+ LR+GPF+ AE+ +GG+P WL +PG
Sbjct: 81 NVHEPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIF 140
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR D EPFK H ++++ +++DMMK EKLFASQGGPIIL Q+ENEY + Y E G Y
Sbjct: 141 FRTDNEPFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYI 200
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGW 258
WA+K+ + ++G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW
Sbjct: 201 KWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQ 260
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
F+ FG R EDIA+SVARFF K G+ NYYMYHGGTNFGRT+ ++TT Y +AP
Sbjct: 261 FRVFGDPPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAP 319
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFL 377
+DE+GL R PK+GHLK LH A+ LC+ ALL G+ + E Y + CAAFL
Sbjct: 320 LDEFGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFL 379
Query: 378 ANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEA 437
AN + + + + FR Y +P S+SILPDCK VV+NT + + ++ N S+
Sbjct: 380 ANNNTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTS-----RNFMKSKK 434
Query: 438 SPDNGSKGLKWQVFKEI--AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEE 495
+ +K ++VF E + I G++ F+ V+ TKD +DY WYTTS +++N+
Sbjct: 435 A----NKNFDFKVFTESVPSKIKGDS-FIP---VELYGLTKDESDYGWYTTSFKIDDNDL 486
Query: 496 FLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSM 555
K G +P L I S GHALH + N E G+ G+ F ++ P++LK G+N + +L +
Sbjct: 487 SKKKGGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGV 546
Query: 556 TVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYS-WTYKIGLQGEHLGIYNPGYRNN 614
G ++G + E G SV I G SGTLDL+ + W K+G++GE LGI+
Sbjct: 547 LTGFPDSGSYMEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKK 606
Query: 615 INW--VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
+ W S EP +TWY+ P + M MGKGL W+NGE +GRYW
Sbjct: 607 VKWEKASGKEP----GMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWM--- 659
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG-D 731
++ G+P+Q YHIPRS+ KP +N+LVIFEE+
Sbjct: 660 ----------------------SFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVK 697
Query: 732 PTKITFSI 739
P I F I
Sbjct: 698 PELIDFVI 705
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 589 bits (1519), Expect = e-165, Method: Compositional matrix adjust.
Identities = 315/721 (43%), Positives = 436/721 (60%), Gaps = 52/721 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLIING REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFWN HE
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +LVKFIK+I++ +Y+ LR+GPF+ AE+ +GG+P WL +PG FR D EP
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 163
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ +++DMMK EKLFASQGGPIIL Q+ENEY + Y E G Y WA+K+
Sbjct: 164 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ ++G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW F+ FG
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 283
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIA+SVARFF K G+ NYYMYHGGTNFGRT+ ++TT Y +AP+DE+GL
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEFGLE 342
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PK+GHLK LH A+ LC+ ALL G+ + E Y + CAAFLAN + +
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTEA 402
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+ + FR Y +P S+SILPDCK VV+NT + + ++ N S+ + +K
Sbjct: 403 AEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTS-----RNFMKSKKA----NK 453
Query: 445 GLKWQVFKEI--AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
++VF E + I G++ F+ V+ TKD +DY WYTTS +++N+ K G +
Sbjct: 454 NFDFKVFTESVPSKIKGDS-FIP---VELYGLTKDESDYGWYTTSFKIDDNDLSKKKGGK 509
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
P L I S GHALH + N E G+ G+ F ++ P++LK G+N + +L + G ++
Sbjct: 510 PNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDS 569
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYS-WTYKIGLQGEHLGIYNPGYRNNINW--VS 619
G + E G SV I G SGTLDL+ + W K+G++GE LGI+ + W S
Sbjct: 570 GSYMEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEKAS 629
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
EP +TWY+ P + M MGKGL W+NGE +GRYW
Sbjct: 630 GKEP----GMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWM---------- 675
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG-DPTKITFS 738
++ G+P+Q YHIPRS+ KP +N+LVIFEE+ P I F
Sbjct: 676 ---------------SFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFV 720
Query: 739 I 739
I
Sbjct: 721 I 721
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 321/726 (44%), Positives = 425/726 (58%), Gaps = 52/726 (7%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G VTYD R+L++NG R ++ S +HY RS P MWP ++ +A++GG++ I++YVFWN HE
Sbjct: 37 GEVTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYVFWNVHEP 96
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
GKY F GR+N+VKFI+ IQ +Y+ LRIGPF+ AE+ YGG P WLH +P FR D
Sbjct: 97 VQGKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPNITFRTDN 156
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK HMQ F+T +V+MMK E L+ QGGPII++Q+ENEY E +G GG RY WAA
Sbjct: 157 EPFKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPRYVQWAAS 216
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFG 263
+AV GVPW+MC+Q D PDP+INTCN C + P+SP+ P +WTENW + +G
Sbjct: 217 LAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWTTRYPIYG 276
Query: 264 GRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
R + DI F+VA F +KGGS +YYMYHGGTNFGR A ++TTSY AP+DEY
Sbjct: 277 NDTKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEY 335
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL P WGHLKELH A+KL LL G SN SLG QEA V+ ++ C AFL N D
Sbjct: 336 GLIWQPTWGHLKELHAAVKLSSEPLLYGTYSNFSLGEDQEAHVF-ETKLKCVAFLVNFDK 394
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ--SSTVEMVPENLQPSEASPD 440
TV+FRN+S L S+SIL DC+ VVF T V AQ S T E+V ++L +
Sbjct: 395 HQRPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNAQHGSRTAEVV-QSLNDTHT--- 450
Query: 441 NGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
W+ FKE I +A + +H++TTKD TDYLWY S +++
Sbjct: 451 -------WKAFKESIPQDISKAAYTGKQLFEHLSTTKDETDYLWYIASYEYRPSDD---- 499
Query: 500 GSRPVLL-IESKGHALHAFANQELQGSASG-NGTHPPFKYKNPISLKAGKNEIALLSMTV 557
S VLL +ES+ H LHAF N E GS G +G ISLK G+N I+LL++ V
Sbjct: 500 -SHLVLLNVESQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEGQNTISLLNVMV 558
Query: 558 GLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
G ++G E GI V I L+ W Y++GL GE IY +++ W
Sbjct: 559 GSPDSGAHMERRSFGIHKVSIQQGQHALHLLNNELWGYQVGLFGEGNRIYTQEGSHSVEW 618
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
+ + PLTWY+ P G++ + L++ MGKG W+NGE IGRYW S
Sbjct: 619 -TDVNNLTYLPLTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRYWVSFKTPS-- 675
Query: 678 HDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF 737
G+PSQ YHIP+ + K ++N+LV+ EE GG+P +IT
Sbjct: 676 -----------------------GQPSQSLYHIPQHFLKNTDNLLVLVEEMGGNPLQITV 712
Query: 738 SIRKIS 743
+ I+
Sbjct: 713 NTVSIT 718
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 583 bits (1504), Expect = e-164, Method: Compositional matrix adjust.
Identities = 312/706 (44%), Positives = 425/706 (60%), Gaps = 45/706 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLII+G+REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFWN HE
Sbjct: 40 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +LVKFIK+I++ MY+ LR+GPF+ AE+ +GG+P WL +PG FR D +P
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKP 159
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ +I+D MK E+LFASQGGPIIL Q+ENEY + Y + G Y WA+K+
Sbjct: 160 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKLV 219
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ +G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW F+ FG
Sbjct: 220 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGDP 279
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIA+SVARFF K GS NYYMYHGGTNFGRT+ ++TT Y +AP+DEYGL
Sbjct: 280 PTQRSVEDIAYSVARFFSKNGSHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 338
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
R PK+GHLK LH A+ LC+ LL G+ G E Y + CAAFLAN + +
Sbjct: 339 REPKYGHLKHLHSALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 398
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+T+ F+ Y + S+SILPDCK VV+NTA + +Q ++ N S+ + +K
Sbjct: 399 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTS-----RNFMKSKKA----NK 449
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
++VF E E + V+ TKD TDY WYTTS V++N K G +
Sbjct: 450 KFDFKVFTETLPSKLEGNSYIP--VELYGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTF 507
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
+ I S GHALH + N E GS G+ F ++ ++LKAG+N + +L + G ++G
Sbjct: 508 VRIASLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLIMLGVLTGFPDSGS 567
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYS-WTYKIGLQGEHLGIYNPGYRNNINWVS-TME 622
+ E G V I G SGTLDL+ S W KIG++GE LGI+ + W T +
Sbjct: 568 YMEHRYTGPRGVSILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGK 627
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P LTWY+A P + M MGKGL W+NGE +GRYW
Sbjct: 628 APG---LTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYW-------------- 670
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
++ G+P+Q YHIPRS+ KP +N+LVIFEE+
Sbjct: 671 -----------QSFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEE 705
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 583 bits (1504), Expect = e-164, Method: Compositional matrix adjust.
Identities = 300/635 (47%), Positives = 391/635 (61%), Gaps = 22/635 (3%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
NVTYD RSLII+G +++ S +IHY RS P MWP L+ +AK GG++ +++YVFWN HE
Sbjct: 22 VANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHE 81
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G ++VKFIK ++ +Y+ LRIGPF+ E++YGG+P WLH + G VFR D
Sbjct: 82 PQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 141
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+++ +IV +MK E L+ASQGGPIIL+Q+ENEYG + + GK Y W A
Sbjct: 142 NEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTA 201
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTF 262
K+AV + GVPW+MC+Q D PDP++N CN C + P+SP+ P IWTENW +++T+
Sbjct: 202 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTY 261
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +EDIAF VA F K GS NYYMYHGGTNFGR A F+ TSY +AP+DEY
Sbjct: 262 GEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 320
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH A+KLCE LL+G ++ +SLG Q A V+ + CAA L N D
Sbjct: 321 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVN-QD 379
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K + TV FRN SY L SVS+LPDCK V FNTA V AQ +T + + N
Sbjct: 380 KCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNT---------RTRKARQNL 430
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S W+ F E + E ++H+NTT+DT+DYLW TT +E G+
Sbjct: 431 SSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQSE-------GAP 483
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL + GHALHAF N GS G F + +SL G N +ALLS+ VGL N+
Sbjct: 484 SVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNS 543
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G E G SVKI L + YSW Y++GL+GE +Y + W
Sbjct: 544 GAHLERRVVGSRSVKIWN-GRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQW-KQYR 601
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLA 657
K+QPLTWYKA P G++P+ L++ MGKG A
Sbjct: 602 DSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 316/726 (43%), Positives = 424/726 (58%), Gaps = 78/726 (10%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
C A +T D+R ++ING R+++IS ++HYPRS P MWP L+Q++K+GG+NTI++YVFW+
Sbjct: 21 CNADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDL 80
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE +Y F G +LV+FIK IQ +Y +LRIGP+V AE+ YGG PVWLH P R
Sbjct: 81 HEPQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLR 140
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
+ + +ENEYG Y + G +Y W
Sbjct: 141 TNNTVY-------------------------------MIENEYGNVMRAYHDAGVQYINW 169
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
A+MA A + GVPWIMCQQ + P P+INTCN +YCDQFTP++P+ PK+WTENW GW+K +
Sbjct: 170 CAQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNW 229
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG DPHR +ED+AFSVARF+Q GG+ NYYMYHGGTNFGRTAGGP+ITTSYDY+AP++EY
Sbjct: 230 GGSDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEY 289
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
G PKWGHL++LH + E AL G+ N+ + A +Y+ G + F N +
Sbjct: 290 GNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYS-YQGKSSCFFGNSNA 348
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
D T+ + V+Y +PAWSVSILPDC V+NTA V +Q ST + SEA +N
Sbjct: 349 DRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVK-----KGSEA--ENE 401
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
L+W E ++ G VD N D +W G
Sbjct: 402 PNSLQWTWRGET------IQYITPGSVDISN-----DDPIW----------------GKD 434
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
L + + GH LHAF N E G F+++ I+L+ GKNEI LLS+TVGL N
Sbjct: 435 LTLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNY 494
Query: 563 GPFYEWVGAGITS-VKITGFNSGTLDL-----STYSWTYKIGLQGEHLGIYNPGYRNNIN 616
GP ++ V GI V+I N G+ D+ + W YK GL GE I+ R N
Sbjct: 495 GPDFDMVNQGIHGPVQIIASN-GSADIIKDLSNNNQWAYKAGLNGEDKKIFLGRARYN-Q 552
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
W S P N+ WYKA PPG++P+ +D++ +GKG AW+NG +GRYWP +
Sbjct: 553 WKSD-NLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARG- 610
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
+ C ECDYRG + +KC T CG PSQRWYH+PRS+ ++N LV+FEE G+P+ +T
Sbjct: 611 --EGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFXGNPSSVT 668
Query: 737 FSIRKI 742
F +
Sbjct: 669 FQTVTV 674
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 290/645 (44%), Positives = 408/645 (63%), Gaps = 17/645 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+ +G RE+ +S +IHYPRS P MWP L+ +AKEGG+NTIE+YVFWN HE
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G+ ++V+F ++IQ+ MY ++R+GPF+ AE+N+GG+P WL IP VFR + EP
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K HM+ F+ +I+ +K LFASQGGPIILAQ+ENEY + E+ + + G +Y WAAKMA
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGGR 265
++ NIG+PWIMC+Q P VI TCN C P + SMP +WTENW ++ FG
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVFGDP 282
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIAF+VARFF GG++ NYYMYHGGTNFGRT+ F+ Y EAP+DE+GL
Sbjct: 283 PSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLY 341
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANMDDKN 384
+ PKWGHL++LH A+KLC+ ALL G S LG EA V+ C AFL+N + K+
Sbjct: 342 KEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTKD 401
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D T+ FR Y +P S+S+L DC+ VVF T +V AQ + Q + D ++
Sbjct: 402 DATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHN---------QRTFHFADQTAQ 452
Query: 445 GLKWQVFK-EIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W++F E + +A D N TKD TDY+WYT+S + ++ +++ +
Sbjct: 453 NNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 512
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
VL + S GHA AF N + G G + F + P+ LK G N +A+L+ ++G+ ++G
Sbjct: 513 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSG 572
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
+ E AG+ V+ITG N+GTLDL+ W + +GL GE IY ++ W M
Sbjct: 573 AYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKPAMN- 631
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
++PLTWYK P G++P+ LDM MGKG+ ++NG+ IGRYW
Sbjct: 632 --DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYW 674
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 578 bits (1491), Expect = e-162, Method: Compositional matrix adjust.
Identities = 311/718 (43%), Positives = 428/718 (59%), Gaps = 46/718 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLII+G+REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFWN HE
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +LVKFIK+IQ+ MY+ LR+GPF+ AE+ +GG+P WL +PG FR D +
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ +I+D MK E+LFASQGGPIIL Q+ENEY + Y + G Y WA+ +
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQF-TPHSPSMPKIWTENWPGWFKTFGGR 265
+ +G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW F+ FG
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIA+SVARFF K G+ NYYMYHGGTNFGRT+ ++TT Y +AP+DEYGL
Sbjct: 281 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 339
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
+ PK+GHLK LH A+ LC+ LL G+ G E Y + CAAFLAN + +
Sbjct: 340 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 399
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+T+ F+ Y + S+SILPDCK VV+NTA + +Q ++ N S+ + +K
Sbjct: 400 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTS-----RNFMKSKKA----NK 450
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
++VF E E + V+ TKD TDY WYTTS V++N K G +
Sbjct: 451 KFDFKVFTETLPSKLEGNSYIP--VELYGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTF 508
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
+ I S GHALHA+ N E GS G+ F ++ ++LKAG+N + +L + G ++G
Sbjct: 509 VRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGS 568
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYS-WTYKIGLQGEHLGIYNPGYRNNINWVS-TME 622
+ E G + I G SGTLDL+ S W KIG++GE LGI+ + W T +
Sbjct: 569 YMEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGK 628
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
P LTWY+ P + M MGKGL W+NGE +GRYW
Sbjct: 629 APG---LTWYQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYW-------------- 671
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG-DPTKITFSI 739
++ G+P+Q YHIPRS+ KP +N+LVIFEE+ P + F+I
Sbjct: 672 -----------QSFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFAI 718
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 311/677 (45%), Positives = 408/677 (60%), Gaps = 65/677 (9%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD RSL+I+G+R +I+S +IHYPRS P MWP L+++AKEGG++ IE+Y+FWNGHE
Sbjct: 30 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPH 89
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
+Y F G +++V+F K IQ A MY ILRIGP++ E+NYGG+P WL IPG FR E
Sbjct: 90 RRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 149
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAA 204
PF+ M+ F TLIV+ MK K+FA QGGPIILAQ+ENEYG + Y W A
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 205 KMAVAQNIGVPWIMCQQ-FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
MA QN+GVPWIMCQQ D P V+NTCN FYC + P+ +PKIWTENW GWFK +
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
D HR +EDIAF+VA FFQK GS+ NYYMYHGGTNFGRT+GGP+ITTSYDY+AP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDD 382
R PK+GHLKELH +K E L++GE + + G + Y DSS AC F+ N D
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSAC--FINNRFD 387
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
D V ++ LPAWSVSILPDCK V FN+A ++ Q+S + P + + S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQES---- 443
Query: 443 SKGLKWQVFKEIAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKN 499
LKW E + + +F K+ ++ I T+ D +DYLWY TS+ N E
Sbjct: 444 ---LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSL--NHKGE---- 494
Query: 500 GSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGL 559
GS L + + GH L+AF N +L G F+ ++P+ L GKN I+LLS TVGL
Sbjct: 495 GSYK-LYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGL 553
Query: 560 QNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
+N GP +E + GI VK+ N +DLS SW+
Sbjct: 554 KNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWS----------------------- 590
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
YKA + P G++P+ +D+L + KG+AW+NG +GRYWP S ++
Sbjct: 591 --------------YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRYWP--SYTAAE 634
Query: 678 HDECVQECDYRGKFNPD 694
C CDYRG F +
Sbjct: 635 MAGC-HRCDYRGAFQAE 650
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 306/681 (44%), Positives = 413/681 (60%), Gaps = 41/681 (6%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MW ++ +A+ GG+N I++YVFWN HE G++ F G ++LVKFIK+I + +MY+ LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
PF+ AE+N+GG+P WL P +FR+ FK++M+K++ +IVDMMK KLFASQGGPI+
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
LAQ+ENEY + + Y E G +Y WAA MAV +GVPWIMC+Q D PDPVINTCN +C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180
Query: 238 -DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
D FT P+ P P +WTENW ++ FG R +EDIAFSVARFF K GS+ NYYMYH
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMYH 240
Query: 296 GGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL 355
GGTNFGRT+ F TT Y EAP+DE+GL R PKWGHL+++H A+ LC+ LL G
Sbjct: 241 GGTNFGRTSA-VFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQ 299
Query: 356 SLGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
+G EA Y + CAAFLAN D K+ +T+ FR + LP S+SILPDCK VVFN
Sbjct: 300 VIGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVFN 359
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINT 474
T + +Q + +P N +K LKW++ E + ++ +
Sbjct: 360 TETIVSQHNARNFIPSK---------NANK-LKWKMSPESIPTVEQVPVNNKIPLELYSL 409
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
KDTTDY WYTTSI +++ + + PVL I S GHA+ F N E G+A G+
Sbjct: 410 LKDTTDYGWYTTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEEKN 469
Query: 535 FKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWT 594
F ++ + KAG N IALL + VGL ++G + E AG S+ I G N+GTLD+S W
Sbjct: 470 FVFQGSVPFKAGVNNIALLGILVGLPDSGAYMEHRFAGPRSITILGLNTGTLDISKNGWG 529
Query: 595 YKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGK 654
+++ LQGE + ++ G + ++W E + LTWYK P G++P+ + M MGK
Sbjct: 530 HQVALQGEKVKVFTQGGSHRVDWSEIKE--EKSALTWYKTYFDAPEGNDPVAIRMNGMGK 587
Query: 655 GLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSW 714
G W+NG+ IGRYW +P K T Q YHIPRS+
Sbjct: 588 GQIWVNGKSIGRYW-------------------MSYLSPLKLST------QSEYHIPRSF 622
Query: 715 FKPSENILVIFEEKGGDPTKI 735
KPSEN+LVI EE+ P K+
Sbjct: 623 IKPSENLLVILEEENVTPEKV 643
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 316/721 (43%), Positives = 411/721 (57%), Gaps = 46/721 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD R+L+++G R + S +HY RS P MWP L+ +AK GG++ I++YVFWN HE
Sbjct: 29 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y F GR++LVKFI+ IQ +Y+ LRIGPFV AE+ YGG P WLH +P FR+D EP
Sbjct: 89 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HMQ F+T IV MMK E L+ QGGPII++Q+ENEY E +G G RY WAA MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFGGR 265
V GVPW+MC+Q D PDPVINTCN C + P+SP+ P +WTENW + +G
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIYGND 268
Query: 266 DPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R EDIAF+VA + +K GS +YYMYHGGTNFGR A ++TTSY AP+DEYGL
Sbjct: 269 TKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLDEYGL 327
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
P WGHL+ELH A+K LL G SN SLG QEA V+ ++ C AFL N D N
Sbjct: 328 IWQPTWGHLRELHCAVKQSSEPLLFGSYSNFSLGQQQEAHVF-ETDFKCVAFLVNFDQHN 386
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
V FRN+S L S+S+L DC+ VVF TA V AQ + N S +N
Sbjct: 387 TPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRT---ANAVQSLNDINN--- 440
Query: 445 GLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W+ F E + ++ + + + + TTKD TDYLWY IV+
Sbjct: 441 ---WKAFIEPVPQDLSKSTYTGNQLFEQLPTTKDETDYLWY----IVSYKNRASDGNQIA 493
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNP-ISLKAGKNEIALLSMTVGLQNA 562
L ++S H LHAF N E GS G+ P N +SLK G N I+LLS+ VG ++
Sbjct: 494 RLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDS 553
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G + E GI +V I L+ W Y++GL GE IY N++ W+ +
Sbjct: 554 GAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMD-IN 612
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
PLTWYK PPG++ + L++ MGKG W+NGE IGRYW S
Sbjct: 613 NLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS------- 665
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
G+PSQ YHIPR + P +N+LV+ EE GGDP +IT + +
Sbjct: 666 ------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSV 707
Query: 743 S 743
+
Sbjct: 708 T 708
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 301/710 (42%), Positives = 417/710 (58%), Gaps = 74/710 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD+RSL+I+G+R+L S AIHYPRS P +WP L+ +AKEGG+NTIE+Y+FWN HE P
Sbjct: 36 VTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWNAHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +LVKF+K+IQ+ MY I+RIGPF+ AE+N+GG+P WL I +FR + +P
Sbjct: 96 GKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+K+ +V +K +LFASQGGP+IL Q+ENEYG + + G +Y WAA+MA
Sbjct: 156 YKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLEWAAQMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ GVPWIMC+Q P VI TCN +C D +T + P +WTENW F+ +G +
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQL 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGGS+ NYYMYHGGTNFGRT+ +T YD EAP+DEYG+ +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSMVNYYMYHGGTNFGRTSASYVLTGYYD-EAPLDEYGMYK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH I+ + A L+G+ S+ LG EA ++ C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLSGKHSSEILGHGYEAQIFELPEENLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV+FR V +++P+ SVSIL CK VV+NT V Q S + S + + SK
Sbjct: 395 GTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRVFVQHS---------ERSYHTSEVTSKN 445
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
+W+++ E+ + + ++ N TKD +DYLWYTTS + ++ + RPVL
Sbjct: 446 NQWEMYSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWYTTSFRLESDDLPFRGDIRPVL 505
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
++S H++ FAN GSA GN F ++ P+ LKAG N + LLS T+G++++G
Sbjct: 506 QVKSSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVVLLSSTMGMKDSGGE 565
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V GI I G N+GTLDL W
Sbjct: 566 LAEVKGGIQECLIQGLNTGTLDLQVNGWG------------------------------- 594
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
+K +P GD+PI LDM M KG+ ++NGE IGRYW
Sbjct: 595 ------HKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYW----------------V 632
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
+R T G PSQ YHIPR + KP +N+LV+FEE+ G P I
Sbjct: 633 SFR---------TLAGTPSQAVYHIPRPFLKPKDNLLVVFEEEMGKPDGI 673
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 570 bits (1469), Expect = e-160, Method: Compositional matrix adjust.
Identities = 279/527 (52%), Positives = 352/527 (66%), Gaps = 16/527 (3%)
Query: 217 IMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAF 276
++C+Q D PDP+IN CN FYCD F+P+ PK+WTE W GWF FGG P+RP+ED+AF
Sbjct: 1 VLCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAF 60
Query: 277 SVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
SVARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+AP+DEYGL R PKWGHLK+L
Sbjct: 61 SVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDL 120
Query: 337 HGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYH 396
H AIKLCE AL++GE + + LG+ QEA VY SGAC+AFLAN + K+ V F N Y+
Sbjct: 121 HRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYN 180
Query: 397 LPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAG 456
LP WS+SILPDCK V+NTA V AQ+S ++MV P +G GL WQ + E
Sbjct: 181 LPPWSISILPDCKNTVYNTARVGAQTSRMKMV--------RVPVHG--GLSWQAYNEDPS 230
Query: 457 IWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHA 516
+ + F G V+ INTT+DT+DYLWY T + V+ NE FL+NG P L + S GHA+H
Sbjct: 231 TYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHV 290
Query: 517 FANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS- 575
F N +L GSA G+ P ++ ++L+AG N+IA+LS+ VGL N GP +E AG+
Sbjct: 291 FINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGP 350
Query: 576 VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAV 635
V + G N G DLS WTYK+GL+GE L +++ +++ W + QPLTWYK
Sbjct: 351 VSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTT 410
Query: 636 VKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDK 695
P GD P+ +DM MGKG W+NG+ +GR+WP S EC Y G F DK
Sbjct: 411 FSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGS-----CSECSYTGTFREDK 465
Query: 696 CITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
C+ CGE SQRWYH+PRSW KPS N+LV+FEE GGDP IT R++
Sbjct: 466 CLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREV 512
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 315/725 (43%), Positives = 413/725 (56%), Gaps = 94/725 (12%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
G+VTYD RSLIING+R L+ S +IHYPRS P MWP L+ +AKEGG++ IE+Y FWN HE
Sbjct: 21 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 80
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G+Y F GR ++VKF K +Q +Y LRIGPF+ +E+NYGG+P WLH +PG ++R+D
Sbjct: 81 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 140
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFK++MQ F T IV++MK E L+ASQGGPIIL+Q+ENEY E+ + E G Y WAA
Sbjct: 141 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 200
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
KMAV + T +Y G
Sbjct: 201 KMAVD-------------------LQTAMRYY---------------------------G 214
Query: 265 RDPH-RPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
D R +ED+AF VA F +K GS NYYMYHGGTNFGRT+ +T YD +AP+DEY
Sbjct: 215 EDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEY 273
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHLKELH IKLC LL G + N SLG QEA ++ SG CAAFL N D
Sbjct: 274 GLIRQPKWGHLKELHAVIKLCSDTLLXGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 333
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
+ + TV+F+N +Y L A S+SILPDCKK+ FNTA V Q +T + + G
Sbjct: 334 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSV--------QTRATFG 385
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
S +W ++E +G S ++H+ TTKD +DYLWYT I N + ++
Sbjct: 386 STK-QWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIHNSSN------AQ 438
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
PVL ++S H L AF N + SA G+ + F N + L +G N I+LLS+ VGL +A
Sbjct: 439 PVLRVDSLAHVLLAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDA 498
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
GP+ E AGI V+I + D S + W Y++GL GE L IY + W
Sbjct: 499 GPYLEHKVAGIRRVEIQD-GGXSKDFSKHPWGYQVGLMGEKLQIYTSPGSQKVQWYGLGS 557
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
+ PLTWYK + P G++P+ L MGKG AW+NG+ IGRYW
Sbjct: 558 HGRG-PLTWYKTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYWV------------- 603
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI---TFSI 739
+T GEPSQ WY++PR++ P N+LV+ EE+ GDP KI T S+
Sbjct: 604 ------------SYLTPSGEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSV 651
Query: 740 RKISG 744
+ G
Sbjct: 652 TNVCG 656
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 305/718 (42%), Positives = 424/718 (59%), Gaps = 47/718 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+I+G+R+L S AIHYPRS P MW L++ AK+GG+NTIE+YVFWN HE P
Sbjct: 35 VSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWNAHEPEP 94
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GR +L+KF+K+IQ MY ++RIGPF+ AE+N+GG+P WL IP +FR + EP
Sbjct: 95 GKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 154
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+KF+ IV +K ++FASQGGP+ILAQ+ENEYG + + G +Y WAA+MA
Sbjct: 155 YKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMA 214
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ N GVPWIMC+Q P VI TCN +C D +T + P++WTENW F+ FG +
Sbjct: 215 ISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQL 274
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYM-YHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R +EDIA+SV RFF KGG++ NYYM Y+GGTNFGRT G ++ T Y E P+DE +P
Sbjct: 275 ALRSAEDIAYSVLRFFAKGGTLVNYYMQYYGGTNFGRT-GASYVLTGYYDEGPVDEC-MP 332
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKN 384
+ PK+GHL++LH IK A L G++S L EA + C AF++N +
Sbjct: 333 KAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGE 392
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TV FR Y++P+ SVSIL DCK VV+NT V Q S + S + +K
Sbjct: 393 DGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHS---------ERSFHTAQKLAK 443
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
W+++ E + ++ N TKD +DYL + + ++ + RPV
Sbjct: 444 SNAWEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLCFR----LEADDLPFRGDIRPV 499
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
+ ++S HAL F N G+ G+ F ++ PI+L+ G N +ALLS ++G++++G
Sbjct: 500 VQVKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGG 559
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
V GI I G N+GTLDL W +K+ L+GE IY + WV
Sbjct: 560 ELVEVKGGIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPAT--- 616
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
+ +TWYK +P G++P+ LDM MGKG+ ++NGE +GRYWP
Sbjct: 617 TGRAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWP--------------- 661
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF-SIRK 741
YR T G PSQ YHIPR + KP N+LVIFEE+ G P I ++R+
Sbjct: 662 -SYR---------TVGGVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRR 709
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 298/739 (40%), Positives = 419/739 (56%), Gaps = 45/739 (6%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
T +A NVTYDSR+L+I+GRR L++S +IHYPRS P MWP L +AK G++ I++Y+FW
Sbjct: 20 TSAYAMNVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTYLFW 79
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N + +PG++ RF+ V+F+++ Q+A +Y+ RIGPFV AE+ YGG+P WL IP +
Sbjct: 80 NTNVPTPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIPDIM 139
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYA 200
FR+ +P+ +++T V ++K +L A QGGPIIL Q+ENEYG ES Y GG +Y
Sbjct: 140 FRDYDQPWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRYA-GGPQYV 198
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFK 260
W ++A WIMC Q D P +I TCN+FYCD F PH P P +WTENWPGWF+
Sbjct: 199 EWCGQLAANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVPH-PGQPSMWTENWPGWFQ 257
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 320
+G PHRP++D+A++V R++ KGGS NYYMYHGGTNF RTAGGPFITT+YDY+A +D
Sbjct: 258 KWGDPTPHRPAQDVAYAVTRYYIKGGSYMNYYMYHGGTNFERTAGGPFITTNYDYDASLD 317
Query: 321 EYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEADVYADSSGACAAFLAN 379
EYG+P PK+ HL +H + E ++ +SLG++ EA +Y +SS C AFL+N
Sbjct: 318 EYGMPNEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNLEAHIY-NSSVGCVAFLSN 376
Query: 380 MDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQS--------------STV 425
++K D V F +Y LPAWSVS+L C ++NTA RA
Sbjct: 377 NNNKTDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTAVCRAHQRAPHDAACCARESRRVC 436
Query: 426 EMVPENLQPSEASPDNGS--KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLW 483
+ +P L+P +P + L V I + ++ I+ T D TDYLW
Sbjct: 437 DRLPP-LRPKARAPCQSGRIRHLCLVVLTSIGPQAPATKYWNKTPLEQIDQTLDHTDYLW 495
Query: 484 YTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISL 543
Y+TS + + S P + + + F G+ S +SL
Sbjct: 496 YSTSYV--SSSATYAQLSLPQITDVAYVYVNGKFVTVSWSGNVSAT-----------VSL 542
Query: 544 KAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEH 603
AG N I +LS+T+GL N G G+ + G G+++L+ W ++ G+ GE
Sbjct: 543 VAGPNTIDILSLTMGLDNGGDILSEYNCGL----LGGVYLGSVNLTENGWWHQTGVVGER 598
Query: 604 LGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDE-PIGLDMLKMGKGLAWLNGE 662
I+ P + W T N LTWYK+ P + P+ LD+ MGKG W+NG
Sbjct: 599 NAIFLPENLKKVAW--TTPAVLNTGLTWYKSSFDVPRDSQAPLALDLTGMGKGYVWVNGH 656
Query: 663 EIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENIL 722
+GRYWP + P D CDYRG ++ C GC PSQ YH+PR W + N+L
Sbjct: 657 NLGRYWPTILATNWPCD----VCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENNVL 712
Query: 723 VIFEEKGGDPTKITFSIRK 741
V+ EE GG+P+KI R+
Sbjct: 713 VLLEEMGGNPSKIALVERE 731
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 301/652 (46%), Positives = 394/652 (60%), Gaps = 29/652 (4%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G VTYD R+L++NG R ++ S +HY RS P MWP L+ AK+GG++ I++YVFWN HE
Sbjct: 38 GEVTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNVHEP 97
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G+Y F GR++LVKFI+ IQ +Y+ LRIGPF+ AE+ YGG P WLH +P FR D
Sbjct: 98 VQGQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNITFRTDN 157
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK HMQ+F+T IV+MMK E L+ QGGPII++Q+ENEY E +G GG RY WAA+
Sbjct: 158 EPFKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRWAAE 217
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFG 263
MAV GVPW+MC+Q D PDP+INTCN C + P+SP+ P +WTENW + +G
Sbjct: 218 MAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYPIYG 277
Query: 264 GRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
R +EDIAF+VA F +K GS +YYMYHGGTNFGR A ++TTSY AP+DEY
Sbjct: 278 NDTKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEY 336
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL P WGHL+ELH A+KL ALL G SN SLG QEA ++ ++ C AFL N D
Sbjct: 337 GLIWRPTWGHLRELHAAVKLSSEALLFGRYSNFSLGPEQEAHIF-ETELKCVAFLVNFDK 395
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ--SSTVEMVPENLQPSEASPD 440
TVVFRN+ + L S+S+L +C+ VVF TA V AQ S T E+V E+L
Sbjct: 396 HQTPTVVFRNIYFQLAPKSISVLSECRTVVFETARVNAQYGSRTAEVV-ESLNDIHT--- 451
Query: 441 NGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL-- 497
W+ FKE I +A + + +H++ TKD TDYLWY S E++
Sbjct: 452 -------WKAFKEPIPEDISKAVYTGNQLFEHLSMTKDETDYLWYIVSY------EYIPS 498
Query: 498 KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNP-ISLKAGKNEIALLSMT 556
+G +L +ES+ H LHAF N E GS G+ P N ISL G+N I+LLS+
Sbjct: 499 DDGQLVLLNVESRAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEGQNTISLLSVM 558
Query: 557 VGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
VG ++G E GI V I L+ W Y++GL GE IY ++
Sbjct: 559 VGSPDSGAHMERRSFGIHKVSIQQGQQPLHLLNNELWAYQVGLYGEANRIYTQEESSSAE 618
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
W + + P TWYK P G++ + L++ MGKG W+NGE +GRYW
Sbjct: 619 W-TEINNLTYHPFTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESLGRYW 669
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 550 bits (1417), Expect = e-153, Method: Compositional matrix adjust.
Identities = 290/679 (42%), Positives = 405/679 (59%), Gaps = 46/679 (6%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWP ++ +A+ GG+NTI++YVFWN HE GKY F GRF+LVKFIK+I + +Y+ LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
PF+ AE+N+GG+P WL +P FR + EPFK H ++++ I+ MMK EKLFASQGGPII
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
L Q+ENEY + Y E G++Y WAA + + N+G+PW+MC+Q D P +IN CN +C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180
Query: 238 -DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
D F P+ P +WTENW F+ FG R EDIAFSVAR+F K GS NYYMYH
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 240
Query: 296 GGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL 355
GGTNFGRT+ F+TT Y +AP+DE+GL + PK+GHLK +H A++LC+ AL G+
Sbjct: 241 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ 299
Query: 356 SLGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
+LG E Y + CAAFL+N + ++ T+ F+ Y LP+ S+SILPDCK VV+N
Sbjct: 300 TLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYN 359
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINT 474
TA + AQ S + V + SKGLK+++F E + D + G + ++
Sbjct: 360 TAQIVAQHSWRDFV---------KSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL-- 408
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
TKD TDY WYTTS+ ++E++ + G + +L + S GHAL + N E G A G
Sbjct: 409 TKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKS 468
Query: 535 FKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLS-TYSW 593
F++ P++ K G N I++L + GL ++G + E AG ++ I G SGT DL+ W
Sbjct: 469 FEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEW 528
Query: 594 TYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMG 653
+ GL+GE +Y + W + K +PLTWYK + P G + + M MG
Sbjct: 529 GHLAGLEGEKKEVYTEEGSKKVKW---EKDGKRKPLTWYKTYFETPEGVNAVAIRMKAMG 585
Query: 654 KGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRS 713
KGL W+NG +GRYW ++ GEP+Q YHIPRS
Sbjct: 586 KGLIWVNGIGVGRYWM-------------------------SFLSPLGEPTQTEYHIPRS 620
Query: 714 WFK--PSENILVIFEEKGG 730
+ K +N+LVI EE+ G
Sbjct: 621 FMKGEKKKNMLVILEEEPG 639
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 299/711 (42%), Positives = 406/711 (57%), Gaps = 73/711 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+I+G+R+L S AIHYPRS P MW LV+ AK GG+NTIE+YVFWNGHE P
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKYYF GRF+L++F+ +I+ MY I+RIGPF+ AE+N+GG+P WL I +FR + EP
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK +ENEYG + G +Y WAA+MA
Sbjct: 156 FK-------------------------------IENEYGNIKKDRKVEGDKYLEWAAEMA 184
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ IGVPW+MC+Q P VI TCN +C D +T + P++WTENW F+TFG +
Sbjct: 185 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 244
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGG++ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 245 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 303
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH IK A L G++S LG EA Y C +FL+N + D
Sbjct: 304 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 363
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TVVFR +++P+ SVSIL DCK VV+NT V Q S + S + D SK
Sbjct: 364 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---------ERSFHTTDETSKN 414
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
W+++ E + + ++ N TKDT+DYLWYTTS + ++ + RPV+
Sbjct: 415 NVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVI 474
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I+S HA+ FAN G+ G+ F ++ P+ L+ G N IA+LS ++G++++G
Sbjct: 475 QIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGE 534
Query: 566 YEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
V GI + G N+GTLDL +K L+GE IY W +P +
Sbjct: 535 LVEVKGGIQDCVVQGLNTGTLDLQGNGRGHKARLEGEDKEIYTEKGMAQFQW----KPAE 590
Query: 626 NQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
N P+TWYK +P GD+PI +DM M KG+ ++NGE IGRYW
Sbjct: 591 NDLPITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT--------------- 635
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
IT G PSQ YHIPR++ KP N+L+IFEE+ G P I
Sbjct: 636 ----------SFITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGI 676
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 257/533 (48%), Positives = 347/533 (65%), Gaps = 20/533 (3%)
Query: 214 VPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSED 273
VPW+MC+Q D PDP+INTCN FYCD F+P+ P P WTE W WF FGG + RP ED
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVED 62
Query: 274 IAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHL 333
+AF VARF QKGGS+ NYYMYHGGTNFGRTAGGPFITTSYDY+APIDEYGL R PK+GHL
Sbjct: 63 LAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHL 122
Query: 334 KELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNV 393
K LH A+KLCE ALL GE + +L + Q+A V++ SSG CAAFL+N N V F
Sbjct: 123 KRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFNGR 182
Query: 394 SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE 453
Y LP WS+SILPDCK V++NTA V+ Q++ + +P ++ W+ + E
Sbjct: 183 HYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFLPTKVE-----------SFSWETYNE 231
Query: 454 -IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGH 512
I+ I ++ G ++ + TKD +DYLWYTTS+ V+ NE +L+ G P L SKGH
Sbjct: 232 NISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGH 291
Query: 513 ALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAG 572
+H F N +L GS+ G + F + I+L+AG N+++LLS+ GL N GP YE G
Sbjct: 292 GMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMG 351
Query: 573 ITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPPKNQPLT 630
+ V I G + G +DLS W+YK+GL+GE++ + +P ++W +++ QPLT
Sbjct: 352 VLGPVAIHGLDXGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQENAQPLT 411
Query: 631 WYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGK 690
WYKA P GDEP+ LDM M KG W+NG+ +GRYW + + +C Y G
Sbjct: 412 WYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWTITANGN------CTDCSYSGT 465
Query: 691 FNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
+ P KC GCG+P+Q+WYH+PRSW P++N++V+FEE GG+P++I+ R ++
Sbjct: 466 YRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVT 518
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 546 bits (1408), Expect = e-152, Method: Compositional matrix adjust.
Identities = 298/733 (40%), Positives = 417/733 (56%), Gaps = 82/733 (11%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
+LI + ++ C A V YDS +LIING R++I S AIHYPRS P MWP L+ +AK+GG+
Sbjct: 9 VLISTLALLSLCSATTVEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELINKAKDGGL 68
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFW+ HE +Y F G ++VKF ++IQ+A +Y+ILRIGP+V AE+NYGG P+
Sbjct: 69 DAIETYVFWDRHEPVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEWNYGGFPM 128
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WLH PG R D E +K + F F S I+ +Q+ GYY
Sbjct: 129 WLHNTPGVELRTDNEIYKVPLLIF-------------FVSNNVRIV-SQINTCNGYY--- 171
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
CD F P++P PK++
Sbjct: 172 ---------------------------------------------CDTFKPNNPKSPKMF 186
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GW+K +GG+ +R +ED+AFSVARF Q GG +NYYMY+GGTNFGRTAGGP+IT
Sbjct: 187 TENWSGWYKLWGGKTSYRTAEDMAFSVARFVQAGGVFNNYYMYYGGTNFGRTAGGPYITA 246
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSG 371
SYDY++P+DEYG PKWGHLK+LH +IKL E + NG + + + + Y +++
Sbjct: 247 SYDYDSPLDEYGNLNQPKWGHLKQLHASIKLGEKIITNGTVTIKNFQAGVDLTAYTNNAT 306
Query: 372 ACA-AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS-TVEMVP 429
FL+N++ + + ++ +Y +PAWSVSIL +C K +FNTA V Q+S V+ +
Sbjct: 307 RERFCFLSNINIADAHIDLQQDGNYTIPAWSVSILQNCSKEIFNTAKVNTQTSLMVKKLY 366
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
EN +P+ S W + G+ F S +D TT D +DYLWY TS
Sbjct: 367 ENDKPTNLS-------WVWAPEPMKDTLLGKGRFRTSQLLDQKETTVDASDYLWYMTSFD 419
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+N+N N L + S+GH LHA+ N++L S F ++ P++LK G N
Sbjct: 420 MNKNTLQWTN---VTLRVTSRGHVLHAYVNKKLI-VGSQLVIQGEFTFEKPVTLKPGNNV 475
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS--VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
I+LLS TVGL N G F++ GI V++ +DLS+ W+YKIGL GE Y
Sbjct: 476 ISLLSATVGLANYGSFFDKTPVGIVDGPVQLMANGKPVMDLSSNLWSYKIGLNGEAKRFY 535
Query: 608 NPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+P R+N W + +P+TWYK P G +P+ +D+ MGKG AW NG+ +GRY
Sbjct: 536 DPTSRHN-KWSAANGVSTARPMTWYKTTFSSPSGTDPVVVDLQGMGKGHAWANGKSLGRY 594
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS-ENILVIFE 726
WP + + + C CDYRG +N KC CG P+QRWYH+PRS+ + +N L++FE
Sbjct: 595 WPSQIANA---NGCSGTCDYRGPYNAGKCTRNCGIPTQRWYHVPRSFLNSNGKNTLILFE 651
Query: 727 EKGGDPTKITFSI 739
E GGDP+ I+F I
Sbjct: 652 EVGGDPSGISFQI 664
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 296/714 (41%), Positives = 392/714 (54%), Gaps = 106/714 (14%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A VTYD RSLI+NGRREL+ S +IHYPRS P
Sbjct: 29 AKTVTYDGRSLIVNGRRELLFSGSIHYPRSTP---------------------------- 60
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
++ F G ++LVKFIK+I +Y LRIGPF+ AE+N+GG P WL +P +FR+
Sbjct: 61 ----EFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSY 116
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPFKYHM+K+ +I++MMK KLFA QGGPIILAQ+ENEY + Y E G +Y WA
Sbjct: 117 NEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYKELGVQYVQWAG 176
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTF 262
KMAV GVPWIMC+Q D PDPVINTCN +C D FT P+ P+ P +WTENW ++ F
Sbjct: 177 KMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVF 236
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R +ED+AFSVARF K G++ NYYMYHGGTNFGRT G F+TT Y EAP+DEY
Sbjct: 237 GDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEY 295
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMD 381
GL R PKWGHLK+LH A++LC+ AL G LG +E Y + CAAFL N
Sbjct: 296 GLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNH 355
Query: 382 DKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
+ T+ FR Y LP S+SILPDCK VV+NT V AQ + V +
Sbjct: 356 SREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARNFVKSKI--------- 406
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
+K LKW++ +E + + + ++ KD +DY W+ TSI ++ + +K
Sbjct: 407 ANKNLKWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFVTSIELSNYDLPMKKDI 466
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
PVL I + GHA+ AF N GSA G+ F ++ P+ + G+N++ ++
Sbjct: 467 IPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFQ-GRNKLHCPAV------ 519
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
Y+ GI SV+I G N+GTLD++ W ++G+ GEH+ Y G + + W T
Sbjct: 520 ----YDSGTTGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQW--TA 573
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
K +TWYK P G++P+ L M M KG NG E
Sbjct: 574 AKGKGPAMTWYKTYFDMPEGNDPVILRMTSMAKG----NGLE------------------ 611
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
YH+PR+W KPS+N+LVIFEE GG+P +I
Sbjct: 612 --------------------------YHVPRAWLKPSDNLLVIFEETGGNPEEI 639
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 285/679 (41%), Positives = 400/679 (58%), Gaps = 50/679 (7%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWP ++ +A+ GG+NTI++YVFWN HE GKY F GRF+LVKFIK+I + +Y+ LR+G
Sbjct: 69 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 128
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
PF+ AE+N+GG+P WL +P FR + EPFK H ++++ I+ MMK EKLFASQGGPII
Sbjct: 129 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 188
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
L Q+ENEY + Y E G++Y WAA + + N+G+PW+MC+Q D P +IN CN +C
Sbjct: 189 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 248
Query: 238 -DQF-TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
D F P+ P +WTENW F+ FG R EDIAFSVAR+F K GS NYYMYH
Sbjct: 249 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 308
Query: 296 GGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL 355
GGTNFGRT+ F+TT Y +AP+DE+GL + PK+GHLK +H A++LC+ AL G+
Sbjct: 309 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ 367
Query: 356 SLGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
+LG E Y + CAAFL+N + ++ T+ F+ Y LP+ S+SILPDCK VV+N
Sbjct: 368 TLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYN 427
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINT 474
TA + AQ S + V + SKGLK+++F E + D + G + ++
Sbjct: 428 TAQIVAQHSWRDFV---------KSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL-- 476
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
TKD TDY + ++E++ + G + +L + S GHAL + N E G A G
Sbjct: 477 TKDKTDY----ACVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKS 532
Query: 535 FKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLS-TYSW 593
F++ P++ K G N I++L + GL ++G + E AG ++ I G SGT DL+ W
Sbjct: 533 FEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEW 592
Query: 594 TYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMG 653
+ GL+GE +Y + W + K +PLTWYK + P G + + M MG
Sbjct: 593 GHLAGLEGEKKEVYTEEGSKKVKW---EKDGKRKPLTWYKTYFETPEGVNAVAIRMKAMG 649
Query: 654 KGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRS 713
KGL W+NG +GRYW ++ GEP+Q YHIPRS
Sbjct: 650 KGLIWVNGIGVGRYWM-------------------------SFLSPLGEPTQTEYHIPRS 684
Query: 714 WFK--PSENILVIFEEKGG 730
+ K +N+LVI EE+ G
Sbjct: 685 FMKGEKKKNMLVILEEEPG 703
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 261/494 (52%), Positives = 328/494 (66%), Gaps = 16/494 (3%)
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG P+RP+ED+AFSVARF QKGGS NYYMYHGGTNFGRTAGGPFI
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+AP+DEYGL R PKWGHLK+LH AIKLCE AL++GE + + LG+ QEA VY
Sbjct: 61 ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSK 120
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
SGAC+AFLAN + K+ V F N Y+LP WS+SILPDCK V+NTA V AQ+S ++MV
Sbjct: 121 SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMKMV- 179
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
P +G GL WQ + E + + F G V+ INTT+DT+DYLWY T +
Sbjct: 180 -------RVPVHG--GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVK 230
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
V+ NE FL+NG P L + S GHA+H F N +L GSA G+ P ++ ++L+AG N+
Sbjct: 231 VDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNK 290
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
IA+LS+ VGL N GP +E AG+ V + G N G DLS WTYK+GL+GE L +++
Sbjct: 291 IAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHS 350
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
+++ W + QPLTWYK P GD P+ +DM MGKG W+NG+ +GR+W
Sbjct: 351 LSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHW 410
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
P S EC Y G F DKC+ CGE SQRWYH+PRSW KPS N+LV+FEE
Sbjct: 411 PAYKAVGS-----CSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEW 465
Query: 729 GGDPTKITFSIRKI 742
GGDP IT R++
Sbjct: 466 GGDPNGITLVRREV 479
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 249/479 (51%), Positives = 325/479 (67%), Gaps = 10/479 (2%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
L+ + +T+C NV+YDS ++IING R +I S +IHYPRS MWP L+Q+AK+GG++
Sbjct: 7 LVATLACLTFCLGDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLD 66
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
IE+Y+FW+ HE KY F GR + +KF ++IQ A +Y+++RIGP+V AE+NYGG PVW
Sbjct: 67 AIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVW 126
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-F 191
LH +PG R + + +K MQ F T IV+M K+ LFASQGGPIILAQ+ENEYG +
Sbjct: 127 LHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPA 186
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
YG+ GK Y W A+MA + NIGVPWIMCQQ D P P+INTCN FYCD FTP++P PK++
Sbjct: 187 YGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPKSPKMF 246
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWFK +G +DP+R +ED+AFSVARFFQ GG +NYYMYHGGTNFGRT+GGPFITT
Sbjct: 247 TENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITT 306
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SS 370
SYDY AP+DEYG PKWGHLK+LH +IKL E L NG +N + GSS + + ++
Sbjct: 307 SYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNGTHTNQNFGSSVTLTKFFNPTT 366
Query: 371 GACAAFLANMDDKNDKTVVFR-NVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
G FL+N D KND T+ + + Y +PAWSVSIL C K V+NTA V +Q+S V
Sbjct: 367 GERFCFLSNTDGKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQTSM--FVK 424
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
E + +N W + G F + F++ T D +DY WY T++
Sbjct: 425 E-----QNEKENAQLSWAWAPEPMKDTLQGNGKFAANLFLEQKRVTADFSDYFWYMTNV 478
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 299/722 (41%), Positives = 395/722 (54%), Gaps = 85/722 (11%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G VTY+ R+L+++G R ++ + +HYPRS P MWP L+ +AKEGG++ I++YVFWN HE
Sbjct: 16 GEVTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVHEP 75
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G+Y F GR++LV+FIK IQ +Y+ LRIGPF+ +E+ YGG P WLH +P FR+D
Sbjct: 76 IQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDN 135
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK HMQ+F+T IV+MMK E L+ QGGPII +Q+ENEY E +G G+RY WAA
Sbjct: 136 EPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWAAA 195
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MAV GVPW MC+Q D PDPV+ HS ++P + +N + +G
Sbjct: 196 MAVDLQTGVPWTMCKQNDAPDPVVGI-----------HSYTIP-VNFQNDSRNYLIYGND 243
Query: 266 DPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R +DI F+VA F +K GS +YYMYHGGTNFGR A ++TTSY AP+DEYGL
Sbjct: 244 TKLRSPQDITFAVALFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGL 302
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
P WGHL+ELH A+K LL G SNLS+G QEA ++ ++ C AFL N D +
Sbjct: 303 IWQPTWGHLRELHAAVKQSSEPLLFGTYSNLSIGQEQEAHIF-ETETQCVAFLVNFDQHH 361
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ--SSTVEMVPENLQPSEASPDNG 442
VVFRN+S L S+SIL DCK+VVF TA V AQ S T E V +
Sbjct: 362 ISEVVFRNISLELAPKSISILLDCKQVVFETAKVNAQHGSRTAEEV-----------QSF 410
Query: 443 SKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
S W+ FKE I ++ + + +H++TTKD TDYLWY + +N
Sbjct: 411 SDISTWKAFKEPIPQDVSKSAYSGNRLFEHLSTTKDATDYLWYIVGLFLN---------- 460
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
I + H H G + ISL+ G N I+LLS VG +
Sbjct: 461 -----ILGRIHGSH--------------GGPANIIFSTNISLQEGPNTISLLSAMVGSPD 501
Query: 562 AGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
+G E GI V I L+ W Y++GL GE IY + I +T+
Sbjct: 502 SGAHMERRVFGIRKVSIQQGQEPENLLNNELWGYQVGLFGERNNIYTQDSK--ITEWTTI 559
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
+ PLTWYK P G++ + L++ MGKG W+NGE IGRYW S
Sbjct: 560 DNLTYSPLTWYKTTFSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS------ 613
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
G PSQ YHIPR + P +N LV+FEE GG+P IT +
Sbjct: 614 -------------------GNPSQSLYHIPREFLNPQDNTLVLFEEMGGNPQLITVNTMS 654
Query: 742 IS 743
+S
Sbjct: 655 VS 656
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 299/721 (41%), Positives = 391/721 (54%), Gaps = 84/721 (11%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
V+ D+R+L+++G R L+ + +HY RS P MWP L+ +AKEGG++ I++YVFWN HE
Sbjct: 41 QVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNVHEPV 100
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR++LV+FIK IQ +Y+ LRIGPF+ +E+ YGG P WLH +P FR+D E
Sbjct: 101 QGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNE 160
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK HMQ+F+T IV+MMK E L+ QGGPII +Q+ENEY E +G G+RY WAA M
Sbjct: 161 PFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSWAAAM 220
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
AV + GVPW MC+Q D PDPV+ HS ++P + N + +G
Sbjct: 221 AVDRQTGVPWTMCKQNDAPDPVVGI-----------HSHTIPLDFP-NASRNYLIYGNDT 268
Query: 267 PHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIAF+V F +K GS +YYMYHGGTNFGR A ++TTSY AP+DEYGL
Sbjct: 269 KLRSPEDIAFAVVYFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDAAPLDEYGLI 327
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
P WGHL+ELH A+K LL G S LSLG QEA ++ ++ C AFL N D +
Sbjct: 328 WQPTWGHLRELHAAVKQSSEPLLFGTYSYLSLGQEQEAHIF-ETESQCVAFLVNFDRHHI 386
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ--SSTVEMVPENLQPSEASPDNGS 443
VVFRN+S L S+SIL DCK+VVF TA V AQ S T E V + S
Sbjct: 387 SEVVFRNISLELAPKSISILSDCKRVVFETAKVTAQHGSRTAEEV-----------QSFS 435
Query: 444 KGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
W FKE I +A + + +H++TTKD TDYLWY + N
Sbjct: 436 DINTWTAFKEPIPQDVSKAMYSGNRLFEHLSTTKDDTDYLWYIVGLFHN----------- 484
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
I + H H G ISLK G N I+LLS VG ++
Sbjct: 485 ----ILGRIHGSH--------------GGPANIILNTNISLKEGPNTISLLSAMVGSPDS 526
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G E G+ V I L+ W Y++GL GE IY ++ W +T+
Sbjct: 527 GAHMERRVFGLQKVSIQQGQEPENLLNNELWGYQVGLFGERNSIYTQEGSKSVEW-TTIY 585
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
PLTWYK P G++ + L++ MGKG W+NGE IGRYW S
Sbjct: 586 NLAYSPLTWYKTTFSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS------- 638
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
G PSQ YHIPR + P +NILV+FEE GG+P +IT + +
Sbjct: 639 ------------------GNPSQSLYHIPRQFLNPQDNILVLFEEMGGNPQQITVNTVSV 680
Query: 743 S 743
+
Sbjct: 681 T 681
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 293/721 (40%), Positives = 382/721 (52%), Gaps = 92/721 (12%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD R+L+++G R + S +HY RS P MWP L+ +AK GG++ I++YVFWN HE
Sbjct: 29 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y F GR++LVKFI+ IQ +Y+ LRIGPFV AE+ YGG P WLH +P FR+D EP
Sbjct: 89 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HMQ F+T IV MMK E L+ QGGPII++Q+ENEY E +G G RY WAA MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFGGR 265
V GVPW+MC+Q D PDPVINTCN C + P+SP+ P +WTENW + +G
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIYGND 268
Query: 266 DPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R EDIAF+VA F +K GS +YYMYHGGTNFGR A ++TTSY AP+DEY
Sbjct: 269 TKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLDEYDF 327
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
C AFL N D N
Sbjct: 328 -----------------------------------------------KCVAFLVNFDQHN 340
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
V FRN+S L S+S+L DC+ VVF TA V AQ + N S +N
Sbjct: 341 TPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRT---ANAVQSLNDINN--- 394
Query: 445 GLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W+ F E + ++ + + + + TTKD TDYLWY IV+
Sbjct: 395 ---WKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWY----IVSYKNRASDGNQIA 447
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNP-ISLKAGKNEIALLSMTVGLQNA 562
L ++S H LHAF N E GS G+ P N +SLK G N I+LLS+ VG ++
Sbjct: 448 HLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDS 507
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G + E GI +V I L+ W Y++GL GE IY N++ W+ +
Sbjct: 508 GAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSVRWMD-IN 566
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
PLTWYK PPG++ + L++ MGKG W+NGE IGRYW S
Sbjct: 567 NLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS------- 619
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
G+PSQ YHIPR + P +N+LV+ EE GGDP +IT + +
Sbjct: 620 ------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSV 661
Query: 743 S 743
+
Sbjct: 662 T 662
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 510 bits (1314), Expect = e-141, Method: Compositional matrix adjust.
Identities = 292/721 (40%), Positives = 382/721 (52%), Gaps = 92/721 (12%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD R+L+++G R + S +HY RS P MWP L+ +AK GG++ I++YVFWN HE
Sbjct: 25 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 84
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y F GR++LVKFI+ IQ +Y+ LRIGPFV AE+ YGG P WLH +P FR+D EP
Sbjct: 85 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 144
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HMQ F+T IV MMK E L+ QGGPII++Q+ENEY E +G G RY WAA MA
Sbjct: 145 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 204
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGWFKTFGGR 265
V GVPW+MC+Q D PDPVINTCN C + P+SP+ P +WTENW + +G
Sbjct: 205 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIYGND 264
Query: 266 DPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGL 324
R EDIAF+VA + +K GS +YYMYHGGTNFGR A ++TTSY AP+DEY
Sbjct: 265 TKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLDEYDF 323
Query: 325 PRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
C AFL N D N
Sbjct: 324 -----------------------------------------------KCVAFLVNFDQHN 336
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
V FRN+S L S+S+L DC+ VVF TA V AQ + N S +N
Sbjct: 337 TPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRT---ANAVQSLNDINN--- 390
Query: 445 GLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
W+ F E + ++ + + + + TTKD TDYLWY IV+
Sbjct: 391 ---WKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWY----IVSYKNRASDGNQIA 443
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNP-ISLKAGKNEIALLSMTVGLQNA 562
L ++S H LHAF N E GS G+ P N +SLK G N I+LLS+ VG ++
Sbjct: 444 RLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDS 503
Query: 563 GPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTME 622
G + E GI +V I L+ W Y++GL GE IY N++ W+ +
Sbjct: 504 GAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMD-IN 562
Query: 623 PPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECV 682
PLTWYK PPG++ + L++ MGKG W+NGE IGRYW S
Sbjct: 563 NLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS------- 615
Query: 683 QECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
G+PSQ YHIPR + P +N+LV+ EE GGDP +IT + +
Sbjct: 616 ------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSV 657
Query: 743 S 743
+
Sbjct: 658 T 658
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 272/648 (41%), Positives = 381/648 (58%), Gaps = 46/648 (7%)
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+Y F GRF+LVKFIK+I + +Y+ LR+GPF+ AE+N+GG+P WL +P FR + EPF
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K H ++++ I+ MMK EKLFASQGGPIIL Q+ENEY + Y E G++Y WAA +
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGRD 266
+ N+G+PW+MC+Q D P +IN CN +C D F P+ P +WTENW F+ FG
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R EDIAFSVAR+F K GS NYYMYHGGTNFGRT+ F+TT Y +AP+DE+GL +
Sbjct: 260 TQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEK 318
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKND 385
PK+GHLK +H A++LC+ AL G+ +LG E Y + CAAFL+N + ++
Sbjct: 319 APKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDT 378
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
T+ F+ Y LP+ S+SILPDCK VV+NTA + AQ S + V + SKG
Sbjct: 379 NTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFV---------KSEKTSKG 429
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
LK+++F E + D + G + ++ TKD TDY WYTTS+ ++E++ + G + +L
Sbjct: 430 LKFEMFSENIPSLLDGDSLIPGELYYL--TKDKTDYAWYTTSVKIDEDDFPDQKGLKTIL 487
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
+ S GHAL + N E G A G F++ P++ K G N I++L + GL ++G +
Sbjct: 488 RVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSY 547
Query: 566 YEWVGAGITSVKITGFNSGTLDLS-TYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
E AG ++ I G SGT DL+ W + GL+GE +Y + W +
Sbjct: 548 MEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKW---EKDG 604
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
K +PLTWYK + P G + + M MGKGL W+NG +GRYW
Sbjct: 605 KRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWM--------------- 649
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK--PSENILVIFEEKGG 730
++ GEP+Q YHIPRS+ K +N+LVI EE+ G
Sbjct: 650 ----------SFLSPLGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPG 687
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 271/653 (41%), Positives = 379/653 (58%), Gaps = 41/653 (6%)
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F GR +L+KF+K+IQ MY ++RIGPF+ AE+N+GG+P WL IP +FR + EP+K
Sbjct: 108 FEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEPYKKE 167
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
M+KF+ IV +K ++FASQGGP+ILAQ+ENEYG + + G +Y WAA+MA++ N
Sbjct: 168 MEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMAISTN 227
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRP 270
GVPWIMC+Q P VI TCN +C D +T + P++WTENW F+ FG + R
Sbjct: 228 TGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQLALRS 287
Query: 271 SEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKW 330
+EDIA+SV RFF KGG++ NYYMY+GGTNFGRT G ++ T Y E P+DEYG+P+ PK+
Sbjct: 288 AEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPVDEYGMPKAPKY 346
Query: 331 GHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKNDKTVV 389
GHL++LH IK A L G++S L EA + C AF++N + D TV
Sbjct: 347 GHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGEDGTVN 406
Query: 390 FRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQ 449
FR Y++P+ SVSIL DCK VV+NT V Q S + S + +K W+
Sbjct: 407 FRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHS---------ERSFHTAQKLAKSNAWE 457
Query: 450 VFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIES 509
++ E + ++ N TKD +DYLWYTTS + ++ + RPV+ ++S
Sbjct: 458 MYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQVKS 517
Query: 510 KGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWV 569
HAL F N G+ G+ F ++ PI+L+ G N +ALLS ++G++++G V
Sbjct: 518 TSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEV 577
Query: 570 GAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPL 629
GI I G N+GTLDL W +K+ L+GE IY + WV + +
Sbjct: 578 KGGIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPAT---TGRAV 634
Query: 630 TWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRG 689
TWYK +P G++P+ LDM MGKG+ ++NGE +GRYWP YR
Sbjct: 635 TWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWP----------------SYR- 677
Query: 690 KFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITF-SIRK 741
T G PSQ YHIPR + KP N+LVIFEE+ G P I ++R+
Sbjct: 678 --------TVGGVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRR 722
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 294/728 (40%), Positives = 411/728 (56%), Gaps = 61/728 (8%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
NV+YD RSLIING R+L++SA+IHYPR+ P MW +++ K G++ IE+Y FWN HE +
Sbjct: 42 NVSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYTFWNLHEPT 101
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG Y F G N+ F+ I + +Y+ +R GP+V AE+NYGG P WL I G VFR+ +
Sbjct: 102 PGTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDGIVFRDYNQ 161
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PF M +MT IV+ ++ +AS GGPIILAQVENEYG+ E+ YG G +YALWAA+
Sbjct: 162 PFMDQMSNWMTYIVNYLR--PYYASNGGPIILAQVENEYGWLEAAYGASGTKYALWAAQF 219
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPH---SPSMPKIWTENWPGWFKTF 262
A + +IG+PWIMC Q D VINTCN FYC D H P+ P WTENWPGWF+ +
Sbjct: 220 ANSLDIGIPWIMCSQDDIAT-VINTCNGFYCHDWIDVHWTAYPNQPAFWTENWPGWFQNW 278
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G PHRP +D+ +SVAR+ GGS+ NYYM+ GGT FGR GGPFITTSYDY+ IDEY
Sbjct: 279 EGGVPHRPVQDVLYSVARWIAYGGSMMNYYMWFGGTTFGRWTGGPFITTSYDYDGAIDEY 338
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQE-ADVYADSSGACAAFLANM 380
G P PK+ E H I EH +L+ + LG + E + Y+ +G +FLAN
Sbjct: 339 GYPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEISHFYSVETGESFSFLANF 398
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
+TV + +++ + WSV +L + +F+T+ S VP+ P ++
Sbjct: 399 GATGVQTVQWNGITFKVQPWSVQLLYN-NVSIFDTSATPIGSP----VPKQFTPIKS--- 450
Query: 441 NGSKGLKWQVFKEIAGIWGEA-DFVKSGF----VDHINTTKDTTDYLWYTTSIIVNENEE 495
F+ I G W E+ D + + ++ ++ T+D TDYLWY T I VN
Sbjct: 451 ----------FENI-GQWSESFDLTFTNYSETPMEQLSLTRDQTDYLWYVTKIEVN---- 495
Query: 496 FLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSM 555
+ G++ L + + +H F + Q A+G G P ++ G + + +L
Sbjct: 496 --RVGAQ--LSLPNISDMVHVFVDN--QYIATGRG---PTNITLNSTIGVGGHTLQVLHT 546
Query: 556 TVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
VGL N E AGI ++D+S+ W+ K +QGE L +YNP + ++
Sbjct: 547 KVGLVNYAEHMEATVAGI----FEPVTLDSVDISSNGWSMKPFVQGETLQLYNPNHSGSV 602
Query: 616 NWVSTMEPPKNQPLTWYKAVVK-QPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRK 674
W + N PLTWYK + + + LDML M KG+ ++NG IGRYW +
Sbjct: 603 QWTNVT---GNPPLTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYWLALAYG 659
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTK 734
+P C Y+G ++P C GCGEPSQ++YH+P W EN +VIFEE G+P
Sbjct: 660 CNP-------CTYQGGYSPSMCQLGCGEPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEA 712
Query: 735 ITFSIRKI 742
IT R I
Sbjct: 713 ITLVQRVI 720
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 293/731 (40%), Positives = 382/731 (52%), Gaps = 102/731 (13%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD R+L+++G R + S +HY RS P MWP L+ +AK GG++ I++YVFWN HE
Sbjct: 29 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y F GR++LVKFI+ IQ +Y+ LRIGPFV AE+ YGG P WLH +P FR+D EP
Sbjct: 89 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK HMQ F+T IV MMK E L+ QGGPII++Q+ENEY E +G G RY WAA MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPKIWTENWPGW------- 258
V GVPW+MC+Q D PDPVINTCN C + P+SP+ P +WTENW
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSNGQNNS 268
Query: 259 ---FKTFGGRDPHRPSEDIAFSVARFF-QKGGSVHNYYMYHGGTNFGRTAGGPFITTSYD 314
+ +G R EDIAF+VA F +K GS +YYMYHGGTNFGR A ++TTSY
Sbjct: 269 AFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYY 327
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACA 374
AP+DEY C
Sbjct: 328 DGAPLDEYDF-----------------------------------------------KCV 340
Query: 375 AFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQP 434
AFL N D N V FRN+S L S+S+L DC+ VVF TA V AQ + N
Sbjct: 341 AFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRT---ANAVQ 397
Query: 435 SEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNEN 493
S +N W+ F E + ++ + + + + TTKD TDYLWY IV+
Sbjct: 398 SLNDINN------WKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWY----IVSYK 447
Query: 494 EEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNP-ISLKAGKNEIAL 552
L ++S H LHAF N E GS G+ P N +SLK G N I+L
Sbjct: 448 NRASDGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISL 507
Query: 553 LSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
LS+ VG ++G + E GI +V I L+ W Y++GL GE IY
Sbjct: 508 LSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGT 567
Query: 613 NNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
N++ W+ + PLTWYK PPG++ + L++ MGKG W+NGE IGRYW
Sbjct: 568 NSVRWMD-INNLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFK 626
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDP 732
S G+PSQ YHIPR + P +N+LV+ EE GGDP
Sbjct: 627 APS-------------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDP 661
Query: 733 TKITFSIRKIS 743
+IT + ++
Sbjct: 662 LQITVNTMSVT 672
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 261/571 (45%), Positives = 345/571 (60%), Gaps = 24/571 (4%)
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF 240
+ENE+G E YG+ GK Y W A++A + N+ PWIMCQQ D P P+INTCN FYCDQF
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQF 60
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
P++ + PK+WTE+W GWFK +G RDP+R +ED+AF+VARFFQ GGS+HNYYMYHGGTNF
Sbjct: 61 KPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNF 120
Query: 301 GRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSS 360
GR+AGGP+ITTSYDY AP+DEYG PKWGHLK+LH I+ E L G+ ++ G S
Sbjct: 121 GRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHS 180
Query: 361 QEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRA 420
A Y G + F N ++ +D+ + F+ Y +P WSV++LPDCK V+NTA V
Sbjct: 181 TTATSYT-YKGKSSCFFGNPEN-SDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNT 238
Query: 421 QSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEADFVKSG-----FVDHINT 474
Q++ EMVP + + K LKWQ E I + E D S +D
Sbjct: 239 QTTIREMVPSLVGKHK-------KPLKWQWRNEKIEHLTHEGDISGSAITANSLIDQKMV 291
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
T D++DYLWY T +N N+ G R L ++++GH LHAF N + G+ G
Sbjct: 292 TNDSSDYLWYLTGFHLNGNDPLF--GKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYS 349
Query: 535 FKYKNPI-SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYS 592
F + + +L+ G N+IALLS TVGL N G +YE V GI V++ DLST
Sbjct: 350 FTLEKKVRNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGKTIRDLSTNE 409
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKM 652
W YK+GL GE ++P ++ W+S P NQ TWYK P G E + +D++ M
Sbjct: 410 WIYKVGLDGEKYEFFDPDHKFRKPWLSN-NLPLNQNFTWYKTSFSTPKGREGVVVDLMGM 468
Query: 653 GKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPR 712
GKG AW+NG+ IGRYWP + + C CDYRG + KC T CG+P+QRWYHIPR
Sbjct: 469 GKGQAWVNGKSIGRYWP---SYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPR 525
Query: 713 SWFKP-SENILVIFEEKGGDPTKITFSIRKI 742
S+ EN L++FEE GG P I ++
Sbjct: 526 SYMNDGKENTLILFEEFGGMPLNIEIKTTRV 556
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 244/535 (45%), Positives = 345/535 (64%), Gaps = 12/535 (2%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSL+I+G+R+L S AIHYPRS P +WP L+++AKEGG+NTIE+Y+FWN HE P
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F GRF+L+K++K+IQ+ MY I+RIGPF+ AE+N+GG+P WL I +FR + +P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M+KF+ IV +K +LFASQGGPIIL Q+ENEYG + + G +Y WAA+MA
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ GVPWIMC+Q P VI TCN +C D +T + P +WTENW F+ +G +
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGGS+ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH I+ + A L G+ S+ LG EA ++ C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV+FR +++P+ SVSIL CK VV+NT V Q + + S + + SK
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHN---------ERSYHTSEVTSKN 445
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVL 505
+W+++ E + + ++ N TKD +DYLWYTTS + ++ +N RPVL
Sbjct: 446 NQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVL 505
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
++S H++ FAN G A G+ F ++ P+ LK G N + LLS T+G++
Sbjct: 506 QVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMK 560
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 286/728 (39%), Positives = 397/728 (54%), Gaps = 93/728 (12%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLII+G+REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFWN HE
Sbjct: 54 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +LVKFIK+IQ+ MY+ LR+GPF+ AE+ +G I + H +R
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDHKNIAGAYR----- 168
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
++ENEY + Y + G Y WA+ +
Sbjct: 169 --------------------------------KIENEYSAVQRAYKQDGLNYIKWASNLV 196
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ +G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW F+ FG
Sbjct: 197 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 256
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
R EDIA+SVARFF K G+ NYYMYHGGTNFGRT+ ++TT Y +AP+DEYGL
Sbjct: 257 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 315
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKN 384
+ PK+GHLK LH A+ LC+ LL G+ G E Y + CAAFLAN + +
Sbjct: 316 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 375
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+T+ F+ Y + S+SILPDCK VV+NTA + +Q ++ N S+ + +K
Sbjct: 376 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTS-----RNFMKSKKA----NK 426
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
++VF E E + V+ TKD TDY WYTTS V++N K G +
Sbjct: 427 KFDFKVFTETLPSKLEGNSYIP--VELYGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTF 484
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
+ I S GHALHA+ N E GS G+ F ++ ++LKAG+N + +L + G ++G
Sbjct: 485 VRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGS 544
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYS-WTYKIGLQGEHLGIYNPGYRNNINWVS-TME 622
+ E G + I G SGTLDL+ S W KIG++GE LGI+ + W T +
Sbjct: 545 YMEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGK 604
Query: 623 PPKNQPLTWYKAVVKQ----------PPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKS 672
P LTWY+ K+ P + M MGKGL W+NGE +GRYW
Sbjct: 605 APG---LTWYQKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYW---- 657
Query: 673 RKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGG-D 731
++ G+P+Q YHIPRS+ KP +N+LVIFEE+
Sbjct: 658 ---------------------QSFLSPLGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVK 696
Query: 732 PTKITFSI 739
P + F+I
Sbjct: 697 PELMDFAI 704
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 236/474 (49%), Positives = 316/474 (66%), Gaps = 20/474 (4%)
Query: 274 IAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHL 333
+AF VARF QKGGS NYYMYHGGTNFGRTAGGPF+TTSYDY+APIDEYGL R PK+GHL
Sbjct: 1 LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60
Query: 334 KELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNV 393
KELH AIK+CE AL++ + S+G+ Q+A VY+ SG C+AFLAN D ++ V+F NV
Sbjct: 61 KELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNV 120
Query: 394 SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE 453
Y+LP WS+SILPDC+ VFNTA V Q+S +EM+P + +K +W+ + E
Sbjct: 121 HYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTD-----------TKNFQWESYLE 169
Query: 454 -IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGH 512
++ + + F G ++ IN T+DT+DYLWY TS+ + ++E FL G P L+I+S GH
Sbjct: 170 DLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGH 229
Query: 513 ALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAG 572
A+H F N +L GSA G + F Y+ I+L +G N IALLS+ VGL N G +E G
Sbjct: 230 AVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTG 289
Query: 573 ITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPPKNQPLT 630
I V + G + G +DLS WTY++GL+GE + + P +I W+ +++ K QPLT
Sbjct: 290 ILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLT 349
Query: 631 WYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGK 690
W+K P G+EP+ LDM MGKG W+NGE IGRYW + H C Y G
Sbjct: 350 WHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSH------CSYTGT 403
Query: 691 FNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
+ P+KC TGCG+P+QRWYH+PR+W KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 404 YKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 457
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 242/522 (46%), Positives = 318/522 (60%), Gaps = 25/522 (4%)
Query: 220 QQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVA 279
+Q D PDPVINTCN FYCD F+P+ P +WTE W GWF +FGG PHRP ED+AF+VA
Sbjct: 1 KQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVA 60
Query: 280 RFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGA 339
RF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDE+GL R PKWGHL++LH A
Sbjct: 61 RFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRA 120
Query: 340 IKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPA 399
IK E L++ + + S+GS ++A V+ +GACAAFL+N V F Y+LPA
Sbjct: 121 IKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPA 180
Query: 400 WSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWG 459
WS+SILPDCK VFNTA V+ + +M P WQ + E
Sbjct: 181 WSISILPDCKTAVFNTATVKEPTLMPKMNP-------------VVRFAWQSYSEDTNSLS 227
Query: 460 EADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFAN 519
++ F K G V+ ++ T D +DYLWYTT + + N+ L++G P L + S GH++ F N
Sbjct: 228 DSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTVYSAGHSMQVFVN 285
Query: 520 QELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKI 578
+ GS G +P Y + + G N+I++LS VGL N G +E W + V +
Sbjct: 286 GKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTL 345
Query: 579 TGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQ 638
+ N GT DLS WTY++GL+GE LG++ + + W P QPLTW+KA
Sbjct: 346 SSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGG---PGGYQPLTWHKAFFNA 402
Query: 639 PPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCIT 698
P G++P+ LDM MGKG W+NG +GRYW K+ C Y G ++ DKC +
Sbjct: 403 PAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGG------CGGCSYAGTYHEDKCRS 456
Query: 699 GCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
CG+ SQRWYH+PRSW KP N+LV+ EE GGD ++ + R
Sbjct: 457 NCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATR 498
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 233/477 (48%), Positives = 306/477 (64%), Gaps = 22/477 (4%)
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
PHRP+EDIAF+VARF QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYGL R
Sbjct: 1 PHRPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 60
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDK 386
PKWGHL++LH AIKLCE AL++G+ + S+G Q++ V+ +GACAAFL+N D +
Sbjct: 61 EPKWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYA 120
Query: 387 TVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGL 446
VVF + Y +P WS+SILPDCK VFNTA + AQ+S ++M +
Sbjct: 121 RVVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQTSQLKM-------------EWAGKF 167
Query: 447 KWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLL 506
W+ + E + + F K G V+ I+ T+D TDYLWYTT + + ENE FLKNG PVL
Sbjct: 168 SWESYNEDTNSFDDRSFTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYPVLT 227
Query: 507 IESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFY 566
+ S GH++H + N +L G+ G +P Y + L AG N+I++LS+ VGL N G +
Sbjct: 228 VNSAGHSMHIYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHF 287
Query: 567 E-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPK 625
E W + V ++G N G DLS W Y+IGL+GE L ++ +++ W P +
Sbjct: 288 ETWNTGVLGPVTLSGLNEGKRDLSWQKWIYQIGLKGEALNLHTLSGSSSVEWGG---PSQ 344
Query: 626 NQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQEC 685
Q LTWYK P G++P+ LDM MGKG W+NG+ +GRYWP S C
Sbjct: 345 KQSLTWYKTSFNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKASGS-----CGGC 399
Query: 686 DYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
DYRG +N KC + CGE +QRWYH+PRSW P+ N+LV+FEE GGDP+ I+ RK+
Sbjct: 400 DYRGTYNEKKCQSNCGESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKV 456
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 239/492 (48%), Positives = 298/492 (60%), Gaps = 22/492 (4%)
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTE W GWF FGG PHRP ED+AF+VARF QKGGS NYYMYHGGTNF RT+GGPFI
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADS 369
TSYDY+APIDEYGL R PKWGHL++LH AIK E AL++G+ + SLG+ ++A V+ S
Sbjct: 61 ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSS 120
Query: 370 SGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVP 429
GACAAFL+N VVF Y LPAWS+S+LPDCK VFNTA V S+ M P
Sbjct: 121 GGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSP 180
Query: 430 ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSII 489
+ G WQ + E F K G V+ ++ T D +DYLWYTT +
Sbjct: 181 -------------AGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVN 227
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+N NE+FLK+G P L I S GH+L F N + G+ G P Y + + G N+
Sbjct: 228 INSNEQFLKSGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNK 287
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
I++LS VGL N G YE G+ V ++G N G DLS WTY+IGL GE LG+ +
Sbjct: 288 ISILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQS 347
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
+++ W S QPLTW+KA P GD P+ LDM MGKG AW+NG IGRYW
Sbjct: 348 VAGSSSVEWGSA---AGKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW 404
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEK 728
K+ S C Y G ++ KC TGCG+ SQR+YH+PRSW PS N+LV+ EE
Sbjct: 405 SYKASSSG-----CGGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEF 459
Query: 729 GGDPTKITFSIR 740
GGD + + R
Sbjct: 460 GGDLSGVKLVTR 471
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 467 bits (1202), Expect = e-128, Method: Compositional matrix adjust.
Identities = 240/541 (44%), Positives = 331/541 (61%), Gaps = 27/541 (4%)
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MA + +IGVPW+MCQQ + P P++ TCN FYCDQ+ P +PS PK+WTENW GWFK +GG+
Sbjct: 1 MANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGK 60
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P+R +ED+AFSVARFFQ GG+ NYYMYHGGTNFGR AGGP+ITTSYDY AP+DE+G
Sbjct: 61 HPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNL 120
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
PKWGHLK+LH +K E +L G S + LG+S +A +Y G+ + F+ N++ D
Sbjct: 121 NQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEGS-SCFIGNVNATAD 179
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
V F+ YH+PAWSVS+LPDC K +NTA V Q+S M ++ +P
Sbjct: 180 ALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSI--MTEDSSKPER--------- 228
Query: 446 LKWQVFKEIAG---IWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
L+W E A + G D + G VD + T D +DYLWY T + +++ +
Sbjct: 229 LEWTWRPESAQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNM- 287
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPIS-LKAGKNEIALLSMTVGLQN 561
L + S H LHA+ N + G+ ++++ ++ L G N I+LLS++VGLQN
Sbjct: 288 -TLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQN 346
Query: 562 AGPFYEWVGAGITS-VKITGFNSGTL---DLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
GPF+E GI V + G+ DLS + W YKIGL G + +++ + W
Sbjct: 347 YGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKW 406
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
+ + P + LTWYKA K P G EP+ +D+ +GKG AW+NG+ IGRYWP +S
Sbjct: 407 ANE-KLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWP---SFNSS 462
Query: 678 HDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPS-ENILVIFEEKGGDPTKIT 736
D C +CDYRG + DKC CG+P+QRWYH+PRS+ S N + +FEE GG+P+ +
Sbjct: 463 DDGCKDKCDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVN 522
Query: 737 F 737
F
Sbjct: 523 F 523
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 275/731 (37%), Positives = 392/731 (53%), Gaps = 63/731 (8%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE-L 85
N+TYD RSLIING R+L++S ++HYPR+ W +++ +K GV+ IE+Y+FWN H+
Sbjct: 41 NITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNVHQPN 100
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
+P ++Y N+ F+ + ++ +++ LRIGP+V AE+NYGG P+WL I G VFR+
Sbjct: 101 TPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIVFRDYN 160
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+PF M ++T++VD K + FA GGPII+AQ+ENEYG+ E+ YG G+ YALWA
Sbjct: 161 QPFMDAMSTWVTMVVD--KLQDYFAPNGGPIIIAQIENEYGWLENEYGASGREYALWAIN 218
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC----DQFTPHSPSMPKIWTENWPGWFKT 261
A + NIG+PWIMC Q D D INTCN FYC D+ P P WTENW GWF+
Sbjct: 219 FAKSLNIGIPWIMCAQEDI-DSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVGWFEN 277
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDE 321
+G P RP +D+ FS ARF GGS+ NYYM+ GGTNFGR+ GGP+I TSY+Y+AP+DE
Sbjct: 278 WGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYDAPLDE 337
Query: 322 YGLPRNPKWGHLKELHGAIKLCEHALLNGE-RSNLSLGSSQEADVYADSSGACAAFLANM 380
+G P PK+ + H I E ++ + + + L + EA Y G FL N
Sbjct: 338 FGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTPVPLSNISEAHPY----GEDLVFLTNF 393
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPD 440
D + ++ +Y L WSV I+ VVF+T+ V E + + +
Sbjct: 394 GLVIDY-IQWQGTNYTLQPWSVVIVY-SGSVVFDTSYVPD-----EYIKPSTRDQFKDVP 446
Query: 441 NGSKGLKWQVFKEIAGIWGEADFVKSGFV------DHINTTKDTTDYLWYTTSIIVNENE 494
N F E WG++D + + + IN T DTTDYLWYTT+I +NE
Sbjct: 447 NAINYDSILSFSE----WGQSDIINDCIINNESPLEQINLTNDTTDYLWYTTNITLNE-- 500
Query: 495 EFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN-EIALL 553
L IE+ H F N G+ GNG P Y N ++ +L
Sbjct: 501 -------TTTLTIENMYDFCHVFLN----GAYQGNG-WSPVAYITLEPTNGNINYQLQIL 548
Query: 554 SMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRN 613
+MT+GL+N E G+ + + G +++ W+ K G+ GE L IYN +
Sbjct: 549 TMTMGLENYAAHMESYSRGL----LGSISLGQTNITNNQWSMKPGILGEKLQIYNEYSSS 604
Query: 614 NINWVSTMEPPKNQPLTWYKAVV-----KQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYW 668
+NW P Q +TWY+ + P L+M M KG ++NG IGRY+
Sbjct: 605 KVNW-QPYNPSATQSMTWYQFNISLDGLSSDPSSNAYVLNMTSMNKGFVYVNGFNIGRYF 663
Query: 669 PRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSEN----ILVI 724
++ +S+ C + DY G + P C EPSQ YHIP W ++ +++
Sbjct: 664 LMEATQSN----CTLKQDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQYATVIL 719
Query: 725 FEEKGGDPTKI 735
FEE GDPTKI
Sbjct: 720 FEEVNGDPTKI 730
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 213/362 (58%), Positives = 272/362 (75%), Gaps = 4/362 (1%)
Query: 8 APFALLIFFSSSITY--CFAG-NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
A FA L+ FS +I FA NV+YD R+L+I+G+R +++SA IHYPR+ P MWP L+
Sbjct: 6 ALFAALLCFSLTIQLGVSFAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIA 65
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
++KEGG + I++YVFWNGHE +Y F GR+++VKF+K++ + +Y+ LRIGP+V AE+
Sbjct: 66 KSKEGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEW 125
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
N+GG PVWL IPG FR D PFK MQ+F+ IVD+M++E LF+ QGGPII+ Q+ENE
Sbjct: 126 NFGGFPVWLRDIPGIEFRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENE 185
Query: 185 YGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS 244
YG ES +G+ GK Y WAA+MA+ + GVPW+MCQQ D PD +IN CN FYCD F P+S
Sbjct: 186 YGNVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNS 245
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
+ PK+WTE+W GWF ++GGR P RP EDIAF+VARFFQ+GGS HNYYMY GGTNFGR++
Sbjct: 246 ANKPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSS 305
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSN-LSLGSSQEA 363
GGPF TSYDY+APIDEYGL PKWGHLKELH AIKLCE AL+ + + LG QE
Sbjct: 306 GGPFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEV 365
Query: 364 DV 365
V
Sbjct: 366 GV 367
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 169/386 (43%), Positives = 214/386 (55%), Gaps = 35/386 (9%)
Query: 365 VYADSSG---ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ 421
+Y+ SG +C+AFLAN+D+ +V F Y LP WSVSILPDC+ VFNTA V AQ
Sbjct: 576 LYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQ 635
Query: 422 SST----VEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKD 477
+S + VP+ W KE +W E +F G ++H+N TKD
Sbjct: 636 TSIKTNKISYVPKT----------------WMTLKEPISVWSENNFTIQGVLEHLNVTKD 679
Query: 478 TTDYLWYTTSIIVN-ENEEFLK-NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPF 535
+DYLW T I V+ E+ F + N P L I+S LH F N +L GS G+
Sbjct: 680 HSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVIGHWV---- 735
Query: 536 KYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWT 594
K PI L G N++ LLS TVGLQN G F E GAG VK+TGF +G +DLS YSWT
Sbjct: 736 KVVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWT 795
Query: 595 YKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGK 654
Y++GL+GE IY W TWYK P G+ P+ LD+ MGK
Sbjct: 796 YQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGK 855
Query: 655 GLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSW 714
G AW+NG IGRYW R +P D C +CDYRG ++ KC T CG P+Q WYHIPRSW
Sbjct: 856 GQAWVNGHHIGRYWTR----VAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYHIPRSW 910
Query: 715 FKPSENILVIFEEKGGDPTKITFSIR 740
+ S N+LV+FEE GG P +I+ R
Sbjct: 911 LQASNNLLVLFEETGGKPFEISVKSR 936
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/751 (36%), Positives = 408/751 (54%), Gaps = 67/751 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE-LS 86
VTYD RSLIING R+L+ S +IHYPR+ MWP +++Q+K+ G++ I++Y+FWN H+ S
Sbjct: 40 VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
P +YYF G N+ KF+ + ++ +Y+ LRIGP+V AE+ YGG P+WL IP V+R+ +
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIVYRDYNQ 159
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+ M +M +V + + FA GGPIILAQVENEYG+ E YG G YA W+
Sbjct: 160 QWMNEMSIWMEFVVKYL--DNYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSIDF 217
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPH---SPSMPKIWTENWPGWFKTF 262
A + NIG+PWIMCQQ D + INTCN +YC D + H P+ P WTENW GWF+ +
Sbjct: 218 AKSLNIGIPWIMCQQNDI-ESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G P RP +DI +S ARF GGS+ NYYM+ GGTNFGRT+GGP+I TSYDY+AP+DE+
Sbjct: 277 GQAKPKRPVQDILYSNARFIAYGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDEF 336
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
G P PK+ + H + E LLN + SQ +V+ G +F+ N
Sbjct: 337 GQPNEPKFSLSSKFHQVLHAIESDLLNNQPPKSPTFLSQFIEVH--QYGINLSFITNYGT 394
Query: 383 KND-KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDN 441
K + + N +Y + WSV I+ + +++F+T+ +P N + + +N
Sbjct: 395 STTPKIIQWMNQTYTIQPWSVLIIYN-NEILFDTS----------FIPPNTLFNNNTINN 443
Query: 442 GSKGLKWQVFKEIAGIWGEADFVKSGF----------------VDHINTTKDTTDYLWYT 485
K + + + I I +DF + ++ + TKDT+DY WY+
Sbjct: 444 -FKPINQNIIQSIFQI---SDFNLNSGGGGGDGDGNSVNSVSPIEQLLITKDTSDYCWYS 499
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
T+ + + + + G+ + + E + +H F + E QGSA NPI+ +
Sbjct: 500 TN-VTTTSLSYNEKGNIFLTITEFYDY-VHIFIDNEYQGSAFSPSLCQ--LQLNPIN-NS 554
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLG 605
++ +LSMT+GL+N E GI + G+ +L+ W K GL GE++
Sbjct: 555 TTFQLQILSMTIGLENYASHMENYTRGILGSILI----GSQNLTNNQWLMKSGLIGENIK 610
Query: 606 IYNPGYRNNINWVSTMEPPK----NQPLTWYK---AVVKQP--PGDEPIGLDMLKMGKGL 656
I+N N INW ++ +PLTWYK ++V P LDM M KG+
Sbjct: 611 IFNND--NTINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGM 668
Query: 657 AWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSW-F 715
W+NG IGRYW ++ +S + ++ Y G+++P C +PSQ Y +P W F
Sbjct: 669 IWVNGYSIGRYWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLF 728
Query: 716 KPSEN----ILVIFEEKGGDPTKITFSIRKI 742
+ N ++I EE G+P +I KI
Sbjct: 729 NNNYNNQYATIIIIEELNGNPNEIQLLSNKI 759
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 200/332 (60%), Positives = 253/332 (76%), Gaps = 1/332 (0%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
L+ + +T+C NV+YDS +LIING R +I S +IHYPRS MWP L+Q+AK+GG++
Sbjct: 7 LVATLACLTFCIGDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLD 66
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
IE+Y+FW+ HE KY F GR + +KF ++IQ A +Y+++RIGP+V AE+NYGG PVW
Sbjct: 67 AIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVW 126
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-F 191
LH +PG R + + +K MQ F T IV+M K+ LFASQGGPIILAQ+ENEYG +
Sbjct: 127 LHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPA 186
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
YG+ GK Y W A+MA + NIGVPWIMCQQ D P P+INTCN FYCD FTP++P PK++
Sbjct: 187 YGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYCDNFTPNNPKSPKMF 246
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
TENW GWFK +G +DP+R +ED+AFSVARFFQ GG +NYYMYHGGTNFGRT+GGPFITT
Sbjct: 247 TENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITT 306
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLC 343
SYDY AP+DEYG PKWGHLK+LH +I +C
Sbjct: 307 SYDYNAPLDEYGNLNQPKWGHLKQLHASIXIC 338
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 204/329 (62%), Positives = 252/329 (76%), Gaps = 6/329 (1%)
Query: 3 PRTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
P+T + LL + S+I G+VTYD +++IINGRR ++IS +IHYPRS P MWP L
Sbjct: 2 PKTVLLFLCLLTWVCSTI-----GSVTYDHKAIIINGRRRILISGSIHYPRSTPQMWPDL 56
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q+AK+GG++ IE+YVFWNGHE SPGKYYF R++LV+FIK++QQA +Y+ LRIGP+V A
Sbjct: 57 IQKAKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCA 116
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+NYGG P+WL ++PG FR D PFK MQKF+ IVDMMK EKLF +QGGPIIL+Q+E
Sbjct: 117 EWNYGGFPIWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIE 176
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEYG E G GK Y WAA+MAV GVPW+MC+Q D PDP+I+TCN FYC+ F P
Sbjct: 177 NEYGPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKP 236
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
+ PKIWTENW GW+ FGG P+RP ED+AFSVARF Q GGS+ NYYMYHGGTNFGR
Sbjct: 237 NQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGR 296
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWG 331
T+ G F+TTSYD++APIDEYGL R P G
Sbjct: 297 TS-GLFVTTSYDFDAPIDEYGLLREPILG 324
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 97/165 (58%), Gaps = 7/165 (4%)
Query: 576 VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAV 635
V + G N GT D+S Y W+YK+GL+GE L +Y+ N++ W+ + QPLTWYK
Sbjct: 326 VTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSF--QKQPLTWYKTT 383
Query: 636 VKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDK 695
P G+EP+ LDM M KG W+NG IGRY+P + +C Y G F K
Sbjct: 384 FNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIARGK-----CNKCSYTGFFTEKK 438
Query: 696 CITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C+ CG PSQ+WYHIPR W P+ N+L+I EE GG+P I+ R
Sbjct: 439 CLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKR 483
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 275/741 (37%), Positives = 392/741 (52%), Gaps = 82/741 (11%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V YD RSL ING R+L+IS +IHYPRS P MWP L++++K+ G+N IE+YVFWN H+ +
Sbjct: 46 VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105
Query: 88 GKYY-FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
+ Y F G N+ F+ + QQ +Y+ LRIGP+V AE+NYGGIP WL IPG VFR+ +
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGIVFRDYNQ 165
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+ M +MT IV+ +K FAS GGPIILAQVENEYG+ E+ YG+ GK YA WA
Sbjct: 166 PWMTEMASWMTFIVNYLK--PYFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAISF 223
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS----PSMPKIWTENWPGWFKTF 262
A + NIG+PW MCQQ D D INTCN FYC + + P+ P +TENW GW + +
Sbjct: 224 AKSLNIGIPWTMCQQNDI-DDAINTCNGFYCHDWIQYHFQVYPNQPAFFTENWAGWIQYY 282
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
PHRP+ED+ +SVAR+F +GGS+ NYYM+HGGT F R + F+T SYDY+A +DEY
Sbjct: 283 SEGVPHRPTEDLLYSVARWFSRGGSLMNYYMWHGGTTFARYS-STFLTNSYDYDAALDEY 341
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALL-NGER------SNLSLGSSQEADVY---ADSSGA 372
G PK+ L +LH + + LL +GE SN++ ++ E Y + +
Sbjct: 342 GYEAEPKYSALAQLHSVLSQYSYILLSSGEVARPVNISNITTCNTIEIIQYNTTINGTLE 401
Query: 373 CAAFLANMDDKNDKTVVF--RNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
F+ N + V + + WSV IL + + V+ +T+ V+ Q S
Sbjct: 402 TITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYNNQTVI-DTSYVKQQYSA------ 454
Query: 431 NLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGF-VDHINTTKDTTDYLWYTTSII 489
E K + + E G+ ++ V + + ++ T D TDYL +I
Sbjct: 455 ---QKEFYQSKRVKNVLVSSWTEPIGVGNYSNVVTANLPSEQLDLTLDQTDYLCNADDMI 511
Query: 490 VNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNE 549
+ + + E Q + G+ H K I G ++
Sbjct: 512 -------------------------YIYIDGEYQSWSRGSPAHFVLDTKFGI----GTHK 542
Query: 550 IALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY-N 608
+++LS+T+GL + G +E G+ GT D++ W+ + L GE GI N
Sbjct: 543 LSILSLTMGLISYGSHFESYKRGLNGTVTL----GTQDITNNGWSMRPYLVGEMQGIQSN 598
Query: 609 PGYRNNINWVSTMEPPKNQPLTWYK--AVVKQPPGD-EPIGLDMLKMGKGLAWLNGEEIG 665
P +W E NQPLTWYK +++ D LDM+ M KG +NG IG
Sbjct: 599 PHLT---SWSINNELSINQPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVNGNSIG 655
Query: 666 RYWPRKSRKSSPHDECVQECDYRGK-FNPDKCITGCGEPSQRWYHIPRS--WFKPSE-NI 721
RYW C C+Y G + C TGCGEPS+R+YH+P + +P++ N
Sbjct: 656 RYWLTLGWG------CGSGCNYTGDGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLNE 709
Query: 722 LVIFEEKGGDPTKITFSIRKI 742
+++FEE GDP I R +
Sbjct: 710 IIVFEELSGDPNSIQLVQRYV 730
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 197/298 (66%), Positives = 236/298 (79%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD R+++ING+R ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVFWNGHE
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+YYFG R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK MQ F+ IV MMK E LF QGGPIILAQVENEYG ES G G K YA WAAKMA
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
VA GVPW+MC+Q D PDPVINTCN FYCD F+P+S S P +WTE W GWF FGG P
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 267
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
HRP ED+AF+VARF QKGGS NYYMYHGGTNF RT+GGPFI TSYDY+APIDEYG P
Sbjct: 268 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGRP 325
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 222/468 (47%), Positives = 280/468 (59%), Gaps = 21/468 (4%)
Query: 274 IAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHL 333
+AF+VARF QKGGS NYYMYHGGTNF RT+GGPFI TSYDY+APIDEYGL R PKWGHL
Sbjct: 1 MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60
Query: 334 KELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNV 393
++LH AIK E AL++G+ + SLG+ ++A V+ S GACAAFL+N VVF
Sbjct: 61 RDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGR 120
Query: 394 SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE 453
Y LPAWS+S+LPDCK VFNTA V S+ M P + G WQ + E
Sbjct: 121 RYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSP-------------AGGFSWQSYSE 167
Query: 454 IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHA 513
F K G V+ ++ T D +DYLWYTT + +N NE+FLK+G P L + S GH+
Sbjct: 168 ATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHS 227
Query: 514 LHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGI 573
L F N + G+ G P Y + + G N+I++LS VGL N G YE G+
Sbjct: 228 LQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGV 287
Query: 574 TS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWY 632
V ++G N G DLS WTY+IGL GE LG+ + +++ W S QPLTW+
Sbjct: 288 LGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSA---AGKQPLTWH 344
Query: 633 KAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFN 692
KA P GD P+ LDM MGKG AW+NG IGRYW K+ S C Y G ++
Sbjct: 345 KAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSG----GCGGCSYAGTYS 400
Query: 693 PDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
KC TGCG+ SQR+YH+PRSW PS N+LV+ EE GGD + R
Sbjct: 401 ETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTR 448
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/393 (52%), Positives = 272/393 (69%), Gaps = 3/393 (0%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD RSL+I+G+R+L S AIHYPRS P MW LV+ AK GG+NTIE+YVFWNGHE P
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKYYF GRF+L++F+ +I+ MY I+RIGPF+ AE+N+GG+P WL I +FR + EP
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK M+KF+ IV +K ++FA QGGPIIL+Q+ENEYG + G +Y WAA+MA
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
++ IGVPW+MC+Q P VI TCN +C D +T + P++WTENW F+TFG +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPR 326
R +EDIA++V RFF KGG++ NYYMYHGGTNFGRT G ++ T Y EAP+DEYG+ +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 327 NPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKND 385
PK+GHL++LH IK A L G++S LG EA Y C +FL+N + D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANV 418
TVVFR +++P+ SVSIL DCK VV+NT V
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV 427
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 259/751 (34%), Positives = 391/751 (52%), Gaps = 51/751 (6%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
I+ F +L+ F + + V+YD+R++IING R+L+ SA+IHYPRS MWP ++++
Sbjct: 12 ISIFLILLIFPNYVL-SDKLTVSYDNRAIIINGERKLLYSASIHYPRSTRTMWPDILKRT 70
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+NTIE+Y+FWN H+ +P Y F G ++ F+ + ++ ++I+R GP+V AE+N
Sbjct: 71 KAAGINTIETYIFWNLHQPTPDTYDFEGSSDVKHFLDLCKEEGFHVIVRFGPYVCAEWNN 130
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+P WL +PG V+R EPF M+K+M IV + +A GGPII+AQ+ENEYG
Sbjct: 131 GGLPSWLKAVPGIVYRTHNEPFMREMKKWMDYIVHYLS--DYYAPNGGPIIMAQIENEYG 188
Query: 187 YYESFYGE-GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS- 244
+ E Y E GG Y WA K+A + N G+PWIMCQQ +T VINTCN FYC + +
Sbjct: 189 WLEYEYREQGGPEYVDWAVKLAKSYNTGIPWIMCQQ-NTRSDVINTCNGFYCHDWLQYHQ 247
Query: 245 ---PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG 301
P P +TE W GW + F P RP+ D+ +S ARF+ +GG + NYYM+HGGT FG
Sbjct: 248 RTFPDQPAFFTELWTGWPQYFEEGFPTRPTVDVLYSAARFYSRGGGMVNYYMWHGGTTFG 307
Query: 302 RTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALL---NGERSNLSLG 358
R PF+TTSYDY+AP+DEYG P+ PK+ L +LH ++ +L N +
Sbjct: 308 RFT-SPFLTTSYDYDAPLDEYGFPQEPKYSMLTKLHVTLEKYSSVILHDPNVPPPYVFPD 366
Query: 359 SSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANV 418
++ E Y + + FL N DD K V + + WSV I + ++VF+T +
Sbjct: 367 NTVEMIEYKKDAES-VVFLVNWDDTFAKQVDMNGKNVKINQWSVQIYYN-NELVFDTFEI 424
Query: 419 RAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEA-DFVKSGF-----VDHI 472
A + P ++ S D + + W E F+ +
Sbjct: 425 PANLTRPN--PPFKPIAKTSLDATAAATSRTGLVNLVSSWNEPFSFLTYNASSQTPTAQL 482
Query: 473 NTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTH 532
T D +DY+WY T I + + +E +L + + F + + G+
Sbjct: 483 KLTGDNSDYIWYETEIDLTKTDE--------ILYLYKSYDFSYVFVDGQFLYWHRGSPIQ 534
Query: 533 PPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYS 592
F K P+ GK+ + +L +G+ + G E G+T G+ +++
Sbjct: 535 AYFNGKFPV----GKHTLQILCAAMGVPSYGAHIEQHERGLTGDIFL----GSKNITDNG 586
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDE--PIGLDML 650
W + L GE LG++ + + W + +TWYK VK P ++ LD+
Sbjct: 587 WKMRPFLSGELLGLH--ASPSTVKWSPVSKGTAGSGVTWYKFNVKTPSFEDGPAFALDLK 644
Query: 651 KMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHI 710
M KGL ++NG IGRYW K C ++C+ G ++ C CGE SQR+YH+
Sbjct: 645 SMWKGLVFVNGNSIGRYWVAKGW-------CEEKCNQTGLYDNYGCRENCGESSQRYYHV 697
Query: 711 PRSWFK-PSENILVIFEEKGGDPTKITFSIR 740
P+ + K S+N ++IFEE GDP I R
Sbjct: 698 PKDFLKESSDNEVIIFEELQGDPYSIELVQR 728
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 191/287 (66%), Positives = 225/287 (78%), Gaps = 20/287 (6%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD RSLII+GRR L+IS +IHYPRSVP MWP LV +AK+GG + +E+YVFWNGHE +
Sbjct: 37 SVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 96
Query: 87 PGK--------------------YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
G+ YYF RF+LV+F KI++ A +YMILRIGPFVAAE+ +
Sbjct: 97 QGQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTF 156
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+PVWLHY PGTVFR + EPFK HM++F T IVDMMK+E+ FASQGG IILAQVENEYG
Sbjct: 157 GGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYG 216
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
E YG G K YA+WAA MA+AQN GVPWIMCQQ+D PDPVINTCNSFYCDQF P+SP+
Sbjct: 217 DMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPT 276
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYM 293
PK WTENWPGWF+TFG +PHRP ED+AFSVARFF KGGS+ NYY+
Sbjct: 277 KPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYV 323
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 211/388 (54%), Positives = 253/388 (65%), Gaps = 51/388 (13%)
Query: 356 SLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNT 415
SL + ADVY D SG C AFL+N+D + DK V F++ SY LPAWSVSILPDCK V FNT
Sbjct: 317 SLQNYYVADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNT 376
Query: 416 ANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTT 475
A VR+Q+ ++MVP NL+ S+ W +F+E GIWG D V++GFVDHINTT
Sbjct: 377 AKVRSQTLMMDMVPANLESSKVD--------GWSIFREKYGIWGNIDLVRNGFVDHINTT 428
Query: 476 KDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPF 535
KD+TDYLWYTTS V+ + G VL IESKGHA+ AF N EL GSA GNG+ F
Sbjct: 429 KDSTDYLWYTTSFDVDGSH---LAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNF 485
Query: 536 KYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTY 595
+ P++L+AGKN+++LLSMTVGLQN GP YEW GAGITSVKI+G + +DLS+ W Y
Sbjct: 486 SVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAGAGITSVKISGMENRIIDLSSNKWEY 545
Query: 596 KIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKG 655
K+ V P GD+P+GLDM MGKG
Sbjct: 546 KVN-------------------------------------VDVPQGDDPVGLDMQSMGKG 568
Query: 656 LAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWF 715
LAWLNG IGRYWPR S S D C CDYRG F+P+KC GCG+P+QRWYH+PRSWF
Sbjct: 569 LAWLNGNAIGRYWPRISPVS---DRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWF 625
Query: 716 KPSENILVIFEEKGGDPTKITFSIRKIS 743
PS N LVIFEEKGGDPTKITFS R ++
Sbjct: 626 HPSGNTLVIFEEKGGDPTKITFSRRTVA 653
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 209/359 (58%), Positives = 242/359 (67%), Gaps = 14/359 (3%)
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL Y+PG FR D EPFK MQKF IV MMK EKLF +QGGPIIL+Q+ENE+G
Sbjct: 1 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
E G GK Y WAA+MAV + GVPWIMC+Q D PDPVI+TCN FYC+ F P+
Sbjct: 61 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 120
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
PK+WTE W GW+ FGG P RP+ED+AFSVARF Q GGS NYYMYHGGTNFGRTAGG
Sbjct: 121 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTAGG 180
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY 366
PF+ TSYDY+AP+DEYGLPR PKWGHL++LH AIK CE AL++ + S LGS+QEA V+
Sbjct: 181 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF 240
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
S CAAFLAN D K V F Y LP WS+SILPDCK V+NTA V +QSS V+
Sbjct: 241 KSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQVQ 299
Query: 427 MVPENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWY 484
M P + G WQ F +E G + IN T+DTTDYLWY
Sbjct: 300 MTPVH------------SGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWY 346
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 186/295 (63%), Positives = 231/295 (78%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD +++++NG+R +++S +IHYPRSVP MWP L+Q+AK+GG++ +++YVFWNGHE S
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D EPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F T IVDMMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N VPW+MC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PH
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
RP ED+A+ VA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 240/598 (40%), Positives = 322/598 (53%), Gaps = 86/598 (14%)
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
D + KY M++F+TLIV+ +K KLFASQGGPIILAQ+ENEY + E + E G +Y WA
Sbjct: 419 DCKTVKY-MKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWA 477
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKT 261
AKMA+A N GVPWIMC+Q P VI TCN +C P P +WTENW ++
Sbjct: 478 AKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRV 537
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDE 321
FG R +EDIAFSVARFF GG++ NYYMYHGGTNFGR G F+ Y EAP+DE
Sbjct: 538 FGDPPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRN-GAAFVMPRYYDEAPLDE 596
Query: 322 YGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVY-ADSSGACAAFLANM 380
+GL + PKWGHL++LH A++ C+ ALL G S LG EA V+ C AFL+N
Sbjct: 597 FGLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNH 656
Query: 381 DDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSS--TVEMVPENLQPSEAS 438
+ K D TV FR Y + S+SIL DCK VVF+T +V +Q + T + +Q
Sbjct: 657 NTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQ----- 711
Query: 439 PDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL 497
DN W+++ +E + + ++ N TKD TDYLWYTTS + ++
Sbjct: 712 -DN-----VWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPY 765
Query: 498 KNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
+ +PV L+G+ +G + F + + LK G N +A+LS T+
Sbjct: 766 RKEVKPV-----------------LEGAGTGRRSTRSFTMEKAMDLKVGVNHVAILSSTL 808
Query: 558 GLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
GL ++G + E AG+ +V I G N+GTLDL+T W + G
Sbjct: 809 GLMDSGSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVPG------------------- 849
Query: 618 VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSP 677
NQPLTWY+ P G +P+ +D+ MGKG ++NGE +GRYW S
Sbjct: 850 ------KDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYW------VSY 897
Query: 678 HDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
H G+PSQ YH+PRS +P N L+ FEE+GG P I
Sbjct: 898 HH-------------------ALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAI 936
Score = 368 bits (944), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 196/423 (46%), Positives = 247/423 (58%), Gaps = 67/423 (15%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD RSLII+G RE+ S +IHYPRS P WP L+ +AKEGG+N IESYVFWNGHE
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI-PVWLHYIPGTVFRNDTE 146
G Y F GR++L+KF K+IQ+ MY I+RIGPFV AE+N+G + + IP +FR + E
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGFVCHIGSGEIPDIIFRTNNE 152
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK +M++F+TLIV+ +K KLFASQGGPIILAQ+ENEY + E + E G +Y WAAKM
Sbjct: 153 PFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKM 212
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF--TPHSPSMPKIWTENWPGWFKTFGG 264
A+A N GVPWIMC+Q P VI TCN +C P P +WTENW ++ FG
Sbjct: 213 AIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGD 272
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYM------------------------------- 293
R +EDIAFSVARFF GG++ NYYM
Sbjct: 273 PPSQRSAEDIAFSVARFFSVGGTMANYYMVVLNSNSNLFLTKKRDEISDRTDTGGFTCVN 332
Query: 294 ---YHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNG 350
YHGGTNFGR G F+ Y EAP+DE+GL + PKWGHL++LH A++ C+ ALL G
Sbjct: 333 NQQYHGGTNFGRN-GAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRHCKKALLWG 391
Query: 351 ERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKK 410
S LG + R Y + S+SIL DCK
Sbjct: 392 NPSVQPLGK-----------------------------LTRGQKYFVARRSISILADCKT 422
Query: 411 VVF 413
V +
Sbjct: 423 VKY 425
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 185/295 (62%), Positives = 230/295 (77%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD +++++NG+R +++S +IHYPRSVP MWP L+Q+AK+GG++ +++YVFWNGHE S
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG R D EPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISLRTDNEPF 149
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K MQ F T IVDMMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N VPW+MC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PH
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
RP ED+A+ VA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 224/463 (48%), Positives = 284/463 (61%), Gaps = 30/463 (6%)
Query: 293 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
MY GGTNFGRT+GGPF TSYDY+AP+DEYGL PKWGHLK+LH AIKLCE AL+ +
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60
Query: 353 SNL-SLGSSQEADVY---ADSSG-ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPD 407
LGS QEA +Y ++ G CAAFLAN+D+ V F SY LP WSVSILPD
Sbjct: 61 PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120
Query: 408 CKKVVFNTANVRAQSSTVEMVPENLQPSEAS---------PDNGSKGLK-WQVFKEIAGI 457
C+ V FNTA V AQ+S + E+ +PS S DN S K W KE GI
Sbjct: 121 CRHVAFNTAKVGAQTSVKTV--ESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGI 178
Query: 458 WGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL--KNGSRPVLLIESKGHALH 515
WGE +F G ++H+N TKD +DYLW+ T I V+E++ KNG + I+S L
Sbjct: 179 WGENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLR 238
Query: 516 AFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGIT- 574
F N++L GS G+ K P+ G N++ LL+ TVGLQN G F E GAG
Sbjct: 239 VFVNKQLAGSIVGHWV----KAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRG 294
Query: 575 SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPL-TWYK 633
K+TGF +G LDLS SWTY++GL+GE IY + W ST+E + + WYK
Sbjct: 295 KAKLTGFKNGDLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEW-STLETDASPSIFMWYK 353
Query: 634 AVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNP 693
P G +P+ L++ MG+G AW+NG+ IGRYW S+K D C + CDYRG +N
Sbjct: 354 TYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYWNIISQK----DGCDRTCDYRGAYNS 409
Query: 694 DKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
DKC T CG+P+Q YH+PRSW KPS N+LV+FEE GG+P KI+
Sbjct: 410 DKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKIS 452
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 184/295 (62%), Positives = 229/295 (77%), Gaps = 4/295 (1%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
TYD +++++NG+R +++S +IHYPRSVP MWP L+Q+AK+GG++ +++YVFWNGHE S
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+YYF GR++LV FIK+++QA +Y+ LRIGP+V AE+N+GG PVWL Y+PG FR D EPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K F T IVDMMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAV
Sbjct: 150 K----NFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 205
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
A N VPW+MC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PH
Sbjct: 206 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 265
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
RP ED+A+ VA+F QKGGS NYYMYHGGTNFGRTAGGPFI TSYDY+APIDEYG
Sbjct: 266 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 320
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 200/421 (47%), Positives = 263/421 (62%), Gaps = 13/421 (3%)
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GL R PKWGHL++LH AIKLCE AL+ + + SLGS+ EA VY +SG+CAAFLAN+
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68
Query: 383 KNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNG 442
K+D TV F SYHLPAWSVSILPDCK V FNTA + + + ++L+P S +
Sbjct: 69 KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQSLKPDGGS--SA 126
Query: 443 SKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSR 502
G +W KE GI F+K G ++ INTT D +DYLWY+ + + +E FL GS+
Sbjct: 127 ELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSK 186
Query: 503 PVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNA 562
VL IES G ++AF N +L GS G PI+L AGKN + LLS+TVGL N
Sbjct: 187 AVLHIESLGQVVYAFINGKLAGSGHG---KQKISLDIPINLVAGKNTVDLLSVTVGLANY 243
Query: 563 GPFYEWVGAGITS-VKITGFNSG-TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
G F++ VGAGIT V + G ++DL++ WTY++GL+GE G+ G ++ WVS
Sbjct: 244 GAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL---GAVDSSEWVSK 300
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
P QPL WYK P G EP+ +D KG+AW+NG+ IGRYWP + +
Sbjct: 301 SPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWP---TSIAGNGG 357
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C CDYRG + +KC+ CG+PSQ YH+PRSW KPS N LV+FEE GGDPT+I+F +
Sbjct: 358 CTDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTK 417
Query: 741 K 741
+
Sbjct: 418 Q 418
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 177/286 (61%), Positives = 213/286 (74%)
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+N+GG PVWL ++PG FR D EPFK MQ F IV MMK EKLF SQGGPIIL+Q+E
Sbjct: 1 EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEY +G G+ Y WAA+MA N GVPW+MC+++D PDPVINTCN FYCD+F+P
Sbjct: 61 NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKFSP 120
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
+ P PK+WTE W GWF FGG RP ED+AF+VARF Q GGS NYYMYHGGTNFGR
Sbjct: 121 NKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNFGR 180
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
TAGGPFITTSYDY+APIDEYGL R PK+ HLKELH A+KLCE ALL + +SLG+ ++
Sbjct: 181 TAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGNYEQ 240
Query: 363 ADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
A V++ +SG CAAFL+N + K+ V F ++LP WS+SILPDC
Sbjct: 241 AHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/440 (44%), Positives = 263/440 (59%), Gaps = 23/440 (5%)
Query: 306 GPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADV 365
G + Y + + GL R PKWGHLKELH AIKLCE AL+ G+ SLG++Q+A V
Sbjct: 132 GADVQMPYRLDHILVADGLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASV 191
Query: 366 YADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV 425
+ S+ AC AFL N D + V F + Y LP WS+SILPDCK V+NTA+V +Q S +
Sbjct: 192 FRSSTDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQM 251
Query: 426 EMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT 485
+M + G WQ + E G+ F G ++ IN T+D TDYLWYT
Sbjct: 252 KM-------------EWAGGFTWQSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYT 298
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
T + + ++E+FL NG P+L + S GHALH F N +L G+ G+ P Y + L +
Sbjct: 299 TYVDIAQDEQFLSNGKNPMLTVMSAGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWS 358
Query: 546 GKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
G N I+ LS+ VGL N G +E AGI V + G N G DL+ WTYK+GL+GE L
Sbjct: 359 GSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEAL 418
Query: 605 GIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEI 664
+++ +++ W EP + QPL+WYKA P GDEP+ LDM MGKG W+NG+ I
Sbjct: 419 SLHSLSGSSSVEW---GEPVQKQPLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGI 475
Query: 665 GRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVI 724
GRYWP + CDYRG+++ KC T CG+ SQRWYH+PRSW P+ N+LVI
Sbjct: 476 GRYWPGYKASGT-----CGICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVI 530
Query: 725 FEEKGGDPTKITFSIRKISG 744
FEE GGDPT I+ +++I+G
Sbjct: 531 FEEWGGDPTGISM-VKRIAG 549
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 186/353 (52%), Positives = 228/353 (64%), Gaps = 28/353 (7%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L++ ++++ G V+YD RSLII G+R+L+ S +IHYPRS P MWP L+ +AK GG+
Sbjct: 12 LMVMWTTTRGGVEGGQVSYDGRSLIIEGQRKLLFSGSIHYPRSTPDMWPSLISKAKHGGL 71
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
+ IE+YVFWN HE G+Y F GR N+V+FI+ IQ +Y +RIGPF+ AE+ YGG+P
Sbjct: 72 DVIETYVFWNLHEPRHGQYDFKGRHNIVRFIREIQAHGLYAFIRIGPFIEAEWTYGGLPF 131
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WLH +PG V+R+D EPFKYHMQ F T IV++ K E L+A QGGPIIL Q+ENEY E
Sbjct: 132 WLHDVPGIVYRSDNEPFKYHMQNFTTKIVNLFKSEGLYAPQGGPIILQQIENEYKNAERA 191
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--FTPHSPSMPK 249
+ E G Y WAA MAV GVPW+MC+Q D PDPVINTCN C + P+SP+ P
Sbjct: 192 FHEKGPPYVQWAAAMAVGLQTGVPWVMCKQDDAPDPVINTCNGRTCGETFVGPNSPNKPA 251
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
IWT+NW K GS NYYMYHGGTNFGRT G F+
Sbjct: 252 IWTDNWTS-------------------------LKNGSFVNYYMYHGGTNFGRT-GSAFV 285
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
TSY EAPIDEYGL R PKWGHLK+LH IK C LL+G S LG QE
Sbjct: 286 LTSYYDEAPIDEYGLIRQPKWGHLKQLHSVIKSCSQTLLHGVISVSPLGQQQE 338
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 367 bits (943), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 232/626 (37%), Positives = 314/626 (50%), Gaps = 101/626 (16%)
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
P + + +GG+ V Y F N EP + HM++F +I+DMM +EK ASQGGPII
Sbjct: 88 PDIIXKARHGGLNVIHTY----AFWNLHEPVQDHMKRFTRMIIDMMSKEKXIASQGGPII 143
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
LA V++ + E G R WA MAV G+P +MC+Q D PDPVINTC C
Sbjct: 144 LALVDSAIAFKEM-----GTRCVHWAGTMAVGLKTGIPXVMCKQKDAPDPVINTCKGRNC 198
Query: 238 -DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
D FT P+ P+ + + + G ++ FG R +ED+AFS F K G++ NYYMY+
Sbjct: 199 GDTFTGPNRPNKRSV-SNHXLGMYRVFGDPPSQRAAEDLAFSX--FISKNGTLANYYMYY 255
Query: 296 GGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL 355
TNFGRT F TT Y EAP+DEYGLPR KWGHL++LH A++L + ALL G S
Sbjct: 256 SVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLRDLHAALRLSKKALLWGVTSAQ 314
Query: 356 SLGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFN 414
LG EA +Y S CA FL N + T R Y+LP S+S LPDCK VVFN
Sbjct: 315 KLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFN 374
Query: 415 TANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINT 474
T V +Q S +K L+W + ++ + E V+ +
Sbjct: 375 TQTVVSQYSV------------------NKNLQWXMSQDALPTYEECPTKTKSPVELMTM 416
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQE-----LQGSASGN 529
TKDTTDYLWYTT+I + + V + + GH +HAF N E L G+ G+
Sbjct: 417 TKDTTDYLWYTTNIELARTGLPFRKDVLRVPQVSNLGHVMHAFLNGEYMEFYLTGTRHGS 476
Query: 530 GTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLS 589
F + PI+LKAG N+IA L TVGL ++G + E AG+ +V I G N+ T+DL
Sbjct: 477 NVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGVHNVAIQGLNTRTIDLP 536
Query: 590 TYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDM 649
W +KA P GD P+ L++
Sbjct: 537 KNGWG-------------------------------------HKAYFDAPEGDVPVALEL 559
Query: 650 LKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYH 709
M KG+AW+NG+ I YW Y ++ G+PSQ YH
Sbjct: 560 STMAKGMAWINGKSIDXYW----------------VSY---------LSPLGKPSQSVYH 594
Query: 710 IPRSWFKPSENILVIFEEKGGDPTKI 735
+PR++ K S+N+LV+FEE G +P I
Sbjct: 595 VPRAFLKTSDNLLVLFEETGRNPDGI 620
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/57 (57%), Positives = 45/57 (78%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
V+YD R LI+NG+REL+ S +IHYPRS+P MWP ++ +A+ GG+N I +Y FWN HE
Sbjct: 56 VSYDGRPLIVNGKRELLFSGSIHYPRSIPEMWPDIIXKARHGGLNVIHTYAFWNLHE 112
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 367 bits (943), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 214/616 (34%), Positives = 329/616 (53%), Gaps = 43/616 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RSL+ING R+L +S ++HYPRS P +W ++ +K G+N I++YVFW+ HE
Sbjct: 108 VTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDLHEPQR 167
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F G NL F+ + QQ +++ LRIGP++ AE+NYGG+P+WL IPG R+
Sbjct: 168 GVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGIKMRDFNTQ 227
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+ ++++M IVD + FA QGGPI+LAQ+ENEY + + Y E G+++A W A +A
Sbjct: 228 YMEEVERWMKFIVDYL--HGYFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAHWCADLA 285
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFT----PHSPSMPKIWTENWPGWFKTFG 263
+IG+PWIMCQQ D P VINTCN +YC ++ + P ++TENW GWF +
Sbjct: 286 NRLDIGIPWIMCQQDDIP-TVINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGWFNNWV 344
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
HRP D+ +S AR+F GG++ NYYM+HGGTNFGR + GP I SYDY+AP++EYG
Sbjct: 345 NAVRHRPVADLLYSAARWFASGGALMNYYMWHGGTNFGRKS-GPMIALSYDYDAPLNEYG 403
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
PRNPK+ ++ + I E LL+ ++ + ++ + A+F+ N ++
Sbjct: 404 NPRNPKYSQTRDFNKLILSLEDILLSQYPPTPIFLANNISVIHYRNGNNSASFIINSNEN 463
Query: 384 NDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGS 443
+ V+F SY A+SV IL + V ++ N R + TV N+ + +
Sbjct: 464 GNSKVMFEGRSYFSYAYSVQILKNYVSVFDSSQNPRNYTDTVVESEPNIPFANSI----- 518
Query: 444 KGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
+ K + E + ++ +N TKD TDY+WYTT I +++ E LK
Sbjct: 519 ------ISKHVERFDFEESLYDNRLMEQLNLTKDETDYIWYTTMINHDQDGEILK----- 567
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
+ +K +H F + G+ + + G + + LL +G+Q+
Sbjct: 568 ---VINKTDIVHVFVDSYYVGTIMSDSLA-------ITGVPLGPSTLQLLHTKMGIQHYE 617
Query: 564 PFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
E AGI + G ++++ W K + E + I +P + W
Sbjct: 618 LHMENTKAGI----LGPVYYGDIEITNQMWGSKPFVSSEKV-ITDPIQSKFVRWSPLDRK 672
Query: 624 PK----NQPLTWYKAV 635
P + PLTWYK +
Sbjct: 673 PNEVFYSVPLTWYKFI 688
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 176/286 (61%), Positives = 211/286 (73%), Gaps = 1/286 (0%)
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+N+GG PVWL Y+PG FR D PFK MQKF IV+MMK EKLF Q GPII++Q+E
Sbjct: 1 EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEYG E G GK Y WAA+MAV GVPWIMC+Q D PDP+I+TCN FYC+ F P
Sbjct: 61 NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMP 120
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
++ PK++TE W GW+ FGG P+RP+ED+A+SVARF Q GS NYYMYHGGTNFGR
Sbjct: 121 NANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGR 180
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
TAGGPFI TSYDY+AP+DEYGL R PKWGHL++LH IKLCE +L++ + SLGS+QE
Sbjct: 181 TAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQE 240
Query: 363 ADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
A V+ + +CAAFLAN D K V F+N+ Y LP WSVSILPDC
Sbjct: 241 AHVFWTKT-SCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 365 bits (937), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 239/798 (29%), Positives = 370/798 (46%), Gaps = 138/798 (17%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
V++D R+L+++GRR L++S A+HYPRS P MWP +++ ++ G+NT+E+Y+FWN HE
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G F GR +LV+F ++ Q + +ILRIGP++ AE NYGG+P WL +P R D E
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
FK +++ L+ ++++ L A GGP+ILAQ+ENEY + YGE G+RY W+ ++
Sbjct: 122 AFKREKARWVRLVAEVIR--PLCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 207 AVAQNIGVPWIMC-----------QQFDTPDPVINTCNSFYCDQ-----FTPHSPSMPKI 250
A + +G+PW+ C + + T N+F + F H P P +
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREH-PEQPAL 238
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENW GW++T+GG P R E++A++ ARFF GGS NY+++HGGTNFGR G +T
Sbjct: 239 WTENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGRD-GMYLLT 297
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
T+Y++ P+DEYGLP K HL L+ A+ C +L ER G + SS
Sbjct: 298 TAYEFGGPLDEYGLP-TTKARHLARLNKALAACADKILASERPRAITGERNGLLKFQYSS 356
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
G L W + + V N + S+ V V
Sbjct: 357 G-------------------------LTFWCDDVARTVRIVGKNGEVLYDSSARVAPVRR 391
Query: 431 NLQPSEASPDNGSKGLKWQVFKE-IAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTT 486
+ S G + W E + W ++ ++ + TKD TDY WY T
Sbjct: 392 TWKAS------GVRFAPWGWRAEPLPAAWPAEAQSAVTARKPLEQLLLTKDETDYCWYET 445
Query: 487 SIIVNENEEFL--------------------KNGSRP---------------VLLIESKG 511
+I+V + + L + G RP L +
Sbjct: 446 AIVVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVA 505
Query: 512 HALHAFAN-----------QELQGSASGNGTHPPFKYK-NPISLKAGKNEIALLSMTVGL 559
+H F + +E +G F+ + + GK+ ++LL +GL
Sbjct: 506 DIVHVFIDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGL 565
Query: 560 QNAGPFYEW-VGAGITSVKITGF------NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
+W +G +++ G N L+ W ++ GL GE G +P
Sbjct: 566 IKG----DWMIGYENMALEKKGLWAPVFWNGKKLE---GEWRHQPGLLGERCGFADPAAG 618
Query: 613 NNINWVSTMEPP---KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W + +PL W++ +P G P LD+ MGKG+AW+NG IGRYW
Sbjct: 619 SLLAWKTAKAATGRGARRPLRWWRTTFTRPKGHGPWALDLGGMGKGMAWINGHCIGRYW- 677
Query: 670 RKSRKSSPHDECVQECDYRGKFNP--DKCITGC--GEPSQRWYHIPRSWFKPS--ENILV 723
+ + D G + +T P+QR+YH+P W + + LV
Sbjct: 678 -----------LLADTDPMGPWMAWMKGSLTAAPSSGPTQRYYHVPDDWLRTDGGPDTLV 726
Query: 724 IFEEKGGDPTKITFSIRK 741
+FEE GGDP + R+
Sbjct: 727 LFEELGGDPATVRLVRRE 744
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 207/493 (41%), Positives = 272/493 (55%), Gaps = 62/493 (12%)
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF 240
+ENEYG E+ + E G Y WAAKMAV GVPWIMC+Q D PDPVINTCN C +
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKCGET 60
Query: 241 --TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGT 298
P+SP+ P +WTENW +++ +GG R ++DIAF VA F K GS NYYMYHGGT
Sbjct: 61 FGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 120
Query: 299 NFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLG 358
NFGRTA IT YD +AP+DEYGL R PKWGHLKELH IK C LL G ++NLS+G
Sbjct: 121 NFGRTAAAYVITGYYD-QAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSVG 179
Query: 359 SSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANV 418
Q+A ++ G C AFL N D N TV FRN S+ L S+SILPDC ++FNTA V
Sbjct: 180 QLQQAYMFEAQGGGCVAFLVNNDSVN-ATVGFRNKSFELLPKSISILPDCDNIIFNTAKV 238
Query: 419 RAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDT 478
A S N + + +S K W+ + ++ + ++ ++H+NTTKD
Sbjct: 239 NAGS--------NRRITTSS----KKLNTWEKYIDVIPNYSDSTIKSDTLLEHMNTTKDK 286
Query: 479 TDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGT-HPPFKY 537
+DYLWYT S N + ++P+L +ES H +AF N + GSA G+ PF
Sbjct: 287 SDYLWYTFSFQPN------LSCTKPLLHVESLAHVAYAFVNNKYSGSAHGSKNGKVPFIM 340
Query: 538 KNPISLKAG--KNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTY 595
+ PI L N I++LS+ VGL
Sbjct: 341 EVPIVLDDDGLSNNISILSVLVGLS----------------------------------- 365
Query: 596 KIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKG 655
+GL GE L +Y + + W S + QPLTW+K P G++P+ L++ M KG
Sbjct: 366 -VGLLGETLQLYGKEHLEMVKW-SKADISIAQPLTWFKLEFDTPKGNDPVVLNLATMSKG 423
Query: 656 LAWLNGEEIGRYW 668
AW+NG+ IGRYW
Sbjct: 424 EAWVNGQSIGRYW 436
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 358 bits (920), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 182/404 (45%), Positives = 248/404 (61%), Gaps = 19/404 (4%)
Query: 342 LCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWS 401
+CE AL++ + SLG+ Q+A VY SG C+AFL+N D K+ V+F N+ Y+LP WS
Sbjct: 1 MCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWS 60
Query: 402 VSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEA 461
VSILPDC+ VFNTA V Q+S ++M+P N S+ W+ F+E
Sbjct: 61 VSILPDCRNAVFNTAKVGVQTSQMQMLPTN-----------SERFSWESFEEDTSSSSAT 109
Query: 462 DFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQE 521
SG ++ IN T+DT+DYLWY TS+ V +E FL G P L+++S GHA+H F N
Sbjct: 110 TITASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGR 169
Query: 522 LQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITG 580
L GSA G F+Y ++L+AG N IALLS+ VGL N G +E GI V I G
Sbjct: 170 LSGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHG 229
Query: 581 FNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQP 639
+ G LDLS WTY++GL+GE + + +P +++ W+ S + +NQPLTW+K P
Sbjct: 230 LDKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAP 289
Query: 640 PGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITG 699
G+EP+ LDM MGKG W+NG IGRYW + S +C+Y G F P KC G
Sbjct: 290 EGEEPLALDMDGMGKGQIWINGISIGRYWTAIATGS------CNDCNYAGSFRPPKCQLG 343
Query: 700 CGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKIS 743
CG+P+QRWYH+PRSW K + N+LV+FEE GGDP+KI+ + R +S
Sbjct: 344 CGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVS 387
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 242/798 (30%), Positives = 378/798 (47%), Gaps = 139/798 (17%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
V++D R+L+++GRR L++S A+HYPRS P MWP +++ ++ G+NT+E+Y+FWN HE
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G F GR +LV+F ++ Q + +ILRIGP++ AE NYGG+P WL +P R D E
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
FK +++ L+ ++++ L A GGP+ILAQ+ENEY + YGE G+RY W+ ++
Sbjct: 122 AFKREKARWVRLVAEVIR--PLCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 207 AVAQNIGVPWIMC-----------QQFDTPDPVINTCNSFYCDQ-----FTPHSPSMPKI 250
A + +G+PW+ C + + T N+F + F H P P +
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREH-PEQPAL 238
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT 310
WTENW GW++T+GG P R E++A++ ARFF GGS NY+++HGGTNFGR G +T
Sbjct: 239 WTENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGRD-GMYLLT 297
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSS 370
T+Y++ P+DEYGLP K HL L+ A+ C LL ER + SS + + DS
Sbjct: 298 TAYEFGGPLDEYGLP-TTKARHLARLNAALAACAGELLASERPGVVEKSSGVVEYHYDS- 355
Query: 371 GACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPE 430
+ DD A +V I+ +V+++ S+V + P
Sbjct: 356 ----GLVFVCDDT---------------ARAVRIVKKSGEVLYD--------SSVRVAPV 388
Query: 431 NLQPSEASPDNGSKGLKWQVFKE-IAGIW---GEADFVKSGFVDHINTTKDTTDYLWYTT 486
A +G + W E + W ++ ++ + TKD TDY WY T
Sbjct: 389 R----RAWKSSGVRFAPWGWRAEPLPAAWPAEAQSAVTARKPLEQLLPTKDETDYCWYET 444
Query: 487 SIIVNENEEFL--------------------KNGSRP---------------VLLIESKG 511
+I+V + + L + G RP L +
Sbjct: 445 AIVVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVA 504
Query: 512 HALHAFAN-----------QELQGSASGNGTHPPFKYK-NPISLKAGKNEIALLSMTVGL 559
+H F + +E +G F+ + + GK+ ++LL +GL
Sbjct: 505 DIVHVFIDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGL 564
Query: 560 QNAGPFYEW-VGAGITSVKITGF------NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYR 612
+W +G +++ G N L+ W ++ GL GE G +P
Sbjct: 565 IKG----DWMIGYENMALEKKGLWAPVFWNGKKLE---GEWRHQPGLLGERCGFADPAAG 617
Query: 613 NNINWVSTMEPP---KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWP 669
+ + W + +PL W++ +P G P LD+ MGKG W+NG IGRYW
Sbjct: 618 SLLAWKTAKAATGRGARRPLNWWRTTFTRPKGHGPWALDLGGMGKGFCWINGHCIGRYW- 676
Query: 670 RKSRKSSPHDECVQECDYRGKFNP--DKCITGC--GEPSQRWYHIPRSWFKPS--ENILV 723
+ + D G + +T G P+QR+YH+P W + + LV
Sbjct: 677 -----------LLPDTDPMGPWMAWMKGSLTAAPSGGPTQRYYHVPDDWLRTDGGPDTLV 725
Query: 724 IFEEKGGDPTKITFSIRK 741
+FEE GGDP + R+
Sbjct: 726 LFEELGGDPATVRLVRRE 743
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 171/264 (64%), Positives = 196/264 (74%), Gaps = 1/264 (0%)
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
D EPFK MQKF IV MMK E+LF SQGGPIIL+Q+ENE+G E G GK Y W
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
AA+MAV N GVPWIMC+Q D PDPVI+TCN FYC+ FTP+ PK+WTE W GW+ F
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
GG P RP+ED+AFS+AR QKGGS NYYMYHGGTNFGRTAGGPF+ TSYDY+AP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARLIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 323 GLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDD 382
GLPR PKWGHL++LH AIK E AL++ E S SLG+SQEA V+ SG CAAFLAN D
Sbjct: 181 GLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSG-CAAFLANYDT 239
Query: 383 KNDKTVVFRNVSYHLPAWSVSILP 406
K+ V F N Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 171/263 (65%), Positives = 195/263 (74%), Gaps = 1/263 (0%)
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
D EPFK MQKF IV MMK E+LF SQGGPIIL+Q+ENE+G E G GK Y WA
Sbjct: 2 DNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWA 61
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
A+MAV N GVPWIMC+Q D PDPVI+TCN FYC+ FTP+ PK+WTE W GW+ FG
Sbjct: 62 ARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFG 121
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 323
G P RP+ED+AFS+ARF QKGGS NYYMYHGGTNFGRTAGGPF+ TSYDY+AP+DEYG
Sbjct: 122 GAVPTRPAEDLAFSIARFIQKGGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYG 181
Query: 324 LPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDK 383
LPR PKWGHL+ LH AIK E AL++ E S SLG+SQEA + SG CAAFLAN D K
Sbjct: 182 LPREPKWGHLRNLHKAIKSSESALVSAEPSVTSLGNSQEAHAFKSKSG-CAAFLANYDTK 240
Query: 384 NDKTVVFRNVSYHLPAWSVSILP 406
+ V F N Y LP WS+SILP
Sbjct: 241 SSAKVSFGNGQYELPPWSISILP 263
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 156/263 (59%), Positives = 201/263 (76%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F +++ + F NV YD R+L+I+G+R ++IS +IHYPRS P MWP L+Q++K+G
Sbjct: 4 FEIVLVLLWFLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDG 63
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G++ IE+YVFWN HE G+Y F GR +LVKF+K + +A +Y+ LRIGP+V AE+NYGG
Sbjct: 64 GLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGF 123
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P+WLH+IPG FR D EPFK M++F IVD+MK+EKL+ASQGGPIIL+Q+ENEYG +
Sbjct: 124 PLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNID 183
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPK 249
S YG GK Y WAAKMA + + GVPW+MCQQ D PDP+INTCN FYCDQFTP+S + PK
Sbjct: 184 SHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPK 243
Query: 250 IWTENWPGWFKTFGGRDPHRPSE 272
+WTENW GWF +FGG PHRP E
Sbjct: 244 MWTENWSGWFLSFGGAVPHRPVE 266
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 196/488 (40%), Positives = 272/488 (55%), Gaps = 41/488 (8%)
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW F+ +G + R +EDIA++V RFF KGGS+ NYYMYHGGTNFGRT G ++
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-D 368
T Y EAP+DEYG+ + PK+GHL++LH I+ + A L G+ S+ LG EA ++
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 369 SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV 428
C +FL+N + D TV+FR +++P+ SVSIL CK VV+NT V Q S
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHS----- 175
Query: 429 PENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ S + D SK +W++F E + + ++ N TKD TDYLWYTTS
Sbjct: 176 ----ERSFHTSDVTSKNNQWEMFSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSF 231
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ ++ +N RPVL ++S HA+ FAN G A GN F ++ P+ LK G N
Sbjct: 232 RLESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVN 291
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
+ LLS T+G++++G V GI I G N+GTLDL W +K L+GE+ IY+
Sbjct: 292 HVVLLSSTMGMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYS 351
Query: 609 PGYRNNINWVSTMEPPKN-QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ W +P +N + TWYK +P GD+P+ LDM M KG+ ++NGE +GRY
Sbjct: 352 EKGLGKVQW----KPAENDRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRY 407
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
W YR T G PSQ YHIPR + K +N+LVIFEE
Sbjct: 408 W----------------VSYR---------TLAGTPSQAVYHIPRPFLKSKDNLLVIFEE 442
Query: 728 KGGDPTKI 735
+ G P I
Sbjct: 443 EMGKPDGI 450
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 195/488 (39%), Positives = 271/488 (55%), Gaps = 41/488 (8%)
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+WTENW F+ +G + R +EDIA++V RFF KGGS+ NYYMYHGGTNFGRT G ++
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA-D 368
T Y EAP+DEYG+ + PK+GHL++LH I+ + A L G+ S+ LG EA ++
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 369 SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMV 428
C +FL+N + D TV+FR +++P+ SVSIL CK VV+NT V Q S
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHS----- 175
Query: 429 PENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSI 488
+ S + D SK +W++ E + + ++ N TKD TDYLWYTTS
Sbjct: 176 ----ERSFHTSDVTSKNNQWEMSSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSF 231
Query: 489 IVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKN 548
+ ++ +N RPVL ++S HA+ FAN G A GN F ++ P+ LK G N
Sbjct: 232 RLESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVN 291
Query: 549 EIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYN 608
+ LLS T+G++++G V GI I G N+GTLDL W +K L+GE+ IY+
Sbjct: 292 HVVLLSSTMGMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYS 351
Query: 609 PGYRNNINWVSTMEPPKN-QPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRY 667
+ W +P +N + TWYK +P GD+P+ LDM M KG+ ++NGE +GRY
Sbjct: 352 EKGLGKVQW----KPAENDRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRY 407
Query: 668 WPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEE 727
W YR T G PSQ YHIPR + K +N+LVIFEE
Sbjct: 408 W----------------VSYR---------TLAGTPSQAVYHIPRPFLKSKDNLLVIFEE 442
Query: 728 KGGDPTKI 735
+ G P I
Sbjct: 443 EMGKPDGI 450
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 244/778 (31%), Positives = 382/778 (49%), Gaps = 93/778 (11%)
Query: 7 IAPFALLIF---FSSSITY----CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMW 59
I F +LIF F+ +TY +V+YD R++ ING R L+ S IHYPRS P MW
Sbjct: 6 IVFFTVLIFINTFAYPVTYDQVRGIPYHVSYDHRAITINGNRTLLFSGVIHYPRSTPAMW 65
Query: 60 PGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPF 119
P L+ +AKE G+NTI++YVFWN HE G Y F GR NL F++ A +++ LR+GP+
Sbjct: 66 PYLMSKAKEQGLNTIQTYVFWNMHEQKRGTYDFSGRANLSLFLQEAANAGLFVNLRLGPY 125
Query: 120 VAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILA 179
V AE++YG +PVWL+ IP FR+ + +K M++F++ I+ + + A GGPIILA
Sbjct: 126 VCAEWDYGALPVWLNNIPNIAFRSSNDAWKSEMKRFLSDII--VYVDGFLAKNGGPIILA 183
Query: 180 QVENEYGYYESFYGEGGKR-YALWAAKMAVAQ--NIGVPWIMCQQFDTPDPVINTCNSFY 236
Q+ENEYG G R Y W + + +PWIMC + I TCN
Sbjct: 184 QIENEYG--------GNDRAYVDWCGSLVSNDFASTQIPWIMCNGL-AANSTIETCNGCN 234
Query: 237 C------DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHN 290
C D+ P+ P ++TENW GWF+ +G R ED+A+SVA +F GG+ H
Sbjct: 235 CFDDGWMDRHRRTYPNQPLLFTENW-GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHA 293
Query: 291 YYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNG 350
YYM+HGG ++GRT GG +TT+Y + + G P PK+ HL L + LL+
Sbjct: 294 YYMWHGGNHYGRT-GGSGLTTAYSDDVILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQ 352
Query: 351 ERSNL----------SLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAW 400
+ + L S+G+ Q Y S F+ N V+F + +
Sbjct: 353 DSARLPIPYWDGKQWSVGTQQMVYSYPPS----IQFVIN-QAAFSLFVLFNKQNISIAGQ 407
Query: 401 SVSILPDCKKVVFNTANVRAQ-SSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWG 459
SV I + + +++N+A+V + +VP + P L WQV+ E +
Sbjct: 408 SVQIYDNNEHLLWNSADVSGIFRNNTFLVPIVVGP-----------LDWQVYSE-PFLSD 455
Query: 460 EADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFAN 519
V S ++ +N T D T YLWY ++ +++ + V + + ++L F +
Sbjct: 456 LPVIVASTPLEQLNLTNDETIYLWYRRNVSLSQ-----PSAQTIVQVQTRRANSLIFFMD 510
Query: 520 QELQGSASGNGTHPPFKYKNPISLKAGK---NE---IALLSMTVGLQ--NAGP-FYEWVG 570
++ G + +H I+L + N+ +LS+++G+ N GP +E+ G
Sbjct: 511 RQFVGYFDDH-SHAQGTINVNITLNLSQFLPNQQYLFEILSVSLGIDNFNIGPGSFEYKG 569
Query: 571 AGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLT 630
+ +V + G + W ++ GL GE IY + W N+ +T
Sbjct: 570 I-VGNVSLGG--QSLVGDEASIWEHQKGLFGEAYQIYTEQGSKTVEWNPRWTTAINKSVT 626
Query: 631 WYKA------VVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
W++ +V++ P+ LD + +G A++NG +IG YW + + C+Q
Sbjct: 627 WFQTRFDLNHLVREDLNANPVLLDAFGLNRGHAFVNGNDIGLYWLIEGTCQNKLCCCLQN 686
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
T C +PSQR+YHIP W KP+ N+L +FEE G K +++I
Sbjct: 687 Q------------TNCQQPSQRYYHIPSDWLKPTNNLLTVFEEIGASSPKSVGLVQRI 732
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 231/759 (30%), Positives = 370/759 (48%), Gaps = 102/759 (13%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+Y +R I+GRR L++ +IHYPRS G W L++ AK G+N IE YVFWN HE
Sbjct: 86 SVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVFWNLHEQE 145
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G + F G N +F ++ + +++ +R GP+V AE++ GG+P+WL++IPG R+
Sbjct: 146 RGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGMKVRSSNA 205
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+++ M++F+T +V++ + A GGPII+AQ+ENE+ ++ Y E W +
Sbjct: 206 PWQWEMERFVTYMVELSR--PFLAKNGGPIIMAQIENEFAMHDPEYVE-------WCGDL 256
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF----TPHSPSMPKIWTENWPGWFKTF 262
+ +PW+MC + + I +CN C F PS P +WTE+ GWF+T+
Sbjct: 257 VKRLDTSIPWVMCYA-NAAENTILSCNGNDCVDFAVKHVKERPSDPLVWTED-EGWFQTW 314
Query: 263 G--GRDP----HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYE 316
++P R +ED+A++VAR+F GG+ HNYYMYHGG NFGR A +TT Y
Sbjct: 315 AKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGGNNFGRAASAG-VTTKYADG 373
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNL-------SLGSSQEAD----- 364
+ GL PK HL++LH A+ C L+ +R L + G + EA
Sbjct: 374 VNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPHELAPTHGETAEASSLQQR 433
Query: 365 --VYADSSGA-CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ 421
+Y G AFL N DK TVVFR+ Y L S+ I+ D ++FNTA+VR
Sbjct: 434 AFIYGAEDGPNQVAFLENQADKK-VTVVFRDNKYELAPTSMMIIKD-GALLFNTADVR-- 489
Query: 422 SSTVEMVPENLQPSEASPDNGSKGLKWQVFKE--IAGIWGEADFVKSGFVDHINTTKDTT 479
+ P + + +P + L+W+ + E ++ + V V+ + T D +
Sbjct: 490 ----KSFPGTVHRA-YTPIVQAATLQWETWSELNVSSLTPRRRVVAERPVEQLRLTADRS 544
Query: 480 DYLWYTTSIIVNENEE--FLKNGSRPVLLIESKGHALHAFANQELQGSAS----GNGTHP 533
DYL Y T+ V+ + + + + V + + ++ AF + L G + G
Sbjct: 545 DYLTYETTFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNLAYPGGNCSK 604
Query: 534 PFKYKNPISLKAGK-NEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYS 592
F++ P ++ + + + L+S+++G+ + G + G V G +
Sbjct: 605 EFRFSLPTNIDVTRQHSLKLVSVSLGIYSLGSNHTKGLTGKVRVGRKNLAKG------HQ 658
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINW--VSTMEPPKNQPLTWYKAVVKQP---------PG 641
W L GE L IY P + +++ W V + Q ++WY P P
Sbjct: 659 WEMYPTLVGEQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSYPAFELPAEADPV 718
Query: 642 DEP--IGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITG 699
EP I LD + + +G A++NG ++GRYW +
Sbjct: 719 SEPFSILLDCIGLTRGRAYINGHDLGRYW---------------------------LVND 751
Query: 700 CGEPSQRWYHIPRSWF-KPSENILVIFEEKGGDPTKITF 737
GE QR+YH+PR W K N+LV+F+E GG +
Sbjct: 752 EGEFVQRYYHVPRDWLVKDQANVLVVFDELGGSVADVRL 790
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 338 bits (868), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 207/320 (64%), Gaps = 5/320 (1%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
CFA V+YD+ S IIN + +I S +HYP S +WP + ++ K GG++ IESY+FW+
Sbjct: 4 CFATEVSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWDR 63
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE +Y G + + F+K+IQ+A +Y ILRIGP+V +N+GG +WLH +P R
Sbjct: 64 HEPVRREYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEIELR 123
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
D K MQ F T IV+M K KLFA GGPIIL +ENEYG + Y E K Y W
Sbjct: 124 IDNPIXKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIKW 183
Query: 203 AAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
A+MA+ QNIGVPWIMC D P P+INTCN YCD F P++P K++ F+ +
Sbjct: 184 CAQMALTQNIGVPWIMCXXRDAPQPMINTCNGHYCDSFXPNNPKSSKMFRX-----FQKW 238
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEY 322
G R PH+ +E+ FSVARFFQ GG ++NYYMYHGGTNFG GGP++T SY+Y+AP+DEY
Sbjct: 239 GERVPHKSAEESTFSVARFFQSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDEY 298
Query: 323 GLPRNPKWGHLKELHGAIKL 342
G PKW H K+LH +
Sbjct: 299 GNLNKPKWEHFKQLHKELTF 318
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 156/292 (53%), Positives = 209/292 (71%), Gaps = 3/292 (1%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLII+G+REL+ S +IHYPRS P MWP ++++AK+GG+NTI++YVFWN HE
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F GR +LVKFIK+IQ+ MY+ LR+GPF+ AE+ +GG+P WL +PG FR D +
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ +I+D MK E+LFASQGGPIIL Q+ENEY + Y + G Y WA+ +
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGR 265
+ +G+PW+MC+Q D PDP+IN CN +C D F P+ + P +WTENW F+ FG
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEA 317
R EDIA+SVARFF K G+ NYYMYHGGTNFGRT+ ++TT Y +A
Sbjct: 281 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYEDA 331
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 216/605 (35%), Positives = 311/605 (51%), Gaps = 63/605 (10%)
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
M+ +M I ++R FA+ GGPII++QVENEYG+ + YGE G +YA W+A++A + N
Sbjct: 1 MESWMRFITKYLERH--FAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQSLN 58
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQFT----PHSPSMPKIWTENWPGWFKTFGGRDP 267
+GVPWIMCQQ D D VINTCN FYC + P+ P +TENWPGWF+ + P
Sbjct: 59 VGVPWIMCQQ-DDIDSVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQSTP 117
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRN 327
HRP ED+ ++V +F +GGS+ NYYM+HGGTNFGRT+ P + SYDY+A +DEYG P
Sbjct: 118 HRPVEDVLYAVGNWFARGGSLMNYYMWHGGTNFGRTS-SPMVVNSYDYDAALDEYGNPSE 176
Query: 328 PKWGHLKELHGAIKLCEHALLNG---ERSNLSLGSSQEADVYADSSGACAAFLANMDDKN 384
PK+ H + + ++ H LN RS GSS + + G +FL N +
Sbjct: 177 PKYSHAAKFNNLLQKYSHIFLNAPEIPRSEYLGGSS--SIYHYTFGGESLSFLINNHESA 234
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
+V+ ++ + WSV +L +N V ++T E+ + SP N
Sbjct: 235 LNDIVWNGQNHIIKPWSVHLL-------YNNHTVFDSAATPEVSKLAMTSKRFSPVNSFN 287
Query: 445 GLKWQVFKEIAGIWGEADFVKSGF----VDHINTTKDTTDYLWYTTSI--IVNENEEFLK 498
+ E E D S + ++ ++ T D TDYLWY T I V E F
Sbjct: 288 NAYISQWVE------EIDMTDSTWSSKPLEQLSLTHDKTDYLWYVTEINLQVRGAEVFTT 341
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSA-SGNGTHPPFKYKNPISLKAGKNEIALLSMTV 557
N S LHA+ + + Q + S N PF K+ I L G +++ +L+ +
Sbjct: 342 NVSD----------VLHAYIDGKYQSTIWSAN----PFNIKSDIPL--GWHKLQILNSKL 385
Query: 558 GLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINW 617
G+Q+ E V G+ + G D++ W+ K + GE L IYNP ++W
Sbjct: 386 GVQHYTVDMEKVTGGL----LGNIWVGGTDITNNGWSMKPYVNGERLAIYNPNNIFKVDW 441
Query: 618 VSTMEPPKNQPLTWYKA-VVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
S QPLTWYK + + ++ L+M M KG+ WLNG+ + RYW K +
Sbjct: 442 SSF--SGVQQPLTWYKINFLHELSPNKHYSLNMSGMNKGMIWLNGKHVARYWITKGWGCN 499
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
C Y+G + C T CGEPSQ YH+P+ W N+LVIFEE GG+P I
Sbjct: 500 G-------CSYQGGYTDQLCSTNCGEPSQINYHLPQDWLIEGANLLVIFEEVGGNPKSIK 552
Query: 737 FSIRK 741
++
Sbjct: 553 LEEKE 557
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 167/287 (58%), Positives = 198/287 (68%), Gaps = 4/287 (1%)
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MA + + GVPWIMCQQ + PDP+INTCNSFYCDQFTP+S + PK+WTENW GWF FGG
Sbjct: 1 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGA 60
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNFGRT GGPFI+TSYDY+APIDEYG
Sbjct: 61 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDI 120
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKND 385
R PKWGHLK+LH AIKLCE AL+ + + S G + E VY + C+AFLAN+ +D
Sbjct: 121 RQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYK-TGAVCSAFLANI-GMSD 178
Query: 386 KTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKG 445
TV F SYHLP WSVSILPDCK VV NTA V S E+L+ E S
Sbjct: 179 ATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLK--EKVDSLDSSS 236
Query: 446 LKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNE 492
W E GI F KSG ++ INTT D +DYLWY+ SI+ +
Sbjct: 237 SGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYED 283
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 246/781 (31%), Positives = 380/781 (48%), Gaps = 99/781 (12%)
Query: 7 IAPFALLIF---FSSSITY----CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMW 59
I F +LIF F+ +TY V+YD R++ ING R L+ S IHYPRS P MW
Sbjct: 6 IVFFTVLIFINTFAYPVTYDQVRGIPYRVSYDHRAITINGNRTLLFSGVIHYPRSTPAMW 65
Query: 60 PGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPF 119
P L+ +AKE G+NTI++YVFWN HE G Y F GR NL F++ A +++ LR+GP+
Sbjct: 66 PYLMSKAKEQGLNTIQTYVFWNIHEQKRGTYDFSGRANLSLFLQEAANAGLFVNLRLGPY 125
Query: 120 VAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILA 179
V AE++YG +PVWL+ IP FR+ + +K M++F++ I+ + + A GGPIILA
Sbjct: 126 VCAEWDYGALPVWLNNIPNIAFRSSNDAWKSEMKRFLSDII--VYVDGFLAKNGGPIILA 183
Query: 180 QVENEYGYYESFYGEGGKR-YALWAAKMAVAQ--NIGVPWIMCQQFDTPDPVINTCNSFY 236
Q+ENEYG G R Y W + + +PWIMC + I TCN
Sbjct: 184 QIENEYG--------GNDRAYVDWCGSLVSNDFASTQIPWIMCNGL-AANSTIETCNGCN 234
Query: 237 C------DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHN 290
C D+ P+ P ++TENW GWF+ +G R ED+A+SVA +F GG+ H
Sbjct: 235 CFDDGWMDRHRRTYPNQPLLFTENW-GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHA 293
Query: 291 YYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNG 350
YYM+HGG ++GRT GG +TT+Y + + G P PK+ HL L + LL+
Sbjct: 294 YYMWHGGNHYGRT-GGSGLTTAYSDDVILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQ 352
Query: 351 ERSNLSL----------GSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAW 400
+ + LS+ G+ Q Y S F+ N V+F + +
Sbjct: 353 DSNRLSIPYWNGKQWTVGTQQMVYSYPPS----VQFVIN-QAAFSLFVLFNKQNISIAGQ 407
Query: 401 SVSILPDCKKVVFNTANVRAQS-STVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWG 459
SV I + +++N+A+V S + +VP + P L WQV+ E
Sbjct: 408 SVQIYDYNEHLLWNSADVSGISRNNTFLVPIVVGP-----------LDWQVYSEPF---- 452
Query: 460 EADF---VKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHA 516
+D V S ++ +N T D T YLWY ++ +++ + V + + ++L
Sbjct: 453 TSDLPVIVASTPLEQLNLTNDETIYLWYRRNVSLSQ-----PSVQTIVQVQTRRANSLLF 507
Query: 517 FANQELQGSASGNGTHPPFKYKNPISLKAGK---NE---IALLSMTVGLQN--AGP-FYE 567
F +++ G + +H I+L + N+ +LS+++G+ N GP +E
Sbjct: 508 FMDRQFVGYFDDH-SHTQGTINVNITLNLSQFLPNQQYIFEILSVSLGIDNFNIGPGSFE 566
Query: 568 WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
+ G + +V + G + W ++ GL GE IY + W N+
Sbjct: 567 YKGI-VGNVSLGG--QSLVGDEASIWEHQKGLFGEAHQIYTEQGSKTVEWNPKWTTVINK 623
Query: 628 PLTWYKA------VVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDEC 681
P+TW++ + ++ PI LD +G A++NG +IG YW + + C
Sbjct: 624 PVTWFQTRFDLNHLAREDLNANPILLDAFGFNRGHAFVNGNDIGLYWLIEGTCQNNLCCC 683
Query: 682 VQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRK 741
+Q T C +PSQR+YHI W KP+ N+L +FEE G K +++
Sbjct: 684 LQNQ------------TNCQQPSQRYYHISSDWLKPTNNLLTVFEEIGASSPKSVGLVQR 731
Query: 742 I 742
I
Sbjct: 732 I 732
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 331 bits (849), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 165/376 (43%), Positives = 223/376 (59%), Gaps = 20/376 (5%)
Query: 356 SLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNT 415
SLG++QE V+ SG+CAAFLAN D + V F+N+ Y LP WS+SILPDCK VFNT
Sbjct: 4 SLGNNQEVHVFNPKSGSCAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAVFNT 63
Query: 416 ANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSGFVDHINT 474
A + AQSS +M P + WQ + +E A + F G + +N
Sbjct: 64 ARLGAQSSLKQMTPVST-------------FSWQSYIEESASSSDDKTFTTDGLWEQLNV 110
Query: 475 TKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPP 534
T+D +DYLWY T+I ++ NE FLKNG P+L I S GHALH F N +L G+ G +P
Sbjct: 111 TRDASDYLWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPK 170
Query: 535 FKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSW 593
+ + ++ G N+++LLS++VGLQN G +E G+ V + G N GT DLS W
Sbjct: 171 LTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQQW 230
Query: 594 TYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMG 653
+YKIGL+GE L ++ +++ WV + QPLTWYK P G+EP+ LDM MG
Sbjct: 231 SYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMG 290
Query: 654 KGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRS 713
KGL W+N + IGR+WP H C EC+Y G + KC T CG+PSQRWYH+PRS
Sbjct: 291 KGLIWINSQSIGRHWP----GYIAHGSC-GECNYAGTYTDKKCHTNCGQPSQRWYHVPRS 345
Query: 714 WFKPSENILVIFEEKG 729
W P+ N+LV+ + G
Sbjct: 346 WLNPTGNLLVVLKRVG 361
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 158/286 (55%), Positives = 195/286 (68%), Gaps = 5/286 (1%)
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+N+GG PVWL Y+PG FR D PFK M KF IV MMK E LF SQGGPIIL+Q+E
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEYG E + G K Y WAA+MAV N VPW+MC+Q D PDPVIN CN FYCD F+P
Sbjct: 61 NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYCDYFSP 120
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
+ P P +WTE W GWF F G + A V R + ++ + GTNFGR
Sbjct: 121 NKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQVIRRWILVTTIVPW-----GTNFGR 175
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
TAGGPFI+TSYDY+APIDEYGL R PKWGHL++LH AIK+CE AL++G+ + LG+ QE
Sbjct: 176 TAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQE 235
Query: 363 ADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
A VY SG+CAAFL+N + + +V F + Y++P+WS+SILPDC
Sbjct: 236 AHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 182/453 (40%), Positives = 250/453 (55%), Gaps = 43/453 (9%)
Query: 293 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
MYHGGTNFGRT+ FIT YD +AP+DEYGL R PK+GHLKELH AIK + LL G++
Sbjct: 1 MYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQ 59
Query: 353 SNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVV 412
+ LSLG Q+A V+ D++ C AFL N D K + + FRN +Y L S+ IL +CK ++
Sbjct: 60 TILSLGPMQQAYVFEDANNGCVAFLVNNDAKASQ-IQFRNNAYSLSPKSIGILQNCKNLI 118
Query: 413 FNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHI 472
+ TA V + +T P + PDN W +F+E + + ++H
Sbjct: 119 YETAKVNVKMNTRVTTPVQV---FNVPDN------WNLFRETIPAFPGTSLKTNALLEHT 169
Query: 473 NTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTH 532
N TKD TDYLWYT+S ++ + P + ES GH +H F N L GS G+
Sbjct: 170 NLTKDKTDYLWYTSSFKLDS------PCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDI 223
Query: 533 PPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYS 592
K + P+SL G+N I++LS VGL ++G + E G+T V+I+ + +DLS
Sbjct: 224 RVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQ 283
Query: 593 WTYKIGLQGEHLGIYNPGYRNNINW-VSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLK 651
W Y +GL GE + +Y N + W ++ KN+PL WYK P GD P+GL M
Sbjct: 284 WGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSS 343
Query: 652 MGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIP 711
MGKG W+NGE IGRYW +T G+PSQ YHIP
Sbjct: 344 MGKGEIWVNGESIGRYWV-------------------------SFLTPAGQPSQSIYHIP 378
Query: 712 RSWFKPSENILVIFEEKGGDPTKITFSIRKISG 744
R++ KPS N+LV+FEE+GGDP I+ + + G
Sbjct: 379 RAFLKPSGNLLVVFEEEGGDPLGISLNTISVVG 411
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 231/767 (30%), Positives = 366/767 (47%), Gaps = 114/767 (14%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V Y R +I+G+ +++ +IHY RS P W L+ +AKE G+N ++ Y+FWN HE
Sbjct: 98 DVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFHEPR 157
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G +YF R NL F + + +++ LR GP+V AE+N GG+P+WL IPG R+++E
Sbjct: 158 RGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGMKVRSNSE 217
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
++ M + + +++++ + F+ GGPII+AQ+ENEY ++ Y W +++
Sbjct: 218 SWRQEMNRIILIMINLAR--PYFSVNGGPIIMAQIENEYNGHDP-------TYVAWLSQL 268
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS----PSMPKIWTENWPGWFKTF 262
IG+PW MC + I+TCN C QF + PS P +WTEN W++ +
Sbjct: 269 VRKLGIGIPWTMCNGASAVN-TISTCNDNDCFQFAEKNAKVFPSQPLVWTEN-EAWYEKW 326
Query: 263 G-------GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDY 315
G++ R E +A+ VAR+F GG++HNYYMYHGG NFGRTA +TT Y
Sbjct: 327 ATKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYHGGNNFGRTASAG-VTTMYAD 385
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERS---NLSLGS------SQEADVY 366
A + GL PK HL++LH + C ALL+ ER LG +Q A +Y
Sbjct: 386 GAILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNHAKPLGPEGKNAYTQRAYIY 445
Query: 367 ADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVE 426
+ S FL N + ++ Y LP ++ IL D V++NT++V +
Sbjct: 446 GNCS-----FLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDVSGTLGS-- 497
Query: 427 MVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEAD---------FVKSGFVDHINTTKD 477
SP + W+ IW E D V ++ + T+D
Sbjct: 498 -----RSTRSFSPLIRFRKSDWK-------IWSEWDVNPHNVRDQIVNDSPLEQLLVTQD 545
Query: 478 TTDYLWYTTSIIVNENEEFLKNGSRPVLL--IESKGHALHAFANQELQGSAS----GNGT 531
TTDYL Y + N KN + +L I ++ F N E G G+
Sbjct: 546 TTDYLMYQNEVRWGSNGP-TKNKMKSSILKFISCDANSFLVFINGEFIGEQHLAYPGDDC 604
Query: 532 HPPFKYK-NPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLS 589
F++ P+ +++LS+++G+ + G ++ GI S V+I + +L
Sbjct: 605 SNIFRFDLGPLGKYGANLTLSILSISLGIHSLGEKHQ---KGIVSDVQI---DERSLVYG 658
Query: 590 TYS-WTYKIGLQGEHLGIYNPGYRNNINWVS-TMEPPKNQPLTWY--KAVVKQPPGD--E 643
+ W GL GE L +Y+P + N++ W + ++ + + WY K V+KQ D
Sbjct: 659 PHERWVMFSGLIGELLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQLDWDTET 718
Query: 644 PIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEP 703
+ LD M +G +LNG ++GRYW ++ D G
Sbjct: 719 SVLLDCKGMNRGRIYLNGHDLGRYW------------LIRRSD--------------GAY 752
Query: 704 SQRWYHIPRSWFKPS--ENILVIFEEKGGDPTK----ITFSIRKISG 744
QR+Y IP +W + N LVIFEE + + +T ++R+I
Sbjct: 753 VQRYYTIPVAWLHAANKSNYLVIFEELRNETIESMRIVTSTMRRIDA 799
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 155/286 (54%), Positives = 193/286 (67%), Gaps = 4/286 (1%)
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+N+GG PVWL Y+PG FR D PFK M KF IV MMK E LF SQGGPIIL+Q+E
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
NEYG E + G K Y WAA+MAV N GVPW+MC+Q D PDPVIN N FYCD F+P
Sbjct: 61 NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYCDYFSP 120
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
+S + + W G + + F V + + +G NYYMYHGGTNFGR
Sbjct: 121 NS--LKTFFGGLKLDWLVPVSGSSSSQ-TVRTGFCV-QVYTEGWIFRNYYMYHGGTNFGR 176
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
TAGG FI+TSYDY+APIDEY L R PKWGHL++LH AIK+CE AL++G+ + LG+ QE
Sbjct: 177 TAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQE 236
Query: 363 ADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
A VY SG+CAAFL+N + + +V F + Y++P+WS+SILPDC
Sbjct: 237 AHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 308 bits (789), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 224/748 (29%), Positives = 357/748 (47%), Gaps = 107/748 (14%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD RS ++G+R + ++ ++HYPR+ P MW ++ QA E G+N I+ Y FWN HE
Sbjct: 35 VTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQIYTFWNLHEPVK 94
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y + G ++ F++ +++ +RIGP+V AE++ GGIPVW++Y+ G R + +
Sbjct: 95 GQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDV 154
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+K M +M ++ D + FA +GGPII +Q+ENE +G G + Y W + A
Sbjct: 155 WKKEMGDWMKVLTDYTR--DFFADRGGPIIFSQIENE------LWG-GAREYIDWCGEFA 205
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF-TPHSPS------MPKIWTENWPGWFK 260
+ + VPW+MC DT + IN CN C + H S P WTEN GWF+
Sbjct: 206 ESLELNVPWMMCNG-DTSEKTINACNGNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQ 263
Query: 261 TFGG----RDPH-----RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITT 311
G RD + R +ED F+V +F +GGS HNYYM+ GG ++G+ AG +T
Sbjct: 264 IHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHNYYMWFGGNHYGKWAGNG-MTN 322
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLN--GERSNLSLGSSQEADVYADS 369
Y I LP PK H ++H + LLN + +N + + +
Sbjct: 323 WYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYR 382
Query: 370 SG-ACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVR-AQSSTVEM 427
G +F+ N DK V++R++ Y LPAWS+ +L + V+F T NV+ V
Sbjct: 383 YGDRLVSFVENNKGSADK-VIYRDIVYELPAWSMIVLDEYDNVLFETNNVKPVNKHRVYH 441
Query: 428 VPENLQPSEASPDNGSKGLKWQVFKE-IAGIWGEAD--FVKSGFVDHINTTKDTTDYLWY 484
E L+ ++ + E ++ + EA V + +N T+D T++L+Y
Sbjct: 442 CEEKLE--------------FEYWNEPVSTLSQEAPRVVVSPKANEQLNMTRDLTEFLYY 487
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
T + ++E L G + +A A+ + GS + H + N I++K
Sbjct: 488 ETEVEFPQDECTLSIGG-------TDANAFVAYVDDHFVGSDDEHTHHDGWHTMN-INMK 539
Query: 545 A--GKNEIALLSMTVGLQNAGPFY---EWVGAGITS----VKITGFNSGTLDLSTYSWTY 595
+ GK+++ LLS ++G+ N W + + +K+ G D+ W +
Sbjct: 540 SGKGKHKLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWIKLCGN-----DIFNQEWKH 594
Query: 596 KIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDML----K 651
GL GE ++ + W S +E N L WY++ K P G + G+++L
Sbjct: 595 YPGLVGEAKQVFTDEGMKTVTWKSDVENADN--LAWYRSTFKTPQGLKR-GIEVLLRPEG 651
Query: 652 MGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIP 711
M +G A++NG IGRYW K G GE +Q +YHIP
Sbjct: 652 MNRGQAYVNGHNIGRYWMIKD--------------------------GNGEYTQGYYHIP 685
Query: 712 RSWFK--PSENILVIFEEKGGDPTKITF 737
+ W K EN+LV+ E G +T
Sbjct: 686 KDWLKGEGEENVLVLGETLGASDPSVTI 713
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 301 bits (771), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 169/357 (47%), Positives = 211/357 (59%), Gaps = 65/357 (18%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MW GLV+ AKEGG++ IE+YVF NGHELSP YYFGG ++L+KF+KI+QQA MY+IL IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
PFVA E+N+G T+F+ +++PFKYHMQKFMTLIV++MK++KLFASQGGPII
Sbjct: 61 PFVATEWNFG-----------TIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN---- 233
L Q +NEYG + Y +GGK Y +WAA M ++ NIGVPWIMC Q+ D I
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMC-QYSYVDIYIYIVKKEGL 168
Query: 234 -------SFYCDQFTPHS---------PSMPKIWTENWPGWFKTFGGRDPHRPSED-IAF 276
+ HS + PK + K G HR D +
Sbjct: 169 YSLSYQYALILSTLVTHSIVTNSHQILQAKPKCGLKIGLDGLKHLG----HRILTDYMKI 224
Query: 277 SVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ NYYMYHGGTNFG T+GGPFITT+Y+Y APIDEYGL R PK
Sbjct: 225 LLFLLLFFFFQKVNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK------- 277
Query: 337 HGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNV 393
C SQE DVYADS G AAF++N+D+K DK +VF+NV
Sbjct: 278 ------C---------------PSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNV 313
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 136/218 (62%), Positives = 166/218 (76%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MWP L+Q+AK+GG++ I++YVFWNGHE SPGKYYF ++LVKFIK++QQA +Y+ LRIG
Sbjct: 2 MWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIG 61
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPII 177
P+V AE+N+GG PVWL YIPG FR D PFK MQ+F T IV+MMK E+LF S GGPII
Sbjct: 62 PYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPII 121
Query: 178 LAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC 237
L+Q+ENEYG E G GK Y WAA+MAV GVPW+MC+Q D PDPVIN CN FYC
Sbjct: 122 LSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYC 181
Query: 238 DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIA 275
D F+P+ PK+WTE W GWF FGG P+RP+ED+A
Sbjct: 182 DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/382 (41%), Positives = 222/382 (58%), Gaps = 17/382 (4%)
Query: 290 NYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLN 349
NYYMYHGGTNFGRT+ F+ Y EAP+DE+GL + PKWGHL++LH A+KLC+ ALL
Sbjct: 3 NYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 350 GERSNLSLGSSQEADVYA-DSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDC 408
G+ S LG EA V+ C AFL+N + K+D T+ FR SY +P S+SIL DC
Sbjct: 62 GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121
Query: 409 KKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVF-KEIAGIWGEADFVKSG 467
K VVF T +V AQ + Q + D ++ WQ+F +E + ++
Sbjct: 122 KTVVFGTQHVNAQHN---------QRTFHFADQTTQNNVWQMFDEEKVPKYKQSKIRLRK 172
Query: 468 FVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSAS 527
D N TKD TDY+WYT+S + ++ ++ + VL + S GHA AF N + G
Sbjct: 173 AGDLYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGH 232
Query: 528 GNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLD 587
G + F + P+ LK G N +A+L+ T+G+ ++G + E AG+ V+I G N+GTLD
Sbjct: 233 GTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAGTLD 292
Query: 588 LSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKN-QPLTWYKAVVKQPPGDEPIG 646
L+ W + +GL GE IY ++ W +P N +PLTWYK P G++PI
Sbjct: 293 LTNNGWGHIVGLVGEQKQIYTDKGMGSVTW----KPAVNDRPLTWYKRHFDMPSGEDPIV 348
Query: 647 LDMLKMGKGLAWLNGEEIGRYW 668
LDM MGKGL ++NG+ IGRYW
Sbjct: 349 LDMSTMGKGLMFVNGQGIGRYW 370
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 159/422 (37%), Positives = 231/422 (54%), Gaps = 45/422 (10%)
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAA 375
P+DE+GL R PKWGHLK++H A+ LC+ AL G + L LG Q+A V+ + ACAA
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63
Query: 376 FLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPS 435
LAN + + + V FR LPA S+S+LPDCK VVFNT V Q ++ V +
Sbjct: 64 LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRSEI--- 120
Query: 436 EASPDNGSKGLKWQVFKEI--AGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNEN 493
+K W++++E+ G+ + D + F + TKDTTDY WYTTS+++
Sbjct: 121 ------ANKNFNWEMYREVPPVGLGFKFDVPRELF----HLTKDTTDYAWYTTSLLLGRR 170
Query: 494 EEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALL 553
+ +K RPVL + S GH +HA+ N E GSA G+ F + SLK G+N IALL
Sbjct: 171 DLPMKKNVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALL 230
Query: 554 SMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRN 613
VGL ++G + E AG S+ I G N+GTLD+S W +++G GE ++
Sbjct: 231 GYLVGLPDSGAYMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGTDGEKKKLFTEEGSK 290
Query: 614 NINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSR 673
++ W +P + PLTWYK P GD P+ + M MGKG+ W+NG IGRYW
Sbjct: 291 SVQWT---KPDQGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYW----- 342
Query: 674 KSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPT 733
+ ++ +P+Q YHIPR++ KP +N++V+ EE+GG+P
Sbjct: 343 --------------------NNYLSPLKKPTQSEYHIPRAYLKP-KNLIVLLEEEGGNPK 381
Query: 734 KI 735
+
Sbjct: 382 DV 383
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 278 bits (712), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 146/279 (52%), Positives = 182/279 (65%), Gaps = 12/279 (4%)
Query: 293 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
MYHGGTNF R+ GGPFI TSYDY+APIDEYG+ R KWGHLK+++ AIKLCE AL+ +
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60
Query: 353 SNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVV 412
SLG + EA VY S CAAFLAN+D KNDKTV F SYHLPAWSVS+LPDCK VV
Sbjct: 61 KISSLGQNLEAAVYKTGS-VCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVV 119
Query: 413 FNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHI 472
NTA + + S+ V E++ E S KW E GI + K+G ++ I
Sbjct: 120 LNTAKINSASAISNFVTEDISSLETSSS------KWSWINEPVGISKDDILSKTGLLEQI 173
Query: 473 NTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTH 532
NTT D +DYLWY+ S+ + ++ GS+ VL IES GH LHAF N +L G+ +GN
Sbjct: 174 NTTADRSDYLWYSLSLDLADDP-----GSQTVLHIESLGHTLHAFINGKLAGNQAGNSDK 228
Query: 533 PPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGA 571
PI+L +GKN+I LLS+TVGLQN G F++ VGA
Sbjct: 229 SKLNVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTVGA 267
>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
Length = 287
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 146/299 (48%), Positives = 188/299 (62%), Gaps = 15/299 (5%)
Query: 306 GPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADV 365
GPF+ TSYDY+AP+DEYGLPR PKWGHL++LH AIK E AL++ E S SLG+ QEA V
Sbjct: 1 GPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHV 60
Query: 366 YADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTV 425
+ SG CAAFLAN D K+ V F N Y LP WS+SILPDCK V+NTA + +QSS +
Sbjct: 61 FKSKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGSQSSQM 119
Query: 426 EMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVK-SGFVDHINTTKDTTDYLWY 484
+M P L WQ F E + E+D G + IN T+DTTDYLWY
Sbjct: 120 KMTPVK------------SALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWY 167
Query: 485 TTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLK 544
T I ++ +E F+K G P+L I S GHALH F N +L G+ G +P + + L+
Sbjct: 168 MTDITISPDEGFIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLR 227
Query: 545 AGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGE 602
+G N++ALLS++VGL N G +E AG+ V + G NSGT D+S + WTYK GL+GE
Sbjct: 228 SGINKLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKTGLKGE 286
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/292 (51%), Positives = 180/292 (61%), Gaps = 11/292 (3%)
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAP 318
F +FG PHRP ED+AF+VARF+Q+GG+ NYYM+HGGTNFGRT GGPFI+TSYD++ P
Sbjct: 6 FVSFGDVVPHRPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTP 65
Query: 319 IDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACAAFLA 378
IDEYG+ R PKW HLK +H AIKLCE ALL + LG + EA VY + AAFLA
Sbjct: 66 IDEYGIIRQPKWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVY-NIGAVSAAFLA 124
Query: 379 NMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEAS 438
N+ K D V F SYHLPAW VS LPDCK VV NTA + + S E+L+ S
Sbjct: 125 NI-AKTDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGS 183
Query: 439 PDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLK 498
D+ G W E GI F K ++ INTT D +DYLWY++SI ++ E
Sbjct: 184 LDDSGSGWSW--ISEPIGISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDLDAATE--- 238
Query: 499 NGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEI 550
VL IES GHALHAF N +L GS +GN K PI+L GKN I
Sbjct: 239 ----TVLHIESLGHALHAFVNGKLAGSGTGNHEKVSVKVDIPITLVYGKNTI 286
>gi|38699452|gb|AAR27062.1| beta-galactosidase 2 [Ficus carica]
Length = 177
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 129/177 (72%), Positives = 148/177 (83%)
Query: 482 LWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
LWY TSI V+ENE FLKNGS+P+LL+ESKGHALHAF NQELQGSASGNGTH P+K+K PI
Sbjct: 1 LWYMTSIYVDENEGFLKNGSQPILLVESKGHALHAFVNQELQGSASGNGTHSPYKFKKPI 60
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQG 601
SLKAGKNEIALLSMTVGLQNAG FYEWVGAG+T+V+I+GF +G ++LS +WTYKIGLQG
Sbjct: 61 SLKAGKNEIALLSMTVGLQNAGSFYEWVGAGLTNVEISGFKNGPVNLSNSTWTYKIGLQG 120
Query: 602 EHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAW 658
E LGIY +NW++T PPK QPL WYKAV+ P GDEP+GLDML MGKG W
Sbjct: 121 EQLGIYKEDGVAKVNWIATSNPPKKQPLIWYKAVIDPPLGDEPVGLDMLHMGKGQIW 177
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 181/614 (29%), Positives = 297/614 (48%), Gaps = 73/614 (11%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTY R I+G++ L++ +IHYPRS PG W L+++AK G+N IE YVFWN HE
Sbjct: 84 SVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWEQLLREAKRDGLNHIEMYVFWNLHEQE 143
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G + F G N+ +F ++ + +++ +R GP+V AE+N GG+P+WL++IPG R+
Sbjct: 144 RGVFNFAGNANITRFYELAAEVGLFLHVRFGPYVCAEWNNGGLPLWLNWIPGMEVRSSNA 203
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P++ M++F+ +V++ + A GGPII+AQ+ENE+ +++ Y W +
Sbjct: 204 PWQREMERFIRYMVELSR--PFLAKNGGPIIMAQIENEFAWHD-------PEYIAWCGNL 254
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQF----TPHSPSMPKIWTENWPGWFKTF 262
+ +PW+MC + + I +CN C F PS P +WTE+ GWF+T+
Sbjct: 255 VKQLDTSIPWVMCYA-NAAENTILSCNDDDCVDFAVKHVKERPSDPLVWTED-EGWFQTW 312
Query: 263 --GGRDP----HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYE 316
++P R ED+A++VAR+F GG+ HNYYMYHGG N+GR A +TT Y
Sbjct: 313 QKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHNYYMYHGGNNYGRAASAG-VTTMYADG 371
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLS---LGSSQEADVYADSSGAC 373
+ GL PK HL++LH A+ C LL +R L+ L E V A S
Sbjct: 372 VNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRNDRQVLNPRELPLVDEQTVKASSQQRA 431
Query: 374 AAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQ 433
+ + D ++F+TA+VR Q
Sbjct: 432 FVYGPEAEPNQDGA-----------------------ILFDTADVRKSFP-------GRQ 461
Query: 434 PSEASPDNGSKGLKWQVFKE--IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
+P + L W+ + E ++ V ++ + T D +DYL Y T+
Sbjct: 462 HRTYTPLVKASALAWKAWSELNVSSTTPRRRVVADQPIEQLRLTADQSDYLTYETTFTPK 521
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSAS----GNGTHPPFKYKNPISLKAGK 547
+ + + + V + + ++ A + L G + G F + P S++ G+
Sbjct: 522 QLSD-VDDDMWTVKVTSCEASSIIALVDGWLIGERNLAYPGGNCSKEFSFHLPASIEVGR 580
Query: 548 -NEIALLSMTVGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLST-YSWTYKIGLQGEHL 604
+++ L+S+++G+ + G + G+T SV+I G DL+ W L GE L
Sbjct: 581 QHDLKLVSVSLGIYSLGSNH---SKGVTGSVRI-----GHKDLARGQRWEMYPSLIGEQL 632
Query: 605 GIYNPGYRNNINWV 618
IY + + + W
Sbjct: 633 EIYRSQWIDAVPWT 646
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 267 bits (682), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 150/420 (35%), Positives = 225/420 (53%), Gaps = 23/420 (5%)
Query: 43 LIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFI 102
++ A+IHYPR P W L++ AKE G+N IE+YVFWN HE G Y F GR +L FI
Sbjct: 477 ILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGFI 536
Query: 103 KIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDM 162
+ I +A +Y +LRIGP++ AE ++GG P WL I G FR EPF+ +++ +V+
Sbjct: 537 RTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVEK 596
Query: 163 MKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQF 222
+ F SQGGPI++ Q ENEY YGE G Y W +++A + VP MC+
Sbjct: 597 LNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCK-- 654
Query: 223 DTPDPVINTCNSFYCDQFTPHS----PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSV 278
+ + V+ T N FY Q + P+ P IWTE W GW+ +G RP +D+ ++V
Sbjct: 655 GSIENVLETINDFYGHQEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYAV 714
Query: 279 ARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
RFF +GG NYYM+HGGTN+ + A TTSYDY+APIDEYG + K+ L+ +H
Sbjct: 715 LRFFAQGGKGINYYMFHGGTNYDQLAMY-LQTTSYDYDAPIDEYG-RKTKKYFGLQYIHR 772
Query: 339 AIKLCEHALLNGERSNLSLGSSQEADVYA-----DSSGACAAFLANMDDKNDKTVVFRNV 393
++ +H + + S E D Y + G+ F N + K V ++
Sbjct: 773 QLE--QHFASLALKLEAPIAHSYE-DNYVWIFIWEEQGSNCIFFCNDHPTSTKQVQWKEQ 829
Query: 394 SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE 453
Y L SV ++ D +++ + + E++ + L+P + + + WQ +KE
Sbjct: 830 EYCLAPLSVQMVVDHHRLILKSDQLFVDE---ELIQKELKPISVTTEEWT----WQYYKE 882
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 216/777 (27%), Positives = 357/777 (45%), Gaps = 106/777 (13%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
+ G ++DSR++ +NG+R L++ ++ YP+ W ++ AKE G+N ++ YVFWN H
Sbjct: 3 YQGVASFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWNVH 62
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN 143
E G + F ++ +F+++ Q + ++LR+GP++ AE +YGG P WL IPG FR
Sbjct: 63 EKKRGIFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGIQFRT 122
Query: 144 DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+PF +++++ I ++K ++LF QGGPI+L Q+ENEY G++Y W
Sbjct: 123 YNDPFMREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYLNWY 182
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPV---------------------INTCNSFY----CD 238
++ VP IMC+ +P+ V I T NSFY
Sbjct: 183 NELYRELAFDVPLIMCR--SSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKIA 240
Query: 239 QFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGT 298
P P +WTE W GW+ + R +ED+ ++ RF +GG+ +YYM+HGGT
Sbjct: 241 DLRRRKPHQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQGGAGFSYYMFHGGT 300
Query: 299 NFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLG 358
+F A TTSY +++PIDEYG P + + H + H L L L
Sbjct: 301 HFNNLAMYS-QTTSYYFDSPIDEYGRPSFLFYMLKRINHILHQFSSHLLSQDHPQVLHLL 359
Query: 359 SSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTAN 417
A ++ + SS +FL N D + ++F+ + SV++ + +++F+++
Sbjct: 360 PQVVAFIWQEHSSQQSLSFLCN-DSEQIAYIMFQQSMMKMNPLSVAVFLE-NELLFDSS- 416
Query: 418 VRAQSSTVEMVP-ENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTK 476
S +P + +P E + K + + I + DF S D ++ T+
Sbjct: 417 ----SGYDWQIPFRDFKPLERAYFRELKTFQLDI--PIPPLSSSCDF--SQLPDMLSVTQ 468
Query: 477 DTTDYLWYTTSIIV-NENEEFLKNGSRPVLLIESKGHALHAFANQELQGS---------- 525
D TDY+WY +S + ++EF VLL +H F NQ+ GS
Sbjct: 469 DETDYMWYISSATLPVSSKEF---TCEKVLLQIEMADLIHLFINQQYMGSSWIKIDDERF 525
Query: 526 ASG-NGTHPPFKYKN-----PISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS---- 575
A+G NG +++N P+ K +++L ++GL G F W GA +
Sbjct: 526 ANGKNGFRFSIEFENSVYPQPVFSSNSKLYVSILVCSLGLIK-GEFQLWKGATMEKEKKG 584
Query: 576 ----------VKITGFNSGTLDLS-TYSWTYK-IGLQGEHLGIYNPGYRNNINWVSTMEP 623
VK + + T+ LS T SW + + +H + Y + ++
Sbjct: 585 LFKQPIIHFVVKHSELETETIPLSFTSSWAMMPLSIMKDHQSAFVKEYN-----IKNVDK 639
Query: 624 PKNQPLTWYK--AVVKQPPGDEP---IGLDMLKMGKGLAWLNGEEIGRYWPR----KSRK 674
P + T+YK ++ + D + +D M KG+ N GRY+ K R
Sbjct: 640 PLSLGPTYYKQTVIINKAMIDALKWGLVIDFSSMTKGIFRWNSFCCGRYYSIQVLGKERD 699
Query: 675 SSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGD 731
S + VQE D+ K +QR+YHIP+ + N L +FEE GG+
Sbjct: 700 PSLRNSPVQE-DHLFK------------STQRYYHIPKGVLQ-ERNELEVFEEIGGN 742
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 265 bits (677), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 179/580 (30%), Positives = 289/580 (49%), Gaps = 55/580 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
VT+D R+++I+G+R ++ + HYP+ WP ++ AK+ G+N +E Y+FWN HE
Sbjct: 3 TAQVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHE 62
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G Y+F N+ +F+++ Q+ + +ILR+GP++ AE +YGG P WL IPG FR
Sbjct: 63 KKKGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGIEFRTY 122
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
EPF M++++T I M+K KL+ +GGPIIL Q+ENEY S YG G++Y W
Sbjct: 123 NEPFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWCY 182
Query: 205 KMAVAQNIGVPWIMCQQFD-----TPDPVINTCNSFY----CDQFTPHSPSMPKIWTENW 255
++ + W+ + + + D I T N FY D P P +WTE W
Sbjct: 183 EL--YKEGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALKPHQPLLWTEFW 240
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDY 315
GW+ + G RP +D+ ++ ARF +GGS NYYM+HGGT+FG A TT YD+
Sbjct: 241 IGWYNIWRGAQRQRPVDDVIYAAARFIAQGGSGMNYYMFHGGTHFGNLAMYG-QTTGYDF 299
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA---DSSGA 372
+AP+D YG P K+ LK+L+ + E+ LL+ + + + +VY SG
Sbjct: 300 DAPVDSYGRP-TEKFERLKQLNHCLSNLEYILLSQDEPEVQ-KLTPNVNVYRWKDIESGD 357
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVV---FNTANVRAQS-STVEMV 428
+F+ N D ++ V+ + L SV I + ++V N+ NV +S ++ V
Sbjct: 358 ECSFVCN-DQRSQSYVIVAERAVCLKPLSVKIYLNHEEVFDSSQNSYNVSQKSYHRLDYV 416
Query: 429 PENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYT--T 486
+ + + K K +F D ++ T+D TDY+WYT
Sbjct: 417 CNEWKTMQIPIPSKEKKDKEHF-----------EFSFPHIPDMLHITQDETDYMWYTGVG 465
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASG-----------NGTHPPF 535
+I E + + + +E+ + +H F N++ GS +G F
Sbjct: 466 TIYCPFKGENTPHCLKIHMELEAADY-VHVFLNRKYVGSCRSPCYDERFTGRRSGFSKSF 524
Query: 536 KYKN--PISLKAGKN-----EIALLSMTVGLQNAGPFYEW 568
++ P+ + A K+ E+A+L ++GL G F W
Sbjct: 525 DLEDFAPMQIAADKDGTYKFELAILVCSLGLIK-GEFQLW 563
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 215/777 (27%), Positives = 354/777 (45%), Gaps = 125/777 (16%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+TYDSRSL ING+ +S A+HY RS P WP + + + G+NT+E+YVFW HE
Sbjct: 9 EITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEFE 68
Query: 87 PG-------KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYI--- 136
P + F G +LV+F++ + + ILR+GP+V AE NYGG P WL +
Sbjct: 69 PPEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCEK 128
Query: 137 ---PGTVFRNDTEPFKYHMQKFMTLIVD-MMKREKLFASQGGPIILAQVENEYGYYESFY 192
FR + +++++ +VD ++K ++FA QGGP+ILAQ+ENEY Y
Sbjct: 129 GSSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAESY 188
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDP--VINTCNSFYCDQFTPH------S 244
G G++Y W A +A +GVP +MC + VI T N+FY + +
Sbjct: 189 GPDGQQYLDWIASLANQLALGVPLVMCYGASQRESGRVIETINAFYAHEHVESLRRAQGA 248
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P +WTE W GW+ +G R + D+A++V RF GG+ NYYMY GGTN+ R
Sbjct: 249 NPQPLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAAGGAGINYYMYFGGTNWRREN 308
Query: 305 GGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIK--LCEH-ALLNGERSNLSLGSSQ 361
TSYDY+AP++EY + K HL+ LH +I+ L + +L+ R L + +
Sbjct: 309 TMYLQATSYDYDAPLNEYVM-ETTKSRHLRRLHESIQPFLSDRDGVLDMSRLELKVFEGE 367
Query: 362 EADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQ 421
+ + S + D +++++V + VF++A++R
Sbjct: 368 RRAILYERSTVS----GDADHRSEESV---------------------RCVFDSADIRVH 402
Query: 422 SSTVEMVPENLQPSEASPDNGSKGLKWQVFKE---IAGIWGEADFVKSGFVDHINTTKDT 478
+ + + + AS D G + L+W++ E + + + D ++ T T
Sbjct: 403 ---LALELREIIVNAASRDTG-QDLRWRMLPEPPPLRAALSDTSATLATIPDLVDATAGT 458
Query: 479 TDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFA--------NQELQGSASGNG 530
+DY WY + L+ L + G A Q L+ +A+ G
Sbjct: 459 SDYAWYILRCPTAQGSGLLQ------LEVADFGRVWRRKAVDQGDDAERQPLEWAAA--G 510
Query: 531 THPPFKYKNPISLKAGK--------------NEIALLSMTVGLQNAGPFYEWVGAGITSV 576
PP + + P + + + E +L ++G+ G + G G+
Sbjct: 511 PEPPVEDRFPNAWNSTEYGYGIVEVGAIDCHEEYVVLVSSLGMVK-GDWQLPPGYGMARE 569
Query: 577 KITGFNSGTLDLSTYS---WT------YKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQ 627
+ + T++ W + GL+GE + G + ++ T P+
Sbjct: 570 RKGLLRASYRSDVTFADDEWRDALVVGFAAGLRGERIRSVIEGDADAYPYLWT---PQKA 626
Query: 628 PLT--------WYKAVVKQPP--GDEPIG--LDMLKMG--KGLAWLNGEEIGRYWPRKSR 673
L+ WY+A + PP DE G LD+ + G KG ++NGE GR+W +
Sbjct: 627 ALSGRRFSWPRWYRASLAIPPPNADETEGIILDLYESGVEKGWIYMNGEPCGRHW--RVH 684
Query: 674 KSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWF---KPSENILVIFEE 727
+ P + +++ D G G+P+QR+++IP W K + LVIF+E
Sbjct: 685 GTMPKNGFLRQGDQEAPIEQ----VGHGQPTQRYFYIP-PWHLHAKGRPSTLVIFDE 736
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/266 (47%), Positives = 166/266 (62%), Gaps = 22/266 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
VTYD SLIING+REL+ S ++HYPRS P MWP ++ +A+ GG+NTI++YVFWN HE
Sbjct: 42 VTYDGTSLIINGKRELLFSVSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEH 101
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
KY F GRF+LV FIK+IQ+ +Y+ LR+GPF+ AE+N+GG+P WL +P FR D EP
Sbjct: 102 RKYDFKGRFDLVTFIKLIQEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPEVYFRTDNEP 161
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
FK H ++++ I+ MMK EKL ASQ L ENE + Y E G+RY WAA +
Sbjct: 162 FKEHTERYVRKILGMMKEEKLLASQRRSHHLG-TENECNAVQLAYKENGERYIKWAANLV 220
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
+ +G+PW+MC+Q + D +IN CN +C F+ G
Sbjct: 221 ESMKLGIPWVMCKQNNASDNLINACNGRHC---------------------FEFLGILQL 259
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYM 293
SEDIAFSVAR+F K GS NYYM
Sbjct: 260 IEQSEDIAFSVARYFSKNGSHVNYYM 285
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 171/300 (57%), Gaps = 7/300 (2%)
Query: 443 SKGLKWQVFKEIAGIWGEADFVKS-GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGS 501
S WQ + E G D + ++ I T+D++DYLWY T + ++ NE F+KNG
Sbjct: 12 SSAFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQ 71
Query: 502 RPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQN 561
PVL S GH LH F N + G+A G +P + N + L+ G N+I+LLS+ VGL N
Sbjct: 72 YPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSN 131
Query: 562 AGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVST 620
G YE G+ V + G N GT DLS W+YKIGL+GE L ++ +++ W
Sbjct: 132 VGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKG 191
Query: 621 MEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDE 680
+ QPLTWYKA P G++P+ LDM MGKG W+NGE IGR+WP + S
Sbjct: 192 SSLVEKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIARGS---- 247
Query: 681 CVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C+Y G F KC T CG+P+Q+WYHIPRSW P N LV+ EE GGDP+ I+ R
Sbjct: 248 -CGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISLVKR 306
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 148/204 (72%), Gaps = 1/204 (0%)
Query: 52 PRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMY 111
PRS P MWP L+Q AKEGG++ I++YVFWNGHE SPG YYF R++ VKFIK++ QA +Y
Sbjct: 1 PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60
Query: 112 MILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFAS 171
+ LRIGP++ E+N+GG PVWL Y+PG FR D PFK MQKF IV+MMK EKLF
Sbjct: 61 VHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEP 120
Query: 172 QGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINT 231
QGGP I++Q+E EYG G GK Y WAA+MAV GVPWIMC+Q D PDP+I+T
Sbjct: 121 QGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDT 179
Query: 232 CNSFYCDQFTPHSPSMPKIWTENW 255
CN FYC+ F P++ PK+WTE W
Sbjct: 180 CNGFYCENFMPNANYKPKMWTEAW 203
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 111/205 (54%), Positives = 149/205 (72%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G VTYD R+LI++G R ++ S +HYPRS P MWP L+ +AK+GG++ I++YVFWN HE
Sbjct: 36 GEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEP 95
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G++ F GR++LVKFI+ I +Y+ LRIGPFV +E+ YGG+P WL IP FR+D
Sbjct: 96 VQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNITFRSDN 155
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
EPFK HMQKF+T IV++MK E+LF QGGPII++Q+ENEY E+ + G Y WAA
Sbjct: 156 EPFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYVHWAAA 215
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVIN 230
MAV GVPW+MC+Q D PDP+++
Sbjct: 216 MAVNLQTGVPWMMCKQDDAPDPIVS 240
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 127/297 (42%), Positives = 166/297 (55%), Gaps = 9/297 (3%)
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
G WQ + E F K G V+ ++ T D +DYLWYTT + +N NE+FLK+G P
Sbjct: 6 GFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQ 65
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
L I S GH+L F N + G+ G P Y + + G N+I++LS VGL N G
Sbjct: 66 LTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGT 125
Query: 565 FYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEP 623
YE G+ V ++G N G DLS WTY+IGL GE LG+ + +++ W S
Sbjct: 126 HYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGS---A 182
Query: 624 PKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
QPLTW+KA P GD P+ LDM MGKG AW+NG IGRYW K+ S
Sbjct: 183 AGKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSG-----CG 237
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
C Y G ++ KC TGCG+ SQR+YH+PRSW PS N+LV+ EE GGD + + R
Sbjct: 238 GCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTR 294
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 112/202 (55%), Positives = 142/202 (70%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+TYD R+L+++G R + S +HY RS P MWP L+ +AK GG++ I++YVFWN HE
Sbjct: 28 EITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPI 87
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+Y F GR++LVKFI+ IQ +Y+ LRIGPFV AE+ YGG P WLH +P FR+D E
Sbjct: 88 QGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNE 147
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
PFK HMQ F+T IV MMK E L+ QGGPII++Q+ENEY E +G G RY WAA M
Sbjct: 148 PFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAM 207
Query: 207 AVAQNIGVPWIMCQQFDTPDPV 228
AV GVPW+MC+Q D PDPV
Sbjct: 208 AVGLQTGVPWMMCKQNDAPDPV 229
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 182/319 (57%), Gaps = 39/319 (12%)
Query: 47 AAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQ 106
++HYPR P MWP + ++AK+ + F G ++L+KFIK+I
Sbjct: 11 GSVHYPRCPPEMWPDIFKKAKQ---------------------FNFEGNYDLIKFIKMIG 49
Query: 107 QARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKRE 166
I+ + ++ +P+WL IP +FR+D +PF YHM++F +I+ M+ E
Sbjct: 50 ------IMICMQHLELVHSLKELPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMRDE 103
Query: 167 KLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD 226
K F + Q+ENE+ + Y E G RY W MAV + GVPWIMC+Q +
Sbjct: 104 KFFPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNALG 156
Query: 227 PVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQK 284
PV+NTCN YC D F+ P+ S I ++ ++ FG R +EDIA +VARFF K
Sbjct: 157 PVMNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFSK 214
Query: 285 GGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCE 344
G++ NYYMY+GGTNFGRT+ F+TT Y EAPI EYGLPR PKWGH ++LH A+KLC+
Sbjct: 215 KGTMANYYMYYGGTNFGRTSSS-FVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLCQ 273
Query: 345 HALLNGERSNLSLGSSQEA 363
ALL G + LG E
Sbjct: 274 KALLWGTQPVQMLGKDLEV 292
>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
Length = 447
Score = 239 bits (609), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 159/400 (39%), Positives = 210/400 (52%), Gaps = 45/400 (11%)
Query: 218 MCQQFDTPDPVINTCNSFYC-DQFT-PHSPSMPKIWTENWPGWFKTFGGRDPH-RPSEDI 274
MC+Q D PDPVINTC C D FT P+ P+ + TE + +T PH + + I
Sbjct: 1 MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTE----YLET-----PHLKGQQKI 51
Query: 275 AFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLK 334
S+ F K G++ NYYMY+ TNFGRT F TT Y EAP+DEYGLPR KWGHL+
Sbjct: 52 LHSL--FISKNGTLANYYMYYSVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLR 108
Query: 335 ELHGAIKLCEHALLNGERSNLSLGSSQEADVYAD-SSGACAAFLANMDDKNDKTVVFRNV 393
+LH A++L + ALL G S LG EA +Y S CA FL N + T R
Sbjct: 109 DLHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGS 168
Query: 394 SYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE 453
Y+LP S+S LPDCK VVFNT V +S + P ++ S P+ + L
Sbjct: 169 KYYLPQHSISNLPDCKTVVFNTQTV---ASNYLIFPFSMFDSLNEPNMKTDALP------ 219
Query: 454 IAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHA 513
+ E V+ + TKDTTDYLWYTT K V + + GH
Sbjct: 220 ---TYEECPTKTKSPVELMTMTKDTTDYLWYTT-----------KKDVLRVPQVSNLGHV 265
Query: 514 LHAFANQE------LQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYE 567
+HAF N E L G+ G+ F + PI+LKAG N+IA L TVGL ++G + E
Sbjct: 266 MHAFLNGEYVMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYME 325
Query: 568 WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIY 607
AG+ +V I G N+ T+DL W +K+GL G+ L ++
Sbjct: 326 HRLAGVHNVAIQGLNTRTIDLPKNGWGHKVGLNGDKLHLF 365
Score = 45.8 bits (107), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 691 FNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
N DK PSQ YH+PR++ K S+N+LV+FEE G +P I
Sbjct: 357 LNGDKLHLFTQPPSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGI 401
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 236 bits (603), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 193/662 (29%), Positives = 303/662 (45%), Gaps = 107/662 (16%)
Query: 114 LRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQG 173
+RIGP+V AE++ GGIPVW++Y+ G R + + +K M +M ++ D + FA +G
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTR--DFFADRG 58
Query: 174 GPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN 233
GPII +Q+ENE +G G + Y W + A + + VPW+MC DT + IN CN
Sbjct: 59 GPIIFSQIENE------LWG-GAREYIDWCGEFAESLELNVPWMMCNG-DTSEKTINACN 110
Query: 234 SFYCDQF-TPHSPS------MPKIWTENWPGWFKTFGG----RDPH-----RPSEDIAFS 277
C + H S P WTEN GWF+ G RD + R +ED F+
Sbjct: 111 GNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFN 169
Query: 278 VARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
V +F +GGS HNYYM+ GG ++G+ AG +T Y I LP PK H ++H
Sbjct: 170 VLKFMDRGGSYHNYYMWFGGNHYGKWAGNG-MTNWYTNGVMIHSDTLPNEPKHSHTAKMH 228
Query: 338 GAIKLCEHALLN--GERSNLSLGSSQEADVYADSSG-ACAAFLANMDDKNDKTVVFRNVS 394
+ LLN + +N + + + G +F+ N DK V++R++
Sbjct: 229 RMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYRYGDRLVSFVENSKGSADK-VIYRDIV 287
Query: 395 YHLPAWSVSILPDCKKVVFNTANVR-AQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKE 453
Y LPAWS+ +L + V+F T NV+ V E L+ ++ + E
Sbjct: 288 YELPAWSMIVLDEYDNVLFETNNVKPVNKHRVYHCEEKLE--------------FEYWNE 333
Query: 454 -IAGIWGEAD--FVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESK 510
++ + EA V + +N T+D T++L+Y T + ++E L G +
Sbjct: 334 PVSTLSQEAPRVVVSPKANEQLNMTRDLTEFLYYETEVEFPQDECTLSIGG-------TD 386
Query: 511 GHALHAFANQELQGSASGNGTHPPFKYKNPISLKA--GKNEIALLSMTVGLQNAGPFY-- 566
+A A+ + GS + H + N I++K+ GK+++ LLS ++G+ N
Sbjct: 387 ANAFVAYVDDHFVGSDDEHTHHDGWHTMN-INMKSGKGKHKLVLLSESLGVSNGMDSNLD 445
Query: 567 -EWVGAGITS----VKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTM 621
W + + +K+ G D+ W + GL GE ++ + W S +
Sbjct: 446 PSWASSRLKGICGWIKLCGN-----DIFNQEWKHYPGLVGEAKQVFTDEGMKTVTWKSDV 500
Query: 622 EPPKNQPLTWYKAVVKQPPGDEPIGLDML----KMGKGLAWLNGEEIGRYWPRKSRKSSP 677
E N L WY++ K P G + G+++L M +G A+ NG IGRYW K
Sbjct: 501 ENADN--LAWYRSTFKTPQGLKR-GIEVLLRPEGMNRGQAYANGHNIGRYWMIKD----- 552
Query: 678 HDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK--PSENILVIFEEKGGDPTKI 735
G GE +Q +YHIP+ W K EN+LV+ E G +
Sbjct: 553 ---------------------GNGEYTQGFYHIPKDWLKGEGEENVLVLGETLGASDPSV 591
Query: 736 TF 737
T
Sbjct: 592 TI 593
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 235 bits (599), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 211/778 (27%), Positives = 335/778 (43%), Gaps = 127/778 (16%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYDSR+ I+G R L++ +IHYPR W ++++ G+N ++ YVFWN HE
Sbjct: 50 SVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYHEPR 109
Query: 87 P-----------GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY 135
P KY F GR +L+ FI+ + +++ LRIGP+V AE+ +GG+P+WL
Sbjct: 110 PPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLWLRD 169
Query: 136 IPGTVFRN--------------------DTEPFKYHMQKFMTLIVDMMKREKLFASQGGP 175
+ G FR+ +P++ +M F+ I M+K L A+QGGP
Sbjct: 170 VEGMCFRSICGYNGSPGKCKPWEGGKFRSCDPWRKYMADFVMEIGRMVKEANLMAAQGGP 229
Query: 176 IILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSF 235
+IL Q+ENEYG++ + G+ Y W +++ + VPW+MC + + +N CN
Sbjct: 230 VILGQLENEYGHHS----DAGRAYIDWVGELSFGLGLDVPWVMCNGI-SANGTLNVCNGD 284
Query: 236 YC-DQF-TPHS---PSMPKIWTENWPGWFKTFGGR--DPHRPSEDIAFSVARFFQKGGSV 288
C D++ T H P P WTEN GWF T+GG + R +E++A+ +A++ GGS
Sbjct: 285 DCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVGGSH 343
Query: 289 HNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALL 348
HNYYM++GG + + G +T +Y GLP PK HL+ LH + L+
Sbjct: 344 HNYYMWYGGNHLAQW-GAASLTNAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGELM 402
Query: 349 NGERSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVV-FRNVSYHLPAWSVSIL-P 406
E + + E V A AFL V + +Y + V ++ P
Sbjct: 403 QVEDRHSVMPVQLENGVEVYEWTAGLAFLHRPACSGSPVEVHYAKATYSIACREVLVVDP 462
Query: 407 DCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKS 466
V+F TA+V V V L +W + KE + G A
Sbjct: 463 SSSTVLFATASVEPPPELVRRVVATLTAD-----------RWSMRKEEL-LHGMATVEGR 510
Query: 467 GFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESK-GHALHAFANQELQGS 525
V+H+ + TDY+ Y T++ E + N S L I+S+ H + +
Sbjct: 511 EPVEHLRVSGLDTDYVTYKTTVTATEG---VTNVS---LEIDSRISQVFHVSVDNASSLA 564
Query: 526 AS----GNGTHPPFKYKNPISLKAGKN-EIALLSMTVGLQNAGPFYEWVGAGITSVKITG 580
A+ G +L AG+ ++ +LS ++G++N G Y A S++
Sbjct: 565 ATVMDVNKGNTEWTAVAQLHNLTAGRTYDLWILSESLGVEN-GMLYGAPAATEPSLQKGI 623
Query: 581 FNSGTLD---LSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKA--- 634
F L+ + W+ GL GE G + + ++ P W+ A
Sbjct: 624 FGDIRLNEKSIRKGRWSMVKGLDGEVDGGQG---KAELPCCDSLGP------AWFVAGFT 674
Query: 635 --VVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFN 692
V+ + L + + G WLNG +IGR+ R++S
Sbjct: 675 LHSVRSKSISLTLPLGLPQQAGGHIWLNGVDIGRWRAVGGRQAS---------------- 718
Query: 693 PDKCITGCGEPSQRWYHIPRSWFKPSENILVIF-------EEKGGDPTKITFSIRKIS 743
Y +P K N L +F E+GG PT + +K S
Sbjct: 719 ---------------YRLPSDVLKRGSNRLAVFSATGHWVSEQGGPPTVVEEFYKKRS 761
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 107/172 (62%), Positives = 122/172 (70%)
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG PVWL Y+PG FR D EPFK MQ F IV++MK E LF SQGGPIIL+Q+ENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS 246
G+ G +Y WAA MAV GVPW+MC++ D PDPVINTCN FYCD F+P+ P
Sbjct: 61 PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPY 120
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGT 298
P IWTE W GWF FGG RP +D+AF+VARF QKGGS NYYMYHGGT
Sbjct: 121 KPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 109/205 (53%), Positives = 147/205 (71%), Gaps = 1/205 (0%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
T IA F LL F + F NVTYD ++L+I+G+R +++S +IHYPRS P MWP L+Q
Sbjct: 4 TQIA-FVLLWFLGVYVPASFCSNVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQ 62
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
++K+GG++ IE+YVFWN HE G+Y F GR +LV F+K++ A +Y+ LRIGP+V AE+
Sbjct: 63 KSKDGGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEW 122
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
NYGG P+WLH+I G FR + EPFK M++F IVDMMK+E L+ASQGGPIIL+Q+ENE
Sbjct: 123 NYGGFPLWLHFIAGIKFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENE 182
Query: 185 YGYYESFYGEGGKRYALWAAKMAVA 209
YG ++ K Y WAA MA +
Sbjct: 183 YGNIDTHDARAAKSYIDWAASMATS 207
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 108/188 (57%), Positives = 136/188 (72%), Gaps = 1/188 (0%)
Query: 176 IILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSF 235
++L V G E+ YG+GGK Y WAAK A++ +GVPW+MC+Q D P +I+TCN++
Sbjct: 32 LVLGTVSLGVGAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAY 91
Query: 236 YCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
YCD F P+S + P +WTENW GW+ +G R PHRP ED+AF+VA FFQ+GGS NYYMY
Sbjct: 92 YCDGFKPNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYYMYF 151
Query: 296 GGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER-SN 354
G TNFGRTAGGP TSYDY A IDEYG R PKWGHLK+LH A+KLCE AL+ + +
Sbjct: 152 GRTNFGRTAGGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALVATDSPTY 211
Query: 355 LSLGSSQE 362
+ LG +QE
Sbjct: 212 IKLGPNQE 219
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 113/261 (43%), Positives = 159/261 (60%), Gaps = 10/261 (3%)
Query: 486 TSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKA 545
T++ ++ +E L G +P L ++S GHALH F N + GSA G F + P+ L+A
Sbjct: 1 TNVDISSSE--LHGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRA 58
Query: 546 GKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHL 604
G N+IALLS+ VGL N G YE W + V + G G DL+ W K+GL+GE +
Sbjct: 59 GINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAM 118
Query: 605 GIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEE 663
+ +P ++++W+ ++ Q L WYKA P GDEP+ LDM MGKG W+NG+
Sbjct: 119 DLVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQS 178
Query: 664 IGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILV 723
IGRYW + + +C C Y G F P KC GCG+P+QRWYH+PRSW KP++N++V
Sbjct: 179 IGRYW-----MAYANGDC-SLCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMV 232
Query: 724 IFEEKGGDPTKITFSIRKISG 744
+FEE GGDP+KIT R ++G
Sbjct: 233 MFEELGGDPSKITLVKRSVAG 253
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 127/364 (34%), Positives = 193/364 (53%), Gaps = 38/364 (10%)
Query: 373 CAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENL 432
C AFL+N + K+D T+ FR Y +P S+S+L DC+ VVF T +V AQ +
Sbjct: 7 CVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHN--------- 57
Query: 433 QPSEASPDNGSKGLKWQVFK-EIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVN 491
Q + D ++ W++F E + +A D N TKD TDY+WYT+S +
Sbjct: 58 QRTFHFADQTAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLE 117
Query: 492 ENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIA 551
++ +++ + VL + S GHA AF N + G G + F + P+ LK G N +A
Sbjct: 118 ADDMPIRSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVA 177
Query: 552 LLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGY 611
+L+ ++G+ ++G + E AG+ V+ITG N+GTLDL+ W + +GL GE IY
Sbjct: 178 VLASSMGMTDSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKG 237
Query: 612 RNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRK 671
++ W M ++PLTWYK P G++P+ LDM MGKG+ ++NG+ IGRYW
Sbjct: 238 MGSVTWKPAMN---DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYW--- 291
Query: 672 SRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGD 731
Y+ G PSQ+ YH+PRS+ + +N+LV+FEE+ G
Sbjct: 292 -------------ISYKHAL---------GRPSQQLYHVPRSFLRQKDNMLVLFEEEFGR 329
Query: 732 PTKI 735
P I
Sbjct: 330 PDAI 333
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 163/296 (55%), Gaps = 20/296 (6%)
Query: 448 WQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFL--KNGSRPVL 505
W KE IW ++ F G +H+N TKD +DYLWY+T + V++++ +N P L
Sbjct: 35 WMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDSDILFWEENDVHPKL 94
Query: 506 LIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPF 565
I+ L F N +L ++K IS+ GKN+ S + N G F
Sbjct: 95 TIDGVRDILRVFINGQL--------IVKDEQFKAVISVSIGKNDCTAGS----INNYGAF 142
Query: 566 YEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
E GAGI +KITGF +G +DLS WTY++GLQGE L Y+ N+ WV
Sbjct: 143 LEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEENENS-EWVELTPDA 201
Query: 625 KNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQE 684
TWYK P G +P+ LD MGKG AW+NG+ IGRYW R S KS C Q
Sbjct: 202 IPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTRVSPKSG----CQQV 257
Query: 685 CDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIR 740
CDYRG +N DKC T CG+P+Q YH+PRSW K + N+LVI EE GG+P +I+ +
Sbjct: 258 CDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVILEETGGNPFEISVKLH 313
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 222 bits (565), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 101/162 (62%), Positives = 125/162 (77%), Gaps = 1/162 (0%)
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
MA+ + GVPWIMC+Q D P P+I+TCN +YC+ F P+S + PK+WTENW GW+ FGG
Sbjct: 1 MALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFGGA 60
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLP 325
P+RP EDIA+SVARF QKGGS+ NYYMYHGGTNF RTA G F+ +SYDY+AP+DEYGLP
Sbjct: 61 VPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLP 119
Query: 326 RNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYA 367
R PK+ HLK LH AIKL E ALL+ + + SLG+ QE + A
Sbjct: 120 REPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEVTIKA 161
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/352 (36%), Positives = 182/352 (51%), Gaps = 39/352 (11%)
Query: 385 DKTVVFRNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSK 444
D TVVFR +++P+ SVSIL DCK VV+NT V Q S + S + D SK
Sbjct: 2 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS---------ERSFHTTDETSK 52
Query: 445 GLKWQVFKEIAGIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPV 504
W+++ E + + ++ N TKDT+DYLWYTTS + ++ + RPV
Sbjct: 53 NNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPV 112
Query: 505 LLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGP 564
+ I+S HA+ FAN G+ G+ F ++ P+ L+ G N IA+LS ++G++++G
Sbjct: 113 IQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGG 172
Query: 565 FYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPP 624
V GI + G N+GTLDL W +K L+GE IY W +P
Sbjct: 173 ELVEVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQW----KPA 228
Query: 625 KNQ-PLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQ 683
+N P+TWYK +P GD+PI +DM M KG+ ++NGE IGRYW
Sbjct: 229 ENDLPITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT-------------- 274
Query: 684 ECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
IT G PSQ YHIPR++ KP N+L+IFEE+ G P I
Sbjct: 275 -----------SFITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGI 315
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 218 bits (555), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 98/154 (63%), Positives = 127/154 (82%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+VTYD +++IING+R ++IS +IHYPRS P MWP L+Q+AK+GG++ IE+YVFWNGHE S
Sbjct: 1 SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 60
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
P KYYF R++LV+FIK++QQA +Y+ LRIGP+V AE+NYGG P+WL ++PG FR D
Sbjct: 61 PDKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 120
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
PFK MQKF+ IVDMMK EKLF +QGGPIIL+Q
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 103/170 (60%), Positives = 117/170 (68%)
Query: 128 GIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
G Y+PG FR D PFK MQKF IV+MMK EKLF QGGPII++Q+ENEYG
Sbjct: 3 GFSCLAQYVPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYGP 62
Query: 188 YESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSM 247
E G GK Y WAA+MAV N GVPWIMC+Q D PDPVI+TCN FYC+ F P+
Sbjct: 63 VEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKNYK 122
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
PK+WTENW GW+ FGG P+RP ED+AFSVARF Q GS NYYMYHG
Sbjct: 123 PKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHGA 172
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 107/206 (51%), Positives = 139/206 (67%), Gaps = 6/206 (2%)
Query: 537 YKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYK 596
++ PISL G N+IALLS+ VGL N+G +E AGI++V + GF GT DLS WTY+
Sbjct: 2 FELPISLIPGTNDIALLSVMVGLPNSGGHFERKIAGISTVTLRGFKDGTRDLSQELWTYQ 61
Query: 597 IGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGL 656
IGL GE IY+ ++NW S+ P N PLTWYKAV+ P GDEP+ LD+ MGKG
Sbjct: 62 IGLLGEMSTIYSDVGFISVNWTSSSTP--NPPLTWYKAVIDVPDGDEPVILDLSSMGKGQ 119
Query: 657 AWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFK 716
AW+NGE IGRYW +P +C +CDYRG ++ KC T CG+PSQ YH+PRSW +
Sbjct: 120 AWINGEHIGRYW---ISFLAPLGDC-SKCDYRGNYSLHKCATNCGQPSQTLYHVPRSWLR 175
Query: 717 PSENILVIFEEKGGDPTKITFSIRKI 742
P+ N+LV+FEE GGDP+K++ R I
Sbjct: 176 PTGNLLVLFEETGGDPSKVSLLTRSI 201
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 100/139 (71%), Positives = 117/139 (84%), Gaps = 1/139 (0%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L+ FFS T CFAGNV+YDSRSLIING R+L+ISAAIHYPRSVP MWP LV+ AKEGGV
Sbjct: 5 LIFFFSLCFTLCFAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGV 64
Query: 72 NTIESYVFWNGHE-LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+ IE+YVFWN H+ SP +Y+F GRF+LVKFI I+Q+A MY+ILRIGPFVAAE+N+GGIP
Sbjct: 65 DVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIP 124
Query: 131 VWLHYIPGTVFRNDTEPFK 149
VWLHY+ GTVFR D FK
Sbjct: 125 VWLHYVNGTVFRTDNYNFK 143
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 202 bits (513), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 87/158 (55%), Positives = 121/158 (76%)
Query: 23 CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNG 82
C+ VTYD R+L+I+G+R ++ S +IHYPRS+P +WP +++++KEGG++ IE+YVFWN
Sbjct: 155 CYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEGGLDVIETYVFWNN 214
Query: 83 HELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR 142
HE G+YYF GRF+LV+F+K +Q+A + + LRIGP+ AE+NYGG PVWLH+IPG FR
Sbjct: 215 HEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGFPVWLHFIPGIQFR 274
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
+ FK M++F+ IV +MK LFA QGGPIILAQ
Sbjct: 275 TTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 312
>gi|296081427|emb|CBI16778.3| unnamed protein product [Vitis vinifera]
Length = 242
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 97/127 (76%), Positives = 102/127 (80%), Gaps = 4/127 (3%)
Query: 229 INTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSV 288
INTCNSFYCDQFTP+SP+ PK+WTENWPGW KTFG DPH P EDI FSVARFF K
Sbjct: 120 INTCNSFYCDQFTPNSPNKPKMWTENWPGWSKTFGALDPHGPREDIVFSVARFFWKV--- 176
Query: 289 HNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALL 348
NYYM HGGTNFGRT+GGPFITT+YDY APIDEYGL R PK GHLKEL AIK CEH LL
Sbjct: 177 -NYYMDHGGTNFGRTSGGPFITTTYDYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLL 235
Query: 349 NGERSNL 355
GE NL
Sbjct: 236 YGEPINL 242
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 89/172 (51%), Positives = 126/172 (73%), Gaps = 1/172 (0%)
Query: 10 FALLIFFSSSITY-CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
LL+ + + C+ VTYD R+L+I+G+R ++ S +IHYPRS+P +WP +++++KE
Sbjct: 6 LVLLVLIAVCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKE 65
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
GG++ IE+YVFWN HE G+YYF GRF+LV+F+K +Q+A + + LRIGP+ AE+NYGG
Sbjct: 66 GGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGG 125
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
PVWLH+IPG FR + FK M++F+ IV +MK LFA QGGPIILAQ
Sbjct: 126 FPVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177
>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
Length = 199
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 94/201 (46%), Positives = 124/201 (61%), Gaps = 7/201 (3%)
Query: 541 ISLKAGKNEIALLSMTVGLQNAGPFYE-WVGAGITSVKITGFNSGTLDLSTYSWTYKIGL 599
I L AG N+IALLS+ VGL N G +E W + V + G NSGT D+S + W+YKIG+
Sbjct: 4 IKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGTWDMSKWKWSYKIGV 63
Query: 600 QGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWL 659
+GE L ++ + + W K QPLTWYK+ P G+EP+ LDM MGKG W+
Sbjct: 64 KGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWI 123
Query: 660 NGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSE 719
NG IGR+WP + S C+Y G F+ KC++ CGE SQRWYH+PRSW K S+
Sbjct: 124 NGRNIGRHWPAYKAQGS-----CGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQ 177
Query: 720 NILVIFEEKGGDPTKITFSIR 740
N++V+FEE GGDP I+ R
Sbjct: 178 NLIVVFEELGGDPNGISLVKR 198
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 189 bits (479), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 88/143 (61%), Positives = 103/143 (72%)
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
MQKF T IV+MMK E LF QGGPIIL+Q+ENE+G E GE K YA WAA MAVA N
Sbjct: 1 MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPS 271
VPWIMC++ D PDP+INTCN FYCD F+P+ P P +WTE W W+ FG PHRP
Sbjct: 61 TSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPV 120
Query: 272 EDIAFSVARFFQKGGSVHNYYMY 294
ED+A+ VA+F QKGGS NYYM+
Sbjct: 121 EDLAYGVAKFIQKGGSFVNYYMF 143
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/221 (42%), Positives = 127/221 (57%), Gaps = 12/221 (5%)
Query: 523 QGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGF 581
+G+ G+ P Y + L AG N I+ LS+ VGL N G +E AGI V + G
Sbjct: 164 EGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGL 223
Query: 582 NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPG 641
N G DL+ WTY++GL+GE +++ + + W ++ N A P G
Sbjct: 224 NEGRRDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASNM------AFFNAPDG 277
Query: 642 DEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCG 701
DEP+ LDM MGKG W+NG+ IGRYWP K+S + CDYRG+++ KC T CG
Sbjct: 278 DEPLALDMSSMGKGQIWINGQGIGRYWP--GYKASGN---CGTCDYRGEYDETKCQTNCG 332
Query: 702 EPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
+ SQRWYH+PRSW P+ N+LVIFEE GGDPT I+ R I
Sbjct: 333 DSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSI 373
>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
Length = 138
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 88/138 (63%), Positives = 100/138 (72%)
Query: 180 QVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ 239
Q+ENEYG E GK Y WAAKMAV N GVPW+MC+Q D PDPVI+TCN +YC+
Sbjct: 1 QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYCEN 60
Query: 240 FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTN 299
FTP+ PK+WTENW GW+ +GG P RP EDIA+SV RF Q GGS NYYMYHGGTN
Sbjct: 61 FTPNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFIQNGGSFVNYYMYHGGTN 120
Query: 300 FGRTAGGPFITTSYDYEA 317
FGRT G FI TSYDY+A
Sbjct: 121 FGRTYSGLFIATSYDYDA 138
>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
Length = 451
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 142/458 (31%), Positives = 205/458 (44%), Gaps = 100/458 (21%)
Query: 293 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
MYHGGTNF R +GGP I TSYDY+AP+DEYG PKWGHL++LH I LL+ +
Sbjct: 38 MYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPKWGHLRDLHVRI------LLHLSQ 91
Query: 353 SNLSLGSSQEADVYA--------DSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSI 404
S LG A VYA +++G FL+N D
Sbjct: 92 SR-GLGF---ATVYALNLTTYINNATGERFCFLSNTKTNED------------------- 128
Query: 405 LPDCKKVVFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFV 464
AN+ Q + VP A I+ + V
Sbjct: 129 -----------ANIDLQQDGIFFVP-------------------------AWIYYYSSRV 152
Query: 465 KSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQG 524
+ G T D TDYL Y T +F + V + S+ + +L
Sbjct: 153 QQGNFQQCKATSDETDYLRYITRYF-----DFF---TVSVKDVHSRCQQCNNTEEHDL-- 202
Query: 525 SASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSG 584
+ GT P ++ L+ + I ++T G QN G F++ GI +G
Sbjct: 203 ACDFFGTSPACSCQSAARLQQVFHSI--YNLTSGKQNYGEFFDEGPEGI---------AG 251
Query: 585 TLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEP 644
DLS+ W YKIGL GE +Y+P + + ++ P + +TWYK P G +P
Sbjct: 252 AADLSSNQWAYKIGLGGEAKRLYDPNSGHRDVFRTSAILPVGRAMTWYKTTFHVPSGTDP 311
Query: 645 IGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPS 704
+ L++ MGKG AW+NG +GR+WP +S + + CDYRGK++ DKC+T CG P+
Sbjct: 312 LVLNLQGMGKGHAWVNGHSLGRFWPMQSADPTGYS---GSCDYRGKYDKDKCLTNCGNPT 368
Query: 705 QRWYHIPRSWFKPSENILVIFE-EKGGDPTKITFSIRK 741
QRW HI + F P+ I+ + + G+P S++K
Sbjct: 369 QRWKHI--ATFMPNGRIISVIQFASFGNPEGTCGSLQK 404
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 162 MMKREKLFASQGGPIILAQVENEYGYY 188
M K KLFAS GGPI+ AQ+EN+YG +
Sbjct: 1 MAKEAKLFASSGGPIVFAQIENDYGNF 27
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 181/346 (52%), Gaps = 20/346 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD +S I+ +R I+SAAIHY R W ++++AK GG NTIE+Y+ WN HE+
Sbjct: 2 ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G +L F+++ +Y+I R GP++ AE+++GG P WL +R+
Sbjct: 62 GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPS 121
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F +++ ++ ++ ++ +L ++ G +I+ Q+ENE+ YG+ K+Y +
Sbjct: 122 FLHYVDQYFDQVISIIDEYQL--TKNGSVIMVQIENEF----QAYGKPDKKYMEYLRDGM 175
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
+A+ I VP++ C + D + N + + PK E W GWF+ +
Sbjct: 176 IARGIEVPFVTC--YGAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEHW 233
Query: 263 GGRDPHRPS-EDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAGGP-FITTSYDYE 316
GG ++ + E + + + G + NYYMY GGTNF GRT F TT+YDY+
Sbjct: 234 GGNKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYDYD 293
Query: 317 APIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
IDEY P K+ LK H +K E N E++N + S +
Sbjct: 294 VAIDEYLQPTR-KYEVLKRYHLFVKWLEPLFTNAEQANSDVKLSSD 338
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 182 bits (461), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 113/317 (35%), Positives = 163/317 (51%), Gaps = 30/317 (9%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
T +++G I+S A+HY R P W +++A+ G+NTIE+YV WN H PG
Sbjct: 5 TIGETDFLLDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPG 64
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+ G +L +F+++++ A MY I+R GPF+ AE++ GG+P WL PG R F
Sbjct: 65 VFDTDGILDLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGVGIRRHEPRF 124
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
++K++ ++ +++ ++ GGP++L QVENEYG Y + Y A M
Sbjct: 125 LDEVEKYLHQVLALVRPHQV--DLGGPVLLVQVENEYGAYGD-----DRDYLQAVADMIR 177
Query: 209 AQNIGVPWIMCQQ-FDTP------DPVINTCNSFYCDQ------FTPHSPSMPKIWTENW 255
I VP + Q D D V+ T +SF D H P+ P + E W
Sbjct: 178 GAGIDVPLVTVDQPVDAMLAAGGLDGVLRT-SSFGSDSANRLRTLRDHQPTGPLMCMEFW 236
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PF 308
GWF +GGR P E A + G SV N YM+HGGTNFG T+G P
Sbjct: 237 DGWFDHWGGRHHTTPVEQAAEELDALLAAGASV-NVYMFHGGTNFGLTSGANDKGIYRPT 295
Query: 309 ITTSYDYEAPIDEYGLP 325
+ TSYDY+AP+DE G P
Sbjct: 296 V-TSYDYDAPLDEAGNP 311
>gi|296086917|emb|CBI33129.3| unnamed protein product [Vitis vinifera]
Length = 186
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 81/110 (73%), Positives = 98/110 (89%)
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+N IE+YVFW GHELSPG YYFGG ++L+KF+KI+QQ M++IL IGPFVAAE+N+ GIP
Sbjct: 69 INVIETYVFWIGHELSPGNYYFGGWYDLLKFVKIVQQDGMWLILHIGPFVAAEWNFDGIP 128
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
VWLHY+ GTVFR ++EPFKYHMQKFMTLIV++MK+EKLFASQGGPI LA
Sbjct: 129 VWLHYVLGTVFRTNSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPINLAH 178
>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 213
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/219 (41%), Positives = 128/219 (58%), Gaps = 8/219 (3%)
Query: 525 SASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNS 583
S G+ P + ++LK G N++++LS+TVGL N G ++ AG+ V + G N
Sbjct: 1 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 60
Query: 584 GTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDE 643
GT D+S Y W+YK+GL+GE L +Y+ N++ W+ + QPLTWYK P G+E
Sbjct: 61 GTRDMSKYKWSYKVGLKGEILNLYSVKGSNSVQWMKG--SFQKQPLTWYKTTFNTPAGNE 118
Query: 644 PIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEP 703
P+ LDM M KG W+NG IGRY+P +C +C Y G F KC+ CG P
Sbjct: 119 PLALDMSSMSKGQIWVNGRSIGRYFP----GYIASGKC-NKCSYTGFFTEKKCLWNCGGP 173
Query: 704 SQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
SQ+WYHIPR W P+ N+L+I EE GG+P I+ R +
Sbjct: 174 SQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRTV 212
>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
Length = 216
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 80/129 (62%), Positives = 100/129 (77%), Gaps = 5/129 (3%)
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTEN 254
GK Y W + MA + +IGVPWI+CQQ D P P+INTC +YCDQFTP++ + PK WTEN
Sbjct: 56 AGKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQFTPNTANSPKKWTEN 115
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-ITTSY 313
W GWFK++G +DPHR +E +AF+VARFFQ N YMYHGGTNFGRTAGGP+ TTS+
Sbjct: 116 WTGWFKSWGDKDPHRTAEGVAFAVARFFQ----FQNCYMYHGGTNFGRTAGGPYSTTTSH 171
Query: 314 DYEAPIDEY 322
DY+AP+DE+
Sbjct: 172 DYDAPLDEH 180
>gi|298204831|emb|CBI25664.3| unnamed protein product [Vitis vinifera]
Length = 118
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 77/112 (68%), Positives = 97/112 (86%)
Query: 58 MWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIG 117
MW GLV+ AKEGG++ IE+YVFWNGHELSPG YYFGG ++L+KF+KI+QQ MY+ILR G
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFWNGHELSPGNYYFGGWYDLLKFVKIVQQDGMYLILRFG 60
Query: 118 PFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLF 169
PFV AE+N+ G+ VWLHY+PGTVF ++EPF YHMQKFMTL+V++MK+EKL
Sbjct: 61 PFVVAEWNFSGVLVWLHYMPGTVFWTNSEPFNYHMQKFMTLVVNIMKKEKLL 112
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 185/375 (49%), Gaps = 34/375 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYD +S I+ R I+SAAIHY R W ++ +AK GG NTIE+Y+ WN HE++
Sbjct: 2 ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G +L F ++ +Y+I R GP++ AE+++GG P WL +R+
Sbjct: 62 GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPA 121
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F +++ ++ ++ ++ +L ++ G +I+ QVENE+ YG+ K Y +
Sbjct: 122 FLHYVDQYFDRVIPIIDEYQL--TKNGTVIMVQVENEF----QAYGKPDKPYMEYIRDGM 175
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS-----------PSMPKIWTENWP 256
A+ I VP + C + + + N F HS P PK E W
Sbjct: 176 KARGIDVPLVTC--YGAVEGAVEFRN------FWSHSKHAAAILDERFPDQPKGVMEFWI 227
Query: 257 GWFKTFGG-RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAG-GPFIT 310
GWF+ +GG + + E + + G + NYYMY GGTNF GRT G T
Sbjct: 228 GWFEQWGGNKADQKTPEQLERECYQLLSNGFTAINYYMYFGGTNFDHWGGRTVGEQTLCT 287
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER--SNLSLGSSQEADVYAD 368
T+YDY+ IDEY P K+ LK H +K E + E+ S++ L S +++ A
Sbjct: 288 TTYDYDVAIDEYLQPTR-KYEVLKRYHSFVKWLEPLFTDAEKVASDMKLPSDLKSERIAS 346
Query: 369 SSGACAAFLANMDDK 383
G N +++
Sbjct: 347 PYGEVIFIENNRNER 361
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 172/335 (51%), Gaps = 21/335 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +DS S II+G+R+ IISAA+HY R W ++++A+ GG N IE+Y+ WN HE +
Sbjct: 2 IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
++ F G +L F I MY+I+R GP++ AE+++GG+P +L+ G +R
Sbjct: 62 EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGIEYRCSNAA 121
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
++ ++++ I+ +++R +L GG II+ Q+ENEY +G+ + + ++
Sbjct: 122 YEQAVRRYFERIMPIIRRYQL--GSGGSIIMVQIENEY----HAFGKKDLAHIRFLEELT 175
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ-----FTPHSPSMPKIWTENWPGWFKTF 262
I VP + C + + N + + P E W GW + +
Sbjct: 176 RGFGITVPLVSC--YGAGRNTVEMRNFWSGAERAAAVLRERQSGQPLGIMEFWIGWVEHW 233
Query: 263 GGR-DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAGGP--FITTSYDY 315
GG H+P+E + + G NYYMY GG+NF GRT G F+T SYDY
Sbjct: 234 GGEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYDY 293
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNG 350
+AP+DE+G K+ L LH I E+ L G
Sbjct: 294 DAPLDEFGF-ETEKYRLLAVLHTFIAWLENDLTAG 327
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 128/378 (33%), Positives = 167/378 (44%), Gaps = 85/378 (22%)
Query: 293 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
MYHG TNF RTAGGPFITT+YDY+AP+DE+G PK+GHLK+LH E L G
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82
Query: 353 SNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKVV 412
S G+ VY G+ + F+ N++ K + F+ SY +PAW VSILPDCK
Sbjct: 83 STADFGNLVMTTVYQTEEGS-SCFIGNVNAK----INFQGTSYDVPAWYVSILPDCKTES 137
Query: 413 FNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIWGEADFVKSGFVDHI 472
+NTA +++ FK
Sbjct: 138 YNTAKRMKLRTSLR------------------------FK-------------------- 153
Query: 473 NTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTH 532
N + D +D+LWY T+ VN E+ G L I S H LH F N + G+
Sbjct: 154 NVSNDESDFLWYMTT--VNLKEQDPAWGKNMSLRINSTAHVLHGFVNGQHTGNYRVENGK 211
Query: 533 PPFKYKNPISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTY 591
+ ++ G N I LLS+TV L N G F+E V AGIT V I G N
Sbjct: 212 FHYVFEQDAKFNPGVNVITLLSVTVDLPNYGAFFENVPAGITGPVFIIGRN--------- 262
Query: 592 SWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLK 651
G + ++ST LT +KA P G EP+ +D+L
Sbjct: 263 ------------------GDETVVKYLSTHNGATK--LTIFKA----PLGSEPVVVDLLG 298
Query: 652 MGKGLAWLNGEEIGRYWP 669
GKG A +N GRYWP
Sbjct: 299 FGKGKASINENYTGRYWP 316
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/371 (31%), Positives = 181/371 (48%), Gaps = 23/371 (6%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
V YD S II+GRR I+SAA+HY R W ++ ++KE G N IE+YV WN HE
Sbjct: 5 RVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEEE 64
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G++ F G +L F+ + + +Y+I+R GP++ AE++ GG+P WL P +R
Sbjct: 65 EGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDMQYRKFHR 124
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F +++ + +V ++ L S G +I+ QVENE+ G+ K Y +
Sbjct: 125 EFLHYVDLYWDRLVPVVLPRLL--SNSGTVIMVQVENEF----QALGKPDKAYMEYLRDG 178
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGWFKT 261
+ + I VP + C + D + N + + PK E W GWF+
Sbjct: 179 LIERGIDVPLVTC--YGAVDGAVEFRNFWSHAEEHARTLEERFADQPKGVLEFWIGWFEQ 236
Query: 262 FGG-RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAGG-PFITTSYDY 315
+GG R + + + ++G + NYYM+ GGTNF GRT G F+TTSYDY
Sbjct: 237 WGGPRANQKTASQVERKTYELIREGFTAINYYMFFGGTNFGHWGGRTIGEHTFMTTSYDY 296
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAIKLCEHAL--LNGERSNLSLGSSQEADVYADSSGAC 373
+A +DEY P K+ LK +H ++ E L G + + LG A + G
Sbjct: 297 DAALDEYLRP-TAKYKALKLVHDFVRWMEPLLTETTGSTAFIPLGKHSSAKKKSGPQGTI 355
Query: 374 AAFLANMDDKN 384
F+ N D +
Sbjct: 356 -LFIHNDDTER 365
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 159/312 (50%), Gaps = 30/312 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G I+S A+HY R P +W + +A+ G+NTIE+YV WN H G +
Sbjct: 10 DFLLDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTD 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +L +F++ + A +Y I+R GP++ AE++ GG+P WL PG R F ++
Sbjct: 70 GMLDLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVE 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++ ++D+++ L QGGP++L QVENEYG + + Y A M I
Sbjct: 130 QYLEQVLDLVR--PLQVDQGGPVLLLQVENEYGAFGN-----DPEYLEAVAGMIRKAGIT 182
Query: 214 VPWIMCQQ-------FDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKT 261
VP + Q D V+ T + + H P+ P + E W GWF
Sbjct: 183 VPLVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDH 242
Query: 262 FGGRDPHRPS--EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--ITTSY 313
+GG PH + ED A + G SV N YM+HGGTNFG T+G G F TSY
Sbjct: 243 WGG--PHHTTSVEDAARELDALLAAGASV-NIYMFHGGTNFGLTSGADDKGVFRPTVTSY 299
Query: 314 DYEAPIDEYGLP 325
DY+AP+DE G P
Sbjct: 300 DYDAPLDEAGRP 311
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 113/332 (34%), Positives = 176/332 (53%), Gaps = 29/332 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
++ D S I G++ I+S +IHY R VP W +++ K G+NT+++YV WN HE P
Sbjct: 71 LSLDEDSFYIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMP 130
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G N+ +FIKI + +I+R GP++ +E++ GG+P WL + P R++ +P
Sbjct: 131 GEFDFSGLLNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNMKIRSNYKP 190
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRY------ 199
++ +++F T + +++ L +S GGPII QVENEY Y + G +Y
Sbjct: 191 YQDAVKRFFTKLFEILT--PLQSSYGGPIIAFQVENEYAAYGPRNATGRHHMQYLANLMR 248
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTEN 254
+L A ++ + + G I P+ + T N S ++ P+ P + E
Sbjct: 249 SLGAVELFITSD-GQNDIKASSDMAPNNALLTVNFQNDPSEALNKLLLVQPNKPPLVMEY 307
Query: 255 WPGWFKTFGGRDPHR---PSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-----RTAGG 306
W GWF +G R R PS+ I ++ Q GGS N YM+HGGTNFG GG
Sbjct: 308 WTGWFDHWGRRHLERTLSPSQLIV-NIGTILQMGGSF-NLYMFHGGTNFGFMNGANIEGG 365
Query: 307 PFI--TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ TSYDY+AP+ E G K+ L+EL
Sbjct: 366 EYRPDVTSYDYDAPLSEAG-DITKKYTLLREL 396
>gi|298205257|emb|CBI17316.3| unnamed protein product [Vitis vinifera]
Length = 141
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/113 (71%), Positives = 95/113 (84%)
Query: 487 SIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAG 546
+I V E+E FLK S+P+LL+ESKGHALHAF NQ+LQGSASGNG+H PFK++ PISLKAG
Sbjct: 9 NITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAG 68
Query: 547 KNEIALLSMTVGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGL 599
KNEI +LSMTVGLQN PFYEWVGA +TSVKI G N+G +DLSTY W YK+ L
Sbjct: 69 KNEIVVLSMTVGLQNEIPFYEWVGARLTSVKIKGLNNGIMDLSTYPWIYKVFL 121
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 127/462 (27%), Positives = 206/462 (44%), Gaps = 75/462 (16%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+TYD S +++G+ ++S A+HY R+VP W + + K G NT+E+YV WN HE
Sbjct: 3 QLTYDD-SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEPE 61
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G++ F G ++V+FIK ++ +++I+R GPF+ AE+ +GG P WL +P R +
Sbjct: 62 EGQFVFEGIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNIKLRCFNQ 121
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+ + + ++ + ++ L +S GGPII Q+ENEYG + G + L +
Sbjct: 122 PYLEKVDAYFDVLFERLR--PLLSSNGGPIIALQIENEYGSF------GNDQKYLQYLRD 173
Query: 207 AVAQNIGVPWIMCQQFDTPDP----------VINTCN-----SFYCDQFTPHSPSMPKIW 251
+ + +G + D P+P + T N Q + P+ P +
Sbjct: 174 GIKKRVGNELLFTS--DGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNAPLMC 231
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----- 306
E W GWF +G R +E + ++ ++ GSV N+YM HGGTNFG G
Sbjct: 232 MEFWHGWFDHWGEEHHTRSAESVVETLEEILKQNGSV-NFYMAHGGTNFGFYNGANHNET 290
Query: 307 ---PFITTSYDYEAPIDEYG------------------LPR-NPKWGHLKELHGAIKLCE 344
P I TSYDY+ + E G LP N K L G +K E
Sbjct: 291 DYQPTI-TSYDYDGLLTESGDVTEKFYAVRKVFEKYVDLPELNLPAPIPKRLFGKVKFTE 349
Query: 345 HALLNGERSNLSLGSSQEADVYADSSGACAAFLA--------------NMDDKNDKTVVF 390
HA L +S EA + + G F+ + D +D+ V+
Sbjct: 350 HAGLLDSLHRISTPQKSEAPLPMEKYGQAYGFIVYETTIKGAYGKQALTVQDIHDRGQVY 409
Query: 391 RNVSYHLPAWSVSILPDCKKVVFNTANVRAQSSTVEMVPENL 432
N Y V I+ + + + S ++++ EN+
Sbjct: 410 VNGEY------VGIVERNRGCSRLVVELTEEESKLQIIVENM 445
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 169 bits (427), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 161/321 (50%), Gaps = 22/321 (6%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++GR I+S A+HY R P W +++A+ G+NT+E+YV WN H G +
Sbjct: 10 DFLLDGRSLQIVSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVHSPERGVFDTS 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
GR +L +F+ ++ ++ I+R GP++ AE+ GG+P WL P R F +
Sbjct: 70 GRRDLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIG 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
++ ++ ++ ++ ++GGP+++ QVENEYG Y +RY A M AQ I
Sbjct: 130 EYYAALLPIVAERQV--TRGGPVLMVQVENEYGAYGDDPPVERERYLRALADMIRAQGID 187
Query: 214 VPWIMCQQFDTPD------PVINTCNSFYCDQ------FTPHSPSMPKIWTENWPGWFKT 261
VP Q + P + T +F H P+ P + E W GWF +
Sbjct: 188 VPLFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMCMEFWDGWFDS 247
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--ITTSYDY 315
G P E A + G SV N YM HGGTNFG T+G G + ITTSYDY
Sbjct: 248 AGLHHHTTPPEANARDLDDLLAAGASV-NLYMLHGGTNFGLTSGANDKGVYRPITTSYDY 306
Query: 316 EAPIDEYGLPRNPKWGHLKEL 336
+AP+ E+G P K+ ++E+
Sbjct: 307 DAPLSEHGAP-TAKYVAMREV 326
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 168/348 (48%), Gaps = 30/348 (8%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
+ F + F S +G + ++NG ++ +A +HYPR W ++Q
Sbjct: 12 LLSFGAMAGFQSCSPKTESGTFEAGKGTFLLNGEPFVVKAAELHYPRIPRAYWEHRIKQC 71
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+NTI YVFWN HE PG++ F G+ +L +F ++ Q+ MY+ILR GP+V AE+
Sbjct: 72 KALGMNTICLYVFWNFHEEKPGEFDFTGQKDLAEFCRLCQKNDMYVILRPGPYVCAEWEM 131
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+P WL R D F + F + + + L +GGPII+ QVENEYG
Sbjct: 132 GGLPWWLLKKKDIRLREDDPYFLERVAIFEKEVANQVA--GLTIQKGGPIIMVQVENEYG 189
Query: 187 YYESFYGEGGKRYALWAAKMAVAQNIG-VPWIMCQ-----QFDTPDPVINTCN----SFY 236
Y G + + + V N G V C Q + D ++ T N +
Sbjct: 190 SY------GESKEYVAKIRDIVRGNFGDVTLFQCDWASNFQLNALDDLVWTMNFGTGANI 243
Query: 237 CDQFTPHS---PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYM 293
+QF P P P + +E W GWF +G R ++D+ + KG S + YM
Sbjct: 244 DEQFAPLKKVRPDSPLMCSEFWSGWFDKWGANHETRAADDMIAGIDEMLSKGISF-SLYM 302
Query: 294 YHGGTNFGRTAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLKE 335
HGGTN+G AG P + TSYDY+API E G PK+ L+E
Sbjct: 303 THGGTNWGHWAGANSPGFAPDV-TSYDYDAPISESG-KITPKYEKLRE 348
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 105/161 (65%), Gaps = 7/161 (4%)
Query: 586 LDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWV-STMEPPKNQPLTWYKAVVKQPPGDEP 644
+DLS WTY++GL+GE + + P +I W+ +++ K QPLTW+K P G+EP
Sbjct: 1 MDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEP 60
Query: 645 IGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGCGEPS 704
+ LDM MGKG W+NGE IGRYW + H C Y G + P+KC TGCG+P+
Sbjct: 61 LALDMEGMGKGQIWVNGESIGRYWTAFATGDCSH------CSYTGTYKPNKCQTGCGQPT 114
Query: 705 QRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKISGF 745
QRWYH+PR+W KPS+N+LVIFEE GG+P+ ++ R +SG
Sbjct: 115 QRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGV 155
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 170/344 (49%), Gaps = 23/344 (6%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
I A ++ S ++ G+ T + ++NGR +I +A +HYPR W ++
Sbjct: 7 IRTIAAVLLLSLAVPSARGGDFTVGKNTFLLNGRPFVIKAAELHYPRIPRPYWEQRIKMC 66
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+NT+ YVFWN HE G++ F G ++ F ++ + MY+I+R GP+V AE+
Sbjct: 67 KALGMNTLCLYVFWNIHEQREGQFDFTGNNDVAAFCRLAHKNGMYVIVRPGPYVCAEWEM 126
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+P WL R D F ++ F + + L GGPII+ QVENEYG
Sbjct: 127 GGLPWWLLKKKDVRLREDDPYFMARVKAFEAEVGRQLA--PLTIQNGGPIIMVQVENEYG 184
Query: 187 YY---ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN----SFYCDQ 239
Y + + E R + A+ W + + D ++ T N + +Q
Sbjct: 185 SYGINKKYVSE--IRDIVKASGFDKVTLFQCDWASNFEHNGLDDLVWTMNFGTGANIDEQ 242
Query: 240 F---TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHG 296
F P P + +E W GWF +G R RP++D+ + +KG S + YM HG
Sbjct: 243 FRRLKQLRPEAPLMCSEFWSGWFDKWGARHETRPAKDMVEGIDEMLRKGISF-SLYMTHG 301
Query: 297 GTNFGRTAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLK 334
GT+FG AG P + TSYDY+API+EYG+P PK+ L+
Sbjct: 302 GTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGMP-TPKFFALR 343
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 162/325 (49%), Gaps = 35/325 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++GR I+S A+HY R P +W + +A+ G+NTIE+YV WN H PG +
Sbjct: 10 DFLLDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLS 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +L +F++++ A MY I+R GP++ AE++ GG+P WL P R + ++
Sbjct: 70 GGLDLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVR 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++T + +++ ++ +GGP++L QVENEYG + KRY A+ +
Sbjct: 130 EYLTKVYEVVVPHQI--DRGGPVLLVQVENEYGAFGD-----DKRYLKALAEHTREAGVT 182
Query: 214 VPWIMCQQFDTPDPVINTCNSFYCDQFT---------------PHSPSMPKIWTENWPGW 258
VP D P P + S T H P+ P + +E W GW
Sbjct: 183 VP---LTTVDQPTPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGW 239
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFITT 311
F +G + D A + G SV N YM+HGGTNFG T G P I T
Sbjct: 240 FDHWGAHHHTTSAADSAAELDALLAAGASV-NLYMFHGGTNFGLTNGANDKGVYQPLI-T 297
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKEL 336
SYDY+AP+DE G P PK+ +++
Sbjct: 298 SYDYDAPLDEAGDP-TPKYHAFRDV 321
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/314 (33%), Positives = 157/314 (50%), Gaps = 34/314 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++GR +++ A+HY R P +W +++A+ G+NTIE+Y WN HE G Y F
Sbjct: 10 DFLLDGRPHRVLAGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFT 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +L +F++++ A M+ I+R GP++ AE++ GG+P WL+ P R + +
Sbjct: 70 GMLDLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLGAVS 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
++ + D++ L +GGP++L Q+ENEYG Y S K Y + I
Sbjct: 130 AYLRRVYDVVT--PLQIDRGGPVVLVQIENEYGAYGS-----DKFYLRHLVDLTRECGIT 182
Query: 214 VPWIMCQQFDTPDPVINTCNSFYCDQFT---------------PHSPSMPKIWTENWPGW 258
VP D P + + S C T H P+ P + +E W GW
Sbjct: 183 VP---LTTVDQPTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFWNGW 239
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFITT 311
F +G R +ED A + G SV N YM+HGGTNFG T+G P I T
Sbjct: 240 FDHWGDRHHTTSAEDSAAELDALLAAGASV-NIYMFHGGTNFGLTSGANDKGVYQPTI-T 297
Query: 312 SYDYEAPIDEYGLP 325
SYDY+AP+DE G P
Sbjct: 298 SYDYDAPLDEAGNP 311
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/354 (31%), Positives = 168/354 (47%), Gaps = 27/354 (7%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
+ +L FFS ++ Y GN +++G+ I S +HYPR W +Q K
Sbjct: 9 YIILSFFSINLLYSQKGNFEIKDGHFLLSGKPFTIYSGEMHYPRVPSEYWKHRLQMMKSM 68
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+NT+ +YVFWN HE PGK+ F G +L KFIK Q+A +Y+I+R GP+V AE+ +GG
Sbjct: 69 GLNTVTTYVFWNYHEEEPGKWNFSGEKDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFGGY 128
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY- 188
P WL R D + F + ++ + + L + GGP+I+ Q ENE+G Y
Sbjct: 129 PWWLQKDKNLEIRTDNKAFLKQCENYINELAKQII--PLQINNGGPVIMVQAENEFGSYV 186
Query: 189 ---ESFYGEGGKRYALWAAKMAVAQNIGVP-------WIMCQ-QFDTPDPVIN---TCNS 234
+ E K+Y+ V I VP W+ + + P N ++
Sbjct: 187 AQRKDISLEQHKKYSHKIKDFLVKSGITVPFFTSDGSWLFKEGSIEGALPTANGEGDVDN 246
Query: 235 FYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
++ P + E +PGW + +ED+ + K G NYYM
Sbjct: 247 LRKKINEFNNGKGPYMVAEYYPGWLDHWAEPFVKVSTEDVV-KQTELYIKNGISFNYYMI 305
Query: 295 HGGTNFGRTAGGPFIT--------TSYDYEAPIDEYGLPRNPKWGHLKELHGAI 340
HGGTNFG T+G + TSYDY+API+E G PK+ L+++ I
Sbjct: 306 HGGTNFGFTSGANYDKNHDIQPDLTSYDYDAPINEAGWV-TPKFNALRDIFQKI 358
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 162/325 (49%), Gaps = 31/325 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+N + IIS +IHY R VP W +++ + G NT+E+YV WN HE GK+ F
Sbjct: 12 LNDQPMKIISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNL 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L +FI++ Q+ +Y+ILR P++ AE+ +GG+P WL P R D PF + ++
Sbjct: 72 DLRRFIQLAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYF 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV-- 214
T + + L +Q GPI++ QVENEYG Y + K Y +A++ I V
Sbjct: 132 TQLFSQVS--DLQITQEGPILMMQVENEYGSYGN-----DKSYLRKSAELMRHNGIDVSL 184
Query: 215 -----PWI-MCQQFDTPD---PVINTCNSFYCDQFTP----HSPSMPKIWTENWPGWFKT 261
PW+ M + D P IN C S + F H P + E W GWF
Sbjct: 185 FTSDGPWLDMLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDA 243
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDY 315
+G H S A + R + GSV N YM+HGGTNFG G + TSYDY
Sbjct: 244 WGDDKHHTTSVTDAANELRDCLEAGSV-NIYMFHGGTNFGFMNGANYYEKLSPDVTSYDY 302
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAI 340
+A + E+G PK+ +++ G I
Sbjct: 303 DALLSEWG-DVTPKYEAFQQVIGEI 326
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 162/325 (49%), Gaps = 31/325 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+N + IIS +IHY R VP W +++ + G NT+E+YV WN HE GK+ F
Sbjct: 12 LNDQPMKIISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNL 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L +FI++ Q+ +Y+ILR P++ AE+ +GG+P WL P R D PF + ++
Sbjct: 72 DLRRFIQLAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYF 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV-- 214
T + + L +Q GPI++ QVENEYG Y + K Y +A++ I V
Sbjct: 132 TQLFSQVS--DLQITQEGPILMMQVENEYGSYGN-----DKSYLRKSAELMRHNGIDVPL 184
Query: 215 -----PWI-MCQQFDTPD---PVINTCNSFYCDQFTP----HSPSMPKIWTENWPGWFKT 261
PW+ M + D P IN C S + F H P + E W GWF
Sbjct: 185 FTSDGPWLDMLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDA 243
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDY 315
+G H S A + R + GSV N YM+HGGTNFG G + TSYDY
Sbjct: 244 WGDDKHHTTSVTDAANELRDCLEAGSV-NIYMFHGGTNFGFMNGANYYEKLLPDVTSYDY 302
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAI 340
+A + E+G PK+ +++ G I
Sbjct: 303 DALLSEWG-DVTPKYEAFQQVIGEI 326
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 158/321 (49%), Gaps = 32/321 (9%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
R ++G IIS AIHY R P W +++A+ G+NTIE+YV WN H S +++
Sbjct: 8 ERDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFH 67
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
G +L +F+ IIQ+ + I+R GP++ AE++ GG+P WL P V R+ +
Sbjct: 68 TDGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDIVVRSSDPTYLTE 127
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+++++ + +++ ++ + GGPIIL QVENEYG Y G A V +N
Sbjct: 128 VERYLEHLAPIVEPRQI--NHGGPIILMQVENEYGAY-------GNDRAYLTHLTNVYRN 178
Query: 212 IG--VPWIMCQQ------FDTPDPVINTCNSF------YCDQFTPHSPSMPKIWTENWPG 257
+G VP Q P ++T SF H + P + +E W G
Sbjct: 179 LGFVVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIG 238
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFIT 310
WF +G D A ++ R G SV N YM+HGGTNFG T G P +
Sbjct: 239 WFDHWGAHHHTTDVADAANALDRLLGAGASV-NIYMFHGGTNFGFTNGANDKGVYQPLV- 296
Query: 311 TSYDYEAPIDEYGLPRNPKWG 331
TSYDY+AP+ E G P W
Sbjct: 297 TSYDYDAPLAEDGYPTEKYWA 317
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 158/313 (50%), Gaps = 32/313 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G+ I+S A+HY R P +W + +A+ G+NTIE+YV WN H G++
Sbjct: 7 DFLLDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTD 66
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +L +F+++++ M I+R GP++ AE++ GG+P WL P R D + +
Sbjct: 67 GALDLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVS 126
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++ ++D++ ++ +GGP++L QVENEYG Y G + MA+ ++ G
Sbjct: 127 EYLGTVLDLVAPFQV--DRGGPVVLVQVENEYGAY-------GSDHVYLEKLMALTRSHG 177
Query: 214 VPWIMCQQFDTPDPV---------INTCNSF------YCDQFTPHSPSMPKIWTENWPGW 258
+ + D P ++ SF H P+ P + E W GW
Sbjct: 178 IT-VPLTSIDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGW 236
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--ITTS 312
F +G ++D A + G SV N YM+HGGTNFG T+G G + TTS
Sbjct: 237 FDHWGAHHHTTSAQDAARELDELLAAGASV-NIYMFHGGTNFGFTSGANDKGVYQPTTTS 295
Query: 313 YDYEAPIDEYGLP 325
YDY+AP+ E G P
Sbjct: 296 YDYDAPLAEDGYP 308
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 77/162 (47%), Positives = 98/162 (60%), Gaps = 5/162 (3%)
Query: 581 FNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPP 640
N G DLS WTYK+GL+GE L +++ +++ W + QPLTWYK P
Sbjct: 1 LNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPA 60
Query: 641 GDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYRGKFNPDKCITGC 700
GD P+ +DM MGKG W+NG+ +GR+WP S EC Y G F DKC+ C
Sbjct: 61 GDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGS-----CSECSYTGTFREDKCLRNC 115
Query: 701 GEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKITFSIRKI 742
GE SQRWYH+PRSW KPS N+LV+FEE GGDP IT R++
Sbjct: 116 GEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREV 157
>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
Length = 255
Score = 162 bits (410), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 107/199 (53%), Gaps = 48/199 (24%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V+YD RSL+I+G+R +I+S +IHYPRS P
Sbjct: 29 SVSYDDRSLVIDGQRRIILSGSIHYPRSTP------------------------------ 58
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
+ IQ A MY ILRIGP++ E+NYGG+P WL IPG FR E
Sbjct: 59 ----------------EEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 102
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY--GEGGKRYALWAA 204
PF+ M+ F TLIV+ MK K+FA QGGPIILAQ+ENEYG + Y W A
Sbjct: 103 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 162
Query: 205 KMAVAQNIGVPWIMCQQFD 223
MA QN+GVPWIMCQQ D
Sbjct: 163 DMANKQNVGVPWIMCQQDD 181
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 162 bits (410), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 168/348 (48%), Gaps = 33/348 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
FALL F+S G ++ ++NG+ +I +A +HYPR W ++ K
Sbjct: 12 FALLTVFTSFGAPKRGGIFVAGDKTFLLNGKPFVIKAAELHYPRIPRPYWEHRIRMCKAL 71
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+NTI YVFWN HE GK+ F G ++ F ++ Q+ +Y+I+R GP+V AE+ GG+
Sbjct: 72 GMNTICLYVFWNIHEQQEGKFNFTGNNDVAAFCRLAQKHGLYVIVRPGPYVCAEWEMGGL 131
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY- 188
P WL R F ++ F + + + L +GGPII+ QVENEYG Y
Sbjct: 132 PWWLLKKKDIRLRERDPYFMERVKVFEQQVGNQLA--PLTIDKGGPIIMVQVENEYGSYG 189
Query: 189 ----------ESFYGEGGKRYAL----WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNS 234
+ G + AL WA+ + W M F T N
Sbjct: 190 VDKEYVSQIRDIVRSSGFDKVALFQCDWASNFEKNGLDDLIWTM--NFGTG---ANIDEQ 244
Query: 235 FYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
F + P PK+ +E W GWF +G R RP++++ + KG S + YM
Sbjct: 245 F--KRLGELRPQSPKMCSEFWSGWFDKWGARHETRPAKNMVAGIDEMLTKGISF-SLYMT 301
Query: 295 HGGTNFGRTAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
HGGT+FG AG P + TSYDY+API+EYGL PK+ L+ +
Sbjct: 302 HGGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGLA-TPKYYELRAM 347
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 108/352 (30%), Positives = 168/352 (47%), Gaps = 27/352 (7%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
+L+FFS + + G ++NG+ I S IHYPR W ++ K G+
Sbjct: 15 ILLFFSLNTVFSQKGKFEIRDGHFLLNGKPFTIYSGEIHYPRVPSAYWKHRLEMMKAMGL 74
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
NT+ +YVFWN HE +PGK+ F G +L KFIK Q+ +Y+I+R GP+V AE+ +GG P
Sbjct: 75 NTVTTYVFWNYHEEAPGKWNFSGEKDLQKFIKTAQETGLYVIIRPGPYVCAEWEFGGYPW 134
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--- 188
WL R D + F K+++ + + ++ + GGP+I+ Q ENE+G Y
Sbjct: 135 WLQKNKELEIRRDNKAFSEECWKYISQLAKQITPMQI--TNGGPVIMVQAENEFGSYVAQ 192
Query: 189 -ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCDQFTP 242
+ E ++Y+ +M + I VP + + + + T N
Sbjct: 193 RKDIPLEEHRKYSHKIKEMLLKSGISVPLFTSDGSSLFKGGSVEGALPTANGESDIDVLK 252
Query: 243 HSPSM------PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHG 296
S + P + E +PGW + +E++ + + G S NYYM HG
Sbjct: 253 KSINEYNGGKGPYMIAEYYPGWLDHWAEPFVKVSTEEVVKQTNLYIENGVSF-NYYMIHG 311
Query: 297 GTNFGRTAGGPFIT--------TSYDYEAPIDEYGLPRNPKWGHLKELHGAI 340
GTNFG T+G + TSYDY+API E G PK+ L+++ I
Sbjct: 312 GTNFGFTSGANYDKDHDIQPDLTSYDYDAPISEAGWA-TPKYNALRKIFQKI 362
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/341 (32%), Positives = 170/341 (49%), Gaps = 31/341 (9%)
Query: 16 FSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIE 75
FS+S T G ++ ++NG ++ +A IHYPR W ++ +K G+NTI
Sbjct: 16 FSTSCTQSSKGTFEVGDKTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGMNTIC 75
Query: 76 SYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY 135
YVFWN HE GKY F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL
Sbjct: 76 LYVFWNFHEPEEGKYDFTGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPWWLLK 135
Query: 136 IPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGE 194
R E Y+M++ + ++ K+ L S+GG II+ QVENEYG +
Sbjct: 136 KEDIKLR---EQDPYYMERVKLFMNEVGKQLADLQISKGGNIIMVQVENEYGSF------ 186
Query: 195 GGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN-------SFYCDQF 240
G + + A + V Q GVP C + + D ++ T N ++
Sbjct: 187 GIDKPYIAAIRDMVKQAGFTGVPLFQCDWNSNFENNALDDLLWTVNFGTGANIDQQFERL 246
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
P+ P + +E W GWF +G + R +E++ + + S + YM HGGT+F
Sbjct: 247 KELRPNTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSF 305
Query: 301 GRTAGGPF-----ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
G G F TSYDY+API+E G PK+ +++L
Sbjct: 306 GHWGGANFPNFSPTCTSYDYDAPINESG-KVTPKFLEVRDL 345
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 165/334 (49%), Gaps = 35/334 (10%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
G T ++ ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE
Sbjct: 29 GGTFTTGDKTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHE 88
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
GK+ F G ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL R
Sbjct: 89 QEEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLR-- 146
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
E Y MQ+ ++ K+ L GGPII+ QVENEYG Y GK +
Sbjct: 147 -EQDPYFMQRVEIFEKEVGKQLAPLTIQNGGPIIMVQVENEYGSY-------GKDKPYVS 198
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVIN--------TCN---SFYCDQ----FTPHSPSMP 248
A + + G + Q D +N T N DQ P+ P
Sbjct: 199 AIRDIVRKSGFDKVSLFQCDWSSNFLNNGLDDLTWTMNFGTGANIDQQFKRLGEVRPNAP 258
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
K+ +E W GWF +G R RP++D+ + KG S + YM HGGT+FG AG
Sbjct: 259 KMCSEFWSGWFDKWGARHETRPAKDMVEGMDEMLSKGISF-SLYMTHGGTSFGHWAGANS 317
Query: 307 ----PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P + TSYDY+API+E+GL PK+ L+++
Sbjct: 318 PGFQPDV-TSYDYDAPINEWGLA-TPKFYELQKM 349
>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
gi|238005922|gb|ACR33996.1| unknown [Zea mays]
Length = 345
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 127/241 (52%), Gaps = 28/241 (11%)
Query: 497 LKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMT 556
++ + VL + S GHA AF N + G G + F + P+ LK G N +A+L+ T
Sbjct: 3 IRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLAST 62
Query: 557 VGLQNAGPFYEWVGAGITSVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNIN 616
+G+ ++G + E AG+ V+I G N+GTLDL+ W + +GL GE IY ++
Sbjct: 63 MGMMDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVT 122
Query: 617 WVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSS 676
W + ++PLTWYK P G++PI LDM MGKGL ++NG+ IGRYW
Sbjct: 123 WKPAVN---DRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYW-------- 171
Query: 677 PHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKIT 736
Y+ G PSQ+ YHIPRS+ + +N+LV+FEE+ G P I
Sbjct: 172 --------ISYKHAL---------GRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIM 214
Query: 737 F 737
Sbjct: 215 I 215
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 154/314 (49%), Gaps = 26/314 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T + L++N R II+ AIHY R VP W + + K G NT+E+YV WN HE
Sbjct: 4 LTIQGKQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEE 63
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G +L KFI + + +Y I+R P++ AE+ +GG+P WL PG R +P
Sbjct: 64 GRFVFEGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGMRLRCSYKP 123
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F + ++ + +++GGP+I Q+ENEYG Y + K Y + +
Sbjct: 124 FLDKADAYYDELIPRLT--PFLSTKGGPLIAMQIENEYGSYGN-----DKTYLNYLKEAL 176
Query: 208 VAQNIGV-------PWIMCQQFDTPDPVINTCN--SFYCDQFT---PHSPSMPKIWTENW 255
V + + V P Q + V T N S + F + P P + E W
Sbjct: 177 VKRGVDVLLFTSDGPEDFMLQGGMVEGVWETVNFGSRSAEAFAKLQEYQPDQPLMCMEFW 236
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------I 309
GWF +G R + D+A + G SV N+YM+HGGTNFG +G +
Sbjct: 237 NGWFDHWGETHHTRGAADVALVLDEMLAAGASV-NFYMFHGGTNFGFFSGANYTDRLLPT 295
Query: 310 TTSYDYEAPIDEYG 323
TSYDY++P+ E G
Sbjct: 296 VTSYDYDSPLSESG 309
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 156/323 (48%), Gaps = 33/323 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
S ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE PG Y F
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVN 475
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG-----------EGGKRYAL- 201
F + +K L + GGPII+ QVENEYG Y + G G AL
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIALF 533
Query: 202 ---WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
WA+ + + W M F T V + P+SP M +E W GW
Sbjct: 534 QCDWASNFTLNGLDDLIWTM--NFGTGANVDQQFAKL--KKLRPNSPLMC---SEFWSGW 586
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG------GPFITTS 312
F +G RP+ED+ + +G S + YM HGGTN+G AG P + TS
Sbjct: 587 FDKWGANHETRPAEDMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDV-TS 644
Query: 313 YDYEAPIDEYGLPRNPKWGHLKE 335
YDY+API E G PK+ L+E
Sbjct: 645 YDYDAPISESG-QTTPKYWKLRE 666
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 162/340 (47%), Gaps = 36/340 (10%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
T G+ T + ++N R ++ +A +HYPR W ++ K G+NTI YVFW
Sbjct: 25 TTAAPGDFTVGKGTFLLNNRPFVVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFW 84
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE G++ F G ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL
Sbjct: 85 NIHEQREGEFDFSGNSDVAAFCRLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDIR 144
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY------------ 188
R F ++ F + + + L GGPII+ QVENEYG Y
Sbjct: 145 LRESDPYFMERVEIFEQKVAEQLA--PLTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRD 202
Query: 189 --ESFYGEGGKRYAL----WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
++ G+ AL WA+ + W M F T N F +
Sbjct: 203 VLRKYWYTNGRGPALFQCDWASNFEKNGLEDLIWTM--NFGTG---ANIDAQFM--RLGE 255
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
P PK+ +E W GWF +G R RP++D+ + KG S + YM HGGT+FG
Sbjct: 256 LRPDAPKMCSEFWSGWFDKWGARHETRPAKDMVAGIDEMLSKGISF-SLYMTHGGTSFGH 314
Query: 303 TAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
AG P + TSYDY+API+EYG PK+ L+++
Sbjct: 315 WAGANSPGFAPDV-TSYDYDAPINEYG-QVTPKFWELRKM 352
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 154/336 (45%), Gaps = 32/336 (9%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
T +++GR I+S AIHY R P W + +A+ G+NTIE+YV WN HE G
Sbjct: 5 TIGEHDFLLDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEG 64
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
++ + G +L F+K + M+ I+R P++ AE++ GG+P WL R D F
Sbjct: 65 QWSWEGGLDLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKAAGVRRDEPVF 124
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
+Q ++ + +++ E L GGP+IL Q+ENEYG Y S Y +
Sbjct: 125 MAAVQAYLRRVYEVI--EPLQIHHGGPVILVQIENEYGAYGS-----DPEYLRKLVDITS 177
Query: 209 AQNIGVPWIMCQQFDT------PDPVINTCNSF------YCDQFTPHSPSMPKIWTENWP 256
+ I VP Q + P + SF H P+ P + E W
Sbjct: 178 SAGITVPLTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWN 237
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--IT 310
GWF +G +E A + G SV N YM GGTNFG T G G + I
Sbjct: 238 GWFDDWGTPHHTTDAEASAADLDALLGSGASV-NLYMLCGGTNFGLTNGANDKGTYEPIV 296
Query: 311 TSYDYEAPIDEYGLPRNPKW------GHLKELHGAI 340
TSYDY+AP+DE G P W G EL G +
Sbjct: 297 TSYDYDAPLDEAGHPTAKYWAFREVIGRYTELPGEV 332
>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
Length = 1106
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 162/345 (46%), Gaps = 35/345 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ + + ++NG+ +I +A +HYPR W ++ K G+NTI YVFWN HE
Sbjct: 349 GDFSAGKGTFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHES 408
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG + F G+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R
Sbjct: 409 QPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESD 468
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
F + F + + + + GGPII+ QVENEYG Y GE K Y
Sbjct: 469 PYFMERVGIFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSY----GED-KGYVSQIRD 521
Query: 206 MAVAQNIGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQFTPHS---PSMPKIWTE 253
+ A GV C + ++ T N + QF P P P + +E
Sbjct: 522 IVRANYPGVALFQCDWASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSE 581
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------P 307
W GWF +G RP+ D+ + KG S + YM HGGTN+G AG P
Sbjct: 582 FWSGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAP 640
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
+T SYDY+API E G W EL A+ +NGE+
Sbjct: 641 DVT-SYDYDAPISESGQTTPKYW----ELRKALS----KYMNGEK 676
>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
Length = 1106
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 162/345 (46%), Gaps = 35/345 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ + + ++NG+ +I +A +HYPR W ++ K G+NTI YVFWN HE
Sbjct: 349 GDFSAGKGTFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHES 408
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG + F G+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R
Sbjct: 409 QPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESD 468
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
F + F + + + + GGPII+ QVENEYG Y GE K Y
Sbjct: 469 PYFMERVGIFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSY----GED-KGYVSQIRD 521
Query: 206 MAVAQNIGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQFTPHS---PSMPKIWTE 253
+ A GV C + ++ T N + QF P P P + +E
Sbjct: 522 IVRANYPGVALFQCDWASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSE 581
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------P 307
W GWF +G RP+ D+ + KG S + YM HGGTN+G AG P
Sbjct: 582 FWSGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAP 640
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
+T SYDY+API E G W EL A+ +NGE+
Sbjct: 641 DVT-SYDYDAPISESGQTTPKYW----ELRKALS----KYMNGEK 676
>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
Length = 1106
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 162/345 (46%), Gaps = 35/345 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ + + ++NG+ +I +A +HYPR W ++ K G+NTI YVFWN HE
Sbjct: 349 GDFSAGKGTFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHES 408
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG + F G+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R
Sbjct: 409 QPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESD 468
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
F + F + + + + GGPII+ QVENEYG Y GE K Y
Sbjct: 469 PYFMERVGIFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSY----GED-KGYVSQIRD 521
Query: 206 MAVAQNIGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQFTPHS---PSMPKIWTE 253
+ A GV C + ++ T N + QF P P P + +E
Sbjct: 522 IVRANYPGVALFQCDWASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSE 581
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------P 307
W GWF +G RP+ D+ + KG S + YM HGGTN+G AG P
Sbjct: 582 FWSGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAP 640
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
+T SYDY+API E G W EL A+ +NGE+
Sbjct: 641 DVT-SYDYDAPISESGQTTPKYW----ELRKALS----KYMNGEK 676
>gi|297194972|ref|ZP_06912370.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|297152570|gb|EFH31854.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 599
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 166/330 (50%), Gaps = 39/330 (11%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ T +++GR ++S A+HY R G W + + G+N +E+YV WN HE
Sbjct: 10 DFTVGDTDFLLDGRPVRLLSGALHYFRVHEGQWGHRLAMLRAMGLNCVETYVPWNLHEPE 69
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y G L +F+ + A M+ I+R GP++ AE+ GG+P WL G R +
Sbjct: 70 PGRYADDG--ALGRFLDAVHAAGMWAIVRPGPYICAEWENGGLPFWLTGRVGRRVRTEDP 127
Query: 147 PFKYHMQKFMT-LIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+ H++++ T L+ +++RE ++GGP+++ QVENEYG Y S +GG Y +
Sbjct: 128 EYLGHVERWFTRLLPQVVERE---ITRGGPVVMVQVENEYGSYGS---DGG--YLRQLVE 179
Query: 206 MAVAQNIGVPWI--------MCQQFDTPDPVINTCN--SFYCDQFTP---HSPSMPKIWT 252
+ + +GVP M P V+ T N S + F H P+ P +
Sbjct: 180 LLRSCGVGVPLFTSDGPEDHMLSGGSVPG-VLATVNFGSGAGEAFAALRRHRPTGPLMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------ 306
E W GWF+ +G R +ED A ++ + G SV N YM HGGT+FG AG
Sbjct: 239 EFWCGWFEHWGAEPARRDAEDAARALREILEAGASV-NVYMAHGGTSFGGWAGANRSGEL 297
Query: 307 ------PFITTSYDYEAPIDEYGLPRNPKW 330
P + TSYDY+AP+DE G P W
Sbjct: 298 HDGVLEPTV-TSYDYDAPVDEAGRPTEKFW 326
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 156/319 (48%), Gaps = 34/319 (10%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
T +++GR +IS +HY R P W ++ AK G+NTIE+YV WN HE G
Sbjct: 5 TIGETDFLLDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRG 64
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
++ G +L +F+ +I ++ I+R GP++ AE++ GG+PVWL PG R F
Sbjct: 65 EWDATGWNDLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGIGIRRSEPQF 124
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
+ +++ + +++ ++ +GG ++L Q+ENEYG Y S K Y ++
Sbjct: 125 VEAVSEYLRRVYEIVAPRQI--DRGGNVVLVQIENEYGAYGS-----DKEYLRELVRVTK 177
Query: 209 AQNIGVPWIMCQQ------FDTPDPVINTCNSF------YCDQFTPHSPSMPKIWTENWP 256
I VP Q P ++ SF H P+ P + +E W
Sbjct: 178 DAGITVPLTTVDQPMPWMLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWD 237
Query: 257 GWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF 308
GWF +G DP + D+ +A G SV N YM HGGTNFG T G G F
Sbjct: 238 GWFDWWGSIHHTTDPAASAHDLDVLLA----AGASV-NIYMVHGGTNFGTTNGANDKGRF 292
Query: 309 --ITTSYDYEAPIDEYGLP 325
I TSYDY+APIDE G P
Sbjct: 293 DPIVTSYDYDAPIDESGHP 311
>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
Length = 1106
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 162/345 (46%), Gaps = 35/345 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ + + ++NG+ +I +A +HYPR W ++ K G+NTI YVFWN HE
Sbjct: 349 GDFSAGKGTFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHES 408
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG + F G+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R
Sbjct: 409 QPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESD 468
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
F + F + + + + GGPII+ QVENEYG Y GE K Y
Sbjct: 469 PYFMERVGIFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSY----GED-KGYVSQIRD 521
Query: 206 MAVAQNIGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQFTPHS---PSMPKIWTE 253
+ A GV C + ++ T N + QF P P P + +E
Sbjct: 522 IVRANYPGVALFQCDWASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSE 581
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------P 307
W GWF +G RP+ D+ + KG S + YM HGGTN+G AG P
Sbjct: 582 FWSGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAP 640
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
+T SYDY+API E G W EL A+ +NGE+
Sbjct: 641 DVT-SYDYDAPISESGQTTPKYW----ELRKALS----KYMNGEK 676
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 159 bits (401), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 170/344 (49%), Gaps = 31/344 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+ FS+S + ++ ++NG+ ++ +A IHYPR W ++ K G+N
Sbjct: 13 VTVFSTSCSQSSKETFEIGDKTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMN 72
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI YVFWN HE GKY F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 73 TICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWW 132
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R E Y+M++ + ++ K+ L S+GG II+ QVENEYG +
Sbjct: 133 LLKKKDIKLR---EQDPYYMERVKLFMNEVGKQLADLQISKGGNIIMVQVENEYGSF--- 186
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQF 240
G + + + V Q GVP C + + D ++ T N + DQF
Sbjct: 187 ---GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 241 ---TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P +P + +E W GWF +G + R +ED+ + + S + YM HGG
Sbjct: 244 KRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGG 302
Query: 298 TNFGRTAGGPF-----ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
T+FG G F TSYDY+API+E G PK+ ++ L
Sbjct: 303 TSFGHWGGANFPNFSPTCTSYDYDAPINESG-KVTPKYFEVRNL 345
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 125/378 (33%), Positives = 174/378 (46%), Gaps = 41/378 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ ++NG+ +I +A +HYPR W ++ K G+NTI YVFWN HE PG++ F
Sbjct: 74 TFLLNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFT 133
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G+ +L F ++ QQ MY+ILR GP+V AE+ GG+P WL R F +
Sbjct: 134 GQNDLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREADPYFIERVN 193
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
F + + L GGPII+ QVENEYG YGE + +L + V N G
Sbjct: 194 IFEQEVARQVG--GLTIQNGGPIIMVQVENEYGS----YGESKEYVSL--IRDIVRTNFG 245
Query: 214 -VPWIMCQ------QFDTPDPV--INTCNSFYCDQ----FTPHSPSMPKIWTENWPGWFK 260
V C + PD + IN DQ P P + +E W GWF
Sbjct: 246 DVTLFQCDWASNFTKNALPDLLWTINFGTGANIDQQFAGLKKLRPDSPLMCSEFWSGWFD 305
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG------GPFITTSYD 314
+G RP+ D+ + KG S + YM HGGTN+G AG P + TSYD
Sbjct: 306 KWGANHETRPASDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV-TSYD 363
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADVYADSSGACA 374
Y+API E G PK+ L++ G +NGE+ + + A A
Sbjct: 364 YDAPISESG-QTTPKYWALRKTLG-------KYMNGEKQTKVPDMIKSVSIPAFQFTEVA 415
Query: 375 AFLANM----DDKNDKTV 388
AN+ DKN +T+
Sbjct: 416 PLFANLPISKKDKNIRTM 433
>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 584
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 177/375 (47%), Gaps = 47/375 (12%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++T D SL +G+ I+S +HY R P W +++A+ G+NTI++Y+ WN HE
Sbjct: 5 DITGDGFSL--DGQPFRIVSGGLHYFRVHPAQWSDRLRKARLMGLNTIDTYIPWNLHERR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG + FGG +L F+ ++++LR GP++ E+ GG+P WL P R+
Sbjct: 63 PGTFDFGGILDLAAFLDAAAAEGLHVLLRPGPYICGEWEGGGLPSWLLADPDLALRSTDP 122
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F ++ ++ I+ ++ ++GGP+I QVENEYG Y S + Y +
Sbjct: 123 AFLQAVEAYLDAIMPIVLPR--LGTRGGPVIAVQVENEYGAYGSDTAYMERLY-----EA 175
Query: 207 AVAQNIGVPWIMCQQ----FDTPDP-VINTCN-----SFYCDQFTPHSPSMPKIWTENWP 256
++ I VP+ Q D P V+ T N + P+ P + E W
Sbjct: 176 LTSRGIDVPFFTSDQPNDLADGALPGVLATANFGGKVTASLAALRAQQPTGPLMCAEFWN 235
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------PFIT 310
GWF +GG R +ED ++ Q G SV N+YM+HGGTNFG T G
Sbjct: 236 GWFDYWGGTHAQRSAEDAGAALEEMLQAGASV-NFYMFHGGTNFGFTNGANDKGTYRATV 294
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHG--------------------AIKLCEHALLNG 350
TSYDY++P+DE G P K+ + + G ++ L A L
Sbjct: 295 TSYDYDSPLDEAGDPTE-KYRRFRSIIGKYETVPDEEVPEPGEKLAPVSVALTGRAALFS 353
Query: 351 ERSNLSLGSSQEADV 365
E S SLG +Q ++
Sbjct: 354 EASLASLGVAQNSET 368
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/337 (33%), Positives = 166/337 (49%), Gaps = 45/337 (13%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
G T ++ ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE
Sbjct: 18 GGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
GK+ F G ++ +F ++ Q+ +Y+I+R GP+V AE+ GG+P WL R
Sbjct: 78 QQEGKFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLR-- 135
Query: 145 TEPFKYHMQKFMTLIVDMMKRE------KLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
EP Y M++ V + +R+ L GGPII+ QVENEYG Y GK
Sbjct: 136 -EPDPYFMER-----VKLFERKVGEQLASLTIQNGGPIIMVQVENEYGSY-------GKN 182
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTP--------DPVINTCN---SFYCDQ----FTPH 243
A +A + + G + Q D D ++ T N DQ
Sbjct: 183 KAYVSAIRDIVRRSGFDKVTLFQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGEL 242
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
P+ P++ +E W GWF +G R RP++ + + KG S + YM HGGT+FG
Sbjct: 243 RPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHW 301
Query: 304 AG------GPFITTSYDYEAPIDEYGLPRNPKWGHLK 334
AG P + TSYDY+API+EYG PK+ L+
Sbjct: 302 AGANSPGFAPDV-TSYDYDAPINEYG-QATPKYWELR 336
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 165/337 (48%), Gaps = 34/337 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ T + ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE
Sbjct: 92 GDFTTGKGTFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQ 151
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G++ F G+ ++ F ++ QQ MY+I+R GP+V AE+ GG+P WL R
Sbjct: 152 REGQFDFTGQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQD 211
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG------YYESFYGEGGKRY 199
F ++ F + + + L +GGPII+ QVENEYG Y S + +RY
Sbjct: 212 PYFMERVELFEQKVAEQLA--PLTIRRGGPIIMVQVENEYGSYGEDKAYVSQIRDVLRRY 269
Query: 200 ALWA--------AKMAVAQNIGVPWIMCQQFDTPDPVINTCN----SFYCDQFT---PHS 244
W+ + A W + D ++ T N + DQF
Sbjct: 270 --WSLSPTGEGRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELR 327
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P PK+ +E W GWF +G R RP+ D+ + KG S + YM HGGT+FG A
Sbjct: 328 PDAPKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWA 386
Query: 305 G------GPFITTSYDYEAPIDEYGLPRNPKWGHLKE 335
G P + TSYDY+API+EYG PK+ L++
Sbjct: 387 GANSPGFAPDV-TSYDYDAPINEYGQA-TPKFWELRK 421
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 165/337 (48%), Gaps = 34/337 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ T + ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE
Sbjct: 30 GDFTTGKGTFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQ 89
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G++ F G+ ++ F ++ QQ MY+I+R GP+V AE+ GG+P WL R
Sbjct: 90 REGQFDFTGQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQD 149
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG------YYESFYGEGGKRY 199
F ++ F + + + L +GGPII+ QVENEYG Y S + +RY
Sbjct: 150 PYFMERVELFEQKVAEQLA--PLTIRRGGPIIMVQVENEYGSYGEDKAYVSQIRDVLRRY 207
Query: 200 ALWA--------AKMAVAQNIGVPWIMCQQFDTPDPVINTCN----SFYCDQFT---PHS 244
W+ + A W + D ++ T N + DQF
Sbjct: 208 --WSLSPTGEGRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELR 265
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P PK+ +E W GWF +G R RP+ D+ + KG S + YM HGGT+FG A
Sbjct: 266 PDAPKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWA 324
Query: 305 G------GPFITTSYDYEAPIDEYGLPRNPKWGHLKE 335
G P + TSYDY+API+EYG PK+ L++
Sbjct: 325 GANSPGFAPDV-TSYDYDAPINEYGQA-TPKFWELRK 359
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 170/344 (49%), Gaps = 31/344 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+ FS+S + ++ ++NG+ ++ +A IHYPR W ++ K G+N
Sbjct: 13 VTVFSTSCSQSSKEIFEIGDKTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMN 72
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI YVFWN HE GKY F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 73 TICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWW 132
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R E Y+M++ + ++ K+ L S+GG II+ QVENEYG +
Sbjct: 133 LLKKKDIKLR---EQDPYYMERVKLFMNEVGKQLTDLQISKGGNIIMVQVENEYGSF--- 186
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQF 240
G + + + V Q GVP C + + D ++ T N + DQF
Sbjct: 187 ---GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 241 ---TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P +P + +E W GWF +G + R +ED+ + + S + YM HGG
Sbjct: 244 KRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGG 302
Query: 298 TNFGRTAGGPF-----ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
T+FG G F TSYDY+API+E G PK+ ++ L
Sbjct: 303 TSFGHWGGANFPNFSPTCTSYDYDAPINESG-KVTPKYFEVRNL 345
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 174/361 (48%), Gaps = 50/361 (13%)
Query: 11 ALLIFFSSSITYCFA----GNVTYDSR----SLIINGRRELIISAAIHYPRSVPGMWPGL 62
A L+F + +I+ A G+VT+ R +NG ++S +HY R W
Sbjct: 17 AALLFMACTISAQTAKMPAGSVTHTFRVAGDHFELNGEPVQLLSGEMHYARIPREYWRAR 76
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q AK G+NT+ +Y+FWN HE PG Y F G ++ F+K+ Q+ + +ILR GP+ A
Sbjct: 77 LQMAKAMGLNTVATYIFWNVHEPKPGVYDFSGNHDVAAFVKMAQEEGLNVILRAGPYACA 136
Query: 123 EYNYGGIPVWLHYIP--GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQ 180
E+ +GG P WL P G+ R++ E + +++++ + M L S GGPI+ Q
Sbjct: 137 EWEFGGYPSWLMKDPKMGSALRSNDEVYMAPVERWIKRLGQEMV--PLLISNGGPIVAVQ 194
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVIN---------- 230
VENEYG + G K+Y A + + QN G D ++N
Sbjct: 195 VENEYGDF-----GGDKKYL--AHMLEIFQNAGFKDSFLYTVDPSKALVNGSLEGLPSGV 247
Query: 231 ---TCNSFYCDQFTPH-SPSMPKIWTENWPGWFKTFGGRDPHRP----SEDIAFSVARFF 282
N+ H P P +E WPGWF +G RP +DIA+++
Sbjct: 248 NFGVGNAERGLTALAHLRPGQPLFASEYWPGWFDHWGHPHETRPIPPQLKDIAYTLDH-- 305
Query: 283 QKGGSVHNYYMYHGGTNFGRTAGGPFI-------TTSYDYEAPIDEYGLPRNPKWGHLKE 335
S N YM+HGGT+FG +G + TSYDY+AP+DE G P PK+ ++
Sbjct: 306 ---KSSINIYMFHGGTSFGFMSGASWTGGEYLPDVTSYDYDAPLDEAGHP-TPKFYAYRD 361
Query: 336 L 336
L
Sbjct: 362 L 362
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 155/323 (47%), Gaps = 33/323 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE PG Y F
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--------------ESFYGEGGKRY 199
F + +K L + GGPII+ QVENEYG Y + +G G +
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALF 533
Query: 200 AL-WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
WA+ + + W M F T V Q P+SP M +E W GW
Sbjct: 534 QCDWASNFTLNGLDDLIWTM--NFGTGANVDQQFAKL--KQLRPNSPLMC---SEFWSGW 586
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG------GPFITTS 312
F +G RP+ D+ + +G S + YM HGGTN+G AG P + TS
Sbjct: 587 FDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDV-TS 644
Query: 313 YDYEAPIDEYGLPRNPKWGHLKE 335
YDY+API E G PK+ L+E
Sbjct: 645 YDYDAPISESG-QTTPKYWALRE 666
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 169/344 (49%), Gaps = 31/344 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+ FS+S + ++ ++NG ++ +A IHYPR W ++ K G+N
Sbjct: 13 VTVFSTSCSQSSKETFEIGDKTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMN 72
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI YVFWN HE GKY F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 73 TICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWW 132
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R E Y+M++ + ++ K+ L S+GG II+ QVENEYG +
Sbjct: 133 LLKKKDIKLR---EQDPYYMERVKLFMNEVGKQLTDLQISKGGNIIMVQVENEYGSF--- 186
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQF 240
G + + + V Q GVP C + + D ++ T N + DQF
Sbjct: 187 ---GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 241 ---TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P +P + +E W GWF +G + R +ED+ + + S + YM HGG
Sbjct: 244 KRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGG 302
Query: 298 TNFGRTAGGPF-----ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
T+FG G F TSYDY+API+E G PK+ ++ L
Sbjct: 303 TSFGHWGGANFPNFSPTCTSYDYDAPINESG-KVTPKYFEVRNL 345
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/339 (30%), Positives = 164/339 (48%), Gaps = 43/339 (12%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ +NG++ L++S A+HY R VP W + + K G+N +E+YV WN HE G + F
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +L +FI+I Q +Y++LR GP++ +E+++GG+P WL + P R P+ +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGY------YESFYGEGGKRYALWAAKMA 207
++ I+ ++ L S+GGPII Q+ENEYG Y+ F +Y +
Sbjct: 130 AYLAKILPLVN--DLQMSKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLFT 187
Query: 208 VAQNIGVPWIMCQQFDTPDP-VINTCNSFYCDQ--------FTPHSPSMPKIWTENWPGW 258
G+ + P P V+ T N +Q P +P + E W GW
Sbjct: 188 SDNGTGIQ-------NGPIPGVLATTNFQEQEQGYLMFEYLRNIKQPGLPMMVMEFWSGW 240
Query: 259 FKTFGGRDPHRPSEDIAF-SVARFFQKGGSVHNYYMYHGGTNFGRTAGG----------- 306
F +G + H F V ++ GS N+YM+HGGTNFG AG
Sbjct: 241 FDHWG--EQHNLCHHAEFIDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATNEGG 298
Query: 307 --PFI--TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIK 341
P+ TTSYDY+ P+ E G N K+ ++ + +K
Sbjct: 299 GEPYAADTTSYDYDCPVSESG-QLNEKFYEIRNILSEMK 336
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 170/344 (49%), Gaps = 31/344 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+ FS+S + ++ ++NG+ ++ +A IHYPR W ++ K G+N
Sbjct: 13 VTVFSTSCSQSSKETFEIGDKTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMN 72
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI YVFWN HE GKY F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 73 TICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWW 132
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R E Y+M++ + ++ K+ L ++GG II+ QVENEYG +
Sbjct: 133 LLKKKDIKLR---EQDPYYMERVKLFMNEVGKQLTDLQINKGGNIIMVQVENEYGSF--- 186
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQF 240
G + + + V Q GVP C + + D ++ T N + DQF
Sbjct: 187 ---GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 241 ---TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P +P + +E W GWF +G + R +ED+ + + S + YM HGG
Sbjct: 244 KRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGG 302
Query: 298 TNFGRTAGGPF-----ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
T+FG G F TSYDY+API+E G PK+ ++ L
Sbjct: 303 TSFGHWGGANFPNFSPTCTSYDYDAPINESG-KVTPKYFEVRNL 345
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 154/323 (47%), Gaps = 33/323 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE PG Y F
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG-----------EGGKRYAL- 201
F + +K L + GGPII+ QVENEYG Y G G AL
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533
Query: 202 ---WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
WA+ + + W M F T V Q P+SP M +E W GW
Sbjct: 534 QCDWASNFTLNGLDDLIWTM--NFGTGANVDQQFAKL--KQLRPNSPLMC---SEFWSGW 586
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG------GPFITTS 312
F +G RP+ D+ + +G S + YM HGGTN+G AG P + TS
Sbjct: 587 FDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDV-TS 644
Query: 313 YDYEAPIDEYGLPRNPKWGHLKE 335
YDY+API E G PK+ L+E
Sbjct: 645 YDYDAPISESG-QTTPKYWALRE 666
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 144/303 (47%), Gaps = 30/303 (9%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
+IS AIHY R VP W +++ + G NT+E+YV WN HE G Y F G +L +FI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQ 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
Q+ +Y+ILR P++ AE+ +GG+P WL P R D PF + ++ + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI------ 217
+ L +QGGPII+ QVENEYG Y + K Y + P +
Sbjct: 139 R--DLQITQGGPIIMMQVENEYGSYAN-----DKEYLRKMVAAMRQHGVETPLVTSDGPW 191
Query: 218 --MCQQFDTPD---PVINTCNSFYCDQFTP----HSPSMPKIWTENWPGWFKTFGGRDPH 268
M + D P IN C S + F H P + E W GWF +G H
Sbjct: 192 HDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDQHH 250
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDEY 322
S A + GSV N YM+HGGTNFG G + TSYDY+A + E+
Sbjct: 251 TTSTQDAVKELQDCLALGSV-NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTEW 309
Query: 323 GLP 325
G P
Sbjct: 310 GEP 312
>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
Length = 633
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 176/352 (50%), Gaps = 40/352 (11%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
T +A AL ++ T+ G+ +Y+ ++NG+ II + R +P W ++
Sbjct: 7 TLVALSALSATLAAETTHA-PGSFSYNRTDFLLNGQPFQIIGGQMDPQRILPEYWTHRLK 65
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
A+ G+NTI SY++WN HE PG + F GR ++ +F ++ QQ + ++LR GP++ E
Sbjct: 66 MARAMGLNTIFSYLYWNLHEPRPGAWDFSGRNDVARFFRLAQQEGLRVVLRPGPYICGER 125
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKRE--KLFASQGGPIILAQVE 182
++GG P WL +PG R + PF + + +D + +E +L +QGGPI++AQ+E
Sbjct: 126 DWGGFPAWLSQVPGMAVRQNNRPFLDAAKSY----IDRLGKELGQLQITQGGPILMAQLE 181
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNI----------GVPWIMCQQFDTPDPVI--N 230
NEYG + G + L A + +N G ++ Q VI +
Sbjct: 182 NEYGSF------GTDKTYLAALAAMLRENFDVFLYTNDGGGQSYLEGGQLHGVLAVIDGD 235
Query: 231 TCNSFYC-DQFTPHSPSM-PKIWTENWPGWFKTFGGRDPHR----PSEDIAFSVARF--F 282
+ + F D++ S+ P++ E + W +G PH+ D+A +VA
Sbjct: 236 SQSGFAARDKYVTDPTSLGPQLNGEYYISWIDQWGSDYPHQQIAGSQADVAKAVADLDWT 295
Query: 283 QKGGSVHNYYMYHGGTNFGRTAGG-------PFITTSYDYEAPIDEYGLPRN 327
GG + YM+HGGTNFG GG +TTSYDY AP+DE G P +
Sbjct: 296 LAGGYSFSIYMFHGGTNFGFENGGIRDDGPLAAMTTSYDYGAPLDESGRPTD 347
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 154/323 (47%), Gaps = 33/323 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE PG Y F
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
+ +L +F ++ QQ MY+ILR GP+V AE+ GG+P WL R F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG-----------EGGKRYAL- 201
F + +K L + GGPII+ QVENEYG Y G G AL
Sbjct: 476 LFEEAVAKQVK--NLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533
Query: 202 ---WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGW 258
WA+ + + W M F T V Q P+SP M +E W GW
Sbjct: 534 QCDWASNFTLNGLDDLIWTM--NFGTGANVDQQFAKL--KQLRPNSPLMC---SEFWSGW 586
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG------GPFITTS 312
F +G RP+ D+ + +G S + YM HGGTN+G AG P + TS
Sbjct: 587 FDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDV-TS 644
Query: 313 YDYEAPIDEYGLPRNPKWGHLKE 335
YDY+API E G PK+ L+E
Sbjct: 645 YDYDAPISESG-QTTPKYWALRE 666
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 149/307 (48%), Gaps = 28/307 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G I+S +HY R PG+W + +A+ G+NT+E+YV WN H+ P ++ G
Sbjct: 18 LDGEPFRILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGL 77
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK-F 155
+L +F+ + ++++LR GP++ AE+ GG+P WL P R+ F + F
Sbjct: 78 DLPRFLDLAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYF 137
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
L+ + R AS+GGP++ QVENEYG Y Y A + VP
Sbjct: 138 RRLLPPLHDR---LASRGGPVLAVQVENEYGAYGD-----DTAYLEHLADSLRRHGVDVP 189
Query: 216 WIMCQQFDTPDP-----VINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
C Q + V+ T N + + PS P + TE W GWF +GG
Sbjct: 190 LFTCDQPADLERGALAGVLATANFGSRPAAHLATLRTARPSAPLLCTEFWIGWFDRWGGN 249
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFITTSYDYEAP 318
R +E + + G SV N+YM+HGGTNFG G P + TSYDY+AP
Sbjct: 250 HVVRDAEQASQELDELLATGASV-NFYMFHGGTNFGFMNGANDKHTYRPTV-TSYDYDAP 307
Query: 319 IDEYGLP 325
+DE G P
Sbjct: 308 LDEAGDP 314
>gi|420262409|ref|ZP_14765050.1| beta-galactosidase [Enterococcus sp. C1]
gi|394770166|gb|EJF49970.1| beta-galactosidase [Enterococcus sp. C1]
Length = 585
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 145/303 (47%), Gaps = 30/303 (9%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
+IS AIHY R VP W +++ + G NT+E+YV WN HE G Y F G +L +FI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGILDLRRFIQ 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
Q+ +Y+ILR P++ AE+ +GG+P WL P R D PF + ++ + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI------ 217
+ L +QGGPI++ QVENEYG Y + K Y Q + P +
Sbjct: 139 R--DLQITQGGPILMMQVENEYGSYAN-----DKEYLRKMVAAMRQQGVETPLVTSDGPW 191
Query: 218 --MCQQFDTPD---PVINTCNSFYCDQFTP----HSPSMPKIWTENWPGWFKTFGGRDPH 268
M + D P IN C S + F H P + E W GWF +G H
Sbjct: 192 HDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDHHH 250
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDEY 322
S A + GSV N YM+HGGTNFG G + TSYDY+A + E+
Sbjct: 251 TTSTADAVKELQDCLAEGSV-NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTEW 309
Query: 323 GLP 325
G P
Sbjct: 310 GEP 312
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 172/357 (48%), Gaps = 47/357 (13%)
Query: 7 IAPFALLI--FFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
IA ALL+ S G T ++ ++NG+ ++ +A +HYPR W ++
Sbjct: 11 IATVALLVTAMLSPVSAARKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIK 70
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
K G+NT+ YVFWN HE GK+ F ++ +F ++ Q+ +Y+I+R GP+V AE+
Sbjct: 71 MCKALGMNTVCLYVFWNIHEQQEGKFDFTDNNDVAEFCRLAQRNGLYVIVRPGPYVCAEW 130
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKRE------KLFASQGGPIIL 178
GG+P WL R EP Y M++ V + +R+ L GGPII+
Sbjct: 131 EMGGLPWWLLKKKDIRLR---EPDPYFMER-----VKLFERKVGEQLASLTIQNGGPIIM 182
Query: 179 AQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTP--------DPVIN 230
QVENEYG Y G+ A +A + + G + Q D D ++
Sbjct: 183 VQVENEYGSY-------GENKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVW 235
Query: 231 TCN---SFYCDQ----FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ 283
T N DQ P+ P++ +E W GWF +G R RP++ + +
Sbjct: 236 TMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKTMVEGIDEMLS 295
Query: 284 KGGSVHNYYMYHGGTNFGRTAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLK 334
KG S + YM HGGT+FG AG P + TSYDY+API+EYG PK+ L+
Sbjct: 296 KGISF-SLYMTHGGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGQA-TPKYWELR 349
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 152/318 (47%), Gaps = 27/318 (8%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A +T+ +L+ GR I+S ++HY R PG W + + G+NT+++YV WN HE
Sbjct: 14 AATLTHAGGTLLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHE 73
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+PG F G +L +F+++ Q+ + +I+R GP++ AE++ GG+P WL PG R
Sbjct: 74 RTPGDVRFDGWRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRPRTS 133
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
PF + ++ ++ + L A +GGP++ Q+ENEYG YG+ G Y W
Sbjct: 134 HPPFLAAVARWFDQLIPRIA--ALQAGRGGPVVAVQIENEYGS----YGDDGD-YVRWVR 186
Query: 205 KMAVAQNI--------GVPWIMCQQFDTPDPVINTCNSFYCDQ----FTPHSPSMPKIWT 252
A+ + G +M + +Q P P
Sbjct: 187 DALTARGVTELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPFFCA 246
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E W GWF +G + RP+ A V R GGS+ + YM HGGTNFG AG
Sbjct: 247 EFWNGWFDHWGEQHHVRPARSAADDVGRILGAGGSL-SLYMAHGGTNFGLWAGANHDGDR 305
Query: 309 ---ITTSYDYEAPIDEYG 323
TSYD +AP+ E+G
Sbjct: 306 LQPTVTSYDSDAPVAEHG 323
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 173/371 (46%), Gaps = 47/371 (12%)
Query: 10 FALLIFFSSSI-TYCFAGNVTYDSRS--LIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
L I F+ ++ + + T++ ++ ++NG+ I S +HYPR W +Q
Sbjct: 8 LVLFILFACNVLIFSQSRKSTFEIKNGHFLLNGKLFSIHSGEMHYPRIPQEYWKHRLQMM 67
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+N + +YVFWN HE +PGK+ + G +L KFIK Q+ +Y+I+R GP+V AE+ +
Sbjct: 68 KAMGLNAVTTYVFWNYHEENPGKWNWSGEKDLKKFIKTAQEVGLYVIIRPGPYVCAEWEF 127
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG P WL I G R D F QK++T + + +K L + GGP+I+ Q ENE+G
Sbjct: 128 GGYPWWLQNIKGLKIREDNNLFLAETQKYITQLYNQVK--DLQITNGGPVIMVQAENEFG 185
Query: 187 YYESFYGE----GGKRYALWAAKMAVAQNIGVP-------WIM-----------CQQFDT 224
+ + + + Y K VP W+ D
Sbjct: 186 SFVAQRKDIPLASHRTYNAKIVKQLKDAGFSVPMFTSDGSWLFEGGSVVGALPTANGEDN 245
Query: 225 PDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQK 284
+ + N + +Q P + E +PGW + + P + +A ++ +
Sbjct: 246 IENLKKIVNQYNNNQ-------GPYMVAEFYPGWLAHWAEKFPRVDAGTVARQTDKYLKN 298
Query: 285 GGSVHNYYMYHGGTNFGRTAGGPFIT--------TSYDYEAPIDEYGLPRNPKWGHLKEL 336
S NYYM HGGTNFG T G + TSYDY+API E G R PK+ L+ +
Sbjct: 299 DVSF-NYYMVHGGTNFGFTNGANYDKNHDIQPDLTSYDYDAPITEAGW-RTPKYDSLRAV 356
Query: 337 ---HGAIKLCE 344
H KL E
Sbjct: 357 ISKHTKAKLPE 367
>gi|5566254|gb|AAD45349.1| beta-galactosidase [Vitis vinifera]
Length = 181
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 110/181 (60%), Gaps = 2/181 (1%)
Query: 480 DYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKN 539
DYLWY T I + +E FL+ G P L++++ GHA+H F N +L GSA G + F +
Sbjct: 1 DYLWYMTRIDIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTE 60
Query: 540 PISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIG 598
++L AG N IALLS+ VGL N G +E GI V + G N G DLS WTYK+G
Sbjct: 61 KVNLHAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVG 120
Query: 599 LQGEHLGIYNPGYRNNINWVS-TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLA 657
L+GE + + +P ++++W+ ++ + QPLTW+KA P GDEP+ LDM MGKG
Sbjct: 121 LKGEAMNLVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQI 180
Query: 658 W 658
W
Sbjct: 181 W 181
>gi|325569852|ref|ZP_08145846.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325156975|gb|EGC69143.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 585
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 145/303 (47%), Gaps = 30/303 (9%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
+IS AIHY R VP W +++ + G NT+E+YV WN HE G Y F G +L +FI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGILDLRRFIQ 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
Q+ +Y+ILR P++ AE+ +GG+P WL P R D PF + ++ + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI------ 217
+ L +QGGPI++ QVENEYG Y + K Y Q + P +
Sbjct: 139 R--DLQITQGGPILMMQVENEYGSYAN-----DKEYLRKMVAAMRQQGVETPLVTSDGPW 191
Query: 218 --MCQQFDTPD---PVINTCNSFYCDQFTP----HSPSMPKIWTENWPGWFKTFGGRDPH 268
M + D P IN C S + F H P + E W GWF +G H
Sbjct: 192 HDMLENGTIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDHHH 250
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDEY 322
S A + GSV N YM+HGGTNFG G + TSYDY+A + E+
Sbjct: 251 TTSTADAVKELQDCLAEGSV-NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTEW 309
Query: 323 GLP 325
G P
Sbjct: 310 GEP 312
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 158/328 (48%), Gaps = 28/328 (8%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ + + ++NG+ ++ +A +HYPR W ++ K G+NTI YVFWN HE
Sbjct: 347 GDFSAGKGTFLLNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEP 406
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
PG + F G+ +L +F ++ +Q MY+ILR GP+V AE+ GG+P WL R
Sbjct: 407 QPGVFDFTGQNDLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESD 466
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
F + F + + + + GGPII+ QVENEYG Y GE K Y
Sbjct: 467 PYFIERVGIFEKAVAEQVA--DMTIQNGGPIIMVQVENEYGSY----GED-KGYVSQIRD 519
Query: 206 MAVAQNIGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQFTPHS---PSMPKIWTE 253
+ A GV C + ++ T N + QF P P P + +E
Sbjct: 520 IVRANYPGVTLFQCDWASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSE 579
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------P 307
W GWF +G RP+ D+ + KG S + YM HGGTN+G AG P
Sbjct: 580 FWSGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAP 638
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKE 335
+T SYDY+API E G PK+ L++
Sbjct: 639 DVT-SYDYDAPISESG-QTTPKYWELRK 664
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 144/303 (47%), Gaps = 30/303 (9%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
+IS AIHY R VP W +++ + G NT+E+YV WN HE G Y F G +L +FI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQ 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
Q+ +Y+ILR P++ AE+ +GG+P WL P R D PF + ++ + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI------ 217
+ L +QGGPII+ QVENEYG Y + K Y + P +
Sbjct: 139 R--DLQITQGGPIIMMQVENEYGSYAN-----DKEYLRKMVAAMRQHGVETPLVTSDGPW 191
Query: 218 --MCQQFDTPD---PVINTCNSFYCDQFTP----HSPSMPKIWTENWPGWFKTFGGRDPH 268
M + D P IN C S + F H P + E W GWF +G H
Sbjct: 192 HDMLENGSIKDLALPTIN-CGSNIKENFEKLRKFHGEKRPLMVMEFWIGWFDAWGDDQHH 250
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDEY 322
S A + GSV N YM+HGGTNFG G + TSYDY+A + E+
Sbjct: 251 TTSIQDAVKELQDCLALGSV-NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTEW 309
Query: 323 GLP 325
G P
Sbjct: 310 GEP 312
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 166/337 (49%), Gaps = 45/337 (13%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
G T ++ ++NG+ ++ +A +HYPR W ++ K G+NT+ YVFWN HE
Sbjct: 27 GGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHE 86
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G ++ +F ++ Q+ +Y+I+R GP+V AE+ GG+P WL R
Sbjct: 87 QQEGRFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLR-- 144
Query: 145 TEPFKYHMQKFMTLIVDMMKRE------KLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
EP Y M++ V + +R+ L GGPII+ QVENEYG Y G+
Sbjct: 145 -EPDPYFMER-----VKLFERKVGEQLASLTIQNGGPIIMVQVENEYGSY-------GEN 191
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTP--------DPVINTCN---SFYCDQ----FTPH 243
A +A + + G + Q D D ++ T N DQ
Sbjct: 192 KAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGEL 251
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
P+ P++ +E W GWF +G R RP++ + + KG S + YM HGGT+FG
Sbjct: 252 RPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHW 310
Query: 304 AG------GPFITTSYDYEAPIDEYGLPRNPKWGHLK 334
AG P + TSYDY+API+EYG PK+ L+
Sbjct: 311 AGANSPGFAPDV-TSYDYDAPINEYGQA-TPKYWELR 345
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 111/341 (32%), Positives = 167/341 (48%), Gaps = 36/341 (10%)
Query: 10 FALLIFFSSSITY--------CFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPG 61
FA + F S ++T G+ ++ ++NG+ + +A +HYPR W
Sbjct: 5 FAKIAFLSLALTLGAPTISYGADKGSFDIGHKTFLLNGKPFTVKAAELHYPRIPRPYWEH 64
Query: 62 LVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVA 121
++ K G+N I YVFWN HE G++ F G ++ +F ++ Q+ MY+I+R GP+V
Sbjct: 65 RIKMCKALGMNAICIYVFWNIHEQKEGEFNFTGNNDVAEFCRLAQKNGMYVIVRPGPYVC 124
Query: 122 AEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQV 181
AE+ GG+P WL R F ++ F + + + L +GGPII+ QV
Sbjct: 125 AEWEMGGLPWWLLKKKDIKLRERDPYFMERVKIFEDKVAEQLA--PLTIQRGGPIIMVQV 182
Query: 182 ENEYGYY---ESFYGEGGKRYAL---WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN-- 233
ENEYG Y + + GE R L W + + Q W + D +I T N
Sbjct: 183 ENEYGSYGIDKQYVGE--IRDMLRQGWGNDVKMFQ---CDWSSNFTHNGLDDLIWTMNFG 237
Query: 234 --SFYCDQFTPHS---PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSV 288
+ +QF P P + +E W GWF +G R RP++D+ ++ KG S
Sbjct: 238 TGANIDNQFKKLKSLRPDAPLMCSEFWSGWFDKWGARHETRPAQDMVNNIDEMLSKGISF 297
Query: 289 HNYYMYHGGTNFGRTAGG------PFITTSYDYEAPIDEYG 323
+ YM HGGT+FG AG P + TSYDY+API+EYG
Sbjct: 298 -SLYMTHGGTSFGHWAGANSPGFQPDV-TSYDYDAPINEYG 336
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 155 bits (391), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 111/357 (31%), Positives = 172/357 (48%), Gaps = 33/357 (9%)
Query: 4 RTPIAPFALLIF--FSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPG 61
+ P+ +L+ SS + G + ++NG ++ +A IHYPR W
Sbjct: 2 KKPLLYLLILVVAVLGSSCSQSSEGTFEVGKNTFLLNGEPFVVKAAEIHYPRIPKEYWEH 61
Query: 62 LVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVA 121
++ K G+NTI YVFWN HE G+Y F G+ ++ F ++ Q+ MY+I+R GP+V
Sbjct: 62 RIKMCKALGMNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGMYVIVRPGPYVC 121
Query: 122 AEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQ 180
AE+ GG+P WL R E Y+M++ + ++ K+ L S+GG II+ Q
Sbjct: 122 AEWEMGGLPWWLLKKKDIKLR---EQDPYYMERVKLFLNEVGKQLADLQISKGGNIIMVQ 178
Query: 181 VENEYGYYESFYGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN 233
VENEYG + G + + + V Q GVP C + + D ++ T N
Sbjct: 179 VENEYGAF------GIDKPYISEIRDMVKQAGFTGVPLFQCDWNSNFENNALDDLLWTIN 232
Query: 234 ----SFYCDQF---TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGG 286
+ +QF P P + +E W GWF +G + R +E++ + +
Sbjct: 233 FGTGANIDEQFKRLKELRPDTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDRNI 292
Query: 287 SVHNYYMYHGGTNFGRTAGGPF-----ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
S + YM HGGT+FG G F TSYDY+API+E G PK+ ++ L G
Sbjct: 293 SF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESG-KVTPKYLEVRNLLG 347
>gi|332030018|gb|EGI69843.1| Beta-galactosidase [Acromyrmex echinatior]
Length = 594
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 103/335 (30%), Positives = 169/335 (50%), Gaps = 31/335 (9%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V Y++ +++G+ +S + HY R+ W +++ + G+N I +YV W+ HE
Sbjct: 1 DVDYENNQFLLDGKPFQYVSGSFHYFRTPRQYWRDRLRKMRAAGLNAISTYVEWSLHEPE 60
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW-LHYIPGTVFRNDT 145
PG++ + G +LV F+ I Q+ ++++LR GP++ AE + GG+P W L +P R
Sbjct: 61 PGQFNWTGDADLVNFLNIAQEEDLFVLLRPGPYICAERDMGGLPYWLLREVPNINLRTKD 120
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY------------ESFYG 193
F + ++ I+ ++ L GGPII+ Q+ENEYG Y E F
Sbjct: 121 ADFVRYATLYLNEILSKIR--PLLRGNGGPIIMVQIENEYGSYYACDIEYMDMLKEVFVK 178
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIW 251
+ G + L+ A A + +I + T D N NSF + + P P +
Sbjct: 179 KVGNKALLYTTDGAAASLLRCGFI-SGAYATVDFGTASNVTNSFLSMRL--YQPRGPLVN 235
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----- 306
+E +PGW +G +E I S+ G SV N+YM++GGTNFG T+G
Sbjct: 236 SEFYPGWLTHWGEPFQRTKTEAIVKSLEEMLALGASV-NFYMFYGGTNFGFTSGANGGAG 294
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
P + TSYDY+AP+ E G P PK+ ++++ G
Sbjct: 295 VYNPQL-TSYDYDAPLTEAGDP-TPKYFAIRDVIG 327
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 149/319 (46%), Gaps = 33/319 (10%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+S AIHY R P W + K G NT+E+YV WN HE PG + F G +L F+
Sbjct: 19 ILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSGSIDLAAFLD 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
+Y I+R PF+ AE+ +GG+P WL R+ F H+ ++ ++ ++
Sbjct: 79 EAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQYYDHLMPIL 138
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV-------PW 216
++ +GG II+ QVENEYG Y K Y ++ V + + V PW
Sbjct: 139 VSRQI--DKGGNIIMMQVENEYGSYCE-----DKDYLRAIRRLMVERGVSVPLCTSDGPW 191
Query: 217 IMCQQFDT--PDPVINTCN--SFYCDQFTP-------HSPSMPKIWTENWPGWFKTFGGR 265
C + T D V+ T N S + F H P + E W GWF +G
Sbjct: 192 RGCLRAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWDGWFNRYGEN 251
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAGGPFITTSYDYEAP 318
R ED+A V + GGS+ N YM+HGGTNFG R TSYDY+AP
Sbjct: 252 VIRRDPEDLASCVREVLELGGSL-NLYMFHGGTNFGFMNGCSARHTHDLHQVTSYDYDAP 310
Query: 319 IDEYGLPRNPKWGHLKELH 337
+DE G P + + +H
Sbjct: 311 LDEQGNPTEKYFAIQRTVH 329
>gi|357391354|ref|YP_004906195.1| putative beta-galactosidase [Kitasatospora setae KM-6054]
gi|311897831|dbj|BAJ30239.1| putative beta-galactosidase [Kitasatospora setae KM-6054]
Length = 588
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/330 (32%), Positives = 157/330 (47%), Gaps = 44/330 (13%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TYDS ++GR ++S A+HY RS P W + + G+NT+E+YV WN HE +P
Sbjct: 2 LTYDSTGFRLDGRPLRVLSGAVHYFRSRPEQWADRLAAVRAMGLNTVETYVPWNLHEPAP 61
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ G L F+ ++ ++ I+R GP++ AE++ GG+P WL G R
Sbjct: 62 GRFARVG--ELGAFLDEARRQGLWTIVRPGPYICAEWDNGGLPGWLTARLGRRVRTGDPE 119
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F + F +++ + E+ + G +++ QVENEYG + S G Y A+
Sbjct: 120 FLAAVGAFFDVLLPQVV-ERQWGRPDGSVLMVQVENEYGAFGSDAG-----YLAALARGL 173
Query: 208 VAQNIGVPWIMCQQFDTPD---------PVINTCNSFYCD------QFTPHSPSMPKIWT 252
+ + VP D P+ P + +F D H P P
Sbjct: 174 RERGVSVPLFTS---DGPEDHMLAAGTVPGVLATVNFGSDPERGFAALRRHRPEDPPFCM 230
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----P 307
E W GWF +G R ++D A S+ R GGSV N YM HGGT+FG +AG P
Sbjct: 231 EFWNGWFDQWGRPHHTRGADDAADSLRRILAAGGSV-NLYMAHGGTSFGTSAGANHADPP 289
Query: 308 F------------ITTSYDYEAPIDEYGLP 325
F TSYDY+AP+DE GLP
Sbjct: 290 FNSTDWTHSPYQPTVTSYDYDAPLDERGLP 319
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/322 (31%), Positives = 156/322 (48%), Gaps = 29/322 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
S +NG IIS A+HY R P W +++A+ G+NT+E+YV WN H+ PG
Sbjct: 10 SFELNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLD 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +L +F+++ + ++LR GP++ AE++ GG+P WL R+ F +
Sbjct: 70 GLLDLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIID 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++ L++ + A GGP+I QVENEYG Y + Y + + ++ I
Sbjct: 130 RYLDLLLPPLLPH--MAESGGPVIAVQVENEYGAYGN-----DAEYLKYLVEAFRSRGIE 182
Query: 214 VPWIMCQQFDTPD------PVINTCNSF------YCDQFTPHSPSMPKIWTENWPGWFKT 261
C Q + P + + +F H P P + E W GWF
Sbjct: 183 ELLFTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFDH 242
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG-------GPFITTSYD 314
+GG R + D+A + + G SV N YM+HGGTNFG T G P I TSYD
Sbjct: 243 WGGPHHTRDTADVAADLDKLLAAGASV-NIYMFHGGTNFGLTNGANHHHTYAPTI-TSYD 300
Query: 315 YEAPIDEYGLPRNPKWGHLKEL 336
Y+AP+ E G P PK+ +E+
Sbjct: 301 YDAPLTENGDP-GPKYHAFREV 321
>gi|414880685|tpg|DAA57816.1| TPA: putative RAN GTPase activating family protein [Zea mays]
Length = 598
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/302 (35%), Positives = 153/302 (50%), Gaps = 64/302 (21%)
Query: 292 YMYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGE 351
+ YHGGTNFGRT+GGP+ITTSYDY+AP+DEYG R PK+GHLK+LH I+ E L++G+
Sbjct: 308 FKYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGK 367
Query: 352 RSNLSLGSSQEADVYADSSGACAAFLANMDDKNDKTVVFRNVSYHLPAWSVSILPDCKKV 411
Y D+S A + D K V ++ +PAWSVSILPDCK V
Sbjct: 368 --------------YNDTSYGKNAIFVDRDVK----VTLSGGTHLVPAWSVSILPDCKTV 409
Query: 412 VFNTANVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQVFKEIAGIW---GEADFVKSGF 468
+NTA ++ Q+S ++ + E P+ L+W E + F S
Sbjct: 410 AYNTAKIKTQTS---VMVKKANSVEKEPE----ALRWSWMPENLKPFMTDHRDSFRHSQL 462
Query: 469 VDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHAL-------------- 514
++ I T+ D +DYLWY TS+ E GS L + + GH +
Sbjct: 463 LEQITTSTDQSDYLWYRTSL------EHKGEGSY-TLYVNTSGHEMAKLLGRWSVRLPAP 515
Query: 515 ---HAFANQELQGSA-----------SGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQ 560
A +EL+ S S +G F+ ++P+ L +GKN ++LLS TVGL+
Sbjct: 516 VSGEAPLRKELRFSPQRHSRTQGQNYSADGAF-VFQLQSPVKLHSGKNYVSLLSGTVGLK 574
Query: 561 NA 562
+A
Sbjct: 575 SA 576
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 159/346 (45%), Gaps = 33/346 (9%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
LL S+ G+ T + ++NG+ ++ +A +HYPR W ++ K G
Sbjct: 13 TLLFSLSTLTALARGGDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIKMCKALG 72
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+NTI YVFWN HE KY F G ++ F ++ Q+ MY+I+R GP+V AE+ GG+P
Sbjct: 73 MNTICLYVFWNIHEQQESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLP 132
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-- 188
WL R D F ++ F + + L GGPII+ QVENEYG Y
Sbjct: 133 WWLLKKKDIRLREDDPYFLARVKAFEAEVGRQLA--PLTIQNGGPIIMVQVENEYGSYGV 190
Query: 189 ---------ESFYGEGGKRYAL----WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSF 235
+ G + L WA+ + W M F T +
Sbjct: 191 NKQYVSQIRDIVKASGFDKVTLFQCDWASNFEKNGLDDLLWTM--NFGTGSNIDAQFKRL 248
Query: 236 YCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
Q P +P M +E W GWF +G R RP++ + + K S + YM H
Sbjct: 249 --KQLRPETPLM---CSEFWSGWFDKWGARHETRPAKAMVEGINEMLSKNISF-SLYMTH 302
Query: 296 GGTNFGRTAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLKE 335
GGT+FG AG P + TSYDY+API+EYG PK+ L++
Sbjct: 303 GGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGHA-TPKFWELRK 346
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 132/435 (30%), Positives = 194/435 (44%), Gaps = 64/435 (14%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++ D RSL++NG R L++S +IHYPRS P MWP L +A+ G+N IESY FWN H +
Sbjct: 1037 SIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKHSAT 1096
Query: 87 P-GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP------------VWL 133
G Y +G ++ F+ + + ++++ R GP+V AE+ GGIP W+
Sbjct: 1097 RYGAYDYGFNGDVDLFLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASNAWI 1156
Query: 134 HYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYG 193
H +PG R + + ++M D + S+ G ++ENEYG +S
Sbjct: 1157 HDVPGMKTRTNNTAWLNETGRWMR---DHFAVIEPHLSRNG--ASNRIENEYGGSKSDAA 1211
Query: 194 EGGKRYALWAAKMAVAQNIGVPWIMCQ--QFDTPDPVINTCNSFYCDQ-------FTPHS 244
AL A AVA + W+MC PD ++T N DQ P +
Sbjct: 1212 AVAYVDALDALADAVAPEL--VWMMCGFVSLVAPD-ALHTGNGCPHDQGPASAHVVVPPA 1268
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR-- 302
P W W+ +G RP D+A+ VA + GG++HN+YM+HGG ++G
Sbjct: 1269 PGADPAWYTEDELWYDAWGLPSLARPPADVAYGVASYVATGGAMHNFYMWHGGNHYGNWS 1328
Query: 303 TA----GG------PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGER 352
TA GG P Y AP+ G P + HL +HG + L
Sbjct: 1329 TATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHGTLDAYAEVL----- 1383
Query: 353 SNLSLGSSQEADVYADSSGAC--AAFLANMDDKNDKTVVFRNVSYHLPA-WSVSILPDCK 409
LG++ EA AC A FL +D +VVF H A W+ C
Sbjct: 1384 ----LGATPEALATPSCVAACPHAYFLKFANDT--ASVVF---GVHACAQWNA-----CD 1429
Query: 410 KVVFNTANVRAQSST 424
+VRA ++T
Sbjct: 1430 ANATAAVDVRASNAT 1444
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 105/311 (33%), Positives = 151/311 (48%), Gaps = 32/311 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
++N + IIS A+HY R VP W + + K G NT+E+YV WN HE GK+ FG
Sbjct: 10 QFLLNDKPLRIISGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFG 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +++ F+++ + +++I+R P++ AE+ +GG+P WL R F +
Sbjct: 70 GIADVIAFVELAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQLRCSDPKFLAKVD 129
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L + GGPII QVENEYG Y + K Y + +A+ I
Sbjct: 130 AYYDVL--LPKFVPLLCTNGGPIIAMQVENEYGSYGN-----DKAYLGYLRDGMIARGID 182
Query: 214 VPWI--------MCQQFDTPDPVINTCN-------SFYCDQFTPHSPSMPKIWTENWPGW 258
V M Q PD V+ T N SF +F + P P + E W GW
Sbjct: 183 VLLFTSDGPTDEMLQGGTLPD-VLATVNFGSRPEESFA--KFREYRPDEPLMCMEFWNGW 239
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTS 312
F + R ED A + G SV N+YM+HGGTNFG +G I TS
Sbjct: 240 FDHWMEEHHTRDGEDAARVLDDMLGAGASV-NFYMFHGGTNFGFYSGANHIKTYEPTVTS 298
Query: 313 YDYEAPIDEYG 323
YDY+AP+ E G
Sbjct: 299 YDYDAPLTERG 309
>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
Length = 779
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 171/354 (48%), Gaps = 33/354 (9%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTY--DSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
T I ALL+F S AG T+ +++ +++G+ +I +A IHY R W
Sbjct: 7 TAIWLTALLLFAFSGCNQKPAGEHTFAIGNKTFLLDGKPFVIKAAEIHYTRIPAEYWEHR 66
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q K G+NTI Y FWN HE PG++ F G+ ++ F ++ Q+ MY++LR GP+V +
Sbjct: 67 IQLCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKYDMYIMLRPGPYVCS 126
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+ GG+P WL R + F + FM I + L ++GG II+ QVE
Sbjct: 127 EWEMGGLPWWLLKKDDIKLRTNDPYFLERTKLFMNEIGKQLA--DLQITKGGNIIMVQVE 184
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIG---VPWIMCQ-----QFDTPDPVINTCN- 233
NEYG Y + K Y A + + G VP C Q + D ++ T N
Sbjct: 185 NEYGSYAT-----DKEYI--ANIRDIVKGAGFTDVPLFQCDWSSNFQNNALDDLVWTINF 237
Query: 234 ---SFYCDQFTPHS---PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGS 287
+ +QF P+ P + +E W GWF +G + R +E + + +G S
Sbjct: 238 GTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRGIS 297
Query: 288 VHNYYMYHGGTNFGRTAGG-----PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ YM HGGT FG G + +SYDY+API E G PK+ L+EL
Sbjct: 298 F-SLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGW-TTPKYFKLREL 349
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 152/311 (48%), Gaps = 26/311 (8%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ +++G I+S A+HY R P +W +++A+ G+NTIE+YV WN H G +
Sbjct: 9 QDFLLDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDL 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+ ++ ++ I+R GP++ AE++ GG+P WL PG R + +
Sbjct: 69 TGNLDLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAI 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ I+ ++ ++ ++GGP+++ QVENEYG Y Y M + I
Sbjct: 129 AGYYDEILAVVAPRQV--TRGGPVLMVQVENEYGAYGD-----DADYLRALVTMMRERGI 181
Query: 213 GVPWIMCQQFDTPD------PVINTCNSF------YCDQFTPHSPSMPKIWTENWPGWFK 260
VP C Q + P ++ +F + H P+ P + E W GWF
Sbjct: 182 EVPLTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFD 241
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--ITTSYD 314
++G + H A + G+ N YM+HGGTN G T G G + ITTSYD
Sbjct: 242 SWGEQH-HTTDAAEAAADLDLLLSQGASANLYMFHGGTNLGFTNGANDKGTYLPITTSYD 300
Query: 315 YEAPIDEYGLP 325
Y+AP+ E G P
Sbjct: 301 YDAPLAEDGSP 311
>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
Length = 586
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 153/322 (47%), Gaps = 47/322 (14%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
IIS AIHY R VP W ++ K G NT+E+YV WN HE G+Y F +L +FI+
Sbjct: 19 IISGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQ 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF----KYHMQKFMTLI 159
+ + +ILR P++ AE+ +GG+P WL R+ PF + + ++ +
Sbjct: 79 LADSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMRVRSTYPPFMERVRLYYRELFKEV 138
Query: 160 VDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI-- 217
+D+ + GGPIIL QVENEYG Y S K+Y M + VP +
Sbjct: 139 IDLQ------ITSGGPIILMQVENEYGGYGS-----EKKYLQELVTMMKENGVTVPLVTS 187
Query: 218 ------MCQQFDTPDPVINTCNSFYCDQFTPH---------SPSMPKIWTENWPGWFKTF 262
M + + + T N C P P + E W GWF +
Sbjct: 188 DGPWGDMLENGSLQESALPTVN---CGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAW 244
Query: 263 GGRDPHRPSEDIAFSVARFFQ--KGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYD 314
+ H + D+ SV + K GSV N+YM+HGGTNFG G + TTSYD
Sbjct: 245 QDKKHH--TTDVKSSVESLEEILKRGSV-NFYMFHGGTNFGFMNGANYYGKLLPDTTSYD 301
Query: 315 YEAPIDEYGLPRNPKWGHLKEL 336
Y+AP++EYG + K+ KE+
Sbjct: 302 YDAPLNEYG-EQTEKYKAFKEV 322
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 158/336 (47%), Gaps = 34/336 (10%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+T+ + + GR ++S ++HY R P W + + G+NT+++YV WN HE
Sbjct: 24 TLTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHERR 83
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+ F G +L +F+++ Q+A + +++R GP++ AE++ GG+P WL PG R +
Sbjct: 84 PGEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMRLRAGHQ 143
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+ + ++ +V + +L A GGP++ Q+ENEYG Y + Y W
Sbjct: 144 PYLDAVARWFDALVPRVA--ELQAVHGGPVVAVQIENEYGSYGDDHA-----YVRWVRDA 196
Query: 207 AVAQNIGVPWIMCQQFDTPDPVI---------------NTCNSFYCDQFTPHSPSMPKIW 251
V + I + D P P++ + + P P +
Sbjct: 197 LVDRGITE---LLYTADGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPFLC 253
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G + R + A V GGSV + YM HGGTNFG AG
Sbjct: 254 AEFWNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGSV-SLYMAHGGTNFGLWAGANHDGG 312
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAI 340
TSYD +AP+ E+G PK+ L+E A+
Sbjct: 313 VLRPTVTSYDSDAPVSEHG-ALTPKFHALRERFAAL 347
>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
Length = 586
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 153/313 (48%), Gaps = 28/313 (8%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
SR +++G I+S AIHY R P +W +++A+ G+NTIE+YV WN H +PG +
Sbjct: 8 SRDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFR 67
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
G +L +F+ ++ M I+R GP++ AE++ GG+P WL P R+ +
Sbjct: 68 TDGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSIGVRSSEPGYLAA 127
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ FM ++ ++ ++ ++GGP+IL Q+ENEYG Y S K Y A
Sbjct: 128 VDGFMDRLLPIVVERQI--TRGGPVILFQIENEYGAYGS-----DKAYLQHLVDTATRAG 180
Query: 212 IGVPWIMCQQ------FDTPDPVINTCNSF--YCDQ----FTPHSPSMPKIWTENWPGWF 259
+ VP C Q D P ++ +F D+ P P + E W GWF
Sbjct: 181 VEVPLFTCDQPFETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGWF 240
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFITTS 312
+G + A + G SV N YM+HGGTNFG T G P I TS
Sbjct: 241 DNWGTHHHTTDAAASAAELDALLAAGASV-NIYMFHGGTNFGFTNGANDKGIYEPTI-TS 298
Query: 313 YDYEAPIDEYGLP 325
YDY+AP+ E G P
Sbjct: 299 YDYDAPLSEDGHP 311
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 98/331 (29%), Positives = 157/331 (47%), Gaps = 34/331 (10%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++Y +L+ NGR +++ ++HY R PG W +++ G+N +++YV WN HE +
Sbjct: 5 TLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHERT 64
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G F G +L +FI++ Q+ + +++R GP++ AE++ GG+P WL PG R
Sbjct: 65 AGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSHG 124
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+ + ++ +V + +L A +GGP++ Q+ENEYG Y + Y
Sbjct: 125 PYLEAVDRWFDALVPRIA--ELQAGRGGPVVAVQIENEYGSYGD-----DRAYVRHIRDA 177
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSF---------------YCDQFTPHSPSMPKIW 251
VA+ I + D P P++ + P+ P
Sbjct: 178 LVARGITE---LLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFC 234
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G + RP+ A + +GGSV + YM HGGTNFG AG
Sbjct: 235 AEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEGG 293
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
TSYD +API E G PK+ L++
Sbjct: 294 TIRPTVTSYDSDAPIAENGA-LTPKFFALRD 323
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 162/330 (49%), Gaps = 41/330 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++ ++NG+ +I +A +HYPR W ++ K G+NT+ YVFWN HE GK+
Sbjct: 40 NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFD 99
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND----TEP 147
F G ++ +FI++ Q+ +Y+I+R GP+V AE+ GG+P WL R E
Sbjct: 100 FTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMER 159
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-----------ESFYGEGG 196
++ QK I D L +GGPII+ QVENEYG Y + G
Sbjct: 160 YRIFAQKLGEQIGD------LTIEKGGPIIMVQVENEYGSYGEDKPYVSAIRDIIRDSGF 213
Query: 197 KRYAL----WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWT 252
+ L W++ + W M F T N N F + P P++ +
Sbjct: 214 DKVTLFQCDWSSNFTKNGLDDLVWTM--NFGTG---ANIENEF--KKLGELRPESPQMCS 266
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------ 306
E W GWF +GGR R S+++ + KG S + YM HGGT++G AG
Sbjct: 267 EFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFS 325
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P + TSYDY+API+E G PK+ L+E+
Sbjct: 326 PDV-TSYDYDAPINEAG-QVTPKYMELREM 353
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 165/351 (47%), Gaps = 31/351 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+T IA L++ ++ G+ T + ++NG+ ++ +A +HYPR W +
Sbjct: 47 KTVIA--TLVLSLATLTAPARGGDFTVGKNTFLLNGQPFVVKAAELHYPRIPRPYWEQRI 104
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
+ K G+NT+ YVFWN HE GK+ F G ++ F ++ Q+ MY+I+R GP+V AE
Sbjct: 105 KMCKSLGMNTVCLYVFWNIHEQQEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAE 164
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+ GG+P WL R D F ++ F + + L GGPII+ QVEN
Sbjct: 165 WEMGGLPWWLLKKKDIRLREDDPYFMARVKAFEAEVGRQLA--PLTIQNGGPIIMVQVEN 222
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIG-VPWIMCQ-----QFDTPDPVINTCN---- 233
EYG Y K+Y + A V C + + D ++ T N
Sbjct: 223 EYGSYGV-----NKKYVSQIRDIVKASGFDKVTLFQCDWASNFENNGLDDLVWTMNFGTG 277
Query: 234 ---SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHN 290
+ P P + +E W GWF +G R RP++ + + K S +
Sbjct: 278 SNIDAQFKRLKQLRPDAPLMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKNISF-S 336
Query: 291 YYMYHGGTNFGRTAG------GPFITTSYDYEAPIDEYGLPRNPKWGHLKE 335
YM HGGT+FG AG P + TSYDY+API+EYG PK+ L++
Sbjct: 337 LYMTHGGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGHA-TPKFWELRK 385
>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
johnsonii DSM 18315]
gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
DSM 18315]
Length = 539
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 171/354 (48%), Gaps = 33/354 (9%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTY--DSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
T I ALL+F S AG T+ +++ +++G+ +I +A IHY R W
Sbjct: 7 TAIWLTALLLFAFSGCNQKPAGEHTFAIGNKTFLLDGKPFVIKAAEIHYTRIPAEYWEHR 66
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q K G+NTI Y FWN HE PG++ F G+ ++ F ++ Q+ MY++LR GP+V +
Sbjct: 67 IQLCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKYDMYIMLRPGPYVCS 126
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E+ GG+P WL R + F + FM I + L ++GG II+ QVE
Sbjct: 127 EWEMGGLPWWLLKKDDIKLRTNDPYFLERTKLFMNEIGKQLA--DLQITKGGNIIMVQVE 184
Query: 183 NEYGYYESFYGEGGKRYALWAAKMAVAQNIG---VPWIMCQ-----QFDTPDPVINTCN- 233
NEYG Y + K Y A + + G VP C Q + D ++ T N
Sbjct: 185 NEYGSYAT-----DKEYI--ANIRDIVKGAGFTDVPLFQCDWSSNFQNNALDDLVWTINF 237
Query: 234 ---SFYCDQFTPHS---PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGS 287
+ +QF P+ P + +E W GWF +G + R +E + + +G S
Sbjct: 238 GTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRGIS 297
Query: 288 VHNYYMYHGGTNFGRTAGG-----PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ YM HGGT FG G + +SYDY+API E G PK+ L+EL
Sbjct: 298 F-SLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGW-TTPKYFKLREL 349
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 112/340 (32%), Positives = 170/340 (50%), Gaps = 28/340 (8%)
Query: 31 DSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY 90
+ R ++G+ I+S A+HY R P W + + K G+NT+E+YV WN HE G +
Sbjct: 45 NGRHFTMDGKPFTILSGAMHYFRIPPQYWEDRIVKLKAMGLNTVETYVSWNLHEEIQGDF 104
Query: 91 YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN-DTEPFK 149
F ++V+FIK Q+ +Y+I+R GP++ AE++ GG+P WL + P R+ D K
Sbjct: 105 NFKDGLDIVEFIKTAQKHDLYVIMRPGPYICAEWDLGGLPSWLLHNPNIYLRSLDPIFMK 164
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE---SFYGEGGKRYALWAAKM 206
++ F LI ++ + S GGPII Q+ENEY Y+ ++ + + + K
Sbjct: 165 ATLRFFDELIPRLIDYQ---YSNGGPIIAWQIENEYLSYDNSSAYMRKLQQEMVIRGVKE 221
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKT 261
+ + G+ + ++ + V+ T N + P+MP + TE W GWF
Sbjct: 222 LLFTSDGIWQMQIEKKYSLPGVLKTVNFQRNETNILKGLRKLQPNMPLMVTEFWSGWFDH 281
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------PFITTSY 313
+ G D H + + A + K S NYYM HGGTNFG G P I TSY
Sbjct: 282 W-GEDKHVLTVEKAAERTKNILKMESSINYYMLHGGTNFGFMNGANAENGKYKPTI-TSY 339
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERS 353
DY+API E G PK+ L+E KL ++A N S
Sbjct: 340 DYDAPISESG-DITPKYRELRE-----KLLKYAPKNSRMS 373
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 98/331 (29%), Positives = 157/331 (47%), Gaps = 34/331 (10%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++Y +L+ NGR +++ ++HY R PG W +++ G+N +++YV WN HE +
Sbjct: 5 TLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHERT 64
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G F G +L +FI++ Q+ + +++R GP++ AE++ GG+P WL PG R
Sbjct: 65 AGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSHG 124
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+ + ++ +V + +L A +GGP++ Q+ENEYG Y + Y
Sbjct: 125 PYLEAVDRWFDALVPRIA--ELQAGRGGPVVAVQIENEYGSYGD-----DRAYVRHIRDA 177
Query: 207 AVAQNIGVPWIMCQQFDTPDPVINTCNSF---------------YCDQFTPHSPSMPKIW 251
VA+ I + D P P++ + P+ P
Sbjct: 178 LVARGITE---LLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFC 234
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G + RP+ A + +GGSV + YM HGGTNFG AG
Sbjct: 235 AEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEGG 293
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
TSYD +API E G PK+ L++
Sbjct: 294 TIRPTVTSYDSDAPIAENGA-LTPKFFALRD 323
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 165/354 (46%), Gaps = 35/354 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F LL FS S + + +G+ IIS +HYPR W +Q K
Sbjct: 10 FILLFVFSISSFSQKKHTFEIKNGDFVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAM 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+N + +YVFWN HE PGK+ F G NL ++IKI + + +ILR GP+V AE+ +GG
Sbjct: 70 GLNAVATYVFWNIHEPEPGKWDFTGDKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGY 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKRE--KLFASQGGPIILAQVENEYGY 187
P WL + G R D E F K+ L ++ + +E L ++GGPI++ Q ENE+G
Sbjct: 130 PWWLQNVEGLELRRDNEQF----LKYTQLYINRLYKEVGNLQITKGGPIVMVQAENEFGS 185
Query: 188 YESFYG----EGGKRYALWAAKMAVAQNIGVP-------WI-----MCQQFDTPDPVINT 231
Y S E +RY + VP W+ + T + N
Sbjct: 186 YVSQRKDIPLEEHRRYNAKIVQQLKDAGFDVPSFTSDGSWLFEGGAVPGALPTANGESNI 245
Query: 232 CN-SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHN 290
N D++ + P + E +PGW + P + IA ++ Q S+ N
Sbjct: 246 ENLKKAVDKY--NGGQGPYMVAEFYPGWLAHWLEPHPQISATSIARQTEKYLQNNVSI-N 302
Query: 291 YYMYHGGTNFGRTAGGPFIT--------TSYDYEAPIDEYGLPRNPKWGHLKEL 336
YYM HGGTNFG T+G + TSYDY+API E G PK+ L+ +
Sbjct: 303 YYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGW-VTPKYDSLRNV 355
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 164/327 (50%), Gaps = 35/327 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++ ++NG+ +I +A +HYPR W ++ K G+NT+ YVFWN HE GK+
Sbjct: 40 NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFD 99
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +FI++ Q+ +Y+I+R GP+V AE+ GG+P WL R E Y
Sbjct: 100 FTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLR---EQDPYF 156
Query: 152 MQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYY-----------ESFYGEGGKRY 199
M+++ + ++ L +GGPII+ QVENEYG Y + G +
Sbjct: 157 MERYRIFAKKLGEQIGDLTIEKGGPIIMVQVENEYGSYGEDKPYVSGIRDIIRDSGFDKV 216
Query: 200 AL----WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENW 255
L W++ + W M F T N N F + P P++ +E W
Sbjct: 217 TLFQCDWSSNFTKNGLDDLVWTM--NFGTG---ANIENEF--KKLGELRPESPQMCSEFW 269
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------PFI 309
GWF +GGR R S+++ + KG S + YM HGGT++G AG P +
Sbjct: 270 SGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDV 328
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+API+E G PK+ L+E+
Sbjct: 329 -TSYDYDAPINEAG-QVTPKYMELREM 353
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 161/324 (49%), Gaps = 36/324 (11%)
Query: 29 TYDSRS--LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
T+D ++ ++G ++S AIHY R VP W + + K G NT+E+Y+ WN HE
Sbjct: 3 TFDVQNGQFCLDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPK 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG++ F G ++V+F++I + +++I+R P++ AE+ +GG+P WL PG R
Sbjct: 63 PGQFRFDGLADVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGMRVRCMHR 122
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
P+ + + V + + L + GGPII Q+ENEYG Y G R L K
Sbjct: 123 PYLDRVDAYYD--VLLPLLKPLLCTNGGPIIAMQIENEYGSY------GNDRAYLVYLKD 174
Query: 207 AVAQ---------NIGVPWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWT 252
A+ Q + G M Q P V+ T N + + P P +
Sbjct: 175 AMLQRGMDVLLFTSDGPEHFMLQGGMIPG-VLETVNFGSRAEEAFEMLRKYQPDGPIMCM 233
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------ 306
E W GWF +G + R ++D+A + G SV N+YM+HGGTNFG +G
Sbjct: 234 EYWNGWFDHWGEQHHTRDAKDVADVFDDMLRLGASV-NFYMFHGGTNFGYMSGANCPQRD 292
Query: 307 ---PFITTSYDYEAPIDEYGLPRN 327
P I TSYDY+ P++E G P +
Sbjct: 293 HYEPTI-TSYDYDVPLNESGEPTD 315
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 158/332 (47%), Gaps = 32/332 (9%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++YD + R +IS AIHY R VP W +++ K G N IE+YV WN HE
Sbjct: 3 TLSYDQGQFTMGDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEPR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+++F G ++ +F+++ + +Y+I+R P++ AE+ +GG+P WL + ND
Sbjct: 63 EGEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWLLKDDMRLRCNDPR 122
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+ + L+ + L A++GGPII Q+ENEYG Y G A A+
Sbjct: 123 FLEKVAAYYDALLPQLT---PLLATKGGPIIAVQIENEYGSY-------GNDQAYLQAQR 172
Query: 207 AVAQNIGVPWIM---------CQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWT 252
A+ GV ++ Q + V+ T N D+ + P P +
Sbjct: 173 AMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMCM 232
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E W GWF + + R +ED A + G SV N+YM HGGTNFG +G
Sbjct: 233 EYWNGWFDHWFEQHHTRDAEDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDKY 291
Query: 309 --ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
TSYDY+A I E G PK+ +E+ G
Sbjct: 292 EPTVTSYDYDAAISEAG-DLTPKYHAFREVIG 322
>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 629
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 168/334 (50%), Gaps = 26/334 (7%)
Query: 22 YCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWN 81
Y FA + Y++ +++G+ +S + HY R+ W G++++ + GG+N + +YV W+
Sbjct: 29 YSFA--IDYENDQFLLDGKPFRYVSGSFHYFRTPRQHWRGILRKMRAGGLNAVSTYVEWS 86
Query: 82 GHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTV 140
HE ++ + G ++V+FIKI Q+ +++ILR GP++ AE ++GG P WL +P
Sbjct: 87 MHEPEFDQWVWDGDADIVEFIKIAQEEDLFVILRPGPYICAERDFGGFPYWLLSRVPDIK 146
Query: 141 FRNDTEPFKYHMQKFMTLIVDMMKREK-LFASQGGPIILAQVENEYG-------YYESFY 192
R E + ++ ++F+ ++++R K L GGPII+ QVENEYG Y+S
Sbjct: 147 LRTKDERYVFYAERFLN---EILRRTKPLLRGNGGPIIMVQVENEYGSFYACDDQYKSKM 203
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNS----FYCDQFTPHSPSMP 248
E R+ A + + C I+ N F SP P
Sbjct: 204 YEIFHRHVKNDAVLFTTDGSARSMLKCGSIPGVYATIDFGNGANVPFNYKIMREFSPKGP 263
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
+ +E +PGW +G S ++A ++ SV N YMY+GGTNF T+G
Sbjct: 264 LVNSEYYPGWLTHWGESFQRVNSHNVAKTLDEMLAYNVSV-NIYMYYGGTNFAFTSGANI 322
Query: 309 ------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP+ E G P PK+ L+++
Sbjct: 323 NEHYWPQLTSYDYDAPLTEAGDP-TPKYFELRDV 355
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/335 (32%), Positives = 164/335 (48%), Gaps = 37/335 (11%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
ALL+ F+ + AG+ T +++ ++NG ++ +A +HYPR W ++ K G
Sbjct: 10 ALLLTFAQ---FASAGDFTVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEHRIKMCKALG 66
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+NT+ YVFWN HE G++ F ++ +F ++ Q+ MY+I+R GP+V AE+ GG+P
Sbjct: 67 MNTLCIYVFWNIHEQREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLP 126
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-- 188
WL R F ++ F + + + L GGPII+ QVENEYG Y
Sbjct: 127 WWLLKKKDIRLRERDPYFLERVKIFEQKVGEQLA--PLTIQNGGPIIMVQVENEYGSYGE 184
Query: 189 ---------ESFYGEGGKRYAL----WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSF 235
+ G G++ L W++ + W M F T N + F
Sbjct: 185 DKPYVSEIRDCLRGIYGEKLTLFQCDWSSNFERNGLDDLVWTM--NFGTG---ANIDHEF 239
Query: 236 -YCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
Q P++P M +E W GWF +G RP++D+ + K S + YM
Sbjct: 240 ARLKQLRPNAPLM---CSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF-SLYMT 295
Query: 295 HGGTNFGRTAG------GPFITTSYDYEAPIDEYG 323
HGGT+FG AG P + TSYDY+API+EYG
Sbjct: 296 HGGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYG 329
>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 588
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 105/329 (31%), Positives = 155/329 (47%), Gaps = 31/329 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G IIS A+HY R P W +++A+ G+NTIE+Y+ WN HE P
Sbjct: 7 LTTSSDGFLLHGEPFRIISGAMHYFRIHPDQWTDRLRKARLMGLNTIETYLPWNLHEPEP 66
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G G +L +++++ Q ++++LR GPF+ AE++ GG+P WL P R+
Sbjct: 67 GTLVLDGFLDLPRWLRLAQDEGLHVLLRPGPFICAEWDDGGLPAWLLADPDIRLRSSDPR 126
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F ++ ++ ++ A+ GGP+I QVENEYG Y G A
Sbjct: 127 FTGAFDGYLDQLLPALR--PFMAAHGGPVIAVQVENEYGAY-------GDDTAYLKHVHQ 177
Query: 208 VAQNIGVPWIM--CQQFDTPDPVINTCNSFYCD------------QFTPHSPSMPKIWTE 253
++ GV ++ C Q T H P P + +E
Sbjct: 178 ALRDRGVEELLYTCDQASAEHLAAGTLPGTLATATFGSRVEENLAALRTHQPEGPLMCSE 237
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----- 308
W GWF +GG R + D A + R G SV N YM+HGGTNFG T G
Sbjct: 238 FWVGWFDHWGGPHHVRSAADAAADLDRLLSAGASV-NIYMFHGGTNFGFTNGANHKHAYE 296
Query: 309 -ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP+ E G P PK+ +E+
Sbjct: 297 PTVTSYDYDAPLTESGDP-GPKYHAFREV 324
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 164/357 (45%), Gaps = 47/357 (13%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
+G + S AIHY R VP W +++ K G NT+E+YV WN HE G++ F G +
Sbjct: 14 DGEELRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FI++ + +++I+R P++ AE+ +GG+P WL PG R + + +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 158 LIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI 217
++ + L + GGP+IL QVENEYG Y S K Y V + I VP
Sbjct: 134 ELIPRLV--PLLCTSGGPVILVQVENEYGSYGS-----DKAYLEHLRDGLVRRGIDVPLF 186
Query: 218 --------MCQQFDTPDPVINTCN--SFYCDQFT---PHSPSMPKIWTENWPGWFKTFGG 264
M Q P V+ T N S + F + P P + E W GWF +
Sbjct: 187 TSDGPTDAMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAP 318
R + D A + G SV N+YM+HGGTNFG G I TSYDY++P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSP 304
Query: 319 IDEYGLP-------RNPKWGHL------------KELHGAIKLCEHALLNGERSNLS 356
+ E+G P R+ HL + +G +++ E A L + LS
Sbjct: 305 LTEWGEPTAKYDAVRDVLAKHLPLGAPELPEPIPRRTYGTVRVTERADLFAQLDRLS 361
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 164/357 (45%), Gaps = 47/357 (13%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
+G + S AIHY R VP W +++ K G NT+E+YV WN HE G++ F G +
Sbjct: 14 DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FI++ + +++I+R P++ AE+ +GG+P WL PG R + + +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 158 LIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI 217
++ + L + GGP+IL QVENEYG Y S K Y V + I VP
Sbjct: 134 ELIPRLV--PLLCTSGGPVILVQVENEYGSYGS-----DKAYLEHLRDGLVRRGIDVPLF 186
Query: 218 --------MCQQFDTPDPVINTCN--SFYCDQFT---PHSPSMPKIWTENWPGWFKTFGG 264
M Q P V+ T N S + F + P P + E W GWF +
Sbjct: 187 TSDGPTDSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAP 318
R + D A + G SV N+YM+HGGTNFG G I TSYDY++P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSP 304
Query: 319 IDEYGLP-------RNPKWGHL------------KELHGAIKLCEHALLNGERSNLS 356
+ E+G P R+ HL + +G +++ E A L + LS
Sbjct: 305 LTEWGEPTAKYYAVRDVLAEHLPLGAPELPEPIPRRTYGTVRVTERADLFAQLDRLS 361
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 141/302 (46%), Gaps = 32/302 (10%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
II+ +HY R++ W + + K G NT+E+YV WN HE G Y F G ++ FI+
Sbjct: 20 IIAGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIE 79
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
+ Q +++I+R P++ AE+ +GG+P WL PG R +PF H++++ ++ ++
Sbjct: 80 LAQSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKIL 139
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV--------- 214
L Q GPIIL Q+ENEYGYY G + + + ++ G
Sbjct: 140 A--PLQIDQDGPIILMQIENEYGYY-------GNDKEYLSTLLKIMRDFGTTVPVVTSDG 190
Query: 215 PW-------IMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDP 267
PW + P T + + F + P + E W GWF +G
Sbjct: 191 PWGEALDAGSLLADVSLPTMNFGTGAKEHIENFKEKYVNKPVMCMEFWVGWFDAWGDDRH 250
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDE 321
H A + R GSV N YM+HGGTNFG G + TSYDY+A + E
Sbjct: 251 HTRDASDAANELRDILNEGSV-NIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTE 309
Query: 322 YG 323
G
Sbjct: 310 CG 311
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 164/357 (45%), Gaps = 47/357 (13%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
+G + S AIHY R VP W +++ K G NT+E+YV WN HE G++ F G +
Sbjct: 14 DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FI++ + +++I+R P++ AE+ +GG+P WL PG R + + +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 158 LIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWI 217
++ + L + GGP+IL QVENEYG Y S K Y V + I VP
Sbjct: 134 ELIPRLV--PLLCTSGGPVILVQVENEYGSYGS-----DKAYLEHLRDGLVRRGIDVPLF 186
Query: 218 --------MCQQFDTPDPVINTCN--SFYCDQFT---PHSPSMPKIWTENWPGWFKTFGG 264
M Q P V+ T N S + F + P P + E W GWF +
Sbjct: 187 TSDGPTDSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAP 318
R + D A + G SV N+YM+HGGTNFG G I TSYDY++P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFHNGANHIKTYEPTITSYDYDSP 304
Query: 319 IDEYGLP-------RNPKWGHL------------KELHGAIKLCEHALLNGERSNLS 356
+ E+G P R+ HL + +G +++ E A L + LS
Sbjct: 305 LTEWGEPTAKYYAVRDVLAEHLPLGAPELPEPIPRRTYGTVRVTERADLFAQLDRLS 361
>gi|297735919|emb|CBI18695.3| unnamed protein product [Vitis vinifera]
Length = 113
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 66/98 (67%), Positives = 83/98 (84%)
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+N IE+YVFW GHELSPG YYFGG ++L+KF+KI+QQ M++IL IGPFVA E+N+ GIP
Sbjct: 9 INVIETYVFWIGHELSPGNYYFGGWYDLLKFVKIVQQDGMWLILHIGPFVATEWNFSGIP 68
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKL 168
VWLHY+ GTVF ++EPFKYHMQKFMTLIV++MK+
Sbjct: 69 VWLHYVLGTVFWTNSEPFKYHMQKFMTLIVNIMKKRSF 106
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 148/316 (46%), Gaps = 36/316 (11%)
Query: 55 VPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMIL 114
+P W + + K G+NT+E+YV WN HE + F ++VKF+K+ Q+ +Y+I+
Sbjct: 1 MPEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVII 60
Query: 115 RIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGG 174
R GP++ AE++ GG+P WL P R PF + ++ + ++ L QGG
Sbjct: 61 RPGPYICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLL--TPLQYCQGG 118
Query: 175 PIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTP-----DPVI 229
PII Q+ENEY SF + Y KM V + +M + + V+
Sbjct: 119 PIIAWQIENEYS---SFDKKVDMTYMELLQKMMVKNGVTEMLLMSDNLFSMKTHPINLVL 175
Query: 230 NTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQK 284
T N Q P P + TE WPGWF +G + P+E + + F
Sbjct: 176 KTINLQKNVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLFSL 235
Query: 285 GGSVHNYYMYHGGTNFGRTAGGPFI--------------TTSYDYEAPIDEYGLPRNPKW 330
G S+ N+YM+HGGTNFG G F TSYDY+AP+ E G PK+
Sbjct: 236 GASI-NFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESG-DITPKY 293
Query: 331 GHLKELHGAIKLCEHA 346
L++ + EHA
Sbjct: 294 KALRKF-----IREHA 304
>gi|390336578|ref|XP_792349.2| PREDICTED: beta-galactosidase-like [Strongylocentrotus purpuratus]
Length = 671
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 167/329 (50%), Gaps = 24/329 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ YDS + + +G+ +S + HY R W + + K G+N +++YV WN HEL P
Sbjct: 31 IDYDSNTFLKDGQPFRYVSGSFHYSRVPAFYWQDRLDKMKMAGLNAVQTYVIWNFHELKP 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G +++ F+K + +ILR GP++ E++ GG+P WL IPG V R+ +
Sbjct: 91 GEFNFDGDHDILSFLKKANDTGLAVILRPGPYICGEWDLGGLPAWLLNIPGIVLRSSNDL 150
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR-YALWAAKM 206
+ H+ ++M + + R L+ + GGPII+ QVENEYG Y++ + ++ Y L+ A +
Sbjct: 151 YMAHVTEWMNFFLPKL-RPYLYVN-GGPIIMVQVENEYGSYQTCDHQYQRQLYHLFRANL 208
Query: 207 A------VAQNIGVPWIMC----QQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWP 256
G + C + T D + ++ + P P + +E +
Sbjct: 209 GPDVVLFTTDGPGDHLLQCGTLQDMYATIDFGAGSNSTGMFQEMRKFEPKGPLVNSEYYT 268
Query: 257 GWFKTFGGRDPHRPSEDIAF--SVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT---- 310
GW + PH+ + A S+ + G +V N YM+ GGTNFG G + T
Sbjct: 269 GWLDHW--EHPHQTVKTAAVCTSLDQMLALGANV-NMYMFEGGTNFGFWNGANYPTFNPQ 325
Query: 311 -TSYDYEAPIDEYGLPRNPKWGHLKELHG 338
TSYDY+AP+ E G P PK+ ++ + G
Sbjct: 326 PTSYDYDAPLTEAGDP-TPKYMAIRNVIG 353
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 158/320 (49%), Gaps = 25/320 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T D + ++G+ I+S AIHY R W +Q + G+NTI+ Y+ WN HE
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G + FGG +LV+F I + + ++ R GP++ +E+++GG+P WL P R++
Sbjct: 68 GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
++ + + + ++ ++ L S GGPII QVENEYG Y + + W A +
Sbjct: 128 YQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYGDYV----DKDNEHLPWLADLM 181
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS-----PSMPKIWTENWPGWFKTF 262
+ + + + T I N + TP S P+ P + TE W GWF +
Sbjct: 182 KSHGLFELFFISDGGHT----IRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYW 237
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI--------TTSYD 314
G ++ ++ ++G SV N+YM+HGGTNFG G + TSYD
Sbjct: 238 GHGRNLLNNDVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGYYTADVTSYD 296
Query: 315 YEAPIDEYGLPRNPKWGHLK 334
Y+ P+DE G R KW +K
Sbjct: 297 YDCPVDESG-NRTEKWEIIK 315
>gi|307188518|gb|EFN73255.1| Beta-galactosidase [Camponotus floridanus]
Length = 624
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/346 (28%), Positives = 174/346 (50%), Gaps = 35/346 (10%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
S+ T+ ++ V Y++ +++G+ +S + HY R+ W +++ + G+N + +Y
Sbjct: 24 SNDTWQYSFGVDYENNQFLLDGKPFRYVSGSFHYFRAPRQYWRDRLRKMRAAGLNAVSTY 83
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW-LHYI 136
V W+ HE PG++ + G +L++F+ I Q+ ++++LR GP++ AE + GG+P W L
Sbjct: 84 VEWSLHEPEPGQFNWAGDADLIEFLNIAQEEDLFVLLRPGPYICAERDLGGLPYWLLREA 143
Query: 137 PGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-------- 188
P R F + ++ +++ +K L GGPII+ Q+ENEYG Y
Sbjct: 144 PDIKLRTKDAAFMKYATAYLNQVLEKVK--PLLRGNGGPIIMVQIENEYGSYNACDTEYT 201
Query: 189 ----ESFYGEGGKRYALWAAKMAVAQNIGVPWI----MCQQFDTPDPVINTCNSFYCDQF 240
E G+ G + L+ A A + ++ F T +N NSF +
Sbjct: 202 DMLKEIIVGKVGSKALLYTTDGASASLLRCGFVPGAYATIDFGTS---VNVTNSFQSMRL 258
Query: 241 TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF 300
+ P P + +E +PGW +G +E + ++ G SV N YM++GGTNF
Sbjct: 259 --YQPRGPLVNSEFYPGWLTHWGETFQRVKTEAVTKTLREMLALGASV-NIYMFYGGTNF 315
Query: 301 GRTAGG--------PFITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
G T+G P I TSYDY+AP+ E G P + K+ ++++ G
Sbjct: 316 GFTSGANGGVGAYSPQI-TSYDYDAPLTEAGDPTD-KYFAIRDVIG 359
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 167/341 (48%), Gaps = 43/341 (12%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ + + ++G+ I+S AIHY R W + + K G+NT+E+YV WN HE
Sbjct: 11 LVAEGENFTLDGKPVQILSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHEPEK 70
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GK+ F G ++ +++ +++I R GP++ AE++YGG+P WL P R +P
Sbjct: 71 GKFDFTGMLDIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNMQVRTTYQP 130
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+ +++F ++ ++K +GGPII QVENEYG Y +Y L A K A
Sbjct: 131 YMEAVERFFDALLPIVK--PFQYKEGGPIIAMQVENEYGSYAR-----DDKY-LTAVKQA 182
Query: 208 VAQNIGVPWIMCQ----QFDTPDP-----VINTCNSFY-----CDQFTPHSPSMPKIWTE 253
+ Q G+ ++ Q + + V+ T N + P+ P++ E
Sbjct: 183 I-QKRGIEELLLTSDGGQIERLERGCIPGVLMTANFNFNPKKQLGALKKLQPNRPQMVME 241
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVH------NYYMYHGGTNFGRTAGGP 307
W GWF + GRD H+ V +F Q G + N+YM+HGGTNFG G
Sbjct: 242 FWSGWFDHW-GRDHHK------LHVEKFEQLLGDILRFPSSVNFYMFHGGTNFGFMNGAN 294
Query: 308 FI------TTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKL 342
+I TSYDY+AP+ E G P PK+ +EL + +
Sbjct: 295 YINGYKPDVTSYDYDAPLSEAGDP-TPKYYKTRELLKTLAM 334
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 161/330 (48%), Gaps = 30/330 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
++Y+ + ++ G+ +IS A+HY R VP W +++ K G N +E+Y+ WN HE
Sbjct: 4 LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G ++V+FI+I Q+ + +I+R P++ AE+ +GG+P WL R
Sbjct: 64 GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWL-LKEDIRLRCSDPR 122
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK-M 206
F + + ++ +K L ++ GGPII Q+ENEYG Y G + L A + M
Sbjct: 123 FLEKVSAYYDALIPQLK--PLLSTSGGPIIAVQIENEYGSY------GNDQAYLQALRNM 174
Query: 207 AVAQNIGV-------PWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTEN 254
V + I V P Q + V+ T N + + P+ P + E
Sbjct: 175 LVERGIDVLLFTSDGPADDMLQGGMTEGVLATVNFGSRPKEAFGKLEEYQPNAPLMCMEY 234
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ 308
W GWF + R +ED A + G SV N+YM HGGTNFG ++G
Sbjct: 235 WNGWFDHWFEEHHTRSAEDAAQVLDEMLSMGASV-NFYMLHGGTNFGFSSGANHGGRYKP 293
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
TSYDY++ I E G PK+ +++ G
Sbjct: 294 TVTSYDYDSAISEAG-DITPKYQLFRKVIG 322
>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
Length = 765
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 144/310 (46%), Gaps = 26/310 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ I+S +HYPR W ++ + G+NT+ +YVFWN HE PGK+ F G
Sbjct: 36 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++I+I + + +ILR GP+V AE+ +GG P WL IPG R D F + ++
Sbjct: 96 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 155
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + + L S+GGPII+ Q ENE+G Y + E +RY +
Sbjct: 156 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 213
Query: 213 GVPWI------MCQQFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIWTENWPGWFKT 261
VP + + TP + + H P + E +PGW
Sbjct: 214 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWLMH 273
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+ P IA + Q S N+YM HGGTNFG T+G + TSY
Sbjct: 274 WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLTSY 332
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 333 DYDAPISEAG 342
>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
Length = 768
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 144/310 (46%), Gaps = 26/310 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ I+S +HYPR W ++ + G+NT+ +YVFWN HE PGK+ F G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++I+I + + +ILR GP+V AE+ +GG P WL IPG R D F + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + + L S+GGPII+ Q ENE+G Y + E +RY +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 213 GVPWI------MCQQFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIWTENWPGWFKT 261
VP + + TP + + H P + E +PGW
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWLMH 276
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+ P IA + Q S N+YM HGGTNFG T+G + TSY
Sbjct: 277 WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLTSY 335
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 336 DYDAPISEAG 345
>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
Length = 768
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 144/310 (46%), Gaps = 26/310 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ I+S +HYPR W ++ + G+NT+ +YVFWN HE PGK+ F G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++I+I + + +ILR GP+V AE+ +GG P WL IPG R D F + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + + L S+GGPII+ Q ENE+G Y + E +RY +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 213 GVPWI------MCQQFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIWTENWPGWFKT 261
VP + + TP + + H P + E +PGW
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWLMH 276
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+ P IA + Q S N+YM HGGTNFG T+G + TSY
Sbjct: 277 WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLTSY 335
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 336 DYDAPISEAG 345
>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
Length = 768
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 144/310 (46%), Gaps = 26/310 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ I+S +HYPR W ++ + G+NT+ +YVFWN HE PGK+ F G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++I+I + + +ILR GP+V AE+ +GG P WL IPG R D F + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + + L S+GGPII+ Q ENE+G Y + E +RY +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 213 GVPWI------MCQQFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIWTENWPGWFKT 261
VP + + TP + + H P + E +PGW
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWLMH 276
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+ P IA + Q S N+YM HGGTNFG T+G + TSY
Sbjct: 277 WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLTSY 335
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 336 DYDAPISEAG 345
>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
Length = 778
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 159/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+I FSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F K+ QQ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFAGQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L +GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVDKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 627
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 154/329 (46%), Gaps = 32/329 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY-YF 92
+ NG+ + S +HY R W ++ K G+N + +YVFWN HE PGK+ +
Sbjct: 41 QFVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWK 100
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G NL +F+K + M +ILR GP+ AE+++GG P WL G V R D +PF
Sbjct: 101 TGNRNLRQFVKTAAEEGMLVILRPGPYCCAEWDFGGYPWWLSKAKGLVIRADNQPFLDSC 160
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAV 208
+ ++ + M+ L ++GGPII+ Q ENE+G Y + E + Y+ + +
Sbjct: 161 RVYINQLASQMR--DLQITKGGPIIMVQAENEFGSYVAQRKDVPLESHRAYSAKIKQQLI 218
Query: 209 AQNIGVPWIMCQ--------QFDTPDPVINTCNSF-----YCDQFTPHSPSMPKIWTENW 255
VP + P N N +++ + P + E +
Sbjct: 219 DAGFDVPLFTSDGSWLFKGGTIEGALPTANGENDIEKLKKVVNEY--NGGKGPYMVAEFY 276
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT----- 310
PGW + P +E I A++ + G S NYYM HGGTNFG T+G + T
Sbjct: 277 PGWLSHWAEPFPQVSTESIVKQTAKYLENGVSF-NYYMVHGGTNFGFTSGANYTTATNLQ 335
Query: 311 ---TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+API E G PK+ L+ L
Sbjct: 336 SDLTSYDYDAPISEAGW-NTPKYDALRAL 363
>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
Length = 584
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 156/333 (46%), Gaps = 41/333 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
++ ING + IIS A+HY R VP W + K G NT+E+YV WN HE GKY
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ F+K+ ++ +++ILR P++ AE+ GG+P WL P R + + +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ ++++ + + ++ +Q GPIILAQ+ENEYG YGE K Y L +M
Sbjct: 127 LDQYFSILLPKLSKYQI--TQNGPIILAQLENEYGS----YGE-DKEYLLAVYQMMRKYG 179
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIW 251
I VP T +N + F H + P +
Sbjct: 180 IEVPLFTAD--GTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESHQITAPLMC 237
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----- 306
E W GWF + R ++ S G N+YM+ GGTNFG G
Sbjct: 238 MEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSARKE 295
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P I TSYDY+A + EYG + K+ L+E+
Sbjct: 296 HDLPQI-TSYDYDAILTEYG-AKTEKYHLLREV 326
>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
Length = 778
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+I FSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F K+ QQ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 156/315 (49%), Gaps = 31/315 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ D + ++G+ +I +HY R W +++A+ G+NTI YVFWN HE P
Sbjct: 29 IKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQP 88
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G+ ++ +F+++ Q+ +Y+ILR GP+ AE+++GG P WL V+R+
Sbjct: 89 GEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDPR 148
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F + ++++ + + L + GG I++ QVENEYG Y + K Y M
Sbjct: 149 FLEYCERYIKALGKQLA--PLTVNNGGNILMVQVENEYGSYAA-----DKEYLAALRDMI 201
Query: 208 VAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCDQ----FTPHSPSMPKIWTENWPGW 258
VP C + D + T N + + + P P E +P W
Sbjct: 202 KDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPAW 261
Query: 259 FKTFGGR----DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF-----GRTAGGPFI 309
F +G R D RP+E + + + +G SV + YM+HGGTNF TAGG
Sbjct: 262 FDVWGQRHSTVDYKRPAEQLDWMLG----QGVSV-SMYMFHGGTNFWYMNGANTAGGYRP 316
Query: 310 T-TSYDYEAPIDEYG 323
TSYDY+AP+ E+G
Sbjct: 317 QPTSYDYDAPLGEWG 331
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 156/315 (49%), Gaps = 31/315 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ D + ++G+ +I +HY R W +++A+ G+NTI YVFWN HE P
Sbjct: 29 IKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQP 88
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G+ ++ +F+++ Q+ +Y+ILR GP+ AE+++GG P WL V+R+
Sbjct: 89 GEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDPR 148
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F + ++++ + + L + GG I++ QVENEYG Y + K Y M
Sbjct: 149 FLEYCERYIKALGKQLA--PLTVNNGGNILMVQVENEYGSYAA-----DKEYLAALRDMI 201
Query: 208 VAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCDQ----FTPHSPSMPKIWTENWPGW 258
VP C + D + T N + + + P P E +P W
Sbjct: 202 KDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPAW 261
Query: 259 FKTFGGR----DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF-----GRTAGGPFI 309
F +G R D RP+E + + + +G SV + YM+HGGTNF TAGG
Sbjct: 262 FDVWGQRHSTVDYKRPAEQLDWMLG----QGVSV-SMYMFHGGTNFWYMNGANTAGGYRP 316
Query: 310 T-TSYDYEAPIDEYG 323
TSYDY+AP+ E+G
Sbjct: 317 QPTSYDYDAPLGEWG 331
>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
Length = 768
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 144/310 (46%), Gaps = 26/310 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ I+S +HYPR W ++ + G+NT+ +YVFWN HE PGK+ F G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++I+I + + +ILR GP+V AE+ +GG P WL IPG R D F + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + + L S+GGPII+ Q ENE+G Y + E +RY +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 213 GVPWI------MCQQFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIWTENWPGWFKT 261
VP + + TP + + H P + E +PGW
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWLMH 276
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+ P IA + Q S N+YM HGGTNFG T+G + TSY
Sbjct: 277 WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLTSY 335
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 336 DYDAPISEAG 345
>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 768
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 144/310 (46%), Gaps = 26/310 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ I+S +HYPR W ++ + G+NT+ +YVFWN HE PGK+ F G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++I+I + + +ILR GP+V AE+ +GG P WL IPG R D F + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + + L S+GGPII+ Q ENE+G Y + E +RY +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 213 GVPWI------MCQQFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIWTENWPGWFKT 261
VP + + TP + + H P + E +PGW
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWLMH 276
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+ P IA + Q S N+YM HGGTNFG T+G + TSY
Sbjct: 277 WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLTSY 335
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 336 DYDAPISEAG 345
>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
Length = 778
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 159/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+I FSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F K+ QQ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L +GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVDKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 160/340 (47%), Gaps = 24/340 (7%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L++ F S V ++ + ING+ +I +HYPR W + +A+ G+
Sbjct: 14 LIVSFFISACSSPREQVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRARAMGL 73
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
NT+ +YVFWN HE PG + F G+ ++ +F++I Q+ +Y+ILR GP+V AE+++GG P
Sbjct: 74 NTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPS 133
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL +R+ F + ++++ + + L + GG II+ QVENEYG Y +
Sbjct: 134 WLLKEKDLTYRSKDPRFMSYCERYIKELGKQLA--PLTINNGGNIIMVQVENEYGSYAA- 190
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDP-----VINTCNSFYCDQF----TP 242
K Y M VP C + + T N + +
Sbjct: 191 ----DKEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFKIVDK 246
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF-- 300
+ P P E +P WF +G R E A + G SV + YM+HGGTNF
Sbjct: 247 YHPGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGHGVSV-SMYMFHGGTNFWY 305
Query: 301 --GRTAGGPF--ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
G G F TSYDY+AP+ E+G PK+ +E+
Sbjct: 306 MNGANTSGGFRPQPTSYDYDAPLGEWG-NCYPKYHAFREI 344
>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
Length = 778
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 159/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+I FSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F K+ QQ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L +GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVDKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|163848976|ref|YP_001637020.1| beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
gi|163670265|gb|ABY36631.1| Beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
Length = 897
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 154/298 (51%), Gaps = 17/298 (5%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ ++S +HY R W L++QA+ G+NTI++ + WN HE PG++ F
Sbjct: 14 LDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRHEPQPGEFDFSEEA 73
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK-F 155
+L F+ + + + I+R GP++ AE+ GG+P WL R+D F+ + + F
Sbjct: 74 DLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWLTASGDMRLRSDDPAFRDAVLRWF 133
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
TL+ ++ R+ GGPIIL Q+ENE+ + YG + L A+ A+ + I VP
Sbjct: 134 DTLMPILVPRQY---PHGGPIILCQIENEH-WASGVYGADTHQQTL--AQAALERGIVVP 187
Query: 216 WIMCQQFDTPDPVINTCNSFYCDQFTPHS---PSMPKIWTENWPGWFKTFGG-RDPHRPS 271
C P S ++ P P I +E W GWF +GG R + +
Sbjct: 188 QYTCVGAMPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWFDNWGGHRQTRKTA 247
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAGGPFI--TTSYDYEAPIDEYG 323
+ ++ + G + +++M+ GGTNF GRT GG I TTSYDY+AP+DEYG
Sbjct: 248 AKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTSYDYDAPVDEYG 305
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 166/337 (49%), Gaps = 23/337 (6%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y++ S ING + + SAAIHY R W ++ +AK G+N +++Y WN HE
Sbjct: 18 VQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNCVDTYFAWNVHEPEE 77
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G + F+ + + +++I R GPF+ AE+++GG P WL+ FR
Sbjct: 78 GEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWLNTKKDMKFRAFDMQ 137
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+ ++ ++M I+ +++ ++ A GG +IL QVENEYGY S E + Y L +
Sbjct: 138 YLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYLAS--DEVARDYMLHLRDVM 193
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
+ + + VP I C + + N + + P PKI TE W GWF+ +
Sbjct: 194 LDRGVMVPLITC--VGGAEGTVEGANFWSGADHHYNNLVQKQPDTPKIVTEFWTGWFEHW 251
Query: 263 GGRDPHRPSEDIAFSVARFFQK---GGSVHNYYM----YHGGTNFGRTAGGP--FITTSY 313
G P + A R + G + ++YM + G GRT G F+ TSY
Sbjct: 252 GA--PAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNFGGYGGRTVGASDIFMVTSY 309
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNG 350
DY+AP+ EYG + K+ K + ++ E LLN
Sbjct: 310 DYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLLNA 345
>gi|222526932|ref|YP_002571403.1| beta-galactosidase [Chloroflexus sp. Y-400-fl]
gi|222450811|gb|ACM55077.1| Beta-galactosidase [Chloroflexus sp. Y-400-fl]
Length = 917
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 154/298 (51%), Gaps = 17/298 (5%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ ++S +HY R W L++QA+ G+NTI++ + WN HE PG++ F
Sbjct: 34 LDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRHEPQPGEFDFSEEA 93
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK-F 155
+L F+ + + + I+R GP++ AE+ GG+P WL R+D F+ + + F
Sbjct: 94 DLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWLTASGDMRLRSDDPAFRDAVLRWF 153
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
TL+ ++ R+ GGPIIL Q+ENE+ + YG + L A+ A+ + I VP
Sbjct: 154 DTLMPILVPRQY---PHGGPIILCQIENEH-WASGVYGADTHQQTL--AQAALERGIVVP 207
Query: 216 WIMCQQFDTPDPVINTCNSFYCDQFTPHS---PSMPKIWTENWPGWFKTFGG-RDPHRPS 271
C P S ++ P P I +E W GWF +GG R + +
Sbjct: 208 QYTCVGAMPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWFDNWGGHRQTRKTA 267
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAGGPFI--TTSYDYEAPIDEYG 323
+ ++ + G + +++M+ GGTNF GRT GG I TTSYDY+AP+DEYG
Sbjct: 268 AKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTSYDYDAPVDEYG 325
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 34/312 (10%)
Query: 39 GRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNL 98
G I+S +HY R W +Q K G+NT+ +YVFWN HE+ PGK+ F G NL
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 99 VKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTL 158
++I+I + M +ILR GP+V AE+ +GG P WL IPG R D F + +K+
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151
Query: 159 IVDMMKRE--KLFASQGGPIILAQVENEYGYYES----FYGEGGKRYALWAAKMAVAQNI 212
+D + +E L ++GGPII+ Q ENE+G Y S E + Y
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGF 210
Query: 213 GVP-------WI-----MCQQFDTPDPVINTCN-SFYCDQFTPHSPSMPKIWTENWPGWF 259
VP W+ + T + + N +Q+ H P + E +PGW
Sbjct: 211 TVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQY--HGGKGPYMVAEFYPGWL 268
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------T 311
+G P + +IA + Q S N+YM HGGTNFG T+G + T
Sbjct: 269 SHWGEPFPQVSASEIARQTEAYLQNNVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLT 327
Query: 312 SYDYEAPIDEYG 323
SYDY+API E G
Sbjct: 328 SYDYDAPISEAG 339
>gi|386839582|ref|YP_006244640.1| beta-galactosidase [Streptomyces hygroscopicus subsp. jinggangensis
5008]
gi|374099883|gb|AEY88767.1| putative beta-galactosidase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451792876|gb|AGF62925.1| putative beta-galactosidase [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 585
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 105/325 (32%), Positives = 154/325 (47%), Gaps = 42/325 (12%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++GR ++S A+HY R W + + G+N +E+YV WN HE PG +
Sbjct: 10 GFLLDGRPVRLLSGALHYFRVHEDQWGHRLAMLRAMGLNCVETYVPWNLHEPRPGVFRDV 69
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G +F+ ++ A ++ I+R GP++ AE+ GG+PVWL PGT R E + H++
Sbjct: 70 GAVG--RFLDAVRGAGLWAIVRPGPYICAEWENGGLPVWLTGEPGTRARTRDERYLRHVR 127
Query: 154 K-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
F L+ +++ R+ +GGP+++ QVENEYG Y S G V +
Sbjct: 128 NWFQRLLPEIVPRQ---IDRGGPVVMVQVENEYGSYGSDTGH-------LEELAGVLRAE 177
Query: 213 GVPWIMCQQFDTPDP----------VINTCN-----SFYCDQFTPHSPSMPKIWTENWPG 257
GV +C D P+ V+ T N + H P P + E W G
Sbjct: 178 GVTAALCTS-DGPEDHMLTGGSLPGVLATVNFGSHARVAFETLRRHRPGGPLMCMEFWCG 236
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----------GP 307
WF + G R + A ++ + G SV N YM HGGT+FG AG GP
Sbjct: 237 WFDHWSGEHAVRDPAEAAEALREILECGASV-NLYMAHGGTSFGGWAGANRGGGELHEGP 295
Query: 308 F--ITTSYDYEAPIDEYGLPRNPKW 330
TSYDY+AP+DEYG P W
Sbjct: 296 LEPDVTSYDYDAPVDEYGRPTEKFW 320
>gi|290956543|ref|YP_003487725.1| glycosyl hydrolase family 42 [Streptomyces scabiei 87.22]
gi|260646069|emb|CBG69162.1| putative glycosyl hydrolase (family 42) [Streptomyces scabiei
87.22]
Length = 591
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 155/329 (47%), Gaps = 29/329 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S ++NG I+S A+HY R P +W +++A+ G+NT+E+YV WN H+ P
Sbjct: 6 LTTSSDGFLLNGEPFRIVSGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPDP 65
Query: 88 GK-YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G +L +++ + + ++++LR GP++ AE++ GG+P WL PG R+
Sbjct: 66 DSPLVLDGLLDLPRYLSLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPGIRLRSSDP 125
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F + + L + + A+ GGP+I QVENEYG Y Y +
Sbjct: 126 RFTDALDGY--LDILLPPLLPYMAANGGPVIAVQVENEYGAYGD-----DTAYLKHVHQA 178
Query: 207 AVAQNIGVPWIMCQQFDTPD-------PVINTCNSF------YCDQFTPHSPSMPKIWTE 253
A+ + C Q + P + + +F H P P + +E
Sbjct: 179 LRARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGPLMCSE 238
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----- 308
W GWF +G R +E A + + G SV N YM+HGGTNFG T G
Sbjct: 239 FWIGWFDHWGEEHHVRDAESAAADLDKLLAAGASV-NIYMFHGGTNFGFTNGANHDQCYA 297
Query: 309 -ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
I TSYDY+A + E G P PK+ +E+
Sbjct: 298 PIVTSYDYDAALTESGDP-GPKYHAFREV 325
>gi|449664450|ref|XP_002165261.2| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 589
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 163/338 (48%), Gaps = 29/338 (8%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
LLI F+ + + Y++ + +G IS +IHY R W + + ++ G
Sbjct: 8 CLLIVFAKISSSERTFKIDYENNKFLKDGTEFRYISGSIHYMRVPEDYWEDRLSKIRKAG 67
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+N I++Y+ WN HE + G + FGG+ N+ KF+K+ Q+ + +ILR GP++ AE+ +GG P
Sbjct: 68 LNAIQTYIPWNFHEPTEGNFQFGGQQNVFKFLKLAQKYDLLVILRPGPYICAEWEFGGFP 127
Query: 131 VWLHYIPGT---VFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
WL G R + ++ +M++++ + R L+ + GGPII QVENEYG
Sbjct: 128 YWLLKKVGNKTMQLRTSDNLYLQKVENYMSVLLSGL-RPYLYEN-GGPIITVQVENEYGS 185
Query: 188 Y----ESFYGEGG--KRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN-------S 234
Y E Y ++Y + G ++ C T P+ T +
Sbjct: 186 YGCDHEYMYKLESIFRKYLGENVILFTTDGAGDSYLKC---GTIKPLFATVDFGPTAEPK 242
Query: 235 FYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
Y D + P P + +E + GW +GG+ H ED+ ++ + SV N YM+
Sbjct: 243 LYFDIQRKYQPLGPLVNSEFYTGWLDHWGGQHAHTSLEDVTDTLDKMLSLNASV-NMYMF 301
Query: 295 HGGTNFGRTAGGPFIT-------TSYDYEAPIDEYGLP 325
GGTNFG G + TSYDY+AP+ E G P
Sbjct: 302 EGGTNFGFMNGANQDSNSLQPQPTSYDYDAPLSEAGDP 339
>gi|408677368|ref|YP_006877195.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
gi|328881697|emb|CCA54936.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
Length = 611
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 155/322 (48%), Gaps = 37/322 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++GR ++S A+HY R W + + G+N +E+YV WN HE PG+Y
Sbjct: 10 DFLLDGRPVRLLSGALHYFRVREEQWEHRLGMLRAMGLNCVETYVPWNLHEPEPGRY--A 67
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
L +F+ + +A M+ I+R GP++ AE+ GG+P WL G R+ F ++
Sbjct: 68 DVAALGRFLDAVARAGMWAIVRPGPYICAEWENGGLPHWLTGPLGRRVRSFDPEFLAPVE 127
Query: 154 K-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
F L+ +++R+ +GGP++L QVENEYG Y S + Y W A++ +
Sbjct: 128 AWFRRLLPQVVERQ---IDRGGPVVLVQVENEYGSYGS-----DRAYLEWLAELLRGCGV 179
Query: 213 GVPWI--------MCQQFDTPDPVINTCN--SFYCDQFTP---HSPSMPKIWTENWPGWF 259
VP M P V+ T N S + F H PS P + E W GWF
Sbjct: 180 AVPLFTSDGPEDHMLTGGSVPG-VLATANFGSGAREGFATLRRHQPSGPLMCMEFWCGWF 238
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG---------GPF-- 308
+G R + D A ++ + G SV N YM HGGTNFG AG GP
Sbjct: 239 DHWGTEHAVRDAADAAEALREILECGASV-NVYMAHGGTNFGGFAGANRAGELHDGPLRA 297
Query: 309 ITTSYDYEAPIDEYGLPRNPKW 330
TSYDY+AP+DE G P W
Sbjct: 298 TVTSYDYDAPVDEAGRPTEKFW 319
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 152/315 (48%), Gaps = 34/315 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
I++G+ I+S AIHY R VP W + K G NT+E+Y+ WN HE G++ F
Sbjct: 9 EFIVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQ 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++V FIK Q+ + +I+R P++ AE+ +GG+P WL R+D + ++
Sbjct: 69 GIKDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ +++ M+ L ++QGGPII+ QVENE+G + + K Y K+ + +
Sbjct: 129 NYYEVLLPMLT--SLQSTQGGPIIMMQVENEFGSFSN-----NKTYLKKLKKIMLDLGVE 181
Query: 214 VPWIMC-----QQFDT----PDPVINTC--------NSFYCDQFTP-HSPSMPKIWTENW 255
VP Q ++ D V+ T N +QF H P + E W
Sbjct: 182 VPLFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
GWF +G R ++D+A V +G N YM+HGGTNFG G
Sbjct: 242 DGWFNRWGEEIITRDAQDLANCVKELLTRGSI--NLYMFHGGTNFGFMNGCSARGQKDLP 299
Query: 309 ITTSYDYEAPIDEYG 323
TSYDY+A + E G
Sbjct: 300 QVTSYDYDALLTEAG 314
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 148 bits (374), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 34/312 (10%)
Query: 39 GRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNL 98
G I+S +HY R W +Q K G+NT+ +YVFWN HE+ PGK+ F G NL
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 99 VKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTL 158
++I+I + M +ILR GP+V AE+ +GG P WL IPG R D F + +K+
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151
Query: 159 IVDMMKRE--KLFASQGGPIILAQVENEYGYYES----FYGEGGKRYALWAAKMAVAQNI 212
+D + +E L ++GGPII+ Q ENE+G Y S E + Y
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGF 210
Query: 213 GVP-------WI-----MCQQFDTPDPVINTCN-SFYCDQFTPHSPSMPKIWTENWPGWF 259
VP W+ + T + + N +Q+ H P + E +PGW
Sbjct: 211 TVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQY--HGGKGPYMVAEFYPGWL 268
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------T 311
+G P + +IA + Q S N+YM HGGTNFG T+G + T
Sbjct: 269 SHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLT 327
Query: 312 SYDYEAPIDEYG 323
SYDY+API E G
Sbjct: 328 SYDYDAPISEAG 339
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 34/312 (10%)
Query: 39 GRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNL 98
G I+S +HY R W +Q K G+NT+ +YVFWN HE+ PGK+ F G NL
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 99 VKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTL 158
++I+I + M +ILR GP+V AE+ +GG P WL IPG R D F + +K+
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151
Query: 159 IVDMMKRE--KLFASQGGPIILAQVENEYGYYES----FYGEGGKRYALWAAKMAVAQNI 212
+D + +E L ++GGPII+ Q ENE+G Y S E + Y
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGF 210
Query: 213 GVP-------WI-----MCQQFDTPDPVINTCN-SFYCDQFTPHSPSMPKIWTENWPGWF 259
VP W+ + T + + N +Q+ H P + E +PGW
Sbjct: 211 TVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQY--HGGKGPYMVAEFYPGWL 268
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------T 311
+G P + +IA + Q S N+YM HGGTNFG T+G + T
Sbjct: 269 SHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLT 327
Query: 312 SYDYEAPIDEYG 323
SYDY+API E G
Sbjct: 328 SYDYDAPISEAG 339
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|256424388|ref|YP_003125041.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256039296|gb|ACU62840.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 586
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/327 (31%), Positives = 159/327 (48%), Gaps = 30/327 (9%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ +++ + IIS +H R W +Q AK G NTI +YVFWN HE GK+ F
Sbjct: 17 KDFLLDSKPYQIISGEMHPARIPKEYWRHRIQMAKAMGCNTIAAYVFWNYHEQEEGKFDF 76
Query: 93 GGR-FNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
++V FIK++Q+ M+++LR GP+V AE+ +GG+P +L IP R +
Sbjct: 77 TSENRDIVAFIKMVQEEGMWVMLRPGPYVCAEWEFGGLPPYLLRIPDIKVRCMDPRYIAA 136
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
++++ + + +K L + GGPI++ QVENEYG + + + Y L M V
Sbjct: 137 TERYIKALSEEVK--PLQITNGGPIVMVQVENEYGSFGN-----DREYMLKVKDMWVQNG 189
Query: 212 IGVPW--------IMCQQFDTPDPVINTCNSFYCDQFTP---HSPSMPKIWTENWPGWFK 260
I VP+ + + P I + F +P +P +E++PGW
Sbjct: 190 INVPFYTADGPVSALLEAGSVPGAAIGLDSGSSEGDFAAAEKQNPDVPSFSSESYPGWL- 248
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------PFITTS 312
T G RP + +F N Y+ HGGTNFG TAG P + TS
Sbjct: 249 THWGEKWARPDKAGIVKEVKFLMDTKRSFNLYVIHGGTNFGFTAGANSGGKGYEPDL-TS 307
Query: 313 YDYEAPIDEYGLPRNPKWGHLKELHGA 339
YDY+API+E G K+ L++L G+
Sbjct: 308 YDYDAPINEQG-DTTAKYNALRDLIGS 333
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 163/340 (47%), Gaps = 27/340 (7%)
Query: 6 PIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
P++ A +SS G+ ++ +++G+ IIS +HY R W +Q
Sbjct: 8 PVSVMAAARRGNSSALSDQRGSFRVENGKFVLDGQPFQIISGEMHYERIPRAYWKARLQM 67
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYN 125
AK G+NTI +YVFWN HE PGK+ F G +L +FI+ QQ + ++LR GP+ AE+
Sbjct: 68 AKAMGLNTIATYVFWNLHEPEPGKFDFSGNADLAQFIRDAQQTGLKVLLRAGPYSCAEWE 127
Query: 126 YGGIPVWLHYIPG--TVFR-NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
+GG P WL P T R ND E K Q + L ++ L GGPII Q+E
Sbjct: 128 FGGFPAWLMKNPKMQTALRSNDPEFMKPAEQWILRLGREV---APLQVGYGGPIIGVQIE 184
Query: 183 NEYGYY--ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN------S 234
NEYG + ++ Y E K+ L A P + P V + N +
Sbjct: 185 NEYGDFGGDAAYLEHLKKIFLKAGFTQSLLYTANPSRALVRGSIPG-VYSAVNFAPGHAA 243
Query: 235 FYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARF--FQKGGSVHNYY 292
D P + +E W GWF +G +PH+ S+ ++ V F + G+ N Y
Sbjct: 244 QALDSLAQLRAGQPLLSSEYWTGWFDHWG--EPHQ-SKPLSLQVKDFNYILRHGAGVNLY 300
Query: 293 MYHGGTNFGRTAGGPFI-------TTSYDYEAPIDEYGLP 325
M+HGGT+FG +G + TSYDY AP+DE G P
Sbjct: 301 MFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAPLDEAGHP 340
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 159/340 (46%), Gaps = 24/340 (7%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
L++ F S V ++ + ING+ +I +HYPR W + +A G+
Sbjct: 14 LIVSFFISACSSPREQVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRAHAMGL 73
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
NT+ +YVFWN HE PG + F G+ ++ +F++I Q+ +Y+ILR GP+V AE+++GG P
Sbjct: 74 NTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPS 133
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL +R+ F + ++++ + + L + GG II+ QVENEYG Y +
Sbjct: 134 WLLKEKDLTYRSKDPRFMSYCERYIKELGKQLA--PLTINNGGNIIMVQVENEYGSYAA- 190
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDP-----VINTCNSFYCDQF----TP 242
K Y M VP C + + T N + +
Sbjct: 191 ----DKEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFKIVDK 246
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF-- 300
+ P P E +P WF +G R E A + G SV + YM+HGGTNF
Sbjct: 247 YHPGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGHGVSV-SMYMFHGGTNFWY 305
Query: 301 --GRTAGGPF--ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
G G F TSYDY+AP+ E+G PK+ +E+
Sbjct: 306 MNGANTSGGFRPQPTSYDYDAPLGEWG-NCYPKYHAFREI 344
>gi|380693434|ref|ZP_09858293.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 778
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 157/331 (47%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSSAEAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ--------QFDTPDPVINTCNSFYCDQ-- 239
G + + A + V ++ VP C D IN DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 165/330 (50%), Gaps = 25/330 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+T + ++G I++ A+HY R P W + + K G+NT+E+YV WN HE
Sbjct: 3 TLTIEGDHFELDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEPH 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+++FG N+ ++I++ + +Y+I+R GP++ AE+ GG+P WL P R +
Sbjct: 63 EGEFHFGDWLNIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQMKLRCMYQ 122
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG----------YYESFYGEGG 196
P+ + ++ + + M + L +++GGPII QVENEYG Y E + G
Sbjct: 123 PYLDAVGEYFSQL--MHRLVPLQSTRGGPIIAMQVENEYGSYGNDTRYLKYLEELLRQCG 180
Query: 197 KRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWP 256
L+ A VA + + F + ++F ++ + P + E W
Sbjct: 181 VDVLLFTAD-GVADEMMQYGSLPHLFKAVNFGNRPGDAF--EKLREYQTGGPLLVAEFWD 237
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----PFIT- 310
GWF +G R R + ++A + +G SV N YM+HGGTNFG G P T
Sbjct: 238 GWFDHWGERHHTRSAGEVARVLDDLLSEGASV-NLYMFHGGTNFGFMNGANAFPSPHYTP 296
Query: 311 --TSYDYEAPIDEYGLPRNPKWGHLKELHG 338
TSYDY+AP+ E G PK+ ++E+ G
Sbjct: 297 TVTSYDYDAPLSECG-NITPKYEAMREVIG 325
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 151/306 (49%), Gaps = 25/306 (8%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
++ + I+S A+HY R VP W + + K G+NT+E+YV WN HE G++ F G
Sbjct: 63 FFLDNKELRILSGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTG 122
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
++ +F+ I ++ + +ILR GPF+ +E+ +GG+P WL P R+ PF +
Sbjct: 123 MLDIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQMDVRSTYRPFMDAARS 182
Query: 155 FMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----------ESFYGEGGKRYALWAA 204
+M ++ + E + GGPII Q+ENEYG Y ++ + G L+ +
Sbjct: 183 YMRSLISEL--EDMQYQYGGPIIAMQIENEYGSYSDDVNYMQELKNIMTDSGVIEILFTS 240
Query: 205 KMAVAQNIG-VPWI-MCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTF 262
G VP + M F N + D+ P P + E W GWF +
Sbjct: 241 DNKHGLQPGRVPGVFMTTNFKN----TNEGGRMF-DKLHELQPGKPLMVMEFWSGWFDHW 295
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG---PFI--TTSYDYEA 317
+ E+ A +V Q+G S+ N YM+HGGTNFG G P++ TSYDY++
Sbjct: 296 EEKHHTMSLEEYASAVEYILQQGSSI-NLYMFHGGTNFGFLNGANTEPYLPTVTSYDYDS 354
Query: 318 PIDEYG 323
P+ E G
Sbjct: 355 PLSEAG 360
>gi|29349062|ref|NP_812565.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124327|ref|ZP_09944991.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
gi|29340969|gb|AAO78759.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839176|gb|EES67260.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
Length = 778
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 157/331 (47%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSSAEAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ--------QFDTPDPVINTCNSFYCDQ-- 239
G + + A + V ++ VP C D IN DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|298386767|ref|ZP_06996322.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|298260441|gb|EFI03310.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 778
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 157/331 (47%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFSS+ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSSAEAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ--------QFDTPDPVINTCNSFYCDQ-- 239
G + + A + V ++ VP C D IN DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG +GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGS----FGE-EKAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P I TSYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQI-TSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|400603388|gb|EJP70986.1| glycoside hydrolase family 35 [Beauveria bassiana ARSEF 2860]
Length = 631
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 161/327 (49%), Gaps = 31/327 (9%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
GN +Y+ ++NG+ II + R P W ++ A+ G+NTI SY++WN HE
Sbjct: 27 GNFSYNRHQFLLNGQPYQIIGGQMDPQRIPPEYWTHRLKMARAMGLNTIFSYLYWNLHEP 86
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG++ F GR N+ +F ++ Q+ + ++LR GP++ E ++GG P WL +PG R +
Sbjct: 87 SPGEWDFQGRNNVAEFFRLAQEEGLKVVLRPGPYICGERDWGGFPAWLSQVPGMAVRQNN 146
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
PF + ++ + + L +QGGPI++ Q+ENEYG + G + A AA
Sbjct: 147 GPFLDAAKSYINRVGKELG--SLQITQGGPILMTQLENEYGSF----GTDKEYLAALAAM 200
Query: 206 MAVAQNI--------GVPWIMCQQFDTPDPVIN--TCNSFYC-DQFTPHSPSM-PKIWTE 253
+ ++ G ++ QF VI+ + F D++ S+ P++ E
Sbjct: 201 LHDNFDVFLYTNDGGGKSYLEGGQFHGVLAVIDGDSKTGFEARDKYVTDPTSLGPQLNGE 260
Query: 254 NWPGWFKTFGGRDPHRPSE------DIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
+ W +G H+ S D A + G + YM+HGGTNFG GG
Sbjct: 261 YYITWIDQWGSDYSHQQSSGSQTKIDKAVGDLDWTLAGNYSFSIYMFHGGTNFGFENGGI 320
Query: 307 ------PFITTSYDYEAPIDEYGLPRN 327
+TTSYDY AP+DE G P +
Sbjct: 321 RDDGPLAAVTTSYDYGAPLDESGRPTD 347
>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
Length = 783
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 155/325 (47%), Gaps = 31/325 (9%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
++ ++NG+ LI +A IHY R W ++ K G+NTI Y FWN HE PG++
Sbjct: 37 NKEFLLNGKPFLIKAAEIHYTRIPAEYWEHRIEMCKALGMNTICIYAFWNIHEQRPGEFD 96
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G+ ++ +F ++ Q+ MY++LR GP+V +E+ GG+P WL R F
Sbjct: 97 FEGQNDVARFCRLAQKHGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIALRTSDPYFLER 156
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ FM + + L A +GG II+ QVENEYG Y K Y A+ + +
Sbjct: 157 TKIFMNELGKQLA--DLQAPRGGNIIMVQVENEYGAYAE-----DKEYI--ASIRDIVRG 207
Query: 212 IG---VPWIMCQ-----QFDTPDPVINTCN---SFYCDQ----FTPHSPSMPKIWTENWP 256
G VP C Q + D ++ T N DQ P P + +E W
Sbjct: 208 AGFTDVPLFQCDWASTFQRNGLDDLLWTINFGTGADIDQQFKALREARPETPLMCSEYWS 267
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----PFITT 311
GWF +G + RP++ + + + S + YM HGGT FG G + +
Sbjct: 268 GWFDHWGRKHETRPADVMVKGIKDMMDRNISF-SLYMTHGGTTFGHWGGANSPSYSAMCS 326
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKEL 336
SYDY+API E G PK+ L++L
Sbjct: 327 SYDYDAPISEAGWA-TPKYYQLRDL 350
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/185 (42%), Positives = 99/185 (53%), Gaps = 18/185 (9%)
Query: 557 VGLQNAGPFYEWVGAGIT-SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNI 615
+ N G F E GAG VK+TGF +G +DLS YSWTY++GL+GE IY
Sbjct: 22 IAAGNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKA 81
Query: 616 NWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKS 675
W TWYK P G+ P+ LD+ MGKG AW+NG IGRYW R
Sbjct: 82 EWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTR----V 137
Query: 676 SPHDECVQECDYRGKFNPDKCITGCGEPSQRWYHIPRSWFKPSENILVIFEEKGGDPTKI 735
+P D C +CDYRG ++ K YHIPRSW + S N+LV+FEE GG P +I
Sbjct: 138 APKDGC-GKCDYRGHYHTSK------------YHIPRSWLQASNNLLVLFEETGGKPFEI 184
Query: 736 TFSIR 740
+ R
Sbjct: 185 SVKSR 189
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 951
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 198/836 (23%), Positives = 322/836 (38%), Gaps = 148/836 (17%)
Query: 15 FFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTI 74
+F S Y +V+YD R++ IN +R L++S ++H R+ G W + +A G+N I
Sbjct: 137 YFPSFWNYNGNLSVSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMI 196
Query: 75 ESYVFWNGHEL---SPGKYYFGG--------RFNLVKFIKIIQQARMYMILRIGPFVAAE 123
Y+FW H+ P + G ++ L ++ +++ +RIGP+ E
Sbjct: 197 TVYIFWGAHQSFRDEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGE 256
Query: 124 YNYGGIPVWLHYIPGTV-FRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
Y YGGIP WL T+ R P+ M+ F+ + + L+A QGGPI++AQ+E
Sbjct: 257 YTYGGIPEWLPLQSSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIE 316
Query: 183 NEYG---------------------------------YYESFYGEGGKR----------- 198
NE G Y R
Sbjct: 317 NELGSGVDGSAAANYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATV 376
Query: 199 --YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPS------MPKI 250
YA W + V W MC + + + D + S P I
Sbjct: 377 QDYADWCGNLVARLAPNVIWTMCNGLSAENTISTFNGNNGIDWLEKYGDSGRIQVDQPAI 436
Query: 251 WTENWPGWFKTFGGRDPHRPSE--------DIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
WTE+ G F+ +G + P +PS+ +A ++F +GG+ NYYM+ GG N GR
Sbjct: 437 WTED-EGGFQLWGDQ-PSKPSDYFWGRTSRAMATDALQWFARGGTHLNYYMWWGGYNRGR 494
Query: 303 TAGGPFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQE 362
++ I +Y +A + G R+PK+ H LH I LL+ S L S +
Sbjct: 495 SSAAG-IMNAYATDAFLCSSGQRRHPKYDHFLALHLVIADIAAILLHAPTSLLKNASVEI 553
Query: 363 AD----VYADSSGACAAFLANMDDKND-KTVVF-RNVSYHLPAWSVSILPDCKKVVFNTA 416
D + D+ FL + D +D K V+F N + ++ +VF
Sbjct: 554 MDGDDWIVGDNQ---RQFLYQVLDTHDSKQVIFLENDANTTEMARLTGAKADDSLVFVMK 610
Query: 417 NVRAQSSTVEMVPENLQPSEASPDNGSKGLKWQ--VFKEIAGIWGE----ADFVKSGFVD 470
+Q +V + + + L ++ V + W E AD ++ V
Sbjct: 611 PYSSQIVIDGIVAFDSSTISTKAMSFRRTLHYEPAVLLHLTS-WSEPIAGADTDQNAHVS 669
Query: 471 -------HINTTKD-TTDYLWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQEL 522
++N+ ++DY WY T + ++ +K + + K AL F +
Sbjct: 670 TEPLEQTNLNSKASISSDYAWYGTDVKIDVVLSQVK-----LYIGTEKATALAVFIDGAF 724
Query: 523 QGSASGNGTH---PPFKYKNPISLKAGKNEIALLSMTVGLQNA----GPFYEWVGAGITS 575
G A+ N H P SL AG + +A+L ++G N G GIT
Sbjct: 725 IGEAN-NHQHAEGPTVLSIEIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITG 783
Query: 576 VKITGF-----NSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVSTMEPPKNQPLT 630
+ G N +D W+ GL E + R + + E + PL
Sbjct: 784 NVLIGSPLLSENISLVDGRQMWWSLP-GLSVERKAARHGLRRESFEDAAQAEAGLH-PL- 840
Query: 631 WYKAVVKQPPGDEPIGLDMLKM--GKGLAWLNGEEIGRYWPRKSRKSSPHDECVQECDYR 688
W + P D + L + G+G WLNG+++GRYW +R +S +D
Sbjct: 841 WSSVLFTSPQFDSTVHSLFLDLTSGRGHLWLNGKDLGRYW-NITRGNSWNDY-------- 891
Query: 689 GKFNPDKCITGCGEPSQRWYHIPRSW--FKPSENILVIFEEKGGDPTKITFSIRKI 742
SQR+Y +P + N L++F+ GGD + + I
Sbjct: 892 ---------------SQRYYFLPADFLHLDGQLNELILFDMLGGDHSAARLLLSSI 932
>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
Length = 638
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 153/338 (45%), Gaps = 42/338 (12%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
GN YD G+ I+S +HY R W +Q K G+NT+ +YVFWN HE
Sbjct: 39 GNFVYD-------GKTTRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEE 91
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG + F G +L FIK + +++ILR GP+ AE+++GG P WL I G R D
Sbjct: 92 SPGNWNFEGDHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDN 151
Query: 146 EPFKYHMQKFMTLIVDMMKRE--KLFASQGGPIILAQVENEYGYYESFYG----EGGKRY 199
F + +K+ +D + +E L + GGPII+ Q ENE+G Y S E K Y
Sbjct: 152 AKFLEYTKKY----IDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAY 207
Query: 200 ALWAAKMAVAQNIGVPWIMCQ--------QFDTPDPVINTCNSF-----YCDQFTPHSPS 246
K VP P N N+ DQ+ ++
Sbjct: 208 NAKIKKQLEEAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQ 265
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
P + E +PGW + + IA ++ Q S NYYM HGGTNFG T+G
Sbjct: 266 GPYMVAEFYPGWLDHWAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGA 324
Query: 307 PF--------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ TSYDY+API E G PK+ ++ +
Sbjct: 325 NYNNKSDIQPDITSYDYDAPISEAGWA-TPKYDSIRTV 361
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 625
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 162/334 (48%), Gaps = 31/334 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T D + GR I+SAAIHY R P +W +Q+ + G NT+E Y+ WN H+ +P
Sbjct: 7 LTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQPTP 66
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
F G ++ F+++ + +I R GP++ AE+++GG+P WL R
Sbjct: 67 AAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENVRLRTTDPV 126
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES---FYGEGGKRYALWAA 204
+ + + ++ ++ +L A++GGP++ Q+ENEYG + + + K
Sbjct: 127 YLAAVDAWFDELIPVLA--ELQATRGGPVVAVQIENEYGSFGADPDYLDHLRKGLIERGV 184
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCN--SFYCDQFTPH---SPSMPKIWTENWPGWF 259
+ + G +M PD V+ T N S + F P P + E W GWF
Sbjct: 185 DTLLFTSDGPQELMLAGGTVPD-VLATVNFGSRADEAFATLRRVRPDDPPVCMEFWNGWF 243
Query: 260 KTFGGRDPH--RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----------- 306
FG +PH R ++D A S+ GGSV N+YM HGGTNFG AG
Sbjct: 244 DHFG--EPHHTRSAQDAARSLDEILAAGGSV-NFYMGHGGTNFGFWAGANHSGVGTGDPG 300
Query: 307 --PFITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
P I TSYDY+AP+ E G PK+ +E+ G
Sbjct: 301 YQPTI-TSYDYDAPVGEAG-ELTPKFHLFREVVG 332
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|167755577|ref|ZP_02427704.1| hypothetical protein CLORAM_01091 [Clostridium ramosum DSM 1402]
gi|167704516|gb|EDS19095.1| glycosyl hydrolase family 35 [Clostridium ramosum DSM 1402]
Length = 584
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 156/333 (46%), Gaps = 41/333 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
++ ING + IIS A+HY R VP W + K G NT+E+YV WN HE GKY
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ F+K+ ++ +++ILR P++ AE+ GG+P WL P R + + +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ ++++ + + ++ +Q GPIILAQ+ENEYG YGE K Y L +M
Sbjct: 127 LDQYFSILLPKLSKYQI--TQNGPIILAQLENEYGS----YGE-DKEYLLAVYQMMRKYG 179
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIW 251
I VP T +N + F + + P +
Sbjct: 180 IEVPLFTAD--GTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESYQITAPLMC 237
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----- 306
E W GWF + R ++ S G N+YM+ GGTNFG G
Sbjct: 238 MEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSARKE 295
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P I TSYDY+A + EYG + K+ L+E+
Sbjct: 296 HDLPQI-TSYDYDAILTEYG-AKTEKYHLLREV 326
>gi|313231869|emb|CBY08981.1| unnamed protein product [Oikopleura dioica]
Length = 664
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 160/336 (47%), Gaps = 34/336 (10%)
Query: 12 LLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGV 71
LL+F ++S+++ +++YDS++ + ++S ++HY R W + + K G+
Sbjct: 38 LLLFSNTSLSFRRRPSLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGL 97
Query: 72 NTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPV 131
N + +YV WN HE PG++ F G ++V FI I + +++ILR GP++ +E+ +GG+P
Sbjct: 98 NGVTTYVPWNLHEPEPGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPP 157
Query: 132 WLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESF 191
WL R + + +++F ++ ++K ++ + GGPI+ QVENEYG Y
Sbjct: 158 WLLRDSFMKVRTNYSGYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYGMYA-- 213
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCD------------- 238
G+ G A++ + I P D N N+ Y D
Sbjct: 214 -GQDGAHLNT-LAELLKNEGIVEPLFTSDGSSVWD---NEKNTIYEDGLKSVNFKSNPEK 268
Query: 239 ---QFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
H P P E W GWF +G + D ++ S+ N+YM+H
Sbjct: 269 HLKSLRGHFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDVILDHKASL-NFYMFH 327
Query: 296 GGTNFGRTAGGPFI--------TTSYDYEAPIDEYG 323
GGTNFG T GG I TSYDY+ PI E G
Sbjct: 328 GGTNFGFTNGGLTIARGYYTADVTSYDYDCPISEAG 363
>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
Length = 638
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 153/338 (45%), Gaps = 42/338 (12%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
GN YD G+ I+S +HY R W +Q K G+NT+ +YVFWN HE
Sbjct: 39 GNFVYD-------GKATRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEE 91
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
SPG + F G +L FIK + +++ILR GP+ AE+++GG P WL I G R D
Sbjct: 92 SPGNWNFEGDHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDN 151
Query: 146 EPFKYHMQKFMTLIVDMMKRE--KLFASQGGPIILAQVENEYGYYESFYG----EGGKRY 199
F + +K+ +D + +E L + GGPII+ Q ENE+G Y S E K Y
Sbjct: 152 AKFLEYTKKY----IDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAY 207
Query: 200 ALWAAKMAVAQNIGVPWIMCQ--------QFDTPDPVINTCNSF-----YCDQFTPHSPS 246
K VP P N N+ DQ+ ++
Sbjct: 208 NAKIKKQLEEAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQ 265
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
P + E +PGW + + IA ++ Q S NYYM HGGTNFG T+G
Sbjct: 266 GPYMVAEFYPGWLDHWAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGA 324
Query: 307 PF--------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ TSYDY+API E G PK+ ++ +
Sbjct: 325 NYNNKSDIQPDITSYDYDAPISEAGW-TTPKYDSIRTV 361
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 149/306 (48%), Gaps = 41/306 (13%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I S AIHY R VP W + + K G+NT+E+YV WN HE PG++ + G N+ KFI
Sbjct: 13 IRSGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFIL 72
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
+ Q+ Y+ILR GP++ AE+ +GG+P WL R+ +PFK + +F + +
Sbjct: 73 LAQELGFYVILRPGPYICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEI 132
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFD 223
K L AS+GGPII QVENEYG Y S + Y + + N G+ ++ +
Sbjct: 133 K--SLQASKGGPIIAVQVENEYGSYGS-----DEEYMQFIRDALI--NRGIVELLVTSDN 183
Query: 224 TPDP-------VINTCNSFYCDQFTPHSPS----------MPKIWTENWPGWFKTFGGRD 266
+ V+ T N F H+ S P I E W GWF +G ++
Sbjct: 184 SEGIKHGGAPGVLKTYN------FQGHAKSHLSILERLQDAPSIVMEFWSGWFDHWGEKN 237
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI---------TTSYDYEA 317
+ + + + N+Y++HGGTNFG G FI TSYDY+A
Sbjct: 238 HQVHTIAHVTNTFKDILDCDASFNFYVFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDA 297
Query: 318 PIDEYG 323
P+ E G
Sbjct: 298 PLSEAG 303
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 165/338 (48%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQLV--NGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|322703307|gb|EFY94918.1| beta-calactosidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 645
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 158/343 (46%), Gaps = 45/343 (13%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
L+ + + GN TYD + +++G +I + R P W +Q AK G+N
Sbjct: 19 LLSLAKPLVAAHRGNFTYDRHNFLLDGVPIQLIGGQMDPQRIPPAYWTQRLQMAKAMGLN 78
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI SYVFWN E + G + F GR ++ +F+++ QQ +Y++LR GP++ E+ +GG P W
Sbjct: 79 TIFSYVFWNNIEPTEGSWDFDGRNDIARFLRLAQQEGLYVVLRPGPYICGEHEWGGFPSW 138
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY 192
L IPG R + +PF + ++ + + + SQGGP+++ Q+ENEYG +
Sbjct: 139 LAQIPGMAVRQNNKPFLDASRNYLEQLGKHLAATHI--SQGGPVLMTQLENEYGSFGK-- 194
Query: 193 GEGGKRYALWAAKMAVAQNIGVPW-----------------IMCQQFDTPDPVINTCNSF 235
K Y A M A G + I+ + P + +
Sbjct: 195 ---DKAYLRAMADMLKANFDGFLYTNDGGGKSYLDGGSLHGILAETDGDPKTGFAARDQY 251
Query: 236 YCDQFTPHSPSM--PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQK------GGS 287
D P+M P++ E + W + P++ + + R G +
Sbjct: 252 VTD------PTMLGPQLDGEYYVTWIDDWSSNSPYQYTSGRPDATKRVLDDLDWILAGNN 305
Query: 288 VHNYYMYHGGTNFGRTAGGPF-------ITTSYDYEAPIDEYG 323
+ YM+HGGTN+G GG + +TTSYDY AP+DE G
Sbjct: 306 SFSIYMFHGGTNWGFENGGIWVDNRLNAVTTSYDYGAPLDESG 348
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 94/327 (28%), Positives = 154/327 (47%), Gaps = 28/327 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+TY +L+ GR +++ +HY R P W +++ G+NT+++Y+ WN HE
Sbjct: 9 LTYSEGTLLRAGRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHERRT 68
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G ++ +F++ Q+ + +I+R GP++ AE++ GG+P WL PG R+ P
Sbjct: 69 GEHRFDGWRDIERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRPRSSYAP 128
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+ + ++ +++ + L A++GGP++ QVENEYG Y + Y W
Sbjct: 129 YLDEVARWFDVLIPRIA--DLQAARGGPVVAVQVENEYGSYGDDHA-----YMRWVHDAL 181
Query: 208 VAQNI--------GVPWIMCQQFDTPDPVINTCNSFYCDQ----FTPHSPSMPKIWTENW 255
+ + G +M P + DQ P + E W
Sbjct: 182 AGRGVTELLYTADGPTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLCAEFW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
GWF +G + R A ++ KGGSV + Y HGGTNFG AG
Sbjct: 242 NGWFDHWGEKHHTRSVGSAAAALDEILAKGGSV-SLYPAHGGTNFGLWAGANHADGALQP 300
Query: 309 ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
TSYD +API E+G P PK+ ++
Sbjct: 301 TVTSYDSDAPIAEHGAP-TPKFHAFRD 326
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 157/335 (46%), Gaps = 23/335 (6%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+T + +++ F S V + + I G+ +I +HYPR W +
Sbjct: 6 KTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRL 65
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++A+ G+NT+ +YVFWN HE PG++ F G+ ++ +FI+ Q+ +Y+ILR GP+V AE
Sbjct: 66 KRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAE 125
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+++GG P WL +R+ F + ++++ + + L + GG II+ QVEN
Sbjct: 126 WDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLS--PLTINNGGNIIMVQVEN 183
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCD 238
EYG Y + K Y M VP C + + + T N + +
Sbjct: 184 EYGSYAA-----DKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGVFGE 238
Query: 239 QF----TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
+ P E +P WF +G R E A + G SV + YM+
Sbjct: 239 DIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSHGVSV-SMYMF 297
Query: 295 HGGTNF----GRTAGGPF--ITTSYDYEAPIDEYG 323
HGGTNF G GG + TSYDY+AP+ E+G
Sbjct: 298 HGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 332
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 165/338 (48%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQLV--NGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 154/311 (49%), Gaps = 26/311 (8%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
++S AIHY R P +W +++ G+NT+E+YV WN HE G+ F G +L +FI
Sbjct: 26 VLSGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARFIS 85
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
+ + +I+R GP++ AE+++GG+P WL PG R F + + +V ++
Sbjct: 86 LAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAVVPVI 145
Query: 164 KREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYAL---WAAKMAVAQNIGVPWIM 218
+ L + GGP++ QVENEYG Y ++ Y E ++ L + + G W+
Sbjct: 146 R--PLLTTAGGPVVAVQVENEYGSYGDDAAYLEHCRKGLLDRGIDVLLFTSDGPGPDWLD 203
Query: 219 CQQFDTPDPVIN----TCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH--RPSE 272
+N T +F + P+ P + E W GWF +G +PH R +
Sbjct: 204 NGTIPGVLATVNFGSRTDEAFA--ELRKVQPAGPDMVMEYWNGWFDHWG--EPHHVRDVD 259
Query: 273 DIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-------ITTSYDYEAPIDEYGLP 325
D A + + GGSV N+YM HGGTNFG +G TSYDY+A + E G
Sbjct: 260 DAAGVLDDVLRAGGSV-NFYMAHGGTNFGLWSGANVEDGKLQPTVTSYDYDAAVGEAG-E 317
Query: 326 RNPKWGHLKEL 336
PK+ +E+
Sbjct: 318 LTPKFHAFREV 328
>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
Length = 596
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 158/329 (48%), Gaps = 32/329 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
S ++GRR I S + HY R+ P +W + + K G+NT+ +YV WN HE G++ G
Sbjct: 8 SFYLDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLG 67
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT-EPFKYHM 152
G ++LV F++ +Q+ +Y+I+R GP++ AE+ +GG P WL P R + P+ +
Sbjct: 68 GLYDLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYG----YYESFYGEGGKRYALWAAKMAV 208
+++++ + ++ K GGPII QVENE+G + + +Y+ W +
Sbjct: 128 KQYLSQLFAVLT--KFTYKHGGPIIAFQVENEFGSKGVHDPEYLQFLVTQYSSWNLNELL 185
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCD--QFTPHSPSMPKIWTENWPGWFKTFGGRD 266
+ G ++ IN + D + P P + TE W GWF +G
Sbjct: 186 FTSDGKKYLSNGTLPDVLATINLNDHAKEDLEELKEFQPERPLMVTEFWAGWFDHWGEEH 245
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------------TS 312
H + ++ + SV N+YM+ GGTNFG G +++ TS
Sbjct: 246 HHYGTTELERELEAILSLNASV-NFYMFIGGTNFGFWNGANYLSYNKDKEASLLGPTVTS 304
Query: 313 YDYEAPIDEYGLPRNPKWGHLKELHGAIK 341
YDY+A + E WGH+K + I+
Sbjct: 305 YDYDAAVSE--------WGHVKPKYNVIR 325
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 157/335 (46%), Gaps = 23/335 (6%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+T + +++ F S V + + I G+ +I +HYPR W +
Sbjct: 8 KTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRL 67
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++A+ G+NT+ +YVFWN HE PG++ F G+ ++ +FI+ Q+ +Y+ILR GP+V AE
Sbjct: 68 KRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAE 127
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+++GG P WL +R+ F + ++++ + + L + GG II+ QVEN
Sbjct: 128 WDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLS--PLTINNGGNIIMVQVEN 185
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCD 238
EYG Y + K Y M VP C + + + T N + +
Sbjct: 186 EYGSYAA-----DKGYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNGVFGE 240
Query: 239 Q----FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
+ P E +P WF +G R E A + G SV + YM+
Sbjct: 241 DIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSHGVSV-SMYMF 299
Query: 295 HGGTNF----GRTAGGPF--ITTSYDYEAPIDEYG 323
HGGTNF G GG + TSYDY+AP+ E+G
Sbjct: 300 HGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
>gi|160885481|ref|ZP_02066484.1| hypothetical protein BACOVA_03481 [Bacteroides ovatus ATCC 8483]
gi|423290348|ref|ZP_17269197.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
gi|156109103|gb|EDO10848.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
gi|392665735|gb|EIY59258.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
Length = 778
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 157/335 (46%), Gaps = 23/335 (6%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+T + +++ F S V + + I G+ +I +HYPR W +
Sbjct: 8 KTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRL 67
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++A+ G+NT+ +YVFWN HE PG++ F G+ ++ +FI+ Q+ +Y+ILR GP+V AE
Sbjct: 68 KRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAE 127
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+++GG P WL +R+ F + ++++ + + L + GG II+ QVEN
Sbjct: 128 WDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLS--PLTINNGGNIIMVQVEN 185
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCD 238
EYG Y + K Y M VP C + + + T N + +
Sbjct: 186 EYGSYAA-----DKGYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNGVFGE 240
Query: 239 Q----FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
+ P E +P WF +G R E A + G SV + YM+
Sbjct: 241 DIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSHGVSV-SMYMF 299
Query: 295 HGGTNF----GRTAGGPF--ITTSYDYEAPIDEYG 323
HGGTNF G GG + TSYDY+AP+ E+G
Sbjct: 300 HGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
Length = 628
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 147/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIKI + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + DQ+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------IT 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDM 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
Length = 583
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 148/321 (46%), Gaps = 33/321 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+ + +G I+S A+HY R P W + +A+E G+NTIE+Y+ WN H + G++
Sbjct: 8 EQDFLHDGTPVRILSGALHYFRHHPDQWRDRLTRARELGLNTIETYIPWNAHSPARGEFR 67
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
G +L +F+ + M+ I+R GP++ AE+ GG+P WL + G R +
Sbjct: 68 TDGILDLGRFLDEVAAQGMWAIVRPGPYICAEWTGGGLPGWL-FTAGAAVRRHEPTYLAA 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+Q + + ++ ++ +GGP++L QVENEYG Y K Y K+
Sbjct: 127 IQDYYEAVAGIVAPRQV--DRGGPVVLVQVENEYGAYGD-----DKDYLRALVKLLRESG 179
Query: 212 IGVPWIMCQQFDTPD---------PVINTCNSF------YCDQFTPHSPSMPKIWTENWP 256
I P D P+ P ++ SF H P+ P + E W
Sbjct: 180 ITTP---LTTIDQPEPWMLENGSLPELHKTGSFGSRAAERLATLREHQPTGPLMCAEFWD 236
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--IT 310
GWF ++G + A + G SV N YM GGTNFG T G G + I
Sbjct: 237 GWFDSWGLHHHTTDAAASAHELDTLLAAGASV-NLYMVCGGTNFGFTNGANDKGTYVPIV 295
Query: 311 TSYDYEAPIDEYGLPRNPKWG 331
TSYDY+AP+DE G P W
Sbjct: 296 TSYDYDAPLDEAGRPTAKYWA 316
>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 674
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 155/329 (47%), Gaps = 32/329 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY-YF 92
+ NG+ + S +HY R W ++ K G+N + +YVFWN HE PGK+ +
Sbjct: 88 QFVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWK 147
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G NL +F+K + M +ILR GP+ AE+ +GG P WL G V R D +PF
Sbjct: 148 TGNRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGLVIRADNQPFLDSC 207
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAV 208
+ ++ + M+ L ++GGPII+ Q ENE+G Y + E + Y+ + +
Sbjct: 208 RVYINQLASQMR--DLQITKGGPIIMVQAENEFGSYVAQRKDIPLETHRAYSAKIKQQLL 265
Query: 209 AQNIGVPWIMCQ-----QFDTPDPVINTCN--------SFYCDQFTPHSPSMPKIWTENW 255
VP + T + + T N +++ + P + E +
Sbjct: 266 DAGFDVPLFTSDGSWLFKGGTIEGALPTANGESDIEKLKKVVNEY--NGGKGPYMVAEFY 323
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT----- 310
PGW + P +E I A++ + G S NYYM HGGTNFG T+G + T
Sbjct: 324 PGWLSHWAEPFPQVSTESIVKQTAKYLENGISF-NYYMVHGGTNFGFTSGANYTTATNLQ 382
Query: 311 ---TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+API E G PK+ L+ L
Sbjct: 383 PDLTSYDYDAPISEAGW-NTPKYDALRAL 410
>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
Length = 595
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VPWIMCQQ--FDTPDP-VINTCNSFYCDQFTPHSPSMPKIWTE-------NWP------- 256
VP + D ++ + F F HS ++ E NWP
Sbjct: 182 VPLFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 257 -GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
Length = 632
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 152/338 (44%), Gaps = 51/338 (15%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ +G+ IIS +HYPR W +Q K G+N + +YVFWN HE PGK+ F
Sbjct: 36 DFVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHEPEPGKWDFT 95
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
NL ++IKI + + +ILR GP+V AE+ +GG P WL + R D E F
Sbjct: 96 EDKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEMELRRDNEQF----L 151
Query: 154 KFMTLIVDMMKRE--KLFASQGGPIILAQVENEYGYYESFYG----EGGKRYALWAAKMA 207
K+ L ++ + +E L ++GGPII+ Q ENE+G Y S E +RY +
Sbjct: 152 KYTQLYINRLYQEVGNLQITKGGPIIMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQL 211
Query: 208 VAQNIGVP-------WIM--------------CQQFDTPDPVINTCNSFYCDQFTPHSPS 246
+P W+ D V+N N
Sbjct: 212 KTAGFDIPSFTSDGSWLFEGGAVPGALPTANGESNIDNLKKVVNRYN----------GGQ 261
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
P + E +PGW + P + +A ++ Q S+ NYYM HGGTNFG T+G
Sbjct: 262 GPYMVAEFYPGWLAHWVEPHPQVSATSVARQTEKYLQNDVSI-NYYMVHGGTNFGFTSGA 320
Query: 307 PFIT--------TSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ TSYDY+AP+ E G PK+ L+ +
Sbjct: 321 NYDKKHDIQPDLTSYDYDAPVSEAGW-VTPKFDSLRNV 357
>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
Length = 583
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 105/334 (31%), Positives = 159/334 (47%), Gaps = 36/334 (10%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++YD + R +IS AIHY R VP W +++ K G N IE+YV WN HE
Sbjct: 3 TLSYDEGQFKMGDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEPR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+++F ++ +F+++ + +Y+I+R P++ AE+ +GG+P WL + ND
Sbjct: 63 EGEFHFERMADVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWLLKDDMRLRCNDPR 122
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+ + L+ + L A++GGPII Q+ENEYG Y G A A+
Sbjct: 123 FLEKVSAYYDALLPQLT---PLLATKGGPIIAVQIENEYGSY-------GNDQAYLQAQR 172
Query: 207 AVAQNIGVPWIM---------CQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWT 252
A+ GV ++ Q + V+ T N D+ + P P +
Sbjct: 173 AMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMCM 232
Query: 253 ENWPGWFKTFGGRDPH--RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
E W GWF + +PH R ++D A + G SV N+YM HGGTNFG +G
Sbjct: 233 EYWNGWFDHW--FEPHHTRDAKDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSD 289
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
TSYDY+A I E G PK+ +E+ G
Sbjct: 290 KYEPTVTSYDYDAAISEAG-DLTPKYHAFREVIG 322
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 156/335 (46%), Gaps = 23/335 (6%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+T + +++ F S V + + I G+ +I +HYPR W +
Sbjct: 6 KTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRL 65
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++A G+NT+ +YVFWN HE PG++ F G+ ++ +FI+ Q+ +Y+ILR GP+V AE
Sbjct: 66 KRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAE 125
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+++GG P WL +R+ F + ++++ + + L + GG II+ QVEN
Sbjct: 126 WDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLS--PLTINNGGNIIMVQVEN 183
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCD 238
EYG Y + K Y M VP C + + + T N + +
Sbjct: 184 EYGSYAA-----DKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGVFGE 238
Query: 239 QF----TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
+ P E +P WF +G R E A + G SV + YM+
Sbjct: 239 DIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSHGVSV-SMYMF 297
Query: 295 HGGTNF----GRTAGGPF--ITTSYDYEAPIDEYG 323
HGGTNF G GG + TSYDY+AP+ E+G
Sbjct: 298 HGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 332
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 156/335 (46%), Gaps = 23/335 (6%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLV 63
+T + +++ F S V + + I G+ +I +HYPR W +
Sbjct: 8 KTVLVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRL 67
Query: 64 QQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
++A G+NT+ +YVFWN HE PG++ F G+ ++ +FI+ Q+ +Y+ILR GP+V AE
Sbjct: 68 KRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAE 127
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+++GG P WL +R+ F + ++++ + + L + GG II+ QVEN
Sbjct: 128 WDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLS--PLTINNGGNIIMVQVEN 185
Query: 184 EYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCNSFYCD 238
EYG Y + K Y M VP C + + + T N + +
Sbjct: 186 EYGSYAA-----DKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGVFGE 240
Query: 239 QF----TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
+ P E +P WF +G R E A + G SV + YM+
Sbjct: 241 DIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSHGVSV-SMYMF 299
Query: 295 HGGTNF----GRTAGGPF--ITTSYDYEAPIDEYG 323
HGGTNF G GG + TSYDY+AP+ E+G
Sbjct: 300 HGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 778
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 105/351 (29%), Positives = 163/351 (46%), Gaps = 35/351 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYD----SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
F + + ++ C N + +++ +++G+ +I +A +HY R W +Q
Sbjct: 9 FGVAVLITAIFMGCSTSNKSQTFEVGNQTFLLDGKPFIIKAAEMHYTRIPAEYWEHRIQM 68
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYN 125
K G+NTI Y FWN HE PG++ F G+ ++ +F ++ Q+ MY++LR GP+V +E+
Sbjct: 69 CKALGMNTICIYAFWNIHEQRPGEFDFKGQNDIAEFCRLAQKNGMYIMLRPGPYVCSEWE 128
Query: 126 YGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
GG+P WL R + F + FM I + L A +GG II+ QVENEY
Sbjct: 129 MGGLPWWLLKKKDIQLRTNDPYFLERTKLFMNEIGKQLA--DLQAPRGGNIIMVQVENEY 186
Query: 186 GYYESFYGEGGKRYALWAAKMAVAQNIG---VPWIMCQ-----QFDTPDPVINTCN---- 233
G Y K Y A + + G VP C Q + D ++ T N
Sbjct: 187 GGYAV-----NKEYI--ANVRDIVRGAGFTDVPLFQCDWSSTFQLNGLDDLLWTINFGTG 239
Query: 234 ---SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHN 290
P P + +E W GWF +G + R +E + + + S +
Sbjct: 240 ANIDAQFKSLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRNISF-S 298
Query: 291 YYMYHGGTNFGRTAGG---PF--ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
YM HGGT FG G P+ + +SYDY+API E G PK+ L+E+
Sbjct: 299 LYMAHGGTTFGHWGGANCPPYSAMCSSYDYDAPISEAGWA-TPKYYKLREM 348
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 154/329 (46%), Gaps = 46/329 (13%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ + +G +S +HY R W +Q+ K G+N I +YV W+ HE P
Sbjct: 31 VDYEKNEFLKDGEVFRYVSGDLHYFRVPKSYWKDRIQKIKAAGLNAITTYVEWSLHEPFP 90
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP--GTVFRNDT 145
G Y F G +L FIK+IQ MY++LR GP++ AE ++GG P WL + G++ ND+
Sbjct: 91 GTYNFEGMADLEYFIKLIQDEGMYLLLRPGPYICAERDFGGFPYWLLNVTPKGSLRTNDS 150
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
KY Q F L M K + GG II+ QVENEYG Y + Y LW
Sbjct: 151 SYKKYVSQWFSVL---MKKMQPHLYGNGGNIIMVQVENEYGSYYA----CDSDYKLWLRD 203
Query: 206 M--------AVAQNIGVPWIMCQQFDT---PDPVIN-------TCNSFYC-DQFTPHSPS 246
+ A+ I + C+Q D P P + + N+ C D +
Sbjct: 204 LLKGYVEDKALLYTIDI----CRQRDFDCGPIPEVYATVDFGISVNAATCFDFLKNYQKG 259
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
P + +E +PGW + P S+D+ + S ++YM+HGGTNFG T+G
Sbjct: 260 GPSVNSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLSLNASF-SFYMFHGGTNFGFTSGA 318
Query: 307 ------------PFITTSYDYEAPIDEYG 323
P + TSYDY+API E G
Sbjct: 319 NTNESDANIGYLPQL-TSYDYDAPITEAG 346
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 157/332 (47%), Gaps = 38/332 (11%)
Query: 31 DSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY 90
D ++G+ +I S +HYPR W ++ A+ G+NT+ +Y FW+ HE PG++
Sbjct: 36 DGAHFKLDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQHEPEPGQW 95
Query: 91 YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRN-DTEPFK 149
F G+ +L FIK + + ++LR GP+V AE ++GG P WL G R+ D
Sbjct: 96 SFSGQNDLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRSMDARYLA 155
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ F L ++ L +S+GGPI++ Q+ENEYG Y G L A + +
Sbjct: 156 ASARYFKRLAQEV---ADLQSSRGGPILMLQLENEYGSY------GRDHDYLRAVRTQMR 206
Query: 210 Q-NIGVPWIMCQ-----------QFDTPDPVIN-----TCNSFYCDQFTPHSPSMPKIWT 252
Q P D P V+N + P P++
Sbjct: 207 QAGFDAPLFTSDGGAGRLFEGGTLADVP-AVVNFGGGADDAQASVQELAAWRPHGPRMAG 265
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI--- 309
E W GWF +G + + E+ A +V R +G S N YM+HGGT+FG AG +
Sbjct: 266 EYWAGWFDHWGEQHHTQSPEEAARTVERMLSQGVSF-NLYMFHGGTSFGWLAGANYSGSE 324
Query: 310 -----TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TTSYDY+A +DE G P PK+ L+++
Sbjct: 325 PYQPDTTSYDYDAALDEAGRP-TPKYFALRDV 355
>gi|440698010|ref|ZP_20880386.1| glycosyl hydrolase family 35 [Streptomyces turgidiscabies Car8]
gi|440279645|gb|ELP67504.1| glycosyl hydrolase family 35 [Streptomyces turgidiscabies Car8]
Length = 586
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 156/327 (47%), Gaps = 27/327 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G IIS A+HY R P W +++A+ G+NT+E+YV WN H+ P
Sbjct: 4 LTTTSDGFLLHGEPFRIISGAMHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEP 63
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G G +L +++++ Q ++++LR GPF+ AE++ GG+P WL P R+
Sbjct: 64 GTLALDGILDLPRYLRLAQAEGLHVLLRPGPFICAEWDGGGLPSWLTTDPDIRLRSSDPR 123
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
F + +++ L++ + A GGP+I QVENEYG Y Y A+
Sbjct: 124 FTGAIDRYLDLLLPPLLPYL--AESGGPVIAVQVENEYGAYGD-----DAAYLEHLAEAL 176
Query: 208 VAQNIGVPWIMCQQFDTPD------PVINTCNSF------YCDQFTPHSPSMPKIWTENW 255
++ IG C Q + P + T +F +Q H P P + E W
Sbjct: 177 RSRGIGELLFTCDQANPEHLAAGSLPGVLTTGTFGSKVAASLEQLRAHQPEGPLMCAEFW 236
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------I 309
GWF + G + H A + G+ N YM+HGGTNF T G +
Sbjct: 237 IGWFDHW-GEEHHTRDAADAAADLDRLLSAGASVNIYMFHGGTNFAFTNGANHDHAYQPM 295
Query: 310 TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A + E G P PK+ +E+
Sbjct: 296 VTSYDYDAALSENGDP-GPKYHAFREV 321
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV W+ HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 36/319 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+++G+ I+S AIHY R +P W + K G NT+E+YV WN HE+ G++ F
Sbjct: 8 EEFLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +LV F+K ++ + +ILR GP++ AE+ GG+P WL R D E F +
Sbjct: 68 TGGKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ ++ L ++GGP+I+ QVENEYG + + K Y KM I
Sbjct: 128 ENYFKVLLPLIV--PLQVTKGGPVIMVQVENEYGSFSN-----DKLYLRALKKMIEDAGI 180
Query: 213 GVP-------WIMCQQFDT--PDPVINTCN-------SFYCDQ--FTPHSPSMPKIWTEN 254
VP W T + V+ T N +F Q H P + E
Sbjct: 181 DVPLFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEF 240
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------- 306
W GWF + R ++++ + Q+G N YM+HGGTNFG G
Sbjct: 241 WCGWFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNL 298
Query: 307 PFITTSYDYEAPIDEYGLP 325
P + TSYDY+A + E+G P
Sbjct: 299 PQV-TSYDYDAFLTEWGDP 316
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ +G+ IIS +HY R W ++ K G+N + +YVFWN HE PGK+ F
Sbjct: 33 QFVYDGKAIRIISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPGKWDFS 92
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G NL ++I+I + + +ILR GP+V AE+ +GG P WL + G R D E F
Sbjct: 93 GDRNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGMELRRDNEQF----L 148
Query: 154 KFMTLIVDMMKRE--KLFASQGGPIILAQVENEYGYYES----FYGEGGKRYALWAAKMA 207
K+ L ++ + +E KL +QGGPII+ Q ENE+G Y S E + Y K
Sbjct: 149 KYTKLYLERLYKEVGKLQITQGGPIIMVQGENEFGSYVSQRKDITLEEHRAYNAKIIKQL 208
Query: 208 VAQNIGVP-------WIMCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTE 253
VP W+ + P P N N+ +Q+ + P + E
Sbjct: 209 KEVGFDVPMFTSDGSWLFEGGY-VPGALPTANGENNIENLKKVVNQY--NGGQGPYMVAE 265
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--- 310
+PGW + P + IA ++ G S NYYM HGGTNFG T+G +
Sbjct: 266 FYPGWLAHWCEPHPQVKASTIARQTEKYLANGVSF-NYYMVHGGTNFGFTSGANYDKKHD 324
Query: 311 -----TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 325 IQPDLTSYDYDAPISEAG 342
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 151/317 (47%), Gaps = 40/317 (12%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ +++G+ IIS +IHY R VP W +++ K G NT+E+Y+ WN E G++ F
Sbjct: 9 TFLLDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFD 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G + KF+ + Q+ +Y I+R P++ AE+ GG+P W+ +PG R EP+ +++
Sbjct: 69 GLCDFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVR 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ +++ + ++ +GG IIL Q+ENEYGYY Y + + I
Sbjct: 129 DYYKVLLPRLVNHQI--DKGGNIILMQIENEYGYYGK-----DMSYMHFLEGLMREGGIT 181
Query: 214 VPWIMCQ----------QFDTPDPVINTCNSFYCDQFTPHSPSM-----------PKIWT 252
VP++ Q D P N + P +M P +
Sbjct: 182 VPFVTSDGPWGKMFIHGQCDGALPTGN-----FGSHARPLFANMKRMMKKTGNRGPLMCM 236
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI--- 309
E W GWF +G ++ + K G+V N+YM+HGGTNFG G +
Sbjct: 237 EFWIGWFDAWGNKEHKTSKLKRNIKDLNYMLKKGNV-NFYMFHGGTNFGFMNGSNYFTKL 295
Query: 310 ---TTSYDYEAPIDEYG 323
TTSYDY+AP+ E G
Sbjct: 296 TPDTTSYDYDAPLSEDG 312
>gi|219847209|ref|YP_002461642.1| beta-galactosidase [Chloroflexus aggregans DSM 9485]
gi|219541468|gb|ACL23206.1| Beta-galactosidase [Chloroflexus aggregans DSM 9485]
Length = 898
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 126/444 (28%), Positives = 201/444 (45%), Gaps = 48/444 (10%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
V + + ++ R ++S IHY R W L++QA+ G+NTI++ + WN HE
Sbjct: 4 TVRVGRQGIELDSRPFYLLSGCIHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRHEPQ 63
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFR-NDT 145
PG + F +L F+ + + +I+R GP++ AE+ GG+P WL R ND
Sbjct: 64 PGVFDFADEADLGAFLDLCHDLGLKVIVRPGPYICAEWENGGLPAWLTANGDLRLRTNDP 123
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
++ F TL+ ++ R+ ++GGPIIL Q+ENE+ + YG + L A+
Sbjct: 124 VFLSAVLRWFDTLMPILVPRQH---TRGGPIILCQIENEH-WASGVYGADEHQQTL--AR 177
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS---PSMPKIWTENWPGWFKTF 262
A + I VP C P S ++ P P I +E W GWF +
Sbjct: 178 AAFERGIEVPQYTCMGATPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWFDNW 237
Query: 263 GG-RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRTAGGPFI--TTSYDY 315
GG R + + + + + G + +++M+ GGTNF GRT GG I TT YDY
Sbjct: 238 GGHRQTRKSAAKLDMILHQLTAVGCAGFSHWMWAGGTNFGYWGGRTVGGDLIHMTTGYDY 297
Query: 316 EAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGS--SQEADVYADS--SG 371
+APIDEYG +L E AL+ R +L L ++ + V AD+ G
Sbjct: 298 DAPIDEYG-----------------RLTEKALV-ARRHHLFLSCFGAELSSVLADAVPGG 339
Query: 372 ACAAFLANMDDKNDKTV----VFRNVSYHLPAW---SVSIL--PDCKKVVFNTANVRAQS 422
A + +++ V R PAW V+ L P + V +
Sbjct: 340 ITVIPPAAIAGRSEGGVQPYRTVRAGPTAPPAWRDFCVTFLANPGLEAVTYEVFGPGGDH 399
Query: 423 STVEMVPENLQPSEASPDNGSKGL 446
++E+ P +++P A+ G G+
Sbjct: 400 LSIEVEPTSIRPIFANLPLGESGI 423
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 161/314 (51%), Gaps = 37/314 (11%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG+ ++S A+HY R +P +W + + K G+NT+E+YV WN HE + G++ + G
Sbjct: 17 LNGQPFRVLSGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFRYEGGL 76
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L FI++ + +Y+I+R GPF+ AE+ +GG+P WL P R +P+ +++F
Sbjct: 77 DLAAFIRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEAVRRFY 136
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ + ++ +GGPI+ QVENEYG Y S + Y W ++ + + GV
Sbjct: 137 DDLLPRLLPLQI--QRGGPILAMQVENEYGSYGS-----DQLYLTWLRRLML--DGGVET 187
Query: 217 IMCQQFDTPDPVIN-----------TCNSFYCDQFT---PHSPSMPKIWTENWPGWFKTF 262
++ D ++ S ++F + P P + E W GWF +
Sbjct: 188 LLFTSDGATDHMLKHGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWNGWFDHW 247
Query: 263 GGRDPH--RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PFIT-------T 311
G +PH R + D A ++ R G V N YM+HGGTNFG G +T
Sbjct: 248 G--EPHHTRDAADAADALERIMACGAHV-NVYMFHGGTNFGFMNGANTDLLTRDYQPTVN 304
Query: 312 SYDYEAPIDEYGLP 325
SYDY+AP+DE G P
Sbjct: 305 SYDYDAPLDETGQP 318
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 156/332 (46%), Gaps = 41/332 (12%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
II+G++ IIS A+HY R VP W + K+ G N +E+Y+ WN HE GK+ F
Sbjct: 8 EDFIIDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G+ ++ F+++ ++ +Y+I+R P++ +E+ GG+P WL R + + H+
Sbjct: 68 DGQKDVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHL 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+++ +++ M+ + ++ ++ G IILAQ+ENEYG Y K Y KM I
Sbjct: 128 EEYYAVLLPMIAKYQI--NREGTIILAQLENEYGSYNQ-----DKDYLKALLKMMREYGI 180
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
VP T + + + F D F H P +
Sbjct: 181 EVPIFTAD--GTWEEALEAGSLFEEDVFPTGNFGSNAKENIAVLKEFMKKHQIVAPIMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------ 306
E W GWF + R E++ S G N+YM+HGGTNFG G
Sbjct: 239 EFWDGWFNRWNMEIVKRDPEELVQSAKEMIDLGSI--NFYMFHGGTNFGWMNGCSARKEH 296
Query: 307 --PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P I TSYDY+A + EYG + K+ L+++
Sbjct: 297 DLPQI-TSYDYDAILTEYG-AKTEKYHLLRKM 326
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 36/319 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+++G+ I+S AIHY R +P W + K G NT+E+YV WN HE+ G++ F
Sbjct: 8 EEFLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +LV F+K ++ + +ILR GP++ AE+ GG+P WL R D E F +
Sbjct: 68 TGGKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ ++ L ++GGP+I+ QVENEYG + + K Y KM I
Sbjct: 128 ENYFKVLLPLIV--PLQVTKGGPVIMVQVENEYGSFSN-----DKLYLRALKKMIEDAGI 180
Query: 213 GVP-------WIMCQQFDT--PDPVINTCN-------SFYCDQ--FTPHSPSMPKIWTEN 254
VP W T + V+ T N +F Q H P + E
Sbjct: 181 DVPLFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEF 240
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------- 306
W GWF + R ++++ + Q+G N YM+HGGTNFG G
Sbjct: 241 WCGWFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNL 298
Query: 307 PFITTSYDYEAPIDEYGLP 325
P + TSYDY+A + E+G P
Sbjct: 299 PQV-TSYDYDAFLTEWGDP 316
>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
Length = 594
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV W+ HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 290
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 291 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 624
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 152/318 (47%), Gaps = 31/318 (9%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+S +HY R W +Q K G+NT+ +YVFWN HE+ PGK+ F G NL ++I+
Sbjct: 40 ILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNLAEYIR 99
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
I + M +ILR GP+V AE+ +GG P WL IPG R D F + +K++ + + +
Sbjct: 100 IAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDRLYEEV 159
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYG----EGGKRYALWAAKMAVAQNIGVP---- 215
L ++GGPII+ Q ENE+G Y S E + Y +P
Sbjct: 160 G--DLQCTKGGPIIMVQCENEFGSYVSQRKDIPLEEHRSYNAKIKGQLADAGFTIPLFTS 217
Query: 216 ---WI-----MCQQFDTPDPVINTCN-SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRD 266
W+ + T + + N +Q+ H P + E + GW +G
Sbjct: 218 DGSWLFEGGCVAGALPTANGESDIANLKKVVNQY--HGDKGPYMVAEFYSGWLSHWGEPF 275
Query: 267 PHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSYDYEAP 318
P + +IA + Q S N+YM HGGTNFG T+G + TSYDY+AP
Sbjct: 276 PQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAP 334
Query: 319 IDEYGLPRNPKWGHLKEL 336
I E G PK+ ++ +
Sbjct: 335 ISEAGW-LTPKYDSIRSV 351
>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
Length = 595
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|329927841|ref|ZP_08281902.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938242|gb|EGG34637.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 619
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 100/319 (31%), Positives = 158/319 (49%), Gaps = 32/319 (10%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G +T+ + +++G+ IIS AIHY R VP W + + K G NT+E+Y+ WN HE
Sbjct: 2 GMLTWGNGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEP 61
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
GK+ F G ++ FI++ + +++I+R PF+ AE+ +GG+P WL G + +
Sbjct: 62 QEGKFSFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGY-GEIRLRCS 120
Query: 146 EPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAA 204
+P ++ K +++ R L +S GGPI+ QVENEYG Y G +A
Sbjct: 121 DPL--YLSKVDHYYDELIPRLVPLLSSNGGPILAVQVENEYGSY-------GNDHAYLDY 171
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVI--NTCNSFYCD------------QFTPHSPSMPKI 250
A G+ ++ D ++ T N + ++ + P +
Sbjct: 172 LRAGLVRRGIDVLLFTSDGPTDEMLLGGTLNDVHATVNFGSRVEESFRKYREYRTEEPLM 231
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI- 309
E W GWF + R + D+A + +KG S+ N YM+HGGTNFG +G I
Sbjct: 232 VMEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEKGSSM-NMYMFHGGTNFGFYSGANHIQ 290
Query: 310 -----TTSYDYEAPIDEYG 323
TTSYDY+AP+ E+G
Sbjct: 291 TYEPTTTSYDYDAPLTEWG 309
>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
Length = 595
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 165/338 (48%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++N + I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNDQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGGTNFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 162/330 (49%), Gaps = 35/330 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAKMAVAQ 210
++ ++++ + +L GG I++ Q+ENEYG + E Y + + A+
Sbjct: 137 AEYYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFF 194
Query: 211 NIGVPWIMCQQFDT--PDPVINTCN-------SFYCDQ--FTPHSPSMPKIWTENWPGWF 259
PW + + D ++ T N +F Q F H P + E W GWF
Sbjct: 195 TSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWF 254
Query: 260 KTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
+ RDP +E + ++A GS+ N YM+HGGTNFG G P
Sbjct: 255 NRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCSARGTIDLP 308
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
IT SYDY+AP+DE G P + K LH
Sbjct: 309 QIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 162/330 (49%), Gaps = 35/330 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAKMAVAQ 210
++ ++++ + +L GG I++ Q+ENEYG + E Y + + A+
Sbjct: 127 AEYYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFF 184
Query: 211 NIGVPWIMCQQFDT--PDPVINTCN-------SFYCDQ--FTPHSPSMPKIWTENWPGWF 259
PW + + D ++ T N +F Q F H P + E W GWF
Sbjct: 185 TSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWF 244
Query: 260 KTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
+ RDP +E + ++A GS+ N YM+HGGTNFG G P
Sbjct: 245 NRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFGFMNGCSARGTIDLP 298
Query: 308 FITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
IT SYDY+AP+DE G P + K LH
Sbjct: 299 QIT-SYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
Length = 595
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
Length = 595
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 165/338 (48%), Gaps = 51/338 (15%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG SF E K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG---SFGEE--KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + RDP +E + ++A GS+ N YM+HGG NFG G
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGINFGFMNGCS 300
Query: 307 -------PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P IT SYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQIT-SYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|328721397|ref|XP_003247292.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 628
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 98/337 (29%), Positives = 162/337 (48%), Gaps = 29/337 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ + +G+ +S ++HY R W +Q+ K G+N I +YV W+ HE P
Sbjct: 17 VDYERNEFLKDGQVFRYVSGSLHYFRVPKPYWKDRIQKMKAAGLNAISTYVEWSLHEPYP 76
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTVFRNDTE 146
G+Y F +L F+++++ MY++LR GP++ AE ++GG P WL + +P R +
Sbjct: 77 GEYNFDDIADLEYFLQLVKDEGMYLLLRPGPYICAERDFGGFPFWLLNVVPKKRLRTNDP 136
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGG-------KRY 199
+K+++ K+ ++ M K ++ GG II+ QVENEYG Y + E KRY
Sbjct: 137 SYKHYVTKWFNVL--MPKIDRFLYGNGGNIIMVQVENEYGSYNACDQEYMLWLRDLYKRY 194
Query: 200 ALWAAKMAVAQNIGVPWIMC----QQFDTPDPVINTCNSFYCDQFTPHSPSM-PKIWTEN 254
+ A + G + C + T D + + C ++ + P + +E
Sbjct: 195 VGYKALLYTTDGCGYSYFTCGAIPDVYATVDFGASVKDVSQCFKYMRTTQKRGPLVNSEY 254
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------- 306
+ GW + P S ++ ++ S+ N+YM+HGGTNFG T+G
Sbjct: 255 YAGWLSHWREPSPVISSYEVVETMKDMLALNASI-NFYMFHGGTNFGFTSGANKYESLKN 313
Query: 307 ----PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGA 339
P +T SYDY +P+DE G P + K L G
Sbjct: 314 PDYLPQLT-SYDYNSPLDEAGDPTEKYFKIKKLLEGT 349
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 37/61 (60%), Gaps = 4/61 (6%)
Query: 613 NNINWVSTMEPPKNQPL-TWYKAVVKQPPG-DEPIG--LDMLKMGKGLAWLNGEEIGRYW 668
N +W ST+EP K+ L +YK K P G +P+ LD+ KG+A++NG IGRYW
Sbjct: 511 NETSWFSTIEPQKDAVLPAFYKTQFKLPDGLTKPLDTYLDVTGWKKGVAFVNGINIGRYW 570
Query: 669 P 669
P
Sbjct: 571 P 571
>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
Length = 596
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 152/322 (47%), Gaps = 40/322 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+ ++NG+ I+S A+HY R VP W + K G NT+E+YV WN H+ P ++
Sbjct: 7 EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F R +LVKF++ + +Y+ILR P++ AE+ +GG+P WL IP R + F
Sbjct: 67 FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ ++ + ++ +QGG I++ Q+ENEYG + + K Y + +
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYGSFGN-----DKNYLRAILALMLIHG 179
Query: 212 IGVP-------WIMCQQFDT--PDPVINTCN------------SFYCDQFTPHSPSMPKI 250
+ VP W + D ++ T N Y D+ H S P +
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RT 303
E W GWF + R ++D+A ++ N+YM+ GGTNFG R
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLADCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294
Query: 304 AGGPFITTSYDYEAPIDEYGLP 325
TSYDY+AP+ E+G P
Sbjct: 295 DTDLPQVTSYDYDAPVHEWGEP 316
>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
Length = 778
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GIDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
Length = 778
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GIDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
43184]
gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
Length = 780
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/316 (31%), Positives = 148/316 (46%), Gaps = 17/316 (5%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ +++G+ +I +A IHY R W +Q K G+NTI Y FWN HE PG++ F
Sbjct: 39 TFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFK 98
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G+ ++ F ++ Q+ MY++LR GP+V +E+ GG+P WL R + F +
Sbjct: 99 GQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTK 158
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGE-GGKRYALWAAKMAVAQNI 212
FM I + L ++GG II+ QVENEYG Y + R A+ AA
Sbjct: 159 LFMNEIGKQLA--DLQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPLF 216
Query: 213 GVPWIMCQQFDTPDPVINTCN-------SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGR 265
W Q + D ++ T N + P P + +E W GWF +G +
Sbjct: 217 QCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGRK 276
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----PFITTSYDYEAPID 320
R + + + + S + YM HGGT FG G + +SYDY+API
Sbjct: 277 HETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPIS 335
Query: 321 EYGLPRNPKWGHLKEL 336
E G PK+ L+EL
Sbjct: 336 EAGWA-TPKYYKLREL 350
>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
Length = 779
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 165/349 (47%), Gaps = 33/349 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYD--SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAK 67
+LI S N T++ ++ ++NG+ +I +A IHY R W +Q K
Sbjct: 12 MVMLICVLSGCKNQSGSNGTFEIGDKTFLLNGKPFIIKAAEIHYTRIPVEYWEHRIQMCK 71
Query: 68 EGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYG 127
G+NTI Y FWN HE PG++ F G+ ++ F ++ Q+ MY++LR GP+V +E+ G
Sbjct: 72 ALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKNGMYIMLRPGPYVCSEWEMG 131
Query: 128 GIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY 187
G+P WL R + F + +M I + ++ ++GG II+ QVENEYG
Sbjct: 132 GLPWWLLKKEDIQLRTNDPYFIERTRIYMNEIGKQLADRQI--TRGGNIIMVQVENEYGS 189
Query: 188 YESFYGEGGKRYALWAAKMAVAQNIG---VPWIMCQ-----QFDTPDPVINTCN----SF 235
Y + K Y A + ++ G VP C + D ++ T N +
Sbjct: 190 YAT-----DKSYI--AKNRDILRDAGFTDVPLFQCDWSSNFLNNALDDLVWTVNFGTGAN 242
Query: 236 YCDQFTPHS---PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYY 292
+QF P+ P + +E W GWF +G + R +E + + + S + Y
Sbjct: 243 IDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMIAGLRDMLDRNISF-SLY 301
Query: 293 MYHGGTNFGRTAGG-----PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
M HGGT FG G + +SYDY+API E G PK+ L+E
Sbjct: 302 MTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWA-TPKYHKLREF 349
>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
Length = 780
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 156/343 (45%), Gaps = 17/343 (4%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
I +L+F S + + + +++G+ +I +A IHY R W +Q
Sbjct: 12 ITCCVILLFSGCSPRQGEKHDFSIGKGTFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMC 71
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+NTI Y FWN HE PG++ F G+ ++ F ++ Q+ MY++LR GP+V +E+
Sbjct: 72 KALGMNTICIYAFWNIHEQKPGEFDFKGQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEM 131
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG+P WL R + F + FM I + L ++GG II+ QVENEYG
Sbjct: 132 GGLPWWLLKKEDIKLRTNDPYFLERTKLFMNEIGKQLA--DLQVTRGGNIIMVQVENEYG 189
Query: 187 YYESFYGE-GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCN-------SFYCD 238
Y + R A+ AA W Q + D ++ T N
Sbjct: 190 AYATDKAYIANIRDAVKAAGFTDVPLFQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFK 249
Query: 239 QFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGT 298
+ P P + +E W GWF +G + R + + + + S + YM HGGT
Sbjct: 250 KLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGT 308
Query: 299 NFGRTAGG-----PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
FG G + +SYDY+API E G PK+ L+EL
Sbjct: 309 TFGHWGGANSPAYSAMCSSYDYDAPISEAGWA-TPKYYKLREL 350
>gi|295689222|ref|YP_003592915.1| beta-galactosidase [Caulobacter segnis ATCC 21756]
gi|295431125|gb|ADG10297.1| Beta-galactosidase [Caulobacter segnis ATCC 21756]
Length = 617
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 157/340 (46%), Gaps = 36/340 (10%)
Query: 11 ALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGG 70
AL I S + + A + +G +ISA +HY R W +Q+AK G
Sbjct: 17 ALAILPSDARSAAPAHRFEVSGAGFLKDGAPHQVISAEMHYVRIPRAYWRDRLQKAKTMG 76
Query: 71 VNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIP 130
+NTI +Y FWN HE PG Y F G+ +L FI+ Q + +ILR GP+V +E+ GG P
Sbjct: 77 LNTITTYAFWNVHEPRPGVYDFTGQNDLAAFIRAAQAEGLDVILRPGPYVCSEWELGGYP 136
Query: 131 VWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-- 188
WL + R+ + ++++M + +K L GGPI+ Q+ENEYG +
Sbjct: 137 SWLLKDRNVLLRSTEPQYAAAVERWMARLGREVK--PLLLKNGGPIVAIQLENEYGAFGD 194
Query: 189 ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD---PVINTCNSF-------YCD 238
+ Y EG L A GV + Q D P + + +F
Sbjct: 195 DKAYLEG-----LEATYRRAGLADGVLFTSNQASDLAKGSLPHLPSMVNFGSGGAEKSVA 249
Query: 239 QFTPHSPSMPKIWTENWPGWFKTFGGR----DPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
Q P ++ E W GWF +G D + +E++ F Q+G SV + YM+
Sbjct: 250 QLETFRPDGLRMVGEYWAGWFDKWGEEHHETDGRKEAEELRF----MLQRGYSV-SLYMF 304
Query: 295 HGGTNFGRTAGGPF--------ITTSYDYEAPIDEYGLPR 326
HGGT+FG G TTSYDY+AP+DE G PR
Sbjct: 305 HGGTSFGWMNGADSHTGKDYHPDTTSYDYDAPLDEAGAPR 344
>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
parasuis SH0165]
Length = 596
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 152/322 (47%), Gaps = 40/322 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+ ++NG+ I+S A+HY R VP W + K G NT+E+YV WN H+ P ++
Sbjct: 7 EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F R +LVKF++ + +Y+ILR P++ AE+ +GG+P WL IP R + F
Sbjct: 67 FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ ++ + ++ +QGG I++ Q+ENEYG + + K Y + +
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYGSFGN-----DKNYLRAIRALMLIHG 179
Query: 212 IGVP-------WIMCQQFDT--PDPVINTCN------------SFYCDQFTPHSPSMPKI 250
+ VP W + D ++ T N Y D+ H S P +
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RT 303
E W GWF + R ++D+A ++ N+YM+ GGTNFG R
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLANCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294
Query: 304 AGGPFITTSYDYEAPIDEYGLP 325
TSYDY+AP+ E+G P
Sbjct: 295 DTDLPQVTSYDYDAPVHEWGEP 316
>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
Length = 778
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GIDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + DQ+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------IT 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDM 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
Length = 595
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G N+V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
+P W+ T D I +F +F H + P + E W
Sbjct: 182 IPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|324507659|gb|ADY43243.1| Beta-galactosidase [Ascaris suum]
Length = 655
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/313 (30%), Positives = 155/313 (49%), Gaps = 37/313 (11%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
+++GR IS +IHY R P W + + + G+N I+ Y+ WN HE+ GK+ F G
Sbjct: 41 FLLDGRSFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHEIYEGKHRFDG 100
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
N+ F+++ Q +Y ++RIGP++ AE+ GG P WL R + F +++
Sbjct: 101 SRNITHFLQLAMQNELYALVRIGPYICAEWENGGAPWWLLKYKDIKMRTSDKRFLDAVKR 160
Query: 155 FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV 214
+ +++ ++K GGPI++ Q+ENEYG SF G + Y ++ +A ++ G
Sbjct: 161 WFDVLLPILKPN--LRKNGGPILMLQLENEYG---SFDGGCDRNYTIFLRDLA-RRHFGD 214
Query: 215 PWIMCQQFDTPDPVINTCNSF------------------YC----DQFTPHSPSMPKIWT 252
++ D D C + +C Q+ PH P + +
Sbjct: 215 DVVLYTT-DGGDDFYLKCGTIPGVYATVDFGPASSEAIDHCFASQRQYEPHGPLVN---S 270
Query: 253 ENWPGWFKTFGGRDP-HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E +PGWF T+ ++ +P ++ F+KG + NYYM+HGGTNF GG
Sbjct: 271 EFYPGWFLTWSQKERGDQPVHNVINGSKYMFEKGANF-NYYMFHGGTNFAFWNGGATKTA 329
Query: 309 ITTSYDYEAPIDE 321
ITTSYDY AP+ E
Sbjct: 330 ITTSYDYFAPLSE 342
>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
Length = 628
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + DQ+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------IT 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDM 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
Length = 628
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + DQ+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------IT 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDM 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 628
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + DQ+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------IT 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDM 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 151/319 (47%), Gaps = 34/319 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R +P W + K G N +E+YV WN HE G++
Sbjct: 7 EEEFMLDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +FI + +Y+I+R P++ AE+ +GG+P WL P R+ F +
Sbjct: 67 FSGTKDIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEY 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
++++ + +++ L GPI++ QVENEYG YGE K Y A+M +
Sbjct: 127 VERYYDRLFEILT--PLQIDHHGPILMMQVENEYGS----YGE-DKTYLSALARMMRDRG 179
Query: 212 IGVP-------WIMCQQFDT-------PDPVINTCNSFYCDQFTPHSPSMPKIW----TE 253
+ VP W C + + P + + D K W E
Sbjct: 180 VTVPLFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSME 239
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR----TAGGPF- 308
W GWF +G R R S+++ + ++G N YM+HGGTNFG +A G
Sbjct: 240 FWDGWFNRWGDRIITRQSDELIDEIGEVLKRGSI--NLYMFHGGTNFGFWNGCSARGRID 297
Query: 309 --ITTSYDYEAPIDEYGLP 325
TSYDY+AP+DE G P
Sbjct: 298 LPQVTSYDYDAPLDEAGNP 316
>gi|115361550|gb|ABI95864.1| beta-galactosidase [Planococcus sp. L4]
Length = 552
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/293 (35%), Positives = 148/293 (50%), Gaps = 28/293 (9%)
Query: 49 IHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQA 108
+HY R+VP W +Q+ K G+NT+E+Y+ WN HE G+++F G ++ FI++ +
Sbjct: 1 MHYFRTVPEQWEDRLQKLKALGLNTVETYIPWNFHEPKKGQFHFSGMADIEGFIELAHRL 60
Query: 109 RMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKL 168
+Y+ILR P++ AE+ GG+P WL V R+ F H++ + + + K K
Sbjct: 61 GLYVILRPAPYICAEWEMGGLPSWLMKDKNLVLRSSDPAFLGHVEDYFAEL--LPKFTKH 118
Query: 169 FASQGGPIILAQVENEYGYY--ESFYGEGGK-RYALWAAKMAVAQNIGVPWIMCQQFDTP 225
GGP+I Q+ENEYG Y +S Y + K +Y + + G +I Q P
Sbjct: 119 LYQNGGPVIAMQIENEYGAYGNDSAYLDFFKAQYEHHGLNTFLFTSDGPDFIT--QGSMP 176
Query: 226 DPVINTCN-------SFYC-DQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFS 277
D V T N SF D F P SP M E W GWF + G R +D+A
Sbjct: 177 D-VTTTLNFGSRVDESFQALDAFKPDSPKMV---AEFWIGWFDYWSGEHTVRSGDDVASV 232
Query: 278 VARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFITTSYDYEAPIDEYG 323
+K SV N+YM+HGGTNFG G P I TSYDY++ + E G
Sbjct: 233 FKEIMEKNISV-NFYMFHGGTNFGFMNGANHYDIYYPTI-TSYDYDSLLTEGG 283
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 165/337 (48%), Gaps = 49/337 (14%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++N + I+S AIHY R P W + K G NT+E+YV WN HE G ++F
Sbjct: 18 EEFLLNDQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+K+ Q+ +Y I+R P++ AE+ +GG P WL PG + R++ + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++++ + +L + GG I++ Q+ENEYG + GE K Y + +A+ +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSF----GEE-KAYLRAIRDLMIARGV 189
Query: 213 GVPWIMCQQFDTP------------DPVINTCN-------SFYCDQ--FTPHSPSMPKIW 251
P+ D P D ++ T N +F Q F H P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 252 TENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF----GRT 303
E W GWF + RDP +E + ++A GS+ N YM+HGGTNF G +
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL-----GSI-NLYMFHGGTNFEFMNGCS 300
Query: 304 AGGPF---ITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
A G TSYDY+AP+DE G P + K LH
Sbjct: 301 ARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|299147339|ref|ZP_07040404.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298514617|gb|EFI38501.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GIDKPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|302549318|ref|ZP_07301660.1| beta-galactosidase [Streptomyces viridochromogenes DSM 40736]
gi|302466936|gb|EFL30029.1| beta-galactosidase [Streptomyces viridochromogenes DSM 40736]
Length = 589
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/330 (30%), Positives = 156/330 (47%), Gaps = 32/330 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G I+S A+HY R P +W +++A+ G+NT+E+Y+ WN H+ P
Sbjct: 4 LTTTSDGFLLHGEPFRILSGALHYFRVHPDLWSDRLRKARLMGLNTVETYLPWNHHQPDP 63
Query: 88 -GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G G +L +F+++ Q ++++LR GPF+ AE++ GG+P WL P R
Sbjct: 64 EGPLVLDGLLDLPRFLRLAQDEGLHVLLRPGPFICAEWDGGGLPDWLTSDPDVRLRTSDP 123
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F + +++ L++ ++ A+ GGP+I QVENEYG Y G A
Sbjct: 124 RFTGAVDRYLDLLLPALRPH--LAAAGGPVIAVQVENEYGAY-------GDDCAYLKHLA 174
Query: 207 AVAQNIGVPWIM--CQQFDTPD------PVINTCNSFYC------DQFTPHSPSMPKIWT 252
++ GV ++ C Q D P + T ++F + H P
Sbjct: 175 DAFRSRGVEELLFTCDQADPEHLAAGSLPGVLTASTFGSRVEQSFGRLREHRSEGPLFCA 234
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E W GWF +GG H A + G+ N YM+HGGTNFG G
Sbjct: 235 EFWIGWFDHWGGPH-HVRDAADAAADLDRLLSAGASVNIYMFHGGTNFGFANGANHKHAY 293
Query: 309 --ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A + E G P PK+ +E+
Sbjct: 294 TPTVTSYDYDAALTECGDP-GPKYHAFREV 322
>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
Neff]
Length = 604
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 148/310 (47%), Gaps = 32/310 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
+G+ I+S +IHY RS+P WP ++ + G+NT+ +YV WN HE +PG+Y F GR +
Sbjct: 36 DGQEFRIVSGSIHYFRSLPEQWPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLD 95
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
+V+FI+ QQ +I+R P++ AE +GG+P WL G R + + F+
Sbjct: 96 IVRFIEAAQQEGFLVIVRPPPYICAELEFGGLPAWLLNEEGLQLRCSDPKYLKRVDSFLD 155
Query: 158 LIVDMMKREKLFASQGGPIILAQVENEYGYY----------ESFYGEGGKRYALWAAKMA 207
+ M+ + S+GGPII QVENEYG Y E + + L+++ A
Sbjct: 156 HFLPMLATYQY--SRGGPIIAMQVENEYGSYGNDHLYLRHLELKFRQHQIDAILFSSNGA 213
Query: 208 VAQNI---GVPWIM-CQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
Q +P ++ F T V + PS P TE W GWF +
Sbjct: 214 GDQMFVGGALPSLLRTVNFGTGADVEGNLKV-----LRKYQPSGPLFVTEFWDGWFDHW- 267
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------PF--ITTSY 313
G + H + + + N YM GGTNFG T G P+ TTSY
Sbjct: 268 GEEHHTTTPTQSMKTLEAILSNNASVNLYMAFGGTNFGFTNGANKGYGETDPYQPTTTSY 327
Query: 314 DYEAPIDEYG 323
DY+AP++E G
Sbjct: 328 DYDAPVNESG 337
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADVG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + DQ+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------IT 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDM 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|423259078|ref|ZP_17240001.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|423263951|ref|ZP_17242954.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
gi|387776658|gb|EIK38758.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|392706217|gb|EIY99340.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
Length = 773
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/328 (31%), Positives = 153/328 (46%), Gaps = 29/328 (8%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
R+ ++NG ++ +A +HY R W + K G+NTI Y+FWN HE GK+ F
Sbjct: 31 RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ KF K+ Q+ MY+ILR GP+V AE+ GG+P WL R+ F
Sbjct: 91 SGEKNVAKFCKLAQKHGMYIILRPGPYVCAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN- 211
+ FM + + +L + GG II+ QVENE+G G G + + A + V +
Sbjct: 151 EIFMKELGKQLAPLQL--ANGGNIIMVQVENEFG------GYGVDKPYMTAIRDIVCRAG 202
Query: 212 ------IGVPWIMCQQFDTPDPVINTCN-------SFYCDQFTPHSPSMPKIWTENWPGW 258
W + + D ++ T N + + P P + +E W GW
Sbjct: 203 FDKSVLFQCDWDSTFELNALDDLLWTLNFGTGANIDKEFKKLSTVRPDTPLMCSEFWSGW 262
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----PFITTSY 313
F +G + RP+E + + + S + YM HGGT FG G + +SY
Sbjct: 263 FDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSSY 321
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIK 341
DY+API E G PK+ L+EL G +
Sbjct: 322 DYDAPISEAGW-TTPKYYLLQELLGKYR 348
>gi|293370654|ref|ZP_06617206.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|292634388|gb|EFF52925.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 778
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GIDKPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + RP++D+ + + S + YM HGG
Sbjct: 245 KRLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
gallopavo]
Length = 643
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 169/356 (47%), Gaps = 25/356 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ YD + +GR IS +IHY R W + + K G++ I++YV WN HE
Sbjct: 18 IDYDCNCFVKDGRPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLDAIQTYVPWNYHETQM 77
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F G +L F+++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 78 GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 137
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY------------ESFYGEG 195
+ ++K+M +++ MK GGPII+ QVENEYG Y + F
Sbjct: 138 YLTAVEKWMGVLLPKMKPH--LYQNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFRQHL 195
Query: 196 GKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWTE 253
G L+ A ++ + + T D P N +F + + P+ P + +E
Sbjct: 196 GDEVVLFTTDGASQFHLKCGALQ-GLYATVDFAPGGNVTAAFLAQRSS--EPTGPLVNSE 252
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--RTAGGPFIT- 310
+ GW +G R PS+ IA ++ +G +V N YM+ GGTNF A P+++
Sbjct: 253 FYTGWLDHWGHRHAVVPSQTIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMSQ 311
Query: 311 -TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLGSSQEADV 365
TSYDY+AP+ E G K+ L+E+ G L+ S + G+ + V
Sbjct: 312 PTSYDYDAPLSEAG-DLTEKYFALREVIGMYNQLPEGLIPPTTSKFAYGNVRLQKV 366
>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 583
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 149/306 (48%), Gaps = 28/306 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ IIS A+HY R VP W +++ K G NT+E+YV WN HE GK+ F G
Sbjct: 14 LDGKPFKIISGAVHYFRIVPEYWRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGML 73
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
++ +FI + Q+ +Y+I+R P++ AE+ +GG+P WL G R EPF ++++
Sbjct: 74 DISRFILLAQELGLYVIVRPSPYICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYY 133
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
+++ ++ L GGP+IL QVENEYGYY RY ++ + VP
Sbjct: 134 SVLFPILV--PLQIHHGGPVILMQVENEYGYYGD-----DTRYMETMKQLMLDNGAEVPL 186
Query: 217 IM----------CQQFDTPDPVIN--TCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
+ C + P N + + ++ P + TE W GWF +G
Sbjct: 187 VTSDGPMDESLSCGRLPGVLPTGNFGSKTEERFEVLKKYTEGGPLMCTEFWVGWFDHWGN 246
Query: 265 RDPHRPS-EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEA 317
R + E+ + + + G N YM+ GGTNFG G + TSYDY+A
Sbjct: 247 GGHMRGNLEESTKDLDKMLEMGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTSYDYDA 304
Query: 318 PIDEYG 323
+ E G
Sbjct: 305 VLTEAG 310
>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
Length = 779
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 159/334 (47%), Gaps = 30/334 (8%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F + F SS+ A + +++G+ ++ +A +HY R W ++ K
Sbjct: 12 FTVTFFVSSAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKAL 71
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+NTI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+
Sbjct: 72 GMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGL 131
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYY 188
P WL R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 132 PWWLLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY 188
Query: 189 ESFYGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCD 238
G + + A + V ++ VP C + D +I T N D
Sbjct: 189 ------GINKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANID 242
Query: 239 Q----FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
Q P P + +E W GWF +G + RP++D+ + + S + YM
Sbjct: 243 QQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMT 301
Query: 295 HGGTNFGRTAGG-----PFITTSYDYEAPIDEYG 323
HGGT FG G + +SYDY+API E G
Sbjct: 302 HGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAG 335
>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
Length = 629
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 145/312 (46%), Gaps = 30/312 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+NG++ I+S +HY R W +Q K G+N + +YVFWN HE PGK+ F G
Sbjct: 38 LNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHETEPGKWDFTGDK 97
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
NL ++IK + M +ILR GP+V AE+ +GG P WL +PG R D F H + ++
Sbjct: 98 NLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGMEIRRDNPQFLKHTEAYI 157
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQNI 212
+ + L ++GGPI++ Q ENE+G Y + + + Y +
Sbjct: 158 QRLYKEVGH--LQCTKGGPIVMVQCENEFGSYVAQRKDITLQEHRAYNAKIKQQLADAGF 215
Query: 213 GVPWIMCQ-----QFDTPDPVINTCN--------SFYCDQFTPHSPSMPKIWTENWPGWF 259
VP + + + + T N +Q+ H P + E +PGW
Sbjct: 216 DVPLFTSDGSWLFEGGSTEGALPTANGETDIANLKKVVNQY--HGGQGPYMVAEFYPGWL 273
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------T 311
+ P + +A + + + S N YM HGGTNFG T+G + T
Sbjct: 274 SHWAEPFPQVSASSVARTTESYLKNDVSF-NVYMVHGGTNFGFTSGANYDKKRDIQPDLT 332
Query: 312 SYDYEAPIDEYG 323
SYDY+API E G
Sbjct: 333 SYDYDAPISEAG 344
>gi|423278914|ref|ZP_17257828.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
gi|404585906|gb|EKA90510.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
Length = 769
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKNTFLLNGKSFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ GK+ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPDTP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PAYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
Length = 595
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKDIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 192/441 (43%), Gaps = 65/441 (14%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+S AIHY R VP W + + + G+NT+E+Y+ WN HE G++ F G +L +F++
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
I +++ILR P++ AE+ +GG+P WL P R + + ++ ++ +
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRL 140
Query: 164 KREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYAL-WAAKMAVAQNIGVPWIMCQ 220
L S+GGP+I Q+ENEYG Y ++ Y E K + + + + G M Q
Sbjct: 141 V--PLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDGLIKRGVDVLLFTSDGPTDGMLQ 198
Query: 221 QFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIA 275
P V+ T N D+ + P P + E W GWF + R +ED A
Sbjct: 199 GGAVPG-VLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDAA 257
Query: 276 FSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPIDEYG------ 323
SV N+YM+HGGTNFG G F TSYDY+AP+ E G
Sbjct: 258 AVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECGDVTAKF 316
Query: 324 -----------------LPRNPK------WGHLKELHGAIKLCEHALLNGERSNLS---- 356
LP P+ +G + H A L L+ E+ +
Sbjct: 317 EAIRSAIAQHQGKELSDLPSLPQPVKKISYGSVSMTHYADLLEHLPALSEEQKRTAPVPM 376
Query: 357 --LGSSQEADVYADS-SGACAAFLANMDDKNDKTVVFRNVSYH--LPAWSVSILPDCKKV 411
LG S VYA SG ++ + +D+ VF + Y + W LP
Sbjct: 377 ERLGQSYGFTVYATHISGPRQGESLHLQEVHDRAQVFLDGKYQGTVERWDAKALP----- 431
Query: 412 VFNTANVRAQSSTVEMVPENL 432
+V A + +E+V EN+
Sbjct: 432 ----IDVPAAGAKLEIVVENM 448
>gi|241642284|ref|XP_002409405.1| beta-galactosidase precursor, putative [Ixodes scapularis]
gi|215501365|gb|EEC10859.1| beta-galactosidase precursor, putative [Ixodes scapularis]
Length = 812
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 183/369 (49%), Gaps = 44/369 (11%)
Query: 18 SSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESY 77
S+ CF V Y++ + + +S + HY R + W + + K GG+N +++Y
Sbjct: 325 SASERCF--RVDYENNVFLKDDEPFQFVSGSFHYFRVLKDSWKDRLIKMKNGGLNVVQTY 382
Query: 78 VFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYI 136
V W+GHE P +Y F G +++ F+K+ Q+ ++++LR GP+++AE + GG+P WL
Sbjct: 383 VEWSGHEPEPQQYNFEGNYDIETFLKLAQEVGLFVVLRPGPYISAERDNGGLPYWLLREN 442
Query: 137 PGTVFRNDTEPFKYHMQK----FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFY 192
P V+R+ F + + F+ +I D M GGPII+ QVENEYG Y+
Sbjct: 443 PRMVYRSFDPTFMLPVDRWFHYFLPMIQDYMYH------NGGPIIMVQVENEYGEYK--- 493
Query: 193 GEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTC--------------NSFYCD 238
E RY + + Q++G ++ +Q D P C N D
Sbjct: 494 -ECDCRYMEHLVYIFL-QHLGTDTVLYRQ-DYPLEENYICDEARQTFVSGSFKYNETIAD 550
Query: 239 QFTPHSPSM----PKIWTENWPG-WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYM 293
F + S P + +E +PG W +G + P + + + K SV N+YM
Sbjct: 551 VFDIMNKSQGNEGPMLVSEYYPGGWQSHWGWEEVTFPEDKVIAKLEEMLSKKASV-NFYM 609
Query: 294 YHGGTNFGRTAGG--PFITTSYDYEAPIDEYGLPRNPKWGHLKE-LHGAIKLCEHALLNG 350
Y GGTNFG T G P + TSYDY +PI E G R P + L++ ++ + L E+ +++
Sbjct: 610 YVGGTNFGFTNGNRPPPLVTSYDYGSPISECGDTR-PIYHTLRQSINKFLPLPEYIVIDP 668
Query: 351 ERSNLSLGS 359
E L+LGS
Sbjct: 669 E-PRLNLGS 676
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 94/184 (51%), Gaps = 18/184 (9%)
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+N ++ YV W+GHE PG+Y F ++L F++ +Q + ++ R GP++ AE +
Sbjct: 2 KMAGLNAVDVYVEWSGHEPEPGRYLFHNEYDLELFLEFVQDLDLLVLFRPGPYICAERDN 61
Query: 127 GGIPVW-LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
GG+P W L V+R F + ++ ++ +MK GGPIIL QVENEY
Sbjct: 62 GGLPYWLLRKNASMVYRTSDPSFMAEVTRWFDRLLPLMK--PYLYEYGGPIILVQVENEY 119
Query: 186 GYYESFYGEGGKRYALWAAKMAVAQNIG--VPWIMCQQFDTPDPVINTCNSFYCDQFTPH 243
G Y + K+Y A + + +++G VP + Q D + F CD+ +
Sbjct: 120 GAYFA----CDKKYMRDLASL-LRRHLGHSVPLFLSNQADE--------SHFRCDRVSGI 166
Query: 244 SPSM 247
P++
Sbjct: 167 LPTV 170
>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 779
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 159/334 (47%), Gaps = 30/334 (8%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
F + F SS+ A + +++G+ ++ +A +HY R W ++ K
Sbjct: 12 FTVTFFVSSAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKAL 71
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+NTI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+
Sbjct: 72 GMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGL 131
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYY 188
P WL R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 132 PWWLLKKRDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY 188
Query: 189 ESFYGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCD 238
G + + A + V ++ VP C + D +I T N D
Sbjct: 189 ------GINKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANID 242
Query: 239 Q----FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
Q P P + +E W GWF +G + RP++D+ + + S + YM
Sbjct: 243 QQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMT 301
Query: 295 HGGTNFGRTAGG-----PFITTSYDYEAPIDEYG 323
HGGT FG G + +SYDY+API E G
Sbjct: 302 HGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAG 335
>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
Length = 595
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 151/318 (47%), Gaps = 36/318 (11%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+++G IIS AIHY R P W + K G NT+E+Y+ WN HE G + F
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++V+F+KI Q+ + +ILR ++ AE+ +GG+P WL P R+ F ++
Sbjct: 69 GFKDVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+ ++ + K L +QGGP+I+ Q+ENEYG Y K Y ++ +A +I
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYGM-----EKSYLRQTKELMLAHSID 181
Query: 214 VP-------WIMCQQFDTP-DPVINTCNSF---------YCDQFTP-HSPSMPKIWTENW 255
VP W+ T D I +F +F H + P + E W
Sbjct: 182 VPLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R E++A V + G N YM+HGGTNFG G P
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A ++E G P
Sbjct: 300 QI-TSYDYDALLNEAGQP 316
>gi|193690496|ref|XP_001952133.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 635
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 161/326 (49%), Gaps = 36/326 (11%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ Y++ + +G+ +S ++HY R W +Q+ K G+NTI +YV W+ HE P
Sbjct: 27 IDYENNEFLKDGKVFRYVSGSLHYFRIPQLYWKDRIQKMKAAGLNTITTYVEWSLHEPFP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW-LHYIPGTVFRNDTE 146
G Y F G +L FI++I+ MY+ILR GP++ AE ++GG P W L+ P R +
Sbjct: 87 GVYDFEGIADLEYFIELIKNENMYLILRPGPYICAERDFGGFPYWLLNVTPKRSLRTNNS 146
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+K ++ K+ ++++ +++ GG IIL QVENEYG Y + E Y LW +
Sbjct: 147 SYKKYVSKWFSVLMPIIQPH--LYGNGGNIILVQVENEYGSYYACDSE----YKLWIRDL 200
Query: 207 --AVAQNIGVPWIM--CQQ--FD---------TPDPVINTCNSFYCDQFTPHSPSMPKIW 251
+ +N V + + C Q FD T D I++ S D P +
Sbjct: 201 FRSYVENKAVLFTIDGCGQSYFDCGVIPEVYATVDFGISSNASQCFDFMRKVQKGGPLVN 260
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----- 306
+E +PGW + + + D+ + S ++YM+HGGTNFG T+G
Sbjct: 261 SEFYPGWLTHWQESESIVNTTDVVKQMKVMLAMNAS-FSFYMFHGGTNFGFTSGANTNDT 319
Query: 307 -------PFITTSYDYEAPIDEYGLP 325
P + TSYDY AP+DE G P
Sbjct: 320 KESIGYLPQL-TSYDYNAPLDEAGDP 344
>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
Length = 628
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGNLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + +Q+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT-------- 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDL 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 152/326 (46%), Gaps = 33/326 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
I +GR +IS AIH+ R W +Q+A+ G+NT+E+YVFWN EL G++ F
Sbjct: 34 QFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFT 93
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++ F++ + +ILR GP+V AE+ GG P WL P R+ F Q
Sbjct: 94 GNNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQ 153
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++ + ++ L S GGPII QVENEYG Y +G Y + + +G
Sbjct: 154 RYLEALGTQVR--PLLNSNGGPIIAMQVENEYGSYGDDHG-----YLQAVRALFIKAGLG 206
Query: 214 VPWI-------MCQQFDTPDPVINTCN------SFYCDQFTPHSPSMPKIWTENWPGWFK 260
+ M PD V+ N D+ P P++ E W GWF
Sbjct: 207 GALLFTSDGAQMLGNGTLPD-VLAAVNVAPGEAKQALDKLATFHPGQPQLVGEYWAGWFD 265
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----------IT 310
+G ++ A + ++G S+ N YM+ GGT+FG G F T
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPGDHYSPQT 324
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A +DE G P PK+ +++
Sbjct: 325 TSYDYDAALDEAGRPM-PKFALFRDV 349
>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
Length = 628
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 146/313 (46%), Gaps = 34/313 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + +Q+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT-------- 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDL 329
Query: 311 TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 330 TSYDYDAPISEAG 342
>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
Length = 608
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 161/341 (47%), Gaps = 45/341 (13%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
TY F + S++ I++G ++HY R W +++ K G+NT+++Y+ W
Sbjct: 4 TYLFKIRRLFKSKTRILSG--------SLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGW 55
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE G + F ++ +F+KI + +Y+I+R GP++ AE+ +GG P WL +
Sbjct: 56 NLHEPREGDFIFEDELDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMI 115
Query: 141 FRN-DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRY 199
R +E + +Q + T++ ++ + S+GGPII QVENEY Y Y
Sbjct: 116 VRQTKSEAYLAAVQNWFTVLFSQLRDHQW--SRGGPIISIQVENEYASYNK-----DSEY 168
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDT----------PDPVI-----NTCNSFYCDQFTPHS 244
W + ++G +++ +T PD + + N+F +
Sbjct: 169 LPWVKNLLT--DVGKCFLLKIINETNFFLKGAHLLPDTFLTANFQSVGNAF--EVLDKLQ 224
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P+ PK+ TE W GWF +G + S R GS N YM+HGGT+FG A
Sbjct: 225 PNRPKMVTEFWAGWFDHWGQQGHSTLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMA 284
Query: 305 GGPFI---------TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
G ++ TTSYDY+AP+ E G KW +E+
Sbjct: 285 GSNWLSKKQRGTSDTTSYDYDAPLSESG-DLTEKWNVTREI 324
>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
Length = 628
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 152/326 (46%), Gaps = 35/326 (10%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
NG+ ++S +HY R W +Q K G+NT+ +YVFWN HE PGK+ F G N
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMT 157
L +FIK + M +ILR GP+V AE+ +GG P WL + G R D F K+
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEF----LKYTK 152
Query: 158 LIVDMMKRE--KLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAAKMAVAQN 211
+D + +E L ++GGPI++ Q ENE+G Y + E + Y +
Sbjct: 153 AYIDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 212 IGVPWI------MCQQFDTPD--PVINTCNSF-----YCDQFTPHSPSMPKIWTENWPGW 258
VP + + TP P N + +Q+ H P + E +PGW
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQY--HDGKGPYMVAEFYPGW 270
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT-------- 310
+ P + IA ++ Q S N+YM HGGTNFG T+G +
Sbjct: 271 LSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDL 329
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+API E G PK+ ++ +
Sbjct: 330 TSYDYDAPISEAGW-VTPKYDSIRNV 354
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 151/319 (47%), Gaps = 35/319 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ ++NG+ I S A+HY R P W +++ K G+NT+E+Y+ WN HE G++ F
Sbjct: 10 KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
R+++ KF+K+ Q +Y+ILR P++ AE+ +GG+P WL P V R++T F +
Sbjct: 70 EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKV 129
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + ++ L + GGP+++ QVENEYG + + K Y + +
Sbjct: 130 ANYYEALFKVLV--PLQITHGGPVLMMQVENEYGSFGN-----DKAYLRHVKSLMETNGV 182
Query: 213 GVP-------WIMCQQFDT--PDPVINTC--------NSFYCDQFT-PHSPSMPKIWTEN 254
VP W + + D V T N QF H + P + E
Sbjct: 183 DVPLFTADGSWQQALKAGSLIEDDVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCMEF 242
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--------RTAGG 306
W GWF + R ++ +A ++ S N YM+ GGTNFG +
Sbjct: 243 WDGWFNRWQEEIVTRSADSFQTDLAELVKEQASF-NLYMFRGGTNFGFFNGCSSRQNVDY 301
Query: 307 PFITTSYDYEAPIDEYGLP 325
P I TSYDY+A + E G P
Sbjct: 302 PQI-TSYDYDAVLHEDGRP 319
>gi|313202559|ref|YP_004041216.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312441875|gb|ADQ78231.1| glycoside hydrolase family 35 [Paludibacter propionicigenes WB4]
Length = 786
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 155/310 (50%), Gaps = 23/310 (7%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
++NG+ +I + +HY R W ++ K G+NTI Y+FWN HE +PG + F G
Sbjct: 40 FMLNGKPYIIRAGELHYTRIPKAYWDHRIKMCKAMGMNTICIYLFWNIHEQTPGVFDFKG 99
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
+ ++ +F+++IQQ MY I+R GP+V AE++ GG+P WL R+ ++ Y M++
Sbjct: 100 QNDVAEFVRLIQQNGMYCIVRPGPYVCAEWDMGGLPWWLLKKKDLQVRSLSD--SYFMEQ 157
Query: 155 FMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAKMAVAQN 211
+ + K+ L GG II+ QVENEYG + +S Y E R + A Q
Sbjct: 158 TKKYLNEAGKQLAPLQIQNGGNIIMVQVENEYGTWGSDSKYME-TMRNNVRQAGFGKVQL 216
Query: 212 IGVPWIMCQQFDTPDPVINTCN----SFYCDQFTP---HSPSMPKIWTENWPGWFKTFGG 264
+ W D +N N S DQF +P P + E W GWF +G
Sbjct: 217 LRCDWSSNFFHYKLDGAVNALNFGAGSNIDDQFKKFKEMNPDSPLMCGEYWTGWFDQWG- 275
Query: 265 RDPHRPSEDIAF--SVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-----ITTSYDYEA 317
PH E +F S+ K S + YM HGGT++G+ AG T+SYDY A
Sbjct: 276 -RPHETREINSFIGSLKDMMDKRISF-SLYMAHGGTSYGQWAGANAPAYAPTTSSYDYNA 333
Query: 318 PIDEYGLPRN 327
PIDE G P +
Sbjct: 334 PIDEAGNPTD 343
>gi|354585216|ref|ZP_09004105.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
gi|353188942|gb|EHB54457.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
Length = 619
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 156/321 (48%), Gaps = 36/321 (11%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G +T+ + +++G+ IIS A+HY R VP W + + K G NT+E+Y+ WN HE
Sbjct: 2 GVLTWKNGQYLLDGQPYRIISGAVHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEP 61
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
+ G++ F G ++ FI++ + +++I+R PF+ AE+ +GG+P WL R
Sbjct: 62 TEGEFNFSGMADVGSFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSD 121
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+ + + ++ M L +S GGPI+ QVENEYG Y G +A
Sbjct: 122 PLYLSKVDHYYDELIPRMV--PLLSSNGGPILAVQVENEYGSY-------GNDHAYLEYL 172
Query: 206 MAVAQNIGVPWIMCQQFDTP----------DPVINTCN-------SFYCDQFTPHSPSMP 248
A GV ++ D P D V T N SF ++ + P
Sbjct: 173 RAGLVRRGVDVLLFTS-DGPTDEMLLGGSIDHVHATVNFGSRVEESF--GKYREYRTDEP 229
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF 308
+ E W GWF + R + D+A + +KG S+ N YM+HGGTNFG +G
Sbjct: 230 LMVMEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEKGSSI-NMYMFHGGTNFGFYSGANH 288
Query: 309 I------TTSYDYEAPIDEYG 323
I TTSYDY+AP+ E+G
Sbjct: 289 IKTYEPTTTSYDYDAPLTEWG 309
>gi|257870316|ref|ZP_05649969.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
gi|257804480|gb|EEV33302.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
Length = 593
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 150/321 (46%), Gaps = 41/321 (12%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG ++S AIHY R P W + K G NT+E+YV WN HE G + F
Sbjct: 8 EEFLMNGSPFKLLSGAIHYFRVHPDDWEHSLYNLKALGFNTVETYVPWNLHEPHKGLFQF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L +F+ + Q+ +Y+ILR P++ AE+ +GG+P WL G + D +
Sbjct: 68 EGILDLERFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESGRLRACDPSYLAHVA 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + L+ ++ + S GG I++ QVENEYG YGE K Y +M + + I
Sbjct: 128 EYYDVLLPKIIPYQ---LSHGGNILMIQVENEYGS----YGE-EKAYLRAIKEMLINRGI 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+P D P D V+ T N + D F H+ P +
Sbjct: 180 DMPLFTS---DGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTA 304
E W GWF + R +D+A SV + G N YM+HGGTNFG R A
Sbjct: 237 MEFWDGWFNRWNEPIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGA 294
Query: 305 GGPFITTSYDYEAPIDEYGLP 325
TSYDY+AP+DE G P
Sbjct: 295 VDLPQVTSYDYDAPLDEQGNP 315
>gi|393785841|ref|ZP_10373985.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
CL02T12C05]
gi|392660955|gb|EIY54552.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
CL02T12C05]
Length = 605
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 148/313 (47%), Gaps = 29/313 (9%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF-GGRFNLVKFI 102
IIS IH R W +Q K G NT+ Y+ WN HE PG + F G +L KFI
Sbjct: 48 IISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGNKDLEKFI 107
Query: 103 KIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDM 162
+ +Q+ M+++ R GP+V E+++GG+P +L P R + ++++ T I +
Sbjct: 108 RTVQEEDMFLLFRPGPYVCGEWDFGGLPAYLLSTPDIKIRCMDPRYTTAVERYATAIAPI 167
Query: 163 MKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW------ 216
+K+ ++ + GGPII+ QVENEYG Y + + Y W + + I VP+
Sbjct: 168 IKKYEV--TNGGPIIMVQVENEYGSYGN-----DRTYMKWIHDLWRDKGIEVPFYTADGA 220
Query: 217 --IMCQQFDTPDPVIN---TCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPS 271
M + P I + D+ P +E +PGW + H
Sbjct: 221 TPYMLEAGTLPGVAIGLDPAASKAEFDEALKVHPDASVFCSELYPGWLTHWRENWQHPSI 280
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG----PFI----TTSYDYEAPIDEYG 323
E I V G S NYY+ HGGTNFG AG P I TSYDY+API+E G
Sbjct: 281 EKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGIYQPDVTSYDYDAPINEMG 339
Query: 324 LPRNPKWGHLKEL 336
PK+ L+EL
Sbjct: 340 -QATPKYMALREL 351
>gi|170034400|ref|XP_001845062.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875695|gb|EDS39078.1| beta-galactosidase [Culex quinquefasciatus]
Length = 611
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 164/338 (48%), Gaps = 26/338 (7%)
Query: 19 SITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYV 78
S Y ++ Y+ + +++G IS + HY R++PG W +++ + G+N + +Y+
Sbjct: 2 SFRYQHDHSIDYERDTFLLDGEPFRFISGSFHYFRALPGSWRHILRAMRAAGLNAVMTYI 61
Query: 79 FWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIP 137
W+ HE + G Y + +L +FI+I ++ +Y+ILR GP++ AE + GG P WL P
Sbjct: 62 EWSTHEPTEGDYRWNEIADLEQFIRIAEEENLYVILRPGPYICAERDMGGFPYWLLTKFP 121
Query: 138 GTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-------YYES 190
R + +QK+ +++ M + +K +GGP+I+ +ENEYG Y
Sbjct: 122 NIKLRTQDSDYMREVQKWYSVL--MPRIQKYLYGRGGPVIMVSIENEYGSFSACDKTYLK 179
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQ----FDTPDPVINTCNSFYCDQFTPHSPS 246
F + Y + A + N G + C + T D Y + P
Sbjct: 180 FLKNMTESYIQYDA--VLFTNDGPEQLNCGRIPGILATLDFGSTGSPERYWQKLRKVQPK 237
Query: 247 MPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG- 305
P + E +PGW + + + ++ +G +V N+YM+ GGTNF TAG
Sbjct: 238 GPLVNAEFYPGWLTHWMEPMARTATGPVVDTLRLMLNQGANV-NFYMFFGGTNFAFTAGA 296
Query: 306 -----GPFIT--TSYDYEAPIDEYGLPRNPKWGHLKEL 336
G F T TSYDY+AP+DE G P PK+ L+++
Sbjct: 297 NDGGPGKFNTDITSYDYDAPLDEAGDP-TPKYFALRDV 333
>gi|313149116|ref|ZP_07811309.1| beta-galactosidase [Bacteroides fragilis 3_1_12]
gi|313137883|gb|EFR55243.1| beta-galactosidase [Bacteroides fragilis 3_1_12]
Length = 769
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKNTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ GK+ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPDTP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PAYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
>gi|424664993|ref|ZP_18102029.1| hypothetical protein HMPREF1205_00868 [Bacteroides fragilis HMW
616]
gi|404575526|gb|EKA80269.1| hypothetical protein HMPREF1205_00868 [Bacteroides fragilis HMW
616]
Length = 769
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKNTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ GK+ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPDTP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PAYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 162/333 (48%), Gaps = 25/333 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ YD + +G IS +IHY R W + + K G+N I++YV WN HE
Sbjct: 27 IDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNYHEPQM 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F G +L F+++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 87 GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY------------ESFYGEG 195
+ ++K+M +++ MK GGPII+ QVENEYG Y + F
Sbjct: 147 YLTAVEKWMGVLLPKMKPH--LYHNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFRQHL 204
Query: 196 GKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWTE 253
G L+ A ++ + + T D P N +F + + P+ P + +E
Sbjct: 205 GDEVVLFTTDGASQFHLKCGALQ-GLYATVDFAPGGNVTAAFLAQRSS--EPTGPLVNSE 261
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PFIT- 310
+ GW +G R PSE IA ++ +G +V N YM+ GGTNF G P+++
Sbjct: 262 FYTGWLDHWGHRHIVVPSETIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMSQ 320
Query: 311 -TSYDYEAPIDEYGLPRNPKWGHLKELHGAIKL 342
TSYDY+AP+ E G K+ L+E+ G + +
Sbjct: 321 PTSYDYDAPLSEAG-DLTEKYFALREVIGMVSI 352
>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
Length = 617
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 169/363 (46%), Gaps = 50/363 (13%)
Query: 7 IAPFALLIFFSSSITYCFA-GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQ 65
+ L+ FF+ + T F+ N + II I S +HY R W +Q
Sbjct: 10 VVLICLMPFFTKAQTKGFSISNGEFQKDGKIIK-----IHSGEMHYERIPKEYWRHRLQM 64
Query: 66 AKEGGVNTIESYVFWNGHELSPGKYYFG-GRFNLVKFIKIIQQARMYMILRIGPFVAAEY 124
K G+NT+ +YVFWN HE+ PG + F G +L +F++I + +Y+ILR GP+ E+
Sbjct: 65 LKAMGLNTVATYVFWNYHEIEPGVWDFKTGNRDLAEFLRIAKSEGLYVILRPGPYACGEW 124
Query: 125 NYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENE 184
+GG P WL P V R + + F + ++ + ++K FA+QGGPII+ Q ENE
Sbjct: 125 EFGGYPWWLQNNPDLVIRTNNKAFLDACKTYLEHLYAVVKGN--FANQGGPIIMVQAENE 182
Query: 185 YGYYES----FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD-----------PVI 229
+G Y S E K Y A + + G P + F T D V+
Sbjct: 183 FGSYVSQRTDISAEDHKAYK--TAIYNILKETGFP----EPFFTSDGSWLFEGGMVEGVL 236
Query: 230 NTCN--------SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARF 281
T N D++ H P + E +PGW + SE+IA ++
Sbjct: 237 PTANGESNIENLKKQVDKY--HKGQGPYMVAEFYPGWLDHWAEPFVKIGSEEIASQTKKY 294
Query: 282 FQKGGSVHNYYMYHGGTNFGRTAGGPF--------ITTSYDYEAPIDEYGLPRNPKWGHL 333
G S NYYM HGGTNFG T+G + TSYDY+API E G PK+ +
Sbjct: 295 LDAGVSF-NYYMAHGGTNFGFTSGANYNEESDIQPDITSYDYDAPISEAGWA-TPKFMAI 352
Query: 334 KEL 336
+++
Sbjct: 353 RDV 355
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 151/316 (47%), Gaps = 38/316 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+ + ++G+ IIS AIHY R VP W +++ K G NT+E+Y+ WN HE G+++
Sbjct: 14 TDNFYLDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFH 73
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+K Q+ +Y+ILR P++ AE+ +GG+P WL G R PF H
Sbjct: 74 FEGMLDIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKH 133
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+Q + +++ + ++ + GGP+IL QVENEYGYY + + Y L
Sbjct: 134 VQDYYDVLLKKIVPYQI--NYGGPVILMQVENEYGYYAN-----DREYLLAMRDKMQKGG 186
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQFTP-----------------HSPSMPKIWTEN 254
+ VP + + P N + + P ++ P + TE
Sbjct: 187 VVVPLVT-----SDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEF 241
Query: 255 WPGWFKTFG-GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI---- 309
W GWF +G G E+ + + + G N YM+ GGTNFG G +
Sbjct: 242 WVGWFDHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELT 299
Query: 310 --TTSYDYEAPIDEYG 323
TSYDY+A + E G
Sbjct: 300 PDVTSYDYDALLTEDG 315
>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 161/341 (47%), Gaps = 45/341 (13%)
Query: 21 TYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFW 80
TY F + S++ I++G ++HY R W +++ K G+NT+++Y+ W
Sbjct: 4 TYLFKIRRLFKSKTRILSG--------SLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGW 55
Query: 81 NGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV 140
N HE G + F ++ +F+KI + +Y+I+R GP++ AE+ +GG P WL +
Sbjct: 56 NLHEPREGDFIFEDELDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMI 115
Query: 141 FRN-DTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRY 199
R +E + +Q + T++ ++ + S+GGPII QVENEY Y Y
Sbjct: 116 VRQTKSEAYLAAVQNWFTVLFSQLRDHQW--SRGGPIISIQVENEYASYNK-----DSEY 168
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDT----------PDPVI-----NTCNSFYCDQFTPHS 244
W + ++G +++ +T PD + + N+F +
Sbjct: 169 LPWVKNLLT--DVGKCFLLKIINETNFFLKGAHLLPDTFLTANFQSVGNAF--EVLDKLQ 224
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P+ PK+ TE W GWF +G + S R GS N YM+HGGT+FG A
Sbjct: 225 PNRPKMVTEFWAGWFDHWGQQGHSLLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMA 284
Query: 305 GGPFI---------TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
G ++ TTSYDY+AP+ E G KW +E+
Sbjct: 285 GSNWLSKKQRGTSDTTSYDYDAPLSESG-DLTEKWNVTREI 324
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 155/317 (48%), Gaps = 34/317 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T ++ ++G+ IIS A+HY R W + + K G+NTIE+YV WN HE P
Sbjct: 58 LTAQGKTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEPIP 117
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F G +LV FI + + Y++LR GP++ +E+ +GG+P WL P R P
Sbjct: 118 GKYNFTGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKMKVRTMYPP 177
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAAK 205
+ + K+ ++ +K L GGPII Q++NEYG Y ++ Y K +
Sbjct: 178 YIAAVTKYFNYLLPFVK--PLQYQYGGPIIAFQLDNEYGSYFKDADYLPYLKEF------ 229
Query: 206 MAVAQNIGVPWIM--------CQQFDTPDPVINTCN-SFYCDQFTPHS---PSMPKIWTE 253
QN G+ ++ +Q P V+ T N + FT S P P + E
Sbjct: 230 ---LQNKGIIELLFISDSIEGLRQQTIPG-VLKTVNFKRMENHFTDLSNMQPDAPLMVME 285
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------- 306
W GWF +G + ++ ++ F +GGSV N+YM+ GGTNFG G
Sbjct: 286 FWTGWFDWWGEKHHILTVQEFGETLNEIFSQGGSV-NFYMFFGGTNFGFMNGAYKDGTGF 344
Query: 307 PFITTSYDYEAPIDEYG 323
TSYDY+A I E G
Sbjct: 345 HADITSYDYDALIAENG 361
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 151/316 (47%), Gaps = 38/316 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+ + ++G+ IIS AIHY R VP W +++ K G NT+E+Y+ WN HE G+++
Sbjct: 7 TDNFYLDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFH 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+K Q+ +Y+ILR P++ AE+ +GG+P WL G R PF H
Sbjct: 67 FEGMLDIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKH 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+Q + +++ + ++ + GGP+IL QVENEYGYY + + Y L
Sbjct: 127 VQDYYDVLLKKIVPYQI--NYGGPVILMQVENEYGYYAN-----DREYLLAMRDKMQKGG 179
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFYCDQFTP-----------------HSPSMPKIWTEN 254
+ VP + + P N + + P ++ P + TE
Sbjct: 180 VVVPLVT-----SDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEF 234
Query: 255 WPGWFKTFG-GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI---- 309
W GWF +G G E+ + + + G N YM+ GGTNFG G +
Sbjct: 235 WVGWFDHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELT 292
Query: 310 --TTSYDYEAPIDEYG 323
TSYDY+A + E G
Sbjct: 293 PDVTSYDYDALLTEDG 308
>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 590
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 146/321 (45%), Gaps = 38/321 (11%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++GR ++S A+HY R +P WP ++ + G++T+E+YV WN HE PG+Y F G
Sbjct: 11 LDGRPLRLLSGALHYFRVLPEQWPHRLRMLRAMGLDTVETYVPWNLHEPRPGEYDFDGIA 70
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGT-VFRNDTEPFKYHMQKF 155
+L +F+ ++A ++ I+R P++ AE+ GG+P WL P R + H+ ++
Sbjct: 71 DLDRFLHATREAGLHAIVRPSPYICAEWENGGLPWWLLADPEVGALRCQDPAYLAHVDRW 130
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
++ ++ ++ S+GG +++ QVENEYG Y + G Y A A+ I VP
Sbjct: 131 FDRLIPVVAAHQV--SRGGNVLMVQVENEYGSYGTDTG-----YLEHLAAGLRARGIDVP 183
Query: 216 WIMCQQFDTPDPVINTCNSF---------------YCDQFTPHSPSMPKIWTENWPGWFK 260
D PD T + P P + E W GWF
Sbjct: 184 LFTS---DGPDDFFLTGGALPGHLATVNFGSRPKEALADLARLRPDDPAMCMEFWCGWFD 240
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-----------I 309
+G R D A + G SV N YM HGGTNF AG
Sbjct: 241 HWGTDHVVRDPADAAGVLEELLAAGASV-NVYMAHGGTNFSTWAGANTEDPAAGTGYRPT 299
Query: 310 TTSYDYEAPIDEYGLPRNPKW 330
TSYDY+AP+DE G W
Sbjct: 300 VTSYDYDAPVDERGAATEKFW 320
>gi|423248537|ref|ZP_17229553.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
gi|423253485|ref|ZP_17234416.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392657385|gb|EIY51022.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392659750|gb|EIY53368.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
Length = 773
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 101/328 (30%), Positives = 152/328 (46%), Gaps = 29/328 (8%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
R+ ++NG ++ +A +HY R W + K G+NTI Y+FWN HE GK+ F
Sbjct: 31 RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ KF K+ Q+ MY+ILR GP+ AE+ GG+P WL R+ F
Sbjct: 91 SGEKNVAKFCKLAQKHGMYIILRPGPYACAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN- 211
+ FM + + +L + GG II+ QVENE+G G G + + A + V +
Sbjct: 151 EIFMKELGKQLAPLQL--ANGGNIIMVQVENEFG------GYGVDKPYMTAIRDIVCRAG 202
Query: 212 ------IGVPWIMCQQFDTPDPVINTCN-------SFYCDQFTPHSPSMPKIWTENWPGW 258
W + + D ++ T N + + P P + +E W GW
Sbjct: 203 FDKSVLFQCDWDSTFELNALDDLLWTLNFGTGANIDKEFKKLSTVRPDTPLMCSEFWSGW 262
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----PFITTSY 313
F +G + RP+E + + + S + YM HGGT FG G + +SY
Sbjct: 263 FDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSSY 321
Query: 314 DYEAPIDEYGLPRNPKWGHLKELHGAIK 341
DY+API E G PK+ L+EL G +
Sbjct: 322 DYDAPISEAGW-TTPKYYLLQELLGKYR 348
>gi|380512533|ref|ZP_09855940.1| beta-galactosidase [Xanthomonas sacchari NCPPB 4393]
Length = 616
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 146/325 (44%), Gaps = 35/325 (10%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
I +G+ +IS AIH+ R W +Q+A+ G+NT+E+YVFWN E PG++ F G
Sbjct: 41 FIRDGKPYQVISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRPGQFDFSG 100
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
++ F+ + +ILR GP+V AE+ GG P WL PG R+ F Q
Sbjct: 101 NNDIAAFVDEAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAEPGMRVRSQDPRFLAASQA 160
Query: 155 FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV 214
++ + +K GGPI+ QVENEYG Y G +A A+ G
Sbjct: 161 YLDALAAQVKPR--LNGNGGPIVAVQVENEYGSY-------GDDHAYMRLNRAMFVQAGF 211
Query: 215 PWIMCQQFDTPDPVINTC--NSFYCDQFTP------------HSPSMPKIWTENWPGWFK 260
+ D PD + N ++ F P P P++ E W GWF
Sbjct: 212 DKALLFTADGPDVLANGTLPDTLAVVNFAPGDAKNAFETLAKFRPGQPQMVGEYWAGWFD 271
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----------IT 310
+G + + A ++G S N YM+ GGT+FG G F T
Sbjct: 272 QWGEKHAATDATKQASEFEWILRQGHSA-NIYMFVGGTSFGFMNGANFQKNPSDHYAPQT 330
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKE 335
TSYDY+A +DE G P PK+ ++
Sbjct: 331 TSYDYDAVLDEAGRP-TPKFTLFRD 354
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 192/441 (43%), Gaps = 65/441 (14%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+S AIHY R VP W + + + G+NT+E+Y+ WN HE G++ F G +L +F++
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
I +++ILR P++ AE+ +GG+P WL P R + + ++ ++ +
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRL 140
Query: 164 KREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYAL-WAAKMAVAQNIGVPWIMCQ 220
L S+GGP+I Q+ENEYG Y ++ Y E K + + + + G M Q
Sbjct: 141 V--PLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDGLIKRGVDVLLFTSDGPTDGMLQ 198
Query: 221 QFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIA 275
P V+ T N D+ + P P + E W GWF + R +ED A
Sbjct: 199 GGAVPG-VLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDAA 257
Query: 276 FSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPIDEYG------ 323
SV N+YM+HGGTNFG G F TSYDY+AP+ E G
Sbjct: 258 AVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECGDVTAKF 316
Query: 324 -----------------LPRNPK------WGHLKELHGAIKLCEHALLNGERSNLS---- 356
LP P+ +G + H A L L+ E+ +
Sbjct: 317 EAIRSAIAQHQGKELSDLPSLPQPVKKISYGSVSMTHYADLLEHLPALSEEQKRTAPVPM 376
Query: 357 --LGSSQEADVYADS-SGACAAFLANMDDKNDKTVVFRNVSYH--LPAWSVSILPDCKKV 411
LG S VYA SG ++ + +D+ VF + Y + W LP
Sbjct: 377 ERLGQSYGFTVYATHISGPRQGESLHLQEVHDRAQVFLDGKYQGTVERWDPKALP----- 431
Query: 412 VFNTANVRAQSSTVEMVPENL 432
+V A + +E+V EN+
Sbjct: 432 ----IDVPAAGAKLEIVVENM 448
>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
Length = 647
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 169/351 (48%), Gaps = 23/351 (6%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
+A F I ++ + ++ YD+ + +G+ IS +HY R W + +
Sbjct: 1 MAFFLFFICCLPTLAISLSFSIDYDNNCFMKDGKPFRYISGGMHYFRVPQYYWKDRLLKL 60
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+NT+++YV WN HE P +Y F G NL F++I Q + +ILR GP++ AE+++
Sbjct: 61 KASGMNTVQTYVPWNLHEPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGPYICAEWDF 120
Query: 127 GGIPVWLHYIPGTVFRNDT-EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEY 185
GG+P WL P V R+ + + + +M++++ ++K GGP+I+ QVENEY
Sbjct: 121 GGLPGWLLKDPSIVIRSSQGKAYMEAVDAWMSVLLPLVK--PFLYENGGPVIMVQVENEY 178
Query: 186 GYY------ESFYGEGGKRYALWAAKMAVAQNIG--VPWIMC----QQFDTPDPVINTCN 233
G Y + + RY L + + G + I C + T D NT
Sbjct: 179 GDYIHCDHQYMLHLQQLFRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDFGANTDP 238
Query: 234 SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYM 293
S P + +E + GW +G R S+ +A ++ + SV N YM
Sbjct: 239 SIPFANQRKLQQKGPLVNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNASV-NLYM 297
Query: 294 YHGGTNFGRTAGGPF------ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
+ GGTNFG +G F + TSYDY+AP+ E G K+ ++E+ G
Sbjct: 298 FEGGTNFGFWSGADFHGQYQPVPTSYDYDAPLTEAG-DLTEKYHAIREVIG 347
>gi|332375542|gb|AEE62912.1| unknown [Dendroctonus ponderosae]
Length = 454
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 156/340 (45%), Gaps = 46/340 (13%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG--- 93
+N + I S A+HY R W +++ + G+NT+E+YV WN HE GK+ FG
Sbjct: 36 LNDKLIKIYSGAMHYFRVPRPYWRDRLRKIRAAGLNTVETYVPWNLHEPENGKFDFGEGG 95
Query: 94 ----GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+L +F+ ++ +++ILR GP++ +EYN GG P WL FR E +
Sbjct: 96 SEFEDFLHLEEFLNAAKEEDLFVILRTGPYICSEYNSGGFPSWLLREKPMGFRTSEENYM 155
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES--------FYGEGGKRYAL 201
+ +F +++ ++ + GGP+I QVENEYG E+ Y E ++ L
Sbjct: 156 KFVTRFFNVVLTLLAAFQF--QLGGPVIAFQVENEYGNLENGAAFQPDKVYMEELRQLFL 213
Query: 202 WAAKMAVAQNIG----------VPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
+ + + +P + Q + D +N N ++F P P M
Sbjct: 214 KNGIVELLTSADSPLWKGTSGTLPGELFQTANFGDNAVNQLNK--LEEFQPGRPLMV--- 268
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF GG + ED + F K S N YM+HGGTNF G
Sbjct: 269 MEYWIGWFDNVGGEHSVKSDEDSRRVLEDIFSKNASF-NAYMFHGGTNFWFNNGANLDND 327
Query: 309 ---------ITTSYDYEAPIDEYGLPRNPKWGHLKELHGA 339
ITTSYDY+API E G RN K+ +KEL A
Sbjct: 328 LMDNSGYTAITTSYDYDAPISESGGYRN-KYFIVKELVAA 366
>gi|347735403|ref|ZP_08868282.1| beta-galactosidase [Azospirillum amazonense Y2]
gi|346921388|gb|EGY02126.1| beta-galactosidase [Azospirillum amazonense Y2]
Length = 613
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 101/334 (30%), Positives = 163/334 (48%), Gaps = 40/334 (11%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
T + +++G+ I++ +HYPR W +++ K G+NT+ +YVFWN HE +PG
Sbjct: 32 TTNGDHFLLDGQPLQIMAGELHYPRIARADWRDRLRKLKSLGLNTLSAYVFWNAHEKAPG 91
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+Y F G +L ++ + Q+ ++++LR+GP+ AE++ G +P W VF +++
Sbjct: 92 RYDFTGNLDLSAWLALAQEEGLHVLLRVGPYACAEWDGGALPAW-------VFPDESVKA 144
Query: 149 KYHMQKFMTLIVDMMKR-----EKLFASQGGPIILAQVENEYG-------YYESFYGE-- 194
+ +M L +KR L +GGP+++ QVENEYG Y E+ +
Sbjct: 145 RSLDPTYMKLSGRWLKRLGQEVAHLEIDKGGPVLMTQVENEYGSFGQDHSYMEAVRDQIR 204
Query: 195 -GGKRYALWAAKMA-VAQNIGVPWIMCQ-QFDTPDPVINTCNSFYCDQFTPHSPSMPKIW 251
G AL+ A V + +P ++ F T D ++ S P+I
Sbjct: 205 SAGFDGALYTVDGASVIEKGALPSLINGINFGTTDKAEEEFK-----RYAAFKTSGPRIC 259
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
TE W GWF FG P+ + S+ + SV ++YM HGGT+FG AG F
Sbjct: 260 TELWGGWFDHFGEVHSAMPAPPLLDSLKWMLDRQISV-SFYMAHGGTSFGFDAGANFDRK 318
Query: 309 ------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+SYDY+A DE G P PK+ + E+
Sbjct: 319 TETYQPDISSYDYDALFDEAGRP-TPKFSAVLEV 351
>gi|313245457|emb|CBY40184.1| unnamed protein product [Oikopleura dioica]
Length = 620
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 96/329 (29%), Positives = 154/329 (46%), Gaps = 34/329 (10%)
Query: 19 SITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYV 78
S+++ +++YDS++ + ++S ++HY R W + + K G+N + +YV
Sbjct: 1 SLSFRRRPSLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYV 60
Query: 79 FWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
WN HE PG++ F G ++V FI I + +++ILR GP++ +E+ +GG+P WL
Sbjct: 61 PWNLHEPEPGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPAWLLRDSF 120
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR 198
R + + +++F ++ ++K ++ + GGPI+ QVENEYG Y G+ G
Sbjct: 121 MKVRTNYSGYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYGMYA---GQDGAH 175
Query: 199 YALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCD----------------QFTP 242
A++ + I P D N N+ Y D
Sbjct: 176 LNT-LAELLKNEGIVEPLFTSDGSSVWD---NEKNTIYEDGLKSVNFKSNPEKHLKSLRG 231
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
H P P E W GWF +G + D ++ S+ N+YM+HGGTNFG
Sbjct: 232 HFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDVILDHKASL-NFYMFHGGTNFGF 290
Query: 303 TAGGPFI--------TTSYDYEAPIDEYG 323
T GG I TSYDY+ PI E G
Sbjct: 291 TNGGLTIARGYYTADVTSYDYDCPISEAG 319
>gi|347967091|ref|XP_001689312.2| AGAP002056-PA [Anopheles gambiae str. PEST]
gi|333469762|gb|EDO63217.2| AGAP002056-PA [Anopheles gambiae str. PEST]
Length = 629
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 169/354 (47%), Gaps = 38/354 (10%)
Query: 10 FALLIFFSSSITYCF-AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
FAL+ F++ + ++ YD+ + +++G+ ++ + HY R++P WP +++ +
Sbjct: 9 FALVFLFAAPRSVDMRLFSIDYDNDTFVMDGKPFQYVAGSFHYFRALPESWPSILRSMRA 68
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
G+N I +YV W+ H Y + G ++ F+++ A +Y+ILR GP++ AE + GG
Sbjct: 69 AGLNAITTYVEWSLHNPKEDVYNWQGMADIEHFLELADSAGLYVILRPGPYICAERDMGG 128
Query: 129 IPVW-LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYG 186
P W LH P + R T +Y +++ T ++ R ++ QGGPII+ QVENEYG
Sbjct: 129 FPSWLLHKYPDILLR--TNDLRY-LREVRTWYAQLLSRVQRFLVGQGGPIIMVQVENEYG 185
Query: 187 YYESFYG----------EGGKRYALWAAKMAV-----AQNIGVPWIMCQQFDTPDPVINT 231
SFY + +RY + A + + G + D +
Sbjct: 186 ---SFYACDHKYLNWLRDETERYVMGNAVLFTNNGPGLEGCGAIEHVLSSLDFGPGTEDE 242
Query: 232 CNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDI--AFSVARFFQKGGSVH 289
N F+ P P + E +PGW + ++PH D F +
Sbjct: 243 INGFWS-TLRKTQPKGPLVNAEYYPGWLTHW--QEPHMARTDTKPVVDSLDFMLRNKVNV 299
Query: 290 NYYMYHGGTNFGRTAGGPFI--------TTSYDYEAPIDEYGLPRNPKWGHLKE 335
N YM+ GGTN+G TAG + TSYDY+AP+DE G P PK+ L++
Sbjct: 300 NIYMFFGGTNYGFTAGANNMGAGGYAADLTSYDYDAPLDESGDP-TPKYFALRD 352
>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
Length = 592
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGVRLRSTDPIFMTKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 128 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 178
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 179 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 296
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 297 DLPQVTSYDYDALLTEAGEP 316
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 145/315 (46%), Gaps = 32/315 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
I +GR +IS AIH+ R W +Q+A+ G+NT+E+YVFWN EL G++ F
Sbjct: 34 QFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFT 93
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++ F++ + +ILR GP+V AE+ GG P WL P R+ F Q
Sbjct: 94 GNNDISAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQ 153
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++ + ++ L GGPII QVENEYG Y +G Y + + +G
Sbjct: 154 RYLEALGTQVR--PLLNGNGGPIIAVQVENEYGSYGDDHG-----YLQAVRALFIKAGLG 206
Query: 214 VPWI-------MCQQFDTPDPVINTCN------SFYCDQFTPHSPSMPKIWTENWPGWFK 260
+ M PD V+ N D+ P P++ E W GWF
Sbjct: 207 GALLFTADGAQMLGNGTLPD-VLAAVNVAPGEAKQALDKLATFHPGQPQLVGEYWAGWFD 265
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----------IT 310
+G ++ A + ++G S+ N YM+ GGT+FG G F T
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPSDHYSPQT 324
Query: 311 TSYDYEAPIDEYGLP 325
TSYDY+A +DE G P
Sbjct: 325 TSYDYDAALDEAGRP 339
>gi|71731106|gb|EAO33173.1| Beta-galactosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 612
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/325 (31%), Positives = 152/325 (46%), Gaps = 31/325 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
I +GR +IS AIH+ R W +Q+A+ G+NT+E+YVFWN EL G++ F
Sbjct: 34 QFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFT 93
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++ F++ + +ILR GP+V AE+ GG P WL P R+ F Q
Sbjct: 94 GNNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQ 153
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW------AAKMA 207
+++ + ++ L GGPII QVENEYG Y +G +AL+ A +
Sbjct: 154 RYLEALGTQVR--PLLNGNGGPIIAVQVENEYGSYGDDHGYLQAVHALFIKAGLGGALLF 211
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCN------SFYCDQFTPHSPSMPKIWTENWPGWFKT 261
A M PD V+ N D+ P P++ E W GWF
Sbjct: 212 TADGAQ----MLGNGTLPD-VLAAVNFAPGEAKQALDKLATFHPGQPQLVGEYWAGWFDQ 266
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----------ITT 311
+G ++ A + ++G S+ N YM+ GGT+FG G F TT
Sbjct: 267 WGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPGDHYSPQTT 325
Query: 312 SYDYEAPIDEYGLPRNPKWGHLKEL 336
SYDY+A +DE G P PK+ +++
Sbjct: 326 SYDYDAVLDEAGRPM-PKFALFRDV 349
>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
Length = 592
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 128 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 178
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 179 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 296
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 297 DLPQVTSYDYDALLTEAGEP 316
>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 593
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E W GWF +G HR D+A V G N YM+HGGTNFG G
Sbjct: 240 EYWDGWFNRWGEPVIHREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGEK 297
Query: 309 ---ITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|260592848|ref|ZP_05858306.1| beta-galactosidase [Prevotella veroralis F0319]
gi|260535218|gb|EEX17835.1| beta-galactosidase [Prevotella veroralis F0319]
Length = 621
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 106/347 (30%), Positives = 156/347 (44%), Gaps = 48/347 (13%)
Query: 21 TYCFA-GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVF 79
T+ A GN YD + + I+ S +HY R W ++ K G+N + +Y+F
Sbjct: 28 TFAIANGNFIYDGKPIQIH-------SGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIF 80
Query: 80 WNGHELSPGKY-YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
WN HE SPG + + G NL +FIK + + +ILR GP+ AE+ +GG P WL
Sbjct: 81 WNHHETSPGVWDWTTGTHNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKD 140
Query: 139 TVFRNDTEPF----KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ES 190
V R D +PF + ++ + ++D+ +QGGP+I+ Q ENE+G Y +
Sbjct: 141 LVIRTDNKPFLDSCRVYINQLAKQVLDLQ------VTQGGPVIMVQAENEFGSYVAQRKD 194
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQ--------QFDTPDPVINTCNSFYCDQFTP 242
E KRYA + + VP + P N D+
Sbjct: 195 IPLETHKRYAAQIRQQLLDAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDI--DKLKK 252
Query: 243 -----HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
H P + E +PGW + P +E + ++ G S NYYM HGG
Sbjct: 253 VVNEYHGGVGPYMVAEFYPGWLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGG 311
Query: 298 TNFGRTAGGPFIT--------TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TNFG +AG + TSYDY+API E G PK+ L++L
Sbjct: 312 TNFGFSAGANYSNATNIQPDMTSYDYDAPISEAGWA-TPKYNALRDL 357
>gi|357050580|ref|ZP_09111778.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
gi|355381233|gb|EHG28360.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
Length = 593
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 149/321 (46%), Gaps = 41/321 (12%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG ++S AIHY R P W + K G NT+E+YV WN HE G + F
Sbjct: 8 EEFLMNGSPFKLLSGAIHYFRVHPDDWRHSLYNLKALGFNTVETYVPWNLHEPHKGLFQF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L F+ + Q+ +Y+ILR P++ AE+ +GG+P WL G + D +
Sbjct: 68 EGILDLEHFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESGRLRACDPSYLAHVA 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + L+ ++ + S GG I++ QVENEYG YGE K Y +M + + I
Sbjct: 128 EYYDVLLPKIIPYQ---LSHGGNILMIQVENEYGS----YGE-EKAYLRAIKEMLINRGI 179
Query: 213 GVPWIMCQQFDTP------------DPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+P D P D V+ T N + D F H+ P +
Sbjct: 180 DMPLFTS---DGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTA 304
E W GWF + R +D+A SV + G N YM+HGGTNFG R A
Sbjct: 237 MEFWDGWFNRWNEPIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGA 294
Query: 305 GGPFITTSYDYEAPIDEYGLP 325
TSYDY+AP+DE G P
Sbjct: 295 VDLPQVTSYDYDAPLDEQGNP 315
>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 593
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|158301280|ref|XP_550752.3| AGAP002055-PA [Anopheles gambiae str. PEST]
gi|157012394|gb|EAL38488.3| AGAP002055-PA [Anopheles gambiae str. PEST]
Length = 657
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 159/320 (49%), Gaps = 25/320 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ Y+ + +++G+ ++ + HY R++P W ++ + GG+N ++ YV W+ H
Sbjct: 45 IDYERDTFVMDGKDFRYVAGSFHYFRALPETWRTKLRTLRAGGLNAVDLYVQWSLHNPRD 104
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTVFRNDTE 146
G Y + G N+ I+ + +Y+ILR GP++ AE + GG+P WL + PG R
Sbjct: 105 GVYNWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIAVRTSDA 164
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY-------YESFYGEGGKRY 199
+ ++K+ + M + E GGPII+ Q+ENEYG Y +F + +RY
Sbjct: 165 NYLEEVRKWYGEL--MSRMEPYMYGNGGPIIMVQIENEYGAFGKCDKPYLNFLKQQTERY 222
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFD----TPDPVINTCNSF--YCDQFTPHSPSMPKIWTE 253
A + I C Q D T D + T + + + P P + TE
Sbjct: 223 VQDKAVLFTVDRPYDDEIGCGQIDGVFITTDFGLMTEEEVDTHAAKVRSYQPKGPLVNTE 282
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG------GP 307
+ GW + + RP++ +A ++ + + G +V ++YMY GGTNFG AG G
Sbjct: 283 FYTGWLTHWQESNQRRPAQPLAATLRKMLRDGWNV-DFYMYFGGTNFGFWAGANDWGLGK 341
Query: 308 FIT--TSYDYEAPIDEYGLP 325
++ TSYDY+AP+DE G P
Sbjct: 342 YMADITSYDYDAPMDEAGDP 361
>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 592
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 128 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 178
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 179 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 296
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 297 DLPQVTSYDYDALLTEAGEP 316
>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 593
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 593
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/321 (31%), Positives = 154/321 (47%), Gaps = 29/321 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + LI +IHY R W + + K G NT+ +Y+ WN HE GK+ F G
Sbjct: 104 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYIPWNLHEPQRGKFVFSGNL 163
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L F+ + + +++ILR GP++ AE + GG+P WL P T R F + +
Sbjct: 164 DLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLLQNPKTQLRTTERTFVDAVDAYF 223
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
+ M + L GGP+I QVENEYG SF +G +Y + + + + I
Sbjct: 224 DHL--MRRMVPLQYHHGGPVIAVQVENEYG---SFNRDG--QYMAYLKEALLKRGIVELL 276
Query: 217 IMCQQFD-----TPDPVINTC-------NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
C + + V+ T NSFY Q P + E W GW+ ++G
Sbjct: 277 FTCDYYKDVVNGSLKGVLATVNLGSLGKNSFY--QLLQVQSHKPILIMEYWVGWYDSWGL 334
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNF------GRTAGGPFITTSYDYEAP 318
++ + ++A +V+ F + G S N YM+HGGTNF G G +TTSYDY+A
Sbjct: 335 PHANKSAAEVAHTVSTFIKNGISF-NVYMFHGGTNFGFINAAGIVEGRRSVTTSYDYDAV 393
Query: 319 IDEYGLPRNPKWGHLKELHGA 339
+ E G K+ L+EL G+
Sbjct: 394 LSEAG-DYTEKYFKLRELLGS 413
>gi|28199702|ref|NP_780016.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182682446|ref|YP_001830606.1| beta-galactosidase [Xylella fastidiosa M23]
gi|386083781|ref|YP_006000063.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417557800|ref|ZP_12208811.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
gi|28057823|gb|AAO29665.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182632556|gb|ACB93332.1| Beta-galactosidase [Xylella fastidiosa M23]
gi|307578728|gb|ADN62697.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179583|gb|EGO82518.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
Length = 612
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 151/326 (46%), Gaps = 33/326 (10%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
I +GR +IS AIH+ R W +Q+A+ G+NT+E+YVFWN EL G++ F
Sbjct: 34 QFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFT 93
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++ F++ + +ILR GP+V AE+ GG P WL P R+ F Q
Sbjct: 94 GNNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQ 153
Query: 154 KFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIG 213
+++ + ++ L GGPII QVENEYG Y +G Y + + +G
Sbjct: 154 RYLEALGTQVR--PLLNGNGGPIIAVQVENEYGSYGDDHG-----YLQAVRALFIKAGLG 206
Query: 214 VPWI-------MCQQFDTPDPVINTCN------SFYCDQFTPHSPSMPKIWTENWPGWFK 260
+ M PD V+ N D+ P P++ E W GWF
Sbjct: 207 GALLFTADGAQMLGNGTLPD-VLAAVNVAPGEAKQALDKLATFHPGQPQLVGEYWAGWFD 265
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----------IT 310
+G ++ A + ++G S+ N YM+ GGT+FG G F T
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPSDHYSPQT 324
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A +DE G P PK+ +++
Sbjct: 325 TSYDYDAVLDEAGRPM-PKFALFRDV 349
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 153/336 (45%), Gaps = 53/336 (15%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T D + ++G+ I+S AIHY R W +Q + G+NTI+ Y+ WN HE
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G + F G +LV+F I + + ++ R GP++ +E+++GG+P WL P R++
Sbjct: 68 GNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES----------------- 190
++ + + + ++ ++ L S GGPII QVENEYG Y
Sbjct: 128 YQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYGDYVDKDNEHLPWLADLMKSHG 185
Query: 191 -----FYGEGG---KRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP 242
F +GG ++ + + N G ++ + F
Sbjct: 186 LFELFFISDGGHTIRKANMLKVRSTAQLNSGSFQLLAKAF----------------SLKS 229
Query: 243 HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGR 302
P+ P + TE W GWF +G +E ++ ++G SV N+YM+HGGTNFG
Sbjct: 230 LQPNKPMLVTEFWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGASV-NFYMFHGGTNFGF 288
Query: 303 TAGGPFI--------TTSYDYEAPIDEYGLPRNPKW 330
G + TSYDY+ P+DE G R KW
Sbjct: 289 MNGAIELEKGYYTADVTSYDYDCPVDESG-NRTEKW 323
>gi|311281324|ref|YP_003943555.1| glycoside hydrolase [Enterobacter cloacae SCF1]
gi|308750519|gb|ADO50271.1| glycoside hydrolase family 35 [Enterobacter cloacae SCF1]
Length = 591
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 148/315 (46%), Gaps = 30/315 (9%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
++L+ +G+ +IS AIHY R VP W + K G N +E+Y+ WN H+ P ++
Sbjct: 7 EKNLLQDGKPVQLISGAIHYFRLVPQYWEHSLNNLKALGANCVETYLPWNIHQPDPERFC 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +FI + Q+ +++ILR P++ AE+ +GG+P WL P R+ F
Sbjct: 67 FTGMADVERFIALAQRKGLFVILRPSPYICAEWEFGGLPAWLLRDPSMRVRSSQPAFLQA 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
++++ ++ + + +GGP+++ Q+ENEYG + + K Y A M
Sbjct: 127 VERYYAELLPRLAPWQY--DRGGPVVMMQLENEYGSFGN-----DKAYLRTLAAMMRRYG 179
Query: 212 IGVP-------WIMCQQFDT--PDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPG 257
+ VP W Q + D V+ T N + D P P + E W G
Sbjct: 180 VSVPLFTSDGAWQEALQAGSLCEDNVLATANFGSRSAESLDNLAAFQPERPLMCLEFWNG 239
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-------IT 310
WF +G R ++D+ + + N YM+ GGTNFG G
Sbjct: 240 WFNRYGDAIIRRDADDVGQEIRTLLTRASI--NIYMFQGGTNFGFMNGCSVRGDKDLPQV 297
Query: 311 TSYDYEAPIDEYGLP 325
TSYDY+A + E+G P
Sbjct: 298 TSYDYDALLSEWGEP 312
>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
Length = 593
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|375359947|ref|YP_005112719.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301164628|emb|CBW24187.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
Length = 603
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 151/319 (47%), Gaps = 31/319 (9%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G +T + +++G+ I+S A HY R+ P W + + + G+NT+E+YV WN H+
Sbjct: 25 GGLTIRGKEFLLDGKPFRILSGAFHYFRTHPQDWRDRLMRMRAMGLNTVETYVAWNFHQP 84
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
+ F G ++V F++ + + +I+R GP++ AE+++GG+P WL R
Sbjct: 85 DEKEADFTGWRDVVAFVRTADEVGLKVIVRPGPYICAEWDFGGLPAWLLKDKDAPLRRSD 144
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
F+ + + + + + L A++GGPII QVENEYG Y G +A
Sbjct: 145 PAFERAVDAWFAEL--LPRFVDLQATRGGPIIAMQVENEYGSY-------GDDHAYLEHL 195
Query: 206 MAVAQNIGVPWIM-CQQFDTPD-------PVINTCNSFYCDQFTPHS------PSMPKIW 251
+ G+ ++ C T + P + + +F D P + P P
Sbjct: 196 RDTMRAQGIDGLLFCSNGATQEALKAGSLPDLLSTVNFGGDPTGPFAELRAFQPDKPLFC 255
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
TE W GWF +G R A V + + G S+ N+YM GGTNFG +AG
Sbjct: 256 TEFWDGWFDHWGERHRTTDPAQTAADVEKMLEAGASI-NFYMAVGGTNFGWSAGANLSGS 314
Query: 309 ----ITTSYDYEAPIDEYG 323
TSYDY++PI E G
Sbjct: 315 GYQPTVTSYDYDSPISESG 333
>gi|60683116|ref|YP_213260.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60494550|emb|CAH09349.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|423260608|ref|ZP_17241530.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
CL07T00C01]
gi|423266742|ref|ZP_17245744.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
CL07T12C05]
gi|387775162|gb|EIK37271.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
CL07T00C01]
gi|392699974|gb|EIY93143.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
CL07T12C05]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLREARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|324509196|gb|ADY43870.1| Beta-galactosidase [Ascaris suum]
Length = 639
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/334 (29%), Positives = 163/334 (48%), Gaps = 31/334 (9%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
FA LI F S F+ + Y ++ +++G+ IS +IHY R P W + + +
Sbjct: 13 FAFLIIFPSLAENSFS--IDYVNKRFLLDGQPFRYISGSIHYFRVHPDQWNDRLSRMRAA 70
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+N I+ Y+ WN HE+ G F G N+ +F+ + Q +Y ++RIGP++ E+ GG+
Sbjct: 71 GLNAIQFYIPWNFHEIYEGVIGFDGGRNITRFLSLAAQNELYALVRIGPYICGEWENGGL 130
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P WL R + F ++++ +++ ++K GGPI++ QVENEYG
Sbjct: 131 PWWLLKYDDIKMRTSDKRFIRAVERWFGVLLPILKPS--LRKNGGPILMIQVENEYG--- 185
Query: 190 SFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNS----FYCDQFTPHS- 244
SF ++Y + + + +++G ++ D + C S F F P+S
Sbjct: 186 SFTEGCDRKYTTFLRDLTI-KHLGDDVVLYTT-DGANNQSLKCGSIPGVFATVDFGPNSE 243
Query: 245 --------------PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHN 290
P+ P + +E +PGW T+ + PS D + +++ K G+ N
Sbjct: 244 EQIDKNFATQRSYEPNGPLVNSEFYPGWIVTWSQKGRIDPSVDEIINGSKYMFKLGASFN 303
Query: 291 YYMYHGGTNFGRTAGG---PFITTSYDYEAPIDE 321
YYM++GGTNF G + TSYDY AP+ E
Sbjct: 304 YYMFYGGTNFAFWNGAETTSAVITSYDYFAPLTE 337
>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 593
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLSPLQITQGGPVIMMQVENEYGSY-------GMEKAYLQQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|423285593|ref|ZP_17264475.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
615]
gi|404579108|gb|EKA83826.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
615]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLREARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|265767009|ref|ZP_06094838.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263253386|gb|EEZ24862.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/323 (30%), Positives = 153/323 (47%), Gaps = 35/323 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ +++G IIS A+HY R VP W + K G NT+E+YV WN HE G + F
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGVFNF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +LVK++++ Q+ + +ILR P++ AE+ +GG+P WL R++T F +
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDIRVRSNTNLFLNKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRY----------- 199
+ F +++ ++ L GGPII+ QVENEYG + + Y K+
Sbjct: 128 ENFYKVLLPLVT--SLQVENGGPIIMMQVENEYGSFGNDKEYVRSIKKLMRDLGVTVPLF 185
Query: 200 ---ALWAAKMAVAQNIGVPWIMCQQFDT-PDPVINTCNSFYCDQFTPHSPSMPKIWTENW 255
W + I ++ F + + +N SF + P + E W
Sbjct: 186 TSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESF----IKENKKEWPLMCMEFW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R S ++A V ++ N+YM+ GGTNFG G P
Sbjct: 242 DGWFNRWGMEIIRRDSSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDLP 299
Query: 308 FITTSYDYEAPIDEYGLPRNPKW 330
I TSYDY+A + E+G P PK+
Sbjct: 300 QI-TSYDYDALLTEWGEP-TPKY 320
>gi|53715181|ref|YP_101173.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218046|dbj|BAD50639.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLREARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|423251759|ref|ZP_17232772.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
CL03T00C08]
gi|423255080|ref|ZP_17236010.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
CL03T12C07]
gi|392649184|gb|EIY42863.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
CL03T00C08]
gi|392652521|gb|EIY46180.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
CL03T12C07]
Length = 769
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|383812458|ref|ZP_09967896.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
F0472]
gi|383355018|gb|EID32564.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
F0472]
Length = 608
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 152/333 (45%), Gaps = 40/333 (12%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY-YF 92
+ I +G+ I S +HY R W ++ K G+N + +Y+FWN HE SPG + +
Sbjct: 27 NFIYDGKPTQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWS 86
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF---- 148
G NL +FIK + + +ILR GP+ AE+ +GG P WL V R D +PF
Sbjct: 87 TGTHNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKNKDLVIRTDNKPFLDSC 146
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWAA 204
+ ++ + ++D+ +QGGP+I+ Q ENE+G Y + E KRYA
Sbjct: 147 RVYINQLAKQVLDLQ------VTQGGPVIMVQAENEFGSYVAQRKDIPLETHKRYAAQIR 200
Query: 205 KMAVAQNIGVPWIMCQ--------QFDTPDPVINTCNSFYCDQFTP-----HSPSMPKIW 251
++ + VP + P N D+ H P +
Sbjct: 201 QLLLDAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDI--DKLKKVVNEYHGGVGPYMV 258
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT- 310
E +PGW + P +E + ++ G S NYYM HGGTNFG +AG +
Sbjct: 259 AEFYPGWLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNA 317
Query: 311 -------TSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+API E G PK+ L++L
Sbjct: 318 TNIQPDMTSYDYDAPISEAGWA-TPKYNALRDL 349
>gi|423270210|ref|ZP_17249181.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
CL05T00C42]
gi|423276168|ref|ZP_17255110.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
CL05T12C13]
gi|392698134|gb|EIY91316.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
CL05T00C42]
gi|392699308|gb|EIY92489.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
CL05T12C13]
Length = 769
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLREARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
carolinensis]
Length = 584
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 144/302 (47%), Gaps = 35/302 (11%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+ ++HY R W + + K G+NT+ +YV WN HE GK+ F G +L FIK
Sbjct: 29 ILGGSLHYFRIPREYWKDRLMKMKACGLNTVTTYVPWNLHEAIRGKFDFSGNLDLQVFIK 88
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIP----GTVFRNDTEPFKYHMQKFMTLI 159
+ ++ +++ILR GP++ +E++ GG+P WL P T +R TE + + + +
Sbjct: 89 MAEEVGLWVILRPGPYICSEWDLGGLPSWLLQDPEMQLRTTYRGFTEAVDNYFDRLIPQV 148
Query: 160 VDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMC 219
V + + GGPII QVENEYG Y Y + KMA+ V +M
Sbjct: 149 VPLQYK------YGGPIIAVQVENEYGSYAQ-----DPSYMTY-IKMALTSRKIVEMLMT 196
Query: 220 QQ------FDTPDPVINTCNSFYCDQF------TPHSPSMPKIWTENWPGWFKTFGGRDP 267
T D + T N D T MPK+ E W GWF ++GG
Sbjct: 197 SDNHDGLVSGTVDGALATINFQKLDTAIMVFLSTDQRNKMPKMVMEYWTGWFDSWGGLHH 256
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPIDE 321
++D+ +V + + G S+ N YM+HGGTNFG G TSYDY+A + E
Sbjct: 257 VFDADDMVQTVGKVIKLGASI-NLYMFHGGTNFGFLNGAQHSNEYKSTITSYDYDAVLTE 315
Query: 322 YG 323
G
Sbjct: 316 SG 317
>gi|393782614|ref|ZP_10370797.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
gi|392672841|gb|EIY66307.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
Length = 605
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 148/315 (46%), Gaps = 33/315 (10%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF-GGRFNLVKFI 102
IIS IH R W +Q K G NT+ Y+ WN HE PG + F G NL KFI
Sbjct: 48 IISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGNKNLEKFI 107
Query: 103 KIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDM 162
+ +Q M+++ R GP+V E+++GG+P +L IP R + +++++ I +
Sbjct: 108 QTVQDEGMFLLFRPGPYVCGEWDFGGLPPYLLSIPDIKIRCMDTRYTAAVERYVDKIAPI 167
Query: 163 MKREKLFASQGGPIILAQVENEYGYYESFYGEGGKR-YALWAAKMAVAQNIGVPW----- 216
+K+ ++ + GGPII+ QVENEYG Y G R Y W + + I VP+
Sbjct: 168 IKKYEI--TNGGPIIMVQVENEYGSY------GNDRIYMKWMHDLWRDKGIEVPFYTADG 219
Query: 217 ---IMCQQFDTPDPVIN---TCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRP 270
M + P I + D+ P +E +PGW + H
Sbjct: 220 ATPYMLEAGTLPGVAIGLDPAASKAEFDEALKVHPDASVFCSELYPGWLTHWREEWQHPS 279
Query: 271 SEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG---------PFITTSYDYEAPIDE 321
E I V G S NYY+ HGGTNFG AG P + TSYDY+API+E
Sbjct: 280 IEKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGTYQPDV-TSYDYDAPINE 337
Query: 322 YGLPRNPKWGHLKEL 336
G PK+ L+EL
Sbjct: 338 MG-QATPKYMALREL 351
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 141/298 (47%), Gaps = 26/298 (8%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+S AIHY R VP W + + K G+NT+E+Y+ WN HE G++ F G ++ FI
Sbjct: 20 ILSGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMADIEAFIT 79
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
+ + +++I+R P++ AE+ +GG+P WL P R F + + ++ +
Sbjct: 80 LAGKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKKVDAYYDELIPRL 139
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV-------PW 216
L ++ GGPII Q+ENEYG Y + Y + + +A+ + V P
Sbjct: 140 V--PLLSTNGGPIIAVQIENEYGSYGN-----DTAYLQYLQEALIARGVDVLLFTSDGPT 192
Query: 217 IMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPS 271
Q T V T N S + + P + E W GWF + R S
Sbjct: 193 DGMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMKPHHTRDS 252
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPIDEYG 323
ED A A G SV N+YM+HGGTNFG G + TSYDY+AP+ E G
Sbjct: 253 EDAASVFAEMLALGASV-NFYMFHGGTNFGFYNGANYHDKYEPTITSYDYDAPLSECG 309
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 163/348 (46%), Gaps = 45/348 (12%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+T+ + ++G I+S AIHY R VP W + + K G NT+E+Y+ WN HE
Sbjct: 3 RLTWKDQKYRLDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEPR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G + F G ++ +FI+ + +++I+R P++ AE+ +GG+P WL + D E
Sbjct: 63 EGSFRFDGFADVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWLLKSSMGLRCMDNE 122
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
+ + + LI ++ L S+GGPII QVENEYG Y G A A
Sbjct: 123 YLEKVDRYYDELIPRLLP---LLDSRGGPIIAVQVENEYGSY-------GNDTAYLAYLR 172
Query: 207 AVAQNIGVPWIMCQQFDTPDPVI--NTCNSFYCD------------QFTPHSPSMPKIWT 252
GV ++ D ++ T + ++ + P +
Sbjct: 173 DGLIRRGVDCLLFTSDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQDEPLMVM 232
Query: 253 ENWPGWFKTFGGRDPH--RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
E W GWF + R PH R + D+A + ++G SV N YM+HGGTNFG +G +
Sbjct: 233 EYWLGWFDHW--RKPHHVREAGDVANVLDEMLEQGASV-NLYMFHGGTNFGFYSGANYGE 289
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIK--LCEHALLNG 350
TSYDY+AP+ E WG + E + AI+ L +H + G
Sbjct: 290 HYEPTITSYDYDAPLTE--------WGDITEKYKAIRSVLEKHGIPEG 329
>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLQQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTRQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 142/290 (48%), Gaps = 23/290 (7%)
Query: 49 IHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQA 108
+HYPR W +++A+ G+NT+ +YVFWN HE PG++ F G+ ++ +F++ Q+
Sbjct: 1 MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60
Query: 109 RMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKL 168
+Y+ILR GP+V AE+++GG P WL ++R+ F + ++++ + + L
Sbjct: 61 GLYVILRPGPYVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERYIKELGKQLS--SL 118
Query: 169 FASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFD 223
+ GG II+ QVENEYG Y + K Y M VP C +
Sbjct: 119 TINNGGNIIMVQVENEYGSYAA-----DKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAG 173
Query: 224 TPDPVINTCNSFYCDQF----TPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVA 279
+ + T N + + + P E +P WF +G R E A +
Sbjct: 174 HIEGALPTLNGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLD 233
Query: 280 RFFQKGGSVHNYYMYHGGTNF----GRTAGGPF--ITTSYDYEAPIDEYG 323
G SV + YM+HGGTNF G GG + TSYDY+AP+ E+G
Sbjct: 234 WMLSHGVSV-SMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWG 282
>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 139
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 58/100 (58%), Positives = 86/100 (86%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V+YD R+++ING+R ++IS +IHYPRS P MWPGL+Q+AK+GG++ +++YVFWNGHE
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYG 127
G+YYFG R++LV+F+K+ +QA +Y+ LRIGP+V AE+N+G
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFG 127
>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|296475022|tpg|DAA17137.1| TPA: galactosidase, beta 1 precursor [Bos taurus]
Length = 653
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 153/315 (48%), Gaps = 24/315 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HEL
Sbjct: 32 QIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQ 91
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y F G ++ FI++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 92 PGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDP 151
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES------------FYGE 194
+ + K++ +++ M+ L GGPII QVENEYG Y S F+
Sbjct: 152 DYLAAVDKWLGVLLPKMR--PLLYKNGGPIITVQVENEYGSYLSCDYDYLRFLQKRFHDH 209
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWT 252
G+ L+ V + + + + T D P N +F + P+ P + +
Sbjct: 210 LGEDVLLFTTD-GVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQR--KFEPTGPLVNS 266
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PF-- 308
E + GW +G R S+ +AF++ G +V N YM+ GGTNF G P+
Sbjct: 267 EFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGANV-NMYMFIGGTNFAYWNGANIPYQP 325
Query: 309 ITTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 326 QPTSYDYDAPLSEAG 340
>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 160/357 (44%), Gaps = 32/357 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAG-----NVTYDSRSLIINGRRELIISAAIHYPRSVPGM 58
RTP+AP L + F+ IT A N +G+ ++S AIH+ R
Sbjct: 3 RTPLAPLVLALAFALPITGTAAETERWPNFGTQGTQFARDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ARQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
N YM+ GGT+FG G F TTSYDY+A +DE G P PK+ +++
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|383116237|ref|ZP_09936989.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
gi|251945420|gb|EES85858.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
Length = 769
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 160/333 (48%), Gaps = 33/333 (9%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYI--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
+ +SYDY+API E G + K+ L++L
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
>gi|158455090|gb|AAI40686.2| Galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 153/315 (48%), Gaps = 24/315 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HEL
Sbjct: 32 QIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQ 91
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y F G ++ FI++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 92 PGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDP 151
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES------------FYGE 194
+ + K++ +++ M+ L GGPII QVENEYG Y S F+
Sbjct: 152 DYLAAVDKWLGVLLPKMR--PLLYKNGGPIITVQVENEYGSYLSCDYDYLRFLQKRFHDH 209
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWT 252
G+ L+ V + + + + T D P N +F + P+ P + +
Sbjct: 210 LGEDVLLFTTD-GVNERLLQCGALQGLYATLDFSPGTNLTAAFMLQR--KFEPTGPLVNS 266
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PF-- 308
E + GW +G R S+ +AF++ G +V N YM+ GGTNF G P+
Sbjct: 267 EFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGANV-NMYMFIGGTNFAYWNGANIPYQP 325
Query: 309 ITTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 326 QPTSYDYDAPLSEAG 340
>gi|332187631|ref|ZP_08389367.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
gi|332012379|gb|EGI54448.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
Length = 613
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 160/344 (46%), Gaps = 36/344 (10%)
Query: 7 IAPFALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQA 66
+A AL+ +S+ A + T + +G+ +ISA +HY R W +++A
Sbjct: 9 VAASALVPTIASAQGTTPAHSFTVQGNGFLKDGKPYQVISAEMHYTRIPRAYWRDRLRKA 68
Query: 67 KEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNY 126
K G+NTI +Y FWN HE PG Y F G+ ++ FI+ Q + +ILR GP+V AE+
Sbjct: 69 KAMGLNTITTYSFWNAHEPRPGTYDFTGQNDIAAFIRDAQAEGLDVILRPGPYVCAEWEL 128
Query: 127 GGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG 186
GG P WL + R+ + + +++ + +K L GGPI+ Q+ENEYG
Sbjct: 129 GGYPSWLLKDRNLLLRSTDPKYTAAVDRWLARLGQEVK--PLLLRNGGPIVAIQLENEYG 186
Query: 187 YY--ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD---PVINTCNSF------ 235
+ + Y EG L A+ GV + Q D P + + +F
Sbjct: 187 AFGSDKAYLEG-----LKASYQRAGLADGVLFTSNQAGDLAKGSLPEVPSVVNFGSGGAQ 241
Query: 236 -YCDQFTPHSPSMPKIWTENWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHN 290
+ P ++ E W GWF +G D + +E++ F ++G SV +
Sbjct: 242 NAVAKLEAFRPDGLRMVGEYWAGWFDKWGEDHHETDGKKEAEELGF----MLKRGYSV-S 296
Query: 291 YYMYHGGTNFGRTAGGPF--------ITTSYDYEAPIDEYGLPR 326
YM+HGGT FG G TTSYDY AP+DE G PR
Sbjct: 297 LYMFHGGTTFGWMNGADSHTGTDYHPDTTSYDYNAPLDEAGNPR 340
>gi|78042544|ref|NP_001030215.1| beta-galactosidase precursor [Bos taurus]
gi|75057630|sp|Q58D55.1|BGAL_BOVIN RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|61554628|gb|AAX46589.1| galactosidase, beta 1 [Bos taurus]
gi|148839051|dbj|BAF64285.1| galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 153/315 (48%), Gaps = 24/315 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HEL
Sbjct: 32 QIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQ 91
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y F G ++ FI++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 92 PGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDP 151
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES------------FYGE 194
+ + K++ +++ M+ L GGPII QVENEYG Y S F+
Sbjct: 152 DYLAAVDKWLGVLLPKMR--PLLYKNGGPIITVQVENEYGSYLSCDYDYLRFLQKRFHDH 209
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWT 252
G+ L+ V + + + + T D P N +F + P+ P + +
Sbjct: 210 LGEDVLLFTTD-GVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQR--KFEPTGPLVNS 266
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PF-- 308
E + GW +G R S+ +AF++ G +V N YM+ GGTNF G P+
Sbjct: 267 EFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGANV-NMYMFIGGTNFAYWNGANIPYQP 325
Query: 309 ITTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 326 QPTSYDYDAPLSEAG 340
>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
Length = 593
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTRQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/326 (30%), Positives = 161/326 (49%), Gaps = 37/326 (11%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + LI +IHY R W + + K G NT+ +YV WN HE GK+ F G
Sbjct: 93 LGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSGNL 152
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKF- 155
+L F+ + + +++ILR GP++ +E + GG+P WL P + R + F + K+
Sbjct: 153 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNKYF 212
Query: 156 ---MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ +V + R++ GPII QVENEYG + K Y + K + +
Sbjct: 213 DHLISRVVPLQYRKR------GPIIAVQVENEYGSFAE-----DKDYMPYIQKALLER-- 259
Query: 213 GVPWIMCQQFDTP-------DPVINTC--NSFYCDQFTPHSP---SMPKIWTENWPGWFK 260
G+ ++ D + V+ T N+F + F S + P + E W GWF
Sbjct: 260 GIVELLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFD 319
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYD 314
T+GG+ + +ED+ +V++F S N YM+HGGTNFG G + + TSYD
Sbjct: 320 TWGGKHMIKNAEDVEDTVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGKHRGVVTSYD 378
Query: 315 YEAPIDEYGLPRNPKWGHLKELHGAI 340
Y+A + E G K+ L++L G++
Sbjct: 379 YDAVLTEAG-DYTEKYFKLRKLFGSV 403
>gi|440904150|gb|ELR54700.1| Beta-galactosidase, partial [Bos grunniens mutus]
Length = 659
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 153/315 (48%), Gaps = 24/315 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HEL
Sbjct: 38 QIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQ 97
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y F G ++ FI++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 98 PGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDP 157
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES------------FYGE 194
+ + K++ +++ M+ L GGPII QVENEYG Y S F+
Sbjct: 158 DYLAAVDKWLGVLLPKMR--PLLYKNGGPIITVQVENEYGSYLSCDYDYLRFLQKRFHDH 215
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWT 252
G+ L+ V + + + + T D P N +F + P+ P + +
Sbjct: 216 LGEDVLLFTTD-GVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQR--KFEPTGPLVNS 272
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PF-- 308
E + GW +G R S+ +AF++ G +V N YM+ GGTNF G P+
Sbjct: 273 EFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGANV-NMYMFIGGTNFAYWNGANIPYQP 331
Query: 309 ITTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 332 QPTSYDYDAPLSEAG 346
>gi|408532648|emb|CCK30822.1| beta-galactosidase [Streptomyces davawensis JCM 4913]
Length = 577
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 43/328 (13%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
T +++GR ++S A+HY R W + + G+N +E+YV WN HE PG
Sbjct: 5 TVGDTDFLLDGRPVRLLSGALHYFRVHEAQWGHRLAMLRAMGLNCVETYVPWNLHEPRPG 64
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
+ F L +F+ ++A ++ I+R GP++ AE+ GG+P H++PG D
Sbjct: 65 E--FRDVEALGRFLDAAREAGLWAIVRPGPYICAEWENGGLP---HWVPGHARTRDERFL 119
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
+ F L+ +++ R+ +GGP+IL QVENEYG Y S Y A +
Sbjct: 120 RPVRAWFRRLLPEVVSRQ---IDRGGPVILVQVENEYGSYGS-----DAAYPDRLAGLLR 171
Query: 209 AQNIGVPWIMCQQFDTPDP----------VINTCN--SFYCDQFTP---HSPSMPKIWTE 253
A+ + VP D P+ V+ T N S + F H P P + E
Sbjct: 172 AEGVTVPLFTS---DGPEDHMLTGGSVPGVLATVNFGSHAREAFRTLRRHRPEGPLMCME 228
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG-------- 305
W GWF +G R ED A ++ + G SV N YM HGGT+F AG
Sbjct: 229 FWCGWFDHWGAEHVVRDPEDAAAALREILECGASV-NLYMAHGGTSFAGWAGANRGGDLH 287
Query: 306 -GPF--ITTSYDYEAPIDEYGLPRNPKW 330
GP TSYDY+AP+DE G P W
Sbjct: 288 DGPLEPDVTSYDYDAPLDEAGRPTRKFW 315
>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
Length = 581
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 164/319 (51%), Gaps = 29/319 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
I+ ++ IIS +HY R + W + + K G NT+E+Y+ WN HE G++ F G
Sbjct: 12 IDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCFEGNL 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
++ KF+ I + +Y+ILR P++ AE+ +GG+P WL G R +PF H++++
Sbjct: 72 DITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGMRLRCSYKPFLKHVEEYY 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYAL-WAAKMAVAQNIG 213
+ +++ L ++GGP+I+ QVENEYGYY ++ Y + + + + + ++ + + G
Sbjct: 132 HRLFEVIA--PLQYTKGGPVIMMQVENEYGYYGNDTLYLKTLQDFMVSYGCEVPLVTSDG 189
Query: 214 VPWIMCQQFDTPDPVINTCN--SFYCDQFTPHSPSM---PKIWTENWPGWFKTFG----- 263
PW + V+ T N S Q + P + E W GWF ++G
Sbjct: 190 -PWGDAFDCGKLEGVLQTGNFGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFDSWGQTEHK 248
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEA 317
DP++ +E++ + + G V N YM+ GGTNFG G + TSYDY+A
Sbjct: 249 QEDPNKNAENLDEIL-----ESGHV-NIYMFMGGTNFGFMNGSNYYDVLTPDVTSYDYDA 302
Query: 318 PIDEYGLPRNPKWGHLKEL 336
+ E G PK+ LK +
Sbjct: 303 LLTEAG-DLTPKYELLKNV 320
>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
Thetaiotaomicron
Length = 612
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 153/332 (46%), Gaps = 29/332 (8%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G + ++NG ++ +A IHYPR W ++ K G NTI YVFWN HE
Sbjct: 6 GTFEVGKNTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEP 65
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G+Y F G+ ++ F ++ Q+ Y+I+R GP+V AE+ GG+P WL R
Sbjct: 66 EEGRYDFAGQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQD 125
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+ ++ F+ + + L S+GG II QVENEYG + G + + +
Sbjct: 126 PYYXERVKLFLNEVGKQLA--DLQISKGGNIIXVQVENEYGAF------GIDKPYISEIR 177
Query: 206 MAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN----SFYCDQFT---PHSPSMPKIW 251
V Q GVP C + + D ++ T N + +QF P P
Sbjct: 178 DXVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXC 237
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
+E W GWF +G + R +E++ + S + Y HGGT+FG G F
Sbjct: 238 SEFWSGWFDHWGAKHETRSAEELVKGXKEXLDRNISF-SLYXTHGGTSFGHWGGANFPNF 296
Query: 309 --ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
TSYDY+API+E G PK+ ++ L G
Sbjct: 297 SPTCTSYDYDAPINESG-KVTPKYLEVRNLLG 327
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 109/340 (32%), Positives = 160/340 (47%), Gaps = 56/340 (16%)
Query: 31 DSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY 90
D + I+G+ ++S A+HY R VP W + + K G+NT+E+YV WN HE Y
Sbjct: 26 DGANFTIDGKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHEPEKYTY 85
Query: 91 YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKY 150
F G +L +++ I + +++ILR GP++ AE+ +GGIP WL Y+ V T P
Sbjct: 86 NFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYVKEHV--RTTRPMFI 143
Query: 151 HMQK--FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
+ F L+ +++ R+ + GGPII Q+ENEYG + + Y K+
Sbjct: 144 DPVEVWFGRLLAEVVPRQ---YTNGGPIIAVQIENEYGGFSN-----STEYMERLKKILE 195
Query: 209 AQNI----------------GVPWIMCQQFDTPDPVINTCN--SFYCDQFTPHSPSMPKI 250
++ I G+P ++ +N N S + P P +
Sbjct: 196 SRGIVELLFTSDGKGALISGGIPGVL--------KTVNFQNNASDKLQKLKEIQPDRPMM 247
Query: 251 WTENWPGWFKTFGGRDPH---RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
E W GWF + G D H SE SV G SV N+YM+HGGTNFG G
Sbjct: 248 VMEYWTGWFDHW-GEDHHLYRLESESFVHSVFYILDAGASV-NFYMFHGGTNFGFMNGAN 305
Query: 307 ----------PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P I TSYDY+API E G PK+ ++E+
Sbjct: 306 TRYKSGGRTLPTI-TSYDYDAPISETG-DLTPKYFKIREI 343
>gi|373955175|ref|ZP_09615135.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373891775|gb|EHQ27672.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 600
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 156/324 (48%), Gaps = 31/324 (9%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGM-WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ +++G+ IIS +H P +P M W +Q AK G NTI +Y+FWN HE G + F
Sbjct: 31 AFLLDGKPFQIISGELH-PARIPKMYWRHRIQMAKAMGCNTIAAYIFWNYHEQQKGVFDF 89
Query: 93 GGR-FNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
N+V FI++ Q+ M+++LR GP+V AE+++GG+P +L IP R +
Sbjct: 90 TTENRNIVDFIRMCQEEGMWVLLRPGPYVCAEWDFGGLPPYLLSIPDIKLRCMDPRYIAE 149
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ +++ ++ +K L + GGPII+ QVENEYG Y + + Y + V
Sbjct: 150 VTRYVDVLSQQVK--NLQCTSGGPIIMVQVENEYGSYAN-----DREYIKTLRGLWVKNG 202
Query: 212 IGVPW--------IMCQQFDTPDPVINTCNSFYCDQF---TPHSPSMPKIWTENWPGWFK 260
I VP+ M + I + F +P +P +E++PGW
Sbjct: 203 INVPFYTADGPAAFMLEAGGVDGAAIGLDSGSGDADFELAAKQNPDVPSFSSESYPGWL- 261
Query: 261 TFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TS 312
T +P D + + N Y+ +GGTNFG AG T TS
Sbjct: 262 THWKEKWQKPGTDGILKDVTYLLEHQKSFNLYVINGGTNFGYNAGANAFTPTQFQPDVTS 321
Query: 313 YDYEAPIDEYGLPRNPKWGHLKEL 336
YDY+API+E G P PK+ L+ L
Sbjct: 322 YDYDAPINERGEP-TPKYYALRNL 344
>gi|282859441|ref|ZP_06268546.1| glycosyl hydrolase family 35 [Prevotella bivia JCVIHMP010]
gi|424900868|ref|ZP_18324410.1| beta-galactosidase [Prevotella bivia DSM 20514]
gi|282587669|gb|EFB92869.1| glycosyl hydrolase family 35 [Prevotella bivia JCVIHMP010]
gi|388593068|gb|EIM33307.1| beta-galactosidase [Prevotella bivia DSM 20514]
Length = 622
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 106/350 (30%), Positives = 153/350 (43%), Gaps = 46/350 (13%)
Query: 10 FALLIFFSSSITYCFAGNVTYDS-----RSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
FALLI T FAG S + + +G+ I S +HY R W +Q
Sbjct: 7 FALLIGLFLVSTASFAGKPVRHSFVIANGNFLYDGKPLQIYSGELHYARVPAPYWRHRLQ 66
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKY-YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
K G+N + SYVFWN HE++PG + + G NL +F+K + M +ILR GP+ AE
Sbjct: 67 MMKAMGLNVVTSYVFWNHHEVAPGVWDWSTGNHNLREFVKTAAEEGMKVILRPGPYCCAE 126
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+ +GG P WL G V R D +PF + ++ + ++ L ++GGPII+ Q EN
Sbjct: 127 WEFGGYPWWLPKTKGLVVRTDNQPFLDSCRVYINQLASQVR--DLQVTKGGPIIMVQAEN 184
Query: 184 EYGYYES----FYGEGGKRYALWAAKMAVAQNIGVP-------WIM-----------CQQ 221
E+G Y + E K Y+ + + +P W+
Sbjct: 185 EFGSYVAQRPDIPLETHKAYSAKIRQQLLDAGFNIPMFTSDGSWLFKGGVIEGVLPTANG 244
Query: 222 FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARF 281
D D + N + H P + E +PGW + + P + + ++
Sbjct: 245 EDNIDNLKKVVNEY-------HGGQGPYMVAEFYPGWLSHWAEKFPQVSTTSVVTQTKKY 297
Query: 282 FQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSYDYEAPIDEYG 323
S NYYM HGGTNFG AG TSYDY+API E G
Sbjct: 298 LDNKVSF-NYYMVHGGTNFGFMAGANCDNIHKLQPDMTSYDYDAPISEAG 346
>gi|336415312|ref|ZP_08595652.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
gi|335940908|gb|EGN02770.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
Length = 778
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/331 (30%), Positives = 159/331 (48%), Gaps = 30/331 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
+IFFS++ A + +++G+ ++ +A +HY R W ++ K G+N
Sbjct: 14 VIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEYG Y
Sbjct: 134 LLKKKDIALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSY--- 187
Query: 192 YGEGGKRYALWAAKMAVAQN--IGVPWIMCQ-----QFDTPDPVINTCN---SFYCDQ-- 239
G + + A + V ++ VP C + D +I T N DQ
Sbjct: 188 ---GIDKPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQF 244
Query: 240 --FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGG 297
P P + +E W GWF +G + R ++D+ + + S + YM HGG
Sbjct: 245 KKLKELRPETPLMCSEFWSGWFDHWGRKHETRLAKDMVQGIKDMLDRNISF-SLYMTHGG 303
Query: 298 TNFGRTAGG-----PFITTSYDYEAPIDEYG 323
T FG G + +SYDY+API E G
Sbjct: 304 TTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|336410484|ref|ZP_08590961.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
gi|335944314|gb|EGN06136.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
Length = 769
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 153/320 (47%), Gaps = 32/320 (10%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A N T + ++NG+ + +A +HY R W ++ K G+NTI YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
+ G++ F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL V R
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT- 136
Query: 145 TEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWA 203
+P Y M++ + ++ K+ L ++GG II+ QVENEYG Y K Y +
Sbjct: 137 LDP--YFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAV-----DKPYV--S 187
Query: 204 AKMAVAQNIG---VPWIMCQQFDTPDP--------VINTCNSFYCDQ----FTPHSPSMP 248
A + ++ G VP C T D IN +Q P P
Sbjct: 188 AIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLREARPETP 247
Query: 249 KIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-- 306
+ +E W GWF +G + RP++ + + + S + YM HGGT FG G
Sbjct: 248 LMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANN 306
Query: 307 ---PFITTSYDYEAPIDEYG 323
+ +SYDY+API E G
Sbjct: 307 PSYSAMCSSYDYDAPISEPG 326
>gi|387791561|ref|YP_006256626.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379654394|gb|AFD07450.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 619
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 160/332 (48%), Gaps = 32/332 (9%)
Query: 31 DSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY 90
++ + + +G+ I S +H+ R W ++ K G+N++ +YVFWN HE +PG +
Sbjct: 29 ENGAFVYDGKPVQIHSGEMHFARVPQEYWRHRLKMMKAMGLNSVATYVFWNYHETAPGVW 88
Query: 91 YFG-GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
F G N+ +FIKI + + +ILR GP+ AE+ YGG P +L + G R + F
Sbjct: 89 DFKTGNKNISEFIKIAGEEGLMVILRPGPYACAEWEYGGYPWFLQNVEGLEVRRNNPKFL 148
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGE----GGKRYALWAAK 205
++++ + +K +++ ++GGPII+ Q ENE+G Y + + K Y+
Sbjct: 149 AACKEYIDHLAKEVKNQQI--TKGGPIIMVQAENEFGSYVAQRKDIPLAEHKAYSSAIKA 206
Query: 206 MAVAQNIGVPWIMCQ--------QFDTPDPVINTCNSF-----YCDQFTPHSPSMPKIWT 252
+A VP + P N ++ DQ+ + P +
Sbjct: 207 QLLAAGFDVPLFTSDGSWLFEGGSIENCLPTANGEDNIENLKKVVDQY--NGGKGPYMVA 264
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E +PGW + P P+ED+ ++ Q S NYYM HGGTNFG T+G +
Sbjct: 265 EFYPGWLDHWAEPFPKVPTEDVVKQTEKYLQNNVSF-NYYMVHGGTNFGYTSGANYDKNH 323
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+API E G PK+ ++EL
Sbjct: 324 DIQPDMTSYDYDAPISEAGWA-TPKYIAIREL 354
>gi|321478650|gb|EFX89607.1| hypothetical protein DAPPUDRAFT_303198 [Daphnia pulex]
Length = 651
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 154/323 (47%), Gaps = 33/323 (10%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++ Y + + +G +S A+HY R WP +++ + G+N +E+YV W HE
Sbjct: 30 SIDYVNNQFVKDGEPFRYVSGAMHYFRVPVHYWPDRMRKMRAAGLNVLETYVEWASHEPQ 89
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYI-PGTVFRNDT 145
PG Y F G ++ + ++ Q + +ILR GPF+ AE + GG+P WL + P R
Sbjct: 90 PGVYAFEGNLDIEYYFELAQHFNLSVILRPGPFIDAERDMGGLPFWLLSVDPSIKLRTSD 149
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+ + H++K+ ++++ +K + GGPI+ QVENEYG Y + Y W
Sbjct: 150 KSYVTHVEKWFSVLLSKIK--PYLYNNGGPIVTVQVENEYGSYSP----CDRDYTSWLRD 203
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINT-------------CNSFYCDQFTPH---SPSMPK 249
+ Q++G ++ D + S + F P + P+
Sbjct: 204 F-IRQHLGKDVVLFSTDGDGDGYLQCGKIPGVYATVDFGAGSNAVESFKPQRHFELAGPR 262
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--- 306
+ +E +PGW +G ED+ ++ SV + YM+HGGT+FG T+G
Sbjct: 263 VNSEFYPGWLDMWGEPHSTVDKEDVVKTLDDMLAINASV-SMYMFHGGTSFGFTSGALPS 321
Query: 307 ----PFITTSYDYEAPIDEYGLP 325
P I TSYDY+AP++E G P
Sbjct: 322 NTYTPCI-TSYDYDAPLNEAGDP 343
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 151/320 (47%), Gaps = 31/320 (9%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
++ G R I +IHY R W + + K G+NT+ +Y+ WN HE GK+ F G
Sbjct: 90 FLLEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 149
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
++ F+++ +++ILR GP++ +E++ GG+P WL R F +
Sbjct: 150 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVDL 209
Query: 155 FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV 214
+ ++ + L +QGGPII QVENEYG Y+ Y + KMA+ + V
Sbjct: 210 YFNQLIP--RVVPLQYTQGGPIIAVQVENEYGSYDK-----DPNYMPY-IKMALLKRGIV 261
Query: 215 PWIMCQQFDTPDPV-------------INTCNSFYCDQFTPHSPSMPKIWTENWPGWFKT 261
+M D D + + +S + + P + TE W GWF T
Sbjct: 262 ELLMTS--DNKDGLSGGYVEGVLATINLKNVDSIIFNYLQSFQDNKPTMVTEFWTGWFDT 319
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT------TSYDY 315
+GG ++D+ SV+ Q G S+ N YM+HGGTNFG G T TSYDY
Sbjct: 320 WGGPHHIVDADDVMVSVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFTDYQADVTSYDY 378
Query: 316 EAPIDEYGLPRNPKWGHLKE 335
+A + E G PK+ L+E
Sbjct: 379 DAILTEAG-DYTPKFFKLRE 397
>gi|242004937|ref|XP_002423332.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
gi|212506351|gb|EEB10594.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
Length = 596
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/307 (30%), Positives = 158/307 (51%), Gaps = 23/307 (7%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ +S + HY R W +++ K G+N + +YV W+ HE PG Y F G
Sbjct: 1 MDGKPFQYVSGSAHYFRMPNQYWRDRLRKIKAAGLNAVSTYVEWSQHERVPGVYDFEGDL 60
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYI-PGTVFRNDTEPFKYHMQKF 155
++ +F+++ Q+ +++ILR GP++ AE + GG+P WL P R+ + Y++Q++
Sbjct: 61 DVKRFVEMAQEEGLFVILRPGPYICAERDMGGLPYWLMTKHPDIQLRSSDFFYTYYVQRW 120
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-------FYGEGGKRYALWAAKMAV 208
M + + K L+ +GGPIIL QVENEYG Y S + +++ + A +
Sbjct: 121 MDKL--LGKFTDLWYGKGGPIILVQVENEYGSYHSCDYNHTYWLRNLFEKHVDYNAVLFT 178
Query: 209 AQNIGVPWIMCQQ----FDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
++ C + + T D N+ S + PS P + +E +PGW +G
Sbjct: 179 TDGASRNFLKCGKIPGVYATVDFGPNSNVSKMFEAQREFEPSGPLVNSEYYPGWLTHWGE 238
Query: 265 RDPHR-PSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI-------TTSYDYE 316
+ R ++D+ ++ + +V N+YM++GG+NFG TAG TSYDY+
Sbjct: 239 KKHARQDTKDVVKTLREMLNEKANV-NFYMFYGGSNFGFTAGANQFGSIYQSDITSYDYD 297
Query: 317 APIDEYG 323
API E G
Sbjct: 298 APISEAG 304
>gi|312378199|gb|EFR24839.1| hypothetical protein AND_10320 [Anopheles darlingi]
Length = 639
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 98/324 (30%), Positives = 160/324 (49%), Gaps = 31/324 (9%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ Y+ + +++G+ ++ + HY R++P W ++ + GG+N ++ YV W+ H
Sbjct: 25 TIDYERDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLRTLRAGGLNAVDLYVQWSLHNPR 84
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTVFRNDT 145
G Y + G N+ I+ + +Y+ILR GP++ AE + GG+P WL + PG R
Sbjct: 85 DGVYSWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRTSD 144
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY-------YESFYGEGGKR 198
+ ++K+ + M + E GGPII+ Q+ENEYG Y +F E R
Sbjct: 145 ANYLAEVKKWYGEL--MSRMEPYMYGNGGPIIMVQIENEYGAFGKCDKPYLNFLKEETNR 202
Query: 199 YALWAAKMAVAQNIGVPW---IMCQQFD----TPDPVINTCNSF--YCDQFTPHSPSMPK 249
Y AV + P+ I C Q D T D + T + + + P P
Sbjct: 203 Y---VQDKAVLFTVDRPYDDEIGCGQIDGVFITTDFGLMTDEEVDTHAAKVRSYQPKGPL 259
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG---- 305
+ TE + GW + + RP+ +A ++ + + G +V ++YMY GGTNFG AG
Sbjct: 260 VNTEFYTGWLTHWQESNQRRPAGPLAATLRKMLKDGWNV-DFYMYFGGTNFGFWAGANDW 318
Query: 306 --GPFIT--TSYDYEAPIDEYGLP 325
G ++ TSYDY+AP+DE G P
Sbjct: 319 GLGKYMADITSYDYDAPMDEAGDP 342
>gi|345880280|ref|ZP_08831835.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
gi|343923634|gb|EGV34320.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
Length = 621
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 148/328 (45%), Gaps = 35/328 (10%)
Query: 21 TYCFA-GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVF 79
T+ A GN YD G+ I S +HY R W +Q K G+N + SYVF
Sbjct: 28 TFTIANGNFLYD-------GKPTQIHSGELHYARVPAPYWRHRLQMMKAMGLNAVTSYVF 80
Query: 80 WNGHELSPGKY-YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPG 138
WN HE SPG + + G N+ FIKI + + +ILR GP+ AE+ +GG P WL G
Sbjct: 81 WNHHETSPGVWDWQTGNHNIRNFIKIAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKG 140
Query: 139 TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGE 194
V R D +PF + ++ + + ++ L ++GGP+++ Q ENE+G Y + E
Sbjct: 141 LVIRTDNKPFLDSCRVYINQLANQVR--DLQITKGGPVVMVQAENEFGSYVAQRKDIPLE 198
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQ--------QFDTPDPVIN-TCNSFYCDQFTP--H 243
K+YA + + +P + P N N Q H
Sbjct: 199 VHKKYAAQIRQQLLDAGFDIPMFTSDGSWLFKGGSIEGALPTANGEGNIEKLKQVVNEYH 258
Query: 244 SPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRT 303
P + E +PGW + P +E + ++ G S NYYM HGGTNFG T
Sbjct: 259 GGVGPYMVAEFYPGWLSHWAEPFPRVSTESVVKQTKKYLDNGVS-FNYYMVHGGTNFGFT 317
Query: 304 AGGPFIT--------TSYDYEAPIDEYG 323
G + TSYDY+API E G
Sbjct: 318 TGANYSNATNLQPDMTSYDYDAPISEAG 345
>gi|339640120|ref|ZP_08661564.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
gi|339453389|gb|EGP66004.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
Length = 595
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 151/317 (47%), Gaps = 40/317 (12%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I+S AI Y R P W + K G NT+E+Y+ W+ HE G++ G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRETLHNLKALGYNTVETYIPWSLHEPQEGQFVTDGLL 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+ + ++Q+ +++I+R P++ AE+++GG+P WL PG FR + F + +F
Sbjct: 72 DFEAYFDLVQEMGLHLIVRPTPYICAEFDFGGMPPWLLNYPGMRFRVNDALFLEKVSRFY 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP- 215
+ + + ++GGPI++ QVENEYG Y K Y AKM + + VP
Sbjct: 132 DWLFPKLLPYQF--TEGGPILMMQVENEYGSYAE-----DKEYMRNIAKMMRDRGVSVPL 184
Query: 216 ------WIMCQQFDT--PDPVINTCNSFYCDQ-----------FTPHSPSMPKIWTENWP 256
WI + T D + T N + Q H P + TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQAKENTDNLRAFMERHGKKWPLMCTEFWD 242
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--------RTAGGPF 308
GWF +G R +ED+A V + G N ++ GGTNFG +T P
Sbjct: 243 GWFSRWGEEIVRRDAEDLAQDVKEMMRIGSM--NLFLLRGGTNFGFISGCSARKTRDLPQ 300
Query: 309 ITTSYDYEAPIDEYGLP 325
I TSYD++AP+ E+G+P
Sbjct: 301 I-TSYDFDAPVTEWGVP 316
>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 640
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 161/339 (47%), Gaps = 48/339 (14%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I++ +HY R W +Q+AK G+N I +YVFWN HE PG Y F G+
Sbjct: 35 LDGKPFRILTGEMHYARIPRARWDDAMQKAKALGLNAITTYVFWNVHEPRPGVYDFTGQN 94
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L +++ Q+A + +ILR GP+ AE+ +GG P WL P V R+ ++P KFM
Sbjct: 95 DLGEYLAAAQRAGLKVILRPGPYACAEWEFGGYPAWLIKDPTVVVRS-SDP------KFM 147
Query: 157 TLIVDMMKR-----EKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAA----- 204
+ R + A+ GGPII QVENEYG + + Y E K + +
Sbjct: 148 KPVAKWFHRLGQEVQPYLAANGGPIIAVQVENEYGSFGNDHAYMEQMKDLVISSGIGGKN 207
Query: 205 -KMAVAQN-IGVPWIMCQQFDTPDPVINTCNSFYCD-----------------QFTPHSP 245
K AV ++ VP T D + N + ++ P
Sbjct: 208 PKKAVDEDGKNVPQDTGTMLYTADGGVQLPNGTLPELPAVVNFGGGQAKSELARYEAFRP 267
Query: 246 SMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG 305
+ P++ E W GWF +G + + ++G SV + YM +GGT+FG AG
Sbjct: 268 NGPRMVGEYWAGWFDHWGNNHQKTNAAEQVAEYEYMLKRGYSV-SLYMLYGGTSFGWMAG 326
Query: 306 ------GPF--ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
P+ TSYDY+APIDE G P PK+ L+E+
Sbjct: 327 ANSGDKAPYEPDVTSYDYDAPIDERGNP-TPKYFALREV 364
>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 593
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 142/320 (44%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL G R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ + ++ +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLPKLAPMQI--TQGGPVIMMQVENEYGSY-------GMEKAYLQQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|334347175|ref|XP_003341899.1| PREDICTED: beta-galactosidase-1-like protein [Monodelphis
domestica]
Length = 646
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 145/308 (47%), Gaps = 22/308 (7%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
+++G +S +IHY R +W + + + G+N ++ YV WN HE PG Y F G
Sbjct: 56 FLLDGVPFRYVSGSIHYSRVPSPLWSDRLHKMRMSGLNAVQVYVPWNYHEPQPGVYNFQG 115
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
+LV F+K + +ILR GP++ AE+ GG+P WL P V R F +
Sbjct: 116 NRDLVAFLKAAANEDLLVILRPGPYICAEWEMGGLPAWLLQNPEIVLRTSDPDFLAAVDS 175
Query: 155 FMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-----ESFYGEGGKRYALWAAKMAVA 209
+ +++ M+ + GG II QVENEYG Y G AL ++ +
Sbjct: 176 WFHVLMPMV--QPWLYHNGGNIISVQVENEYGSYFACDFRYMRHLAGLFRALLGDQIFLF 233
Query: 210 QNIGVPWIMCQQ----FDTPD--PVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFG 263
G C + T D P N F Q + P+ P + +E + GW +G
Sbjct: 234 TTDGPRGFSCGTLQGLYSTVDFGPDDNMTEIFAMQQ--KYEPNGPLVNSEYYTGWLDYWG 291
Query: 264 GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEA 317
G ++ +A + + G +V N YM+HGGTNFG +G F +TTSYDY+A
Sbjct: 292 GNHSKWDTKTLANGLQNMLELGANV-NMYMFHGGTNFGYWSGADFKKIYQPVTTSYDYDA 350
Query: 318 PIDEYGLP 325
P+ E G P
Sbjct: 351 PLSEAGDP 358
>gi|153808925|ref|ZP_01961593.1| hypothetical protein BACCAC_03226 [Bacteroides caccae ATCC 43185]
gi|149128258|gb|EDM19477.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 778
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 154/332 (46%), Gaps = 32/332 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
++ FSS+ A + +++G ++ +A +HY R W ++ K G+N
Sbjct: 14 VVIFSSAQAQTTARKFEAGKNTFLLDGEPFVVKAAELHYTRIPQAYWEHRIEMCKTLGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEY Y +
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYSSYAT- 189
Query: 192 YGEGGKRYALWAAKMAVAQNIG---VPWIMCQ--------QFDTPDPVINTCNSFYCDQ- 239
K Y AA + + G VP C + +N DQ
Sbjct: 190 ----DKPYV--AAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNFGTGANIDQQ 243
Query: 240 ---FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHG 296
P P + +E W GWF +G + RP++D+ + + S + YM HG
Sbjct: 244 FKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHG 302
Query: 297 GTNFGRTAGG-----PFITTSYDYEAPIDEYG 323
GT FG G + +SYDY+API E G
Sbjct: 303 GTTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 160/357 (44%), Gaps = 32/357 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAG-----NVTYDSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ IT A N + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGAAADTERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ARQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
N YM+ GGT+FG G F TTSYDY+A +DE G P PK+ +++
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|373953405|ref|ZP_09613365.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890005|gb|EHQ25902.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 608
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 163/343 (47%), Gaps = 39/343 (11%)
Query: 5 TPIAPFALLIFFSS--SITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
+ IA LL F + + + FA + +++G+ +IS +HYPR W
Sbjct: 6 SAIALLMLLFVFPAVGQVNHTFA----LGDEAFLLDGKPFQMISGEMHYPRVPRESWRAR 61
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
++ AK G+NTI +YVFWN HE GK+ F G ++ +F++I +Q +++ILR P+V A
Sbjct: 62 MKMAKAMGLNTIGTYVFWNLHEPQKGKFDFTGNNDVAEFVRIAKQEGLWVILRPSPYVCA 121
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQV 181
E+ +GG P WL G V R+ + ++++ + I ++ K+ L + GG I++ Q+
Sbjct: 122 EWEFGGYPYWLQNEKGLVVRSKEAQY---LKEYESYIKEVGKQLAPLQINHGGNILMVQI 178
Query: 182 ENEYGYY----------ESFYGEGGKRYALWAAKMAV-AQNIGVPWIM--CQQFDTPDPV 228
ENEYG Y + + E G L+ A N +P ++ D PD V
Sbjct: 179 ENEYGSYGSDKDYLAINQKLFKEAGFDGLLYTCDPAADLVNGHLPGLLPAVNGIDNPDKV 238
Query: 229 INTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSV 288
+ H+ P E +P WF +G + P+ + + G S+
Sbjct: 239 KQIISQ-------NHNGKGPYYIAEWYPAWFDWWGTKHHTVPAAEYTGRLDSVLAAGISI 291
Query: 289 HNYYMYHGGTNFGRTAGGPFITT--------SYDYEAPIDEYG 323
N YM+HGGT G G + T SYDY+AP+DE G
Sbjct: 292 -NMYMFHGGTTRGFMNGANYKDTSPYEPQVSSYDYDAPLDEAG 333
>gi|423220237|ref|ZP_17206732.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
gi|392623314|gb|EIY17417.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
Length = 778
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 154/332 (46%), Gaps = 32/332 (9%)
Query: 13 LIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVN 72
++ FSS+ A + +++G ++ +A +HY R W ++ K G+N
Sbjct: 14 VVIFSSAQAQTTARKFEAGKNTFLLDGEPFVVKAAELHYTRIPQAYWEHRIEMCKALGMN 73
Query: 73 TIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW 132
TI Y+FWN HE GK+ F G+ ++ F + Q+ MY+I+R GP+V AE+ GG+P W
Sbjct: 74 TICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWW 133
Query: 133 LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESF 191
L R +P Y+M++ + ++ K+ L ++GG II+ QVENEY Y +
Sbjct: 134 LLKKKDVALRT-LDP--YYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYSSYAT- 189
Query: 192 YGEGGKRYALWAAKMAVAQNIG---VPWIMCQ--------QFDTPDPVINTCNSFYCDQ- 239
K Y AA + + G VP C + +N DQ
Sbjct: 190 ----DKPYV--AAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNFGTGANIDQQ 243
Query: 240 ---FTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHG 296
P P + +E W GWF +G + RP++D+ + + S + YM HG
Sbjct: 244 FKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHG 302
Query: 297 GTNFGRTAGG-----PFITTSYDYEAPIDEYG 323
GT FG G + +SYDY+API E G
Sbjct: 303 GTTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|148273884|ref|YP_001223445.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
gi|147831814|emb|CAN02784.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
Length = 599
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 146/312 (46%), Gaps = 36/312 (11%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++GR +I+ A+HY R P W +++A+ G++TIE+YV WN H G +
Sbjct: 20 LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGAFDTSAGL 79
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L +F+ ++ M+ I+R GP++ AE++ GG+P WL P R + + +F+
Sbjct: 80 DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFEDPAVGVRRSEPLYLAAVDEFL 139
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
+ +++ ++ GGP+IL Q+ENEYG YG+ Y + I VP
Sbjct: 140 RRVYEIVAPRQI--DMGGPVILVQIENEYGA----YGDDAD-YLRHLVDLTRESGIIVPL 192
Query: 217 IMCQQFDTPDPVINTCNSFYCDQ-----------------FTPHSPSMPKIWTENWPGWF 259
Q P + D+ H P+ P + +E W GWF
Sbjct: 193 TTVDQ-----PTDEMLSRGSLDELHRTGSFGSRATERLATLRRHQPTGPLMCSEFWDGWF 247
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPFIT--TSY 313
+ G H S A + G+ N YM+HGGTNFG T G G + + TSY
Sbjct: 248 DHW-GEHHHTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHKGTYQSHVTSY 306
Query: 314 DYEAPIDEYGLP 325
DY+AP+DE G P
Sbjct: 307 DYDAPLDETGSP 318
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 101/328 (30%), Positives = 155/328 (47%), Gaps = 45/328 (13%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ +++G IIS A+HY R VP W + K G NT+E+YV WN HE G + F
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIHEPKEGVFNF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +LVK++++ Q+ + +ILR P++ AE+ +GG+P WL R++T F +
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLDKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ F +++ M+ L GGPII+ QVENEYG + + K Y K+ ++
Sbjct: 128 ENFYKVLLPMVT--PLQVENGGPIIMMQVENEYGSFGN-----DKEYVRSIKKIMRDLDV 180
Query: 213 GVP-------W--------------IMCQQFDT-PDPVINTCNSFYCDQFTPHSPSMPKI 250
VP W ++ F + + +N SF + P +
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESF----IKENKKEWPLM 236
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG---- 306
E W GWF +G R ++A V ++ N+YM+ GGTNFG G
Sbjct: 237 CMEFWDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRE 294
Query: 307 ----PFITTSYDYEAPIDEYGLPRNPKW 330
P I TSYDY+A + E+G P PK+
Sbjct: 295 NVDLPQI-TSYDYDALLTEWGEP-TPKY 320
>gi|38699441|gb|AAR27061.1| beta-galactosidase 1 [Ficus carica]
Length = 176
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 101/178 (56%), Gaps = 3/178 (1%)
Query: 482 LWYTTSIIVNENEEFLKNGSRPVLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPI 541
LWY T I + +E FLK G+ P+L + S GHAL F N +L G A G+ P + I
Sbjct: 1 LWYMTDITIGSDEGFLKTGNYPLLTVYSAGHALLVFVNGQLTGKAYGSLDSPKLTFTQNI 60
Query: 542 SLKAGKNEIALLSMTVGLQNAGPFYEWVGAGITS-VKITGFNSGTLDLSTYSWTYKIGLQ 600
L+ G N++ALLS+ VGL N G +E AG+ V + G NSGT D+S + W+YK GL+
Sbjct: 61 KLRVGVNKLALLSVAVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSKWKWSYKTGLE 120
Query: 601 GEHLGIYNPGYRNNINWVSTMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAW 658
GE L + + +++ W K QPLTWY P G+ P+ LDM MGKG W
Sbjct: 121 GEDLSLQSG--SSSVQWAQGSFFTKQQPLTWYTTTFNAPGGNGPLALDMNSMGKGQIW 176
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 160/357 (44%), Gaps = 32/357 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAG-----NVTYDSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ IT A N + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ARQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
N YM+ GGT+FG G F TTSYDY+A +DE G P PK+ +++
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 160/357 (44%), Gaps = 32/357 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAG-----NVTYDSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ IT A N + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ARQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
N YM+ GGT+FG G F TTSYDY+A +DE G P PK+ +++
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 160/357 (44%), Gaps = 32/357 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAG-----NVTYDSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ IT A N + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ARQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
N YM+ GGT+FG G F TTSYDY+A +DE G P PK+ +++
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|406657850|ref|ZP_11065990.1| family 35 glycosyl hydrolase [Streptococcus iniae 9117]
gi|405578065|gb|EKB52179.1| family 35 glycosyl hydrolase [Streptococcus iniae 9117]
Length = 594
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 144/314 (45%), Gaps = 35/314 (11%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+N + I+S AIHY R PG W + K G NT+E+YV WN HE GK+ F G
Sbjct: 12 LNNKPFKILSGAIHYFRLAPGSWYKSLYNLKALGFNTVETYVPWNLHEPQRGKFNFEGLA 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L KF+ + Q+ +Y I+R P++ AE+ +GG+P WL V +D + + +
Sbjct: 72 DLEKFLDLAQEMGLYAIVRPTPYICAEWEFGGLPAWLLKENVRVRSHDAKYLAFVKDYYQ 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
L+ ++KR+ SQGG I++ QVENEYG Y GE K+Y +M I VP
Sbjct: 132 VLLPKLVKRQ---ISQGGNILMFQVENEYGSY----GED-KQYLKQLMQMMREFGISVPL 183
Query: 217 IM--------CQQFDTPDPVINTCNSFYCDQ----------FTPHSPSMPKIWTENWPGW 258
Q D + +F H P + E W GW
Sbjct: 184 FTSDGPWQSALQAGSLIDEDVLVTGNFGSQSKANFSNLRAFLDAHDKKWPLMCMEFWVGW 243
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-------ITT 311
F + R +++ ++ ++G N YM+HGGTNFG G T
Sbjct: 244 FNRWKEPVIRRDPKEMVDAIMEVLEEGSI--NLYMFHGGTNFGFMNGSSARLQEDLPQVT 301
Query: 312 SYDYEAPIDEYGLP 325
SYDY+A +DE G P
Sbjct: 302 SYDYDAILDEAGNP 315
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 149/318 (46%), Gaps = 34/318 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
+ +++G IIS A+HY R VP W + K G NT+E+YV WN HE G + F
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGIFNF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +LVK++++ Q+ + +ILR P++ AE+ +GG+P WL R++T F +
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLNKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRY----------- 199
+ F +++ M+ L GGPII+ QVENEYG + + Y K+
Sbjct: 128 ENFYKVLLPMVT--PLQVENGGPIIMMQVENEYGSFGNDKEYVRNIKKLMRDLGVTVPLF 185
Query: 200 ---ALWAAKMAVAQNIGVPWIMCQQFDT-PDPVINTCNSFYCDQFTPHSPSMPKIWTENW 255
W + I ++ F + + +N SF + P + E W
Sbjct: 186 TSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESF----IKENKKEWPLMCMEFW 241
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF +G R ++A V ++ N+YM+ GGTNFG G P
Sbjct: 242 DGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDLP 299
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY+A + E+G P
Sbjct: 300 QI-TSYDYDALLTEWGEP 316
>gi|383648920|ref|ZP_09959326.1| glycosyl hydrolase family 42 [Streptomyces chartreusis NRRL 12338]
Length = 588
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 162/330 (49%), Gaps = 32/330 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G IIS A+HY R PG+W +++A+ G+NT+E+Y+ WN H+ P
Sbjct: 4 LTTTSDGFLLHGEPFRIISGALHYFRVHPGLWSDRLRKARLMGLNTVETYLPWNHHQPDP 63
Query: 88 -GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G G +L +F+++ Q ++++LR GPF+ AE++ GG+P WL P R+
Sbjct: 64 EGPLVLDGFLDLPRFLRLAQDEGLHVLLRPGPFICAEWDGGGLPDWLTSDPDIRLRSSDP 123
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F + +++ L++ ++ A+ GGP+I QVENEYG Y G A
Sbjct: 124 RFTGAVDRYLDLLLPPLRPHL--AAAGGPVIAVQVENEYGAY-------GDDSAYLKHLA 174
Query: 207 AVAQNIGVPWIM--CQQFDTPD------PVINTCNSF------YCDQFTPHSPSMPKIWT 252
++ GV ++ C Q D P + T +F + + P
Sbjct: 175 DAFRSRGVEELLFTCDQADPEHLAAGSLPGVLTAGTFGSRVEQCLGRLREYRREGPLFCA 234
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E W GWF +GG R + D A + R G SV N YM+HGGTNFG T G
Sbjct: 235 EFWIGWFDHWGGPHHVRNAADAAADLDRLLSAGASV-NIYMFHGGTNFGFTNGANHKHAY 293
Query: 309 --ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A + E G P PK+ +E+
Sbjct: 294 EPTVTSYDYDAALTECGDP-GPKYHAFREV 322
>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 154/312 (49%), Gaps = 33/312 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
S ++ G I S ++HY R W ++ AK G+NTI +YV WN HE+ PG +
Sbjct: 56 SNGFLLYGHPFDIWSGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFD 115
Query: 92 FGGR-FNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKY 150
F +L +F+ + + + +++R P++ AE+++GG+P L P R+ + F
Sbjct: 116 FETHAHDLARFLNLAHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDLELRSSNDAFLD 175
Query: 151 HMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQ 210
++++ ++ +++ L AS GGPII VENEYG Y G R L A +A+ +
Sbjct: 176 EVERYYDALMPILR--PLQASNGGPIIAFYVENEYGSY------GADRDYL-QALVAMMR 226
Query: 211 NIGVPWIMCQQFDTPDP----------VINTCN-----SFYCDQFTPHSPSMPKIWTENW 255
+ G I+ Q F + + T N + DQ P P + +E W
Sbjct: 227 DRG---IVEQMFTCDNAQGLSRGALPGALQTINFQDNVERHLDQLAHFQPDQPLMVSEYW 283
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PFI--TT 311
GWF G SED+ + + +G S N Y++HGGT+FG AG P+ T
Sbjct: 284 TGWFDHDGEEHHTFDSEDLVEGLQKILDRGASF-NLYVFHGGTSFGWNAGANSPYAPDIT 342
Query: 312 SYDYEAPIDEYG 323
SYDY+AP+ E+G
Sbjct: 343 SYDYDAPLSEHG 354
>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
Length = 613
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 159/363 (43%), Gaps = 44/363 (12%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDS-----RSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ +T A T+ S + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPVTAIAATTDTWPSFGTQGTQFVRDGKPYQLLSGAIHFQRIPREY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAGNNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + L GGPII
Sbjct: 123 YTCAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVSKQV--HPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTC--NSFY 236
QVENEYG Y+ +A A A+ G + D D + N ++
Sbjct: 181 VQVENEYGSYDD-------DHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLA 233
Query: 237 CDQFTP------------HSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARF--F 282
F P P P++ E W GWF +G PH S D F
Sbjct: 234 VVNFAPGEAKTAFEKLIKFRPEQPRMVGEYWAGWFDHWG--KPH-ASTDAKQQTEEFEWI 290
Query: 283 QKGGSVHNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGH 332
+ G N YM+ GGT+FG G F TTSYDY+A +DE G P PK+
Sbjct: 291 LRQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFAL 349
Query: 333 LKE 335
+++
Sbjct: 350 MRD 352
>gi|256957323|ref|ZP_05561494.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|257077681|ref|ZP_05572042.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|307270129|ref|ZP_07551446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|422710565|ref|ZP_16767610.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|422721468|ref|ZP_16778057.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422867159|ref|ZP_16913760.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|256947819|gb|EEU64451.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256985711|gb|EEU73013.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|306513498|gb|EFM82113.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|315031294|gb|EFT43226.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315035298|gb|EFT47230.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|329577710|gb|EGG59137.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 593
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 140/320 (43%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|297198988|ref|ZP_06916385.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|297147253|gb|EDY55124.2| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 601
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 152/328 (46%), Gaps = 42/328 (12%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
T +++GR ++S A+HY R W + G+N +E+YV WN HE PG
Sbjct: 11 TVGETDFLLDGRPVRLLSGALHYFRVHEAQWGHRLAMLGAMGLNCVETYVPWNLHEPHPG 70
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
L +F+ ++A ++ I+R GP++ AE+ GG+P WL R E +
Sbjct: 71 DVR--DVEALGRFLDAAREAGLWAIVRPGPYICAEWENGGLPHWLK----GHARTSDEVY 124
Query: 149 KYHMQK-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+++ F L+ +++R+ +GGP+I+ Q ENEYG Y S Y L ++
Sbjct: 125 LGQVERWFGRLLPQVVERQ---IDRGGPVIMVQAENEYGSYGS-----DAAYLLRLTELL 176
Query: 208 VAQNIGVPWI--------MCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTEN 254
AQ I VP M P V+ T N + + P P + E
Sbjct: 177 RAQGITVPLFTSDGPEDHMLTGGSVPG-VLATVNFGSGARTAFEALRRYRPDGPLMCMEF 235
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG--------- 305
W GWF+ +GG R +ED A ++ + G SV N YM HGGTNF AG
Sbjct: 236 WCGWFEHWGGEPVVRDAEDAAEALREILECGASV-NLYMAHGGTNFAGWAGANRGGGALH 294
Query: 306 -GPF--ITTSYDYEAPIDEYGLPRNPKW 330
GP TSYDY+APIDEYG P W
Sbjct: 295 DGPLEPDVTSYDYDAPIDEYGRPTEKFW 322
>gi|294779195|ref|ZP_06744602.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|294453706|gb|EFG22101.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
Length = 592
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 140/320 (43%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL R+ F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 128 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 178
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 179 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 296
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 297 DLPQVTSYDYDALLTEAGEP 316
>gi|357518197|ref|XP_003629387.1| Beta-galactosidase [Medicago truncatula]
gi|355523409|gb|AET03863.1| Beta-galactosidase [Medicago truncatula]
Length = 394
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/148 (49%), Positives = 87/148 (58%), Gaps = 19/148 (12%)
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
K+ MQKF+ IVDMMK E+LF SQGGPII++Q+ENE G E Y G RY
Sbjct: 59 KFQMQKFIEKIVDMMKAERLFESQGGPIIMSQIENECGPTE--YEIGVSRYGYRT----- 111
Query: 209 AQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPH 268
++ + INTCN FYCD F P+ PK+WTE W GWF FGG PH
Sbjct: 112 ------------RYRSSVDHINTCNGFYCDYFYPNKDYKPKMWTEAWTGWFTEFGGPVPH 159
Query: 269 RPSEDIAFSVARFFQKGGSVHNYYMYHG 296
RP+ED+AFSVARF QKGGS+ Y
Sbjct: 160 RPAEDMAFSVARFIQKGGSLFTLRKYSA 187
>gi|294633777|ref|ZP_06712335.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830419|gb|EFF88770.1| beta-galactosidase [Streptomyces sp. e14]
Length = 591
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 155/318 (48%), Gaps = 31/318 (9%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+T + + +G I+SAAIHY R P +W + + + GVNT+E+Y+ WN HE
Sbjct: 5 TLTIKGNAFLRDGEPHQIVSAAIHYFRVHPDLWADRLIRLRAMGVNTVETYIAWNFHEPR 64
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG++ F G ++VKFI+ + +I+R GP++ AE++ GG+P WL G R
Sbjct: 65 PGEFLFDGDRDIVKFIRTAGDLGLDVIVRPGPYICAEWDLGGLPSWLLADRGARLRRREP 124
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYALWAA 204
+ + + ++ + L AS+GGP++ +ENEYG + ++ Y E ++ +
Sbjct: 125 AYLAAVDAWFDVLFPRLI--PLLASRGGPVVAMSIENEYGSFGTDTDYLEHLRKGMIERG 182
Query: 205 K---MAVAQNIGVPWIMCQQFDTPDPVINTCNSF------YCDQFTPHSPSMPKIWTENW 255
+ + G +++ P + +F H P+ P E W
Sbjct: 183 ADCLLFTSDGAGDGFLLGGSI----PGVLAAGTFGSRPEQSLATLRAHQPTGPLFCVEYW 238
Query: 256 PGWFKTFGGRDPH--RPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------- 306
GWF +G +PH R + D A ++ R G SV N YM HGGTNFG +G
Sbjct: 239 HGWFDHWG--EPHHVRDAADAADTLDRLLAAGASV-NIYMGHGGTNFGWWSGANHDGLHH 295
Query: 307 -PFITTSYDYEAPIDEYG 323
P + TSYDY AP+ E G
Sbjct: 296 QPDV-TSYDYGAPVGEAG 312
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 148/304 (48%), Gaps = 26/304 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + +I+ +IHY R W + + + G NT+ +Y+ WN HE GK+ F
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L ++ + + +++ILR GP++ AE + GG+P WL P T R + F + K+
Sbjct: 131 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 190
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ K L GGP+I QVENEYG ++ + Y + K + + I V
Sbjct: 191 DHLIP--KILPLQYRHGGPVIAVQVENEYGSFQK-----DRNYMNYLKKALLKRGI-VEL 242
Query: 217 IMCQ------QFDTPDPVINTC--NSFYCDQFT---PHSPSMPKIWTENWPGWFKTFGGR 265
++ Q + + + T NSF D F P + E W GW+ ++G +
Sbjct: 243 LLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSK 302
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPI 319
+ +E+I +V +F G S N YM+HGGTNFG GG + + TSYDY+A +
Sbjct: 303 HIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVL 361
Query: 320 DEYG 323
E G
Sbjct: 362 SEAG 365
>gi|261406481|ref|YP_003242722.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261282944|gb|ACX64915.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 619
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 152/318 (47%), Gaps = 30/318 (9%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G +T+++ +++G+ IIS AIHY R VP W + + K G NT+E+Y+ WN HE
Sbjct: 2 GMLTWENGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEP 61
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
G++ F G ++ FI++ + +++I+R PF+ AE+ +GG+P WL R
Sbjct: 62 QEGEFNFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSD 121
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
+ + + ++ + L ++ GGPI+ QVENEYG Y G +A
Sbjct: 122 PLYLSKVDHYYDELIPQLV--PLLSTHGGPILAVQVENEYGSY-------GNDHAYLEYL 172
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVI--NTCNSFYCD------------QFTPHSPSMPKIW 251
GV ++ D ++ T + + ++ + P +
Sbjct: 173 REGLVRRGVDVLLFTSDGPTDEMLLGGTLSDVHATVNFGSRVEESFRKYREYRAEEPLMV 232
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI-- 309
E W GWF + R + D+A + + G S+ N YM+HGGTNFG +G I
Sbjct: 233 MEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEMGSSM-NMYMFHGGTNFGFYSGANHIQA 291
Query: 310 ----TTSYDYEAPIDEYG 323
TTSYDY+AP+ E+G
Sbjct: 292 YEPTTTSYDYDAPLTEWG 309
>gi|199599299|ref|ZP_03212698.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
gi|199589801|gb|EDY97908.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
Length = 593
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 155/347 (44%), Gaps = 49/347 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE G++
Sbjct: 7 DHEFMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+K ++ +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLKTAEELGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPTYLAA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG YGE + Y AK+
Sbjct: 126 IDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGS----YGE-DQDYLAVVAKLMQQHG 178
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFY-----------------CDQFTP----HSPSMPKI 250
+ VP D P P S D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHA 346
TSYDY+AP++E G P + K +H + + A
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQA 336
>gi|69247392|ref|ZP_00604336.1| Beta-galactosidase [Enterococcus faecium DO]
gi|256619331|ref|ZP_05476177.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|384518861|ref|YP_005706166.1| beta-galactosidase [Enterococcus faecalis 62]
gi|389870025|ref|YP_006377575.1| beta-galactosidase [Enterococcus faecium DO]
gi|68194864|gb|EAN09337.1| Beta-galactosidase [Enterococcus faecium DO]
gi|256598858|gb|EEU18034.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|309385841|gb|ADO66768.1| beta-galactosidase [Enterococcus faecium]
gi|323480994|gb|ADX80433.1| beta-galactosidase [Enterococcus faecalis 62]
gi|388535404|gb|AFK60593.1| beta-galactosidase [Enterococcus faecium DO]
Length = 592
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 103/335 (30%), Positives = 158/335 (47%), Gaps = 43/335 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
++ G+ I+S AIHY R P W + K G NT+E+YV WN HE G+++
Sbjct: 7 KEEFLLKGKTFKILSGAIHYFRIPPCDWEHSLYNLKALGFNTVETYVPWNLHEPQKGEFH 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G +L +F+ I Q +Y I+R P++ AE+ +GG P WL P + RN+ + H
Sbjct: 67 FEGILDLERFLTIAQDLGLYAIVRPSPYICAEWEFGGFPSWLLREPIHIRRNEI-AYLEH 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ + +++ + +L + GG I++ Q+ENEYG +GE K Y + + +
Sbjct: 126 VADYYDVLMKRIVPHQL--NNGGNILMIQIENEYGS----FGE-EKEYLRAIRDLMIKRG 178
Query: 212 IGVPWIMCQQFDTP------------DPVINTCN--SFYCDQFT-------PHSPSMPKI 250
+ VP+ D P D ++ T N S D F + + P +
Sbjct: 179 VTVPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKDNFNSMKQFFKEYDKNWPLM 235
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG---- 306
E W GWF + R +++A +V ++G N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWKEPIIQRDPQELAEAVKEVLEQGSI--NLYMFHGGTNFGFMNGCSARG 293
Query: 307 ----PFITTSYDYEAPIDEYGLPRNPKWGHLKELH 337
P I TSYDY AP+DE G P + K +H
Sbjct: 294 VIDLPQI-TSYDYGAPLDEQGNPTEKYYALRKMIH 327
>gi|421767985|ref|ZP_16204697.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
gi|421773235|ref|ZP_16209883.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411182327|gb|EKS49478.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411186672|gb|EKS53794.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
Length = 656
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 154/344 (44%), Gaps = 49/344 (14%)
Query: 35 LIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGG 94
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE G++ F G
Sbjct: 73 FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSG 132
Query: 95 RFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQK 154
++ +F+K + +Y I+R P++ AE+ +GG P WL R D + + +
Sbjct: 133 ILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLVAIDR 191
Query: 155 FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV 214
+ T ++ + ++ + GG +I+ QVENEYG YGE + Y AK+ + V
Sbjct: 192 YYTALMPHLVDHQV--THGGNVIMMQVENEYGS----YGE-DQDYLAAVAKLMQQHGVDV 244
Query: 215 PWIMCQQFDTPDPVINTCNSFY-----------------CDQFTP----HSPSMPKIWTE 253
P D P P S D+ H P + E
Sbjct: 245 PLFTS---DGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLMCME 301
Query: 254 NWPGWFKTFG----GRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF- 308
W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 302 FWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTNFGFMNGTSAR 355
Query: 309 ------ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHA 346
TSYDY+AP++E G P + K +H + + A
Sbjct: 356 KDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQA 399
>gi|453049630|gb|EME97211.1| beta-galactosidase [Streptomyces mobaraensis NBRC 13819 = DSM
40847]
Length = 584
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 144/315 (45%), Gaps = 39/315 (12%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
I+GR ++S A+HY R G WP + + G+N +E+YV WN HE G+ + G
Sbjct: 13 IDGREVRLLSGALHYFRVHEGHWPHRLAMLRAMGLNCVETYVPWNRHEPVEGRLHDVG-- 70
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
L +F+ A +Y I+R GP+V AE+ GG+P WL G R F + ++
Sbjct: 71 ELGRFLDAAGAAGLYAIVRPGPYVCAEWENGGLPHWLTGRLGRRVRTSDPEFLRAVDGWL 130
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
+ + + +GGP++L QVENEYG Y S + Y + VP
Sbjct: 131 EAVGAELTGRQF--GRGGPVVLVQVENEYGSYGS-----DQPYLEHLVGRLRDSGVVVPL 183
Query: 217 IMCQQFDTPDPVINTCNSFYCDQFT---------------PHSPSMPKIWTENWPGWFKT 261
+ D P+ + T + T H P+ P + E W GWF
Sbjct: 184 VTS---DGPEDHMLTGGTVPGATATVNFGSGAREAFRVLRRHRPAGPLMCMEFWCGWFAH 240
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-----------IT 310
+GG R + + A ++ + G SV N YM HGGTNFG AG T
Sbjct: 241 WGGAPAARDAGEAAEALREVLECGASV-NVYMAHGGTNFGGWAGANRAGAEHRGALRPTT 299
Query: 311 TSYDYEAPIDEYGLP 325
TSYDY+AP+DEYG P
Sbjct: 300 TSYDYDAPVDEYGRP 314
>gi|340722578|ref|XP_003399681.1| PREDICTED: beta-galactosidase-like [Bombus terrestris]
Length = 646
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 169/326 (51%), Gaps = 33/326 (10%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F+ V Y++ +++G+ IS + HY R+ W +++ + G+N + +YV W+ H
Sbjct: 30 FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLRKMRAAGLNAVSTYVEWSLH 89
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW-LHYIPGTVFR 142
+ + ++++ G ++++FI I Q+ ++++LR GP++ AE ++GG+P W L +P R
Sbjct: 90 QPTENEWHWTGDADVIEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLARVPDIKLR 149
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-----------ESF 191
+ + +++ ++ I+D K + GGPII+ QVENEYG Y +
Sbjct: 150 TNDSRYMKYVEIYLNEILD--KVQPYLRGNGGPIIMVQVENEYGSYACDREYLSRLRDIM 207
Query: 192 YGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPK 249
+ G + L++ A A + +I + + T D P N +F + + P P
Sbjct: 208 RQKIGTKALLYSTDGANANMLRCGFI-PEVYATVDFGPNTNVTKNFEIMRM--YQPRGPL 264
Query: 250 IWTENWPGWFKTFGGRDPHR--PSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG- 306
+ +E +PGW + R+P + + + ++ G SV N YM++GGTNFG TAG
Sbjct: 265 VNSEFYPGWLTHW--REPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGYTAGAN 321
Query: 307 -------PFITTSYDYEAPIDEYGLP 325
P + TSYDY+AP+ E G P
Sbjct: 322 GGHNAYNPQL-TSYDYDAPLTEAGDP 346
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 151/320 (47%), Gaps = 26/320 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T D + +++G+ +IS +HYPR W +++A+ G+N + Y FWN HE
Sbjct: 26 LTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHEEEE 85
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G + F G+ ++ +F++I QQ +++ILR GP+V AE++ GG P WL P R+
Sbjct: 86 GHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAVNLRSLDSR 145
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
+ K+M + + L A++GGPI+ QVENEYG + + Y +M
Sbjct: 146 YIAAADKWMKALGQQLA--PLQAAKGGPILAVQVENEYGSFPDSAQPNAQAYLDRVHQMV 203
Query: 208 VAQNIGVPWIMCQQFDTPDPVIN------TCNSFYCDQFTPHSPSMPK-------IWT-E 253
+ + G + D D + T Y + S ++ K I+T E
Sbjct: 204 L--DAGFKDSLLYTGDGADVLARGTFADLTAGIDYGTGDSARSIALYKKFRPNTNIYTAE 261
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----- 308
W GWF +G + + V GGS+ + YM HGGT+FG G
Sbjct: 262 YWDGWFDHWGAKHEVVDASIHLKEVHDVLTSGGSI-SLYMLHGGTSFGWMNGANIDHNHY 320
Query: 309 --ITTSYDYEAPIDEYGLPR 326
TSYDY+APIDE G R
Sbjct: 321 EPDVTSYDYDAPIDEAGQLR 340
>gi|258507331|ref|YP_003170082.1| beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|385827042|ref|YP_005864814.1| beta-galactosidase [Lactobacillus rhamnosus GG]
gi|257147258|emb|CAR86231.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|259648687|dbj|BAI40849.1| beta-galactosidase [Lactobacillus rhamnosus GG]
Length = 593
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 154/347 (44%), Gaps = 49/347 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE G++
Sbjct: 7 DHEFMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+K + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLAA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG YGE + Y AK+
Sbjct: 126 IDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGS----YGE-DQDYLAAVAKLMQQHG 178
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFY-----------------CDQFTP----HSPSMPKI 250
+ VP D P P S D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CVEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHA 346
TSYDY+AP++E G P + K +H + + A
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQA 336
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 155/332 (46%), Gaps = 24/332 (7%)
Query: 10 FALLIFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEG 69
LL+ F S+ + V Y + +G + IS +IHY R W + +
Sbjct: 10 LLLLMLFGRSLGESPSFTVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKDRLLKMYMA 69
Query: 70 GVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGI 129
G+N I++YV WN HE PG Y F G +L F+K+ Q + +ILR GP++ AE++ GG+
Sbjct: 70 GLNAIQTYVPWNYHEEVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYICAEWDMGGL 129
Query: 130 PVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-YY 188
P WL V R+ + + K+M ++ M+K GGPII QVENEYG Y+
Sbjct: 130 PAWLLKKKDIVLRSTDPDYIAAVDKWMGKLLPMIK--PYLYQNGGPIITVQVENEYGSYF 187
Query: 189 ESFYGEGGKRYALWAAKMA------VAQNIGVPWIMC----QQFDTPD--PVINTCNSFY 236
Y L+ + + G+ ++ C + T D P N +F
Sbjct: 188 ACDYNYMRHLSKLFRSYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPGANVTAAFE 247
Query: 237 CD-QFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYH 295
Q PH P + +E + GW +G R +A +++ G +V N YM+
Sbjct: 248 PQRQVQPHG---PLVNSEFYTGWLDHWGSRHSVVSPTQVAKALSEMLLMGANV-NLYMFI 303
Query: 296 GGTNFG--RTAGGPFIT--TSYDYEAPIDEYG 323
GGTNFG A P+ TSYDY+AP+ E G
Sbjct: 304 GGTNFGYWNGANTPYAAQPTSYDYDAPLTEAG 335
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 148/304 (48%), Gaps = 26/304 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + +I+ +IHY R W + + + G NT+ +Y+ WN HE GK+ F
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L ++ + + +++ILR GP++ AE + GG+P WL P T R + F + K+
Sbjct: 118 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 177
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ K L GGP+I QVENEYG ++ + Y + K + + I V
Sbjct: 178 DHLIP--KILPLQYRHGGPVIAVQVENEYGSFQK-----DRNYMNYLKKALLKRGI-VEL 229
Query: 217 IMCQ------QFDTPDPVINTC--NSFYCDQFT---PHSPSMPKIWTENWPGWFKTFGGR 265
++ Q + + + T NSF D F P + E W GW+ ++G +
Sbjct: 230 LLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSK 289
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPI 319
+ +E+I +V +F G S N YM+HGGTNFG GG + + TSYDY+A +
Sbjct: 290 HIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVL 348
Query: 320 DEYG 323
E G
Sbjct: 349 SEAG 352
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 148/304 (48%), Gaps = 26/304 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + +I+ +IHY R W + + + G NT+ +Y+ WN HE GK+ F
Sbjct: 97 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 156
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L ++ + + +++ILR GP++ AE + GG+P WL P T R + F + K+
Sbjct: 157 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 216
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ K L GGP+I QVENEYG ++ + Y + K + + I V
Sbjct: 217 DHLIP--KILPLQYRHGGPVIAVQVENEYGSFQK-----DRNYMNYLKKALLKRGI-VEL 268
Query: 217 IMCQ------QFDTPDPVINTC--NSFYCDQFT---PHSPSMPKIWTENWPGWFKTFGGR 265
++ Q + + + T NSF D F P + E W GW+ ++G +
Sbjct: 269 LLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSK 328
Query: 266 DPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPI 319
+ +E+I +V +F G S N YM+HGGTNFG GG + + TSYDY+A +
Sbjct: 329 HIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVL 387
Query: 320 DEYG 323
E G
Sbjct: 388 SEAG 391
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 157/330 (47%), Gaps = 37/330 (11%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
I+ + I+S A+HY R P W + K G NT+E+Y+ WN HE GK+ F G
Sbjct: 12 IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
++ KFIKI ++ +Y+ILR P++ AE+ +GG+P WL R+ + F ++K
Sbjct: 72 DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEIKLRSSDDNF---IEKLR 128
Query: 157 TLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
D++ R K ++GGP+++ QVENEYG Y + K Y A + + VP
Sbjct: 129 NYYNDLLPRLVKYQVTKGGPVLMMQVENEYGSYGN-----EKEYLRIVASIMKENGVDVP 183
Query: 216 -------WI---MCQQFDTPDPVIN----TCNSFYCDQFTPHSPSMPKIW----TENWPG 257
WI C D ++ + + CD K W E W G
Sbjct: 184 LFTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDG 243
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-------IT 310
WF +G R S D+A V K GS+ N YM+ GGTNFG G
Sbjct: 244 WFNRWGEDIIRRDSIDLAEDVKEML-KIGSI-NLYMFRGGTNFGFMNGCSARGNNDLPQV 301
Query: 311 TSYDYEAPIDEYGLPRNPKWGHLKELHGAI 340
TSYDY+A + E+G P + K+ L+++ ++
Sbjct: 302 TSYDYDAILTEWGNPSD-KYYELQKVMKSL 330
>gi|387790696|ref|YP_006255761.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379653529|gb|AFD06585.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 790
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 144/313 (46%), Gaps = 18/313 (5%)
Query: 26 GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHEL 85
G+ + ++NG+ LI + IH+PR W ++ K G+NTI Y+FWN HE
Sbjct: 36 GSFVLGTNEFLLNGKPFLIRAGEIHFPRIPREYWDHRIKLCKAMGMNTICIYLFWNFHEQ 95
Query: 86 SPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDT 145
P ++ F G+ ++ F+K++Q MY I+R GP+ AE++ GG+P WL P R T
Sbjct: 96 KPDQFDFTGQKDVAAFVKLVQANGMYCIVRPGPYACAEWDMGGLPWWLLKKPDLKVR--T 153
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQ-GGPIILAQVENEYGYY-ESFYGEGGKRYALWA 203
+Y M++ + ++ K+ L Q GG II+ QVENEY + S R L
Sbjct: 154 LEDRYFMERSAKYLKEVGKQLALLQIQNGGNIIMVQVENEYAAFGNSAEYMDANRKNLKD 213
Query: 204 AKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--------FTPHSPSMPKIWTENW 255
A Q + W DP + +F F P+ P + +E W
Sbjct: 214 AGFNKVQLMRCDWSSTFNSYITDPEVAITLNFGAGSDVDKQFKGFQEKHPTAPLMCSEYW 273
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG---PF--IT 310
GWF +G R S+ + S + YM HGGT FG+ G P+ +
Sbjct: 274 TGWFDHWGRPHETRSINSFIGSLKDMMDRKIS-FSLYMAHGGTTFGQWGGANSPPYSAMV 332
Query: 311 TSYDYEAPIDEYG 323
SYDY API E G
Sbjct: 333 ASYDYNAPIGEQG 345
>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
Length = 631
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 147/305 (48%), Gaps = 28/305 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + +I+ +IHY R W + + + G NT+ +Y+ WN HE GK+ F
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L ++ + + +++ILR GP++ AE + GG+P WL PG+ R + F + K+
Sbjct: 118 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 177
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ K L +GGP+I QVENEYG + + K Y + K + N G+
Sbjct: 178 DHLIP--KILPLQYRRGGPVIAVQVENEYGSFRN-----DKNYMEYIKKALL--NRGIVE 228
Query: 217 IMCQQFDTPDPVINT---------CNSFYCDQFTP---HSPSMPKIWTENWPGWFKTFGG 264
++ + I + NSF D F P + E W GW+ ++G
Sbjct: 229 LLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSWGS 288
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------PFITTSYDYEAP 318
+ + + +I ++ RFF G S N YM+HGGTNFG GG + TSYDY+A
Sbjct: 289 KHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYDAV 347
Query: 319 IDEYG 323
+ E G
Sbjct: 348 LSEAG 352
>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
Length = 644
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 147/305 (48%), Gaps = 28/305 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + +I+ +IHY R W + + + G NT+ +Y+ WN HE GK+ F
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L ++ + + +++ILR GP++ AE + GG+P WL PG+ R + F + K+
Sbjct: 131 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 190
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ K L +GGP+I QVENEYG + + K Y + K + N G+
Sbjct: 191 DHLIP--KILPLQYRRGGPVIAVQVENEYGSFRN-----DKNYMEYIKKALL--NRGIVE 241
Query: 217 IMCQQFDTPDPVINT---------CNSFYCDQFTP---HSPSMPKIWTENWPGWFKTFGG 264
++ + I + NSF D F P + E W GW+ ++G
Sbjct: 242 LLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSWGS 301
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------PFITTSYDYEAP 318
+ + + +I ++ RFF G S N YM+HGGTNFG GG + TSYDY+A
Sbjct: 302 KHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYDAV 360
Query: 319 IDEYG 323
+ E G
Sbjct: 361 LSEAG 365
>gi|326779952|ref|ZP_08239217.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
gi|326660285|gb|EGE45131.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
Length = 648
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 150/324 (46%), Gaps = 41/324 (12%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ T ++G+ ++S A+HY R W + G+N +E+YV WN HE
Sbjct: 3 DFTVGDDCFRLDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+ G L +F+ +++A ++ I+R GP++ AE+ GG+PVW+ G R
Sbjct: 63 EGEVRDVG--ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDA 120
Query: 147 PFKYHMQK-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
++ +++ F L+ +++R+ S+GGP++L Q ENEYG Y S Y W A
Sbjct: 121 AYRAVVERWFRELLPQVVRRQ---VSRGGPVVLVQAENEYGSYGS-----DAVYLEWLAG 172
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP---------------HSPSMPKI 250
+ + VP D P+ + T S T H P P +
Sbjct: 173 LLRQCGVTVPLFTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFKVLRRHQPGGPLM 229
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----G 306
E W GWF +G R E A ++ + G SV N YM HGGTNFG AG G
Sbjct: 230 CMEFWCGWFDHWGAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSG 288
Query: 307 PF-------ITTSYDYEAPIDEYG 323
P TSYDY+AP+DEYG
Sbjct: 289 PHQDESFQPTVTSYDYDAPVDEYG 312
>gi|258538519|ref|YP_003173018.1| beta-galactosidase [Lactobacillus rhamnosus Lc 705]
gi|385834266|ref|YP_005872040.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
gi|257150195|emb|CAR89167.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus Lc 705]
gi|355393757|gb|AER63187.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
Length = 593
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 154/347 (44%), Gaps = 49/347 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE G++
Sbjct: 7 DHEFMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+K + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLAA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG YGE + Y AK+
Sbjct: 126 IDRYYTALMPHLVDHQV--THGGNVIMMQVENEYGS----YGE-DQDYLAAVAKLMQQHG 178
Query: 212 IGVPWIMCQQFDTPDPVINTCNSFY-----------------CDQFTP----HSPSMPKI 250
+ VP D P P S D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHA 346
TSYDY+AP++E G P + K +H + + A
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQA 336
>gi|307289489|ref|ZP_07569436.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422703871|ref|ZP_16761687.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499556|gb|EFM68926.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315164595|gb|EFU08612.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 593
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 140/320 (43%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL R+ F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKSVRLRSTDPIFMTKV 128
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 129 RNYFQVLLP--KLAPLQITQGGPVIMMQVENEYGSY-------GMEKAYLRQTKQIMEEL 179
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 180 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 297
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 298 DLPQVTSYDYDALLTEAGEP 317
>gi|182439300|ref|YP_001827019.1| beta-galactosidase [Streptomyces griseus subsp. griseus NBRC 13350]
gi|178467816|dbj|BAG22336.1| putative beta-galactosidase [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 630
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 150/324 (46%), Gaps = 41/324 (12%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ T ++G+ ++S A+HY R W + G+N +E+YV WN HE
Sbjct: 3 DFTVGDDCFRLDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+ G L +F+ +++A ++ I+R GP++ AE+ GG+PVW+ G R
Sbjct: 63 EGEVRDVG--ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDA 120
Query: 147 PFKYHMQK-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
++ +++ F L+ +++R+ S+GGP++L Q ENEYG Y S Y W A
Sbjct: 121 AYRAVVERWFRELLPQVVRRQ---VSRGGPVVLVQAENEYGSYGS-----DAVYLEWLAG 172
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTP---------------HSPSMPKI 250
+ + VP D P+ + T S T H P P +
Sbjct: 173 LLRQCGVTVPLFTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFAVLRRHQPGGPLM 229
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----G 306
E W GWF +G R E A ++ + G SV N YM HGGTNFG AG G
Sbjct: 230 CMEFWCGWFDHWGAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSG 288
Query: 307 PF-------ITTSYDYEAPIDEYG 323
P TSYDY+AP+DEYG
Sbjct: 289 PHQDESFQPTVTSYDYDAPVDEYG 312
>gi|328956117|ref|YP_004373450.1| beta-galactosidase [Coriobacterium glomerans PW2]
gi|328456441|gb|AEB07635.1| Beta-galactosidase [Coriobacterium glomerans PW2]
Length = 597
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 146/314 (46%), Gaps = 34/314 (10%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++GR I S AIHY R P W + K G NT+E+Y+ WN HE ++
Sbjct: 12 MDGRPFQIRSGAIHYFRLHPDDWEHSLYNLKAMGFNTVETYIPWNMHEPHKDEFRITAET 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+ +F+ + ++ I+R PF+ AE+ +GG+P WL G R++ F + +
Sbjct: 72 DFERFLGLASDLGLWAIVRPSPFICAEWEFGGLPAWLLAERGMRIRSNDPRFLERLALYY 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV-- 214
+++ + + ++ ++G II+ Q+ENEYG Y Y + V + I V
Sbjct: 132 DMLMPHLAKHQI--TRGANIIMMQIENEYGSYCE-----DSDYMRSVRDLMVERGIDVKL 184
Query: 215 -----PWIMCQQFDT--PDPVINTCN--SFYCDQFTP-------HSPSMPKIWTENWPGW 258
PW CQ+ + D V+ T N S + F H + P + E W GW
Sbjct: 185 CTSDGPWRACQRAGSLIEDNVLATGNFGSHATENFAALKGFHKEHGKTWPLMCMEFWAGW 244
Query: 259 FKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAGGPFITT 311
F +G R E++A SV ++G N YM+HGGTNFG R T
Sbjct: 245 FNRWGESVVRRDPEELARSVREALREGSI--NLYMFHGGTNFGFMNGCSARHDHDLHQIT 302
Query: 312 SYDYEAPIDEYGLP 325
SYDY+AP+DE G P
Sbjct: 303 SYDYDAPLDEAGNP 316
>gi|241156773|ref|XP_002407847.1| beta-galactosidase precursor, putative [Ixodes scapularis]
gi|215494239|gb|EEC03880.1| beta-galactosidase precursor, putative [Ixodes scapularis]
Length = 388
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 161/319 (50%), Gaps = 26/319 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ Y++ + +G IIS ++HY R++P W + K G+NT+++Y+ W+ HE
Sbjct: 35 IDYENNCFLKDGEPFQIISGSMHYFRTLPEQWEDRLTTMKTAGLNTLQTYIEWSSHEPEN 94
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTV-FRNDTE 146
G+Y F G+ ++VKFIKI ++ +ILR GPF+ AE + GG P WL TV R+ +
Sbjct: 95 GQYDFEGQEDIVKFIKIAERLGFLVILRPGPFIDAERDMGGFPYWLLSEDNTVRLRSSDQ 154
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE-------SFYGEGGKRY 199
+ ++ ++ + ++ ++K S GGP+++ QVENEYG Y + + +R+
Sbjct: 155 RYLKYVDRYFSKLLPLLKPLLY--SNGGPVLMLQVENEYGSYHECDFVYTAHLKDLMRRH 212
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFD----TPD--PVINTCNSFYCDQFTPHSPSMPKIWTE 253
+ G ++ C + D T D P + SF + H P + +E
Sbjct: 213 LGPDVLLYTTDGNGDRYLKCGKNDGAYTTVDFGPGSDVVASFAAQR--RHQDRGPLMNSE 270
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--- 310
+ GW +G + + +A ++ SV N Y++HGG++FG TAG
Sbjct: 271 FYSGWLDNWGDKHWEGNASAVAETLREMLTMNASV-NIYVFHGGSSFGCTAGANLDKGVY 329
Query: 311 ----TSYDYEAPIDEYGLP 325
TSYDY+AP++E G P
Sbjct: 330 SPNPTSYDYDAPMNEAGDP 348
>gi|443621995|ref|ZP_21106540.1| putative Beta-galactosidase [Streptomyces viridochromogenes Tue57]
gi|443344625|gb|ELS58722.1| putative Beta-galactosidase [Streptomyces viridochromogenes Tue57]
Length = 587
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 150/329 (45%), Gaps = 30/329 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G IIS A+HY R P W +++A+ G+NT+E+YV WN H+ P
Sbjct: 4 LTTTSDGFLLHGEPFRIISGALHYFRIHPDQWADRLRKARLMGLNTVETYVPWNFHQPDP 63
Query: 88 -GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G G +L +++ + Q + ++LR GPF+ AE++ GG+P WL P R+
Sbjct: 64 DGPLVLDGLLDLPRYLSLAQAEGLRVLLRPGPFICAEWHDGGLPAWLVADPDVRLRSSDP 123
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----------ESFYGEGG 196
F + ++ L V + A+ GGP+I QVENEYG Y E + G
Sbjct: 124 RFTRAVDRY--LDVLLPPLLPHMAAAGGPVIAVQVENEYGAYGDDTAYLKHLEQAFRSRG 181
Query: 197 KRYALWAAKMAVAQNI---GVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTE 253
L+ A ++ G+P ++ + H P P + E
Sbjct: 182 VEELLFTCDQADPGHLAAGGLPGVLATA------TFGSRVGQNLAVLRTHRPEGPLMCAE 235
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----- 308
W GWF +GG H A + G+ N YM+HGGTNFG T G
Sbjct: 236 FWIGWFDHWGGPH-HVRDAADAAADLDRLLSAGASVNIYMFHGGTNFGFTNGANHKHAYE 294
Query: 309 -ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A + E G P PK+ +E+
Sbjct: 295 PTVTSYDYDAALTECGDP-GPKYHAFREV 322
>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
Length = 648
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 153/318 (48%), Gaps = 32/318 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HE P
Sbjct: 23 IDYHHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 82
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y F G ++ FIK+ + + +ILR GP++ AE++ GG+P WL + R+
Sbjct: 83 GQYKFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 142
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY------------ESFYGEG 195
+ + K++ +++ MK L GGPII QVENEYG Y + F+
Sbjct: 143 YLAAVDKWLGVLLPRMK--PLLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQKLFHYHL 200
Query: 196 GKRYALWAAKMAVAQNIGVPWIMCQQ----FDTPD--PVINTCNSFYCDQFTPHSPSMPK 249
GK L+ A+ P++ C + T D P N +F + + P P
Sbjct: 201 GKDVLLFTTDGALE-----PFLQCGALQGLYATVDFGPGANITAAFEVQRKS--EPKGPL 253
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--P 307
+ +E + GW +G +E +A S+ +G +V N YM+ GGTNF G P
Sbjct: 254 VNSEFYTGWLDHWGQPHSTVKTEVVASSLHDILARGANV-NLYMFIGGTNFAYWNGANMP 312
Query: 308 FIT--TSYDYEAPIDEYG 323
+ TSYDY+AP+ E G
Sbjct: 313 YKAQPTSYDYDAPLSEAG 330
>gi|229553373|ref|ZP_04442098.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
gi|229313254|gb|EEN79227.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
Length = 583
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/343 (30%), Positives = 154/343 (44%), Gaps = 49/343 (14%)
Query: 36 IINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGR 95
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE G++ F G
Sbjct: 1 MLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSGI 60
Query: 96 FNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKF 155
++ +F+K + +Y I+R P++ AE+ +GG P WL R D + + ++
Sbjct: 61 LDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLAAIDRY 119
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
T ++ + ++ + GG +I+ QVENEYG YGE + Y AK+ + VP
Sbjct: 120 YTALMPHLVDHQV--THGGNVIMMQVENEYGS----YGE-DQDYLAAVAKLMQQHGVDVP 172
Query: 216 WIMCQQFDTPDPVINTCNSFY-----------------CDQFTP----HSPSMPKIWTEN 254
D P P S D+ H P + E
Sbjct: 173 LFTS---DGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLMCMEF 229
Query: 255 WPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 230 WDGWFNRWGEPIIRRDPDETAEDLRAVIKR-----GSV-NLYMFHGGTNFGFMNGTSARK 283
Query: 309 -----ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHA 346
TSYDY+AP++E G P + K +H + + A
Sbjct: 284 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQA 326
>gi|429198615|ref|ZP_19190430.1| glycosyl hydrolase family 35 [Streptomyces ipomoeae 91-03]
gi|428665679|gb|EKX64887.1| glycosyl hydrolase family 35 [Streptomyces ipomoeae 91-03]
Length = 593
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 160/331 (48%), Gaps = 33/331 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G IIS A+HY R P +W +++A+ G+NT+E+YV WN H+ P
Sbjct: 6 LTTSSDGFLLHGEPFRIISGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPDP 65
Query: 88 GK-YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G +L +++ + + ++++LR GP++ AE++ GG+P WL P R+
Sbjct: 66 DSPLVLDGLLDLPRYLCLARDEGLHVLLRPGPYICAEWDGGGLPSWLTTDPDIRLRSSDP 125
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F + +++ +++ + A+ GG +I QVENEYG Y G A
Sbjct: 126 RFTDALDRYLDILLPPLLPH--MAANGGSVIAVQVENEYGAY-------GDDTAYLKHVH 176
Query: 207 AVAQNIGVPWIM--CQQFDTPD-------PVINTCNSF------YCDQFTPHSPSMPKIW 251
++ G+ ++ C Q + P + + +F + H P P +
Sbjct: 177 QALRSRGIEELLFTCDQAGSAHHLAAGSLPGVLSTATFGGRIEESLEALRAHQPEGPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
+E W GWF +G R + + A + + G SV N YM+HGGTNFG T G
Sbjct: 237 SEFWIGWFDHWGEEHHVRDAANAAADLDKLLAAGASV-NIYMFHGGTNFGFTNGANHDQC 295
Query: 309 ---ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
I TSYDY+A + E G P PK+ +E+
Sbjct: 296 YAPIVTSYDYDAALTESGDP-GPKYHAFREV 325
>gi|156380756|ref|XP_001631933.1| predicted protein [Nematostella vectensis]
gi|156218982|gb|EDO39870.1| predicted protein [Nematostella vectensis]
Length = 652
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 161/340 (47%), Gaps = 41/340 (12%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +D+ + +G+ IS IHY R W + + K G+N I++YV WN HE +P
Sbjct: 27 IDFDNNRFLKDGQPFRYISGGIHYFRVPQFFWKDRLLKMKAAGMNAIQTYVPWNLHEPTP 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F G +L+ F+++ + I+R GP++ AE+++GG+P WL R+ +
Sbjct: 87 GKYNFDGGADLLSFLELAHSLDLVAIVRAGPYICAEWDFGGLPAWLLKNSSITLRSSKD- 145
Query: 148 FKYHMQKFMTLI-----VDMMKREKLFASQGGPIILAQVENEYGYYE------------S 190
Q +M+ + V + K + GGP+I+ QVENEYG Y +
Sbjct: 146 -----QAYMSAVDSWMGVLLPKLKAYLYEHGGPVIMVQVENEYGNYYTCDHEYMNHLEIT 200
Query: 191 FYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCD-QFTPHSPSM 247
F G L+ + N+ ++ F T D P I+ +F QF P P
Sbjct: 201 FRQHLGSNVILFTTDPPIPYNLKCGTLLS-LFTTIDFGPGIDPAAAFNIQRQFQPKGPF- 258
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG------ 301
+ +E + GW +G + + SE ++ + + SV N YM+ GGTNFG
Sbjct: 259 --VNSEYYTGWLDHWGEQHQTKTSESVSQYLDKILALNASV-NLYMFEGGTNFGFWNGAN 315
Query: 302 RTAGGPF---ITTSYDYEAPIDEYGLPRNPKWGHLKELHG 338
AG + TSYDY+AP+ E G P K+ ++E+ G
Sbjct: 316 ANAGASSFQPVPTSYDYDAPLTEAGDPTE-KYFAIREVVG 354
>gi|295113973|emb|CBL32610.1| Beta-galactosidase [Enterococcus sp. 7L76]
Length = 592
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 140/320 (43%), Gaps = 38/320 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ IIS AIHY R P W + K G NT+E+Y+ WN HE G Y F
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G N+ F+++ ++ + +ILR ++ AE+ +GG+P WL R+ F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + +++ K L +QGGP+I+ QVENEYG Y G A + + +
Sbjct: 128 RNYFQVLLP--KLAPLQITQGGPVIMIQVENEYGSY-------GMEKAYLRQTKQIMEEL 178
Query: 213 GVPWIMCQQFDTPDPVINTCNSFYCDQF--------------------TPHSPSMPKIWT 252
G+ + + V++ D F T H P +
Sbjct: 179 GIEVPLFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAG 305
E W GWF +G R D+A V G N YM+HGGTNFG R A
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAK 296
Query: 306 GPFITTSYDYEAPIDEYGLP 325
TSYDY+A + E G P
Sbjct: 297 DLPQVTSYDYDALLTEAGEP 316
>gi|225868140|ref|YP_002744088.1| beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus]
gi|225701416|emb|CAW98512.1| putative beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus]
Length = 601
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/341 (30%), Positives = 151/341 (44%), Gaps = 42/341 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
S ++GR I+S AIHY R P W + K G NT+E+Y+ WN HE G Y
Sbjct: 9 SDQFYLDGRPLQILSGAIHYFRIHPDDWYQSLYNLKALGFNTVETYIPWNLHEAKEGSYD 68
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G+ ++ F+ + QQ +Y I+R P++ AE+ +GG+P WL + +D Y
Sbjct: 69 FSGQLDVEAFLTLAQQLGLYAIVRPSPYICAEWEFGGLPAWLLTKNCHIRSSDPAYLAYV 128
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ + L+ + + E QGG I++ Q+ENEYG Y G + L A K + ++
Sbjct: 129 RRYYEELLPRLARHE---WQQGGNILMFQLENEYGSY------GEDKAYLTAVKGFMEEH 179
Query: 212 IGVPWIMCQQFDTP------------DPVINTCN------SFYCDQ---FTPHSPSMPKI 250
+ P D P D V T N + D F+ H P +
Sbjct: 180 LSAPLFTA---DGPWRATLRAGSLIEDDVFVTGNFGSRARDNFADMQAFFSEHGKHWPLM 236
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
E W GWF + R E++A +V +G N YM+HGGTNFG G
Sbjct: 237 CMEFWDGWFNRWNEPIIKRDPEELADAVMEVLAQGSI--NLYMFHGGTNFGFMNGCSARK 294
Query: 309 -----ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCE 344
TSYDY+A +DE G P + K L + E
Sbjct: 295 QLDLPQVTSYDYDAILDEAGNPTAKFYAIQKRLTAELSEIE 335
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 160/357 (44%), Gaps = 32/357 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAG-----NVTYDSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ IT A N + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPITGAAADTERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNIILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ARQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKE 335
+ YM+ GGT+FG G F TTSYDY+A +DE G P PK+ +++
Sbjct: 297 ASLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|62955063|ref|NP_001017547.1| beta-galactosidase precursor [Danio rerio]
gi|62089564|gb|AAH92166.1| Galactosidase, beta 1 [Danio rerio]
gi|182890870|gb|AAI65636.1| Glb1 protein [Danio rerio]
Length = 651
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 154/315 (48%), Gaps = 24/315 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+V Y + +G IS +IHY R W + + G+N I++YV WN HE
Sbjct: 27 SVDYHRNCFLKDGEPFRYISGSIHYSRIPRVYWKDRLLKMYMAGLNAIQTYVPWNFHEAV 86
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y F G +L +F+++ Q + +I+R GP++ AE++ GG+P WL V R+
Sbjct: 87 PGQYDFSGDRDLEQFLQLCQDIGLLVIMRPGPYICAEWDMGGLPAWLLKKKDIVLRSSDP 146
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES----FYGEGGKRYALW 202
+ + K+M ++ ++KR GGPII QVENEYG Y + + + + +
Sbjct: 147 DYLAAVDKWMGKLLPIIKR--YLYQNGGPIITVQVENEYGSYFACDFNYMRHLSQLFRFY 204
Query: 203 AAKMAV---AQNIGVPWIMCQQ----FDTPD--PVINTCNSFYCDQFTPHSPSMPKIWTE 253
+ AV G+ ++ C + T D P N +F + P P + +E
Sbjct: 205 LGEEAVLFTTDGAGLGYLKCGSLQGLYATVDFGPGANVTAAFEAQRHV--EPRGPLVNSE 262
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-----RTAGGPF 308
+PGW +G + P+ + ++ + G +V N YM+ GGTNFG T GP
Sbjct: 263 FYPGWLDHWGEKHSVVPTSAVVKTLNEILEIGANV-NLYMFIGGTNFGYWNGANTPYGP- 320
Query: 309 ITTSYDYEAPIDEYG 323
TSYDY++P+ E G
Sbjct: 321 QPTSYDYDSPLTEAG 335
>gi|365860016|ref|ZP_09399844.1| putative beta-galactosidase [Streptomyces sp. W007]
gi|364010544|gb|EHM31456.1| putative beta-galactosidase [Streptomyces sp. W007]
Length = 645
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 150/324 (46%), Gaps = 41/324 (12%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ T ++G+ ++S A+HY R W + G+N +E+YV WN HE
Sbjct: 3 DFTVGDDCFRLDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPR 62
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G+ G L +F+ +++A ++ I+R GP++ AE+ GG+PVW+ G R
Sbjct: 63 EGEVRDVG--ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDA 120
Query: 147 PFKYHMQK-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAK 205
++ +++ F L+ +++R+ S+GGP+IL Q ENEYG Y S Y W A
Sbjct: 121 AYRAVVERWFRELLPQVVQRQ---VSRGGPVILVQAENEYGSYGS-----DAVYLEWLAG 172
Query: 206 MAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC---------------DQFTPHSPSMPKI 250
+ + VP D P+ + T S + H P P +
Sbjct: 173 LLRQCGVTVPLFTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFEVLLRHQPRGPLM 229
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----G 306
E W GWF +G R E A ++ + G SV N YM HGGTNFG AG G
Sbjct: 230 CMEFWCGWFDHWGAEPVRRDPEQAAGALREVLECGASV-NIYMAHGGTNFGGWAGANRSG 288
Query: 307 PF-------ITTSYDYEAPIDEYG 323
P TSYDY+AP+DEYG
Sbjct: 289 PHQDESFQPTVTSYDYDAPVDEYG 312
>gi|384209874|ref|YP_005595594.1| beta-galactosidase [Brachyspira intermedia PWS/A]
gi|343387524|gb|AEM23014.1| beta-galactosidase [Brachyspira intermedia PWS/A]
Length = 592
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 149/330 (45%), Gaps = 35/330 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
I+NG+ ++S AIHY R V W + K G NT+E+Y+ WN HE+ G +
Sbjct: 7 KEDFILNGKPIKLLSGAIHYFRFVEEYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGVFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ FIK+ Q+ + +ILR P++ AE+ +GG+P WL R +TE F
Sbjct: 67 FSGNKDIASFIKLAQKMDLLVILRPTPYICAEWEFGGLPAWLLRYDNMKVRTNTELFLSK 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ + + + L ++ GP+I+ Q+ENEYG + + K Y + V
Sbjct: 127 VDAYYKELFKQIA--DLQITRNGPVIMMQIENEYGSFGN-----DKEYLKALKNLMVKHG 179
Query: 212 IGVP-------WIMCQQFDT--PDPVINTCN-------SFYCDQ--FTPHSPSMPKIWTE 253
VP W + T D ++ T N SF + F P + E
Sbjct: 180 AEVPLFTSDGAWDAVLEAGTLVDDGILATVNFGSQAKESFDATEKFFERKGIKNPLMCME 239
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI---- 309
W GWF + R ++D V ++G N YM+ GGTNFG G
Sbjct: 240 FWDGWFNLWKEPIIKRDADDFIMEVKEIIKRGSI--NLYMFIGGTNFGFYNGTSVTGYTD 297
Query: 310 ---TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A + E+G P K+ L++L
Sbjct: 298 FPQITSYDYDAVLTEWGEP-TEKFYKLQKL 326
>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
ED99]
Length = 590
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 156/330 (47%), Gaps = 35/330 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
S + +++ + I+S AIHY R W + K G NT+E+YV WN HE +Y
Sbjct: 7 SDTFLLDDKPIKILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G +L FI++ + +Y+I+R P++ AE+ +GG P WL R+ E +
Sbjct: 67 FKGHKDLKHFIELAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTMRIRSRDEKYLEK 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
++K+ + ++ L QGGPII+ QVENEYG + + Y A M +
Sbjct: 127 VKKYYHELFKILT--PLQIDQGGPIIMMQVENEYGSFGQDHD-----YLRSLAHMMREEG 179
Query: 212 IGVP-------WIMCQQFDT--PDPVINTCN--SFYCDQF-------TPHSPSMPKIWTE 253
+ VP W C + + D ++ T N S F S P + E
Sbjct: 180 VTVPFFTSDGAWDQCLRAGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCME 239
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG-------RTAGG 306
W GWF +G R S+D+A V R K GS+ N YM+HGGTNFG R
Sbjct: 240 FWDGWFNRWGEPVIKRDSDDLAEEV-RDAVKLGSL-NLYMFHGGTNFGFWNGCSARGTKD 297
Query: 307 PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY AP+DE G P K+ L+E+
Sbjct: 298 LPQVTSYDYHAPLDEAGNP-TEKYFALQEM 326
>gi|195342884|ref|XP_002038028.1| GM17976 [Drosophila sechellia]
gi|194132878|gb|EDW54446.1| GM17976 [Drosophila sechellia]
Length = 672
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 157/318 (49%), Gaps = 25/318 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +++ + +++G+ +S + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 48 IDHEANTFLLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHD 107
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY-IPGTVFRNDTE 146
G+Y + G +LVKF++I Q+ Y+ILR GP++ AE + GG+P WL P R +
Sbjct: 108 GEYNWEGIADLVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTNDP 167
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY---ESFYGEGGKRYALWA 203
+ + K+ + M + + LF GG II+ QVENEYG Y + +
Sbjct: 168 NYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDHDYLNWLRDETEKYV 225
Query: 204 AKMAVAQNIGVP--WIMCQQ----FDTPDPVINTCNSF--YCDQFTPHSPSMPKIWTENW 255
+ A+ + +P + C + F T D I+ N P+ P + +E +
Sbjct: 226 SGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWAMLRALQPTGPLVNSEFY 285
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
PGW + ++ R +++A ++ SV N YM+ GGTNFG TAG +
Sbjct: 286 PGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNLDGGIG 344
Query: 309 ---ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 345 YAADITSYDYDAVMDEAG 362
>gi|147775416|emb|CAN71703.1| hypothetical protein VITISV_023997 [Vitis vinifera]
Length = 297
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 61/84 (72%), Positives = 76/84 (90%)
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
YYFGG ++L+KF+KI+QQ M++IL IGPFVAA +N+ GIPVWLHY+ GTVFR ++EPFK
Sbjct: 18 YYFGGWYDLLKFVKIVQQDGMWLILHIGPFVAAXWNFXGIPVWLHYVLGTVFRTNSEPFK 77
Query: 150 YHMQKFMTLIVDMMKREKLFASQG 173
YHMQKFMTLIV++MK+EKLFASQG
Sbjct: 78 YHMQKFMTLIVNIMKKEKLFASQG 101
>gi|170034404|ref|XP_001845064.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875697|gb|EDS39080.1| beta-galactosidase [Culex quinquefasciatus]
Length = 650
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 158/326 (48%), Gaps = 37/326 (11%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ YD + +++G+ +S + HY R++P W ++ + GG+N ++ YV W+ H
Sbjct: 37 IDYDRDTFVMDGKDFRYVSGSFHYFRALPQTWRSKLRTMRAGGLNAVDLYVQWSLHNPKD 96
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTVFR---- 142
+Y + G N+ I+ + +Y+ILR GP++ AE + GG+P WL + PG R
Sbjct: 97 NQYVWDGIANITDVIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRISDA 156
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-------YYESFYGEG 195
N + K +K M+ + M GGPII+ Q+ENEYG Y + E
Sbjct: 157 NYIKEVKIWYEKLMSQLTPYM------YGNGGPIIMVQLENEYGAFGKCDKQYLNVLKEE 210
Query: 196 GKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC-DQFTPHS-------PSM 247
++Y A + ++C Q P I T D+ H+ P
Sbjct: 211 TEKYTQGKAVLFTVDRPYDDELVCGQI--PGVFITTDFGLMTDDEVDTHAAKVRSIQPKG 268
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG-- 305
P + TE + GW + ++ RP+ +A ++ + + G +V ++YMY GGTNFG AG
Sbjct: 269 PLVNTEFYTGWLTHWQEKNQRRPAGPLAATLRKMLKDGWNV-DFYMYFGGTNFGFWAGAN 327
Query: 306 ----GPFIT--TSYDYEAPIDEYGLP 325
G ++ TSYDY+AP+DE G P
Sbjct: 328 DWGLGKYMADITSYDYDAPMDEAGDP 353
>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
garnettii]
Length = 669
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 151/313 (48%), Gaps = 22/313 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HE P
Sbjct: 34 IDYSRDRFLKDGQPFRYISGSIHYSRLPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 93
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
GKY F ++ FI++ + + +ILR GP++ AE++ GG+P WL + R+
Sbjct: 94 GKYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKESMILRSSDPD 153
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWA 203
+ + K++ +++ MK L GGPII QVENEYG Y + KR+ +
Sbjct: 154 YLAAVDKWLGVLLPKMK--PLLYQNGGPIISVQVENEYGSYFTCDHDYMRFLLKRFRYYL 211
Query: 204 AKMAV---AQNIGVPWIMCQQ----FDTPD--PVINTCNSFYCDQFTPHSPSMPKIWTEN 254
V I ++ C + T D +N +F + + P P I +E
Sbjct: 212 GDDVVLFTTDGIFEKYLNCGALQGLYATVDFGTGVNITAAFKLQRKS--EPKGPLINSEF 269
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--PFIT-- 310
+ GW +G +ED+AFS+ +G SV N YM+ GGTNF G P+
Sbjct: 270 YTGWLDHWGQPHSTVKTEDVAFSLFDILARGASV-NLYMFTGGTNFAYWNGANIPYSAQP 328
Query: 311 TSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 329 TSYDYDAPLSEAG 341
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 157/337 (46%), Gaps = 34/337 (10%)
Query: 10 FALLIFFSSSITYCFAGNV-TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKE 68
LLI FS + + T + +++G+ +IS IHYPR W ++ AK
Sbjct: 7 ITLLIVFSYLFSIAQQQHTFTLGDTAFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAKA 66
Query: 69 GGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGG 128
G+NTI +YVFWN HE G+Y F G ++ F+K+ ++ ++++LR P+V AE+ +GG
Sbjct: 67 MGLNTIGTYVFWNVHEPEKGQYDFSGNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFGG 126
Query: 129 IPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGY 187
P WL I G R+ EP +++ + I+ + K+ L + GG I++ Q+ENEYG
Sbjct: 127 YPYWLQEIKGLKVRS-KEP--QYLEAYRNYIMAVGKQLSPLLVTHGGNILMVQIENEYGS 183
Query: 188 Y----------ESFYGEGGKRYALWAAK-MAVAQNIGVPWIM--CQQFDTPDPVINTCNS 234
Y + E G L+ A +N +P ++ D P V N
Sbjct: 184 YSDDKDYLDINRKMFVEAGFDGLLYTCDPKAAIKNGHLPGLLPAINGVDDPLQVKQLINE 243
Query: 235 FYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMY 294
HS P E +P WF +G + P + G S+ N YM+
Sbjct: 244 -------NHSGKGPYYIAEWYPAWFDWWGTKHHTVPYRQYLGKLDSVLAAGISI-NMYMF 295
Query: 295 HGGTNFGRTAGG------PF--ITTSYDYEAPIDEYG 323
HGGT G G P+ +SYDY+AP+DE G
Sbjct: 296 HGGTTRGFMNGANANDADPYEPQISSYDYDAPLDEAG 332
>gi|123788298|sp|Q3UPY5.1|GLBL2_MOUSE RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|74224567|dbj|BAE25259.1| unnamed protein product [Mus musculus]
Length = 636
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 145/315 (46%), Gaps = 27/315 (8%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+ +IHY R W + + K G+NT+ +YV WN HE GK+ F G +L FI+
Sbjct: 63 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 122
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMM 163
+ + +++ILR GP++ +E + GG+P WL P R F ++ + + M
Sbjct: 123 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVELYFDHL--MS 180
Query: 164 KREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFD 223
+ L GGPII QVENEYG Y + Y + K + I + D
Sbjct: 181 RVVPLQYKHGGPIIAVQVENEYGSYNK-----DRAYMPYIKKALEDRGIIEMLLTSDNKD 235
Query: 224 -----TPDPVINTCNSFYCDQFTPHSPSM-------PKIWTENWPGWFKTFGGRDPHRPS 271
D V+ T N + + + PK+ E W GWF ++GG S
Sbjct: 236 GLEKGVVDGVLATINLQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNILDS 295
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDEYGLP 325
++ +V+ + G S+ N YM+HGGTNFG G TSYDY+A + E G
Sbjct: 296 SEVLQTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEAG-D 353
Query: 326 RNPKWGHLKELHGAI 340
K+ L+EL G +
Sbjct: 354 YTAKYTKLRELFGTV 368
>gi|350418578|ref|XP_003491903.1| PREDICTED: beta-galactosidase-like [Bombus impatiens]
Length = 646
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 171/340 (50%), Gaps = 40/340 (11%)
Query: 24 FAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGH 83
F+ V Y++ +++G+ IS + HY R+ W +++ + G+N + +YV WN H
Sbjct: 30 FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLKKMRAAGLNAVSTYVEWNLH 89
Query: 84 ELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLH-YIPGTVFR 142
+ + ++++ G ++V+FI I Q+ ++++LR GP++ AE ++GG+P WL +P R
Sbjct: 90 QPTENEWHWTGDADVVEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLGRVPDINLR 149
Query: 143 NDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW 202
+ + +++ ++ ++D K + GGPII+ QVENEYG Y L
Sbjct: 150 TNDPRYMKYVEIYINEVLD--KVQPYLRGNGGPIIMVQVENEYGSYAC------DTEYLI 201
Query: 203 AAKMAVAQNIGVPWIM------------C----QQFDTPDPVINTCNSFYCDQFTPHSPS 246
+ + Q IG ++ C + + T D NT + + + P
Sbjct: 202 RLRDIMRQKIGTKALLYSTDGSNPNMLRCGFVPEVYATVDFGTNTNVTKNFEIMRMYQPR 261
Query: 247 MPKIWTENWPGWFKTFGGRDPHR--PSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P + +E +PGW + R+P + + + ++ G SV N YM++GGTNFG TA
Sbjct: 262 GPLVNSEFYPGWLSHW--REPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGYTA 318
Query: 305 GG--------PFITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
G P + TSYDY+AP+ E G P PK+ ++ +
Sbjct: 319 GANGGHNAYNPQL-TSYDYDAPLTEAGDP-TPKYFAIRNV 356
>gi|195977873|ref|YP_002123117.1| beta-galactosidase precursor Bga [Streptococcus equi subsp.
zooepidemicus MGCS10565]
gi|195974578|gb|ACG62104.1| beta-galactosidase precursor Bga [Streptococcus equi subsp.
zooepidemicus MGCS10565]
Length = 594
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 154/319 (48%), Gaps = 45/319 (14%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I+S AIHY R P WP ++ Q K G NT+E+Y+ WN HE G++ F G
Sbjct: 12 LDGKPFKILSGAIHYFRIAPDSWPRVLYQLKALGFNTVETYIPWNMHEPRKGQFTFEGIA 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
++ F+ + Q+ +Y I+R P++ AE+ +GG+P WL R+ E F H+ +
Sbjct: 72 DVEAFLDLAQEYGLYAIVRPSPYICAEWEFGGLPAWL-LTENCRVRSSDEVFLKHVSDYY 130
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGV-- 214
+++ + + +L GG I++ Q+ENEYG YGE K Y ++ +A+ I
Sbjct: 131 DVLLPKLVKRQL--DNGGNILMFQLENEYGS----YGE-EKDYLRKLKELMLAKGISAPL 183
Query: 215 -----PWI--MCQQFDTPDPVINTCN---------SFYCDQFTPHSPSMPKIWTENWPGW 258
PW+ + D V T N + D F H P + E W GW
Sbjct: 184 FTSDGPWLATLASGSLIDDDVFVTGNFGSNASKQFASMQDFFQAHQKQWPLMCMEFWLGW 243
Query: 259 FKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------- 306
F + RDP + I ++ + GS+ N YM+ GGTNFG G
Sbjct: 244 FNRWNEPIIRRDPKEAVDAIMEAI-----ELGSI-NLYMFCGGTNFGFMNGSSARLQKDL 297
Query: 307 PFITTSYDYEAPIDEYGLP 325
P I TSYDY+A +DE G P
Sbjct: 298 PQI-TSYDYDALLDEAGNP 315
>gi|374375671|ref|ZP_09633329.1| glycoside hydrolase family 35 [Niabella soli DSM 19437]
gi|373232511|gb|EHP52306.1| glycoside hydrolase family 35 [Niabella soli DSM 19437]
Length = 568
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 151/323 (46%), Gaps = 35/323 (10%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF-GGR 95
++G+ IIS +H R W +Q K G NTI YV WN E +PGK+ F G
Sbjct: 1 MDGKPFQIISGELHPARIPKEYWKHRIQMTKAMGCNTIAVYVMWNDLETAPGKFDFKTGN 60
Query: 96 FNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKF 155
++ FI++ ++ M+++LR GP+V AE+++GG+P L IP R + + +
Sbjct: 61 HDIAAFIRLCKEEGMWVLLRPGPYVCAEWDFGGLPASLLKIPDLKIRCRDPRYMAAVTGY 120
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
+ + + L + GGPI++ QVENEYG Y + K Y + + I VP
Sbjct: 121 VQHLS--AEVASLQCTNGGPIVMVQVENEYGSYGN-----DKEYLETLRNLWIKNGIRVP 173
Query: 216 WIMCQQFDTPDPVI--------------NTCNSFYCDQFTPHSPSMPKIWTENWPGWFKT 261
+ D P P + + + D+ +P +P +E +PGW
Sbjct: 174 FYTA---DGPTPYMLEAGNIKGAAIGMDSGGDQHAFDEAKKWNPDVPAFSSETYPGWLTH 230
Query: 262 FGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFIT--------TSY 313
+G + S I + F N Y+ HGGTNFG TAG + TSY
Sbjct: 231 WGEKWAQPDSAGIKKEL-EFLLSHKKSFNLYVIHGGTNFGFTAGANAFSPTQYQPDVTSY 289
Query: 314 DYEAPIDEYGLPRNPKWGHLKEL 336
DY+API+E GLP PK+ L+ L
Sbjct: 290 DYDAPINEQGLP-TPKYFMLRNL 311
>gi|294812047|ref|ZP_06770690.1| Beta-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|326440560|ref|ZP_08215294.1| putative beta-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|294324646|gb|EFG06289.1| Beta-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 582
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 149/321 (46%), Gaps = 41/321 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
R +++GR ++S A+HY R W + + G+N +E+YV WN HE PG+Y
Sbjct: 8 ERDFLLDGRPVRLLSGALHYFRVHEAQWGHRLAMLRAMGLNCVETYVPWNLHEPEPGRYE 67
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
L +F+ + A ++ I+R GP++ AE+ GG+P WL G R E F
Sbjct: 68 --DPEALGRFLDAARAAGLWAIVRPGPYICAEWENGGLPHWLTGPLGRRTRTADEEFLVP 125
Query: 152 MQK-FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQ 210
+++ F L+ +++R+ +GGP+++ Q+ENEYG + S RY + A
Sbjct: 126 VERWFARLLPQVVERQ---IDRGGPVLMVQIENEYGSWGS-----DARYLRRIERALRAS 177
Query: 211 NIGVPWIMCQQFDTPDPVINTCNSF---------------YCDQFTPHSPSMPKIWTENW 255
+ VP D P+ + T S H PS P + E W
Sbjct: 178 GLVVPLFTS---DGPEDHMLTGGSVPGALATVNFGSGARAAFGTLRGHRPSGPLMCMEFW 234
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
GWF +G R +++ A ++ + G SV N YM HGG+NFG AG
Sbjct: 235 CGWFDHWGDEHAVRDADEAADALREILECGASV-NVYMAHGGSNFGGWAGANRSGEVQDG 293
Query: 309 ----ITTSYDYEAPIDEYGLP 325
TSYDY+APIDE G P
Sbjct: 294 ALEPTATSYDYDAPIDEAGRP 314
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 153/347 (44%), Gaps = 31/347 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTY-----DSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ +T A + + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALTFALPVTAAAADTERWPDFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGAEMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ATQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLP 325
N YM+ GGT+FG G F TTSYDY+A +DE G P
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAGRP 343
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 152/315 (48%), Gaps = 26/315 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ Y+ + +G+ IS +IHY R W + + K G+N IE+YV WN HE P
Sbjct: 63 IDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFHEPFP 122
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G+Y F G +L F++++ + + +ILR GP++ AE++ GG+PVWL R+
Sbjct: 123 GQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIFLRSSDPD 182
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYG-YYESFYGEGGKRYALWAAKM 206
+ + K++ +++ MK GGPII QVENEYG Y+ Y R+ L +
Sbjct: 183 YLKAVDKWLEVLLPKMK--PYLYQNGGPIITVQVENEYGSYFACDYNY--LRFLLKVFRQ 238
Query: 207 AVAQNI--------GVPWIMCQQFDTPDPVI------NTCNSFYCDQFTPHSPSMPKIWT 252
+ + + G ++ C + N +F + P P + +
Sbjct: 239 HLGEEVVLFTTDGAGENYLKCGTLQDLYATVDFGTSSNITQAFMIQRKV--EPKGPLVNS 296
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--RTAGGPFI- 309
E + GW +G +++I S+ +G +V N YM+ GGTNFG A P++
Sbjct: 297 EFYTGWLDHWGESHQTVSTKNIVASLTDMLSRGANV-NLYMFIGGTNFGFWNGANMPYLP 355
Query: 310 -TTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 356 QPTSYDYDAPLSEAG 370
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 153/347 (44%), Gaps = 31/347 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTY-----DSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ +T A + + +G+ ++S AIH+ R
Sbjct: 41 RTTLAPLVLALAFALPVTAAAADTERWPDFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 100
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 101 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 160
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + L GGPII
Sbjct: 161 YACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQV--QPLLNHNGGPIIA 218
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 219 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 278
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 279 EAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ATQQAEEFEWILRQGHS 334
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLP 325
N YM+ GGT+FG G F TTSYDY+A +DE G P
Sbjct: 335 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAGRP 381
>gi|313237466|emb|CBY12653.1| unnamed protein product [Oikopleura dioica]
Length = 948
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 146/307 (47%), Gaps = 40/307 (13%)
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q + G+NTI+ Y+ WN HE G + FGG +LV+F I + + ++ R GP
Sbjct: 25 WKHRLQSVVDCGLNTIDVYIPWNLHEKERGNFDFGGELDLVEFFTIAAEMGLKVLCRPGP 84
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
++ +E+++GG+P WL P R++ ++ + + + ++ ++ L S GGPII
Sbjct: 85 YICSEWDWGGLPSWLLKDPKMHIRSNYCGYQAAVSSYFSKLLPLLA--PLQHSNGGPIIA 142
Query: 179 AQVENEYGYY---------------------ESFYGEGGKRYALWAAKM--AVAQNIGVP 215
QVENEYG Y E F+ G+ L KM + + I
Sbjct: 143 FQVENEYGDYVDKDNEHLPWLADLMKSHGLFELFFISDGEGVILGGYKMPQNLLKTINFK 202
Query: 216 WIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIA 275
++ ++ P+ + + Q P+ P + TE W GWF +G ++
Sbjct: 203 YLNVEKLTKSTPICDNLQALKSLQ-----PNKPMLVTEFWAGWFDYWGHGRNLLNNDVFE 257
Query: 276 FSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI--------TTSYDYEAPIDEYGLPRN 327
++ ++G SV N+YM+HGGTNFG G + TSYDY+ P+DE G R
Sbjct: 258 KTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGYYTADVTSYDYDCPVDESG-NRT 315
Query: 328 PKWGHLK 334
KW +K
Sbjct: 316 EKWEIIK 322
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 49/94 (52%), Gaps = 10/94 (10%)
Query: 245 PSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTA 304
P+ P + TE W GWF +G +E ++ ++G SV N+YM+HGGTNFG
Sbjct: 556 PNKPMLVTEFWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGASV-NFYMFHGGTNFGFMN 614
Query: 305 GGPFI--------TTSYDYEAPIDEYGLPRNPKW 330
G + TSYDY+ P+DE G R KW
Sbjct: 615 GAIELEKGYYTADVTSYDYDCPVDESG-NRTEKW 647
>gi|336424850|ref|ZP_08604882.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013315|gb|EGN43197.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 596
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/316 (31%), Positives = 148/316 (46%), Gaps = 43/316 (13%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF---- 92
+NG IIS IHY R +P W +Q+ KE G NT+E+Y+ WN HE GK+ F
Sbjct: 16 LNGEPFQIISGGIHYFRILPEYWEDRLQKLKELGCNTVETYIPWNMHEPVKGKFDFYGEH 75
Query: 93 -GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
G ++V F++ Q+ +++ILR P++ AE+++GG+P WL R E + H
Sbjct: 76 VHGMLDVVSFVRTAQRLGLWVILRPSPYICAEWDFGGLPFWLMAGEEMDLRTSDERYLRH 135
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
++ + ++ ++ L QGGP+++ QVENEYG + + K+Y M +
Sbjct: 136 VRDYYDRLMPLLA--PLQIDQGGPVLMLQVENEYGSFGN-----DKKYLESLRDMMRERG 188
Query: 212 IGVPWIMCQQFDTPD-------------PVIN----TCNSF-YCDQFTPHSPSMPKIWTE 253
I VP D PD P N +F +++T P M TE
Sbjct: 189 ITVPLFAS---DGPDHNMLANTKTEGIFPTANFGSGASKAFSILEEYTDGGPCM---CTE 242
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI---- 309
W GWF + H + A + G+V N YM+ GGTNFG G +
Sbjct: 243 FWIGWFDAWHDEVHHEGDTETAVKELENILELGNV-NIYMFEGGTNFGFMNGSNYSDHLT 301
Query: 310 --TTSYDYEAPIDEYG 323
TSYDY+A + E G
Sbjct: 302 ADVTSYDYDALLTEDG 317
>gi|157106609|ref|XP_001649402.1| beta-galactosidase [Aedes aegypti]
gi|108879821|gb|EAT44046.1| AAEL004575-PA [Aedes aegypti]
Length = 648
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 168/363 (46%), Gaps = 48/363 (13%)
Query: 13 LIFFSSSITYCFAGN-------------VTYDSRSLIINGRRELIISAAIHYPRSVPGMW 59
L+F + ++ C+ N + Y++ + +++G I+ + HY R++P W
Sbjct: 8 LLFTAIAVVLCYHVNGQRLLDNRQRTFTIDYENNTFLLDGAPFQYIAGSFHYFRALPQAW 67
Query: 60 PGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPF 119
+++ + G+N + +YV W+ H G Y + G ++ +F+++ Q + +ILR GP+
Sbjct: 68 GPILKSMRAAGLNAVTTYVEWSLHNPKKGVYNWDGMADIERFVQLAQNEDLLVILRPGPY 127
Query: 120 VAAEYNYGGIPVW-LHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKR-EKLFASQGGPII 177
+ AE + GG P W L+ PG R + +++ T ++ R E F GGPII
Sbjct: 128 ICAERDMGGFPYWLLNKYPGIQLRTADVAY---LREVRTWYAELFSRLEPYFYGNGGPII 184
Query: 178 LAQVENEYGY-------YESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVIN 230
+ QVENEYG Y + + +RY K + N G C D V++
Sbjct: 185 MVQVENEYGSFFACDYKYMKWLRDETERYV--RGKAVLFTNNGPGLTQCGGIDG---VLS 239
Query: 231 TCN---------SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARF 281
T + Y P P + E +PGW + + R + + R+
Sbjct: 240 TLDFGPGTALEIDGYWKDLRKLQPKGPLVNAEYYPGWLTHWQEQQMARSPIEPVVTSLRY 299
Query: 282 FQKGGSVHNYYMYHGGTNFGRTAG------GPFI--TTSYDYEAPIDEYGLPRNPKWGHL 333
N YM++GGTNFG TAG G FI TSYDY+AP+DE G P PK+ +
Sbjct: 300 MLSSKVNVNIYMFYGGTNFGFTAGANEQGPGRFIPDITSYDYDAPLDESGDP-TPKYEAI 358
Query: 334 KEL 336
+++
Sbjct: 359 RKV 361
>gi|253755017|ref|YP_003028157.1| beta-galactosidase [Streptococcus suis BM407]
gi|251817481|emb|CAZ55222.1| putative beta-galactosidase precursor [Streptococcus suis BM407]
Length = 590
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 159/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + +++LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYVSLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKAYLRAVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYILQQRLKEVYPELEYAE 337
>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
Length = 595
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 155/329 (47%), Gaps = 35/329 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG IIS AIHY R +P W + K G NT+E+Y+ WN HE +Y F
Sbjct: 8 EEFLLNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G+ ++ +F++ ++ +++ILR P++ AE+ +GG+P WL R+ F +
Sbjct: 68 SGQLDIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNMRIRSSDPQFIEKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
+ + + + L + GGP+I+ Q+ENEYG YGE K Y ++ + +
Sbjct: 128 SSYYKKLFEQIV--PLQVTSGGPVIMMQLENEYGS----YGE-DKEYLKTLYELMLELGV 180
Query: 213 GVP-------WIMCQQFDT-PDPVINTCNSFYCDQ----------FTPHSPSMPKIWTEN 254
VP W Q+ T D I T +F + P + E
Sbjct: 181 TVPIFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEY 240
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG-----GPFI 309
W GWF + R ++D+ V + G N YM+HGGTNFG G G +
Sbjct: 241 WGGWFNRWNDPIIKRDAQDLTNDVKEALKIGSL--NLYMFHGGTNFGFMNGCSARLGKDL 298
Query: 310 --TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P N K+ L+++
Sbjct: 299 PQLTSYDYDAPLNEQGNPTN-KYDSLQKM 326
>gi|417092513|ref|ZP_11957129.1| Beta-galactosidase [Streptococcus suis R61]
gi|353532192|gb|EHC01864.1| Beta-galactosidase [Streptococcus suis R61]
Length = 590
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 158/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWHHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + + +LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYASLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKEYLRSVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYILQQRLKEVYPELEYAE 337
>gi|301763006|ref|XP_002916929.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Ailuropoda
melanoleuca]
Length = 1209
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 145/305 (47%), Gaps = 28/305 (9%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
+ G + LI +IHY R W + + K G NT+ +YV WN HE GK+ F
Sbjct: 499 LGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENL 558
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L F+ + + +++ILR GP++ +E + GG+P WL P + R + F + K+
Sbjct: 559 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYF 618
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
++ + L +GGPII QVENEYG + K Y + K + + G+
Sbjct: 619 DHLIS--RVVPLQYHKGGPIIAVQVENEYGSFAV-----DKDYMPYVRKALLER--GIVE 669
Query: 217 IMCQQFDTPDPV------------INTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
++ D + +NT +Q + + P + E W GWF T+GG
Sbjct: 670 LLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDTWGG 729
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAP 318
+ +ED+ +V++F S N YM+HGGTNFG G + + TSYDY+A
Sbjct: 730 KHMVNNAEDVEETVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGIHRAVVTSYDYDAL 788
Query: 319 IDEYG 323
+ E G
Sbjct: 789 LTEAG 793
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 82/198 (41%), Gaps = 38/198 (19%)
Query: 14 IFFSSSITYCFAGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNT 73
+F + S + + S ++G LII+ IHY R W + + K G NT
Sbjct: 35 VFLTPSHMMNRKEGLNVEGSSFTLDGSPFLIIAGTIHYFRVPREYWRDRLMKLKACGFNT 94
Query: 74 IESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL 133
+ + F+ + +++IL GP++ ++ + GG+P WL
Sbjct: 95 VTT-----------------------AFVAMASDVGLWVILCPGPYIGSDLDLGGLPSWL 131
Query: 134 HYIPG----TVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE 189
P T +R T+ + K + IV +L +GGPII QVENEYG Y
Sbjct: 132 LRDPKMKLRTTYRGFTKAVNLYFDKIIPKIV------QLQYGKGGPIIALQVENEYGSYH 185
Query: 190 SFYGEGGKRYALWAAKMA 207
KRY + K+A
Sbjct: 186 Q-----DKRYMPYIKKLA 198
>gi|170782982|ref|YP_001711316.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
gi|169157552|emb|CAQ02748.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
Length = 615
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 146/307 (47%), Gaps = 26/307 (8%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++GR +I+ A+HY R P W +++A+ G++TIE+YV WN H G +
Sbjct: 37 LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGTFDTSAGL 96
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+L +F+ ++ M+ I+R GP++ AE++ GG+P WL P R + + +F+
Sbjct: 97 DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFGDPAVGVRRSEPLYLAAVDEFL 156
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPW 216
+ +++ ++ GGP+IL Q+ENEYG YG+ + Y + I VP
Sbjct: 157 RRVYEIVAPRQI--DMGGPVILVQIENEYGA----YGDDAE-YLRHLVDLTRESGIIVPL 209
Query: 217 IMCQQFDTPDPVINTCNSFY------------CDQFTPHSPSMPKIWTENWPGWFKTFGG 264
Q + + + + H + P + +E W GWF + G
Sbjct: 210 TTVDQPTDEMLSRGSLDELHRTGSFGSRAAERLETLRRHQRTGPLMCSEFWDGWFDHW-G 268
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPFIT--TSYDYEAP 318
H S A + G+ N YM+HGGTNFG T G G + + TSYDY+AP
Sbjct: 269 EHHHTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHKGTYQSHVTSYDYDAP 328
Query: 319 IDEYGLP 325
+DE G P
Sbjct: 329 LDETGSP 335
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 153/347 (44%), Gaps = 31/347 (8%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTY-----DSRSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + F+ +T A + + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAFALPVTAAAADTERWPDFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F G ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + + + L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQV--QPLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQ---KGGSV 288
D+ P P++ E W GWF +G PH ++ A A F+ + G
Sbjct: 241 EAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWG--KPHAATD--ATQQAEEFEWILRQGHS 296
Query: 289 HNYYMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLP 325
N YM+ GGT+FG G F TTSYDY+A +DE G P
Sbjct: 297 ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAGRP 343
>gi|146318103|ref|YP_001197815.1| beta-galactosidase [Streptococcus suis 05ZYH33]
gi|146320284|ref|YP_001199995.1| Beta-galactosidase [Streptococcus suis 98HAH33]
gi|253751293|ref|YP_003024434.1| beta-galactosidase precursor [Streptococcus suis SC84]
gi|253753194|ref|YP_003026334.1| beta-galactosidase precursor [Streptococcus suis P1/7]
gi|386577401|ref|YP_006073806.1| beta-galactosidase [Streptococcus suis GZ1]
gi|386579383|ref|YP_006075788.1| beta-galactosidase [Streptococcus suis JS14]
gi|386581447|ref|YP_006077851.1| beta-galactosidase [Streptococcus suis SS12]
gi|386587678|ref|YP_006084079.1| beta-galactosidase [Streptococcus suis A7]
gi|403061087|ref|YP_006649303.1| beta-galactosidase [Streptococcus suis S735]
gi|145688909|gb|ABP89415.1| Beta-galactosidase [Streptococcus suis 05ZYH33]
gi|145691090|gb|ABP91595.1| Beta-galactosidase [Streptococcus suis 98HAH33]
gi|251815582|emb|CAZ51165.1| putative beta-galactosidase precursor [Streptococcus suis SC84]
gi|251819439|emb|CAR44926.1| putative beta-galactosidase precursor [Streptococcus suis P1/7]
gi|292557863|gb|ADE30864.1| Beta-galactosidase [Streptococcus suis GZ1]
gi|319757575|gb|ADV69517.1| Beta-galactosidase [Streptococcus suis JS14]
gi|353733593|gb|AER14603.1| Beta-galactosidase [Streptococcus suis SS12]
gi|354984839|gb|AER43737.1| Beta-galactosidase [Streptococcus suis A7]
gi|402808413|gb|AFQ99904.1| beta-galactosidase [Streptococcus suis S735]
Length = 590
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 159/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + +++LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYVSLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKAYLRAVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYILQQRLKEVYPELEYAE 337
>gi|254675347|ref|NP_083286.1| beta-galactosidase-1-like protein precursor [Mus musculus]
gi|81879201|sp|Q8VC60.1|GLB1L_MOUSE RecName: Full=Beta-galactosidase-1-like protein; Flags: Precursor
gi|18256820|gb|AAH21773.1| Glb1l protein [Mus musculus]
gi|148667965|gb|EDL00382.1| mCG133890 [Mus musculus]
Length = 646
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 148/316 (46%), Gaps = 24/316 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V + +++G +S ++HY R P +W + + + G+N ++ YV WN HE P
Sbjct: 28 VDREHDRFLLDGVPFRYVSGSLHYFRVPPVLWADRLLKMQLSGLNAVQFYVPWNYHEPEP 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F G +L+ F+ + + +ILR GP++ AE+ GG+P WL P R
Sbjct: 88 GIYNFNGSRDLIAFLNEAAKVNLLVILRPGPYICAEWEMGGLPSWLLRNPNIHLRTSDPA 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-----FYGEGGKRYALW 202
F + + +++ K GG II QVENEYG Y++ G AL
Sbjct: 148 FLEAVDSWFKVLLP--KIYPFLYHNGGNIISIQVENEYGSYKACDFKYMRHLAGLFRALL 205
Query: 203 AAKMAVAQNIGVPWIMCQQ----FDTPD--PVINTCNSF-YCDQFTPHSPSMPKIWTENW 255
K+ + G + C + T D P N F ++ PH P + +E +
Sbjct: 206 GDKILLFTTDGPHGLRCGSLQGLYTTIDFGPADNVTRIFSLLREYEPHG---PLVNSEYY 262
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--I 309
GW +G R S +A + + + G SV N YM+HGGTNFG G G F I
Sbjct: 263 TGWLDYWGQNHSTRSSPAVAQGLEKMLKLGASV-NMYMFHGGTNFGYWNGADEKGRFLPI 321
Query: 310 TTSYDYEAPIDEYGLP 325
TTSYDY+API E G P
Sbjct: 322 TTSYDYDAPISEAGDP 337
>gi|24418925|ref|NP_722498.1| beta-galactosidase-1-like protein 2 [Mus musculus]
gi|23512349|gb|AAH38479.1| Galactosidase, beta 1-like 2 [Mus musculus]
gi|148693361|gb|EDL25308.1| cDNA sequence BC038479, isoform CRA_b [Mus musculus]
Length = 652
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 146/319 (45%), Gaps = 35/319 (10%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+ +IHY R W + + K G+NT+ +YV WN HE GK+ F G +L FI+
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 138
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND----TEPFKYHMQKFMTLI 159
+ + +++ILR GP++ +E + GG+P WL P R T+ + M+ +
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVDLYFDHLMSRV 198
Query: 160 VDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMC 219
V + + GGPII QVENEYG Y + Y + K + I +
Sbjct: 199 VPLQYK------HGGPIIAVQVENEYGSYNK-----DRAYMPYIKKALEDRGIIEMLLTS 247
Query: 220 QQFD-----TPDPVINTCNSFYCDQFTPHSPSM-------PKIWTENWPGWFKTFGGRDP 267
D D V+ T N + + + PK+ E W GWF ++GG
Sbjct: 248 DNKDGLEKGVVDGVLATINLQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHN 307
Query: 268 HRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------TTSYDYEAPIDE 321
S ++ +V+ + G S+ N YM+HGGTNFG G TSYDY+A + E
Sbjct: 308 ILDSSEVLQTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTE 366
Query: 322 YGLPRNPKWGHLKELHGAI 340
G K+ L+EL G +
Sbjct: 367 AG-DYTAKYTKLRELFGTV 384
>gi|26325854|dbj|BAC26681.1| unnamed protein product [Mus musculus]
Length = 646
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 148/316 (46%), Gaps = 24/316 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V + +++G +S ++HY R P +W + + + G+N ++ YV WN HE P
Sbjct: 28 VDREHDRFLLDGVPFRYVSGSLHYFRVPPVLWADRLLKMQLSGLNAVQFYVPWNYHEPEP 87
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F G +L+ F+ + + +ILR GP++ AE+ GG+P WL P R
Sbjct: 88 GIYNFNGSRDLIAFLNEAAKVNLLVILRPGPYICAEWEMGGLPSWLLRNPNIHLRTSDPA 147
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-----FYGEGGKRYALW 202
F + + +++ K GG II QVENEYG Y++ G AL
Sbjct: 148 FLEAVDSWFKVLLP--KIYPFLYHNGGNIISIQVENEYGSYKACDFKYMRHLAGLFRALL 205
Query: 203 AAKMAVAQNIGVPWIMCQQ----FDTPD--PVINTCNSF-YCDQFTPHSPSMPKIWTENW 255
K+ + G + C + T D P N F ++ PH P + +E +
Sbjct: 206 GDKILLFTTDGPHGLRCGSLQGLYTTIDFGPADNVTRIFSLLREYEPHG---PLVNSEYY 262
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPF--I 309
GW +G R S +A + + + G SV N YM+HGGTNFG G G F I
Sbjct: 263 TGWLDYWGQNHSTRSSPAVAQGLEKMLKLGASV-NMYMFHGGTNFGYWNGADEKGRFLPI 321
Query: 310 TTSYDYEAPIDEYGLP 325
TTSYDY+API E G P
Sbjct: 322 TTSYDYDAPISEAGDP 337
>gi|327260596|ref|XP_003215120.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 679
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 151/322 (46%), Gaps = 27/322 (8%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++ Y + + +G + IS +IHY R W + + G+N ++ Y+ WN HE
Sbjct: 72 SIDYTDKCFLKDGVKFRYISGSIHYFRIPRAYWKDRLLKMYMSGLNAVQIYIPWNYHEPL 131
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G Y F G +L F+ + + +ILR GP++ AE+ GGIP WL P + R
Sbjct: 132 SGVYNFDGDRDLEGFLDLAANFDLLVILRPGPYICAEWEMGGIPSWLLAKPNIILRTSDP 191
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES------------FYGE 194
F + K+ ++++ +K GG II QVENEYG Y + F
Sbjct: 192 DFLQAVDKWFSVLLPKIKPH--LYINGGNIISVQVENEYGSYYACDYDYLRHLEAVFRSY 249
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWT 252
GK+ L+ + + + + T D P N +F + H P+ P + +
Sbjct: 250 LGKKVVLFTTDGTKESEL-LCGTLHGLYTTVDFGPEENVTEAFEKQRI--HEPNGPLVNS 306
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E + GW +G + +ED+A + + + G +V N YM+ GGTNFG +G +
Sbjct: 307 EYYTGWLDYWGEPHSTKSAEDVARGLEKMLELGANV-NMYMFQGGTNFGYWSGADYNNGI 365
Query: 309 ---ITTSYDYEAPIDEYGLPRN 327
ITTSYDY+AP+ E G P +
Sbjct: 366 YNPITTSYDYDAPLSEAGDPTD 387
>gi|301065438|ref|YP_003787461.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
gi|300437845|gb|ADK17611.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
Length = 598
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDSAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|194857009|ref|XP_001968877.1| GG24263 [Drosophila erecta]
gi|190660744|gb|EDV57936.1| GG24263 [Drosophila erecta]
Length = 672
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 160/321 (49%), Gaps = 31/321 (9%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ + + + +++G+ +S + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 48 IDHAANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHD 107
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY-IPGTVFRNDTE 146
G+Y + G ++VKF++I QQ Y+ILR GP++ AE + GG+P WL P R +
Sbjct: 108 GEYNWEGIADVVKFLEIAQQEDFYIILRPGPYICAERDNGGLPHWLFAKYPSIKMRTNDP 167
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYE------SFYGEGGKRYA 200
+ + K+ + M + + LF GG II+ QVENEYG Y ++ + ++Y
Sbjct: 168 NYIAEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDHDYLNWLRDETEKYV 225
Query: 201 LWAAKMAVAQNIGVPWIMCQQ----FDTPDPVINTCNSFYCDQ----FTPHSPSMPKIWT 252
A + +I + C + F T D I+ N DQ P+ P + +
Sbjct: 226 TGKA-LLFTVDIPNEKMSCGKIENVFATTDFGIDRINEI--DQIWAMLRTLQPTGPLVNS 282
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---- 308
E +PGW + ++ R +++A ++ SV N YM+ GGTNFG TAG +
Sbjct: 283 EFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNLDG 341
Query: 309 ------ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 342 GIGYAADITSYDYDAVMDEAG 362
>gi|418004004|ref|ZP_12644053.1| beta-galactosidase 3 [Lactobacillus casei UW1]
gi|410551057|gb|EKQ25134.1| beta-galactosidase 3 [Lactobacillus casei UW1]
Length = 598
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDSAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|414564444|ref|YP_006043405.1| beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus ATCC 35246]
gi|338847509|gb|AEJ25721.1| beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus ATCC 35246]
Length = 599
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 103/341 (30%), Positives = 152/341 (44%), Gaps = 42/341 (12%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
S ++GR I+S AIHY R P W + K G NT+E+Y+ WN HE G Y
Sbjct: 9 SDQFYLDGRPLQILSGAIHYFRIHPDDWYHSLYNLKALGFNTVETYIPWNLHEAKEGSYD 68
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G+ ++ F+ + Q+ +Y I+R P++ AE+ +GG+P WL + +D Y
Sbjct: 69 FSGQLDVEAFLTLAQRLGLYAIVRPSPYICAEWEFGGLPAWLLTKNCYIRSSDPVYLAYV 128
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ + L+ + + E QGG I++ Q+ENEYG Y G + L A K + ++
Sbjct: 129 RRYYEELLPRLARHE---WQQGGNILMFQLENEYGSY------GEDKAYLKAIKALMEEH 179
Query: 212 IGVPWIMCQQFDTP------------DPVINTCN------SFYCDQ---FTPHSPSMPKI 250
+ P D P D V T N + D F+ H + P +
Sbjct: 180 LSAPLFTA---DGPWRATLRAGSLIEDDVFVTGNFGSRAQENFADMQAFFSEHGKAWPLM 236
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
E W GWF + R E++A +V +G N YM+HGGTNFG G
Sbjct: 237 CMEFWDGWFNRWHEPIIKRDPEELADAVMEVLAQGSI--NLYMFHGGTNFGFMNGCSARK 294
Query: 309 -----ITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCE 344
TSYDY+A +DE G P + K L + E
Sbjct: 295 QLDLPQVTSYDYDAILDEAGNPTAKFYAIQKRLTAELSEIE 335
>gi|319945941|ref|ZP_08020191.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|417919516|ref|ZP_12563047.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
gi|319748006|gb|EFW00250.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|342832897|gb|EGU67186.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
Length = 595
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 154/319 (48%), Gaps = 38/319 (11%)
Query: 44 IISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIK 103
I+S AIHY R W + K G NT+E+YV WN HE G ++F G +L FI+
Sbjct: 19 ILSGAIHYFRIDREDWYHSLYNLKALGFNTVETYVPWNAHEPQRGHFHFEGNLDLEHFIQ 78
Query: 104 IIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP-FKYHMQKFMTLIVDM 162
+ Q+ +Y+ILR PF+ +E+ +GG+P WL I + ++P F + ++ ++
Sbjct: 79 VAQELDLYVILRPSPFICSEWEFGGLPAWL--IEKDLRIRSSDPAFLEEVARYYDELLPR 136
Query: 163 MKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP------- 215
+ + +L +GG I++ QVENEYG Y GE K Y + + ++I P
Sbjct: 137 VAKYQL--DRGGNILMMQVENEYGSY----GED-KAYLRAIRDLMIERDITCPLFTSDGP 189
Query: 216 WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIWTENWPGWFKTFGG 264
W + T D + T N S + F H P + E W GWF +
Sbjct: 190 WRATLRAGTLIEDGLFVTGNFGSRANYNFSQMKEFFAEHDRKWPLMCMEFWDGWFNRWKE 249
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-------ITTSYDYEA 317
R E++A +V Q+G N YM+HGGTNFG G TSYDY+A
Sbjct: 250 PIIKRDPEELAEAVHEVLQEGSI--NLYMFHGGTNFGFMNGCSARGTVDLPQVTSYDYDA 307
Query: 318 PIDEYGLPRNPKWGHLKEL 336
+DE G P PK+ +K++
Sbjct: 308 LLDEQGNP-TPKYDAVKKM 325
>gi|169604026|ref|XP_001795434.1| hypothetical protein SNOG_05023 [Phaeosphaeria nodorum SN15]
gi|111066294|gb|EAT87414.1| hypothetical protein SNOG_05023 [Phaeosphaeria nodorum SN15]
Length = 638
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 158/349 (45%), Gaps = 41/349 (11%)
Query: 5 TPIAPFALLIFFSSSITYCFAGNVTY--DSRSLIINGRRELIISAAIHYPRSVPGMWPGL 62
TPI F S ++ N T+ D + I G+ II I R WP
Sbjct: 19 TPITDF-------SDLSDAAQANSTFSWDKNTFYIEGKPYSIIGGQIDPQRVPRAYWPQR 71
Query: 63 VQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAA 122
+Q AK G+NTI SYV+W E PG++ F + ++ + + IQ+A M +LR GP+V A
Sbjct: 72 LQMAKSMGLNTILSYVYWQDIEQHPGQFDFTDKNDIAAWFQEIQKAGMKAVLRPGPYVCA 131
Query: 123 EYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVE 182
E ++GG+P WL I G R++ PF K++T + + + L + GGPI++ QVE
Sbjct: 132 ERDWGGMPGWLPQISGMKHRSNNGPFLDATNKYLTKVGAQL--QPLLIANGGPILMVQVE 189
Query: 183 NEYGYYESFYG-------------EGGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVI 229
NEYG+ S + K Y A +N VP + FD D +
Sbjct: 190 NEYGWAGSDHTYTNKLADILKANFPNTKLYTNDANNAGALKNGQVPGALA-VFDGTD-MK 247
Query: 230 NTCNSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRP-SEDIAFSVAR-----FFQ 283
N + T S P + E W WF +G ++ H D R +
Sbjct: 248 NGVTTLRS-AITDASSIGPAMNGEYWIRWFDNWGPKNGHSSYDRDTNGMQGRANDLDWML 306
Query: 284 KGGSVHNYYMYHGGTNFGRTAGG-------PFITTSYDYEAPIDEYGLP 325
G + +M+HGGT+F AG PF TTSYDY AP+DE G P
Sbjct: 307 TNGHHFSIFMFHGGTSFAFGAGSGDTTPRTPF-TTSYDYGAPLDETGRP 354
>gi|225872977|ref|YP_002754436.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
gi|225792973|gb|ACO33063.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
Length = 619
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 149/322 (46%), Gaps = 28/322 (8%)
Query: 25 AGNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHE 84
A T I++G+ IIS +IH+ R W +++A+ G+N I YVFWN E
Sbjct: 35 AHTATVGDGHFILDGKPVQIISGSIHFARVPRAEWGDRLRKARAMGLNAISVYVFWNVQE 94
Query: 85 LSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRND 144
G++ F G++++ +FI++ QQA +Y+ILR GP+ AE++ GG P WL R+
Sbjct: 95 PHRGQWDFSGQYDVARFIRMAQQAGLYVILRPGPYACAEWSMGGYPAWLWKDGRVKIRSS 154
Query: 145 TEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYA-- 200
+ + Q +M + +K L + GGPII QVENEYG + Y E +R
Sbjct: 155 DPAYLHAAQDYMDHLGQQLK--PLLWTHGGPIIAVQVENEYGSFGKSRAYLEEVRRMVAG 212
Query: 201 --LWAAKMAVAQNIGVPWI-----MCQQFDT-PDPVINTCNSFYCDQFTPHSPSMPKIWT 252
L + A G+ W + + D P V N + PHS +
Sbjct: 213 AGLGGVVLYTADGPGL-WSGSLPELPEAIDVGPGGVENGVKQLLA--YRPHSKLV--YVA 267
Query: 253 ENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------ 306
E +PGWF +G H R+ G N YM+HGGT++G G
Sbjct: 268 EYYPGWFDQWGQPHHHGAPLKEQLKDLRWILSRGYSVNLYMFHGGTDWGFMNGANDNAAD 327
Query: 307 ---PFITTSYDYEAPIDEYGLP 325
TTSYDY AP++E G P
Sbjct: 328 TDYAPQTTSYDYAAPLNEAGDP 349
>gi|328958462|ref|YP_004375848.1| beta-galactosidase [Carnobacterium sp. 17-4]
gi|328674786|gb|AEB30832.1| beta-galactosidase [Carnobacterium sp. 17-4]
Length = 589
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 95/318 (29%), Positives = 149/318 (46%), Gaps = 34/318 (10%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG I S A+HY R +P W + K G NT+E+Y+ WN HE G+Y F
Sbjct: 8 EDFLLNGEPFKITSGAVHYFRVLPEDWYHSLYNLKALGFNTVETYIPWNVHEPKEGEYQF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G++++ KF+++ ++ +++ILR P++ AE+ +GG+P WL + R+ F +
Sbjct: 68 SGQWDIKKFVQLAEELGLFVILRPSPYICAEWEFGGLPAWLLTYKDMLIRSSDPVFIEKV 127
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNI 212
++ ++ + L GGP+I+ Q+ENEYG YGE K Y ++ + +
Sbjct: 128 SRYYKELLKQIT--PLQVDHGGPVIMMQLENEYGS----YGE-DKEYLRTLYELMLKLGV 180
Query: 213 GVP-------WIMCQQFDT-PDPVINTCNSFYC------DQFTPHSPSMPKIW----TEN 254
+P W Q+ T D I T +F + S K W E
Sbjct: 181 TIPIFTSDGAWRATQEAGTMTDLDILTTGNFGSRSKENFKELKEFHESKGKKWPLMCMEY 240
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ 308
W GWF + R + ++ V + G N YM+HGGTNFG G
Sbjct: 241 WDGWFNRWNDPIIKRDALELTQDVKEALEIGSL--NLYMFHGGTNFGFMNGCSARLRKDL 298
Query: 309 -ITTSYDYEAPIDEYGLP 325
TSYDY+AP++E G P
Sbjct: 299 PQVTSYDYDAPLNEQGNP 316
>gi|426249767|ref|XP_004018620.1| PREDICTED: beta-galactosidase [Ovis aries]
Length = 634
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/314 (29%), Positives = 148/314 (47%), Gaps = 22/314 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
+ Y + +G+ IS +IHY R W + + K G+N I++YV WN HEL
Sbjct: 20 QIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQ 79
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
PG+Y F G ++ FI++ + + +ILR GP++ AE++ GG+P WL V R+
Sbjct: 80 PGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDP 139
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-------FYGEGGKRY 199
+ + K++ +++ M+ L GGPII QVENEYG Y S F + + +
Sbjct: 140 DYLAAVDKWLGVLLPKMR--PLLYKNGGPIITVQVENEYGSYYSCDYDYLRFLQKRFQDH 197
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPDPVI------NTCNSFYCDQFTPHSPSMPKIWTE 253
+ + ++ C + N +F + P P I +E
Sbjct: 198 LGEDVLLFTTDGVNEEFLQCGALQGLYATVDFSTGSNLTAAFMLQR--KFEPRGPLINSE 255
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--RTAGGPF--I 309
+ GW +G R S+ +AF++ G +V N YM+ GG+NF A P+
Sbjct: 256 FYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGANV-NMYMFIGGSNFAYWNGANTPYQPQ 314
Query: 310 TTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 315 PTSYDYDAPLSEAG 328
>gi|393780989|ref|ZP_10369190.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
gi|392677324|gb|EIY70741.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
Length = 776
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 146/313 (46%), Gaps = 32/313 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
++ ++NG ++ +A +HY R W ++ K G+NTI YVFWN HE G++
Sbjct: 31 KKTFLLNGEPFIVKAAELHYTRIPQPYWEHRIKMCKALGMNTICLYVFWNIHEQEEGQFD 90
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G+ ++ F ++ Q+ MY+I+R GP+V AE+ GG+P WL R +P Y+
Sbjct: 91 FTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRT-LDP--YY 147
Query: 152 MQK---FMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
M++ FM + + + L ++GG II+ QVENEYG Y + K Y M
Sbjct: 148 MERVGIFMKKVGEQLV--PLQITRGGNIIMVQVENEYGSYGT-----DKPYVSAIRDMVR 200
Query: 209 AQNIG-VPWIMCQ--------QFDTPDPVINTCNSFYCDQ----FTPHSPSMPKIWTENW 255
VP C D +N DQ P P + +E W
Sbjct: 201 GAGFTEVPLFQCDWSSNFTNNALDDLLWTVNFGTGANIDQQFKKLKELRPETPLMCSEFW 260
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-----PFIT 310
GWF +G + RP++D+ + + S + YM HGGT FG G +
Sbjct: 261 SGWFDHWGRKHETRPAKDMVQGLKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMC 319
Query: 311 TSYDYEAPIDEYG 323
+SYDY+API E G
Sbjct: 320 SSYDYDAPISEAG 332
>gi|223932593|ref|ZP_03624593.1| Beta-galactosidase [Streptococcus suis 89/1591]
gi|302023447|ref|ZP_07248658.1| beta-galactosidase precursor [Streptococcus suis 05HAS68]
gi|386583558|ref|YP_006079961.1| beta-galactosidase [Streptococcus suis D9]
gi|223898703|gb|EEF65064.1| Beta-galactosidase [Streptococcus suis 89/1591]
gi|353735704|gb|AER16713.1| Beta-galactosidase [Streptococcus suis D9]
Length = 590
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 158/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + + +LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYASLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKEYLRSVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYILQQRLKEVYPELEYAE 337
>gi|386585602|ref|YP_006082004.1| beta-galactosidase [Streptococcus suis D12]
gi|353737748|gb|AER18756.1| Beta-galactosidase [Streptococcus suis D12]
Length = 590
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 158/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + + +LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYASLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKEYLRSVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYLLQQRLKEVYPELEYAE 337
>gi|330832298|ref|YP_004401123.1| beta-galactosidase [Streptococcus suis ST3]
gi|329306521|gb|AEB80937.1| Beta-galactosidase [Streptococcus suis ST3]
Length = 590
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 158/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + + +LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYASLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKEYLRSVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYILQQRLKEVYPELEYAE 337
>gi|418000981|ref|ZP_12641151.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|418009807|ref|ZP_12649594.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
gi|410548851|gb|EKQ23035.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|410554934|gb|EKQ28899.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
Length = 598
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|191637109|ref|YP_001986275.1| beta-galactosidase 3 [Lactobacillus casei BL23]
gi|385818812|ref|YP_005855199.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|385821988|ref|YP_005858330.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|409995961|ref|YP_006750362.1| beta-galactosidase 17 [Lactobacillus casei W56]
gi|190711411|emb|CAQ65417.1| Beta-galactosidase 3 [Lactobacillus casei BL23]
gi|327381139|gb|AEA52615.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|327384315|gb|AEA55789.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|406356973|emb|CCK21243.1| Beta-galactosidase 17 [Lactobacillus casei W56]
Length = 598
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFTIQKM 325
>gi|24582088|ref|NP_608978.2| beta galactosidase, isoform A [Drosophila melanogaster]
gi|21430516|gb|AAM50936.1| LP09580p [Drosophila melanogaster]
gi|22945722|gb|AAF52321.2| beta galactosidase, isoform A [Drosophila melanogaster]
Length = 672
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 157/318 (49%), Gaps = 25/318 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +++ + +++G+ +S + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 48 IDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHD 107
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY-IPGTVFRNDTE 146
G+Y + G ++VKF++I Q+ Y+ILR GP++ AE + GG+P WL P R +
Sbjct: 108 GEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTNDP 167
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY---ESFYGEGGKRYALWA 203
+ + K+ + M + + LF GG II+ QVENEYG Y + +
Sbjct: 168 NYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDHDYLNWLRDETEKYV 225
Query: 204 AKMAVAQNIGVP--WIMCQQ----FDTPDPVINTCNSF--YCDQFTPHSPSMPKIWTENW 255
+ A+ + +P + C + F T D I+ N P+ P + +E +
Sbjct: 226 SGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWAMLRALQPTGPLVNSEFY 285
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
PGW + ++ R +++A ++ SV N YM+ GGTNFG TAG +
Sbjct: 286 PGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNLDGGIG 344
Query: 309 ---ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 345 YAADITSYDYDAVMDEAG 362
>gi|195473731|ref|XP_002089146.1| GE18961 [Drosophila yakuba]
gi|194175247|gb|EDW88858.1| GE18961 [Drosophila yakuba]
Length = 672
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 157/318 (49%), Gaps = 25/318 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +++ + +++G+ +S + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 48 IDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHD 107
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY-IPGTVFRNDTE 146
G+Y + G ++VKF++I Q+ Y+ILR GP++ AE + GG+P WL P R +
Sbjct: 108 GEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFAKYPSIKMRTNDP 167
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY---ESFYGEGGKRYALWA 203
+ + K+ + M + + LF GG II+ QVENEYG Y + +
Sbjct: 168 NYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDHDYLNWLRDETEKYV 225
Query: 204 AKMAVAQNIGVP--WIMCQQ----FDTPDPVINTCNSF--YCDQFTPHSPSMPKIWTENW 255
+ A+ + +P + C + F T D I+ N P+ P + +E +
Sbjct: 226 SGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWAMLRALQPTGPLVNSEFY 285
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
PGW + ++ R +++A ++ SV N YM+ GGTNFG TAG +
Sbjct: 286 PGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNLDGGIG 344
Query: 309 ---ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 345 YAADITSYDYDAVMDEAG 362
>gi|357455525|ref|XP_003598043.1| Beta-galactosidase [Medicago truncatula]
gi|355487091|gb|AES68294.1| Beta-galactosidase [Medicago truncatula]
Length = 309
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 138/291 (47%), Gaps = 43/291 (14%)
Query: 446 LKWQVFKEIA--GIWGEADFVKSGFVDHINTTKDTTDYLWYTTSIIVNENEEFLKNGSRP 503
LKW+ E + G+ F S ++ N T +DYLWY T ++VN+ + + +
Sbjct: 26 LKWEWASEPMQDTLLGKGTFTASKLLNQKNVTAGASDYLWYMTEVVVNDTKIW----GKA 81
Query: 504 VLLIESKGHALHAFANQELQGSASGNGTHPPFKYKNPISLKAGKNEIALLSMTVGLQNAG 563
L +++KG L+++ N G G+ + P F Y+ +SLK G N I+LLS+T+G N
Sbjct: 82 RLHVDTKGPILYSYINGFWWGVEGGSPSKPGFVYEEDVSLKQGANIISLLSVTLGKSNCS 141
Query: 564 PFYEWVGAGIT----SVKITGFNSGTLDLSTYSWTYKIGLQGEHLGIYNPGYRNNINWVS 619
+ + GI + T + + LDLS +W+YK+G+ G Y+P N + W
Sbjct: 142 GYIDMKETGIVGGPAKLISTEYPNNVLDLSKSTWSYKVGMNGVARKFYDPKSTNVVPW-Q 200
Query: 620 TMEPPKNQPLTWYKAVVKQPPGDEPIGLDMLKMGKGLAWLNGEEIGRYWPRKSRKSSPHD 679
T P+TWYK K P G + LD++ + +G AW+NG+ IGRYW
Sbjct: 201 TRNVSIEGPMTWYKTTFKTPEGSNLVVLDLIGLQRGKAWVNGQSIGRYW----------- 249
Query: 680 ECVQECDYRGKFNPDKCITGCGEPSQ-RWYHIPRSWFKPSENILVIFEEKG 729
GE S R+Y +PR + N LV+FEE G
Sbjct: 250 --------------------IGENSSFRFYAVPRPFLNKDVNTLVLFEELG 280
>gi|442626280|ref|NP_001260120.1| beta galactosidase, isoform B [Drosophila melanogaster]
gi|440213416|gb|AGB92656.1| beta galactosidase, isoform B [Drosophila melanogaster]
Length = 670
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 157/318 (49%), Gaps = 25/318 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +++ + +++G+ +S + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 46 IDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHD 105
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY-IPGTVFRNDTE 146
G+Y + G ++VKF++I Q+ Y+ILR GP++ AE + GG+P WL P R +
Sbjct: 106 GEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTNDP 165
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY---ESFYGEGGKRYALWA 203
+ + K+ + M + + LF GG II+ QVENEYG Y + +
Sbjct: 166 NYISEVGKWYAEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDHDYLNWLRDETEKYV 223
Query: 204 AKMAVAQNIGVP--WIMCQQ----FDTPDPVINTCNSF--YCDQFTPHSPSMPKIWTENW 255
+ A+ + +P + C + F T D I+ N P+ P + +E +
Sbjct: 224 SGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWAMLRALQPTGPLVNSEFY 283
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------- 308
PGW + ++ R +++A ++ SV N YM+ GGTNFG TAG +
Sbjct: 284 PGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNLDGGIG 342
Query: 309 ---ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 343 YAADITSYDYDAVMDEAG 360
>gi|417985674|ref|ZP_12626256.1| beta-galactosidase 3 [Lactobacillus casei 32G]
gi|410527574|gb|EKQ02437.1| beta-galactosidase 3 [Lactobacillus casei 32G]
Length = 598
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
Length = 620
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 166/358 (46%), Gaps = 40/358 (11%)
Query: 10 FALLIFFSSSITYCFA-----GNVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQ 64
FAL++ +++ FA + ++ S + NG+ I S +HY R W +Q
Sbjct: 9 FALVLIV---LSFGFAQAQDDASFKIENGSFVYNGKPTPIYSGEMHYERIPKEYWRHRIQ 65
Query: 65 QAKEGGVNTIESYVFWNGHELSPGKYYF-GGRFNLVKFIKIIQQARMYMILRIGPFVAAE 123
K G+NTI +YVFWN H +PG + F G N+ +FIKI ++ M++ILR GP+ E
Sbjct: 66 MMKAMGLNTIATYVFWNYHNPAPGVWDFESGNRNVAEFIKIAKEEEMFVILRPGPYACGE 125
Query: 124 YNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVEN 183
+ +GG P +L IPG R + F ++++ + + L + GG II+ QVEN
Sbjct: 126 WEFGGYPWFLQNIPGLKVRENNAQFLAACKEYINELAKQVA--PLQVNNGGNIIMTQVEN 183
Query: 184 EYGYY----ESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-----QFDTPDPVINTCN- 233
E+G Y E E K Y KM P+ + + + V+ T N
Sbjct: 184 EFGSYVAQREDIAPEDHKAYKEAIFKMLKDAGFQAPFFTSDGAWLFEGGSLEGVLPTANG 243
Query: 234 -------SFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGG 286
++F ++ P + E +PGW + + DIA + K G
Sbjct: 244 EGNIDNLKKVVNKF--NNNEGPYMVAEFYPGWLDHWAEPFVKISASDIA-KQTEVYLKNG 300
Query: 287 SVHNYYMYHGGTNFGRTAGGPF--------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
N+YM HGGTNFG T+G + TSYDY+API E G PK+ ++ L
Sbjct: 301 VNFNFYMAHGGTNFGFTSGANYNDEHDIQPDITSYDYDAPISEAGW-VTPKYDSIRAL 357
>gi|384428898|ref|YP_005638258.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
gi|341938001|gb|AEL08140.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
Length = 613
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 158/355 (44%), Gaps = 26/355 (7%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDS-----RSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + + IT A + + + + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALAIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRAY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + ++ L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQSYLDAVAQQVR--PLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y+ + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNY 291
D+ P P++ E W GWF +G ++ + ++G S N
Sbjct: 241 EAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWGTPHASTNAKQQTEELEWILRQGHSA-NL 299
Query: 292 YMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
YM+ GGT+FG G F TTSYDY+A +DE G P PK+ ++++
Sbjct: 300 YMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFALMRDV 353
>gi|389856131|ref|YP_006358374.1| beta-galactosidase [Streptococcus suis ST1]
gi|353739849|gb|AER20856.1| Beta-galactosidase [Streptococcus suis ST1]
Length = 590
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 158/343 (46%), Gaps = 38/343 (11%)
Query: 30 YDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGK 89
Y ++G I+S AIHY R P W + K G NT+E+YV WN HE G+
Sbjct: 5 YIGDQFYLDGEPFKILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGE 64
Query: 90 YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFK 149
+ + G ++ +F+K+ Q+ +Y I+R P++ AE+ +GG+P WL V +D+ +
Sbjct: 65 FCYEGILDIERFLKLAQELGLYAIVRPSPYICAEWEWGGLPAWLMKEELRVRSSDSVYLQ 124
Query: 150 YHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVA 209
+ + + +LI K KL +QGG +++ QVENEYG YGE K Y A +
Sbjct: 125 HLDEYYASLIP---KLAKLQLAQGGNVLMFQVENEYGS----YGE-EKAYLRAVAGLMRK 176
Query: 210 QNIGVP-------WIMCQQFDT--PDPVINTCN---------SFYCDQFTPHSPSMPKIW 251
+ P W + T D V T N + F H + P +
Sbjct: 177 HGLTAPLFTSDGSWRATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMC 236
Query: 252 TENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--- 308
E W GWF +G R E++ SV + G N YM+HGGTNFG G
Sbjct: 237 MEFWDGWFNRWGDEIIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQ 294
Query: 309 ----ITTSYDYEAPIDEYGLPRNPKW---GHLKELHGAIKLCE 344
TSYDY+A +DE G P + LKE++ ++ E
Sbjct: 295 IDLPQVTSYDYDAILDEAGNPTKKFYILQQRLKEVYPELEYAE 337
>gi|157106611|ref|XP_001649403.1| beta-galactosidase [Aedes aegypti]
gi|108879822|gb|EAT44047.1| AAEL004580-PA [Aedes aegypti]
Length = 656
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 155/326 (47%), Gaps = 37/326 (11%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ YD + +++G+ ++ + HY R++P W ++ + GG+N ++ YV W+ H
Sbjct: 45 IDYDRDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLKTLRAGGLNAVDLYVQWSLHNPKE 104
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHY-IPGTVFRNDTE 146
+Y + G N+ I+ +A +Y+ILR GP++ AE + GG+P WL PG R
Sbjct: 105 NQYVWDGIANIKDVIEAAIEADLYVILRPGPYICAEIDNGGLPYWLFTKYPGIQVRTSDA 164
Query: 147 PFKYHM----QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGY-------YESFYGEG 195
+ + +K M+ + M GGPII+ Q+ENEYG Y +F E
Sbjct: 165 NYLKEVATWYEKLMSQLTPYM------YGNGGPIIMVQLENEYGAFGKCDKPYLNFLKEE 218
Query: 196 GKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQ--------FTPHSPSM 247
++Y A + + C Q P + T D+ P+
Sbjct: 219 TEKYTQGKAVLFTVDRPYGNEMECGQ--VPGVFVTTDFGLMTDEEVDTHKAKLRSVQPNG 276
Query: 248 PKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG-- 305
P + TE + GW + + RP+E +A ++ + G +V ++YMY GGTNFG AG
Sbjct: 277 PLVNTEFYTGWLTHWQESNQRRPAEPLANTLRKMLHDGWNV-DFYMYFGGTNFGFWAGAN 335
Query: 306 ----GPFIT--TSYDYEAPIDEYGLP 325
G ++ TSYDY+AP+DE G P
Sbjct: 336 DWGLGKYMADITSYDYDAPMDEAGDP 361
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 145/298 (48%), Gaps = 26/298 (8%)
Query: 43 LIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFI 102
+I+ +IHY R W + + + G NT+ +Y+ WN HE GK+ F +L ++
Sbjct: 1 MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60
Query: 103 KIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDM 162
+ + +++ILR GP++ AE + GG+P WL P T R + F + K+ ++
Sbjct: 61 LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIP- 119
Query: 163 MKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVPWIMCQ-- 220
K L GGP+I QVENEYG ++ + Y + K + + I V ++
Sbjct: 120 -KILPLQYRHGGPVIAVQVENEYGSFQK-----DRNYMNYLKKALLKRGI-VELLLTSDD 172
Query: 221 ----QFDTPDPVINTC--NSFYCDQFT---PHSPSMPKIWTENWPGWFKTFGGRDPHRPS 271
Q + + + T NSF D F P + E W GW+ ++G + + +
Sbjct: 173 KDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKSA 232
Query: 272 EDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF------ITTSYDYEAPIDEYG 323
E+I +V +F G S N YM+HGGTNFG GG + + TSYDY+A + E G
Sbjct: 233 EEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAG 289
>gi|239629323|ref|ZP_04672354.1| glycosyl hydrolase [Lactobacillus paracasei subsp. paracasei
8700:2]
gi|417979668|ref|ZP_12620358.1| beta-galactosidase 3 [Lactobacillus casei 12A]
gi|417982493|ref|ZP_12623148.1| beta-galactosidase 3 [Lactobacillus casei 21/1]
gi|239528009|gb|EEQ67010.1| glycosyl hydrolase [Lactobacillus paracasei subsp. paracasei
8700:2]
gi|410526941|gb|EKQ01818.1| beta-galactosidase 3 [Lactobacillus casei 12A]
gi|410529717|gb|EKQ04508.1| beta-galactosidase 3 [Lactobacillus casei 21/1]
Length = 598
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|433461907|ref|ZP_20419504.1| beta-galactosidase [Halobacillus sp. BAB-2008]
gi|432189486|gb|ELK46587.1| beta-galactosidase [Halobacillus sp. BAB-2008]
Length = 579
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 151/311 (48%), Gaps = 22/311 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T ++ ++N + I+S AIHY R+VP W +++ K G+NT+E+YV WN HE
Sbjct: 2 LTAENGQFLLNDKPFQILSGAIHYFRTVPEHWEDRLEKLKALGLNTVETYVPWNLHEPRR 61
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G++ F G ++ FI+ +Y+I+R P++ AE+ GG+P WL V R+
Sbjct: 62 GEFEFSGLADIEGFIQTAADLGLYVIVRPAPYICAEWEMGGLPSWLLKDKDVVMRSSDPV 121
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY---ESFYGEGGKRYALWAA 204
+ +++ + + + K GGPII Q+ENEYG Y + + K+Y
Sbjct: 122 YLSYVESYYKEL--LPKFVPHLYQNGGPIIAMQIENEYGAYGNDQKYLTFLKKQYEQHGL 179
Query: 205 KMAVAQNIGVPWIMCQQFDTPDPVINTCN-----SFYCDQFTPHSPSMPKIWTENWPGWF 259
+ + G +I +Q PD V T N ++ PK+ E W GWF
Sbjct: 180 DTFLFTSDGPDFI--EQGSLPD-VTTTLNFGSKVEQAFERLDAFKTGSPKMVAEFWIGWF 236
Query: 260 KTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------PFITTS 312
+ G R + D A ++ SV N+YM+HGGTNFG G P I TS
Sbjct: 237 DYWTGEHHTRDAGDAAAVFRELMERKASV-NFYMFHGGTNFGFMNGANHYDVYYPTI-TS 294
Query: 313 YDYEAPIDEYG 323
YDY++ + E G
Sbjct: 295 YDYDSLLTESG 305
>gi|445062232|ref|ZP_21374649.1| beta-galactosidase [Brachyspira hampsonii 30599]
gi|444506390|gb|ELV06735.1| beta-galactosidase [Brachyspira hampsonii 30599]
Length = 592
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 150/330 (45%), Gaps = 37/330 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
I+NG+ I+S AIHY R V W + K G NT+E+Y+ WN HE+ G + F
Sbjct: 8 EEFILNGKPIKILSGAIHYFRFVREYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGFFDF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G ++ FIK Q+ + +ILR P++ AE+ +GG+P WL R +T+ F +
Sbjct: 68 SGNKDIASFIKTAQKLDLLVILRPTPYICAEWEFGGLPAWLLRYDNIKVRTNTQLF---L 124
Query: 153 QKFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
K ++ K + L ++ GP+I+ Q+ENEYG + + K Y + +
Sbjct: 125 SKVDAYYKELFKHIDDLQITRNGPVIMMQIENEYGSFGN-----DKEYLRALKNLMIKHG 179
Query: 212 IGVP-------WIMCQQFDT--PDPVINTCN-------SFYCDQ--FTPHSPSMPKIWTE 253
VP W + T D ++ T N SF + F P + E
Sbjct: 180 AEVPLFTSDGAWDAVLEAGTLIDDGILATVNFGSKAKESFDDTEKFFARKGIKKPLMCME 239
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI---- 309
W GWF + R ++D V ++G N YM+ GGTNFG G
Sbjct: 240 FWDGWFNLWKDPIIKRDADDFIMEVKEILKRGSI--NLYMFIGGTNFGFYNGTSVTGYTD 297
Query: 310 ---TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+A + E+G P K+ L++L
Sbjct: 298 FPQITSYDYDAVLTEWGEP-TEKFYKLQKL 326
>gi|348508360|ref|XP_003441722.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 648
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 145/316 (45%), Gaps = 36/316 (11%)
Query: 31 DSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKY 90
+S + + LI+ +IHY R W + + K G+NT+ +YV WN HE G +
Sbjct: 60 NSSQFTLERKPFLILGGSIHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGVF 119
Query: 91 YFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKY 150
F + +L ++++ +++ILR GP++ AE++ GG+P WL P R F Y
Sbjct: 120 KFDDQLDLEAYLRLAASLGLWVILRPGPYICAEWDLGGLPSWLLRDPQMKLRTTYSGFTY 179
Query: 151 HMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY-----------ESFYGEGGKRY 199
+ F ++ + S+GGPII QVENEYG Y E+ G
Sbjct: 180 AVNSFFDEVIKKAVPHQY--SKGGPIIAVQVENEYGSYATDENYMPFIKEALLSRGITEL 237
Query: 200 ALWAAKMAVAQNIGVP----WIMCQQFDTPDPVINTCNSFYCDQFTPHSPSMPKIWTENW 255
L + + GV I Q+ D PD + Y +Q P PK+ E W
Sbjct: 238 LLTSDNKDGLKLGGVKGALETINFQKLD-PDEIK------YLEQIQPQQ---PKMVMEYW 287
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------ 309
GWF +GG +E++ V + S+ N YM+HGGTNFG +G +
Sbjct: 288 SGWFDLWGGLHHVYTAEEMIPVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGLPAPK 346
Query: 310 --TTSYDYEAPIDEYG 323
TSYDY+AP+ E G
Sbjct: 347 PMVTSYDYDAPLSEAG 362
>gi|195146534|ref|XP_002014239.1| GL19091 [Drosophila persimilis]
gi|194106192|gb|EDW28235.1| GL19091 [Drosophila persimilis]
Length = 672
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 152/323 (47%), Gaps = 35/323 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ ++S S ++NG ++ + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 49 IDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNPHD 108
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTVFRNDTE 146
G Y + G ++VKF++I Q+ Y+ILR GP++ AE + GG+P WL P R
Sbjct: 109 GVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTSDS 168
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW---- 202
+ + K+ + M + + L GG II+ QVENEYG YE K Y W
Sbjct: 169 NYMAEVGKWYAEL--MPRLQHLLIGNGGKIIMVQVENEYGDYEC-----DKDYLNWLRDE 221
Query: 203 ----AAKMAVAQNIGVP--WIMCQQFD----TPDPVINTCNSF--YCDQFTPHSPSMPKI 250
+ A+ +P + C + D T D I+ + P+ P +
Sbjct: 222 TEKYVNRNALLFTTDIPNERMSCGKIDNVFATTDFGIDRIHEIDDIWTMLRKLQPTGPLV 281
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
+E +PGW + + R + +A ++ SV N YM+ GGTNFG TAG +
Sbjct: 282 NSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGANYNL 340
Query: 309 --------ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 341 DGGIGYAADITSYDYDAVMDEAG 363
>gi|21232326|ref|NP_638243.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
gi|21114096|gb|AAM42167.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
Length = 613
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 158/355 (44%), Gaps = 26/355 (7%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDS-----RSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + + IT A + + + + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALSIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRTY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + ++ L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSYLDAVAQQVR--PLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y+ + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNY 291
D+ P P++ E W GWF +G ++ + ++G S N
Sbjct: 241 EAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWGTPHASTNAKQQTEELEWILRQGHSA-NL 299
Query: 292 YMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
YM+ GGT+FG G F TTSYDY+A +DE G P PK+ ++++
Sbjct: 300 YMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFALMRDV 353
>gi|347967093|ref|XP_320991.5| AGAP002058-PA [Anopheles gambiae str. PEST]
gi|333469761|gb|EAA01064.5| AGAP002058-PA [Anopheles gambiae str. PEST]
Length = 630
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 93/329 (28%), Positives = 157/329 (47%), Gaps = 24/329 (7%)
Query: 27 NVTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELS 86
++ + + + +G+ IS + HY R++P W +++ + G+NT+ +Y+ W+ HE
Sbjct: 33 DIDFQNDTFTKDGQPFQFISGSFHYFRALPESWRHILRSMRAAGLNTVMTYIEWSLHEPM 92
Query: 87 PGKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW-LHYIPGTVFRNDT 145
PG+Y + G NL +FI+I Q +++ILR GP++ AE + GG P W L P R
Sbjct: 93 PGQYQWEGIANLEEFIEIAQSENLFVILRPGPYICAERDMGGFPHWLLTKYPSIKLRTYD 152
Query: 146 EPFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGE-----GGKRYA 200
+ +Q + ++ + R GGP+I+ +ENEYG +++ G+
Sbjct: 153 TDYLREVQNWYNQLMPRLVR--YLYGNGGPVIMVSIENEYGSFKACDGQYMQFLKNLTVH 210
Query: 201 LWAAKMAVAQNIGVPWIMCQQFDTPDP-----VINTCNSFYCDQFTPHSPSMPKIWTENW 255
K + N G + C P + N N+F+ Q + P P + E +
Sbjct: 211 FVQDKAVLFTNDGPELLKCGSIPGILPTLDFGITNNPNAFW-QQLRKYLPKGPLVNAEYY 269
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI------ 309
PGW T R + + + + N+YM+ GGTNFG TAG +
Sbjct: 270 PGWL-THWMEPTARVDAGMVVNTLKLMLNQKANVNFYMFFGGTNFGFTAGANDVGPGKYS 328
Query: 310 --TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP+DE G P PK+ ++++
Sbjct: 329 ADITSYDYDAPLDEAGDP-TPKYFAIRKV 356
>gi|187736173|ref|YP_001878285.1| beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
gi|187426225|gb|ACD05504.1| Beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
Length = 780
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 148/316 (46%), Gaps = 32/316 (10%)
Query: 29 TYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPG 88
+ + + +++G+ IIS +HYPR W Q+ K G+NT+ +Y+FWN HE PG
Sbjct: 35 STNQENFLMDGKPVKIISGEMHYPRVPRQHWKDRFQRIKAMGMNTVCTYLFWNVHEPEPG 94
Query: 89 KYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPF 148
K+ F G + V+FIK Q+A +++I+R GP+V AE+ +GG P WL R+ F
Sbjct: 95 KWDFSGNLDFVEFIKEAQKAGLWVIVRPGPYVCAEWEFGGFPGWLLKDEDLKVRSQDPRF 154
Query: 149 KYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAV 208
++ + M+ E L ++GGPII+AQVENEYG Y S K Y K
Sbjct: 155 LEPAMAYLKKVCSML--EPLQITKGGPIIMAQVENEYGSYGS-----DKDY---VKKHLD 204
Query: 209 AQNIGVPWIMCQQFDTPD---------PVINTCNSF------YCDQFTPHSPSMPKIWTE 253
+P ++ D P+ P + +F H P+I E
Sbjct: 205 VIRKELPGVVPFTSDGPNDWMIKNGTLPGVVPAMNFGGGAKGAFANLEKHKGKTPRINGE 264
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG----GPFI 309
W GWF +G +E + + S N +M HGGT+FG G G +
Sbjct: 265 FWVGWFDHWGKPKNGGSTEGFNRDLKWMLENNVS-PNLFMAHGGTSFGFMNGANWEGAYT 323
Query: 310 --TTSYDYEAPIDEYG 323
T+YDY API E G
Sbjct: 324 PDVTNYDYGAPISENG 339
>gi|318077940|ref|ZP_07985272.1| beta-galactosidase [Streptomyces sp. SA3_actF]
Length = 588
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 145/317 (45%), Gaps = 44/317 (13%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++GR ++S A+HY R +P WP ++ + G+NT+E+YV WN HE PG + F G+
Sbjct: 11 LDGRPLRLLSGALHYFRVLPEQWPHRLRMLRALGLNTVETYVPWNFHEPRPGHHDFTGQA 70
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGT-VFRNDTEPFKYHMQKF 155
+L F+ + A ++ I+R P++ AE+ GG+P WL P R + H+ ++
Sbjct: 71 DLDAFLHATRDAGLHAIVRPSPYICAEWENGGLPWWLLADPEVRALRCQDPAYLAHVDRW 130
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
++ + ++ +QGG +++ QVENEYG Y + G Y A + I VP
Sbjct: 131 YDALIPRLAAHQV--TQGGNVVMMQVENEYGSYGTDTG-----YLEHLADGMRRRGIDVP 183
Query: 216 WIMCQQFDTPDPVINTCNSF------------------YCDQFTPHSPSMPKIWTENWPG 257
D PD T + + PH P M E W G
Sbjct: 184 LFTS---DGPDDFFLTGGTLPGHLATVNFGSRPAQAFAGLKRLRPHDPPM---CAEFWCG 237
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------- 308
WF +G R + + +A GGSV N YM HGGTNF AG
Sbjct: 238 WFDHWGAPRTVRDAAEATEELAATLGAGGSV-NVYMAHGGTNFSTWAGANTEDPATGAGY 296
Query: 309 --ITTSYDYEAPIDEYG 323
TSYDY+APIDE G
Sbjct: 297 LPTVTSYDYDAPIDERG 313
>gi|318059605|ref|ZP_07978328.1| beta-galactosidase [Streptomyces sp. SA3_actG]
Length = 588
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 145/317 (45%), Gaps = 44/317 (13%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++GR ++S A+HY R +P WP ++ + G+NT+E+YV WN HE PG + F G+
Sbjct: 11 LDGRPLRLLSGALHYFRVLPEQWPHRLRMLRALGLNTVETYVPWNFHEPRPGHHDFTGQA 70
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGT-VFRNDTEPFKYHMQKF 155
+L F+ + A ++ I+R P++ AE+ GG+P WL P R + H+ ++
Sbjct: 71 DLDAFLHATRDAGLHAIVRPSPYICAEWENGGLPWWLLADPEVRALRCQDPAYLAHVDRW 130
Query: 156 MTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP 215
++ + ++ +QGG +++ QVENEYG Y + G Y A + I VP
Sbjct: 131 YDALIPRLAAHQV--TQGGNVVMMQVENEYGSYGTDTG-----YLEHLADGMRRRGIDVP 183
Query: 216 WIMCQQFDTPDPVINTCNSF------------------YCDQFTPHSPSMPKIWTENWPG 257
D PD T + + PH P M E W G
Sbjct: 184 LFTS---DGPDDFFLTGGTLPGHLATVNFGSRPAQAFAGLKRLRPHDPPM---CAEFWCG 237
Query: 258 WFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------- 308
WF +G R + + +A GGSV N YM HGGTNF AG
Sbjct: 238 WFDHWGAPRTVRDAAEATEELAATLGAGGSV-NVYMAHGGTNFSTWAGANTEDPATGAGY 296
Query: 309 --ITTSYDYEAPIDEYG 323
TSYDY+APIDE G
Sbjct: 297 LPTVTSYDYDAPIDERG 313
>gi|414563760|ref|YP_006042721.1| beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus ATCC 35246]
gi|338846825|gb|AEJ25037.1| beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus ATCC 35246]
Length = 594
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 100/319 (31%), Positives = 149/319 (46%), Gaps = 45/319 (14%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I+S AIHY R P W ++ Q K G NT+E+Y+ WN HE G++ F G
Sbjct: 12 LDGKPFKILSGAIHYFRIAPDSWSRVLYQLKALGFNTVETYIPWNMHEPRKGQFTFEGIA 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
++ F+ + Q+ +Y I+R P++ AE+ +GG+P WL R+ E F H+ +
Sbjct: 72 DVEAFLDLAQECGLYAIVRPSPYICAEWEFGGLPAWL-LTENCRLRSSDEVFLKHVSDYY 130
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYAL------------- 201
+++ + + +L GG I++ Q+ENEYG Y E Y K L
Sbjct: 131 DVLLPKLVKRQL--DNGGNILMFQLENEYGSYGEEKAYLRKLKELMLAKGISAPLFTSDG 188
Query: 202 -WAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYC--DQFTPHSPSMPKIWTENWPGW 258
W+A +A I + F + N F D F H P + E W GW
Sbjct: 189 PWSATLASGSLIDDDVFVTGNFGS-----NASKQFASMQDFFQAHQKQWPLMCMEFWLGW 243
Query: 259 FKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------- 306
F + RDP + I ++ + GS+ N YM+ GGTNFG G
Sbjct: 244 FNRWNEPIIRRDPKETVDAIMEAI-----ELGSI-NLYMFCGGTNFGFMNGSSARLQKDL 297
Query: 307 PFITTSYDYEAPIDEYGLP 325
P I TSYDY+A +DE G P
Sbjct: 298 PQI-TSYDYDALLDEAGNP 315
>gi|417991864|ref|ZP_12632235.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
gi|410534805|gb|EKQ09440.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
Length = 598
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 155/337 (45%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIEHFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDSAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSHADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|313240094|emb|CBY32448.1| unnamed protein product [Oikopleura dioica]
Length = 677
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 146/297 (49%), Gaps = 17/297 (5%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T D + ++G+ I+S AIHY R W +Q + G+NTI+ Y+ WN HE
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G + FGG +LV+F I + + ++ R GP++ +E+++GG+P WL P R++
Sbjct: 68 GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMA 207
++ + + + ++ ++ L S GGPII QVENEYG Y + + W A +
Sbjct: 128 YQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYGDYV----DKDNEHLPWLADLM 181
Query: 208 VAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHS-----PSMPKIWTENWPGWFKTF 262
+ + + + T I N + TP S P+ P + TE W GWF +
Sbjct: 182 KSHGLFELFFISDGGHT----IRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYW 237
Query: 263 GGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYEAPI 319
G ++ ++ ++G SV N+YM+HGGTNFG G + Y Y A +
Sbjct: 238 GHGRNLLNNDVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGY-YTADV 292
>gi|328713057|ref|XP_001947370.2| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 630
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 97/339 (28%), Positives = 160/339 (47%), Gaps = 38/339 (11%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
V Y+ I +G +S ++HY R W +++ K G+N I YV W+ HE
Sbjct: 30 VDYEKNEFIKDGNIFRYVSGSLHYFRVPRPYWRDRIRKMKSAGLNAISFYVEWSFHEPYS 89
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVW-LHYIPGTVFRNDTE 146
G Y F G+ ++ F+ I +Q M +++R GPF++AE + GG P W L P R+
Sbjct: 90 GVYDFEGQADIEHFLTISKQENMNVLIRPGPFISAERDLGGHPYWLLKEKPSLHLRSSDP 149
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW---- 202
+K +++++ +++ M K GG II+ Q+ENEYG+ + G K Y LW
Sbjct: 150 NYKKYIKRWFSVL--MPKIVPFLYGNGGNIIMVQIENEYGHND--LGNCDKEYMLWLRDL 205
Query: 203 -------AAKMAVAQNIGVPWIMCQQ----FDTPD--PVINTCNSFYCDQFTPHSPSMPK 249
A++ + ++ C Q + T D V+N F P
Sbjct: 206 FHHYVGEQAQLYTTDECNLSFLECGQIPNVYSTVDFAAVVNVTECF--QHLRQVQKKGPL 263
Query: 250 IWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFI 309
+ +E + GW + P R + DI V+++F + N++M+HGGTNFG ++G +
Sbjct: 264 VNSEFYDGWVAFWDSPRPVRNTSDI-IRVSKYFLEANVSFNFFMFHGGTNFGFSSGANTM 322
Query: 310 ------------TTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYD+ AP+DE G P K+ +K++
Sbjct: 323 GTTLDKSGYRPQLTSYDFTAPLDEAGDPTE-KYHAIKQI 360
>gi|227533108|ref|ZP_03963157.1| beta-galactosidase 3, partial [Lactobacillus paracasei subsp.
paracasei ATCC 25302]
gi|227189289|gb|EEI69356.1| beta-galactosidase 3 [Lactobacillus paracasei subsp. paracasei ATCC
25302]
Length = 578
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE + G +
Sbjct: 14 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDFD 73
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 74 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 132
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 133 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 185
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 186 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 242
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +ED+ + R GSV N YM+HGGTNFG G
Sbjct: 243 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 296
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 297 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 332
>gi|225407896|ref|ZP_03761085.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
gi|225042575|gb|EEG52821.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
Length = 590
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 93/319 (29%), Positives = 147/319 (46%), Gaps = 34/319 (10%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+ ++GR ++S A+HY R +P W + K G NT+E+Y+ WN HE G++
Sbjct: 7 NEEFCLDGRPVKLLSGAVHYFRLMPEYWEDCLYNLKAMGFNTVETYIPWNIHEPEEGEFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ F+++ +++ILR PF+ AE+ GG+P WL P R +T F
Sbjct: 67 FSGSRDVEAFVRLAGSMGLHVILRPSPFICAEWEMGGLPAWLLRYPDMKVRTNTPLFLVK 126
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----------ESFYGEGGKRYAL 201
++ + + + L ++GGP+IL QVENEYG + +S G
Sbjct: 127 VEAYYRELFRHIA--DLQITRGGPVILMQVENEYGSFGNDKEYLRRIKSLMERFGAEVPF 184
Query: 202 ------WAAKMAVAQNIGVPWIMCQQFDT-PDPVINTCNSFYCDQFTPHSPSMPKIWTEN 254
W A + I + F + D ++ +F F H P + E
Sbjct: 185 FTSDGSWDAALEAGSLIEDGVLATANFGSRSDENLDVLEAF----FKRHGRKWPLMCMEF 240
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG-------- 306
W GWF + + R +ED+A V + ++ N YM+ GGTNFG G
Sbjct: 241 WDGWFNRWREKIITRDAEDLAMEVRQLLERASI--NLYMFQGGTNFGFYNGCSARGYTDL 298
Query: 307 PFITTSYDYEAPIDEYGLP 325
P I TSY+Y+A + E+G P
Sbjct: 299 PQI-TSYNYDAILTEWGQP 316
>gi|417994975|ref|ZP_12635282.1| beta-galactosidase 3 [Lactobacillus casei M36]
gi|410539221|gb|EKQ13758.1| beta-galactosidase 3 [Lactobacillus casei M36]
Length = 598
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE S G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYSEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +E++ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAENLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|392987629|ref|YP_006486222.1| glucosyl hydrolase family protein [Enterococcus hirae ATCC 9790]
gi|392335049|gb|AFM69331.1| glucosyl hydrolase family protein [Enterococcus hirae ATCC 9790]
Length = 592
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 148/318 (46%), Gaps = 35/318 (11%)
Query: 33 RSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYF 92
++NG+ I+S AIHY R W + K G NT+E+YV WN HE G ++F
Sbjct: 8 EDFLLNGKPFKILSGAIHYFRVDSADWYHSLYNLKALGFNTVETYVPWNLHEPKKGDFHF 67
Query: 93 GGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHM 152
G +L F+ I ++ +Y I+R P++ AE+ +GG P WL GT R + + H+
Sbjct: 68 EGILDLEHFLSIAEELGLYAIVRPSPYICAEWEFGGFPAWL-LNEGTRIRTNETVYLNHV 126
Query: 153 QKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY--ESFYGEGGKRYAL--------- 201
+ +++ + +L + GG I++ Q+ENEYG Y E Y + L
Sbjct: 127 ADYYDVLIKKIVPHQL--TNGGNILMIQIENEYGSYGEEKDYLRSIRDLMLDRGITVPFF 184
Query: 202 -----WAAKMAVAQNIGVPWIMCQQFDT-PDPVINTCNSFYCDQFTPHSPSMPKIWTENW 255
W A + I ++ F + + ++ +F F H P + E W
Sbjct: 185 TSDGPWRATLRAGSMIDEDILVTGNFGSKAEENFSSMEAF----FNEHGKKWPLMCMEFW 240
Query: 256 PGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG--------P 307
GWF + R ++++A ++ +G N YM+HGGTNFG G P
Sbjct: 241 DGWFNRWKEPIVQRDAKELAEAIKEVVLRGSI--NLYMFHGGTNFGFMNGCSARGVIDLP 298
Query: 308 FITTSYDYEAPIDEYGLP 325
I TSYDY AP+DE G P
Sbjct: 299 QI-TSYDYGAPLDEQGNP 315
>gi|422861007|ref|ZP_16907651.1| beta-galactosidase [Streptococcus sanguinis SK330]
gi|327468658|gb|EGF14137.1| beta-galactosidase [Streptococcus sanguinis SK330]
Length = 592
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 149/317 (47%), Gaps = 40/317 (12%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I+S AI Y R P W + K G NT+E+Y+ W HE G++ G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+ + K++++ +Y+I+R P++ AE+++GG+P WL P R + F + F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP- 215
+ + + + QGGPI++ QVENEYG Y K Y A+M + + VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAE-----DKAYMRSIAQMMKVRGVSVPL 184
Query: 216 ------WIMCQQFDT--PDPVINTCNSFYCDQFTPHSPSM-----------PKIWTENWP 256
WI + T D + T N + Q ++ ++ P + TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--------RTAGGPF 308
GWF + R +ED+A V Q G N ++ GGTNFG +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 309 ITTSYDYEAPIDEYGLP 325
I TSYD++API E+G P
Sbjct: 301 I-TSYDFDAPITEWGQP 316
>gi|198475912|ref|XP_002132214.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
gi|198137462|gb|EDY69616.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
Length = 672
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 150/323 (46%), Gaps = 35/323 (10%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ ++S S ++NG ++ + HY R+VP W ++ + G+N +++YV W+ H
Sbjct: 49 IDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNPHD 108
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWL-HYIPGTVFRNDTE 146
G Y + G ++VKF++I Q+ Y+ILR GP++ AE + GG+P WL P R
Sbjct: 109 GVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTSDS 168
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALW---- 202
+ + K+ + M + + L GG II+ QVENEYG YE K Y W
Sbjct: 169 NYMAEVGKWYAEL--MPRLQHLLIGNGGKIIMVQVENEYGDYEC-----DKDYLNWLRDE 221
Query: 203 ------AAKMAVAQNIGVPWIMCQQFD----TPDPVINTCNSF--YCDQFTPHSPSMPKI 250
+ +I + C + D T D I+ + P+ P +
Sbjct: 222 TEKYVNGNALLFTTDIPNERMSCGKIDNVFATTDFGIDRIHEIDDIWAMLRKLQPTGPLV 281
Query: 251 WTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF-- 308
+E +PGW + + R + +A ++ SV N YM+ GGTNFG TAG +
Sbjct: 282 NSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGANYNL 340
Query: 309 --------ITTSYDYEAPIDEYG 323
TSYDY+A +DE G
Sbjct: 341 DGGVGYAADITSYDYDAVMDEAG 363
>gi|417988603|ref|ZP_12629136.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|417997907|ref|ZP_12638140.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|418015108|ref|ZP_12654689.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
gi|410541233|gb|EKQ15720.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|410542248|gb|EKQ16704.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|410552187|gb|EKQ26219.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
Length = 598
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 156/337 (46%), Gaps = 50/337 (14%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+++G+ I+S AIHY R P W + K G NT+E+YV WN HE S G +
Sbjct: 7 DHEFMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYSEGDFD 66
Query: 92 FGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYH 151
F G ++ +F+ + +Y I+R P++ AE+ +GG P WL R D +
Sbjct: 67 FSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQA 125
Query: 152 MQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQN 211
+ ++ T ++ + ++ + GG +I+ QVENEYG Y GE K Y A++
Sbjct: 126 IDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSY----GED-KDYLAAVAELMKKHG 178
Query: 212 IGVPWIMCQQFDTPDP------------VINTCN-----SFYCDQFTP----HSPSMPKI 250
+ VP D P P ++ T N D+ H P +
Sbjct: 179 VDVPLFTS---DGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 251 WTENWPGWFKTFGG----RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG 306
E W GWF +G RDP +E++ + R GSV N YM+HGGTNFG G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAENLRAVIQR-----GSV-NLYMFHGGTNFGFMNGT 289
Query: 307 PF-------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
TSYDY+AP++E G P PK+ ++++
Sbjct: 290 SARKDHDLPQVTSYDYDAPLNEQGNP-TPKYFAIQKM 325
>gi|456387967|gb|EMF53457.1| glycosyl hydrolase family 42 [Streptomyces bottropensis ATCC 25435]
Length = 591
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 147/318 (46%), Gaps = 28/318 (8%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+T S +++G IIS A+HY R P +W +++A+ G+NT+E+YV WN H+ P
Sbjct: 6 LTTTSDGFLLHGEPFRIISGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPDP 65
Query: 88 GK-YYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTE 146
G +L +++++ + ++++LR GP++ AE++ GG+P WL P R+
Sbjct: 66 DSPLVLDGLLDLPRYLRLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPDIRLRSSDP 125
Query: 147 PFKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKM 206
F + + L + + A+ GP+I QVENEYG Y Y +
Sbjct: 126 RFTAALDGY--LDILLPPLLPYMAANDGPVIAVQVENEYGAYGD-----DTAYLKHVHQA 178
Query: 207 AVAQNIGVPWIMCQQFDTPD-------PVINTCNSF------YCDQFTPHSPSMPKIWTE 253
A+ + C Q + P + + +F H P P + +E
Sbjct: 179 LRARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGPLMCSE 238
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF----- 308
W GWF +G R + A + + G SV N YM+HGGTNFG T G
Sbjct: 239 FWIGWFDHWGEEHHVRDAAGAAADLDKLLAAGASV-NIYMFHGGTNFGFTNGANHDQCYA 297
Query: 309 -ITTSYDYEAPIDEYGLP 325
I TSYDY+A + E G P
Sbjct: 298 PIVTSYDYDAALTESGDP 315
>gi|66767541|ref|YP_242303.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
gi|66572873|gb|AAY48283.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
Length = 613
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 158/355 (44%), Gaps = 26/355 (7%)
Query: 4 RTPIAPFALLIFFSSSITYCFAGNVTYDS-----RSLIINGRRELIISAAIHYPRSVPGM 58
RT +AP L + + IT A + + + + +G+ ++S AIH+ R
Sbjct: 3 RTTLAPLVLALSIALPITATAASDDQWPTFATQGTQFVRDGKPYQVLSGAIHFQRIPRTY 62
Query: 59 WPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFNLVKFIKIIQQARMYMILRIGP 118
W +Q+A+ G+NT+E+YVFWN E G++ F ++ F++ + +ILR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 119 FVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFMTLIVDMMKREKLFASQGGPIIL 178
+ AE+ GG P WL R+ F Q ++ + ++ L GGPII
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSYLDAVAQQVR--PLLNHNGGPIIA 180
Query: 179 AQVENEYGYYESFYGEGGKRYALWAA----KMAVAQNIGVPWIMCQQFDTPDPVINTC-- 232
QVENEYG Y+ + A++ K + + G + V+N
Sbjct: 181 VQVENEYGSYDDDHAYIADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPG 240
Query: 233 -NSFYCDQFTPHSPSMPKIWTENWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNY 291
D+ P P++ E W GWF +G ++ + ++G S N
Sbjct: 241 EAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWGTPHASTNAKQQTEELEWILRQGHSA-NL 299
Query: 292 YMYHGGTNFGRTAGGPF----------ITTSYDYEAPIDEYGLPRNPKWGHLKEL 336
YM+ GGT+FG G F TTSYDY+A +DE G P PK+ ++++
Sbjct: 300 YMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFALMRDV 353
>gi|221129758|ref|XP_002162955.1| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 620
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 151/316 (47%), Gaps = 25/316 (7%)
Query: 28 VTYDSRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSP 87
+ +++ + +G IS ++HY R W +++AK G+NTI+SYV WN HE++
Sbjct: 27 IDFENNCFLKDGSPFRYISGSMHYFRIPKLYWNDSMKKAKSMGLNTIQSYVAWNIHEINE 86
Query: 88 GKYYFGGRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEP 147
G Y F +++ FI + QQ + +ILR GP++ AE+ +GG P W+ T+ + +
Sbjct: 87 GHYDFNDDKDIINFINLAQQNDLLVILRPGPYIDAEWEFGGFPWWMAKSNMTMRTSGDKS 146
Query: 148 FKYHMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYY----ESFYGEGGKRYALWA 203
+ ++ + ++++ M+ + GGPII QVENEYG Y + E + L
Sbjct: 147 YMKYVSNWFSILLPMINQ--YLYKNGGPIIAVQVENEYGNYYACDHEYMKELKNLFQLHL 204
Query: 204 AKMAV---AQNIGVPWIMC----QQFDTPD--PVINTCNSFYCDQFTPHSPSMPKIWTEN 254
V ++ C F T D I+ +F H P + +E
Sbjct: 205 GNDVVLFTTDGYTDDYLKCGTIPSLFTTIDFGTEISAVEAFKL--LRNHQKKGPLVNSEF 262
Query: 255 WPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAG-----GPFI 309
+ GW +G R + +IA + + SV N YM+ GGTNFG G G F+
Sbjct: 263 YTGWLDYWGKNHQKRNARNIALHLDEILKLNASV-NLYMFQGGTNFGYMNGADMSDGQFL 321
Query: 310 T--TSYDYEAPIDEYG 323
TSYDY+API E G
Sbjct: 322 ISPTSYDYDAPISEAG 337
>gi|297842039|ref|XP_002888901.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297334742|gb|EFH65160.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 686
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 160/355 (45%), Gaps = 47/355 (13%)
Query: 38 NGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRFN 97
+G II +HY R +P W + +AK G+NTI+ YV WN HE PGK F G +
Sbjct: 72 DGNHFQIIGGDLHYFRVLPEYWEDRLLRAKALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 131
Query: 98 LVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYI-PGTVFRNDTEPFKYHMQKFM 156
LV F+K+ + ++LR GP++ E++ GG P WL + P R + ++++
Sbjct: 132 LVSFLKLCDKLDFMVMLRAGPYICGEWDLGGFPAWLLSVKPRLQLRTSDPAYLKLVERWW 191
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYES----------------------FYGE 194
++ + K L S GGP+I+ Q+ENEYG Y + + +
Sbjct: 192 GVL--LPKIFPLIYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTD 249
Query: 195 GGKRYALWAAKMAVAQNIGVPWIMCQQFDTPDPVINTCNSFYCDQFTPHSP-SMPKIWTE 253
GG + L + V + D P P+ F ++P S P + +E
Sbjct: 250 GGTKETLEKGTVPVDDVYSA--VDFTTGDDPWPIFELQKKF-------NAPGSSPPLSSE 300
Query: 254 NWPGWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGG------- 306
+ GW +G + +E A S+ + + GS YM HGGTNFG G
Sbjct: 301 FYTGWLTHWGEKIAKTDAEFTATSLEKILSRNGSAV-LYMVHGGTNFGFYNGANTGSEES 359
Query: 307 ---PFITTSYDYEAPIDEYGLPRNPKWGHLKELHGAIKLCEHALLNGERSNLSLG 358
P + TSYDY+API E G NPK+ L+ + + H+++ + + G
Sbjct: 360 DYKPDL-TSYDYDAPIKESGDIDNPKFRALQRVIKKYNVASHSIIPSNKQRKAYG 413
>gi|308457238|ref|XP_003091008.1| hypothetical protein CRE_12379 [Caenorhabditis remanei]
gi|308258733|gb|EFP02686.1| hypothetical protein CRE_12379 [Caenorhabditis remanei]
Length = 628
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 148/310 (47%), Gaps = 20/310 (6%)
Query: 32 SRSLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYY 91
+R +++G IS +IHY R W +Q+ + G N I+ Y+ WN HEL G +
Sbjct: 17 NRQFLLDGLPFRYISGSIHYFRIPRERWDERLQKVRALGFNAIQYYIPWNTHELEEGIHD 76
Query: 92 FGGRFNLVKFIKI-IQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKY 150
F G + +F + + ++ ILRIGP++ E+ GG+P WL T R+ F
Sbjct: 77 FSGILDFAEFSSLAFHKYNLWTILRIGPYICGEWENGGLPAWLLTKNVTKQRSSDPVFTR 136
Query: 151 HMQKFMTLIVDMMKREKLFASQGGPIILAQVENEYGYYES-------FYGEGGKRY---- 199
++K+ ++ +K L GGPI++ Q+ENEYG Y++ F + + +
Sbjct: 137 EVEKWFETLLPRVK--PLLRKNGGPILMLQIENEYGSYDACDKQYLRFLRDLTRAHVGDD 194
Query: 200 ALWAAKMAVAQNIGVPWIMCQQFDTPD--PVINTCN-SFYCDQFTPHSPSMPKIWTENWP 256
L A+N+ + F T D P NT + D +P+ P + +E +P
Sbjct: 195 VLLFTTDGSAENLLKCGTVEGVFPTIDFGPTDNTKDIQSNFDLQRKFAPNAPLVNSEYYP 254
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF---ITTSY 313
GW +G + PS + A+F G+ N+YM HGGTNFG G TSY
Sbjct: 255 GWLVLWGQKRQDLPSPQTIINGAQFIYSLGASINFYMIHGGTNFGFWNGAEVEAPCITSY 314
Query: 314 DYEAPIDEYG 323
DY+API E G
Sbjct: 315 DYDAPISEAG 324
>gi|422845798|ref|ZP_16892481.1| beta-galactosidase [Streptococcus sanguinis SK72]
gi|325688586|gb|EGD30603.1| beta-galactosidase [Streptococcus sanguinis SK72]
Length = 592
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 149/317 (47%), Gaps = 40/317 (12%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I+S AI Y R P W + K G NT+E+Y+ W HE G++ G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+ + K++++ +Y+I+R P++ AE+++GG+P WL P R + F + F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP- 215
+ + + + QGGPI++ QVENEYG Y K Y A+M + + VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAE-----DKAYMRSIAQMMKVRGVTVPL 184
Query: 216 ------WIMCQQFDT--PDPVINTCNSFYCDQFTPHSPSM-----------PKIWTENWP 256
WI + T D + T N + Q ++ ++ P + TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--------RTAGGPF 308
GWF + R +ED+A V Q G N ++ GGTNFG +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 309 ITTSYDYEAPIDEYGLP 325
I TSYD++API E+G P
Sbjct: 301 I-TSYDFDAPITEWGQP 316
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 149/309 (48%), Gaps = 22/309 (7%)
Query: 34 SLIINGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFG 93
+ +++G+ IIS +HYPR W +++AK G+NTI +YVFWN HE GKY F
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405
Query: 94 GRFNLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQ 153
G ++ F+K Q+ +++ILR P+V AE+ +GG P WL I G R+ EP ++Q
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGLEVRS-KEP--QYLQ 462
Query: 154 KFMTLIVDMMKR-EKLFASQGGPIILAQVENEYGYYES--FYGEGGKRYALWAAKMAVAQ 210
+ I+ + K+ L + GG I++ QVENEYG Y S Y + +R + A +
Sbjct: 463 AYKNYIMQVGKQLAPLQVNHGGNILMVQVENEYGAYGSDREYLDINRRLFIEAGFDGLLY 522
Query: 211 NIGVPWIMCQQFDTPDPVINTCNSF----YCDQFTPHSPS--MPKIWTENWPGWFKTFGG 264
P + + P + + N Q + P E +P WF +G
Sbjct: 523 TCD-PEPFLAKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVAEWYPAWFDWWGT 581
Query: 265 RDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPF--------ITTSYDYE 316
+ P+E + G SV N YM+HGGT G + +SYDY+
Sbjct: 582 QHHKVPAEKYTPGLDSVLSAGMSV-NMYMFHGGTTRDFMNGANYNDQNPYEPQISSYDYD 640
Query: 317 APIDEYGLP 325
AP+DE G P
Sbjct: 641 APLDEAGNP 649
>gi|422871792|ref|ZP_16918285.1| beta-galactosidase [Streptococcus sanguinis SK1087]
gi|328945306|gb|EGG39459.1| beta-galactosidase [Streptococcus sanguinis SK1087]
Length = 592
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 149/317 (47%), Gaps = 40/317 (12%)
Query: 37 INGRRELIISAAIHYPRSVPGMWPGLVQQAKEGGVNTIESYVFWNGHELSPGKYYFGGRF 96
++G+ I+S AI Y R P W + K G NT+E+Y+ W HE G++ G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 97 NLVKFIKIIQQARMYMILRIGPFVAAEYNYGGIPVWLHYIPGTVFRNDTEPFKYHMQKFM 156
+ + K++++ +Y+I+R P++ AE+++GG+P WL P R + F + F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 157 TLIVDMMKREKLFASQGGPIILAQVENEYGYYESFYGEGGKRYALWAAKMAVAQNIGVP- 215
+ + + + QGGPI++ QVENEYG Y K Y A+M + + VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAE-----DKAYMRSIAQMMKVRGVTVPL 184
Query: 216 ------WIMCQQFDT--PDPVINTCNSFYCDQFTPHSPSM-----------PKIWTENWP 256
WI + T D + T N + Q ++ ++ P + TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 257 GWFKTFGGRDPHRPSEDIAFSVARFFQKGGSVHNYYMYHGGTNFG--------RTAGGPF 308
GWF + R +ED+A V Q G N ++ GGTNFG +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 309 ITTSYDYEAPIDEYGLP 325
I TSYD++API E+G P
Sbjct: 301 I-TSYDFDAPITEWGQP 316
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.440
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,553,572,546
Number of Sequences: 23463169
Number of extensions: 632971111
Number of successful extensions: 1119200
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2080
Number of HSP's successfully gapped in prelim test: 175
Number of HSP's that attempted gapping in prelim test: 1106642
Number of HSP's gapped (non-prelim): 5461
length of query: 747
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 597
effective length of database: 8,839,720,017
effective search space: 5277312850149
effective search space used: 5277312850149
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)