BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037925
(821 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 1388 bits (3593), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 644/816 (78%), Positives = 724/816 (88%), Gaps = 3/816 (0%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYDH+ALVIDGKRRVLQSGSIHYPR+TPEVWPE+IRKSKEGGL+VIETYVFWNYHEP+R
Sbjct: 36 VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQYYFEGRFDLVRFVKTVQEAGLF+HLRIGPYACAEWNYGGFP+WLHFIPG+QFRT+N+
Sbjct: 96 GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGVQFRTSNDI 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK MK FL KI+DLMK +NLFASQGGPIILAQVENEYGNV+WAYGVGGELYVKWAA+TA
Sbjct: 156 FKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAETA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
++LNT+VPWVMC QEDAPDP+INTCNGFYCD FTPNSPSKP MWTENYSGWFL+FGYAVP
Sbjct: 216 ISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQFTPNSPSKPKMWTENYSGWFLAFGYAVP 275
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
+RPVEDLAFAVARFFE GG+FQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ
Sbjct: 276 YRPVEDLAFAVARFFEYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 335
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PKWGHLR+LH AIK CEEYL+SSDP HQ+LG KLEAH+Y+K SNDCAAFLANYDS SDAN
Sbjct: 336 PKWGHLRDLHSAIKQCEEYLVSSDPVHQQLGNKLEAHVYYKHSNDCAAFLANYDSGSDAN 395
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
VTFNGN YFLPAWSVSIL DCKNV+FNTAKV++QR+ GD F++ V+ L+A+S +SW
Sbjct: 396 VTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDALFSRSTTVDGNLVAASPWSW 455
Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAAL 484
Y+E+VGI GN SF +P L EQINTTKDTSD+LWY+ S++V GQ KE LNIESLGHAAL
Sbjct: 456 YKEEVGIWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVEAGQDKEHLLNIESLGHAAL 515
Query: 485 VFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS 544
VFVNK+ VAFGYGNHD A+F + ++I L EG NTLD+LSM++G+QNYG WFDV GAG+ S
Sbjct: 516 VFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDVLSMLIGVQNYGPWFDVQGAGIHS 575
Query: 545 VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTT 604
V L+DL K+DLSSG+W YQVG+EGEY+GLD +SLANSS W QG++LPVNKSLIWYK T
Sbjct: 576 VFLVDLHKSKKDLSSGKWTYQVGLEGEYLGLDNVSLANSSLWSQGTSLPVNKSLIWYKAT 635
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
+APEG GPLALNLASMGKGQAW+NGQSIGRYWSAYL+PS GCT CDYRG+Y++ KCQK
Sbjct: 636 IIAPEGNGPLALNLASMGKGQAWINGQSIGRYWSAYLSPSAGCTDNCDYRGAYNSFKCQK 695
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVD 724
CGQPAQTLYHIPRTWVHPGENLLV+HEELGGDPS+ISLLT+TGQ ICS VSE DPPP D
Sbjct: 696 KCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQISLLTRTGQDICSIVSEDDPPPAD 755
Query: 725 SWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPIVQKAC 784
SWKPNL +S SP+VRL CE GWHIAAINFAS+G PEG CG+F PG CH D+L IVQKAC
Sbjct: 756 SWKPNLEFMSQSPEVRLTCEHGWHIAAINFASFGTPEGKCGTFTPGNCHADMLTIVQKAC 815
Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+G CSIP+S+A LG CPG++K VEA CS
Sbjct: 816 IGHERCSIPISAAKLG---DPCPGVVKRFVVEALCS 848
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 1344 bits (3479), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 638/819 (77%), Positives = 710/819 (86%), Gaps = 5/819 (0%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V+YDHRALVIDGKRRVLQSGSIHYPR+TPEVWP++IRKSKEGGL+VIETYVFWNYHE
Sbjct: 27 SGKVSYDHRALVIDGKRRVLQSGSIHYPRTTPEVWPDIIRKSKEGGLDVIETYVFWNYHE 86
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYFEGRFDLVRFVKT+QEAGL +HLRIGPYACAEWNYGGFP+WLHFIPGIQFRTT
Sbjct: 87 PVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTT 146
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FKEEMK FL KI+++MK+ENLFASQGGPIILAQVENEYGNVEWAYG GELYVKWAA
Sbjct: 147 NELFKEEMKLFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVEWAYGAAGELYVKWAA 206
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+TAV+LNTSVPWVMC Q DAPDPIINTCNGFYCD F+PNSPSKP MWTENYSGWFLSFGY
Sbjct: 207 ETAVSLNTSVPWVMCAQVDAPDPIINTCNGFYCDRFSPNSPSKPKMWTENYSGWFLSFGY 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
A+P+RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF
Sbjct: 267 AIPYRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPKWGHLR+LHKAIK CEE+LISSDP HQ+LG LEAHIY+KSSNDCAAFLANYDSSS
Sbjct: 327 IRQPKWGHLRDLHKAIKQCEEHLISSDPIHQQLGNNLEAHIYYKSSNDCAAFLANYDSSS 386
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DANVTFNGN+YFLPAWSVSILPDCKNV+FNTAKV+ N GD FA +VNE+ L
Sbjct: 387 DANVTFNGNIYFLPAWSVSILPDCKNVIFNTAKVLI-LNLGDDFFAHSTSVNEIPLEQIV 445
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+SWY+E+VGI GN SF P L EQINTTKD SD+LWY+ SI V Q K++ LNIESLGH
Sbjct: 446 WSWYKEEVGIWGNNSFTAPGLLEQINTTKDISDFLWYSTSISVNADQVKDIILNIESLGH 505
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
AALVFVNK LV YGNHD A+F + +KI L EG NTLD+LSMM+G+QNYG WFDV GAG
Sbjct: 506 AALVFVNKVLVG-KYGNHDDASFSLTEKISLIEGNNTLDLLSMMIGVQNYGPWFDVQGAG 564
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
+++V+L+ K DLSS +W YQVG+EGEY GLDK+SLANSS W QG++ P+NKSLIWY
Sbjct: 565 IYAVLLVGQSKVKIDLSSEKWTYQVGLEGEYFGLDKVSLANSSLWTQGASPPINKSLIWY 624
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
K TF+APEGKGPLALNLA MGKGQAWVNGQSIGRYW AYL+PSTGC CDYRG+YD+ K
Sbjct: 625 KGTFVAPEGKGPLALNLAGMGKGQAWVNGQSIGRYWPAYLSPSTGCNDSCDYRGAYDSFK 684
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
C K CGQPAQTLYHIPRTWVHPGENLLV+HEELGGDPSKIS+LT+TG ICS VSE DPP
Sbjct: 685 CLKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSKISVLTRTGHEICSIVSEDDPP 744
Query: 722 PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPIVQ 781
P DSWK + S +P+VRL CE+GWHI +INFAS+G P G CG+F PG+CH D+L IVQ
Sbjct: 745 PADSWKSSSEFKSQNPEVRLTCEQGWHIKSINFASFGTPAGICGTFNPGSCHADMLDIVQ 804
Query: 782 KACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
KAC+GQ CSI +S+A LG CPG+LK AVEA CS
Sbjct: 805 KACIGQEGCSISISAANLG---DPCPGVLKRFAVEARCS 840
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 1045 bits (2703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 504/843 (59%), Positives = 621/843 (73%), Gaps = 33/843 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHR+L+IDG+RRVL SGSIHYPRSTPE+WP++I+K+K+GGL+VIE+YVFWN HE
Sbjct: 28 AANVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPDIIQKAKDGGLDVIESYVFWNMHE 87
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + +YYFE RFDLV+FVK VQ+AGL +HLRIGPYACAEWNYGGFPVWLH IPGI FRT
Sbjct: 88 PKQNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYACAEWNYGGFPVWLHLIPGIHFRTD 147
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF AKI+D+MKQE LFASQGGPIILAQ+ENEYGN++ YG G+ YVKWAA
Sbjct: 148 NEPFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQIENEYGNIDGPYGAAGKSYVKWAA 207
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV LNT VPWVMCQQ DAPDPIINTCNGFYCD FTPNSP+KP MWTEN+SGWFLSFG
Sbjct: 208 SMAVGLNTGVPWVMCQQADAPDPIINTCNGFYCDAFTPNSPNKPKMWTENWSGWFLSFGG 267
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
+PFRP EDLAF+VARFF+ GGTFQNYYMY GGTNFGRT GGP +ATSYDYDAPIDEYG
Sbjct: 268 RLPFRPTEDLAFSVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYGI 327
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHL+ELHKAIKLCE L++++ + LG+ LEAH+Y S CAAFLAN ++ S
Sbjct: 328 VRQPKWGHLKELHKAIKLCEAALVNAESNYTSLGSGLEAHVYSPGSGTCAAFLANSNTQS 387
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS- 420
DA V FNGN Y LPAWSVSILPDCKNVVFNTAK+ SQ + Q N L+LA S
Sbjct: 388 DATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGSQTT------SVQMNPANLILAGSN 441
Query: 421 -----------AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ- 468
++SW E++GI G+ +F +P L EQINTT D+SDYLWYT SI V +
Sbjct: 442 SMKGTDSANAASWSWLHEQIGIGGSNTFSKPGLLEQINTTVDSSDYLWYTTSIQVDDNEP 501
Query: 469 ----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
G + L+++SLGHA VF+N + G G+ + + I L G N +D+LS+
Sbjct: 502 FLHNGTQPVLHVQSLGHALHVFINGEFAGRGAGSSSSSKIALQTPITLKSGKNNIDLLSI 561
Query: 525 MVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS 583
VGLQNYG++FD GAG+ VIL K+G+ DLS+ +W YQ+G+ GE +G+ S
Sbjct: 562 TVGLQNYGSFFDTWGAGITGPVILQGFKDGEHDLSTQQWTYQIGLTGEQLGIYSGDTKAS 621
Query: 584 SFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAP 643
+ W GS LP + +IWYKT F AP G P+ALNL MGKG AWVNGQSIGRYW +Y+A
Sbjct: 622 AQWVAGSDLPTKQPMIWYKTNFDAPSGNDPVALNLLGMGKGVAWVNGQSIGRYWPSYIAS 681
Query: 644 STGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
+GCT CDYRG+Y ++KCQ +CGQP+Q LYH+PR+W+ P N+LV+ EELGGDP++IS
Sbjct: 682 QSGCTDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSWIQPTGNVLVLFEELGGDPTQISF 741
Query: 704 LTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLACERGWH-IAAINFASYG 758
+T++ +C+ VSE PPVDSWK + L V +++L C H I +I FAS+G
Sbjct: 742 MTRSVGSLCAQVSETHLPPVDSWKSSATSGLEVNKPKAELQLHCPSSRHLIKSIKFASFG 801
Query: 759 IPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
+G+CGSF G C+ + + IV++AC+G+ CS+ VS G C G +K LAVEA
Sbjct: 802 TSKGSCGSFTYGHCNTNSTMSIVEEACIGRESCSVEVSIEKFG---DPCKGTVKNLAVEA 858
Query: 818 HCS 820
CS
Sbjct: 859 SCS 861
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 1033 bits (2672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/823 (62%), Positives = 609/823 (73%), Gaps = 15/823 (1%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 24 ANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY FEGR DLV+FVK V AGL++HLRIGPYACAEWNYGGFP+WLHFIPGIQFRT N
Sbjct: 84 VRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PF+ EMK+F AKI+DLMKQENL+ASQGGPIIL+Q+ENEYGN+E YG + Y+KWAA
Sbjct: 144 KPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIENEYGNIEADYGPAAKSYIKWAAS 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L T VPWVMCQQ++APDPIIN CNGFYCD F PNS +KP +WTE Y+GWFL+FG A
Sbjct: 204 MATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQFKPNSNTKPKIWTEGYTGWFLAFGDA 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGR +GGP VA+SYDYDAPIDEYGFI
Sbjct: 264 VPHRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRASGGPFVASSYDYDAPIDEYGFI 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+++HKAIKLCEE LI++DPT LG +EA +Y K+ CAAFLAN ++SD
Sbjct: 324 RQPKWGHLKDVHKAIKLCEEALIATDPTITSLGPNIEAAVY-KTGVVCAAFLANI-ATSD 381
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A VTFNGN Y LPAWSVSILPDCKNVV NTAK+ S K+V L + S +
Sbjct: 382 ATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKITSASMISSFTTESLKDVGSLDDSGSRW 441
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
SW E +GIS SF L EQINTT D SDYLWY+ SI + G + FL+I+SLGHA
Sbjct: 442 SWISEPIGISKADSFSTFGLLEQINTTADRSDYLWYSLSIDL--DAGAQTFLHIKSLGHA 499
Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
F+N KL G GNH+ AN ++ I L G NT+D+LS+ VGLQNYGA+FD GAG+
Sbjct: 500 LHAFINGKLAGSGTGNHEKANVEVDIPITLVSGKNTIDLLSLTVGLQNYGAFFDTWGAGI 559
Query: 543 FS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
VIL LKNG DLSS +W YQVG++ E +GL S S W STLP N+ L W
Sbjct: 560 TGPVILKCLKNGSNVDLSSKQWTYQVGLKNEDLGL---SSGCSGQWNSQSTLPTNQPLTW 616
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKT F+AP G P+A++ MGKG+AWVNGQSIGRYW Y +P GCT C+YRG+YDAS
Sbjct: 617 YKTNFVAPSGNNPVAIDFTGMGKGEAWVNGQSIGRYWPTYASPKGGCTDSCNYRGAYDAS 676
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
KC K+CG+P+QTLYH+PR+W+ P N LV+ EE GG+P +IS TK +CS VSE+ P
Sbjct: 677 KCLKNCGKPSQTLYHVPRSWLRPDRNTLVLFEESGGNPKQISFATKQIGSVCSHVSESHP 736
Query: 721 PPVDSWKPNL-GVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGACHMD-VL 777
PPVDSW N P V L C +++I FAS+G P G CG+F+ G C + L
Sbjct: 737 PPVDSWNSNTESGRKVVPVVSLECPYPNQVVSSIKFASFGTPLGTCGNFKHGLCSSNKAL 796
Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
IVQKAC+G C I +S G C G+ K+LAVEA C+
Sbjct: 797 SIVQKACIGSSSCRIELSVNTFG---DPCKGVAKSLAVEASCA 836
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 1030 bits (2663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/823 (61%), Positives = 614/823 (74%), Gaps = 14/823 (1%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 25 ANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
++GQY FEGR DLV+FVK V AGL++HLRIGPYACAEWNYGGFP+WLHFIPGIQFRT N
Sbjct: 85 VQGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTDN 144
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PF+ EMKRF KI+D+MKQE+L+ASQGGPIIL+QVENEYGN++ AYG + Y+KWAA
Sbjct: 145 KPFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQVENEYGNIDAAYGPAAKSYIKWAAS 204
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG A
Sbjct: 205 MATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNAKPKMWTENWSGWFLSFGGA 264
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGRT GGP ++TSYDYDAPID+YG I
Sbjct: 265 VPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDQYGII 324
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+++HKAIKLCEE LI++DPT G +EA +Y K+ + CAAFLAN ++SD
Sbjct: 325 RQPKWGHLKDVHKAIKLCEEALIATDPTITSPGPNIEAAVY-KTGSICAAFLANI-ATSD 382
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ-QKNVNELLLASSA 421
A VTFNGN Y LPAWSVSILPDCKNVV NTAK+ S ++ V L + S
Sbjct: 383 ATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSASMISSFTTESFKEEVGSLDDSGSG 442
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+SW E +GIS + SF + L EQINTT D SDYLWY+ SI V G + L+IESLGH
Sbjct: 443 WSWISEPIGISKSDSFSKFGLLEQINTTADKSDYLWYSISIDVEGDSGSQTVLHIESLGH 502
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
A F+N K+ G GN A ++ + L G N++D+LS+ VGLQNYGA+FD GAG
Sbjct: 503 ALHAFINGKIAGSGTGNSGKAKVNVDIPVTLVAGKNSIDLLSLTVGLQNYGAFFDTWGAG 562
Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
+ VIL LKNG DLSS +W YQVG++ E +G S +S W STLP N+SLI
Sbjct: 563 ITGPVILKGLKNGSTVDLSSQQWTYQVGLKYEDLGP---SNGSSGQWNSQSTLPTNQSLI 619
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKT F+AP G P+A++ MGKG+AWVNGQSIGRYW Y++P+ GCT C+YRG+Y +
Sbjct: 620 WYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSPNGGCTDSCNYRGAYSS 679
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
SKC K+CG+P+QTLYHIPR+W+ P N LV+ EE GGDP++IS TK +CS VSE+
Sbjct: 680 SKCLKNCGKPSQTLYHIPRSWLQPDSNTLVLFEESGGDPTQISFATKQIGSMCSHVSESH 739
Query: 720 PPPVDSWKPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGACHMD-VL 777
PPPVD W + G P + L C I++I FAS+G P G CG+F+ G C + L
Sbjct: 740 PPPVDLWNSDKG-RKVGPVLSLECPYPNQLISSIKFASFGTPYGTCGNFKHGRCRSNKAL 798
Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
IVQKAC+G C I +S G C G+ K+LAVEA C+
Sbjct: 799 SIVQKACIGSSSCRIGISINTFG---DPCKGVTKSLAVEASCA 838
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 1021 bits (2641), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 501/834 (60%), Positives = 609/834 (73%), Gaps = 23/834 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++ VTYDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 22 FASTVTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLH 81
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R QY F+GR DLV+FVKTV EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGIQFRT
Sbjct: 82 EPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQFRT 141
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFKEEM+ F AKI+D+MK+ENL+ASQGGPIIL+Q+ENEYGN++ AYG + Y++WA
Sbjct: 142 DNGPFKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGNIDSAYGSAAKSYIQWA 201
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS KP MWTEN++GWFLSFG
Sbjct: 202 ASMATSLDTGVPWVMCQQADAPDPMINTCNGFYCDQFTPNSVKKPKMWTENWTGWFLSFG 261
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RPVED+AFAVARFF+ GGTFQNYYMY GGTNFGRT GGP +ATSYDYDAPIDEYG
Sbjct: 262 GAVPYRPVEDIAFAVARFFQLGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++LHKAIKLCE LI++DPT LG LEA +Y + CAAFLAN ++
Sbjct: 322 LLRQPKWGHLKDLHKAIKLCEAALIATDPTITSLGTNLEASVYKTGTGSCAAFLANVRTN 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
SDA V F+GN Y LPAWSVSILPDCKNV NTA++ S P Q+++ + +S
Sbjct: 382 SDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQINSM---AVMPRFMQQSLKNDIDSSD 438
Query: 421 AF----SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
F SW +E VGIS N +F + L EQIN T D SDYLWY+ S + + G +
Sbjct: 439 GFQSGWSWVDEPVGISKNNAFTKLGLLEQINITADKSDYLWYSLSTEIQGDEPFLEDGSQ 498
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L++ESLGHA F+N KL G GN A ++ + L G NT+D+LS+ VGLQNY
Sbjct: 499 TVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHGKNTIDLLSLTVGLQNY 558
Query: 532 GAWFDVAGAGLFSVI-LIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
GA++D GAG+ I L L NG DLSS +W YQVG++GE +GL +SS W G
Sbjct: 559 GAFYDKQGAGITGPIKLKGLANGTTVDLSSQQWTYQVGLQGEELGLPS---GSSSKWVAG 615
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
STLP + LIWYKTTF AP G P+AL+ MGKG+AWVNGQSIGRYW AY++ + GCT
Sbjct: 616 STLPKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQSIGRYWPAYVSSNGGCTS 675
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
C+YRG Y ++KC K+CG+P+Q LYH+PR+W+ P N LV+ EE+GGDP++IS TK +
Sbjct: 676 SCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVLFEEIGGDPTQISFATKQVE 735
Query: 710 HICSFVSEADPPPVDSWKPNLGV-VSSSPQVRLACE-RGWHIAAINFASYGIPEGNCGSF 767
+CS VSE P PVD W +L SSP + L C I++I FAS+G P G CGSF
Sbjct: 736 SLCSRVSEYHPLPVDMWGSDLTTGRKSSPMLSLECPFPNQVISSIKFASFGTPRGTCGSF 795
Query: 768 RPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
C L IVQ+AC+G CSI VS G C G+ K+LAVEA C+
Sbjct: 796 SHSKCSSRTALSIVQEACIGSKSCSIGVSIDTFG---DPCSGIAKSLAVEASCT 846
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 1016 bits (2626), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/824 (61%), Positives = 609/824 (73%), Gaps = 13/824 (1%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN +EP
Sbjct: 24 ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLNEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY F+GR DLV+FVKTV AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 84 VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+D++K+ENL+ASQGGP+IL+Q+ENEYGN++ AYG G+ Y+KWAA
Sbjct: 144 EPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNIDSAYGAAGKSYIKWAAT 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLPFGGA 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGII 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+E+HKAIKLCEE LI++DPT LG LEA +Y K+ + CAAFLAN D+ SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVDTKSD 382
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
V F+GN Y LPAWSVSILPDCKNVV NTAK+ S K ++ +S+
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISSFTTESLKEDIGSSEASSTG 442
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+SW E VGIS SF + L EQINTT D SDYLWY+ SI G + L+IESLGH
Sbjct: 443 WSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKGDAGSQTVLHIESLGH 502
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
A F+N KL GN F ++ + L G NT+D+LS+ VGLQNYGA+FD GAG
Sbjct: 503 ALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAG 562
Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
+ VIL L NG DLS +W YQVG++GE +GL S +S W ST P N+ LI
Sbjct: 563 ITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGL---SSGSSGQWNSQSTFPKNQPLI 619
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKTTF AP G P+A++ MGKG+AWVNGQSIGRYW Y+A GCT C+YRG Y A
Sbjct: 620 WYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSA 679
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
SKC+++CG+P+QTLYH+PR+W+ P N+LV+ EE GGDP++IS +TK + +C+ VS++
Sbjct: 680 SKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSH 739
Query: 720 PPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-V 776
PPPVD W + P + L C I++I FASYG P G CG+F G C +
Sbjct: 740 PPPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKA 799
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L IVQKAC+G CS+ VSS G C G+ K+LAVEA C+
Sbjct: 800 LSIVQKACIGSSSCSVGVSSETFG---NPCRGVAKSLAVEATCA 840
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 1011 bits (2615), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/823 (61%), Positives = 608/823 (73%), Gaps = 21/823 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN +EP
Sbjct: 24 ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLNEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY F+GR DLV+FVKTV AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 84 VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+D++K+ENL+ASQGGP+IL+Q+ENEYGN++ AYG G+ Y+KWAA
Sbjct: 144 EPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNIDSAYGAAGKSYIKWAAT 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLPFGGA 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGII 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+E+HKAIKLCEE LI++DPT LG LEA +Y K+ + CAAFLAN D+ SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVDTKSD 382
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F+GN Y LPAWSVSILPDCKNVV NTAKV ++ L +S+ +
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKVC---------LTNFISMFMWLPSSTGW 433
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
SW E VGIS SF + L EQINTT D SDYLWY+ SI G + L+IESLGHA
Sbjct: 434 SWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKGDAGSQTVLHIESLGHA 493
Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
F+N KL GN F ++ + L G NT+D+LS+ VGLQNYGA+FD GAG+
Sbjct: 494 LHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAGI 553
Query: 543 FS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
VIL L NG DLS +W YQVG++GE +GL S +S W ST P N+ LIW
Sbjct: 554 TGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGL---SSGSSGQWNSQSTFPKNQPLIW 610
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKTTF AP G P+A++ MGKG+AWVNGQSIGRYW Y+A GCT C+YRG Y AS
Sbjct: 611 YKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSAS 670
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
KC+++CG+P+QTLYH+PR+W+ P N+LV+ EE GGDP++IS +TK + +C+ VS++ P
Sbjct: 671 KCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSHP 730
Query: 721 PPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-VL 777
PPVD W + P + L C I++I FASYG P G CG+F G C + L
Sbjct: 731 PPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKAL 790
Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
IVQKAC+G CS+ VSS G C G+ K+LAVEA C+
Sbjct: 791 SIVQKACIGSSSCSVGVSSETFG---NPCRGVAKSLAVEATCA 830
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 1009 bits (2608), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 493/837 (58%), Positives = 615/837 (73%), Gaps = 31/837 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 23 AVNVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIKKSKDGGLDVIETYVFWSGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + +Y FEGR+DLV+FVK V+EAGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFKEEM+RF KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG ++Y+KW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKIYIKWSA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFL FG
Sbjct: 203 SMALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFTPNSNSKPKMWTENWSGWFLGFGD 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIKLCE+ LI++DPT LG+ LEA +Y +S CAAFLAN + S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGTKS 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DA V+FNG Y LPAWSVSILPDCKNV FNTAK+ N+ P A + + SSA
Sbjct: 383 DATVSFNGESYHLPAWSVSILPDCKNVAFNTAKI----NSATEPTAFARQSLKPDGGSSA 438
Query: 422 -----FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKE 471
+S+ +E +GIS +F++P L EQINTT D SDYLWY+ + + +G +
Sbjct: 439 ELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSK 498
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L+IESLG F+N KL G+G + ++ I L G NT+D+LS+ VGL NY
Sbjct: 499 AVLHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLAAGKNTVDLLSVTVGLANY 555
Query: 532 GAWFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
GA+FD+ GAG+ V L K G DL+S +W YQVG++GE GL + +SS W
Sbjct: 556 GAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSK 612
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
S LP + LIWYKTTF AP G P+A++ GKG AWVNGQSIGRYW +A + GCT
Sbjct: 613 SPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTD 672
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TG 708
CDYRGSY A+KC K+CG+P+QTLYH+PR+W+ P N LV+ EE+GGDP++IS TK TG
Sbjct: 673 SCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQTG 732
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNC 764
++C VS++ PPPVD+W + + + + P + L C I++I FAS+G P+G C
Sbjct: 733 SNLCLMVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPVSTQVISSIKFASFGTPQGTC 792
Query: 765 GSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
GSF G C+ L +VQKAC+G C++ VS+ G C G++K+LAVEA CS
Sbjct: 793 GSFTHGHCNSSRSLSVVQKACIGSRSCNVEVSTRVFGE---PCRGVIKSLAVEASCS 846
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 1006 bits (2600), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/835 (58%), Positives = 612/835 (73%), Gaps = 27/835 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + +Y FEGR+DLV+FVK +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFKEEM+RF KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG + Y+KW+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIKLCE+ LI++DPT LG+ LEA +Y S CAAFLAN D+ S
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLA 418
DA VTFNG Y LPAWSVSILPDCKNV FNTAK+ S + FA+Q +
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPDGGSSAEL 446
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
S +S+ +E +GIS +F++P L EQINTT D SDYLWY+ + +G +
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L+IESLG F+N KL G+G + ++ I L G NT+D+LS+ VGL NYGA
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGA 563
Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+FD+ GAG+ V L K G DL+S +W YQVG++GE GL + +SS W S
Sbjct: 564 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSP 620
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
LP + LIWYKTTF AP G P+A++ GKG AWVNGQSIGRYW +A + GCT+ C
Sbjct: 621 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESC 680
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQH 710
DYRGSY A+KC K+CG+P+QTLYH+PR+W+ P N+LV+ EE+GGDP++IS TK TG +
Sbjct: 681 DYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSN 740
Query: 711 ICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGS 766
+C VS++ PPPVD+W + + + + P + L C I +I FAS+G P+G CGS
Sbjct: 741 LCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS 800
Query: 767 FRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F G C+ L +VQKAC+G C++ VS+ G C G++K+LAVEA CS
Sbjct: 801 FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 852
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 1006 bits (2600), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/835 (58%), Positives = 612/835 (73%), Gaps = 27/835 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + +Y FEGR+DLV+FVK +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFKEEM+RF KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG + Y+KW+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIKLCE+ LI++DPT LG+ LEA +Y S CAAFLAN D+ S
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLA 418
DA VTFNG Y LPAWSVSILPDCKNV FNTAK+ S + FA+Q +
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPDGGSSAEL 446
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
S +S+ +E +GIS +F++P L EQINTT D SDYLWY+ + +G +
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L+IESLG F+N KL G+G + ++ I L G NT+D+LS+ VGL NYGA
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGA 563
Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+FD+ GAG+ V L K G DL+S +W YQVG++GE GL + +SS W S
Sbjct: 564 FFDLMGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSP 620
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
LP + LIWYKTTF AP G P+A++ GKG AWVNGQSIGRYW +A + GCT+ C
Sbjct: 621 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESC 680
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQH 710
DYRGSY A+KC K+CG+P+QTLYH+PR+W+ P N+LV+ EE+GGDP++IS TK TG +
Sbjct: 681 DYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSN 740
Query: 711 ICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGS 766
+C VS++ PPPVD+W + + + + P + L C I +I FAS+G P+G CGS
Sbjct: 741 LCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS 800
Query: 767 FRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F G C+ L +VQKAC+G C++ VS+ G C G++K+LAVEA CS
Sbjct: 801 FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 852
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 1006 bits (2600), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/835 (58%), Positives = 612/835 (73%), Gaps = 27/835 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + +Y FEGR+DLV+FVK +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFKEEM+RF KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG + Y+KW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIKLCE+ LI++DPT LG+ LEA +Y S CAAFLAN D+ S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLA 418
DA VTFNG Y LPAWSVSILPDCKNV FNTAK+ S + FA+Q +
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPDGGSSAEL 440
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
S +S+ +E +GIS +F++P L EQINTT D SDYLWY+ + +G +
Sbjct: 441 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 500
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L+IESLG F+N KL G+G + ++ I L G NT+D+LS+ VGL NYGA
Sbjct: 501 LHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGA 557
Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+FD+ GAG+ V L K G DL+S +W YQVG++GE GL + +SS W S
Sbjct: 558 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSP 614
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
LP + LIWYKTTF AP G P+A++ GKG AWVNGQSIGRYW +A + GCT+ C
Sbjct: 615 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESC 674
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQH 710
DYRGSY A+KC K+CG+P+QTLYH+PR+W+ P N+LV+ EE+GGDP++IS TK TG +
Sbjct: 675 DYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSN 734
Query: 711 ICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGS 766
+C VS++ PPPVD+W + + + + P + L C I +I FAS+G P+G CGS
Sbjct: 735 LCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS 794
Query: 767 FRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F G C+ L +VQKAC+G C++ VS+ G C G++K+LAVEA CS
Sbjct: 795 FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 846
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 1002 bits (2591), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/832 (59%), Positives = 610/832 (73%), Gaps = 28/832 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + +Y FEGR+DLV+FVK +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFKEEM+RF KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG + Y+KW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIKLCE+ LI++DPT LG+ LEA +Y S CAAFLAN D+ S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DA VTFNG Y LPAWSVSILPDCKNV FNTAKV + N + EL S
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKV---KFNSISKTPDGGSSAEL---GSQ 436
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFLNI 476
+S+ +E +GIS +F++P L EQINTT D SDYLWY+ + +G + L+I
Sbjct: 437 WSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLHI 496
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
ESLG F+N KL G+G + ++ I L G NT+D+LS+ VGL NYGA+FD
Sbjct: 497 ESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGAFFD 553
Query: 537 VAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ GAG+ V L K G DL+S +W YQVG++GE GL + +SS W S LP
Sbjct: 554 LVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSPLPT 610
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ LIWYKTTF AP G P+A++ GKG AWVNGQSIGRYW +A + GCT+ CDYR
Sbjct: 611 KQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYR 670
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQHICS 713
GSY A+KC K+CG+P+QTLYH+PR+W+ P N+LV+ EE+GGDP++IS TK TG ++C
Sbjct: 671 GSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCL 730
Query: 714 FVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
VS++ PPPVD+W + + + + P + L C I +I FAS+G P+G CGSF
Sbjct: 731 TVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQ 790
Query: 770 GACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C+ L +VQKAC+G C++ VS+ G C G++K+LAVEA CS
Sbjct: 791 GHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 839
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 999 bits (2583), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/826 (59%), Positives = 606/826 (73%), Gaps = 20/826 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 20 TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
++GQY F+GR DLV+FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 80 VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 139
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ YG G+ Y+ WAA
Sbjct: 140 EPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWAAK 199
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG A
Sbjct: 200 MATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFGGA 259
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R+ GGP +ATSYDYDAPIDEYG I
Sbjct: 260 VPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDYDAPIDEYGII 319
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQ KWGHL+++HKAIKLCEE LI++DP LG LEA +Y K+ + CAAFLAN D+ +D
Sbjct: 320 RQQKWGHLKDVHKAIKLCEEALIATDPKISSLGQNLEAAVY-KTGSVCAAFLANVDTKND 378
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F+GN Y LPAWSVSILPDCKNVV NTAK+ S + ++++ L +SS +
Sbjct: 379 KTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISNF---VTEDISSLETSSSKW 435
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
SW E VGIS + + L EQINTT D SDYLWY+ S+ + G + L+IESLGHA
Sbjct: 436 SWINEPVGISKDDILSKTGLLEQINTTADRSDYLWYSLSLDLADDPGSQTVLHIESLGHA 495
Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
F+N KL GN D + ++ I L G N +D+LS+ VGLQNYGA+FD GAG+
Sbjct: 496 LHAFINGKLAGNQAGNSDKSKLNVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTVGAGI 555
Query: 543 FS-VILIDLKNGKR--DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
VIL LKNG DLSS +W YQ+G++GE + +S +S W ST P N+ L+
Sbjct: 556 TGPVILKGLKNGNNTLDLSSRKWTYQIGLKGEDL---GLSSGSSGGWNSQSTYPKNQPLV 612
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKT F AP G P+A++ MGKG+AWVNGQSIGRYW Y+A + GCT C+YRG Y +
Sbjct: 613 WYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTS 672
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
SKC+K+CG+P+QTLYH+PR+++ P N LV+ EE GGDP++IS TK + +CS VS++
Sbjct: 673 SKCRKNCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSH 732
Query: 720 PPPVDSWKPNL---GVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSFRPGACHMD 775
PP +D W + G V P + L+C I++I FASYG P G CG+F G C +
Sbjct: 733 PPQIDLWNQDTESGGKV--GPALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSN 790
Query: 776 -VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L IV+KAC+G CS+ VS+ G C G+ K+LAVEA C+
Sbjct: 791 KALSIVKKACIGSRSCSVGVSTDTFG---DPCRGVPKSLAVEATCA 833
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 998 bits (2581), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/824 (60%), Positives = 600/824 (72%), Gaps = 14/824 (1%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+NVTYDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GG++VIETYVFWN HEP
Sbjct: 24 SNVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY FEGR DLV FVK V AGL++HLRIGPY CAEWNYGGFP+WLHFI GI+FRT N
Sbjct: 84 VRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+D+MKQENL+ASQGGPIIL+Q+ENEYGN++ + Y+ WAA
Sbjct: 144 EPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAAS 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPW+MCQQ +APDPIINTCN FYCD FTPNS +KP MWTEN+SGWFL+FG A
Sbjct: 204 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGA 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNFGRT GGP ++TSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDI 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LHKAIKLCEE LI+SDPT G LE +Y K+ C+AFLAN SD
Sbjct: 324 RQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVY-KTGAVCSAFLANI-GMSD 381
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
A VTFNGN Y LP WSVSILPDCKNVV NTAKV + K V+ L +SS
Sbjct: 382 ATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSSSG 441
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+SW E VGIS +F + L EQINTT D SDYLWY+ SI G + L+IESLGH
Sbjct: 442 WSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYEDNAGDQPVLHIESLGH 501
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
A FVN KL G+ A ++ I L G NT+D+LS+ VGLQNYGA++D GAG
Sbjct: 502 ALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGKNTIDLLSLTVGLQNYGAFYDTVGAG 561
Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
+ VIL LKNG DL+S +W YQVG++GE++GL S N W S LP N+ L
Sbjct: 562 ITGPVILKGLKNGSSVDLTSQQWTYQVGLQGEFVGL---SSGNVGQWNSQSNLPANQPLT 618
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKT F+AP G P+A++ MGKG+AWVNGQSIGRYW Y++P++GCT C+YRG+Y A
Sbjct: 619 WYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSA 678
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
SKC K+CG+P+QTLYH+PR W+ P N V+ EE GGDP+KIS TK + +CS V+E+
Sbjct: 679 SKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESH 738
Query: 720 PPPVDSWKPNL-GVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGACHMD-V 776
PPPVD+W N P + L C I++I FAS+G P G CG++ G+C +
Sbjct: 739 PPPVDTWNSNAESERKVGPVLSLECPYPNQAISSIKFASFGTPRGTCGNYNHGSCSSNRA 798
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L IVQKAC+G C+I VS G C G+ K+LAVEA C+
Sbjct: 799 LSIVQKACIGSSSCNIGVSINTFG---NPCRGVTKSLAVEAACT 839
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 998 bits (2579), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/831 (58%), Positives = 608/831 (73%), Gaps = 22/831 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L+ NVTYDHRALVIDGKR+VL SGS+HYPRSTPE+WP +I+KSK+GGL+VIETYVFWN H
Sbjct: 23 LAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNLH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R QY FEGR DLV+F+K V AGL++H+RIGPY CAEWNYGGFPVWLHF+PG+QFRT
Sbjct: 83 EPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK EMKRF AKI+D++KQE L+ASQGGPIIL+Q+ENEYGNV+ ++G + YV+WA
Sbjct: 143 DNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWA 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +LNT VPWVMC Q DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 203 ATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A+P+RPVEDLAFAVARF++TGG+ QNYYMY GGTNFGRT+GGP +ATSYDYDAPIDEYG
Sbjct: 263 GALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHLR++HKAIK+CEE L+S+DP LG LEA +Y KS + C+AFLAN D+
Sbjct: 323 LVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVY-KSGSQCSAFLANVDTQ 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK---VISQRNNGDHPFAQQKNVNELLL 417
SD VTFNGN Y LPAWSVSILPDCKNVV NTAK V ++ + + P + +E
Sbjct: 382 SDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAF- 440
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
S +SW +E +GIS N SF L+EQINTT D SDYLWY+ S + + G
Sbjct: 441 -DSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNT 499
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L+++SLGH VF+NKKL G G+ + ++ I L G NT+D+LS+ VGLQNYG
Sbjct: 500 VLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYG 559
Query: 533 AWFDVAGAGLFSVILIDLK--NGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
A+F++ GAG+ + ++ + N DLSSG+W YQ+G+EGE +GL ++S W
Sbjct: 560 AFFELRGAGVTGPVKLENQKNNITVDLSSGQWTYQIGLEGEDLGLPS---GSTSQWLSQP 616
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LP NK L WYKTTF AP G PLAL+ GKG+AW+NG SIGRYW +Y+A S CT
Sbjct: 617 NLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIA-SGQCTSY 675
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDY+G+Y A+KC ++CG+P+QTLYH+P++W+ P N LV+ EE+G DP++++ +K
Sbjct: 676 CDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLGS 735
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
+CS VSE+ PPPV+ W + + P + L C I++I FAS+G P G CGSF
Sbjct: 736 LCSHVSESHPPPVEMWSSDSKQQKTGPVLSLECPSPSQVISSIKFASFGTPRGTCGSFSH 795
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C + L IVQKAC+G CSI VS G C G K+LAVEA+C
Sbjct: 796 GQCSTRNALSIVQKACIGSKSCSIDVSIKAFG---DPCRGKTKSLAVEAYC 843
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 997 bits (2578), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 490/831 (58%), Positives = 608/831 (73%), Gaps = 22/831 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L+ NVTYDHRALVIDGKR+VL SGS+HYPRSTPE+WP +I+KSK+GGL+VIETYVFWN H
Sbjct: 23 LAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNLH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R QY FEGR DLV+F+K V AGL++H+RIGPY CAEWNYGGFPVWLHF+PG+QFRT
Sbjct: 83 EPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK EMKRF AKI+D++KQE L+ASQGGPIIL+Q+ENEYGNV+ ++G + YV+WA
Sbjct: 143 DNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWA 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +LNT VPWVMC Q DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 203 ATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A+P+RPVEDLAFAVARF++TGG+ QNYYMY GGTNFGRT+GGP +ATSYDYDAPIDEYG
Sbjct: 263 GALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHLR++HKAIK+CEE L+S+DP LG LEA +Y KS + C+AFLAN D+
Sbjct: 323 LVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVY-KSGSQCSAFLANVDTQ 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK---VISQRNNGDHPFAQQKNVNELLL 417
SD VTFNGN Y LPAWSVSILPDCKNVV NTAK V ++ + + P + +E
Sbjct: 382 SDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAF- 440
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
S +SW +E +GIS N SF L+EQINTT D SDYLWY+ S + + G
Sbjct: 441 -DSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNT 499
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L+++SLGH VF+NKKL G G+ + ++ I L G NT+D+LS+ VGLQNYG
Sbjct: 500 VLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYG 559
Query: 533 AWFDVAGAGLFS-VILIDLKNG-KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
A+F++ GAG+ V L + KN DLSSG+W YQ+G+EGE +GL ++S W
Sbjct: 560 AFFELRGAGVTGPVKLENXKNNITVDLSSGQWTYQIGLEGEDLGLPS---GSTSQWLSQP 616
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LP NK L WYKTTF AP G PLAL+ GKG+AW+NG SIGRYW +Y+A S CT
Sbjct: 617 NLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIA-SGQCTSY 675
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDY+G+Y A+KC ++CG+P+QTLYH+P++W+ P N LV+ EE+G DP++++ +K
Sbjct: 676 CDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLGS 735
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
+CS VSE+ PPPV+ W + + P + L C I++I FAS+G P G CGSF
Sbjct: 736 LCSHVSESHPPPVEMWSSDSKQQKTGPVLSLECPSPSQVISSIKFASFGTPRGTCGSFSH 795
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C + L IVQKAC+G CSI VS G C G K+LAVEA+C
Sbjct: 796 GQCSTRNALSIVQKACIGSKSCSIDVSIKAFG---DPCRGKTKSLAVEAYC 843
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 995 bits (2573), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/832 (58%), Positives = 610/832 (73%), Gaps = 22/832 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ANVTYDHRAL+IDGKRRVL SGSIHYPRSTPE+WP LI+KSK+GGL+VIETYVFWN H
Sbjct: 21 FAANVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPGLIQKSKDGGLDVIETYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R QY FEGR+DLV+FVK V EAGL++H+RIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 81 EPVRNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVCAEWNYGGFPLWLHFIPGIKFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK EM+RF AKI+D+MKQE L+ASQGGPIIL+Q+ENEYGN++ A+G + Y+ WA
Sbjct: 141 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAFGPAAKTYINWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A++L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWF SFG
Sbjct: 201 AGMAISLDTGVPWVMCQQADAPDPVINTCNGFYCDQFTPNSKNKPKMWTENWSGWFQSFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RPVEDLAFAVARF++ GTFQNYYMY GGTNFGRT GGP ++TSYDYDAP+DEYG
Sbjct: 261 GAVPYRPVEDLAFAVARFYQLSGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL+++HKAIKLCEE LI++DPT LG+ LEA +Y K+ + CAAFLAN ++
Sbjct: 321 LLRQPKWGHLKDVHKAIKLCEEALIATDPTTTSLGSNLEATVY-KTGSLCAAFLANI-AT 378
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLL 417
+D VTFNGN Y LPAWSVSILPDCKNV NTAK+ S FA+Q +V+
Sbjct: 379 TDKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKINSVTIVPS--FARQSLVGDVDSSKA 436
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
S +SW E VGIS N +FV+ L EQINTT D SDYLWY+ S ++ + G +
Sbjct: 437 IGSGWSWINEPVGISKNDAFVKSGLLEQINTTADKSDYLWYSLSTNIKGDEPFLEDGSQT 496
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L++ESLGHA F+N KL G G A ++ I L G NT+D+LS+ VGLQNYG
Sbjct: 497 VLHVESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITLTPGKNTIDLLSLTVGLQNYG 556
Query: 533 AWFDVAGAGLFSVILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
A++++ GAG+ + + +NG DLSS +W YQ+G++GE G+ S + W T
Sbjct: 557 AFYELTGAGITGPVKLKAQNGNTVDLSSQQWTYQIGLKGEDSGISSGSSSE---WVSQPT 613
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
LP N+ LIWYKT+F AP G P+A++ MGKG+AWVNGQSIGRYW ++PS+GC C
Sbjct: 614 LPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTNVSPSSGCADSC 673
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
+YRG Y ++KC K+CG+P+QT YHIPR+W+ N+LV+ EE+GGDP++I+ T+ +
Sbjct: 674 NYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNILVLLEEIGGDPTQIAFATRQVGSL 733
Query: 712 CSFVSEADPPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
CS VSE+ P PVD W + G S P + L C I++I FAS+G P G+CGS+
Sbjct: 734 CSHVSESHPQPVDMWNTDSEGGKRSGPVLSLQCPHPDKVISSIKFASFGTPHGSCGSYSH 793
Query: 770 GAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C L IVQKACVG C++ VS G C G+ K+LAVEA C+
Sbjct: 794 GKCSSTSALSIVQKACVGSKSCNVGVSINTFG---DPCRGVKKSLAVEASCT 842
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 995 bits (2572), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/831 (58%), Positives = 605/831 (72%), Gaps = 25/831 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYDHRALVIDGKR++L SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 32 SVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNGHEPE 91
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+ +Y FEGR+DLV+FVK +AGL++HLRIGPYACAEWNYGGFPVWLHF+PGI+FRT N
Sbjct: 92 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNYGGFPVWLHFVPGIKFRTDNE 151
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK EM+RF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ +YG G+ Y+KW+A
Sbjct: 152 PFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSSYGAAGKSYMKWSASM 211
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A++L+T VPW MCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFL FG
Sbjct: 212 ALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGEPS 271
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG +R
Sbjct: 272 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFERTSGGPLISTSYDYDAPIDEYGLLR 331
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHLR+LHKAIKLCE+ LI++DP LG+ LEA +Y S+ CAAFLAN + SDA
Sbjct: 332 QPKWGHLRDLHKAIKLCEDALIATDPKITSLGSNLEAAVYKTSTGSCAAFLANIGTKSDA 391
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLASS 420
VTFNG Y LPAWSVSILPDCKNV FNTAK+ S + FA+Q N + S
Sbjct: 392 TVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPNADSSAELGS 449
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFLN 475
+S+ +E VGIS +FV+P L EQINTT D SDYLWY+ + + +G + L+
Sbjct: 450 QWSYIKEPVGISKADAFVKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAVLH 509
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
++S+G F+N KL G G + ++ I L G NT+D+LS+ VGL NYG +F
Sbjct: 510 VQSIGQLVYAFINGKLAGSGNGKQKIS---LDIPINLVTGKNTIDLLSVTVGLANYGPFF 566
Query: 536 DVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
D+ GAG+ V L K G DLSS +W YQVG++GE GL +SS W S LP
Sbjct: 567 DLTGAGITGPVSLKSAKTGSSTDLSSQQWTYQVGLKGEDKGLGS---GDSSEWVSNSPLP 623
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++ LIWYKTTF AP G P+A++ GKG AWVNGQSIGRYW +A + GC CDY
Sbjct: 624 TSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNGQSIGRYWPTSIARTDGCVGSCDY 683
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQHIC 712
RGSY ++KC K+CG+P+QTLYH+PR+W+ P N LV+ EE+GGDP+KIS TK TG ++C
Sbjct: 684 RGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLVLLEEMGGDPTKISFATKQTGSNLC 743
Query: 713 SFVSEADPPPVDSWKPNLGVVS-SSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPG 770
VS++ P PVD+W + + +SP + L C I++I FAS+G P G CGSF G
Sbjct: 744 LTVSQSHPAPVDTWISDSKFSNRTSPVLSLKCPVSTQVISSIRFASFGTPTGTCGSFSYG 803
Query: 771 AC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
C L +VQKACVG C + VS+ G C G++K+LAVEA C+
Sbjct: 804 HCSSARSLSVVQKACVGSRSCKVEVSTRVFGE---PCRGVVKSLAVEASCA 851
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 993 bits (2568), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/824 (59%), Positives = 598/824 (72%), Gaps = 21/824 (2%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V+YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP+
Sbjct: 29 TVSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 88
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR DLV FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT N
Sbjct: 89 RGQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNE 148
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+K EM RF AKI+++MK E L+ASQGGPIIL+Q+ENEYGN++ AYG + Y+ WAA+
Sbjct: 149 PYKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAKTYINWAANM 208
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+L+T VPWVMCQQ DAP +INTCNGFYCD F+PNS S P +WTEN+SGWFLSFG AV
Sbjct: 209 AVSLDTGVPWVMCQQADAPSSVINTCNGFYCDQFSPNSNSTPKIWTENWSGWFLSFGGAV 268
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGR++GGP +ATSYDYDAP+DEYG +R
Sbjct: 269 PQRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDAPLDEYGLLR 328
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL+++HKAIKLCE ++++DPT LG +EA +Y K+ + C+AFLAN D+ SDA
Sbjct: 329 QPKWGHLKDVHKAIKLCEPAMVATDPTISSLGQNIEAAVY-KTGSVCSAFLANVDTKSDA 387
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLASS 420
VTFNGN Y LPAWSVSILPDCKNVV NTAK+ + F +Q +V S
Sbjct: 388 TVTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATMVPS--FTRQSISADVEPTEAVGS 445
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+SW E VGIS +F R L EQINTT D SDYLWY+ SI V G + L+++SLG
Sbjct: 446 GWSWINEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDVKGGYKAD--LHVQSLG 503
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN KL G GN A + +E G NT+D+LS+ VGLQNYGA+FD+ GA
Sbjct: 504 HALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDLLSLTVGLQNYGAFFDLVGA 563
Query: 541 GLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ V L NG DLSS +W YQ+G++GE D+ + SS W TLP N+ L
Sbjct: 564 GITGPVQLKGSANGTTIDLSSQQWTYQIGLKGE----DEDLPSGSSQWISQPTLPKNQPL 619
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYKT F AP G P+AL+ MGKG+AWVNGQSIGRYW +AP TGCT C+YRG+Y
Sbjct: 620 TWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNVAPKTGCT-DCNYRGAYS 678
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
A KC+K+CG P+Q LYH+PR+W+ N LV+ EE+GGDP+++S T+ + +CS VSE+
Sbjct: 679 ADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGDPTQLSFATRQVESLCSHVSES 738
Query: 719 DPPPVDSWKPNLGVVSSS-PQVRLACE-RGWHIAAINFASYGIPEGNCGSFRPGACHMD- 775
P PVD W + S S P++ L C I++I FASYG P G CGSF G+C
Sbjct: 739 HPSPVDMWSSDSKAGSKSRPRLSLECPFPNQVISSIKFASYGRPSGTCGSFSHGSCRSSR 798
Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
L IVQKACVG CSI VS+ G C GL K+LAVEA C
Sbjct: 799 ALSIVQKACVGSKSCSIEVSTHTFG---DPCKGLAKSLAVEASC 839
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 991 bits (2563), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/830 (58%), Positives = 603/830 (72%), Gaps = 24/830 (2%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRAL+IDGKRRVL SGSIHYPRST E+W +LI+KSK+GGL+VIETYVFWN HEP+
Sbjct: 31 NVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDGGLDVIETYVFWNAHEPV 90
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+ QY FEGR+DLV+F+K V EAGL+ HLRIGPY CAEWNYGGFP+WLHF+PGI+FRT N
Sbjct: 91 QNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGFPLWLHFVPGIKFRTDNE 150
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK EM+RF AKI+D+MKQE L+ASQGGPIIL+Q+ENEYGN++ +YG + Y+ WAA
Sbjct: 151 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSSYGPAAKSYINWAASM 210
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG AV
Sbjct: 211 AVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSKNKPKMWTENWSGWFLSFGGAV 270
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP ++TSYDYDAP+DEYG R
Sbjct: 271 PYRPVEDLAFAVARFYQLGGTFQNYYMYHGGTNFGRSTGGPFISTSYDYDAPLDEYGLTR 330
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL++LHK+IKLCEE L+++DP LG LEA +Y + C+AFLAN+ +SD
Sbjct: 331 QPKWGHLKDLHKSIKLCEEALVATDPVTSSLGQNLEATVYKTGTGLCSAFLANF-GTSDK 389
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS---S 420
V FNGN Y LP WSVSILPDCKNV NTAK+ S + F Q + + A S
Sbjct: 390 TVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVIPN--FVHQSLIGDADSADTLGS 447
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++SW E VGIS N +FV+P L EQINTT D SDYLWY+ S + + G + L+
Sbjct: 448 SWSWIYEPVGISKNDAFVKPGLLEQINTTADKSDYLWYSLSTVIKDNEPFLEDGSQTVLH 507
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ESLGHA FVN KL G GN A + + L G NT+D+LS+ GLQNYGA+F
Sbjct: 508 VESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNTIDLLSLTAGLQNYGAFF 567
Query: 536 DVAGAGLFSVILID-LKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ GAG+ + ++ LKNG DLSS +W YQ+G++GE +GL + +S W LP
Sbjct: 568 ELEGAGITGPVKLEGLKNGTTVDLSSLQWTYQIGLKGEELGLS----SGNSQWVTQPALP 623
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ LIWYKT+F AP G P+A++ + MGKG+AWVNGQSIGRYW ++P++GC+ C+Y
Sbjct: 624 TKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGRYWPTKVSPTSGCS-NCNY 682
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
RGSY +SKC K+C +P+QTLYH+PR+WV N LV+ EE+GGDP++I+ TK +CS
Sbjct: 683 RGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEEIGGDPTQIAFATKQSASLCS 742
Query: 714 FVSEADPPPVDSWKPNL-GVVSSSPQVRLACE-RGWHIAAINFASYGIPEGNCGSFRPGA 771
VSE+ P PVD W N + P + L C I++I FAS+G P G CGSF G
Sbjct: 743 HVSESHPLPVDMWSSNSEAERKAGPVLSLECPFPNQVISSIKFASFGTPRGTCGSFSHGQ 802
Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
C L IVQKAC+G CSI S++ G C G+ K+LAVEA C+
Sbjct: 803 CKSTRALSIVQKACIGSKSCSIGASASTFG---DPCRGVAKSLAVEASCA 849
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 990 bits (2560), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/824 (60%), Positives = 596/824 (72%), Gaps = 13/824 (1%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 24 ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY F+GR DLV+FVKTV AGL++HLRIGPY CAEWNYGGFPVWLHFIPGI+FRT N
Sbjct: 84 VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+D++KQE L+ASQGGP+IL+Q+ENEYGN++ AYG G+ Y+KWAA
Sbjct: 144 EPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWAAT 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPWVMC Q DAPDPIINT NGFY D FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPKMWTENWSGWFLVFGGA 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R +GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYGII 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+E+HKAIKLCEE LI++DPT LG LEA +Y K+ + CAAFLAN + SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVGTKSD 382
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
V F+GN Y LPAWSVSILPDCK+VV NTAK+ S K ++ +S+
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASSTG 442
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+SW E VGIS SF + L EQINTT D SDYLWY+ SI + L+IESLGH
Sbjct: 443 WSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADASSQTVLHIESLGH 502
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
A F+N KL GN F ++ + L G NT+D+LS+ VGLQNYGA+FD G G
Sbjct: 503 ALHAFINGKLAGSQPGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGVG 562
Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
+ VIL NG DLSS +W YQVG++GE +GL S +S W ST P N+ L
Sbjct: 563 ITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGL---SSGSSGQWNLQSTFPKNQPLT 619
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKTTF AP G P+A++ MGKG+AWVNGQ IGRYW Y+A CT C+YRG Y A
Sbjct: 620 WYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSCNYRGPYSA 679
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
SKC+K+C +P+QTLYH+PR+W+ P N+LV+ EE GGDP++IS +TK + +C+ VS++
Sbjct: 680 SKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESLCAHVSDSH 739
Query: 720 PPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-V 776
PPPVD W P + L C I++I FASYG P G CG+F G C +
Sbjct: 740 PPPVDLWNSETESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKA 799
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L IVQKAC+G CS+ VSS G C G+ K+LAVEA C+
Sbjct: 800 LSIVQKACIGSSSCSVGVSSDTFG---DPCRGMAKSLAVEATCA 840
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 989 bits (2556), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/831 (59%), Positives = 597/831 (71%), Gaps = 23/831 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HE
Sbjct: 20 AKVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEA 79
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY F GR DLV+FVKTV EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGIQ RT N
Sbjct: 80 VRGQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQLRTDN 139
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EM+RF AKI+D+MK+E L+ASQGGPIIL+Q+ENEYGN++ AYG + Y+KWAAD
Sbjct: 140 EPFKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQTYIKWAAD 199
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-PIMWTENYSGWFLSFGY 241
AV+L+T VPWVMCQQ+DAP +I+TCNGFYCD +TP P K P MWTEN+SGWFLSFG
Sbjct: 200 MAVSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQWTPRLPEKRPKMWTENWSGWFLSFGG 259
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVEDLAFAVARFF+ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG
Sbjct: 260 AVPQRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGL 319
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHL+++HKAIKLCEE ++++DP + G +EA +Y K+ + CAAFLAN D+ S
Sbjct: 320 LRQPKWGHLKDVHKAIKLCEEAMVATDPKYSSFGPNVEATVY-KTGSACAAFLANSDTKS 378
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR---NNGDHPFAQQKNVNELLLA 418
DA VTFNGN Y LPAWSVSILPDCKNVV NTAK+ S + H + +E L
Sbjct: 379 DATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAAMIPSFMHHSVLDDIDSSEAL-- 436
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG-----QGKEVF 473
S +SW E VGIS +F R L EQINTT D SDYLWY+ SI V G +
Sbjct: 437 GSGWSWINEPVGISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVTSSDTFLQDGSQTI 496
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L++ESLGHA F+N K G + ++ + G NT+D+LS+ +GLQNYGA
Sbjct: 497 LHVESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNTIDLLSLTIGLQNYGA 556
Query: 534 WFDVAGAGLFS-VILIDLKNG-KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+FD +GAG+ V L LKNG DLSS W YQ+G++GE G S + W T
Sbjct: 557 FFDKSGAGITGPVQLKGLKNGTTTDLSSQRWTYQIGLQGEDSGFSSGSSSQ---WISQPT 613
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
LP + L WYK TF AP+G P+AL+ MGKG+AWVNGQSIGRYW AP++GC C
Sbjct: 614 LPKKQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNNAPTSGCPDSC 673
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
++RG YD++KC+K+CG+P+Q LYH+PR+W+ P N LV+ EE+GGDP++IS T+ + +
Sbjct: 674 NFRGPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGDPTQISFATRQIESL 733
Query: 712 CSFVSEADPPPVDSWKPNLGVVSS-SPQVRLACE-RGWHIAAINFASYGIPEGNCGSFRP 769
CS VSE+ P PVD+W + P + L C I++I FASYG P+G CGSF
Sbjct: 734 CSHVSESHPSPVDTWSSDSKAGRKLGPVLSLECPFPNQVISSIKFASYGKPQGTCGSFSH 793
Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C L IVQKACVG CSI VS G C G+ K+LAVEA C
Sbjct: 794 GQCKSTSALSIVQKACVGSKSCSIEVSVKTFG---DPCKGVAKSLAVEASC 841
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 988 bits (2555), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/836 (57%), Positives = 595/836 (71%), Gaps = 24/836 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 30 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
+RGQY FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 90 AVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 149
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FK EM+RF K++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA
Sbjct: 150 NEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 209
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLSFG
Sbjct: 210 GMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGG 269
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG
Sbjct: 270 AVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGM 329
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
+RQPKWGHLR++HKAIKLCE LI+++P++ LG EA +Y + N CAAFLAN D+
Sbjct: 330 VRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQ 389
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNE 414
SD V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ R+ G ++
Sbjct: 390 SDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLIT 449
Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
LA++ +S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V +G E +L
Sbjct: 450 PELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDEPYL 506
Query: 475 N-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
N + SLGH +++N KL G+ + + + L G N +D+LS VG
Sbjct: 507 NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVG 566
Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
L NYGA+FD+ GAG+ + + NG +LSS +W YQ+G+ GE + L S A S W
Sbjct: 567 LSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SPEWV 625
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
+ P N+ LIWYKT F AP G P+A++ MGKG+AWVNGQSIGRYW LAP +GC
Sbjct: 626 SDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC 685
Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS T+
Sbjct: 686 VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQ 745
Query: 708 GQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCG 765
IC+ VSE P +DSW P + P +RL C R G I+ I FAS+G P G CG
Sbjct: 746 TSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCG 805
Query: 766 SFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ G C L +VQ+ACVG CS+PVSS G C G+ K+L VEA CS
Sbjct: 806 NYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 858
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 988 bits (2554), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/836 (57%), Positives = 595/836 (71%), Gaps = 24/836 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 128 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 187
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
+RGQY FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT
Sbjct: 188 AVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 247
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FK EM+RF K++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA
Sbjct: 248 NEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 307
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLSFG
Sbjct: 308 GMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGG 367
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG
Sbjct: 368 AVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGM 427
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
+RQPKWGHLR++HKAIKLCE LI+++P++ LG EA +Y + N CAAFLAN D+
Sbjct: 428 VRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQ 487
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNE 414
SD V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ R+ G ++
Sbjct: 488 SDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLIT 547
Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
LA++ +S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V +G E +L
Sbjct: 548 PELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDEPYL 604
Query: 475 N-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
N + SLGH +++N KL G+ + + + L G N +D+LS VG
Sbjct: 605 NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVG 664
Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
L NYGA+FD+ GAG+ + + NG +LSS +W YQ+G+ GE + L S A S W
Sbjct: 665 LSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SPEWV 723
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
+ P N+ LIWYKT F AP G P+A++ MGKG+AWVNGQSIGRYW LAP +GC
Sbjct: 724 SDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC 783
Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS T+
Sbjct: 784 VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQ 843
Query: 708 GQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCG 765
IC+ VSE P +DSW P + P +RL C R G I+ I FAS+G P G CG
Sbjct: 844 TSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCG 903
Query: 766 SFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ G C L +VQ+ACVG CS+PVSS G C G+ K+L VEA CS
Sbjct: 904 NYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 956
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 986 bits (2548), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/822 (58%), Positives = 597/822 (72%), Gaps = 18/822 (2%)
Query: 13 VIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR 72
VIDG RRVL SGSIHYPRSTPE+WP+LI KSK GGL++IETYVFW+ HEP++GQY F+GR
Sbjct: 1 VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60
Query: 73 FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF 132
DLVRF+KTV EAGL++HLRIGPYACAEWNYGGFP+WLHFIPGI+FRT N PFK+EM+RF
Sbjct: 61 KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRF 120
Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
KI+DLMKQENL+ASQGGPIIL+Q+ENEYGN+++AYG + Y+ WAA A +L+T VP
Sbjct: 121 TTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVP 180
Query: 193 WVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLA 252
WVMCQQ DAPDPIINTCNGFYCD F+PNS +KP +WTEN+SGWFLSFG VP RPVEDLA
Sbjct: 181 WVMCQQTDAPDPIINTCNGFYCDQFSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDLA 240
Query: 253 FAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
FAVARFF+ GGTFQNYYMY G NFG T+GGP +ATSYDYDAPIDEYG RQPKWGHL+E
Sbjct: 241 FAVARFFQRGGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLKE 300
Query: 313 LHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVY 372
LHKAIKLCE L+++D +LG LEAH+Y +S CAAFLAN + SDA VTFNG Y
Sbjct: 301 LHKAIKLCEPALVATDHHTLRLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNGKSY 360
Query: 373 FLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNV--NELLLASSAF----SWYE 426
LPAWSVSILPDC+ VVFNTA++ SQ + + + +++ ++ + +S F S+
Sbjct: 361 SLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDWSFVI 420
Query: 427 EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFLNIESLGH 481
E VGIS + + + L EQINTT D SDYLWY+ SI + G + L+ ESLGH
Sbjct: 421 EPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEPFLSNGTQSNLHAESLGH 480
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
FVN KL G GN A + K I L G N++D+LS VGLQNYGA+FD+ GAG
Sbjct: 481 VLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAFFDLMGAG 540
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
+ + + +NG DLSS W YQ+G++GE + L + S + S W STLP N+ LIWY
Sbjct: 541 ITGPVKLKGQNGTLDLSSNAWTYQIGLKGEDLSLHENS-GDVSQWISESTLPKNQPLIWY 599
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
KTTF AP+G P+A++ MGKG+AWVNGQSIGRYW Y +P GC+ C+YRG Y ASK
Sbjct: 600 KTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTYSSPQNGCSTACNYRGPYSASK 659
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
C K+CG+P+Q LYH+PR+++ N LV+ EE+GGDP++ISL TK +C+ VSE+ P
Sbjct: 660 CIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQMTSLCAHVSESHPA 719
Query: 722 PVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLP 778
PVD+W S P ++L C I++I FAS+G P G CGSF C VL
Sbjct: 720 PVDTWLSLQQKGKKSGPTIQLECPYPNQVISSIKFASFGTPSGMCGSFNHSQCSSASVLA 779
Query: 779 IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+VQKACVG CS+ +SS LG C G++K+LAVEA CS
Sbjct: 780 VVQKACVGSKRCSVGISSKTLG---DPCRGVIKSLAVEAACS 818
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 985 bits (2547), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/839 (57%), Positives = 596/839 (71%), Gaps = 27/839 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 30 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89
Query: 62 PIRGQ---YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF 118
P+RGQ Y FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+F
Sbjct: 90 PVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKF 149
Query: 119 RTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK 178
RT N FK EM+RF K++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++
Sbjct: 150 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 209
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
WAA AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLS
Sbjct: 210 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 269
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
FG AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDE
Sbjct: 270 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 329
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANY 357
YG +RQPKWGHLR++HKAIKLCE LI+++P++ LG EA +Y + N CAAFLAN
Sbjct: 330 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANV 389
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKN 411
D+ SD V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ R+ G +
Sbjct: 390 DAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDS 449
Query: 412 VNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE 471
+ LA++ +S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V +G E
Sbjct: 450 LITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDE 506
Query: 472 VFLN-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
+LN + SLGH V++N KL G+ + + + L G N +D+LS
Sbjct: 507 PYLNGSQSNLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLST 566
Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
VGL NYGA+FD+ GAG+ + + NG +LSS +W YQ+G+ GE + L S A S
Sbjct: 567 TVGLSNYGAFFDLIGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SP 625
Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
W + P N+ LIWYKT F AP G P+A++ MGKG+AWVNGQSIGRYW LAP
Sbjct: 626 EWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 685
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
+GC C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS
Sbjct: 686 SGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFT 745
Query: 705 TKTGQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEG 762
T+ IC+ VSE P +DSW P + P +RL C R G I+ I FAS+G P G
Sbjct: 746 TRQTSSICAHVSEMHPAQIDSWISPQQTSQTPGPALRLECPREGQVISNIKFASFGTPSG 805
Query: 763 NCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CG++ G C L +VQ+ACVG CS+PVSS G C G+ K+L VEA CS
Sbjct: 806 TCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 861
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 983 bits (2542), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/839 (57%), Positives = 595/839 (70%), Gaps = 27/839 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 30 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89
Query: 62 PIRGQ---YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF 118
+RGQ Y FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+F
Sbjct: 90 AVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKF 149
Query: 119 RTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK 178
RT N FK EM+RF K++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++
Sbjct: 150 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 209
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
WAA AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLS
Sbjct: 210 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 269
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
FG AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDE
Sbjct: 270 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 329
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANY 357
YG +RQPKWGHLR++HKAIKLCE LI+++P++ LG EA +Y + N CAAFLAN
Sbjct: 330 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANV 389
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKN 411
D+ SD V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ R+ G +
Sbjct: 390 DAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDS 449
Query: 412 VNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE 471
+ LA++ +S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V +G E
Sbjct: 450 LITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDE 506
Query: 472 VFLN-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
+LN + SLGH +++N KL G+ + + + L G N +D+LS
Sbjct: 507 PYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLST 566
Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
VGL NYGA+FD+ GAG+ + + NG +LSS +W YQ+G+ GE + L S A S
Sbjct: 567 TVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SP 625
Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
W + P N+ LIWYKT F AP G P+A++ MGKG+AWVNGQSIGRYW LAP
Sbjct: 626 EWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 685
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
+GC C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS
Sbjct: 686 SGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFT 745
Query: 705 TKTGQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEG 762
T+ IC+ VSE P +DSW P + P +RL C R G I+ I FAS+G P G
Sbjct: 746 TRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSG 805
Query: 763 NCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CG++ G C L +VQ+ACVG CS+PVSS G C G+ K+L VEA CS
Sbjct: 806 TCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 861
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 982 bits (2538), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/834 (58%), Positives = 600/834 (71%), Gaps = 23/834 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDG RRVL SGSIHYPRSTP++WP LI+K+K+GGL+VIETYVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQY FEGR DL FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF AK++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA
Sbjct: 147 NEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAA 206
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 207 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN R++GGP +ATSYDYDAPIDEYG
Sbjct: 267 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR++HKAIKLCE LI++DP++ LG +EA +Y K + CAAFLAN D S
Sbjct: 327 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVY-KVGSVCAAFLANIDGQS 385
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
D VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ + + + NV
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 445
Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
LA S +S+ E VGI+ + + + L EQINTT D SD+LWY+ SI V +G E +LN
Sbjct: 446 ELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 502
Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
+ SLGH V++N K+ G+ + K IEL G N +D+LS VGL
Sbjct: 503 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 562
Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
NYGA+FD+ GAG+ + + NG DLSS EW YQ+G+ GE + L S A S W
Sbjct: 563 SNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 621
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ P+N LIWYKT F P G P+A++ MGKG+AWVNGQSIGRYW LAP +GC
Sbjct: 622 ANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCV 681
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
C+YRG+Y +SKC K CGQP+QTLYH+PR+++ PG N LV+ E GGDPSKIS + +
Sbjct: 682 NSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQT 741
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSF 767
+C+ VSEA P +DSW + P +RL C + G I+++ FAS+G P G CGS+
Sbjct: 742 GSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLECPKEGQVISSVKFASFGTPSGTCGSY 801
Query: 768 RPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C L IVQ+AC+G CS+PVSS Y G C G+ K+LAVEA CS
Sbjct: 802 SHGECSSTQALSIVQEACIGVSSCSVPVSSNYFG---NPCTGVTKSLAVEAACS 852
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 981 bits (2535), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 497/832 (59%), Positives = 595/832 (71%), Gaps = 21/832 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 24 ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY F+GR DLV+FVKTV AGL++HLRIGPY CAEWNYGGFPVWLHFIPGI+FRT N
Sbjct: 84 VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+D++KQE L+ASQGGP+IL+Q+ENEYGN++ AYG G+ Y+KWAA
Sbjct: 144 EPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWAAT 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+T VPWVMC Q DAPDPIINT NGFY D FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPKMWTENWSGWFLVFGGA 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R +GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYGII 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+E+HKAIKLCEE LI++DPT LG LEA +Y K+ + CAAFLAN + SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVGTKSD 382
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
V F+GN Y LPAWSVSILPDCK+VV NTAK+ S K ++ +S+
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASSTG 442
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+SW E VGIS SF + L EQINTT D SDYLWY+ SI + L+IESLGH
Sbjct: 443 WSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADASSQTVLHIESLGH 502
Query: 482 AALVFVNKKLVA--------FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
A F+N KL N F ++ + L G NT+D+LS+ VGLQNYGA
Sbjct: 503 ALHAFINGKLAGKYKLKHSQLIICNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGA 562
Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+FD G G+ VIL NG DLSS +W YQVG++GE +GL S +S W ST
Sbjct: 563 FFDTWGVGITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGL---SSGSSGQWNLQST 619
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
P N+ L WYKTTF AP G P+A++ MGKG+AWVNGQ IGRYW Y+A CT C
Sbjct: 620 FPKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSC 679
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
+YRG Y ASKC+K+C +P+QTLYH+PR+W+ P N+LV+ EE GGDP++IS +TK + +
Sbjct: 680 NYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESL 739
Query: 712 CSFVSEADPPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
C+ VS++ PPPVD W P + L C I++I FASYG P G CG+F
Sbjct: 740 CAHVSDSHPPPVDLWNSETESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYH 799
Query: 770 GACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C + L IVQKAC+G CS+ VSS G C G+ K+LAVEA C+
Sbjct: 800 GRCSSNKALSIVQKACIGSSSCSVGVSSDTFG---DPCRGMAKSLAVEATCA 848
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 980 bits (2533), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/832 (56%), Positives = 592/832 (71%), Gaps = 18/832 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYDHRALVIDG RRVL SGSIHYPRSTP++WP L++K+K+GGL+V+ETYVFW+ HE
Sbjct: 27 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDVHE 86
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQY FEGR DLVRFVK +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT
Sbjct: 87 PVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTD 146
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF K++ MK L+ASQGGPIIL+Q+ENEYGN+ +YG G+ Y++WAA
Sbjct: 147 NEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAA 206
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPWVMCQQ DAP+P+INTCNGFYCD FTP+ PS+P +WTEN+SGWFLSFG
Sbjct: 207 GMAVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGG 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG
Sbjct: 267 AVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGL 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR++HKAIK+CE LI++DP++ LG EAH+Y KS + CAAFLAN D S
Sbjct: 327 VRQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQS 385
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNEL 415
D VTFNG Y LPAWSVSILPDCKNVV NTA++ SQ RN G A + E
Sbjct: 386 DKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEA 445
Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKE 471
LA+S++S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V G+ G +
Sbjct: 446 ELAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQ 505
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L + SLGH VF+N KL G+ + + + L G N +D+LS VGL NY
Sbjct: 506 SNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNY 565
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
GA+FD+ GAG+ + + G DLSS EW YQ+G+ GE + L S A S W ++
Sbjct: 566 GAFFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEA-SPEWVSDNS 624
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
P N L WYK+ F AP G P+A++ MGKG+AWVNGQSIGRYW +AP +GC C
Sbjct: 625 YPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSGCVNSC 684
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
+YRGSY A+KC K CGQP+Q LYH+PR+++ PG N +V+ E+ GG+PSKIS TK + +
Sbjct: 685 NYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQTESV 744
Query: 712 CSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRP 769
C+ VSE P +DSW + S P +RL C + G I++I FAS+G P G CGS+
Sbjct: 745 CAHVSEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSGTCGSYSH 804
Query: 770 GAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C L + Q+ACVG CS+PVS+ G C G+ K+L VEA CS
Sbjct: 805 GECSSSQALAVAQEACVGVSSCSVPVSAKNFG---DPCRGVTKSLVVEAACS 853
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 978 bits (2527), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/835 (57%), Positives = 604/835 (72%), Gaps = 24/835 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYDHRALVIDG RRVL SGSIHYPRSTP++WP +I+K+K+GGL+VIETYVFW+ HE
Sbjct: 34 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFWDIHE 93
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQY FEGR DL FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 94 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 153
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF AK++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA
Sbjct: 154 NEPFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 213
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 214 GMAISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 273
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN R++GGP +ATSYDYDAPIDEYG
Sbjct: 274 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 333
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+R+PKWGHLR++HKAIKLCE LI++DP++ LG EA +Y K+ + CAAFLAN D S
Sbjct: 334 VREPKWGHLRDVHKAIKLCEPALIATDPSYTSLGQNAEAAVY-KTGSVCAAFLANIDGQS 392
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
D VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ + + + + N+
Sbjct: 393 DKTVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRYLESSNMASDGSFITP 452
Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
LA S +S+ E VGI+ + + + L EQINTT D SD+LWY+ SI V +G E +LN
Sbjct: 453 ELAVSGWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 509
Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
+ SLGH V++N K+ G+ + K IEL G N +D+LS VGL
Sbjct: 510 GSQSNLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 569
Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
NYGA+FD+ GAG+ + + NG DLSS EW YQ+G+ GE + L S A S W
Sbjct: 570 SNYGAFFDLVGAGITGPVKLSGTNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 628
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ P+N+ LIWYKT F P G P+A++ MGKG+AWVNGQSIGRYW LAP +GC
Sbjct: 629 ANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCV 688
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
C+YRGSY+++KC K CGQP+QTLYH+PR+++ PG N +V+ E+ GGDPSKIS + +
Sbjct: 689 NSCNYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFVIRQT 748
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSS-SPQVRLACER-GWHIAAINFASYGIPEGNCGS 766
+C+ VSE P +DSW + + P++RL C + G I++I FAS+G P G CGS
Sbjct: 749 GSVCAQVSEEHPAQIDSWNSSQQTMQRYGPELRLECPKDGQVISSIKFASFGTPSGTCGS 808
Query: 767 FRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G C L +VQ+AC+G CS+PVSS Y G C G+ K+LAVEA CS
Sbjct: 809 YSHGECSSTQALSVVQEACIGVSSCSVPVSSNYFG---NPCTGVTKSLAVEAACS 860
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 977 bits (2526), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/833 (57%), Positives = 600/833 (72%), Gaps = 22/833 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ANVTYDHRALV+DG+RRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 29 FAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R QY FEGR DL+ FVK V++AGLF+H+RIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 89 EPVRNQYDFEGRKDLINFVKLVEKAGLFVHIRIGPYVCAEWNYGGFPLWLHFIPGIEFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVK 178
N PFK EMKRF AKI+D++KQENL+ASQGGP+IL+Q+ENEYGN +E YG + YV
Sbjct: 149 DNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIESRYGPRAKPYVN 208
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
WAA A +LNT VPWVMCQQ DAP +INTCNGFYCD F NS P MWTEN++GWFLS
Sbjct: 209 WAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQNSDKTPKMWTENWTGWFLS 268
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
FG VP+RPVED+AFAVARFF+ GGTFQNYYMY GGTNFGRT+GGP +ATSYDYDAP+DE
Sbjct: 269 FGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPLDE 328
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
YG I QPKWGHL++LHKAIKLCE +++++P LG+ +E +Y K+ + CAAFLAN
Sbjct: 329 YGLINQPKWGHLKDLHKAIKLCEAAMVATEPNITSLGSNIEVSVY-KTDSQCAAFLANTA 387
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
+ SDA V+FNGN Y LP WSVSILPDCKNV F+TAK+ S F + + +
Sbjct: 388 TQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTIST--FVTRSSEADASGG 445
Query: 419 S-SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
S S ++ E VGIS +F R L EQINTT D SDYLWY+ S+++ + G
Sbjct: 446 SLSGWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDEPFLQDGSAT 505
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L++++LGH ++N KL G GN +NF I + L G N +D+LS VGLQNYG
Sbjct: 506 VLHVKTLGHVLHAYINGKLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYG 565
Query: 533 AWFDVAGAGLFS-VILIDLKNGK-RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
A+FD+ GAG+ V L KNG DLSS +W YQVG++GE +GL S S+ WK +
Sbjct: 566 AFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGL---SNGGSTLWKSQT 622
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LP N+ LIWYK +F AP G PL+++ MGKG+AWVNGQSIGR+W AY+AP+ GCT
Sbjct: 623 ALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDP 682
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C+YRG Y+A KC K+CG+P+Q LYH+PR+W+ N+LV+ EE+GGDP+K+S T+ Q
Sbjct: 683 CNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQS 742
Query: 711 ICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFR 768
+CS +S+A P P+D W + S P + L C I++I FAS+G P+G CGSF
Sbjct: 743 VCSRISDAHPLPIDMWASEDDARKKSGPTLSLECPHPNQVISSIKFASFGTPQGTCGSFI 802
Query: 769 PGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C + L IV+KAC+G CS+ VS G C G+ K+LAVEA C+
Sbjct: 803 HGRCSSSNALSIVKKACIGSKSCSLGVSINAFG---DPCKGVAKSLAVEASCT 852
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 975 bits (2520), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/833 (57%), Positives = 598/833 (71%), Gaps = 22/833 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ANVTYDHRALV+DG+RRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 29 FAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R QY FEGR DL+ FVK V+ AGLF+H+RIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 89 EPVRNQYDFEGRKDLINFVKLVERAGLFVHIRIGPYVCAEWNYGGFPLWLHFIPGIEFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVK 178
N PFK EMKRF AKI+D++KQENL+ASQGGP+IL+Q+ENEYGN +E YG + YV
Sbjct: 149 DNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIESRYGPRAKPYVN 208
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
WAA A +LNT VPWVMCQQ DAP +INTCNGFYCD F NS P MWTEN++GWFLS
Sbjct: 209 WAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQNSDKTPKMWTENWTGWFLS 268
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
FG VP+RPVED+AFAVARFF+ GGTFQNYYMY GGTNFGRT+GGP +ATSYDYDAP+DE
Sbjct: 269 FGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPLDE 328
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
YG I QPKWGHL++LHKAIKLCE +++++P LG+ +E +Y K+ + CAAFLAN
Sbjct: 329 YGLINQPKWGHLKDLHKAIKLCEAAMVATEPNVTSLGSNIEVSVY-KTDSQCAAFLANTA 387
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
+ SDA V+FNGN Y LP WSVSILPDCKNV F+TAK+ S F + + +
Sbjct: 388 TQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTIST--FVTRSSEADASGG 445
Query: 419 S-SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
S S ++ E VGIS +F R L EQINTT D SDYLWY+ S+++ + G
Sbjct: 446 SLSGWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDEPFLQDGSAT 505
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L++++LGH ++N +L G GN +NF I + L G N +D+LS VGLQNYG
Sbjct: 506 VLHVKTLGHVLHAYINGRLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYG 565
Query: 533 AWFDVAGAGLFS-VILIDLKNGK-RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
A+FD+ GAG+ V L KNG DLSS +W YQVG++GE +GL S S+ WK +
Sbjct: 566 AFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGL---SNGGSTLWKSQT 622
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LP N+ LIWYK +F AP G PL+++ MGKG+AWVNGQSIGR+W AY+AP+ GCT
Sbjct: 623 ALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDP 682
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C+YRG Y+A KC K+CG+P+Q LYH+PR+W+ N+LV+ EE+GGDP+K+S T+ Q
Sbjct: 683 CNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQS 742
Query: 711 ICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFR 768
+CS S+A P P+D W + S P + L C I++I FAS+G P+G CGSF
Sbjct: 743 VCSRTSDAHPLPIDMWASEDDARKKSGPTLSLECPHPNQVISSIKFASFGTPQGTCGSFI 802
Query: 769 PGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C + L IV+KAC+G CS+ VS G C G+ K+LAVEA C+
Sbjct: 803 HGRCSSSNALSIVKKACIGSKSCSLGVSINAFG---DPCKGVAKSLAVEASCT 852
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 974 bits (2519), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/832 (56%), Positives = 585/832 (70%), Gaps = 18/832 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYDHRALVIDG RRVL SGSIHYPRSTP++WP L++K+K+GGL+V+ETYVFW+ HE
Sbjct: 26 ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDIHE 85
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
QY FEGR DLVRFVK + GL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 86 TATXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 145
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF K++ MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA
Sbjct: 146 NEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIRWAA 205
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP +WTEN+SGWFLSFG
Sbjct: 206 GMAVALDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSNSKPKLWTENWSGWFLSFGG 265
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG
Sbjct: 266 AVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGL 325
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHL+++HKAIK CE LI++DP++ +G EAH+Y K+ + CAAFLAN D+ S
Sbjct: 326 VRQPKWGHLKDVHKAIKQCEPALIATDPSYMSMGQNAEAHVY-KAGSVCAAFLANMDTQS 384
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNEL 415
D VTFNGN Y LPAWSVSILPDCKNVV NTA++ SQ R+ G A + E
Sbjct: 385 DKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEMRSLGSSTKASDGSSIET 444
Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKE 471
LA S +S+ E VGI+ + +P L EQINTT D SD+LWY+ S+ V G+ G +
Sbjct: 445 ELALSGWSYAIEPVGITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGGEPYLNGSQ 504
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L + SLGH ++N K G+ + + I L G N +D+LS VGL NY
Sbjct: 505 SNLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKIDLLSGTVGLSNY 564
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
GA+FD+ GAG+ + + G DLSS +W YQVG+ GE + L S A S W
Sbjct: 565 GAFFDLVGAGITGPVKLSGPKGVLDLSSTDWTYQVGLRGEGLHLYNPSEA-SPEWVSDKA 623
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
P N+ LIWYK+ F P G P+A++ MGKG+AWVNGQSIGRYW LAP +GC C
Sbjct: 624 YPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSC 683
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
+YRG Y +SKC K CGQP+QTLYH+PR+++ PG N +V+ E+ GGDPSKIS TK +
Sbjct: 684 NYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFTTKQTASV 743
Query: 712 CSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRP 769
C+ VSE P +DSW P V S P +RL C + G I++I FAS+G P G CG++
Sbjct: 744 CAHVSEDHPDQIDSWISPQQKVQRSGPALRLECPKAGQVISSIKFASFGTPSGTCGNYNH 803
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C L + Q+AC+G CS+PVS+ G C G+ K+L VEA CS
Sbjct: 804 GECSSPQALAVAQEACIGVSSCSVPVSTKNFG---DPCTGVTKSLVVEAACS 852
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 966 bits (2498), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/850 (57%), Positives = 598/850 (70%), Gaps = 41/850 (4%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 20 TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
++GQY F+GR DLV+FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 80 VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 139
Query: 123 NPFK--EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
PFK EMKRF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYG+++ AYG G+ Y+ WA
Sbjct: 140 EPFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWA 199
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +L+T VPWVMCQQEDAPD IINTCNGFYCD FTPNS +KP MWTEN+S W+L FG
Sbjct: 200 AKMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQFTPNSNTKPKMWTENWSAWYLLFG 259
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYM---------------------YFGGTNFGR 279
P RPVEDLAFAVARFF+ GGTFQNYYM Y GGTNF R
Sbjct: 260 GGFPHRPVEDLAFAVARFFQRGGTFQNYYMVLQPEMFFTSSIYYMVLFLRPYHGGTNFDR 319
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
+ GGP +ATSYD+DAPIDEYG IRQPKWGHL++LHKA+KLCEE LI+++P LG LE
Sbjct: 320 STGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLCEEALIATEPKITSLGPNLE 379
Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR 399
A +Y K+ + CAAFLAN D+ SD V F+GN Y LPAWSVSILPDCKNVV NTAK+ S
Sbjct: 380 AAVY-KTGSVCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSAS 438
Query: 400 NNGDHPFAQQK-NVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
+ K +++ L +SS +SW E VGIS + F + L EQIN T D SDYLWY
Sbjct: 439 AISNFVTKSSKEDISSLETSSSKWSWINEPVGISKDDIFSKTGLLEQINITADRSDYLWY 498
Query: 459 TASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINT 518
+ S+ + G + L+IESLGHA FVN KL GN D ++ I++ G N
Sbjct: 499 SLSVDLKDDLGSQTVLHIESLGHALHAFVNGKLAGSHTGNKDKPKLNVDIPIKVIYGNNQ 558
Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKR--DLSSGEWIYQVGVEGEYIGL 575
+D+LS+ VGLQNYGA+FD GAG+ V L LKNG DLSS +W YQVG++GE +GL
Sbjct: 559 IDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLKNGNNTLDLSSQKWTYQVGLKGEDLGL 618
Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
S +S W ST P N+ LIWYKT F AP G P+A++ MGKG+AWVNGQSIGR
Sbjct: 619 ---SSGSSEGWNSQSTFPKNQPLIWYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGR 675
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW Y+A + CT C+YRG + +KC +CG+P+QTLYH+PR+++ P N LV+ EE G
Sbjct: 676 YWPTYVASNADCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPRSFLKPNGNTLVLFEENG 735
Query: 696 GDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSPQVRLAC-ERGWHIAA 751
GDP++I+ TK + +C+ VS++ PP +D W + G V P + L C I +
Sbjct: 736 GDPTQIAFATKQLESLCAHVSDSHPPQIDLWNQDTTSWGKVG--PALLLNCPNHNQVIFS 793
Query: 752 INFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLL 810
I FASYG P G CG+F G C + L IV+KAC+G CSI VS+ G C G+
Sbjct: 794 IKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSIGVSTDTFG---DPCRGVP 850
Query: 811 KALAVEAHCS 820
K+LAVEA C+
Sbjct: 851 KSLAVEATCA 860
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 939 bits (2427), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/834 (56%), Positives = 582/834 (69%), Gaps = 45/834 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDG RRVL SGSIHYPRSTP++WP LI+K+K+GGL+VIETYVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQY FEGR DL FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF AKI ENEYGN++ AYG G+ Y++WAA
Sbjct: 147 NEPFKAEMQRFTAKI----------------------ENEYGNIDSAYGAPGKAYMRWAA 184
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 185 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 244
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN R++GGP +ATSYDYDAPIDEYG
Sbjct: 245 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 304
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR++HKAIKLCE LI++DP++ LG +EA +Y K + CAAFLAN D S
Sbjct: 305 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVY-KVGSVCAAFLANIDGQS 363
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
D VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ + + + NV
Sbjct: 364 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 423
Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
LA S +S+ E VGI+ + + + L EQINTT D SD+LWY+ SI V +G E +LN
Sbjct: 424 ELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 480
Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
+ SLGH V++N K+ G+ + K IEL G N +D+LS VGL
Sbjct: 481 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 540
Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
NYGA+FD+ GAG+ + + NG DLSS EW YQ+G+ GE + L S A S W
Sbjct: 541 SNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 599
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ P+N LIWYKT F P G P+A++ MGKG+AWVNGQSIGRYW LAP +GC
Sbjct: 600 ANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCV 659
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
C+YRG+Y +SKC K CGQP+QTLYH+PR+++ PG N LV+ E GGDPSKIS + +
Sbjct: 660 NSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQT 719
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSF 767
+C+ VSEA P +DSW + P +RL C + G I+++ FAS+G P G CGS+
Sbjct: 720 GSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLECPKEGQVISSVKFASFGTPSGTCGSY 779
Query: 768 RPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C L IVQ+AC+G CS+PVSS Y G C G+ K+LAVEA CS
Sbjct: 780 SHGECSSTQALSIVQEACIGVSSCSVPVSSNYFG---NPCTGVTKSLAVEAACS 830
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 928 bits (2399), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/803 (56%), Positives = 564/803 (70%), Gaps = 24/803 (2%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+WP LI+KSK+GGL+VIETYVFW+ HE +RGQY FEGR DLVRFVK V +AGL++HLRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
PY CAEWNYGGFPVWLHF+PGI+FRT N FK EM+RF K++D MK L+ASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
L+Q+ENEYGN++ AYG G+ Y++WAA AV+L+T VPWVMCQQ DAPDP+INTCNGFYC
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180
Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
D FTPNS SKP MWTEN+SGWFLSFG AVP+RP EDLAFAVARF++ GGTFQNYYMY GG
Sbjct: 181 DQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGG 240
Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
TNFGR+ GGP +ATSYDYDAPIDEYG +RQPKWGHLR++HKAIKLCE LI+++P++ L
Sbjct: 241 TNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSL 300
Query: 335 GAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTA 393
G EA +Y + N CAAFLAN D+ SD V FNGN Y LPAWSVSILPDCKNVV NTA
Sbjct: 301 GQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTA 360
Query: 394 KVISQ------RNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQIN 447
++ SQ R+ G ++ LA++ +S+ E VGI+ + +P L EQIN
Sbjct: 361 QINSQVTTSEMRSLGSSIQDTDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQIN 420
Query: 448 TTKDTSDYLWYTASIHVMPGQGKEVFLN-------IESLGHAALVFVNKKLVAFGYGNHD 500
TT D SD+LWY+ SI V +G E +LN + SLGH +++N KL G+
Sbjct: 421 TTADASDFLWYSTSIVV---KGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSAS 477
Query: 501 FANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG 560
+ + + L G N +D+LS VGL NYGA+FD+ GAG+ + + NG +LSS
Sbjct: 478 SSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSST 537
Query: 561 EWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLAS 620
+W YQ+G+ GE + L S A S W + P N+ LIWYKT F AP G P+A++
Sbjct: 538 DWTYQIGLRGEDLHLYNPSEA-SPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTG 596
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKG+AWVNGQSIGRYW LAP +GC C+YRG+Y ++KC K CGQP+QTLYH+PR++
Sbjct: 597 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSF 656
Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW-KPNLGVVSSSPQV 739
+ PG N LV+ E+ GGDPS IS T+ IC+ VSE P +DSW P + P +
Sbjct: 657 LQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPAL 716
Query: 740 RLACER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSA 797
RL C R G I+ I FAS+G P G CG++ G C L +VQ+ACVG CS+PVSS
Sbjct: 717 RLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSN 776
Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
G C G+ K+L VEA CS
Sbjct: 777 NFG---DPCSGVTKSLVVEAACS 796
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 906 bits (2341), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/851 (53%), Positives = 577/851 (67%), Gaps = 41/851 (4%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD RAL+I+G+RR+L S IHYPR+TPE+WP L++KSKEGG +V+++YVFWN HEP
Sbjct: 34 NVTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQSYVFWNGHEPK 93
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLV+F+K VQ+AGL+ HLRIGPY CAEWN+GGFP WL IPGI FRT N
Sbjct: 94 QGQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKDIPGIVFRTDNE 153
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+ F++KI++LMK+ LFA QGGPII+AQ+ENEYGN+EWA+G GG+ Y WAA+
Sbjct: 154 PFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDGGKRYAMWAAEL 213
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+ VPWVMCQQ+DAP IINTCNG+YCDGF N+ +KP WTE+++GWF +G +V
Sbjct: 214 ALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKANTATKPAFWTEDWNGWFQYWGQSV 273
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVED AFA+ARFF+ GG+FQNYYMYFGGTNF RTAGGP + TSYDYDAP+DEYG IR
Sbjct: 274 PHRPVEDNAFAIARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTSYDYDAPLDEYGLIR 333
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
QPKWGHLR+LH AIKLCE L + D P LG +EAH+Y CAAFLAN DS
Sbjct: 334 QPKWGHLRDLHAAIKLCEPALTAVDEVPLSTWLGPNVEAHVY-SGRGQCAAFLANIDSWK 392
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS-- 419
A V F G Y LP WSVSILPDCKNVVFNTA+V +Q + K E+++ S
Sbjct: 393 IATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSKLEGEVVMPSNM 452
Query: 420 -----------SAFSWYE--EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
S W E VGI G + V L EQ+N TKD++DYLWY+ SI V
Sbjct: 453 LRKHAPESIVGSGLKWEASVEPVGIRGAATLVSNRLLEQLNITKDSTDYLWYSISIKVSV 512
Query: 465 -----MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
+ + L + S+ A +FVN++LV G ++ + + + L EG N +
Sbjct: 513 EAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAMG----SDVQVVQPVPLKEGKNDI 568
Query: 520 DILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
D+LSM VGLQNYGA+ + GAG+ S +L L +G DLS+ W YQVG++GE L +
Sbjct: 569 DLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSGVLDLSTERWSYQVGIQGEEKRLFET 628
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
A+ W S+ P +L WYKTTF AP+G P+AL+L SMGKGQAWVNG +GRYW
Sbjct: 629 GTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQAWVNGHHMGRYWP 688
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT-----LYHIPRTWVHPGENLLVIHEE 693
+ LA +GC+ CDYRG+YDA KC+ +CG+P+Q +YHIPR W+ NLLV+ EE
Sbjct: 689 SVLASQSGCS-TCDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWLQLSNNLLVLFEE 747
Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSPQVRLACERGWHIA 750
+GGD SK+SL+T++ +C+ V E+ PPPV W N + S S + L C G HI
Sbjct: 748 IGGDVSKVSLVTRSAPAVCTHVHESQPPPVLFWPANSSMDAMSSRSGEAVLECIAGQHIR 807
Query: 751 AINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGL 809
I FAS+G P+G+CG+F+ G CH M L + +KAC+G CSIPV G CP +
Sbjct: 808 HIKFASFGNPKGSCGNFQRGTCHAMKSLEVARKACMGMHRCSIPVQWQTFG-EFDPCPDV 866
Query: 810 LKALAVEAHCS 820
K+LAV+ CS
Sbjct: 867 SKSLAVQVFCS 877
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 906 bits (2341), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/832 (53%), Positives = 560/832 (67%), Gaps = 28/832 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYDH+ALVI+G+RR+L SGSIHYPRST E+WP+L RK+K+GGL+VI+TYVFWN H
Sbjct: 21 VECGVTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDGGLDVIQTYVFWNMH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGRFDLV+FVK QEAGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81 EPSPGNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F K++DLMK E LF SQGGPIILAQVENEY E YG+ G Y+ WA
Sbjct: 141 DNEPFKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEEMEYGLAGAQYMNWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV ++T VPWVMC+Q+DAPDP+INTCNGFYCD F PN P KP MWTE +SGW+ FG
Sbjct: 201 AQMAVGMDTGVPWVMCKQDDAPDPVINTCNGFYCDNFVPNKPYKPTMWTEAWSGWYTEFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A P RPVEDLAFAVARFF GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 261 GASPHRPVEDLAFAVARFFVKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPKWGHL+ELHKAIKLCE L+S DP LG +A++Y + +CAAF+ NYDS+
Sbjct: 321 LIRQPKWGHLKELHKAIKLCEPALVSGDPVVTSLGHFQQAYVYSAGAGNCAAFIVNYDSN 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S V FNG Y + WSVSILPDC+NVVFNTAKV Q +Q K +
Sbjct: 381 SVGRVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQT-------SQMK-----MTPVG 428
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
F W +E + + S L EQIN T+D +DYLWY S+ V + G
Sbjct: 429 GFGWESIDENIASFEDNSISAVGLLEQINITRDNTDYLWYITSVEVDEDEPFIKNGGLPV 488
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L ++S G A VF+N L YG + + + LN G N + +LSM VGLQN G
Sbjct: 489 LTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNKISLLSMTVGLQNIGP 548
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F++A AG+ + L K+G RDLSS W YQ+G++GE + L S N+ W +G +
Sbjct: 549 HFEMANAGVLGPITLSGFKDGTRDLSSQRWSYQIGLKGETMNL-HTSGDNTVEWMKGVAV 607
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
P ++ L WYK F AP G+ PL L+L+SMGKGQAWVNGQSIGRYW +YLA C+ C
Sbjct: 608 PQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYWPSYLAEGV-CSDGCS 666
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y G+Y KC +CGQ +Q YH+PR+W+ P N LV+ EE+GG+PS +SL+T++ +C
Sbjct: 667 YEGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEIGGNPSGVSLVTRSVDSVC 726
Query: 713 SFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
+ VSE+ ++ W+ P+V L C +G I+AI FAS+G P+G CGSF+
Sbjct: 727 AHVSESHSQSINFWRLESTDQVQKLHIPKVHLQCSKGQRISAIKFASFGTPQGLCGSFQQ 786
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH + + +QK C+G +CS+ VS G CPG+ K +A+EA CS
Sbjct: 787 GDCHSPNSVATIQKKCMGLRKCSLSVSEKIFG--GDPCPGVRKGVAIEAVCS 836
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 892 bits (2305), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/837 (53%), Positives = 573/837 (68%), Gaps = 40/837 (4%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YD +A+VI+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 26 ASVSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFE +DLV+F+K +Q+AGL++HLRIGPY CAEWN+GGFPVWL +IPGIQFRT N
Sbjct: 86 SPGKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIQFRTDN 145
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK +M+RF KI+++MK E LF SQGGPIIL+Q+ENEYG +E+ G G++Y WAA
Sbjct: 146 GPFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTDWAAH 205
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN KP MWTE ++GW+ FG A
Sbjct: 206 MALGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWYTEFGGA 265
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 266 VPSRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S+DPT LG EAH++ S CAAFLANY+ S
Sbjct: 326 RQPKWGHLKDLHRAIKLCEPALVSADPTVTPLGTYQEAHVFKSKSGACAAFLANYNPRSF 385
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCKN V+NTA+V +Q AQ K L AF
Sbjct: 386 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------AQMKMPRVPL--HGAF 436
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
SW Y ++ + SF L EQINTT+D+SDYLWY + + P + GK L
Sbjct: 437 SWQAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKIDPNEEFLRSGKYPVLT 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I S GHA VF+N +L YG+ +F ++ + L GIN + +LS+ VGL N G F
Sbjct: 497 ILSAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIALLSIAVGLPNVGPHF 556
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ VIL L G+RDLS +W Y+VG++GE + L +S ++S W QGS +
Sbjct: 557 ETWNAGVLGPVILNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWIQGSLVTR 616
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLAL++ SMGKGQ W+NG+SIGRYW AY A +G C+Y
Sbjct: 617 RQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPAYKA--SGSCGACNYA 674
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GSY KC +CG+ +Q YH+PRTW++P NLLV+ EE GGDP+ I L+ + IC+
Sbjct: 675 GSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNGIFLVRREIDSICAD 734
Query: 715 VSEADPPPVDSWKPNL--------GVVSS--SPQVRLACERGWHIAAINFASYGIPEGNC 764
+ E W+PNL G V P+ L+C G I++I FAS+G PEG C
Sbjct: 735 IYE--------WQPNLMSWQMQASGKVKKPVRPKAHLSCGPGQKISSIKFASFGTPEGGC 786
Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
GSFR G+CH + Q++C+GQ CS+ V+ G CP ++K L+VEA CS
Sbjct: 787 GSFREGSCHAHNSYDAFQRSCIGQNSCSVTVAPENFG--GDPCPNVMKKLSVEAICS 841
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 890 bits (2301), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/832 (51%), Positives = 569/832 (68%), Gaps = 29/832 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+V+YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GG++VI+TYVFWN H
Sbjct: 24 VTASVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTYVFWNGH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G YYFE R+DLV+F+K VQ+AGL+LHLRIGPY CAEWN+GGFPVWL ++PGI+FRT
Sbjct: 84 EPSPGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVPGIEFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK E LF +QGGPIIL+Q+ENEYG VEW G G+ Y KWA
Sbjct: 144 DNGPFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
AD AV L T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN KP +WTE ++GW+ FG
Sbjct: 204 ADMAVKLGTGVPWIMCKQEDAPDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTGWYTEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP ED+AF+VARF + GG++ NYYMY GGTNFGRTAGGP +ATSYDYDAP+DE+G
Sbjct: 264 GAVPHRPAEDMAFSVARFIQNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PKWGHLR+LHKAIKLCE L+S DPT LG+ EAH++ KS + CAAFLANYD+
Sbjct: 324 LPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVF-KSKSVCAAFLANYDTK 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
VTF Y LP WSVSILPDCK V+NTA++ SQ +Q K ++ ASS
Sbjct: 383 YSVKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQS-------SQMK----MVPASS 431
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
+FSW EE + + L EQIN T+D +DYLWY + + + G+
Sbjct: 432 SFSWQSYNEETASADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKIDADEGFLKSGQNP 491
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L I S GHA VF+N +L YG ++ I+L EGIN + +LS+ VGL N G
Sbjct: 492 LLTIFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKISLLSVAVGLPNVG 551
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ + L L G RDLS +W Y++G++GE + L S + S W +GS
Sbjct: 552 LHFETWNAGVLGPITLKGLNEGTRDLSGQKWSYKIGLKGESLSLHTASGSESVEWVEGSL 611
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L ++L WYKT F AP+G PLAL+++SMGKGQ W+NGQ+IGR+W Y+A G C
Sbjct: 612 LAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWPGYIA--HGSCGDC 669
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
+Y G++D KC+ +CG+P+Q YH+PR+W+ P NLL + EE GGDP+ IS + +T +
Sbjct: 670 NYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPTGISFVKRTTASV 729
Query: 712 CSFVSEADPPPVDSWK--PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
C+ + E P + +W+ + V+S P+ L C G I+ I FAS+G+P+G CGSFR
Sbjct: 730 CADIFEGQ-PALKNWQAIASGKVISPQPKAHLWCPTGQKISQIKFASFGMPQGTCGSFRE 788
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G+CH ++ CVG+ CS+ V+ G CP K L+VEA CS
Sbjct: 789 GSCHAHKSYDAFERNCVGKQSCSVTVAPEVFG--GDPCPDSAKKLSVEAVCS 838
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 888 bits (2295), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/827 (52%), Positives = 554/827 (66%), Gaps = 17/827 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
++NVTYDHR+L+I G+RR++ S SIHYPRS PE+WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQYYFE RFDLVRFVK V++AGL L LRIGP+ AEWN+GG PVWLH++PG FRT
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTD 145
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-EWAYGVGGELYVKWA 180
N PFK MK F I+++MK+E LFASQGG IILAQ+ENEYG+ E AY GG+ Y WA
Sbjct: 146 NEPFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV NT VPW+MCQ+ DAPDP+IN+CNGFYCDGF PNSP+KP +WTEN+ GWF +FG
Sbjct: 206 ASMAVAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKLWTENWPGWFQTFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ P RP ED+AFAVARFFE GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKW HLR+LHK+I+LCE L+ + T LG K EA IY S C AFLAN DS+
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+D VTF Y LPAWSVSILPDC+NVVFNTAKV SQ + V E L AS
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQT-------SMVAMVPESLQASK 438
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP--GQGKEVFLNI 476
W + E+ GI G FVR + INTTKD++DYLWYT S V +G V LNI
Sbjct: 439 PERWNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDESYSKGSHVVLNI 498
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GH F+N + + YGN ++F + I L G N L +LSM VGLQN G ++
Sbjct: 499 DSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNELALLSMTVGLQNAGFSYE 558
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
GAG +V + ++NG +LSS W Y++G+EGEY L K N+ W S P N+
Sbjct: 559 WIGAGFTNVNISGVRNGTINLSSNNWAYKIGLEGEYYSLFKPDQRNNQRWIPQSEPPKNQ 618
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK P+G P+ +++ SMGKG W+NG +IGRYW + CT CDYRG
Sbjct: 619 PLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWPRTSSIDDRCTPSCDYRGE 678
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
++ +KC+ CGQP Q YHIPR+W HP N+LVI EE GGDP+KI+ + +CSFVS
Sbjct: 679 FNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDPTKITFSRRAVTSVCSFVS 738
Query: 717 EADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
E P ++SW + +SP + +L+C G +I+++ FAS G P G C S++ G+CH
Sbjct: 739 EHFPSIDLESWDGSATNEGTSPAKAQLSCPIGKNISSLKFASLGTPSGTCRSYQKGSCHH 798
Query: 775 -DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ L +V+KAC+ C++ +S G CPG+ K LA+EA CS
Sbjct: 799 PNSLSVVEKACLNTNSCTVSLSDESFG--KDLCPGVTKTLAIEADCS 843
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 887 bits (2291), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/829 (52%), Positives = 559/829 (67%), Gaps = 27/829 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ +V+YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 36 TCSVSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 95
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFEGR+DLV+F+K V+EAGL++HLRIGPYACAEWN+GGFPVWL +IPGI FRT
Sbjct: 96 PSPGEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTD 155
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M F KI+D+MK+E LF +QGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 156 NEPFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAA 215
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ AV L T VPWVMC+Q+DAPDPIINTCN YCD F+PN KP MWTE ++ WF +FG
Sbjct: 216 NMAVGLGTGVPWVMCKQDDAPDPIINTCNDHYCDWFSPNKNYKPTMWTEAWTSWFTAFGG 275
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP+RP ED+AFA+A+F + GG+F NYYMY GGTNFGRTAGGP VATSYDYDAPIDEYG
Sbjct: 276 PVPYRPAEDMAFAIAKFIQRGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYGL 335
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPKWGHL++LHKAIK+CE L+S DP LG+ E+H++ S DCAAFLANYD S
Sbjct: 336 IRQPKWGHLKDLHKAIKMCEAALVSGDPIVTSLGSSQESHVFKSESGDCAAFLANYDEKS 395
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V F G Y LP WS+SILPDC N VFNTA+V AQ ++ +
Sbjct: 396 FAKVAFQGMHYNLPPWSISILPDCVNTVFNTARV----------GAQTSSMTMTSVNPDG 445
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
FSW Y E+ + S L EQIN T+D +DYLWYT I + P + G+ L
Sbjct: 446 FSWETYNEETASYDDASITMEGLLEQINVTRDVTDYLWYTTDITIDPNEGFLKNGEYPVL 505
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ S GHA +F+N +L YG+ D ++L G N + +LS+ VGL N GA
Sbjct: 506 TVMSAGHALHIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKISVLSIAVGLPNIGAH 565
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ V+L L G+RDLS W Y++G++GE + L ++ ++S W S +
Sbjct: 566 FETWNTGVLGPVVLNGLNEGRRDLSWQNWSYKIGLKGEALQLHSLTGSSSVEWS--SLIA 623
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYKTTF APEG GP AL+++ MGKGQ W+NGQSIGRYW AY A G +C Y
Sbjct: 624 QKQPLTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWPAYKA--YGNCGECSY 681
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G Y+ KC +CG+ +Q YH+P +W++P NLLV+ EE GGDP+ ISL+ +T C+
Sbjct: 682 TGRYNEKKCLANCGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTGISLVRRTTGSACA 741
Query: 714 FVSEADPPPVDSWKPNLGVVS--SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
F+SE P + G P+ L+C G I++I FAS+G P+G CG+F G+
Sbjct: 742 FISEWHPTLRKWHIKDYGRAERPRRPKAHLSCADGQKISSIKFASFGTPQGVCGNFTEGS 801
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CH I +K CVGQ CS+ +S G CP ++K LAVEA C
Sbjct: 802 CHAHKSYDIFEKNCVGQQWCSVTISPDVFG--GDPCPNVMKNLAVEAIC 848
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 885 bits (2288), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/832 (53%), Positives = 564/832 (67%), Gaps = 30/832 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+V+ETYVFWN HEP
Sbjct: 25 ASVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWNVHEP 84
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85 SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG G G+ YV WAA
Sbjct: 145 EPFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYVNWAAK 204
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV + T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP++WTE +SGWF FG
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+ RPV+DLAFAVARF GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG I
Sbjct: 265 IHKRPVQDLAFAVARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELH+AIK+CE L+S+DP LG +AH+Y S DCAAFL+NYDS S
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIITSLGESQQAHVYTTESGDCAAFLSNYDSKSS 384
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LP WSVSILPDC+NVVFNTAKV Q + L + F
Sbjct: 385 ARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKV----------GVQTSQMQMLPTNTQLF 434
Query: 423 SW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW ++E V + + + + P L EQIN TKD SDYLWY S+ + + G+ L
Sbjct: 435 SWESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTL 494
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
++S GHA VF+N +L YG ++ F+ K+ L GIN + +LS+ +GL N G
Sbjct: 495 IVQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRIALLSVAIGLPNVGEH 554
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TL 592
F+ G+ V L L GK DLS +W YQVG++GE + L + +S W Q + +
Sbjct: 555 FESWSTGILGPVALHGLDQGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVV 614
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
N+ L W+KT F APEG PLAL++ MGKGQ W+NGQSIGRYW+ + +TG C+
Sbjct: 615 QRNQPLTWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTTF---ATGNCNDCN 671
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y GS+ KCQ CGQP Q YH+PR+W+ P +NLLVI EELGG+PSKISL+ ++ +C
Sbjct: 672 YAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGGNPSKISLVKRSVSSVC 731
Query: 713 SFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
+ VSE P + +W S P+V L C G I++I FAS+G P G CG++
Sbjct: 732 ADVSEYH-PNIKNWHIESYGKSEEFHPPKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQ 790
Query: 770 GACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
GACH I++K C+G+ C++ VS++ G CP +LK L+VEA C+
Sbjct: 791 GACHSPASYAILEKRCIGKPRCTVTVSNSNFG--QDPCPKVLKRLSVEAVCA 840
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 885 bits (2287), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/829 (52%), Positives = 564/829 (68%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+V+YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 29 VTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLV+FVK +EAGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 89 EPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M++F KI+++MK E LF +QGGPIIL+Q+ENEYG +E+ G G+ Y KWA
Sbjct: 149 DNGPFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWA 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 209 AEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFG 268
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 269 GPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 328
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++LH+AIKLCE L+S D T LG EAH+++ + CAAFLANY
Sbjct: 329 LLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQR 388
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F Y LP WS+SILPDCKN V+NTA+V +Q A+ K +
Sbjct: 389 SFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMTPVPMHGGF 441
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ Y E+ SG+ +F L EQINTT+D SDYLWY +H+ P + GK L
Sbjct: 442 SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLG 501
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VF+N +L YG+ DF + ++L G+N + +LS+ VGL N G F
Sbjct: 502 VLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHF 561
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G+RDLS +W Y++G+ GE +GL IS ++S W +GS +
Sbjct: 562 ETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSLVAQ 621
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQ +GR+W AY A +G C Y
Sbjct: 622 RQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA--SGTCGDCSYI 679
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+Y+ KC +CG+ +Q YH+P++W+ P NLLV+ EE GGDP+ ISL+ + +C+
Sbjct: 680 GTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDVDSVCAD 739
Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E P ++ G V+ P+ L+C G I +I FAS+G PEG CGS+R G+C
Sbjct: 740 IYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYRQGSC 799
Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H CVGQ CS+ V+ G C ++K LAVEA CS
Sbjct: 800 HAFHSYDAFNNLCVGQNSCSVTVAPEMFG--GDPCLNVMKKLAVEAICS 846
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 884 bits (2284), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/829 (51%), Positives = 564/829 (68%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+V+YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 22 VTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 81
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLV+FVK +EAGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 82 EPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRT 141
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M++F K++++MK E LF +QGGPIIL+Q+ENEYG +E+ G G+ Y KWA
Sbjct: 142 DNGPFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWA 201
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 202 AEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFG 261
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 262 GPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++LH+AIKLCE L+S D T LG EAH+++ + CAAFLANY
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQR 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F Y LP WS+SILPDCKN V+NTA+V +Q A+ K +
Sbjct: 382 SFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMTPVPMHGGF 434
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ Y E+ SG+ +F L EQINTT+D SDYLWY +H+ P + GK L
Sbjct: 435 SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLG 494
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VF+N +L YG+ DF + ++L G+N + +LS+ VGL N G F
Sbjct: 495 VLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHF 554
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G+RDLS +W Y++G+ GE +GL IS ++S W +GS +
Sbjct: 555 ETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSLVAQ 614
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQ +GR+W AY A +G C Y
Sbjct: 615 RQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA--SGTCGDCSYI 672
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+Y+ KC +CG+ +Q YH+P++W+ P NLLV+ EE GGDP+ ISL+ + +C+
Sbjct: 673 GTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDVDSVCAD 732
Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E P ++ G V+ P+ L+C G I +I FAS+G PEG CGS+R G+C
Sbjct: 733 IYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYRQGSC 792
Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H CVGQ CS+ V+ G C ++K LAVEA CS
Sbjct: 793 HAFHSYDAFNNLCVGQNSCSVTVAPEMFG--GDPCLNVMKKLAVEAICS 839
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 883 bits (2282), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/830 (52%), Positives = 562/830 (67%), Gaps = 27/830 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+VTYD R+ +I+G+R++L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 20 SASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 79
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P RG+YYFEGR+DLVRF+K VQ AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 80 PSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTD 139
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F KI+D+MK E LF QGGPII++Q+ENEYG VE+ G G+ Y KWAA
Sbjct: 140 NGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWAA 199
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ AV L T VPWVMC+QEDAPDP+I+ CNGFYC+ F PN KP M+TE ++GW+ FG
Sbjct: 200 EMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDYKPKMFTEAWTGWYTEFGG 259
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
A+P RP EDLA++VARF + G+F NYYMY GGTNFGRTAGGP ++TSYDYDAPIDEYG
Sbjct: 260 AIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGL 319
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+PKWGHLR+LHKAIKLCE L+S+DPT LG LEAH+Y S CAAFLANYD S
Sbjct: 320 PSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANYDPKS 379
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A VTF Y LP WSVSILPDCKNVVFNTA++ +Q + Q +N + S
Sbjct: 380 SAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQ--------SSQMKMNPV----ST 427
Query: 422 FSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
FSW Y E+ + D L EQIN T+DT+DYLWY +H+ P + G+
Sbjct: 428 FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYPV 487
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GHA VF+N +L YG + ++L G N + +LS+ +GL N G
Sbjct: 488 LTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGL 547
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ V L L G D+SS +W Y++G++GE + L I+ ++S W +GS L
Sbjct: 548 HFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSLL 607
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYKTTF AP G PLAL+++SMGKGQ W+NG+SIGR+W AY A G C+
Sbjct: 608 AQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWPAYTA--HGNCNGCN 665
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y G ++ KCQ CG P+Q YH+PR+W+ P N L++ EELGG+P+ I+L+ +T +C
Sbjct: 666 YAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVC 725
Query: 713 SFVSEADPPPVDSWKPNLGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
+ + E P +S V+S + L C G I+ I FAS+G+P+G CGSFR G+
Sbjct: 726 ADIFEGQPSLKNSQIIGSSKVNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGS 785
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH +Q+ C+G+ CS+ V+ G CPG +K L+VEA CS
Sbjct: 786 CHAHKSYDALQRNCIGKQSCSVSVAPEVFG--GDPCPGSMKKLSVEALCS 833
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 883 bits (2282), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/828 (52%), Positives = 549/828 (66%), Gaps = 18/828 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
++NVTYDHR+L+I G+RR++ S SIHYPRS PE+WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQYYFE RFDLVRFVK V++AGL L LRIGPY AEWNYGG PVWLH++PG FRT
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-EWAYGVGGELYVKWA 180
N PFK MK F I+D+MK+E LFASQGG IILAQ+ENEYG+ E AYG GG+ Y WA
Sbjct: 146 NEPFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ NT VPW+MCQ+ DAPDP+IN+CNGFYCDGF PNSP+KP +WTEN+ GWF +FG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ P RP ED+AFAVARFFE GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKW HLRELHK+I+LCE L+ + T LG K EA IY S C AFLAN DS+
Sbjct: 326 LRRFPKWAHLRELHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+D VTF Y LPAWSVSILPDC+NVVFNTAKV SQ + V E L AS
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQT-------SMVTMVPESLQASK 438
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLN 475
W + E+ GI G FVR + INTTKD++DYLWYT S V +G LN
Sbjct: 439 PERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGSYSSKGSHAVLN 498
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+S GH F+N L+ YGN + F + I L G N L +LSM VGLQN G +
Sbjct: 499 IDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNELALLSMTVGLQNAGFAY 558
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
+ GAG +V + ++ G DLSS W Y++G+EGEY L K N+ W S P N
Sbjct: 559 EWIGAGFTNVNISGVRTGIIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEPPKN 618
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK P+G P+ +++ SMGKG AW+NG +IGRYW + + CT C+YRG
Sbjct: 619 QPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCNYRG 678
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
++ KC+ CGQP Q YHIPR+W HP N+LV+ EE GGDP+KI+ + +CSFV
Sbjct: 679 TFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVTSVCSFV 738
Query: 716 SEADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
SE P ++SW + + P + +L+C G I+++ FAS G P G C S++ G CH
Sbjct: 739 SEHFPSIDLESWDESAMNEGTPPAKAQLSCPEGKSISSVKFASLGNPSGTCRSYQMGRCH 798
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ L +V+KAC+ C++ ++ G C G+ K LA+EA CS
Sbjct: 799 HPNSLSVVEKACLNTNSCTVSLTDESFG--KDLCHGVTKTLAIEADCS 844
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 883 bits (2281), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/828 (52%), Positives = 549/828 (66%), Gaps = 18/828 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
++NVTYDHR+L+I G+RR++ S SIHYPRS PE+WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQYYFE RFDLVRFVK V++AGL L LRIGPY AEWNYGG PVWLH++PG FRT
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-EWAYGVGGELYVKWA 180
N PFK +K F I+D+MK+E LFASQGG IILAQ+ENEYG+ E AYG GG+ Y WA
Sbjct: 146 NEPFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ NT VPW+MCQ+ DAPDP+IN+CNGFYCDGF PNSP+KP +WTEN+ GWF +FG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ P RP ED+AFAVARFFE GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKW HLR+LHK+I+LCE L+ + T LG K EA IY S C AFLAN DS+
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+D VTF Y LPAWSVSILPDC+NVVFNTAKV SQ + V E L AS
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQT-------SMVTMVPESLQASK 438
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLN 475
W + E+ GI G FVR + INTTKD++DYLWYT S V +G LN
Sbjct: 439 PERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGSYSSKGSHAVLN 498
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+S GH F+N L+ YGN + F + I L G N L +LSM VGLQN G +
Sbjct: 499 IDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNELALLSMTVGLQNAGFAY 558
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
+ GAG +V + ++ G DLSS W Y++G+EGEY L K N+ W S P N
Sbjct: 559 EWIGAGFTNVNISGVRTGTIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEPPKN 618
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK P+G P+ +++ SMGKG AW+NG +IGRYW + + CT C+YRG
Sbjct: 619 QPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCNYRG 678
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
++ KC+ CGQP Q YHIPR+W HP N+LV+ EE GGDP+KI+ + +CSFV
Sbjct: 679 TFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVTSVCSFV 738
Query: 716 SEADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
SE P ++SW + + P + +L C G I+++ FAS G P G C S++ G CH
Sbjct: 739 SEHFPSIDLESWDESAMTEGTPPAKAQLFCPEGKSISSVKFASLGNPSGTCRSYQMGRCH 798
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ L +V+KAC+ C++ ++ G CPG+ K LA+EA CS
Sbjct: 799 HPNSLSVVEKACLNTNSCTVSLTDESFG--KDLCPGVTKTLAIEADCS 844
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 883 bits (2281), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/830 (52%), Positives = 562/830 (67%), Gaps = 27/830 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+VTYD R+ +I+G+R++L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23 SASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P RG+YYFEGR+DLVRF+K VQ AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 PSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F KI+D+MK E LF QGGPII++Q+ENEYG VE+ G G+ Y KWAA
Sbjct: 143 NGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ AV L T VPWVMC+QEDAPDP+I+ CNGFYC+ F PN KP M+TE ++GW+ FG
Sbjct: 203 EMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDYKPKMFTEAWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
A+P RP EDLA++VARF + G+F NYYMY GGTNFGRTAGGP ++TSYDYDAPIDEYG
Sbjct: 263 AIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+PKWGHLR+LHKAIKLCE L+S+DPT LG LEAH+Y S CAAFLANYD S
Sbjct: 323 PSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANYDPKS 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A VTF Y LP WSVSILPDCKNVVFNTA++ +Q + Q +N + S
Sbjct: 383 SAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQ--------SSQMKMNPV----ST 430
Query: 422 FSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
FSW Y E+ + D L EQIN T+DT+DYLWY +H+ P + G+
Sbjct: 431 FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYPV 490
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GHA VF+N +L YG + ++L G N + +LS+ +GL N G
Sbjct: 491 LTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGL 550
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ V L L G D+SS +W Y++G++GE + L I+ ++S W +GS L
Sbjct: 551 HFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSLL 610
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYKTTF AP G PLAL+++SMGKGQ W+NG+SIGR+W AY A G C+
Sbjct: 611 AQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWPAYTA--HGNCNGCN 668
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y G ++ KCQ CG P+Q YH+PR+W+ P N L++ EELGG+P+ I+L+ +T +C
Sbjct: 669 YAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVC 728
Query: 713 SFVSEADPPPVDSWKPNLGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
+ + E P +S V+S + L C G I+ I FAS+G+P+G CGSFR G+
Sbjct: 729 ADIFEGQPSLKNSQIIGSSKVNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGS 788
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH +Q+ C+G+ CS+ V+ G CPG +K L+VEA CS
Sbjct: 789 CHAHKSYDALQRNCIGKQSCSVSVAPEVFG--GDPCPGSMKKLSVEALCS 836
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 882 bits (2280), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/830 (53%), Positives = 566/830 (68%), Gaps = 24/830 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YDH+A+ I+GKRR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 19 ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYF G +DLVRF+K V++AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 79 SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+RF KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y +WAA
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV L T VPWVMC+Q+DAPDPIIN+CNGFYCD F+PN KP MWTE ++GWF FG A
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S DP+ LG EAH++ CAAFLANY+ S
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCKN V+NTA+V +Q A+ K V + AF
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMVP--VPIHGAF 429
Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW EE +G RSF L EQINTT+D SDYLWY+ + + P + GK L
Sbjct: 430 SWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTL 489
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ S GHA VFVN +L YG+ +F +K + L GIN + ILS+ VGL N G
Sbjct: 490 TVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKISILSIAVGLPNVGPH 549
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ AG+ V L L G+RDLS +W Y+VGVEGE + L +S ++S W GS +
Sbjct: 550 FETWNAGVLGPVTLNGLNEGRRDLSWQKWSYKVGVEGEAMSLHSLSGSSSVEWTAGSFVA 609
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+KTTF AP G PLAL++ SMGKGQ W+NG+SIGR+W AY A +G CDY
Sbjct: 610 RRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPAYKA--SGSCGWCDY 667
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+++ KC +CG+ +Q YH+PR+W +P NLLV+ EE GGDP+ ISL+ + +C+
Sbjct: 668 AGTFNEKKCLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNGISLVRREVDSVCA 727
Query: 714 FVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
+ E P ++ G V+ P+ L C G I+++ FAS+G PEG CGS+R G+
Sbjct: 728 DIYEWQPTLMNYQMQASGKVNKPLRPKAHLQCGPGQKISSVKFASFGTPEGACGSYREGS 787
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH ++ CVGQ CS+ V + A P ++K LAVE CS
Sbjct: 788 CHAHHSYDAFERLCVGQNWCSVTVVPRNVSGEIPA-PSVMKKLAVEVVCS 836
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 882 bits (2278), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/826 (51%), Positives = 565/826 (68%), Gaps = 21/826 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+KSK+GGL+VI+TYVFWN HE
Sbjct: 25 TASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 PSPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTD 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 145 NEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN KP MWTE ++GW+ FG
Sbjct: 205 QMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 265 AVPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 324
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PKWGHLR+LHKAIK E L+S++P+ LG EAH++ KS + CAAFLANYD+ S
Sbjct: 325 PREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVF-KSKSGCAAFLANYDTKS 383
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V+F Y LP W +SILPDCK V+NTA++ SQ + Q + + A
Sbjct: 384 SAKVSFGNGQYELPPWPISILPDCKTAVYNTARLGSQ--------SSQMKMTPVKSALPW 435
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
S+ EE + + L EQIN T+DT+DYLWY I + P + G+ L I
Sbjct: 436 QSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKRGESPLLTI 495
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG + ++ ++ GIN L +LS+ VGL N G F+
Sbjct: 496 YSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKPRSGINKLALLSISVGLPNVGLHFE 555
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ V L L +G D+S +W Y++G++GE +GL +S ++S W +G ++
Sbjct: 556 TWNAGVLGPVTLKGLNSGTWDMSRWKWTYKIGLKGEALGLHTVSGSSSVEWAEGPSMAQK 615
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK TF AP G GPLAL+++SMGKGQ W+NGQSIGR+W AY A G C Y G
Sbjct: 616 QPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTA--RGNCGNCYYAG 673
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+YD KC+ HCG+P+Q YH+PR+W+ P NLLV+ EE GGDP+KISL+ + +C+ +
Sbjct: 674 TYDDKKCRTHCGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPTKISLVERRTSSVCADI 733
Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
E P +S K G + + P+ L C G I+ I FASYG+P+G CGSF+ G+CH
Sbjct: 734 FEGQPTLTNSQKLASGKL-NRPKAHLWCPPGQVISDIKFASYGLPQGTCGSFQEGSCHAH 792
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ C+G+ CS+ V+ G CPG K L+VEA CS
Sbjct: 793 KSYDAPKRNCIGKQSCSVAVAPEVFG--GDPCPGSTKKLSVEAVCS 836
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 882 bits (2278), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/826 (51%), Positives = 561/826 (67%), Gaps = 21/826 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A V+YDHRA+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 28 ATVSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 87
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G YYFE R+DLV+F+K VQ AGL++HLRIGPY CAEWN+GGFPVWL ++PGI+FRT N
Sbjct: 88 SPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDN 147
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KWAAD
Sbjct: 148 GPFKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAD 207
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV L T VPWVMC+Q+DAPDP+INTCNGFYC+ F PN KP +WTEN++GW+ FG A
Sbjct: 208 MAVKLGTGVPWVMCKQDDAPDPVINTCNGFYCENFKPNKDYKPKLWTENWTGWYTEFGGA 267
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+ G +ATSYDYDAP+DEYG
Sbjct: 268 VPYRPAEDLAFSVARFIQNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDEYGLT 327
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R PKWGHLR+LHKAIKLCE L+S DPT + LG+ EAH++ +S + CAAFLANYD+
Sbjct: 328 RDPKWGHLRDLHKAIKLCEPALVSVDPTVKSLGSNQEAHVF-QSKSSCAAFLANYDTKYS 386
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
VTF Y LP WS+SILPDCK VFNTA++ +Q + Q + + A S
Sbjct: 387 VKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGAQ--------SSQMKMTPVGGALSWQ 438
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
S+ EE + + L EQIN T+D SDYLWY ++++ + G L I
Sbjct: 439 SYIEEAATGYTDDTTTLEGLWEQINVTRDASDYLWYMTNVNIDSDEGFLKNGDSPVLTIF 498
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GH+ VF+N +L YG+ + ++ ++L GIN + +LS+ VGL N G F+
Sbjct: 499 SAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKLTAGINKISLLSVAVGLPNVGVHFEK 558
Query: 538 AGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG+ V L L G RDLS +W Y++G++GE + L ++ ++S W +GS +
Sbjct: 559 WNAGILGPVTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTVTGSSSVEWVEGSLSAKKQ 618
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK TF APEG P+AL+++SMGKGQ WVNGQSIGR+W AY A G C+Y G+
Sbjct: 619 PLTWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPAYTA--RGSCSACNYAGT 676
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
YD KC+ +CG+P+Q YH+PR+W++P NLLV+ EE GG+PS ISL+ +T +C+ +
Sbjct: 677 YDDKKCRSNCGEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSGISLVKRTTGSVCADIF 736
Query: 717 EADPPPVDSWKPNLGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
E P + LG + P+ L C G I+ I FASYG P+G CGSF+ G+CH
Sbjct: 737 EGQPALKNWQMIALGRLDHLQPKAHLWCPHGQKISKIKFASYGSPQGTCGSFKAGSCHAH 796
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+K C+G+ CS+ V++ G CP K L+VEA C+
Sbjct: 797 KSYDAFEKKCIGKQSCSVTVAAEVFG--GDPCPDSSKKLSVEAVCT 840
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 880 bits (2275), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/827 (52%), Positives = 570/827 (68%), Gaps = 22/827 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YD +A+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 ASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 86
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFEG +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 87 SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK +M++F KI+DLMK E L+ SQGGPII++Q+ENEYG +E+ G G+ Y KWAA+
Sbjct: 147 EPFKHQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEYEIGAAGKAYTKWAAE 206
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L T VPWVMC+Q+D PDP+INTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 207 MAMGLGTGVPWVMCKQDDTPDPLINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGP 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 267 VPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 326
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S DPT K+G EAH++ S CAAFLANY+ S
Sbjct: 327 RQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSKSGACAAFLANYNPKSY 386
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCKN V+NTA+V SQ AQ K + F
Sbjct: 387 ATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQS-------AQMKMTR--VPIHGGF 437
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
SW + E+ + + SF L EQ+NTT+D SDYLWY+ + + P + GK+ L
Sbjct: 438 SWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVLDPNEGFLRNGKDPVLT 497
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VF+N +L YG+ +F N+ ++L G+N + +LS+ VGL N G F
Sbjct: 498 VFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRAGVNKISLLSVAVGLPNVGPHF 557
Query: 536 DVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ I L L G+RDLS +W Y+VG++GE + L +S ++S W QGS +
Sbjct: 558 ETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGEILSLHSLSGSSSVEWIQGSLVSQ 617
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQ++GRYW AY A +G CDY
Sbjct: 618 RQPLTWYKTTFDAPAGTAPLALDMDSMGKGQVWLNGQNLGRYWPAYKA--SGTCDYCDYA 675
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+Y+ +KC+ +CG+ +Q YH+P++W+ P NLLV+ EELGGDP+ I L+ + +C+
Sbjct: 676 GTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSVCAD 735
Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
+ E P + G P+V L+C G I++I FAS+G P G+CG+F G+CH
Sbjct: 736 IYEWQPNLISYQMQTSGKAPVRPKVHLSCSPGQKISSIKFASFGTPAGSCGNFHEGSCHA 795
Query: 775 -DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ CVGQ C++ VS G CP +LK L+VEA CS
Sbjct: 796 HKSYDAFERNCVGQNWCTVTVSPENFG--GDPCPNVLKKLSVEAICS 840
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 880 bits (2275), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/852 (51%), Positives = 579/852 (67%), Gaps = 44/852 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD RA++IDG+RR+L S IHYPR+TPE+WP +I+ +K+GG +V++TYVFWN HEP
Sbjct: 31 NVTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGADVVQTYVFWNGHEPE 90
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLV+F+K V++AGL+ HLRIGPY CAEWN+GGFP WL IPGI FRT N
Sbjct: 91 QGQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPYWLKEIPGIVFRTDNE 150
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+ F +KI++LMK+ LF+ QGGPII+AQ+ENEYG++E +G GG+ YV+WAAD
Sbjct: 151 PFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQFGDGGKRYVQWAADM 210
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A++L+T VPW+MC+QEDAP IINTCNGFYCDG+ PN+ KPI+WTE+++GWF ++G A
Sbjct: 211 ALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGWKPNTALKPILWTEDWNGWFQNWGQAA 270
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVED AFAVARFF+ GG+FQNYYMYFGGTNF RTAGGP + T+YDYDAPIDEYG IR
Sbjct: 271 PHRPVEDNAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTTYDYDAPIDEYGLIR 330
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQK--LGAKLEAHIYHKSSNDCAAFLANYDSSS 361
QPKWGHL++LH AIKLCE L + D Q +G+ EAH Y ++ CAAFLAN DS +
Sbjct: 331 QPKWGHLKDLHAAIKLCEPALTAVDTVPQSTWIGSNQEAHEY-SANGHCAAFLANIDSEN 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V F G Y LPAWSVSILPDCKNV FNTA++ +Q A + ++ L S+
Sbjct: 390 SVTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVTRMRIAPSNSRGDIFLPSNT 449
Query: 422 --------------FSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM 465
W E GI G+ + V L EQ+N TKDTSDYLWY+ SI +
Sbjct: 450 LVHDHISDGGVFANLKWQASAEPFGIRGSGTTVSNSLLEQLNITKDTSDYLWYSTSITIT 509
Query: 466 PG------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
G E L + ++ A +FVN KL G N + + I L +G N++
Sbjct: 510 SEGVTSDVSGTEANLVLGTMRDAVHIFVNGKLAGSAMG----WNIQVVQPITLKDGKNSI 565
Query: 520 DILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
D+LSM +GLQNYGA+ + GAG+ SV + L G LS+ EW YQVG+ GE + L
Sbjct: 566 DLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYGNLSLSTAEWSYQVGLRGEELKLFHN 625
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
A+ W S+ L WYKTTF AP G P+AL+L SMGKGQAW+NG +GRY+
Sbjct: 626 GTADGFSWDS-SSFTNASYLTWYKTTFDAPGGTDPVALDLGSMGKGQAWINGHHLGRYF- 683
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQ-------TLYHIPRTWVHPGENLLVIH 691
+AP +GC + CDYRG+Y+ +KC+ +CG+P+Q +YHIPR W+ NLLV+
Sbjct: 684 LMVAPQSGC-ETCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIPRAWLQATGNLLVLF 742
Query: 692 EELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGV--VSSSPQVRLACERGWHI 749
EE+GGD SK+S++T++ +C+ ++E+ PPP+ +W+P+ + ++ ++ L C G HI
Sbjct: 743 EEIGGDISKVSVVTRSAHAVCAHINESQPPPIRTWRPHRSIDAFNNPAEMLLECAAGQHI 802
Query: 750 AAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPG 808
I FAS+G P G+CG F+ G CH + + V+K C+G+ +C IPV + G S CPG
Sbjct: 803 TKIKFASFGNPRGSCGHFQHGTCHANKSMEAVRKVCIGKQQCYIPVQRKFFG-SIDPCPG 861
Query: 809 LLKALAVEAHCS 820
+ K+LAV+ HCS
Sbjct: 862 VSKSLAVQVHCS 873
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/839 (51%), Positives = 564/839 (67%), Gaps = 40/839 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD +A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+++K+GGL+VI+TYVFWN H
Sbjct: 26 VRASVSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGLDVIQTYVFWNGH 85
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFE +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGIQFRT
Sbjct: 86 EPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIQFRT 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK++M+RF KI+++MK E LF S GGPIIL+Q+ENEYG +E+ G G+ Y WA
Sbjct: 146 DNGPFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYEIGAPGKAYTDWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+Q+DAPDP+IN CNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 206 AQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP EDLAF+VA+F + GG F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 266 GAVPYRPAEDLAFSVAKFLQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++LH+AIKLCE L+SSDPT LG EAH++ +S CAAFLANY+
Sbjct: 326 LLRQPKWGHLKDLHRAIKLCEPALVSSDPTVTPLGTYQEAHVFKSNSGACAAFLANYNRK 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F Y LP WS+SILPDCKN V+NTA++ +Q P +
Sbjct: 386 SFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQTARMKMP---------RVPIHG 436
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
FSW Y ++ + SF L EQIN T+D +DYLWY + + P + G
Sbjct: 437 GFSWQAYNDETATYSDTSFTTAGLLEQINITRDATDYLWYMTDVKIDPSEDFLRSGNYPV 496
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GHA VF+N +L YG+ + + + L GIN + +LS+ VGL N G
Sbjct: 497 LTVLSAGHALRVFINGQLAGTAYGSLETPKLTFKQGVNLRAGINQIALLSIAVGLPNVGP 556
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ VIL L G+RDLS +W Y++G++GE + L ++ ++S W +GS +
Sbjct: 557 HFETWNAGILGPVILNGLNEGRRDLSWQKWSYKIGLKGEALSLHSLTGSSSVEWTEGSFV 616
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYKTTF P G PLAL++ SMGKGQ W+N +SIGRYW AY A +G +C+
Sbjct: 617 AQRQPLTWYKTTFNRPAGNSPLALDMGSMGKGQVWINDRSIGRYWPAYKA--SGTCGECN 674
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y G++ KC +CG+ +Q YH+PR+W++P NLLV+ EE GGDP+ I L+ + +C
Sbjct: 675 YAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTGNLLVVLEEWGGDPNGIFLVRREVDSVC 734
Query: 713 SFVSEADPPPVDSWKPNL--------GVVSS--SPQVRLACERGWHIAAINFASYGIPEG 762
+ + E W+PNL G V+ P+ L+C G I++I FAS+G PEG
Sbjct: 735 ADIYE--------WQPNLMSWQMQVSGRVNKPLRPKAHLSCGPGQKISSIKFASFGTPEG 786
Query: 763 NCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CGSFR G CH +++C+GQ CS+ VS G CP ++K L+VEA CS
Sbjct: 787 VCGSFREGGCHAHKSYNAFERSCIGQNSCSVTVSPENFG--GDPCPNVMKKLSVEAICS 843
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/834 (52%), Positives = 554/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +A++IDG+RR+L SGSIHYPRSTP++W +L++K+K+GGL+VI+TYVFWN H
Sbjct: 24 IHCTVTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKAKDGGLDVIDTYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGRFDLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 EPSPGNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK E LF SQGGPII +Q+ENEYG A+G G Y+ WA
Sbjct: 144 DNGPFKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEYGPESRAFGAAGHSYINWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+++DAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF FG
Sbjct: 204 AQMAVGLKTGVPWVMCKEDDAPDPVINTCNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A RPV+DLAFAVARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GAFHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IR+PK+GHL+ELH+AIKLCE L+SSDPT LG +AH++ C+AFLANY +
Sbjct: 324 LIREPKYGHLKELHRAIKLCEHELVSSDPTITLLGTYQQAHVFSSGKRSCSAFLANYHTQ 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q +V L S
Sbjct: 384 SAARVMFNNMHYVLPPWSISILPDCRNVVFNTAKV----------GVQTSHVQMLPTGSR 433
Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW Y+E + G S + L EQIN T+DT+DYLWY S+++ P + G+
Sbjct: 434 FFSWESYDEDISSLGASSRMTALGLMEQINVTRDTTDYLWYITSVNINPSESFLRGGQWP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L +ES GHA VF+N + +G + F + L G N + +LS+ VGL N G
Sbjct: 494 TLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVNLRAGTNRIALLSIAVGLPNVG 553
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V+L L G +DL+ +W YQVG++GE + L + A+S W QGS
Sbjct: 554 VHYETWKTGILGPVMLHGLNQGNKDLTWQQWSYQVGLKGEAMNLVSPNRASSVDWIQGSL 613
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ L WYK F AP G PLAL++ SMGKGQ W+NGQSIGRYW +Y + G C
Sbjct: 614 ATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWINGQSIGRYWLSY---AKGDCSSC 670
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
Y G++ KCQ CGQP Q YH+PR+W+ P +NLLVI EELGGD SKISL+ ++ +
Sbjct: 671 GYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIFEELGGDASKISLVKRSTTSV 730
Query: 712 CSFVSEADPPPVDSWKPNLGVVSS----SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
C+ E P ++++ S +V L C G I+AINFAS+G P G CGSF
Sbjct: 731 CADAFEHH-PTIENYNTESNGESERNLHQAKVHLRCAPGQSISAINFASFGTPTGTCGSF 789
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + +V+K C+G+ C + +S++ G A CP LK L+VEA CS
Sbjct: 790 QEGTCHAPNSHSVVEKKCIGRESCMVAISNSNFG--ADPCPSKLKKLSVEAVCS 841
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/826 (52%), Positives = 547/826 (66%), Gaps = 16/826 (1%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+++VTYDHR+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 35 NSSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 94
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +GQYYFE RFDLVRF K V++AGL++ LRIGP+ AEW +GG PVWLH+ PG FRT
Sbjct: 95 PAQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTN 154
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK MKRF I+D+MK+E FASQGG IILAQVENEYG++E AYG G + Y WAA
Sbjct: 155 NEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAA 214
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A+ NT VPW+MCQQ DAPDP+INTCN FYCD F PNSP+KP WTEN+ GWF +FG
Sbjct: 215 SMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGE 274
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
+ P RP ED+AF+VARFF GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 275 SNPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 334
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKW HLR+LHK+IKL E L+ + + LG + EA +Y S C AFL+N DS
Sbjct: 335 RRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEK 394
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D VTF Y LPAWSVSILPDCKNV FNTAKV SQ D V L +S
Sbjct: 395 DKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM-------VPANLESSKV 447
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIE 477
W + EK GI GN VR + INTTKD++DYLWYT S V G L+IE
Sbjct: 448 DGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIE 507
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA F+N +L+ YGN +NF + + L G N L +LSM VGLQN G ++
Sbjct: 508 SKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEW 567
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AGAG+ SV + ++N DLSS +W Y++G+EGEY L K W S P N+
Sbjct: 568 AGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQP 627
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ WYK P+G P+ L++ SMGKG AW+NG +IGRYW S CT CDYRG++
Sbjct: 628 MTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTF 687
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+KC++ CGQP Q YH+PR+W HP N LVI EE GGDP+KI+ +T +CSFVSE
Sbjct: 688 SPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSE 747
Query: 718 ADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
P ++SW N + +V+L+C +G I+++ F S+G P G C S++ G+CH
Sbjct: 748 HYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCHHP 807
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ + +V+KAC+ C++ +S G CPG+ K LA+EA CS
Sbjct: 808 NSISVVEKACLNMNGCTVSLSDE--GFGEDLCPGVTKTLAIEADCS 851
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 879 bits (2270), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/826 (52%), Positives = 547/826 (66%), Gaps = 16/826 (1%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+++VTYDHR+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 103 NSSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 162
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +GQYYFE RFDLVRF K V++AGL++ LRIGP+ AEW +GG PVWLH+ PG FRT
Sbjct: 163 PAQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTN 222
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK MKRF I+D+MK+E FASQGG IILAQVENEYG++E AYG G + Y WAA
Sbjct: 223 NEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAA 282
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A+ NT VPW+MCQQ DAPDP+INTCN FYCD F PNSP+KP WTEN+ GWF +FG
Sbjct: 283 SMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGE 342
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
+ P RP ED+AF+VARFF GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 343 SNPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 402
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKW HLR+LHK+IKL E L+ + + LG + EA +Y S C AFL+N DS
Sbjct: 403 RRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEK 462
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D VTF Y LPAWSVSILPDCKNV FNTAKV SQ D V L +S
Sbjct: 463 DKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM-------VPANLESSKV 515
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIE 477
W + EK GI GN VR + INTTKD++DYLWYT S V G L+IE
Sbjct: 516 DGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIE 575
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA F+N +L+ YGN +NF + + L G N L +LSM VGLQN G ++
Sbjct: 576 SKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEW 635
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AGAG+ SV + ++N DLSS +W Y++G+EGEY L K W S P N+
Sbjct: 636 AGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQP 695
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ WYK P+G P+ L++ SMGKG AW+NG +IGRYW S CT CDYRG++
Sbjct: 696 MTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTF 755
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+KC++ CGQP Q YH+PR+W HP N LVI EE GGDP+KI+ +T +CSFVSE
Sbjct: 756 SPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSE 815
Query: 718 ADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
P ++SW N + +V+L+C +G I+++ F S+G P G C S++ G+CH
Sbjct: 816 HYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCHHP 875
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ + +V+KAC+ C++ +S G CPG+ K LA+EA CS
Sbjct: 876 NSISVVEKACLNMNGCTVSLSDE--GFGEDLCPGVTKTLAIEADCS 919
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 879 bits (2270), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/826 (52%), Positives = 547/826 (66%), Gaps = 16/826 (1%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+++VTYD R+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 35 NSSVTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 94
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +GQYYFE RFDLVRF K V++AGL++ LRIGP+ AEW +GG PVWLH+ PG FRT
Sbjct: 95 PAQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTN 154
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK MKRF I+D+MK+E FASQGG IILAQVENEYG++E AYG G + Y WAA
Sbjct: 155 NEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAA 214
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A+ NT VPW+MCQQ DAPDP+INTCN FYCD F PNSP+KP WTEN+ GWF +FG
Sbjct: 215 SMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGE 274
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
+ P RP ED+AF+VARFF GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 275 SNPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 334
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKW HLR+LHK+IKL E L+ + + LG + EA +Y S C AFL+N DS
Sbjct: 335 RRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEK 394
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D VTF Y LPAWSVSILPDCKNV FNTAKV SQ D V L +S
Sbjct: 395 DKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM-------VPANLESSKV 447
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIE 477
W + EK GI GN VR + INTTKD++DYLWYT S V G L+IE
Sbjct: 448 DGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIE 507
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA F+N +L+ YGN +NF + + L G N L +LSM VGLQN G ++
Sbjct: 508 SKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEW 567
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AGAG+ SV + ++N DLSS +W Y++G+EGEY L K W S P N+
Sbjct: 568 AGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQP 627
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ WYK P+G P+ L++ SMGKG AW+NG +IGRYW S CT CDYRG++
Sbjct: 628 MTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTF 687
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+KC++ CGQP Q YH+PR+W HP N LVI EE GGDP+KI+ +T +CSFVSE
Sbjct: 688 SPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSE 747
Query: 718 ADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
P ++SW N + +V+L+C +G I+++ FAS+G P G C S++ G+CH
Sbjct: 748 HYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFASFGNPSGTCRSYQQGSCHHP 807
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ + +V+KAC+ C++ +S G CPG+ K LA+EA CS
Sbjct: 808 NSISVVEKACLNMNGCTLSLSDE--GFGEDLCPGVTKTLAIEADCS 851
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 878 bits (2269), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/829 (52%), Positives = 560/829 (67%), Gaps = 24/829 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI K+KEGG++V+ETYVFWN HEP
Sbjct: 25 ASVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEP 84
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85 SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG G G+ YV WAA
Sbjct: 145 EPFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAK 204
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV + T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP++WTE +SGWF FG
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+ RPV+DLAFA ARF GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG I
Sbjct: 265 IHKRPVQDLAFAAARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELH+AIK+CE L+S+DP LG +AH+Y S DCAAFL+NYDS S
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIVTSLGEFQQAHVYTTESGDCAAFLSNYDSKSS 384
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LP WSVSILPDC+NVVFNTAKV Q + Q N L + +F
Sbjct: 385 ARVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGVQTSQ-----MQMLPTNTQLFSWESF 439
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
E+ + + + P L EQIN TKD SDYLWY S+ + + G+ L ++
Sbjct: 440 D--EDIYSVDESSAITAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTLIVQ 497
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA VF+N +L +G ++ F K+ L GIN + +LS+ +GL N G F+
Sbjct: 498 STGHAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRIALLSVAIGLPNVGEHFES 557
Query: 538 AGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVN 595
G+ V L L GK DLS +W YQVG++GE + L + +S W Q + + N
Sbjct: 558 WSTGILGPVALHGLDKGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVVQRN 617
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L W+KT F APEG PLAL++ MGKGQ W+NGQSIGRYW+A+ +TG C+Y G
Sbjct: 618 QPLTWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAF---ATGNCNDCNYAG 674
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
S+ KCQ CGQP Q YH+PR+W+ +NLLVI EELGG+PSKISL+ ++ +C+ V
Sbjct: 675 SFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGGNPSKISLVKRSVSSVCADV 734
Query: 716 SEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
SE P + +W S P+V L C G I++I FAS+G P G CG++ GAC
Sbjct: 735 SEYH-PNIKNWHIESYGKSEEFRPPKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQGAC 793
Query: 773 HMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H I++K C+G+ C++ VS++ G CP +LK L+VEA C+
Sbjct: 794 HSPASYVILEKRCIGKPRCTVTVSNSNFG--QDPCPKVLKRLSVEAVCA 840
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/831 (52%), Positives = 567/831 (68%), Gaps = 26/831 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+V+YD RA+VI+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 26 VTASVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 85
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP +G+YYFEGR+DLVRF+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++ GI FRT
Sbjct: 86 EPSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRT 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+RF KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y +WA
Sbjct: 146 NNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 206 AKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DE+G
Sbjct: 266 GAVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++LH+AIKLCE LIS DPT LG EAH++H S CAAFLANY+
Sbjct: 326 LLRQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPR 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F Y LP WS+SILPDCKN V+NTA++ Q ++ S
Sbjct: 386 SYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARL-----------GAQSATMKMTPVSG 434
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
F W Y E+ + SF L EQINTT+D SDYLWY+ + + G+
Sbjct: 435 RFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPV 494
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GHA VF+N +L YG+ + ++ ++L G+NT+ +LS+ VGL N G
Sbjct: 495 LTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGP 554
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ V L L G+RDLS +W Y+VG++GE + L +S ++S W +GS +
Sbjct: 555 HFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLM 614
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQ++GRYW AY A TG C+
Sbjct: 615 ARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA--TGGCGDCN 672
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y G+Y KC +CG+P+Q YH+P +W+ P NLLV+ EE GG+P+ ISL+ + + +C
Sbjct: 673 YAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIESVC 732
Query: 713 SFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+ + E P ++ G V+ P+ L C G I++I FAS+G PEG CGS+R G
Sbjct: 733 ADIYEWQPTLMNYEMQASGKVNKPLRPKAHLWCAPGQKISSIKFASFGTPEGVCGSYREG 792
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+CH +++C+G CS+ V+ G CP ++K L+VEA CS
Sbjct: 793 SCHAHKSYDAFERSCIGMNSCSVTVAPEIFG--GDPCPSVMKKLSVEAICS 841
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/825 (52%), Positives = 570/825 (69%), Gaps = 18/825 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YD +A+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 28 ASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 87
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFEG +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 88 SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 147
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK +M++F KI+DLMK E L+ SQGGPII++Q+ENEYG +E+ G G+ Y KWAA+
Sbjct: 148 EPFKVQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEYEIGAAGKAYTKWAAE 207
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L T VPW+MC+Q+D PDP+INTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 208 MAMELGTGVPWIMCKQDDTPDPLINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGP 267
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 268 VPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 327
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S DPT K+G EAH++ S CAAFLANY+ S
Sbjct: 328 RQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSMSGACAAFLANYNPKSY 387
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILP+CKN V+NTA+V SQ AQ K + ++
Sbjct: 388 ATVAFGNMHYNLPPWSISILPNCKNTVYNTARVGSQS-------AQMKMTRVPIHGGLSW 440
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
+ E+ + + SF L EQ+NTT+D SDYLWY+ + + P + GK+ L +
Sbjct: 441 LSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVLDPNEGFLRNGKDPVLTVF 500
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA VF+N +L YG+ +F N+ ++L G+N + +LS+ VGL N G F+
Sbjct: 501 SAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRTGVNKISLLSVAVGLPNVGPHFET 560
Query: 538 AGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG+ I L L G+RDLS +W Y+VG++GE + L + ++S W QGS + +
Sbjct: 561 WNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGETLSLHSLGGSSSVEWIQGSLVSQRQ 620
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYKTTF AP+G PLAL++ SMGKGQ W+NGQ++GRYW AY A +G CDY G+
Sbjct: 621 PLTWYKTTFDAPDGTAPLALDMNSMGKGQVWLNGQNLGRYWPAYKA--SGTCDYCDYAGT 678
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
Y+ +KC+ +CG+ +Q YH+P++W+ P NLLV+ EELGGD + ISL+ + +C+ +
Sbjct: 679 YNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDLNGISLVRRDIDSVCADIY 738
Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV 776
E P + G P+V L+C G I++I FAS+G P G+CG+F G+CH +
Sbjct: 739 EWQPNLISYQMQTSGKAPVRPKVHLSCSPGQKISSIKFASFGTPVGSCGNFHEGSCHAHM 798
Query: 777 -LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ CVGQ C++ VS G CP +LK L+VEA CS
Sbjct: 799 SYDAFERNCVGQNLCTVAVSPENFG--GDPCPNVLKKLSVEAICS 841
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 877 bits (2267), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/829 (52%), Positives = 556/829 (67%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+S +V+YD RA+ I+GKRR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+VI+TYVFWN H
Sbjct: 30 VSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGH 89
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLVRFVK VQ++GL+LHLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 90 EPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 149
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M+RF KI+++MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y WA
Sbjct: 150 DNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWA 209
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 210 AKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 270 GPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 329
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
RQPKWGHL++LH+AIKLCE L+S +PT LG EAH+Y S C+AFLANY+
Sbjct: 330 LERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKAKSGACSAFLANYNPK 389
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F N Y LP WS+SILPDCKN V+NTA+V +Q ++ K V +
Sbjct: 390 SYAKVSFGSNHYNLPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGL 442
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ Y E + SF L EQINTT+DTSDYLWY + + + G L
Sbjct: 443 SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKIDANEGFLRNGDLPTLT 502
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VF+N +L YG+ D K + L G N + ILS+ VGL N G F
Sbjct: 503 VLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G+RDLS +W Y+VG++GE + L +S ++S W +G+ +
Sbjct: 563 ETWNAGVLGPVSLNGLSGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLA+++ SMGKGQ W+NGQS+GR+W AY A G +C Y
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYT 680
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G++ KC ++CG+ +Q YH+PR+W+ P NLLV+ EE GGDP+ ISL+ + +C+
Sbjct: 681 GTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGISLVRREVDSVCAD 740
Query: 715 VSEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E V+ G V+ P+V L C G I + FAS+G PEG CGS+R G+C
Sbjct: 741 IYEWQSTLVNYQLHASGKVNKPLHPKVHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSC 800
Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H K CVGQ CS+ V+ G CP ++K LAVEA C+
Sbjct: 801 HDHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 847
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 877 bits (2266), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/826 (51%), Positives = 564/826 (68%), Gaps = 21/826 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+KSK+GGL+VI+TYVFWN HE
Sbjct: 25 TASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 PSPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTD 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 145 NEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN KP MWTE ++GW+ FG
Sbjct: 205 QMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 265 AVPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 324
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PKWGHLR+LHKAIK E L+S++P+ LG EAH++ KS + CAAFLANYD+ S
Sbjct: 325 PREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVF-KSKSGCAAFLANYDTKS 383
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V+F Y LP WS+SILPDC+ V+NTA++ SQ + Q + + A
Sbjct: 384 SAKVSFGNGQYELPPWSISILPDCRTAVYNTARLGSQ--------SSQMKMTPVKSALPW 435
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
S+ EE + + L EQIN T+DT+DY WY I + P + G+ L I
Sbjct: 436 QSFIEESASSDESDTTTLDGLWEQINVTRDTTDYSWYMTDITISPDEGFIKRGESPLLTI 495
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG + ++ ++L GIN L +LS+ VGL N G F+
Sbjct: 496 YSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSISVGLPNVGLHFE 555
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ V L L +G D+S +W Y+VG++GE +GL +S ++S W +G ++
Sbjct: 556 TWNAGVLGPVTLKGLNSGTWDMSRWKWTYKVGLKGEALGLHTVSGSSSVEWAEGPSMAQK 615
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WY+ TF AP G GPLAL+++SMGKGQ W+NGQSIGR+W AY A G C Y G
Sbjct: 616 QPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTA--RGNCGNCYYAG 673
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+YD KC+ HCG+P+Q YH+PR+W+ NLLV+ EE GGDP+KISL+ + +C+ +
Sbjct: 674 TYDDKKCRTHCGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPTKISLVERRTSSVCADI 733
Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
E P +S K G + + P+ L C G I+ I FASYG+ +G CGSF+ G+CH
Sbjct: 734 FEGQPTLTNSQKLASGKL-NRPKAHLWCPPGQVISDIKFASYGLSQGTCGSFQEGSCHAH 792
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ C+G+ CS+ V+ G CPG K L+VEA CS
Sbjct: 793 KSYDAPKRNCIGKQSCSVTVAPEVFG--GDPCPGSTKKLSVEAVCS 836
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 877 bits (2265), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/833 (52%), Positives = 562/833 (67%), Gaps = 38/833 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH++++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 23 VTASVTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYF GR+DLVRF+K V++AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 EPSPGQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M +F KI+ +MK E L+ +QGGPIIL+Q+ENEYG VE+ G G+ Y WA
Sbjct: 143 DNGPFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWA 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV LNT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN +KP MWTE ++GWF FG
Sbjct: 203 AKMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKDNKPKMWTEAWTGWFTGFG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP ED+AFAVARF + GG+F NYYMY GGTNFGRTAGGP ++TSYDYDAPIDEYG
Sbjct: 263 GAVPQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHLR+LHKAIKLCE L+S +PT LG E+++Y +S + CAAFLAN++S
Sbjct: 323 LLRQPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQESYVY-RSKSSCAAFLANFNSR 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
A VTFNG Y LP WSVSILPDCK VFNTA+V +Q +
Sbjct: 382 YYATVTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQTTTMKMQYL------------G 429
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
FSW Y E + +F + L EQ++TT D SDYLWYT + + + GK +
Sbjct: 430 GFSWKAYTEDTDALNDNTFTKDGLVEQLSTTWDRSDYLWYTTYVDIAKNEEFLKTGKYPY 489
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GHA VF+N +L YG+ D + +L G N + ILS+ VGL N G
Sbjct: 490 LTVMSAGHAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKISILSVSVGLPNVGN 549
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ G+ V L L GKRDLS +W YQ+G+ GE + L ++ +++ W + S
Sbjct: 550 HFETWNTGVLGPVTLTGLNEGKRDLSLQKWTYQIGLHGETLSLHSLTGSSNVEWGEASQ- 608
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYKT F AP G PLAL++ +MGKGQ W+NGQSIGRYW AY A +G CD
Sbjct: 609 --KQPLTWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPAYKA--SGSCGSCD 664
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
YRG+Y+ KC +CG+ +Q YH+PR+W+ P N LV+ EE GGDP+ IS++ ++ +C
Sbjct: 665 YRGTYNEKKCLSNCGEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTGISMVKRSVASVC 724
Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ V E P +D+W+ P+V L+C+ G ++ I FAS+G P+G CGSF G+C
Sbjct: 725 AEVEELQ-PTMDNWRTK---AYGRPKVHLSCDPGQKMSKIKFASFGTPQGTCGSFSEGSC 780
Query: 773 HM----DVLPI--VQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
H D + + CVGQ CS+ V+ G CPG +K LAVEA C
Sbjct: 781 HAHKSYDAFEQEGLMQNCVGQEFCSVNVAPEVFG--GDPCPGTMKKLAVEAIC 831
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 877 bits (2265), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/829 (52%), Positives = 564/829 (68%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD +A++I+G RR+L SGSIHYPRST E+WP+LI+K+KEGGL+VIETYVFWN H
Sbjct: 24 VQASVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLIQKAKEGGLDVIETYVFWNGH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLVRFVK V +AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 84 EPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M+RF KI+++MK E L+ SQGGPIIL+Q+ENEYG +E+ G G+ Y KWA
Sbjct: 144 DNAPFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYSKWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 204 AQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP ED+AFAVARF + GG NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 264 GAVPHRPAEDMAFAVARFIQKGGALINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++L++AIKLCE L+S DP +LG EAH++ S CAAFL+NY+
Sbjct: 324 LLRQPKWGHLKDLNRAIKLCEPALVSGDPIVTRLGNYQEAHVFKSKSGACAAFLSNYNPR 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F Y +P WS+SILPDCKN VFNTA+V +Q A K + S
Sbjct: 384 SYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGAQT-------AIMKMSPVPMHESF 436
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ Y E+ ++F L EQINTT+D +DYLWYT +H+ + GK L
Sbjct: 437 SWQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWYTTDVHIDANEGFLRSGKYPVLT 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VFVN +L YG+ DF ++ + L G N + +LS+ VGL N G F
Sbjct: 497 VLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNNKIALLSIAVGLPNVGPHF 556
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
++ AG+ V L L G+RDL+ +W Y++G++GE + L +S ++S W QGS +
Sbjct: 557 EMWNAGILGPVNLNGLDEGRRDLTWQKWTYKIGLDGEAMSLHSLSGSSSVEWIQGSLVAQ 616
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L W+KTTF AP G PLAL++ SMGKGQ W+NGQS+GRYW AY STG CDY
Sbjct: 617 KQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGRYWPAY--KSTGSCGSCDYT 674
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+Y+ KC +CG+ +Q YH+PR+W++P NLLV+ EE GGDP+ I L+ + +C
Sbjct: 675 GTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWGGDPNGIHLVRRDVDSVCVN 734
Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
++E P ++ + G V+ P+ L+C G I+++ FAS+G PEG CGSFR G+C
Sbjct: 735 INEWQPTLMNWQMQSSGKVNKPLRPKAHLSCGPGQKISSVKFASFGTPEGECGSFREGSC 794
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H Q+ CVGQ C++ V+ G CP ++K L+VE CS
Sbjct: 795 HAHHSYDAFQRTCVGQNFCTVTVAPEMFG--GDPCPNVMKKLSVEVICS 841
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 877 bits (2265), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/828 (52%), Positives = 565/828 (68%), Gaps = 26/828 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YD RA+VI+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 16 NVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 75
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+G+YYFEGR+DLVRF+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 76 QGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNNE 135
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+RF KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y +WAA
Sbjct: 136 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 195
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG AV
Sbjct: 196 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 255
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DE+G +R
Sbjct: 256 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLLR 315
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL++LH+AIKLCE LIS DPT LG EAH++H S CAAFLANY+ S A
Sbjct: 316 QPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRSYA 375
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V+F Y LP WS+SILPDCKN V+NTA++ Q ++ S F
Sbjct: 376 KVSFRNMHYNLPPWSISILPDCKNTVYNTARL-----------GAQSATMKMTPVSGRFG 424
Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
W Y E+ + SF L EQINTT+D SDYLWY+ + + + G+ L +
Sbjct: 425 WQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTV 484
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG+ + ++ ++L G+NT+ +LS+ VGL N G F+
Sbjct: 485 LSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFE 544
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ V L L G+RDLS +W Y+VG++GE + L +S ++S W +GS +
Sbjct: 545 TWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLMARG 604
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQ++GRYW AY A TG C+Y G
Sbjct: 605 QPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA--TGGCGDCNYAG 662
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+Y KC +CG+P+Q YH+P +W+ P NLLV+ EE GG+P+ ISL+ + + +C+ +
Sbjct: 663 TYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIESVCADI 722
Query: 716 SEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
E P ++ G V+ P+ L C G I++I FAS+G PEG CGS+R G+CH
Sbjct: 723 YEWQPTLMNYEMQASGKVNKPLRPKAHLWCAPGQKISSIKFASFGTPEGVCGSYREGSCH 782
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+++C+G CS+ V+ G CP ++K L+VEA CS
Sbjct: 783 AHKSYDAFERSCIGMNSCSVTVAPEIFG--GDPCPSVMKKLSVEAICS 828
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 876 bits (2264), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/825 (52%), Positives = 565/825 (68%), Gaps = 24/825 (2%)
Query: 7 YDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQ 66
YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN HEP G+
Sbjct: 34 YDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGK 93
Query: 67 YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
YYFEG +DLV+F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PFK
Sbjct: 94 YYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGINFRTDNGPFK 153
Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
+M+RF KI+++MK E LF SQGGPIIL+Q+ENEYG +E+ G G+ Y KWAA AV
Sbjct: 154 AQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGQAYSKWAAKMAVG 213
Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
L T VPWVMC+Q+DAPDP+INTCNGFYCD F+PN P KP MWTE ++GWF FG AVP+R
Sbjct: 214 LGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKPYKPKMWTEAWTGWFTEFGGAVPYR 273
Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPK 306
P EDLAF+VARF + GG F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +RQPK
Sbjct: 274 PAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPK 333
Query: 307 WGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVT 366
WGHL++LH+AIKLCE L+S P+ LG EAH++ S CAAFLANY+ S A V+
Sbjct: 334 WGHLKDLHRAIKLCEPALVSGAPSVMPLGNYQEAHVFKSKSGACAAFLANYNQRSFAKVS 393
Query: 367 FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW-- 424
F Y LP WS+SILPDCKN V+NTA++ +Q A+ K + FSW
Sbjct: 394 FGNMHYNLPPWSISILPDCKNTVYNTARIGAQS-------ARMK--MSPIPMRGGFSWQA 444
Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESL 479
Y E+ G+ +F+ L EQINTT+D SDYLWY+ + + + GK L + S
Sbjct: 445 YSEEASTEGDNTFMMVGLLEQINTTRDVSDYLWYSTDVRIDSNEGFLRSGKYPVLTVLSA 504
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
GHA VFVN +L YG+ + ++ +++ GIN + +LS+ VGL N G F+
Sbjct: 505 GHALHVFVNGQLSGTAYGSLESPKLTFSQGVKMRAGINRIYLLSIAVGLPNVGPHFETWN 564
Query: 540 AGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
AG+ V L L G+RDLS +W Y++G+ GE + L +S ++S W QGS + + L
Sbjct: 565 AGVLGPVTLNGLNEGRRDLSWQKWTYKIGLHGEALSLHSLSGSSSVEWAQGSFVSRKQPL 624
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
+WYKTTF AP G PLAL++ SMGKGQ W+NGQS+GRYW AY A +G C+Y G+++
Sbjct: 625 MWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQSVGRYWPAYKA--SGNCGVCNYAGTFN 682
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
KC +CG+ +Q YH+PR+W++ NLLV+ EE GGDP+ ISL+ + +C+ + E
Sbjct: 683 EKKCLTNCGEASQRWYHVPRSWLNTAGNLLVVFEEWGGDPNGISLVRREVDSVCADIYEW 742
Query: 719 DPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MD 775
P ++ + G V+ P+V L C G I+ I FAS+G PEG CGS+R G+CH
Sbjct: 743 QPTLMNYMMQSSGKVNKPLRPKVHLQCGAGQKISLIKFASFGTPEGVCGSYRQGSCHAFH 802
Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ CVGQ CS+ V+ G CP ++K LAVEA CS
Sbjct: 803 SYDAFNRLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCS 845
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 875 bits (2262), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/826 (51%), Positives = 547/826 (66%), Gaps = 17/826 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ VTYD R+L+I G+RR+L S SIHYPRS P +WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 100 SGVTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHET 159
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFE RFDLVRF K V++AGL+L LRIGP+ AEWN+GG PVWLH+IPG FRT N
Sbjct: 160 APGEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPGAVFRTNN 219
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK MK F KI+D+MK+E FASQGG IILAQ+ENEYG+ E AYG G+ Y WAA
Sbjct: 220 EPFKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAAS 279
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ NT VPW+MCQQ DAP+ +INTCN FYCD F NSP+KP +WTEN+ GWF +FG +
Sbjct: 280 MALAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKTNSPTKPKIWTENWPGWFQTFGES 339
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P RP ED+AF+VARFF+ GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 340 NPHRPPEDVAFSVARFFQKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLT 399
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R PKW HLR+LHK+IKLCE L+ + T LG K EA +Y S C AFLAN D +D
Sbjct: 400 RLPKWAHLRDLHKSIKLCEHSLLYGNLTSLSLGTKQEADVYTDHSGGCVAFLANIDPEND 459
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
VTF Y LPAWSVSILPDCKN VFNTAKV SQ D V E L ++
Sbjct: 460 TVVTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQTLMVDM-------VPETLQSTKPD 512
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIE 477
W + EK GI F+R + INTTKD++DYLW+T S +V P G L+I+
Sbjct: 513 RWSIFREKTGIWDKNDFIRNGFVDHINTTKDSTDYLWHTTSFNVDRSYPTNGNRELLSID 572
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA F+N +L+ YGN ++F ++ I+L G N + +LSM VGLQN G ++
Sbjct: 573 SKGHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIALLSMTVGLQNAGPHYEW 632
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
GAGL SV + +KNG DLSS W Y++G+EGE+ GL K N+ W S P +
Sbjct: 633 VGAGLTSVNISGMKNGSIDLSSNNWAYKIGLEGEHYGLFKPDQGNNQRWSPQSEPPKGQP 692
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYK P+G P+ +++ SMGKG AW+NG +IGRYW + CT C+YRG +
Sbjct: 693 LTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSSDDRCTPSCNYRGPF 752
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+ SKC+ CG+P Q YH+PR+W HP N LV+ EE GGDP+KI+ + +CSFVSE
Sbjct: 753 NPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATKVCSFVSE 812
Query: 718 ADPP-PVDSWKPNLGVV-SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
P ++SW ++ + +V+L+C +G +I+++ FAS+G P G C S++ G CH
Sbjct: 813 NYPSIDLESWDKSISDDGKDTAKVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGRCHHP 872
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L +V+KAC+ C++ +S G CPG+ K LA+EA CS
Sbjct: 873 SSLSVVEKACLNINSCTVSLSDE--GFGKDLCPGVAKTLAIEADCS 916
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 875 bits (2260), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/828 (51%), Positives = 560/828 (67%), Gaps = 26/828 (3%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+VI+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+V+ETYVFWN HEP
Sbjct: 28 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLVRF+KT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT N P
Sbjct: 88 GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ LMK E+LF SQGGPIIL+Q+ENEYG +G G Y+ WAA+ A
Sbjct: 148 FKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEMA 207
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP +WTE +SGWF FG +
Sbjct: 208 VGLDTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPYKPTIWTETWSGWFTEFGGPIH 267
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPV+DLA+AVA F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG IRQ
Sbjct: 268 QRPVQDLAYAVATFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQ 327
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELHKAIK+CE L+S+DP LG +A++Y S DC+AFL+N+DS S A
Sbjct: 328 PKYGHLKELHKAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSKSAAR 387
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V FN Y LP WS+SILPDC+NVVFNTAKV Q + Q N +L+ ++
Sbjct: 388 VMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQ-----MQMLPTNIPMLSWESYD- 441
Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESL 479
E+ + + + P L EQIN T+D++DYLWY S+ + + G+ L ++S
Sbjct: 442 -EDLTSMDDSSTMTAPGLLEQINVTRDSTDYLWYITSVDIDSSESFLHGGELPTLIVQST 500
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
GHA +F+N +L +G + F K+ L G N + +LS+ VGL N G F+
Sbjct: 501 GHAVHIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIALLSVAVGLPNVGGHFEAWN 560
Query: 540 AGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS- 597
G+ V L L GK DLS +W YQVG++GE + L + +S W GS + K
Sbjct: 561 TGILGPVALHGLNQGKWDLSWQKWTYQVGLKGEAMNLVSQNAFSSVEWISGSLIAQKKQQ 620
Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L W+KT F PEG PLAL++ MGKGQ W+NGQSIGRYW+A+ + G C Y G
Sbjct: 621 PLTWHKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYWTAF---ANGNCNGCSYAGG 677
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
+ +KCQ CG+P Q YH+PR+W+ P +NLLV+ EELGGDPS+ISL+ + +CS V+
Sbjct: 678 FRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPSRISLVKRAVSSVCSEVA 737
Query: 717 EADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
E P + +W + G V SP+V L C G I++I FAS+G P G CGS++ G CH
Sbjct: 738 EYH-PTIKNWHIESYGKVEDFHSPKVHLRCNPGQAISSIKFASFGTPLGTCGSYQEGTCH 796
Query: 774 MDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+VQK C+G+ C++ +S++ G CP +LK L+VEA C+
Sbjct: 797 ATTSYSVVQKKCIGKQRCAVTISNSNFG---DPCPKVLKRLSVEAVCA 841
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 874 bits (2257), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/831 (52%), Positives = 555/831 (66%), Gaps = 24/831 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +AL+I+G+RR+L SGSIHYPRSTP++W LI+K+K+GG++VIETYVFWN H
Sbjct: 26 VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNLH 85
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 86 EPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK F +I++LMK ENLF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 146 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF FG
Sbjct: 206 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 266 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IR+PK+GHL+ELH+AIK+CE+ L+S+DP +G K +AH+Y S DC+AFLANYD+
Sbjct: 326 LIREPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q + + KN
Sbjct: 386 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWQ----- 440
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
S+ E+ + + +F L EQIN T+DTSDYLWY S+ + + G+ L
Sbjct: 441 --SYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSVDIGDTESFLHGGELPTLI 498
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+S GHA +FVN +L +G F KI L+ G N + +LS+ VGL N G F
Sbjct: 499 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 558
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
+ G+ V L L GKRDLS +W YQVG++GE + L + S W S T+
Sbjct: 559 ESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAFPTNTRSIGWMDASLTVQ 618
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+ +TG +C Y
Sbjct: 619 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSQCSY 675
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+Y +KCQ CGQP Q YH+PR+W+ P +NLLVI EELGG+PS +SL+ ++ +C+
Sbjct: 676 TGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLVIFEELGGNPSSVSLVKRSVSGVCA 735
Query: 714 FVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
VSE P + +W+ G P+V L C G IA+I FAS+G P G CGS++ G
Sbjct: 736 EVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQG 794
Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH I+++ CVG+ C++ +S+ G CP +LK L VEA C+
Sbjct: 795 ECHAATSYAILERKCVGKARCAVTISNTNFG--KDPCPNVLKRLTVEAVCA 843
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 874 bits (2257), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/829 (52%), Positives = 555/829 (66%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+S +V+YD RA+ I+GKRR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+VI+TYVFWN H
Sbjct: 30 VSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGH 89
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLV+FVK VQ++GL+LHLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 90 EPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 149
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M+RF KI+++MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y WA
Sbjct: 150 DNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWA 209
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 210 AKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 270 GPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 329
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
RQPKWGHL++LH+AIKLCE L+S +PT LG EAH+Y S C+AFLANY+
Sbjct: 330 LERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPK 389
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F N Y LP WS+SILPDCKN V+NTA+V +Q ++ K V +
Sbjct: 390 SYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGL 442
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ Y E + SF L EQINTT+DTSDYLWY + V + G L
Sbjct: 443 SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLT 502
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VF+N +L YG+ D K + L G N + ILS+ VGL N G F
Sbjct: 503 VLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G+RDLS +W Y+VG++GE + L +S ++S W +G+ +
Sbjct: 563 ETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLA+++ SMGKGQ W+NGQS+GR+W AY A G +C Y
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYT 680
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G++ KC ++CG+ +Q YH+PR+W+ P NLLV+ EE GGDP+ I+L+ + +C+
Sbjct: 681 GTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCAD 740
Query: 715 VSEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E V+ G V+ P+ L C G I + FAS+G PEG CGS+R G+C
Sbjct: 741 IYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSC 800
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H K CVGQ CS+ V+ G CP ++K LAVEA C+
Sbjct: 801 HAHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 847
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 873 bits (2256), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/857 (52%), Positives = 574/857 (66%), Gaps = 49/857 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDGKRR+L S IHYPR+TPE+WP+LI KSKEGG ++I+TY FWN HEPI
Sbjct: 30 NVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLIAKSKEGGADLIQTYAFWNGHEPI 89
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR+D+V+F+K AGL+ HLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 90 RGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 149
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+K+EM+RF+ KI+DLM+QE LF+ QGGPIIL Q+ENEYGN+E YG G+ YVKWAAD
Sbjct: 150 PYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIENEYGNIERLYGQRGKDYVKWAADM 209
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP+ II+ CN FYCDGF PNS KP +WTE+++GW+ S+G V
Sbjct: 210 AIGLGAGVPWVMCRQTDAPENIIDACNAFYCDGFKPNSYRKPALWTEDWNGWYTSWGGRV 269
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVED AFAVARFF+ GG++ NYYM+FGGTNFGRT+GGP TSYDYDAPIDEYG +
Sbjct: 270 PHRPVEDNAFAVARFFQRGGSYHNYYMFFGGTNFGRTSGGPFYVTSYDYDAPIDEYGLLS 329
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSS-------------N 348
QPKWGHL++LH AIKLCE L++ D P + +LG EAH+Y SS
Sbjct: 330 QPKWGHLKDLHSAIKLCEPALVAVDDAPQYIRLGPMQEAHVYRHSSYVEDQSSSTLGNGT 389
Query: 349 DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ 408
C+AFLAN D + ANV F G VY LP WSVSILPDCKNV FNTAKV SQ + F+
Sbjct: 390 LCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDCKNVAFNTAKVASQISVKTVEFSS 449
Query: 409 Q--KNVNE---LLL------ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLW 457
+N E LLL S+ + +E +G G +F + E +N TKDTSDYLW
Sbjct: 450 PFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEWGGNNFTAEGILEHLNVTKDTSDYLW 509
Query: 458 YTASIHVMPG-----QGKEVF--LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKI 510
Y +H+ + EV L I+S+ +FVN +L G+H + + +
Sbjct: 510 YIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFVNGQLA----GSHVGRWVRVEQPV 565
Query: 511 ELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVE 569
+L +G N L ILS VGLQNYGA+ + GAG I L LK+G+ DL++ W+YQVG+
Sbjct: 566 DLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQIKLTGLKSGEYDLTNSLWVYQVGLR 625
Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
GE++ + + S+ W V + WYKT F AP+GK P++L L SMGKGQAWVN
Sbjct: 626 GEFMKIFSLEEHESADWVDLPNDSVPSAFTWYKTFFDAPQGKDPVSLYLGSMGKGQAWVN 685
Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
G SIGRYWS +AP GC + CDYRG+Y SKC +CG+P Q+ YHIPR+W+ P +NLLV
Sbjct: 686 GHSIGRYWS-LVAPVDGC-QSCDYRGAYHESKCATNCGKPTQSWYHIPRSWLQPSKNLLV 743
Query: 690 IHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLAC 743
I EE GG+P +IS+ + IC+ VSE+ PP+ W + + ++ P++ L C
Sbjct: 744 IFEETGGNPLEISVKLHSTSSICTKVSESHYPPLHLWSHKDIVNGKVSISNAVPEIHLQC 803
Query: 744 ERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVS 802
+ G I++I FAS+G P+G+C F G CH + +V +AC G+ CSI VS+ G
Sbjct: 804 DNGQRISSIMFASFGTPQGSCQRFSQGDCHAPNSFSVVSEACQGRNNCSIGVSNKVFG-- 861
Query: 803 AGACPGLLKALAVEAHC 819
C G++K LAVEA C
Sbjct: 862 GDPCRGVVKTLAVEAKC 878
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 873 bits (2256), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/830 (51%), Positives = 567/830 (68%), Gaps = 26/830 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A+V+YDHRA++++G+RR+L SGS+HYPRSTPE+WP +I+K+KEGG++VI+TYVFWN HE
Sbjct: 24 TASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHE 83
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +G+YYFEGR+DLV+F+K V +AGL++HLR+GPYACAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 PQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTD 143
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F AKI+++MK E L+ +QGGPIIL+Q+ENEYG +EW G G+ Y +WAA
Sbjct: 144 NGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAA 203
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN KP +WTE ++ WF FG
Sbjct: 204 KMAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGN 263
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP+RP EDLAF+VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 264 PVPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 323
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHL++LH+AIKLCE L+S DP LG + EAH++ + CAAFLANYD S
Sbjct: 324 LRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHS 383
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V+F Y LP WS+SILPDCKN VFNTA++ +Q AQ K + S
Sbjct: 384 FATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQS-------AQMK----MTPVSRG 432
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
W + E+ + SF L EQINTT+D SDYLWY+ + + + GK +L
Sbjct: 433 LPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWL 492
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN +L YG+ + +K + L G+N + +LS+ VGL N G
Sbjct: 493 TIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPH 552
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ AG+ V L L GKRDL+ +W Y+VG++GE + L +S ++S W +GS +
Sbjct: 553 FETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLVA 612
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK+TF AP G PLAL+L +MGKGQ W+NGQS+GRYW Y A +G C+Y
Sbjct: 613 QRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKA--SGNCGACNY 670
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G ++ KC +CG+ +Q YH+PR+W++P NLLV+ EE GG+P ISL+ + +C+
Sbjct: 671 AGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVCA 730
Query: 714 FVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
++E P V+ G V P+ L+C G I +I FAS+G P+G CGSFR G+
Sbjct: 731 DINEWQPQLVNWQMQASGKVDKPLRPKAHLSCASGQKITSIKFASFGTPQGVCGSFREGS 790
Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH ++ C+GQ CS+PV+ G CP ++K L+VE CS
Sbjct: 791 CHAFHSYDAFERYCIGQNSCSVPVTPEIFG--GDPCPHVMKKLSVEVICS 838
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 873 bits (2256), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/828 (51%), Positives = 548/828 (66%), Gaps = 21/828 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ VTYDHR+LVI G+RR+L S SIHYPRS P +WP+L+ ++KEGG + IETYVFWN HE
Sbjct: 29 SGVTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHET 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFE RFDLV+F + V++AGLFL LRIGP+ AEWN+GG P WLH+IPG FRT N
Sbjct: 89 APGKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTVFRTNN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK MK F KI+D+MK++ FASQGG IILAQ+ENEYG + AYG GG+ Y WA
Sbjct: 149 EPFKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGS 208
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A NT VPW+MCQQ D PD +INTCN FYCD F PNSP++P +WTEN+ GWF +FG +
Sbjct: 209 MAQAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQFKPNSPTQPKIWTENWPGWFQTFGES 268
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P RP ED+AF+VARFF GG+ QNYY+Y GGTNF RTAGGP + TSYDYDAPIDEYG
Sbjct: 269 NPHRPPEDVAFSVARFFGKGGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGLR 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R PKW HL+ELH++IKLCE L+ + T LG + EA +Y S C AFLAN DS D
Sbjct: 329 RLPKWAHLKELHQSIKLCEHSLLFGNSTLLSLGPQQEADVYTDHSGGCVAFLANIDSEKD 388
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
VTF Y LPAWSVSILPDCKNVVFNTAKV SQ D V L AS
Sbjct: 389 RVVTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDM-------VPGTLQASKPD 441
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIE 477
W + E++G+ FVR + + INTTKD++DYLW+T S V P G LNI+
Sbjct: 442 QWSIFTERIGVWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRNYPSSGNHPVLNID 501
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA F+N L+ YGN ++F + I L G N + ILSM VGL++ G +++
Sbjct: 502 SKGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAILSMTVGLKSAGPYYEW 561
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
GAGL SV + +KNG DLSS W Y+VG+EGE+ GL K N+ W+ S P ++
Sbjct: 562 VGAGLTSVNISGMKNGTTDLSSNNWAYKVGLEGEHYGLFKHDQGNNQRWRPQSQPPKHQP 621
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYK P+G P+ L++ SMGKG W+NG +IGRYW + CT CDYRG +
Sbjct: 622 LTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWPRTSPTNDRCTTSCDYRGKF 681
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+KC+ CG+P Q YH+PR+W HP N LV+ EE GGDP+KI+ + +CSFVSE
Sbjct: 682 SPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATSVCSFVSE 741
Query: 718 ADPP-PVDSWKPNL---GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
P ++SW ++ G V++ +V+L+C +G +I+++ FAS+G P G C S++ G+CH
Sbjct: 742 NYPSIDLESWDKSISDDGRVAA--KVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGSCH 799
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
D + +V+KAC+ C++ +S G CPG+ K LA+EA CS
Sbjct: 800 HPDSVSVVEKACMNMNSCTVSLSDE--GFGEDPCPGVTKTLAIEADCS 845
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 873 bits (2255), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/830 (51%), Positives = 567/830 (68%), Gaps = 26/830 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A+V+YDHRA++++G+RR+L SGS+HYPRSTPE+WP +I+K+KEGG++VI+TYVFWN HE
Sbjct: 24 TASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHE 83
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +G+YYFEGR+DLV+F+K V +AGL++HLR+GPYACAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 PQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTD 143
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F AKI+++MK E L+ +QGGPIIL+Q+ENEYG +EW G G+ Y +WAA
Sbjct: 144 NGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAA 203
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN KP +WTE ++ WF FG
Sbjct: 204 KMAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGN 263
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP+RP EDLAF+VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 264 PVPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 323
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHL++LH+AIKLCE L+S DP LG + EAH++ + CAAFLANYD S
Sbjct: 324 LRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHS 383
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V+F Y LP WS+SILPDCKN VFNTA++ +Q AQ K + S
Sbjct: 384 FATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQS-------AQMK----MTPVSRG 432
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
W + E+ + SF L EQINTT+D SDYLWY+ + + + GK +L
Sbjct: 433 LPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWL 492
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN +L YG+ + +K + L G+N + +LS+ VGL N G
Sbjct: 493 TIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPH 552
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ AG+ V L L GKRDL+ +W Y+VG++GE + L +S ++S W +GS +
Sbjct: 553 FETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLVA 612
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK+TF AP G PLAL+L +MGKGQ W+NGQS+GRYW Y A +G C+Y
Sbjct: 613 QRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKA--SGNCGACNY 670
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G ++ KC +CG+ +Q YH+PR+W++P NLLV+ EE GG+P ISL+ + +C+
Sbjct: 671 AGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVCA 730
Query: 714 FVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
++E P V+ G V P+ L+C G I +I FAS+G P+G CGSFR G+
Sbjct: 731 DINEWQPQLVNWQMQASGKVDKPLRPKAHLSCAPGQKITSIKFASFGTPQGVCGSFREGS 790
Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH ++ C+GQ CS+PV+ G CP ++K L+VE CS
Sbjct: 791 CHAFHSYDAFERYCIGQNSCSVPVTPEIFG--GDPCPHVMKKLSVEVICS 838
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 873 bits (2255), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/829 (52%), Positives = 555/829 (66%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+S +V+YD RA+ I+GKRR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+VI+TYVFWN H
Sbjct: 30 VSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGH 89
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLV+FVK VQ++GL+LHLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 90 EPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 149
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M+RF KI+++MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y WA
Sbjct: 150 DNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWA 209
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 210 AKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 270 GPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 329
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
RQPKWGHL++LH+AIKLCE L+S +PT LG EAH+Y S C+AFLANY+
Sbjct: 330 LERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPK 389
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F N Y LP WS+SILPDCKN V+NTA+V +Q ++ K V +
Sbjct: 390 SYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGL 442
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ Y E + SF L EQINTT+DTSDYLWY + V + G L
Sbjct: 443 SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLT 502
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA +F+N +L YG+ D K + L G N + ILS+ VGL N G F
Sbjct: 503 VLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G+RDLS +W Y+VG++GE + L +S ++S W +G+ +
Sbjct: 563 ETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLA+++ SMGKGQ W+NGQS+GR+W AY A G +C Y
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYT 680
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G++ KC ++CG+ +Q YH+PR+W+ P NLLV+ EE GGDP+ I+L+ + +C+
Sbjct: 681 GTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCAD 740
Query: 715 VSEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E V+ G V+ P+ L C G I + FAS+G PEG CGS+R G+C
Sbjct: 741 IYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSC 800
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H K CVGQ CS+ V+ G CP ++K LAVEA C+
Sbjct: 801 HAHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 847
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/834 (51%), Positives = 559/834 (67%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W ++I+K+K+GGL+V+ETYVFWN H
Sbjct: 77 IQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVH 136
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF++TVQ+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 137 EPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 196
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ LMK E LF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 197 DNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWA 256
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP +WTE +SGWF FG
Sbjct: 257 ANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPTIWTEAWSGWFNEFG 316
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 317 GPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 376
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPK+GHL+ELH++IKLCE L+S+DP LG+ +AH+Y + DCAAFL+NYD+
Sbjct: 377 LVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSDAGDCAAFLSNYDTK 436
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q ++ L +
Sbjct: 437 SSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKV----------GVQTAHMEMLPTNAE 486
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y+E + + + +F L EQIN T+D SDYLWY I + + G+
Sbjct: 487 MLSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGSSESFLRGGELP 546
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L +++ GHA VF+N +L +G ++ F +K+ L+ G NT+ +LS+ VGL N G
Sbjct: 547 TLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIALLSVAVGLPNVG 606
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L GK DLS W Y+VG++GE + L + +S W QGS
Sbjct: 607 GHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLVSPNGISSVDWMQGSL 666
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L W+K F APEG PLAL++ MGKGQ W+NGQSIGRYW+AY + G +
Sbjct: 667 AAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGRYWTAY---ANGNCQG 723
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y KCQ CGQP Q YH+PR+W+ P +NLLV+ EELGGDPS+ISL+ ++
Sbjct: 724 CSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGGDPSRISLVRRSMTS 783
Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ V E P + +W + G P+V L C G I++I FASYG P G CGSF
Sbjct: 784 VCADVFEYH-PNIKNWHIESYGKTEELHKPKVHLRCGPGQSISSIKFASYGTPLGTCGSF 842
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH D IV+K C+G+ C++ +S+ + CP +LK L+VEA C+
Sbjct: 843 EQGPCHAPDSYAIVEKRCIGRQRCAVTISNT--NFAQDPCPNVLKRLSVEAVCA 894
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/834 (52%), Positives = 556/834 (66%), Gaps = 39/834 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A++I+G+RR+L SGSIHYPRSTPE+W LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLV+F+KT Q+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ +MK E LFASQGGPIIL+Q+ENEYG E +G G+ Y WAA A
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC+QEDAPDP+IN CNGFYCD FTPN+PSKP MWTE ++GWF FG +
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDL+FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG R+
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELHKAIKLCE+ L+S DPT LG+ EAH+Y +S + CAAFLANY+S+S A
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVY-RSPSGCAAFLANYNSNSHAK 390
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
+ F+ Y LP WS+SILPDCK VV+NTA V Q + +S+ W
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATV----------GVQTSQMQMWSDGASSMMW 440
Query: 425 --YEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+VG ++ L EQ+N T+DTSDYLWY S+ V P + GK + L +
Sbjct: 441 ERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA +FVN +L G + ++L G N + +LS+ GL N G ++
Sbjct: 501 QSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYE 560
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ V+L L G RDL+ W YQVG++GE + L+ + A+S W QGS + N
Sbjct: 561 TWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQN 620
Query: 596 K-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRY AY +TG K C Y
Sbjct: 621 QMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAY---ATGDCKDCSYT 677
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GS+ A KCQ CGQP Q YH+P++W+ P NLLV+ EELGGD SKISL+ ++ ++C+
Sbjct: 678 GSFRAIKCQAGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVCAD 737
Query: 715 VSEADPPPVDSW--------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
VSE P + +W KP L +V L C G I+AI FAS+G P G CGS
Sbjct: 738 VSEFH-PSIKNWQTENSGEAKPEL----RRSKVHLRCAPGQSISAIKFASFGTPLGTCGS 792
Query: 767 FRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F G CH V + C+G+ C++ +S G CP ++K +AVEA CS
Sbjct: 793 FEQGQCHSTKSQTVLENCIGKQRCAVTISPDNFG--GDPCPNVMKRVAVEAVCS 844
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 872 bits (2252), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/836 (51%), Positives = 564/836 (67%), Gaps = 36/836 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V+YDH+A+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HE
Sbjct: 29 SASVSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHE 88
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYF G +DLVRF+K VQ+AGL+++LRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 89 PSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISFRTD 148
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK +M++F KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y +WAA
Sbjct: 149 NGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTQWAA 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L T VPW+MC+QEDAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 209 HMAVGLGTGVPWIMCKQEDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGG 268
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 269 AVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 328
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
RQPKWGHL++LH+AIKLCE L+S DPT Q+LG EAH++ S CAAFLANY+ S
Sbjct: 329 PRQPKWGHLKDLHRAIKLCEPALVSGDPTVQQLGNYEEAHVFRSKSGACAAFLANYNPQS 388
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V F Y LP WS+SILP+CK+ V+NTA+V SQ K + +
Sbjct: 389 YATVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTT-------MKMTRVPIHGGLS 441
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
+ + E+ + + SF L EQIN T+D SDYLWY+ + + + GK L +
Sbjct: 442 WKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTV 501
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG+ + ++ + L G+N + +LS+ VGL N G F+
Sbjct: 502 LSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFE 561
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ + L L G+RDL+ +W Y+VG++GE + L +S ++S W QG +
Sbjct: 562 RWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRR 621
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQS+GRYW AY A +G C+Y G
Sbjct: 622 QPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKA--SGSCGYCNYAG 679
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+Y+ KC +CGQ +Q YH+P +W+ P NLLV+ EELGGDP+ I L+ + +C+ +
Sbjct: 680 TYNEKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADI 739
Query: 716 SEADPPPVDSWKPNL--------GVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCG 765
E W+PNL G V S P+ L+C G I++I FAS+G P G+CG
Sbjct: 740 YE--------WQPNLVSYDMQASGKVRSPVRPKAHLSCGPGQKISSIKFASFGTPVGSCG 791
Query: 766 SFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++R G+CH QK CVGQ C++ VS G CP ++K L+VEA C+
Sbjct: 792 NYREGSCHAHKSYDAFQKNCVGQSWCTVTVSPEIFG--GDPCPSVMKKLSVEAICT 845
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 871 bits (2251), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/834 (51%), Positives = 559/834 (67%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W ++I+K+K+GGL+V+ETYVFWN H
Sbjct: 24 IQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF++TVQ+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 EPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ LMK E LF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 144 DNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP +WTE +SGWF FG
Sbjct: 204 ANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPTIWTEAWSGWFNEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPK+GHL+ELH++IKLCE L+S+DP LG+ +AH+Y + DCAAFL+NYD+
Sbjct: 324 LVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSDAGDCAAFLSNYDTK 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q ++ L +
Sbjct: 384 SSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKV----------GVQTAHMEMLPTNAE 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y+E + + + +F L EQIN T+D SDYLWY I + + G+
Sbjct: 434 MLSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGSSESFLRGGELP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L +++ GHA VF+N +L +G ++ F +K+ L+ G NT+ +LS+ VGL N G
Sbjct: 494 TLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIALLSVAVGLPNVG 553
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L GK DLS W Y+VG++GE + L + +S W QGS
Sbjct: 554 GHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLVSPNGISSVDWMQGSL 613
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L W+K F APEG PLAL++ MGKGQ W+NGQSIGRYW+AY + G +
Sbjct: 614 AAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGRYWTAY---ANGNCQG 670
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y KCQ CGQP Q YH+PR+W+ P +NLLV+ EELGGDPS+ISL+ ++
Sbjct: 671 CSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGGDPSRISLVRRSMTS 730
Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ V E P + +W + G P+V L C G I++I FASYG P G CGSF
Sbjct: 731 VCADVFEYH-PNIKNWHIESYGKTEELHKPKVHLRCGPGQSISSIKFASYGTPLGTCGSF 789
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH D IV+K C+G+ C++ +S+ + CP +LK L+VEA C+
Sbjct: 790 EQGPCHAPDSYAIVEKRCIGRQRCAVTISNT--NFAQDPCPNVLKRLSVEAVCA 841
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 871 bits (2250), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/834 (52%), Positives = 555/834 (66%), Gaps = 39/834 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A++I+G+RR+L SGSIHYPRSTPE+W LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLV+F+KT Q+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ +MK E LFASQGGPIIL+Q+ENEYG E +G G+ Y WAA A
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC+QEDAPDP+IN CNGFYCD FTPN+PSKP MWTE ++GWF FG +
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDL+FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG R+
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELHKAIKLCE+ L+S DPT LG+ EAH+Y +S + CAAFLANY+S+S A
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVY-RSPSGCAAFLANYNSNSHAK 390
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
+ F+ Y LP WS+SILPDCK VV+NTA V Q + +S+ W
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATV----------GVQTSQMQMWSDGASSMMW 440
Query: 425 --YEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+VG ++ L EQ+N T+DTSDYLWY S+ V P + GK + L +
Sbjct: 441 ERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA +FVN +L G + ++L G N + +LS+ GL N G ++
Sbjct: 501 QSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYE 560
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ V+L L G RDL+ W YQVG++GE + L+ + A+S W QGS + N
Sbjct: 561 TWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQN 620
Query: 596 K-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRY AY +TG K C Y
Sbjct: 621 QMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAY---ATGDCKDCSYT 677
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GS+ A KCQ CGQP Q YH+P+ W+ P NLLV+ EELGGD SKISL+ ++ ++C+
Sbjct: 678 GSFRAIKCQAGCGQPTQRWYHVPKPWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVCAD 737
Query: 715 VSEADPPPVDSW--------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
VSE P + +W KP L +V L C G I+AI FAS+G P G CGS
Sbjct: 738 VSEFH-PSIKNWQTENSGEAKPEL----RRSKVHLRCAPGQSISAIKFASFGTPLGTCGS 792
Query: 767 FRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F G CH V + C+G+ C++ +S G CP ++K +AVEA CS
Sbjct: 793 FEQGQCHSTKSQTVLENCIGKQRCAVTISPDNFG--GDPCPNVMKRVAVEAVCS 844
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/834 (51%), Positives = 557/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W +LIRK+K+GGL+VI+TY+FWN H
Sbjct: 25 IQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL F+PGI FRT
Sbjct: 85 EPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 145 NNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE +SGWF FG
Sbjct: 205 AKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELHKAIKLCE ++S+DPT LG+ +AH++ +CAAFL+NY+
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LPAWS+SILPDC+ VVFNTA+V Q ++ S
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARV----------GVQTSHMRMFPTNSK 434
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + G+ + L EQIN T+D++DYLWY S+++ + G+
Sbjct: 435 LHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTP 494
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA VF+N + YG + F L+ G N + +LS+ VGL N G
Sbjct: 495 TLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVG 554
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V+L + GKRDLS +W YQVG++GE + L + ++ W +GS
Sbjct: 555 LHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSL 614
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L WYK F APEG PLAL++ SMGKGQ W+NGQSIGRYW AY + G
Sbjct: 615 AAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNV 671
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y KCQ CG P Q YH+PR+W+ P +NLL+I EELGGD SKI+L+ + +
Sbjct: 672 CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKS 731
Query: 711 ICSFVSEADPPPVDSW---KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ +E P +++W P+ V L C G I+ I FAS+G P G CGSF
Sbjct: 732 VCADANEHH-PTLENWHTESPSESEELHQASVHLQCAPGQSISTIMFASFGTPSGTCGSF 790
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + I++K C+GQ +CS+P+S++Y G A CP +LK L+VEA CS
Sbjct: 791 QKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFG--ADPCPNVLKRLSVEAACS 842
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/834 (51%), Positives = 557/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W +LIRK+K+GGL+VI+TY+FWN H
Sbjct: 25 IQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL F+PGI FRT
Sbjct: 85 EPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 145 NNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE +SGWF FG
Sbjct: 205 AKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELHKAIKLCE ++S+DPT LG+ +AH++ +CAAFL+NY+
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LPAWS+SILPDC+ VVFNTA+V Q ++ S
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARV----------GVQTSHMRMFPTNSK 434
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + G+ + L EQIN T+D++DYLWY S+++ + G+
Sbjct: 435 LHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTP 494
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA VF+N + YG + F L+ G N + +LS+ VGL N G
Sbjct: 495 TLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVG 554
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V+L + GKRDLS +W YQVG++GE + L + ++ W +GS
Sbjct: 555 LHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSL 614
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L WYK F APEG PLAL++ SMGKGQ W+NGQSIGRYW AY + G
Sbjct: 615 AAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNV 671
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y KCQ CG P Q YH+PR+W+ P +NLL+I EELGGD SKI+L+ + +
Sbjct: 672 CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKS 731
Query: 711 ICSFVSEADPPPVDSW---KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ +E P +++W P+ V L C G I+ I FAS+G P G CGSF
Sbjct: 732 VCADANEHH-PTLENWHTESPSESEELHZASVHLQCAPGQSISTIMFASFGTPSGTCGSF 790
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + I++K C+GQ +CS+P+S++Y G A CP +LK L+VEA CS
Sbjct: 791 QKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFG--ADPCPNVLKRLSVEAACS 842
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/834 (51%), Positives = 557/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W +LIRK+K+GGL+VI+TY+FWN H
Sbjct: 25 IQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL F+PGI FRT
Sbjct: 85 EPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 145 NNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE +SGWF FG
Sbjct: 205 AKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELHKAIKLCE ++S+DPT LG+ +AH++ +CAAFL+NY+
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LPAWS+SILPDC+ VVFNTA+V Q ++ S
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARV----------GVQTSHMRMFPTNSK 434
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + G+ + L EQIN T+D++DYLWY S+++ + G+
Sbjct: 435 LHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTP 494
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA VF+N + YG + F L+ G N + +LS+ VGL N G
Sbjct: 495 TLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVG 554
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V+L + GKRDLS +W YQVG++GE + L + ++ W +GS
Sbjct: 555 LHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSL 614
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L WYK F APEG PLAL++ SMGKGQ W+NGQSIGRYW AY + G
Sbjct: 615 AAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNV 671
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y KCQ CG P Q YH+PR+W+ P +NLL+I EELGGD SKI+L+ + +
Sbjct: 672 CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKS 731
Query: 711 ICSFVSEADPPPVDSW---KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ +E P +++W P+ V L C G I+ I FAS+G P G CGSF
Sbjct: 732 VCADANEHH-PTLENWHTESPSESEELHEASVHLQCAPGQSISTIMFASFGTPSGTCGSF 790
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + I++K C+GQ +CS+P+S++Y G A CP +LK L+VEA CS
Sbjct: 791 QKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFG--ADPCPNVLKRLSVEAACS 842
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 870 bits (2248), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/831 (52%), Positives = 554/831 (66%), Gaps = 24/831 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 29 VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89 EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK F +I++LMK ENLF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 268
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+AIK+CE+ L+S+DP +G K +AH+Y S DC+AFLANYD+
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 388
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q + + KN
Sbjct: 389 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWE----- 443
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
S+ E+ + + +F L EQIN T+DTSDYLWY S+ + + G+ L
Sbjct: 444 --SYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 501
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+S GHA +FVN +L +G F KI L+ G N + +LS+ VGL N G F
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 561
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
+ G+ V L L GK DLS +W YQVG++GE + L + S W S T+
Sbjct: 562 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 621
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+ +TG C Y
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSY 678
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+Y +KCQ CGQP Q YH+PR W+ P +NLLVI EELGG+PS +SL+ ++ +C+
Sbjct: 679 TGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCA 738
Query: 714 FVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
VSE P + +W+ G P+V L C G IA+I FAS+G P G CGS++ G
Sbjct: 739 EVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQG 797
Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH I+++ CVG+ C++ +S++ G CP +LK L VEA C+
Sbjct: 798 ECHAATSYAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 846
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 870 bits (2248), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/834 (52%), Positives = 554/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 26 VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 85
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 86 EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK F +I++LMK ENLF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 146 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF FG
Sbjct: 206 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 266 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+AIK+CE+ L+S+DP +G K +AH+Y S DC+AFLANYD+
Sbjct: 326 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q + + KN
Sbjct: 386 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKN--------- 436
Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
F W E+ + + +F L EQIN T+DTSDYLWY S+ + + G+
Sbjct: 437 -FQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELP 495
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L I+S GHA +FVN +L +G F KI L+ G N + +LS+ VGL N G
Sbjct: 496 TLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVG 555
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
F+ G+ V L L GK DLS +W YQVG++GE + L + S W S
Sbjct: 556 GHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASL 615
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
T+ + L W+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+ +TG
Sbjct: 616 TVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSH 672
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y +KCQ CGQP Q YH+PR W+ P +NLLVI EELGG+PS +SL+ ++
Sbjct: 673 CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 732
Query: 711 ICSFVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ VSE P + +W+ G P+V L C G IA+I FAS+G P G CGS+
Sbjct: 733 VCAEVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSY 791
Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH I+++ CVG+ C++ +S++ G CP +LK L VEA C+
Sbjct: 792 QQGECHAATSYAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 843
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 870 bits (2247), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/830 (52%), Positives = 552/830 (66%), Gaps = 23/830 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 29 VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89 EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK F +I++LMK ENLF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 268
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+AIK+CE+ L+S+DP +G K +AH+Y S DC+AFLANYD+
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 388
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q + + KN
Sbjct: 389 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWE----- 443
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
S+ E+ + + +F L EQIN T+DTSDYLWY S+ + + G+ L
Sbjct: 444 --SYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 501
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+S GHA +FVN +L +G F KI L+ G N + +LS+ VGL N G F
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 561
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
+ G+ V L L GK DLS +W YQVG++GE + L + S W S T+
Sbjct: 562 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 621
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+ +TG C Y
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSY 678
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+Y +KCQ CGQP Q YH+PR W+ P +NLLVI EELGG+PS +SL+ ++ +C+
Sbjct: 679 TGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCA 738
Query: 714 FVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
VSE P + +W+ G P+V L C G IA+I FAS+G P G CGS++ G
Sbjct: 739 EVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQG 797
Query: 771 ACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + + CVG+ C++ +S++ G CP +LK L VEA C+
Sbjct: 798 ECHAATSYAILERCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 845
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/768 (55%), Positives = 532/768 (69%), Gaps = 18/768 (2%)
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QY FEGR DLVRFVK +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT N PF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM+RF K++ MK L+ASQGGPIIL+Q+ENEYGN+ +YG G+ Y++WAA AV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
L+T VPWVMCQQ DAP+P+INTCNGFYCD FTP+ PS+P +WTEN+SGWFLSFG AVP+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG +RQP
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHLR++HKAIK+CE LI++DP++ LG EAH+Y KS + CAAFLAN D SD V
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQSDKTV 299
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNELLLAS 419
TFNG Y LPAWSVSILPDCKNVV NTA++ SQ RN G A + E LA+
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359
Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVFLN 475
S++S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V G+ G + L
Sbjct: 360 SSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQSNLP 419
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ SLGH VF+N KL G+ + + + L G N +D+LS VGL NYGA+F
Sbjct: 420 VNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGAFF 479
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
D+ GAG+ + + G DLSS EW YQ+G+ GE + L S A S W ++ P N
Sbjct: 480 DLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEA-SPEWVSDNSYPTN 538
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WYK+ F AP G P+A++ MGKG+AWVNGQSIGRYW +AP + C C+YRG
Sbjct: 539 NPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSDCVNSCNYRG 598
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
SY A+KC K CGQP+Q LYH+PR+++ PG N +V+ E+ GG+PSKIS TK + +C+ V
Sbjct: 599 SYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQTESVCAHV 658
Query: 716 SEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGAC- 772
SE P +DSW + S P +RL C + G I++I FAS+G P G CGS+ G C
Sbjct: 659 SEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSGTCGSYSHGECS 718
Query: 773 HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L + Q+ACVG CS+PVS+ G C G+ K+L VEA CS
Sbjct: 719 SSQALAVAQEACVGVSSCSVPVSAKNFG---DPCRGVTKSLVVEAACS 763
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/857 (50%), Positives = 579/857 (67%), Gaps = 48/857 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+++GKRR L S IHYPR+TPE+WP+LI KSKEGG +VIETYVFWN HEP+
Sbjct: 46 NVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNGHEPV 105
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR+DLV+FV+ GL+ LRIGPYACAEWN+GGFPVWL IPGI+FRT N
Sbjct: 106 RGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 165
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFKEEMKRF++K+++LM++E LF+ QGGPIIL Q+ENEYGN+E +YG GG+ Y+KWAA
Sbjct: 166 PFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYGKGGKEYMKWAAKM 225
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A++L VPWVMC+Q+DAP II+TCN +YCDGF PNS +KP MWTEN+ GW+ +G +
Sbjct: 226 ALSLGAGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSHNKPTMWTENWDGWYTQWGERL 285
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAFAVARFF+ GG+FQNYYMYFGGTNFGRTAGGPL TSYDYDAPIDEYG +R
Sbjct: 286 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGLLR 345
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYH-------------KSSND 349
+PKWGHL++LH A+KLCE L+++D PT+ KLG K EAH+Y +SS+
Sbjct: 346 EPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPKQEAHVYQANVHLEGLNLSMFESSSI 405
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN--------- 400
C+AFLAN D +A VTF G Y +P WSVS+LPDC+N VFNTAKV +Q +
Sbjct: 406 CSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVRAQTSVKLVESYLP 465
Query: 401 --NGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
+ P Q ++ N+ S ++ +E + I SF + E +N TKD SDYLWY
Sbjct: 466 TVSNIFPAQQLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWY 525
Query: 459 TASIHVMPG-----QGKEVF--LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
+ ++V + +V L I+ + VF+N +L+ GN + + ++
Sbjct: 526 STRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLI----GNVVGHWIKVVQTLQ 581
Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEG 570
G N L +L+ VGLQNYGA+ + GAG+ I I +NG DLS W YQVG++G
Sbjct: 582 FLPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQG 641
Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
E++ NS W + + + + WYKT F P G P+AL+ SMGKGQAWVNG
Sbjct: 642 EFLKFYSEENENSE-WVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNG 700
Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
Q IGRYW+ ++P +GC + CDYRG+Y++ KC +CG+P QTLYH+PR+W+ NLLVI
Sbjct: 701 QHIGRYWTR-VSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVI 759
Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPV------DSWKPNLGVVSSSPQVRLACE 744
EE GG+P +IS+ + + IC+ VSE++ PP+ D + + P++ L C+
Sbjct: 760 LEETGGNPFEISVKLHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMIPELHLHCQ 819
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSA 803
+G I+++ FAS+G P G+C +F G CH + IV +AC G+ CSI +S + GV
Sbjct: 820 QGHTISSVAFASFGTPGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVD- 878
Query: 804 GACPGLLKALAVEAHCS 820
CPG++K L+VEA C+
Sbjct: 879 -PCPGVVKTLSVEARCT 894
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/834 (51%), Positives = 559/834 (67%), Gaps = 31/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 25 IQCSVTYDRKAIMINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y+FEGR+D+VRF+KT+Q AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 EPTPGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ LMK ENLF SQGGPIIL+Q+ENEYG +G G Y+ WA
Sbjct: 145 DNEPFKRAMQGFTEKIVGLMKAENLFESQGGPIILSQIENEYGVQSKLFGAAGYNYMTWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A+ T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP +WTE +SGWF FG
Sbjct: 205 ANMAIQTGTGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPTIWTEAWSGWFSEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVA+F + GG+F NYYM+ GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHQRPVQDLAFAVAKFIQKGGSFINYYMFHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH++IK+CE L+S DP +LG + H+Y S DCAAFLANYD+
Sbjct: 325 LIRQPKYGHLKELHRSIKMCERALVSVDPIVTQLGTYQQVHVYSTESGDCAAFLANYDTK 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q + E+L +
Sbjct: 385 SAARVLFNNMHYNLPPWSISILPDCRNVVFNTAKV-----------GVQTSQMEMLPTNG 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW Y+E + + + +F L EQIN T+D SDYLWY S+ + + G+
Sbjct: 434 IFSWESYDEDISSLDDSSTFTTAGLLEQINVTRDASDYLWYMTSVDIGSSESFLHGGELP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L I+S GHA +F+N +L +G + F K+ L G N + +LS+ VGL N G
Sbjct: 494 TLIIQSTGHAVHIFINGQLSGSAFGTRENRRFTYTGKVNLRPGTNRIALLSVAVGLPNVG 553
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V L L GK DLS +W YQVG++GE + L S W Q S
Sbjct: 554 GHYESWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLLSPDSVTSVEWMQSSL 613
Query: 592 LPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L W+K F APEG PLAL++ MGKGQ W+NGQSIGRYW+AY ++G
Sbjct: 614 AAQRPQPLTWHKAYFNAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAY---ASGNCNG 670
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G++ +KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGDPS+ISL+ ++
Sbjct: 671 CSYAGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTNNLLVVFEELGGDPSRISLVKRSLAS 730
Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ VSE P + +W+ + G SP+V L C G I +I FAS+G P G CGS+
Sbjct: 731 VCAEVSEFH-PTIKNWQIESYGRAEEFHSPKVHLRCSGGQSITSIKFASFGTPLGTCGSY 789
Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ GACH I++K C+G+ C++ +S++ G CP ++K L+VEA C+
Sbjct: 790 QQGACHASTSYAILEKKCIGKQRCAVTISNSNFG--QDPCPNVMKKLSVEAVCA 841
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 869 bits (2245), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/829 (50%), Positives = 562/829 (67%), Gaps = 26/829 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YDH+A++++G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 29 ASVSYDHKAIIVNGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+G+YYFE R+DLV+F+K V +AGL+++LR+GPYACAEWN+GGFPVWL ++PGI FRT N
Sbjct: 89 EQGKYYFEERYDLVKFIKLVHQAGLYVNLRVGPYACAEWNFGGFPVWLKYVPGISFRTDN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+++MK E L+ SQGGPIIL+Q+ENEYG +E +G G+ Y +WAA
Sbjct: 149 EPFKAAMQKFTTKIVNMMKAERLYESQGGPIILSQIENEYGPLEVRFGEQGKSYAEWAAK 208
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A++L T VPW+MC+Q+DAPDP+INTCNGFYCD F PN KP +WTE ++ WF FG
Sbjct: 209 MALDLGTGVPWLMCKQDDAPDPVINTCNGFYCDYFYPNKAYKPKIWTEAWTAWFTEFGSP 268
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAF VA F +TGG+F NYYMY GGTNFGRTAGGP VATSYDYDAP+DE+G +
Sbjct: 269 VPYRPVEDLAFGVANFIQTGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEFGLL 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S DPT LG +AH++ +S CAAFLAN D +S
Sbjct: 329 RQPKWGHLKDLHRAIKLCEPALVSGDPTVTALGNYQKAHVFRSTSGACAAFLANNDPNSF 388
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCK+ V+NTA+V Q + ++ A+ +
Sbjct: 389 ATVAFGNKHYNLPPWSISILPDCKHTVYNTARV-----------GAQSALMKMTPANEGY 437
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
SW Y ++ + +F L EQ+NTT+D SDYLWY + + P + G +L
Sbjct: 438 SWQSYNDQTAFYDDNAFTVVGLLEQLNTTRDVSDYLWYMTDVKIDPSEGFLRSGNWPWLT 497
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S G A VFVN +L YG+ +K + L G+N + +LS+ VGL N G F
Sbjct: 498 VSSAGDALHVFVNGQLAGTVYGSLKKQKITFSKAVNLRAGVNKISLLSIAVGLPNIGPHF 557
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ G+ V L L GKRDL+ +W Y+VG++GE + L +S ++S W +GS +
Sbjct: 558 ETWNTGVLGPVSLSGLDEGKRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWVEGSLVAQ 617
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQSIGRYW Y A +G C+Y
Sbjct: 618 RQPLTWYKTTFNAPAGNEPLALDMNSMGKGQVWINGQSIGRYWPGYKA--SGTCDACNYA 675
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G ++ KC +CG +Q YH+PR+W+HP NLLV+ EE GGDP+ ISL+ + +C+
Sbjct: 676 GPFNEKKCLSNCGDASQRWYHVPRSWLHPTGNLLVVFEEWGGDPNGISLVKRELASVCAD 735
Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
++E P V+ G V P+ L+C G I +I FAS+G P+G CGSF G+C
Sbjct: 736 INEWQPQLVNWQLQASGKVDKPLRPKAHLSCTSGQKITSIKFASFGTPQGVCGSFSEGSC 795
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H +K C+GQ C++PV+ G CP ++K L+VEA CS
Sbjct: 796 HAHHSYDAFEKYCIGQESCTVPVTPEIFG--GDPCPSVMKKLSVEAVCS 842
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 869 bits (2245), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/831 (51%), Positives = 558/831 (67%), Gaps = 31/831 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYD +AL+I+G+RR+L SGSIHYPRSTP++W LI+K+K+GGL+ I+TYVFWN HEP
Sbjct: 26 SVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 85
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G+Y FEGR+DLVRF+K +Q+AGL++HLRIGPY CAEWN+GGFPVWL F+PG+ FRT N
Sbjct: 86 PGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNFGGFPVWLKFVPGVSFRTDNE 145
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+RF KI+ +MK E LF SQGGPII++Q+ENEYG+ A+G G Y+ WAA
Sbjct: 146 PFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEYGHESRAFGAPGYAYLTWAAKM 205
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV ++T VPWVMC+++DAPDP+INTCNGFYCD F+PN P+KP +WTE +SGWF F +
Sbjct: 206 AVAMDTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPNKPTLWTEAWSGWFTEFAGPI 265
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
RPVEDL+FAV RF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR
Sbjct: 266 QQRPVEDLSFAVTRFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 325
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPK+GHL+ELHKAIKLCE L+S+DP LG +A +++ S CAAFL+NY+ +S A
Sbjct: 326 QPKYGHLKELHKAIKLCERALLSADPAETSLGTYAKAQVFYSESGGCAAFLSNYNPTSAA 385
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
VTFN Y L WS+SILPDCKNVVFNTA V Q + Q N LL+ F+
Sbjct: 386 RVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQTSQ-----MQMLPTNSELLSWETFN 440
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
E+ + + L EQ+N T+DTSDYLWY+ I + + G+ L ++S
Sbjct: 441 --EDISSADDDSTITVVGLLEQLNVTRDTSDYLWYSTRIDISSSESFLHGGQHPTLIVQS 498
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GHA VF+N L +G + F + L G N + +LS+ VGL N G F+
Sbjct: 499 TGHAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSNIISVLSIAVGLPNNGPHFETW 558
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V+L L GK+DLS +W YQVG++GE + L ++ ++ W +GS +
Sbjct: 559 STGVLGPVVLHGLDEGKKDLSWQKWSYQVGLKGEAMNLVSPNVISNIDWMKGSLFAQKQQ 618
Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F AP+G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G C Y G+
Sbjct: 619 PLTWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---AKGNCSGCSYSGT 675
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
+ +KCQ CGQP Q YH+PR+W+ P +NLLV+ EELGGD SKIS + ++ +C+ VS
Sbjct: 676 FRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVLFEELGGDASKISFMKRSVTTVCAEVS 735
Query: 717 EADPPPVDSW------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
E P + +W +P S P+V L C G I+AI FAS+G P G CG+F+ G
Sbjct: 736 EHH-PNIKNWHIESQERPE---EMSKPKVHLHCASGQSISAIKFASFGTPSGTCGNFQKG 791
Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH +++K C+GQ +CS+ VSS+ A CP + K L+VEA C+
Sbjct: 792 TCHAPTSQAVLEKKCIGQQKCSVAVSSSNF---ANPCPNMFKKLSVEAVCA 839
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 868 bits (2244), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/834 (52%), Positives = 554/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +AL+I+G+RR+L SGSIHYPRSTP++W LI+K+K+GG++VIETYVFWN H
Sbjct: 29 VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNLH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR DLVRFVK + +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89 EPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK F +I++LMK ENLF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQILGAEGHNYMTWA 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ T VPWVMC+++DAPDP+I+TCNGFYCD F PN P KP +WTE +SGWF FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVISTCNGFYCDSFAPNKPYKPTIWTEAWSGWFTEFG 268
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+AIK+CE+ L+S+DP LG K +AH+Y S DC+AFLANYD+
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSTDPVVTSLGNKQQAHVYSSESGDCSAFLANYDTE 388
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+N VFNTAKV Q + L ++
Sbjct: 389 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKV----------GVQTSQMEMLPTSTG 438
Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
+F W E+ + + +F L EQIN T+DTSDYLWY S+ + + G+
Sbjct: 439 SFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSVDIGETESFLHGGELP 498
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L I+S GHA +FVN +L +G F KI L+ G N + +LS+ VGL N G
Sbjct: 499 TLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKINLHSGTNRIALLSVAVGLPNVG 558
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
F+ G+ V L L GKRDLS +W YQVG++GE + L + S W S
Sbjct: 559 GHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAYPTNTPSFGWMDASL 618
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
T+ + L W+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+ +TG
Sbjct: 619 TVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCGH 675
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y +KC CGQP Q YH+PR+W+ P +NLLVI EELGG+PS +SL+ ++
Sbjct: 676 CSYTGTYKPNKCNSGCGQPTQKWYHVPRSWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 735
Query: 711 ICSFVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ VSE P + +W+ G P+V L C G I+AI FAS+G P G CGS+
Sbjct: 736 VCAEVSEYH-PNIKNWQIESYGKGQTFRRPKVHLKCSPGQAISAIKFASFGTPLGTCGSY 794
Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH I+++ CVG+ C++ +S++ G CP +LK L VEA C+
Sbjct: 795 QQGDCHAATSYAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 846
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 867 bits (2241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/833 (51%), Positives = 558/833 (66%), Gaps = 38/833 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+++DG+RR+L SGSIHYPRSTPE+W LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG +G G+ Y+ WAA A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELH+A+KLCE+ L+S+DPT LG+ EAH++ +SS+ CAAFLANY+S+S A
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFLANYNSNSYAK 385
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V FN Y LP WS+SILPDCKNVVFNTA V Q N + +S+ W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQMWADGASSMMW 435
Query: 425 --YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+V ++ L EQ+N T+DTSDYLWY S+ V P + G + L +
Sbjct: 436 EKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTV 495
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VF+N +L YG + + L G N + +LS+ GL N G ++
Sbjct: 496 QSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYE 555
Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ ++I L G RDL+ W YQVG++GE + L+ + + S W QGS + N
Sbjct: 556 TWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQN 615
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G K C Y
Sbjct: 616 QQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---AEGDCKGCHYT 672
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GSY A KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGD SKI+L +T +C+
Sbjct: 673 GSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCAD 732
Query: 715 VSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
VSE P + +W+ P + +V L C G I+AI FAS+G P G CG+F+
Sbjct: 733 VSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQ 787
Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH ++ +++K C+G C + +S + G CP ++K +AVEA CS
Sbjct: 788 QGECHSINSNSVLEKKCIGLQRCVVAISPSNFG--GDPCPEVMKRVAVEAVCS 838
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 867 bits (2241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/834 (51%), Positives = 561/834 (67%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTPE+W +LI K+KEGGL+V+ETYVFWN H
Sbjct: 24 VHCDVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FR
Sbjct: 84 EPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRA 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK + KI++LMK NLF SQGGPIIL+Q+ENEYG G G Y WA
Sbjct: 144 DNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L+T VPWVMC++EDAPDP+INTCNGFYCD F PN P KP +WTE +SGWF FG
Sbjct: 204 ANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPAIWTEAWSGWFSEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVA+F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+A+K+CE+ ++S+DP LG +A++Y + CAAFL+N D
Sbjct: 324 LIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSETGGCAAFLSNNDWK 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q + L S
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSKMEMLPTNSE 433
Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + + S +R L EQIN T+DTSDYLWY S+ + + G+
Sbjct: 434 MLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDIGSTESFLHGGELP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L +E+ GHA VF+N +L +G F+ K+ L G N + +LS+ VGL N G
Sbjct: 494 TLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRIALLSVAVGLPNIG 553
Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ + I L +GK DLS +W YQVG++GE + L + ++ W QGS
Sbjct: 554 GHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVSTNGISAVDWMQGSL 613
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ + L W+K F PEG PLAL+++SMGKGQ W+NGQSIGRYW+AY +TG
Sbjct: 614 IAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRYWTAY---ATGDCNG 670
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G + KCQ CG+P Q YH+PR+W+ P +NLLV+ EELGGDP++ISL+ ++ +
Sbjct: 671 CQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGGDPTRISLVKRSVTN 730
Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+CS V+E P + +W+ N G P+VR+ C G I++I FAS+G P G CGSF
Sbjct: 731 VCSNVAEYH-PNIKNWQIENYGKTEEFHLPKVRIHCAPGQSISSIKFASFGTPLGTCGSF 789
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH D +V+K C+G+ C++ +S++ G CP +LK L+VEAHC+
Sbjct: 790 KQGTCHAPDSHAVVEKKCLGRQTCAVTISNSNFG--EDPCPNVLKRLSVEAHCT 841
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 867 bits (2240), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/831 (52%), Positives = 562/831 (67%), Gaps = 31/831 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
VTYD +A++I+G+RR+L SGSIHYPRSTPE+W LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 28 TTVTYDKKAILINGQRRLLISGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNGHEP 87
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G YYFEGR+DLVRF+KTVQ+AGLFLHLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 88 SPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 147
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK E LFASQGGPIIL+Q+ENEYG A G G+ Y+ WAA
Sbjct: 148 GPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPERKALGAPGQNYINWAAK 207
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV L+T VPWVMC+++DAPDP+IN CNGFYCDGFTPN P KP MWTE +SGWFL FG
Sbjct: 208 MAVGLDTGVPWVMCKEDDAPDPMINACNGFYCDGFTPNKPYKPTMWTEAWSGWFLEFGGT 267
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+ RPV+DLAFAVARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG I
Sbjct: 268 IHHRPVQDLAFAVARFIQRGGSYVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLI 327
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELHKAIKLCE L+SS+PT LG +A++++ CAAFL+N+ S +
Sbjct: 328 RQPKYGHLKELHKAIKLCEHSLLSSEPTVTSLGTYHQAYVFNSGPRRCAAFLSNFH-SVE 386
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A VTFN Y LP WSVSILPDC+N V+NTAKV Q +V + S F
Sbjct: 387 ARVTFNNKHYDLPPWSVSILPDCRNEVYNTAKV----------GVQTSHVQMIPTNSRLF 436
Query: 423 SW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
SW Y+E + RS + L EQIN T+DTSDYLWY ++ + GK+ L +
Sbjct: 437 SWQTYDEDISSVHERSSIPAIGLLEQINVTRDTSDYLWYMTNVDISSSDLSGGKKPTLTV 496
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VFVN + +G + F + L+ GIN + +LS+ VGL N G ++
Sbjct: 497 QSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVNLHAGINRIALLSIAVGLPNVGLHYE 556
Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
G+ + +D L NGK+DL+ +W +VG++GE + L + A+S W ++
Sbjct: 557 SWKTGIQGPVFLDGLGNGKKDLTLHKWFNKVGLKGEAMNLVSPNGASSVGWIRRSLATQT 616
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
++L WYK F AP G PLAL++ MGKGQ W+NGQSIGRYW AY + G C Y
Sbjct: 617 KQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVWINGQSIGRYWMAY---AKGDCSSCSYI 673
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G++ +KCQ HCG+P Q YH+PR+W+ P +NL+V+ EELGGDPSKI+L+ ++ +C
Sbjct: 674 GTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNLVVVFEELGGDPSKITLVRRSVAGVCGD 733
Query: 715 VSEADPPP----VDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+ E P VD + + + + QV L C G I++I FAS+G P G CGSF+ G
Sbjct: 734 LHENHPNAENFDVDGNEDSKTLHQA--QVHLHCAPGQSISSIKFASFGTPSGTCGSFQQG 791
Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + +V+K C+G+ CS+ VS++ CP +LK L+VEA CS
Sbjct: 792 TCHATNSHAVVEKNCIGRESCSVAVSNSTF--ETDPCPNVLKRLSVEAVCS 840
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 867 bits (2239), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/827 (51%), Positives = 564/827 (68%), Gaps = 20/827 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YD +A+ I+G+ R+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 26 ASVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFEG +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 86 SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 145
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK +M++F KI+D+MK + LF SQGGPII++Q+ENEYG +E+ G G+ Y KWAAD
Sbjct: 146 EPFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTKWAAD 205
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV L T VPW+MC+Q+DAPDP+INTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 206 MAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCDYFSPNKDYKPKMWTEAWTGWFTEFGGP 265
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 266 VPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
+QPKWGHL++LH+AIKL E LIS DPT ++G EAH++ S CAAFL NY+ +
Sbjct: 326 QQPKWGHLKDLHRAIKLSEPALISGDPTVTRIGNYQEAHVFKSKSGACAAFLGNYNPKAF 385
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCKN V+NTA+V SQ AQ K + ++
Sbjct: 386 ATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQS-------AQMKMTRVPIHGGLSW 438
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
+ E+ + + SF L EQ+NTT+D +DYLWY+ + + P + GK+ L +
Sbjct: 439 QVFTEQTASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDVVIDPNEGFLRSGKDPVLTVL 498
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GHA VF+N +L YG+ +F ++ ++L G+N + +LS+ VGL N G F+
Sbjct: 499 SAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKISLLSVAVGLPNVGPHFET 558
Query: 538 AGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG+ I ++ L G+RDLS +W Y+VG+ GE + L + ++S W QGS + +
Sbjct: 559 WNAGVLGPITLNGLDEGRRDLSWQKWSYKVGLHGEALSLHSLGGSSSVEWVQGSLVSRMQ 618
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYKTTF AP+G P AL++ SMGKGQ W+NGQ++GRYW AY A +G CDY G+
Sbjct: 619 PLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPAYKA--SGTCDNCDYAGT 676
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
Y+ +KC+ +CG+ +Q YH+P +W+ P NLLV+ EELGGDP+ I L+ + +C+ +
Sbjct: 677 YNENKCRSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADIY 736
Query: 717 EADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
E P + G + P+ L+C G I++I FAS+G P G+CG+F G+CH
Sbjct: 737 EWQPNLISYQMQTSGKTNKPVRPKAHLSCGPGQKISSIKFASFGTPVGSCGNFHEGSCHA 796
Query: 775 -DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+K CVGQ C + VS G CP +LK L+VEA C+
Sbjct: 797 HKSYNTFEKNCVGQNSCKVTVSPENFG--GDPCPNVLKKLSVEAICT 841
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 867 bits (2239), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/834 (51%), Positives = 556/834 (66%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+R++L SGSIHYPRSTP++W L++K+K+GGL+VI+TYVFWN H
Sbjct: 26 IQCSVTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLMQKAKDGGLDVIQTYVFWNVH 85
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRFVKTVQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 86 EPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK E+LF SQGGPIIL+Q+ENEYG+ A G G Y+ WA
Sbjct: 146 DNEPFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIENEYGSESKALGAPGHAYMTWA 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP MWTE +SGWF FG
Sbjct: 206 AKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYCDAFTPNKPYKPTMWTEAWSGWFTEFG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
V RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 266 GTVHERPVEDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+AIKLCE LIS+DP LG ++H++ + CAAFL+NY+ +
Sbjct: 326 LIRQPKYGHLKELHRAIKLCEPALISADPIVTSLGPYQQSHVFSSGTGGCAAFLSNYNPN 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q + + K
Sbjct: 386 SVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKVGVQTSQMHMSAGETK---------- 435
Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y+E + G+ S + L EQ+N T+DTSDYLWY S+ + P + G+
Sbjct: 436 LLSWEMYDEDIASLGDNSMITAVGLLEQLNVTRDTSDYLWYMTSVDISPSESSLRGGRPP 495
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA V++N +L +G+ + F + + GIN + +LS+ V L N G
Sbjct: 496 VLTVQSAGHALHVYINGQLSGSAHGSRENRRFTFTGDVNMRAGINRIALLSIAVELPNVG 555
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V+L L GKRDL+ +W YQVG++GE + L S + W Q S
Sbjct: 556 LHYESTNTGVLGPVVLHGLDQGKRDLTWQKWSYQVGLKGEAMNLVAPSGISYVEWMQASF 615
Query: 592 LPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L WYK F AP G PLAL+L SMGKGQ W+NG+SIGRYW+ A + G
Sbjct: 616 ATQKLQPLTWYKAYFNAPGGDEPLALDLGSMGKGQVWINGESIGRYWT---AAANGDCNH 672
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G+Y A KCQ CGQP Q YH+PR+W+ P +NLLVI EE+GGD S ISL+ ++
Sbjct: 673 CSYAGTYRAPKCQTGCGQPTQRWYHVPRSWLQPTKNLLVIFEEIGGDASGISLVKRSVSS 732
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ VSE P + +W S P+V L C G I+AI FAS+G P G CGSF
Sbjct: 733 VCADVSEWH-PTIKNWHIESYGRSEELHRPKVHLRCAMGQSISAIKFASFGTPLGTCGSF 791
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + I++K C+GQ C++ +S G CP ++K +AVEA C+
Sbjct: 792 QQGPCHSPNSHAILEKKCIGQQRCAVTISMNNFG--GDPCPNVMKRVAVEAICT 843
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/828 (51%), Positives = 560/828 (67%), Gaps = 20/828 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V+YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HE
Sbjct: 27 SASVSYDHKAIIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHE 86
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYF G +DLVRF+K VQ+AGL+++LRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 87 PSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISFRTD 146
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK +M++F KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y +WAA
Sbjct: 147 NGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRSYTQWAA 206
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L T VPW+MC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 207 HMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGG 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 267 AVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
RQPKWGHL++LH+AIKLCE L+S D T Q+LG EAH++ S CAAFLANY+ S
Sbjct: 327 ARQPKWGHLKDLHRAIKLCEPALVSGDSTVQRLGNYEEAHVFRSKSGACAAFLANYNPQS 386
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V F Y LP WS+SILP+CK+ V+NTA+V SQ K + +
Sbjct: 387 YATVAFGNQHYNLPPWSISILPNCKHTVYNTARVGSQSTT-------MKMTRVPIHGGLS 439
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
+ + E+ + + SF L EQIN T+D SDYLWY+ + + + GK L +
Sbjct: 440 WKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTV 499
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG+ + ++ + L G+N + +LS+ VGL N G F+
Sbjct: 500 LSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFE 559
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ + L L G+RDL+ +W Y+VG++GE + L +S ++S W QG +
Sbjct: 560 RWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRR 619
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQS+GRYW AY A +G C+Y G
Sbjct: 620 QPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKA--SGSCGYCNYAG 677
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+Y+ KC +CG+ +Q YH+P +W+ P NLLV+ EELGGDP+ I L+ + +C+ +
Sbjct: 678 TYNEKKCGSNCGEASQRWYHVPHSWLKPSGNLLVVFEELGGDPNGIFLVRRDIDSVCADI 737
Query: 716 SEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
E P V G V S P+ L+C G I++I FAS+G P G+CGS+R G+CH
Sbjct: 738 YEWQPNLVSYEMQASGKVRSPVRPKAHLSCGPGQKISSIKFASFGTPVGSCGSYREGSCH 797
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
K CVGQ C++ VS G CP ++K L+VEA C+
Sbjct: 798 AHKSYDAFLKNCVGQSWCTVTVSPEIFG--GDPCPRVMKKLSVEAICT 843
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/831 (51%), Positives = 559/831 (67%), Gaps = 25/831 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD RA+VI+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+V+ETYVFWN H
Sbjct: 24 VQCTVTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y F+GR+DLVRF+KT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 EPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ LMK E LF SQGGPIIL+Q+ENEYG +G G Y+ WA
Sbjct: 144 DNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIENEYGAQSKLFGAAGHNYMTWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F PN P KP +WTE +SGWF FG
Sbjct: 204 ANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFAPNKPYKPTIWTEAWSGWFSEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLA+AVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 264 GPIHQRPVQDLAYAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+AIK+CE L+S+DP LG +A++Y S DC+AFL+N+DS
Sbjct: 324 LIRQPKYGHLKELHRAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSK 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q +Q + + S
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQT-------SQMGMLPTNIQMLS 436
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
S+ E+ + + + P L EQIN T+D++DYLWY S+ + + G+ L
Sbjct: 437 WESYDEDITSLDDSSTITAPGLLEQINVTRDSTDYLWYKTSVDIGSSESFLRGGELPTLI 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
++S GHA +F+N +L +G + F K+ L+ G N + +LS+ VGL N G F
Sbjct: 497 VQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAGTNRIALLSVAVGLPNVGGHF 556
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ G+ V L L GK DLS +W YQVG++GE + L + +S W +GS
Sbjct: 557 EAWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLVSPNSISSVDWMRGSLAAQ 616
Query: 595 NKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+KT F APEG PLAL++ MGKGQ W+NGQSIGRYW+A+ + G C Y
Sbjct: 617 KQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAF---ANGNCNGCSY 673
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G + KCQ CGQP Q +YH+PR+W+ P +NLLVI EE GGDPS+ISL+ ++ +C+
Sbjct: 674 AGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVIFEEFGGDPSRISLVKRSVSSVCA 733
Query: 714 FVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
V+E P + +W + G SP+V L C G I++I FAS+G P G CGS++ G
Sbjct: 734 EVAEYH-PTIKNWHIESYGKAEDFHSPKVHLRCNPGQAISSIKFASFGTPLGTCGSYQEG 792
Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH ++QK C+G+ C++ +S++ G CP +LK L+VEA C+
Sbjct: 793 TCHAATSYSVLQKKCIGKQRCAVTISNSNFG---DPCPKVLKRLSVEAVCA 840
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/834 (51%), Positives = 559/834 (67%), Gaps = 30/834 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD A+VI+G+RR+L SGSIHYPRSTPE+W +LI K+KEGGL+V+ETYVFWN H
Sbjct: 24 VHCDVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FR
Sbjct: 84 EPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRA 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK + KI++LMK NLF SQGGPIIL+Q+ENEYG G G Y WA
Sbjct: 144 DNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L+T VPWVMC++EDAPDP+INTCNGFYCD F PN P KP WTE +SGWF FG
Sbjct: 204 ANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPATWTEAWSGWFSEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVA+F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+A+K+CE+ ++S+DP LG +A++Y + CAAFL+N D
Sbjct: 324 LIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSETGGCAAFLSNNDWK 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q + L S
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSKMEMLPTNSE 433
Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + + S +R L EQIN T+DTSDYLWY S+ + + G+
Sbjct: 434 MLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDIGSTESFLHGGELP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L +E+ GHA VF+N +L +G F+ K+ L G N + +LS+ VGL N G
Sbjct: 494 TLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRIALLSVAVGLPNIG 553
Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ + I L +GK DLS +W YQVG++GE + L + ++ W QGS
Sbjct: 554 GHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVSTNGISAVDWMQGSL 613
Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ + L W+K F PEG PLAL+++SMGKGQ W+NGQSIGRYW+AY +TG
Sbjct: 614 IAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRYWTAY---ATGDCNG 670
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y G + KCQ CG+P Q YH+PR+W+ P +NLLV+ EELGGDP++ISL+ ++ +
Sbjct: 671 CQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGGDPTRISLVKRSVTN 730
Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+CS V+E P + +W+ N G P+VR+ C G I++I FAS+G P G CGSF
Sbjct: 731 VCSNVAEYH-PNIKNWQIENYGKTEEFHLPKVRIHCAPGQSISSIKFASFGTPLGTCGSF 789
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH D +V+K C+G+ C++ +S++ G CP +LK L+VEAHC+
Sbjct: 790 KQGTCHAPDSHAVVEKKCLGRQTCAVTISNSNFG--EDPCPNVLKRLSVEAHCT 841
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 866 bits (2237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/822 (54%), Positives = 555/822 (67%), Gaps = 44/822 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHR+L+++GKRR+L SGS+HYPR+TPE+WP +I+K+KEGGL+VIETYVFW+ HEP
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQYYFEGR+DLV+FVK VQ+AGL ++LRIGPY CAEWN GGFP+WL IP I FRT N
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+ M+ FL KI+++MK+ENLFASQGGPIILAQVENEYGNV+ YG G Y+ WAA+
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A NT VPW+MC Q P+ II+TCNG YCDG+ P KP MWTE+Y+GWF +G+ +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPTLYKKPTMWTESYTGWFTYYGWPL 258
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVED+AFAVARFFE GG+F NYYMYFGGTNFGRT+GGP VA+SYDYDAP+DEYG
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQH 318
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PKWGHL++LH+ +KL EE ++SS+ H +LG EAH+Y N C AFLAN DS +D
Sbjct: 319 LPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVY-SYGNGCVAFLANVDSMNDT 377
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V F Y LPAWSVSI+ DCK V FN+AKV S Q V + + S+ S
Sbjct: 378 VVEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKS-----------QSAVVSMNPSKSSLS 426
Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
W ++E VGISG+ SF L EQ+ TTKDTSDYLWYT G +L+IES+
Sbjct: 427 WTSFDEPVGISGS-SFKAKQLLEQMETTKDTSDYLWYTTRYATGTGS---TWLSIESMRD 482
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
+FVN + + + + + I+L G NT+ +LS VGLQN+GA+ + AG
Sbjct: 483 VVHIFVNGQFQSSWHTSKSVLYNSVEAPIKLAPGSNTIALLSATVGLQNFGAFIETWSAG 542
Query: 542 LF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
L S+IL L G ++LS EW YQVG++GE + L + + S W ST K L W
Sbjct: 543 LSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAVST---KKPLTW 599
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
Y T F AP G P+AL+LASMGKGQAWVNGQSIGRYW AY A + C + CDYRGSYD +
Sbjct: 600 YMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGSYDQN 659
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
KC CGQ +Q YH+PR+W+ P NLLV+ EE GGDPS I +T++ IC+ V E+ P
Sbjct: 660 KCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARVYESHP 719
Query: 721 PPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPI 779
V W P V I+ I FAS G PEG+CGSF+ G+CH D+
Sbjct: 720 ASVKLWCPGEKQV---------------ISQIRFASLGNPEGSCGSFKEGSCHTNDLSNT 764
Query: 780 VQKACVGQIECSIPVSSAYLGVSAGACPGLL-KALAVEAHCS 820
V+KACVGQ CS+ + ACPG+ K LAVEA CS
Sbjct: 765 VEKACVGQRSCSLAPD-----FTTSACPGVREKFLAVEALCS 801
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 866 bits (2237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/829 (51%), Positives = 562/829 (67%), Gaps = 26/829 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YDH+A++++G+R++L SGSIHYPRSTPE+WP+LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 22 ASVSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEP 81
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFE R+DLV+F+K VQEAGL++HLRIGPYACAEWN+GGFPVWL ++PGI FRT N
Sbjct: 82 EEGKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGISFRTNN 141
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+D+MK E L+ +QGGPIIL+Q+ENEYG +EW G G++Y +WAA
Sbjct: 142 EPFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEWAAK 201
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV+L T VPW+MC+Q+D PDPIINTCNGFYCD FTPN +KP MWTE ++ WF FG
Sbjct: 202 MAVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNKANKPKMWTEAWTAWFTEFGGP 261
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RP ED+AFAVARF +TGG+F NYYMY GGTNFGRT+GGP +ATSYDYDAP+DE+G +
Sbjct: 262 VPYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEFGSL 321
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S DPT LG EA ++ S CAAFLANY+ S
Sbjct: 322 RQPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQEARVFKSESGACAAFLANYNQHSF 381
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCKN V+NTA+V +Q AQ K + S F
Sbjct: 382 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------AQMK----MTPVSRGF 430
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
SW + E + +F L EQIN T+D SDYLWY I + P + G +L
Sbjct: 431 SWESFNEDAASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEIDPTEGFLNSGNWPWLT 490
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VFVN +L YG+ + + I L G+N + +LS+ VGL N G F
Sbjct: 491 VFSAGHALHVFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNKISLLSIAVGLPNVGPHF 550
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G RDL+ +W Y+VG++GE + L +S + S W +GS +
Sbjct: 551 ETWNAGVLGPVSLNGLNEGTRDLTWQKWFYKVGLKGEALSLHSLSGSPSVEWVEGSLVAQ 610
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP+G PLAL++ +MGKGQ W+NGQS+GR+W AY S+G C+Y
Sbjct: 611 KQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHWPAY--KSSGSCSVCNYT 668
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G +D KC +CG+ +Q YH+PR+W++P NLLV+ EE GGDP I+L+ + +C+
Sbjct: 669 GWFDEKKCLTNCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDPYGITLVKREIGSVCAD 728
Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E P ++ + G P+ L C G I++I FAS+G PEG CG+F+ G+C
Sbjct: 729 IYEWQPQLLNWQRLVSGKFDRPLRPKAHLKCAPGQKISSIKFASFGTPEGVCGNFQQGSC 788
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H +K CVG+ CS+ V+ G C +LK L+VEA CS
Sbjct: 789 HAPRSYDAFKKNCVGKESCSVQVTPENFG--GDPCRNVLKKLSVEAICS 835
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 865 bits (2235), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/829 (51%), Positives = 569/829 (68%), Gaps = 20/829 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
A+V+YD++A+ I+G+R++L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 22 FEASVSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGH 81
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLV+F++ VQ+AGL++HLRIGPYACAEWN+GGFPVWL +IPGI FRT
Sbjct: 82 EPSPGKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRT 141
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M++F KI+++MK E L+ SQGGPIIL+Q+ENEYG +E+ G G+ Y +WA
Sbjct: 142 DNGPFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWA 201
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ L T VPWVMC+Q+DAPDP+INTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 202 AHMAIGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTGFG 261
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 262 GTVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL++LH+AIKLCE L+S+DPT +LG EAH++ S CAAFLANY+
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSADPTVTRLGNYQEAHVFKSKSGACAAFLANYNPH 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S + V F Y LP WS+SILP+CK+ V+NTA++ SQ AQ K +
Sbjct: 382 SYSTVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQS-------AQMKMTRVPIHGGL 434
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
++ + E+ + + SF L EQIN T+D SDYLWY+ + + P + GK L
Sbjct: 435 SWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPDEGYFRNGKNPVLT 494
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA VF+N +L YG+ DF ++ + L G+N + +LS+ VGL N G F
Sbjct: 495 VLSAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKISLLSVAVGLPNVGPHF 554
Query: 536 DVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ I ++ L G+RDL+ +W Y+VG++GE + L +S ++S W QG +
Sbjct: 555 ETWNAGVLGPITLNGLNEGRRDLTWQKWSYKVGLKGEDLSLHSLSGSSSVDWLQGYLVSR 614
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF AP G PLAL++ SMGKGQ W+NGQS+GRYW AY A TG C+Y
Sbjct: 615 RQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPAYKA--TGSCDYCNYA 672
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+Y+ KC +CG+ +Q YH+P +W+ P NLLV+ EELGGDP+ + L+ + +C+
Sbjct: 673 GTYNEKKCGTNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNGVFLVRRDIDSVCAD 732
Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E P V G VS SP+ L+C G I++I FAS+G P G+CG++R G+C
Sbjct: 733 IYEWQPNLVSYQMQASGKVSRPVSPKAHLSCGPGQKISSIKFASFGTPVGSCGNYREGSC 792
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H Q+ CVGQ C++ VS G CP ++K L+VEA C+
Sbjct: 793 HAHKSYDAFQRNCVGQSSCTVTVSPEIFG--GDPCPNVMKKLSVEAICT 839
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 864 bits (2232), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/824 (54%), Positives = 556/824 (67%), Gaps = 45/824 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHR+L+++GKRR+L SGS+HYPR+TPE+WP +I+K+KEGGL+VIETYVFW+ HEP
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQYYFEGR+DLV+FVK VQ+AGL ++LRIGPY CAEWN GGFP+WL IP I FRT N
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+ M+ FL KI+++MK+ENLFASQGGPIILAQVENEYGNV+ YG G Y+ WAA+
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A NT VPW+MC Q P+ II+TCNG YCDG+ P KP MWTE+Y+GWF +G+ +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPILYKKPTMWTESYTGWFTYYGWPI 258
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYM--YFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RPVED+AFAVARFFE GG+F NYYM YFGGTNFGRT+GGP VA+SYDYDAP+DEYG
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLDEYGM 318
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
PKWGHL++LH+ +KL EE ++SS+ H +LG EAH+Y N C AFLAN DS +
Sbjct: 319 QHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVY-SYGNGCVAFLANVDSMN 377
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F Y LPAWSVSIL DCK V FN+AKV S Q V + + S
Sbjct: 378 DTVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKS-----------QSAVVSMSPSKST 426
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
SW ++E VGISG+ SF L EQ+ TTKDTSDYLWYT S+ G G +L+IES+
Sbjct: 427 LSWTSFDEPVGISGS-SFKAKQLLEQMETTKDTSDYLWYTTSVEAT-GTGS-TWLSIESM 483
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
+FVN + + + + + I L G NT+ +LS VGLQN+GA+ +
Sbjct: 484 RDVVHIFVNGQFQSSWHTSKSVLYNSVEAPITLAPGSNTIALLSATVGLQNFGAFIETWS 543
Query: 540 AGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
AGL S+IL L G ++LS EW YQVG++GE + L + + S W ST K L
Sbjct: 544 AGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAVST---EKPL 600
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WY T F AP G P+AL+LASMGKGQAWVNGQSIGRYW AY A + C + CDYRGSYD
Sbjct: 601 TWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGSYD 660
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
+KC CGQ +Q YH+PR+W+ P NLLV+ EE GGDPS I +T++ IC+ V E+
Sbjct: 661 QNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARVYES 720
Query: 719 DPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVL 777
P V W P V I+ I FAS G PEG+CGSF+ G+CH D+
Sbjct: 721 HPASVKLWCPGEKQV---------------ISQIRFASLGNPEGSCGSFKEGSCHTNDLS 765
Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLL-KALAVEAHCS 820
V+KACVGQ CS+ + ACPG+ K LAVEA CS
Sbjct: 766 NTVEKACVGQRSCSLAPD-----FTISACPGVREKFLAVEALCS 804
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 863 bits (2231), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/833 (51%), Positives = 557/833 (66%), Gaps = 28/833 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RRVL SGSIHYPRSTPE+W LI+K+KEGGL+V+ETYVFWN H
Sbjct: 25 VQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DL RF+KT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 EPSPGNYNFEGRYDLARFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ LMK ENLF SQGGPIIL+Q+ENEYG +G G+ Y+ WA
Sbjct: 145 DNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF FG
Sbjct: 205 AKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 265 GPIHQRPVQDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+A+K+CE+ L+S+DP LG+ +A++Y S +CAAFL+NYD+
Sbjct: 325 LIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTD 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q + L S
Sbjct: 385 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQLEMLPTNSP 434
Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
W Y E V + + + L EQIN TKDTSDYLWY S+ + + G+
Sbjct: 435 MLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSVDIGSTESFLHGGELP 494
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA +F+N +L +G+ + F K+ G NT+ +LS+ VGL N G
Sbjct: 495 TLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVG 554
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
F+ G+ V L L GK DLS +W Y+VG++GE + L + +S W +GS
Sbjct: 555 GHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSL 614
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L W+K+ F APEG PLA+++ MGKGQ W+NG SIGRYW+AY +TG K
Sbjct: 615 AAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAY---ATGNCDK 671
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C+Y G++ KCQ+ CGQP Q YH+PR W+ P +NLLV+ EELGG+P+ ISL+ ++
Sbjct: 672 CNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVTG 731
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
+C+ VSE P + + G P+V L C G+ I +I FAS+G P G CGS++
Sbjct: 732 VCADVSEYHPTLKNWHIESYGKSEDLHRPKVHLKCSAGYSITSIKFASFGTPLGTCGSYQ 791
Query: 769 PGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH + I++K C+G+ C++ +S+ G CP +LK L+VE C+
Sbjct: 792 QGTCHAPMSYDILEKRCIGKQRCAVTISNTNFG--QDPCPNVLKRLSVEVVCA 842
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 863 bits (2231), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/829 (50%), Positives = 559/829 (67%), Gaps = 27/829 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+V++G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 27 VTASVTYDHKAIVVNGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 86
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 87 EPSPGKYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 146
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK+E LF +QGGPII++Q+ENEYG VEW G G+ Y KW
Sbjct: 147 DNEPFKAAMQKFTEKIVSIMKEEKLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWF 206
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
+ AV L+T VPW+MC+Q+D PDP+I+TCNG+YC+ FTPN KP MWTEN++GW+ FG
Sbjct: 207 SQMAVGLDTGVPWIMCKQQDTPDPLIDTCNGYYCENFTPNKKYKPKMWTENWTGWYTEFG 266
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP ED+AF+VARF + GG+F NYYMY GGTNF RT+ G +ATSYDYD PIDEYG
Sbjct: 267 GAVPRRPAEDMAFSVARFVQNGGSFVNYYMYHGGTNFDRTSSGLFIATSYDYDGPIDEYG 326
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PKWGHLR+LHKAIKLCE L+S DPT G LE H++ K+S CAAFLANYD+
Sbjct: 327 LLNEPKWGHLRDLHKAIKLCEPALVSVDPTVTWPGNNLEVHVF-KTSGACAAFLANYDTK 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A+V F Y LP WS+SILPDCK VFNTA++ Q ++ ++ +S
Sbjct: 386 SSASVKFGNGQYDLPPWSISILPDCKTAVFNTARL-----------GAQSSLMKMTAVNS 434
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF W EE + + S L EQIN T+D++DYLWY +++ + G+
Sbjct: 435 AFDWQSYNEEPASSNEDDSLTAYALWEQINVTRDSTDYLWYMTDVNIDANEGFIKNGQSP 494
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GH V +N +L YG D + ++L G N + +LS+ VGL N G
Sbjct: 495 VLTVMSAGHVLHVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKISLLSIAVGLPNVG 554
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L L G RDLS +W Y++G++GE + L+ +S ++S W QGS
Sbjct: 555 PHFETWNAGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGEALNLNTVSGSSSVEWVQGSL 614
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + L WYKTTF P G PLAL++ SMGKGQAW+NG+SIGR+W Y+A G C
Sbjct: 615 LAKQQPLAWYKTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHWPGYIA--RGNCGDC 672
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
Y G+Y KC+ +CG+P+Q YHIPR+W++P N LV+ EE GGDP+ I+L+ +T +
Sbjct: 673 YYAGTYTDKKCRTNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWGGDPTGITLVKRTTASV 732
Query: 712 CSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
C+ + + P + + G V P+ L C G +I+ I FASYG+P+G CG+FR G+
Sbjct: 733 CADIYQGQPTLKNRQMLDSGKV-VRPKAHLWCPPGKNISQIKFASYGLPQGTCGNFREGS 791
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CH QK C+G+ C + V+ G CPG+ K L++EA C
Sbjct: 792 CHAHKSYDAPQKNCIGKQSCLVTVAPEVFG--GDPCPGIAKKLSLEALC 838
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 862 bits (2228), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/832 (51%), Positives = 548/832 (65%), Gaps = 35/832 (4%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +L++K+K+GGL+V++TYVFWN HEP
Sbjct: 27 TTVTYDKKAILINGQRRILISGSIHYPRSTPEMWDDLMQKAKDGGLDVVDTYVFWNVHEP 86
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRF+KT Q GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 87 SPGNYDFEGRYDLVRFIKTAQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK E LFASQGGPIIL+Q+ENEYG A G G Y+ WAA
Sbjct: 147 GPFKMAMQGFTQKIVQMMKSEKLFASQGGPIILSQIENEYGPQSKALGAAGHAYMNWAAK 206
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV LNT VPWVMC+++DAPDP+IN+CNGFYCD F+PN P KP +WTE +SGWF FG
Sbjct: 207 MAVGLNTGVPWVMCKEDDAPDPVINSCNGFYCDYFSPNKPYKPTLWTEAWSGWFTEFGGP 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
V RPV+DLAFAVARF + GG+ NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG +
Sbjct: 267 VYGRPVQDLAFAVARFVQKGGSLFNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGML 326
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ LH+AIKLCE L+SSDPT LGA +AH++ CAAFLANY ++S
Sbjct: 327 RQPKYGHLKNLHRAIKLCEHALVSSDPTVTSLGAYEQAHVFSSGPGRCAAFLANYHTNSA 386
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LPAWS+SILPDCK VVFNTA+V G H AQ ++L S
Sbjct: 387 ATVVFNNMRYALPAWSISILPDCKRVVFNTAQV------GVH-IAQ----TQMLPTISKL 435
Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW E+ + G+ L EQIN T+DTSDYLWY S+ + + G++ L
Sbjct: 436 SWETYNEDTYSLGGSSRMTVAGLLEQINVTRDTSDYLWYMTSVGISSSEAFLRGGQKPTL 495
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
++ S GHA VF+N + YG+ + F I L G+N + +LS+ VGL N G
Sbjct: 496 SVRSAGHAVHVFINGQFSGSAYGSREHPAFTYTGPINLRAGMNKIALLSIAVGLPNVGLH 555
Query: 535 FDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ I I L GK+DL+ +W YQVG++GE + L + A S W +GS L
Sbjct: 556 FEKWQTGILGPISISGLNGGKKDLTWQKWSYQVGLKGEAMNLVSPTEATSVDWIKGSLLQ 615
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK +F AP G PLAL+L SMGKGQAW+NGQSIGRYW AY + G +C Y
Sbjct: 616 GQRPLTWYKASFNAPRGNEPLALDLRSMGKGQAWINGQSIGRYWMAY---AKGGCSRCTY 672
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+Y C+ CGQP Q YH+PR+W+ P N+LV+ EELGGD SKISL+ ++ +C
Sbjct: 673 AGTYRPPTCENGCGQPTQRWYHVPRSWLKPTNNVLVLFEELGGDASKISLMRRSVTGLCG 732
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQ---VRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
E K + ++ S+ + + L C G I+AI FAS+G P G CGS++ G
Sbjct: 733 EAVEYHA------KNDSYIIESNEELDSLHLQCNPGQVISAIKFASFGTPSGTCGSYQKG 786
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
CH D I++K C+G CS+ + GV CP LK L VE C I
Sbjct: 787 TCHAPDSHAIIEKKCIGLKSCSVSTTRDNFGVD--PCPNELKQLLVEVDCGI 836
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 862 bits (2228), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/851 (51%), Positives = 565/851 (66%), Gaps = 44/851 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+I GKRR+L S IHYPR+TPE+W +LI KSKEGG +V++TYVFWN HEP+
Sbjct: 37 NVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKEGGADVVQTYVFWNGHEPV 96
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLV+FVK + +GL+LHLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 97 KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNE 156
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+EM++F+ KI+DLM++ LF QGGPII+ Q+ENEYG+VE +YG G+ YVKWAA
Sbjct: 157 PFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP+ II+ CNG+YCDGF PNS +KP++WTE++ GW+ +G ++
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRTKPVLWTEDWDGWYTKWGGSL 276
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP EDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP TSYDYDAP+DEYG
Sbjct: 277 PHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRS 336
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND----CAAFLANYD 358
+PKWGHL++LH AIKLCE L+++D P ++KLG+K EAHIYH CAAFLAN D
Sbjct: 337 EPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHIYHGDGETGGKVCAAFLANID 396
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---------Q 409
A+V FNG Y LP WSVSILPDC++V FNTAKV +Q + A+ Q
Sbjct: 397 EHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQ 456
Query: 410 KNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
K V + ++ + SW +E +GI G +F L E +N TKD SDYLW+ I V
Sbjct: 457 KVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSED 516
Query: 468 Q-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
G ++I+S+ VFVNK+L G+ A + + +G N L
Sbjct: 517 DISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWVKA----VQPVRFIQGNNDLL 572
Query: 521 ILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
+L+ VGLQNYGA+ + GAG L KNG DLS W YQVG++GE DKI
Sbjct: 573 LLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSKSSWTYQVGLKGE---ADKIY 629
Query: 580 LANSSFWKQGSTLPVNKS---LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
+ + STL + S +WYKT F P G P+ LNL SMG+GQAWVNGQ IGRY
Sbjct: 630 TVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRY 689
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W+ ++ GC + CDYRG+Y++ KC +CG+P QT YH+PR+W+ P NLLV+ EE GG
Sbjct: 690 WNI-ISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGG 748
Query: 697 DPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACERGWHIA 750
+P KIS+ T T +C VSE+ PP+ W + + S +P+V L CE G I+
Sbjct: 749 NPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDYINGTMSINSVAPEVHLHCEDGHVIS 808
Query: 751 AINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGL 809
+I FASYG P G+C F G CH + L IV +AC G+ C I VS+ + C G
Sbjct: 809 SIEFASYGTPRGSCDGFSIGKCHASNSLSIVSEACKGRNSCFIEVSNT--AFISDPCSGT 866
Query: 810 LKALAVEAHCS 820
LK LAV + CS
Sbjct: 867 LKTLAVMSRCS 877
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 862 bits (2227), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/833 (51%), Positives = 557/833 (66%), Gaps = 28/833 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RRVL SGSIHYPRSTPE+W LI+K+KEGGL+V+ETYVFWN H
Sbjct: 25 VQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF+KT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 EPSPGNYNFEGRYDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ LMK ENLF SQGGPIIL+Q+ENEYG +G G+ Y+ WA
Sbjct: 145 DNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF FG
Sbjct: 205 AKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVA F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 265 GPIHQRPVQDLAFAVALFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELH+A+K+CE+ L+S+DP LG+ +A++Y S +CAAFL+NYD+
Sbjct: 325 LIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTD 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTAKV Q + L S
Sbjct: 385 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQLEMLPTNSP 434
Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
W Y E V + + + L EQIN TKDTSDYLWY S+ + + G+
Sbjct: 435 MLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSVDIGSTESFLHGGELP 494
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA +F+N +L +G+ + F K+ G NT+ +LS+ VGL N G
Sbjct: 495 TLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVG 554
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
F+ G+ V L L GK DLS +W Y+VG++GE + L + +S W +GS
Sbjct: 555 GHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSL 614
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L W+K+ F APEG PLA+++ MGKGQ W+NG SIGRYW+AY +TG K
Sbjct: 615 AAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAY---ATGNCDK 671
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C+Y G++ KCQ+ CGQP Q YH+PR W+ P +NLLV+ EELGG+P+ ISL+ ++
Sbjct: 672 CNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVTG 731
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
+C+ VSE P + + G P+V L C G+ I +I FAS+G P G CGS++
Sbjct: 732 VCADVSEYHPTLKNWHIESYGKSEDLHRPKVHLKCSAGYSITSIKFASFGTPLGTCGSYQ 791
Query: 769 PGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH + I++K C+G+ C++ +S+ G CP +LK L+VE C+
Sbjct: 792 QGTCHAPMSYDILEKRCIGKQRCAVTISNTNFG--QDPCPNVLKRLSVEVVCA 842
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 862 bits (2226), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/856 (49%), Positives = 572/856 (66%), Gaps = 47/856 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDGKRR+L S +HYPR++PE+WP++I KSKEGG +VI++YVFWN HEP
Sbjct: 32 NVSYDHRALIIDGKRRMLISAGVHYPRASPEMWPDIIEKSKEGGADVIQSYVFWNGHEPT 91
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY F+GR+DLV+F++ V +GL+LHLRIGPY CAEWN+GGFP+WL +PGI+FRT N
Sbjct: 92 KGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGPYVCAEWNFGGFPLWLRDVPGIEFRTDNA 151
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFKEEM+RF+ KI+DL++ E LF QGGP+I+ QVENEYGN+E +YG G+ Y+KW +
Sbjct: 152 PFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENEYGNIESSYGKRGQEYIKWVGNM 211
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMCQQ+DAP IIN+CNG+YCDGF NSPSKPI WTEN++GWF S+G
Sbjct: 212 ALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKANSPSKPIFWTENWNGWFTSWGERS 271
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAF+VARFF+ G+FQNYYMYFGGTNFGRTAGGP TSYDYD+PIDEYG IR
Sbjct: 272 PHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGRTAGGPFYITSYDYDSPIDEYGLIR 331
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSN-------------D 349
+PKWGHL++LH A+KLCE L+S+D P + KLG K EAH+YH S +
Sbjct: 332 EPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQEAHVYHMKSQTDDLTLSKLGTLRN 391
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHP 405
C+AFLAN D V FNG Y LP WSVSILPDC+NVVFNTAKV +Q + P
Sbjct: 392 CSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQNVVFNTAKVAAQTSIKILELYAP 451
Query: 406 FA-------QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
+ + NEL + ++++ +E +GI +++F + E +N TKD SDYLWY
Sbjct: 452 LSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQNFTVKGILEHLNVTKDRSDYLWY 511
Query: 459 TASIHVMPGQGK-------EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
IHV + + I+S+ VFVN KL G + F+ + ++
Sbjct: 512 MTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNGKLTGSAIGQ--WVKFV--QPVQ 567
Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEG 570
EG N L +LS +GLQN GA+ + GAG+ I L KNG DLS W YQVG++G
Sbjct: 568 FLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFKNGDIDLSKSLWTYQVGLKG 627
Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
E++ + + W + S + + WYK F +P+G P+A+NL SMGKGQAWVNG
Sbjct: 628 EFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSSPDGTDPVAINLGSMGKGQAWVNG 687
Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
IGRYWS ++P GC +KCDYRG+Y++ KC +CG+P Q+ YHIPR+W+ NLLV+
Sbjct: 688 HHIGRYWSV-VSPKDGCPRKCDYRGAYNSGKCATNCGRPTQSWYHIPRSWLKESSNLLVL 746
Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPV----DSWKPNLGVVS--SSPQVRLACE 744
EE GG+P +I + + IC VSE+ P + + + + +S ++P++ L C+
Sbjct: 747 FEETGGNPLEIVVKLYSTGVICGQVSESHYPSLRKLSNDYISDGETLSNRANPEMFLHCD 806
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
G I+++ FASYG P+G+C F G CH + L +V +AC+G+ C++ +S++ G
Sbjct: 807 DGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSVVSQACLGKNSCTVEISNSAFG--G 864
Query: 804 GACPGLLKALAVEAHC 819
C ++K LAVEA C
Sbjct: 865 DPCHSIVKTLAVEARC 880
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 861 bits (2225), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/828 (52%), Positives = 550/828 (66%), Gaps = 28/828 (3%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A++IDG+RR+L SGSIHYPRSTP++W LI+K+K+GGL+VI+TYVFWN HEP G
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
YYFE R+DLVRFVKTVQ+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90 NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG +G G+ Y+ WAA AV
Sbjct: 150 KTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAV 209
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
L+T VPWVMC++EDAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 210 GLDTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 269
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR+P
Sbjct: 270 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREP 329
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
K HL+ELH+A+KLCE+ L+S DPT LG EAH++ +S + CAAFLANY+S+S A V
Sbjct: 330 KHSHLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVF-RSPSGCAAFLANYNSNSHAKV 388
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWY 425
FN Y LP WS+SILPDCKNVVFN+A V Q Q + S + Y
Sbjct: 389 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQ--------TSQMQMWGDGATSMMWERY 440
Query: 426 EEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP------GQGKEVFLNIES 478
+E+V ++ L EQ+N T+D+SDYLWY S+ + P G GK L+++S
Sbjct: 441 DEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQS 500
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GHA VFVN +L YG + N + L G N + +LS+ GL N G ++
Sbjct: 501 AGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETW 560
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V+L L G RDL+ W YQVG++GE + L+ + + S W QGS + +
Sbjct: 561 NTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQ 620
Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G K C Y G+
Sbjct: 621 PLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---ADGDCKGCSYTGT 677
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL-GGDPSKISLLTKTGQHICSFV 715
+ A KCQ CGQP Q YH+PR+W+ P NLLV+ EEL GGD SKI+L ++ +C+ V
Sbjct: 678 FRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADV 737
Query: 716 SEADPPPVDSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
SE D P + W+ + G +V L C G I+AI FAS+G P G CG+F+ G CH
Sbjct: 738 SE-DHPNIKKWQIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCH 796
Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+++K C+G C + +S G CP + K +AVEA CS
Sbjct: 797 SASSHAVLEKRCIGLQRCVVAISPDNFG--GDPCPSVTKRVAVEAVCS 842
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 860 bits (2223), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/835 (51%), Positives = 557/835 (66%), Gaps = 40/835 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+++DG+RR+L SGSIHYPRSTPE+W LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG +G G+ Y+ WAA A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELH+A+KLCE+ L+S+DPT LG+ EAH++ +SS+ CAAFLANY+S+S A
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFLANYNSNSYAK 385
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V FN Y LP WS+SILPDCKNVVFNTA V Q N + +S+ W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQMWADGASSMMW 435
Query: 425 --YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+V ++ L EQ+N T+DTSDYLWY + V P + G + L +
Sbjct: 436 EKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPSEKFLQGGTPLSLTV 495
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VF+N +L YG + + L G N + +LS+ GL N G ++
Sbjct: 496 QSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYE 555
Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIY--QVGVEGEYIGLDKISLANSSFWKQGSTLP 593
G+ ++I L G RDL+ W Y QVG++GE + L+ + + S W QGS +
Sbjct: 556 TWNTGVVGPVVIHGLDEGSRDLTWQTWSYQFQVGLKGEQMNLNSLEGSGSVEWMQGSLVA 615
Query: 594 VNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
N+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G K C
Sbjct: 616 QNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---AEGDCKGCH 672
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y GSY A KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGD SKI+L +T +C
Sbjct: 673 YTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVC 732
Query: 713 SFVSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
+ VSE P + +W+ P + +V L C G I+AI FAS+G P G CG+
Sbjct: 733 ADVSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFGTPLGTCGT 787
Query: 767 FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F+ G CH ++ +++K C+G C + +S + G CP ++K +AVEA CS
Sbjct: 788 FQQGECHSINSNSVLEKKCIGLQRCVVAISPSNFG--GDPCPEVMKRVAVEAVCS 840
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 860 bits (2223), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/831 (51%), Positives = 558/831 (67%), Gaps = 30/831 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ + VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI K+K GGL+V+ETYVFWN H
Sbjct: 23 IHSTVTYDRKAILINGQRRILFSGSIHYPRSTPEMWEDLILKAKNGGLDVVETYVFWNVH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGRFDLVRF+KT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 EPYPGIYNFEGRFDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N FK M+ F KI+ LMK ENLF SQGGPIILAQ+ENEYG +G G Y+ WA
Sbjct: 143 DNEAFKNAMQGFTEKIVALMKSENLFESQGGPIILAQIENEYGTESKLFGEAGYNYMTWA 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L T VPWVMC++ DAPDP+INTCNGFYCD F+PN P KP MWTE ++GWF FG
Sbjct: 203 ANMAVGLQTGVPWVMCKEADAPDPVINTCNGFYCDTFSPNKPYKPTMWTEAWTGWFSEFG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVARF + GG+ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 263 GPLHQRPVQDLAFAVARFIQRGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPK+GHL+ELH+AIK+CE L+S+DP LG +AH+Y S CAAFL+NYD+
Sbjct: 323 LLRQPKYGHLKELHRAIKMCEPALVSADPIVTSLGDYQQAHVYSSESGGCAAFLSNYDTK 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDCKN VFNTAKV Q + L S+
Sbjct: 383 SFARVLFNNRHYNLPPWSISILPDCKNAVFNTAKV----------GVQTAQMGMLPAEST 432
Query: 421 AFSW--YEEKVGISGNRSFV-RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + +RS + P L EQIN T+DTSDYLWY S+ + + G+
Sbjct: 433 TLSWESYFEDISALDDRSMMTSPGLLEQINVTRDTSDYLWYITSVDISSSEPFLHGGELP 492
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA VF+N +L G+ F + K+ L+ G N + +LS+ VGL N G
Sbjct: 493 TLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKVNLHAGTNKIGLLSVAVGLPNVG 552
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
F+ G+ V+L L+ GK DLSS +W Y+VG++GE + L S + W Q S
Sbjct: 553 GHFETWNTGILGPVVLYGLRQGKWDLSSQKWTYKVGLKGEAMNLISPSGFSPVEWMQASL 612
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ L W+K F APEG+ PLAL++ MGKGQ W+NGQSIGRYW+AY + G +
Sbjct: 613 AAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKGQIWINGQSIGRYWTAY---ARGNCSR 669
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C+Y ++ KCQ CGQP Q YH+PR+W+ P +NLLV+ EE+GG+PS+IS++ +
Sbjct: 670 CNYATAFRPPKCQLGCGQPTQRWYHVPRSWLRPEQNLLVVFEEVGGNPSRISIVKRLVTS 729
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+C+ VSE P +W + +P+V L+C+ G +I++I FAS+G P G CGS++ G
Sbjct: 730 VCADVSEFH-PTFKNWHITAKFI--TPKVHLSCDPGQYISSIKFASFGTPLGTCGSYQQG 786
Query: 771 ACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH I++K CVG+ C++ VS++ CP ++K L+VEA C+
Sbjct: 787 TCHAPSSSGILEKKCVGKQRCAVTVSNSNF---EDPCPNMMKRLSVEAVCN 834
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 860 bits (2223), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/853 (51%), Positives = 570/853 (66%), Gaps = 44/853 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDGKRR+L S IHYPR+TPE+WP+LI KSKEGG +VI+TYVFWN HEP+
Sbjct: 28 NVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNGHEPV 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
R QY FEGR+D+V+FVK V +GL+LHLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 88 RRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+EM+RF+ KI+DLM++E LF+ QGGPII+ Q+ENEYGNVE ++G G+ YVKWAA
Sbjct: 148 PFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKWAARM 207
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+ VPWVMCQQ DAPD IIN CNGFYCD F PNS +KP +WTE+++GWF S+G
Sbjct: 208 ALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSANKPKLWTEDWNGWFASWGGRT 267
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVED+AFAVARFF+ GG+F NYYMYFGGTNFGR++GGP TSYDYDAPIDEYG +
Sbjct: 268 PKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEYGLLS 327
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYH--------KSSN--DCAA 352
QPKWGHL+ELH AIKLCE L++ D P + KLG EAH+Y +S N C+A
Sbjct: 328 QPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEAHVYRVKESLYSTQSGNGSSCSA 387
Query: 353 FLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQ 408
FLAN D A+VTF G +Y LP WSVSILPDC+ VFNTAKV +Q + D P +
Sbjct: 388 FLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQTSIKTVEFDLPLVR 447
Query: 409 QKNVNELLLASSAFSW-------YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTAS 461
+V + L+ + S+ +E + + +F + E +N TKD SDYLW
Sbjct: 448 NISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFTIQGVLEHLNVTKDHSDYLWRITR 507
Query: 462 IHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
I+V + +V L+I+S+ +FVN +L+ G+ + + I+L +
Sbjct: 508 INVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVIGHW----VKVVQPIQLLQ 563
Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
G N L +LS VGLQNYGA+ + GAG V L KNG+ DLS W YQVG+ GE+
Sbjct: 564 GYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQ 623
Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
+ I + + W + + WYKT F AP G+ P+AL+L SMGKGQAWVNG I
Sbjct: 624 KIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHI 683
Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
GRYW+ +AP GC KCDYRG Y SKC +CG P Q YHIPR+W+ NLLV+ EE
Sbjct: 684 GRYWTR-VAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYHIPRSWLQASNNLLVLFEE 741
Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWH 748
GG P +IS+ +++ Q IC+ VSE+ P + +W P+ + +S P++ L C+ G
Sbjct: 742 TGGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHT 801
Query: 749 IAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACP 807
I++I FASYG P+G+C F G CH + L +V KAC G+ C I + ++ G C
Sbjct: 802 ISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFG--GDPCR 859
Query: 808 GLLKALAVEAHCS 820
G++K LAVEA C+
Sbjct: 860 GIVKTLAVEAKCA 872
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 860 bits (2222), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/829 (51%), Positives = 538/829 (64%), Gaps = 12/829 (1%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L+ANVTYD R+L+IDG R++L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN H
Sbjct: 18 LAANVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQNAKEGGVDVIETYVFWNGH 77
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
E Y+F+GRFDLV+F+ V AGL+L LRIGP+ AEWN+GG PVWLH+IP FRT
Sbjct: 78 ELSPDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWNFGGVPVWLHYIPNTVFRT 137
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N FK M++F I+ LMK+E LFASQGGPIIL+QVENEYG++E YG GG+ Y WA
Sbjct: 138 DNASFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEYGDIERVYGEGGKPYAMWA 197
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV+ N VPW+MCQQ DAPDP+INTCN FYCD FTPNSP+KP MWTEN+ GWF +FG
Sbjct: 198 AQMAVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKTFG 257
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 258 ARDPHRPPEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 317
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKWGHL+ELH+AIKL E L++S+PT+ LG LEA +Y SS CAAF+AN D
Sbjct: 318 LPRLPKWGHLKELHRAIKLTERVLLNSEPTYVSLGPSLEADVYTDSSGACAAFIANIDEK 377
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLAS 419
D V F Y LPAWSVSILPDCKNVVFNTA + SQ + P Q + +
Sbjct: 378 DDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAMVEMVPEELQPSADATNKDL 437
Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
A W + E+ GI G FV+ L + +NTTKDT+DYLWYT SI V + G +
Sbjct: 438 KALKWEVFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYTTSIFVNENEKFLKGSQPV 497
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L +ES GHA F+NKKL GN F + I L G N + +LSM VGLQN G
Sbjct: 498 LVVESKGHALHAFINKKLQVSATGNGSDITFKFKQAISLKAGKNEIALLSMTVGLQNAGP 557
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+++ GAGL V++ NG DLSS W Y++G++GE++G+ K + W P
Sbjct: 558 FYEWVGAGLSKVVIEGFNNGPVDLSSYAWSYKIGLQGEHLGIYKPDGIKNVKWLSSREPP 617
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK P G P+ L++ MGKG AW+NG+ IGRYW + C +KCDY
Sbjct: 618 KQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLAWLNGEEIGRYWPTKSSIHDVCVQKCDY 677
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
RG + KC CG+P Q YH+PR+W P N+LVI EE GGDP++I L + IC+
Sbjct: 678 RGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTQIRLSKRKVLGICA 737
Query: 714 FVSEADPPPVDSWKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E P ++SW V S V L C IA I FAS+G P+G+CGS+ G C
Sbjct: 738 HLGEGH-PSIESWSEAENVERKSKATVDLKCPDNGRIAKIKFASFGTPQGSCGSYSIGDC 796
Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H + + +V+K C+ + EC I + G + G CP K LAVEA CS
Sbjct: 797 HDPNSISLVEKVCLNRNECRIELGEE--GFNKGLCPTASKKLAVEAMCS 843
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 860 bits (2221), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/832 (52%), Positives = 551/832 (66%), Gaps = 31/832 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+VIETYVFWN HEP
Sbjct: 24 SDVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR DLVRF++TV +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FR N
Sbjct: 84 SPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRQDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK+ M+ F KI+ +MK E L+ SQGGPIIL+Q+ENEYG G G Y+ WAA
Sbjct: 144 EPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKMLGPVGYNYMSWAAK 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV + T VPW+MC+++DAPDP+INTCNGFYCD FTPN P KP MWTE +SGWF FG
Sbjct: 204 MAVEMGTGVPWIMCKEDDAPDPVINTCNGFYCDKFTPNKPYKPTMWTEAWSGWFSEFGGP 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+ RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG I
Sbjct: 264 IHKRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLI 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELHKAIK+CE+ LIS+DP LG +A++Y S DC+AFL+NYDS S
Sbjct: 324 RQPKYGHLKELHKAIKMCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSS 383
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LP WSVSILPDC+N VFNTAKV Q + L S F
Sbjct: 384 ARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKV----------GVQTSQMQMLPTNSERF 433
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
SW +EE S + L EQIN T+DTSDYLWY S+ V + GK L
Sbjct: 434 SWESFEEDTSSSSATTITASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLI 493
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
++S GHA VF+N +L YG + F + L G NT+ +LS+ VGL N G F
Sbjct: 494 VQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHF 553
Query: 536 DVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
+ G+ ++I L GK DLS +W YQVG++GE + L +S W Q + +
Sbjct: 554 ETWNTGILGPVVIHGLDKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQ 613
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
N+ L W+KT F APEG+ PLAL++ MGKGQ W+NG SIGRYW+A +TG C+Y
Sbjct: 614 RNQPLTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRYWTAI---ATGSCNDCNY 670
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
GS+ KCQ CGQP Q YH+PR+W+ NLLV+ EELGGDPSKISL ++ +C+
Sbjct: 671 AGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCA 730
Query: 714 FVSEADPP----PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
VSE P +DS+ + P+V L C G I++I FAS+G P G CGS+
Sbjct: 731 DVSEYHPNLKNWHIDSYGKSENF--RPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQ 788
Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
GACH I+++ C+G+ C + VS++ G CP +LK L+VEA C+
Sbjct: 789 GACHSSSSYDILEQKCIGKPRCIVTVSNSNFG--RDPCPNVLKRLSVEAVCA 838
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 858 bits (2218), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/843 (51%), Positives = 558/843 (66%), Gaps = 48/843 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+++DG+RR+L SGSIHYPRSTPE+W LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ----------VENEYGNVEWAYGVGGE 174
FK M+ F KI+ +MK ENLFASQGGPIIL+Q +ENEYG +G G+
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 175 LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSG 234
Y+ WAA AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SG
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDA 294
WF FG + RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 295 PIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFL 354
P+DEYG R+PK+GHL+ELH+A+KLCE+ L+S+DPT LG+ EAH++ +SS+ CAAFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFL 385
Query: 355 ANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE 414
ANY+S+S A V FN Y LP WS+SILPDCKNVVFNTA V Q N +
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQM 435
Query: 415 LLLASSAFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--- 468
+S+ W Y+E+V ++ L EQ+N T+DTSDYLWY S+ V P +
Sbjct: 436 WADGASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFL 495
Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
G + L ++S GHA VF+N +L YG + + L G N + +LS+
Sbjct: 496 QGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVAC 555
Query: 527 GLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
GL N G ++ G+ ++I L G RDL+ W YQVG++GE + L+ + + S
Sbjct: 556 GLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVE 615
Query: 586 WKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
W QGS + N+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY +
Sbjct: 616 WMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---A 672
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
G K C Y GSY A KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGD SKI+L
Sbjct: 673 EGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALA 732
Query: 705 TKTGQHICSFVSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYG 758
+T +C+ VSE P + +W+ P + +V L C G I+AI FAS+G
Sbjct: 733 KRTVSGVCADVSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFG 787
Query: 759 IPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
P G CG+F+ G CH ++ +++K C+G C + +S + G CP ++K +AVEA
Sbjct: 788 TPLGTCGTFQQGECHSINSNSVLEKKCIGLQRCVVAISPSNFG--GDPCPEVMKRVAVEA 845
Query: 818 HCS 820
CS
Sbjct: 846 VCS 848
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 858 bits (2217), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/836 (51%), Positives = 552/836 (66%), Gaps = 32/836 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+VIETY+FWN H
Sbjct: 28 VHCSVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYIFWNVH 87
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP RG Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 88 EPSRGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 147
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F KI+ +MK E L+ SQGGPIIL+Q+ENEYG G G+ YV WA
Sbjct: 148 DNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGPAGQNYVNWA 207
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP +WTE +SGWF FG
Sbjct: 208 AKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFG 267
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 268 GPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 327
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELHKAIK+CE L+S+DP +G +AH+Y S DCAAFL+N+D+
Sbjct: 328 LIRQPKYGHLKELHKAIKMCERALVSADPAVTSMGNFQQAHVYTTKSGDCAAFLSNFDTK 387
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S V FN Y LP WS+SILPDC+NVVFNTAKV Q + L +
Sbjct: 388 SSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQMQMLPTNTH 437
Query: 421 AFSW--YEEKVGISGNRSFV---RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
FSW ++E + + S + L EQIN T+DTSDYLWY S+ + + GK
Sbjct: 438 MFSWESFDEDISSLDDGSAITITTSGLLEQINVTRDTSDYLWYITSVDIGSSESFLRGGK 497
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
L ++S GHA VF+N +L YG + F + L G N + +LS+ VGL N
Sbjct: 498 LPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFRYTGTVNLRAGTNRIALLSVAVGLPN 557
Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
G F+ G+ V+L L GK DLS +W YQVG++GE + L + +S W Q
Sbjct: 558 VGGHFETWNTGILGPVVLRGLNQGKLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQS 617
Query: 590 STLP-VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ + N+ L W+KT F AP+G PLAL++ MGKGQ W+NG SIGRYW+ AP+ G
Sbjct: 618 ALVSEKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWT---APAAGIC 674
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
C Y G++ KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGDPSKISL+ ++
Sbjct: 675 NGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPNHNLLVVFEELGGDPSKISLVKRSV 734
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCG 765
IC+ VSE P + +W + S P+V L C I++I FAS+G P G CG
Sbjct: 735 SSICADVSEYH-PNIRNWHIDSYGKSEEFHPPKVHLHCSPSQAISSIKFASFGTPLGTCG 793
Query: 766 SFRPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ G CH ++K C+G+ C++ VS++ G CP +LK L+VEA CS
Sbjct: 794 NYEKGVCHSPTSYATLEKKCIGKPRCTVTVSNSNFG--QDPCPNVLKRLSVEAVCS 847
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 858 bits (2217), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/824 (51%), Positives = 549/824 (66%), Gaps = 34/824 (4%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A+V++G+RR+L SGSIHYPRSTPE+WP+LI K+K+GGL+V++TYVFWN HEP G
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYFEGR+DLV F+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 87 QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 146
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM++F KI+++MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV
Sbjct: 147 KAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 206
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
LNTSVPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 207 ALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPH 266
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+P
Sbjct: 267 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 326
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHL++LHKAIKLCE L++ DP LG ++ ++ S+ CAAFL N D S A V
Sbjct: 327 KWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLENKDKVSYARV 386
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
FNG Y LP WS+SILPDCK VFNTA+V SQ +Q K + + F+W
Sbjct: 387 AFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK-----MEWAGGFAWQ 434
Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
Y E++ G L EQIN T+D +DYLWYT + V + G+ + L + S
Sbjct: 435 SYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLTVMS 494
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GHA +F+N +L YG+ D ++L G NT+ LS+ VGL N G F+
Sbjct: 495 AGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETW 554
Query: 539 GAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AG+ + +D L G+RDL+ +W YQVG++GE + L +S +++ W + PV K
Sbjct: 555 NAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE----PVQKQ 610
Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F AP+G PLAL+++SMGKGQ W+NGQ IGRYW Y A +G CDYRG
Sbjct: 611 PLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA--SGNCGTCDYRGE 668
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
YD +KCQ +CG +Q YH+PR+W+ P NLLVI EE GGDP+ IS++ ++ +C+ VS
Sbjct: 669 YDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVS 728
Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-D 775
E P + +W +V L C+ G I I FAS+G P+G+CGS+ G CH
Sbjct: 729 EWQ-PSMKNWHTK---DYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHK 784
Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
I K CVGQ C + V G CPG +K VEA C
Sbjct: 785 SYDIFWKNCVGQERCGVSVVPEIFG--GDPCPGTMKRAVVEAIC 826
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 858 bits (2216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/856 (50%), Positives = 570/856 (66%), Gaps = 47/856 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDGKRR+L S IHYPR+TPE+WP+LI KSKEGG++VI+TY FW+ HEP+
Sbjct: 35 NVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPV 94
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR+D+V+F V +GL+LHLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 95 RGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 154
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
FKEEM+RF+ K++DLM++E L + QGGPII+ Q+ENEYGN+E +G G+ Y+KWAA+
Sbjct: 155 LFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMLQIENEYGNIEGQFGQKGKEYIKWAAEM 214
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP II+ CNG+YCDG+ PNS +KP MWTE++ GW+ S+G +
Sbjct: 215 ALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPNSYNKPTMWTEDWDGWYASWGGRL 274
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP TSYDYDAPIDEYG +
Sbjct: 275 PHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSN-------------D 349
+PKWGHL++LH AIKLCE L+++D P + KLG K EAH+Y +S+
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQEAHVYRMNSHTEGLNITSYGSQIS 394
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHP 405
C+AFLAN D A+VTF G Y LP WSVSILPDC+NVV+NTAKV +Q + D P
Sbjct: 395 CSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTVEFDLP 454
Query: 406 F-----AQQKNV--NELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
+QQ+ + N+ L + ++ +E VG+ +F + E +N TKD SDYLW+
Sbjct: 455 LYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNFTVQGILEHLNVTKDQSDYLWH 514
Query: 459 TASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
I V ++I+S+ VFVN +L G+ + + ++
Sbjct: 515 ITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQLTGSVIGHW----VKVEQPVK 570
Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEG 570
+G N L +L+ VGLQNYGA+ + GAG I L KNG D S W YQVG++G
Sbjct: 571 FLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTGFKNGDIDFSKLLWTYQVGLKG 630
Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
E++ + I + W + S + IWYKT F +P G P+AL+L SMGKGQAWVNG
Sbjct: 631 EFLKIYTIEENEKASWAELSPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNG 690
Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
IGRYW+ +AP GC + CDYRG+YD+ KC +CG+P QTLYH+PR+W+ NLLVI
Sbjct: 691 HHIGRYWT-LVAPEDGCPEICDYRGAYDSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVI 749
Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACE 744
EE GG+P IS+ ++ +C+ VSE+ PPV W + V +P++ L C+
Sbjct: 750 LEETGGNPFDISIKLRSAGVLCAQVSESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQ 809
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
G+ I++I FASYG P+G+C F G CH + IV K+C+G+ CS+ +S+ G
Sbjct: 810 DGFTISSIEFASYGTPQGSCQKFSMGNCHATNSSSIVSKSCLGKNSCSVEISNISFG--G 867
Query: 804 GACPGLLKALAVEAHC 819
C G++K LAVEA C
Sbjct: 868 DPCRGVVKTLAVEARC 883
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 857 bits (2215), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/828 (51%), Positives = 552/828 (66%), Gaps = 42/828 (5%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM+ F KI+D+MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+P
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 329
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHL+ELHKAIKLCE L++ DP LG +A ++ S++ C AFL N D S A V
Sbjct: 330 KWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARV 389
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
+FNG Y LP WS+SILPDCK V+NTA V SQ +Q K + + F+W
Sbjct: 390 SFNGMHYDLPPWSISILPDCKTTVYNTASVGSQ-------ISQMK-----MEWAGGFTWQ 437
Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
Y E + G+ SF L EQIN T+D +DYLWYT + + + GK L + S
Sbjct: 438 SYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMS 497
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GHA +FVN +L YG+ + + ++L G NT+ LS+ VGL N G F+
Sbjct: 498 AGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETW 557
Query: 539 GAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AG+ + +D L G+RDL+ +W Y+VG++GE + L +S ++S W + PV K
Sbjct: 558 NAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALSLHSLSGSSSVEWGE----PVQKQ 613
Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F AP+G PLAL+++SMGKGQ W+NGQ IGRYW Y A +G CDYRG
Sbjct: 614 PLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA--SGTCGICDYRGE 671
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
YD KCQ +CG +Q YH+PR+W++P NLLVI EE GGDP+ IS++ + IC+ VS
Sbjct: 672 YDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRIAGSICADVS 731
Query: 717 EADPPPVDSWKPNLGVVSS----SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
E W+P++ + +V L C+ G + I FAS+G P+G+CGS+ G C
Sbjct: 732 E--------WQPSMANWRTKGYEKAKVHLQCDHGRKMTHIKFASFGTPQGSCGSYSEGGC 783
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
H I K+C+GQ C + V G CPG +K VEA C
Sbjct: 784 HAHKSYDIFWKSCIGQERCGVSVVPDAFG--GDPCPGTMKRAVVEAIC 829
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 857 bits (2215), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/828 (51%), Positives = 551/828 (66%), Gaps = 30/828 (3%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A++IDG+RR+L SGSIHYPRSTPE+W L +K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLV+F+KT Q+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87 GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ +MK E LFASQGGPIIL+Q+ENEYG ++G G+ Y WAA A
Sbjct: 147 FKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKMA 206
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC+Q+DAPDP+IN CNGFYCD F+PN P KP MWTE ++GWF FG +
Sbjct: 207 VGLDTGVPWVMCKQDDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWTGWFTEFGGTIR 266
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDL+FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG R+
Sbjct: 267 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELH+A+KLCE L+S DP LG+ EAH++ +S + CAAFLANY+S+S AN
Sbjct: 327 PKYGHLKELHRAVKLCEPALVSVDPAVTTLGSMQEAHVF-RSPSSCAAFLANYNSNSHAN 385
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V FN Y LP WS+SILPDCK VVFNTA V Q + S+ W
Sbjct: 386 VVFNNEHYSLPPWSISILPDCKTVVFNTATV----------GVQTSQMQMWADGESSMMW 435
Query: 425 --YEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+VG ++ L EQ+N T+D+SDYLWY S+ V P + G+ + L +
Sbjct: 436 ERYDEEVGSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPSEKFLQGGEPLSLTV 495
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA +F+N +L G + F L G N + +LS+ GL N G ++
Sbjct: 496 QSAGHALHIFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIALLSIACGLPNVGVHYE 555
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ V+L L G RDL+ W YQVG++GE + L+ + A+S W QGS L
Sbjct: 556 TWNTGIVGPVVLHGLDVGSRDLTWQTWSYQVGLKGEQMNLNSLEGASSVEWMQGSLL-AQ 614
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRY ++Y ++G K C Y G
Sbjct: 615 APLSWYRAYFDTPTGDEPLALDMGSMGKGQIWINGQSIGRYSTSY---ASGDCKACSYAG 671
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
SY A KCQ CGQP Q YH+P++W+ P NLLV+ EELGGD SKISL+ ++ +C+ V
Sbjct: 672 SYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSKISLVKRSVSSVCADV 731
Query: 716 SEADPPPVDSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
SE + +W+ N G V P+V L C G I+AI FAS+G P G CG+F+ G CH
Sbjct: 732 SEYH-TNIKNWQIENAGEVEFHRPKVHLRCAPGQTISAIKFASFGTPLGTCGNFQQGDCH 790
Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+++K C+GQ C++ +S G CP +K +AVEA CS
Sbjct: 791 STKSHAVLEKNCIGQQRCAVTISPDNFG--GDPCPKEMKKVAVEAVCS 836
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 857 bits (2215), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/843 (51%), Positives = 558/843 (66%), Gaps = 48/843 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+++DG+RR+L SGSIHYPRSTPE+W LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ----------VENEYGNVEWAYGVGGE 174
FK M+ F KI+ +MK ENLFASQGGPIIL+Q +ENEYG +G G+
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 175 LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSG 234
Y+ WAA AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SG
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDA 294
WF FG + RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 295 PIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFL 354
P+DEYG R+PK+GHL+ELH+A+KLCE+ L+S+DPT LG+ EAH++ +SS+ CAAFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFL 385
Query: 355 ANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE 414
ANY+S+S A V FN Y LP WS+SILPDCKNVVFNTA V Q N +
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQM 435
Query: 415 LLLASSAFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--- 468
+S+ W Y+E+V ++ L EQ+N T+DTSDYLWY S+ V P +
Sbjct: 436 WADGASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFL 495
Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
G + L ++S GHA VF+N +L YG + + L G N + +LS+
Sbjct: 496 QGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVAC 555
Query: 527 GLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
GL N G ++ G+ ++I L G RDL+ W YQVG++GE + L+ + + S
Sbjct: 556 GLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVE 615
Query: 586 WKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
W QGS + N+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY +
Sbjct: 616 WMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---A 672
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
G K C Y GSY A KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGD SKI+L
Sbjct: 673 EGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALA 732
Query: 705 TKTGQHICSFVSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYG 758
+T +C+ VSE P + +W+ P + +V L C G I+AI FAS+G
Sbjct: 733 KRTVSGVCADVSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFG 787
Query: 759 IPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
P G CG+F+ G CH ++ ++++ C+G C + +S + G CP ++K +AVEA
Sbjct: 788 TPLGTCGTFQQGECHSINSNSVLERKCIGLERCVVAISPSNFG--GDPCPEVMKRVAVEA 845
Query: 818 HCS 820
CS
Sbjct: 846 VCS 848
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 857 bits (2214), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/858 (50%), Positives = 564/858 (65%), Gaps = 48/858 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDG RR+L SG IHYPR+TP++WP+LI KSKEGG++VI+TYVFWN HEP+
Sbjct: 39 NVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYVFWNGHEPV 98
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEG++DLV+FVK V +GL+LHLRIGPY CAEWN+GGFPVWL IPGI FRT N+
Sbjct: 99 KGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIVFRTDNS 158
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PF EEM++F+ KI+DLM++E LF+ QGGPII+ Q+ENEYGN+E ++G GG+ YVKWAA
Sbjct: 159 PFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKEYVKWAARM 218
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP II+ CN +YCDG+ PNS KPI+WTE++ GW+ ++G ++
Sbjct: 219 ALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGYKPNSNKKPILWTEDWDGWYTTWGGSL 278
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAFAVARFF+ GG+FQNYYMYFGGTNF RTAGGP TSYDYDAPIDEYG +
Sbjct: 279 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAPIDEYGLLS 338
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQ-KLGAKLEAHIY-------------HKSSND 349
+PKWGHL++LH AIKLCE L+++D KLG+K EAH+Y H S +
Sbjct: 339 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGSKQEAHVYRANVHAEGQNLTQHGSQSK 398
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA-- 407
C+AFLAN D V F G Y LP WSVS+LPDC+N VFNTAKV +Q + A
Sbjct: 399 CSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSIKSMELALP 458
Query: 408 ---------QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
Q NE SS++ +E + + +F + E +N TKD SDYLWY
Sbjct: 459 QFSGISAPKQLMAQNEGSYMSSSWMTVKEPISVWSGNNFTVEGILEHLNVTKDHSDYLWY 518
Query: 459 TASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
I+V + I+S+ VF+N +L G + + ++
Sbjct: 519 FTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIGRW----IKVVQPVQ 574
Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEG 570
+G N L +LS VGLQNYGA+ + GAG L ++G DLS+ EW YQVG++G
Sbjct: 575 FQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDGDIDLSNLEWTYQVGLQG 634
Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
E + + W + + + WYKT F AP G P+AL+L SMGKGQAWVN
Sbjct: 635 ENQKIYTTENNEKAEWTDLTLDDIPSTFTWYKTYFDAPSGADPVALDLGSMGKGQAWVND 694
Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
IGRYW+ +AP GC +KCDYRG+Y++ KC+ +CG+P Q YHIPR+W+ P NLLVI
Sbjct: 695 HHIGRYWT-LVAPEEGC-QKCDYRGAYNSEKCRTNCGKPTQIWYHIPRSWLQPSNNLLVI 752
Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACE 744
EE GG+P +IS+ ++ +C+ VSE PP+ W N+ +P+++L C+
Sbjct: 753 FEETGGNPFEISIKLRSASVVCAQVSETHYPPLQRWIHTDFIYGNVSGKDMTPEIQLRCQ 812
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSA 803
G+ I++I FASYG P+G+C F G CH + L +V KAC G+ C+I +S+A G
Sbjct: 813 DGYVISSIEFASYGTPQGSCQKFSRGNCHAPNSLSVVSKACQGRDTCNIAISNAVFG--G 870
Query: 804 GACPGLLKALAVEAHCSI 821
C G++K LAVEA CS+
Sbjct: 871 DPCRGIVKTLAVEAKCSL 888
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 856 bits (2211), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/829 (51%), Positives = 551/829 (66%), Gaps = 31/829 (3%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A++IDG+RR+L SGSIHYPRSTP++W LI+K+K+GGL+VI+TYVFWN HEP G
Sbjct: 28 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 87
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
YYFE R+DLVRF+KTVQ+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N PF
Sbjct: 88 NYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 147
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K M+ F KI+ +MK E LFASQGGPIIL+Q+ENEYG G G+ Y+ WAA A+
Sbjct: 148 KTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGPEGKELGAAGQAYINWAAKMAI 207
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
L T VPWVMC++EDAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 208 GLGTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 267
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +R+P
Sbjct: 268 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREP 327
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
K HL+ELH+A+KLCE+ L+S DP LG EAH++ +S + CAAFLANY+S+S A V
Sbjct: 328 KHSHLKELHRAVKLCEQALVSVDPAITTLGTMQEAHVF-RSPSGCAAFLANYNSNSYAKV 386
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
FN Y LP WS+SILPDCKNVVFN+A V Q + +S+ W
Sbjct: 387 VFNNEQYSLPPWSISILPDCKNVVFNSATV----------GVQTSQMQMWGDGASSMMWE 436
Query: 425 -YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP------GQGKEVFLNI 476
Y+E+V ++ L EQ+N T+D+SDYLWY S+ + P G GK + L++
Sbjct: 437 RYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPLSLSV 496
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VFVN +L YG + N L G N + +LS+ GL N G ++
Sbjct: 497 LSAGHALHVFVNGELQGSAYGTREDRRIKYNGNANLRAGTNKIALLSVACGLPNVGVHYE 556
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ V L L G RDL+ W YQVG++GE + L+ + + S W QGS + N
Sbjct: 557 TWNTGVGGPVGLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSTSVEWMQGSLIAQN 616
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G K+C Y
Sbjct: 617 QQPLSWYRAYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---ADGDCKECSYT 673
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G++ A KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGD SKI+L+ ++ +C+
Sbjct: 674 GTFRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALVKRSVSSVCAD 733
Query: 715 VSEADPPPVDSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
VSE D P + +W+ + G +V L C G I+AI FAS+G P G CG+F+ G C
Sbjct: 734 VSE-DHPNIKNWQIESYGEREYHRAKVHLRCSPGQSISAIKFASFGTPMGTCGNFQQGDC 792
Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H + +++K C+G C++ +S G CP + K +AVEA CS
Sbjct: 793 HSANSHTVLEKKCIGLQRCAVAISPESFG--GDPCPRVTKRVAVEAVCS 839
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 855 bits (2210), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/836 (50%), Positives = 556/836 (66%), Gaps = 35/836 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFW+ H
Sbjct: 24 VQCTVTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWDVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
E G Y F+GR+DLVRF+KTVQ+ GL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 ETSPGNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG A G G Y+ WA
Sbjct: 144 DNEPFKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIENEYGPESRALGAAGRSYINWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP +WTE +SGWF FG
Sbjct: 204 AKMAVGLDTGVPWVMCKEDDAPDPMINTCNGFYCDAFAPNKPYKPTLWTEAWSGWFTEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPVEDLAFAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GPIHQRPVEDLAFAVARFIQKGGSYFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IR+PK+GHL+ LHKAIKLCE L+SSDP+ LG +AH++ S CAAFLANY++
Sbjct: 324 LIREPKYGHLKALHKAIKLCEHALVSSDPSITSLGTYQQAHVF-SSGRSCAAFLANYNAK 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V FN Y LP WS+SILPDC+NVVFNTA+V AQ + L S
Sbjct: 383 SAARVMFNNMHYDLPPWSISILPDCRNVVFNTARV----------GAQTLRMQMLPTGSE 432
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW Y+E++ ++ + L EQIN T+DTSDYLWY S+ + P + G++
Sbjct: 433 LFSWETYDEEISSLTDSSRITALGLLEQINVTRDTSDYLWYLTSVDISPSEAFLRNGQKP 492
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GH VF+N + +G + + L G N + +LS+ VGL N G
Sbjct: 493 SLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVNLRAGTNRIALLSIAVGLPNVG 552
Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ +L++ L GK+DL+ +W YQVG++GE + L + +S W +GS
Sbjct: 553 LHYETWKTGVQGPVLLNGLNQGKKDLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWIEGSL 612
Query: 592 LPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
++L W+K F AP G PLAL++ SMGKGQ W+NGQSIGRYW AY + G
Sbjct: 613 ASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNS 669
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
C Y ++ SKCQ CG+P Q YH+PR+W+ P +NLLV+ EELGGD SKISL+ ++ +
Sbjct: 670 CSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLLVVFEELGGDASKISLVKRSIEG 729
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCG 765
+C+ E P + N G S ++ L C G IAAI FAS+G P G CG
Sbjct: 730 VCADAYEHHPATKNY---NTGGNDESSKLHQAKIHLRCAPGQFIAAIKFASFGTPSGTCG 786
Query: 766 SFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
SF+ G CH + +++K C+GQ C + +S++ G A CP +LK L+VEA CS
Sbjct: 787 SFQQGTCHAPNTHSVIEKKCIGQESCMVTISNSNFG--ADPCPNVLKKLSVEAVCS 840
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 855 bits (2208), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/854 (50%), Positives = 570/854 (66%), Gaps = 45/854 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD+RAL+I GKRR+L S IHYPR+TPE+WP LI +SKEGG +VIETY FWN HEP
Sbjct: 36 NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR+D+V+F K V GLFL +RIGPYACAEWN+GGFP+WL IPGI+FRT N
Sbjct: 96 RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFKEEM+R++ KI+DLM E+LF+ QGGPIIL Q+ENEYGNVE +G G+LY+KWAA+
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESTFGPKGKLYMKWAAEM 215
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV L VPWVMC+Q DAP+ II+TCN +YCDGFTPNS KP +WTEN++GWF +G +
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGERL 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RP ED+AFA+ARFF+ GG+ QNYYMYFGGTNFGRTAGGP TSYDYDAP+DEYG +R
Sbjct: 276 PYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGLLR 335
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND-----------CA 351
QPKWGHL++LH AIKLCE L+++D P + KLG K EAH+Y +SN+ CA
Sbjct: 336 QPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGICA 395
Query: 352 AFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----------- 400
AF+AN D A V F G + LP WSVSILPDC+N FNTAKV +Q +
Sbjct: 396 AFIANIDEHESATVKFYGQEFTLPPWSVSILPDCRNTAFNTAKVGAQTSIKTVGSDSVSV 455
Query: 401 NGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
+ F Q ++L S ++ +E +G+ G+++F + E +N TKD SDYLWY
Sbjct: 456 GNNSLFLQVITKSKLESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKDQSDYLWYLT 515
Query: 461 SIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN 513
I++ + +V ++I+S+ +FVN +L G + + ++L
Sbjct: 516 RIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----IKVVQPVKLV 571
Query: 514 EGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEY 572
+G N + +LS VGLQNYGA+ + GAG I L K+G +L++ W YQVG+ GE+
Sbjct: 572 QGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWTYQVGLRGEF 631
Query: 573 IGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQS 632
+ + ++ S+ W + T WYKT F AP G P+AL+ +SMGKGQAWVNG
Sbjct: 632 LEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGKGQAWVNGHH 691
Query: 633 IGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
+GRYW+ +AP+ GC + CDYRG+Y + KC+ +CG+ Q YHIPR+W+ N+LVI E
Sbjct: 692 VGRYWT-LVAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKTLNNVLVIFE 750
Query: 693 ELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN-----LGVVSSSPQVRLACERGW 747
E+ P IS+ T++ + IC+ VSE PP+ W + L ++ +P++ L C+ G
Sbjct: 751 EIDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLSLMDKTPEMHLQCDEGH 810
Query: 748 HIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGAC 806
I++I FASYG P G+C F G CH + L +V +AC+G+ CSI +S+ GV C
Sbjct: 811 TISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQACIGRTSCSIGISN---GVFGDPC 867
Query: 807 PGLLKALAVEAHCS 820
++K+LAV+A CS
Sbjct: 868 RHVVKSLAVQAKCS 881
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/856 (50%), Positives = 569/856 (66%), Gaps = 46/856 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDGKRR+L S IHYPR+TPE+WP+LI KSKEGG++VI+TY FW+ HEP+
Sbjct: 35 NVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPV 94
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR+D+V+F V +GL+LHLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 95 RGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 154
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
FKEEM+RF+ K++DLM++E L + QGGPII+ Q+ENEYGN+E +G G+ Y+KWAA+
Sbjct: 155 LFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMMQIENEYGNIEGQFGQKGKEYIKWAAEM 214
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP II+ CNG+YCDG+ PNS +KP +WTE++ GW+ S+G +
Sbjct: 215 ALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPNSYNKPTLWTEDWDGWYASWGGRL 274
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP TSYDYDAPIDEYG +
Sbjct: 275 PHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSN-------------D 349
+PKWGHL++LH AIKLCE L+++D P + KLG K EAH+Y +S+
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQEAHVYRVNSHTEGLNITSYGSQIS 394
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHP 405
C+AFLAN D A+VTF G Y LP WSVSILPDC+NVV+NTAKV +Q + D P
Sbjct: 395 CSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTVEFDLP 454
Query: 406 F-----AQQKNV--NELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
+QQ+ + N+ L + ++ +E VG+ +F + E +N TKD SDYLW+
Sbjct: 455 LYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNFTVQGILEHLNVTKDQSDYLWH 514
Query: 459 TASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
I V ++I+S+ VFVN +L H + + ++
Sbjct: 515 ITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQLTEGSVIGHWVK---VEQPVK 571
Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEG 570
+G N L +L+ VGLQNYGA+ + GAG I L KNG DLS W YQVG++G
Sbjct: 572 FLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTGFKNGDIDLSKLLWTYQVGLKG 631
Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
E+ + I + W + S + IWYKT F +P G P+AL+L SMGKGQAWVNG
Sbjct: 632 EFFKIYTIEENEKAGWAELSPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNG 691
Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
IGRYW+ +AP GC + CDYRG+Y++ KC +CG+P QTLYH+PR+W+ NLLVI
Sbjct: 692 HHIGRYWT-LVAPEDGCPEICDYRGAYNSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVI 750
Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACE 744
EE GG+P IS+ ++ +C+ VSE+ PPV W + V +P++ L C+
Sbjct: 751 LEETGGNPFDISIKLRSAGVLCAQVSESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQ 810
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
G+ I++I FASYG P+G+C F G CH + IV K+C+G+ CS+ +S+ G
Sbjct: 811 DGFTISSIEFASYGTPQGSCQKFSMGNCHATNSSSIVSKSCLGKNSCSVEISNNSFG--G 868
Query: 804 GACPGLLKALAVEAHC 819
C G++K LAVEA C
Sbjct: 869 DPCRGIVKTLAVEARC 884
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/836 (51%), Positives = 551/836 (65%), Gaps = 32/836 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+VIETYVFWN H
Sbjct: 28 VHCSVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVH 87
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP RG Y FEGR+DLVRFVKT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 88 EPSRGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 147
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F KI+ +MK E L+ SQGGPIIL+Q+ENEYG G G+ YV WA
Sbjct: 148 DNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGSAGQNYVNWA 207
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP +WTE +SGWF FG
Sbjct: 208 AKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFG 267
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 268 GPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 327
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELHKAIK+CE L+S+DP LG +AH+Y S DCAAFL+N+D+
Sbjct: 328 LIRQPKYGHLKELHKAIKMCERALVSTDPAVTSLGNFQQAHVYSAKSGDCAAFLSNFDTK 387
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S V FN Y LP WS+SILPDC+NVVFNTAKV Q + L +
Sbjct: 388 SSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQMQMLPTNTR 437
Query: 421 AFSW--YEEKVGISGNRSFVRPD---LAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
FSW ++E + + S + L EQIN T+DTSDYLWY S+ + + GK
Sbjct: 438 MFSWESFDEDISSLDDGSSITTTTSGLLEQINVTRDTSDYLWYITSVDIGSSESFLRGGK 497
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
L ++S GHA VF+N +L YG + F + L G N + +LS+ VGL N
Sbjct: 498 LPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAGTNRIALLSVAVGLPN 557
Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
G F+ G+ V+L GK DLS +W YQVG++GE + L + +S W Q
Sbjct: 558 VGGHFETWNTGILGPVVLRGFDQGKLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQS 617
Query: 590 STLP-VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ + N+ L W+KT F AP+G PLAL++ MGKGQ W+NG SIGRYW+A A G
Sbjct: 618 ALVSDKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWTALAA---GNC 674
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
C Y G++ KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGDPSKISL+ ++
Sbjct: 675 NGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDHNLLVVFEELGGDPSKISLVKRSV 734
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCG 765
+C+ VSE P + +W + S P+V L C G I++I FAS+G P G CG
Sbjct: 735 SSVCADVSEYH-PNIRNWHIDSYGKSEEFHPPKVHLHCSPGQTISSIKFASFGTPLGTCG 793
Query: 766 SFRPGACHMDVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ G CH ++K C+G+ C++ VS++ G CP +LK L+VEA C+
Sbjct: 794 NYEKGVCHSSTSHATLEKKCIGKPRCTVTVSNSNFG--QDPCPNVLKRLSVEAVCA 847
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/828 (50%), Positives = 551/828 (66%), Gaps = 24/828 (2%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV YD +ALVIDG+RR+L SGSIHYPRSTPE+W LI+K+K+GGL+ I+TYVFWN HEP
Sbjct: 30 NVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 89
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G Y FEGR DLVRF+KTV +AGL++HLRIGPY C+EWN+GGFPVWL F+PGI FRT N
Sbjct: 90 PGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNFGGFPVWLKFVPGISFRTDNE 149
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M++F K++ LMK E LF SQGGPIIL+Q+ENEY A+G G Y+ WAA
Sbjct: 150 PFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYEPESKAFGASGYAYMTWAAKM 209
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV + T VPWVMC+++DAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 210 AVGMGTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPYKPTMWTEAWSGWFTEFGGPI 269
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
RPVEDL FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR
Sbjct: 270 YQRPVEDLTFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 329
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
+PK+GHL+ELHKA+KLCE L+++DPT LG+ +AH++ S A FL+N+++ S
Sbjct: 330 RPKYGHLKELHKAVKLCELALLNADPTVTTLGSYEQAHVFSSKSGSGAVFLSNFNTKSAT 389
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
VTFN + LP WS+SILPDCKNV FNTA+V Q + Q N L + F+
Sbjct: 390 KVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTSQ-----TQLLRTNSELHSWGIFN 444
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP-----GQGKEVFLNIES 478
E+ ++G+ + L +Q+N T+D+SDYLWYT S+ + P G G+ L ++S
Sbjct: 445 --EDVSSVAGDTTITVTGLLDQLNITRDSSDYLWYTTSVDIDPSESFLGGGQHPSLTVQS 502
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
G A VF+N +L G + F + L+ G+N + +LS+ VGL N G F+
Sbjct: 503 AGDAMHVFINDQLSGSASGTREHRRFTFTGNVNLHAGLNKISLLSIAVGLANNGPHFETR 562
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V L L +G RDLS +W YQVG++GE LD + ++ W GS + +
Sbjct: 563 NTGVLGPVALHGLDHGTRDLSWQKWSYQVGLKGEATNLDSPNSISAVDWMTGSLVAQKQQ 622
Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F P G PLAL++ SMGKGQ W+NGQSIGRYW+ Y + C+ C Y G+
Sbjct: 623 PLTWYKAYFDEPNGDEPLALDMGSMGKGQVWINGQSIGRYWTIY--ADSDCS-ACTYSGT 679
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
+ KCQ C P Q YH+PR+W+ P +NLLV+ EE+GGD SK++L+ K+ +C+ VS
Sbjct: 680 FRPKKCQFGCQHPTQQWYHVPRSWLKPSKNLLVVFEEIGGDVSKVALVKKSVTSVCAEVS 739
Query: 717 EADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
E + P + +W V P++ L C G I+AI F+S+G P G+CG F+ G CH
Sbjct: 740 E-NHPRITNWHTESHGQTEVQQKPEISLHCTDGHSISAIKFSSFGTPSGSCGKFQHGTCH 798
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ ++QK C+G+ +CS+ +S+ G A CP LK L+VEA CS
Sbjct: 799 APNSNAVLQKECLGKQKCSVTISNTNFG--ADPCPSKLKKLSVEAVCS 844
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/845 (51%), Positives = 549/845 (64%), Gaps = 55/845 (6%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 29 VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89 EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK F +I++LMK ENLF SQGGPIIL+Q+ENEYG G G Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 268
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE--------AHIYHKSSNDCAA 352
IRQPK+GHL+ELH+AIK+CE+ L+S+DP +G K + AH+Y S DC+A
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQVWIYYERFAHVYSAESGDCSA 388
Query: 353 FLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNV 412
FLANYD+ S A V FN Y LP WS+SILPDC+N VFNTAKV
Sbjct: 389 FLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKV----------------- 431
Query: 413 NELLLASSAFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ- 468
S F W E+ + + +F L EQIN T+DTSDYLWY S+ + +
Sbjct: 432 -------SNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSES 484
Query: 469 ----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
G+ L I+S GHA +FVN +L +G F KI L+ G N + +LS+
Sbjct: 485 FLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSV 544
Query: 525 MVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS 583
VGL N G F+ G+ V L L GK DLS +W YQVG++GE + L + S
Sbjct: 545 AVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPS 604
Query: 584 SFWKQGS-TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
W S T+ + L W+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+
Sbjct: 605 IGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF-- 662
Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKIS 702
+TG C Y G+Y +KCQ CGQP Q YH+PR W+ P +NLLVI EELGG+PS +S
Sbjct: 663 -ATGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVS 721
Query: 703 LLTKTGQHICSFVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGI 759
L+ ++ +C+ VSE P + +W+ G P+V L C G IA+I FAS+G
Sbjct: 722 LVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGT 780
Query: 760 PEGNCGSFRPGACHM----DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAV 815
P G CGS++ G CH +L + CVG+ C++ +S++ G CP +LK L V
Sbjct: 781 PLGTCGSYQQGECHAATSYAILERYMQKCVGKARCAVTISNSNFG--KDPCPNVLKRLTV 838
Query: 816 EAHCS 820
EA C+
Sbjct: 839 EAVCA 843
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 852 bits (2202), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/855 (50%), Positives = 566/855 (66%), Gaps = 48/855 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRAL+IDG+RR+L S IHYPR+TPE+WP+LI KSKEGG +V++TYVFW HEP+
Sbjct: 35 NVTYDHRALIIDGRRRILNSAGIHYPRATPEMWPDLIAKSKEGGADVVQTYVFWGGHEPV 94
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQYYFEGR+DLV+FVK V E+GL+LHLRIGPY CAEWN+GGFPVWL +PG+ FRT N
Sbjct: 95 KGQYYFEGRYDLVKFVKLVGESGLYLHLRIGPYVCAEWNFGGFPVWLRDVPGVVFRTDNA 154
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFKEEM++F+ KI+DLM++E L + QGGPII+ Q+ENEYGN+E ++G GG+ Y+KWAA
Sbjct: 155 PFKEEMQKFVTKIVDLMREEMLLSWQGGPIIMFQIENEYGNIEHSFGQGGKEYMKWAAGM 214
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+ VPWVMC+Q DAP+ II+ CNG+YCDGF PNSP KPI WTE++ GW+ ++G +
Sbjct: 215 ALALDAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSPKKPIFWTEDWDGWYTTWGGRL 274
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAFAVARFF+ GG+FQNYYMYFGGTNFGRT+GGP TSYDYDAPIDEYG +
Sbjct: 275 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPT-HQKLGAKLEAHIY-------------HKSSND 349
+PKWGHL++LH AIKLCE L+++D + KLG K EAH+Y + S +
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGPKQEAHVYGGSLSIQGMNFSQYGSQSK 394
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ 409
C+AFLAN D A V F G + LP WSVSILPDC+N VFNTAKV +Q + F
Sbjct: 395 CSAFLANIDERQAATVRFLGQSFTLPPWSVSILPDCRNTVFNTAKVAAQTHIKTVEFVLP 454
Query: 410 KNVNELLL--------ASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
+ + LL + + SW +E + + +F + E +N TKD SDYLWY
Sbjct: 455 LSNSSLLPQFIVQNEDSPQSTSWLIAKEPITLWSEENFTVKGILEHLNVTKDESDYLWYF 514
Query: 460 ASIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
I+V + +V ++I+S+ VF+N +L G+ A + ++
Sbjct: 515 TRIYVSDDDIAFWEKNKVSPAVSIDSMRDVLRVFINGQLTGSVVGHWVKA----VQPVQF 570
Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGE 571
+G N L +LS VGLQNYGA+ + GAG I L KNG DLS+ W YQVG++GE
Sbjct: 571 QKGYNELVLLSQTVGLQNYGAFLERDGAGFKGQIKLTGFKNGDIDLSNLSWTYQVGLKGE 630
Query: 572 YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQ 631
++ + W + + + WYKT F AP G P+AL+L SMGKGQAWVNG
Sbjct: 631 FLKVYSTGDNEKFEWSELAVDATPSTFTWYKTFFDAPSGVDPVALDLGSMGKGQAWVNGH 690
Query: 632 SIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIH 691
IGRYW+ ++P GC CDYRG+Y + KC+ +CG P QT YH+PR W+ NLLV+
Sbjct: 691 HIGRYWTV-VSPKDGC-GSCDYRGAYSSGKCRTNCGNPTQTWYHVPRAWLEASNNLLVVF 748
Query: 692 EELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACER 745
EE GG+P +IS+ ++ + IC+ VSE+ PP+ W N+ +P++ L C+
Sbjct: 749 EETGGNPFEISVKLRSAKVICAQVSESHYPPLRKWSRADLTGGNISRNDMTPEMHLKCQD 808
Query: 746 GWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAG 804
G +++I FASYG P G+C F G CH + +V +AC G+ +C I +S+A G
Sbjct: 809 GHIMSSIEFASYGTPNGSCQKFSRGNCHASNSSSVVTEACQGKNKCDIAISNAVFG---D 865
Query: 805 ACPGLLKALAVEAHC 819
C G++K LAVEA C
Sbjct: 866 PCRGVIKTLAVEARC 880
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 851 bits (2199), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/855 (50%), Positives = 559/855 (65%), Gaps = 45/855 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRA+ + G+RR+L S +HYPR+TPE+WP +I K KEGG +VIETY+FWN HEP
Sbjct: 51 NVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCKEGGADVIETYIFWNGHEPA 110
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQYYFE RFDLVRF+K V GLFL LRIGPYACAEWN+GGFPVWL IPGI+FRT N
Sbjct: 111 KGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 170
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+K EM+ F+ KI+D+MK E L++ QGGPIIL Q+ENEYGN++ YG G+ Y++WAA
Sbjct: 171 PYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 230
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+T +PWVMC+Q DAP+ I++TCN FYCDGF PNS +KP +WTE++ GW+ +G +
Sbjct: 231 ALGLDTGIPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGPL 290
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL TSYDYDAPI+EYG +R
Sbjct: 291 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPINEYGMLR 350
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKS-----------SNDC 350
QPKWGHL++LH AIKLCE LI+ D P + KLG+ EAHIY + + C
Sbjct: 351 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYVKLGSMQEAHIYSSAKVHTNGSTAGNAQIC 410
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-----NGDHP 405
+AFLAN D +V G Y LP WSVSILPDC+NV FNTA+V +Q + +G
Sbjct: 411 SAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCENVAFNTARVGAQTSVFTFESGSPS 470
Query: 406 FAQQKNVNELL------LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
+ ++ + LL SS + +E +G G+ SF + E +N TKD SDYLWYT
Sbjct: 471 HSSRREPSVLLPGVRGSYLSSTWWTSKETIGTWGDGSFATQGILEHLNVTKDISDYLWYT 530
Query: 460 ASIHVM-------PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
S+++ +G L I+ + A VFVN KL G+ + + I+
Sbjct: 531 TSVNISDEDVAFWSSKGVLPSLIIDQIRDVARVFVNGKLAGSQVGHW----VSLKQPIQF 586
Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGE 571
G+N L +LS +VGLQNYGA+ + GAG V L L NG DL++ W YQVG++GE
Sbjct: 587 VRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLTGLSNGDTDLTNSAWTYQVGLKGE 646
Query: 572 YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQ 631
+ + + W T + WYKT APEG P+A++L SMGKGQAWVNG+
Sbjct: 647 FSMIYTPEKQECAEWSAMQTDNIQSPFTWYKTMVDAPEGTDPVAIDLGSMGKGQAWVNGR 706
Query: 632 SIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIH 691
IGRYWS +AP +GC C+Y G+Y +KCQ +CG P Q+ YHIPR W+ NLLV+
Sbjct: 707 LIGRYWS-LVAPESGCPSSCNYPGAYSETKCQSNCGMPTQSWYHIPREWLQESNNLLVLF 765
Query: 692 EELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK----PNLGVVSSSPQVRLACERGW 747
EE GGDPSKISL + ICS +SE PP+ +W + V S +P++ L C+ G+
Sbjct: 766 EETGGDPSKISLEVHYTKTICSRISENYYPPLSAWSWLDTGRVSVDSVAPELLLRCDDGY 825
Query: 748 HIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGAC 806
I+ I FASYG P G C +F G CH L V +ACVG+ +C+I VS+ G C
Sbjct: 826 EISRITFASYGTPSGGCQNFSKGKCHAASTLDFVTEACVGKNKCAISVSNDVFG---DPC 882
Query: 807 PGLLKALAVEAHCSI 821
G+LK LAVEA CS+
Sbjct: 883 RGVLKDLAVEAECSL 897
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 848 bits (2192), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/857 (51%), Positives = 562/857 (65%), Gaps = 52/857 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRA++I GKRR+L S +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQYYFE RFDLV+F K V GLFL LRIGPYACAEWN+GGFPVWL IPGI+FRT N
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK EM+ F+ KI+ LMK+E L++ QGGPIIL Q+ENEYGN++ YG G+ Y++WAA
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+T +PWVMC+Q DAP+ II+TCN FYCDGF PNS +KP +WTE++ GW+ +G A+
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL TSYDYDAPIDEYG +R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-----------SSNDC 350
QPKWGHL++LH AIKLCE LI+ D P + KLG+ EAH+Y ++ C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPF 406
+AFLAN D A+V G Y LP WSVSILPDC+NV FNTA++ +Q + P
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482
Query: 407 AQQKNVNELLLASS-----AFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
++ +L +S + +W+ +E +G G +F + E +N TKD SDYLWYT
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542
Query: 460 ASIHVMPG-------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
+++ +G L I+ + A VFVN KL G+ + + I+L
Sbjct: 543 TRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQL 598
Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGE 571
EG+N L +LS +VGLQNYGA+ + GAG V L L +G DL++ W YQVG++GE
Sbjct: 599 VEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGE 658
Query: 572 YIGL---DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
+ + +K A S ++ S P WYKT F P+G P+A++L SMGKGQAWV
Sbjct: 659 FSMIYAPEKQGCAGWSRMQKDSVQP----FTWYKTMFSTPKGTDPVAIDLGSMGKGQAWV 714
Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLL 688
NG IGRYWS +AP +GC+ C Y G+Y+ KCQ +CG P Q YHIPR W+ +NLL
Sbjct: 715 NGHLIGRYWS-LVAPESGCSSSCYYPGAYNERKCQSNCGMPTQNWYHIPREWLKESDNLL 773
Query: 689 VIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW----KPNLGVVSSSPQVRLACE 744
V+ EE GGDPS ISL + +CS +SE PP+ +W V +++P++RL C+
Sbjct: 774 VLFEETGGDPSLISLEAHYAKTVCSRISENYYPPLSAWSHLSSGRASVNAATPELRLQCD 833
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSA 803
G I+ I FASYG P G C +F G CH L +V +ACVG +C+I VS+ G
Sbjct: 834 DGHVISEITFASYGTPSGGCLNFSKGNCHASSTLDLVTEACVGNTKCAISVSNDVFG--- 890
Query: 804 GACPGLLKALAVEAHCS 820
C G+LK LAVEA CS
Sbjct: 891 DPCRGVLKDLAVEAKCS 907
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 848 bits (2192), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/836 (50%), Positives = 549/836 (65%), Gaps = 46/836 (5%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPE------------VWPELIRKSKEGGLEVIET 53
TYD +A+V++G+RR+L SGSIHYPRSTPE +WP+LI K+K+GGL+V++T
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86
Query: 54 YVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI 113
YVFWN HEP GQYYFEGR+DLV F+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++
Sbjct: 87 YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146
Query: 114 PGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG 173
PGI FRT N PFK EM++F KI+++MK E LF QGGPIIL+Q+ENE+G +EW G
Sbjct: 147 PGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPA 206
Query: 174 ELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
+ Y WAA+ AV LNTSVPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++
Sbjct: 207 KAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWT 266
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
W+ FG VP RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYD
Sbjct: 267 AWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYD 326
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAF 353
APIDEYG +R+PKWGHL++LHKAIKLCE L++ DP LG ++ ++ S+ CAAF
Sbjct: 327 APIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAF 386
Query: 354 LANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVN 413
L N D S A V FNG Y LP WS+SILPDCK VFNTA+V SQ +Q K
Sbjct: 387 LENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK--- 436
Query: 414 ELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--- 468
+ + F+W Y E++ G L EQIN T+D +DYLWYT + V +
Sbjct: 437 --MEWAGGFAWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFL 494
Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
G+ + L + S GHA +F+N +L YG+ D ++L G NT+ LS+ V
Sbjct: 495 SNGENLKLTVMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAV 554
Query: 527 GLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
GL N G F+ AG+ + +D L G+RDL+ +W YQVG++GE + L +S +++
Sbjct: 555 GLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVE 614
Query: 586 WKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
W + PV K L WYK F AP+G PLAL+++SMGKGQ W+NGQ IGRYW Y A
Sbjct: 615 WGE----PVQKQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA-- 668
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
+G CDYRG YD +KCQ +CG +Q YH+PR+W+ P NLLVI EE GGDP+ IS++
Sbjct: 669 SGNCGTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMV 728
Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNC 764
++ +C+ VSE P + +W +V L C+ G I I FAS+G P+G+C
Sbjct: 729 KRSIGSVCADVSEWQ-PSMKNWHTK---DYEKAKVHLQCDNGQKITEIKFASFGTPQGSC 784
Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
GS+ G CH I K CVGQ C + V G CPG +K VEA C
Sbjct: 785 GSYTEGGCHAHKSYDIFWKNCVGQERCGVSVVPEIFG--GDPCPGTMKRAVVEAIC 838
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 847 bits (2187), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/848 (50%), Positives = 551/848 (64%), Gaps = 38/848 (4%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+I KRR+L S IHYPR+TPE+W +LI KSKEGG +VI+TYVFW+ HEP+
Sbjct: 37 NVSYDHRALIIADKRRMLVSAGIHYPRATPEMWSDLIEKSKEGGADVIQTYVFWSGHEPV 96
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLV+FVK + +GL+LHLRIGPY CAEWN+GGFPVWL IPGIQFRT N
Sbjct: 97 KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIQFRTDNE 156
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+EM++F+ KI+DLM+ LF QGGPII+ Q+ENEYG+VE +YG G+ YVKWAA
Sbjct: 157 PFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP+ II+ CNG+YCDGF PNS KPI+WTE++ GW+ +G ++
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSQMKPILWTEDWDGWYTKWGGSL 276
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP EDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP TSYDYDAP+DEYG
Sbjct: 277 PHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRS 336
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND----CAAFLANYD 358
+PKWGHL++LH AIKLCE L+++D P ++KLG+ EAHIY CAAFLAN D
Sbjct: 337 EPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSNQEAHIYRGDGETGGKVCAAFLANID 396
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---------Q 409
A+V FNG Y LP WSVSILPDC++V FNTAKV +Q + A+ Q
Sbjct: 397 EHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGSKSILQ 456
Query: 410 KNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
K V + ++ + SW +E +GI G +F L E +N TKD SDYLW+ I V
Sbjct: 457 KVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRITVSED 516
Query: 468 Q-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
G ++I+S+ VFVNK+L G+ A + + +G N L
Sbjct: 517 DISFWKKNGANPTVSIDSMRDVLRVFVNKQLSGSVVGHWVKA----VQPVRFMQGNNDLL 572
Query: 521 ILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
+L+ VGLQNYGA+ + GAG L KNG DL+ W YQVG++GE + +
Sbjct: 573 LLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDMDLAKSSWTYQVGLKGEAEKIYTVE 632
Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
+ W T +WYKT F P G P+ L+L SMGKGQAWVNG IGRYW+
Sbjct: 633 HNEKAEWSTLETDASPSIFMWYKTYFDTPAGTDPVVLDLESMGKGQAWVNGHHIGRYWNI 692
Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
++ GC + CDYRG+Y + KC +CG+P QT YH+PR+W+ P NLLV+ EE GG+P
Sbjct: 693 -ISQKDGCERTCDYRGAYYSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPF 751
Query: 700 KISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACERGWHIAAIN 753
IS+ T T +C V E+ PP+ W + + S +P+V L CE G I++I
Sbjct: 752 NISVKTVTAGILCGQVLESHYPPLRKWSTPDYINGTMSINSVAPEVYLHCEDGHVISSIE 811
Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
FASYG P G+C F G CH + L IV +AC G+ C I VS+ + C G LK
Sbjct: 812 FASYGTPRGSCDRFSIGKCHASNSLSIVSEACKGRTSCFIEVSNT--AFRSDPCSGTLKT 869
Query: 813 LAVEAHCS 820
LAV A CS
Sbjct: 870 LAVMARCS 877
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 845 bits (2184), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/863 (51%), Positives = 571/863 (66%), Gaps = 56/863 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRAL+IDG RR+L S IHYPR+TPE+WP+LI K+KEGG++VIETYVFWN H+P+
Sbjct: 49 NVTYDHRALIIDGHRRMLISAGIHYPRATPEMWPDLIAKAKEGGVDVIETYVFWNGHQPV 108
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLV+F K V GL+ LRIGPYACAEWN+GGFPVWL IPGI+FRT N
Sbjct: 109 KGQYNFEGRYDLVKFAKLVASNGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 168
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQV------ENEYGNVEWAYGVGGELYV 177
PFKEEMKRF++K+++LM++E LF+ QGGPIIL QV ENEYGN+E +YG G+ YV
Sbjct: 169 PFKEEMKRFVSKVVNLMREEMLFSWQGGPIILLQVRREYGIENEYGNLESSYGNEGKEYV 228
Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
KWAA A++L VPWVMC+Q DAP II+TCN +YCDGF PNS +KPI WTEN+ GW+
Sbjct: 229 KWAASMALSLGAGVPWVMCKQPDAPYDIIDTCNAYYCDGFKPNSRNKPIFWTENWDGWYT 288
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
+G +P RPVEDLAFAVARFF+ GG+ QNYYMYFGGTNFGRTAGGPL TSYDYDAPID
Sbjct: 289 QWGERLPHRPVEDLAFAVARFFQRGGSLQNYYMYFGGTNFGRTAGGPLQITSYDYDAPID 348
Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKS---------- 346
EYG + +PKWGHL++LH A+KLCE L+++D PT+ KLG+K EAH+Y ++
Sbjct: 349 EYGLLNEPKWGHLKDLHAALKLCEPALVAADSPTYIKLGSKQEAHVYQENVHREGLNLSI 408
Query: 347 ---SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN--- 400
SN C+AFLAN D A VTF G Y LP WSVSILPDC++ +FNTAKV +Q +
Sbjct: 409 SQISNKCSAFLANIDERKAATVTFRGQTYTLPPWSVSILPDCRSAIFNTAKVGAQTSVKL 468
Query: 401 -NGDHPFA-----QQKNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDT 452
+ P Q++++ ++ + SW +E + I N SF + E +N TKD
Sbjct: 469 VGSNLPLTSNLLLSQQSIDHNGISHISKSWMTTKEPINIWINSSFTAEGIWEHLNVTKDQ 528
Query: 453 SDYLWYTASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFL 505
SDYLWY+ I+V G L I+S+ VFVN +L+ G+ A
Sbjct: 529 SDYLWYSTRIYVSDGDILFWKENAAHPKLAIDSVRDILRVFVNGQLIGNVVGHWVKA--- 585
Query: 506 INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIY 564
+ ++ G N L +L+ VGLQNYGA+ + GAG+ I I +NG DLS W Y
Sbjct: 586 -VQTLQFQPGYNDLTLLTQTVGLQNYGAFIEKDGAGIRGTIKITGFENGHIDLSKPLWTY 644
Query: 565 QVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKG 624
QVG++GE++ N+ W + + + + WYKT F P G P+AL+L SMGKG
Sbjct: 645 QVGLQGEFLKFYNEESENAG-WVELTPDAIPSTFTWYKTYFDVPGGNDPVALDLESMGKG 703
Query: 625 QAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG 684
QAWVNG IGRYW+ ++P TGC + CDYRG+YD+ KC +CG+P QTLYH+PR+W+
Sbjct: 704 QAWVNGHHIGRYWTR-VSPKTGC-QVCDYRGAYDSDKCTTNCGKPTQTLYHVPRSWLKAS 761
Query: 685 ENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW--KPNLGV--VSSS---P 737
N LVI EE GG+P IS+ + +C+ VS++ PP+ LG VSS+ P
Sbjct: 762 NNFLVILEETGGNPLGISVKLHSASIVCAQVSQSYYPPMQKLLNASLLGQQEVSSNDMIP 821
Query: 738 QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSS 796
++ L C G I++I FAS+G P G+C SF G CH IV KAC+G+ CSI +SS
Sbjct: 822 EMNLRCRDGNIISSITFASFGTPGGSCQSFSRGNCHAPSSKSIVSKACLGKRSCSIKISS 881
Query: 797 AYLGVSAGACPGLLKALAVEAHC 819
G C ++K L+VEA C
Sbjct: 882 DVFG--GDPCQDVVKTLSVEARC 902
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 845 bits (2184), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/835 (50%), Positives = 537/835 (64%), Gaps = 27/835 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYD R+L+I+G+R++L S SIHYPRS P +WP L+R +KEGG++VIETYVFWN HEP
Sbjct: 45 SVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVFWNGHEPS 104
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G YYF GRFDLV+F K +Q+AG+++ LRIGP+ AEWN+GG PVWLH++PG FRT +
Sbjct: 105 PGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGTTFRTDSE 164
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M++F+ ++LMK+E LFASQGGPIIL+QVENEYG E AYG GG+ Y WAA
Sbjct: 165 PFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRYALWAAKM 224
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A++ NT VPW+MCQQ DAPDP+I+TCN FYCD F P SP+KP +WTEN+ GWF +FG
Sbjct: 225 ALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWFKTFGARD 284
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED+A++VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG R
Sbjct: 285 PHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 344
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PKWGHL+ELHK IK CE L+++DPT LG EA +Y +S CAAFLAN D +D
Sbjct: 345 FPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLANMDDKNDK 404
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD------HPFAQQKNVNELLL 417
V F Y LPAWSVSILPDCKNV FNTAKV Q + + HP A + +
Sbjct: 405 VVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSPKRD---I 461
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN-- 475
S + ++E G+ G F + + INTTKD +DYLWYT SI V +E FL
Sbjct: 462 KSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFV---HAEEDFLRNR 518
Query: 476 ------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
+ES GHA VF+NKKL A GN F I L G N + +LSM VGLQ
Sbjct: 519 GTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEIALLSMTVGLQ 578
Query: 530 NYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
GA+++ GAG SV + K G DL++ W Y++G++GE++ + K S W
Sbjct: 579 TAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQKSYNLKSKIWAPT 638
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
S P + L WYK AP G P+AL++ MGKG AW+NGQ IGRYW + C
Sbjct: 639 SQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKYENCVT 698
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
+CDYRG ++ KC CGQP Q YH+PR+W P N+L+I EE+GGDPS+I +
Sbjct: 699 QCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRFSMRKVS 758
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGS 766
C +S D P D + S P + L C +I+++ FAS+G P G CGS
Sbjct: 759 GACGHLS-VDHPSFDVENLQGSEIESDKNRPTLSLKCPTNTNISSVKFASFGNPNGTCGS 817
Query: 767 FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + +V+K C+ Q EC++ +SSA + CP +K LAVE +CS
Sbjct: 818 YMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQ--LCPSTVKKLAVEVNCS 870
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 845 bits (2182), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/831 (50%), Positives = 542/831 (65%), Gaps = 13/831 (1%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L+ANVTYD R+L+IDG+R++L S SIHYPRS P +WP L++ +KEGG++VIETYVFWN H
Sbjct: 19 LAANVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFWNGH 78
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
E YYF GR+DL++FVK VQ+A ++L LR+GP+ AEWN+GG PVWLH++PG FRT
Sbjct: 79 ELSPDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTVFRT 138
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
+ PFK M++F+ I+++MK+E LFASQGGPIILAQVENEYG+ E YG GG+ Y WA
Sbjct: 139 NSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYAMWA 198
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A++ N VPW+MCQQ DAPDP+INTCN FYCD FTPNSP+KP MWTEN+ GWF +FG
Sbjct: 199 ANMALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKTFG 258
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRT+GGP + TSYDY+APIDEYG
Sbjct: 259 APDPHRPHEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPIDEYG 318
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKWGHL+ELH+AIK CE L+ +P + LG E +Y SS CAAF++N D
Sbjct: 319 LARLPKWGHLKELHRAIKSCEHVLLYGEPINLSLGPSQEVDVYTDSSGGCAAFISNVDEK 378
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLAS 419
D + F Y +PAWSVSILPDCKNVVFNTAKV SQ + + P Q ++
Sbjct: 379 EDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEELQPSLVPSNKDL 438
Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG--KEV--- 472
W + EK GI G FV+ + INTTKDT+DYLWYT S+ V + KE+
Sbjct: 439 KGLQWETFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGESENFLKEISQP 498
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L +ES GHA FVN+KL GN + F I L G N + +LSM VGLQN G
Sbjct: 499 VLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDIALLSMTVGLQNAG 558
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
+++ GAGL SV + L NG DLS+ W Y++G++GE++ + K NS W
Sbjct: 559 PFYEWVGAGLTSVKIKGLNNGIMDLSTYTWTYKIGLQGEHLLIYKPEGLNSVKWLSTPEP 618
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
P + L WYK P G P+ L++ MGKG AW+NG+ IGRYW + C ++CD
Sbjct: 619 PKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWPRKSSIHDKCVQECD 678
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
YRG + +KC CG+P Q YH+PR+W P N+LVI EE GGDP+KI + +C
Sbjct: 679 YRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTKIRFSRRKTTGVC 738
Query: 713 SFVSEADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+ VSE P ++SW + + + + L C HI+++ FASYG P G CGS+ G
Sbjct: 739 ALVSEDHPTYELESWHKDANENNKNKATIHLKCPENTHISSVKFASYGTPTGKCGSYSQG 798
Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + +V+K C+ + +C+I ++ S CP K LAVEA CS
Sbjct: 799 DCHDPNSASVVEKLCIRKNDCAIELAEK--NFSKDLCPSTTKKLAVEAVCS 847
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 845 bits (2182), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/862 (49%), Positives = 573/862 (66%), Gaps = 62/862 (7%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N++YDHRA++I G+RR+L SG IHYPR++P++WP LIR +KEGGL++I+TYVFW+ HE
Sbjct: 20 ATNISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G Y F+GR+DL+RF+K V +AGL+++LRIGPY CAEWN+GGFP WL +PGIQFRT
Sbjct: 80 PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTH 139
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F+++M+ F+ KI+D++K E LFASQGGP++ +Q+ENEYGNV+ +YG+ G+ Y+ WAA
Sbjct: 140 NRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYMLWAA 199
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A +L T VPW+MC+Q DAPD IINTCNG+YCDG+ PNS KP MWTEN+SGW+ S+G
Sbjct: 200 RMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQSWGE 259
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYM------------------YFGGTNFGRTAGG 283
A P+R VED+AFAVARFF+ GG QNYYM YFGGTNFGRT+GG
Sbjct: 260 AAPYRTVEDVAFAVARFFQRGGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNFGRTSGG 319
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG---AKLEA 340
P + TSYDYDAP+DE+G +RQPKWGHL+ELH A+KLCE L S+DP + LG ++A
Sbjct: 320 PFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPVYYTLGRMQEMVQA 379
Query: 341 HIYHKSSND---------CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
H+Y S + CAAFLAN D+SS A+V F G VY LP WSVSILPDC+NVVFN
Sbjct: 380 HVYSDGSLEANFSNLATPCAAFLANIDTSS-ASVKFGGKVYNLPPWSVSILPDCRNVVFN 438
Query: 392 TAKVISQRNNGDHPFAQQKNVNEL--------LLASSAFSWYEEKVGISGNRSFVRPDLA 443
TA+V +Q + Q+ ++ E L+ A+ W++E VG SG + L
Sbjct: 439 TAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHALL 498
Query: 444 EQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDF 501
EQI+TT D++DY+WY+ ++ + G + L I S+ +FVN +
Sbjct: 499 EQISTTNDSTDYMWYSTRFEILDQELKGGDPVLVITSMRDMVHIFVNGEFAGSTSTLKSG 558
Query: 502 ANFL-INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSS 559
+ + + I L G+N L ILS VGLQNYGA + GAG+ I I L G R+L+S
Sbjct: 559 GLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIWIQGLSTGTRNLTS 618
Query: 560 GEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLA 619
W++QVG+ GE+ D I+ W ++LP + L+WYK F P+G P+A++L
Sbjct: 619 ALWLHQVGLNGEH---DAIT------WSSTTSLPFFQPLVWYKANFNIPDGDDPVAIHLG 669
Query: 620 SMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRT 679
SMGKGQAWVNG S+GR+W APSTGC+ +CDYRG+Y +SKC CG P+Q YH+PR
Sbjct: 670 SMGKGQAWVNGHSLGRFWPVITAPSTGCSDRCDYRGTYYSSKCLSSCGLPSQEWYHVPRE 729
Query: 680 WVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQV 739
W+ +N LV+ EE+GG+ S +S ++ +C+ VSE PPV + SS P++
Sbjct: 730 WLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQF-------SSLPEL 782
Query: 740 RLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAY 798
L+C G I++I FAS+G P+G CG+F+ G+CH ++ IV+KAC+G+ CS +
Sbjct: 783 GLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKACIGRQSCSFEIFWKN 842
Query: 799 LGVSAGACPGLLKALAVEAHCS 820
G CPG K LAVEA C+
Sbjct: 843 FGTD--PCPGKAKTLAVEAACT 862
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 845 bits (2182), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/835 (50%), Positives = 537/835 (64%), Gaps = 27/835 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYD R+L+I+G+R++L S SIHYPRS P +WP L+R +KEGG++VIETYVFWN HEP
Sbjct: 45 SVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVFWNGHEPS 104
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G YYF GRFDLV+F K +Q+AG+++ LRIGP+ AEWN+GG PVWLH++PG FRT +
Sbjct: 105 PGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGTTFRTDSE 164
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M++F+ ++LMK+E LFASQGGPIIL+QVENEYG E AYG GG+ Y WAA
Sbjct: 165 PFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRYALWAAKM 224
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A++ NT VPW+MCQQ DAPDP+I+TCN FYCD F P SP+KP +WTEN+ GWF +FG
Sbjct: 225 ALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWFKTFGARD 284
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED+A++VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG R
Sbjct: 285 PHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 344
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PKWGHL+ELHK IK CE L+++DPT LG EA +Y +S CAAFLAN D +D
Sbjct: 345 FPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLANMDDKNDK 404
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD------HPFAQQKNVNELLL 417
V F Y LPAWSVSILPDCKNV FNTAKV Q + + HP A + +
Sbjct: 405 VVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSPKRD---I 461
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN-- 475
S + ++E G+ G F + + INTTKD +DYLWYT SI V +E FL
Sbjct: 462 KSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFV---HAEEDFLRNR 518
Query: 476 ------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
+ES GHA VF+NKKL A GN F I L G N + +LSM VGLQ
Sbjct: 519 GTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEISLLSMTVGLQ 578
Query: 530 NYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
GA+++ GAG SV + K G DL++ W Y++G++GE++ + K S W
Sbjct: 579 TAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQKSYNLKSKIWAPT 638
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
S P + L WYK AP G P+AL++ MGKG AW+NGQ IGRYW + C
Sbjct: 639 SQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKYENCVT 698
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
+CDYRG ++ KC CGQP Q YH+PR+W P N+L+I EE+GGDPS+I +
Sbjct: 699 QCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRFSMRKVS 758
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGS 766
C +S D P D + + P + L C +I+++ FAS+G P G CGS
Sbjct: 759 GACGHLS-VDHPSFDVENLQGSEIENDKNRPTLSLKCPTNTNISSVKFASFGNPNGTCGS 817
Query: 767 FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G CH + +V+K C+ Q EC++ +SSA + CP +K LAVE +CS
Sbjct: 818 YMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQ--LCPSTVKKLAVEVNCS 870
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 844 bits (2181), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/824 (51%), Positives = 548/824 (66%), Gaps = 29/824 (3%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP
Sbjct: 29 VTYDRKAVVVNGQRRILLSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSP 88
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY+FEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFP+WL ++PGI FRT N P
Sbjct: 89 GQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPIWLKYVPGISFRTDNEP 148
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK EM++F KI+ +MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ A
Sbjct: 149 FKAEMQKFTTKIVQMMKSERLFEWQGGPIILSQIENEFGPLEWDQGEPAKDYASWAANMA 208
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ LNT VPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 209 MALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 268
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDLA+ VA+F + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAP+DEYG +R+
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFERTAGGPFIATSYDYDAPLDEYGLLRE 328
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PKWGHL+ELH+AIKLCE L+++DP LG +A ++ S+ CAAFL N S A
Sbjct: 329 PKWGHLKELHRAIKLCEPALVAADPILSSLGNAQKASVFRSSTGACAAFLENKHKLSYAR 388
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V+FNG Y LP WS+SILPDCK VFNTA+V SQ +Q K E + S+
Sbjct: 389 VSFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK--MEWAGGLTWQSY 439
Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESL 479
EE S SF L EQIN T+D +DYLWYT + V + GK L + S
Sbjct: 440 NEEINSFSELESFTTVGLLEQINMTRDNTDYLWYTTYVDVAKDEQFLTSGKNPKLTVMSA 499
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
GHA VF+N +L YG+ + K++L G NT+ LS+ VGL N G F+
Sbjct: 500 GHALHVFINGQLSGTVYGSVENPKLTYTGKVKLWSGSNTISCLSIAVGLPNVGEHFETWN 559
Query: 540 AGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS- 597
AG+ + +D L GKRDL+ +W YQVG++GE + L +S ++S W + PV K
Sbjct: 560 AGILGPVTLDGLNEGKRDLTWQKWTYQVGLKGEAMSLHSLSGSSSVEWGE----PVQKQP 615
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYK F AP+G PLAL++ SMGKGQ W+NGQ IGRYW Y A +G CDYRG Y
Sbjct: 616 LTWYKAFFNAPDGDEPLALDMNSMGKGQIWINGQGIGRYWPGYKA--SGTCGHCDYRGEY 673
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+ +KCQ +CG P+Q YH+PR W++P NLLVI EE GGDP+ IS++ +T +C+ VSE
Sbjct: 674 NETKCQTNCGDPSQRWYHVPRPWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSVCADVSE 733
Query: 718 ADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMD-V 776
P + +W+ +V L C+ G I I FAS+G P+G+CG++ G CH
Sbjct: 734 WQ-PSIKNWRTK---DYEKAEVHLQCDHGRKITEIKFASFGTPQGSCGNYSEGGCHAHRS 789
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
I +K C+ Q C + V G CPG +K VE CS
Sbjct: 790 YDIFKKNCINQEWCGVSVVPEAFG--GDPCPGTMKRAVVEVTCS 831
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 842 bits (2176), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/827 (51%), Positives = 539/827 (65%), Gaps = 13/827 (1%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YD R+L+IDG+R++L S +IHYPRS PE+WP+L++ +KEGG++VIETYVFWN HEP
Sbjct: 28 NVSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G YYF GR+DLV+FVK V++AG+ L LRIGP+ AEW +GG PVWLH++PG FRT N
Sbjct: 88 PGNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTVFRTENK 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M++F I+DLMKQE FASQGGPIILAQVENEYG E YG GG+ Y WAA
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+ N VPW+MCQQ DAP+ +INTCN FYCD FTP +KP +WTEN+ GWF +FG
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQFTPIYQNKPKIWTENWPGWFKTFGGWN 267
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED+AF+VARFF+ GG+ NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG R
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 327
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PKWGHL++LH+AIKLCE +++S PT+ LG LEA ++ SS CAAF+AN D +D
Sbjct: 328 LPKWGHLKQLHRAIKLCEHIMLNSQPTNVSLGPSLEADVFTNSSGACAAFIANMDDKNDK 387
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLASSAF 422
V F Y LPAWSVSILPDCKNVVFNTAKV SQ + + P + Q +V +
Sbjct: 388 TVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSVVEMLPESLQLSVGSADKSLKDL 447
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
W + EK GI G FV+ L + INTTK T+DYLWYT SI V + G L
Sbjct: 448 KWDVFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGENEEFLKKGSSPVLL 507
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
IES GHA FVN++L A GN F + I L EG N + +LSM VGLQN G+++
Sbjct: 508 IESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIALLSMTVGLQNAGSFY 567
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
+ GAGL SV + NG DLS+ W Y++G+EGE+ GLDK + W S P
Sbjct: 568 EWVGAGLTSVKIQGFNNGTIDLSAYNWTYKIGLEGEHQGLDKEEGFGNVNWISASEPPKE 627
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK P G P+ L++ MGKG AW+NG+ IGRYW P GC K+C+YRG
Sbjct: 628 QPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPRK-GPLHGCVKECNYRG 686
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+D KC CG+P Q YH+PR+W N+LVI EE GGDPSKI + +C+ V
Sbjct: 687 KFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSKIEFSRRKITGVCALV 746
Query: 716 SEADPP-PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH- 773
+E P ++SW G + + L C HI+++ FAS+G P G C S+ G CH
Sbjct: 747 AENYPSIDLESWNDGSGSNKTVATIHLGCPEDTHISSVKFASFGNPTGACRSYTQGDCHD 806
Query: 774 MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ + +V+K C+ + C I ++ + G+C K LAVE C+
Sbjct: 807 PNSISVVEKVCLNKNRCDIELTGE--NFNKGSCLSEPKKLAVEVQCN 851
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 842 bits (2175), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/862 (49%), Positives = 572/862 (66%), Gaps = 62/862 (7%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N++YDHRA++I G+RR+L SG +HYPR++P++WP LIR +KEGGL++I+TYVFW+ HE
Sbjct: 20 ATNISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G Y F+GR+DL+RF+K V +AGL+++LRIGPY CAEWN+GGFP WL +PGIQFRT
Sbjct: 80 PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTH 139
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F+++M+ F+ KI+D++K E LFASQGGP++ +Q+ENEYGNV+ +YG G+ Y+ WAA
Sbjct: 140 NRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYMLWAA 199
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A +L T VPW+MC+Q DAPD IINTCNG+YCDG+ PNS KP MWTEN+SGW+ +G
Sbjct: 200 RMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQLWGE 259
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYM------------------YFGGTNFGRTAGG 283
A P+R VED+AFAVARFF+ GG QNYYM YFGGTNFGRT+GG
Sbjct: 260 AAPYRTVEDVAFAVARFFQRGGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNFGRTSGG 319
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG---AKLEA 340
P + TSYDYDAP+DE+G +RQPKWGHL+ELH A+KLCE L S+DP + LG ++A
Sbjct: 320 PFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPLYYTLGRMQEMVQA 379
Query: 341 HIYHKSSND---------CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
H+Y S + CAAFLAN D+SS A+V F GNVY LP WSVSILPDC+NVVFN
Sbjct: 380 HVYSDGSLEANFSNLATPCAAFLANIDTSS-ASVKFGGNVYNLPPWSVSILPDCRNVVFN 438
Query: 392 TAKVISQRNNGDHPFAQQKNVNEL--------LLASSAFSWYEEKVGISGNRSFVRPDLA 443
TA+V +Q + Q+ ++ E L+ A+ W++E VG SG + L
Sbjct: 439 TAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHALL 498
Query: 444 EQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDF 501
EQI+TT D++DYLWY+ + + G + L I S+ +FVN +
Sbjct: 499 EQISTTNDSTDYLWYSTRFEISDQELKGGDPVLVITSMRDMVHIFVNGEFAGSTSTLKSG 558
Query: 502 ANFL-INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSS 559
+ + + I L G+N L ILS VGLQNYGA + GAG+ + I L G R+L+S
Sbjct: 559 GLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVWIQGLSTGTRNLTS 618
Query: 560 GEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLA 619
W++QVG+ GE+ D I+ W ++LP + L+WYK F P+G P+A++L
Sbjct: 619 ALWLHQVGLNGEH---DAIT------WSSTTSLPFFQPLVWYKANFNIPDGDDPVAIHLG 669
Query: 620 SMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRT 679
SMGKGQAWVNG S+GR+W A APSTGC+ +CDYRG+Y +SKC CG P+Q YH+PR
Sbjct: 670 SMGKGQAWVNGHSLGRFWPAITAPSTGCSDRCDYRGTYYSSKCLSGCGLPSQEWYHVPRE 729
Query: 680 WVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQV 739
W+ +N LV+ EE+GG+ S +S ++ +C+ VSE PPV + SS P++
Sbjct: 730 WLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQF-------SSLPEL 782
Query: 740 RLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAY 798
L+C G I++I FAS+G P+G CG+F+ G+CH ++ IV+KAC+G+ CS +
Sbjct: 783 GLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKACIGRQSCSFEIFWKN 842
Query: 799 LGVSAGACPGLLKALAVEAHCS 820
G CPG K LAVEA C+
Sbjct: 843 FGTD--PCPGKAKTLAVEAACT 862
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 842 bits (2174), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/854 (50%), Positives = 560/854 (65%), Gaps = 47/854 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRAL++ GKRR+L S +HYPR+TPE+WP LI K+KEGG++VIETY+FWN HEP
Sbjct: 68 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFWNGHEPA 127
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQYYFEGRFD+VRF K V GLFL LRIGPYACAEWN+GGFPVWL IPGI+FRT N
Sbjct: 128 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 187
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+K EM+ F+ KI+D+MK+E L++ QGGPIIL Q+ENEYGN++ YG G+ Y++WAA
Sbjct: 188 PYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 247
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+T VPWVMC+Q DAP+ I++TCN FYCDGF PNS +KP +WTE++ GW+ +G A+
Sbjct: 248 ALALDTGVPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGEAL 307
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP +D AFAVARF++ GG+FQNYYMYFGGTNF RTAGGPL TSYDYDAPIDEYG +R
Sbjct: 308 PHRPAQDSAFAVARFYQRGGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 367
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-----------SSNDC 350
QPKWGHL++LH AIKLCE L + D P + KLG EAH+Y ++ C
Sbjct: 368 QPKWGHLKDLHAAIKLCEPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSISGNAQFC 427
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-----NGDHP 405
+AFLAN D A+V G Y LP WSVSILPDC+ V FNTA+V +Q + +G
Sbjct: 428 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGSPS 487
Query: 406 FAQQKNVNELLLASSAFS--WY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTAS 461
++ + L L S W+ +E VGI F + E +N TKD SDYL YT
Sbjct: 488 YSSRHKPRILSLGGPYLSSTWWASKEPVGIWSEDIFAAQGILEHLNVTKDISDYLSYTTR 547
Query: 462 IHVMP-------GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
+++ +G L I+ + +FVN KL G+ +N+ ++L +
Sbjct: 548 VNISDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGHW----VSLNQPLQLVQ 603
Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
G+N L +LS +VGLQNYGA+ + GAG V L L NG DL++ W YQ+G++GE+
Sbjct: 604 GLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLKGEFS 663
Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
+ S+ W W+KTTF APEG GP+A++L SMGKGQAWVNG I
Sbjct: 664 RIYSPEKQGSAGWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSMGKGQAWVNGHLI 723
Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
GRYWS +AP +GC C+Y G+Y SKC+ +CG Q+ YHIPR W+ +NLLV+ EE
Sbjct: 724 GRYWS-LVAPESGCPSSCNYAGNYGDSKCRSNCGIATQSWYHIPREWLQESDNLLVLFEE 782
Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACERGW 747
GGDPS+ISL + ICS +SE PP+ +W +P++ V +P++RL C+ G
Sbjct: 783 TGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGRPSVNTV--APELRLQCDEGH 840
Query: 748 HIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGAC 806
I+ I FASYG P G+C +F G CH L +V +AC G+ C+I V++ G C
Sbjct: 841 VISKITFASYGTPTGDCQNFSVGNCHASTTLDLVAEACEGKNRCAISVTNDVFG---DPC 897
Query: 807 PGLLKALAVEAHCS 820
++K LAV A CS
Sbjct: 898 RKVVKDLAVVAECS 911
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 842 bits (2174), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/785 (53%), Positives = 531/785 (67%), Gaps = 35/785 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A+++DG+RR+L SGSIHYPRSTPE+W LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+ +MK ENLFASQGGPIIL+Q+ENEYG +G G+ Y+ WAA A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL+ELH+A+KLCE+ L+S+DPT LG+ EAH++ +SS+ CAAFLANY+S+S A
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFLANYNSNSYAK 385
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V FN Y LP WS+SILPDCKNVVFNTA V Q N + +S+ W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQMWADGASSMMW 435
Query: 425 --YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+V ++ L EQ+N T+DTSDYLWY S+ V P + G + L +
Sbjct: 436 EKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTV 495
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VF+N +L YG + + L G N + +LS+ GL N G ++
Sbjct: 496 QSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYE 555
Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ ++I L G RDL+ W YQVG++GE + L+ + + S W QGS + N
Sbjct: 556 TWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQN 615
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WY+ F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G K C Y
Sbjct: 616 QQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---AEGDCKGCHYT 672
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GSY A KCQ CGQP Q YH+PR+W+ P NLLV+ EELGGD SKI+L +T +C+
Sbjct: 673 GSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCAD 732
Query: 715 VSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
VSE P + +W+ P + +V L C G I+AI FAS+G P G CG+F+
Sbjct: 733 VSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQ 787
Query: 769 PGACH 773
G CH
Sbjct: 788 QGECH 792
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 841 bits (2172), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/830 (50%), Positives = 540/830 (65%), Gaps = 16/830 (1%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SANV+YD R+L+ID +R++L S SIHYPRS P +WP L++ +KEGG++VIETYVFWN HE
Sbjct: 74 SANVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGHE 133
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G YYF GRFDLV+F +TVQ+AG++L LRIGP+ AEWN+GG PVWLH++PG FRT
Sbjct: 134 LSPGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTY 193
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PF M++F I++LMKQE LFASQGGPIILAQ+ENEYG E Y G+ Y WAA
Sbjct: 194 NQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGKKYALWAA 253
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+ NT VPW+MCQQ DAPDP+I+TCN FYCD FTP SP++P +WTEN+ GWF +FG
Sbjct: 254 KMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFKTFGG 313
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RP ED+AF+VARFF+ GG+ NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 314 RDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYGL 373
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKWGHL+ELH+AIKLCE L++ + LG +EA +Y SS CAAF++N D +
Sbjct: 374 PRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNVDDKN 433
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F + LPAWSVSILPDCKNVVFNTAKV SQ + + ++++ ++
Sbjct: 434 DKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSVVAMVPESLQQSDKVV---NS 490
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
F W +EK GI G FV+ + INTTKDT+DYLW+T SI V + G + L
Sbjct: 491 FKWDIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGNKPVL 550
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
IES GHA FVN++ G GN A F I L G N + +L + VGLQ G +
Sbjct: 551 LIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIALLCLTVGLQTAGPF 610
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+D GAGL SV + L NG DLSS W Y++GV+GEY+ L + + N+ W S P
Sbjct: 611 YDFVGAGLTSVKIKGLNNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNNVNWTSTSEPPK 670
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-PSTGCTKKCDY 653
+ L WYK AP G P+ L++ MGKG AW+NG+ IGRYW S C K+CDY
Sbjct: 671 MQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCVKECDY 730
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
RG ++ KC CG+P Q YH+PR+W P N+LV+ EE GGDP KI + + C+
Sbjct: 731 RGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSGACA 790
Query: 714 FVSEADPPP--VDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
V+E P V + + + P RLAC I+A+ FAS+G P G CGS+ G
Sbjct: 791 LVAEDYPSVALVSQGEDKIQSNKNIPFARLACPGNTRISAVKFASFGSPSGTCGSYLKGD 850
Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + IV+KAC+ + +C I ++ + CPGL + LAVEA CS
Sbjct: 851 CHDPNSSTIVEKACLNKNDCVIKLTEE--NFKSNLCPGLSRKLAVEAVCS 898
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 840 bits (2171), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/807 (52%), Positives = 539/807 (66%), Gaps = 34/807 (4%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SGS+HYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP RGQYYFEGR+DLV F+K V
Sbjct: 2 SGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKLV 61
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PFK EM++F KI+D+MK
Sbjct: 62 KQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMKS 121
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAP 202
E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV LNTSVPWVMC+++DAP
Sbjct: 122 EGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAP 181
Query: 203 DPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETG 262
DPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP RPVEDLA+ VA+F + G
Sbjct: 182 DPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKG 241
Query: 263 GTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
G+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+PKWGHL+ELHKAIKLCE
Sbjct: 242 GSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCEP 301
Query: 323 YLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSIL 382
L++ DP LG +A ++ S++ C AFL N D S A V+FNG Y LP WS+SIL
Sbjct: 302 ALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWSISIL 361
Query: 383 PDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRP 440
PDCK V+NTA+V SQ +Q K + + F+W Y E + G+ SFV
Sbjct: 362 PDCKTTVYNTARVGSQ-------ISQMK-----MEWAGGFTWQSYNEDINSLGDESFVTV 409
Query: 441 DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFG 495
L EQIN T+D +DYLWYT + V + GK L + S GHA +FVN +L
Sbjct: 410 GLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTGTV 469
Query: 496 YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGK 554
YG+ D ++L G NT+ LS+ VGL N G F+ AG+ + +D L G+
Sbjct: 470 YGSVDDPKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGR 529
Query: 555 RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGP 613
RDL+ +W Y+VG++GE + L +S ++S W + P+ K L WYK F AP+G P
Sbjct: 530 RDLTWQKWTYKVGLKGEDLSLHSLSGSSSVEWGE----PMQKQPLTWYKAFFNAPDGDEP 585
Query: 614 LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTL 673
LAL+++SMGKGQ W+NGQ IGRYW Y A +G CDYRG YD KCQ +CG +Q
Sbjct: 586 LALDMSSMGKGQIWINGQGIGRYWPGYKA--SGTCGICDYRGEYDEKKCQTNCGDSSQRW 643
Query: 674 YHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVV 733
YH+PR+W++P NLLVI EE GGDP+ IS++ +T IC+ VSE P + +W+
Sbjct: 644 YHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSICADVSEWQ-PSMTNWRTK---D 699
Query: 734 SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSI 792
++ L C+ G + I FAS+G P+G+CGS+ G CH I K C+GQ C +
Sbjct: 700 YEKAKIHLQCDHGRKMTDIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKNCIGQERCGV 759
Query: 793 PVSSAYLGVSAGACPGLLKALAVEAHC 819
V G CPG +K VEA C
Sbjct: 760 SVVPNVFG--GDPCPGTMKRAVVEAIC 784
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/803 (52%), Positives = 540/803 (67%), Gaps = 41/803 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+I GKRR+L S IHYPR+TPE+W +LI KSKEGG +V++TYVFWN HEP+
Sbjct: 37 NVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKEGGADVVQTYVFWNGHEPV 96
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLV+FVK + +GL+LHLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 97 KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNE 156
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+EM++F+ KI+DLM++ LF QGGPII+ Q+ENEYG+VE +YG G+ YVKWAA
Sbjct: 157 PFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L VPWVMC+Q DAP+ II+ CNG+YCDGF PNS +KP++WTE++ GW+ +G ++
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRTKPVLWTEDWDGWYTKWGGSL 276
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP EDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP TSYDYDAP+DEYG
Sbjct: 277 PHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRS 336
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND----CAAFLANYD 358
+PKWGHL++LH AIKLCE L+++D P ++KLG+K EAHIYH CAAFLAN D
Sbjct: 337 EPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHIYHGDGETGGKVCAAFLANID 396
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---------Q 409
A+V FNG Y LP WSVSILPDC++V FNTAKV +Q + A+ Q
Sbjct: 397 EHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQ 456
Query: 410 KNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
K V + ++ + SW +E +GI G +F L E +N TKD SDYLW+ I V
Sbjct: 457 KVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSED 516
Query: 468 Q-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
G ++I+S+ VFVNK+L G+ A + + +G N L
Sbjct: 517 DISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWVKA----VQPVRFIQGNNDLL 572
Query: 521 ILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
+L+ VGLQNYGA+ + GAG L KNG DLS W YQVG++GE DKI
Sbjct: 573 LLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSKSSWTYQVGLKGE---ADKIY 629
Query: 580 LANSSFWKQGSTLPVNKS---LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
+ + STL + S +WYKT F P G P+ LNL SMG+GQAWVNGQ IGRY
Sbjct: 630 TVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRY 689
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W+ ++ GC + CDYRG+Y++ KC +CG+P QT YH+PR+W+ P NLLV+ EE GG
Sbjct: 690 WNI-ISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGG 748
Query: 697 DPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACERGWHIA 750
+P KIS+ T T +C VSE+ PP+ W + + S +P+V L CE G I+
Sbjct: 749 NPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDYINGTMSINSVAPEVHLHCEDGHVIS 808
Query: 751 AINFASYGIPEGNCGSFRPGACH 773
+I FASYG P G+C F G CH
Sbjct: 809 SIEFASYGTPRGSCDGFSIGKCH 831
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/880 (49%), Positives = 551/880 (62%), Gaps = 80/880 (9%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEV------------------------------ 35
TYD +A++IDG+RR+L SGSIHYPRSTP+V
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89
Query: 36 ----------------------WPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
W LI+K+K+GGL+VI+TYVFWN HEP G YYFE R+
Sbjct: 90 LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DLVRFVKTVQ+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N PFK M+ F
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFT 209
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
KI+ +MK ENLFASQGGPIIL+Q+ENEYG +G G+ Y+ WAA AV L+T VPW
Sbjct: 210 EKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPW 269
Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAF 253
VMC++EDAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF FG + RPVEDLAF
Sbjct: 270 VMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAF 329
Query: 254 AVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
AVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR+PK HL+EL
Sbjct: 330 AVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKEL 389
Query: 314 HKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYF 373
H+A+KLCE+ L+S DPT LG EAH++ +S + CAAFLANY+S+S A V FN Y
Sbjct: 390 HRAVKLCEQALVSVDPTITTLGTMQEAHVF-RSPSGCAAFLANYNSNSHAKVVFNNEQYS 448
Query: 374 LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKV-GIS 432
LP WS+SILPDCKNVVFN+A V Q + Q + S + Y+E+V ++
Sbjct: 449 LPPWSISILPDCKNVVFNSATVGVQTS--------QMQMWGDGATSMMWERYDEEVDSLA 500
Query: 433 GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP------GQGKEVFLNIESLGHAALVF 486
L EQ+N T+D+SDYLWY S+ + P G GK L+++S GHA VF
Sbjct: 501 AAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVF 560
Query: 487 VNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-V 545
VN +L YG + N + L G N + +LS+ GL N G ++ G+ V
Sbjct: 561 VNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPV 620
Query: 546 ILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTT 604
+L L G RDL+ W YQVG++GE + L+ + + S W QGS + + L WYK
Sbjct: 621 VLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAY 680
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F P G PLAL++ SMGKGQ W+NGQSIGRYW+AY + G K C Y G++ A KCQ
Sbjct: 681 FETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---ADGDCKGCSYTGTFRAPKCQA 737
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEEL-GGDPSKISLLTKTGQHICSFVSEADPPPV 723
CGQP Q YH+PR+W+ P NLLV+ EEL GGD SKI+L ++ +C+ VSE D P +
Sbjct: 738 GCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVSE-DHPNI 796
Query: 724 DSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIV 780
W+ + G +V L C G I+AI FAS+G P G CG+F+ G CH ++
Sbjct: 797 KKWQIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVL 856
Query: 781 QKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+K C+G C + +S G CP + K +AVEA CS
Sbjct: 857 EKRCIGLQRCVVAISPDNFG--GDPCPSVTKRVAVEAVCS 894
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/825 (50%), Positives = 554/825 (67%), Gaps = 33/825 (4%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV YD RA+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 25 NVWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 84
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G+YYFEG +DLVRF+K VQ+ GL+LHLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85 PGKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNE 144
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK EM++F + I+++MK E LF QGGPIIL+Q+ENE+G +E+ G + Y WAA
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+L T VPWVMC+++DAPDP+INT NGFY DGF PN KP+MWTEN++GWF +G V
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFYPNKRYKPMMWTENWTGWFTGYGVPV 264
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVEDLAF+VA+F + GG++ NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +R
Sbjct: 265 PHRPVEDLAFSVAKFVQKGGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGMLR 324
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPK+GHL +LHKAIKLCE L+S P LG E++++ +S CAAFLANYD+ A
Sbjct: 325 QPKYGHLTDLHKAIKLCEPALVSGYPVVTSLGNNQESNVFRSNSGACAAFLANYDTKYYA 384
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
VTFNG Y LP WS+SILPDCK VFNTA+V +Q + FS
Sbjct: 385 TVTFNGMRYNLPPWSISILPDCKTTVFNTARVGAQ------------TTQMQMTTVGGFS 432
Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
W Y E + SF + L EQI+ T+D++DYLWYT +++ + G+ L
Sbjct: 433 WVSYNEDPNSIDDGSFTKLGLVEQISMTRDSTDYLWYTTYVNIDQNEQFLKNGQYPVLTA 492
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GH+ VF+N +L+ YG+ + ++L G N + LS+ VGL N G F+
Sbjct: 493 QSAGHSLHVFINGQLIGTAYGSVEDPRLTYTGNVKLFAGSNKISFLSIAVGLPNVGEHFE 552
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
GL V L L GKRDL+ +W Y++G++GE + L +S +++ W S
Sbjct: 553 TWNTGLLGPVTLNGLNEGKRDLTWQKWTYKIGLKGEALSLHTLSGSSNVEWGDASR---K 609
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK F AP G PLAL++++MGKGQ W+NGQSIGRYW AY A G KCDY G
Sbjct: 610 QPLAWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIGRYWPAYKA--RGSCPKCDYEG 667
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+Y+ +KCQ +CG +Q YH+PR+W++P NL+V+ EE GG+P+ ISL+ ++ + C++V
Sbjct: 668 TYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGGEPTGISLVKRSMRSACAYV 727
Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
S+ P +++W + +V L+C+ G + I FASYG P+G C S+ G CH
Sbjct: 728 SQGQ-PSMNNWHTKY----AESKVHLSCDPGLKMTQIKFASYGTPQGACESYSEGRCHAH 782
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
I QK C+GQ CS+ V G CPG++K++AV+A C
Sbjct: 783 KSYDIFQKNCIGQQVCSVTVVPEVFG--GDPCPGIMKSVAVQASC 825
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 837 bits (2161), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/838 (51%), Positives = 539/838 (64%), Gaps = 30/838 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
LS NV+YD R+L+IDG+R++L S SIHYPRS P +WP L++ +KEGG++VIETYVFWN H
Sbjct: 18 LSGNVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGH 77
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
E G YYF GRFDLV+F KTVQ+AG++L LRIGP+ AEWN+GG PVWLH++PG FRT
Sbjct: 78 ELSPGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRT 137
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PF M++F I++LMKQE LFASQGGPIIL+Q+ENEYG E Y G+ Y WA
Sbjct: 138 YNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKKYALWA 197
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV+ NT VPW+MCQQ DAPDP+I+TCN FYCD FTP SP++P +WTEN+ GWF +FG
Sbjct: 198 AKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFKTFG 257
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RP ED+AF+VARFF+ GG+ NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 258 GRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYG 317
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKWGHL+ELH+AIKLCE L++ + LG +EA +Y SS CAAF++N D
Sbjct: 318 LPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNVDDK 377
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-NGDHPFAQQ---KNVNELL 416
+D V F Y LPAWSVSILPDCKNVVFNTAKV SQ N P + Q K VN L
Sbjct: 378 NDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNVVAMIPESLQQSDKGVNSL- 436
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
+ +EK GI G FV+ + INTTKDT+DYLW+T SI V + G +
Sbjct: 437 ----KWDIVKEKPGIWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGSK 492
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L IES GHA FVN++ G GN + F I L G N + +L + VGLQ
Sbjct: 493 PVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIALLCLTVGLQTA 552
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
G ++D GAGL SV + LKNG DLSS W Y++GV+GEY+ L + + N W S
Sbjct: 553 GPFYDFIGAGLTSVKIKGLKNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNKVNWTSTSE 612
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-PSTGCTKK 650
+ L WYK AP G P+ L++ MGKG AW+NG+ IGRYW S C K+
Sbjct: 613 PQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCVKE 672
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRG ++ KC CG+P Q YH+PR+W P N+LV+ EE GGDP KI + +
Sbjct: 673 CDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSG 732
Query: 711 ICSFVSEADPPPV-------DSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
C+ V+E D P V D + N V P L C I+A+ FAS+G P G+
Sbjct: 733 ACALVAE-DYPSVGLLSQGEDKIQNNKNV----PFAHLTCPSNTRISAVKFASFGTPSGS 787
Query: 764 CGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CGS+ G CH + IV+KAC+ + +C I ++ CPGL + LAVEA CS
Sbjct: 788 CGSYLKGDCHDPNSSTIVEKACLNKNDCVIKLTEE--NFKTNLCPGLSRKLAVEAVCS 843
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/825 (50%), Positives = 544/825 (65%), Gaps = 36/825 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD +A+V++G+RR+L SGSIHYPRSTPE+WP+LI K+K+GGL+V++TYVFWN HEP
Sbjct: 23 LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQYYFEGR+DLV F+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT N P
Sbjct: 83 GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 142
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK EM++F KI+++MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ A
Sbjct: 143 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 202
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V LNT VPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 203 VALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 262
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RPVEDLA+ VA+F + GG+F NYYM+ GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+
Sbjct: 263 HRPVEDLAYGVAKFIQKGGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 322
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PKWGHL++LHKAIKLCE L++ DP LG ++ ++ S+ CAAFL N D S A
Sbjct: 323 PKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLDNKDKVSYAR 382
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V FNG Y LP WS+SILPDCK VFNTA+V SQ +Q K + + F+W
Sbjct: 383 VAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK-----MEWAGGFAW 430
Query: 425 --YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
Y E++ G F L EQIN T+D +DYLWYT + V Q + N E+
Sbjct: 431 QSYNEEINSFGEDPFTTVGLLEQINVTRDNTDYLWYTTYVDV--AQDDQFLSNGENPKLT 488
Query: 483 ALVFVNKKLVAFG-----YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
+ F+ ++ YG+ D ++L G NT+ LS+ VGL N G F+
Sbjct: 489 VMCFLILNILFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFET 548
Query: 538 AGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG+ + +D L G+RDL+ +W YQVG++GE + L +S +++ W + PV K
Sbjct: 549 WNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE----PVQK 604
Query: 597 S-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WYK F AP+G PLAL+++SMGKGQ W+NGQ IGRYW Y A +G CDYRG
Sbjct: 605 QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA--SGNCGTCDYRG 662
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
YD +KCQ +CG +Q YH+PR+W+ P NLLVI EE GGDP+ IS++ ++ +C+ V
Sbjct: 663 EYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADV 722
Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
SE P + +W +V L C+ G I I FAS+G P+G+CGS+ G CH
Sbjct: 723 SEWQ-PSMKNWHTK---DYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYSEGGCHAH 778
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
I K CVGQ C + V G CPG +K VEA C
Sbjct: 779 KSYDIFWKNCVGQERCGVSVVPEIFG--GDPCPGTMKRAVVEAIC 821
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/855 (50%), Positives = 556/855 (65%), Gaps = 48/855 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRAL++ GKRR+L S +HYPR+TPE+WP LI K KEGG++ IETYVFWN HEP
Sbjct: 62 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGVDAIETYVFWNGHEPA 121
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQYYFEGRFD+VRF K V GLFL LRIGPYACAEWN+GGFPVWL +PGI+FRT N
Sbjct: 122 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDVPGIEFRTDNE 181
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+K EM+ F+ KI+D+MK+E L++ QGGPIIL Q+ENEYGN++ YG G+ Y+ WAA
Sbjct: 182 PYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGHYGQAGKRYMLWAAQM 241
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+T VPWVMC+Q DAP+ I+NTCN FYCDGF PNS +KP +WTE++ GW+ +G ++
Sbjct: 242 ALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGESL 301
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP +D AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL TSYDYDAPIDEYG +R
Sbjct: 302 PHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 361
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-----------SSNDC 350
QPKWGHL++LH AIKLCE L + D P + KLG EAH+Y +S C
Sbjct: 362 QPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHVYSSENVHTNGSISGNSQFC 421
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-----NGDHP 405
+AFLAN D A+V G Y LP WSVSILPDC+ V FNTA+V +Q + +G
Sbjct: 422 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGSPS 481
Query: 406 FAQQKNVNELLLASSAF---SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
++ + L L + +W ++E VGI G F + E +N TKD SDYL YT
Sbjct: 482 YSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQGILEHLNVTKDISDYLSYTT 541
Query: 461 SIHVMP-------GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN 513
+++ +G L I+ + A VFVN KL G+ +N+ ++L
Sbjct: 542 RVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGSKVGHW----VSLNQPLQLV 597
Query: 514 EGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEY 572
+G+N L +LS +VGLQNYGA+ + GAG V L L NG DL++ W YQ+G++GE+
Sbjct: 598 QGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLKGEF 657
Query: 573 IGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQS 632
+ S+ W W+KT F APEG GP+ ++L SMGKGQAWVNG
Sbjct: 658 SRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPVTIDLGSMGKGQAWVNGHL 717
Query: 633 IGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
IGRYWS +AP +GC C+Y G+Y SKC+ +CG Q+ YHIPR W+ NLLV+ E
Sbjct: 718 IGRYWS-LVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSWYHIPREWLQESGNLLVLFE 776
Query: 693 ELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACERG 746
E GGDPS+ISL + ICS +SE PP+ +W +P++ V +P++RL C+ G
Sbjct: 777 ETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGRPSVNTV--APELRLQCDDG 834
Query: 747 WHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGA 805
I+ I FASYG P G C +F G CH L +V +AC G+ C+I V++ G
Sbjct: 835 HVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEACEGKNRCAISVTNEVFG---DP 891
Query: 806 CPGLLKALAVEAHCS 820
C ++K LAVEA CS
Sbjct: 892 CRKVVKDLAVEAECS 906
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 827 bits (2137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/714 (56%), Positives = 500/714 (70%), Gaps = 23/714 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+VTYDH+ALVIDGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN HEP
Sbjct: 24 ASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQYYFE R++LVRFVK VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 84 SPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F AKI+ +MK E L+ SQGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 144 GPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN KP MWTE ++GWF FG
Sbjct: 204 MALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKPKMWTEAWTGWFTEFGGP 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLA+AVARF + G+ NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLI 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHLR+LHKAIKLCE L+S DPT LG+K EAH+Y+ S +CAAFLANYD S+
Sbjct: 324 RQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTRSGECAAFLANYDPSTS 383
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
VTF + Y LP WSVSILPDCK VVFNTAKV + P K + S+F
Sbjct: 384 VRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKV-------NAPSYWPK-----MTPISSF 431
Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW+ EE + + L EQI+ T+D +DYLWY I + + G+ L
Sbjct: 432 SWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIRIDSNEGFLKSGQWPLL 491
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VF+N +L YG D +K + L G+N L +LS+ VGL N G
Sbjct: 492 TIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVH 551
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ AG+ V L L G RD+S +W Y+VG++GE + L +S ++S W GS +
Sbjct: 552 FETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMTGSLVS 611
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYKTTF AP G PLAL++ SMGKGQ W+NG+SIGR+W AY A G KC Y
Sbjct: 612 QKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHWPAYTA--RGSCGKCYY 669
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G + KC CG+P+Q YH+PR W+ P N+LVI EE GG+P ISL+ ++
Sbjct: 670 GGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISLVKRS 723
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 826 bits (2134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/833 (50%), Positives = 531/833 (63%), Gaps = 35/833 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NV+YD R+L+I+G+R++L S +IHYPRS P +WPEL++ +KEGG++VIETYVFWN H
Sbjct: 17 FAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVH 76
Query: 61 EPIR-GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFR 119
+P +Y+F+GRFDLV+F+ VQEAG++L LRIGP+ AEWN+GG PVWLH++ G FR
Sbjct: 77 QPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFR 136
Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ--VENEYGNVEWAYGVGGELYV 177
T N FK M+ F I+ LMK+E LFASQGGPIIL+Q VENEYG E AYG GG+ Y
Sbjct: 137 TDNYNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENEYGYYEGAYGEGGKRYA 196
Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
WAA AV+ NT VPW+MCQQ DAP +INTCN FYCD F P P KP +WTEN+ GWF
Sbjct: 197 AWAAQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQ 256
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
+FG P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDY+APID
Sbjct: 257 TFGAPNPHRPAEDVAFSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 316
Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
EYG R PKWGHL+ELHKAIKLCE L++S P + LG EA +Y +S C AFLAN
Sbjct: 317 EYGLPRLPKWGHLKELHKAIKLCEHVLLNSKPVNLSLGPSQEADVYADASGGCVAFLANI 376
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
D +D V F Y LPAWSVSILPDCKNVV+NTAK QK+
Sbjct: 377 DDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAK--------------QKD------ 416
Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGK 470
S A W + EK GI G F++ + INTTKDT+DYLWYT SI V +G+
Sbjct: 417 GSKALKWEVFVEKAGIWGEPDFMKNGFVDHINTTKDTTDYLWYTTSIVVGENEEFLKEGR 476
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
L IES+GHA FVN++L GN + F I L G N + +LSM VGL N
Sbjct: 477 HPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPISLKAGNNEIALLSMTVGLPN 536
Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
G++++ GAGL SV + NG DLS WIY++G++GE +G+ K NS W S
Sbjct: 537 AGSFYEWVGAGLTSVRIEGFNNGTVDLSHFNWIYKIGLQGEKLGIYKPEGVNSVSWVATS 596
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
P + L WYK P G P+ L++ MGKG AW+NG+ IGRYW + C +
Sbjct: 597 EPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSSVHEKCVTE 656
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRG + KC CGQP Q YH+PR+W P NLLVI EE GGDP KI+ +
Sbjct: 657 CDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLVIFEEKGGDPEKITFSRRKMSS 716
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQ--VRLACERGWHIAAINFASYGIPEGNCGSFR 768
IC+ ++E P G +S+ + V L C + I+A+ FAS+G P G CGS+
Sbjct: 717 ICALIAEDYPSADRKSLQEAGSKNSNSKASVHLGCPQNAVISAVKFASFGTPTGKCGSYS 776
Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH + + +V+KAC+ + EC+I ++ + G CP + LAVEA CS
Sbjct: 777 EGECHDPNSISVVEKACLNKTECTIELTEE--NFNKGLCPDFTRRLAVEAVCS 827
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/832 (50%), Positives = 550/832 (66%), Gaps = 34/832 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+V+YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 21 VTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFEG +DLV+FVK V+EAGL+++LRIGPY CAEWN+G QF+
Sbjct: 81 EPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAEWNFGH-----------QFQN 129
Query: 121 TNNPFKEE---MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYV 177
PF+ E M++F KI+++MK E LF SQGGPIIL+Q+ENEYG +E+ G G+ Y
Sbjct: 130 GQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGSPGQAYT 189
Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
KWAA AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN KP MWTE ++GWF
Sbjct: 190 KWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFT 249
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
FG VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+D
Sbjct: 250 QFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLD 309
Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
EYG +RQPKWGHL++LH+AIKLCE L+S D T LG EAH+++ + CAAFLANY
Sbjct: 310 EYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANY 369
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
S A V+F Y LP WS+SILPDCKN V+NTA+V +Q A K +
Sbjct: 370 HQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ATIKMTPVPMH 422
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
++ Y E+ SG+ +F L EQINTT+D SDYLWY +H+ P + GK
Sbjct: 423 GGLSWQTYNEEPSSSGDNTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLKSGKYP 482
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA VF+N +L YG+ DF ++ + L G+N + +LS+ VGL N G
Sbjct: 483 VLTVLSAGHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVNKISLLSIAVGLPNVG 542
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L L G+ DLS +W Y++G+ GE + L IS ++S W +GS
Sbjct: 543 PHFETWNAGILGPVTLNGLNEGRMDLSWQKWSYKIGLHGEALSLHSISGSSSVEWAEGSL 602
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYKTTF AP G PLAL++ SMGKGQ W+NGQ +GR+W AY A +G +C
Sbjct: 603 VAQKQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA--SGTCGEC 660
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
Y G+Y+ +KC +CG+ +Q YH+P++W+ P NLLV+ EE GGDP+ +SL+ + +
Sbjct: 661 TYIGTYNENKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGVSLVRREVDSV 720
Query: 712 CSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
C+ + E P ++ G V+ P+ L+C G I +I FAS+G PEG CGS+
Sbjct: 721 CADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYNQ 780
Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G+CH CVGQ CS+ V+ G CP ++K LA EA CS
Sbjct: 781 GSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFG--GDPCPSVMKKLAAEAICS 830
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 825 bits (2132), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/713 (56%), Positives = 499/713 (69%), Gaps = 23/713 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+VTYDH+ALVIDGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN HEP
Sbjct: 24 ASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQYYFE R++LVRFVK VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 84 SPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F AKI+ +MK E L+ SQGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 144 GPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN KP MWTE ++GWF FG
Sbjct: 204 MALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKPKMWTEAWTGWFTEFGGP 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLA+AVARF + G+ NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLI 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHLR+LHKAIKLCE L+S DPT LG+K EAH+Y+ S +CAAFLANYD S+
Sbjct: 324 RQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTRSGECAAFLANYDPSTS 383
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
VTF + Y LP WSVSILPDCK VVFNTAKV + P K + S+F
Sbjct: 384 VRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKV-------NAPSYWPK-----MTPISSF 431
Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW+ EE + + L EQI+ T+D +DYLWY I + + G+ L
Sbjct: 432 SWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIRIDSNEGFLKSGQWPLL 491
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VF+N +L YG D +K + L G+N L +LS+ VGL N G
Sbjct: 492 TIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVH 551
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ AG+ V L L G RD+S +W Y+VG++GE + L +S ++S W GS +
Sbjct: 552 FETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMTGSLVS 611
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYKTTF AP G PLAL++ SMGKGQ W+NG+SIGR+W AY A G KC Y
Sbjct: 612 QKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHWPAYTA--RGSCGKCYY 669
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
G + KC CG+P+Q YH+PR W+ P N+LVI EE GG+P ISL+ +
Sbjct: 670 GGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISLVKR 722
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 247/515 (47%), Positives = 323/515 (62%), Gaps = 27/515 (5%)
Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
I+TCNGFYC+ F PN KP +WTEN+SGW+ +FG P+RP ED+AF+VARF + GG+
Sbjct: 723 IDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSL 782
Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
NYYMY GGTNFGRT+G V TSYD+DAPIDEYG +R+PKWGHLR+LHKAIKLCE L+
Sbjct: 783 VNYYMYHGGTNFGRTSG-LFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALV 841
Query: 326 SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
S+DPT LG EA ++ SS CAAFLANYD+S+ V F + Y LP WS+SILPDC
Sbjct: 842 SADPTSTWLGKDQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDC 901
Query: 386 KNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS----SAFSWY---EEKVGISGNRSFV 438
K V FNTA+V +R+ + + LL+A S+F W EE +
Sbjct: 902 KTVTFNTARV--RRD-------PKLFIPNLLMAKMTPISSFWWLSYKEEPASAYAKDTTT 952
Query: 439 RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVA 493
+ L EQ++ T DT+DYLWY I + + G+ L + S GH VF+N +L
Sbjct: 953 KDGLVEQVSVTWDTTDYLWYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSG 1012
Query: 494 FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKN 552
YG+ + +K + L +G+N L +LS+ VGL N G FD AG+ V L L
Sbjct: 1013 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 1072
Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKG 612
G RD+S +W Y+VG+ GE + L + +NS W +GS + L WYKTTF P G
Sbjct: 1073 GTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQ--KQPLTWYKTTFNTPAGNE 1130
Query: 613 PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT 672
PLAL+++SM KGQ WVNG+SIGRY+ Y+A +G KC Y G + KC +CG P+Q
Sbjct: 1131 PLALDMSSMSKGQIWVNGRSIGRYFPGYIA--SGKCNKCSYTGFFTEKKCLWNCGGPSQK 1188
Query: 673 LYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
YHIPR W+ P NLL+I EE+GG+P ISL+ +T
Sbjct: 1189 WYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 1223
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 824 bits (2129), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/831 (49%), Positives = 543/831 (65%), Gaps = 35/831 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A V YDH+A+ I+ +RR+L SGSIHYPRSTPE+WP LI+K+KEGG+EVI+TYVFWN H
Sbjct: 21 VTATVWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKAKEGGIEVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYF+ R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFP+WL ++PGI+FRT
Sbjct: 81 EPSPGQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPMWLKYVPGIEFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F+ I+++MK++ LF +QGGPIIL+Q+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNGPFKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYGPVEWTIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A LNT VPW+MC+QEDAPDP I+TCNGFYC+G+ PN+ +KP +WTEN++GW+ +G
Sbjct: 201 AAMATGLNTGVPWIMCKQEDAPDPTIDTCNGFYCEGYKPNNYNKPKVWTENWTGWYTEWG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+VP+RP ED AF+VARF G+F NYYMY GGTNF RTA G +ATSYDYDAP+DEYG
Sbjct: 261 ASVPYRPPEDTAFSVARFIAASGSFVNYYMYHGGTNFDRTA-GLFMATSYDYDAPLDEYG 319
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
PKWGHLR+LH+AIK E L+S+DPT LG EAH++ +S CAAFLANYD+
Sbjct: 320 LTHDPKWGHLRDLHRAIKQSERALVSADPTVISLGKNQEAHVF-QSKMGCAAFLANYDTQ 378
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
A V F Y LP WS+S+LPDCK VV+NTAK+ +Q ++ +S
Sbjct: 379 YSARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISAQSTQ-----------KWMMPVAS 427
Query: 421 AFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
FSW E VG S +F + L EQ T D +DYLWY + + + GK
Sbjct: 428 GFSWQSHIDEVPVGYSAG-TFTKVGLWEQKYLTGDKTDYLWYMTDVTINSNEGFLRSGKN 486
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
FL + S GH VF+N L YG+ + ++ ++L G+N + +LS VGL N
Sbjct: 487 PFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLVGGVNKIALLSATVGLANV 546
Query: 532 GAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
G +D G+ V L L G D++ +W Y++G++GE + L S + W QG+
Sbjct: 547 GVHYDTWNVGVLGPVTLQGLNQGTLDMTKWKWSYKIGLKGEDLKL--FSGGANVGWAQGA 604
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
L L WYKT AP G P+AL + SMGKGQ ++NG+SIGR+W AY A G K
Sbjct: 605 QLAKKTPLTWYKTFINAPPGNDPVALYMGSMGKGQMYINGRSIGRHWPAYTA--KGNCKD 662
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDY G YD KC+ CGQP Q YH+PR+W+ P NLLV+ EE+GGDP+ ISL+ +
Sbjct: 663 CDYAGYYDDQKCRSGCGQPPQQWYHVPRSWLKPTGNLLVVFEEMGGDPTGISLVKRVVGS 722
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+C+ + + D P + SW N+ V +P+ L C G + I FASYG P+G CG++R G
Sbjct: 723 VCADIDD-DQPEMKSWTENIPV---TPKAHLWCPPGQKFSKIVFASYGWPQGRCGAYRQG 778
Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + QK C+G+ C I V+ A G CPG K L+V+ CS
Sbjct: 779 KCHALKSWDPFQKYCIGKGACDIDVAPATFG--GDPCPGSAKRLSVQLQCS 827
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 817 bits (2111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/834 (50%), Positives = 543/834 (65%), Gaps = 35/834 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++ V+YDHRAL +DG+RR+L SGSIHYPRSTP +WP LI K+KEGGL+VI+TYVFWN H
Sbjct: 24 VAVTVSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFWNGH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP RG Y + GR++L +F++ V EAG++++LRIGPY CAEWN GGFP WL FIPGI+FRT
Sbjct: 84 EPTRGVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGIEFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK E +RF+ ++ +K+E LFA QGGPII+AQ+ENEYGN++ +YG G+ Y+ W
Sbjct: 144 DNEPFKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRYLNWI 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV NTSVPW+MCQQ +AP +INTCNGFYCDG+ PNS KP WTEN++GWF S+G
Sbjct: 204 ANMAVATNTSVPWIMCQQPEAPQLVINTCNGFYCDGWRPNSEDKPAFWTENWTGWFQSWG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RPV+D+AF+VARFFE GG+F NYYMY GGTNF RT G V TSYDYDAPIDEY
Sbjct: 264 GGAPTRPVQDIAFSVARFFEKGGSFMNYYMYHGGTNFERT-GVESVTTSYDYDAPIDEYD 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
+RQPKWGHL++LH A+KLCE L+ D PT LG EAH+Y SS CAAFLA++D
Sbjct: 323 -VRQPKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQSSSGTCAAFLASWD 381
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
++D+ VTF G Y LPAWSVSILPDCK+VVFNTAKV Q + + A
Sbjct: 382 -TNDSLVTFQGQPYDLPAWSVSILPDCKSVVFNTAKV-----------GAQSVIMTMQGA 429
Query: 419 SSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEV---- 472
+W Y E +G G+ F L EQI TTKDT+DYLWY ++ V + +
Sbjct: 430 VPVTNWVSYHEPLGPWGS-VFSTNGLLEQIATTKDTTDYLWYMTNVQVAESDVRNISAQA 488
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + SL AA FVN F G + I L G N + +LSM +GLQ YG
Sbjct: 489 TLVMSSLRDAAHTFVN----GFYTGTSHQQFMHARQPISLRPGSNNITVLSMTMGLQGYG 544
Query: 533 AWFDVAGAGL-FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+ + AG+ + V + DL +G +L W YQVG++GE L +++ + ++ W S
Sbjct: 545 PFLENEKAGIQYGVRIEDLPSGTIELGGSTWTYQVGLQGESKQLFEVNGSLTAEWNTISE 604
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ L W KT F P G G +AL+L+SMGKG WVNG ++GRYWS++ A GC C
Sbjct: 605 VSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYWSSFTAQRDGCDASC 664
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
DYRGSY SKC C QP+Q YHIPR W+ P N +V+ EE GG+P IS+ T+ Q I
Sbjct: 665 DYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDISIATRMPQQI 724
Query: 712 CSFVSEADPPP--VDSW--KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
CS +S++ P P + SW + NL + L C G I+ I FASYG P G+C F
Sbjct: 725 CSHISQSHPFPFSLTSWTKRDNLTSTLLRAPLTLECAEGQQISRICFASYGTPSGDCEGF 784
Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+CH + ++ KACVG+ +CS+P+ S+ G CPGL K+LA A CS
Sbjct: 785 VLSSCHANTSYDVLTKACVGRQKCSVPIVSSIFG--DDPCPGLSKSLAATAECS 836
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 817 bits (2110), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/863 (48%), Positives = 563/863 (65%), Gaps = 63/863 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD+RAL+I GKRR+L S IHYPR+TPE+WP LI +SKEGG +VIETY FWN HEP
Sbjct: 36 NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY FEGR+D+V+F K V GLFL +RIGPYACAEWN+GGFP+WL IPGI+FRT N
Sbjct: 96 RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFKEEM+R++ KI+DLM E+LF+ QGGPIIL Q+ENEYGNVE ++G G+LY+KWAA+
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESSFGPKGKLYMKWAAEM 215
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV L VPWVMC+Q DAP+ II+TCN +YCDGFTPNS KP +WTEN++GWF +G +
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGERL 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RP ED+AFA+ARFF+ GG+ QNYYMYFGGTNFGRTAGGP TSYDYDAP+DEYG +R
Sbjct: 276 PYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGLLR 335
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND-----------CA 351
QPKWGHL++LH AIKLCE L+++D P + KLG K EAH+Y +SN+ CA
Sbjct: 336 QPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGICA 395
Query: 352 AFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV-ISQRNNGDHPFAQQK 410
AF+AN D A V F G + LP WSV V A++ +S + H Q K
Sbjct: 396 AFIANIDEHESATVKFYGQEFTLPPWSV--------VFCQIAEIQLSTQLRWGHKL-QSK 446
Query: 411 NVNELLL---------------ASSAF--SWY--EEKVGISGNRSFVRPDLAEQINTTKD 451
++L +S +F SW +E +G+ G+++F + E +N TKD
Sbjct: 447 QWAQILFQLGIILCFYKLSLKASSESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKD 506
Query: 452 TSDYLWYTASIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANF 504
SDYLWY I++ + +V ++I+S+ +FVN +L G
Sbjct: 507 QSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----I 562
Query: 505 LINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWI 563
+ + ++L +G N + +LS VGLQNYGA+ + GAG I L K+G +L++ W
Sbjct: 563 KVVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWT 622
Query: 564 YQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGK 623
YQVG+ GE++ + ++ S+ W + T WYKT F AP G P+AL+ +SMGK
Sbjct: 623 YQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGK 682
Query: 624 GQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHP 683
GQAWVNG +GRYW+ +AP+ GC + CDYRG+Y + KC+ +CG+ Q YHIPR+W+
Sbjct: 683 GQAWVNGHHVGRYWT-LVAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKT 741
Query: 684 GENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN-----LGVVSSSPQ 738
N+LVI EE P IS+ T++ + IC+ VSE PP+ W + L ++ +P+
Sbjct: 742 LNNVLVIFEETDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLSLMDKTPE 801
Query: 739 VRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSA 797
+ L C+ G I++I FASYG P G+C F G CH + L +V +AC+G+ CSI +S+
Sbjct: 802 MHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQACIGRTSCSIGISN- 860
Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
GV C ++K+LAV+A CS
Sbjct: 861 --GVFGDPCRHVVKSLAVQAKCS 881
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 816 bits (2109), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/714 (56%), Positives = 500/714 (70%), Gaps = 20/714 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
LS V YDHR L+I+G+ R+L S SIHYPR+ P++W +LI +K GG++VIETYVFW+ H
Sbjct: 20 LSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGH 79
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
+P R Y FEGRFDLV FVK V EAGL+ +LRIGPY CAEWN GGFPVWL +PGI+FRT
Sbjct: 80 QPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIEFRT 139
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK EM+ F+ KI+ +MK + LFA QGGPIILAQ+ENEYGN++ AYG G+ Y++WA
Sbjct: 140 NNQPFKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMEWA 199
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A L T VPW+MCQQ DAPD I++TCNGFYCD + PN+ KP MWTEN+SGWF +G
Sbjct: 200 ANMAQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWG 259
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A P RPVED+AFAVARFF+ GG+FQNYYMYFGGTNFGR++GGP V TSYDYDAPIDE+G
Sbjct: 260 EASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFG 319
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDS 359
IRQPKWGHL++LH AIKLCE L S+DPT+ LG EAH+Y SS CAAFLAN DS
Sbjct: 320 VIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDS 379
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
SSDA V FN Y LPAWSVSILPDCK V NTAKV Q + + +
Sbjct: 380 SSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKV-----------HVQTAMPTMKPSI 428
Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK--EVFLN 475
+ +W Y E VG+ + V L EQINTTKDTSDYLWYT S+ + + L+
Sbjct: 429 TGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAASGKALLS 488
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ES+ VFVN KL + + IEL G N+L IL VGLQNYG +
Sbjct: 489 LESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFI 548
Query: 536 DVAGAGL-FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ GAG+ SVI+ L +G+ DL++ EWI+QVG++GE + + S + W S +P
Sbjct: 549 ETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWS--SAVPQ 606
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST-GCTKKCDY 653
++L+WYK F +P G P+AL+L SMGKGQAW+NGQSIGR+W + AP T GC + CDY
Sbjct: 607 GQALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPSLRAPDTAGCPQTCDY 666
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
RGSY +SKC+ CGQP+Q YH+PR+W+ NL+V+ EE GG PS +S +T+T
Sbjct: 667 RGSYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKPSGVSFVTRT 720
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 815 bits (2106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/799 (51%), Positives = 524/799 (65%), Gaps = 28/799 (3%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+W LI+K+K+GGL+VI+TYVFWN HEP G YYFE R+DLVRFVKTVQ+AGLF+HLRIG
Sbjct: 29 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
PY C EWN+GGFPVWL ++PGI FRT N PFK M+ F KI+ +MK ENLFASQGGPII
Sbjct: 89 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 148
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
L+Q+ENEYG +G G+ Y+ WAA AV L+T VPWVMC++EDAPDP+IN CNGFYC
Sbjct: 149 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 208
Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
D F+PN P KP MWTE +SGWF FG + RPVEDLAFAVARF + GG+F NYYMY GG
Sbjct: 209 DAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHGG 268
Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
TNFGRTAGGP + TSYDYDAPIDEYG IR+PK HL+ELH+A+KLCE+ L+S DPT L
Sbjct: 269 TNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTITTL 328
Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK 394
G EAH++ +S + CAAFLANY+S+S A V FN Y LP WS+SILPDCKNVVFN+A
Sbjct: 329 GTMQEAHVF-RSPSGCAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNSAT 387
Query: 395 VISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKV-GISGNRSFVRPDLAEQINTTKDTS 453
V Q + Q + S + Y+E+V ++ L EQ+N T+D+S
Sbjct: 388 VGVQTS--------QMQMWGDGATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSS 439
Query: 454 DYLWYTASIHVMP------GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLIN 507
DYLWY S+ + P G GK L+++S GHA VFVN +L YG + N
Sbjct: 440 DYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYN 499
Query: 508 KKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQV 566
+ L G N + +LS+ GL N G ++ G+ V+L L G RDL+ W YQV
Sbjct: 500 GNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQV 559
Query: 567 GVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQ 625
G++GE + L+ + + S W QGS + + L WYK F P G PLAL++ SMGKGQ
Sbjct: 560 GLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQ 619
Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGE 685
W+NGQSIGRYW+AY + G K C Y G++ A KCQ CGQP Q YH+PR+W+ P
Sbjct: 620 VWINGQSIGRYWTAY---ADGDCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSR 676
Query: 686 NLLVIHEEL-GGDPSKISLLTKTGQHICSFVSEADPPPVDSWK-PNLGVVS-SSPQVRLA 742
NLLV+ EEL GGD SKI+L ++ +C+ VSE D P + W+ + G +V L
Sbjct: 677 NLLVVLEELGGGDSSKIALAKRSVSSVCADVSE-DHPNIKKWQIESYGEREHRRAKVHLR 735
Query: 743 CERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGV 801
C G I+AI FAS+G P G CG+F+ G CH +++K C+G C + +S G
Sbjct: 736 CAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVAISPDNFG- 794
Query: 802 SAGACPGLLKALAVEAHCS 820
CP + K +AVEA CS
Sbjct: 795 -GDPCPSVTKRVAVEAVCS 812
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 815 bits (2104), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/830 (49%), Positives = 536/830 (64%), Gaps = 29/830 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD +AL+I+G+R++L SGSIHYPRS P++W LI K+K GGL+V++TYVFWN HEP
Sbjct: 29 NVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKAKMGGLDVVDTYVFWNLHEPS 88
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G Y FEGR DLV+F+K V++AGL++HLRIGPY C EWN+GGFP WL F+PGI FRT N
Sbjct: 89 PGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNFGGFPAWLKFVPGISFRTDNE 148
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M +F KI+ +MK E LF SQGGPIIL+Q+ENEY + +G G Y+ WAA
Sbjct: 149 PFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYETEDKVFGEAGFAYMNWAAKM 208
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV ++T VPWVMC+Q+DAPDP+INTCNGFYCD F+PN P KP WTE ++ WF +FG
Sbjct: 209 AVQMDTGVPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPN 268
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
RPVEDLAF VARF + GG+ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR
Sbjct: 269 HKRPVEDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 328
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPK+GHL+ LH A+KLCE+ L++ +P L +A ++ SS DCAAFL+NY S++ A
Sbjct: 329 QPKFGHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTA 388
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
VTFNG Y LP WS+SILPDCK+V++NTA+V Q N ++ L +FS
Sbjct: 389 RVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTN----------QLSFLPTKVESFS 438
Query: 424 W--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
W Y E + I + S L EQ+ TKD SDYLWYT S++V P + GK L
Sbjct: 439 WETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLT 498
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
S GH VF+N KL +G HD + F +I L G+N + +LS+ GL N G +
Sbjct: 499 ATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHY 558
Query: 536 DVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ G+ + I L GK DLS +W Y+VG++GE + L S + W + S
Sbjct: 559 EEREMGVLGPVAIHGLDKGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQE 618
Query: 595 N-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
N + L WYK F APEG PLAL++ SM KGQ W+NGQ++GRYW+ + + CT C Y
Sbjct: 619 NAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWT--ITANGNCT-DCSY 675
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+Y KCQ CGQP Q YH+PR+W+ P +NL+V+ EE+GG+PS+ISL+ ++ IC+
Sbjct: 676 SGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICT 735
Query: 714 FVSEADPPPVD-SWKPNLGVVSSSP--QVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
S+ P + N G ++ ++ L C G I+AI FAS+G P G CGS + G
Sbjct: 736 EASQYRPVIKNVHMHQNNGELNEQNVLKINLHCAAGQFISAIKFASFGTPSGACGSHKQG 795
Query: 771 ACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CH ++QK CVG+ C + ++ G CP L K L+ E C
Sbjct: 796 TCHSPKSDYVLQKLCVGRQRCLATIPTSIFG--EDPCPNLRKKLSAEVVC 843
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 813 bits (2101), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/714 (55%), Positives = 498/714 (69%), Gaps = 23/714 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+VTYDH+A++I+G+RR+L SGSIHYPRS P++WP+LI+K+K+GGL+VIETYVFWN HEP
Sbjct: 24 ASVTYDHKAIIINGRRRILISGSIHYPRSIPQMWPDLIQKAKDGGLDVIETYVFWNGHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQY FE R+DLVRFVK V +AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 84 SPGQYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+ LMK E L+ SQGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 144 GPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ LNT VPWVMC+Q+DAPDP+I+TCNGFYC+ F PN KP MWTE ++GWF FG
Sbjct: 204 MALGLNTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGP 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P+RPVED+A++VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +
Sbjct: 264 APYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLL 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PKW HLR+LHKAIKLCE L+S DPT LG+ EAH++ S CAAFLANYD+SS
Sbjct: 324 REPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSS 383
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A VTF N Y LP WSVSILPDCK+V+FNTAKV P +Q K + S+F
Sbjct: 384 ATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKV-------GAPTSQPK-----MTPVSSF 431
Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW EE + L EQI+ T+D++DYLWY I + P + G+ L
Sbjct: 432 SWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLL 491
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ S GHA VF+N +L YG + +K + L GIN L ILS+ VGL N G
Sbjct: 492 TVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLH 551
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L L RD+S +W Y++G++GE + L +S ++S W GS +
Sbjct: 552 YETWNTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVA 611
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYKTTF +P+G PLAL+++SMGKGQ W+NGQSIGR+W AY A G KC+Y
Sbjct: 612 QKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTA--KGSCGKCNY 669
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G ++ KC CG+P+Q YH+PR W+ N+LVI EE GG+P ISL+ ++
Sbjct: 670 GGIFNEKKCHSXCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRS 723
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 813 bits (2099), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/715 (55%), Positives = 496/715 (69%), Gaps = 22/715 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+ +VIDG+RR+L SGSIHYPRSTPE+WP L +K+KEGGL+VI+TYVFWN H
Sbjct: 21 VTASVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFE RFDLV+F+K Q+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81 EPSPGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK ENLF +QGGPII++Q+ENEYG VEW G G+ Y WA
Sbjct: 141 DNEPFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAPGKAYTNWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPW MC+QEDAPDP+I+TCNG+YC+ FTPN KP MWTEN+SGW+ FG
Sbjct: 201 AQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTDFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A+ +RPVEDLA++VARF + G+F NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 261 NAICYRPVEDLAYSVARFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PKW HLR+LHKAIK CE L+S DPT LG KLEAH+Y ++ CAAFLANYD+
Sbjct: 321 LTNEPKWSHLRDLHKAIKQCEPALVSVDPTITSLGNKLEAHVYSTGTSVCAAFLANYDTK 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A VTF Y LP WSVSILPDCK VFNTAKV +Q + QK ++ +S
Sbjct: 381 SAATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGAQ--------SSQKT---MISTNS 429
Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
F W EE S + S L EQIN T+D+SDYLWY +++ P + G+
Sbjct: 430 TFDWQSYIEEPAFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPNEDFIKNGQYP 489
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
LN+ S GH VFVN +L YG D + + L G N + +LS+ VGL N G
Sbjct: 490 ILNVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKISLLSVAVGLPNVG 549
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L G RDLS +W Y+VG++GE + L I+ +S W QGS
Sbjct: 550 LHFETWNVGVLGPVTLKGLNEGTRDLSWQKWSYKVGLKGESLSLHTITGGSSVDWTQGSL 609
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + L WYK TF AP G PL L+++SMGKG+ WVN QSIGR+W Y+A G C
Sbjct: 610 LAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWPGYIA--HGSCGDC 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
DY G++ +KC+ +CG P QT YHIPR+W++P N+LV+ EE GGDPS ISLL +
Sbjct: 668 DYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSGISLLKR 722
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 812 bits (2097), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/716 (54%), Positives = 502/716 (70%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+VI+GKRR+L SGSIHYPRSTP++WP+LI+K+K+GG++VIETYVFWN H
Sbjct: 24 VTASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIETYVFWNGH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP +G+YYFE RFDLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 84 EPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVAFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK ENLF SQGGPIIL+Q+ENEYG VEW G G+ Y KW
Sbjct: 144 DNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWF 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
+ AV LNT VPWVMC+QEDAPDPII+TCNG+YC+ F+PN KP MWTEN++GW+ FG
Sbjct: 204 SQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMWTENWTGWYTDFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP EDLAF+VARF + G++ NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 264 TAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
I +PKWGHLR+LHKAIK CE L+S DPT G LE H+Y S CAAFLANYD+
Sbjct: 324 LISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSFGACAAFLANYDTG 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F Y LP WS+SILPDCK VFNTAKV + R + + A+S
Sbjct: 384 SWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVH-----------RSMTPANS 432
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF+W Y E+ SG S+ L EQ++ T D SDYLWY +++ P + G+
Sbjct: 433 AFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNISPNEGFIKNGQNP 492
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L S GH VF+N + YG+ D + ++L G N + +LS+ VGL N G
Sbjct: 493 VLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKISLLSVAVGLSNVG 552
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V L L G RDLS +W Y++G++GE + L S ++S W QGS
Sbjct: 553 VHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGESLNLHTTSGSSSVKWTQGSF 612
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + L WYKTTF AP G PLAL+++SMGKG+ WVNGQSIGR+W AY+A G C
Sbjct: 613 LSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSIGRHWPAYIA--RGNCGSC 670
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G++ KC+ +CGQP Q YHIPR+W++P N+LV+ EE GGDP+ ISL+ +T
Sbjct: 671 NYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEEWGGDPTGISLVKRT 726
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 811 bits (2095), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/714 (54%), Positives = 503/714 (70%), Gaps = 23/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23 SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN KP MWTE ++GW+ FG
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 263 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+R+PKWGHLR+LHKAIK CE L+S DP+ KLG+ EAH++ KS +DCAAFLANYD+
Sbjct: 323 LREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 381
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F G Y LP WS+SILPDCK V++TAKV SQ + ++ S
Sbjct: 382 SVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKVGSQSSQ-----------VQMTPVHSG 430
Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
F W E + L EQIN T+DT+DYLWY I + + GK
Sbjct: 431 FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 490
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VF+N +L YG+ + ++ + L GIN L +LS+ VGL N G
Sbjct: 491 LTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 550
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ + L L +G D+S +W Y+ G++GE +GL ++ ++S W +G ++
Sbjct: 551 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 610
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYK TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C
Sbjct: 611 AKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 668
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y G+YD KC+ HCG+P+Q YHIPR+W+ P NLLV+ EE GGDPS+ISL+ +
Sbjct: 669 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPNGNLLVVFEEWGGDPSRISLVER 722
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 811 bits (2094), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/714 (54%), Positives = 501/714 (70%), Gaps = 23/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23 SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN KP MWTE ++GW+ FG
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 263 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PKWGHLR+LHKAIK CE L+S DP+ KLG+ EAH++ KS +DCAAFLANYD+
Sbjct: 323 PREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 381
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F G Y LP WS+SILPDCK V+NTAKV SQ + ++ S
Sbjct: 382 SVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-----------VQMTPVHSG 430
Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
F W E + L EQIN T+DT+DYLWY I + + GK
Sbjct: 431 FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 490
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VF+N +L YG+ + ++ + L GIN L +LS+ VGL N G
Sbjct: 491 LTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 550
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ + L L +G D+S +W Y+ G++GE +GL ++ ++S W +G ++
Sbjct: 551 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 610
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYK TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C
Sbjct: 611 AKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 668
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y G+YD KC+ HCG+P+Q YHIPR+W+ P NLLV+ EE GGDPS ISL+ +
Sbjct: 669 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSGISLVER 722
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 810 bits (2092), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/714 (54%), Positives = 501/714 (70%), Gaps = 23/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 16 SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 75
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 76 PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 135
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 136 NEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 195
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN KP MWTE ++GW+ FG
Sbjct: 196 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 255
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 256 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 315
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PKWGHLR+LHKAIK CE L+S DP+ KLG+ EAH++ KS +DCAAFLANYD+
Sbjct: 316 PREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 374
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F G Y LP WS+SILPDCK V+NTAKV SQ + ++ S
Sbjct: 375 SVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-----------VQMTPVHSG 423
Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
F W E + L EQIN T+DT+DYLWY I + + GK
Sbjct: 424 FPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 483
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VF+N +L YG+ + ++ + L GIN L +LS+ VGL N G
Sbjct: 484 LTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 543
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ + L L +G D+S +W Y+ G++GE +GL ++ ++S W +G ++
Sbjct: 544 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 603
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L W+K TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C
Sbjct: 604 AKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 661
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y G+YD KC+ HCG+P+Q YHIPR+W+ P NLLV+ EE GGDPS ISL+ +
Sbjct: 662 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSGISLVER 715
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 808 bits (2086), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/714 (54%), Positives = 500/714 (70%), Gaps = 23/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23 SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G YYFE R+DLV+F+K VQ+ GLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 PSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN KP MWTE ++GW+ FG
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 263 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PKWGHLR+LHKAIK CE L+S DP+ KLG+ EAH++ KS +DCAAFLANYD+
Sbjct: 323 PREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 381
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F G Y LP WS+SILPDCK V+NTAKV SQ + ++ S
Sbjct: 382 SVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-----------VQMTPVHSG 430
Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
F W E + L EQIN T+DT+DYLWY I + + GK
Sbjct: 431 FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 490
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VF+N +L YG+ + ++ + L GIN L +LS+ VGL N G
Sbjct: 491 LTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 550
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ + L L +G D+S +W Y+ G++GE +GL ++ ++S W +G ++
Sbjct: 551 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 610
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYK TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C
Sbjct: 611 AEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 668
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y G+YD KC+ HCG+P+Q YHIPR+W+ P NLLV+ EE GGDPS+ISL+ +
Sbjct: 669 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSRISLVER 722
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 808 bits (2086), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/713 (53%), Positives = 501/713 (70%), Gaps = 17/713 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 35 VKASVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 94
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP +G YYF+ R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFPVWL ++PGI+FRT
Sbjct: 95 EPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPVWLKYVPGIEFRT 154
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M +F KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW G G+ Y KWA
Sbjct: 155 DNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWDIGAPGKAYAKWA 214
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV LNT VPWVMC+Q+DAPDP+INTCNGFYC+ F PN KP MWTE ++GWF FG
Sbjct: 215 AQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCEKFVPNQNYKPKMWTEAWTGWFTEFG 274
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP EDL F+VARF ++GG+F NYYMY GGTNFGRT+GG VATSYDYDAPIDEYG
Sbjct: 275 SAVPTRPAEDLVFSVARFIQSGGSFINYYMYHGGTNFGRTSGG-FVATSYDYDAPIDEYG 333
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PKWGHLR LHKAIKLCE L+S DPT + LG EAH+++ S CAAFLANYD++
Sbjct: 334 LLNEPKWGHLRGLHKAIKLCEPALVSVDPTVKSLGENQEAHVFNSISGKCAAFLANYDTT 393
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
A V+F Y LP WS+S+LPDCK VFNTA+V Q + QK ++ A S
Sbjct: 394 FSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQ--------SSQKKFVPVINAFS 445
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
S+ EE + + +F + L EQ+ T D SDYLWY +++ + G++ L
Sbjct: 446 WQSYIEETASSTDDNTFTKDGLWEQVYLTADASDYLWYMTDVNIGSNEGFLKNGQDPLLT 505
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I S GHA VF+N +L YG+ + +K ++L G+N + +LS VGL N G F
Sbjct: 506 IWSAGHALQVFINGQLSGTVYGSLENPKLTFSKNVKLRAGVNKISLLSTSVGLPNVGTHF 565
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V L L G RD+S +W Y++G++GE + L +S ++S W QG++L
Sbjct: 566 EKWNAGVLGPVTLKGLNEGTRDISKQKWTYKIGLKGEALSLHTVSGSSSVEWAQGASLAQ 625
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ + WYKTTF P G PLAL++ +MGKG W+NGQSIGR+W Y+ G C+Y
Sbjct: 626 KQPMTWYKTTFNVPPGNDPLALDMGAMGKGMVWINGQSIGRHWPGYIG--NGNCGGCNYA 683
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y KC+ +CG+P+Q YH+PR+ + P NLLV+ EE GG+P ISLL +T
Sbjct: 684 GTYTEKKCRTYCGKPSQRWYHVPRSRLKPSGNLLVVFEEWGGEPHWISLLKRT 736
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/716 (54%), Positives = 499/716 (69%), Gaps = 21/716 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+VI+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GG++VI+TYVFWN H
Sbjct: 27 VTASVTYDHKAIVINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIQTYVFWNGH 86
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G YYFE RFDLV+FVK VQ+AGL+++LRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 87 EPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGVAFRT 146
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F AKI+ +MK ENLF SQGGPII++Q+ENEYG VEW G G+ Y KW
Sbjct: 147 DNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWF 206
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
+ A+ L+T VPW+MC+QEDAPDPII+TCNG+YC+ FTPN KP MWTEN+SGW+ FG
Sbjct: 207 SQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTDFG 266
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP +D+AF+VARF + G++ NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 267 SAVPYRPAQDVAFSVARFIQNRGSYVNYYMYHGGTNFGRTSAGLFIATSYDYDAPIDEYG 326
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PKWGHLR LHKAIK CE L+S DPT G LE H+Y S+ CAAFLANYD++
Sbjct: 327 LLSEPKWGHLRNLHKAIKQCEPILVSVDPTVSWPGKNLEVHVYKTSTGACAAFLANYDTT 386
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A VTF Y LP WS+SILPDCK VFNTAKV G P +K + SS
Sbjct: 387 SPAKVTFGNGQYDLPPWSISILPDCKTAVFNTAKV------GTVPSFHRK----MTPVSS 436
Query: 421 AFSW--YEEKVGISG-NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF W Y E SG + S L EQI T+D+SDYLWY +++ P + G+
Sbjct: 437 AFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQYP 496
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L S GH VFVN + YG + + ++L G N + +LS+ VGL N G
Sbjct: 497 VLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSNVG 556
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V L L G RDLS +W Y++G++GE + L + ++S W +GS+
Sbjct: 557 LHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKGSS 616
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + L WYK TF AP G PLAL+++SMGKG+ WVNG+SIGR+W AY+A G C
Sbjct: 617 LVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIA--RGSCGGC 674
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G++ KC+ CGQP Q YHIPR+WV+P N LV+ EE GGDPS ISL+ +T
Sbjct: 675 NYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISLVKRT 730
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 804 bits (2077), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/716 (53%), Positives = 502/716 (70%), Gaps = 23/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A V+YDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+ +KEGGL+VI+TYVFWN H
Sbjct: 19 VEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGH 78
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G YYFE R+DLV+F+K V +AGL++HLRIGPY C EWN+GGFPVWL ++PGIQFRT
Sbjct: 79 EPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYICGEWNFGGFPVWLKYVPGIQFRT 138
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M++F KI+++MK E LF QGGPII++Q+ENEYG +EW G G+ Y KWA
Sbjct: 139 DNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWA 198
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC+QEDAPDPII+TCNGFYC+ F PN+ KP M+TE ++GW+ FG
Sbjct: 199 AQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPKMFTEAWTGWYTEFG 258
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP+RP ED+A++VARF + G+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 259 GPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 318
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PKWGHLR+LHK IKLCE L+S DP LG+ EAH++ + CAAFLANYD
Sbjct: 319 LRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFW-TKTSCAAFLANYDLK 377
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
VTF Y LP WSVSILPDCK VVFNTAKV+S Q ++ +++ +S
Sbjct: 378 YSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVS-----------QGSLAKMIAVNS 426
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AFSW EE + + F + L EQI+ T+D +DYLWY + + P + G++
Sbjct: 427 AFSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPDEAFLKNGQDP 486
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA VFVN +L YG + + K++L G+N + +LS+ VGL N G
Sbjct: 487 ILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLLSIAVGLPNVG 546
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L + +G D+S +W Y++G++GE + L +S ++S W +GS
Sbjct: 547 LHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSGSSSVEWVEGSL 606
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + LIWYKTTF AP G PLAL++ SMGKGQ W+NGQSIGR+W Y A G C
Sbjct: 607 LAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGYKA--RGSCGAC 664
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G YD KC +CG+ +Q YH+PR+W++P NLLV+ EE GGDP+KISL+ +
Sbjct: 665 NYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKISLVKRV 720
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 803 bits (2073), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/731 (54%), Positives = 498/731 (68%), Gaps = 37/731 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
LS V YDHR L+I+G+ R+L S SIHYPR+ P++W +LI +K GG++VIETYVFW+ H
Sbjct: 22 LSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGH 81
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
+P R Y FEGRFDLV FVK V EAGL+ +LRIGPY CAEWN GGFPVWL + GI+FRT
Sbjct: 82 QPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIEFRT 141
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK EM+ F+ KI+ +MK + LFA QGGPIILAQ+ENEYGN++ AYG G+ Y+ WA
Sbjct: 142 NNQPFKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMVWA 201
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ + L T VPW+MCQQ DAPD I++TCNGFYCD + PN+ KP MWTEN+SGWF +G
Sbjct: 202 ANMSQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWG 261
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A P RPVED+AFAVARFF+ GG+FQNYYMYFGGTNFGR++GGP V TSYDYDAPIDE+G
Sbjct: 262 EASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDS 359
IRQPKWGHL++LH AIKLCE L S+DPT+ LG EAH+Y SS CAAFLAN DS
Sbjct: 322 VIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDS 381
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
SSDA V FN Y LPAWSVSILPDCK V NTAKV Q + + +
Sbjct: 382 SSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKV-----------DVQTAMPTMKPSI 430
Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK--EVFLN 475
+ +W Y E VG+ + V L EQINTTKDTSDYLWYT S+ + + L
Sbjct: 431 TGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAASGKALLY 490
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ES+ VFVN KL + + IEL G N+L IL VGLQNYG +
Sbjct: 491 LESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFI 550
Query: 536 DVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ GAG+ SVI+ L +G+ DL++ EWI+QVG++GE + + S + W S +P
Sbjct: 551 ETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWS--SAVPQ 608
Query: 595 NKSLIWYKTTFL-----------------APEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
++L+WYK F +P G P+AL+L SMGKGQAW+NGQSIGR+W
Sbjct: 609 GQALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFW 668
Query: 638 SAYLAPST-GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
+ AP T GC + CDYRGSY +SKC+ CGQP+Q YH+PR+W+ G NL+V+ EE GG
Sbjct: 669 PSLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDGGNLVVLFEEEGG 728
Query: 697 DPSKISLLTKT 707
PS +S +T+T
Sbjct: 729 KPSGVSFVTRT 739
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 802 bits (2071), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/716 (54%), Positives = 498/716 (69%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYD +A+ I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VI+TYVFWN H
Sbjct: 25 VTASVTYDGKAIKINGQRRILFSGSIHYPRSTPEMWPGLIQKAKEGGLDVIQTYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFEGR+DLVRF+K Q+AGL++HLRIG Y CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 EPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRIGLYVCAEWNFGGFPVWLKYVPGIAFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI++LMK E LF SQGGPII++Q+ENEYG VEW G G+ Y KWA
Sbjct: 145 DNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ AV L+T VPW+MC+QEDAPDPII+TCNGFYC+GFTPN KP MWTE ++GW+ FG
Sbjct: 205 AEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFYCEGFTPNKNYKPKMWTEAWTGWYTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPVEDLA++VARF + G+F NYYMY GGTNFGRTA G VATSYDYDAPIDEYG
Sbjct: 265 GPIHNRPVEDLAYSVARFIQNNGSFVNYYMYHGGTNFGRTAAGLFVATSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PKWGHLR+LHKAIKLCE L+S+ PT G LE H++ KS + CAAFLANYD S
Sbjct: 325 LPREPKWGHLRDLHKAIKLCEPSLVSAYPTVTWPGKNLEVHVF-KSKSSCAAFLANYDPS 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A VTF Y LP WS+SILPDCKN VFNTA+V S+ + + ++
Sbjct: 384 SPAKVTFQNMQYDLPPWSISILPDCKNAVFNTARVSSKSS----------QMKMTPVSGG 433
Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AFSW EE V + + + L EQI+ T+D SDYLWY +++ P + G+
Sbjct: 434 AFSWQSYIEETVSADDSDTIAKNGLWEQISITRDGSDYLWYLTDVNIHPNEGFLKNGQSP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA VF+N +L YG+ + + ++L GIN + +LS VGL N G
Sbjct: 494 VLTVMSAGHALHVFINGQLAGTVYGSLENPKLTFSNNVKLRAGINKISLLSAAVGLPNVG 553
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L G RDL+ +W Y+VG++GE + L +S ++S W QGS
Sbjct: 554 LHFETWNTGVLGPVTLKGLNEGTRDLTKQKWSYKVGLKGEDLSLHTLSGSSSVEWVQGSL 613
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + L WYK TF APEG PLAL++ +MGKGQ W+NG+SIGR+W Y A +G C
Sbjct: 614 LAQKQPLTWYKATFNAPEGNDPLALDMNTMGKGQIWINGESIGRHWPEYKA--SGNCGGC 671
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G Y KC +CG+ +Q YH+PR+W+ P N LV+ EELGGDP+ IS + +T
Sbjct: 672 SYAGIYTEKKCLSNCGEASQRWYHVPRSWLKPSGNFLVVFEELGGDPTGISFVRRT 727
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 801 bits (2070), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/716 (53%), Positives = 501/716 (69%), Gaps = 23/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A V+YDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+ +KEGGL+VI+TYVFWN H
Sbjct: 19 VEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGH 78
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G YYFE R+DLV+F+K V +AGL++HLRI PY C EWN+GGFPVWL ++PGIQFRT
Sbjct: 79 EPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYICGEWNFGGFPVWLKYVPGIQFRT 138
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M++F KI+++MK E LF QGGPII++Q+ENEYG +EW G G+ Y KWA
Sbjct: 139 DNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWA 198
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC+QEDAPDPII+TCNGFYC+ F PN+ KP M+TE ++GW+ FG
Sbjct: 199 AQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPKMFTEAWTGWYTEFG 258
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP+RP ED+A++VARF + G+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 259 GPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 318
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PKWGHLR+LHK IKLCE L+S DP LG+ EAH++ + CAAFLANYD
Sbjct: 319 LRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFW-TKTSCAAFLANYDLK 377
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
VTF Y LP WSVSILPDCK VVFNTAKV+S Q ++ +++ +S
Sbjct: 378 YSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVS-----------QGSLAKMIAVNS 426
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AFSW EE + + F + L EQI+ T+D +DYLWY + + P + G++
Sbjct: 427 AFSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPDEAFLKNGQDP 486
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA VFVN +L YG + + K++L G+N + +LS+ VGL N G
Sbjct: 487 ILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLLSIAVGLPNVG 546
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L + +G D+S +W Y++G++GE + L +S ++S W +GS
Sbjct: 547 LHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSGSSSVEWVEGSL 606
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + LIWYKTTF AP G PLAL++ SMGKGQ W+NGQSIGR+W Y A G C
Sbjct: 607 LAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGYKA--RGSCGAC 664
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G YD KC +CG+ +Q YH+PR+W++P NLLV+ EE GGDP+KISL+ +
Sbjct: 665 NYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKISLVKRV 720
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 801 bits (2069), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/716 (54%), Positives = 499/716 (69%), Gaps = 24/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+V+DGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21 VTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE RFDLV+FVK VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81 EPSPGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F AKI+ LMK+ LF SQGGPII++Q+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+QEDAPDP+I+TCNG+YC+ F PN +KP MWTEN++GW+ FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGYYCENFKPNKNTKPKMWTENWTGWYTDFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG +ATSYDYDAP+DEYG
Sbjct: 261 GAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PK+ HLR LHKAIK CE L+++DP Q LG LEAH++ + CAAF+ANYD+
Sbjct: 321 LQNEPKYEHLRNLHKAIKQCEPALVATDPKVQSLGYNLEAHVF-STPGACAAFIANYDTK 379
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A TF Y LP WS+SILPDCK VV+NTAKV G+ + VN S
Sbjct: 380 SYAKATFGNGQYDLPPWSISILPDCKTVVYNTAKV------GNSWLKKMTPVN------S 427
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF+W EE S S L EQ+N T+D+SDYLWY +++ + G+
Sbjct: 428 AFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVYINANEGFLKNGQSP 487
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L S GH VF+N +L +G + ++L G N L +LS+ VGL N G
Sbjct: 488 VLTAMSAGHVLHVFINDQLAGTVWGGLANPKLTFSDNVKLRVGNNKLSLLSVAVGLPNVG 547
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L L G RDLSS +W Y+VG++GE + L S ++S W +GS
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSSQKWSYKVGLKGESLSLHTESGSSSVEWIRGSL 607
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYKTTF AP G PLAL+L SMGKG+ WVNG+SIGR+W Y+A G C
Sbjct: 608 VAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIA--HGSCNAC 665
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G Y +KC+ +CGQP+Q YH+PR+W+ G N LV+ EE GGDP+ I+L+ +T
Sbjct: 666 NYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 721
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 801 bits (2069), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/714 (55%), Positives = 499/714 (69%), Gaps = 26/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDHR+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+V++TYVFWN HE
Sbjct: 91 NAAVSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVFWNGHE 150
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYF R+DL+RFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 151 PVKGQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 210
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF+ KI+ +MK E LF QGGPII++QVENE+G +E A GVG + Y WAA
Sbjct: 211 NGPFKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPYANWAA 270
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV NT VPWVMC+QEDAPDP+INTCNGFYCD FTPN +KP MWTE ++GWF SFG
Sbjct: 271 KMAVATNTGVPWVMCKQEDAPDPVINTCNGFYCDYFTPNKKNKPAMWTEAWTGWFTSFGG 330
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVED+AFAVARF + GG+F NYYMY GGTNFGRTAGGP VATSYDYDAPIDE+G
Sbjct: 331 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEFGL 390
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIK E L+S DPT Q LG +A+++ + CAAFL+NY +S
Sbjct: 391 LRQPKWGHLRDLHKAIKQAEPTLVSGDPTIQSLGNYEKAYVFKSKNGACAAFLSNYHMNS 450
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V FNG Y LPAWS+SILPDCK VVFNTA V ++ + +
Sbjct: 451 AVKVRFNGRHYDLPAWSISILPDCKTVVFNTATV------------KEPTLLPKMHPVVR 498
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVFLN 475
F+W Y E + +F + L EQ++ T D SDYLWYT +++ PG+ G+ L
Sbjct: 499 FTWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPGELSKNGQWPQLT 558
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GH+ VFVN K YG + + +++ +G N + ILS VGL N G F
Sbjct: 559 VYSAGHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISILSSAVGLPNVGDHF 618
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLP 593
+ G+ V L L GKRDLS +W YQVG++GE +G+ +S +++ W GS P
Sbjct: 619 ERWNVGVLGPVTLSGLSEGKRDLSHQKWTYQVGLKGESLGIHTVSGSSAVEWGGPGSKQP 678
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
L W+K F AP G P+AL++ SMGKGQ WVNG +GRYWS Y APS GC C Y
Sbjct: 679 ----LTWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGRYWS-YKAPSRGC-GGCSY 732
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y KC+ CG+ +Q YH+PR+W+ PG NLLV+ EE GGD + ++L T+T
Sbjct: 733 AGTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGVTLATRT 786
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 800 bits (2066), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/716 (54%), Positives = 496/716 (69%), Gaps = 24/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+VIDGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21 VTASVTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFE R+DLVRFVK Q+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81 EPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F AKI+ LMK+E LF SQGGPIIL+Q+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN +KP MWTEN++GW+ FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A P RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG +ATSYDYDAP+DEYG
Sbjct: 261 GASPIRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PKWGHLR LHKAIK E L+S+DP LG LEAH++ + CAAF+ANYD+
Sbjct: 321 LQNEPKWGHLRALHKAIKQSEPALVSTDPKVTSLGYNLEAHVF-STPGACAAFIANYDTK 379
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A TF Y LP WS+SILPDCK VV+NTA+V NG V ++ +S
Sbjct: 380 SSAKATFGSGQYDLPPWSISILPDCKTVVYNTARV----GNG--------WVKKMTPVNS 427
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
F+W EE S + S L EQ+N T+D+SDYLWY +++ + G+
Sbjct: 428 GFAWQSYNEEPASSSQDDSIAAEALWEQVNVTRDSSDYLWYMTDVYINGNEGFLKNGRSP 487
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GH VF+N +L YG + + L G N L +LS+ VGL N G
Sbjct: 488 VLTVMSAGHLLHVFINGQLSGTVYGGLGNPKLTFSDNVNLRVGNNKLSLLSVAVGLPNVG 547
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L L G RDLS +W Y+VG++GE + L S ++S W QGS
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGEALNLHTESGSSSVEWIQGSL 607
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK TF AP G PLAL+L SMGKG+ WVNG+SIGR+W Y+A G C
Sbjct: 608 VAKKQPLTWYKATFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIA--HGSCNAC 665
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G Y KC+ +CG+P+Q YH+PR+W++ G N LV+ EE GGDP+ I+L+ +T
Sbjct: 666 NYAGYYTDQKCRTNCGKPSQRWYHVPRSWLNSGGNSLVVFEEWGGDPNGIALVKRT 721
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/835 (48%), Positives = 535/835 (64%), Gaps = 48/835 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A +++D RA+ IDGKRRVL SGSIHYPRSTP++WP+LI+KSKEGGL+ IETYVFWN HE
Sbjct: 22 AAVISHDGRAITIDGKRRVLLSGSIHYPRSTPQMWPDLIKKSKEGGLDAIETYVFWNVHE 81
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R QY F G DLVRF+K VQ+ GL+ LRIGPY CAEWNYGGFPVWLH +PGI+ RT
Sbjct: 82 PSRRQYDFGGNLDLVRFIKAVQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGIELRTA 141
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N+ F EM+ F + I+D+MKQE LFASQGGPII+AQVENEYGNV +YG G+ Y+ W A
Sbjct: 142 NSIFMNEMQNFTSLIVDMMKQEQLFASQGGPIIIAQVENEYGNVMSSYGAAGKAYIDWCA 201
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A +LN VPW+MCQQ DAPDP+INTCNG+YCD FTP++P+ P MWTEN++GWF S+G
Sbjct: 202 NMAESLNIGVPWIMCQQSDAPDPMINTCNGWYCDQFTPSNPNSPKMWTENWTGWFKSWGG 261
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P R ED+AFAVARFF+TGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DE+G
Sbjct: 262 KDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEFGN 321
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+ QPKWGHL++LH + EE L S + + A IY + + + FL+N + +S
Sbjct: 322 LNQPKWGHLKQLHDVLHSMEEILTSGTVSSVDYDNSVTATIY-ATDKESSCFLSNANETS 380
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DA + F G Y +PAWSVSILPDC NV +NTAKV +Q + ++ N E S
Sbjct: 381 DATIEFKGTTYTIPAWSVSILPDCANVGYNTAKVKTQTS----VMVKRDNKAEDEPTSLN 436
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLN 475
+SW E V + G + +Q D SDYLWY S+ + K++ +
Sbjct: 437 WSWRPENVDKTVLLGQGHIHAKQIVDQKAVANDASDYLWYMTSVDLKKDDLIWSKDMSIR 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I GH +VN + + + + +N++ K ++L G N + +LS VGL NYGA +
Sbjct: 497 INGSGHILHAYVNGEYLGSQWSEYSVSNYVFEKSVKLKHGRNLITLLSATVGLANYGANY 556
Query: 536 DVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGL-DKISLANS---SFWK 587
D+ AG+ + + + G +DLS+ W Y+VG+ +GL DK+ L++S S W+
Sbjct: 557 DLIQAGILGPVELVGRKGDETIIKDLSNNRWSYKVGL----LGLEDKLYLSDSKHASKWQ 612
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
+ LP NK L WYKTTF AP G P+ L+L +GKG AW+NG SIGRYW ++LA GC
Sbjct: 613 E-QELPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKGMAWINGNSIGRYWPSFLAEDDGC 671
Query: 648 -TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
T CDYRG YD +KC +CG+P Q YH+PR+++ EN LV+ EE GG+PS+++ T
Sbjct: 672 STDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQDNENTLVLFEEFGGNPSQVNFQTV 731
Query: 707 TGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
C E + V ++C G I+A+ FAS+G P+G CGS
Sbjct: 732 VTGVACVSGDEGEV------------------VEISC-NGQSISAVQFASFGDPQGTCGS 772
Query: 767 FRPGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G+C D L IVQKACVG CS+ VS G + +C + LAVE C
Sbjct: 773 SVKGSCEGTEDALLIVQKACVGNESCSLEVSHKLFGST--SCDNGVNRLAVEVLC 825
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/714 (54%), Positives = 493/714 (69%), Gaps = 25/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25 NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQYYF R+DLVRFVK ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + Y WAA
Sbjct: 145 NGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG
Sbjct: 205 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG
Sbjct: 265 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 324
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIK E L+S DPT Q LG +A+++ S CAAFL+NY +S+
Sbjct: 325 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSA 384
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V FNG Y LPAWS+S+LPDCK VFNTA V P A + + +
Sbjct: 385 AARVVFNGRRYDLPAWSISVLPDCKAAVFNTATV-------SEPSAPAR-----MSPAGG 432
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
FSW Y E R+F + L EQ++ T D SDYLWYT +++ + G+ L
Sbjct: 433 FSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQL 492
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GH+ VFVN + YG +D + +++ +G N + ILS VGL N G
Sbjct: 493 TIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTH 552
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L L GKRDLS +W YQ+G+ GE +G+ ++ ++S W +
Sbjct: 553 YETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG-- 610
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+K F AP G P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+GC C Y
Sbjct: 611 -KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGC-GGCSY 667
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG +Q YH+PR+W++P NLLV+ EE GGD S + L+T+T
Sbjct: 668 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTRT 721
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 796 bits (2057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/716 (53%), Positives = 496/716 (69%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ ANV+YD RA+VI+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21 VKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR+DLV+F+K VQ AGL+++LRIGPY CAEWN+GG PVWL ++ G++FRT
Sbjct: 81 EPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F+ KI+ +MK E LF QGGPII+AQ+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GWF FG
Sbjct: 201 AQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWFTKFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+P RP ED+AF+VARF + G++ NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PK+GHLRELHKAIK CE L+SS PT LG+ EAH+Y S CAAFL+NYD+
Sbjct: 321 LLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSGACAAFLSNYDAK 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
V+F Y LP WS+SILPDCK VV+NTAKV SQ ++ ++ A
Sbjct: 381 YSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSS-----------IKMTPAGG 429
Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + + +R + L EQ N T+D+SDYLWY I++ + GK+
Sbjct: 430 GLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDINIASNEGFLKSGKDP 489
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+L + S GH VFVN KL YG D + ++LN GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKISLLSVSVGLPNVG 549
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+D AG+ V L L G RDL+ +W Y+VG++GE + L +S ++S W QGS
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTLSGSSSVEWVQGSL 609
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK TF AP G PLAL++ASMGKGQ W+NG+ +GR+W Y A G KC
Sbjct: 610 VARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWPGYAA--QGDCSKC 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+++ KCQ +CGQP+Q YH+PR+W+ NLLV+ EE GGDP+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPTGISLVRRS 723
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 795 bits (2053), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/833 (49%), Positives = 537/833 (64%), Gaps = 41/833 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L NV++D RA+ IDGKRRVL SGSIHYPRSTPE+WPELI+K+KEGGL+ IETYVFWN H
Sbjct: 26 LHTNVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAH 85
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP R Y F G D++RF+KT+QE+GL+ LRIGPY CAEWNYGG PVW+H +P ++ RT
Sbjct: 86 EPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRT 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N+ F EM+ F I+D++K+E LFASQGGPIIL Q+ENEYGNV YG G+ Y+ W
Sbjct: 146 ANSVFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAYMNWC 205
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A +L VPW+MCQ+ DAP P+INTCNG+YCD F PNS + P MWTEN+ GWF ++G
Sbjct: 206 ANMAESLKVGVPWIMCQESDAPQPMINTCNGWYCDNFEPNSFNSPKMWTENWIGWFKNWG 265
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P R ED+AFAVARFF+TGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 266 GRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYG 325
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
I QPKWGHL+ELH A+K EE L S + + LG ++ IY ++ + FL+N +++
Sbjct: 326 NIAQPKWGHLKELHSALKAMEEALTSGNVSETDLGNSVKVTIY-ATNGSSSCFLSNTNTT 384
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+DA +TF GN Y +PAWSVSILPDC++ +NTAKV Q + ++ + E A
Sbjct: 385 ADATLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTS----VMTKENSKAEKEAAIL 440
Query: 421 AFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLN 475
+ W E + + G + L +Q + D SDYLWY +HV P + + L
Sbjct: 441 KWVWRSENIDKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHDDPVWSENMTLR 500
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I GH FVN + + + + N KI+L G NT+ +LS+ VGLQNYGA+F
Sbjct: 501 INGSGHVIHAFVNGEYIDSHWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGAFF 560
Query: 536 DVAGAGLFSVI-LIDLKNGK---RDLSSGEWIYQVGVEG--EYIGLDKISLANSSFWKQG 589
D AGL I L+ +K + ++LSS +W Y++G+ G + D A S W +
Sbjct: 561 DTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDHKLFSDDSPFAAQSKW-ES 619
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
LP N+ L WYKTTF AP G P+ ++L MGKG AWVNG++IGR W +Y A GC+
Sbjct: 620 EKLPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKNIGRIWPSYNAEEDGCSD 679
Query: 650 K-CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
+ CDYRG Y SKC +CG+P Q YH+PR+++ G N LV+ ELGG+PS ++ T
Sbjct: 680 EPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLFAELGGNPSLVNFQTVVV 739
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
++C+ E + + L+C+ G I+AI FAS+G P+G CG+F
Sbjct: 740 GNVCANAYE------------------NKTLELSCQ-GRKISAIKFASFGDPKGVCGAFT 780
Query: 769 PGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G+C + LPIVQKACVG+ CSI +S G A AC L K LAVEA C
Sbjct: 781 NGSCESKSNALPIVQKACVGKEACSIDLSEKTFG--ATACGNLAKRLAVEAVC 831
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 795 bits (2053), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/716 (53%), Positives = 495/716 (69%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ ANV+YD RA+VI+GKR++L SGSIHYPRSTP++WP+LI K+K+GGL+VIETYVFWN H
Sbjct: 21 VKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIEKAKDGGLDVIETYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR+DLV+F+K VQ AGL+++LRIGPY CAEWN+GG PVWL ++ G++FRT
Sbjct: 81 EPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F+ KI+ +MK E LF QGGPII+AQ+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GWF FG
Sbjct: 201 AQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWFTKFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+P RP ED+AF+VARF + G++ NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PK+GHLRELHKAIK CE L+SS PT LG+ EAH+Y S CAAFL+NYD+
Sbjct: 321 LLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSGACAAFLSNYDAK 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
V+F Y LP WS+SILPDCK VV+NTAKV SQ ++ ++ A
Sbjct: 381 YSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSS-----------IKMTPAGG 429
Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y E + + +R + L EQ N T+D+SDYLWY +++ + GK+
Sbjct: 430 GLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDVNIASNEGFLKSGKDP 489
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+L + S GH VFVN KL YG D + ++LN GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKISLLSVSVGLPNVG 549
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+D AG+ V L L G RDL+ +W Y+VG++GE + L +S ++S W QGS
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTLSGSSSVEWVQGSL 609
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK TF AP G PLAL++ASMGKGQ W+NG+ +GR+W Y A G KC
Sbjct: 610 VARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWPGYAA--QGDCSKC 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+++ KCQ +CGQP+Q YH+PR+W+ NLLV+ EE GGDP+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPTGISLVRRS 723
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/716 (54%), Positives = 494/716 (68%), Gaps = 24/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+V+DGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21 VTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE RFDLV+FVK Q+AGL++HLRIGPY CAEWN GGFPVWL ++PGI FRT
Sbjct: 81 EPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F AKI+ LMK+ LF SQGGPIIL+Q+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN +KP MWTEN++GW+ FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG +ATSYDYDAP+DEYG
Sbjct: 261 GAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PK+ HLR LHKAIK E L+++DP Q LG LEAH++ + CAAF+ANYD+
Sbjct: 321 LENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVF-SAPGACAAFIANYDTK 379
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A F Y LP WS+SILPDCK VV+NTAKV G + VN S
Sbjct: 380 SYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKV------GYGWLKKMTPVN------S 427
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF+W EE S S L EQ+N T+D+SDYLWY ++V + G+
Sbjct: 428 AFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSP 487
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GH VF+N +L +G + ++L G N L +LS+ VGL N G
Sbjct: 488 LLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVG 547
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L L G RDLS +W Y+VG++GE + L S ++S W QGS
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSL 607
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYKTTF AP G PLAL+L SMGKG+ WVNG+SIGR+W Y+A G C
Sbjct: 608 VAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIA--HGSCNAC 665
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G Y +KC+ +CGQP+Q YH+PR+W+ G N LV+ EE GGDP+ I+L+ +T
Sbjct: 666 NYAGYYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 721
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 793 bits (2047), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/735 (53%), Positives = 508/735 (69%), Gaps = 25/735 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ +VTYD +A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFW+ HE
Sbjct: 34 TCSVTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWDGHE 93
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFEGR+DLV+F+K V++AGL+++LRIGPY CAEWN GGFPVWL +IPGI FRT
Sbjct: 94 PSPGKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFPVWLKYIPGISFRTD 153
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M F KI+++MK E+LF QGGPII++Q+ENEYG VEW G G++Y +WAA
Sbjct: 154 NEPFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEWEIGAIGKVYTRWAA 213
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AVNLNT VPW+MC+Q++ PDPIINTCNGFYCD F PN KPIMWTE ++GWF +FG
Sbjct: 214 SMAVNLNTGVPWIMCKQDEVPDPIINTCNGFYCDWFKPNKDYKPIMWTELWTGWFTAFGG 273
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP+RPVED+A+AV +F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 274 PVPYRPVEDVAYAVVKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 333
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PKWGHLR+LH+AIK+CE L+S+DPT K+G EAH++ S C+AFL N D ++
Sbjct: 334 KREPKWGHLRDLHRAIKMCEPALVSNDPTVTKIGDSQEAHVFKFESGACSAFLENKDETN 393
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS-S 420
VTF G Y LP WS+SILPDC NVV+NT +V Q ++ +L AS +
Sbjct: 394 FVKVTFQGMQYELPPWSISILPDCVNVVYNTGRV-----------GTQTSMMTMLSASNN 442
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
FSW Y E S L+EQI+ TKD++DYL YT + + + G+
Sbjct: 443 EFSWASYNEDTASYNEESMTIEGLSEQISITKDSTDYLRYTTDVTIGQNEGFLKNGEYPV 502
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GHA VFVN +L YG+ + + K++L G N + +LS VGL N G
Sbjct: 503 LTVNSAGHALQVFVNGQLSGTAYGSVNDPRLTFSGKVKLWAGNNKISLLSSAVGLPNVGT 562
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ G+ V L L GKRDLS +W Y+VGV GE + L + ++S W GS+
Sbjct: 563 HFETWNYGVLGPVTLNGLNEGKRDLSLQKWSYKVGVIGEALQLHSPTGSSSVEW--GSST 620
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ WYKTTF AP G PLAL++ +MGKGQ W+NGQSIGRYW AY A G C
Sbjct: 621 SKIQPFTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIGRYWPAYKA--NGKCSACH 678
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
Y G YD KC +CG+ +Q YHIPR+W++P NLLV+ EE GGDP+ I+L+ +T C
Sbjct: 679 YTGWYDEKKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWGGDPTGITLVRRTIGSAC 738
Query: 713 SFVSEADPPPVDSWK 727
++++E P V +WK
Sbjct: 739 AYINEWH-PTVKNWK 752
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 791 bits (2044), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/714 (54%), Positives = 494/714 (69%), Gaps = 25/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP+L++K+K+GGL+V++TYVFWN HE
Sbjct: 28 NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQTYVFWNGHE 87
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +GQYYF R+DLVRFVK ++AGLF+HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 88 PQQGQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 147
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + Y WAA
Sbjct: 148 NAPFKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAA 207
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG
Sbjct: 208 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 267
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG
Sbjct: 268 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 327
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIK E L+S DPT Q +G +A++Y SS CAAFL+NY +++
Sbjct: 328 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQTIGNYEKAYVYKSSSGACAAFLSNYHTNA 387
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V FNG Y LPAWS+S+LPDC+ VFNTA V S P A + + +
Sbjct: 388 AARVVFNGRRYDLPAWSISVLPDCRTAVFNTATVSS-------PSAPAR-----MTPAGG 435
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
FSW Y E +R+F + L EQ++ T D SDYLWYT +++ + G+ L
Sbjct: 436 FSWQSYSEATNSLDDRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQL 495
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN + YG +D + +++ +G N + ILS VGL N G
Sbjct: 496 TIYSAGHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTH 555
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L L GKRDLS+ +W YQ+G+ GE +G+ ++ ++S W +
Sbjct: 556 YEAWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVHSVAGSSSVEWGSAAG-- 613
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+K F AP G P+AL+++SMGKGQAWVNG IGRYWS Y A C C Y
Sbjct: 614 -KQPLTWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIGRYWS-YKATGGSC-GGCSY 670
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG +Q YH+PR+W++P NLLV+ EE GGD S + L+T+T
Sbjct: 671 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLVTRT 724
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/840 (48%), Positives = 536/840 (63%), Gaps = 60/840 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV++D RA++IDG+RRVL SGSIHYPRSTPE+WP+LIRK+KEGGL+ IETYVFWN HEP
Sbjct: 24 NVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNAHEPA 83
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQ-FRTTN 122
R QY F G DL+RF+KT+Q+ GL+ LRIGPY CAEWNYGGFPVWLH +PG+Q FRT N
Sbjct: 84 RRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQEFRTVN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
F EM+ F I+D++KQE LFASQGGPII+AQ+ENEYGN+ YG G++Y+ W A
Sbjct: 144 EVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQIENEYGNMISNYGDAGKVYIDWCAK 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +L+ VPW+MCQ+ DAP P+INTCNG+YCD FTPN P+ P MWTEN++GWF S+G
Sbjct: 204 MAESLDIGVPWIMCQESDAPQPMINTCNGWYCDSFTPNDPNSPKMWTENWTGWFKSWGGK 263
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P R EDLAF+VARFF+TGGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP+DE+G +
Sbjct: 264 DPHRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPLDEFGNL 323
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
QPKWGHL+ELH +K E+ L + + G + A +Y + + F N +++ D
Sbjct: 324 NQPKWGHLKELHTVLKAMEKTLTHGNVSTTDFGNSVTATVY-ATEEGSSCFFGNANTTGD 382
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A +TF G+ Y +PAWSVSILPDCK +NTAKV +Q + ++ N E +S +
Sbjct: 383 ATITFQGSDYVVPAWSVSILPDCKTEAYNTAKVNTQTS----VIVKKPNQAENEPSSLKW 438
Query: 423 SWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
W E + + G SF L +Q D SDYLWY S+ + P + L +
Sbjct: 439 VWRPEAIDEPVVQGKGSFSASFLIDQ-KVINDASDYLWYMTSVDLKPDDIIWSDNMTLRV 497
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+ G FVN + V + + + ++++LN G N + +LS+ VGLQNYG FD
Sbjct: 498 NTTGIVLHAFVNGEHVGSQWTKYGVFKDVFQQQVKLNPGKNQISLLSVTVGLQNYGPMFD 557
Query: 537 VAGAGLFS-VILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST- 591
+ AG+ V LI K + +DLS +W Y+VG+ G L ++ F+ + ST
Sbjct: 558 MVQAGITGPVELIGQKGDETVIKDLSCHKWTYEVGLTG---------LEDNKFYSKASTN 608
Query: 592 ---------LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
+P N + WYKTTF AP G P+ L+L MGKG AWVNG ++GRYW +YLA
Sbjct: 609 ETCGWSAENVPSNSKMTWYKTTFKAPLGNDPVVLDLQGMGKGFAWVNGYNLGRYWPSYLA 668
Query: 643 PSTGCTKK-CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKI 701
+ GC+ CDYRG YD +KC +CGQP+Q YH+PR+++ GEN LV+ EE GG+P ++
Sbjct: 669 EADGCSSDPCDYRGQYDNNKCVTNCGQPSQRWYHVPRSFLQDGENTLVLFEEFGGNPWQV 728
Query: 702 SLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPE 761
+ T +C G + L+C G I+AI FAS+G P+
Sbjct: 729 NFQTLVVGSVC------------------GNAHEKKTLELSC-NGRPISAIKFASFGDPQ 769
Query: 762 GNCGSFRPGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G CGSF+ G C D+LP++Q+ CVG+ CSI +S LG + C ++K LAVEA C
Sbjct: 770 GTCGSFQAGTCQTEQDILPVLQQECVGKETCSIDISEDKLGKT--NCGSVVKKLAVEAVC 827
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/714 (54%), Positives = 493/714 (69%), Gaps = 21/714 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 31 VTASVTYDHKAIMINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 90
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFE RFDLV F+K VQ+AGLF+HLRIGP+ CAEWN+GGFPVWL ++PGI FRT
Sbjct: 91 EPSPGKYYFEDRFDLVGFIKLVQQAGLFVHLRIGPFICAEWNFGGFPVWLKYVPGIAFRT 150
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFKE M++F KI+++MK E LF SQGGPIIL+Q+ENEYG VEW G G+ Y KWA
Sbjct: 151 DNEPFKEAMQKFTEKIVNIMKAEKLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 210
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+QEDAPDPII+TCNGFYC+ FTPN KP +WTEN++GW+ +FG
Sbjct: 211 AQMAVGLDTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNYKPKLWTENWTGWYTAFG 270
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A P+RP ED+AF+VARF + G+ NYYMY GGTNFGRT+ G VATSYDYDAPIDEYG
Sbjct: 271 GATPYRPAEDIAFSVARFIQNRGSLFNYYMYHGGTNFGRTSNGLFVATSYDYDAPIDEYG 330
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PKWGHLRELH+AIK CE L+S DPT G LE H+Y K+ + CAAFLANY++
Sbjct: 331 LLNEPKWGHLRELHRAIKQCESALVSVDPTVSWPGKNLEVHLY-KTESACAAFLANYNTD 389
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
V F Y LP WS+SILPDCK VFNTAKV S R + ++ +S
Sbjct: 390 YSTQVKFGNGQYDLPPWSISILPDCKTEVFNTAKVNSPRLH-----------RKMTPVNS 438
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG---QGKEVFL 474
AF+W EE S N L EQ+ T+D+SDYLWY +++ P GK L
Sbjct: 439 AFAWQSYNEEPASSSENDPVTGYALWEQVGVTRDSSDYLWYLTDVNIGPNDIKDGKWPVL 498
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
S GH VF+N + YG+ D ++ + L G N + +LS+ VGL N G
Sbjct: 499 TAMSAGHVLNVFINGQYAGTAYGSLDDPRLTFSQSVNLRVGNNKISLLSVSVGLANVGTH 558
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ V L L +G DLS +W Y++G++GE + L + +NS W QGS +
Sbjct: 559 FETWNTGVLGPVTLTGLSSGTWDLSKQKWSYKIGLKGESLSLHTEAGSNSVEWVQGSLVA 618
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYKTTF AP G PLAL+L SMGKG+ WVNGQSIGR+W A G C+Y
Sbjct: 619 KKQPLAWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSIGRHWPGNKA--RGNCGNCNY 676
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KC +CGQP+Q YH+PR+W+ G N LV+ EE GGDP+ I+L+ +T
Sbjct: 677 AGTYTDTKCLANCGQPSQRWYHVPRSWLRSGGNYLVVLEEWGGDPNGIALVERT 730
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 790 bits (2040), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/714 (53%), Positives = 494/714 (69%), Gaps = 26/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDH+A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23 NAAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYF R+DLVRFVK ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 PVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + Y WAA
Sbjct: 143 NGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV VPWVMC+Q+DAPDP+INTCNGFYCD FTPNS KP MWTE +SGWF +FG
Sbjct: 203 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG
Sbjct: 263 AVPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIK E ++S DPT Q +G +A+++ S+ CAAFL+NY +SS
Sbjct: 323 LRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSS 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V +NG Y LPAWS+SILPDCK V+NTA V P A K + +
Sbjct: 383 PAKVVYNGRRYELPAWSISILPDCKTAVYNTATV-------KEPSAPAK-----MNPAGG 430
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
FSW Y E + +F + L EQ++ T D SD+LWYT +++ + G+ L
Sbjct: 431 FSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQL 490
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GH VFVN + GYG +D +K +++ +G N + ILS VGL N G
Sbjct: 491 TINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTH 550
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L L GKRDLS+ +W YQ+G++GE +G+ I+ ++S W +
Sbjct: 551 YENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSANGA- 609
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+K F AP G P+AL++ SMGKGQ WVNG++ GRYWS ++G C Y
Sbjct: 610 --QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWS---YKASGSCGSCSY 664
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ +CG +Q YH+PR+W++P NLLV+ EE GGD S + L+T+T
Sbjct: 665 TGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTRT 718
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 789 bits (2038), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/710 (54%), Positives = 498/710 (70%), Gaps = 26/710 (3%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP RG
Sbjct: 24 SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QY+F R+DLVRFVK ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 84 QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPF 143
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM+RF+ KI+ +MK E LF QGGPIILAQVENEYG +E A G G + Y WAA+ AV
Sbjct: 144 KAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMAV 203
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
+ VPWVMC+Q+DAPDP+INTCNGFYCD FTPNS SKP MWTE ++GWF +FG VP
Sbjct: 204 ATDAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNSKPTMWTEAWTGWFTAFGGPVPH 263
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVED+AFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG IRQP
Sbjct: 264 RPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQP 323
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHLR+LHKAIK E L+S DPT Q++G +A+++ S+ CAAFL+NY +SS A +
Sbjct: 324 KWGHLRDLHKAIKQAEPALVSGDPTIQRIGNYEKAYVFKSSTGACAAFLSNYHTSSAARI 383
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
+NG Y LPAWS+SILPDCK VFNTA V P A K + + F+W
Sbjct: 384 VYNGRRYDLPAWSISILPDCKTAVFNTATV-------KEPTAPAK-----MNPAGGFAWQ 431
Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
Y E + +F + L EQ++ T D SDYLWYT +++ + G+ L I S
Sbjct: 432 SYSEDTNALDSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSSEQFLKTGQWPQLTINS 491
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH+ VFVN + YG ++ +K +++ +G N + ILS +GL N G ++
Sbjct: 492 AGHSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISILSSAMGLPNQGTHYEAW 551
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V L L GKRDLS+ +W YQ+G++GE +G++ IS ++S + S+ +
Sbjct: 552 NVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVNSISGSSSV---EWSSASGAQP 608
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L W+K F AP G P+AL++ SMGKGQ WVNG + GRYWS Y A +G C Y G++
Sbjct: 609 LTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYWS-YRA--SGSCGGCSYAGTF 665
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+KCQ +CG +Q YH+PR+W+ P NLLV+ EE GGD S ++L+T+T
Sbjct: 666 SEAKCQTNCGDISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTLMTRT 715
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 788 bits (2036), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/716 (53%), Positives = 496/716 (69%), Gaps = 28/716 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDH+A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23 NAAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYF R+DLVRFVK ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83 PVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + Y WAA
Sbjct: 143 NGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV VPWVMC+Q+DAPDP+INTCNGFYCD FTPNS KP MWTE +SGWF +FG
Sbjct: 203 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG
Sbjct: 263 AVPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIK E ++S DPT Q +G +A+++ S+ CAAFL+NY +SS
Sbjct: 323 LRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSS 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V +NG Y LPAWS+SILPDCK V+NTA V +QK + L + A
Sbjct: 383 PAKVVYNGRRYELPAWSISILPDCKTAVYNTATV------------RQKWKEKKLWMNPA 430
Query: 422 --FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW Y E + +F + L EQ++ T D SD+LWYT +++ + G+
Sbjct: 431 GGFSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWP 490
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L I S GH VFVN + GYG +D +K +++ +G N + ILS VGL N G
Sbjct: 491 QLTINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQG 550
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V L L GKRDLS+ +W YQ+G++GE +G+ I+ ++S W +
Sbjct: 551 THYENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSANG 610
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ L W+K F AP G P+AL++ SMGKGQ WVNG++ GRYWS ++G C
Sbjct: 611 A---QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWS---YKASGSCGSC 664
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+Y +KCQ +CG +Q YH+PR+W++P NLLV+ EE GGD S + L+T+T
Sbjct: 665 SYTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTRT 720
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 788 bits (2035), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/713 (53%), Positives = 488/713 (68%), Gaps = 19/713 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK+ FASQGGPIIL+Q+ENE+ G G YV WAA
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV LNT VPWVMC+++DAPDPIINTCNGFYCD FTPN P KP MWTE +SGWF FG
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 VPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
++PK+ HL++LH+AIK CE L+SSDP KLG EAH++ C AFL NY ++
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LPAWS+SILPDC+NVVFNTA V ++ ++ Q +L + +
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMVPSGSILYSVAR- 442
Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E + GNR + L EQ+N T+DT+DYLWYT S+ + + GK L +
Sbjct: 443 --YDEDIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VFVN +G + F + ++ L G N + +LS+ VGL N G F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFE 560
Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ SV+L L G +DLS +W YQ G+ GE + L + +S W +GS N
Sbjct: 561 TWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN 620
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
K L WYK F AP G PLAL+L SMGKGQAW+NGQSIGRYW A+ + G C+Y
Sbjct: 621 KQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGDCGSCNYA 677
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG+P Q YH+PR+W+ P NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 787 bits (2032), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/831 (48%), Positives = 535/831 (64%), Gaps = 42/831 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
++ VTYD RA++IDGK R+L SGSIHYPRST ++WP+L++KS+EGGL+ IETYVFW+ HE
Sbjct: 22 ASKVTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPDLVKKSREGGLDAIETYVFWDSHE 81
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R +Y F G DL+RF+KT+Q+ GL+ LRIGPY CAEWNYGGFPVWLH +PG+Q RT
Sbjct: 82 PARREYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQMRTA 141
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N+ F EM+ F I++++KQENLFASQGGP+ILAQ+ENEYGNV +YG G+ Y++W A
Sbjct: 142 NDVFMNEMRNFTTLIVNMVKQENLFASQGGPVILAQIENEYGNVMSSYGDEGKAYIEWCA 201
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A +L+ VPW+MCQQ DAP+P+INTCNG+YCD FTPN P+ P MWTEN++GWF S+G
Sbjct: 202 NMAQSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQFTPNRPTSPKMWTENWTGWFKSWGG 261
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P R EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 262 KDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 321
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+ QPKWGHL+ELH + E+ L + + G + IY + + FL N DS +
Sbjct: 322 LNQPKWGHLKELHDVLHSMEDTLTRGNISSVDFGNSVSGTIY-STEKGSSCFLTNTDSRN 380
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D + F G Y +PAWSVSILPDC++VV+NTAKV +Q + ++KNV E A+
Sbjct: 381 DTTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQTS----VMVKKKNVAEDEPAALT 436
Query: 422 FSWYEE---KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLN 475
+SW E K + G + +Q + D SDYL+Y S+ + P G + L
Sbjct: 437 WSWRPETNDKSILFGKGEVSVNQILDQKDAANDLSDYLFYMTSVSLKEDDPIWGDNMTLR 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I G VFVN + + + + +++ ++I+LN+G NT+ +LS VG NYGA F
Sbjct: 497 ITGSGQVLHVFVNGEFIGSQWAKYGVFDYVFEQQIKLNKGKNTITLLSATVGFANYGANF 556
Query: 536 DVAGAGLFS-VILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
D+ AG+ V L+ + + +DLSS +W Y+VG+EG L ++SS W+Q
Sbjct: 557 DLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKVGLEGLRQNLYS---SDSSKWQQ-DN 612
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
P NK WYK TF AP G P+ ++L +GKG AWVNG SIGRYW +++A C
Sbjct: 613 YPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAWVNGNSIGRYWPSFIAEDGCSLDPC 672
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWV-HPGENLLVIHEELGGDPSKISLLTKTGQH 710
DYRGSYD +KC +CG+P Q YH+PR+++ + G+N LV+ EE GGDPS ++ T
Sbjct: 673 DYRGSYDNNKCVTNCGKPTQRWYHVPRSFLNNEGDNTLVLFEEFGGDPSSVNFQTTAIGS 732
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
C E ++ L+C+ G I+AI FAS+G P G CGSF G
Sbjct: 733 ACVNAEE------------------KKKIELSCQ-GRPISAIKFASFGNPLGTCGSFSKG 773
Query: 771 ACHM--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
C D L IVQKACVGQ C+I VS G S ++K L+VEA C
Sbjct: 774 TCEASNDALSIVQKACVGQESCTIDVSEDTFG-STTCGDDVIKTLSVEAIC 823
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/711 (53%), Positives = 497/711 (69%), Gaps = 17/711 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K GGL+VI+TYVFWN HE
Sbjct: 23 SASVGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKAGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFP+WL ++PGI FRT
Sbjct: 83 PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+++MK E LF ++GGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIENEYGPVEWEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV LNT VPW+MC+QEDAPDP+I+TCNG+YC+ F PN KP MWTE ++GW+ FG
Sbjct: 203 QMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWTEVWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
A+P RPVEDLAF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 263 AIPTRPVEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
++QPKWGHL++LHKAIK CE L++ DP+ KLG EAH+++ S CAAFLANYD+
Sbjct: 323 LQQPKWGHLKDLHKAIKSCEYALVAVDPSVTKLGNNQEAHVFNTKSG-CAAFLANYDTKY 381
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F Y LP WS+SILPDCK VFNTAKV + + Q K V L
Sbjct: 382 PVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKVTWKTSQ-----VQMKPVYSRLPWQ-- 434
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
S+ EE + + L EQI T+D +DYLWY I + + GK L I
Sbjct: 435 -SFIEETTTSDESGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLNNGKFPLLTI 493
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S HA VF+N +L YG+ + ++ ++L GIN L +LS+ VGL N G F+
Sbjct: 494 FSACHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFE 553
Query: 537 VAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ I L L G D+S +W Y++G++GE +GL ++ ++S W +G ++
Sbjct: 554 TWNAGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGEALGLHTVTGSSSVDWAEGPSMAKK 613
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C+Y G
Sbjct: 614 QPLTWYKATFNAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--QGSCGTCNYAG 671
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
++ KC+ +CG+P+Q YHIPR+W+ P NLLV+ EE GGDP +SL+ +
Sbjct: 672 TFYDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPQWMSLVER 722
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/713 (53%), Positives = 487/713 (68%), Gaps = 19/713 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK+ FASQGGPIIL+Q+ENE+ G G YV WAA
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV LNT VPWVMC+++DAPDPIINTCNGFYCD FTPN P KP MWTE +SGWF FG
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 VPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
++PK+ HL++LH+AIK CE L+SSDP KLG EAH++ C AFL NY ++
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LPAWS+SILPDC+NVVFNTA V ++ ++ Q +L + +
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMVPSGSILYSVAR- 442
Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E + GNR + L EQ+N T+DT+DYLWYT S+ + + GK L +
Sbjct: 443 --YDEDIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VFVN +G + F + ++ L G N + +LS+ VGL N G F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFE 560
Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ SV+L L G +DLS +W YQ G+ GE + L + +S W +GS N
Sbjct: 561 TWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN 620
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
K L WYK F P G PLAL+L SMGKGQAW+NGQSIGRYW A+ + G C+Y
Sbjct: 621 KQPLTWYKAYFDVPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGDCGSCNYA 677
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG+P Q YH+PR+W+ P NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/716 (52%), Positives = 497/716 (69%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD RA++I+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21 VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR+DLVRF+K VQ AGL+++LRIGPY CAEWN+GGFPVWL ++PG++FRT
Sbjct: 81 EPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F+ KI+++MK ENLF SQGGPII+AQ+ENEYG VEW G G+ Y KWA
Sbjct: 141 NNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GW+ FG
Sbjct: 201 AQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+P RP ED+AF+VARF + G+F NYYMY GGTNFGRT+ G +ATSYDYDAP+DEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PK+GHLR+LHKAIKL E L+SS LG+ EAH+Y S CAAFL+NYDS
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
VTF Y LP WS+SILPDCK V+NTA+V SQ ++ ++ A
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSS-----------IKMTPAGG 429
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW EE + + L EQ N T+D+SDYLWY ++++ + GK+
Sbjct: 430 GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKDP 489
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+L + S GH VFVN KL YG D + ++L GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVG 549
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+D AG+ V L L G R+L+ +W Y+VG++GE + L +S ++S W +GS
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRGSL 609
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK TF AP G PLAL++ASMGKGQ W+NG+ +GR+W Y+A G KC
Sbjct: 610 MAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIA--QGDCSKC 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+++ KCQ +CGQP+Q YH+PR+W+ P NLLV+ EE GG+P+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRRS 723
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/831 (47%), Positives = 542/831 (65%), Gaps = 45/831 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ V+YD RAL+IDGKRRVLQSGSIHYPRSTPE+WP+LIRK+K GGL+ IETYVFWN HE
Sbjct: 37 AVEVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMWPDLIRKAKAGGLDAIETYVFWNVHE 96
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+R +Y F G DL+RF++T+Q GL+ LRIGPY CAEW YGGFP+WLH +PGI+FRT
Sbjct: 97 PLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPYVCAEWTYGGFPMWLHNMPGIEFRTA 156
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F EM+ F I+D+ KQE LFASQGGPII+AQ+ENEYGN+ YG G++YV W A
Sbjct: 157 NKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIIIAQIENEYGNIMAPYGDAGKVYVDWCA 216
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A +L+ VPW+MCQQ DAP P+INTCNG+YCD FTPN+P+ P MWTEN++GWF ++G
Sbjct: 217 AMANSLDIGVPWIMCQQSDAPQPMINTCNGWYCDSFTPNNPNSPKMWTENWTGWFKNWGG 276
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P R EDL+++VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDYDAP+DE+G
Sbjct: 277 KDPHRTAEDLSYSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYDAPLDEFGN 336
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+ QPKWGHL++LH +K EE L + T +G +E +Y + + F +N ++++
Sbjct: 337 LNQPKWGHLKDLHTVLKSMEETLTEGNITTIDMGNSVEVTVY-ATQKVSSCFFSNSNTTN 395
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DA T+ G Y +PAWSVSILPDCK V+NTAKV +Q + + KN E AS
Sbjct: 396 DATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVNAQTS----VMVKNKNEAEDQPASLK 451
Query: 422 FSWYEEKV---GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLN 475
+SW E + + G L +Q TT D SDYLWY S+ + + L
Sbjct: 452 WSWRPEMIDDTAVLGKGQVSANRLIDQ-KTTNDRSDYLWYMNSVDLSEDDLVWTDNMTLR 510
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ + GH +VN + + + + N++ +K++L G N + +LS +G QNYGA++
Sbjct: 511 VNATGHILHAYVNGEYLGSQWATNGIFNYVFEEKVKLKPGKNLIALLSATIGFQNYGAFY 570
Query: 536 DVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSF-WKQGS 590
D+ +G+ + I + G +DLSS +W Y+VG+ G + K+ S + W++G+
Sbjct: 571 DLVQSGISGPVEIVGRKGDETIIKDLSSHKWSYKVGMHGMAM---KLYDPESPYKWEEGN 627
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+P+N++L WYKTTF AP G + ++L +GKG+AWVNGQS+GRYW + +A GC
Sbjct: 628 -VPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKGEAWVNGQSLGRYWPSSIAED-GCNAT 685
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRG Y +KC ++CG P Q YH+PR+++ EN LV+ EE GG+PS ++ T T
Sbjct: 686 CDYRGPYTNTKCVRNCGNPTQRWYHVPRSFLTADENTLVLFEEFGGNPSLVNFQTVTIGT 745
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
C ++++ N+ + LAC+ I+ I FAS+G P+G+CGSF G
Sbjct: 746 ACG----------NAYENNV--------LELACQNR-PISDIKFASFGDPQGSCGSFSKG 786
Query: 771 AC--HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+C + D L I++KACVG+ CS+ VS G + +C + K LAVEA C
Sbjct: 787 SCEGNKDALDIIKKACVGKESCSLDVSEKAFGST--SCGSIPKRLAVEAVC 835
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/714 (53%), Positives = 492/714 (68%), Gaps = 24/714 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25 NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQYYF R+DLVRFVK ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + Y WAA
Sbjct: 145 NGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG
Sbjct: 205 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG
Sbjct: 265 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 324
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LHKAIK E L+S DPT Q LG +A+++ S CAAFL+NY +S+
Sbjct: 325 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSA 384
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
A V FNG Y LPAWS+S+LPDCK VFNTA V P A + + +
Sbjct: 385 AARVVFNGRRYDLPAWSISVLPDCKAAVFNTATV-------SEPSAPAR-----MSPAGG 432
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
FSW Y E R+F + L EQ++ T D SDYLWYT +++ + G+ L
Sbjct: 433 FSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQL 492
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ S GH+ VFVN + YG +D + +++ +G N + ILS VGL N G
Sbjct: 493 TVYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTH 552
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L L GKRDLS+ +W YQ+G+ GE +G+ ++ ++S W +
Sbjct: 553 YETWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG-- 610
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L W+K F AP G P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+G C Y
Sbjct: 611 -KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGGCGGCSY 668
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG +Q YH+PR+W++P NLLV+ EE GGD + L+T+T
Sbjct: 669 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTRT 722
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 784 bits (2025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/713 (53%), Positives = 488/713 (68%), Gaps = 19/713 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK+ FASQGGPIIL+Q+ENE+ G G YV WAA
Sbjct: 149 GPFKAAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPELKGLGPAGHSYVNWAAK 208
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV LNT VPWVMC+++DAPDPIIN+CNGFYCD FTPN P KP MWTE +SGWF FG
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINSCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+P RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 IPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
++PK+ HL++LH+AIK CE L+SSDP KLG EAH++ C AFL NY ++
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LPAWS+SILPDC+NVVFNTA V ++ ++ Q +L + +
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMMPSGSILYSVAR- 442
Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E + G+R + L EQ+N T+DT+DYLWYT S+ + + GK L +
Sbjct: 443 --YDEDIATYGDRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VFVN +G + F + ++ L G N + +LS+ VGL N G F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANRIALLSVAVGLPNVGPHFE 560
Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ SV+L L G +DLS +W YQ G+ GE + L + +S W +GS N
Sbjct: 561 TWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGEAMKLVSPTEDSSVDWIKGSLAKQN 620
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
K L WYK F AP G PLAL+L SMGKGQAW+NGQSIGRYW A+ + G C+Y
Sbjct: 621 KQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGNCGSCNYA 677
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG+P Q YH+PR+W+ P NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPRGNLLVLFEELGGDISKVSVVKRS 730
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 784 bits (2025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/715 (53%), Positives = 491/715 (68%), Gaps = 22/715 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LIRK+K GGL+ I+TYVFWN H
Sbjct: 24 IHCSVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIRKAKGGGLDAIDTYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLVRF+KTVQ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 EPSPGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG+ G G Y WA
Sbjct: 144 DNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGSESKQLGGAGYAYTNWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV LNT VPWVMC+Q+DAPDP+IN CNGFYCD F+PN P KP +WTE++SGWF FG
Sbjct: 204 AKMAVGLNTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKPYKPTLWTESWSGWFTEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPV+DLAFAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GPIYQRPVQDLAFAVARFIQKGGSYINYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IR+PK+GHL +LHKAIK CE L+SSDPT LGA +AH++ + CAAFLANY S+
Sbjct: 324 LIREPKYGHLMDLHKAIKQCERALVSSDPTVTSLGAYEQAHVFSSKNGACAAFLANYHSN 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A VTFN Y LP WS+SILPDCK VFNTA+V Q + L S
Sbjct: 384 SAARVTFNNRKYDLPPWSISILPDCKTDVFNTARV----------RFQTTKIQMLPSNSK 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW Y+E V +S + L EQ+N T+DTSDYLWY S+ + + G +
Sbjct: 434 LFSWETYDEDVSSLSESSKITASGLLEQLNATRDTSDYLWYITSVDISSSESFLRGGNKP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+++ S GHA VF+N + + +G + + N + L G N + +LS+ VGL N G
Sbjct: 494 SISVHSAGHAVHVFINGQFLGSAFGTSEDRSCTFNGPVNLRAGTNKIALLSVAVGLPNVG 553
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
F+ AG+ V+L L +G++DL+ +W YQ+G++GE + L + +S W + S
Sbjct: 554 FHFETWKAGITGVLLYGLDHGQKDLTWQKWSYQIGLKGEAMNLVSPNGVSSVDWVRDSLD 613
Query: 593 PVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
++S L W+K F AP+G PLAL+L+SMGKGQ W+NGQSIGRYW Y + G C
Sbjct: 614 VRSQSQLKWHKAYFNAPDGVEPLALDLSSMGKGQVWINGQSIGRYWMVY---AKGACNSC 670
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
+Y G+Y +KCQ CGQP Q YH+PR+W+ P NL+V+ EELGG+P KISL +
Sbjct: 671 NYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKPTNNLIVLLEELGGNPWKISLQKR 725
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 783 bits (2023), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/714 (53%), Positives = 491/714 (68%), Gaps = 25/714 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V YDHRA++++GKRR+L SGSIHYPRSTPE+WP+L++K+K+GGL+V++TYVFWN HEP
Sbjct: 25 ASVGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVLQTYVFWNGHEP 84
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFE R+DLV+F+K Q+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85 SPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDN 144
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PF M++F KI+ +MK E LF +QGGPIIL+Q+ENEYG VEW G G+ Y +WAA
Sbjct: 145 RPFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIENEYGPVEWEIGAPGKSYTQWAAK 204
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV LNT VPWVMC+QEDAPDPII+TCNGFYC+ FTPN KP MWTE ++GW+ FG A
Sbjct: 205 MAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNYKPKMWTEIWTGWYTEFGGA 264
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RP +DLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 265 VPTRPAQDLAFSVARFIQNGGSFANYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLP 324
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PK+ HL+ +HKAIK+ E L+++D KLG EAH+Y +S + CAAFLANYD+
Sbjct: 325 REPKYSHLKYMHKAIKMAEPALLATDAAVSKLGNNQEAHVY-QSRSGCAAFLANYDTKYP 383
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
VTF Y LP WS+SILPDCK VFNTA+V G P + V L
Sbjct: 384 VRVTFWNKQYNLPPWSISILPDCKTEVFNTARV------GQSPPTKMTPVAHL------- 430
Query: 423 SW--YEEKVGISG-NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW Y E V S + +F L EQI+ T D +DYLWY I + P + GK L
Sbjct: 431 SWQAYIEDVATSADDNAFTSVGLREQISLTWDNTDYLWYMTDITIGPNEQFLRTGKYPTL 490
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
++S GHA VF+N +L YG F N+ ++L GIN L +LS+ VGL N G
Sbjct: 491 KVDSAGHALHVFINGQLSGSAYGTLAFPKLEFNQGVKLRAGINKLALLSVSVGLANVGLH 550
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ V L + +G D++ +W Y++G+ GE + L +S ++S W QGS L
Sbjct: 551 FETWNTGVLGPVTLAGVNSGTWDMTRWQWTYKIGMRGEDMSLHTVSGSSSVEWVQGSLLA 610
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK AP G PLAL++ SMGKGQ W+NGQSIGR+W AY A G C Y
Sbjct: 611 QYRPLTWYKAILNAPPGNAPLALDMGSMGKGQMWINGQSIGRHWPAYKA--HGSCGACYY 668
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KC+ +CGQP+Q YH+PR+W+ NLLV+ EE GGDP+KISL+ ++
Sbjct: 669 AGTYTENKCRTNCGQPSQRWYHVPRSWLKSSGNLLVVFEEWGGDPTKISLVARS 722
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 783 bits (2021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/717 (53%), Positives = 501/717 (69%), Gaps = 25/717 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27 AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 87 SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDN 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+RF KI+D+MK+E LF +QGGPIIL+Q+ENEYG +EW G G+ Y KW A+
Sbjct: 147 EPFKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMEWEMGAAGKAYSKWTAE 206
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPW+MC+QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF FG A
Sbjct: 207 MALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+P RPVED+AF+VARF + GG+F NYYMY+GGTNF RTA G +ATSYDYDAP+DEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFLNYYMYYGGTNFDRTA-GVFIATSYDYDAPLDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PK+ HL+ELHK IKLCE L+S DPT LG K E H++ KS CAAFL+NYD+SS
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEVHVF-KSKTSCAAFLSNYDTSSA 384
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A + F G Y LP WSVSILPDCK +NTAK+ + P K ++ S+ F
Sbjct: 385 ARIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MVPTSTKF 433
Query: 423 SW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW Y E S + +FV+ L EQI+ T+D +DY WY I + + G + L
Sbjct: 434 SWESYNEGSPSSNDDGTFVKDGLVEQISMTRDKTDYFWYLTDITIGSDESFLKTGDDPLL 493
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN L YG + ++KI+L+ GIN L +LS VGL N G
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKIKLSVGINKLALLSTAVGLPNAGVH 553
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS-SFWKQGSTL 592
++ G+ V L + +G D+S +W Y++G+ GE + I+ +++ +W +GS +
Sbjct: 554 YETWNTGVLGPVTLKGVNSGTWDMSKWKWSYKIGIRGEAMSFHTIAGSSAVKWWIKGSFV 613
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYK++F P+G PLAL++ +MGKGQ WVNG +IGR+W AY A G +C+
Sbjct: 614 VKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTA--RGNCGRCN 671
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
Y G Y+ KC HCG+P+Q YH+PR+W+ P NLLVI EE GGDPS ISL+ +T +
Sbjct: 672 YAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 728
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/716 (52%), Positives = 496/716 (69%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD RA++I+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21 VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
P G+Y FEGR+DLVRF+K VQ AGL+++LRIGPY CAEWN+GGFPVWL ++PG++FRT
Sbjct: 81 GPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F+ KI+++MK ENLF SQGGPII+AQ+ENEYG VEW G G+ Y KWA
Sbjct: 141 NNQPFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GW+ FG
Sbjct: 201 AQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+P RP ED+AF+VARF + G+F NYYMY GGTNFGRT+ G +ATSYDYDAP+DEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PK+GHLR+LHKAIKL E L+SS LG+ EAH+Y S CAAFL+NYDS
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
VTF Y LP WS+SILPDCK V+NTA+V SQ ++ ++ A
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSS-----------IKMTPAGG 429
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW EE + + L EQ N T+D+SDYLWY ++++ + GK+
Sbjct: 430 GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKDP 489
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+L + S GH VFVN KL YG D + ++L GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVG 549
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+D AG+ V L L G R+L+ +W Y+VG++GE + L +S ++S W +GS
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRGSL 609
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK TF AP G PLAL++ASMGKGQ W+NG+ +GR+W Y+A G KC
Sbjct: 610 VAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIA--QGDCSKC 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+++ KCQ +CGQP+Q YH+PR+W+ P NLLV+ EE GG+P+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRRS 723
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/712 (53%), Positives = 493/712 (69%), Gaps = 20/712 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYD +A++I+GKRR+L SGSIHYPRSTP++WP LI+ +K+GGL++IETYVFWN HEP
Sbjct: 20 ATVTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLIQNAKDGGLDIIETYVFWNGHEP 79
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL +PGI FRT N
Sbjct: 80 TQGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPIWLKHVPGIVFRTEN 139
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+ +MK E L+ SQGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 140 EPFKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 199
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN +KP +WTE +SGW+ +FG A
Sbjct: 200 MALGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNRENKPKIWTEVWSGWYTAFGGA 259
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RP EDLAF+VARF + GG+ NYYMY GGTNFGR++ G +A SYD+DAPIDEYG
Sbjct: 260 VPYRPAEDLAFSVARFVQNGGSLFNYYMYHGGTNFGRSS-GLFIANSYDFDAPIDEYGLK 318
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PKW HLR+LHKAIKLCE L+S+DP LG LEA ++ SS CAAFLANYD S+
Sbjct: 319 REPKWEHLRDLHKAIKLCEPALVSADPNVTWLGKNLEARVFKSSSGACAAFLANYDISTS 378
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
+ V+F Y LP WS+SIL DCK+ +FNTA++ AQ + +L++S +
Sbjct: 379 SKVSFWNTQYDLPPWSISILSDCKSAIFNTARI----------GAQSAPMKMMLVSSFWW 428
Query: 423 SWYEEKVGIS-GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E+V + + L EQ+N T D++DYLWY I + P + G+ LNI
Sbjct: 429 LSYKEEVASGYATDTTTKDGLVEQVNFTWDSTDYLWYMTDIQIDPNEAFIKSGQWPLLNI 488
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH VFVN +L YG+ + +K + L G+N L +LS+ VGL N G F+
Sbjct: 489 SSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNLKAGVNKLSMLSVTVGLPNVGLHFE 548
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ V L L G RD+S +W ++VG++GE + L I +NS W +GS L
Sbjct: 549 SWNAGVLGPVTLKGLNEGIRDMSGYKWSHKVGLKGENMNLHTIGGSNSVQWAKGSGLVQK 608
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYKT F P G PLAL+++SMGKGQ W+NG+SIGRYW AY A +G KC Y G
Sbjct: 609 QPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWINGRSIGRYWPAYAA--SGSCGKCSYAG 666
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+ KC +CGQP+Q YH+PR W+ N LV+ EELGG+P ISL+ ++
Sbjct: 667 IFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVVFEELGGNPGGISLVKRS 718
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/713 (53%), Positives = 486/713 (68%), Gaps = 19/713 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +MK+ FASQGGPIIL+Q+ENE+ G G YV WAA
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV LNT VPWVMC+++DAPDPIINTCNGFYCD FTPN P KP MWTE +SGWF FG
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 VPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
++PK+ HL++LH+AIK CE L+SSDP KLG EAH++ C AFL NY ++
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V FN Y LPAWS+SILPDC+NVVFNTA V ++ ++ Q +L + +
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMVPSGSILYSVAR- 442
Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
Y+E + GN + L EQ+N T+DT+DYLWYT S+ + + GK L +
Sbjct: 443 --YDEDIATYGNPGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S GHA VFVN +G + F + ++ L G N + +LS+ VGL N G F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFE 560
Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ SV L L G +DLS +W YQ G+ GE + L + +S W +GS N
Sbjct: 561 TWATGIVGSVALHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN 620
Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
K L WYK F AP G PLAL+L SMGKGQAW+NGQSIGRYW A+ + G C+Y
Sbjct: 621 KQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGDCGSCNYA 677
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KCQ CG+P Q YH+PR+W+ P NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 782 bits (2019), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/717 (51%), Positives = 508/717 (70%), Gaps = 21/717 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25 VKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 85 EPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW G G+ Y KW
Sbjct: 145 DNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS +KP MWTEN++GWF FG
Sbjct: 205 AEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PK+ HL+ LHK IKLCE L+S+DPT LG K EAH++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF-KSKSSCAAFLSNYNTS 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F G+ Y LP WSVSILPDCK +NTAKV R + H +++ ++
Sbjct: 383 SAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV-QVRTSSIH--------MKMVPTNT 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
FSW Y E++ + N +F + L EQI+ T+D +DY WY I + P + G++
Sbjct: 434 PFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 493
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VFVN +L YG+ + ++KI+L+ G+N L +LS GL N G
Sbjct: 494 LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGV 553
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
++ G+ V L + +G D++ +W Y++G +GE + + ++ +++ WK+GS +
Sbjct: 554 HYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLAGSSTVEWKEGSLV 613
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYK+TF +P G PLAL++ +MGKGQ W+NGQ+IGR+W AY A G ++C
Sbjct: 614 AKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPAYTA--RGKCERCS 671
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
Y G++ KC +CG+ +Q YH+PR+W+ P NL+++ EE GG+P+ ISL+ +T +
Sbjct: 672 YAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNGISLVKRTAK 728
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 781 bits (2018), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/717 (51%), Positives = 506/717 (70%), Gaps = 21/717 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25 VKAMVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++P + FRT
Sbjct: 85 EPSPGQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPDMVFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW G G+ Y KW
Sbjct: 145 DNEPFKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS KP MWTEN++GWF FG
Sbjct: 205 AKMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDKKPKMWTENWTGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PK+ HL+ LHK IKLCE L+S+DPT LG K EA ++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAQVF-KSQSSCAAFLSNYNTS 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V+F G+ Y LP WSVSILPDCK +NTAKV R + H +++ ++
Sbjct: 383 SAARVSFGGSTYDLPPWSVSILPDCKTEYYNTAKV-QVRTSSIH--------MKMVPTNT 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
FSW Y E++ + N +F + L EQI+ T+D +DY WY I + P + G++
Sbjct: 434 LFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 493
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
LNI S GHA VFVN +L YG+ + ++KI+L+ G+N L +LS+ GL N G
Sbjct: 494 LNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSIAAGLPNVGV 553
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
++ G+ V L + +G D+S +W Y++G +GE + + ++ +++ WKQGS +
Sbjct: 554 HYETWNTGVLGPVTLKGVNSGTWDMSQWKWSYKIGTKGEALSIHTVTGSSTVEWKQGSLV 613
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYK+TF P G PLAL++ +MGKGQ W+NGQ+IGR+W AY A G ++C
Sbjct: 614 ATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWINGQNIGRHWPAYTA--RGKCERCS 671
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
Y G++ +KC +CG+ +Q YH+PR+W+ P NL+V+ EE GG+P+ ISL+ + +
Sbjct: 672 YAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVLEEWGGEPNGISLVKRRAK 728
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/716 (53%), Positives = 489/716 (68%), Gaps = 25/716 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD ++L+I+G+RR+L SGSIHYPRSTPE+W +LI K+K GGL+VI+TYVFW+ HEP
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G Y FEGR+DLVRF+KTVQ+ GL+ +LRIGPY CAEWN+GG PVWL ++PG+ FRT N
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG + G G YV WAA
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYGPE--SRGAAGRAYVNWAASM 206
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV L T VPWVMC++ DAPDP+IN+CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
RPVEDL+FAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG IR
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPK+ HL+ELHKAIK CE L+S DPT LG L+AH++ + CAAFLANY++ S A
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
VTFN Y LP WS+SILPDCK VFNTAKV Q V L + FS
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAKV----------RVQPSQVKMLPVKPKLFS 436
Query: 424 W--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
W Y+E + ++ + P L EQ+N T+DTSDYLWY S+ + + G++ +N
Sbjct: 437 WESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSIN 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
++S GHA VFVN + +G + + N ++L G N + +LS+ VGLQN G +
Sbjct: 497 VQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHY 556
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V+L L G++DL+ +W Y+VG+ GE + L + +S W Q S
Sbjct: 557 ETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQ 616
Query: 595 NKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++S L WYK F AP GK PLAL+L SMGKGQ W+NGQSIGRYW AY + G C Y
Sbjct: 617 SRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAY---AKGDCNSCTY 673
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
G++ KCQ CGQP Q YH+PR+W+ P +NL+V+ EELGG+P KISL+ +
Sbjct: 674 SGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKRVAH 729
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/716 (52%), Positives = 494/716 (68%), Gaps = 24/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A VTYD +A++++G+RR+L +GSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 27 VTATVTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 86
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G YYFE RFDLV+FVK VQ+AGL+++LRIGPYACAEWN+GGFPVWL ++PG+ FRT
Sbjct: 87 EPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMSFRT 146
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+++MKQE LF QGGPIIL+Q+ENEYG +EW G+ Y +WA
Sbjct: 147 DNEPFKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWA 206
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV LNT VPW+ C+QEDAPDP+I+TCN +YC+ FTPN KP MWTE ++ WF S+G
Sbjct: 207 AQMAVGLNTGVPWIACKQEDAPDPLIDTCNAYYCEKFTPNKSYKPKMWTEAWTAWFTSWG 266
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
V +RP ED AF+V +F ++GG++ NYYMY GGTNFGRTAGGP VATSYDYDAP+DEYG
Sbjct: 267 NPVLYRPAEDQAFSVLKFIQSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYG 326
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
PK+ HL+ +HKAIK E+ L+S+D T LG EAH+Y SS+ CAAFLANYD S
Sbjct: 327 LTNDPKYTHLKHMHKAIKQSEKALVSADATVTSLGTNQEAHVY-SSSSGCAAFLANYDVS 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
V F Y LPAWS+SILPDCK V+NTAKV++ R V++ +
Sbjct: 386 YSVKVNFGSGQYDLPAWSISILPDCKTEVYNTAKVLAPR------------VHKKMTPLG 433
Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
F+W Y ++V D L EQ+ TKD+SDYLWY + + + GK+
Sbjct: 434 GFTWDSYIDEVASGFASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDEAFLTNGKDP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
FLN++S GH VFVN KL+ YG++D ++ ++LN G+N + +LS VGL N G
Sbjct: 494 FLNVQSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIALLSASVGLANVG 553
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L G D++ +W Y+VGV+GE + L+ ++ ++S W +GS
Sbjct: 554 LHFENYNVGVLGPVTLTGLNQGTVDMTKWKWSYKVGVQGEKLQLNTVAGSSSVEWVKGSM 613
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
L + L WYK+TF APEG P+AL++ SMGKGQ W+NGQ IGRYW AY A G C
Sbjct: 614 LAKKQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAYTA--QGNCGGC 671
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G + KC CGQP Q YH+PR+W+ P NLLV+ EE GGDP+ IS++ +T
Sbjct: 672 SYGGYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGISMVKRT 727
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/716 (52%), Positives = 496/716 (69%), Gaps = 22/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD RA++I+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21 VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+Y FEGR+DLVRF+K VQ AGL+++LRIGPY CAEWN+GGFPVWL ++PG++FRT
Sbjct: 81 EPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F+ KI+++MK ENLF SQGGPII+AQ+ENEYG VEW G G+ Y KWA
Sbjct: 141 NNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPW+MC++EDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GW+ FG
Sbjct: 201 AQMAVGLKTGVPWIMCKREDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+P RP ED+AF+VARF + G+F NYYMY GGTNFGRT+ G +ATSYDYDAP+DEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ +PK+GHLR+LHKAIKL E L+SS LG+ EAH+Y S CAAFL+NYDS
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
VTF Y LP WS+SILPDCK V+NTA+V SQ ++ ++ A
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSS-----------IKMTPAGG 429
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW EE + + L EQ N T+D+SDYLWY ++++ + GK+
Sbjct: 430 GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLRNGKDP 489
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+L + S GH VFVN KL YG D + ++L GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVG 549
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
+D AG+ V L L G R+L+ +W Y+VG++GE + L +S ++S W +GS
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRGSL 609
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK TF AP G PLAL +ASMGKGQ W+NG+ +GR+W Y+A G KC
Sbjct: 610 VAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWPGYIA--QGDCSKC 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+++ KCQ +CGQP+Q +H+PR+W+ P NLLV+ EE GG+P+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRRS 723
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 780 bits (2014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/829 (48%), Positives = 523/829 (63%), Gaps = 41/829 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V++D RA+ IDGKRRVL SGSIHYPRST E+WP+LI+KSKEGGL+ IETYVFWN HEP R
Sbjct: 47 VSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDLIKKSKEGGLDAIETYVFWNSHEPSR 106
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DLVRF+KT+Q GL+ LRIGPY CAEWNYGGFP+WLH +PG + RT N+
Sbjct: 107 RQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCAEWNYGGFPMWLHNLPGCELRTANSV 166
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F EM+ F + I+D+MK ENLFASQGGPIILAQVENEYGNV AYG G+ Y+ W ++ A
Sbjct: 167 FMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVENEYGNVMSAYGAAGKTYIDWCSNMA 226
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+L+ VPW+MCQQ DAP P+INTCNG+YCD FTPN+ + P MWTEN++GWF S+G P
Sbjct: 227 ESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQFTPNNANSPKMWTENWTGWFKSWGGKDP 286
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED+AFAVARFF+TGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG + Q
Sbjct: 287 HRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNLNQ 346
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PKWGHL++LH + E L + + + A IY + + A F N + +SDA
Sbjct: 347 PKWGHLKQLHDILHSMEYTLTHGNISTIDYDNSVTATIY-ATDKESACFFGNANETSDAT 405
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
+ F G Y +PAWSVSILPDC+NV +NTAKV +Q +QKN E +S +SW
Sbjct: 406 IVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQT----AIMVKQKNEAEDQPSSLKWSW 461
Query: 425 YEEK---VGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
E + G L +Q D SDYLWY S+H+ P ++ L +
Sbjct: 462 IPENTHTTSLLGKGHAHARQLIDQKAAANDASDYLWYMTSLHIKKDDPVWSSDMSLRVNG 521
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH +VN K + + + +++ K ++L G N + +LS VGLQNYG FD+
Sbjct: 522 SGHVLHAYVNGKHLGSQFAKYGVFSYVFEKSLKLRPGKNVISLLSATVGLQNYGPMFDLV 581
Query: 539 GAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
G+ + I G +DLSS +W Y VG+ G + L + ++S W + LP
Sbjct: 582 QTGIPGPVEIIGHRGDEKVVKDLSSHKWSYSVGLNGFHNELYSSNSRHASRWVE-QDLPT 640
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC-TKKCDY 653
NK +IWYKTTF AP GK P+ L+L MGKG AWVNG +IGRYW ++LA GC T+ CDY
Sbjct: 641 NKMMIWYKTTFKAPLGKDPVVLDLQGMGKGFAWVNGNNIGRYWPSFLAEEDGCSTEVCDY 700
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
RG+YD +KC +CG+P Q YH+PR++ + EN LV+ EE GG+P+ ++ T T +
Sbjct: 701 RGAYDNNKCVTNCGKPTQRWYHVPRSFFNDYENTLVLFEEFGGNPAGVNFQTVTVGKVSG 760
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
E + + L+C G I+AI FAS+G P+G G++ G C
Sbjct: 761 SAGEGE------------------TIELSC-NGKSISAIEFASFGDPQGTSGAYVKGTCE 801
Query: 774 --MDVLPIVQKACVGQIECSIPVSSAYLG-VSAGACPGLLKALAVEAHC 819
D IVQKACVG+ C + S G S G+ ++ LAV+A C
Sbjct: 802 GSNDAFSIVQKACVGKETCKLEASKDVFGPTSCGS--DVVNTLAVQATC 848
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 779 bits (2012), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/715 (53%), Positives = 489/715 (68%), Gaps = 24/715 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S +VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HE
Sbjct: 81 SRSVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHE 140
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT
Sbjct: 141 PSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTD 200
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F+ KI+D+MK E LF +QGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 201 NAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAA 260
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN KP +WTEN+SGW+ +FG
Sbjct: 261 QMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGG 320
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+RP ED+AF+VARF + GG+ NYYMY GGTNFGRT+ G V TSYD+DAPIDEYG
Sbjct: 321 PTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDEYGL 379
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+R+PKWGHLR+LHKAIKLCE L+S+DPT LG EA ++ SS CAAFLANYD+S+
Sbjct: 380 LREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARVFKSSSGACAAFLANYDTSA 439
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V F + Y LP WS+SILPDCK V FNT + K+ + S+
Sbjct: 440 FVRVNFWNHPYDLPPWSISILPDCKTVTFNTGSLQ----------IGVKSYEAKMTPISS 489
Query: 422 FSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
F W EE + + L EQ++ T DT+DYLWY SI + + G+
Sbjct: 490 FWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDTTDYLWYILSIRIDSTEGFLKSGQWPL 549
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S GH VF+N +L YG+ + +K + L +G+N L +LS+ VGL N G
Sbjct: 550 LTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGL 609
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
FD AG+ V L L G RD+S +W Y+VG+ GE + L + +NS W +GS
Sbjct: 610 HFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQ 669
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L WYKTTF P G PLAL+++SM KGQ WVNG+SIGRY+ Y+A G KC
Sbjct: 670 --KQPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIA--RGKCNKCS 725
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G + KC +CG P+Q YHIPR W+ P NLL+I EE+GG+P ISL+ +T
Sbjct: 726 YTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 780
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 779 bits (2011), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/718 (51%), Positives = 508/718 (70%), Gaps = 22/718 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25 VKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 85 EPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW G G+ Y KW
Sbjct: 145 DNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS +KP MWTEN++GWF FG
Sbjct: 205 AEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PK+ HL+ LHK IKLCE L+S+DPT LG K EAH++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF-KSKSSCAAFLSNYNTS 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F G+ Y LP WSVSILPDCK +NTAKV R + H +++ ++
Sbjct: 383 SAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV-QVRTSSIH--------MKMVPTNT 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
FSW Y E++ + N +F + L EQI+ T+D +DY WY I + P + G++
Sbjct: 434 PFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 493
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VFVN +L YG+ + ++KI+L+ G+N L +LS GL N G
Sbjct: 494 LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGV 553
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIY-QVGVEGEYIGLDKISLANSSFWKQGST 591
++ G+ V L + +G D++ +W Y Q+G +GE + + ++ +++ WK+GS
Sbjct: 554 HYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKQIGTKGEALSVHTLAGSSTVEWKEGSL 613
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK+TF +P G PLAL++ +MGKGQ W+NGQ+IGR+W AY A G ++C
Sbjct: 614 VAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPAYTA--RGKCERC 671
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
Y G++ KC +CG+ +Q YH+PR+W+ P NL+++ EE GG+P+ ISL+ +T +
Sbjct: 672 SYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNGISLVKRTAK 729
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 778 bits (2008), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/712 (52%), Positives = 494/712 (69%), Gaps = 17/712 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V YDH+A++I+G+RR+L SGSIHYPRSTP +WP+LI+K+K GGL+VI+TYVFWN HE
Sbjct: 23 SASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFP+WL ++PGI FRT
Sbjct: 83 PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+++MK E LF +QGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPW+MC+QEDAPDP+I+TCNG+YC+ F PN KP MWTE ++GW+ FG
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWTEVWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
A+P RP EDLAF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 263 AIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
++QPKWGHLR+LHKAIK CE L++ DP+ KLG EAH+++ S + CAAFLAN+D+
Sbjct: 323 LQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFN-SKSGCAAFLANHDTKY 381
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F Y LP WS+SILPDCK VFNTAKV + + Q K V L S
Sbjct: 382 SVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASE-----VQMKPVYSRLPWQSF 436
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
E + L EQI T+D +DYLWY I + + GK L I
Sbjct: 437 IE---ETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLKNGKFPLLTI 493
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG+ + ++ ++L GIN L +LS+ VGL N G F+
Sbjct: 494 FSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFE 553
Query: 537 VAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ I L L G D+S +W Y++G++GE +GL ++ ++S W +G ++
Sbjct: 554 TWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESLGLHTVTGSSSVDWAEGPSMAQK 613
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C Y G
Sbjct: 614 QPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--QGSCGNCYYAG 671
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+++ KC+ +CG+P+Q YHIPR+W+ P NLLV+ EE GGDPS +SL+ +
Sbjct: 672 TFNDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSWMSLVERV 723
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 777 bits (2007), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/717 (53%), Positives = 493/717 (68%), Gaps = 26/717 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25 IHCTVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR+DLV+F+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 EPSPGNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG A G G Y WA
Sbjct: 145 DNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGHAYSNWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE++SGWF FG
Sbjct: 205 AKMAVGLGTGVPWVMCKEDDAPDPVINACNGFYCDDFSPNKPYKPKLWTESWSGWFSEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ P RPVEDLAFAVARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GSNPQRPVEDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+R+PK+GHL++LHKAIK CE L+SSDPT LGA +AH++ S CAAFLANY S+
Sbjct: 325 LLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVF-SSGTTCAAFLANYHSN 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A VTFN Y LP WS+SILPDC+ VFNTA++ Q + L S
Sbjct: 384 SAARVTFNNRHYDLPPWSISILPDCRTDVFNTARM----------RFQPSQIQMLPSNSK 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV------MPGQGKE 471
SW Y+E V ++ + L EQI+ T+DTSDYLWY S+ + + G+ K
Sbjct: 434 LLSWETYDEDVSSLAESSRITASRLLEQIDATRDTSDYLWYITSVDISSSESFLRGRNKP 493
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
+++ S G A VF+N K +G + +F N I+L G N + +LS+ VGL N
Sbjct: 494 S-ISVHSSGDAVHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIALLSVAVGLPNG 552
Query: 532 GAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
G F+ +G+ V+L DL +G++DL+ +W YQVG++GE + L + +S W S
Sbjct: 553 GIHFESWKSGITGPVLLHDLDHGQKDLTGQKWSYQVGLKGEAMNLVSPNGVSSVDWVSES 612
Query: 591 TLPVNK-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
N+ L W+K F AP G PLAL+++SMGKGQ W+NGQSIGRYW Y + G
Sbjct: 613 LASQNQPQLKWHKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRYWMVY---AKGNCN 669
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
C+Y G+Y +KCQ CGQP Q YH+PR+W+ P NL+V+ EELGG+P KISL+ +
Sbjct: 670 SCNYAGTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGNPWKISLVKR 726
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 777 bits (2006), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/717 (53%), Positives = 490/717 (68%), Gaps = 23/717 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI+K+K GGL+VI+TYVFWN H
Sbjct: 24 IHCSVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTYVFWNVH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 EPSPSNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG A G G Y WA
Sbjct: 144 DNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGHAYSNWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L T VPWVMC+++DAPDP+IN+CNGFYCD F+PN P KP +WTE++SGWF FG
Sbjct: 204 AKMAVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDFSPNKPYKPKLWTESWSGWFSEFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP RP +DLAFAVARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GPVPQRPAQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+R+PK+GHL++LHKAIK CE L+SSDPT LGA +AH++ + CAAFLANY S+
Sbjct: 324 LLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTQTCAAFLANYHSN 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A VTFN Y LP WS+SILPDCK VFNTA+V Q + L S
Sbjct: 384 SAARVTFNNRHYDLPPWSISILPDCKTDVFNTARV----------RFQNSKIQMLPSNSK 433
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW Y+E V ++ + L EQIN T+DTSDYLWY S+ + P + G +
Sbjct: 434 LLSWETYDEDVSSLAESSRITASGLLEQINATRDTSDYLWYITSVDISPSESFLRGGNKP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+++ S G A VF+N K +G + + N I L+ G N + +LS+ VGL N G
Sbjct: 494 SISVHSSGDAVHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKIALLSVAVGLPNGG 553
Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ IL+ L +G++DL+ +W YQVG++GE + L + +S W + S
Sbjct: 554 IHFESWKTGITGPILLHGLDHGQKDLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWVRESL 613
Query: 592 LPVNK-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
N+ L W+K F AP+G LAL+++ MGKGQ W+NGQSIGRYW Y + G
Sbjct: 614 ASQNQPQLKWHKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGRYWLVY---AKGNCNS 670
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
C+Y G+Y +KCQ CGQP Q YH+PR+W+ P NL+V+ EELGG+P KISL+ +T
Sbjct: 671 CNYAGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGGNPWKISLVKRT 727
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/716 (52%), Positives = 486/716 (67%), Gaps = 23/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NVTYD +AL+I+G+R+VL SGSIHYPRSTPE+W LI+K+K+GGL+VI+TYVFWN H
Sbjct: 24 IQCNVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNLH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y F+GR+DLVRF+K V EAGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84 EPSPGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGISFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK ENLF SQGGPIIL+Q+ENEY A+G G Y+ WA
Sbjct: 144 DNEPFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPESKAFGSPGHAYMTWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A++++T VPWVMC++ DAPDP+INTCNGFYCD F+PN P KP MWTE ++GWF FG
Sbjct: 204 AHMAISMDTGVPWVMCKEFDAPDPVINTCNGFYCDYFSPNKPYKPTMWTEAWTGWFTDFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
RP EDLAFAVARF + GG+ NYYMY GGTNFGRT+GGP + TSYDYDAPIDEYG
Sbjct: 264 GPNHQRPAEDLAFAVARFIQKGGSLVNYYMYHGGTNFGRTSGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL+ELHKAIKLCE+ L+++D T LG+ +AH++ S CAAFL+NY++
Sbjct: 324 LIRQPKYGHLKELHKAIKLCEKALLAADSTVTSLGSYEQAHVFSSDSGGCAAFLSNYNTK 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
A V FN Y LP WS+SILPDCKNVVFNTA H Q V+ L S
Sbjct: 384 QAARVKFNNIQYSLPPWSISILPDCKNVVFNTA----------HVGVQTSQVHMLPTDSE 433
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
SW E+ + ++ L EQ+N T+DTSDYLWYT S+H+ + G+
Sbjct: 434 LLSWETFNEDISSVDDDKMITVAGLLEQLNITRDTSDYLWYTTSVHISSSESFLRGGRLP 493
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L ++S GHA VF+N +L +G + F + ++ + G N + +LS+ VGL N G
Sbjct: 494 VLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKNRISLLSVAVGLPNNG 553
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L G+RDL+ +W Y+VG++GE + L + W QGS
Sbjct: 554 PRFETWNTGILGPVTLHGLDEGQRDLTWQKWSYKVGLKGEDMNLRSRKSVSLVDWIQGSL 613
Query: 592 LP-VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ + L WYK F +P+G PLAL++ SMGKGQ W+NG SIGRYW+ Y + G
Sbjct: 614 MVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGRYWTLY---AEGNCSG 670
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
C Y ++ ++CQ CGQP Q YH+PR+W+ NLLV+ EE+GGD S+ISL+ +
Sbjct: 671 CSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVLFEEIGGDASRISLVKR 726
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/712 (52%), Positives = 493/712 (69%), Gaps = 17/712 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
SA+V YDH+A++I+G+RR+L SGSIHYPRSTP +WP+LI+K+K GGL+VI+TYVFWN HE
Sbjct: 23 SASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAGGLDVIQTYVFWNGHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFP+WL ++PGI FRT
Sbjct: 83 PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M++F KI+++MK E LF +QGGPIIL+Q+ENE+G VEW G G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV L+T VPW+MC+QEDAPDP+I+TCNG+YC+ F PN KP MWTE ++GW+ FG
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWTEVWTGWYTEFGG 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
A+P RP EDLAF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 263 AIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
++QPKWGHLR+LHKAIK CE L++ DP+ KLG EAH+++ S + CAAFLANYD+
Sbjct: 323 LQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFN-SKSGCAAFLANYDTKY 381
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V+F Y LP WS+SILPDCK VFNTAKV + + Q K V L S
Sbjct: 382 SVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASE-----VQMKPVYSRLPWQSF 436
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
E + L EQI T+D +DYLWY I + + GK L I
Sbjct: 437 IE---ETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLKNGKFPLLTI 493
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA VF+N +L YG+ + ++ ++L GIN L +LS+ VGL N G F+
Sbjct: 494 FSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFE 553
Query: 537 VAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ I L L G D+S +W Y++G++GE +GL ++ ++S W +G ++
Sbjct: 554 TWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESLGLHTVTGSSSVDWAEGPSMAQK 613
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK TF AP G PLAL++ SMGKGQ W+NGQS+GR+W Y+A G C Y G
Sbjct: 614 QPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--QGSCGNCYYAG 671
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+++ KC+ +CG+P+Q HIPR+W+ P NLLV+ EE GGDPS +SL+ +
Sbjct: 672 TFNDKKCRTYCGKPSQRWCHIPRSWLTPTGNLLVVFEEWGGDPSWMSLVERV 723
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/828 (47%), Positives = 518/828 (62%), Gaps = 37/828 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD RA+ IDGKR+VL SGSIHYPRST E+WP LI K+KEGGL+VIETYVFWN HEP
Sbjct: 22 VSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKEGGLDVIETYVFWNAHEPQP 81
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DLV+F+KT+Q+ GL+ LRIGPY CAEWNYGGFPVWLH +P ++FRT N
Sbjct: 82 RQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGGFPVWLHNMPNMEFRTNNTA 141
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+ EM+ F I+D M+ ENLFASQGGPIILAQ+ENEYGN+ YG G+ YV+W A A
Sbjct: 142 YMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNIMSEYGENGKQYVQWCAQLA 201
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ VPWVMCQQ DAPDPIINTCNG+YCD F+PNS SKP MWTEN++GWF ++G +P
Sbjct: 202 ESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQFSPNSKSKPKMWTENWTGWFKNWGGPIP 261
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R D+A+AVARFF+ GGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG Q
Sbjct: 262 HRTARDVAYAVARFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNKNQ 321
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PKWGHL++LH+ +K E+ L H G L A +Y+ S A FL N +SS+DA
Sbjct: 322 PKWGHLKQLHELLKSMEDVLTQGTTNHTDYGNLLTATVYNYSGKS-ACFLGNANSSNDAT 380
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR-------NNGDHPFAQQKNVNELLL 417
+ F Y +PAWSVSILP+C N V+NTAK+ +Q N D+ +N +
Sbjct: 381 IMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVMKDNKSDNEEEPHSTLNWQWM 440
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
+ +V S +R + L +Q T DTSDYLWY S+ + + +
Sbjct: 441 HEPHVQMKDGQVLGSVSRKAAQ--LLDQKVVTNDTSDYLWYITSVDISENDPIWSKIRVS 498
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
+ GH VFVN + YG + +F KI+L +G N + +LS VGL NYGA F
Sbjct: 499 TNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLKKGTNEISLLSGTVGLPNYGAHFSN 558
Query: 538 AGAGLFS-VILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
G+ V L+ L+N +D+++ W Y+VG+ GE + L N+ W LP
Sbjct: 559 VSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLHGEIVKL--YCPENNKGWNTNG-LP 615
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
N+ +WYKT F +P+G P+ ++L + KGQAWVNG +IGRYW+ YLA GCT C+Y
Sbjct: 616 TNRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWVNGNNIGRYWTRYLADDNGCTATCNY 675
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWV-HPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
RG Y + KC CG+P Q YH+PR+++ +N LV+ EE GG P+++ T + IC
Sbjct: 676 RGPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQNTLVLFEEFGGHPNEVKFATVMVEKIC 735
Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ +S++ N+ + L+C I+ I FAS+G+PEG CGSF+ C
Sbjct: 736 A----------NSYEGNV--------LELSCREEQVISKIKFASFGVPEGECGSFKKSQC 777
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ L I+ K+C+G+ CS+ VS LG + P LA+EA C
Sbjct: 778 ESPNALSILSKSCLGKQSCSVQVSQRMLGPTGCRMPQNQNKLAIEAVC 825
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/710 (53%), Positives = 489/710 (68%), Gaps = 26/710 (3%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYDHR+L I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP++G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYF R+DLVRFVK V++AGL+++LRIGPY CAEWNYGGFPVWL ++PGI FRT N PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + YV WAA AV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
N VPW+MC+Q+DAPDP+INTCNGFYCD FTPNS +KP MWTE +SGWF +FG VP
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG +RQP
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHL LHKAIK E L++ DPT Q +G +A+++ SS DCAAFL+N+ +S+ A V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
FNG Y LPAWS+S+LPDC+ V+NTA V + + + + F+W
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAK------------MNPAGGFTWQ 430
Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
Y E +F + L EQ++ T D SDYLWYT +++ G+ G+ L + S
Sbjct: 431 SYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYS 490
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH+ VFVN + YG +D + +++ +G N + ILS VGL N G ++
Sbjct: 491 AGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETW 550
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V L L GKRDLS +W YQ+G++GE +G+ +S ++S W + +
Sbjct: 551 NIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAG---KQP 607
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ W++ F AP G P+AL+L SMGKGQAWVNG IGRYWS ++G C Y G+Y
Sbjct: 608 VTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWS---YKASGNCGGCSYAGTY 664
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
KCQ +CG +Q YH+PR+W++P NL+V+ EE GGD S ++L+T+T
Sbjct: 665 SEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTRT 714
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/716 (53%), Positives = 487/716 (68%), Gaps = 32/716 (4%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD ++L+I+G+RR+L SGSIHYPRSTPE+W +LI K+K GGL+VI+TYVFW+ HEP
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G Y FEGR+DLVRF+KTVQ+ GL+ +LRIGPY CAEWN+GG PVWL ++PG+ FRT N
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+ F KI+ +MK E LF SQGGPIIL+Q+ENEYG + G G YV WAA
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYGPE--SRGAAGRAYVNWAASM 206
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV L T VPWVMC++ DAPDP+IN+CNGFYCD F+PN P KP MWTE +SGWF FG +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
RPVEDL+FAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG IR
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPK+ HL+ELHKAIK CE L+S DPT LG L+AH++ + CAAFLANY++ S A
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
VTFN Y LP WS+SILPDCK VFNTAK V L + FS
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAK-----------------VKMLPVKPKLFS 429
Query: 424 W--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
W Y+E + ++ + P L EQ+N T+DTSDYLWY S+ + + G++ +N
Sbjct: 430 WESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSIN 489
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
++S GHA VFVN + +G + + N ++L G N + +LS+ VGLQN G +
Sbjct: 490 VQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHY 549
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG+ V+L L G++DL+ +W Y+VG+ GE + L + +S W Q S
Sbjct: 550 ETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQ 609
Query: 595 NKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++S L WYK F AP GK PLAL+L SMGKGQ W+NGQSIGRYW AY + G C Y
Sbjct: 610 SRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAY---AKGDCNSCTY 666
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
G++ KCQ CGQP Q YH+PR+W+ P +NL+V+ EELGG+P KISL+ +
Sbjct: 667 SGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKRVAH 722
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/710 (53%), Positives = 489/710 (68%), Gaps = 26/710 (3%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYDHR+L I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP++G
Sbjct: 25 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYF R+DLVRFVK V++AGL+++LRIGPY CAEWNYGGFPVWL ++PGI FRT N PF
Sbjct: 85 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 144
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + YV WAA AV
Sbjct: 145 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 204
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
N VPW+MC+Q+DAPDP+INTCNGFYCD FTPNS +KP MWTE +SGWF +FG VP
Sbjct: 205 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 264
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG +RQP
Sbjct: 265 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 324
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHL LHKAIK E L++ DPT Q +G +A+++ SS DCAAFL+N+ +S+ A V
Sbjct: 325 KWGHLTNLHKAIKQAEPALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 384
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
FNG Y LPAWS+S+LPDC+ V+NTA V + + + + F+W
Sbjct: 385 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAK------------MNPAGGFTWQ 432
Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
Y E +F + L EQ++ T D SDYLWYT +++ G+ G+ L + S
Sbjct: 433 SYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYS 492
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH+ VFVN + YG +D + +++ +G N + ILS VGL N G ++
Sbjct: 493 AGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETW 552
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V L L GKRDLS +W YQ+G++GE +G+ +S ++S W + +
Sbjct: 553 NIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAG---KQP 609
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ W++ F AP G P+AL+L SMGKGQAWVNG IGRYWS ++G C Y G+Y
Sbjct: 610 VTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWS---YKASGNCGGCSYAGTY 666
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
KCQ +CG +Q YH+PR+W++P NL+V+ EE GGD S ++L+T+T
Sbjct: 667 SEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTRT 716
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/716 (52%), Positives = 493/716 (68%), Gaps = 24/716 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27 AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PG+ FRT N
Sbjct: 87 SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDN 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+D+MK+E LF +QGGPIIL+Q+ENEYG ++W G G+ Y KW A+
Sbjct: 147 EPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAE 206
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPW+MC+QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF FG A
Sbjct: 207 MALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+P RPVED+AF+VARF + GG+F NYYMY+GGTNF RTA G +ATSYDYDAPIDEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-GVFIATSYDYDAPIDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PK+ HL+ELHK IKLCE L+S DPT LG K E H++ KS CAAFL+NYD+SS
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKTSCAAFLSNYDTSSA 384
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F G Y LP WSVSILPDCK +NTAK+ + P K ++ S+ F
Sbjct: 385 ARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MIPTSTKF 433
Query: 423 SWYEEKVGISGNR---SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW G + +FV+ L EQI+ T+D +DY WY I + + G L
Sbjct: 434 SWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLL 493
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN L YG + ++ I+L+ GIN L +LS VGL N G
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVH 553
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L + +G D+S +W Y++G+ GE + L ++ +++ W +
Sbjct: 554 YETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVV 613
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK++F P G PLAL++ +MGKGQ WVNG +IGR+W AY A G +C+Y
Sbjct: 614 KKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTA--RGNCGRCNY 671
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
G Y+ KC HCG+P+Q YH+PR+W+ P NLLVI EE GGDPS ISL+ +T +
Sbjct: 672 AGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 727
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/832 (48%), Positives = 535/832 (64%), Gaps = 41/832 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ V++D RA++IDGKRRVL SGSIHYPRSTPE+WPELI+K+KEGGL+ IETYVFWN HE
Sbjct: 22 AVEVSHDGRAIIIDGKRRVLLSGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAHE 81
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R Y F G D++RF+KT+QE+GL+ LRIGPY CAEWNYGG PVW+H +P ++ RT
Sbjct: 82 PSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRTA 141
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N+ + EM+ F I+D++K+E LFASQGGPIIL Q+ENEYGNV YG G+ Y+ W A
Sbjct: 142 NSVYMNEMQNFTTLIVDMVKKEKLFASQGGPIILTQIENEYGNVISHYGDAGKAYMNWCA 201
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A +LN VPW+MCQ+ DAP +INTCNGFYCD F PN+PS P MWTEN+ GWF ++G
Sbjct: 202 NMAESLNVGVPWIMCQESDAPQSMINTCNGFYCDNFEPNNPSSPKMWTENWVGWFKNWGG 261
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P R ED+AFAVARFF+TGGTFQNYYMY GGTNF RTAGGP + TSYDYDAP+DEYG
Sbjct: 262 RDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFDRTAGGPYITTSYDYDAPLDEYGN 321
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
I QPKWGHL+ELH +K EE L S + + G ++A IY ++ + FL++ ++++
Sbjct: 322 IAQPKWGHLKELHNVLKSMEETLTSGNVSETDFGNSVKATIY-ATNGSSSCFLSSTNTTT 380
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DA +TF G Y +PAWSVSILPDC++ +NTAKV Q + ++ + E +
Sbjct: 381 DATLTFRGKNYTVPAWSVSILPDCEHEEYNTAKVNVQTS----VMVKENSKAEEEATALK 436
Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNI 476
+ W E + + G + L +Q + D SDYLWY +HV P G+ + L I
Sbjct: 437 WVWRSENIDNALHGKSNVSANRLLDQKDAANDASDYLWYMTKLHVKHDDPVWGENMTLRI 496
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH FVN + + + + N KI+L G NT+ +LS+ VGLQNYGA+FD
Sbjct: 497 NSSGHVIHAFVNGEHIGSHWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGAFFD 556
Query: 537 VAGAGLFSVI-LIDLKNGK---RDLSSGEWIYQVGVEG--EYIGLDKISLANSSFWKQGS 590
AGL I L+ +K + ++LSS +W Y+VG+ G + D A + W +
Sbjct: 557 TWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKVGLHGWDHKLFSDDSPFAAPNKW-ESE 615
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LP ++ L WYKTTF AP G P+ ++L MGKG AWVNGQ+IGR W +Y A GC+ +
Sbjct: 616 KLPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGKGYAWVNGQNIGRIWPSYNAEEDGCSDE 675
Query: 651 -CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
CDYRG Y SKC +CG+P Q YH+PR+++ G N LV+ ELGG+PS+++ T
Sbjct: 676 PCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLKDGANNLVLFAELGGNPSQVNFQTVVVG 735
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
+C+ E + + L+C+ G I+AI FAS+G PEG CG+F
Sbjct: 736 TVCANAYE------------------NKTLELSCQ-GRKISAIKFASFGDPEGVCGAFTN 776
Query: 770 GACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G+C + L IVQKACVG+ CS VS G + AC + K LAVEA C
Sbjct: 777 GSCESKSNALSIVQKACVGKQACSFDVSEKTFGPT--ACGNVAKRLAVEAVC 826
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/713 (53%), Positives = 489/713 (68%), Gaps = 26/713 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YD R+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 35 NAAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHE 94
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYF R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 95 PVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 154
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM++F+ KI+ +MK E LF QGGPII++QVENE+G +E G G + Y WAA
Sbjct: 155 NGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAA 214
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV NT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN KP MWTE ++GWF SFG
Sbjct: 215 KMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGG 274
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G
Sbjct: 275 GVPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LH+AIK E L+S+DPT + +G+ +A+++ + CAAFL+NY ++
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V FNG Y LPAWS+SILPDCK VFNTA V ++ + +
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVR 442
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
F+W Y E + +F + L EQ++ T D SDYLWYT +++ G+ L +
Sbjct: 443 FAWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTV 502
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH+ VFVN K YG +D N ++++ +G N + ILS VGL N G F+
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
G+ V L L G +DLS +W YQVG++GE +GL ++ +++ W G P
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQP- 621
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L W+K F AP G P+AL++ SMGKGQ WVNG +GRYWS S GC C Y
Sbjct: 622 ---LTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYA 675
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y KC+ +CG +Q YH+PR+W+ PG NLLV+ EE GGD + +SL T+T
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRT 728
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/713 (53%), Positives = 489/713 (68%), Gaps = 26/713 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YD R+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 35 NAAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHE 94
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYF R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 95 PVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 154
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM++F+ KI+ +MK E LF QGGPII++QVENE+G +E G G + Y WAA
Sbjct: 155 NGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAA 214
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV NT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN KP MWTE ++GWF SFG
Sbjct: 215 KMAVRTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGG 274
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G
Sbjct: 275 GVPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LH+AIK E L+S+DPT + +G+ +A+++ + CAAFL+NY ++
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V FNG Y LPAWS+SILPDCK VFNTA V ++ + +
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVR 442
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
F+W Y E + +F + L EQ++ T D SDYLWYT +++ G+ L +
Sbjct: 443 FAWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTV 502
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH+ VFVN K YG +D N ++++ +G N + ILS VGL N G F+
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
G+ V L L G +DLS +W YQVG++GE +GL ++ +++ W G P
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQP- 621
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L W+K F AP G P+AL++ SMGKGQ WVNG +GRYWS S GC C Y
Sbjct: 622 ---LTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYA 675
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y KC+ +CG +Q YH+PR+W+ PG NLLV+ EE GGD + +SL T+T
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRT 728
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/712 (53%), Positives = 488/712 (68%), Gaps = 26/712 (3%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A++I+ +RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP
Sbjct: 22 VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPSE 81
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+YYFE R+DLV F+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT N P
Sbjct: 82 GKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYVCAEWNYGGFPIWLKFVPGIAFRTDNEP 141
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F+ KI+D+MK E L+ +QGGPIIL+Q+ENEYG VEW G G+ Y KW A A
Sbjct: 142 FKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENEYGPVEWQIGAPGKSYTKWFAQMA 201
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V+L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN KP +WTEN+SGW+ +FG P
Sbjct: 202 VDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTP 261
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
+RP ED+AF+VARF + G+ NYY+Y GGTNFGRT+ G +ATSYD+DAPIDEYG IR+
Sbjct: 262 YRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS-GLFIATSYDFDAPIDEYGLIRE 320
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PKWGHLR+LHKAIK CE L+S+DPT LG EA ++ KSS+ CAAFLANYD+S+
Sbjct: 321 PKWGHLRDLHKAIKSCEPALVSADPTITWLGKNQEARVF-KSSSACAAFLANYDTSASVK 379
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
V F N Y LP WS+SILPDC V FNTA+V K+ ++ S+F W
Sbjct: 380 VNFWNNPYDLPPWSISILPDCXTVTFNTAQV------------GVKSYQAKMMPISSFGW 427
Query: 425 Y---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
EE + + L EQ++ T DT+DYLWY I + + GK L++
Sbjct: 428 LSYKEEPASAYAKDTTTKAGLVEQVSITWDTTDYLWYMQDISIDSTEGFLKSGKWPLLSV 487
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH VF+N +L YG+ + +K ++L +G+N L +LS+ VGL N G FD
Sbjct: 488 NSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNVDLKQGVNKLSMLSVTVGLPNVGLHFD 547
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG+ V L L G RD+S +W Y+VG+ GE + L +NS W +GS L
Sbjct: 548 TWNAGVLGPVTLEGLNEGTRDMSKYKWSYKVGLSGESLNLYSDKGSNSVQWTKGS-LTQK 606
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYKTTF P G PL L+++SM KGQ W+NGQSIGRY+ Y+A G KC Y G
Sbjct: 607 QPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWINGQSIGRYFPGYIA--NGKCDKCSYAG 664
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+ KC +CG+P+Q YHIPR W+ P +NLLVI EE+GG P ISL+ +T
Sbjct: 665 LFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEIGGSPDGISLVKRT 716
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/834 (48%), Positives = 539/834 (64%), Gaps = 39/834 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++ V+YD RAL +DG RR+L SGSIHYPRSTP +WP LI K+K+GGL+VI+TYVFW+ H
Sbjct: 21 VAVTVSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVIQTYVFWSGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP +G Y F GR+DL +F++ V EAG++++LRIGPY CAEWN+GGFP WL F+PGI+FRT
Sbjct: 81 EPTQGVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLRFLPGIEFRT 140
Query: 121 TNNPFKEEMKR-FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
N FK + F + +I + + F Q +I AQ+ENEYG+++ YG G+ Y+ W
Sbjct: 141 DNESFKVHLSHSFTSSLISVYSRS--FNIQ--LVICAQIENEYGSIDAVYGEAGQKYLNW 196
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
A+ AV N SVPW+MC Q DAP +I+TCNGFYCDGF PNS KP +WTEN++GWF S+
Sbjct: 197 IANMAVATNISVPWIMCNQPDAPPSVIDTCNGFYCDGFRPNSEGKPALWTENWTGWFQSW 256
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G P RPV+D+AFAVARFF+ GG+F +YYMY GGTNF R+A V T+YDYDAPIDEY
Sbjct: 257 GEGAPTRPVQDIAFAVARFFQKGGSFMHYYMYHGGTNFERSAMEG-VTTNYDYDAPIDEY 315
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
G +RQPKWGHL++LH A+KLCE L+ D P+ LG EAH+Y+ S+ CAAFLA++
Sbjct: 316 GDVRQPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNSSTGACAAFLASW 375
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
+ D+ V F G Y LPAWSVSILPDCK+VVFNTAKV Q +
Sbjct: 376 -GTDDSTVLFQGQSYDLPAWSVSILPDCKSVVFNTAKV-----------GVQSMTMTMQS 423
Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKE 471
A +W Y E + G+ +F +L EQI TTKDT+DYLWYT ++ V P +
Sbjct: 424 AIPVTNWVSYREPLEPWGS-TFSTNELVEQIATTKDTTDYLWYTTNVEVAESDAPNGLAQ 482
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L + L AA +FVNK L G ++ I L GIN++ +LSM GLQ
Sbjct: 483 ATLVMSYLRDAAHIFVNKWLT----GTKSAHGSEASQSISLRPGINSVKVLSMTTGLQGT 538
Query: 532 GAWFDVAGAGL-FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
G + + AG+ F + + L +G + W YQVG++GE L + + + S+ W +
Sbjct: 539 GPFLEKEKAGIQFGIRVEGLPSGAIIMQRNTWTYQVGLQGENNRLFESNGSLSAVWSTST 598
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+ SL W+KTTF PE G +AL+L+SMGKGQ WVNG ++GRYWS+ +A + GC
Sbjct: 599 DVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYWSSCIAHTDGCVDN 658
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRGS+ SKC CGQP+Q+ YH+PR W+ +NLLV+ EE G+P I++ + QH
Sbjct: 659 CDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNPEAITIAPRIPQH 718
Query: 711 ICSFVSEADPPPVD-SWKPNLGVVSSSPQV---RLACERGWHIAAINFASYGIPEGNCGS 766
ICS +SE+ P P+ S G +S+P + L C G HI+ I+FASYG P G+CG
Sbjct: 719 ICSRMSESHPFPIPLSSSTKRGSQTSTPPIAPLALECADGQHISRISFASYGTPSGDCGD 778
Query: 767 FRPGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
F+ +CH + ++ KACVG+ +C +P+ S+ G CPG++K+LA A C
Sbjct: 779 FKLSSCHANSSKDVLSKACVGRQKCLVPIVSSICG--GDPCPGMIKSLAATAEC 830
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/716 (52%), Positives = 492/716 (68%), Gaps = 24/716 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27 AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PG+ FRT N
Sbjct: 87 SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDN 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+D+MK+E LF +QGGPIIL+Q+ENEYG ++W G G+ Y KW A+
Sbjct: 147 EPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAE 206
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPW+M +QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF FG A
Sbjct: 207 MALGLSTGVPWIMSKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+P RPVED+AF+VARF + GG+F NYYMY+GGTNF RTA G +ATSYDYDAPIDEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-GVFIATSYDYDAPIDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PK+ HL+ELHK IKLCE L+S DPT LG K E H++ KS CAAFL+NYD+SS
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKTSCAAFLSNYDTSSA 384
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F G Y LP WSVSILPDCK +NTAK+ + P K ++ S+ F
Sbjct: 385 ARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MIPTSTKF 433
Query: 423 SWYEEKVGISGNR---SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW G + +FV+ L EQI+ T+D +DY WY I + + G L
Sbjct: 434 SWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLL 493
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN L YG + ++ I+L+ GIN L +LS VGL N G
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVH 553
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
++ G+ V L + +G D+S +W Y++G+ GE + L ++ +++ W +
Sbjct: 554 YETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVV 613
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK++F P G PLAL++ +MGKGQ WVNG +IGR+W AY A G +C+Y
Sbjct: 614 KKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTA--RGNCGRCNY 671
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
G Y+ KC HCG+P+Q YH+PR+W+ P NLLVI EE GGDPS ISL+ +T +
Sbjct: 672 AGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 727
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/709 (54%), Positives = 482/709 (67%), Gaps = 24/709 (3%)
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
M+RF K++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA AV+L+
Sbjct: 1 MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60
Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPV 248
T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLSFG AVP+RP
Sbjct: 61 TGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPA 120
Query: 249 EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWG 308
EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG +RQPKWG
Sbjct: 121 EDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWG 180
Query: 309 HLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTF 367
HLR++HKAIKLCE LI+++P++ LG EA +Y + N CAAFLAN D+ SD V F
Sbjct: 181 HLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKF 240
Query: 368 NGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNELLLASSA 421
NGN Y LPAWSVSILPDCKNVV NTA++ SQ R+ G ++ LA++
Sbjct: 241 NGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLITPELATAG 300
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN------ 475
+S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V +G E +LN
Sbjct: 301 WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDEPYLNGSQSNL 357
Query: 476 -IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ SLGH +++N KL G+ + + + L G N +D+LS VGL NYGA+
Sbjct: 358 LVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAF 417
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
FD+ GAG+ + + NG +LSS +W YQ+G+ GE + L S A S W + P
Sbjct: 418 FDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SPEWVSDNAYPT 476
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
N+ LIWYKT F AP G P+A++ MGKG+AWVNGQSIGRYW LAP +GC C+YR
Sbjct: 477 NQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYR 536
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS T+ IC+
Sbjct: 537 GAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAH 596
Query: 715 VSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGAC 772
VSE P +DSW P + P +RL C R G I+ I FAS+G P G CG++ G C
Sbjct: 597 VSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGEC 656
Query: 773 -HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L +VQ+ACVG CS+PVSS G C G+ K+L VEA CS
Sbjct: 657 SSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 702
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/831 (47%), Positives = 515/831 (61%), Gaps = 47/831 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V+YD RA+ IDGKR++L SGSIHYPRST E+WP LI KSKEGGL+VIETYVFWN HEP
Sbjct: 26 DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G DLVRF+KT+Q GL+ LRIGPY CAEWNYGGFPVWLH IP I+FRT N
Sbjct: 86 PGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
F++EMK+F I+D+M+ E LFASQGGPIILAQ+ENEYGN+ +YG G+ YV+W A
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A + VPW+MCQQ DAPDP+INTCNGFYCD + PNS +KP MWTE+++GWF+ +G
Sbjct: 206 AQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWGGPT 265
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P R ED+AFAV RFF+ GGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP++EYG +
Sbjct: 266 PHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYGDLN 325
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL+ LH+ +K E L + G ++ A I+ + FL N S DA
Sbjct: 326 QPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIF-SYAGQSVCFLGNAHPSMDA 384
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
N+ F Y +PAWSVSILPDC V+NTAKV N N N L +
Sbjct: 385 NINFQNTQYTIPAWSVSILPDCYTEVYNTAKV-----NAQTSIMTINNENSYAL---DWQ 436
Query: 424 WYEE------KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVF 473
W E K G + G+ + P L +Q DTSDYLWY S+ V G ++
Sbjct: 437 WMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLK 495
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
+ + + GH VFVN + Y + F I+L G N + ++S VGL NYGA
Sbjct: 496 IRVNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLKLGKNEISLVSGTVGLPNYGA 555
Query: 534 WFDVAGAGLFSVILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
+FD G+ V L+ +G +D+S+ W Y+VG+ GE + L S + ++ G
Sbjct: 556 YFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRSTEEWFTNG- 614
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
L +K +WYKTTF P G + L+L +GKGQAWVNG +IGRYW +YLA GC+
Sbjct: 615 -LQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSST 673
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISLLTKTGQ 709
CDYRG+Y ++KC +CG P Q YH+P +++ G +N LV+ EE GG+P ++ + T T
Sbjct: 674 CDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIA 733
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
C+ E ++ LAC+ I+ I FAS+G+PEG CGSF+
Sbjct: 734 KACAKAYEGH------------------ELELACKENQVISEIKFASFGVPEGECGSFKK 775
Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C D L IV++ C+G+ +CSI V+ LG + P LA++A C
Sbjct: 776 GHCESSDTLSIVKRLCLGKQQCSIQVNEKMLGPTGCRVPE--NRLAIDALC 824
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/617 (60%), Positives = 457/617 (74%), Gaps = 18/617 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYDHRALVIDG RRVL SGSIHYPRSTP++WP LI+K+K+GGL+VIETYVFW+ HE
Sbjct: 27 AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQY FEGR DL FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 87 PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM+RF AK++D MK L+ASQGGPIIL+Q+ENEYGN++ AYG G+ Y++WAA
Sbjct: 147 NEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAA 206
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 207 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN R++GGP +ATSYDYDAPIDEYG
Sbjct: 267 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR++HKAIKLCE LI++DP++ LG +EA +Y K + CAAFLAN D S
Sbjct: 327 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVY-KVGSVCAAFLANIDGQS 385
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
D VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ + + + NV
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 445
Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
LA S +S+ E VGI+ + + + L EQINTT D SD+LWY+ SI V +G E +LN
Sbjct: 446 ELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 502
Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
+ SLGH V++N K+ G+ + K IEL G N +D+LS VGL
Sbjct: 503 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 562
Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
NYGA+FD+ GAG+ + + NG DLSS EW YQ+G+ GE + L S A S W
Sbjct: 563 SNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 621
Query: 589 GSTLPVNKSLIWYKTTF 605
+ P+N LIWYK +
Sbjct: 622 ANAYPINHPLIWYKVSM 638
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/831 (47%), Positives = 514/831 (61%), Gaps = 47/831 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V+YD RA+ IDGKR++L SGSIHYPRST E+WP LI KSKEGGL+VIETYVFWN HEP
Sbjct: 26 DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G DLVRF+KT+Q GL LRIGPY CAEWNYGGFPVWLH IP I+FRT N
Sbjct: 86 PGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
F++EMK+F I+D+M+ E LFASQGGPIILAQ+ENEYGN+ +YG G+ YV+W A
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A + VPW+MCQQ D PDP+INTCNGFYCD + PNS +KP MWTE+++GWF+ +G
Sbjct: 206 AQSYQIGVPWIMCQQSDTPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWGGPT 265
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P R ED+AFAV RFF+ GGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP++EYG +
Sbjct: 266 PHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYGDLN 325
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL+ LH+ +K E L + G ++ A I+ + FL N S DA
Sbjct: 326 QPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIF-SYAGQSVCFLGNAHPSMDA 384
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
N+ F Y +PAWSVSILPDC V+NTAKV N N N L +
Sbjct: 385 NINFQNTQYTIPAWSVSILPDCYTEVYNTAKV-----NAQTSIMTINNENSYAL---DWQ 436
Query: 424 WYEE------KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVF 473
W E K G + G+ + P L +Q DTSDYLWY S+ V G ++
Sbjct: 437 WMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLK 495
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
+ + + GH VFVN + Y + F I+L G N + ++S VGL NYGA
Sbjct: 496 IRVNTKGHVLHVFVNGAHIGSQYATYGKYPFTFEADIKLKLGKNEISLVSGTVGLPNYGA 555
Query: 534 WFDVAGAGLFSVILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
+FD G+ V L+ +G +D+S+ W Y+VG+ GE + L S ++ ++ G
Sbjct: 556 YFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRSSEEWFTNG- 614
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
L +K +WYKTTF P G + L+L +GKGQAWVNG +IGRYW +YLA GC+
Sbjct: 615 -LQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSST 673
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISLLTKTGQ 709
CDYRG+Y ++KC +CG P Q YH+P +++ G +N LV+ EE GG+P ++ + T T
Sbjct: 674 CDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIA 733
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
C+ E ++ LAC+ I+ I FAS+G+PEG CGSF+
Sbjct: 734 KACAKAYEGH------------------ELELACKENQVISEIRFASFGVPEGECGSFKK 775
Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C D L IV++ C+G+ +CSI V+ LG + P LA++A C
Sbjct: 776 GHCESSDTLSIVKRLCLGKQQCSIHVNEKMLGPTGCRVPE--NRLAIDALC 824
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/836 (48%), Positives = 538/836 (64%), Gaps = 45/836 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ V++D RA+ IDGKRRVL SGSIHYPRSTP++WP+LI+K+KEGGL+ IETYVFWN HE
Sbjct: 24 AVEVSHDGRAIKIDGKRRVLISGSIHYPRSTPQMWPDLIKKAKEGGLDAIETYVFWNAHE 83
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
PIR +Y F G DL+RF+KT+Q+ GLF LRIGPY CAEWNYGG PVW++ +PG++ RT
Sbjct: 84 PIRREYDFSGNNDLIRFLKTIQDEGLFAVLRIGPYVCAEWNYGGIPVWVYNLPGVEIRTA 143
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F EM+ F I+D++++E LFASQGGPIIL+Q+ENEYGNV AYG G+ Y+ W A
Sbjct: 144 NKVFMNEMQNFTTLIVDMVRKEKLFASQGGPIILSQIENEYGNVMSAYGDEGKAYINWCA 203
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A + N VPW+MCQQ DAP P+INTCNG+YC F PN+P+ P MWTEN+ GWF ++G
Sbjct: 204 NMADSFNIGVPWIMCQQPDAPQPMINTCNGWYCHDFEPNNPNSPKMWTENWVGWFKNWGG 263
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P R ED+A++VARFFETGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 264 KDPHRTAEDIAYSVARFFETGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 323
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAA-FLANYDSS 360
I QPKWGHL+ELH +K E L + + + LG+ ++A +Y ++ND ++ FL N +++
Sbjct: 324 IAQPKWGHLKELHLVLKSMENSLTNGNVSKIDLGSYVKATVY--ATNDSSSCFLTNTNTT 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+DA VTF GN Y +PAWSVSILPDC+ +NTAKV Q + +++N E +
Sbjct: 382 TDATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVNVQTS----IMVKRENKAEDEPEAL 437
Query: 421 AFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLN 475
+ W E V + G S + + +Q D+SDYLWY + + P L
Sbjct: 438 KWVWRAENVHNSLIGKSSVSKNTIVDQKIAANDSSDYLWYMTRLDINQKDPVWTNNTILR 497
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I GH FVN + + + + N I+L G N + +LS+ VGLQNYG +
Sbjct: 498 INGTGHVIHAFVNGEHIGSHWATYGIHNDQFETNIKLKHGRNDISLLSVTVGLQNYGKEY 557
Query: 536 DVAGAGLFSVI-LIDLKNGK---RDLSSGEWIYQVGVEG---EYIGLDKISLANSSFWKQ 588
D GL S I LI K + +DLSS +W Y+VG+ G ++ D A+SS W +
Sbjct: 558 DKWQDGLVSPIELIGTKGDETIIKDLSSHKWTYKVGLHGWENKFFSQDTF-FASSSKW-E 615
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ LP+NK L WYKTTF AP P+ ++L MGKG AWVNG S+GRYW +Y A GC+
Sbjct: 616 SNELPINKMLTWYKTTFKAPLESDPIVVDLQGMGKGYAWVNGHSLGRYWPSYNADEDGCS 675
Query: 649 KK-CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
CDYRG Y+ +KC +CG+P+Q YH+PR ++ G N LV+ EE+GG+PS+I+ T
Sbjct: 676 DDPCDYRGEYNDTKCVSNCGKPSQRWYHVPRDFIEDGVNTLVLFEEIGGNPSQINFQTVI 735
Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
C+ E + + L+C G I+ I FAS+G P+G CG+F
Sbjct: 736 VGSACANAYE------------------NKTLELSC-HGRSISDIKFASFGNPQGTCGAF 776
Query: 768 RPGAC--HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
G+C + + L +VQKACVG+ CSI VS G A C ++K LAVEA C+I
Sbjct: 777 TKGSCESNNEALSLVQKACVGKESCSIDVSEKTFG--ATNCGNMVKRLAVEAVCAI 830
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/714 (53%), Positives = 486/714 (68%), Gaps = 34/714 (4%)
Query: 7 YDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQ 66
YDHR+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HEP++GQ
Sbjct: 47 YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106
Query: 67 YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
Y+F R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI+FRT N PFK
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFK 166
Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
M++F+ KI+ +MK E LF QGGPII+AQVENE+G +E G G + Y WAA AV
Sbjct: 167 AAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAVG 226
Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
NT VPWVMC+Q+DAPDP+INTCNGFYCD FTPN KP MWTE ++GWF FG A+P R
Sbjct: 227 TNTGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNRKYKPTMWTEAWTGWFTKFGGALPHR 286
Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPK 306
PVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G +RQPK
Sbjct: 287 PVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPK 346
Query: 307 WGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVT 366
WGHLR+LH+AIK E LIS DPT Q +G +A+I+ + CAAFL+NY + +
Sbjct: 347 WGHLRDLHRAIKQAEPALISGDPTIQSIGNYEKAYIFKSKNGACAAFLSNYHMKTAVKIR 406
Query: 367 FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW-- 424
F+G Y LPAWS+SILPDCK VFNTA V P K +N +L F+W
Sbjct: 407 FDGRHYDLPAWSISILPDCKTAVFNTATV-------KEPTLLPK-MNPVL----HFAWQS 454
Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF--------LNI 476
Y E + +F R L EQ++ T D SDYLWYT + + G E F L +
Sbjct: 455 YSEDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWYTTHVSI---GGNEQFLKSGQWPQLTV 511
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH+ VFVN + YG +D N +++ +G N + ILS VGL N G F+
Sbjct: 512 YSAGHSMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILSSAVGLPNNGNHFE 571
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK-QGSTLPV 594
+ G+ V L L GKRDLS +W YQVG++GE +GL ++ +++ W G P
Sbjct: 572 LWNVGVLGPVTLSGLNEGKRDLSHQKWTYQVGLKGESLGLHTVTGSSAVEWAGPGGKQP- 630
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L W+K F AP G P+AL++ SMGKGQ WVNG GRYWS Y A S C ++C Y
Sbjct: 631 ---LTWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAGRYWS-YRAYSGSC-RRCSYA 685
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL-GGDPSKISLLTKT 707
G+Y +C +CG +Q YH+PR+W+ P NLLV+ EE GGD + ++L T+T
Sbjct: 686 GTYREDQCLSNCGDISQRWYHVPRSWLKPSGNLLVVLEEYGGGDLAGVTLATRT 739
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/727 (53%), Positives = 480/727 (66%), Gaps = 15/727 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +NV+YD R+L+IDG+R++L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN H
Sbjct: 23 VGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTAKEGGIDVIETYVFWNGH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
E G YYF GRFDLV+F K VQ+AG++L LRIGP+ AEWN+GG PVWLH+IPG FRT
Sbjct: 83 ELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNFGGVPVWLHYIPGTVFRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PF M++F I++LMK+E LFASQGGPIIL+Q+ENEYG E Y G+ Y WA
Sbjct: 143 YNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWA 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV+ NTSVPW+MCQQ DAPDP+I+TCN FYCD FTP SP +P MWTEN+ GWF +FG
Sbjct: 203 AKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKMWTENWPGWFKTFG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RPVED+AF+VARFF+ GG+ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 263 GRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKWGHL+ELHKAIKLCE L+ + LG +EA IY SS CAAF++N D
Sbjct: 323 LPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSSGACAAFISNVDDK 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQQKNVNELL 416
+D V F Y LPAWSVSILPDCKNVVFNTAKV S N +H QQ + +
Sbjct: 383 NDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPEH--LQQSDKGQKT 440
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
L F +E GI G FV+ + INTTKDT+DYLW+T SI + + G +
Sbjct: 441 LKWDVF---KENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILIDANEEFLKKGSK 497
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L IES GH FVN+K G GN + F I L G N + ILS+ VGLQ
Sbjct: 498 PALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIAILSLTVGLQTA 557
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
G ++D GAG+ SV +I L N DLSS W Y++GV GE++ + + NS W S
Sbjct: 558 GPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGEGMNSVKWTSTSE 617
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-PSTGCTKK 650
P ++L WYK AP G P+ L++ MGKG AW+NG+ IGRYW C ++
Sbjct: 618 PPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPRISEFKKEDCVQE 677
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRG ++ KC CG+P+Q YH+PR+W P N+LVI EE GGDP+KI+ +
Sbjct: 678 CDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGDPTKITFVRHCHNP 737
Query: 711 ICSFVSE 717
S V E
Sbjct: 738 YSSIVVE 744
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/831 (46%), Positives = 526/831 (63%), Gaps = 39/831 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V+Y +R + IDG+ ++ SGSIHYPRSTP++WP+LI+KSKEGGL+ IETYVFWN HE
Sbjct: 23 STQVSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQ-FRT 120
P+R QY F DLVRF+KT+Q GL+ LRIGPY CAEWNYGGFPVWLH +PGI+ RT
Sbjct: 83 PVRRQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
TN F EM+ F I+D+MKQENLFASQGGPIILAQ+ENEYGNV +YG G+ YV W
Sbjct: 143 TNPVFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWC 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A + N VPW+MCQQ+DAP+P INTCNG+YCD FTPN+ P MWTEN++GWF S+G
Sbjct: 203 ANMADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P R EDLAF+VARFF+ GGTFQNYYMY GGTNF R AGGP + T+YDY+AP+DEYG
Sbjct: 263 GRDPVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ QPK+GHL++LH A+K E+ L+S + T L + Y + F +N + +
Sbjct: 323 NLNQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINET 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+DA V + G + +PAWSVSILPDC+ V+NTAKV +Q + + +N E+L
Sbjct: 382 TDALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVL---- 437
Query: 421 AFSWYEEKVGIS---GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFL 474
+ W E + + G L +Q + D SDYLWY S+++ P E+ L
Sbjct: 438 EWMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSNEMTL 497
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I GH FVN + + + ++D N++ ++++L G N + +LS +GL+NYGA
Sbjct: 498 RINVSGHIVHAFVNGEHIGSQWASYDVYNYIFEQEVKLKPGKNIISLLSATIGLKNYGAQ 557
Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
+D+ +G+ + + ++G +DLS+ +W Y+VG+ G L ++ W+ G+
Sbjct: 558 YDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQSGN 617
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LPVN+ + WYKTTF P G P+ L+L +GKG AWVNG SIGRYW +++A +
Sbjct: 618 -LPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEP 676
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRGSY +KC + CG+P Q YH+PR+W++ G+N LV+ EE GG+PS ++ T +
Sbjct: 677 CDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEK 736
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
C E + L+C+ G I I FAS+G P G+CG+F G
Sbjct: 737 ACGHAYE------------------KKSLELSCQ-GKEITGIKFASFGDPTGSCGNFSKG 777
Query: 771 ACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+C D + IV+ C+G+ C I +S G + A G++K LAVEA C
Sbjct: 778 SCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCAL-GVVKRLAVEAVC 827
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/703 (52%), Positives = 481/703 (68%), Gaps = 26/703 (3%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YD R+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 35 NAAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHE 94
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQYYF R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 95 PVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 154
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK EM++F+ KI+ +MK E LF QGGPII++QVENE+G +E G G + Y WAA
Sbjct: 155 NGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAA 214
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV NT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN KP MWTE ++GWF SFG
Sbjct: 215 KMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGG 274
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G
Sbjct: 275 GVPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPKWGHLR+LH+AIK E L+S+DPT + +G+ +A+++ + CAAFL+NY ++
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V FNG Y LPAWS+SILPDCK VFNTA V ++ + +
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVR 442
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
F+W Y E + +F + L EQ++ T D SDYLWYT +++ G+ L +
Sbjct: 443 FAWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTV 502
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH+ VFVN K YG +D N ++++ +G N + ILS VGL N G F+
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
G+ V L L G +DLS +W YQVG++GE +GL ++ +++ W G P
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLQTVTGSSAVEWGGPGGYQP- 621
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L W+K F AP G P+AL++ SMGKGQ WVNG +GRYWS S GC C Y
Sbjct: 622 ---LTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYA 675
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
G+Y KC+ +CG +Q YH+PR+W+ PG NLLV+ EE G +
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/831 (46%), Positives = 526/831 (63%), Gaps = 39/831 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V+Y +R + IDG+ ++ SGSIHYPRSTP++WP+LI+KSKEGGL+ IETYVFWN HE
Sbjct: 23 STQVSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQ-FRT 120
P+R QY F DLVRF+KT+Q GL+ LRIGPY CAEWNYGGFPVWLH +PGI+ RT
Sbjct: 83 PVRRQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
TN F EM+ F I+D+MKQENLFASQGGPIILAQ+ENEYGNV +YG G+ YV W
Sbjct: 143 TNPVFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWC 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A + N VPW+MCQQ+DAP+P INTCNG+YCD FTPN+ P MWTEN++GWF S+G
Sbjct: 203 ANMADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P R EDLAF+VARFF+ GGTFQNYYMY GGTNF R AGGP + T+YDY+AP+DEYG
Sbjct: 263 GRDPVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ QPK+GHL++LH A+K E+ L+S + T L + Y + F +N + +
Sbjct: 323 NLNQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINET 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+DA V + G + +PAWSVSILPDC+ V+NTAKV +Q + + +N E+L
Sbjct: 382 TDALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVL---- 437
Query: 421 AFSWYEEKVGIS---GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFL 474
+ W E + + G L +Q + D SDYLWY S+++ P E+ L
Sbjct: 438 EWMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSNEMTL 497
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I GH FVN + + + ++D N++ ++++L G N + +LS +GL+NYGA
Sbjct: 498 RINVSGHIVHAFVNGEHIGSQWASYDVYNYIXEQEVKLKPGKNIISLLSATIGLKNYGAQ 557
Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
+D+ +G+ + + ++G +DLS+ +W Y+VG+ G L ++ W+ G+
Sbjct: 558 YDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQSGN 617
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
LPVN+ + WYKTTF P G P+ L+L +GKG AWVNG SIGRYW +++A +
Sbjct: 618 -LPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEP 676
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
CDYRGSY +KC + CG+P Q YH+PR+W++ G+N LV+ EE GG+PS ++ T +
Sbjct: 677 CDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEK 736
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
C E + L+C+ G I I FAS+G P G+CG+F G
Sbjct: 737 ACGHAYE------------------KKSLELSCQ-GKEITGIKFASFGDPTGSCGNFSKG 777
Query: 771 ACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+C D + IV+ C+G+ C I +S G + A G++K LAVEA C
Sbjct: 778 SCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCAL-GVVKRLAVEAVC 827
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/714 (52%), Positives = 487/714 (68%), Gaps = 18/714 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NVTYD +AL+I+G++R+L SGSIHYPRSTP++W LI+K+K+GGL+VI+TYVFWN H
Sbjct: 24 IECNVTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y FEGR DLV+F+K V +AGL++HLRIGPY C EWN+GGFPVWL +IPG+ FRT
Sbjct: 84 EPSPGNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGMIFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK +M++F KI+ +MK E L+ SQGGPIIL+Q+ENEY + A+G G Y+ WA
Sbjct: 144 DNEPFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWA 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV+LNT VPWVMC++ DAPDP++NTCNGFYCD F+PN KP MWTE ++GWF FG
Sbjct: 204 AHMAVSLNTGVPWVMCKEFDAPDPVVNTCNGFYCDYFSPNKAYKPTMWTEAWTGWFTDFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPIHQRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPK+GHL++LHKAIKLCE L+SSDP LG+ +AH++ +S DCAAFLANY+
Sbjct: 324 LIRQPKYGHLKDLHKAIKLCERALLSSDPVVTTLGSYEQAHVFSSNSGDCAAFLANYNPK 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+ A VTFN Y LP WSVSILPDCKNVVFNTA+V Q + + + ++ L+
Sbjct: 384 ATAKVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGVQPSKIQMLPTEARFLSWEALSED 443
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
S ++K+G L EQIN T+D SDYLWYT +H+ + G+ L
Sbjct: 444 ISSVDDDKIGTVAG-------LLEQINVTRDASDYLWYTTGVHISSSETFLDGGQPPILK 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAW 534
+ S GH VFVN +L YG + ++ +L+ G N + +LS+ VGL N G
Sbjct: 497 VISAGHGIHVFVNGQLSGSVYGTRGNRRISFSGELKQLHAGRNRISLLSVAVGLPNNGPR 556
Query: 535 FDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ ++I L G RDL+ +W Y+VG++GE + L + S W Q S +
Sbjct: 557 FETWNTGVLGPVVIHGLDQGHRDLTWQKWSYKVGLKGEDLNLGSPNSIPSINWMQESAMV 616
Query: 594 VNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L W++ F AP G PLAL+++SM KGQ W+NG SIGRYW+ Y + G C
Sbjct: 617 AERQPLTWHRAFFDAPRGDDPLALDMSSMVKGQVWINGNSIGRYWTVY---ADGNCTACS 673
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y G++ S CQ CGQP Q YHIPR+ + P ENLLV+ EE+GGD SKI L+ +
Sbjct: 674 YSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLVVFEEIGGDVSKIYLVKR 727
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/829 (46%), Positives = 531/829 (64%), Gaps = 40/829 (4%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YD A++I+G+RRV+ SGS+HYPRST +WP+LI+K+K+GGL+ IETY+FW+ HEP
Sbjct: 11 NVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 70
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
R +Y F GR D ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQFRT N
Sbjct: 71 RRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTDNQ 130
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+K EM+ F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV YG G+ Y+ W A
Sbjct: 131 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCAQM 190
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +LN +PW+MCQQ DAP PIINTCNGFYCD F+PN+P P M+TEN+ GWF +G
Sbjct: 191 AESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFKKWGDK 250
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P+R ED+AFAVARFF++GG F NYYMY GGTNFGRTAGGP + TSYDY+AP+DEYG +
Sbjct: 251 DPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDEYGNL 310
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDSSS 361
QPKWGHL++LH +IK+ E+ L +S + QKL + + + +S + FL+N D+ +
Sbjct: 311 NQPKWGHLKQLHASIKMGEKILTNSTRSDQKLXSFVTLTKFSNPTSGERFCFLSNTDNKN 370
Query: 362 DANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
DA + + YF+PAWSVSIL C VFNTAK+ SQ + F + +N E ++
Sbjct: 371 DATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQTS----MFVKVQNKKE----NA 422
Query: 421 AFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLN 475
FSW + + G +F L EQ TT D SDYLWY +I + V L
Sbjct: 423 QFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSLQNVTLQ 482
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ + GH FVN++ + + ++ +F+ K I + G NT+ +LS VGL+NY A++
Sbjct: 483 VNTKGHMLHAFVNRRYIGSQWRSNG-QSFVFXKPILIKPGTNTITLLSATVGLKNYDAFY 541
Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
D G+ + LI N K DLSS W Y+VG+ GE L + + W +
Sbjct: 542 DTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQKS 601
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ + + YKT F P G P+ L++ MGKGQAWVNGQSIGR+W +++A + C+ CDY
Sbjct: 602 IGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNGQSIGRFWPSFIAGNDSCSTTCDY 661
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
RG+Y+ SKC ++CG P+Q YHIPR+++ N LV+ EE+GG+P ++S+ T T IC
Sbjct: 662 RGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTICG 721
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
+E + L+C+ G I+ I FASYG PEG CGSF+ G+ H
Sbjct: 722 NANEGS------------------TLELSCQGGHIISEIQFASYGNPEGKCGSFKQGSWH 763
Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
++ +V+K C+G CSI VS+ G+ G + LA++A CSI
Sbjct: 764 VINSAILVEKLCIGMESCSIDVSAKSFGL--GDVTNISARLAIQALCSI 810
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/716 (52%), Positives = 495/716 (69%), Gaps = 25/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD +A++I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VIETYVFWN H
Sbjct: 25 VKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYF R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT
Sbjct: 85 EPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK+F KI+ +MK E LF +QGGPIILAQ+ENEYG VEW G G+ Y KW
Sbjct: 145 DNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWV 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+ FG
Sbjct: 205 AQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RPVED+A++VARF + GG+ NYYMY GGTNF RTA G +A+SYDYDAP+DEYG
Sbjct: 265 GAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PK+ HL+ LHKAIKL E L+S+D T LGAK EA+++ S + CAAFL+N D +
Sbjct: 324 LPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKDEN 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F G Y LP WSVSILPDCK V+NTAKV + P + ++ +
Sbjct: 383 SAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKV-------NAPSVHR----NMVPTGT 431
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW + E + +F R L EQI+ T D SDY WY I + G+ G
Sbjct: 432 KFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSP 491
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA VFVN +L YG D ++KI+L+ G+N + +LS+ VGL N G
Sbjct: 492 LLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVG 551
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L + +G D+S +W Y++GV+GE + L + ++ W QGS
Sbjct: 552 THFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSF 611
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK+TF P G PLAL++ +MGKGQ W+NG++IGR+W AY A G +C
Sbjct: 612 VAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCGRC 669
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G++DA KC +CG+ +Q YH+PR+W+ +NL+V+ EELGGDP+ ISL+ +T
Sbjct: 670 NYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 724
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/716 (52%), Positives = 495/716 (69%), Gaps = 25/716 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD +A++I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VIETYVFWN H
Sbjct: 25 VKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYF R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT
Sbjct: 85 EPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK MK+F KI+ +MK E LF +QGGPIILAQ+ENEYG VEW G G+ Y KW
Sbjct: 145 DNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWV 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+ FG
Sbjct: 205 AQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RPVED+A++VARF + GG+ NYYMY GGTNF RTA G +A+SYDYDAP+DEYG
Sbjct: 265 GAVPYRPVEDIAYSVARFIQKGGSLINYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PK+ HL+ LHKAIKL E L+S+D T LGAK EA+++ S + CAAFL+N D +
Sbjct: 324 LPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKDEN 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F G Y LP WSVSILPDCK V+NTAKV + P + ++ +
Sbjct: 383 SAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKV-------NAPSVHR----NMVPTGT 431
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW + E + +F R L EQI+ T D SDY WY I + G+ G
Sbjct: 432 KFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSP 491
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA VFVN +L YG D ++KI+L+ G+N + +LS+ VGL N G
Sbjct: 492 LLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVG 551
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L + +G D+S +W Y++GV+GE + L + ++ W QGS
Sbjct: 552 THFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSF 611
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + L WYK+TF P G PLAL++ +MGKGQ W+NG++IGR+W AY A G +C
Sbjct: 612 VAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCGRC 669
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Y G++DA KC +CG+ +Q YH+PR+W+ +NL+V+ EELGGDP+ ISL+ +T
Sbjct: 670 NYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 724
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/829 (45%), Positives = 531/829 (64%), Gaps = 36/829 (4%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NV+YD A++I+G+RR++ SGSIHYPRST E+WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 23 IGNNVSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFWDRH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP R +Y F G + +++ + +QEAGL++ +RIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 83 EPHRRKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N +K EM+ F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV YG G+ Y+ W
Sbjct: 143 NNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYINWC 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +LN +PW+MCQQ DAP PIINTCNGFYCD FTPN+P+ P M+TEN+ GWF +G
Sbjct: 203 AQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPNSPKMFTENWVGWFKKWG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P R ED+AF+VARFF++GG NYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 263 DKDPHRTAEDVAFSVARFFQSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDS 359
+ QPKWGHL++LH +IKL E+ L +S + Q G+ + + + + FL+N D
Sbjct: 323 NLNQPKWGHLKQLHASIKLGEKILTNSTRSDQDFGSSVTFTKFSNLETGEKFCFLSNADE 382
Query: 360 SSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
++DA V G+ YFLPAWSVSIL C +FNTAKV SQ + F +++N E A
Sbjct: 383 NNDAIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTS----LFFKKQNEKE--NA 436
Query: 419 SSAFSWYEE--KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLN 475
+++W E + + G +F L EQ T D+SDYLWY +++ + + L
Sbjct: 437 KLSWNWASEPMRDTLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVNSNTTSSLQNLTLQ 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ + GH F+N++ + +G++ +F+ K I+L G NT+ +LS VGL+NY A++
Sbjct: 497 VNTKGHVLHAFINRRYIGSQWGSNG-QSFVFEKPIQLKLGTNTITLLSATVGLKNYDAFY 555
Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
D G+ + LI N DLSS W Y+VG+ GE L +N + W +
Sbjct: 556 DTVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLNGERKQLYNPMFSNRTKWSTLNKKS 615
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ + + W+K TF P G P+ L++ MGKGQAWVNG+SIGR+W +++A + C++ CDY
Sbjct: 616 IGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRFWPSFIASNDSCSETCDY 675
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
+GSY+ +KC ++CG +Q YHIPR++++ N L++ EE+GG+P +S+ T T IC
Sbjct: 676 KGSYNPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGGNPQMVSVQTITIGTICG 735
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
+E + L+C+ G I+ I FASYG PEG CGSF+ G
Sbjct: 736 NANEGS------------------TLELSCQGGHVISEIQFASYGHPEGKCGSFQSGLWD 777
Query: 774 M--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ IV+KAC+G CSI +S +S A P LAV+A CS
Sbjct: 778 VTKSTTIIVEKACIGMKNCSIDISPNLFKLSKVAYP--YAKLAVQALCS 824
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/830 (46%), Positives = 532/830 (64%), Gaps = 42/830 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YD A++I+G+RRV+ SGS+HYPRST +WP+LI+K+K+GGL+ IETY+FW+ HEP
Sbjct: 36 NVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 95
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
R +Y F GR D ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQFRT N
Sbjct: 96 RRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTDNQ 155
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+K EM+ F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV YG G+ Y+ W A
Sbjct: 156 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCAQM 215
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNSPSKPIMWTENYSGWFLSFGYA 242
A +LN +PW+MCQQ DAP PIINTCNGFYCD F+PN+P P M+TEN+ GWF +G
Sbjct: 216 AESLNIGIPWIMCQQNDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFKKWGDK 275
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P+R ED+AFAVARFF++GG F NYYMY GGTNFGRTAGGP + TSYDY+AP+DEYG +
Sbjct: 276 DPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDEYGNL 335
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDSSS 361
QPKWGHL++LH +IK+ E+ L +S + QK+ + + + +S + FL+N D+ +
Sbjct: 336 NQPKWGHLKQLHASIKMGEKILTNSTRSDQKISSFVTLTKFSNPTSGERFCFLSNTDNKN 395
Query: 362 DANVTFNGN-VYF--LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
DA + + YF +PAWSVSIL C VFNTAK+ SQ + F + +N E
Sbjct: 396 DATIDLQADGKYFVPVPAWSVSILDGCNKEVFNTAKINSQTS----MFVKVQNKKE---- 447
Query: 419 SSAFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVF 473
++ FSW + + G +F L EQ TT D SDYLWY +I + V
Sbjct: 448 NAQFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSLQNVT 507
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH FVN++ + + ++ +F+ K I + G NT+ +LS VGL+NY A
Sbjct: 508 LQVNTKGHMLHAFVNRRYIGSQWRSNG-QSFVFEKPILIKPGTNTITLLSATVGLKNYDA 566
Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
++D G+ + LI N K DLSS W Y+VG+ GE L + + W +
Sbjct: 567 FYDTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQ 626
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
+ + + WYKT+F P G + L++ MGKGQAWVNGQSIGR+W +++A + C+ C
Sbjct: 627 KSIGRRMTWYKTSFKTPSGIDRVTLDMQGMGKGQAWVNGQSIGRFWPSFIASNDSCSTTC 686
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
DYRG+Y+ SKC ++CG P+Q YHIPR+++ N LV+ EE+GG+P ++S+ T T I
Sbjct: 687 DYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTI 746
Query: 712 CSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
C +E + L+C+ G I+ I FASYG PEG CGSF+ G+
Sbjct: 747 CGNANEGS------------------TLELSCQGGHIISEIQFASYGNPEGKCGSFKQGS 788
Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
H ++ +V+K C+G+ CSI VS+ G+ G L LA++A CS
Sbjct: 789 WHVINSAILVEKLCIGRESCSIDVSAKSFGL--GDVTNLSARLAIQALCS 836
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/712 (51%), Positives = 474/712 (66%), Gaps = 17/712 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANV+YDHR+L I +R+++ S +IHYPRS P +WP L++ +KEGG IE+YVFWN HE
Sbjct: 29 AANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 88
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYF GR+++V+F+K VQ+AG+ + LRIGP+ AEWNYGG PVWLH++PG FR
Sbjct: 89 PSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 148
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N P+K M+ F I++L+KQE LFA QGGPIIL+QVENEYG E YG GG+ Y +W+A
Sbjct: 149 NEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+ N VPW+MCQQ DAP +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG
Sbjct: 209 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 268
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RP ED+A++VARFF GG+ NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG
Sbjct: 269 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 328
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKWGHL++LHKAI L E LIS + + LG LEA +Y SS CAAFL+N D +
Sbjct: 329 PRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 388
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F Y LPAWSVSILPDCK VFNTAKV S+ ++ + + E L +SS
Sbjct: 389 DKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKS-------SKVEMLPEDLKSSSG 441
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
W + EK GI G FV+ +L + INTTKDT+DYLWYT SI V + G L
Sbjct: 442 LKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVL 501
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
IES GH VF+NK+ + GN F + K + L G N +D+LSM VGL N G++
Sbjct: 502 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSF 561
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
++ GAGL SV + G +L++ +W Y++GVEGE++ L K + + W + P
Sbjct: 562 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPK 621
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL---APSTGCTKKC 651
+ L WYK P G P+ L++ SMGKG AW+NG+ IGRYW +P+ C K+C
Sbjct: 622 KQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKEC 681
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
DYRG + KC CG+P+Q YH+PR+W N LVI EE GG+P KI L
Sbjct: 682 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKL 733
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/839 (47%), Positives = 513/839 (61%), Gaps = 51/839 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD RAL IDGKRR+L SGSIHYPRSTPE+WP LIRK+KEGGL+VIETYVFWN HEP R
Sbjct: 28 VSYDERALTIDGKRRILFSGSIHYPRSTPEMWPYLIRKAKEGGLDVIETYVFWNAHEPQR 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F DLVRF++T+Q+ GL+ +RIGPY +EWNYGG PVWLH IP ++FRT N
Sbjct: 88 RQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGLPVWLHNIPNMEFRTHNRA 147
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F EEMK F KI+D+M+ E LFA QGGPII+AQ+ENEYGNV AYG G Y+KW A A
Sbjct: 148 FMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQIENEYGNVMHAYGNNGTQYLKWCAQLA 207
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ T VPWVM QQ +AP +I++C+G+YCD F PN KP +WTEN++G + ++G P
Sbjct: 208 DSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPKIWTENWTGGYKNWGTQNP 267
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RP ED+A+AVARFF+ GGTFQNYYMY GGTNF RTAGGP V TSYDYDAP+DEYG + Q
Sbjct: 268 HRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYVTTSYDYDAPLDEYGNLNQ 327
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
PKWGHLR+LH +K E L H G + A +Y + + C F+ N S DA
Sbjct: 328 PKWGHLRQLHNLLKSKENILTQGSSQHTDYGNMVTATVYTYDGKSTC--FIGNAHQSKDA 385
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
+ F N Y +PAWSVSILP+C + +NTAKV +Q K NE L + +
Sbjct: 386 TINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTT------IMVKKDNEDLEYALRWQ 439
Query: 424 WYEE-----KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM----PGQGKEVF 473
W +E K G I+G P L +Q T D SDYLWY SI + P KE
Sbjct: 440 WRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWYITSIDIKGDDDPSWTKEFR 499
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH VFVN K V + + F+ KI+L G N + +LS VGL NYG
Sbjct: 500 LRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIKLTTGKNEISLLSTTVGLPNYGP 559
Query: 534 WFDVAGAGLFSVILIDLKNGK---------RDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
+FD G+ + + G +DLS +W Y+VG+ GE+ S NS
Sbjct: 560 FFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQWSYKVGLHGEHEM--HYSYENSL 617
Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
+P ++ L+WYKTTF +P G P+ ++L+ +GKG AWVNG SIGRYWS+YLA
Sbjct: 618 KTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGKGHAWVNGNSIGRYWSSYLADE 677
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISL 703
GC+ KCDYRG Y ++KC C QP+Q YH+PR+++ +N LV+ EELGG P ++
Sbjct: 678 NGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRDDDQNTLVLFEELGGQPYYVNF 737
Query: 704 LTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
LT T +C+ E + + LAC + I+ I FAS+G+P+G
Sbjct: 738 LTVTVGKVCANAYEGNT------------------LELACNKNQVISEIKFASFGLPKGE 779
Query: 764 CGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
CGSF+ G C + L ++ C+G+ +CSI VS LG + + LAVEA C I
Sbjct: 780 CGSFQKGNCESSEALSAIKAQCIGKDKCSIQVSERALGPTRCRV-AEDRRLAVEAVCDI 837
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/828 (48%), Positives = 508/828 (61%), Gaps = 63/828 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
N+TYD R+L+IDG+R++L S +IHYPRS P +WPEL++ +KEGG++VIETYVFWN HEP
Sbjct: 28 NITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
YYFE R+DLV+FVK VQ+AG++L LRIGP+ AEWN+GG PVWLH++PG FRT N
Sbjct: 88 PSNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTDNY 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
FK M++F+ I++LMK+E LFASQGGPIILAQVENEYG E AYG GG+ Y WAA
Sbjct: 148 NFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYAMWAAQM 207
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+ N VPW+MCQQ DAP+ +INTCN FYCD F P P KP +WTEN+ GWF +FG
Sbjct: 208 AVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQTFGAPN 267
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRT+GGP + TSYDY+APIDEYG R
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLAR 327
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PKW HL+ELHKAIKLCE L++S P + LG EA +Y + S CAAFLAN D +D
Sbjct: 328 LPKWAHLKELHKAIKLCELTLLNSVPVNLSLGPSQEADVYAEESGACAAFLANMDEKNDK 387
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V F Y LPAWSVSILPDCKNVVFNTAKV SQ + + ++ ++ + A
Sbjct: 388 TVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSIVEMVPDDLRSSDK---GTKALK 444
Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV------MPGQGKEVFLN 475
W + E GI G V+ + INTTKDT+DYLWYT SI V + G+ V L
Sbjct: 445 WETFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGENEEFLKKGGRPVLL- 503
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
IES GHA FVN++L GN + F K + L G N + +LSM VGLQN G+++
Sbjct: 504 IESKGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIALLSMTVGLQNAGSFY 563
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
+ GAGL SV + NG DLS+ W Y++G++GE +G+ + W S P +
Sbjct: 564 EWVGAGLTSVKMKGFNNGTIDLSTFNWTYKIGLQGEKLGMYNGIAVETVNWVATSKPPKD 623
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK A + LN W + W
Sbjct: 624 QPLTWYKRQIHARQ-----MLNW-------MWRINSEMILVW------------------ 653
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
T YH+PR+W P N+LVI EE GGDP+KI+ + +C+ V
Sbjct: 654 ----------------TRYHVPRSWFKPSGNILVIFEEKGGDPTKITFSRRKISGVCALV 697
Query: 716 SEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
+E P N G SS+ V L C + I+AI FAS+G P G CGS+ G CH
Sbjct: 698 AEDYPMANLESLENAGSGSSNYKASVHLKCPKSSIISAIKFASFGSPAGACGSYSEGECH 757
Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ +V+K C+ + +C + V+ S G CPG +K LAVEA CS
Sbjct: 758 DPKSISVVEKVCLNKNQCVVEVTEE--NFSKGLCPGKMKKLAVEAVCS 803
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/712 (51%), Positives = 473/712 (66%), Gaps = 17/712 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANV+YDHR+L I +R+++ S +IHYPRS P +WP L++ +KEGG IE+YVFWN HE
Sbjct: 29 AANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 88
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G+YYF GR+++V+F+K VQ+AG+ + LRIGP+ AEWNYGG PVWLH++PG FR
Sbjct: 89 PSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 148
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N P+K M+ F I++L+KQE LFA QGGPIIL+QVENEYG E YG GG+ Y +W+A
Sbjct: 149 NEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+ N VPW+MCQQ DAP +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG
Sbjct: 209 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 268
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RP ED+A++VARFF GG+ NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG
Sbjct: 269 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 328
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKWGHL++LHKAI L E LIS + + LG LEA +Y SS CAAFL+N D +
Sbjct: 329 PRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 388
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F Y LPAWSVSILPDCK VFNTAKV S+ ++ + + E L +SS
Sbjct: 389 DKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKS-------SKVEMLPEDLKSSSG 441
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
W + EK GI G FV+ +L + INTTKDT+DYLWYT SI V + G L
Sbjct: 442 LKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVL 501
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
IES GH VF+NK+ + GN F + K + L G +D+LSM VGL N G++
Sbjct: 502 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGETNIDLLSMTVGLANAGSF 561
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
++ GAGL SV + G +L++ +W Y++GVEGE++ L K + + W + P
Sbjct: 562 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPK 621
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL---APSTGCTKKC 651
+ L WYK P G P+ L++ SMGKG AW+NG+ IGRYW +P+ C K+C
Sbjct: 622 KQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKEC 681
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
DYRG + KC CG+P+Q YH+PR+W N LVI EE GG+P KI L
Sbjct: 682 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKL 733
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/839 (47%), Positives = 512/839 (61%), Gaps = 51/839 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD RAL IDGKRR+L S SIHYPRSTPE+WP LIRK+KEGGL+VIETYVFWN HEP R
Sbjct: 28 VSYDERALTIDGKRRILFSASIHYPRSTPEMWPYLIRKAKEGGLDVIETYVFWNAHEPQR 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F DLVRF++T+Q+ GL+ +RIGPY +EWNYGG PVWLH IP ++FRT N
Sbjct: 88 RQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGLPVWLHNIPNMEFRTHNRA 147
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F EEMK F KI+D+M+ E LFA QGGPII+AQ+ENEYGNV AYG G Y+KW A A
Sbjct: 148 FMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQIENEYGNVMHAYGNNGTQYLKWCAQLA 207
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ T VPWVM QQ +AP +I++C+G+YCD F PN KP +WTEN++G + ++G P
Sbjct: 208 DSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPKIWTENWTGGYKNWGTQNP 267
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
RP ED+A+AVARFF+ GGTFQNYYMY GGTNF RTAGGP V TSYDYDAP+DEYG + Q
Sbjct: 268 HRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYVTTSYDYDAPLDEYGNLNQ 327
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
PKWGHLR+LH +K E L + G + A +Y + + C F+ N S DA
Sbjct: 328 PKWGHLRQLHNLLKSKENILTQGSSQNTDYGNMVTATVYTYDGKSTC--FIGNAHQSKDA 385
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
+ F N Y +PAWSVSILP+C + +NTAKV +Q K NE L + +
Sbjct: 386 TINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTT------IMVKKDNEDLEYALRWQ 439
Query: 424 WYEE-----KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM----PGQGKEVF 473
W +E K G I+G P L +Q T D SDYLWY SI + P KE
Sbjct: 440 WRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWYITSIDIKGDDDPSWTKEFR 499
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH VFVN K V + + F+ KI+L G N + +LS VGL NYG
Sbjct: 500 LRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIKLTTGKNEISLLSTTVGLPNYGP 559
Query: 534 WFDVAGAGLFSVILIDLKNGK---------RDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
+FD G+ + + G +DLS +W Y+VG+ GE+ S NS
Sbjct: 560 FFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQWSYKVGLHGEHEM--HYSYENSL 617
Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
+P ++ L+WYKTTF +P G P+ ++L+ +GKG AWVNG SIGRYWS+YLA
Sbjct: 618 KTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGKGHAWVNGNSIGRYWSSYLADE 677
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISL 703
GC+ KCDYRG Y ++KC C QP+Q YH+PR+++ +N LV+ EELGG P ++
Sbjct: 678 NGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRDNDQNTLVLFEELGGQPYYVNF 737
Query: 704 LTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
LT T +C+ E + + LAC + I+ I FAS+G+P+G
Sbjct: 738 LTVTVGKVCANAYEGN------------------TLELACNKNQVISEIKFASFGLPKGE 779
Query: 764 CGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
CGSF+ G C + L ++ C+G+ +CSI VS LG + + LAVEA C I
Sbjct: 780 CGSFQKGNCESSEALSAIKAQCIGKDKCSIQVSERTLGPTRCRV-AEDRRLAVEAVCDI 837
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/712 (51%), Positives = 475/712 (66%), Gaps = 17/712 (2%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANV+YDHR+L I +R+++ S +IHYPRS P +WP L++ +KEGG IE+YVFWN HE
Sbjct: 28 AANVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 87
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +YYF GR+++V+F+K VQ+AG+ + LRIGP+ AEWNYGG PVWLH++PG FR
Sbjct: 88 PSPRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 147
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N P+K M+ F I++L+K+E LFA QGGPIIL+QVENEYG E YG GG+ Y +W+A
Sbjct: 148 NEPWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 207
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV+ N VPW+MCQQ DAP +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG
Sbjct: 208 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 267
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RP ED+A++VARFF GG+ NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG
Sbjct: 268 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 327
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R PKWGHL++LHKAI L E LI+ + + LG LEA +Y SS CAAFL+N D +
Sbjct: 328 PRLPKWGHLKDLHKAIMLSENLLINGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 387
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F Y LPAWSVSILPDCKN VFNTAKV S+ F++ + + E L +SS
Sbjct: 388 DKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTSK-------FSKVEMLPEDLRSSSG 440
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
W + EK GI G FV+ +L + INTTKDT+DYLWYT SI V + G L
Sbjct: 441 LKWEVFSEKPGIWGEADFVKNELVDHINTTKDTTDYLWYTTSITVSTNEEFLKKGSPPVL 500
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
IES GH VF+NK+ + GN F + K + L G N +D+LSM VGL N G++
Sbjct: 501 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVALKAGENNIDLLSMTVGLSNAGSF 560
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
++ GAGL SV + G +L++ +W Y++GV+G ++ L K + + W + P
Sbjct: 561 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVQGVHLELFKPGDSGAVKWTVTTKPPK 620
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW---SAYLAPSTGCTKKC 651
+ L WYK P G P+ L++ SMGKG AW+NG+ IGRYW + P+ C K+C
Sbjct: 621 KQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEEIGRYWPRIARKSTPNDECVKEC 680
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
DYRG + KC CG+P+Q YH+PR+W N LVI EE GGDP KI+L
Sbjct: 681 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGDPMKITL 732
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/834 (45%), Positives = 522/834 (62%), Gaps = 45/834 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V++D RA+ IDG+RR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HE
Sbjct: 24 STIVSHDERAITIDGQRRILLSGSIHYPRSTSDMWPDLISKAKDGGLDTIETYVFWNAHE 83
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R QY F G DLVRF+KT+Q AGL+ LRIGPY CAEWNYGGFPVWLH +P ++FRT
Sbjct: 84 PSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPDMKFRTI 143
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F EM+ F KI+++MK+E+LFASQGGPIILAQ+ENEYGNV +YG G+ Y+ W A
Sbjct: 144 NPGFMNEMQNFTTKIVNMMKEESLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCA 203
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A +L+ VPW+MCQQ AP P+I TCNGFYCD + P++PS P MWTEN++GWF ++G
Sbjct: 204 NMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCDQYKPSNPSSPKMWTENWTGWFKNWGG 263
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+R EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDYDAP+DEYG
Sbjct: 264 KHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYDAPLDEYGN 323
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+ QPKWGHL++LH +K E+ L + + LG + A +Y ++ + F+ N ++++
Sbjct: 324 LNQPKWGHLKQLHTLLKSMEKPLTYGNISTIDLGNSVTATVY-STNEKSSCFIGNVNATA 382
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ-----RNNGDHPFAQQKNVNELL 416
DA V F G Y +PAWSVS+LPDC +NTA+V +Q ++ D P E L
Sbjct: 383 DALVNFKGKDYNVPAWSVSVLPDCDKEAYNTARVNTQTSIITEDSCDEP--------EKL 434
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVF 473
+ + +K + G+ + L +Q + T D SDYLWY +H+ P + +
Sbjct: 435 KWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRVHLDKKDPIWSRNMS 494
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + S H +VN K V + ++ KK+ L G N L +LS+ VGLQNYG
Sbjct: 495 LRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFEKKVNLVHGTNHLALLSVSVGLQNYGP 554
Query: 534 WFDVAGAGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
+F+ G+ V L+ K ++DLS +W Y++G+ G L + A K
Sbjct: 555 FFESGPTGINGPVKLVGYKGDETIEKDLSKHQWDYKIGLNGFNHKLFSMKSAGHHHRKWS 614
Query: 590 S-TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ LP ++ L WYK F AP GK P+ ++L +GKG+ W+NGQSIGRYW ++ + GCT
Sbjct: 615 TEKLPADRMLSWYKANFKAPLGKDPVIVDLNGLGKGEVWINGQSIGRYWPSFNSSDEGCT 674
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKT 707
++CDYRG Y + KC CG+P Q YH+PR++++ G N + + EE+GGDPS + T
Sbjct: 675 EECDYRGEYGSDKCAFMCGKPTQRWYHVPRSFLNDKGHNTITLFEEMGGDPSMVKFKTVV 734
Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C+ E + +V L+C I+A+ FAS+G P G CGSF
Sbjct: 735 TGRVCAKAHEHN------------------KVELSCNNR-PISAVKFASFGNPSGQCGSF 775
Query: 768 RPGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G+C D + +V K CVG++ C++ VSS G S C K L VE C
Sbjct: 776 AAGSCEGAKDAVKVVAKECVGKLNCTMNVSSHKFG-SNLDCGDSPKRLFVEVEC 828
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/718 (51%), Positives = 491/718 (68%), Gaps = 27/718 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A+V+YD +A++I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VIETYVFWN H
Sbjct: 25 VKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYF R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT
Sbjct: 85 EPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILA--QVENEYGNVEWAYGVGGELYVK 178
N PFK MK+F KI+ +MK E LF +QGGPIILA Q+ENEYG VEW G G+ Y K
Sbjct: 145 DNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQGQIENEYGPVEWEIGAPGKAYTK 204
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
W A A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+
Sbjct: 205 WVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYCEDFKPNSSNKPKMWTENWTGWYTE 264
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
FG AVP+RPVED+A++VARF + GG+F NYYMY GGTNF RTA G +A+SYDYDAP+DE
Sbjct: 265 FGGAVPYRPVEDIAYSVARFIQKGGSFVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDE 323
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
YG R+PK+ HL+ LHK IKL E L+S+D T LGAK EA+++ S + CAAFL+N D
Sbjct: 324 YGLPREPKYSHLKALHKVIKLSEPALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKD 382
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
SS A V F G Y LP WSVSILPDCK +NTAKV + P + ++
Sbjct: 383 ESSAARVMFRGFPYVLPPWSVSILPDCKTEFYNTAKV-------NAPSVHR----NMVPT 431
Query: 419 SSAFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
+ FSW + E + +F R L EQI+ T D SDY WY I + G+ G
Sbjct: 432 GARFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYLTDITIGSGETFLKTGD 491
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
+ S GHA VFVN +L YG D +KI+L+ G+N L +LS+ VGL N
Sbjct: 492 FPLFTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFTQKIKLHAGVNKLALLSVAVGLPN 551
Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
G F+ G+ V L + +G D+S +W Y++GV+GE + L + ++ W QG
Sbjct: 552 VGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTDTESSGVRWTQG 611
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
S + + L WYK+TF P G PLAL++ +MGKGQ W+NG++IGR+W AY A G
Sbjct: 612 SFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCG 669
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+C+Y G+++A KC +CG+ +Q YH+PR+W+ +NL+V+ EE GGDP+ ISL+ +T
Sbjct: 670 RCNYAGTFNAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEEWGGDPNGISLVKRT 726
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/644 (55%), Positives = 460/644 (71%), Gaps = 22/644 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A V+YDH+A++IDG+RR+L SGSIHYPRSTP++WP+LI+K+K+G ++VI+TYVFWN H
Sbjct: 30 VEATVSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPDLIQKAKDG-VDVIQTYVFWNGH 88
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI+FRT
Sbjct: 89 EPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIEFRT 148
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW G G+ Y KWA
Sbjct: 149 DNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQIENEFGPVEWEIGAPGKAYTKWA 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+Q+DAPDP+INTCNGFYC+ F PN +KP MWTEN++GWF +FG
Sbjct: 209 AQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYCENFVPNQKNKPKMWTENWTGWFTAFG 268
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 269 GPTPQRPAEDVAFSVARFIQNGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 328
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+R+PKWGHLR+LHKAIKLCE L+S+DPT LG E H+++ S CAAFLANYD++
Sbjct: 329 LLREPKWGHLRDLHKAIKLCESALVSTDPTVTSLGNNQEVHVFNPKSGSCAAFLANYDTT 388
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F Y LP WS+SILPDCK VFNTA++ +Q + Q V S
Sbjct: 389 SSAKVNFKIMQYELPPWSISILPDCKTAVFNTARLGAQSS-----LKQMTPV-------S 436
Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
FSW EE S +++F L EQ+N T+D SDYLWY +I++ + G++
Sbjct: 437 TFSWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDYLWYMTNINIDSNEGFLKNGQDP 496
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L I S GHA VF+N +L YG D ++ +++ G+N L +LS+ VGLQN G
Sbjct: 497 LLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVG 556
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ G+ V L L G RDLS +W Y++G++GE + L +S ++S W +GS+
Sbjct: 557 THFEQWNTGVLGPVTLRGLNEGTRDLSKQQWSYKIGLKGEDLSLHTVSGSSSVEWVEGSS 616
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
L + L WYKTTF AP G PLAL++++MGKG W+N QSIGR
Sbjct: 617 LAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWINSQSIGR 660
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/762 (49%), Positives = 487/762 (63%), Gaps = 76/762 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YDHR+LVI+G+RR+L SGSIHYPRS PE+WP LI+K+K+GGL+V++TYVFWN HEP +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQYYF R+DLVRFVK V++AGL++HLR+GPY CAEWN+GGFPVWL ++PGI+FRT N P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F+ KI+ +MK E LF QGGPII+AQVENE+G +E G GG+ Y WAA A
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V N VPWVMC+Q+DAPDP+INTCNGFYCD FTPN+ KP MWTE ++GWF FG A P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY----- 299
RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 300 --------------------------------------------GFIRQPKWGHLRELHK 315
G +RQPKWGHLR +H+
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 316 AIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
AIK E L+S DPT + +G +A+++ + CAAFL+NY S + F+G Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459
Query: 376 AWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISG 433
AWS+SILPDCK VFNTA V + + ++ F+W Y E
Sbjct: 460 AWSISILPDCKTAVFNTATV-----------KEPTLLPKMSPVMHRFAWQSYSEDTNSLD 508
Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVN 488
+ +F R L EQ++ T D SDYLWYT +++ + G+ L++ S GH+ VFVN
Sbjct: 509 DSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVN 568
Query: 489 KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VIL 547
+ YG +D + +++ +G N + ILS VGL N G F++ G+ V L
Sbjct: 569 GRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTL 628
Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVNKSLIWYKTTF 605
L GKRDLS WIYQVG++GE +GL ++ +++ W G T P L W+K F
Sbjct: 629 SGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGGGTQP----LTWHKALF 684
Query: 606 LAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKH 665
AP G P+AL++ SMGKGQ WVNG+ GRYWS Y A S GC +C Y G+Y +C +
Sbjct: 685 NAPAGSDPVALDMGSMGKGQVWVNGRHAGRYWS-YRAHSRGC-GRCSYAGTYREDQCTSN 742
Query: 666 CGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
CG +Q YH+PR+W+ P NLLV+ EE GGD + +SL T+T
Sbjct: 743 CGDLSQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVSLATRT 784
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/835 (45%), Positives = 526/835 (62%), Gaps = 49/835 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NV+YD A++I+G+RRV+ SGSIHYPRST +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 1 MGDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRH 60
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP R +Y F G + ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 61 EPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 120
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N +K EM F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV YG G+ Y+ W
Sbjct: 121 DNQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWC 180
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +LN VPW+MCQQ DAP PIINTCNGFYCD F+PN+P P M+TEN+ GWF +G
Sbjct: 181 AQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWG 240
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P+R ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEYG
Sbjct: 241 DKDPYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYG 300
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDS 359
+ QPKWGHL++LH +IKL E+ L + +++ G+ + + ++ + FL+N D
Sbjct: 301 NLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTLTKFSNPTTKERFCFLSNTDD 360
Query: 360 SSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
++DA + + YF+PAWSVSI+ CK VFNTAK+ SQ + F + +N E +
Sbjct: 361 TNDATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTS----MFVKVQNEKENVKL 416
Query: 419 SSAFSWYEEKVG--ISGNRSFVRPDLAEQINTTKDTSDYLWY--------TASIHVMPGQ 468
S + W E + + G +F L EQ TT D+SDYLWY T+SIH
Sbjct: 417 S--WVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSIH----- 469
Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
V L + + GH FVN + + +GN+ +F+ K I L G N + +LS VGL
Sbjct: 470 --NVTLQVNTKGHVLHAFVNTRYIGSQWGNNG-QSFVFEKPILLKAGTNIITLLSATVGL 526
Query: 529 QNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
+NY A++D G+ + LI N +LSS W Y+VG+ GE L + + W
Sbjct: 527 KNYDAFYDTLPTGIDGGPIYLIGDGNVTTNLSSNLWSYKVGLNGEIKQLYNPVFSQETSW 586
Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
+ + + + WYKT+F P G P+ L++ MGKG+AW+NGQSIGR+W +++A +
Sbjct: 587 NTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGNDN 646
Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
C++ CDYRG+YD SKC +CG P+Q YHIPR+++ N LV+ EE+GG P ++S+ T
Sbjct: 647 CSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQTI 706
Query: 707 TGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
T IC +E + L+C+ + I+ I FASYG P+G CGS
Sbjct: 707 TIGTICGNANEGS------------------TLELSCQGEYIISEIQFASYGNPKGKCGS 748
Query: 767 FRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
F+ G+ + + +++K C CS+ VS+ G+ G L L V+A CS
Sbjct: 749 FKQGSWDVTNSALLLEKTCKDMKSCSVDVSAKLFGL--GDAVNLSARLVVQALCS 801
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 746 bits (1925), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/714 (52%), Positives = 484/714 (67%), Gaps = 23/714 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+V+YDH+ALVIDG+RR+L SGSIHYPRSTPE+WP+L +K+K+GGL+VI+TYVFWN H
Sbjct: 21 VTASVSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G Y + R D V+ K Q+A L +HLR+ P + GFPVWL ++PG+ FRT
Sbjct: 81 EPSPGNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMAFRT 134
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK E+LF +QGGPII++Q+ENEYG VEW G G+ Y KWA
Sbjct: 135 DNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWA 194
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPW MC+QEDAPDP+I+TCNG+YC+ FTPN KP MWTEN+SGW+ FG
Sbjct: 195 AQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYTDFG 254
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A+ RP EDLA++VA F + G+F NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 255 GAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 314
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK-LEAHIYHKSSNDCAAFLANYDS 359
+PKW HL+ LHKAIK CE LIS DPT LG K LEAH+Y+ +++ CAAFLANYD+
Sbjct: 315 LPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLANYDT 374
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
S A VTF Y LP WSVSILPDCK VVFNTA V NG H F ++ E
Sbjct: 375 KSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATV-----NG-HSFHKRMTPVETTFDW 428
Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
++S EE S + S + L EQIN T+D+SDYLWY +++ P + G+ L
Sbjct: 429 QSYS--EEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFPTL 486
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GH VFVN +L YG D ++ + L G N + +LS+ VGL N G
Sbjct: 487 TINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVGLH 546
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ V L L G RDLS +W Y+VG++GE + L I+ ++S W QGS+L
Sbjct: 547 FETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQGSSLA 606
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYKTTF AP G P+AL+++SMGKG+ W+N QSIGR+W AY+A G +C+Y
Sbjct: 607 KKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIA--HGNCDECNY 664
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G++ KC+ +CG+P Q YHIPR+W+ N+LV+ EE GGDP+ ISL+ +T
Sbjct: 665 AGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKRT 718
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 746 bits (1925), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/618 (58%), Positives = 442/618 (71%), Gaps = 12/618 (1%)
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QY FEGR DLVRFVK +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT N PF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM+RF K++ MK L+ASQGGPIIL+Q+ENEYGN+ +YG G+ Y++WAA AV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
L+T VPWVMCQQ DAP+P+INTCNGFYCD FTP+ PS+P +WTEN+SGWFLSFG AVP+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG +RQP
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHLR++HKAIK+CE LI++DP++ LG EAH+Y KS + CAAFLAN D SD V
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQSDKTV 299
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNELLLAS 419
TFNG Y LPAWSVSILPDCKNVV NTA++ SQ RN G A + E LA+
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359
Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVFLN 475
S++S+ E VGI+ + +P L EQINTT D SD+LWY+ SI V G+ G + L
Sbjct: 360 SSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQSNLL 419
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ SLGH VF+N KL G+ + + + L G N +D+LS VGL NYGA+F
Sbjct: 420 VNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGAFF 479
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
D+ GAG+ + + G DLSS EW YQ+G+ GE + L S A S W ++ P N
Sbjct: 480 DLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEA-SPEWVSDNSYPTN 538
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WYK+ F AP G P+A++ MGKG+AWVNGQSIGRYW +AP +GC C+YRG
Sbjct: 539 NPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSGCVNSCNYRG 598
Query: 656 SYDASKCQKHCGQPAQTL 673
SY A+KC K CGQP+Q L
Sbjct: 599 SYSATKCLKKCGQPSQIL 616
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/831 (45%), Positives = 517/831 (62%), Gaps = 43/831 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V YD AL+I+G+RR++ SG+IHYPRST ++WP+L++K+K+GGL+ IETY+FW+ HE +R
Sbjct: 25 VKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDRHEQVR 84
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y F G D V+F KT+QEAGL+ +RIGPY+CAEWNYGGFPVWLH IPGI+ RT N
Sbjct: 85 GRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGIEMRTDNAA 144
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K EM+ F+ KII++ K+ NLFASQGGPIILAQ+ENEYG++ W + G+ Y+KWAA A
Sbjct: 145 YKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKWAAQMA 204
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ N VPW MCQQ DAP PIINTCNG+YC F PN+P P M+TEN+ GWF +G P
Sbjct: 205 LAQNIGVPWFMCQQNDAPQPIINTCNGYYCHNFKPNNPKSPKMFTENWIGWFQKWGERAP 264
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED A+AVARFF+ GG F NYYMY GGTNFGRT+GGP + TSYDYDAPI+EYG + Q
Sbjct: 265 HRTAEDSAYAVARFFQNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEYGNLNQ 324
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQK-LGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PK+GHL+ LH+AIKL E+ L + + K LG + Y S FL+N ++D
Sbjct: 325 PKYGHLKFLHEAIKLGEKVLTNYTSRNDKDLGNGITLTTYTNSVGARFCFLSNDKDNTDG 384
Query: 364 NVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
NV N YF+PAWSV+IL C VFNTAKV SQ + +K ++ +
Sbjct: 385 NVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTS------IMEKKIDNSSTNKLTW 438
Query: 423 SWYEE--KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLNIESL 479
+W E K ++G S L EQ T D SDYLWY S+ + L++E+
Sbjct: 439 AWIMEPKKDTMNGRGSIKAHQLLEQKELTLDASDYLWYMTSVDINDTSNWSNANLHVETS 498
Query: 480 GHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH +VNK+ + GYG+ F NF K++ L G N + +LS VGL NYGA FD
Sbjct: 499 GHTLHGYVNKRYI--GYGHSQFGNNFTYEKQVSLKNGTNIITLLSATVGLANYGARFDEI 556
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
G+ V L+ + DLS+G W ++VG+ GE + + W S+ P K
Sbjct: 557 KTGISDGPVKLVGQNSVTIDLSTGNWSFKVGLNGEKRRFYDLQPRSGVAWNT-SSYPTGK 615
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYKT F +P G P+ ++L +GKG AWVNG+SIGRYW++++ + GC+ CDYRG+
Sbjct: 616 PLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGRYWTSWITSTAGCSDTCDYRGN 675
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
Y KC C P+Q YH+PR++++ N L++ EE+GG+P +S LT+T + IC+ V
Sbjct: 676 YKKEKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIGGNPQNVSFLTETTKTICANVY 735
Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MD 775
E ++ L+C+ G I +INFAS+G P+G CGSF+ G+ ++
Sbjct: 736 EGG------------------KLELSCQNGQVITSINFASFGNPQGQCGSFKKGSWESLN 777
Query: 776 VLPIVQKACVGQIECSIPVSSAYLGV-------SAGACPGLLKALAVEAHC 819
+++ +C+G+ C V+ GV S + + LAV+A C
Sbjct: 778 SQSMMETSCIGKTGCGFTVTRDMFGVNLDPLSASKASVKDGIPRLAVQATC 828
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/832 (45%), Positives = 512/832 (61%), Gaps = 42/832 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD R+L+I+G+RRV+ SG++HYPRST ++WP++I+K+K+GGL+ IE+YVFW+ H
Sbjct: 24 FATEVTYDARSLIINGERRVIFSGAVHYPRSTVQMWPDIIQKAKDGGLDAIESYVFWDRH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R +Y F G D ++F + +QEAGL+ LRIGPY CAEWN+GGFP+WLH +PGI+ RT
Sbjct: 84 EPVRREYDFSGNLDFIKFFQIIQEAGLYAILRIGPYVCAEWNFGGFPLWLHNMPGIELRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N +K EM+ F KI+++ K+ LFASQGGPIILAQ+ENEYGN+ YG G+ Y+KW
Sbjct: 144 DNPIYKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNIMTDYGEAGKTYIKWC 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ N VPW+MCQQ DAP P+INTCNG YCD F PN+P P M+TEN+ GWF +G
Sbjct: 204 AQMALAQNIGVPWIMCQQHDAPQPMINTCNGHYCDSFQPNNPKSPKMFTENWIGWFQKWG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP R ED AF+VARFF+ GG NYYMY GGTNFGRTAGGP + TSY+YDAP+DEYG
Sbjct: 264 ERVPHRSAEDSAFSVARFFQNGGILNNYYMYHGGTNFGRTAGGPYMTTSYEYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ QPKWGHL++LH AIKL E+ + + T + G ++ Y ++ + FL+N + S
Sbjct: 324 NLNQPKWGHLKQLHAAIKLGEKIITNGTRTDKDFGNEVTLTTYTHTNGERFCFLSNTNDS 383
Query: 361 SDANVTF--NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
DANV +GN YFLPAWSV+IL C VFNTAKV SQ + V + A
Sbjct: 384 KDANVDLQQDGN-YFLPAWSVTILDGCNKEVFNTAKVNSQTS---------IMVKKSDDA 433
Query: 419 SSAFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-GKEVF 473
S+ +W ++K + G +F L EQ T D SDYLWY S+ +
Sbjct: 434 SNKLTWAWIPEKKKDTMHGKGNFKVNQLLEQKELTFDVSDYLWYMTSVDINDTSIWSNAT 493
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH +VN + V + + NF K + L +G+N + +LS VGL NYGA
Sbjct: 494 LRVNTRGHTLRAYVNGRHVGYKFSQWG-GNFTYEKYVSLKKGLNVITLLSATVGLPNYGA 552
Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
FD G+ V LI N DLS+ W Y++G+ GE L W+ S
Sbjct: 553 KFDKIKTGIAGGPVQLIGNNNETIDLSTNLWSYKIGLNGEKKRLYDPQPRIGVSWRTNSP 612
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
P+ +SL WYK F+AP G P+ ++L +GKG+AWVNGQSIGRYW++++ + GC+ C
Sbjct: 613 YPIGRSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVNGQSIGRYWTSWITATNGCSDTC 672
Query: 652 DYRGSY-DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
DYRG Y A KC +CG P+Q YH+PR+++ +N LV+ EE+GG+P +S T
Sbjct: 673 DYRGKYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTLVLFEEIGGNPQNVSFQTVITGT 732
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
IC+ V E + L+C+ G I+ I F+S+G P GNCGSF+ G
Sbjct: 733 ICAQVQEG------------------ALLELSCQGGKTISQIQFSSFGNPTGNCGSFKKG 774
Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGA--CPGLLKALAVEAHC 819
D +V+ ACVG+ C V+ GV+ G + LAV+A C
Sbjct: 775 TWEATDGQSVVEAACVGRNSCGFMVTKEAFGVAIGPMNVDERVARLAVQATC 826
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/829 (45%), Positives = 517/829 (62%), Gaps = 38/829 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V++D RA+ I+GKRR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HE
Sbjct: 25 STIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R +Y F G D+VRF+KT+Q+AGL+ LRIGPY CAEWNYGGFPVWLH +P ++FRT
Sbjct: 85 PKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTV 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F EM+ F KI+ +MK+E LFASQGGPIILAQ+ENEYGNV +YG G+ Y+ W A
Sbjct: 145 NPSFMNEMQNFTTKIVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A +L+ VPW+MCQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G
Sbjct: 205 NMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+R EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY AP+DE+G
Sbjct: 265 KHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGN 324
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+ QPKWGHL++LH +K E+ L + + LG ++A IY + + F+ N ++++
Sbjct: 325 LNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATA 383
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DA V F G Y +PAWSVS+LPDC +NTAKV +Q + ++ + + SA
Sbjct: 384 DALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESA 443
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIES 478
+K+ + G+ + L +Q + T D SDYLWY +H+ P + + L + S
Sbjct: 444 -----QKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHS 498
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDV 537
H +VN K V + ++ +K+ L G N + +LS+ VGLQNYG +F+
Sbjct: 499 NAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFES 558
Query: 538 AGAGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
G+ V L+ K ++DLS +W Y++G+ G L I W LP
Sbjct: 559 GPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN-EKLP 617
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L WYK F AP GK P+ ++L +GKG+AW+NGQSIGRYW ++ + GC +CDY
Sbjct: 618 TGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDY 677
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHIC 712
RG+Y + KC CG+P Q YH+PR++++ G N + + EE+GG+PS ++ T +C
Sbjct: 678 RGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVC 737
Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E + +V L+C I+A+ FAS+G P G+CGSF G C
Sbjct: 738 ARAHEHN------------------KVELSCHNR-PISAVKFASFGNPLGHCGSFAVGTC 778
Query: 773 H--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
D V K CVG++ C++ VSS G S C K LAVE C
Sbjct: 779 QGDKDAAKTVAKECVGKLNCTVNVSSDTFG-STLDCGDSPKKLAVELEC 826
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/835 (46%), Positives = 515/835 (61%), Gaps = 51/835 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T D R ++I+G+R++L SGS+HYPRSTPE+WP+LI+KSK+GGL I+TYVFW+ HEP R
Sbjct: 30 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DLVRF+K +Q GL+ LRIGPY CAEW YGGFPVWLH P IQ RT N
Sbjct: 90 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+ EM+ F I+D+MK+E LFASQGGPII++Q+ENEYGNV AY G Y+ W A A
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
L+T VPW+MCQQ++AP P+INTCNG+YCD FTPN+P+ P MWTEN+SGW+ ++G + P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP++EYG Q
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
PKWGHLR+LH + E+ L D + A IY ++ + C F N ++ D
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSNADRDV 387
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
+ + G Y +PAWSVSILPDC N V+NTAKV SQ + F ++ + E S ++
Sbjct: 388 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYST----FVKKGSEAENEPNSLQWT 443
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIESLG 480
W E + F +L +Q +DTSDYL+Y ++ + P GK++ L++ + G
Sbjct: 444 WRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDDPIWGKDLTLSVNTSG 503
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H FVN + + + Y F + + L G N + +LS VGL NYG FD+
Sbjct: 504 HILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMVNQ 563
Query: 541 GLFSVILIDLKNGKRDL-----SSGEWIYQVGVEGEYIGLDKISLANSSF--WKQGSTLP 593
G+ + I NG D+ ++ +W Y+ G+ GE KI L + + WK LP
Sbjct: 564 GIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGE---DKKIFLGRARYNQWKS-DNLP 619
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
VN+S +WYK TF AP G+ P+ ++L +GKG+AWVNG S+GRYW +Y+A GC+ +CDY
Sbjct: 620 VNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDY 679
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
RG Y A KC +CG P+Q YH+PR+++ +N LV+ EE GG+PS ++ T T + C+
Sbjct: 680 RGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNACA 739
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS------- 766
E + L+C+ G I+ I FAS+G P+G CG
Sbjct: 740 NAREG------------------YTLELSCQ-GRAISGIKFASFGDPQGTCGKPFATGSQ 780
Query: 767 -FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
F G C D L I+QK CVG+ CSI VS LG C K LAVEA C
Sbjct: 781 VFEKGTCEAADSLSIIQKLCVGKYSCSIDVSEQILG--PAGCTADTKRLAVEAIC 833
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/835 (46%), Positives = 514/835 (61%), Gaps = 50/835 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTYD AL+I+G+RR++ SG+IHYPRST E+WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 6 FATEVTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDRH 65
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R +Y F G D V+F + +Q+AGL+ +RIGPYACAEWN+GGFP WLH +PGI+ RT
Sbjct: 66 EPVRREYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIELRT 125
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N+ +K EM+ F +I++++K+ LFASQGGPIILAQ+ENEYG++ W Y G+ YV+WA
Sbjct: 126 NNSVYKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQWA 185
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ N VPW+MCQQ+DAP PIINTCNG+YC F PN+P P ++TEN+ GWF +G
Sbjct: 186 AQMALAQNIGVPWIMCQQQDAPQPIINTCNGYYCHNFQPNNPKSPKIFTENWIGWFQKWG 245
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP R ED AF+VARFF+ GG NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 246 ERVPHRSAEDSAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEYG 305
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLIS-SDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
+ QPKWGHL+ LH AIKL E L + S + LG L Y SS FL+N ++
Sbjct: 306 NLNQPKWGHLKNLHAAIKLGENVLTNYSARKDEDLGNGLTLTTYTNSSGARFCFLSNNNN 365
Query: 360 SS-DANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
+ A V N VY +PAWSVSI+ C VFNTAKV SQ + +K+ N +
Sbjct: 366 TDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTS-----MMVKKSDN---V 417
Query: 418 ASSAFSWYEEKV-----GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-GKE 471
+S+ +W E KV I GN S L EQ T D SDYLWY S +
Sbjct: 418 SSTNLTW-EWKVEPKRDTIHGNGSLKAQKLLEQKELTLDASDYLWYMTSADINDTSIWSN 476
Query: 472 VFLNIESLGHAALVFVNKKLVAF---GYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
L + + GH+ +VN++ V + YGN F K++ L G N + +LS VGL
Sbjct: 477 ATLRVNTSGHSLHGYVNQRYVGYQFSQYGNQ----FTYEKQVSLKNGTNIITLLSATVGL 532
Query: 529 QNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
NYGAWFD G+ V LI N DLS+ W Y++G+ GE L S W
Sbjct: 533 ANYGAWFDDKKTGISGGPVELIGKNNVTMDLSTNLWSYKIGLNGERRHLYDAQQNVSVAW 592
Query: 587 KQGST-LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
S+ +P+ K LIWY+ F +P G P+ ++L +GKG AWVNG SIGRYWS++++PS
Sbjct: 593 HTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGRYWSSWISPSD 652
Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
GC+ CDYRG+Y KC +CG P+Q YH+PR++++ N LV+ EE+GG+P + T
Sbjct: 653 GCSDTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIGGNPQSVQFQT 712
Query: 706 KTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCG 765
T IC+ V E Q L+C+ G ++ I FASYG PEG CG
Sbjct: 713 VTTGTICANVYEG------------------AQFELSCQSGQVMSQIQFASYGNPEGQCG 754
Query: 766 SFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
SF+ G + +V+ +CVG+ C V+ GV+ + + LAV+ C
Sbjct: 755 SFKKGNFDAANSQSVVEASCVGKNNCGFNVTKEMFGVTNVSS---IPRLAVQVTC 806
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/714 (50%), Positives = 483/714 (67%), Gaps = 22/714 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
NV YD+RA+ I+ +RR+L SGSIHYPRSTPE+WP++I K+K+ L+VI+TYVFWN HEP
Sbjct: 29 GNVWYDYRAIKINDQRRILLSGSIHYPRSTPEMWPDIIEKAKDSQLDVIQTYVFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFEGR+DLV+F+K + +AGLF+HLRIGP+ACAEWN+GGFPVWL ++PGI+FRT N
Sbjct: 89 SEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIGPFACAEWNFGGFPVWLKYVPGIEFRTDN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFKE+M+ F KI+D+MK E LF QGGPIIL Q+ENEYG VEW G G+ Y WAA
Sbjct: 149 GPFKEKMQVFTTKIVDMMKAEKLFHWQGGPIILNQIENEYGPVEWEIGAPGKAYTHWAAQ 208
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A +LN VPW+MC+Q+ D PD +I+TCNGFYC+GF P SKP MWTEN++GW+ +G
Sbjct: 209 MAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFYCEGFVPKDKSKPKMWTENWTGWYTEYGK 268
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
VP+RP ED+AF+VARF + GG+F NYYM+ GGTNF TA G V+TSYDYDAP+DEYG
Sbjct: 269 PVPYRPAEDVAFSVARFIQNGGSFMNYYMFHGGTNFETTA-GRFVSTSYDYDAPLDEYGL 327
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
R+PK+ HL+ LHKAIK+CE L+SSD LG+ EAH+Y +S CAAFLANYD
Sbjct: 328 PREPKYTHLKNLHKAIKMCEPALVSSDAKVTNLGSNQEAHVYSSNSGSCAAFLANYDPKW 387
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
VTF+G + LPAWS+SILPDCK V+NTA+V + P + + ++++
Sbjct: 388 SVKVTFSGMEFELPAWSISILPDCKKEVYNTARV-------NEPSPKLHSKMTPVISNLN 440
Query: 422 FSWYEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPG------QGKEVFL 474
+ Y ++V + + +F L EQIN T D SDYLWY + V+ G +G E +L
Sbjct: 441 WQSYSDEVPTADSPGTFREKKLYEQINMTWDKSDYLWYMTDV-VLDGNEGFLKKGDEPWL 499
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ S GH VFVN +L YG+ ++K+++ G+N + +LS +VGL N G
Sbjct: 500 TVNSAGHVLHVFVNGQLQGHAYGSLAKPQLTFSQKVKMTAGVNRISLLSAVVGLANVGWH 559
Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
F+ G+ V L L G RDL+ W Y++G +GE ++ + S Q
Sbjct: 560 FERYNQGVLGPVTLSGLNEGTRDLTWQYWSYKIGTKGEE---QQVYNSGGSSHVQWGPPA 616
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ L+WYKTTF AP G PLAL+L SMGKGQAW+NGQSIGR+WS +A + C C+Y
Sbjct: 617 WKQPLVWYKTTFDAPGGNDPLALDLGSMGKGQAWINGQSIGRHWSNNIAKGS-CNDNCNY 675
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G+Y +KC CG+ +Q YH+PR+W+ P NLLV+ EE GGD +SL+ +T
Sbjct: 676 AGTYTETKCLSDCGKSSQKWYHVPRSWLQPRGNLLVVFEEWGGDTKWVSLVKRT 729
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/829 (44%), Positives = 516/829 (62%), Gaps = 38/829 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V++D RA+ I+GKRR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HE
Sbjct: 25 STIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R +Y F G D+VRF+KT+Q+AGL+ LRIGPY CAEWNYGGFPVWLH +P ++FRT
Sbjct: 85 PKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTV 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N F EM+ F KI+++MK+E LFASQGGPIILAQ+ENEYGNV +YG G+ Y+ W A
Sbjct: 145 NPSFMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQIENEYGNVISSYGAAGKAYIDWCA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A +L+ VPW+MCQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G
Sbjct: 205 NMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P+R EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY APIDE+G
Sbjct: 265 KHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPIDEFGN 324
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+ QPKWGHL++LH+ +K E+ L + + LG ++A IY + + F+ N ++++
Sbjct: 325 LNQPKWGHLKQLHRVLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATA 383
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+A V F G Y +PAWSVS+LP+C +NTAKV +Q + ++ + + SA
Sbjct: 384 NALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNTQTSIMTEDSSKPEKLEWTWRPESA 443
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIES 478
+K+ + + + L +Q + T D SDYLWY +H+ P + + L + S
Sbjct: 444 -----QKMILKSSGDLIAKGLVDQKDVTNDASDYLWYMTRVHLDKKDPLWSRNMTLRVHS 498
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDV 537
H +VN K V + ++ KK+ L G N + +LS+ VGLQNYGA+F+
Sbjct: 499 NAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHLVHGTNHISLLSVSVGLQNYGAFFES 558
Query: 538 AGAGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
G+ V L+ K ++DLS +W Y++G+ G L W P
Sbjct: 559 GPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNNKLFSTKSVGHIKWAN-EMFP 617
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++ L WYK F AP GK P+ ++ +GKG+AW+NGQSIGRYW ++ + GC +CDY
Sbjct: 618 TSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDY 677
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHIC 712
RG Y + KC CG+P Q YH+PR+++ G N + + EE+GG+PS ++ T +C
Sbjct: 678 RGEYGSDKCAFMCGEPTQRWYHVPRSFLKASGHNTITLFEEMGGNPSMVNFKTVVVGTVC 737
Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ E + +V L+C I+A+ FAS+G P G+CG+F G C
Sbjct: 738 ARAHEHN------------------KVELSCHNH-PISAVKFASFGNPVGHCGTFAVGTC 778
Query: 773 H--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
D + V K CVG++ C+I VSS G S C K LAVE C
Sbjct: 779 QGDKDAVKTVAKECVGKLNCTINVSSDTFG-STLDCGDSPKKLAVELEC 826
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/832 (46%), Positives = 514/832 (61%), Gaps = 49/832 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T D R ++I+G+R++L SGS+HYPRSTPE+WP+LI+KSK+GGL I+TYVFW+ HEP R
Sbjct: 30 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DLVRF+K +Q GL+ LRIGPY CAEW YGGFPVWLH P IQ RT N
Sbjct: 90 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+ EM+ F I+D+MK+E LFASQGGPII++Q+ENEYGNV AY G Y+ W A A
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
L+T VPW+MCQQ++AP P+INTCNG+YCD FTPN+P+ P MWTEN+SGW+ ++G + P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP++EYG Q
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
PKWGHLR+LH + E+ L D + A IY ++ + C F N ++ D
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSNADRDV 387
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
+ + G Y +PAWSVSILPDC N V+NTAKV SQ + F ++ + E S ++
Sbjct: 388 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYST----FVKKGSEAENEPNSLQWT 443
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAA 483
W E + F +L +Q +DTSDYL+Y + + P GK++ L++ + GH
Sbjct: 444 WRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTT-NDDPIWGKDLTLSVNTSGHIL 502
Query: 484 LVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF 543
FVN + + + Y F + + L G N + +LS VGL NYG FD+ G+
Sbjct: 503 HAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMVNQGIH 562
Query: 544 SVILIDLKNGKRDL-----SSGEWIYQVGVEGEYIGLDKISLANSSF--WKQGSTLPVNK 596
+ I NG D+ ++ +W Y+ G+ GE KI L + + WK LPVN+
Sbjct: 563 GPVQIIASNGSADIIKDLSNNNQWAYKAGLNGE---DKKIFLGRARYNQWKS-DNLPVNR 618
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
S +WYK TF AP G+ P+ ++L +GKG+AWVNG S+GRYW +Y+A GC+ +CDYRG
Sbjct: 619 SFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYRGP 678
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
Y A KC +CG P+Q YH+PR+++ +N LV+ EE GG+PS ++ T T + C+
Sbjct: 679 YKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNACANAR 738
Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS--------FR 768
E + L+C+ G I+ I FAS+G P+G CG F
Sbjct: 739 EG------------------YTLELSCQ-GRAISGIKFASFGDPQGTCGKPFATGSQVFE 779
Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C D L I+QK CVG+ CSI VS LG C K LAVEA C
Sbjct: 780 KGTCEAADSLSIIQKLCVGKYSCSIDVSEQILG--PAGCTADTKRLAVEAIC 829
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/843 (44%), Positives = 522/843 (61%), Gaps = 65/843 (7%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NV+YD A++I+G+RRV+ SGSIHYPRST +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 1 MGDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRH 60
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP R +Y F G + ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 61 EPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 120
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N +K EM F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV YG G+ Y+ W
Sbjct: 121 DNQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWC 180
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A + N VPW+MCQQ DAP PIINTCNGFYCD F+PN+P P M+TEN+ GWF +G
Sbjct: 181 AQMAESFNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWG 240
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P+R ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEYG
Sbjct: 241 DKDPYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYG 300
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY----------HKSSNDC 350
+ QPKWGHL++LH +IKL E+ L + +++ G+ + + + ++ +
Sbjct: 301 NLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTFKTFGSFVTLTKFSNPTTKER 360
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK 410
FL+N + YF+PAWSVSI+ CK VFNTAK+ SQ + F + +
Sbjct: 361 FCFLSNTXKADGK--------YFVPAWSVSIIDGCKKEVFNTAKINSQTS----IFVKVQ 408
Query: 411 NVNELLLASSAFSWYEEKVG--ISGNRSFVRPDLAEQINTTKDTSDYLWY--------TA 460
N E + S + W E + + G +F L EQ TT D+SDYLWY T+
Sbjct: 409 NEKENVKLS--WVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTS 466
Query: 461 SIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
SIH V L + + GH FVN + + +GN+ +F+ K I L G N +
Sbjct: 467 SIH-------NVTLQVNTKGHVLHAFVNTRYIGSQWGNNG-QSFVFEKPILLKAGTNIIT 518
Query: 521 ILSMMVGLQNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
+LS VGL+NY A++D G+ + LI N K DLSS W Y+VG+ GE L
Sbjct: 519 LLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEIKQLYNP 578
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
+ + W + + + + WYKT+F P G P+ L++ MGKG+AW+NGQSIGR+W
Sbjct: 579 VFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWP 638
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
+++A + C++ CDYRG+YD SKC +CG P+Q YHIPR+++ N LV+ EE+GG P
Sbjct: 639 SFIAGNDNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSP 698
Query: 699 SKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYG 758
++S+ T T IC +E + L+C+ + I+ I FASYG
Sbjct: 699 QQVSVQTITIGTICGNANEGS------------------TLELSCQGEYIISEIQFASYG 740
Query: 759 IPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
P+G CGSF+ G+ + + +++K C G CS+ VS+ G+ G L L V+A
Sbjct: 741 NPKGKCGSFKQGSWDVTNSALLLEKTCKGMKSCSVDVSAKLFGL--GDAVNLSARLVVQA 798
Query: 818 HCS 820
CS
Sbjct: 799 LCS 801
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 730 bits (1884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/830 (46%), Positives = 525/830 (63%), Gaps = 49/830 (5%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+NVTYD R+LVI+GK +++ SGSIHYPRSTP++WP LI K++ GGL+ I+TYVFWN HEP
Sbjct: 6 SNVTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLHEP 65
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F GR DLVRF+K V GL++ LRIGP+ +EW YGG P WLH +PGI FR+ N
Sbjct: 66 QQGQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIVFRSDN 125
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+R+ I+ ++K E L+ASQGGPIIL+Q+ENEYGNVE A+ G YVKWAA
Sbjct: 126 KPFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWAAK 185
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L+T VPWVMC+Q+DAPDP+IN CNG C + F+ PNSP KP +WTEN++ + ++G
Sbjct: 186 MAVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQTYG 245
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF A F GG+F NYYMY GGTNFGRTA V TSY AP+DEYG
Sbjct: 246 KETRSRSAEDIAFHAALFIAKGGSFVNYYMYHGGTNFGRTA-AEYVPTSYYDQAPLDEYG 304
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPK GHL+ELH AIKLC + L+S + LG EA + ++S++CAAFL N+D
Sbjct: 305 LLRQPKHGHLKELHAAIKLCRKPLLSRKWINFSLGQLQEAFAFERNSDECAAFLVNHDGR 364
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S+A V F G+ Y LP S+SILP CK V FNTA+V +Q A +++ + S
Sbjct: 365 SNATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGTR---LATRRHKFD-----S 416
Query: 421 AFSWYEEKVGI-SGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
W E K I S ++S +R + L E +NTTKD+SDYLWYT H V L + S
Sbjct: 417 IEQWKEYKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQNSSNAHSV-LTVNS 475
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
LGH FVN + + +G+HD +F + + + L G N + +LS+M GL + GA+ +
Sbjct: 476 LGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVSLLSVMTGLPDAGAYLERR 535
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
AGL V I ++ D ++ W Y+VG+ GE I L + + + ++W + ++ ++ L
Sbjct: 536 VAGLRRVT-IQRQHELHDFTTYLWGYKVGLSGENIQLHRNNASVKAYWSRYAS--SSRPL 592
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYK+ F AP G P+ALNLASMGKG+AWVNG+SIGRYW ++L
Sbjct: 593 TWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRYWVSFLDSD-------------- 638
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
G P QT HIPR+++ P NLLVI EE G+P ISL T + +C VS +
Sbjct: 639 --------GNPYQTWNHIPRSFLKPSGNLLVILEEERGNPLGISLGTMSITKVCGHVSIS 690
Query: 719 DPPPVDSWKPNLGVVSS-------SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
PPPV SW+ + + P+V+L C RG I+++ F+S+G P G+C ++ G+
Sbjct: 691 HPPPVISWQGENQINGTRKRKYGRRPKVQLRCPRGRKISSVLFSSFGTPSGDCETYAIGS 750
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + V+KAC+G+ CSIPVSS CPG+ K+L V+A C+
Sbjct: 751 CHASNSRATVEKACLGKERCSIPVSSK--NFKGDPCPGIAKSLLVDAKCA 798
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/826 (47%), Positives = 509/826 (61%), Gaps = 31/826 (3%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYD R+L++DG+R++L SGSIHYPRSTPE+W LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 23 DVTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFWNLHEPQ 82
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F GR D+VRF+K VQ GL++ LRIGP+ EW+YGG P WLH IPGI FR+ N
Sbjct: 83 PGQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGIVFRSDNE 142
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK +M+ F KI+ +M+ E L+ SQGGPIIL+Q+ENEYG VE AY G YVKWAA
Sbjct: 143 PFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYVKWAAQM 202
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
AV LNT VPWVMC+Q DAPDP+IN CNG C + F PNSP+KP +WTEN++ ++ G
Sbjct: 203 AVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTRYVITGE 262
Query: 242 AVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ R VED+AF V +F G+F NYYMY GGTNFGRTA V TSY APIDEYG
Sbjct: 263 NIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASA-FVPTSYYDQAPIDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPKWGHL+E+H AIKLC L+S LG + +A ++ S +CAAFL N D++
Sbjct: 322 LIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTISLGQQQQAFVFTGLSGECAAFLLNNDTA 381
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+ A+V F Y LP S+SILPDCK V FNTAKV +Q + ++LL
Sbjct: 382 NTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYT------TRSMTRSKLLDGED 435
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ Y+E + S + EQ++TTKD SDYLWYT + LN+ SLG
Sbjct: 436 KWVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWYTFRFQ-QESSDTQAVLNVRSLG 494
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H FVN + V + G+H F + + L+EG+N + +LS+MVG+ + GA+ + A
Sbjct: 495 HVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNVSLLSVMVGMPDSGAYMERRAA 554
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
GL V I K G ++ ++ W YQVG+ GE + + ++ W S +N L W
Sbjct: 555 GLRKV-KIQEKEGNKEFTNYSWGYQVGLLGEKLQIFTDQGSSQVQWANFSKNALNP-LTW 612
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKT F AP P+ALNL SMGKG+AWVNGQSIGRYW +Y A Y +
Sbjct: 613 YKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYRASDGSSQIWYAYFNTGAIF 672
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
+ + Y++PR+++ P NLLV+ EE GG+P +IS+ T + ICS V+ +
Sbjct: 673 RAVR---------YNVPRSFLKPKGNLLVVLEESGGNPLQISVDTASISKICSHVTASHL 723
Query: 721 PPVDSWKP-----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCG-SFRPGACHM 774
P V SW N + + P+V+L C I+ I FASYG PEG CG ++ G CH
Sbjct: 724 PLVSSWSKRTNTDNNNSLQARPRVKLDCPSNTKISNILFASYGTPEGTCGDAYAVGMCHS 783
Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
IVQKAC+GQ+ CSIPVSS Y G C K+L V A C
Sbjct: 784 SSSEAIVQKACLGQMRCSIPVSSKYFG--GDPCSANEKSLLVVAEC 827
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 727 bits (1877), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/815 (45%), Positives = 507/815 (62%), Gaps = 38/815 (4%)
Query: 16 GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
GKRR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HEP R +Y F G D+
Sbjct: 1 GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60
Query: 76 VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
VRF+KT+Q+AGL+ LRIGPY CAEWNYGGFPVWLH +P ++FRT N F EM+ F K
Sbjct: 61 VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTK 120
Query: 136 IIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVM 195
I+ +MK+E LFASQGGPIILAQ+ENEYGNV +YG G+ Y+ W A+ A +L+ VPW+M
Sbjct: 121 IVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLM 180
Query: 196 CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAV 255
CQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G P+R EDLAF+V
Sbjct: 181 CQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSV 240
Query: 256 ARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
ARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY AP+DE+G + QPKWGHL++LH
Sbjct: 241 ARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHT 300
Query: 316 AIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
+K E+ L + + LG ++A IY + + F+ N ++++DA V F G Y +P
Sbjct: 301 VLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATADALVNFKGKDYHVP 359
Query: 376 AWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNR 435
AWSVS+LPDC +NTAKV +Q + ++ + + SA +K+ + G+
Sbjct: 360 AWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESA-----QKMILKGSG 414
Query: 436 SFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIESLGHAALVFVNKKLV 492
+ L +Q + T D SDYLWY +H+ P + + L + S H +VN K V
Sbjct: 415 DLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYV 474
Query: 493 AFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDL 550
+ ++ +K+ L G N + +LS+ VGLQNYG +F+ G+ V L+
Sbjct: 475 GNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGY 534
Query: 551 KNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLA 607
K ++DLS +W Y++G+ G L I W LP + L WYK F A
Sbjct: 535 KGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN-EKLPTGRMLTWYKAKFKA 593
Query: 608 PEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCG 667
P GK P+ ++L +GKG+AW+NGQSIGRYW ++ + GC +CDYRG+Y + KC CG
Sbjct: 594 PLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDYRGAYGSDKCAFMCG 653
Query: 668 QPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW 726
+P Q YH+PR++++ G N + + EE+GG+PS ++ T +C+ E +
Sbjct: 654 KPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCARAHEHN------- 706
Query: 727 KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH--MDVLPIVQKAC 784
+V L+C I+A+ FAS+G P G+CGSF G C D V K C
Sbjct: 707 -----------KVELSCHNR-PISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKEC 754
Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
VG++ C++ VSS G S C K LAVE C
Sbjct: 755 VGKLNCTVNVSSDTFG-STLDCGDSPKKLAVELEC 788
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/832 (45%), Positives = 507/832 (60%), Gaps = 52/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+GKR +L SGSIHYPRSTP++WPELI K+K GGL VI+TYVFWN HEP +
Sbjct: 31 VTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ FEG +DLV+F+KT+ E G+F LR+GP+ AEWN+GG P WL IP I FR+ N P
Sbjct: 91 GKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F+ KIID+MK+E LFASQGGPIIL+Q+ENEY V+ AY G Y++WA + A
Sbjct: 151 FKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQWAGNMA 210
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
+ LNT VPWVMC+Q+DAP P+INTCNG +C D FT PN P+KP +WTEN++ F FG
Sbjct: 211 LGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQFRVFGDP 270
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED AF+VAR+F G+ NYYMY GGTNF RTA V T Y +AP+DEYG
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
R+PKWGHL++LH+A+ LC++ L+ +P QKL A +EA Y + CAAFLA+ +S
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGNPNVQKLSADVEARFYEQPGTKVCAAFLASNNSKE 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V F G Y+LPA S+SILPDCK VV+NT V+SQ N+ + F + + N+L
Sbjct: 390 AETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRN--FVKSRKTNKL-----E 442
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEV-----FLNI 476
++ Y E + P E N TKD +DY+W+T +I+V E L +
Sbjct: 443 WNMYSETIPAQLQVDSSLP--KELYNLTKDKTDYVWFTTTINVDRRDMNERKRINPVLRV 500
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
SLGHA + FVN + + +G+ +F++ ++L GIN + +L +VGL + GA+ +
Sbjct: 501 ASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSVDLKPGINFVTLLGTLVGLPDSGAYME 560
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPV 594
AG V ++ L G DL+S W +QVG+ GE L W Q + PV
Sbjct: 561 HRYAGPRGVSILGLNTGTLDLTSNGWGHQVGLSGETAKLFTKEGGGKVTWTKVQKAGPPV 620
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
WYKT F APEGK P+A+ + M KG W+NG+SIGRYW Y++P
Sbjct: 621 T----WYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKSIGRYWMTYVSP----------- 665
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
G+P Q+ YHIPR+++ P +NL+VI EE +P KI +LT ICS+
Sbjct: 666 -----------LGEPTQSEYHIPRSYLKPTDNLMVIFEEEEANPEKIEILTVNRDTICSY 714
Query: 715 VSEADPPPVDSWKPNLG-----VVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
V+E PP V SW+ V ++ P L C I A+ FAS+G P G CG +
Sbjct: 715 VTEYHPPSVKSWERKNNKFTPVVDNAKPAAHLKCPNQKKIIAVQFASFGDPLGTCGDYAV 774
Query: 770 GACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G CH V +V++ C+G+ C IP+ CPG+ K LAV+ CS
Sbjct: 775 GTCHSLVSKQVVEEHCLGKTSCDIPIDKGLFAGKKDDCPGISKTLAVQVKCS 826
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/680 (54%), Positives = 464/680 (68%), Gaps = 23/680 (3%)
Query: 156 AQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD 215
A++ENEYGN++ AYG G+ Y++WAA AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD
Sbjct: 6 AKIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCD 65
Query: 216 GFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGT 275
FTPNS +KP MWTEN+SGWFLSFG AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGT
Sbjct: 66 QFTPNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGT 125
Query: 276 NFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
N R++GGP +ATSYDYDAPIDEYG +RQPKWGHLR++HKAIKLCE LI++DP++ LG
Sbjct: 126 NLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLG 185
Query: 336 AKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV 395
+EA +Y K + CAAFLAN D SD VTFNG +Y LPAWSVSILPDCKNVV NTA++
Sbjct: 186 PNVEAAVY-KVGSVCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQI 244
Query: 396 ISQRNNGDHPFAQQKNVNE------LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
SQ + + + NV LA S +S+ E VGI+ + + + L EQINTT
Sbjct: 245 NSQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTT 304
Query: 450 KDTSDYLWYTASIHVMPGQGKEVFLN-------IESLGHAALVFVNKKLVAFGYGNHDFA 502
D SD+LWY+ SI V +G E +LN + SLGH V++N K+ G+ +
Sbjct: 305 ADASDFLWYSTSITV---KGDEPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSS 361
Query: 503 NFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW 562
K IEL G N +D+LS VGL NYGA+FD+ GAG+ + + NG DLSS EW
Sbjct: 362 LISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEW 421
Query: 563 IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMG 622
YQ+G+ GE + L S A S W + P+N LIWYKT F P G P+A++ MG
Sbjct: 422 TYQIGLRGEDLHLYDPSEA-SPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMG 480
Query: 623 KGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH 682
KG+AWVNGQSIGRYW LAP +GC C+YRG+Y +SKC K CGQP+QTLYH+PR+++
Sbjct: 481 KGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQ 540
Query: 683 PGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLA 742
PG N LV+ E GGDPSKIS + + +C+ VSEA P +DSW + P +RL
Sbjct: 541 PGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLE 600
Query: 743 CER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLG 800
C + G I+++ FAS+G P G CGS+ G C L IVQ+AC+G CS+PVSS Y G
Sbjct: 601 CPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNYFG 660
Query: 801 VSAGACPGLLKALAVEAHCS 820
C G+ K+LAVEA CS
Sbjct: 661 ---NPCTGVTKSLAVEAACS 677
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/807 (47%), Positives = 490/807 (60%), Gaps = 40/807 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V YD A++I+G+R+++ SGSIHYPRST E+W +LI+K+KEGGL+ IETY+FWN HE R
Sbjct: 30 VEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQKAKEGGLDTIETYIFWNAHERRR 89
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
+Y F G D V+F + VQEAGL+ LRIGPYACAEWNYGGFPVWLH IP I+FRT N
Sbjct: 90 REYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEWNYGGFPVWLHNIPEIKFRTDNEI 149
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK EM+ F KI+++ K+ LFASQGGPIILAQ+ENEYGNV YG G+ YV+W A A
Sbjct: 150 FKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNVMGPYGEAGKSYVQWCAQMA 209
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V N VPW+MCQQ DAP +INTCNGFYCD FTPNSP P MWTEN++GW+ +G P
Sbjct: 210 VAQNIGVPWIMCQQSDAPSSVINTCNGFYCDTFTPNSPKSPKMWTENWTGWYKKWGQKDP 269
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R EDLAF+VARFF+ G QNYYMY+GGTNFGRT+GGP +ATSYDYDAP+DEYG + Q
Sbjct: 270 HRTAEDLAFSVARFFQYNGVLQNYYMYYGGTNFGRTSGGPFIATSYDYDAPLDEYGNLNQ 329
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCA--AFLANYDSSS- 361
PKWGHL+ LH A+KL E+ L +S K + S+ D FL+N
Sbjct: 330 PKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVELTTYTSNIDGERLCFLSNTKMDGL 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D ++ +G YF+PAWSVSIL DC +NTAKV Q + ++ + N+ L S
Sbjct: 390 DVDLQQDGK-YFVPAWSVSILQDCNKETYNTAKVNVQTS----LIVKKLHENDTPLKLS- 443
Query: 422 FSWYEE--KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
+ W E K + G F L EQ T D SDYLWY S+ K V L ++
Sbjct: 444 WEWAPEPTKAPLHGQGGFKATQLLEQKAATYDESDYLWYMTSVDNNGTASKNVTLRVKYS 503
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
G FVN K + +G F K L G N + +LS VGLQNYG +FD
Sbjct: 504 GQFLHAFVNGKEIGSQHG----YTFTFEKPALLKPGTNIISLLSATVGLQNYGEFFDEGP 559
Query: 540 AGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V LID N DLSS EW Y+VG+ GE G + + W G+ L V ++
Sbjct: 560 EGIAGGPVELIDSGNTTTDLSSNEWSYKVGLNGEG-GRFYDPTSGRAKWVSGN-LRVGRA 617
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ WYKTTF AP G P+ ++L MGKG AWVNG S+GR+W A GC KCDYRG Y
Sbjct: 618 MTWYKTTFQAPSGTEPVVVDLQGMGKGHAWVNGNSLGRFWPILTADPNGCDGKCDYRGQY 677
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
KC +CG P Q YH+PR++++ G N L++ EE+GG+PS +S + IC E
Sbjct: 678 KEGKCLSNCGNPTQRWYHVPRSFLNNGSNTLILFEEIGGNPSDVSFQITATETICGNTYE 737
Query: 718 ADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAA-INFASYGIPEG-NCGSFRPGACHMD 775
+ L+C G I + I +AS+G P+G +CGSF+ G+
Sbjct: 738 G------------------TTLELSCNGGRRIISDIQYASFGDPQGSSCGSFQRGSVEAS 779
Query: 776 -VLPIVQKACVGQIECSIPVSSAYLGV 801
V+KAC+G+ CSI VS A GV
Sbjct: 780 RSFSAVEKACMGKESCSINVSKATFGV 806
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 715 bits (1846), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/740 (51%), Positives = 462/740 (62%), Gaps = 49/740 (6%)
Query: 21 LQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVK 80
L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN HE G YYF GRFDLV+F K
Sbjct: 1 LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59
Query: 81 TVQEAGLFLHLRIGPYACAEWNYGG---------------------------------FP 107
VQ+AG++L LRIGP+ AEWN+GG P
Sbjct: 60 VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119
Query: 108 VWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEW 167
VWLH+IPG FRT N PF M++F I++LMK+E LFASQGGPIIL+Q+ENEYG E
Sbjct: 120 VWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYEN 179
Query: 168 AYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIM 227
Y G+ Y WAA AV+ NTSVPW+MCQQ DAPDP+I+TCN FYCD FTP SP +P M
Sbjct: 180 YYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKM 239
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
WTEN+ GWF +FG P RPVED+AF+VARFF+ GG+ NYYMY GGTNFGRTAGGP +
Sbjct: 240 WTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFIT 299
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSS 347
TSYDYDAPIDEYG R PKWGHL+ELHKAIKLCE L+ + LG +EA IY SS
Sbjct: 300 TSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSS 359
Query: 348 NDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGD 403
CAAF++N D +D V F Y LPAWSVSILPDCKNVVFNTAKV S N +
Sbjct: 360 GACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPE 419
Query: 404 HPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIH 463
H QQ + + L F +E GI G FV+ + INTTKDT+DYLW+T SI
Sbjct: 420 H--LQQSDKGQKTLKWDVF---KENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSIL 474
Query: 464 VMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINT 518
+ + G + L IES GH FVN+K G GN + F I L G N
Sbjct: 475 IDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNE 534
Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
+ ILS+ VGLQ G ++D GAG+ SV +I L N DLSS W Y++GV GE++ + +
Sbjct: 535 IAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQG 594
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
NS W S P ++L WYK AP G P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 595 EGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWP 654
Query: 639 AYLA-PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
C ++CDYRG ++ KC CG+P+Q YH+PR+W P N+LVI EE GGD
Sbjct: 655 RISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGD 714
Query: 698 PSKISLLTKTGQHICSFVSE 717
P+KI+ + S V E
Sbjct: 715 PTKITFVRHCHNPYSSIVVE 734
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/827 (46%), Positives = 501/827 (60%), Gaps = 46/827 (5%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+VTYD R+L+I+G+RR+L SGSIHYPRSTPE+WP LI K+KEGG++VIETY FWN HEP
Sbjct: 22 GSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEP 81
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F GR D+V+F K VQ GL+ LRIGP+ +EWNYGG P WLH +PGI +R+ N
Sbjct: 82 KQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDN 141
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++LMK ENL+ASQGGPIIL+Q+ENEY NVE A+ G YV+WAA
Sbjct: 142 EPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAK 201
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV+L T VPWVMC+Q+DAPDP+IN CNG C + F PN P+KP +WTEN++ + +G
Sbjct: 202 MAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYG 261
Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R EDLAF VA F + G+F NYYMY GGTNFGRT+ ++ YD AP+DEY
Sbjct: 262 EDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEY 320
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G IRQPKWGHL+ELH IKLC + L+ + LG EA+++ + S CAAFL N D
Sbjct: 321 GLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 380
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+ V F Y L A S+SILPDCK + FNTAKV +Q N ++V
Sbjct: 381 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNT--------RSVQTRATFG 432
Query: 420 SAFSWYEEKVGIS--GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
S W E + GI G L E + TTKD SDYLWYT + + L ++
Sbjct: 433 STKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-IQNSSNAQPVLRVD 491
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
SL H FVN K +A +G+H +F + K+ LN G+N + +LS+MVGL + G + +
Sbjct: 492 SLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEH 551
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AG+ V + D + K D S W YQVG+ GE + + W G
Sbjct: 552 KVAGIRRVEIQDGGDSK-DFSKHPWGYQVGLMGEKSQIYTSPGSQKVQW-HGLGSHGRGP 609
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYKT F AP G P+ L SMGKG+AWVNGQSIGRYW +YL PS
Sbjct: 610 LTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS------------- 656
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
G+P+QT Y++PR +++P NLLV+ EE GDP KIS+ T + ++C V++
Sbjct: 657 ---------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTD 707
Query: 718 ADPPPVDSWKP----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
+ PPP+ SW N P+V+L C +I+ I FAS+G P G C S+ G+CH
Sbjct: 708 SHPPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCH 767
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ L + +KAC+G+ CSIP S G CPG KAL V A C
Sbjct: 768 SPNSLAVAEKACLGKNMCSIPHSLKSFG--DDPCPGTPKALLVAAQC 812
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 714 bits (1843), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/827 (46%), Positives = 501/827 (60%), Gaps = 46/827 (5%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+VTYD R+L+I+G+RR+L SGSIHYPRSTPE+WP LI K+KEGG++VIETY FWN HEP
Sbjct: 30 GSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEP 89
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F GR D+V+F K VQ GL+ LRIGP+ +EWNYGG P WLH +PGI +R+ N
Sbjct: 90 KQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDN 149
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++LMK ENL+ASQGGPIIL+Q+ENEY NVE A+ G YV+WAA
Sbjct: 150 EPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAK 209
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV+L T VPWVMC+Q+DAPDP+IN CNG C + F PN P+KP +WTEN++ + +G
Sbjct: 210 MAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYG 269
Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R EDLAF VA F + G+F NYYMY GGTNFGRT+ ++ YD AP+DEY
Sbjct: 270 EDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEY 328
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G IRQPKWGHL+ELH IKLC + L+ + LG EA+++ + S CAAFL N D
Sbjct: 329 GLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 388
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+ V F Y L A S+SILPDCK + FNTAKV +Q N ++V
Sbjct: 389 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNT--------RSVQTRATFG 440
Query: 420 SAFSWYEEKVGIS--GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
S W E + GI G L E + TTKD SDYLWYT + + L ++
Sbjct: 441 STKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-IQNSSNAQPVLRVD 499
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
SL H FVN K +A +G+H +F + K+ LN G+N + +LS+MVGL + G + +
Sbjct: 500 SLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEH 559
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
AG+ V + D + K D S W YQVG+ GE + + W G
Sbjct: 560 KVAGIRRVEIQDGGDSK-DFSKHPWGYQVGLMGEKSQIYTSPGSQKVQW-HGLGSHGRGP 617
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYKT F AP G P+ L SMGKG+AWVNGQSIGRYW +YL PS
Sbjct: 618 LTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS------------- 664
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
G+P+QT Y++PR +++P NLLV+ EE GDP KIS+ T + ++C V++
Sbjct: 665 ---------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTD 715
Query: 718 ADPPPVDSWKP----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
+ PPP+ SW N P+V+L C +I+ I FAS+G P G C S+ G+CH
Sbjct: 716 SHPPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCH 775
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ L + +KAC+G+ CSIP S G CPG KAL V A C
Sbjct: 776 SPNSLAVAEKACLGKNMCSIPHSLKSFG--DDPCPGTPKALLVAAQC 820
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 711 bits (1835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/650 (53%), Positives = 440/650 (67%), Gaps = 23/650 (3%)
Query: 67 YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
Y FE R+DLVRFVK V +AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PFK
Sbjct: 6 YNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFK 65
Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
M++F KI+ LMK E L+ SQGGPIIL+Q+ENEYG VEW G G+ Y KWAA A+
Sbjct: 66 AAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMALG 125
Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
L+T VPWVMC+Q+DAPDP+I+TCNGFYC+ F PN KP MWTE ++GWF FG P+R
Sbjct: 126 LDTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGPAPYR 185
Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPK 306
PVED+A++VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+PK
Sbjct: 186 PVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPK 245
Query: 307 WGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVT 366
W HLR+LHKAIKLCE L+S DPT LG+ EAH++ S CAAFLANYD+SS A VT
Sbjct: 246 WSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSSATVT 305
Query: 367 FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWY- 425
F N Y LP WSVSILPDCK+V+FNTAKV P +Q K + S+FSW
Sbjct: 306 FGNNQYDLPPWSVSILPDCKSVIFNTAKV-------GAPTSQPK-----MTPVSSFSWLS 353
Query: 426 --EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
EE + L EQI+ T+D++DYLWY I + P + G+ L + S
Sbjct: 354 YNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFS 413
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GHA VF+N +L YG + +K + L GIN L ILS+ VGL N G ++
Sbjct: 414 AGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETW 473
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V L L RD+S +W Y++G++GE + L +S ++S W GS + +
Sbjct: 474 NTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVAQKQP 533
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYKTTF +P+G PLAL+++SMGKGQ W+NGQSIGR+W AY A G KC+Y G +
Sbjct: 534 LTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTA--KGSCGKCNYGGIF 591
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+ KC +CG+P+Q YH+PR W+ N+LVI EE GG+P ISL+ ++
Sbjct: 592 NEKKCHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRS 641
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/833 (44%), Positives = 502/833 (60%), Gaps = 53/833 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+GKR +L SGSIHYPRSTPE+WPELI+K+K GGL VI+TYVFWN HEP +
Sbjct: 31 VTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ FEG +DLV+F+KT+ E G+ +R+GP+ AEWN+GG P WL IP I FR+ N P
Sbjct: 91 GKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+RF+ II+ +K+E LFASQGGPIILAQ+ENEY V+ AY G YV+WA + A
Sbjct: 151 FKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGVSYVQWAGNMA 210
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
+ L T VPWVMC+Q+DAP P+INTCNG +C D FT PNSP KP +WTEN++ F FG
Sbjct: 211 LGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENWTAQFRVFGDP 270
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED AF+VAR+F G+ NYYMY GGTNF RTA V T Y +AP+DEYG
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
R+PKWGHL++LH+A+ LC++ L+ P Q+L A +EA + + +NDCAAFLAN ++
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGTPNVQRLSADVEARFFEQPRTNDCAAFLANNNTKD 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
VTF G Y+LPA S+SILPDCK VV+NT V+SQ N+ + F + + + L
Sbjct: 390 PETVTFRGKKYYLPAKSISILPDCKTVVYNTMTVVSQHNSRN--FVKSRKTDGKL----- 442
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEV--FLNI 476
W I N E N TKD +DY W+T +I+V K++ L +
Sbjct: 443 -EWKMFSETIPSNLLVDSRIPRELYNLTKDKTDYAWFTTTINVDRNDLSARKDINPVLRV 501
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
SLGHA + F+N + + +G+ +F++ ++L GIN + +L +VGL + GA+ +
Sbjct: 502 ASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGINFVTLLGSLVGLPDSGAYME 561
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG V ++ L G DLSS W +QV + GE + W + VNK
Sbjct: 562 HRYAGPRGVSILGLNTGTLDLSSNGWGHQVALSGETAKVFTKEGGRKVTWTK-----VNK 616
Query: 597 S---LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ WYKT F APEGK P+A+ + M KG W+NG+SIGRYW Y++P
Sbjct: 617 DGPPVTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRYWMNYISP---------- 666
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
G+P Q+ YHIPR+++ P NL+VI EE G P KI +LT ICS
Sbjct: 667 ------------LGEPTQSEYHIPRSYLKPTNNLMVILEEEGASPEKIEILTVNRDTICS 714
Query: 714 FVSEADPPPVDSWKPNLGVVS-----SSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
+V+E PP V SW+ + + P RL C I A+ FAS+G P G CG+F
Sbjct: 715 YVTEYHPPNVRSWERKNKKFTPVADDAKPAARLKCPNKKKIVAVQFASFGDPSGTCGNFA 774
Query: 769 PGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C + +V++ C+G+ C IP+ CP L K LAV+ CS
Sbjct: 775 VGTCDSPISKQVVEQHCLGKTSCDIPMDKGLFNGKKDNCPNLTKNLAVQVKCS 827
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/846 (46%), Positives = 516/846 (60%), Gaps = 61/846 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD+RA+ IDG R+++ SGSIHYPRSTPE+WP+LIRK+KEGGL IETYVFWN HEP +
Sbjct: 7 VTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWNAHEPHQ 66
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DL+RF+KT+++ GL+ LRIGPY CAEWNYGGFPVWLH +PGIQ RT N
Sbjct: 67 RQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGIQIRTNNEV 126
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K EM+ F I+++MK LFASQGGPIIL+Q+ENEYGNV+ +YG G+ YVKW A+ A
Sbjct: 127 YKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYVKWCANLA 186
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ VPW+MCQQ DAP P+I++CNGFYCD + N+ S P +WTEN++GWF +G P
Sbjct: 187 ESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYSNNKSLPKIWTENWTGWFQDWGQKNP 246
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED+AFAVARFF+ GG+ NYYMY GGTNFG T GGP + SYDYDAP+DEYG +RQ
Sbjct: 247 HRSAEDVAFAVARFFQLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPLDEYGNLRQ 306
Query: 305 PKWGHLRELHKAIKLCEEYLI------SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
PKWGHLR+LH + E+ L S+ P + + + A+ +S F ++ D
Sbjct: 307 PKWGHLRDLHSVLNSMEQTLTYGESKNSNYPDNNNIFITIFAYQGKRS-----CFFSSID 361
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
D ++F G YFLPAWSVSILPDC V+NTA V Q + ++ + E
Sbjct: 362 -YKDQTISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSIMENKANAADSFRE--PN 418
Query: 419 SSAFSWYEEKV-GIS------GNRSFVRPDLAEQINTTKDTSDYLW-YTASIHVMP---- 466
S + W EK+ G+S GN + V +L +Q T TSDYLW T H M
Sbjct: 419 SLQWKWRPEKIRGLSLQGDFVGN-TLVANELMDQKAVTNGTSDYLWIMTNYDHNMNDSLW 477
Query: 467 GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFA--NFLINKKIELNEGINTLDILSM 524
G GK++ L + + GH FVN K V + + +F+ KI+L GIN + ++S+
Sbjct: 478 GAGKDIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIKLKRGINRISLVSV 537
Query: 525 MVGLQNYGAWFDVAGAGLFSVILI-------DLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
VGLQNYGA FD A G+ I I + + D+SS W+Y+ G+ GE G
Sbjct: 538 SVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVYKTGLHGEDQGFQA 597
Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
+ + + L +N+ +WYKT+F AP G+ P+ ++L +GKG AWVNG++IGR+W
Sbjct: 598 VRPRHRRQFYTKHVL-INQPFVWYKTSFNAPLGQDPVVVDLLGLGKGTAWVNGRNIGRFW 656
Query: 638 SAYLAPSTG-CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
LAP G C C Y G+Y+ +C CG+P Q YHIPR W+ P +N LV+ EELGG
Sbjct: 657 PKALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPEDNKLVLFEELGG 716
Query: 697 DPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFAS 756
P +S+ T T +C E V L+C+ G + I FAS
Sbjct: 717 TPDFVSVQTVTVGKVCVHGYEGH------------------TVELSCQHGRKFSKITFAS 758
Query: 757 YGIPEGNCGSFRPG---ACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKAL 813
+G+P+G CGSF P CH DV IV+KACVG+ CSI +S L + C + L
Sbjct: 759 FGLPQGKCGSFTPSNNHDCHADVSTIVEKACVGKERCSIDISEKAL--APIHCDARIYRL 816
Query: 814 AVEAHC 819
AVEA C
Sbjct: 817 AVEAVC 822
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/832 (44%), Positives = 505/832 (60%), Gaps = 50/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+++++G+R +L SGSIHYPR PE+WPE+IRK+KEGGL VI+TYVFWN HEP++
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQ+ FEG +DLV+F+K + E GL++ LRIGPY AEWN GGFP WL +P I FR+ N P
Sbjct: 88 GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F MK++ +IDL+K+E LFA QGGPII+AQ+ENEY NV+ AY G+ Y++WAA+ A
Sbjct: 148 FIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANMA 207
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
+L VPW+MC+Q+DAP +INTCNG +C D FT PN P+KP +WTEN++ + +FG
Sbjct: 208 TSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF+VARFF GT NYYMY+GGTN+GRT+ V T Y +AP+DE+G
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRTSSS-FVTTRYYDEAPLDEFGLY 326
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
R+PKW HLR+LH+A++L L+ PT QK+ LE ++ K S DCAAFL N ++
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPTVQKINQDLEITVFEKPGSTDCAAFLTNNHTTQ 386
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ + F G Y+LP SVSILPDCK VV+NT ++SQ N+ + +++ S
Sbjct: 387 PSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHNSRNFITSEK---------SKN 437
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI----HVMPGQGKEV-FL 474
W Y+EKV + + E + TKDTSDY WY+ SI H +P + + L
Sbjct: 438 LKWEMYQEKVPTIADLPLKNREPLELYSLTKDTSDYAWYSTSITLERHDLPMRPDILPVL 497
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S+GHA FVN + V FG+GN+ +F+ K I L G NT+ IL+ VG N GA+
Sbjct: 498 QIASMGHALAAFVNGEYVGFGHGNNIEKSFVFQKPIILKPGTNTITILAETVGFPNSGAY 557
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG V + L G D++ W ++VGV GE L A W T P
Sbjct: 558 MEKRFAGPRGVTIQGLMAGTLDITQNNWGHEVGVFGEKQELFTEEGAKKVQWTP-VTGPP 616
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
++ WYKT F APEG P+AL + M KG WVNG+S+GRYW+++L+P
Sbjct: 617 KGAVTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLGRYWTSFLSP----------- 665
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GQP Q YHIPR ++ P NLLVI EE GG P+ I + T ICS
Sbjct: 666 -----------LGQPTQAEYHIPRAYLKPTNNLLVIFEETGGHPTNIEVQTVNRDTICSI 714
Query: 715 VSEADPPPVDSWKPN----LGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
++E PP V SW+ + + VV L C I + FASYG P+G CG+
Sbjct: 715 ITEYHPPHVKSWERSGTDFVAVVEDLKSGAHLTCPDNKIIEKVEFASYGNPDGACGNLFN 774
Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSA-YLGVSAGACPGLLKALAVEAHC 819
G C+ + L +V++ C+G+ C+IP+ Y S CP + K LAV+ C
Sbjct: 775 GNCNSANSLKVVEQHCLGKNTCTIPIEREIYDEPSKDPCPNIFKTLAVQVKC 826
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/769 (48%), Positives = 477/769 (62%), Gaps = 82/769 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTP--------------------------EVWPE 38
VTYD +A++IDG+RR+L SGSIHYPRSTP E+W
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPPTIPWRGLWLRIYGSEMWEG 86
Query: 39 LIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYAC 98
LI+K+K+GGL+VI+TYVFWN HEP G G+F R Y
Sbjct: 87 LIQKAKDGGLDVIQTYVFWNGHEPTPGN----------------DSDGIFF--RFEQYYF 128
Query: 99 AEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ- 157
E GFPVWL ++PGI FRT N PFK M+ F KI+ +MK ENLFASQGGPIIL+Q
Sbjct: 129 EE---SGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPIILSQA 185
Query: 158 --------VENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTC 209
+ENEYG +G G+ Y+ WAA AV L T VPWVMC++EDAPDP+IN C
Sbjct: 186 SIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVPWVMCKEEDAPDPVINAC 245
Query: 210 NGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYY 269
NGFYCD F+PN P KP MWTE +SGWF FG + RPVEDLAFAVARF + GG+F NYY
Sbjct: 246 NGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYY 305
Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
MY GGTNFGRTAGGP + TSYDYDAPIDEYG +R+PK HL+ELH+A+KLCE+ L+S DP
Sbjct: 306 MYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDP 365
Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
LG EA ++ +S + CAAFLANY+S+S A V FN Y LP WS+SILPDCKNVV
Sbjct: 366 AITTLGTMQEARVF-QSPSGCAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVV 424
Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKV-GISGNRSFVRPDLAEQI 446
FN+A V Q + +S+ +W Y+E+V ++ L EQ+
Sbjct: 425 FNSATV----------GVQTSQMQMWGDGASSMTWERYDEEVDSLAAAPLLTTTGLLEQL 474
Query: 447 NTTKDTSDYLWYTASIHV------MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHD 500
N T+D+SDYLWY S+ + + G GK + L+++S GHA VFVN +L YG +
Sbjct: 475 NVTRDSSDYLWYITSVDISSSENFLQGGGKPLSLSVQSAGHALHVFVNGQLQGSAYGTRE 534
Query: 501 FANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSS 559
N L G N + +LS+ GL N G ++ G+ V+L L G RDL+
Sbjct: 535 DRRIKYNGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLDEGSRDLTW 594
Query: 560 GEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNL 618
W YQVG++GE + L+ I ++S W QGS + N+ L WY+ F P G PLAL++
Sbjct: 595 QTWSYQVGLKGEQMNLNSIEGSSSVEWMQGSLIAQNQQPLAWYRAYFETPSGDEPLALDM 654
Query: 619 ASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPR 678
SMGKGQ W+NGQSIGRYW+AY + G K+C Y G++ A KCQ CGQP Q YH+P+
Sbjct: 655 GSMGKGQIWINGQSIGRYWTAY---ADGDCKECSYTGTFRAPKCQSGCGQPTQRWYHVPK 711
Query: 679 TWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK 727
+W+ P NLLV+ EELGGD SKI+L+ ++ +C+ VSE D P + +W+
Sbjct: 712 SWLQPTRNLLVVFEELGGDSSKIALVKRSVSSVCADVSE-DHPNIKNWQ 759
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/825 (46%), Positives = 506/825 (61%), Gaps = 44/825 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+G+R++L SGSIHYPRSTPE+WP LI ++K+GG++VIETYVFWN HEP
Sbjct: 28 VTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQGGIDVIETYVFWNQHEPKP 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F GR D+VRF++ VQ GL+ LRIGP+ AEWNYGGFP WLH +PGI +RT N P
Sbjct: 88 GQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGFPFWLHDVPGIVYRTDNEP 147
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+++MK ENL+ASQGGPIIL Q+ENEY VE +G G+ YV WAA+ A
Sbjct: 148 FKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVEANFGEAGKRYVLWAANMA 207
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPWVMC+Q+DAPDP+IN+CNG C + F PNSP+KP +WTEN++ + FG
Sbjct: 208 VGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNKPAIWTENWTSSYPLFGED 267
Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
RPVED+AF VA F + G+F NYYMY GGTNFGRTA V T+Y +AP+DEYG
Sbjct: 268 ARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASA-YVQTAYYDEAPLDEYGL 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKL-EAHIYHKSSNDCAAFLANYDSS 360
I+QP WGHL+ELH A+KLC E L+ ++ LG KL EA+++ S CAAFL N DS
Sbjct: 327 IQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYVFRGQSGKCAAFLVNNDSR 386
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+D V F Y LP S+SILPDCKN FNTAK P ++
Sbjct: 387 TDVTVVFQNTSYELPRKSISILPDCKNEAFNTAKA------SFRPGLISIQTVTKFNSTE 440
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ Y+E + + S L E +NTTKD SDYLWYT + P G+ V L+ S
Sbjct: 441 QWEEYKESILNFDDTSSRANTLLEHMNTTKDASDYLWYTFRYNNDPSNGQSV-LSTNSRA 499
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA F+N + +G+ +F ++ + GIN + +LS+MVGL + GA+ + A
Sbjct: 500 HALHAFINGRHTGSQHGSSSNLSFSLDNTVSFRAGINNVSLLSVMVGLPDSGAYLERRVA 559
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPVNKSLI 599
GL V I +D ++ W YQVG+ GE + + + W K GS+ + L
Sbjct: 560 GLRRV-RIQSNGSLKDFTNNPWGYQVGLLGEKLQIYTDVGSQKVQWSKFGSS--TSGLLT 616
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKT F AP G P+ALNL SM KG+ WVNGQSIGRYW ++L PS
Sbjct: 617 WYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIGRYWVSFLTPS--------------- 661
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
G+P+Q YHIPR+++ P NLLV+ EE G P IS+ + IC VSE+
Sbjct: 662 -------GKPSQIWYHIPRSFLKPTGNLLVLLEEETGHPVGISIGKVSIPKICGHVSESH 714
Query: 720 PPPVDS---WKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MD 775
PPV S +K + P+V+L C +I+ I FAS+G P G+C S+ G+CH +
Sbjct: 715 LPPVISRVIYKKHENHHGRRPKVQLRCPSNRNISRILFASFGTPSGDCQSYAVGSCHSSN 774
Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
V+KAC+G+ CS+P+S G CPG KAL V+ C+
Sbjct: 775 SRSNVEKACLGKGMCSVPLSYKRFG--GDPCPGTPKALLVDVQCT 817
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 703 bits (1815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/828 (44%), Positives = 506/828 (61%), Gaps = 83/828 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V++D RA+ IDG RRVL SGSIHYPRST E+WP+LI+K KEGGL+ IETYVFWN HEP R
Sbjct: 23 VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGGLDAIETYVFWNAHEPTR 82
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DL+RF+KT+Q+ G++ LRIGPY CAEWNYGGFPVWLH +PG++FRTTN
Sbjct: 83 RQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 142
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F EM+ F I++++K+E LFASQGGPIILAQ+ENEYGNV +YG G+ Y+KW A+ A
Sbjct: 143 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIKWCANMA 202
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+L+ VPW+MCQQ+DAP P++NTCNG+YCD FTPN+P+ P MWTEN++GW+ ++G P
Sbjct: 203 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFTPNNPNTPKMWTENWTGWYKNWGGKDP 262
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED+AFAVARFF+ GGTFQNYYMY GGTNF RTAGGP + T+YDYDAP+DE+G + Q
Sbjct: 263 HRTTEDVAFAVARFFQRGGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQ 322
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL++LH + E+ L + + G + A +Y K+ + F+ N + +SDA
Sbjct: 323 PKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVY-KTEEGSSCFIGNVNETSDAK 381
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
+ F G Y +PAWSVSILPDCK +NTAK+ +Q + ++ N E ++ +SW
Sbjct: 382 INFQGTFYDVPAWSVSILPDCKTETYNTAKINTQTS----VMVKKANEAENEPSTLKWSW 437
Query: 425 YEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
E + + G L +Q + D SDYLWY ++++ P GK + L I S
Sbjct: 438 RPENIDNVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNIKEQDPVWGKNMSLRINS 497
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
H FVN + + + +++ + + N G N + +LS+ VGL NYGA+F+
Sbjct: 498 TAHVLHAFVNGQHIGNYRAENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAFFENV 557
Query: 539 GAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
AG+ + I +NG +DLS+ +W Y+ G+ G N F +
Sbjct: 558 PAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSG---------FENQLFSSESP---- 604
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+T+ AP G P+ ++L +GKG AW+NG +IGRYW A+LA GC+ +
Sbjct: 605 --------STWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLADIDGCSAE---- 652
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQHICS 713
YH+PR++++ G+N LV+ EE+GG+PS ++ T ++C+
Sbjct: 653 -------------------YHVPRSFLNSDGDNTLVLFEEIGGNPSLVNFQTIGVGNVCA 693
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
V E + + L+C G I++I FAS+G P GNCGSF G C
Sbjct: 694 NVYEKN------------------VLELSC-NGKPISSIKFASFGNPGGNCGSFEKGTCE 734
Query: 774 M--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
D I+ + CVG+ +CSI VS G A C GL K LAVEA C
Sbjct: 735 ASNDAAAILTQECVGKEKCSIDVSEKKFG--AADCGGLAKRLAVEAIC 780
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/831 (46%), Positives = 504/831 (60%), Gaps = 51/831 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 23 ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 82
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQ+ F G D+V+F+K V+ GL++ LRIGP+ EW+YGG P WLH + GI FRT N
Sbjct: 83 QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 142
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK MKR+ I+ LMK ENL+ASQGGPIIL+Q+ENEYG V A+ G+ YVKW A
Sbjct: 143 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 202
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L+T VPWVMC+Q+DAPDP++N CNG C + F PNSP+KP +WTEN++ ++ ++G
Sbjct: 203 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF VA F G+F NYYMY GGTNFGR A V TSY AP+DEYG
Sbjct: 263 EEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL+ELH A+KLCEE L+S T LG A ++ K +N CAA L N D
Sbjct: 322 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
++ V F + Y L SVS+LPDCKNV FNTAKV +Q N + + + L +
Sbjct: 381 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 434
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ + E V S L E +NTT+DTSDYLW T +G L + LG
Sbjct: 435 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 492
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN + + +G FL+ K + LN G N L +LS+MVGL N GA +
Sbjct: 493 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 552
Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G SV + NG+ L ++ W YQVG++GE + + WKQ ++ L
Sbjct: 553 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 608
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYK +F PEG+ P+ALNL SMGKG+AWVNGQSIGRYW ++
Sbjct: 609 TWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSF------------------ 650
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSE 717
+ G P+Q YHIPR+++ P NLLVI EE G+P I++ T + +C VS
Sbjct: 651 ----HTYKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSN 706
Query: 718 ADPPPVDS------WKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+P PV S + NL P+V+L C G I+ I FAS+G P G+CGS+ G
Sbjct: 707 TNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIG 766
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+CH + L +VQKAC+ + CS+PV S G +CP +K+L V A CS
Sbjct: 767 SCHSPNSLAVVQKACLKKSRCSVPVWSKTFG--GDSCPHTVKSLLVRAQCS 815
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/832 (43%), Positives = 503/832 (60%), Gaps = 45/832 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTYD R+L+++G+R +L SGSIHYPRSTPE+WP++++K+K GGL +I+TYVFWN HE
Sbjct: 29 AKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPEMWPDILQKAKHGGLNLIQTYVFWNIHE 88
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+ GQ+ FEG +DLV+F+K + + GL+ LRIGP+ AEWN+GGFP WL +P I FR+
Sbjct: 89 PVEGQFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSY 148
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+++ II++MK+ LFA QGGPIILAQ+ENEY +++ AY G YV+WA
Sbjct: 149 NEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYRELGVQYVQWAG 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
AV L VPW+MC+Q+DAPDP+INTCNG +C D FT PN P+KP +WTEN++ + F
Sbjct: 209 KMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVF 268
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R EDLAF+VARF GT NYYMY GGTNFGRT G V T Y +AP+DEY
Sbjct: 269 GDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEY 327
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYD 358
G R+PKWGHL++LH A++LC++ L + P +KLG E Y K ++ CAAFL N
Sbjct: 328 GLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNH 387
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
S A +TF G YFLP S+SILPDCK VV+NT +V++Q N + F + K N+ L
Sbjct: 388 SREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARN--FVKSKIANKNL-- 443
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV-F 473
+ +E + + + + E N KD SDY W+ SI + +P + +
Sbjct: 444 --KWEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWFVTSIELSNYDLPMKKDIIPV 501
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I +LGHA L FVN + +G++ NF+ K ++ G N + +L M VGL N GA
Sbjct: 502 LQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFKAGTNYIALLCMTVGLPNSGA 561
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ + AG+ SV ++ L G D+++ W QVGV GE++ ++ W
Sbjct: 562 YMEHRYAGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKG-- 619
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++ WYKT F PEG P+ L + SM KG AWVNG++IGRYW +YL+P
Sbjct: 620 KGPAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNIGRYWLSYLSP---------- 669
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
+P+Q+ YH+PR W+ P +NLLVI EE GG+P +I + ICS
Sbjct: 670 ------------LEKPSQSEYHVPRAWLKPSDNLLVIFEETGGNPEEIEVELVNRDTICS 717
Query: 714 FVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
V+E PP V SW+ + + + P+ L C I ++FAS+G P G CG F
Sbjct: 718 IVTEYHPPHVKSWQRHDSKIRAVVDEVKPKGHLKCPNYKVIVKVDFASFGNPLGACGDFE 777
Query: 769 PGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C + +V++ C+G+ C IP+ + ++GAC + K LAV+ C
Sbjct: 778 MGNCTAPNSKKVVEQHCMGKTTCEIPMEAGIFDGNSGACSDITKTLAVQVRC 829
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/713 (49%), Positives = 466/713 (65%), Gaps = 32/713 (4%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +A++I+ +RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP
Sbjct: 22 VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPSE 81
Query: 65 GQYYFEG-RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G+ +E ++ + ++ A L P + GFP+WL F+PGI FRT N
Sbjct: 82 GKVTWEDFLYEQILYINCFHVA-----LFXFPPYFXFQKFSGFPIWLKFVPGIAFRTDNE 136
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M++F+ KI+D+MK E L+ +QGGPIIL+Q+ENEYG VEW G G+ Y KW A
Sbjct: 137 PFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENEYGPVEWQIGAPGKSYTKWFAQM 196
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN KP +WTEN+SGW+ +FG
Sbjct: 197 AVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPT 256
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RP ED+AF+VARF + G+ NYY+Y GGTNFGRT+ G +ATSYD+DAPIDEYG IR
Sbjct: 257 PYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS-GLFIATSYDFDAPIDEYGLIR 315
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
+PKWGHLR+LHKAIKLCE L+S+DPT LG EA ++ KSS+ CAAFLANYD+S+
Sbjct: 316 EPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARVF-KSSSACAAFLANYDTSASV 374
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V F N Y LP WS+SILPDCK V FNTA++ K+ ++ S+F
Sbjct: 375 KVNFWNNPYDLPPWSISILPDCKTVTFNTAQI------------GVKSYEAKMMPISSFG 422
Query: 424 WY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
W EE + + L EQ++ T DT+DYLWY I + + GK L+
Sbjct: 423 WLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYLWYMQDISIDSTEGFLKSGKWPLLS 482
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GH VF+N +L YG+ + +K + L +G+N L +LS+ VGL N G F
Sbjct: 483 VNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHF 542
Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
D AG+ V L L G RD+S +W Y+VG+ GE + L +NS W +GS L
Sbjct: 543 DTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLSGESLNLYSDKGSNSVQWTKGS-LTQ 601
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
+ L WYKTTF P G PL L+++SM KGQ WVNG+SIGRY+ Y+A G KC Y
Sbjct: 602 KQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVNGRSIGRYFPGYIA--NGKCDKCSYA 659
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G + KC +CG+P+Q YHIPR W+ P +NLLVI EE+GG P ISL+ +T
Sbjct: 660 GLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEIGGSPDGISLVKRT 712
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/760 (47%), Positives = 486/760 (63%), Gaps = 48/760 (6%)
Query: 101 WNY-GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
W+Y GFP+WL +PGI+FRT N PFKEEM+RF+ KI+DL++ E LF QGGP+I+ QVE
Sbjct: 1 WDYCRGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVE 60
Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
NEYGN+E +YG G+ Y+KW + A+ L VPWVMCQQ+DAP IIN+CNG+YCDGF
Sbjct: 61 NEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKA 120
Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
NSPSKPI WTEN++GWF S+G P RPVEDLAF+VARFF+ G+FQNYYMYFGGTNFGR
Sbjct: 121 NSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGR 180
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKL 338
TAGGP TSYDYD+PIDEYG IR+PKWGHL++LH A+KLCE L+S+D P + KLG K
Sbjct: 181 TAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQ 240
Query: 339 EAHIYHKSSN-------------DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
EAH+YH S +C+AFLAN D V FNG Y LP WSVSILPDC
Sbjct: 241 EAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDC 300
Query: 386 KNVVFNTAKVISQRN----NGDHPFA-------QQKNVNELLLASSAFSWYEEKVGISGN 434
+NVVFNTAKV +Q + P + + NEL + ++++ +E +GI +
Sbjct: 301 QNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSD 360
Query: 435 RSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK-------EVFLNIESLGHAALVFV 487
++F + E +N TKD SDYLWY IHV + + I+S+ VFV
Sbjct: 361 QNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFV 420
Query: 488 NKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI- 546
N KL G + F+ + ++ EG N L +LS +GLQN GA+ + GAG+ I
Sbjct: 421 NGKLTGSAIGQ--WVKFV--QPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIK 476
Query: 547 LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFL 606
L KNG DLS W YQVG++GE++ + + W + S + + WYK F
Sbjct: 477 LTGFKNGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFS 536
Query: 607 APEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHC 666
+P+G P+A+NL SMGKGQAWVNG IGRYWS ++P GC +KCDYRG+Y++ KC +C
Sbjct: 537 SPDGTDPVAINLGSMGKGQAWVNGHHIGRYWSV-VSPKDGCPRKCDYRGAYNSGKCATNC 595
Query: 667 GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPV--- 723
G+P Q+ YHIPR+W+ NLLV+ EE GG+P +I + + IC VSE+ P +
Sbjct: 596 GRPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYPSLRKL 655
Query: 724 -DSWKPNLGVVS--SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPI 779
+ + + +S ++P++ L C+ G I+++ FASYG P+G+C F G CH + L +
Sbjct: 656 SNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSV 715
Query: 780 VQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
V +AC+G+ C++ +S++ G C ++K LAVEA C
Sbjct: 716 VSQACLGKNSCTVEISNSAFG--GDPCHSIVKTLAVEARC 753
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/836 (44%), Positives = 504/836 (60%), Gaps = 59/836 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
++ FEG +D+VRF K +Q AG++ LRIGPY C EWNYGG PVWL IPGI+FR N P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------NVEWAYGVGGELYV 177
F+ EM+ F I+ MK N+FA QGGPIILAQ+ENEYG N++ A+ Y+
Sbjct: 151 FENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHE-----YI 205
Query: 178 KWAADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
W AD A N VPW+MCQQ+ D P ++NTCNGFYC + N S P MWTEN++GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
+ RP ED+AFAVA FF+ G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325
Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
DEYG +RQPK+GHL+ELH + E+ L+ D G + Y ++ A F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATS-ACFINN 384
Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
D NVT +G +FLPAWSVSILPDCK V FN+AK+ +Q + ++ E
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTT----VMVNKTSMVEQQ 440
Query: 417 LASSAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF 473
+SW E + +F + +L EQI TT D SDYLWY S+ G+G V
Sbjct: 441 TEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLE-HKGEGSYV- 498
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH FVN KLV Y ++ F + ++L++G N + +LS VGL+NYG
Sbjct: 499 LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGG 558
Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG 589
F++ AG+ V LID DLS+ W Y+ G+ GEY I LDK + +
Sbjct: 559 SFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK---PGNKWRSHN 615
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
ST+P+N+ WYKTTF AP G+ + ++L + KG AWVNG S+GRYW +Y+A
Sbjct: 616 STIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCH 675
Query: 650 KCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLL 704
CDYRG + +A KC CG+P+Q LYH+PR+++H GE N L++ EE GGDPS++++
Sbjct: 676 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLHKGEPNTLILFEEAGGDPSEVAVR 735
Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGN 763
T +C+ D V L+C G I++++ AS+G+ G
Sbjct: 736 TVVEGSVCASAELGD------------------TVTLSCGAHGRTISSVDVASFGVARGR 777
Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CGS+ G ACVG+ C++ V+ A+ +AG G+ L V+A C
Sbjct: 778 CGSYDGGCDSKVAYDAFAAACVGKESCTVLVTDAF--ANAGCVSGV---LTVQATC 828
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/820 (45%), Positives = 496/820 (60%), Gaps = 59/820 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD R+L+I+G+ ++L SGSIHYPRSTP++W LI K+K GG++VI+TYVFWN HEP
Sbjct: 1 NVTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQ 60
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQ+YF GR DLVRFVK +Q GL+ LRIGP+ +EW YGG P WLH IPG+ +R+ N
Sbjct: 61 QGQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMVYRSDNQ 120
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK MKRF+++I+ +MK E L+ASQGGPIIL+QVENEY NVE A+ G YV+WAA
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
AVNL T VPWVMC+Q+DAPDP+IN+CNG C + F PNSP+KP +WTE+++ ++ +G
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R +D+AF VA F G++ NYYMY GGTNFGRTA + + YD AP+DEYG
Sbjct: 241 ETYMRSAQDIAFHVALFIAKTGSYVNYYMYHGGTNFGRTASAFTITSYYD-QAPLDEYGL 299
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPKWGHL+ELH AIK C + L+ LG +A+++ +S CAAFL N D
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHGAHKTFSLGPLQQAYVFQGNSGQCAAFLVNNDGKQ 359
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ V F N Y LP S+SILPDCK + FNTAKV +Q + N+ +
Sbjct: 360 EVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNAQYT------TRSMKPNQKFNSVGK 413
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+ Y E + S L E ++TTKDTSDYLWYT + VF N +S GH
Sbjct: 414 WEEYNEPIPEFDKTSLRANRLLEHMSTTKDTSDYLWYTFRFQQNLPNAQSVF-NAQSHGH 472
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
+VN FG+G+H +F + + L G N++ +LS VGL + GA+ + AG
Sbjct: 473 VLHAYVNGVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVALLSATVGLPDSGAYLERRVAG 532
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
L V + + +D ++ W YQVG+ GE + + + +N W + L N+ L+WY
Sbjct: 533 LRRVRIQN-----KDFTTYTWGYQVGLLGERLQIYTENGSNKVKWNK---LGTNRPLMWY 584
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
KT F AP G P+ALNL SMGKG+AWVNGQSIGRYW ++
Sbjct: 585 KTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTSQ----------------- 627
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
G P+QT Y+IPR ++ P NLLV+ EE G P I++ T + +C + SE
Sbjct: 628 -----GSPSQTWYNIPRAFLKPTGNLLVLLEEEKGYPPGITVDTVSVTKVCGYASE---- 678
Query: 722 PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPI-V 780
S V+L+C +I++I FAS+G P GNC S+ G CH V
Sbjct: 679 ------------SHLSAVQLSCPLKRNISSIIFASFGTPSGNCESYAIGNCHSSSSKANV 726
Query: 781 QKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+KAC+G+ CSIP S+ + G CPG+ K L VEA C+
Sbjct: 727 EKACIGKRSCSIPQSNHFFG--GDPCPGIPKVLLVEAKCT 764
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/835 (45%), Positives = 501/835 (60%), Gaps = 54/835 (6%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++VI+TYVFWN HE
Sbjct: 22 AANVTYDGRSLIIDGQHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVIDTYVFWNIHE 81
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +GQ+ F GR D+V+F+K V+ GL++ LRIGP+ EW+YGG P WLH + GI FRT
Sbjct: 82 PQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 141
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK MKR+ I+ LMK ENL+ASQGGPIIL+Q+ENEYG V A+ G+ YVKWAA
Sbjct: 142 NEPFKYHMKRYAQMIVKLMKSENLYASQGGPIILSQIENEYGMVARAFRQDGKSYVKWAA 201
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
AV L+T VPWVMC+Q+DAPDP++N CNG C + F PNSP+KP +WTEN++ ++ ++
Sbjct: 202 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTY 261
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+AF VA F G+F NYYMY GGTNFGR A V TSY AP+DEY
Sbjct: 262 GEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 320
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPKWGHL+ELH A+KLCEE L+S T LG A ++ K +N CAA L N D
Sbjct: 321 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAALLVNQD- 379
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
D V F + Y L S+S+LPDCKNV FNTAKV +Q N + + + L +
Sbjct: 380 KCDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTAKVNAQYN------TRTRKPRQNLSSP 433
Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
+ + E V S L E +NTT+DTSDYLW T +G L + L
Sbjct: 434 HMWEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFE--QSEGAPSVLKVNHL 491
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
GH FVN++ + +G +FL+ K + LN G N + +LS+MVGL N GA +
Sbjct: 492 GHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMALLSVMVGLPNSGAHLERRV 551
Query: 540 AGLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G SV ++ NG L ++ W YQVG++GE + A WKQ ++
Sbjct: 552 VGSRSV---NIWNGSYQLFFNNYSWGYQVGLKGEKYHVYTEDGAKKVQWKQYRD-SKSQP 607
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
L WYK +F PEG+ P+ALNL SMGKG+AWVNGQSIGRYW ++
Sbjct: 608 LTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFYTSK------------- 654
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVS 716
G P+Q YHIPR+++ P NLLVI EE G P I++ T + +C VS
Sbjct: 655 ---------GNPSQIWYHIPRSFLKPNSNLLVILEEEREGYPLGITIDTVSVTEVCGHVS 705
Query: 717 EADPPPVDSWKPN----------LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
P PV S + P+V+L C G I+ + FA++G P G+CGS
Sbjct: 706 NTHPHPVISPRKKGHNRNEQRHLKYRYDRKPKVQLQCPTGRKISKVLFATFGNPNGSCGS 765
Query: 767 FRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ G+CH + L +VQKAC+ + CS+PV S G CP +K+L V A CS
Sbjct: 766 YSVGSCHSPNSLAVVQKACLRKSRCSVPVWSKTFG--GDLCPQTVKSLLVRAQCS 818
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/831 (44%), Positives = 493/831 (59%), Gaps = 49/831 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G +D+VRF K +Q AGL+ LRIGPY C EWNYGG P WL IPG+QFR N P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F I++ MK N+FA QGGPIILAQ+ENEYGN+ + Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + K+ Y S A F+ N + +
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTS-ACFINNRNDNM 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +G + LPAWSVSILPDCK V FN+AK+ +Q + N+ E S
Sbjct: 390 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----VMVNKANMVEKEPESLK 445
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY SI+ +F+N +
Sbjct: 446 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGEASYTLFVN--T 503
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + +L++G N + +LS +GL+NYG F+
Sbjct: 504 TGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEKM 563
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
AG+ V LID DLS+ W Y+ G+ GEY I LDK ++ T+P+
Sbjct: 564 PAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDK---PGCTWDNNNGTVPI 620
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
NK WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDYR
Sbjct: 621 NKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDYR 680
Query: 655 GSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTGQ 709
G + D KC CG+P+Q YH+PR+++ GE N L++ EE GGDPS +S T
Sbjct: 681 GVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTLILFEEAGGDPSHVSFRTVAAG 740
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSFR 768
+C+ D + L+C + I+AIN S+G+ G CG+++
Sbjct: 741 SVCASAEVGD------------------TITLSCGQHSKTISAINMTSFGVARGQCGAYK 782
Query: 769 PGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ +++A V+ C L L V+A C
Sbjct: 783 GGCESKAAYKAFTEACLGKESCTVQITNA---VTGSGC--LSNVLTVQASC 828
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 694 bits (1792), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/729 (49%), Positives = 459/729 (62%), Gaps = 27/729 (3%)
Query: 104 GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG 163
GGFPVWL ++PGI FRT N PFK M+ F KI+ ++K ENLFASQGGPIIL+Q+ENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60
Query: 164 NVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPS 223
A G G Y+ WAA AV LNT VPWVMC+++DAPDP+IN CNGFYCDGF+PN P
Sbjct: 61 PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYCDGFSPNKPY 120
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
KPI+WTE +SGWF FG V RPV+DLAFAVARF + GG++ NYYMY GGTNFGRTAGG
Sbjct: 121 KPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTAGG 180
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
P V TSYDYDAPIDEYG R+PK+ HL+ELHKAIKL E+ L+S+ PT LG +A+IY
Sbjct: 181 PFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQAYIY 240
Query: 344 HKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD 403
+ CAAFLANY+S S A V FN Y LP WS+SILPDC+NV +NTA V
Sbjct: 241 NSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALV-------- 292
Query: 404 HPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTA 460
Q +V+ L +S SW Y+E + R+ + L EQIN T+DTSDYLWY
Sbjct: 293 --GVQTSHVHMLPTGTSLLSWETYDEVISSLDERARMTAVGLLEQINVTRDTSDYLWYMT 350
Query: 461 SIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEG 515
S+ + + G++ LN++S GHA VF+N + +G + F + L G
Sbjct: 351 SVDISSSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRAG 410
Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIG 574
N + +LS+ VGL N G +++ G+ + ++ L NGKRDL+ +W YQVG++GE +
Sbjct: 411 SNKISLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKRDLTWQKWSYQVGLKGEAMN 470
Query: 575 LDKISLANSSFWKQGSTLPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
L A+S+ W +GS + + L WYK F AP G PLAL+L SMGKGQ +NGQSI
Sbjct: 471 LVTPEGASSADWVRGSLAARSVQPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQSI 530
Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
GRYW+AY + G + C Y G P Q YH+PR+W+ P +NLLVI EE
Sbjct: 531 GRYWTAY---AKGDCEACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLVIFEE 587
Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVD-SWKPNLGVVSSSPQVRLACERGWHIAAI 752
LGGD SKI+LL ++ ++C+ E P S G V L C G I+AI
Sbjct: 588 LGGDASKIALLRRSLTNVCANAFENHPSMAKYSTSSQDGSKVKEATVNLQCGPGQSISAI 647
Query: 753 NFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLK 811
FAS+G P G CGSF G CH + I++K CVGQ CS+ +S++ G A CP +LK
Sbjct: 648 EFASFGTPSGTCGSFHIGTCHAPNSRSIIEKKCVGQKSCSVTISNSIFG--ADPCPNVLK 705
Query: 812 ALAVEAHCS 820
L VEA CS
Sbjct: 706 RLTVEAVCS 714
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/836 (44%), Positives = 503/836 (60%), Gaps = 59/836 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
++ FEG +D+VRF K +Q AG++ LRIGPY C EWNYGG PVWL IPGI+FR N P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------NVEWAYGVGGELYV 177
F+ M+ F I+ MK N+FA QGGPIILAQ+ENEYG N++ A+ Y+
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHE-----YI 205
Query: 178 KWAADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
W AD A N VPW+MCQQ+ D P ++NTCNGFYC + N S P MWTEN++GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
+ RP ED+AFAVA FF+ G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325
Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
DEYG +RQPK+GHL+ELH + E+ L+ D G + Y ++ A F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATS-ACFINN 384
Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
D NVT +G +FLPAWSVSILP+CK V FN+AK+ +Q + ++ E
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTT----VMVNKTSMVEQQ 440
Query: 417 LASSAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF 473
+SW E + +F + +L EQI TT D SDYLWY S+ G+G V
Sbjct: 441 TEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLE-HKGEGSYV- 498
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH FVN KLV Y ++ F + ++L++G N + +LS VGL+NYG
Sbjct: 499 LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGG 558
Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG 589
F++ AG+ V LID DLS+ W Y+ G+ GEY I LDK + +
Sbjct: 559 SFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK---PGNKWRSHN 615
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
ST+P+N+ WYKTTF AP G+ + ++L + KG AWVNG S+GRYW +Y+A
Sbjct: 616 STIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCH 675
Query: 650 KCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLL 704
CDYRG + +A KC CG+P+Q LYH+PR++++ GE N L++ EE GGDPS++++
Sbjct: 676 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVR 735
Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGN 763
T +C+ D V L+C G I++++ AS+G+ G
Sbjct: 736 TVVEGSVCASAEVGD------------------TVTLSCGAHGRTISSVDVASFGVARGR 777
Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CGS+ G ACVG+ C++ V+ A+ +AG G+ L V+A C
Sbjct: 778 CGSYDGGCESKVAYDAFAAACVGKESCTVLVTDAF--ANAGCVSGV---LTVQATC 828
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/491 (66%), Positives = 385/491 (78%), Gaps = 3/491 (0%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 20 TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
++GQY F+GR DLV+FVK V EAGL++HLRIGPY C+EWNYGGFP+WLHFIPGI+FRT N
Sbjct: 80 VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCSEWNYGGFPLWLHFIPGIKFRTDN 139
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF KI+DLMKQE L+ASQGGPIIL+Q+ENEYG+++ AYG G+ Y+ WAA
Sbjct: 140 EPFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWAAK 199
Query: 183 TAVNLNTSVPWVMCQQEDAPDPI-INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A +L+T VPWVMCQQ DAPDPI INTCNGFYCD FTPNS +KP +WTEN+S W+L FG
Sbjct: 200 MATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQFTPNSKTKPKLWTENWSAWYLLFGG 259
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R+ GGP +ATSYD+DAPIDEYG
Sbjct: 260 GFPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDFDAPIDEYGV 319
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPKWGHL+++HKAIKLCEE LI+++P LG LEA +Y K+ + CAAFLAN D+ S
Sbjct: 320 IRQPKWGHLKDVHKAIKLCEEALIAAEPKITYLGPNLEAAVY-KTGSVCAAFLANVDAKS 378
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASS 420
D V F+GN Y LPAWSVSILPDCKNVV NTAK+ S + K +++ + S
Sbjct: 379 DKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASTISNFVTESLKEDISSSETSRS 438
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+SW E VGIS + + L EQIN T D SDYLWY+ S+ + G + L+IESLG
Sbjct: 439 KWSWINEPVGISKDDILSKTGLLEQINITADRSDYLWYSLSVDLKDDPGSQTVLHIESLG 498
Query: 481 HAALVFVNKKL 491
HA F+N KL
Sbjct: 499 HALHAFINGKL 509
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 152/319 (47%), Positives = 203/319 (63%), Gaps = 16/319 (5%)
Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKR--DLSSGEWIYQV 566
I + G N +D+LS+ VGLQNYGA+FD GAG+ VIL LKNG + DLSS +W YQV
Sbjct: 1950 ITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGNKTLDLSSRKWTYQV 2009
Query: 567 GVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQA 626
G++GE +GL S +S W +T P + LIWYKT F AP G P+ ++ MGKG+A
Sbjct: 2010 GLKGEDLGL---SSGSSGAWNSKTTFPKKQPLIWYKTNFDAPSGSNPVVIDFTGMGKGEA 2066
Query: 627 WVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGEN 686
WVNGQSIGRYW Y+A + CT C+YRG + +KC +CG+P+QTLYH+P++++ P N
Sbjct: 2067 WVNGQSIGRYWPTYVASNVDCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPQSFLKPNGN 2126
Query: 687 LLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSPQVRLAC 743
LV+ EE GGDP++IS TK +C+ VS++ PP +D W + G V P + L C
Sbjct: 2127 TLVLFEESGGDPTQISFATKQIGSVCAHVSDSHPPQIDLWNQDTESGGKV--GPALLLNC 2184
Query: 744 -ERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGV 801
I++I FASYG P G CG+F G C + L IV+KAC+G CSI VS+ G
Sbjct: 2185 PNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKTLSIVKKACIGSRSCSIGVSTDTFG- 2243
Query: 802 SAGACPGLLKALAVEAHCS 820
C G+ K+LAVEA C+
Sbjct: 2244 --DPCKGVPKSLAVEATCA 2260
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 690 bits (1780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/812 (46%), Positives = 494/812 (60%), Gaps = 49/812 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 23 ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 82
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQ+ F G D+V+F+K V+ GL++ LRIGP+ EW+YGG P WLH + GI FRT N
Sbjct: 83 QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 142
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK MKR+ I+ LMK ENL+ASQGGPIIL+Q+ENEYG V A+ G+ YVKW A
Sbjct: 143 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 202
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L+T VPWVMC+Q+DAPDP++N CNG C + F PNSP+KP +WTEN++ ++ ++G
Sbjct: 203 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF VA F G+F NYYMY GGTNFGR A V TSY AP+DEYG
Sbjct: 263 EEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL+ELH A+KLCEE L+S T LG A ++ K +N CAA L N D
Sbjct: 322 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
++ V F + Y L SVS+LPDCKNV FNTAKV +Q N + + + L +
Sbjct: 381 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 434
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ + E V S L E +NTT+DTSDYLW T +G L + LG
Sbjct: 435 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 492
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN + + +G FL+ K + LN G N L +LS+MVGL N GA +
Sbjct: 493 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 552
Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G SV + NG+ L ++ W YQVG++GE + + WKQ ++ L
Sbjct: 553 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 608
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYK +F PEG+ P+ALNL SMGKG+AWVNGQSIGRYW ++
Sbjct: 609 TWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSF------------------ 650
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSE 717
+ G P+Q YHIPR+++ P NLLVI EE G+P I++ T + +C VS
Sbjct: 651 ----HTYKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSN 706
Query: 718 ADPPPVDS------WKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+P PV S + NL P+V+L C G I+ I FAS+G P G+CGS+ G
Sbjct: 707 TNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIG 766
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGV 801
+CH + L +VQKAC+ + CS+PV S GV
Sbjct: 767 SCHSPNSLAVVQKACLKKSRCSVPVWSKTFGV 798
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/834 (43%), Positives = 497/834 (59%), Gaps = 52/834 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+V+YD R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETY+FWN HEP
Sbjct: 29 TSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
R QY FEG +D+VRF K +Q AG++ LRIGPY C EWNYGG P WL IPG+QFR N
Sbjct: 89 HRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWA 180
PF+ EM+ F I++ MK +FA QGGPIILAQ+ENEYGN+ + Y+ W
Sbjct: 149 EPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWC 208
Query: 181 ADTAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
AD A N VPW+MCQQ +D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 209 ADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAW 268
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEY
Sbjct: 269 DKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 328
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPK+GHL+ELH +K E+ L+ + G + Y S+ A F+ N
Sbjct: 329 GNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSS-ACFINNRFD 387
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
D NVT +G + LPAWSVSILPDCK V FN+AK+ +Q + ++ N E S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTS----VMVKKPNTAEQEQES 443
Query: 420 SAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
+SW E + +F + +L EQI T+ D SDYLWY S++ G+G L +
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLN-HKGEGS-YKLYV 501
Query: 477 ESLGHAALVFVNKKLVAFGY-GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ GH FVN KL+ + + DF F + ++L++G N + +LS VGL+NYG F
Sbjct: 502 NTTGHELYAFVNGKLIGKNHSADGDFV-FQLESPVKLHDGKNYISLLSATVGLKNYGPSF 560
Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWK-QGS 590
+ G+ V LID DLS+ W Y+ G+ EY I LDK W
Sbjct: 561 EKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYK----WNGNNG 616
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
T+P+N+ WYK TF AP G+ + ++L + KG AWVNG ++GRYW +Y A +
Sbjct: 617 TIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHR 676
Query: 651 CDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLT 705
CDYRG++ D ++C CG+P+Q YH+PR+++ GE N L++ EE GGDPS ++L T
Sbjct: 677 CDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVALRT 736
Query: 706 KTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCG 765
+C+ D V L+C G +++++ AS+G+ G CG
Sbjct: 737 VVPGAVCTSGEAGDA------------------VTLSCGGGHAVSSVDVASFGVGRGRCG 778
Query: 766 SFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ G ACVG+ C++ ++ A+ G AG G+ L V+A C
Sbjct: 779 GYEGGCESKAAYEAFTAACVGKESCTVEITGAFAG--AGCLSGV---LTVQATC 827
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/832 (43%), Positives = 503/832 (60%), Gaps = 91/832 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V++D RA+ IDG RRVL SGSIHYPRST E+WP+LI+K KEG L+ IETYVFWN HEP R
Sbjct: 22 VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGSLDAIETYVFWNAHEPTR 81
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DL+RF+KT+Q G++ LRIGPY CAEWNYGGFPVWLH +PG++FRTTN
Sbjct: 82 RQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 141
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F EM+ F I++++K+E LFASQGGPIILAQ+ENEYGNV +YG G+ Y++W A+ A
Sbjct: 142 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIQWCANMA 201
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+L+ VPW+MCQQ+DAP P++NTCNG+YCD F+PN+P+ P MWTEN++GW+ ++G P
Sbjct: 202 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNTPKMWTENWTGWYKNWGGKDP 261
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED+AFAVARFF+ GTFQNYYMY GGTNF RTAGGP + T+YDYDAP+DE+G + Q
Sbjct: 262 HRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQ 321
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL++LH + E+ L + + G + A +Y ++ + F+ N + +SDA
Sbjct: 322 PKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVY-QTEEGSSCFIGNVNETSDAK 380
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
+ F G Y +PAWSVSILPDCK +NTAK+ +Q + ++ N E ++ +SW
Sbjct: 381 INFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTS----VMVKKANEAENEPSTLKWSW 436
Query: 425 YEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
E + + G L +Q + D SDYLWY ++++ P GK + L I S
Sbjct: 437 RPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNLKEQDPVLGKNMSLRINS 496
Query: 479 LGHAALVFVNKKLVAFGYGNHDFAN----FLINKKIELNEGINTLDILSMMVGLQNYGAW 534
H FVN + + GN+ N ++ + + N G N + +LS+ VGL NYGA+
Sbjct: 497 TAHVLHAFVNGQHI----GNYRVENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAF 552
Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
F+ AG+ + I +NG +DLS+ +W Y+ G+ G N F +
Sbjct: 553 FENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSG---------FENQLFSSES- 602
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+T+ AP G P+ ++L +GKG AW+NG +IGRYW A+L+ GC+ +
Sbjct: 603 -----------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLSDIDGCSAE 651
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQ 709
YH+PR++++ G+N LV+ EE+GG+PS ++ T
Sbjct: 652 -----------------------YHVPRSFLNSEGDNTLVLFEEIGGNPSLVNFQTIGVG 688
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
+C+ V E + + L+C G I+AI FAS+G P G+CGSF
Sbjct: 689 SVCANVYEKN------------------VLELSC-NGKPISAIKFASFGNPGGDCGSFEK 729
Query: 770 GACHM--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C + I+ + CVG+ +CSI VS G A C L K LAVEA C
Sbjct: 730 GTCEASNNAAAILTQECVGKEKCSIDVSEDKFG--AAECGALAKRLAVEAIC 779
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 686 bits (1771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/826 (45%), Positives = 502/826 (60%), Gaps = 47/826 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L +TYD RALV+ G RR+ SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI+GQY FEGR+DLV+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FR+
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F+ KI+ +MK E L+ QGGPII++Q+ENEY +E A+G G YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLS 238
A AV L T VPW+MC+Q DAPDP+INTCNG C + F PNSP+KP +WTEN++ +
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPI 264
Query: 239 FGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
+G R ED+AFAVA + G+F +YYMY GGTNFGR A V TSY AP+D
Sbjct: 265 YGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLD 323
Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
EYG I QP WGHLRELH A+K E L+ ++ LG + EAH++ ++ C AFL N+
Sbjct: 324 EYGLIWQPTWGHLRELHCAVKQSSEPLLFGSYSNFSLGQQQEAHVF-ETDFKCVAFLVNF 382
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN-VNELL 416
D + V F L S+S+L DC+NVVF TAKV +Q + Q N +N
Sbjct: 383 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWK 442
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LN 475
K +GN+ F EQ+ TTKD +DYLWY S G ++ L
Sbjct: 443 AFIEPVPQDLSKSTYTGNQLF------EQLPTTKDETDYLWYIVSYKNRASDGNQIARLY 496
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
++SL H FVN + V +G+HD N ++N + L EG NT+ +LS+MVG + GA+
Sbjct: 497 VKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAY 556
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ G+ +V + + L++ W YQVG+ GE + NS W + L +
Sbjct: 557 MERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMDINNL-I 615
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L WYKTTF P G + LNL SMGKG+ WVNG+SIGRYW ++ APS
Sbjct: 616 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS---------- 665
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ T + +C
Sbjct: 666 ------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGN 713
Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
V E PP+ S P+VR+ C+ G I++I FASYG P G+C SFR G+CH
Sbjct: 714 VDEFSVPPLQS-------RGKVPKVRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCHA 766
Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ +V+++C+G+ CSIPV +A G CPG+ K+L V A C
Sbjct: 767 ESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 810
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 685 bits (1768), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/831 (41%), Positives = 497/831 (59%), Gaps = 50/831 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+G+R +L SGSIHYPRSTPE W ++ K+++GG+ V++TYVFWN HE +
Sbjct: 9 VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y E ++D ++F+K +Q+ G+++ LR+GP+ AEWN+GG P WL +P I FR+ N P
Sbjct: 69 GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEIIFRSNNEP 128
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ MK++++ +I +K NLFA QGGPIILAQ+ENEY +++ A+ G+ YV+WAA A
Sbjct: 129 FKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKMA 188
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V+L+ VPW+MC+Q DAPDP+IN CNG +C D F+ PN P KP +WTEN++ + FG
Sbjct: 189 VSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGDP 248
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF+VARFF G+ NYYMY GGTNFGRT+ T Y +AP+DEYG
Sbjct: 249 PSQRSAEDIAFSVARFFSKNGSLVNYYMYHGGTNFGRTSSA-FTTTRYYDEAPLDEYGMQ 307
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
R+PKW HLR++H+A+ LC+ L + T K+ E ++ K SN CAAF+ N +
Sbjct: 308 REPKWSHLRDVHRALSLCKRALFNGASTVTKMSQHHEVIVFEKPGSNLCAAFITNNHTKV 367
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
++F G Y++P S+SILPDCK VVFNT + SQ ++ + + +A++
Sbjct: 368 PTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSSRNFKRS---------MAAND 418
Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE-----VFL 474
W Y E + + + E + KDTSDY WYT S+ + P + L
Sbjct: 419 HKWEVYSETIPTTKQIPTHEKNPIELYSLLKDTSDYAWYTTSVELRPEDLPKKNDIPTIL 478
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I SLGH+ L FVN + + +G+H+ F K + L G+N + IL+ VGL + GA+
Sbjct: 479 RIMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQIAILASTVGLPDSGAY 538
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG S+ ++ L +GK DL+S W ++VG++GE +G+ + WK+
Sbjct: 539 MEHRFAGPKSIFILGLNSGKMDLTSNGWGHEVGIKGEKLGIFTEEGSKKVQWKEAKG--P 596
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
++ WYKT F PEG P+A+ + MGKG W+NG+SIGR+W +YL+P
Sbjct: 597 GPAVSWYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRHWMSYLSP----------- 645
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GQP Q+ YHIPRT+ +P +NLLV+ EE +P K+ +LT ICSF
Sbjct: 646 -----------LGQPTQSEYHIPRTYFNPKDNLLVVFEEEIANPEKVEILTVNRDTICSF 694
Query: 715 VSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
V+E PP V SW K V P L C I A+ FAS+G P G CG+F
Sbjct: 695 VTENHPPNVKSWAIKSEKFQAVVNDLVPSASLKCPHQRTIKAVEFASFGDPAGACGAFAL 754
Query: 770 GACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C+ + IV+K C+G+ C +P+ ACP + KALA++ C
Sbjct: 755 GKCNAPAIKQIVEKQCLGKASCLVPIDKDAFTKGQDACPNVTKALAIQVRC 805
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 684 bits (1766), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/831 (42%), Positives = 490/831 (58%), Gaps = 69/831 (8%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ V YD A++++G+R+++ SG+IHYPRST ++WP+LI K+K+G L+ IETY+FW+ HE
Sbjct: 23 ATTVEYDSSAIILNGERKLIISGAIHYPRSTSQMWPDLIMKAKDGDLDAIETYIFWDLHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+R +Y F G D ++F+K QE GL++ LRIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 83 PVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVCAEWNYGGFPMWLHNMPGIQLRTD 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FKEEMK F KI+ + K+ LFA QGGPIILAQ+ENEYG+V YG G Y+KW A
Sbjct: 143 NAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQIENEYGDVISHYGEAGNSYIKWCA 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
+ A+ N VPW+MC+Q++AP II+TCNG+YCD F PN+P P ++TEN+ GWF +G
Sbjct: 203 EMALAQNIGVPWIMCKQKNAPATIIDTCNGYYCDTFKPNNPKSPKIFTENWVGWFQKWGE 262
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P R ED AF+VARFF+ GG QNYY+Y GGTNFGRTAGGP + T+YDYDAP+DEYG
Sbjct: 263 RRPHRTAEDSAFSVARFFQNGGALQNYYLYHGGTNFGRTAGGPFIITTYDYDAPLDEYGN 322
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSS 360
+ +PK+GHL+ LH AIKL E+ L + T + G L Y +K + FL+N +S
Sbjct: 323 LIEPKYGHLKRLHAAIKLGEKVLTNGTATWESHGDSLWMTTYTNKGTGQKFCFLSNSHTS 382
Query: 361 SDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
DA V + Y++PAWS+S+L DC V+NTAK +Q N K +++ L S
Sbjct: 383 KDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNTAKTEAQTN------IYMKQLDQKLGNS 436
Query: 420 SAFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP----GQGKEVF 473
+SW + + G +F L +Q + T SDYLWY + V G+ K
Sbjct: 437 PEWSWTSDPMEDTFQGKGTFTASQLLDQKSVTVGASDYLWYMTEVVVNDTNTWGKAK--- 493
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
+ + + GH +F+N L +G F+ I LN+G N + +LS+ VG NYGA
Sbjct: 494 VQVNTTGHILYLFINGFLTGTQHGTVSQPGFIHEGNISLNQGTNIISLLSVTVGHANYGA 553
Query: 534 WFDVAGAGL----FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
+FD+ G+ + I+ N DLS W Y+VG+ G WK
Sbjct: 554 FFDMQETGIVGGPVKLFSIENPNNVLDLSKSTWSYKVGINGMTKKFYDPKTTIGVQWKT- 612
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
+ + + + WYKTTF P+G P+ L+L + KG+AWVNGQSIGRYW A LA + GC+
Sbjct: 613 NNVSIGVPMTWYKTTFKTPDGTNPVVLDLIGLQKGEAWVNGQSIGRYWPAMLAENKGCSD 672
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
CDYRG Y+A KC CG+P+Q YH+PR++++ N LV+ EE+G D + +
Sbjct: 673 TCDYRGEYNADKCLSGCGEPSQRFYHVPRSFLNNDVNTLVLFEEMGFDATPFN------- 725
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
G ++ I FASYG PEG+CGSF+
Sbjct: 726 ------------------------------------GKTMSEIQFASYGDPEGSCGSFKI 749
Query: 770 GACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +V+KAC+G+ CSI V+S+ + G G LAV+ C
Sbjct: 750 GEWESRYSKTVVEKACIGKQSCSINVTSSTFRLKKGGTNG---QLAVQLSC 797
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/832 (43%), Positives = 488/832 (58%), Gaps = 51/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY FEG +D++RF K +Q AGL+ LRIGPY C EWNYGG P WL IP +QFR N P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F II+ MK N+FA QGGPIILAQ+ENEYGNV + Y+ W AD
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + + Y S A F+ N + +
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 385
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q ++ N+ E S
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPESLK 441
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY S+ +F+N +
Sbjct: 442 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 499
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + ++L++G N + +LS +GL+NYG F+
Sbjct: 500 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 559
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
AG+ V LID DLS+ W Y+ G+ GEY I LDK W + T+P
Sbjct: 560 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 615
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+N+ WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDY
Sbjct: 616 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 675
Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
RG + D KC CG+P+Q YH+PR+++ GE N L++ EE GGDPS++ +
Sbjct: 676 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 735
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
+C D + L+C + I+ I+ S+G+ G CG++
Sbjct: 736 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 777
Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ + +A G GL L V+A C
Sbjct: 778 EGGCESKAAYKAFTEACLGKESCTVQIINALTGSG-----GLSGVLTVQASC 824
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/832 (43%), Positives = 490/832 (58%), Gaps = 51/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY FEG +D++RF K +Q AGL+ LRIGPY C EWNYGG P WL IP +QFR N P
Sbjct: 91 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F II+ MK N+FA QGGPIILAQ+ENEYGNV + Y+ W AD
Sbjct: 151 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 210
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + + Y S A F+ N + +
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q ++ N+ E S
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPESLK 445
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY S+ +F+N +
Sbjct: 446 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 503
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + ++L++G N + +LS +GL+NYG F+
Sbjct: 504 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 563
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
AG+ V LID DLS+ W Y+ G+ GEY I LDK W + T+P
Sbjct: 564 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 619
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+N+ WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDY
Sbjct: 620 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 679
Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
RG + D KC CG+P+Q YH+PR+++ GE N L++ EE GGDPS++ +
Sbjct: 680 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 739
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
+C D + L+C + I+ I+ S+G+ G CG++
Sbjct: 740 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 781
Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ + +A G +G G+ L V+A C
Sbjct: 782 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 828
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/832 (43%), Positives = 490/832 (58%), Gaps = 51/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY FEG +D++RF K +Q AGL+ LRIGPY C EWNYGG P WL IP +QFR N P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F II+ MK N+FA QGGPIILAQ+ENEYGNV + Y+ W AD
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + + Y S A F+ N + +
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 385
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q ++ N+ E S
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPESLK 441
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY S+ +F+N +
Sbjct: 442 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 499
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + ++L++G N + +LS +GL+NYG F+
Sbjct: 500 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 559
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
AG+ V LID DLS+ W Y+ G+ GEY I LDK W + T+P
Sbjct: 560 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 615
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+N+ WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDY
Sbjct: 616 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 675
Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
RG + D KC CG+P+Q YH+PR+++ GE N L++ EE GGDPS++ +
Sbjct: 676 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 735
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
+C D + L+C + I+ I+ S+G+ G CG++
Sbjct: 736 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 777
Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ + +A G +G G+ L V+A C
Sbjct: 778 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 824
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/839 (42%), Positives = 498/839 (59%), Gaps = 54/839 (6%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYD RAL++DG+RR+L +G IHYPRSTPE+WPEL ++K GL+VI+TY+FW+ ++
Sbjct: 47 AMNVTYDSRALLLDGQRRLLIAGCIHYPRSTPEMWPELFARAKANGLDVIQTYLFWDVNQ 106
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G++ RFD VRF+K Q+AGL ++ RIGPY CAEWNYGGFP WL I GI FR
Sbjct: 107 PTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPYVCAEWNYGGFPAWLRQISGIVFRDN 166
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ P+ + + ++ K + ++K L A+ GGP+IL Q+ENEYGN+E +Y GG YV+W
Sbjct: 167 DKPWLDVVGPYITKTVQVLKDNKLLAADGGPVILLQIENEYGNIEDSYA-GGPAYVQWCG 225
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A +LN W+MCQQ+DAP I TCNGFYCD + P+ +P+MWTEN+ GWF ++G
Sbjct: 226 QLAASLNAGAQWIMCQQDDAPANTIATCNGFYCDNYVPHK-GQPMMWTENWPGWFQTWGQ 284
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RP +D+AFA ARF+ GGT+ +YYMY GGTNFGRTAGGP + TSYDYD +DEYG
Sbjct: 285 PSPHRPAQDVAFAAARFYAKGGTYMSYYMYHGGTNFGRTAGGPGITTSYDYDVALDEYGM 344
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PK+ HL LH + E ++S + P LG LEAH+++ SS C AFL+N DSS
Sbjct: 345 PSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLGKNLEAHVFNSSSG-CVAFLSNIDSS 403
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----------------NGDH 404
DA V FNG + LPAWSVSIL +C ++NTA V + N DH
Sbjct: 404 VDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAAVSAPLNARRMTPLVVHEDAVSDAADH 463
Query: 405 PFAQQKNV-NELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIH 463
+ K E + A S F+ Y E +G + EQINTT DT+DYLWYT + +
Sbjct: 464 RRSLSKGEGQERVGAFSTFASYAETIGRRAEEAVYFTSPQEQINTTNDTTDYLWYTTTYN 523
Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
+ L+I ++ V+VN++ V + +NK + L G N +D+LS
Sbjct: 524 SASATSQ--VLSISNVNDVVYVYVNRQFVTMSWSGS------VNKAVPLMAGTNVIDVLS 575
Query: 524 MMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS 583
GLQNYG + + G+ + K G DL+ W +QVG+ GE +G+ A++
Sbjct: 576 TTFGLQNYGTFLEQVTRGIQGTV----KLGSTDLTQNGWWHQVGLLGEELGIFLPQNASN 631
Query: 584 SFWKQGSTLPVNKSLIWYKTTFLAPE-GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
W +T N+ L WY+++F P+ + PLAL++ MGKG WVNG ++GRYW + +A
Sbjct: 632 VPWATPAT--TNRGLTWYRSSFDLPQSSQAPLALDMTGMGKGFVWVNGHNLGRYWPSRIA 689
Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKIS 702
S C CDYRG+YD S+C++ C P+Q YH+PR W+ P NL+V+ EE+GG+P+ IS
Sbjct: 690 DSMAC-DDCDYRGAYDDSRCRQGCNIPSQRYYHVPREWLQPTNNLIVMLEEIGGNPALIS 748
Query: 703 LLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEG 762
L+ + C V E P +L VV L C I + FAS+G P G
Sbjct: 749 LVEREEDISCGAVGEDYP------ADDLSVV-------LGCGLHQTIRRVEFASFGTPVG 795
Query: 763 NCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
C F G+C+ + IV+ C+G+ C +PV+ + G CP K L V+ C+
Sbjct: 796 TCRQFSLGSCNAANSTAIVESLCLGRQACHVPVAINHFG---DPCPDTTKRLFVQVSCA 851
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/837 (42%), Positives = 501/837 (59%), Gaps = 53/837 (6%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYD ++L I+G+R +L SGS+HY RSTP++WP+++ K++ GGL VI+TYVFWN HE
Sbjct: 43 ARNVTYDGKSLFINGRREILFSGSVHYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHE 102
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G++ F+G +DLV+F++ VQ G+F+ LR+GP+ AEWN+GG P WL +PGI FR+
Sbjct: 103 PEPGKFNFQGNYDLVKFIRLVQAKGMFVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSD 162
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N P+K MK F++KII +MK E LFA QGGPIILAQ+ENEY +++ AY G+ YV+WAA
Sbjct: 163 NEPYKFHMKAFVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAA 222
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
+ AV + VPW+MC+Q DAPDP+IN CNG +C D F PN P KP +WTEN++ +
Sbjct: 223 NMAVATDIGVPWLMCKQRDAPDPVINACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRVH 282
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD-APIDE 298
G R ED+AF+VARFF G NYYMY GGTNFGRT+ + +T+ YD AP+DE
Sbjct: 283 GDPPSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTSS--VFSTTRYYDEAPLDE 340
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANY 357
YG R+PKW HLR++HKA+ LC ++ P+ QKL E + + +N CAAF+ N
Sbjct: 341 YGLPREPKWSHLRDVHKALLLCRRAILGGVPSVQKLNHFHEVRTFERVGTNMCAAFITNN 400
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
+ A + F G YFLP S+SILPDCK VVFNT +++SQ N+ ++ E
Sbjct: 401 HTMEPATINFRGTNYFLPPHSISILPDCKTVVFNTQQIVSQHNSRNY---------ERSP 451
Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
A++ F W + E + + P AE + KDT+DY WYT S + G
Sbjct: 452 AANNFHWEMFNEAIPTAKKMPINLPVPAELYSLLKDTTDYAWYTTSFELSQEDMSMKPGV 511
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
L + SLGH+ + FVN +V +G H+ +F + L G N + +LS VGL +
Sbjct: 512 LPVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLRVGTNYISLLSSTVGLPD 571
Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
GA+ + AG S+ ++ L G DL+ W ++VG++GE + + S WK
Sbjct: 572 SGAYMEHRYAGPKSINILGLNRGTLDLTRNGWGHRVGLKGEGKKVFSEEGSTSVKWKPLG 631
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+P ++L WY+T F PEG GP+A+ ++ M KG WVNG +IGRYW +YL+P
Sbjct: 632 AVP--RALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGRYWMSYLSP------- 682
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
G+P Q+ YHIPR++++P +NLLVI EE P+++ +L
Sbjct: 683 ---------------LGKPTQSEYHIPRSFLNPQDNLLVIFEEEARVPAQVEILNVNRDT 727
Query: 711 ICSFVSEADPPPVDSWKPNLG-----VVSSSPQVRLACERGWHIAAINFASYGIPEGNCG 765
ICS V E DP V+SW G V S +AC G I A+ FAS+G P G CG
Sbjct: 728 ICSVVGERDPANVNSWVSRRGNFHPVVKSVGAAASMACATGKRIVAVEFASFGNPSGYCG 787
Query: 766 SFRPGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSA-GACPGLLKALAVEAHCS 820
F G+C+ IV++ C+GQ C++ + A + ACP L+K LAV+ C+
Sbjct: 788 DFAMGSCNAAASKQIVERECLGQEACTLALDRAVFNNNGVDACPDLVKQLAVQVRCA 844
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/832 (43%), Positives = 491/832 (59%), Gaps = 51/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G +D+VRF K +Q AGL+ LRIGPY C EWNYGG P WL IPG+QFR N P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F I++ MK N+FA QGGPIILAQ+ENEYGN+ + Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + + Y S A F+ N + +
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTS-ACFINNRNDNK 389
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q ++ N+ E +
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPENLK 445
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY S+ +F+N +
Sbjct: 446 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 503
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + ++L++G N + +LS +GL+NYG F+
Sbjct: 504 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 563
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
AG+ V LID DLS+ W Y+ G+ GEY I LDK W + T+P
Sbjct: 564 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 619
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+N+ WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDY
Sbjct: 620 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 679
Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
RG + D KC CG+P+Q YH+PR+++ GE N L++ EE GGDPS++ +
Sbjct: 680 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 739
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
+C D + L+C + I+ I+ S+G+ G CG++
Sbjct: 740 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 781
Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ + +A G +G G+ L V+A C
Sbjct: 782 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 828
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/832 (43%), Positives = 490/832 (58%), Gaps = 51/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY FEG +D++RF K +Q AGL+ LRIGPY C EWNYGG P WL IP +QFR N P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F II+ MK N+FA QGGPIILAQ+ENEYGNV + Y+ W AD
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + + Y S A F+ N + +
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTS-ACFINNRNDNK 385
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q ++ N+ E +
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPENLK 441
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY S+ +F+N +
Sbjct: 442 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 499
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + ++L++G N + +LS +GL+NYG F+
Sbjct: 500 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 559
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
AG+ V LID DLS+ W Y+ G+ GEY I LDK W + T+P
Sbjct: 560 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 615
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+N+ WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDY
Sbjct: 616 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 675
Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
RG + D KC CG+P+Q YH+PR+++ GE N L++ EE GGDPS++ +
Sbjct: 676 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 735
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
+C D + L+C + I+ I+ S+G+ G CG++
Sbjct: 736 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 777
Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ + +A G +G G+ L V+A C
Sbjct: 778 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 824
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/834 (42%), Positives = 503/834 (60%), Gaps = 50/834 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ +TYD R+L++DGK + SGSIHYPRSTP++WP+++ K++ GGL +I+TYVFWN HE
Sbjct: 25 AQTITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDILDKARRGGLNLIQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P + + FEGR+DLV+F+K VQE G+++ LRIGP+ AEWN+GG P WL +P I FR+
Sbjct: 85 PEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQAEWNHGGLPYWLREVPDIIFRSN 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK+ MK +++ +I+ MK+E LFA QGGPIILAQ+ENEY +++ AY G+ YV+WAA
Sbjct: 145 NEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIENEYNHIQLAYEADGDNYVQWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
AV+L VPWVMC+Q+DAPDP+IN CNG +C D FT PN P KP +WTEN++ + F
Sbjct: 205 KMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFTGPNKPYKPFIWTENWTAQYRVF 264
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+AF+VARFF G+ NYYMY GGTNFGRT T Y +AP+DE+
Sbjct: 265 GDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEF 323
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKS-SNDCAAFLANYD 358
G R+PKW HLR+ HKA+ LC++ L++ PT QK+ E +Y K SN CAAF+ N
Sbjct: 324 GLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQKISQYHEVIVYEKKESNLCAAFITNNH 383
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
+ + ++F G+ YFLP S+SILPDCK VVFNT + SQ ++ F + K N+
Sbjct: 384 TQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNTQNIASQHSS--RHFEKSKTGND---- 437
Query: 419 SSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG---QGKEV- 472
F W + E + + + AE + KD +DY WYT S+ + P + +V
Sbjct: 438 ---FKWEVFSEPIPSAKELPSKQKLPAELYSLLKDKTDYGWYTTSVELGPEDIPKKSDVA 494
Query: 473 -FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L I SLGH+ FVN + + +G+H+ F K + G+N + IL+ +VGL +
Sbjct: 495 PVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKPVNFKVGVNQIAILANLVGLPDS 554
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
GA+ + AG ++ ++ L +G DL+S W +QVG++GE + + WK G
Sbjct: 555 GAYMEHRYAGPKTITILGLMSGTIDLTSNGWGHQVGLQGENDSIFTEKGSKKVEWKDGKG 614
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
++ WYKT F PEG P+A+ + M KG WVNG+SIGR+W +YL+P
Sbjct: 615 --KGSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNGESIGRHWMSYLSP-------- 664
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
G+P Q+ YHIPR+++ P +NLLVI EE P KI++LT I
Sbjct: 665 --------------LGKPTQSEYHIPRSFLKPKDNLLVIFEEEAISPDKIAILTVNRDTI 710
Query: 712 CSFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGS 766
CSF++E PP + S+ + +P+ + C I A+ FAS+G P G CGS
Sbjct: 711 CSFITENHPPNIRSFASKNQKLERVGENLTPEAFITCPDQKKITAVEFASFGDPSGFCGS 770
Query: 767 FRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
F G C+ IV++ C+G+ CS+P+ A CP ++K LA++ C
Sbjct: 771 FIMGKCNAPSSKKIVEQLCLGKPTCSVPMVKATFTGGNDGCPDVVKTLAIQVKC 824
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/824 (44%), Positives = 499/824 (60%), Gaps = 51/824 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD RAL+++G RR+L SG +HY RSTPE+WP++I K+++GG++VI+TYVFWN HEP++
Sbjct: 39 VTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYVFWNVHEPVQ 98
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGR+++V+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FRT N P
Sbjct: 99 GKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPNITFRTDNEP 158
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ M+ F+ ++++MK E L+ QGGPII++Q+ENEY VE A+G GG YV+WAA A
Sbjct: 159 FKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPRYVQWAASLA 218
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPW+MC+Q DAPDPIINTCNG C + F PNSP+KP +WTEN++ + +G
Sbjct: 219 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWTTRYPIYGND 278
Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R D+ FAVA F GG+F +YYMY GGTNFGR A V TSY AP+DEYG
Sbjct: 279 TKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGL 337
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
I QP WGHL+ELH A+KL E L+ ++ LG EAH++ ++ C AFL N+D
Sbjct: 338 IWQPTWGHLKELHAAVKLSSEPLLYGTYSNFSLGEDQEAHVF-ETKLKCVAFLVNFDKHQ 396
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
V F L S+SIL DC+ VVF T KV +Q + Q N
Sbjct: 397 RPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNAQHGSRTAEVVQSLN--------DT 448
Query: 422 FSWYEEKVGISGNRS---FVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE-VFLNIE 477
+W K I + S + L E ++TTKD +DYLWY AS P V LN+E
Sbjct: 449 HTWKAFKESIPQDISKAAYTGKQLFEHLSTTKDETDYLWYIASYEYRPSDDSHLVLLNVE 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLI-NKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S H FVN + V +G+H ++I N I L EG NT+ +L++MVG + GA +
Sbjct: 509 SQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEGQNTISLLNVMVGSPDSGAHME 568
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
G+ V + ++ L++ W YQVG+ GE + ++S W + L
Sbjct: 569 RRSFGIHKVSIQQGQHALHLLNNELWGYQVGLFGEGNRIYTQEGSHSVEWTDVNNL-TYL 627
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WY+TTF P G + LNL SMGKG+ W+NG+SIGRYW ++ PS
Sbjct: 628 PLTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRYWVSFKTPS------------ 675
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
GQP+Q+LYHIP+ ++ +NLLV+ EE+GG+P +I++ T + +CS V+
Sbjct: 676 ----------GQPSQSLYHIPQHFLKNTDNLLVLVEEMGGNPLQITVNTVSITTVCSSVN 725
Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV 776
E PPV S P+VRL C++G HI+A+ FASYG P G+C +F G+CH +
Sbjct: 726 ELSAPPVQS-------QGKDPEVRLRCQKGKHISAVEFASYGNPAGDCRTFTIGSCHAES 778
Query: 777 L-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+V++AC+G+ CSIPV G CPG+ K+L V AHC
Sbjct: 779 SESVVKQACIGKRSCSIPVGPGSFG--GDPCPGIQKSLLVVAHC 820
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/828 (45%), Positives = 491/828 (59%), Gaps = 77/828 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYD R+L+IDG+R+++ SGSIHYPRSTPE+WP LI K+KEGGL+ IETYVFWN HEP
Sbjct: 25 DVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSLIAKAKEGGLDAIETYVFWNVHEPQ 84
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G Y F G D+VRF+K VQ GL+ LRIGP+ +EW+YGG P WLH IPGI FR+ N
Sbjct: 85 PGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQSEWSYGGLPFWLHDIPGIVFRSDNE 144
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+ F AK++ +M+ ENL+ASQGGPIIL+Q+ENEYG V+ AYG G YV+WAA
Sbjct: 145 PFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIENEYGTVQKAYGQEGLAYVQWAAQM 204
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG--FTPNSPSKPIMWTENYSGWFLSFGY 241
A L T VPWVMC+Q +AP +IN+CNG C PNSP+KP +WTEN++
Sbjct: 205 AEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFVGPNSPNKPSIWTENWT-------- 256
Query: 242 AVPFRPVEDLAFAVARFFET-GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
+ ED+AF V F G+F NYYMY GGTNFGRTA V TSY AP+DEYG
Sbjct: 257 ---TQSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTNFGRTASA-FVTTSYYDQAPLDEYG 312
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
QPKWGHL+ELH AIKLC L+S + LG + +A+I++ S +CAAFL N DSS
Sbjct: 313 LTTQPKWGHLKELHAAIKLCSTPLLSGVQVNLYLGPQQQAYIFNAVSGECAAFLINNDSS 372
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+ A+V F Y LP S+SILPDCKNV + + R G E+L A+
Sbjct: 373 NAASVPFRNASYDLPPMSISILPDCKNV----STQYTTRTMGR---------GEVLDAAD 419
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ + E + + S L EQ+NTTKD+SDYLWYT + L++ SLG
Sbjct: 420 VWQEFTEAIPNFDSTSTRSETLLEQMNTTKDSSDYLWYTFRFQ-HESSDTQAILDVSSLG 478
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN + V G+ F + L++GIN + +LS+MVG+ + GA+ + A
Sbjct: 479 HALHAFVNGQAVGSVQGSRKNPRFKFETSVSLSKGINNVSLLSVMVGMPDSGAFLENRAA 538
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
GL +V++ D K D ++ W YQ+G++GE + + ++ WK+ S L W
Sbjct: 539 GLRTVMIRD-KQDNNDFTNYSWGYQIGLQGETLQIYTEQGSSQVQWKKFSN--AGNPLTW 595
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKT AP G P+ LNLASMGKG+AWVNGQSIGRYW PS
Sbjct: 596 YKTQVDAPPGDVPVGLNLASMGKGEAWVNGQSIGRYW-----PS---------------- 634
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
YH+PR+++ P NLLV+ EE GG+P ++SL T T +C V+ +
Sbjct: 635 -------------YHVPRSFLKPTGNLLVLQEEEGGNPLQVSLDTVTISQVCGHVTASHL 681
Query: 721 PPVDSW-------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNC-GSFRPGAC 772
PV SW K V P+V LAC I+ I+FASYG P GNC S G C
Sbjct: 682 APVSSWIEHNQRYKNPAKVSGRRPKVLLACPSKSKISRISFASYGTPLGNCRNSMAVGTC 741
Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
H + +V++AC+G+++CSIPVS G CP K+L V A C
Sbjct: 742 HSQNSKAVVEEACLGKMKCSIPVSVRQFG--GDPCPAKAKSLMVVAEC 787
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/830 (44%), Positives = 506/830 (60%), Gaps = 56/830 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ TYD R+L+++G+ ++L SGSIHYPRSTP++WP LI K+KEGG++VI+TYVFWN HEP
Sbjct: 15 SATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQ 74
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+G Y F GR D+VRFVK +Q GL+ LRIGP+ AEW+YGG P WLH + GI +R+ N
Sbjct: 75 QGTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNE 134
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK M+ F KI+++MK E L+ASQGGPIIL+Q+ENEY VE A+G G YV+WAA
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
AV+L T VPW MC+Q DAPDP+INTCNG C + FT PNSP+KP +WTEN++ ++ ++G
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254
Query: 242 AVPFRPVEDLAFAVARFFET-GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R E++AF VA F GT+ NYYMY GGTNFGR+A ++ YD +P+DEYG
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QSPLDEYG 313
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PKWGHL+ELH A+KLC L++ ++ LG +EA ++ SN+CAAFL N +
Sbjct: 314 LTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVN-RGA 372
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
D+NV F Y LP S+SILPDCKNV FNT +V Q N Q+ ++ E
Sbjct: 373 IDSNVLFQNVTYELPLGSISILPDCKNVAFNTRRVSVQHNTRSMMAVQKFDLLE------ 426
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ ++E + + +L E + TTKD SDYLWYT + ++ L ++S
Sbjct: 427 -WEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYLWYTFRVQQDSPDSQQT-LEVDSRA 484
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN +G + F + K I L GIN + +LS+MVGL + GA+ + A
Sbjct: 485 HALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSGAFLETRVA 544
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGE--YIGLDKISLANSSFWKQGSTLPVNKSL 598
GL V + D S W Y+VG+ GE I LD S +N + + G++ ++ L
Sbjct: 545 GLRRVGI-----QGEDFSEQHWGYKVGLSGEQSQIFLDTGS-SNVQWSRLGNS---SQPL 595
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYKT F AP G P+ALNL SMGKG WVNG+ IGRYW ++L P
Sbjct: 596 TWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK-------------- 641
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
G+P+Q Y++PR+++ P +N LVI EE G+P +ISL + C VSE+
Sbjct: 642 --------GEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSES 693
Query: 719 DPPPVDSW----KPNLGVV---SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
P V SW K + V + P+V+L+C I+ I FAS+G P G+C S+ G
Sbjct: 694 HYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIGL 753
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CH + IV+ AC+G+ +CSIP+S+ L CP + K L V+A C+
Sbjct: 754 CHSPNSRAIVEHACLGRAKCSIPISN--LNFRGDPCPHVTKTLLVDAQCT 801
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/831 (42%), Positives = 494/831 (59%), Gaps = 48/831 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
++YD R+L++DG+R + SGSIHYPRS P++WPELI K+KEGGL IETYVFWN HEP +
Sbjct: 38 ISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 97
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQ+ FEGR+D+V+F K +QE +F +R+GP+ AEWN+GG P WL IP I FRT N P
Sbjct: 98 GQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 157
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K M+ F+ +I +K NLFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 158 YKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIHWAAQMA 217
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK--PIMWTENYSGWFLSFGYA 242
+ N +PW+MC+Q AP +I TCNG C P +K P++WTEN++ + FG
Sbjct: 218 IGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQYRVFGDP 277
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AFAVARFF GGT NYYMY GGTNFGRTA ++ YD +AP+DE+G
Sbjct: 278 PSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTAAAFVMPKYYD-EAPLDEFGLY 336
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
++PKWGHLR+LH A+KLC++ L+ P+ +KLG +LEA ++ C AFL+N+++
Sbjct: 337 KEPKWGHLRDLHLALKLCKKALLWGKPSTEKLGKQLEARVFEIPEQKVCVAFLSNHNTKD 396
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D +TF G YF+P S+SIL DCK VVF T V +Q N FA Q N N +
Sbjct: 397 DVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTNQNNVWQM--- 453
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG-----QGKEVFLNI 476
+ EEKV A+ N TKD +DY+WYT+S + P + + + +
Sbjct: 454 --FDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVWYTSSFKLEPDDMPIRRDIKTVVEV 511
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GHA++ FVN K G+G F + K +EL +G+N + +L+ +G+ + GA+ +
Sbjct: 512 NSHGHASVAFVNNKFAGCGHGTKMNKAFTLEKPMELKKGVNHVAVLASSMGMMDSGAYLE 571
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG+ V + L G DL++ W + VG+ GE + S WK +K
Sbjct: 572 HRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGEQKEIYTEKGMASVTWKPAVN---DK 628
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F P G+ P+ L++++MGKG +VNGQ IGRYW +Y
Sbjct: 629 PLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIGRYWMSY---------------- 672
Query: 657 YDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
KH G+P+Q LYHIPR+++ P +N+LV+ EE G P I +LT +IC+++
Sbjct: 673 -------KHALGRPSQQLYHIPRSFLRPKDNVLVLFEEEFGRPDAIMILTVKRDNICTYI 725
Query: 716 SEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
SE +P + SW+ ++++ + L C I + FASYG P G CG++ G
Sbjct: 726 SERNPAHIKSWERKDSQITATADDLKARATLTCPPKKLIQQVVFASYGNPVGICGNYTIG 785
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+CH +V+K+C+G+ C++PVS+ G CPG LAV+A CS
Sbjct: 786 SCHTPRAKEVVEKSCLGKRTCTLPVSADVYGGDVN-CPGTTATLAVQAKCS 835
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/832 (43%), Positives = 491/832 (59%), Gaps = 49/832 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ RALVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP
Sbjct: 30 VAYNDRALVIDGQRRIVLSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPRP 89
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G +D+VRF K +Q AG++ LRIGPY C EWNYGG P WL IPG+QFR N P
Sbjct: 90 RQYNFAGNYDIVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRMHNQP 149
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F I++ +K N+FA QGGPIIL+Q+ENEYGN+ Y+ W A
Sbjct: 150 FEHEMETFTTLIVNKLKDANMFAGQGGPIILSQIENEYGNIMANLTDAQSASEYIHWCAA 209
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P +INTCNGFYC + P P +WTEN++GWF ++
Sbjct: 210 MANKQNVGVPWIMCQQDADVPPNVINTCNGFYCHDWFPKRTDIPKIWTENWTGWFKAWDK 269
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R +D+AFAVA FF+ G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 270 PDFHRSAQDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 329
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IR+PK+GHL++LH +K E+ L+ D + G + Y + F++N
Sbjct: 330 IREPKYGHLKDLHAVLKSMEKILVHGDFSDINYGRNVTVTKYTLDGSS-VCFISNQFDDR 388
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
DAN T +G + +PAWSVS+LPDCK V +NTAK+ +Q + ++ N E +
Sbjct: 389 DANATIDGTTHVVPAWSVSVLPDCKAVAYNTAKIKAQTS----VMVKKPNTVEQEPENLK 444
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + SF + +L EQI T+ D SDYLWY S G+ K L++ +
Sbjct: 445 WSWMPEHLKPFMTDEKGSFRKNELLEQITTSTDQSDYLWYRTSFE-HKGEAK-YKLSVNT 502
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN KL + + F + ++L++G N L +LS +GL+NYGA F++
Sbjct: 503 TGHQIYAFVNGKLAGRQHSPNGAFIFQLESPVKLHDGKNYLSLLSATMGLKNYGALFELM 562
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
AG+ V L+D DLS+ W Y+ G+ GE+ I LDK + T+P+
Sbjct: 563 PAGIVGGPVKLVDNNGSTIDLSNSSWSYKAGLAGEHRQIHLDKPGY---KWHGDNGTIPI 619
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
N++ WYK TF AP G+ + +L + KG AWVNG ++GRYW +Y+A G CDYR
Sbjct: 620 NRAFTWYKATFQAPAGEEAVVADLMGLNKGVAWVNGNNLGRYWPSYVAAEMGGCHHCDYR 679
Query: 655 GSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTGQ 709
G++ D KC C +PAQ YH+PR ++ GE N +V+ EE GGDPS++ T
Sbjct: 680 GAFKAEGDGLKCLTGCNEPAQRFYHVPRVFLRAGEPNTVVLFEEAGGDPSRVGFHTVAVG 739
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC--ERGWHIAAINFASYGIPEGNCGSF 767
+C +E V L+C +G I++++ ASYG+ G CG++
Sbjct: 740 PVCVEAAE-----------------KGDNVTLSCGQHKGRTISSVDLASYGVTRGQCGAY 782
Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ G +ACVG+ C++ + A+ G AG G+ L V+A C
Sbjct: 783 QGGCESKAAYEAFAEACVGKESCTVQHTDAFSG--AGCQSGV---LTVQATC 829
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/756 (45%), Positives = 466/756 (61%), Gaps = 43/756 (5%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+WPEL +K+KEGG++ IETY+FW+ HEP+R QYYF G D+V+F K QEAGL + LRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
PY CAEW+YGGFP+WLH IPGI+ RT N +K EM+ F KI+D+ K+ LFA QGGPII
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
LAQ+ENEYGNV YG G YV W A AV N VPW+MCQQ +AP P+INTCNGFYC
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180
Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
D F PN+P P MWTEN+SGWF +G P+R EDLAF+VARF + GG +YYMY GG
Sbjct: 181 DQFKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYHGG 240
Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
TNFGRTAGGP + TSYDY+AP+DEYG + QPKWGHL++LH+AIK E L + T +
Sbjct: 241 TNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSKNF 300
Query: 335 GAKLEAHIY-HKSSNDCAAFLANYDSSSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNT 392
++ Y ++ + + FL+N + +ANV + Y LPAWSV+IL DC ++NT
Sbjct: 301 WGGVDQTTYTNQGTGERFCFLSNTN-MEEANVDLGQDGKYSLPAWSVTILQDCNKEIYNT 359
Query: 393 AKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEE--KVGISGNRSFVRPDLAEQINTTK 450
AKV +Q + ++ +L +++W E K + G F +L EQ TT
Sbjct: 360 AKVNTQTSIMVKKLHEEDKPVQL-----SWTWAPEPMKGVLQGKGRFRATELLEQKETTV 414
Query: 451 DTSDYLWYTASIHVMPGQGKE---VFLNIESLGHAALVFVNKKLVAFGYGNH-------- 499
DT+DYLWY S+++ K+ V L + + GH +VNKK + +
Sbjct: 415 DTTDYLWYMTSVNLNETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQSVK 474
Query: 500 -DFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGK--RD 556
D +FL K + L G NT+ +LS VGL NYG ++D G+ + + NGK D
Sbjct: 475 GDDYSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGKPFMD 534
Query: 557 LSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLAL 616
L+S +W Y++G+ GE + + ++S + LP +++ WYKTTF +P G P+ +
Sbjct: 535 LTSYQWSYKIGLSGEAKRYNDPNSPHASKFTASDNLPTGRAMTWYKTTFASPSGTEPVVV 594
Query: 617 NLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHI 676
+L MGKG AWVNG+S+GR+W +A + GC CDYRGSY+ KC +CG P+Q YHI
Sbjct: 595 DLLGMGKGHAWVNGKSLGRFWPTQIADAKGCPDTCDYRGSYNGDKCVTNCGNPSQRWYHI 654
Query: 677 PRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS 735
PR++++ G+N L++ EE+GG+P+ +S + IC E
Sbjct: 655 PRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAVETICGNAYEGS---------------- 698
Query: 736 SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
+ L+CE G I+ I FASYG PEG CG+F G+
Sbjct: 699 --TLELSCEGGRTISDIQFASYGDPEGTCGAFMKGS 732
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/842 (42%), Positives = 500/842 (59%), Gaps = 54/842 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD ++L ++G+R +L SGSIHY RSTP+ WP+++ K++ GGL VI+TYVFWN HEP
Sbjct: 34 NVTYDGKSLFVNGRRELLFSGSIHYTRSTPDAWPDILDKARHGGLNVIQTYVFWNAHEPE 93
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+G++ FEG DLV+F++ VQ G+++ LR+GP+ AEWN+GG P WL +PGI FR+ N
Sbjct: 94 QGKFNFEGNNDLVKFIRLVQSKGMYVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNE 153
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+K+ MK +++KII +MK E LFA QGGPIILAQ+ENEY +++ AY G+ YV+WAA+
Sbjct: 154 PYKKYMKAYVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 213
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
AV L+ VPW+MC+Q+DAPDP+IN CNG +C D F+ PN P KP +WTEN++ + FG
Sbjct: 214 AVALDIGVPWIMCKQKDAPDPVINACNGRHCGDTFSGPNKPYKPSLWTENWTAQYRVFGD 273
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
V R ED+AF+VARFF G NYYMY GGTNFGRT T Y +AP+DEYG
Sbjct: 274 PVSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEYGM 332
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSS 360
RQPKW HLR+ HKA+ LC + ++ PT QKL E I+ K ++ C+AF+ N ++
Sbjct: 333 ERQPKWSHLRDAHKALLLCRKAILGGVPTVQKLNDYHEVRIFEKPGTSTCSAFITNNHTN 392
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ----RNNGDHPFAQ----QKNV 412
A ++F G+ YFLPA S+S+LPDCK VV+NT V++Q + H + Q N
Sbjct: 393 QAATISFRGSNYFLPAHSISVLPDCKTVVYNTQNVMNQLVYYKLISSHLIIKLIVSQHNK 452
Query: 413 NELLLASSA--FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ 468
+ ++ A W + E + S + E KDT+DY WYT S + P
Sbjct: 453 RNFVKSAVANNLKWELFLEAIPSSKKLESNQKIPLELYTLLKDTTDYGWYTTSFELGPED 512
Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
K L I SLGH FVN + + +G H+ +F + G N + IL+ V
Sbjct: 513 LPKKSAILRIMSLGHTLSAFVNGQYIGTDHGTHEEKSFEFEQPANFKVGTNYISILATTV 572
Query: 527 GLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
GL + GA+ + AG S+ ++ L GK +L+ W ++VG+ GE + + + W
Sbjct: 573 GLPDSGAYMEHRYAGPKSISILGLNKGKLELTKNGWGHRVGLRGEQLKVFTEEGSKKVQW 632
Query: 587 K--QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
G T ++L W KT F PEG+GP+A+ + MGKG WVNG+SIGR+W ++L+P
Sbjct: 633 DPVTGET----RALSWLKTRFATPEGRGPVAIRMTGMGKGMIWVNGKSIGRHWMSFLSP- 687
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
GQP+Q YHIPR +++ +NLLV+ EE G P KI ++
Sbjct: 688 ---------------------LGQPSQEEYHIPRDYLNAKDNLLVVLEEEKGSPEKIEIM 726
Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGI 759
ICS+++E P V+SW G +S PQ L C G I A+ FAS+G
Sbjct: 727 IVDRDTICSYITENSPANVNSWGSKNGEFRSVGKNSGPQASLKCPSGKKIVAVEFASFGN 786
Query: 760 PEGNCGSFRPGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
P G CG F G C+ +V+KAC+G+ EC + V+ A + C G + LA++A
Sbjct: 787 PSGYCGDFALGNCNGGAAKGVVEKACLGKEECLVEVNRA--NFNGQGCAGSVNTLAIQAK 844
Query: 819 CS 820
CS
Sbjct: 845 CS 846
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 672 bits (1735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/833 (44%), Positives = 504/833 (60%), Gaps = 61/833 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYD R+L I+G+R+++ SG+IHYPRS+P +WP L++K+K GGL IETYVFWN HEP
Sbjct: 15 SVTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIETYVFWNAHEPQ 74
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RGQY F G DLV+F+K VQ+ L+ LRIGPY CAEWNYGGFPVWLH +PGI+FRT N
Sbjct: 75 RGQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNLPGIKFRTNNQ 134
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+K F +L K N+F + +ENE+GNVE +YG G+ YVKW A+
Sbjct: 135 VYKVTFXFFFL-TKNLKKINNMF-------LKNXIENEFGNVEGSYGQEGKEYVKWCAEL 186
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A + N S PW+MCQQ DAP PI+ CN CD F PN+ + P MWTE+++GWF +G
Sbjct: 187 AQSYNLSEPWIMCQQGDAPQPIV--CN---CDQFKPNNKNSPKMWTESWAGWFKGWGERD 241
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+R EDLAFAVARFF+ GG+ NYYMY GGTNFGR+AGGP + TSYDY+AP+DEYG +
Sbjct: 242 PYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYNAPLDEYGNMN 301
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSD 362
QPKWGHL++LH+ I+ E+ L D H G A Y +K + C F N +SD
Sbjct: 302 QPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTATSYTYKGKSSC--FFGN-PENSD 358
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLASSA 421
+TF Y +P WSV++LPDCK V+NTAKV +Q + P K+ L
Sbjct: 359 REITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIREMVPSLVGKHKKPL-----K 413
Query: 422 FSWYEEKV-------GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKE 471
+ W EK+ ISG+ + L +Q T D+SDYLWY H+ P GK
Sbjct: 414 WQWRNEKIEHLTHEGDISGS-AITANSLIDQKMVTNDSSDYLWYLTGFHLNGNDPLFGKR 472
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE-LNEGINTLDILSMMVGLQN 530
V L +++ GH FVN K + +G + +F + KK+ L G N + +LS VGL N
Sbjct: 473 VTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRHGFNQIALLSATVGLPN 532
Query: 531 YGAWFDVAGAGLFSVILIDLKNGK--RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
YGA+++ G++ + + + +GK RDLS+ EWIY+VG++GE W
Sbjct: 533 YGAYYENVEVGIYGPVEL-IADGKTIRDLSTNEWIYKVGLDGEKYEFFDPDHKFRKPW-L 590
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ LP+N++ WYKT+F P+G+ + ++L MGKGQAWVNG+SIGRYW +YLA GC+
Sbjct: 591 SNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIGRYWPSYLATENGCS 650
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISLLTKT 707
CDYRG+Y SKC +CG+P Q YHIPR++++ G EN L++ EE GG P I + T
Sbjct: 651 SSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEEFGGMPLNIEIKTTR 710
Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+ +C+ K +LG ++ L C + I F +G P+GNC +F
Sbjct: 711 VKKVCA-------------KVDLG-----SKLELTCHDR-TVKRIIFVGFGNPKGNCNNF 751
Query: 768 RPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G+CH + +++K C+ + +CSI V+ LG++ P LAV+ C
Sbjct: 752 HKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLGLTGCKNPK-DNWLAVQVSC 803
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/754 (47%), Positives = 471/754 (62%), Gaps = 48/754 (6%)
Query: 105 GFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN 164
GFPVWL +PGI+FRT N P+K EM+ F+ KI+D+MK+E L++ QGGPIIL Q+ENEYGN
Sbjct: 19 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78
Query: 165 VEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK 224
++ YG G+ Y+ WAA A+ L+T VPWVMC+Q DAP+ I+NTCN FYCDGF PNS +K
Sbjct: 79 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNK 138
Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
P +WTE++ GW+ +G ++P RP +D AFAVARF++ GG+ QNYYMYFGGTNF RTAGGP
Sbjct: 139 PTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGP 198
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHI 342
L TSYDYDAPIDEYG +RQPKWGHL++LH AIKLCE L + D P + KLG EAH+
Sbjct: 199 LQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHV 258
Query: 343 YHK-----------SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
Y +S C+AFLAN D A+V G Y LP WSVSILPDC+ V FN
Sbjct: 259 YSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFN 318
Query: 392 TAKVISQRN-----NGDHPFAQQKNVNELLLASSAF---SW--YEEKVGISGNRSFVRPD 441
TA+V +Q + +G ++ + L L + +W ++E VGI G F
Sbjct: 319 TARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQG 378
Query: 442 LAEQINTTKDTSDYLWYTASIHVMP-------GQGKEVFLNIESLGHAALVFVNKKLVAF 494
+ E +N TKD SDYL YT +++ +G L I+ + A VFVN KL
Sbjct: 379 ILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGS 438
Query: 495 GYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNG 553
G+ +N+ ++L +G+N L +LS +VGLQNYGA+ + GAG V L L NG
Sbjct: 439 KVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNG 494
Query: 554 KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGP 613
DL++ W YQ+G++GE+ + S+ W W+KT F APEG GP
Sbjct: 495 DIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGP 554
Query: 614 LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTL 673
+ ++L SMGKGQAWVNG IGRYWS +AP +GC C+Y G+Y SKC+ +CG Q+
Sbjct: 555 VTIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSW 613
Query: 674 YHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------K 727
YHIPR W+ NLLV+ EE GGDPS+ISL + ICS +SE PP+ +W +
Sbjct: 614 YHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGR 673
Query: 728 PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVG 786
P++ V +P++RL C+ G I+ I FASYG P G C +F G CH L +V +AC G
Sbjct: 674 PSVNTV--APELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEACEG 731
Query: 787 QIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ C+I V++ G C ++K LAVEA CS
Sbjct: 732 KNRCAISVTNEVFG---DPCRKVVKDLAVEAECS 762
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 671 bits (1731), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/610 (54%), Positives = 418/610 (68%), Gaps = 22/610 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+V+DGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21 VTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE RFDLV+FVK Q+AGL++HLRIGPY CAEWN GGFPVWL ++PGI FRT
Sbjct: 81 EPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRT 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F AKI+ LMK+ LF SQGGPIIL+Q+ENEYG VEW G G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN +KP MWTEN++GW+ FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFG 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG +ATSYDYDAP+DEYG
Sbjct: 261 GAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PK+ HLR LHKAIK E L+++DP Q LG LEAH++ + CAAF+ANYD+
Sbjct: 321 LENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVF-SAPGACAAFIANYDTK 379
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A F Y LP WS+SILPDCK VV+NTAKV G + VN S
Sbjct: 380 SYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKV------GYGWLKKMTPVN------S 427
Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF+W EE S S L EQ+N T+D+SDYLWY ++V + G+
Sbjct: 428 AFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSP 487
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GH VF+N +L +G + ++L G N L +LS+ VGL N G
Sbjct: 488 LLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVG 547
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
F+ AG+ V L L G RDLS +W Y+VG++GE + L S ++S W QGS
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSL 607
Query: 592 LPVNKSLIWY 601
+ + L WY
Sbjct: 608 VAKKQPLTWY 617
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 27/36 (75%)
Query: 672 TLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
T YH+PR+W+ G N LV+ EE GGDP+ I+L+ +T
Sbjct: 615 TWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 650
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/836 (43%), Positives = 499/836 (59%), Gaps = 52/836 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V YD RALVIDG+RR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+ IETYVFWN HEP R
Sbjct: 26 VGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHEPRR 85
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY FEG +D+VRF K VQ+AG++ LRIGPY C EWNYGG P WL I G+QFR N+P
Sbjct: 86 RQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQFRMHNHP 145
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F++EM+ F I+D +K+ +FA QGGPIIL+Q+ENEYGN+ + Y+ W A
Sbjct: 146 FEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHWCAA 205
Query: 183 TAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ +D P +INT NGFYC + P P +WTEN++GWF ++
Sbjct: 206 MANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWFPKRTDIPKIWTENWTGWFKAWDK 265
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AF+VA FF+T G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 266 PDFHRSAEDIAFSVAMFFQTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 325
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPK+GHL++LH +K E+ L+ D +G + N A F++N
Sbjct: 326 IRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMGNTNVTVTKYTLDNSSACFISNKFDDK 385
Query: 362 DANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+ NVT NG + +PAWSVSILPDCK V +N+AK+ +Q + ++ E +
Sbjct: 386 EVNVTLDNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTS-----VMVKRPGAETVTDGL 440
Query: 421 AFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LNI 476
A+SW E + +F + +L EQI T+ D SDYLWY S +G+ + L++
Sbjct: 441 AWSWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRTSFE---HKGESNYKLHV 497
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+ GH FVN KLV Y + F + ++L+ G N + +LS +GL+NYGA F+
Sbjct: 498 NTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGKNYISLLSATIGLKNYGALFE 557
Query: 537 VAGAGLFS--VILIDLKNGKR--DLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG- 589
+ AG+ V L+D DLS+ W Y+ G+ GEY LDK + + S W G
Sbjct: 558 MMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKAGLAGEYRETHLDKAN--DRSQWSGGL 615
Query: 590 -STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
T+PV++ WYK TF AP G+ P+ +L +GKG WVNG ++GRYW +Y+A
Sbjct: 616 NGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNGNNLGRYWPSYVAADMDGC 675
Query: 649 KKCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISL 703
++CDYRG++ D KC C +P+Q YH+PR+++ GE N +V+ EE GGDP+++S
Sbjct: 676 QRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGEPNTMVLFEEAGGDPTRVSF 735
Query: 704 LTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
T C+ + +V LAC G I++++ AS G+ G
Sbjct: 736 HTVAVGAACAEAA-----------------EVGDEVALACSHGRTISSVDVASLGVARGK 778
Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CG+++ G L ACVG+ C++ + + +G G+ L V+A C
Sbjct: 779 CGAYQGGCESKAALAAFTAACVGKESCTVRHTEDFR-AGSGCDSGV---LTVQATC 830
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/836 (43%), Positives = 493/836 (58%), Gaps = 50/836 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTY+ RALVIDG+RR++ SGSIHYPRSTP++WP+LI K+KEGGL IETYVFWN HE
Sbjct: 20 ATTVTYNDRALVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHE 79
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P R QY FEG +D++RF K +Q AG+ LRIGPY C EWNYGG P WL IPG+QFR
Sbjct: 80 PRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLH 139
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKW 179
N PF+ EM+ F I++ MK N+FA QGGPIILAQ+ENEYGN+ + Y+ W
Sbjct: 140 NAPFEREMETFTTLIVNKMKDVNMFAGQGGPIILAQIENEYGNIMGQLKNNQSASQYIHW 199
Query: 180 AADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
AD A VPW+MCQQ+ D P +INTCNGFYC + PN P +WTEN++GWF +
Sbjct: 200 CADMANKQEVGVPWIMCQQDNDVPHNVINTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 259
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
+ R ED+AFAVA FF+ G+ NYYMY GGTNFGRT+GGP + TSYDYDAP+DE
Sbjct: 260 WDKPDFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDE 319
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANY 357
YG IRQPK+GHL++LH I+ E+ L+ G + Y + S+ C F+ N
Sbjct: 320 YGNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGKNVTVTKYMYGGSSVC--FINNQ 377
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
D VT G + +PAWSVSILP+CK V +NTAK+ +Q + ++ N E
Sbjct: 378 FVDRDMKVTLGGETHLVPAWSVSILPNCKTVAYNTAKIKTQTS----VMVKKANSVEKEP 433
Query: 418 ASSAFSWYEEKVG--ISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
+ +SW E + ++ +R SF + L EQI T+ D SDYLWY S+ G+G L
Sbjct: 434 ETMRWSWMPENLKPFMTDHRGSFRQSQLLEQIATSTDQSDYLWYRTSLE-HKGEGSYT-L 491
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ + GH FVN +LV + F + ++L+ G N + +LS VGL+NYG
Sbjct: 492 YVNTSGHEMYAFVNGRLVGQNHSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPS 551
Query: 535 FDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF-WK-QGS 590
F++ AG+ V L+ DL+ W Y+ G+ GE L +I L + W+
Sbjct: 552 FELVPAGIAGGPVKLVGTNGTAIDLTKSSWSYKSGLAGE---LRQIHLDKPGYKWQSHNG 608
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
T+PVN+ WYKTTF AP G+ + ++L + KG AWVNG S+GRYW +Y A
Sbjct: 609 TIPVNRPFTWYKTTFEAPAGEEAVVVDLLGLNKGVAWVNGNSLGRYWPSYTAAEMPGCHV 668
Query: 651 CDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLT 705
CDYRG + D +C CG+PAQ YH+PR+++ GE N L++ EE GGDP++ + T
Sbjct: 669 CDYRGKFIAEGDGIRCLTGCGEPAQRFYHVPRSFLRAGEPNTLILFEEAGGDPTRAAFHT 728
Query: 706 KTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNC 764
+C + V V L+C G +A+++ AS+G+ G+C
Sbjct: 729 VAVGPVC-----------------VAAVELGDDVTLSCGGHGRVVASVDVASFGVARGSC 771
Query: 765 GSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G+++ G L ACVG+ C++ ++A+ G AG G AL V+A CS
Sbjct: 772 GAYKGGCESKAALKAFTDACVGRESCTVKYTAAFAG--AGCQSG---ALTVQATCS 822
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/832 (42%), Positives = 496/832 (59%), Gaps = 50/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDG+R + SGSIHYPRS P++WPELI K+KEGGL IETY+FWN HEP +
Sbjct: 41 VSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWNIHEPEK 100
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQ+ FEGR+D+VRF K +QE ++ +R+GP+ AEWN+GG P WL IP I FRT N P
Sbjct: 101 GQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 160
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K M+ F+ II +K NLFASQGGPIILAQ+ENEY ++E A+ G Y+KWAA+ A
Sbjct: 161 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIKWAANMA 220
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
++ N +PW+MC+Q AP +I TCNG C G T P + S P++WTEN++ + FG
Sbjct: 221 ISTNVGIPWIMCKQTKAPSDVIPTCNGRNC-GDTWPGPMNKSMPLLWTENWTAQYRVFGD 279
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVARFF GGT NYYMY GGTNFGRT+ ++ YD +AP+DE+G
Sbjct: 280 PPSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 338
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
++PKWGHLR+LH A+KLC++ L+ + +KLG + EA ++ C AFL+N+++
Sbjct: 339 YKEPKWGHLRDLHLALKLCKKALLWGKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTK 398
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
D +TF G YF+P S+SIL DCK VVF T V +Q N FA Q N +
Sbjct: 399 DDVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQM-- 456
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLN 475
+ EEKV + N TKD +DY+WYT+S + MP + + L
Sbjct: 457 ---FDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLE 513
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ S GHA++ FVN K V G+G F + K ++L +G+N + +L+ +G+ + GA+
Sbjct: 514 VNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYL 573
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
+ AG+ V + L G DL++ W + VG+ GE + S WK +
Sbjct: 574 EHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAVN---D 630
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK F P G+ P+ L++++MGKG +VNGQ IGRYW +Y
Sbjct: 631 RPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISY--------------- 675
Query: 656 SYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
KH G+P+Q LYHIPR+++ +N+LV+ EE G P I +LT +IC+F
Sbjct: 676 --------KHALGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVKRDNICTF 727
Query: 715 VSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
+SE +P + SW+ ++ + P+ L C I + FASYG P G CG++
Sbjct: 728 ISERNPAHIKSWERKDSQITVTAADLKPRATLTCSPKKLIQQVVFASYGNPMGICGNYTI 787
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G+CH +V+KAC+G+ C++PVS+ G CPG LAV+A CS
Sbjct: 788 GSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVN-CPGTTATLAVQAKCS 838
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/610 (52%), Positives = 432/610 (70%), Gaps = 21/610 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25 VKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 85 EPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW G G+ Y KW
Sbjct: 145 DNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A+ A L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS +KP MWTEN++GWF FG
Sbjct: 205 AEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R+PK+ HL+ LHK IKLCE L+S+DPT LG K EAH++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF-KSKSSCAAFLSNYNTS 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F G+ Y LP WSVSILPDCK +NTAKV R + H +++ ++
Sbjct: 383 SAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV---RTSSIH--------MKMVPTNT 431
Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
FSW Y E++ + N +F + L EQI+ T+D +DY WY I + P + G++
Sbjct: 432 PFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 491
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GHA VFVN +L YG+ + ++KI+L+ G+N L +LS GL N G
Sbjct: 492 LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGV 551
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
++ G+ V L + +G D++ +W Y++G +GE + + ++ +++ WK+GS +
Sbjct: 552 HYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLAGSSTVEWKEGSLV 611
Query: 593 PVNKSLIWYK 602
+ L WYK
Sbjct: 612 AKKQPLTWYK 621
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/833 (43%), Positives = 485/833 (58%), Gaps = 51/833 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD RALVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+K+GGL IETYVFWN HEP
Sbjct: 33 VSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTIETYVFWNGHEPRP 92
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY FEG +D++RF K VQ+AG++ LRIGPY C EWNYGG P WL IP +QFR N P
Sbjct: 93 RQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLRDIPDMQFRLHNEP 152
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVE--WAYGVGGELYVKWAAD 182
F+ EM+ F I++ MK N+FA QGGPIIL Q+ENEYGNV+ Y+ W AD
Sbjct: 153 FEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPDQESATKYIHWCAD 212
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ D P +I TCNGFYC F P + P +WTEN++GWF ++
Sbjct: 213 MANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHDFKPKGSNMPKIWTENWTGWFKAWDK 272
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
RP ED+A+AVA FF+ G+ QNYYMY GGTNFGRT+GGP + T+YDYDAP+DEYG
Sbjct: 273 PDYHRPAEDVAYAVAMFFQNRGSVQNYYMYHGGTNFGRTSGGPYITTTYDYDAPLDEYGN 332
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPK+GHL+ LH + E++L+ L K++A Y A F++N +
Sbjct: 333 IRQPKYGHLKALHTVLTSMEKHLVYGQQNETNLDDKVKATKYTLDDGSSACFISNSHDNK 392
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVTF G+ Y +PAWSVS+LPDCK V +NTAKV +Q + ++++ + L S
Sbjct: 393 DVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQTS----VMVKKESAAKGGLKWSW 448
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LNIESLG 480
+ SF +L EQI T D SDYLWY S+ P KE F L + + G
Sbjct: 449 LPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKTSLTRGP---KEQFTLYVNTTG 505
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H FVN +L + + + F + L G N + +LS VGL+NYGA F++ A
Sbjct: 506 HELYAFVNGELAGYKHAVNGPYLFQFEAPVTLKPGKNYISLLSATVGLKNYGASFELMPA 565
Query: 541 GLFS--VILIDLKNGKRDLSSGEWIYQVGVEGE--YIGLDKISLANSSFWKQGSTLPVNK 596
G+ V L+ DLS+ W Y+ G+ GE I LDK L S F +P N+
Sbjct: 566 GIVGGPVKLVSAHGNTIDLSNNTWTYKTGLFGEQKQIHLDKPGLRWSPF-----AVPTNR 620
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
WYK TF AP G + ++L + KG +VNG ++GRYW +Y+A +CDYRG
Sbjct: 621 PFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHNLGRYWPSYVAGDMDGCHRCDYRGE 680
Query: 657 Y----DASKCQKHCGQPAQTLYHIPRTWV---HPGENLLVIHEELGGDPSKISLLTKTGQ 709
Y + KC CG+ Q YH+PR+++ H N +V+ EE GGDP+K++ T
Sbjct: 681 YVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAHGAPNTVVLFEEAGGDPAKVNFRTVAVG 740
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
+C+ + D V LAC G I++++ AS+G+ G CG++
Sbjct: 741 PVCADAEKGD------------------AVTLACAHGRTISSVDTASFGVSGGQCGAYEG 782
Query: 770 GA-CHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G+ C L + ACVG+ C++ + A+ + C G L V+A CS
Sbjct: 783 GSGCESKPALEAITAACVGKKWCTVSYTDAF---DSADCKG-SGVLTVQATCS 831
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/831 (42%), Positives = 490/831 (58%), Gaps = 105/831 (12%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V++D RA+ IDG RRVL SGSIHYPRST E+WP+LI+K KEG L+ IETYVFWN HEP R
Sbjct: 45 VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGSLDAIETYVFWNAHEPTR 104
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DL+RF+KT+Q G++ LRIGPY CAEWNYGGFPVWLH +PG++FRTTN
Sbjct: 105 RQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 164
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F EM+ F I++++K+E LFASQGGPIILAQ+ENEYGNV +YG G+ Y++W A+ A
Sbjct: 165 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIQWCANMA 224
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+L+ VPW+MCQQ+DAP P++NTCNG+YCD F+PN+P+ P MWTEN++GW+ ++G P
Sbjct: 225 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNTPKMWTENWTGWYKNWGGKDP 284
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED+AFAVARFF+ GTFQNYYMY GGTNF RTAGGP + T+YDYDAP+DE+G + Q
Sbjct: 285 HRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQ 344
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
PK+GHL++LH + E+ L + + G + A +Y ++ + F+ N + +SDA
Sbjct: 345 PKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVY-QTEEGSSCFIGNVNETSDAK 403
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
+ F G Y +PAWSVSILPDCK +NTAK+ +Q + ++ N E ++ +SW
Sbjct: 404 INFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTS----VMVKKANEAENEPSTLKWSW 459
Query: 425 YEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
E + + G L +Q + D SDYLWY ++++ P GK + L I S
Sbjct: 460 RPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNLKEQDPVLGKNMSLRINS 519
Query: 479 LGHAALVFVNKKLVAFGYGNHDFAN----FLINKKIELNEGINTLDILSMMVGLQNYGAW 534
H FVN + + GN+ N ++ + + N G N + +LS+ VGL NYGA+
Sbjct: 520 TAHVLHAFVNGQHI----GNYRVENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAF 575
Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
F+ AG+ + I +NG +DLS+ +W Y+ G+ G N F +
Sbjct: 576 FENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSG---------FENQLFSSES- 625
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+T+ AP G P+ ++L +GKG AW+NG +IGRYW A+L+
Sbjct: 626 -----------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLSDI------ 668
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
G+N LV+ EE+GG+PS ++ T
Sbjct: 669 --------------------------------DGDNTLVLFEEIGGNPSLVNFQTIGVGS 696
Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+C+ V E + + L+C G I+AI FAS+G P G+CGSF G
Sbjct: 697 VCANVYEKNV------------------LELSC-NGKPISAIKFASFGNPGGDCGSFEKG 737
Query: 771 ACHM--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
C + I+ + CVG+ +CSI VS G A C L K LAVEA C
Sbjct: 738 TCEASNNAAAILTQECVGKEKCSIDVSEDKFG--AAECGALAKRLAVEAIC 786
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/831 (45%), Positives = 490/831 (58%), Gaps = 73/831 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 10 ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 69
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQ+ F G D+V+F+K V+ GL++ LRIGP+ EW+YGG P WLH + GI FRT N
Sbjct: 70 QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 129
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK MKR+ I+ LMK ENL+ASQGGPIIL+Q+ENEYG V A+ G+ YVKW A
Sbjct: 130 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 189
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L+T VPWVMC+Q+DAPDP++N CNG C + F PNSP+KP +WTEN++
Sbjct: 190 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL----- 244
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
ED+AF VA F G+F NYYMY GGTNFGR A V TSY AP+DEYG
Sbjct: 245 ------SAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 297
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL+ELH A+KLCEE L+S T LG A ++ K +N CAA L N D
Sbjct: 298 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 356
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
++ V F + Y L SVS+LPDCKNV FNTAKV +Q N + + + L +
Sbjct: 357 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 410
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ + E V S L E +NTT+DTSDYLW T +G L + LG
Sbjct: 411 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 468
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN + + +G FL+ K + LN G N L +LS+MVGL N GA +
Sbjct: 469 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 528
Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G SV + NG+ L ++ W YQVG++GE + + WKQ ++ L
Sbjct: 529 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 584
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYK +F PEG+ P+ALNL SMGKG+AWVNGQSI + +Y
Sbjct: 585 TWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF--SYFR---------------- 626
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSE 717
YHIPR+++ P NLLVI EE G+P I++ T + +C VS
Sbjct: 627 ---------------YHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSN 671
Query: 718 ADPPPVDS------WKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
+P PV S + NL P+V+L C G I+ I FAS+G P G+CGS+ G
Sbjct: 672 TNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIG 731
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+CH + L +VQKAC+ + CS+PV S G +CP +K+L V A CS
Sbjct: 732 SCHSPNSLAVVQKACLKKSRCSVPVWSKTFG--GDSCPHTVKSLLVRAQCS 780
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/836 (43%), Positives = 491/836 (58%), Gaps = 58/836 (6%)
Query: 7 YDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQ 66
Y+ RA+VIDG+RR++ SGSIHYPRSTP++WP+LI K+KEGGL IETYVFWN HEP R Q
Sbjct: 30 YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89
Query: 67 YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
Y FEG +D+VRF K +Q AG+ LRIGPY C EWNYGG P WL IPG+QFR N+PF+
Sbjct: 90 YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNDPFE 149
Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTA 184
EM+ F I++ MK N+FA QGGPIILAQ+ENEYGN+ + Y+ W AD A
Sbjct: 150 REMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADMA 209
Query: 185 VNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
VPW+MCQQ+ D P +INTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 210 NKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWFPNRTGIPKIWTENWTGWFKAWDKPD 269
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+AFAVA FF+ G+ NYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG IR
Sbjct: 270 FHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIR 329
Query: 304 QPKWGHLRELHKAIKLCEEYLIS---SDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
QPK+GHL++LH +K E+ L+ D +H K + + Y SS F++N
Sbjct: 330 QPKYGHLKDLHNLLKSMEKILVHGEYKDTSHGK-NVTVTKYTYGGSS---VCFISNQFDD 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
D NVT G + +PAWSVSILPDCK V +NTAK+ +Q + ++ N E +
Sbjct: 386 RDVNVTLAG-THLVPAWSVSILPDCKTVAYNTAKIKTQTS----VMVKKANSVEKEPEAL 440
Query: 421 AFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
+SW E + + SF + L EQI T+ D SDYLWY S+ G+G L +
Sbjct: 441 RWSWMPENLKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRTSLE-HKGEGSYT-LYVN 498
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
+ GH FVN KLV ++ F + ++L+ G N + +LS VGL+NYG F++
Sbjct: 499 TTGHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPLFEL 558
Query: 538 AGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLP 593
AG+ V L+ + DL+ W Y+ G+ GE+ I LDK S GS +P
Sbjct: 559 VPAGIAGGPVKLVGANDTAIDLTHSSWSYKSGLAGEHRQIHLDKPGYKWRSHNGSGS-IP 617
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST-GCTKKCD 652
VN+ WYKTTF AP G + ++L + KG AWVNG S+GRYW +Y A GC CD
Sbjct: 618 VNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGRYWPSYTAAEMGGCHGACD 677
Query: 653 YRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKT 707
YRG + D +C CG+P+Q YH+PR+++ GE N LV+ EE GGDP++ + T
Sbjct: 678 YRGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLVLFEEAGGDPARAAFHTVA 737
Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWH---IAAINFASYGIPEGNC 764
H+C +E V L+C G +A+++ AS+G+ G C
Sbjct: 738 VGHVCVAAAEV-----------------GDDVTLSCGGGLGGGVVASVDVASFGVTRGGC 780
Query: 765 GSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA-LAVEAHC 819
G ++ G L + ACVG+ C++ + A+ G PG L V+A C
Sbjct: 781 GDYQGGCESKAALKAFRDACVGRESCTVKYTPAFAG------PGCQSGKLTVQATC 830
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/832 (42%), Positives = 496/832 (59%), Gaps = 53/832 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+GKR +L SG+IHYPRSTP++WP+LI+K+K+GG+ IETYVFWN HEP+
Sbjct: 49 VTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFWNGHEPVE 108
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY FEG FDLV+F+K + E L+ +R+GP+ AEWN+GG P WL +PGI FR+ N P
Sbjct: 109 GQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNEP 168
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ MKRF+ I+D +KQE LFA QGGPIILAQ+ENEY ++ A+ G+ YV+WA A
Sbjct: 169 FKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYVQWAGKLA 228
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG--FTPNSPSKPIMWTENYSGWFLSFGYA 242
++LN +VPW+MC+Q DAPDPIINTCNG +C + PN +KP +WTEN++ + FG
Sbjct: 229 LSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQYRVFGDP 288
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R EDLA++VARFF G+ NYYM++GGTNFGRT+ T Y + P+DE+G
Sbjct: 289 PSQRSAEDLAYSVARFFSKNGSMVNYYMHYGGTNFGRTSAS-FTTTRYYDEGPLDEFGLQ 347
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
R+PKWGHL+++H+A+ LC+ L PT KLG +A ++ + ++ CAAFLAN ++
Sbjct: 348 REPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAAFLANNNTRL 407
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+V F G LPA S+S+LPDCK VVFNT V +Q N+ +N +A+
Sbjct: 408 AQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNS--------RNFVRSEIANKN 459
Query: 422 FSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFL 474
F+W E + F P E + TKDT+DY WYT S+ + +P + L
Sbjct: 460 FNWEMCREVPPVGLGFKFDVP--RELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNVRPVL 517
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
+ SLGH +VN + +G+ +F++ + + L EG N + +L +VGL + GA+
Sbjct: 518 RVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLKEGENHIALLGYLVGLPDSGAY 577
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ AG S+ ++ L G D+S W +QVG++GE L + S W +
Sbjct: 578 MEKRFAGPRSITILGLNTGTLDISQNGWGHQVGIDGEKKKLFTEEGSKSVQWTKPDQ--- 634
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L WYK F APEG P+A+ + MGKG WVNG+SIGRYW+ YL+P
Sbjct: 635 GGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP----------- 683
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
+P Q+ YHIPR ++ P +NL+V+ EE GG+P + ++T ICS
Sbjct: 684 -----------LKKPTQSEYHIPRAYLKP-KNLIVLLEEEGGNPKDVHIVTVNRDTICSA 731
Query: 715 VSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
VSE PP ++ G + + P+ L C I A+ FASYG P G CG++
Sbjct: 732 VSEIHPPSPRLFETKNGSLQAKVNDLKPRAELKCPGKKQIVAVEFASYGDPFGACGAYFI 791
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G C + +V+K C+G+ C IP+ S AC L K LAV+ C+
Sbjct: 792 GNCTAPESKQVVEKYCLGKPSCQIPLDSIPFSNQNDACTHLRKTLAVQLKCA 843
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/836 (43%), Positives = 488/836 (58%), Gaps = 79/836 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
++ FEG +D+VRF K +Q AG++ LRIGPY C EWNYGG PVWL IPGI+FR N P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------NVEWAYGVGGELYV 177
F+ M+ F I+ MK N+FA QGGPIILAQ+ENEYG N++ A+ Y+
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHE-----YI 205
Query: 178 KWAADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
W AD A N VPW+MCQQ+ D P ++NTCNGFYC + N S P MWTEN++GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
+ RP ED+AFAVA FF+ G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325
Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
DEYG +RQPK+GHL+ELH + E+ L+ D G + Y ++ A F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATS-ACFINN 384
Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
D NVT +G +FLPAWSVSILP+CK V FN+AK+ +Q + ++ E
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTT----VMVNKTSMVEQQ 440
Query: 417 LASSAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF 473
+SW E + +F + +L EQI TT D SDYLWY S+ G+G V
Sbjct: 441 TEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLE-HKGEGSYV- 498
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + + GH FVN KLV Y ++ F + NYG
Sbjct: 499 LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSP--------------------NYGG 538
Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG 589
F++ AG+ V LID DLS+ W Y+ G+ GEY I LDK + +
Sbjct: 539 SFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK---PGNKWRSHN 595
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
ST+P+N+ WYKTTF AP G+ + ++L + KG AWVNG S+GRYW +Y+A
Sbjct: 596 STIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCH 655
Query: 650 KCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLL 704
CDYRG + +A KC CG+P+Q LYH+PR++++ GE N L++ EE GGDPS++++
Sbjct: 656 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVR 715
Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGN 763
T +C+ D V L+C G I++++ AS+G+ G
Sbjct: 716 TVVEGSVCASAEVGD------------------TVTLSCGAHGRTISSVDVASFGVARGR 757
Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CGS+ G ACVG+ C++ V+ A+ +AG G+ L V+A C
Sbjct: 758 CGSYDGGCESKVAYDAFAAACVGKESCTVLVTDAF--ANAGCVSGV---LTVQATC 808
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/842 (42%), Positives = 491/842 (58%), Gaps = 63/842 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD R+L+IDG R + SGSIHYPRS P+ WP+LI K+KEGGL VIE+YVFWN HEP +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DL++F K +QE ++ +RIGP+ AEWN+GG P WL IP I FRT N P
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ MK+F+ I++ +K+ LFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
+ NT VPW+MC+Q AP +I TCNG +C G T P KP++WTEN++ + FG
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYRVFGD 271
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AF+VARFF GGT NYYMY GGTNFGR ++ YD +AP+DE+G
Sbjct: 272 PPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRNGAAFVMPRYYD-EAPLDEFGL 330
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSS 360
++PKWGHLR+LH A++ C++ L+ +P+ Q LG EA ++ K N C AFL+N+++
Sbjct: 331 YKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTK 390
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
D VTF G YF+ S+SIL DCK VVF+T V SQ N FA Q NV E+
Sbjct: 391 EDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM-- 448
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV- 472
+ EEK+ S EQ N TKD +DYLWYT S + +P + KEV
Sbjct: 449 ------YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYR-KEVK 501
Query: 473 -FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L + S GHA + FVN V G+G F + K ++L G+N + ILS +GL +
Sbjct: 502 PVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDS 561
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
G++ + AG+++V + L G DL++ W + VG++GE + + WK G
Sbjct: 562 GSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWKPGKD 621
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
N+ L WY+ F P G P+ ++L MGKG +VNG+ +GRYW +Y
Sbjct: 622 ---NQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY----------- 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
G+P+Q LYH+PR+ + P N L+ EE GG P I +LT +I
Sbjct: 668 -----------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNI 716
Query: 712 CSFVSEADPPPVD-SWKPN-----------LGVVSSSPQVRLACERGWHIAAINFASYGI 759
C+F++E +P V SW+ G P L+C I ++ FASYG
Sbjct: 717 CTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVVFASYGN 776
Query: 760 PEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
P G CG++ G+CH +V+KAC+G+ CS+ VSS G CPG LAV+A
Sbjct: 777 PLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDV-HCPGTTGTLAVQAK 835
Query: 819 CS 820
CS
Sbjct: 836 CS 837
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/569 (55%), Positives = 398/569 (69%), Gaps = 20/569 (3%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYDHR+L I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP++G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYF R+DLVRFVK V++AGL+++LRIGPY CAEWNYGGFPVWL ++PGI FRT N PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + YV WAA AV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
N VPW+MC+Q+DAPDP+INTCNGFYCD FTPNS +KP MWTE +SGWF +FG VP
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG +RQP
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
KWGHL LHKAIK E L++ DPT Q +G +A+++ SS DCAAFL+N+ +S+ A V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382
Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
FNG Y LPAWS+S+LPDC+ V+NTA V + + + + F+W
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAK------------MNPAGGFTWQ 430
Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
Y E +F + L EQ++ T D SDYLWYT +++ G+ G+ L + S
Sbjct: 431 SYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYS 490
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH+ VFVN + YG +D + +++ +G N + ILS VGL N G ++
Sbjct: 491 AGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETW 550
Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQV 566
G+ V L L GKRDLS +W YQV
Sbjct: 551 NIGVLGPVTLSGLNEGKRDLSKQKWTYQV 579
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/837 (41%), Positives = 498/837 (59%), Gaps = 58/837 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+ DG R + SGSIHYPRS P++WPELI K+KEGGL IETYVFWN HEP +
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ FEG+ D+VRF + +QE ++ +R+GP+ AEWN+GG P WL IP I FRT N P
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K M+ F+ II +K NLFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
++ N +PW+MC+Q AP +I TCNG C G T P + S P++WTEN++ + FG
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNC-GDTWPGPTNKSMPLLWTENWTAQYRVFGD 281
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVARFF GGT NYYMY GGTNFGRT+ ++ YD +AP+DE+G
Sbjct: 282 PPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 340
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
++PKWGHLR+LH+A+KLC++ L+ P+ +KLG +LEA ++ C AFL+N+++
Sbjct: 341 YKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTK 400
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
DA +TF G YF+P S+S+L DC+ VVF T V +Q N FA Q NV E+
Sbjct: 401 DDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQTAQNNVWEMFD 460
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EV 472
+ + + K+ + + N TKD +DY+WYT+S + MP + +
Sbjct: 461 GENVPKYKQAKIRLR--------KAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 512
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA++ FVN K V G+G F + K ++L +G+N + +L+ +G+ + G
Sbjct: 513 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSG 572
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
A+ + AG+ V + L G DL++ W + VG+ GE + S WK
Sbjct: 573 AYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKPAMN- 631
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
++ L WYK F P G+ P+ L++++MGKG +VNGQ IGRYW +Y
Sbjct: 632 --DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY------------ 677
Query: 653 YRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
KH G+P+Q LYH+PR+++ +N+LV+ EE G P I +LT +I
Sbjct: 678 -----------KHALGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILTVKRDNI 726
Query: 712 CSFVSEADPPPVDSWKPNLGVVSSS-------PQVRLACERGWHIAAINFASYGIPEGNC 764
C+F+SE +P + SW+ +++ + LAC I + FASYG P G C
Sbjct: 727 CTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVVFASYGNPAGIC 786
Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G++ G+CH +V+KAC+G+ C++PV++ G A C G LAV+A CS
Sbjct: 787 GNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDAN-CSGTTATLAVQAKCS 842
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/842 (42%), Positives = 490/842 (58%), Gaps = 63/842 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD R+L+IDG R + SGSIHYPRS P+ WP+LI K+KEGGL VIE+YVFWN HEP +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+DL++F K +QE ++ +RIGP+ AEWN+GG P WL IP I FRT N P
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ MK+F+ I++ +K+ LFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
+ NT VPW+MC+Q AP +I TCNG +C G T P KP++WTEN++ + FG
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYRVFGD 271
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AF+VARFF GGT NYYMY GGTNFGR ++ YD +AP DE+G
Sbjct: 272 PPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRNGAAFVMPRYYD-EAPFDEFGL 330
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSS 360
++PKWGHLR+LH A++ C++ L+ +P+ Q LG EA ++ K N C AFL+N+++
Sbjct: 331 YKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTK 390
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
D VTF G YF+ S+SIL DCK VVF+T V SQ N FA Q NV E+
Sbjct: 391 EDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM-- 448
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV- 472
+ EEK+ S EQ N TKD +DYLWYT S + +P + KEV
Sbjct: 449 ------YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYR-KEVK 501
Query: 473 -FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L + S GHA + FVN V G+G F + K ++L G+N + ILS +GL +
Sbjct: 502 PVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDS 561
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
G++ + AG+++V + L G DL++ W + VG++GE + + WK G
Sbjct: 562 GSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWKPGKD 621
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
N+ L WY+ F P G P+ ++L MGKG +VNG+ +GRYW +Y
Sbjct: 622 ---NQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY----------- 667
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
G+P+Q LYH+PR+ + P N L+ EE GG P I +LT +I
Sbjct: 668 -----------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNI 716
Query: 712 CSFVSEADPPPVD-SWKPN-----------LGVVSSSPQVRLACERGWHIAAINFASYGI 759
C+F++E +P V SW+ G P L+C I ++ FASYG
Sbjct: 717 CTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGFKPTAVLSCPTKKTIQSVVFASYGN 776
Query: 760 PEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
P G CG++ G+CH +V+KAC+G+ CS+ VSS G CPG LAV+A
Sbjct: 777 PLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDV-HCPGTTGTLAVQAK 835
Query: 819 CS 820
CS
Sbjct: 836 CS 837
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/575 (55%), Positives = 403/575 (70%), Gaps = 20/575 (3%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A+VTYDH+A+VI+GKRR+L SGSIHYPRSTP++WP+LI+K+K+GG++VIETYVFWN H
Sbjct: 24 VTASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIETYVFWNGH 83
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP +G+YYFE RFDLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 84 EPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVAFRT 143
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK M++F KI+ +MK ENLF SQGGPIIL+Q+ENEYG VEW G G+ Y KW
Sbjct: 144 DNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWF 203
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
+ AV LNT VPWVMC+QEDAPDPII+TCNG+YC+ F+PN KP MWTEN++GW+ FG
Sbjct: 204 SQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMWTENWTGWYTDFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP+RP EDLAF+VARF + G++ NYYMY GGTNFGRT+ G +ATSYDYDAPIDEYG
Sbjct: 264 TAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
I +PKWGHLR+LHKAIK CE L+S DPT G LE H+Y S CAAFLANYD+
Sbjct: 324 LISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSFGACAAFLANYDTG 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S A V F Y LP WS+SILPDCK VFNTAKV + R + + A+S
Sbjct: 384 SWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVH-----------RSMTPANS 432
Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
AF+W Y E+ SG S+ L EQ++ T D SDYLWY +++ P + G+
Sbjct: 433 AFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNISPNEGFIKNGQNP 492
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L S GH VF+N + YG+ D + ++L G N + +LS+ VGL N G
Sbjct: 493 VLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKISLLSVAVGLSNVG 552
Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQV 566
++ G+ V L L G RDLS +W Y+V
Sbjct: 553 VHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKV 587
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/838 (42%), Positives = 492/838 (58%), Gaps = 59/838 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T+D R+L++DG+R + SGSIHYPRS P +WP+LI ++KEGGL VIE+YVFWN HEP
Sbjct: 15 ITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWNGHEPEM 74
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+D+++F K VQE +F +RIGP+ AEWN+GG P WL +P I FRT N P
Sbjct: 75 GVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIIFRTNNEP 134
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ M++F+ I++ +K LFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 135 FKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIHWAAKMA 194
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
+LN VPW+MC+Q AP +I TCNG +C G T P +KP++WTEN++ + FG
Sbjct: 195 SDLNIGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPTDKNKPLLWTENWTAQYRVFGD 253
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVARF+ GGT NYYMY GGTNFGRT ++ YD +AP+DE+G
Sbjct: 254 PPSQRSAEDIAFAVARFYSVGGTMVNYYMYHGGTNFGRTGASFVMPRYYD-EAPLDEFGL 312
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
++PKWGHLR+LH A++LC++ ++ +P++Q LG EA ++ C AFL+N+++
Sbjct: 313 YKEPKWGHLRDLHHALRLCKKAILWGNPSNQPLGKLYEARLFEIPEQKICVAFLSNHNTK 372
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLL 417
D VTF G YF+P SVSIL DCK VVF+T V SQ N F+ Q NV E+
Sbjct: 373 EDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHNQRTFHFSDQTVQGNVWEMYT 432
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEVF 473
S Y+ N +P E N TKD +DY+WYT S + +P + K+++
Sbjct: 433 ESDKVPTYK-----FTNIRTQKP--LEAYNLTKDKTDYVWYTTSFKLEAEDLPFR-KDIW 484
Query: 474 --LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L + S GHA + FVN K V G+G F + K IE+ GIN + ILS +G+Q+
Sbjct: 485 PVLEVSSHGHAMVAFVNGKYVGAGHGTKINKAFTMEKPIEVRTGINHVSILSTTLGMQDS 544
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
G + + AG+ V + L G DL+S W + VG+EGE + W
Sbjct: 545 GVYLEHRQAGIDGVTIQGLNTGTLDLTSNGWGHLVGLEGERRNAHTEKGGDGVQWVPAV- 603
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
++ L WY+ F P G P+ ++++ MGKG +VNG+ +GRYWS+Y
Sbjct: 604 --FDRPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRYWSSY----------- 650
Query: 652 DYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQ 709
KH G+P+Q LYH+PR ++ P N++ I EE GG P I +LT
Sbjct: 651 ------------KHALGRPSQYLYHVPRCFLKPTGNVMTIFEEEGGGQPDGIMILTVKRD 698
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSS------SPQVRLACERGWHIAAINFASYGIPEGN 763
+ICSF+SE +P V SW+ + S PQ L+C I + FASYG P G
Sbjct: 699 NICSFISEKNPAHVKSWERKDSHLKSVADADLKPQAVLSCPEKKLIQQVVFASYGNPLGI 758
Query: 764 CGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CG++ G CH IV+KACVG+ C + VS G CPG LAV+A CS
Sbjct: 759 CGNYTVGNCHAPKAKEIVEKACVGKKSCVLQVSHEVYGADLN-CPGSTGTLAVQAKCS 815
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/838 (43%), Positives = 491/838 (58%), Gaps = 62/838 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD R+L+IDG+R + SGSIHYPRS WP+LI ++KEGGL VIE+YVFWN HEP
Sbjct: 36 ITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWNIHEPEM 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y FEGR+D+++F K +QE +F +RIGP+ AEWN+GG P WL +P I FRT N P
Sbjct: 96 GVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIVFRTDNEP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+ M++F+ +++ +K LFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 156 YKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYIDWAAKMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
++ +T VPW+MC+Q AP +I TCNG +C G T P +KP++WTEN++ + FG
Sbjct: 216 ISTSTGVPWIMCKQTKAPAEVIPTCNGRHC-GDTWPGPTDKNKPLLWTENWTAQYRVFGD 274
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVARFF GG+ NYYMY GGTNFGRT G V Y +AP+DE+G
Sbjct: 275 PPSQRSAEDIAFAVARFFSVGGSMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGM 333
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
++PKWGHLR+LH A++LC++ L+ +P+ Q LG EA ++ C AFL+N+++
Sbjct: 334 YKEPKWGHLRDLHHALRLCKKALLRGNPSTQPLGKLYEARLFEIPEQKVCVAFLSNHNTK 393
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
D VTF G YF+P SVSIL DCK VVF+T V +Q N Q NV E+
Sbjct: 394 EDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLTDQTLQNNVWEMYT 453
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEV 472
Y+ + +RS +P E N TKD +DYLWYT S + +P Q +
Sbjct: 454 EGDKVPTYK----FTTDRS-EKP--LEAYNMTKDKTDYLWYTTSFKLEAEDLPFRQDIKP 506
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L S GHA + FVN KLV +G F + K IE+ GIN + ILS +GLQ+ G
Sbjct: 507 VLEASSHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEVRAGINHVSILSSTLGLQDSG 566
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS 590
A+ + AG+ SV + L G DLSS W + VG++GE +DK WK
Sbjct: 567 AYLEHRQAGVHSVTIQGLNTGTLDLSSNGWGHIVGLDGERKQAHMDK---GGEVQWKPAV 623
Query: 591 -TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
LP L WY+ F P G+ P+ ++L MGKG +VNG+ +GRYWS+Y
Sbjct: 624 FDLP----LTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYWSSY--------- 670
Query: 650 KCDYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
KH G+P+Q LYH+PR ++ P N+L I EE GG P I +LT
Sbjct: 671 --------------KHALGRPSQYLYHVPRCFLKPTGNVLTIFEEEGGRPDAIMILTVKR 716
Query: 709 QHICSFVSEADPPPVDSWK---PNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGN 763
+ICSF+SE +P V SW+ L VV+ P+ L C I + FASYG P G
Sbjct: 717 DNICSFISEKNPGHVRSWERKDSQLTVVADDLKPRAVLTCPEKKTIQQVVFASYGNPLGI 776
Query: 764 CGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CG++ G CH +V+KACVG+ C + VS G CPG LAV+A CS
Sbjct: 777 CGNYTVGNCHTPKAKEVVEKACVGKKSCVLAVSHEVYGGDLN-CPGTTATLAVQAKCS 833
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 653 bits (1685), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/734 (46%), Positives = 455/734 (61%), Gaps = 29/734 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+V+YD R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETY+FWN HEP
Sbjct: 29 TSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
R QY FEG +D+VRF K +Q AG++ LRIGPY C EWNYGG P WL IPG+QFR N
Sbjct: 89 HRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWA 180
PF+ EM+ F I++ MK +FA QGGPIILAQ+ENEYGN+ + Y+ W
Sbjct: 149 EPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWC 208
Query: 181 ADTAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
AD A N VPW+MCQQ +D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 209 ADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAW 268
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEY
Sbjct: 269 DKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 328
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPK+GHL+ELH +K E+ L+ + G + Y S+ A F+ N
Sbjct: 329 GNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSS-ACFINNRFD 387
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
D NVT +G + LPAWSVSILPDCK V FN+AK+ +Q + ++ N E S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTS----VMVKKPNTAEQEQES 443
Query: 420 SAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
+SW E + +F + +L EQI T+ D SDYLWY S++ G+G L +
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLN-HKGEGS-YKLYV 501
Query: 477 ESLGHAALVFVNKKLVAFGY-GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ GH FVN KL+ + + DF F + ++L++G N + +LS VGL+NYG F
Sbjct: 502 NTTGHELYAFVNGKLIGKNHSADGDFV-FQLESPVKLHDGKNYISLLSATVGLKNYGPSF 560
Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWK-QGS 590
+ G+ V LID DLS+ W Y+ G+ EY I LDK W
Sbjct: 561 EKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYK----WNGNNG 616
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
T+P+N+ WYK TF AP G+ + ++L + KG AWVNG ++GRYW +Y A +
Sbjct: 617 TIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHR 676
Query: 651 CDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLT 705
CDYRG++ D ++C CG+P+Q YH+PR+++ GE N L++ EE GGDPS ++L T
Sbjct: 677 CDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVALRT 736
Query: 706 KTGQHICSFVSEAD 719
+C+ D
Sbjct: 737 VVPGPVCTSGEAGD 750
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 653 bits (1685), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/688 (49%), Positives = 443/688 (64%), Gaps = 31/688 (4%)
Query: 146 FASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPI 205
FASQGGPIIL+Q+ENEYG A G G Y+ WAA AV L+T VPWVMC+++DAPDP+
Sbjct: 2 FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61
Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
IN CNGFYCDGF+PN P KP MWTE +SGWF FG + RPV+DLAF+VARF + GG++
Sbjct: 62 INACNGFYCDGFSPNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSY 121
Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
NYYMY GGTNFGRTAGGP + TSYDYD PIDEYG IRQPK+GHL+ELHKAIKLCE L+
Sbjct: 122 INYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALV 181
Query: 326 SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
SSDPT LGA +A++++ CAAFL+N+ S+ A +TFN Y LPAWS+SILPDC
Sbjct: 182 SSDPTVTSLGAYQQAYVFNSGPRRCAAFLSNFHSTG-ARMTFNNMHYDLPAWSISILPDC 240
Query: 386 KNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRP-DL 442
+NVVFNTAKV Q V + S FSW Y+E V RS + L
Sbjct: 241 RNVVFNTAKV----------GVQTSRVQMIPTNSRLFSWQTYDEDVSSLHERSSIAAGGL 290
Query: 443 AEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNIESLGHAALVFVNKKLVAFGYGNH 499
EQIN T+DTSDYLWY ++ + + GK+ L ++S GHA VFVN + +G
Sbjct: 291 LEQINVTRDTSDYLWYMTNVDISSSELRGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTR 350
Query: 500 DFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLS 558
+ F K + L GIN + +LS+ VGL N G ++ G+ + +D L G++DL+
Sbjct: 351 EHRQFTFAKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRKDLT 410
Query: 559 SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK-SLIWYKTTFLAPEGKGPLALN 617
+W +VG++GE + L + +S W +GS K +L WYK F AP G PLAL+
Sbjct: 411 MQKWFNKVGLKGEAMDLVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPLALD 470
Query: 618 LASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIP 677
+ SMGKGQ W+NGQSIG+YW AY + G C Y G++ +KCQ CGQP Q YH+P
Sbjct: 471 MRSMGKGQVWINGQSIGKYWMAY---ANGDCSLCSYIGTFRPTKCQLGCGQPTQRWYHVP 527
Query: 678 RTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPP----VDSWKPNLGVV 733
R+W+ P +NL+V+ EELGGDPSKI+L+ ++ +C+ + E P +DS + + +
Sbjct: 528 RSWLKPTQNLVVVFEELGGDPSKITLVKRSVAGVCADLQEHHPNAEKLDIDSHEESKTLH 587
Query: 734 SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSI 792
+ QV L C G I++I FAS+G P G CGSF+ G CH + IV+K C+G+ C +
Sbjct: 588 QA--QVHLQCVPGQSISSIKFASFGTPTGTCGSFQQGTCHATNSHAIVEKNCIGRESCLV 645
Query: 793 PVSSAYLGVSAGACPGLLKALAVEAHCS 820
VS++ G CP +LK L+VEA CS
Sbjct: 646 TVSNSIFGTD--PCPNVLKRLSVEAVCS 671
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/832 (43%), Positives = 480/832 (57%), Gaps = 101/832 (12%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T D R ++I+G+R++L SGS+HYPRSTPE+WP+LI+KSK+GGL I+TYVFW+ HEP R
Sbjct: 26 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 85
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G DLVRF+K +Q GL+ LRIGPY CAEW YGGFPVWLH P IQ RT N
Sbjct: 86 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 145
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+ +ENEYGNV AY G Y+ W A A
Sbjct: 146 Y-------------------------------MIENEYGNVMRAYHDAGVQYINWCAQMA 174
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
L+T VPW+MCQQ++AP P+INTCNG+YCD FTPN+P+ P MWTEN+SGW+ ++G + P
Sbjct: 175 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 234
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP++EYG Q
Sbjct: 235 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 294
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
PKWGHLR+LH + E+ L D + A IY ++ + C F N ++ D
Sbjct: 295 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSNADRDV 352
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
+ + G Y +PAWSVSILPDC N V+NTAKV SQ + F ++ + E S ++
Sbjct: 353 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYST----FVKKGSEAENEPNSLQWT 408
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAA 483
W E + ++ P + N D +W GK++ L++ + GH
Sbjct: 409 WRGETI------QYITPGSVDISN-----DDPIW-----------GKDLTLSVNTSGHIL 446
Query: 484 LVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF 543
FVN + + + Y F + I L G N + +LS+ VGL NYG FD+ G+
Sbjct: 447 HAFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNYGPDFDMVNQGIH 506
Query: 544 SVILIDLKNGKRDL-----SSGEWIYQVGVEGEYIGLDKISLANSSF--WKQGSTLPVNK 596
+ I NG D+ ++ +W Y+ G+ GE KI L + + WK LPVN+
Sbjct: 507 GPVQIIASNGSADIIKDLSNNNQWAYKAGLNGE---DKKIFLGRARYNQWKS-DNLPVNR 562
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
S +WYK TF AP G+ P+ ++L +GKG+AWVNG S+GRYW +Y+A GC+ +CDYRG
Sbjct: 563 SFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYRGP 622
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
Y A KC +CG P+Q YH+PR+++ +N LV+ EE G+PS ++ T T + C+
Sbjct: 623 YKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFXGNPSSVTFQTVTVGNACANAR 682
Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS--------FR 768
E + L+C+ G I+ I FAS+G P+G CG F
Sbjct: 683 EG------------------YTLELSCQ-GRAISXIKFASFGDPQGTCGKPFATGSQVFE 723
Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C D L I+QK CVG+ CSI VS LG C K LAVEA C
Sbjct: 724 KGTCEAADSLSIIQKLCVGKYSCSIDVSEQILG--PAGCTADTKRLAVEAIC 773
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/609 (55%), Positives = 410/609 (67%), Gaps = 14/609 (2%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +NV+YD R+L+IDG+R++L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN H
Sbjct: 23 VGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTAKEGGIDVIETYVFWNGH 82
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
E G YYF GRFDLV+F K VQ+AG++L LRIGP+ AEWN+GG PVWLH+IPG FRT
Sbjct: 83 ELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNFGGVPVWLHYIPGTVFRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PF M++F I++LMK+E LFASQGGPIIL+Q+ENEYG E Y G+ Y WA
Sbjct: 143 YNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWA 202
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV+ NTSVPW+MCQQ DAPDP+I+TCN FYCD FTP SP +P MWTEN+ GWF +FG
Sbjct: 203 AKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKMWTENWPGWFKTFG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
P RPVED+AF+VARFF+ GG+ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 263 GRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
R PKWGHL+ELHKAIKLCE L+ + LG +EA IY SS CAAF++N D
Sbjct: 323 LPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSSGACAAFISNVDDK 382
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQQKNVNELL 416
+D V F Y LPAWSVSILPDCKNVVFNTAKV S N +H QQ + +
Sbjct: 383 NDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPEH--LQQSDKGQKT 440
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
L F +E GI G FV+ + INTTKDT+DYLW+T SI + + G +
Sbjct: 441 LKWDVF---KENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILIDANEEFLKKGSK 497
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
L IES GH FVN+K G GN + F I L G N + ILS+ VGLQ
Sbjct: 498 PALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIAILSLTVGLQTA 557
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
G ++D GAG+ SV +I L N DLSS W Y++GV GE++ + + NS W S
Sbjct: 558 GPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGEGMNSVKWTSTSE 617
Query: 592 LPVNKSLIW 600
P ++L W
Sbjct: 618 PPKGQALTW 626
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/831 (42%), Positives = 475/831 (57%), Gaps = 68/831 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G +D+VRF K +Q AGL+ LRIGPY C EWNYGG P WL IPG+QFR N P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F I++ MK N+FA QGGPIILAQ+ENEYGN+ + Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ GGP + TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQ-------------------KRGGPYITTSYDYDAPLDEYGN 311
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + K+ Y S A F+ N + +
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTS-ACFINNRNDNM 370
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +G + LPAWSVSILPDCK V FN+AK+ +Q + + E S
Sbjct: 371 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----VMVNKAKMVEKEPESLK 426
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY SI+ +F+N +
Sbjct: 427 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGEASYTLFVN--T 484
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + +L++G N + +LS +GL+NYG F+
Sbjct: 485 TGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEKM 544
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
AG+ V LID DLS+ W Y+ G+ GEY I LDK ++ T+P+
Sbjct: 545 PAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDK---PGCTWDNNNGTVPI 601
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
NK WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A G CDYR
Sbjct: 602 NKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDYR 661
Query: 655 GSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTGQ 709
G + D KC CG+P+Q YH+PR+++ GE N +++ EE GGDPS +S T
Sbjct: 662 GVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVAAG 721
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSFR 768
+C+ D + L+C + I+AIN S+G+ G CG+++
Sbjct: 722 SVCASAEVGD------------------TITLSCGQHSKTISAINVTSFGVARGQCGAYK 763
Query: 769 PGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G +AC+G+ C++ +++A V+ C L L V+A C
Sbjct: 764 GGCESKAAYKAFTEACLGKESCTVQITNA---VTGSGC--LSNVLTVQASC 809
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/705 (47%), Positives = 451/705 (63%), Gaps = 37/705 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDG+R++L SGSIHYPRSTP++WP+LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHEPQP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y F GR+DLV F+K +Q GL++ LRIGP+ +EW YGGFP WLH +PGI +RT N P
Sbjct: 87 GMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPGIVYRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+++MK+E L+ASQGGPIIL+Q+ENEY N++ A+G G YV+WAA A
Sbjct: 147 FKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAAKMA 206
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L+T VPW+MC+Q DAPDP+INTCNG C + FT PNSP+KP +WTEN++ ++ +G
Sbjct: 207 VGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVYGGL 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF V F G++ NYYMY GGTNFGRT G V T Y AP+DEYG +
Sbjct: 267 PYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRT-GSAYVITGYYDQAPLDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+ IK C L+ + LG LE +++ + +C AFL N D +
Sbjct: 326 RQPKWGHLKQLHEVIKSCSTTLLQGVQRNFTLGQLLEVYVFEEEKGECVAFLINNDRDNK 385
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F + Y L S+SILPDC+NV F+TA V + N + ++N SS
Sbjct: 386 ATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTSNR--RIISPKQNF------SSVD 437
Query: 423 SWYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
W + + IS N S L EQ+NTTKD SDYLWYT K L+++S
Sbjct: 438 DWQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCSKPT-LSVQSAA 496
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H A FVN + +GNHD +F + + +N+G N L ILS+MVGL + GA+ + A
Sbjct: 497 HVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSVMVGLPDSGAFLERRFA 556
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
GL SV L + +L++ W YQVG+ GE + + K + + W Q + + ++L W
Sbjct: 557 GLISVELQCSEQESLNLTNSTWGYQVGLMGEQLQVYKEQNNSDTGWSQLGNV-MEQTLFW 615
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKTTF PEG P+ L+L+SMGKG+AWVNG+SIGRYW + +D+
Sbjct: 616 YKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWILF----------------HDSK 659
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q+LYH+PR+++ N+LV+ EE GG+P ISL T
Sbjct: 660 ------GNPSQSLYHVPRSFLKDSGNVLVLLEEGGGNPLGISLDT 698
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/709 (48%), Positives = 443/709 (62%), Gaps = 34/709 (4%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYD R+L+IDG R++L SGSIHYPRSTP++W LI K+KEGG++VI+TYVFWN HEP
Sbjct: 24 AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQY F GR+DL +F+K +Q GL+ LRIGP+ +EW+YGG P WLH + GI +RT N
Sbjct: 84 QPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++LMK E L+ASQGGPIIL+Q+ENEY N+E A+ G YV+WAA
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L T VPWVMC+Q DAPDP+INTCNG C FT PNSP+KP MWTEN++ ++ FG
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF VA F G++ NYYMY GGTNFGR A + TSY AP+DEYG
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR-ASSAYIKTSYYDQAPLDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPKWGHL+ELH AI LC L++ ++ LG EA+++ + C AFL N D
Sbjct: 323 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 382
Query: 361 SDANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+++ V F N ++ LP S+SILPDCKNV+FNTAKV S + Q+ + + +
Sbjct: 383 NNSTVLFQNVSIELLPK-SISILPDCKNVIFNTAKVCSSSRQSAYKI-QELSRSCIQSFD 440
Query: 420 SAFSWYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLNI 476
+ W E K I + S + E +N TKD SDYLWYT P E L+I
Sbjct: 441 AVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT--FRFQPNSSCTEPLLHI 498
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
ESL HA FVN V +G+HD F I LN +N + ILS+MVG + GA+ +
Sbjct: 499 ESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLE 558
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AGL V + + G D ++ W YQVG+ GE + + K ++ W++ + + N+
Sbjct: 559 SRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRK-TEISTNQ 617
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYK F P G P+ALNL++MGKG+AWVNGQSIGRYW S
Sbjct: 618 PLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-----------------S 660
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
+ SK G P+QTLYH+PR ++ ENLLV+ EE GDP ISL T
Sbjct: 661 FHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 704
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/834 (41%), Positives = 497/834 (59%), Gaps = 54/834 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+I+G R +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HEP +
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F GR DLV+F+K +++ GL++ LR+GP+ AEW +GG P WL +PGI FRT N P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 163
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FKE +R++ ++D+MK+E LFASQGGPIIL Q+ENEY V+ AY G Y+KWA+
Sbjct: 164 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
+++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F FG
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 283
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R VED+A++VARFF GT NYYMY GGTNFGRT+ + YD DAP+DE+G
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEFGLE 342
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
R+PK+GHL+ LH A+ LC++ L+ P +K + E Y + CAAFLAN ++ +
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTEA 402
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ F G Y +P S+SILPDCK VV+NT ++IS + + F + K N+ +
Sbjct: 403 AEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRN--FMKSKKANK----NFD 456
Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFL 474
F + E V I G+ SF+ +L TKD SDY WYT S + +G + L
Sbjct: 457 FKVFTESVPSKIKGD-SFIPVEL---YGLTKDESDYGWYTTSFKIDDNDLSKKKGGKPNL 512
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I SLGHA V++N + + G+G+H+ +F+ K + L EG N L +L ++ G + G++
Sbjct: 513 RIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGSY 572
Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ G SV ++ L +G DL+ +W +VG+EGE +G+ W++ S
Sbjct: 573 MEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEKASG-- 630
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ WY+T F APE + A+ + MGKG WVNG+ +GRYW ++L+P
Sbjct: 631 KEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP---------- 680
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
GQP Q YHIPR+++ P +NLLVI EE P I + +C
Sbjct: 681 ------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIVNRDTVC 728
Query: 713 SFVSEADPPPVDSW-KPNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
S++ E P V W + N V + + V L C I+A+ FAS+G P G CG+F
Sbjct: 729 SYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKKISAVEFASFGNPNGTCGNF 788
Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
G+C+ V +V+K C+G+ EC IPV+ S + +CP + K LAV+ C
Sbjct: 789 TLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFEQDKKDSCPKVEKKLAVQVKC 842
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/835 (41%), Positives = 498/835 (59%), Gaps = 54/835 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
++TYD +L+I+G R +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HEP
Sbjct: 27 SITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPE 86
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+G++ F GR DLV+F+K +++ GL++ LR+GP+ AEW +GG P WL +PGI FRT N
Sbjct: 87 QGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNE 146
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFKE +R++ ++D+MK+E LFASQGGPIIL Q+ENEY V+ AY G Y+KWA+
Sbjct: 147 PFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKL 206
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
+++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F FG
Sbjct: 207 VHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGD 266
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R VED+A++VARFF GT NYYMY GGTNFGRT+ + YD DAP+DE+G
Sbjct: 267 PPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEFGL 325
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
R+PK+GHL+ LH A+ LC++ L+ P +K + E Y + CAAFLAN ++
Sbjct: 326 EREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTE 385
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+ + F G Y +P S+SILPDCK VV+NT ++IS + + F + K N+ +
Sbjct: 386 AAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRN--FMKSKKANK----NF 439
Query: 421 AFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
F + E V I G+ SF+ +L TKD SDY WYT S + +G +
Sbjct: 440 DFKVFTESVPSKIKGD-SFIPVEL---YGLTKDESDYGWYTTSFKIDDNDLSKKKGGKPN 495
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I SLGHA V++N + + G+G+H+ +F+ K + L EG N L +L ++ G + G+
Sbjct: 496 LRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGS 555
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
+ + G SV ++ L +G DL+ +W +VG+EGE +G+ W++ S
Sbjct: 556 YMEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEKASG- 614
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ WY+T F APE + A+ + MGKG WVNG+ +GRYW ++L+P
Sbjct: 615 -KEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP--------- 664
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHI 711
GQP Q YHIPR+++ P +NLLVI EE P I + +
Sbjct: 665 -------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIVNRDTV 711
Query: 712 CSFVSEADPPPVDSW-KPNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGS 766
CS++ E P V W + N V + + V L C I+A+ FAS+G P G CG+
Sbjct: 712 CSYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKKISAVEFASFGNPNGTCGN 771
Query: 767 FRPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
F G+C+ V +V+K C+G+ EC IPV+ S + +CP + K LAV+ C
Sbjct: 772 FTLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFEQDKKDSCPKVEKKLAVQVKC 826
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/834 (41%), Positives = 492/834 (58%), Gaps = 54/834 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HEP +
Sbjct: 40 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F GR DLV+F+K +++ G+++ LR+GP+ AEW +GG P WL +PGI FRT N P
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKP 159
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FKE +R++ I+D MK+E LFASQGGPIIL Q+ENEY V+ AY G Y+KWA+
Sbjct: 160 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKLV 219
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F FG
Sbjct: 220 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGDP 279
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R VED+A++VARFF G+ NYYMY GGTNFGRT+ + YD DAP+DEYG
Sbjct: 280 PTQRSVEDIAYSVARFFSKNGSHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEYGLE 338
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
R+PK+GHL+ LH A+ LC++ L+ P +K G E Y + + CAAFLAN ++ +
Sbjct: 339 REPKYGHLKHLHSALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 398
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ F G Y + S+SILPDCK VV+NTA+++SQ + + F + K N+
Sbjct: 399 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRN--FMKSKKANKKF----D 452
Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-----HVMPGQGKEVFL 474
F + E + + GN S++ +L TKD +DY WYT S H+ +G + F+
Sbjct: 453 FKVFTETLPSKLEGN-SYIPVEL---YGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFV 508
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I SLGHA +++N + + G+G+H+ +F+ K++ L G N L +L ++ G + G++
Sbjct: 509 RIASLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLIMLGVLTGFPDSGSY 568
Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ G V ++ L +G DL+ S +W ++G+EGE +G+ WK+ +
Sbjct: 569 MEHRYTGPRGVSILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGKA 628
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
L WY+ F APE A+ + MGKG WVNG+ +GRYW ++L+P
Sbjct: 629 --PGLTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYWQSFLSP---------- 676
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
GQP Q YHIPR+++ P +NLLVI EE P + + +C
Sbjct: 677 ------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFVIVNRDTVC 724
Query: 713 SFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
S+V E P V W V + S L C IAA+ FAS+G P G CG+F
Sbjct: 725 SYVGENYTPSVRHWTRKQDQVQAITDNVSLTATLKCSGTKKIAAVEFASFGNPIGVCGNF 784
Query: 768 RPGACHMDVLP-IVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
G C+ V +++K C+G+ EC IPV+ S + +C + K LAV+ C
Sbjct: 785 TLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVAKTLAVQVKC 838
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/713 (48%), Positives = 442/713 (61%), Gaps = 49/713 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYD R+L+IDG R++L SGSIHYPRSTP++W LI K+KEGG++VI+TYVFWN HEP
Sbjct: 60 AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 119
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQY F GR+DL +F+K +Q GL+ LRIGP+ +EW+YGG P WLH + GI +RT N
Sbjct: 120 QPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 179
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++LMK E L+ASQGGPIIL+Q+ENEY N+E A+ G YV+WAA
Sbjct: 180 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 239
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L T VPWVMC+Q DAPDP+INTCNG C FT PNSP+KP MWTEN++ ++ FG
Sbjct: 240 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 299
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF VA F G++ NYYMY GGTNFGR A + TSY AP+DEYG
Sbjct: 300 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR-ASSAYIKTSYYDQAPLDEYG 358
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPKWGHL+ELH AI LC L++ ++ LG EA+++ + C AFL N D
Sbjct: 359 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 418
Query: 361 SDANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+++ V F N ++ LP S+SILPDCKNV+FNTAK+ + N + +S
Sbjct: 419 NNSTVLFQNVSIELLPK-SISILPDCKNVIFNTAKINTGYN------------ERIATSS 465
Query: 420 SAFS----WYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEV 472
+F W E K I + S + E +N TKD SDYLWYT P E
Sbjct: 466 QSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT--FRFQPNSSCTEP 523
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L+IESL HA FVN V +G+HD F I LN +N + ILS+MVG + G
Sbjct: 524 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 583
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
A+ + AGL V + + G D ++ W YQVG+ GE + + K ++ W++ + +
Sbjct: 584 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRK-TEI 642
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
N+ L WYK F P G P+ALNL++MGKG+AWVNGQSIGRYW
Sbjct: 643 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 688
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
S+ SK G P+QTLYH+PR ++ ENLLV+ EE GDP ISL T
Sbjct: 689 ---SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 733
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 306/497 (61%), Positives = 370/497 (74%), Gaps = 17/497 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+V+YDH+A+ I+GKRR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 19 ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYF G +DLVRF+K V++AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 79 SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+RF KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+ G G Y +WAA
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV L T VPWVMC+Q+DAPDPIIN+CNGFYCD F+PN KP MWTE ++GWF FG A
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
VP+RPVEDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL++LH+AIKLCE L+S DP+ LG EAH++ CAAFLANY+ S
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F Y LP WS+SILPDCKN V+NTA+V +Q A+ K V + AF
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMVP--VPIHGAF 429
Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW EE +G RSF L EQINTT+D SDYLWY+ + + P + GK L
Sbjct: 430 SWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTL 489
Query: 475 NIESLGHAALVFVNKKL 491
+ S GHA VFVN +L
Sbjct: 490 TVLSAGHALHVFVNDQL 506
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 333/706 (47%), Positives = 445/706 (63%), Gaps = 37/706 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+G+R +L SGSIHYPRSTP++WP LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 27 VTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPGLIAKAKQGGLDVIQTYVFWNLHEPQP 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y F GR DLV F+K + GL++ LRIGP+ +EWNYGGFP WLH +PGI +RT N P
Sbjct: 87 GKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIESEWNYGGFPFWLHDVPGIVYRTDNEP 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M+ F KI+++MK+E L+ASQGGPIIL+Q+ENEYGN++ A+G G YV+WAA A
Sbjct: 147 FKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYGNIQKAFGTAGSQYVEWAAKMA 206
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V LNT VPWVMC+Q DAPDP+INTCNG C + FT PNSP+KP MWTEN++ ++ +G
Sbjct: 207 VGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETFTGPNSPNKPAMWTENWTSFYQVYGGV 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF V F G+F NYYMY GGTNFGRT+ ++ YD AP+DEYG
Sbjct: 267 PYIRSAEDIAFHVTLFVARNGSFVNYYMYHGGTNFGRTSSAYMITGYYD-QAPLDEYGLF 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPKWGHL+ELH AIK C L+ + LG E +++ + + CAAFL N D +
Sbjct: 326 RQPKWGHLKELHAAIKSCSTTLLQGVQRNFSLGELQEGYVFEEENGKCAAFLINNDKGNT 385
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V FN + Y L S+SILPDC+NV FNTA + + N ++ + SS
Sbjct: 386 VTVQFNNSSYKLLPKSISILPDCQNVAFNTAHLNTTSN--------RRIITSRQNFSSVD 437
Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
W +++ + + S L EQ+NTTKD SDYLWYT + + L+++S
Sbjct: 438 DWKQFQDVIPNFDDTSLRSDSLLEQMNTTKDKSDYLWYTLRLENNLSCNDPI-LHVQSSA 496
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H A FVN + +GNHD +F + I LNE N + ILS MVGL + GA+ + A
Sbjct: 497 HVAYAFVNNTYIGGEHGNHDVKSFTLELPITLNERTNNISILSGMVGLPDSGAFLEKRFA 556
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK-SLI 599
GL +V L + +L++ W YQVG+ GE + + + W Q + +++ +L
Sbjct: 557 GLNNVELQCSEQESLNLNNSTWGYQVGLLGEQLKVYTEQNSTDIKWTQLGNITIDEVTLT 616
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKTTF P+G P+AL+L+SM KG+AWVNGQSIGRYW +L
Sbjct: 617 WYKTTFDTPKGDDPIALDLSSMAKGEAWVNGQSIGRYWILFLDSK--------------- 661
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q+LYH+PR+++ EN LV+ +E GG+P ISL T
Sbjct: 662 -------GNPSQSLYHVPRSFLKDSENSLVLLDEGGGNPLDISLNT 700
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 349/803 (43%), Positives = 474/803 (59%), Gaps = 55/803 (6%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+W +++ K++ GGL VI+TYVFWN HEP+ GQ+ FEG +DLV+F+K + E +++ LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
P+ AEWN+GG P WL P I FR+ N+ FK MK+++A I+D+MK+ LFASQGGPI+
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
LAQ+ENEY +V+ AY G YV+WAA+ AV L VPW+MC+Q+DAPDP+INTCNG +C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180
Query: 215 -DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYF 272
D FT PN P KP +WTEN++ + FG R ED+AF+VARFF G+ NYYMY
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMYH 240
Query: 273 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ 332
GGTNFGRT+ T Y +AP+DE+G R+PKWGHLR++HKA+ LC++ L+ P Q
Sbjct: 241 GGTNFGRTS-AVFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQ 299
Query: 333 KLGAKLEAHIYHK-SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
+G LEA Y K +N CAAFLAN D+ S + F G + LP S+SILPDCK VVFN
Sbjct: 300 VIGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVFN 359
Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSW-YEEKVGISGNRSFVRPDLAEQINTTK 450
T ++SQ N + F KN N+L S S E+V ++ E + K
Sbjct: 360 TETIVSQHNARN--FIPSKNANKLKWKMSPESIPTVEQVPVNNKIPL------ELYSLLK 411
Query: 451 DTSDYLWYTASIHVMPGQGKEV-----FLNIESLGHAALVFVNKKLVAFGYGNHDFANFL 505
DT+DY WYT SI + + L I SLGHA LVFVN + + +G+H+ NF+
Sbjct: 412 DTTDYGWYTTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEEKNFV 471
Query: 506 INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQ 565
+ G+N + +L ++VGL + GA+ + AG S+ ++ L G D+S W +Q
Sbjct: 472 FQGSVPFKAGVNNIALLGILVGLPDSGAYMEHRFAGPRSITILGLNTGTLDISKNGWGHQ 531
Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKG 624
V ++GE + K+ S S + KS L WYKT F APEG P+A+ + MGKG
Sbjct: 532 VALQGEKV---KVFTQGGSHRVDWSEIKEEKSALTWYKTYFDAPEGNDPVAIRMNGMGKG 588
Query: 625 QAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG 684
Q WVNG+SIGRYW +YL+P T Q+ YHIPR+++ P
Sbjct: 589 QIWVNGKSIGRYWMSYLSPLKLST----------------------QSEYHIPRSFIKPS 626
Query: 685 ENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW----KPNLGVVSSSPQ-V 739
ENLLVI EE P K+ +L ICSF+++ PP V SW K VV
Sbjct: 627 ENLLVILEEENVTPEKVEILLVNRDTICSFITQYHPPNVKSWERKDKQFRAVVDDVKTGA 686
Query: 740 RLACERGWHIAAINFASYGIPEGNCGSFRPGACH--MDVLPIVQKACVGQIECSIPVSSA 797
L C I I FAS+G P G CG+F G CH D +V++ C+G+ CS+P+ +
Sbjct: 687 HLRCPHDKKITNIEFASFGDPSGVCGNFEHGKCHSSSDTKKLVEQHCLGKENCSVPMDA- 745
Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
C K LA++A CS
Sbjct: 746 -FDNFKNECDS--KTLAIQAKCS 765
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 330/710 (46%), Positives = 442/710 (62%), Gaps = 40/710 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTYD R+++++G+R +L SGSIHYPR PE+WP++IRK+KEGGL +I+TYVFWN HE
Sbjct: 25 TKGVTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P++GQ+ FEG +D+V+F+KT+ E GL++ LRIGPY AEWN GGFP WL +P I FR+
Sbjct: 85 PVQGQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSY 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PF MK++ +IDLMK+E LFA QGGPII+AQ+ENEY NV+ AY G+ YV+WAA
Sbjct: 145 NEPFIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
+ A L VPW+MC+Q+DAP +INTCNG +C D FT PN P+KP +WTEN++ + +F
Sbjct: 205 NMATGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTF 264
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+AF+VARFF GT NYYMY+GGTN+GRT G V T Y +AP+DE+
Sbjct: 265 GDPPSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRT-GSSFVTTRYYDEAPLDEF 323
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G R+PKW HLR+LH+A++L L+ P+ QK+ LE +Y K DCAAFL N +
Sbjct: 324 GLYREPKWSHLRDLHRALRLSRRALLWGTPSVQKINQHLEITVYEKPGTDCAAFLTNNHT 383
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLA 418
+ A + F G Y+LP SVSILPDCK + NT ++SQ N+ + P + KN+
Sbjct: 384 TLPATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNSRNFLPSEKAKNLK----- 438
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI----HVMPGQGKEV-F 473
+ Y+EKV + S + E + TKDTSDY WY+ SI H +P + +
Sbjct: 439 ---WEMYQEKVPTISDLSLKNREPLELYSLTKDTSDYAWYSTSINFDRHDLPMRPDILPV 495
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S+GHA FVN + V FG+GN+ +F+ K + L G NT+ IL+ VG N GA
Sbjct: 496 LQIASMGHALSAFVNGEFVGFGHGNNIEKSFVFQKPVILKPGTNTISILAETVGFPNSGA 555
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ + AG + + L G D++ W ++VGV GE L A W + P
Sbjct: 556 YMEKRFAGPRGITVQGLMAGTLDITQNNWGHEVGVFGEKEQLFTEEGAKKVKWTPVNG-P 614
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++ WYKT F APEG P+AL + M KG WVNG S+GRYWS++L+P
Sbjct: 615 TKGAVTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGRYWSSFLSP---------- 664
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
GQP Q YHIPR ++ P NLLVI EE GG P I +
Sbjct: 665 ------------LGQPTQFEYHIPRAFLKPTNNLLVIFEETGGHPETIEV 702
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 352/847 (41%), Positives = 475/847 (56%), Gaps = 68/847 (8%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYD RAL+IDG+RR+L SGSIHYPRSTP++WPEL ++K G++VI+TY+FWN +
Sbjct: 24 AMNVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTYLFWNTNV 83
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G++ RFD VRFV+ QEAGL+++ RIGP+ CAEW YGG P WL IP I FR
Sbjct: 84 PTPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIPDIMFRDY 143
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ P+ + ++ K + ++K L A QGGPIIL Q+ENEYG E Y GG YV+W
Sbjct: 144 DQPWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRYA-GGPQYVEWCG 202
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A NL + W+MC Q DAP II TCN FYCD F P+ P +P MWTEN+ GWF +G
Sbjct: 203 QLAANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVPH-PGQPSMWTENWPGWFQKWGD 261
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
P RP +D+A+AV R++ GG++ NYYMY GGTNF RTAGGP + T+YDYDA +DEYG
Sbjct: 262 PTPHRPAQDVAYAVTRYYIKGGSYMNYYMYHGGTNFERTAGGPFITTNYDYDASLDEYGM 321
Query: 302 IRQPKWGHLRELHKAIKLCEEYLIS-SDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+PK+ HL +H + E +++ P LG LEAHIY+ SS C AFL+N ++
Sbjct: 322 PNEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNLEAHIYN-SSVGCVAFLSNNNNK 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA------------- 407
+D V FNG Y LPAWSVS+L C ++NTA V H A
Sbjct: 381 TDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTA-VCRAHQRAPHDAACCARESRRVCDRL 439
Query: 408 -----------QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
Q + L L + N++ + EQI+ T D +DYL
Sbjct: 440 PPLRPKARAPCQSGRIRHLCLVVLTSIGPQAPATKYWNKTPL-----EQIDQTLDHTDYL 494
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
WY+ S + L++ + A V+VN K V + + ++ + L G
Sbjct: 495 WYSTSY--VSSSATYAQLSLPQITDVAYVYVNGKFVTVSWSGN------VSATVSLVAGP 546
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
NT+DILS+ +GL N G GL + + G +L+ W +Q GV GE +
Sbjct: 547 NTIDILSLTMGLDNGGDILSEYNCGLLGGVYL----GSVNLTENGWWHQTGVVGERNAIF 602
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAP-EGKGPLALNLASMGKGQAWVNGQSIGR 635
W + L N L WYK++F P + + PLAL+L MGKG WVNG ++GR
Sbjct: 603 LPENLKKVAWTTPAVL--NTGLTWYKSSFDVPRDSQAPLALDLTGMGKGYVWVNGHNLGR 660
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW LA + C CDYRG+YDA C++ C P+QT YH+PR W+ N+LV+ EE+G
Sbjct: 661 YWPTILATNWPC-DVCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENNVLVLLEEMG 719
Query: 696 GDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFA 755
G+PSKI+L+ + C V E P +L VV L C IA ++FA
Sbjct: 720 GNPSKIALVEREEYVSCGVVGEDYP------ADDLAVV-------LGCGTHQTIAGVDFA 766
Query: 756 SYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLL-KAL 813
SYG P G+C S++ G+CH + IV C G+ CSIPVS+A G CP + K L
Sbjct: 767 SYGTPMGSCRSYQQGSCHASNSTEIVLSLCHGKQACSIPVSAAMFG---NPCPDVTNKRL 823
Query: 814 AVEAHCS 820
AV+ C+
Sbjct: 824 AVQVACA 830
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 329/634 (51%), Positives = 421/634 (66%), Gaps = 43/634 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRA++I GKRR+L S +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFWNGHEPA 122
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQYYFE RFDLV+F K V GLFL LRIGPYACAEWN+GGFPVWL IPGI+FRT N
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK EM+ F+ KI+ LMK+E L++ QGGPIIL Q+ENEYGN++ YG G+ Y++WAA
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+T +PWVMC+Q DAP+ II+TCN FYCDGF PNS +KP +WTE++ GW+ +G A+
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL TSYDYDAPIDEYG +R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362
Query: 304 QPKWGHLRELHKAIKLCEEYLIS--SDPTHQKLGAKLEAHIYHK-----------SSNDC 350
QPKWGHL++LH AIKLCE LI+ P + KLG+ EAH+Y ++ C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPF 406
+AFLAN D A+V G Y LP WSVSILPDC+NV FNTA++ +Q + P
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482
Query: 407 AQQKNVNELLLASS-----AFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
++ +L +S + +W+ +E +G G +F + E +N TKD SDYLWYT
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542
Query: 460 ASIHVMPG-------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
+++ +G L I+ + A VFVN KL G+ + + I+L
Sbjct: 543 TRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQL 598
Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGE 571
EG+N L +LS +VGLQNYGA+ + GAG V L L +G DL++ W YQVG++GE
Sbjct: 599 VEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGE 658
Query: 572 YIGL---DKISLANSSFWKQGSTLPVNKSLIWYK 602
+ + +K A S ++ S P WYK
Sbjct: 659 FSMIYAPEKQGCAGWSRMQKDSVQP----FTWYK 688
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 343/837 (40%), Positives = 492/837 (58%), Gaps = 54/837 (6%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTYD +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HE
Sbjct: 38 NKEVTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHE 97
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +G++ F GR DLV+F+K +Q+ G+++ LR+GP+ AEW +GG P WL +PGI FRT
Sbjct: 98 PQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTD 157
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FKE +R++ I+D MK+E LFASQGGPIIL Q+ENEY V+ AY G Y+KWA+
Sbjct: 158 NKQFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWAS 217
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
+ ++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F F
Sbjct: 218 NLVDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVF 277
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R VED+A++VARFF GT NYYMY GGTNFGRT+ + YD DAP+DEY
Sbjct: 278 GDPPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEY 336
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYD 358
G ++PK+GHL+ LH A+ LC++ L+ P +K G E Y + + CAAFLAN +
Sbjct: 337 GLEKEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNN 396
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
+ + + F G Y + S+SILPDCK VV+NTA+++SQ + + F + K N+
Sbjct: 397 TEAAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRN--FMKSKKANKKF-- 452
Query: 419 SSAFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-----HVMPGQGKE 471
F + E + + GN S++ +L TKD +DY WYT S H+ +G +
Sbjct: 453 --DFKVFTETLPSKLEGN-SYIPVEL---YGLTKDKTDYGWYTTSFKVHKNHLPTKKGVK 506
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
F+ I SLGHA ++N + + G+G+H+ +F+ K++ L G N L +L ++ G +
Sbjct: 507 TFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDS 566
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
G++ + G + ++ L +G DL+ S +W ++G+EGE +G+ WK+ +
Sbjct: 567 GSYMEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFT 626
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
L WY+T F APE + + MGKG WVNG+ +GRYW ++L+P
Sbjct: 627 GKA--PGLTWYQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSP------- 677
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQ 709
GQP Q YHIPR+++ P +NLLVI EE P +
Sbjct: 678 ---------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFAIVNRD 722
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNC 764
+CS+V E P V W V + S L C IAA+ FAS+G P G C
Sbjct: 723 TVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKKIAAVEFASFGNPIGVC 782
Query: 765 GSFRPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
G+F G C+ V +++K C+G+ EC IPV+ S + +C ++K LAV+ C
Sbjct: 783 GNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVVKMLAVQVKC 839
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 348/834 (41%), Positives = 493/834 (59%), Gaps = 54/834 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+I+G R +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HEP +
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F GR DLV+F+K +++ G+++ LR+GP+ AEW +GG P WL +PGI FRT N P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNTP 163
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FKE +R++ I+D MK+E LFASQGGPIIL Q+ENEY V+ AY G Y+KWA+
Sbjct: 164 FKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
+++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F +G
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYGDP 283
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R VED+A++VARFF GT NYYMY GGTNFGRT+ + YD DAP+DEYG
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEYGLE 342
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
R+PK+GHL+ LH A+ LC++ L+ P +K + E Y + CAAFLAN ++ S
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTES 402
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ F G Y +P S+SILPDCK VV+NT ++IS + + F + K N+ +
Sbjct: 403 AEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHTSRN--FMKSKKANK----NFD 456
Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFL 474
F + E V I G+ S++ +L TKD +DY WYT S + +G + L
Sbjct: 457 FKVFTETVPSKIKGD-SYIPVEL---YGLTKDETDYGWYTTSFKIDDNDLSKKKGSKPTL 512
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I SLGHA V++N + + G+G+H+ +F+ K I L EG N L +L ++ G + G++
Sbjct: 513 RIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHLTMLGVLTGFPDSGSY 572
Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ G SV ++ L +G DL+ +W +VG+EGE +G+ W++ S
Sbjct: 573 MEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGEKLGIHAEEGLKKVKWQKFSG-- 630
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
L WY+T F APE + A+ + MGKG WVNG+ +GRYW ++L+P
Sbjct: 631 KEPGLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP---------- 680
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
GQP Q YHIPR+++ P +NLLVI EE P I + +C
Sbjct: 681 ------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIINRDTVC 728
Query: 713 SFVSEADPPPVDSW-KPNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
S + E P V W + N V + + V L C I+ + FAS+G P G CG+F
Sbjct: 729 SHIGENYTPSVRHWTRKNDQVQAITDDVHLTASLKCSGTKKISEVEFASFGNPNGTCGNF 788
Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
G C+ V +V+K C+G+ EC IPV+ S + +CP + K LAV+ C
Sbjct: 789 TLGTCNAPVSKKVVEKYCLGKAECVIPVNKSTFQQDKKDSCPKVEKKLAVQVKC 842
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 347/833 (41%), Positives = 487/833 (58%), Gaps = 54/833 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+I+GKR +L SGS+HYPRSTP +WP +I K++ GGL I+TYVFWN HEP +
Sbjct: 41 VTYDGTSLIINGKRELLFSGSVHYPRSTPHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y F+GRFDLV+F+K + E GL++ LR+GP+ AEWN+GG P WL +P + FRT N P
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FKE +R++ KI+ +MK+E LFASQGGPIIL Q+ENEY V+ AY GE Y+KWAA+
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
++N +PWVMC+Q DAP +IN CNG +C D F PN KP +WTEN++ F FG
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R VED+AF+VAR+F G+ NYYMY GGTNFGRT+ V T Y DAP+DE+G
Sbjct: 281 PTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLE 339
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
+ PK+GHL+ +H+A++LC++ L Q LG E Y + + CAAFL+N ++
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ F G Y LP+ S+SILPDCK VV+NTA++++Q + D F + + ++ L
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRD--FVKSEKTSKGL----K 453
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQ-GKEVFLNI 476
F + E + + + P E TKD +DY WYT S+ + P Q G + L +
Sbjct: 454 FEMFSENIPSLLDGDSLIP--GELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRV 511
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
SLGHA +V+VN + +G H+ +F K + G N + IL ++ GL + G++ +
Sbjct: 512 ASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYME 571
Query: 537 VAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG ++ +I LK+G RDL+ + EW + G+EGE + + W++
Sbjct: 572 HRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK---R 628
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
K L WYKT F PEG +A+ + +MGKG WVNG +GRYW ++L+P
Sbjct: 629 KPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP------------ 676
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
G+P QT YHIPR+++ +N+LVI EE G I + IC
Sbjct: 677 ----------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTIC 726
Query: 713 SFVSEADPPPVDSWK-PNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
S V E P V SWK +VS S +RL C + + FAS+G P G CG+F
Sbjct: 727 SNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNF 786
Query: 768 RPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C +V+K C+G+ CSI V+ G CP ++K LAV+ C
Sbjct: 787 TMGKCSASKSKEVVEKECLGRNYCSIVVARETFG--DKGCPEIVKTLAVQVKC 837
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 319/635 (50%), Positives = 425/635 (66%), Gaps = 25/635 (3%)
Query: 82 VQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMK 141
V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT N PFK MK+F KI+ +MK
Sbjct: 2 VHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMK 61
Query: 142 QENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDA 201
E LF +QGGPIILAQ+ENEYG VEW G G+ Y KW A A+ L+T VPW+MC+QEDA
Sbjct: 62 AEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDA 121
Query: 202 PDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFET 261
P PII+TCNG+YC+ F PNS +KP MWTEN++GW+ +FG AVP+RPVED+A++VARF +
Sbjct: 122 PGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFIQK 181
Query: 262 GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCE 321
GG+ NYYMY GGTNF RTA G +A+SYDYDAP+DEYG R+PK+ HL+ LHKAIKL E
Sbjct: 182 GGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSE 240
Query: 322 EYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSI 381
L+S+D T LGAK EA+++ S + CAAFL+N D +S A V F G Y LP WSVSI
Sbjct: 241 PALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSI 299
Query: 382 LPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNR-SFV 438
LPDCK V+NTAKV + P + ++ + FSW + E + +F
Sbjct: 300 LPDCKTEVYNTAKV-------NAPSVHR----NMVPTGTKFSWGSFNEATPTANEAGTFA 348
Query: 439 RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVA 493
R L EQI+ T D SDY WY I + G+ G L + S GHA VFVN +L
Sbjct: 349 RNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSG 408
Query: 494 FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKN 552
YG D ++KI+L+ G+N + +LS+ VGL N G F+ G+ V L + +
Sbjct: 409 TAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNS 468
Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKG 612
G D+S +W Y++GV+GE + L + ++ W QGS + + L WYK+TF P G
Sbjct: 469 GTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNE 528
Query: 613 PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT 672
PLAL++ +MGKGQ W+NG++IGR+W AY A G +C+Y G++DA KC +CG+ +Q
Sbjct: 529 PLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCGRCNYAGTFDAKKCLSNCGEASQR 586
Query: 673 LYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
YH+PR+W+ +NL+V+ EELGGDP+ ISL+ +T
Sbjct: 587 WYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 620
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 360/825 (43%), Positives = 470/825 (56%), Gaps = 89/825 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+VTYD R+L+I+G+RR+L SGSIHYPRSTPE+WP LI K+KEGG++VIETY FWN HEP
Sbjct: 22 GSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEP 81
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F GR D+V+F K VQ GL+ LRIGP+ +EWNYGG P WLH +PGI +R+ N
Sbjct: 82 KQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDN 141
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++LMK ENL+ASQGGPIIL+Q+ENEY NVE A+ G YV+WAA
Sbjct: 142 EPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAK 201
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV+L T++ + + E+ G
Sbjct: 202 MAVDLQTAMRY----------------------------------YGEDKRG-------- 219
Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R EDLAF VA F + G+F NYYMY GGTNFGRT+ ++ YD AP+DEYG
Sbjct: 220 ---RAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEYGL 275
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
IRQPKWGHL+ELH IKLC + L+ + LG EA+++ + S CAAFL N D
Sbjct: 276 IRQPKWGHLKELHAVIKLCSDTLLXGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDKRR 335
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ V F Y L A S+SILPDCK + FNTAKV +Q N ++V S
Sbjct: 336 NVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNT--------RSVQTRATFGST 387
Query: 422 FSWYEEKVGIS--GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
W E + GI G L E + TTKD SDYLWYT + + L ++SL
Sbjct: 388 KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-IHNSSNAQPVLRVDSL 446
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
H L FVN K +A +G+H +F + K+ LN G+N + +LS+MVGL + G + +
Sbjct: 447 AHVLLAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEHKV 506
Query: 540 AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
AG+ V + D +D S W YQVG+ GE + + + W G L
Sbjct: 507 AGIRRVEIQD-GGXSKDFSKHPWGYQVGLMGEKLQIYTSPGSQKVQW-YGLGSHGRGPLT 564
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKT F AP G P+ L SMGKG+AWVNGQSIGRYW +YL PS
Sbjct: 565 WYKTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS--------------- 609
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
G+P+QT Y++PR +++P NLLV+ EE GDP KIS+ T + ++C V+++
Sbjct: 610 -------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTDSH 662
Query: 720 PPPVDSWKP----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
PPP+ SW N P+V+L C +I+ I FAS+G P G C S+ G+CH
Sbjct: 663 PPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCHSP 722
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ L + +KAC+G+ CSIP S G CPG KAL V A C
Sbjct: 723 NSLAVAEKACLGKNXCSIPHSLKSFG--DDPCPGTPKALLVAAQC 765
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 323/658 (49%), Positives = 417/658 (63%), Gaps = 74/658 (11%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YDHR+LVI+G+RR+L SGSIHYPRS PE+WP LI+K+K+GGL+V++TYVFWN HEP +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQYYF R+DLVRFVK V++AGL++HLR+GPY CAEWN+GGFPVWL ++PGI+FRT N P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F+ KI+ +MK E LF QGGPII+AQVENE+G +E G GG+ Y WAA A
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
V N VPWVMC+Q+DAPDP+INTCNGFYCD FTPN+ KP MWTE ++GWF FG A P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY----- 299
RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 300 --------------------------------------------GFIRQPKWGHLRELHK 315
G +RQPKWGHLR +H+
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 316 AIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
AIK E L+S DPT + +G +A+++ + CAAFL+NY S + F+G Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459
Query: 376 AWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISG 433
AWS+SILPDCK VFNTA V + + ++ F+W Y E
Sbjct: 460 AWSISILPDCKTAVFNTATV-----------KEPTLLPKMSPVMHRFAWQSYSEDTNSLD 508
Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVN 488
+ +F R L EQ++ T D SDYLWYT +++ + G+ L++ S GH+ VFVN
Sbjct: 509 DSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVN 568
Query: 489 KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VIL 547
+ YG +D + +++ +G N + ILS VGL N G F++ G+ V L
Sbjct: 569 GRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTL 628
Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVNKSLIWYKT 603
L GKRDLS WIYQVG++GE +GL ++ +++ W G T P L W+K
Sbjct: 629 SGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGGGTQP----LTWHKV 682
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 637 bits (1642), Expect = e-179, Method: Compositional matrix adjust.
Identities = 337/715 (47%), Positives = 450/715 (62%), Gaps = 50/715 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYD R+L+IDG+R++L SGSIHYPRSTP++WP LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 2 AEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEP 61
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQY F GR+DLVRF+K +Q GL++ LRIGPY +EW YGGFP WLH +P I +RT N
Sbjct: 62 QFGQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIVYRTDN 121
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+ +M+ E L+ASQGGPIIL+Q+ENEY NVE A+G G YV+WAA+
Sbjct: 122 QPFKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAE 181
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L T VPW+MC+Q DAPDP+INTCNG C + FT PNSP+KP WTEN++ ++ +G
Sbjct: 182 MAVGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYG 241
Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R ED+AF V F G++ NYYMY GGTN GRT+ ++ + YD AP+DEY
Sbjct: 242 GEPYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSSYVITSYYD-QAPLDEY 300
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPKWGHL+ELH AIK C L+ ++ LG E +++ + C AFL N D
Sbjct: 301 GLLRQPKWGHLKELHAAIKSCSTTLLEGKQSNFSLGQLQEGYVFEEEGK-CVAFLVNNDH 359
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
V F Y LP+ S+SILPDC+NV FNTA V ++ N ++ + + S
Sbjct: 360 VKMFTVQFRNRSYELPSKSISILPDCQNVTFNTATVNTKSN--------RRMTSTIQTFS 411
Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
SA W +++ + + + L EQ+N TKD SDYLWYT S E L +
Sbjct: 412 SADKWEQFQDVIPNFDQTTLISNSLLEQMNVTKDKSDYLWYTLS---------ESKLTAQ 462
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S H F + + +G+HD +F ++LNEG N + ILS+MVGL + GA+ +
Sbjct: 463 SAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNNISILSVMVGLPDAGAFLER 522
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ-GSTLPVNK 596
AGL + + I DL++ W YQVG+ GE + + + +S W G+T N+
Sbjct: 523 RFAGL-TAVEIQCSEESYDLTNSTWGYQVGLLGEQLEIYEEKSNSSIQWSPLGNT--CNQ 579
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
+L WYKT F +P+G P+ALNL SMGKGQAWVNG+SIGRYW ++
Sbjct: 580 TLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYWISF---------------- 623
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
+D+ GQP+QTLYH+PR+++ N LV+ EE GG+P ISL T + +I
Sbjct: 624 HDSK------GQPSQTLYHVPRSFLKDIGNSLVLFEEEGGNPLHISLDTISSTNI 672
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 329/706 (46%), Positives = 445/706 (63%), Gaps = 33/706 (4%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTYD R+L+IDG+R++L SG IHYPRSTP++WP+LI K+K+GGL+VI+TYVFWN HE
Sbjct: 24 AEEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHE 83
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G Y F GR+DLV F+K +Q GL++ LRIGP+ +EW YGGFP WLH +PGI +RT
Sbjct: 84 PQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQSEWKYGGFPFWLHDVPGIVYRTD 143
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FK M+ F KI+++MK+E L+ASQGGPIIL+Q+ENEY N++ A+G G YV+WAA
Sbjct: 144 NESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAA 203
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
AV LNT VPWVMC+Q DAPDP+INTCNG C + FT PNSP+KP +WTEN++ ++ +
Sbjct: 204 KMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVY 263
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+AF V F G++ NYYMY GGTNFGRTA ++ YD AP+DEY
Sbjct: 264 GGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRTASAYVITGYYD-QAPLDEY 322
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPKWGHL++LH+ IK C L+ + LG E +++ + +C AFL N D
Sbjct: 323 GLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFSLGQLQEGYVFEEEKGECVAFLKNNDR 382
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+ V F Y L S+SILPDC+NV FNTA V + N + ++N + L
Sbjct: 383 DNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNTTSNR--RIISPKQNFSSL---- 436
Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
+ +++ + N S L EQ+NTTKD SDYLWYT K L+++S
Sbjct: 437 DDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCRKPT-LSVQSA 495
Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
H A F+N + +GNHD +F + + +N+G N L ILS MVGL + GA+ +
Sbjct: 496 AHVAHAFINNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSAMVGLPDSGAFLERRF 555
Query: 540 AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
AGL SV L + +L++ W YQVG+ GE + + K + W Q + + + LI
Sbjct: 556 AGLISVELQCSEQESLNLTNSTWGYQVGLLGEQLQVYKKQNNSDIGWSQLGNI-MEQLLI 614
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
WYKTTF PEG P+ L+L+SMGKG+AWVN QSIGRYW + +D+
Sbjct: 615 WYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQSIGRYWILF----------------HDS 658
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q+LYH+PR+++ N+LV+ EE GG+P ISL T
Sbjct: 659 K------GNPSQSLYHVPRSFLKDTGNVLVLVEEGGGNPLGISLDT 698
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 352/799 (44%), Positives = 479/799 (59%), Gaps = 56/799 (7%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+WP LI K+KEGG++VI+TYVFWN HEP +G Y F GR D+VRFVK +Q GL+ LRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
P+ AEW+YGG P WLH + GI +R+ N PFK M+ F KI+++MK E L+ASQGGPII
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
L+Q+ENEY VE A+G G YV+WAA AV+L T VPW MC+Q DAPDP+INTCNG C
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180
Query: 215 -DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFET-GGTFQNYYMY 271
+ FT PNSP+KP +WTEN++ ++ ++G R E++AF VA F GT+ NYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240
Query: 272 FGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTH 331
GGTNFGR+A ++ YD +P+DEYG R+PKWGHL+ELH A+KLC L++ ++
Sbjct: 241 HGGTNFGRSASAFMITGYYD-QSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSN 299
Query: 332 QKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
LG +EA ++ SN+CAAFL N + D+NV F Y LP S+SILPDCKNV FN
Sbjct: 300 FSLGQSVEAIVFKTESNECAAFLVN-RGAIDSNVLFQNVTYELPLGSISILPDCKNVAFN 358
Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKD 451
T +V Q N Q+ ++ E + ++E + + +L E + TTKD
Sbjct: 359 TRRVSVQHNTRSMMAVQKFDLLE-------WEEFKEPIPNIDDTELRANELLEHMGTTKD 411
Query: 452 TSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
SDYLWYT + ++ L ++S HA FVN +G + F + K I
Sbjct: 412 RSDYLWYTFRVQQDSPDSQQT-LEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNIT 470
Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGE 571
L GIN + +LS+MVGL + GA+ + AGL V + D S W Y+VG+ GE
Sbjct: 471 LRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGI-----QGEDFSEQHWGYKVGLSGE 525
Query: 572 --YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
I LD S +N + + G++ ++ L WYKT F AP G P+ALNL SMGKG WVN
Sbjct: 526 QSQIFLDTGS-SNVQWSRLGNS---SQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVN 581
Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
G+ IGRYW ++L P G+P+Q Y++PR+++ P +N LV
Sbjct: 582 GRGIGRYWVSFLTPK----------------------GEPSQKWYNVPRSFLKPTDNQLV 619
Query: 690 IHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW----KPNLGVV---SSSPQVRLA 742
I EE G+P +ISL + C VSE+ P V SW K + V + P+V+L+
Sbjct: 620 ILEEETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLS 679
Query: 743 CERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGV 801
C I+ I FAS+G P G+C S+ G CH + IV+ AC+G+ +CSIP+S+ L
Sbjct: 680 CPSKKKISNILFASFGTPSGDCQSYAIGLCHSPNSRAIVEHACLGRAKCSIPISN--LNF 737
Query: 802 SAGACPGLLKALAVEAHCS 820
CP + K L V+A C+
Sbjct: 738 RGDPCPHVTKTLLVDAQCT 756
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 347/833 (41%), Positives = 473/833 (56%), Gaps = 70/833 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
QY F G +D+VRF K +Q AGL+ LRIGPY C EWNYGG P WL IPG+QFR N P
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
F+ EM+ F I++ MK N+FA QGGPIILAQ+ENEYGN+ + Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
A N VPW+MCQQ+ D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA FF+ GGP + TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQ-------------------KRGGPYITTSYDYDAPLDEYGN 311
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
+RQPK+GHL++LH IK E+ L+ + K+ Y S A F+ N + +
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTS-ACFINNRNDNM 370
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D NVT +G + LPAWSVSILPDCK V FN+AK+ +Q + + E S
Sbjct: 371 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----VMVNKAKMVEKEPESLK 426
Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
+SW E + S+ + +L EQI T+ D SDYLWY SI+ +F+N +
Sbjct: 427 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGEASYTLFVN--T 484
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV + + F + +L++G N + +LS +GL+NYG F+
Sbjct: 485 TGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEKM 544
Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
AG+ V LID DLS+ W Y+ G+ GEY I LDK ++ T+P+
Sbjct: 545 PAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDK---PGCTWDNNNGTVPI 601
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST--GCTKKCD 652
NK WYKTTF AP G+ + ++L + KG AWVNG ++GRYW +Y A +
Sbjct: 602 NKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAARSMRRLPTTAH 661
Query: 653 YRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKT 707
YRG + D KC CG+P+Q YH+PR+++ GE N +++ EE GGDPS +S T
Sbjct: 662 YRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVA 721
Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGS 766
+C+ D + L+C + I+AIN S+G+ G CG+
Sbjct: 722 AGSVCASAEVGD------------------TITLSCGQHSKTISAINVTSFGVARGQCGA 763
Query: 767 FRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
++ G +AC+G+ C++ +++A V+ C L L V+A C
Sbjct: 764 YKGGCESKAAYKAFTEACLGKESCTVQITNA---VTGSGC--LSNVLTVQASC 811
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 345/833 (41%), Positives = 485/833 (58%), Gaps = 54/833 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+I+GKR + SGS+HYPRSTP++WP +I K++ GGL I+TYVFWN HEP +
Sbjct: 41 VTYDGTSLIINGKRELFFSGSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y F+GRFDLV+F+K + E GL++ LR+GP+ AEWN+GG P WL +P + FRT N P
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FKE +R++ KI+ +MK+E LFASQGGPIIL Q+ENEY V+ AY GE Y+KWAA+
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
++N +PWVMC+Q DAP +IN CNG +C D F PN KP +WTEN++ F FG
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF+VAR+F G+ NYYMY GGTNFGRT+ V T Y DAP+DE+G
Sbjct: 281 PTQRTAEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLE 339
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
+ PK+GHL+ +H+A++LC++ L Q LG E Y + + CAAFL+N ++
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ F G Y LP+ S+SILPDCK VV+NTA++++Q + D F + + ++ L
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRD--FVKSEKTSKGL----K 453
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQ-GKEVFLNI 476
F + E + + + P E TKD +DY WYT S+ + P Q G + L +
Sbjct: 454 FEMFSENIPSLLDGDSLIP--GELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRV 511
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
SLGHA +V+VN + +G H+ +F K + G N + IL ++ GL + G++ +
Sbjct: 512 ASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYME 571
Query: 537 VAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
AG ++ +I LK+G RDL+ + EW + G+EGE + + W++
Sbjct: 572 HRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGE---R 628
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
K L WYKT F PEG +A+ + MGKG WVNG +GRYW ++L+P
Sbjct: 629 KPLTWYKTYFETPEGVNAVAIRMKGMGKGLIWVNGIGVGRYWMSFLSP------------ 676
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
G+P QT YHIPR+++ +N+LVI EE G I + IC
Sbjct: 677 ----------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTIC 726
Query: 713 SFVSEADPPPVDSWK-PNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
S V E P V SWK +VS S +RL C + + FAS+G P G CG+F
Sbjct: 727 SNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNF 786
Query: 768 RPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C +V+K C+G+ CSI V+ G CP ++K LAV+ C
Sbjct: 787 TMGKCSASKSKEVVEKECLGRNYCSIVVARETFG--DKGCPEIVKTLAVQVKC 837
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 330/642 (51%), Positives = 422/642 (65%), Gaps = 51/642 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRA++I GKRR+L S +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 64 RGQYYFEGRFDLVRFVKT--------VQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPG 115
+GQYYFE RFDLV+F K V GLFL LRIGPYACAEWN+GGFPVWL IPG
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPG 182
Query: 116 IQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL 175
I+FRT N PFK EM+ F+ KI+ LMK+E L++ QGGPIIL Q+ENEYGN++ YG G+
Sbjct: 183 IEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKR 242
Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGW 235
Y++WAA A+ L+T +PWVMC+Q DAP+ II+TCN FYCDGF PNS +KP +WTE++ GW
Sbjct: 243 YMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGW 302
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
+ +G A+P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL TSYDYDAP
Sbjct: 303 YADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAP 362
Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-------- 345
IDEYG +RQPKWGHL++LH AIKLCE LI+ D P + KLG+ EAH+Y
Sbjct: 363 IDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGS 422
Query: 346 ---SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-- 400
++ C+AFLAN D A+V G Y LP WSVSILPDC+NV FNTA++ +Q +
Sbjct: 423 MAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVF 482
Query: 401 --NGDHPFAQQKNVNELLLASS-----AFSWY--EEKVGISGNRSFVRPDLAEQINTTKD 451
P ++ +L +S + +W+ +E +G G +F + E +N TKD
Sbjct: 483 TVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKD 542
Query: 452 TSDYLWYTASIHVMPG-------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANF 504
SDYLWYT +++ +G L I+ + A VFVN KL G+
Sbjct: 543 ISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----V 598
Query: 505 LINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWI 563
+ + I+L EG+N L +LS +VGLQNYGA+ + GAG V L L +G DL++ W
Sbjct: 599 SLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWT 658
Query: 564 YQVGVEGEYIGL---DKISLANSSFWKQGSTLPVNKSLIWYK 602
YQVG++GE+ + +K A S ++ S P WYK
Sbjct: 659 YQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQP----FTWYK 696
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 633 bits (1633), Expect = e-179, Method: Compositional matrix adjust.
Identities = 342/707 (48%), Positives = 441/707 (62%), Gaps = 47/707 (6%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYD R+L+IDG+ ++L SGSIHYPRSTP++WP LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 26 NVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEPQ 85
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY F G ++VRF+K +Q GL++ LRIGPY +E YGG P+WLH IPGI FR+ N
Sbjct: 86 QGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSDNE 145
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
FK M+RF AKI++LMK NLFASQGGPIIL+Q+ENEYGNVE A+ G Y++WAA
Sbjct: 146 QFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQM 205
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFG 240
AV L T VPWVMC+Q++APDP+INTCNG C G T PNSP+KP +WTEN++ ++ FG
Sbjct: 206 AVGLQTGVPWVMCKQDNAPDPVINTCNGMQC-GKTFKGPNSPNKPSLWTENWTSFYQVFG 264
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+A+ VA F G++ NYYMY GGTNF R A +V YD +AP+DEYG
Sbjct: 265 EVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVVTAYYD-EAPLDEYG 323
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+R+PKWGHL+ELH+AIK C L+ T LG + A+++ +SS +CAAFL N +
Sbjct: 324 LVREPKWGHLKELHEAIKSCSNSLLYGTQTSFSLGTQQNAYVFRRSSIECAAFLENTEDR 383
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
S + F Y LP S+SILPDCKNV FNTAKV +Q + + L +S
Sbjct: 384 S-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQ---------NARAMKSQLQFNS 433
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
A W Y E + + S L +QI+T KDTSDYLWYT ++ + + L+ S
Sbjct: 434 AEKWKVYREAIPSFADTSLRANTLLDQISTAKDTSDYLWYTFRLYDNSANAQSI-LSAYS 492
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
GH FVN LV +G+H +F++ K+ L G+N + LS VGL N GA+ +
Sbjct: 493 HGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNLISGMNNISFLSATVGLPNSGAYLEGR 552
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
AGL S LK RD ++ W YQVG+ GE + + S ++ W+ S L K L
Sbjct: 553 VAGLRS-----LKVQGRDFTNQAWGYQVGLLGEKLQIYTASGSSKVKWE--SFLSSTKPL 605
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYKTTF AP G P+ LNL SMGKG WVNGQ IGRYW ++ P
Sbjct: 606 TWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNGQGIGRYWVSFHTPQ-------------- 651
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q YHIPR+ + NLLV+ EE G+P I+L T
Sbjct: 652 --------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGITLDT 690
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 354/826 (42%), Positives = 473/826 (57%), Gaps = 93/826 (11%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L +TYD RALV+ G RR+ SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI+GQY FEGR+DLV+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FR+
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F+ KI+ +MK E L+ QGGPII++Q+ENEY +E A+G G YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLS 238
A AV L T VPW+MC+Q DAPDP+INTCNG C + F PNSP+KP +WTEN++ +
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPI 264
Query: 239 FGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
+G R ED+AFAVA F G+F +YYMY GGTNFGR A V TSY AP+D
Sbjct: 265 YGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLD 323
Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
EY F C AFL N+
Sbjct: 324 EYDF-----------------------------------------------KCVAFLVNF 336
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN-VNELL 416
D + V F L S+S+L DC+NVVF TAKV +Q + Q N +N
Sbjct: 337 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWK 396
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEV-FLN 475
K +GN+ F EQ+ TTKD +DYLWY S G ++ L
Sbjct: 397 AFIEPVPQDLSKSTYTGNQLF------EQLTTTKDETDYLWYIVSYKNRASDGNQIAHLY 450
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
++SL H FVN + V +G+HD N ++N + L EG NT+ +LS+MVG + GA+
Sbjct: 451 VKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAY 510
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ G+ +V + + L++ W YQVG+ GE + NS W + L +
Sbjct: 511 MERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSVRWMDINNL-I 569
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L WYKTTF P G + LNL SMGKG+ WVNG+SIGRYW ++ APS
Sbjct: 570 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS---------- 619
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ T + +C
Sbjct: 620 ------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGN 667
Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
V E PP+ S P+VR+ C+ G I++I FASYG P G+C SFR G+CH
Sbjct: 668 VDEFSVPPLQS-------RGKVPKVRIWCQGGNRISSIEFASYGNPVGDCRSFRIGSCHA 720
Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ +V+++C+G+ CSIPV +A G CPG+ K+L V A C
Sbjct: 721 ESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 764
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 360/828 (43%), Positives = 471/828 (56%), Gaps = 102/828 (12%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ NVTYD R+L+I+G+ R+L SGSIHYPRSTPE
Sbjct: 37 AGNVTYDGRSLIINGEHRILFSGSIHYPRSTPE--------------------------- 69
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
Y F+GR DLV+F+ VQ GL+ LRIGP+ EW YGG P WLH + GI FR+
Sbjct: 70 -----YDFDGRKDLVKFLLEVQAQGLYAALRIGPFIEGEWTYGGLPFWLHDVSGIVFRSD 124
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK+ M+RF+ KI+++MK L+ASQGGPII++Q+ENEY NVE A+ G YV WAA
Sbjct: 125 NEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQIENEYQNVETAFHEKGSRYVHWAA 184
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
+ AV LNT VPWVMC+Q DAPDP+INTCNG C + F PNSP+KP MWTEN++ ++ F
Sbjct: 185 NMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFAGPNSPNKPSMWTENWTSFYQVF 244
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+AF VA F G++ NYYMY GGTNFGRT G V TSY AP+DEY
Sbjct: 245 GGEPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRT-GSAFVTTSYYDQAPLDEY 303
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK--LGAKLEAHIYHKSSNDCAAFLANY 357
G IRQPKWGHL++LH IK C + LI THQ LG EA+++ + S DC AFL N
Sbjct: 304 GLIRQPKWGHLKDLHAKIKSCSKTLIRG--THQTFPLGRLQEAYVFREKSGDCVAFLVNN 361
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
D D V F Y LP S+SILPDCK++ FNTAKV +Q +Q+
Sbjct: 362 DGRRDVTVRFQNRSYELPHKSISILPDCKSITFNTAKVNTQYATRSATLSQE-------- 413
Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
SS W Y+E V + S L + ++TTKDTSDYLWYT + + L
Sbjct: 414 FSSVGKWEEYKETVATFDSTSLRAKTLLDHLSTTKDTSDYLWYTFRFQNHFSRPQST-LR 472
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
S GH +VN +G+H+ +F + + L G N + +LS+ VGL + GA+
Sbjct: 473 AYSRGHVLHAYVNGVYAGSAHGSHESTSFTLENSVRLKNGTNNVALLSVTVGLPDSGAYL 532
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ--GSTLP 593
+ AGL V + + +D ++ W YQVG+ GE + + + N W + G+T P
Sbjct: 533 ERRVAGLHRVRIQN-----KDFTTYSWGYQVGLLGEKLQIYTDNGLNKVSWNEFRGTTQP 587
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
L WYKT F AP G P+ALNL SMGKG+AWVNGQSIGRYW
Sbjct: 588 ----LTWYKTQFDAPAGSDPIALNLHSMGKGEAWVNGQSIGRYWV--------------- 628
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
S+ SK G P+QT YHIP+++V P NLLV+ EE G P I++ + + +C
Sbjct: 629 --SFSTSK-----GNPSQTRYHIPQSFVKPTGNLLVLLEEEKGYPPGITVDSISISKVCG 681
Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
VSE S V+L+C +I+ I F+S+G PEGNC + G CH
Sbjct: 682 HVSE----------------SHKSVVQLSCPPNRNISRILFSSFGTPEGNCNQYAIGKCH 725
Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+ IV+KAC+G+ +C I S+ + G CPG+ K L V+A C+
Sbjct: 726 SSNSRAIVEKACIGKTKCIILRSNRFFG--GDPCPGIRKGLLVDAKCT 771
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 334/728 (45%), Positives = 455/728 (62%), Gaps = 45/728 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP L+ K++EGG++VI+TYVFWN HEP
Sbjct: 23 GDVTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREGGVDVIQTYVFWNLHEP 82
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+Y F GR DLVRF+K +Q GL++ LRIGP+ +EW YGGFP WLH +P I +R+ N
Sbjct: 83 RPGEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPDIVYRSDN 142
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI+++MK E L+ASQGGPIIL+Q+ENEY NVE A+ G YV WAA
Sbjct: 143 EPFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVEAAFRDKGPPYVIWAAK 202
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSF 239
AV L T VPWVMC+Q DAPDP+INTCNG C G T PNSP+KP +WTEN++ ++ +
Sbjct: 203 MAVELQTGVPWVMCKQTDAPDPVINTCNGMRC-GETFGGPNSPTKPSLWTENWTSFYQVY 261
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+AF V F G++ NYYM+ GGTNFGRTA ++ + YD AP+DEY
Sbjct: 262 GGEPYIRSAEDIAFHVTLFIAKNGSYINYYMFHGGTNFGRTASAYVITSYYD-QAPLDEY 320
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G IRQPKWGHL+ELH AIK C ++ ++ LG +A+I+ + CAAFL N D
Sbjct: 321 GLIRQPKWGHLKELHAAIKSCSSTILEGVQSNFSLGQLQQAYIFEEEGAGCAAFLVNNDQ 380
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
++A V F + L S+S+LPDC+N++FNTAKV ++ N + ++L +
Sbjct: 381 KNNATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKGNE------ITRTSSQLFDDA 434
Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLNIES 478
+ Y + + + + L E +NTTKD SDYLWYT S +P E L++ES
Sbjct: 435 DRWEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYTFSF--LPNSSCTEPILHVES 492
Query: 479 LGHAALVFVNKKLVAFGYGNHDFAN-FLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
L H A FVN K +G+ D F + I LN+ +NT+ ILS MVGLQ+ GA+ +
Sbjct: 493 LAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTISILSTMVGLQDSGAFLER 552
Query: 538 AGAGLFSVILIDLKNGKRDL----SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
AGL V +++ ++++ ++ EW YQ G+ GE + + ++ W + +
Sbjct: 553 RYAGLTRV---EIRCAQQEIYNFTNNYEWGYQAGLSGESLNIYMREHLDNIEWSEVVS-A 608
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++ L W+K F AP G P+ LNL++MGKG+AWVNGQSIGRYW ++L
Sbjct: 609 TDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWLSFLTSK--------- 659
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
GQP+QTLYHIPR +++ NLLV+ EE GGDP ISL T + +
Sbjct: 660 -------------GQPSQTLYHIPRAFLNSSGNLLVLLEESGGDPLHISLDTVSRTGLQE 706
Query: 714 FVSEADPP 721
S PP
Sbjct: 707 HASRYHPP 714
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 353/826 (42%), Positives = 473/826 (57%), Gaps = 93/826 (11%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L +TYD RALV+ G RR+ SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 21 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI+GQY FEGR+DLV+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FR+
Sbjct: 81 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F+ KI+ +MK E L+ QGGPII++Q+ENEY +E A+G G YV+WA
Sbjct: 141 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 200
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLS 238
A AV L T VPW+MC+Q DAPDP+INTCNG C + F PNSP+KP +WTEN++ +
Sbjct: 201 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPI 260
Query: 239 FGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
+G R ED+AFAVA + G+F +YYMY GGTNFGR A V TSY AP+D
Sbjct: 261 YGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLD 319
Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
EY F C AFL N+
Sbjct: 320 EYDF-----------------------------------------------KCVAFLVNF 332
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN-VNELL 416
D + V F L S+S+L DC+NVVF TAKV +Q + Q N +N
Sbjct: 333 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWK 392
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LN 475
K +GN+ F EQ+ TTKD +DYLWY S G ++ L
Sbjct: 393 AFIEPVPQDLSKSTYTGNQLF------EQLTTTKDETDYLWYIVSYKNRASDGNQIARLY 446
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
++SL H FVN + V +G+HD N ++N + L EG NT+ +LS+MVG + GA+
Sbjct: 447 VKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAY 506
Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
+ G+ +V + + L++ W YQVG+ GE + NS W + L +
Sbjct: 507 MERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMDINNL-I 565
Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
L WYKTTF P G + LNL SMGKG+ WVNG+SIGRYW ++ APS
Sbjct: 566 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS---------- 615
Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ T + +C
Sbjct: 616 ------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGN 663
Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
V E PP+ S P+VR+ C+ G I++I FASYG P G+C SFR G+CH
Sbjct: 664 VDEFSVPPLQS-------RGKVPKVRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCHA 716
Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ +V+++C+G+ CSIPV +A G CPG+ K+L V A C
Sbjct: 717 ESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 760
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 306/554 (55%), Positives = 388/554 (70%), Gaps = 21/554 (3%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27 AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PG+ FRT N
Sbjct: 87 SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDN 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F KI+D+MK+E LF +QGGPIIL+Q+ENEYG ++W G G+ Y KW A+
Sbjct: 147 EPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAE 206
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
A+ L+T VPW+MC+QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF FG A
Sbjct: 207 MALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
+P RPVED+AF+VARF + GG+F NYYMY GGTNF RTA G +ATSYDYDAPIDEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFMNYYMYXGGTNFDRTA-GVFIATSYDYDAPIDEYGLL 325
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
R+PK+ HL+ELHK IKLCE L+S DPT LG K E H++ KS CAAFL+NYD+SS
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKTSCAAFLSNYDTSSA 384
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
A V F G Y LP WSVSILPDCK +NTAK+ + P K ++ S+ F
Sbjct: 385 ARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MIPTSTKF 433
Query: 423 SWYEEKVGISGNR---SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
SW G + +FV+ L EQI+ T+D +DY WY I + + G L
Sbjct: 434 SWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLL 493
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I S GHA VFVN L YG + ++ I+L+ GIN L +LS VGL N G
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVH 553
Query: 535 FDVAGAGLFSVILI 548
++ G+ + +
Sbjct: 554 YETWNTGILGPVTL 567
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 340/710 (47%), Positives = 444/710 (62%), Gaps = 51/710 (7%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
NVTYD R+L+IDG+ ++L SGSIHYPRSTP++WP LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 26 GNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEP 85
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F G ++VRF+K +Q GL++ LRIGPY +E YGG P+WLH IPGI FR+ N
Sbjct: 86 QQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSDN 145
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
FK M++F AKI++LMK NLFASQGGPIIL+Q+ENEYGNVE A+ G Y++WAA
Sbjct: 146 EQFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQ 205
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSF 239
AV L T VPWVMC+Q++APDP+INTCNG C G T PNSP+KP +WTEN++ ++ F
Sbjct: 206 MAVGLQTGVPWVMCKQDNAPDPVINTCNGMQC-GKTFKGPNSPNKPSLWTENWTSFYQVF 264
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R ED+A+ VA F G++ NYYMY GGTNF R A ++ YD +AP+DEY
Sbjct: 265 GEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVITAYYD-EAPLDEY 323
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +R+PKWGHL+ELH AIK C ++ T LG + A+++ +SS +CAAFL N +
Sbjct: 324 GLVREPKWGHLKELHAAIKSCSNSILHGTQTSFSLGTQQNAYVFKRSSIECAAFLENTED 383
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
S + F Y LP S+SILPDCKNV FNTAKV Q + + L +
Sbjct: 384 QS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVSIQN---------ARAMKSQLEFN 433
Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
SA +W Y+E + G+ S L +QI+TTKDTSDYLWYT ++ + + L+
Sbjct: 434 SAETWKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYTFRLYDNSPNAQSI-LSAY 492
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S GH FVN LV +G+H +F++ K+ L G+N + LS VGL N GA+ +
Sbjct: 493 SHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNISFLSATVGLPNSGAYLER 552
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVN 595
AGL S LK RD ++ W YQ+G+ GE + + S ++ W+ Q ST P
Sbjct: 553 RVAGLRS-----LKVQGRDFTNQAWGYQIGLLGEKLQIYTASGSSKVQWESFQSSTKP-- 605
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WYKTTF AP G P+ LNL SMGKG W+NGQ IGRYW ++ P
Sbjct: 606 --LTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYWVSFHTPQ----------- 652
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q YHIPR+ + NLLV+ EE G+P I+L T
Sbjct: 653 -----------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGITLDT 691
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 354/836 (42%), Positives = 473/836 (56%), Gaps = 103/836 (12%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L +TYD RALV+ G RR+ SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI+GQY FEGR+DLV+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FR+
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F+ KI+ +MK E L+ QGGPII++Q+ENEY +E A+G G YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGW--- 235
A AV L T VPW+MC+Q DAPDP+INTCNG C + F PNSP+KP +WTEN++
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSNG 264
Query: 236 -------FLSFGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
+ +G R ED+AFAVA F G+F +YYMY GGTNFGR A V
Sbjct: 265 QNNSAFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVT 323
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSS 347
TSY AP+DEY F
Sbjct: 324 TSYYDGAPLDEYDF---------------------------------------------- 337
Query: 348 NDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA 407
C AFL N+D + V F L S+S+L DC+NVVF TAKV +Q +
Sbjct: 338 -KCVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAV 396
Query: 408 QQKN-VNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP 466
Q N +N K +GN+ F EQ+ TTKD +DYLWY S
Sbjct: 397 QSLNDINNWKAFIEPVPQDLSKSTYTGNQLF------EQLTTTKDETDYLWYIVSYKNRA 450
Query: 467 GQGKEV-FLNIESLGHAALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSM 524
G ++ L ++SL H FVN + V +G+HD N ++N + L EG NT+ +LS+
Sbjct: 451 SDGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSV 510
Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
MVG + GA+ + G+ +V + + L++ W YQVG+ GE + NS
Sbjct: 511 MVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSV 570
Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
W + L + L WYKTTF P G + LNL SMGKG+ WVNG+SIGRYW ++ APS
Sbjct: 571 RWMDINNL-IYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS 629
Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++
Sbjct: 630 ----------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVN 667
Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNC 764
T + +C V E PP+ S P+VR+ C+ G I++I FASYG P G+C
Sbjct: 668 TMSVTTVCGNVDEFSVPPLQS-------RGKVPKVRIWCQGGNRISSIEFASYGNPVGDC 720
Query: 765 GSFRPGACHMDVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
SFR G+CH + +V+++C+G+ CSIPV +A G CPG+ K+L V A C
Sbjct: 721 RSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 774
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 329/705 (46%), Positives = 433/705 (61%), Gaps = 36/705 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 32 VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKL 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F GR DLV+F+K ++ GL++ LRIGP+ AEWNYGG P WL +PG+ +RT N P
Sbjct: 92 GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F AKI+DLMK E L+ASQGGPIIL+Q+ENEY NVE A+ G Y+KWA A
Sbjct: 152 FKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAGQMA 211
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPW+MC+ DAPDP+INTCNG C + F PNSP+KP MWTE+++ +F +G
Sbjct: 212 VGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVYGKE 271
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF A F G++ NYYMY GGTNFGRT+ + YD AP+DEYG +
Sbjct: 272 PYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 330
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELH AIK L+ T LG +A+++ ++N C AFL N D+ +
Sbjct: 331 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDAKA- 389
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
+ + F N Y L S+ IL +CKN+++ TAKV + N Q NV + +
Sbjct: 390 SQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQVFNVPD------NW 443
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
+ + E + S L E N TKD +DYLWYT+S + P ++ ES GH
Sbjct: 444 NLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDSPCTNPSIY--TESSGH 501
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
VFVN L G+G+ D + + L G N + ILS MVGL + GA+ + G
Sbjct: 502 VVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYG 561
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
L V + DLS +W Y VG+ GE + L + N W L N+ L W
Sbjct: 562 LTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAW 621
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKTTF P G GP+ L+++SMGKG+ WVNG+SIGRYW ++L P+
Sbjct: 622 YKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPA---------------- 665
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
GQP+Q++YHIPR ++ P NLLV+ EE GGDP ISL T
Sbjct: 666 ------GQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNT 704
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 329/705 (46%), Positives = 433/705 (61%), Gaps = 36/705 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 32 VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKL 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F GR DLV+F+K ++ GL++ LRIGP+ AEWNYGG P WL +PG+ +RT N P
Sbjct: 92 GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F AKI+DLMK E L+ASQGGPIIL+Q+ENEY NVE A+ G Y+KWA A
Sbjct: 152 FKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAGQMA 211
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPW+MC+ DAPDP+INTCNG C + F PNSP+KP MWTE+++ +F +G
Sbjct: 212 VGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVYGKE 271
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF A F G++ NYYMY GGTNFGRT+ + YD AP+DEYG +
Sbjct: 272 PYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 330
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELH AIK L+ T LG +A+++ ++N C AFL N D+ +
Sbjct: 331 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDAKA- 389
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
+ + F N Y L S+ IL +CKN+++ TAKV + N Q NV + +
Sbjct: 390 SQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQVFNVPD------NW 443
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
+ + E + S L E N TKD +DYLWYT+S + P ++ ES GH
Sbjct: 444 NLFRETIPASQAHLLKTNALLEHTNLTKDKTDYLWYTSSFKLDSPCTNPSIY--TESSGH 501
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
VFVN L G+G+ D + + L G N + ILS MVGL + GA+ + G
Sbjct: 502 VVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYG 561
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
L V + DLS +W Y VG+ GE + L + N W L N+ L W
Sbjct: 562 LTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAW 621
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKTTF P G GP+ L+++SMGKG+ WVNG+SIGRYW ++L P+
Sbjct: 622 YKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPA---------------- 665
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
GQP+Q++YHIPR ++ P NLLV+ EE GGDP ISL T
Sbjct: 666 ------GQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNT 704
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 623 bits (1607), Expect = e-175, Method: Compositional matrix adjust.
Identities = 329/705 (46%), Positives = 433/705 (61%), Gaps = 36/705 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 32 VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKAKEGGIDVIQTYVFWNLHEPKL 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F GR DLV+F+K ++ GL++ LRIGP+ AEWNYGG P WL +PG+ +RT N P
Sbjct: 92 GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F AKI+DLMK E L+ASQGGPIIL+Q+ENEY NVE A+ G Y+KWA A
Sbjct: 152 FKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAGQMA 211
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPW+MC+ DAPDP+INTCNG C + F PNSP+KP MWTE+++ +F +G
Sbjct: 212 VGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVYGKE 271
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF A F G++ NYYMY GGTNFGRT+ + YD AP+DEYG +
Sbjct: 272 PYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 330
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELH AIK L+ T LG +A+++ ++N C AFL N D+ +
Sbjct: 331 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDAKA- 389
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
+ + F N Y L S+ IL +CKN+++ TAKV + N Q NV + +
Sbjct: 390 SQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQVFNVPD------NW 443
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
+ + E + S L E N TKD +DYLWYT+S + P ++ ES GH
Sbjct: 444 NLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDSPCTNPSIY--TESSGH 501
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
VFVN L G+G+ D + + L G N + ILS MVGL + GA+ + G
Sbjct: 502 VVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYG 561
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
L V + DLS +W Y VG+ GE + L + N W L N+ L W
Sbjct: 562 LTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAW 621
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKTTF P G GP+ L+++SMGKG+ WVNG+SIGRYW ++L P+
Sbjct: 622 YKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPA---------------- 665
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
GQP+Q++YHIPR ++ P NLLV+ EE GGDP ISL T
Sbjct: 666 ------GQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNT 704
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 328/705 (46%), Positives = 430/705 (60%), Gaps = 36/705 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 30 VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKL 89
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F GR DLV+F+K ++ GL++ LRIGP+ AEWNYGG P WL +PG+ +RT N P
Sbjct: 90 GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 149
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK M++F KI++LMK E L+ASQGGPIIL+Q+ENEY NVE A+ G Y+KWA A
Sbjct: 150 FKFHMQKFTTKIVNLMKSEGLYASQGGPIILSQIENEYANVEAAFHEKGASYIKWAGQMA 209
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPW+MC+ DAPDP+INTCNG C + F PNSP+KP MWTE+++ +F +G
Sbjct: 210 VGLKTGVPWIMCKSPDAPDPVINTCNGMRCGETFPGPNSPNKPKMWTEDWTSFFQVYGTE 269
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AF F G++ NYYMY GGTNFGRT+ + YD AP+DEYG +
Sbjct: 270 PYIRSAEDIAFHAVLFIAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 328
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
RQPK+GHL+ELH AIK L+ T LG +A+++ +S+ C AFL N D+
Sbjct: 329 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDASSGCVAFLVNNDAKV- 387
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
+ + F + Y L S+ IL +CKN+++ TAKV ++N Q NV E +
Sbjct: 388 SQIQFRKSSYSLSPKSIGILQNCKNLIYETAKVNVEKNKRVTTPVQVFNVPE------KW 441
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
+ E + S L E N TKD +DYLWYT+S P ++ IES GH
Sbjct: 442 EGFRETIPAFSGTSLKANALLEHTNLTKDKTDYLWYTSSFKPDSPCTNPSIY--IESSGH 499
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
VFVN L G+G+ D + L G N++ ILS MVGL + GA+ + G
Sbjct: 500 VVHVFVNNALAGSGHGSRDIKVVKLQVPASLTNGQNSISILSGMVGLPDSGAYMERKSYG 559
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
L V + DLS +W Y VG+ GE + L + N W + L N+ LIW
Sbjct: 560 LTKVQISCGGTKPIDLSGSQWGYSVGLLGEKVRLQQWRNLNRVKWSMNNAGLIKNRPLIW 619
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKT F P G GP+ LN++SMGKG+ WVNG+SIGRYW ++L PS
Sbjct: 620 YKTIFDGPNGDGPVGLNMSSMGKGEIWVNGESIGRYWVSFLTPS---------------- 663
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q++YHIPR ++ P NLLV+ EE GGDP ISL T
Sbjct: 664 ------GHPSQSIYHIPREFLKPSGNLLVVFEEEGGDPLGISLNT 702
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 619 bits (1596), Expect = e-174, Method: Compositional matrix adjust.
Identities = 327/828 (39%), Positives = 469/828 (56%), Gaps = 46/828 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDGKR + SG+IHYPRS PEVWP+LI ++KEGGL IETY+FWN HEP
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGRFDL++++K +QE ++ +RIGP+ AEWN+GG P WL I I FR N+P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM++F+ I+ +K LFASQGGPIIL Q+ENEYGN++ + G+ Y++WAA A
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ T VPW+MC+Q AP +I TCNG +C D +T +KP++WTEN++ F ++G V
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GG+ NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
+PK+GHLR+LH I+ ++ + + + LG EAHI+ N C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL CKNVV+NT +V Q N + + +E+ ++ +
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSY------HTSEVTSKNNQW 448
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
Y EK+ + + EQ N TKD SDYLWYT S + +P + L ++
Sbjct: 449 EMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVK 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S H+ + F N V G+ F+ K ++L G+N + +LS +G+++ G
Sbjct: 509 SSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAE 568
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
+G+ ++ L G DL W ++ +EGE + WK ++
Sbjct: 569 VKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWKPAEN---GRA 625
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
WYK F P+G P+ L+++SM KG +VNG+ +GRYW +Y
Sbjct: 626 ATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY----------------- 668
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+ G P+Q LYHIPR ++ +NLLV+ EE G P I + T T IC F+SE
Sbjct: 669 -----RTLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISE 723
Query: 718 ADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+P + +W K L S + L C I + FAS+G PEG CG+F G C
Sbjct: 724 HNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEGMCGNFTVGTC 783
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
H + IV+K C+G+ C +PV G C L V+ C
Sbjct: 784 HTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTATLGVQVRC 830
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 335/709 (47%), Positives = 438/709 (61%), Gaps = 49/709 (6%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYD +LVI+G ++L SGSIHYPRSTP++WP+LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 24 ANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDVIQTYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F GRFDLV F+K +Q GL++ LRIGPY +E YGG P+WLH +PGI FRT N
Sbjct: 84 QQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDVPGIVFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ FK M+RF KI+++MK NLFASQGGPIIL+Q+ENEYG+++ + G Y+ WAA
Sbjct: 144 DQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFRANGLPYIHWAAQ 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNSPSKPIMWTENYSGWFLSFG 240
AV L T VPW+MC+Q+DAPDP+IN CNG C + PNSP+KP +WTEN++ + +FG
Sbjct: 204 MAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLWTENWTSFLQAFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A R D+A+ VA F G++ NYYMY GGTNF R A ++ YD +AP+DEYG
Sbjct: 264 GAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASAFIITAYYD-EAPLDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL+ELH +IK C + L+ T LG++ +A+++ +SS +CAAFL N
Sbjct: 323 LVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQAYVF-RSSTECAAFLEN-SGP 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
D + F Y LP S+SILP CKNVVFNT KV Q N + + L +S
Sbjct: 381 RDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNN--------VRAMKPRLQFNS 432
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
A +W Y E + + S L +QI+T KDTSDY+WYT + K V L+I S
Sbjct: 433 AENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTFRFNNKSPNAKSV-LSIYS 491
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
G F+N L +G+ + + K + L G+N + ILS VGL N GA+ +
Sbjct: 492 QGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNISILSATVGLPNSGAFLESR 551
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVNK 596
AGL V + RD SS W YQVG+ GE + + +S ++ WK Q ST P
Sbjct: 552 VAGLRKVEV-----QGRDFSSYSWGYQVGLLGEKLQIFTVSGSSKVQWKSFQSSTKP--- 603
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WY+TTF AP G P+ +NL SMGKG AWVNGQ IGRYW ++ P
Sbjct: 604 -LTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYWVSFHKPD------------ 650
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q YHIPR+++ NLLVI EE G+P I+L T
Sbjct: 651 ----------GTPSQQWYHIPRSFLKSTGNLLVILEEETGNPLGITLDT 689
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 350/820 (42%), Positives = 467/820 (56%), Gaps = 81/820 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V+ D RALV+DG RR+L +G +HY RSTPE+WP+LI K+KEGGL++I+TYVFWN HEP+
Sbjct: 41 QVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNVHEPV 100
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+GQY FEGR+DLVRF+K +Q GL++ LRIGP+ +EW YGGFP WLH +P I FR+ N
Sbjct: 101 QGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNE 160
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+ M+RF+ I+++MK E L+ QGGPII +Q+ENEY VE A+G G+ YV WAA
Sbjct: 161 PFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSWAAAM 220
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+ T VPW MC+Q DAPDP++ G + + P N S +L +G
Sbjct: 221 AVDRQTGVPWTMCKQNDAPDPVV----GIHSHTIPLDFP--------NASRNYLIYGNDT 268
Query: 244 PFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+AFAV F G++ +YYMY GGTNFGR A V TSY AP+DEYG I
Sbjct: 269 KLRSPEDIAFAVVYFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDAAPLDEYGLI 327
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
QP WGHLRELH A+K E L+ ++ LG + EAHI+ S C AFL N+D
Sbjct: 328 WQPTWGHLRELHAAVKQSSEPLLFGTYSYLSLGQEQEAHIFETES-QCVAFLVNFDRHHI 386
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ-KNVNELLLASSA 421
+ V F L S+SIL DCK VVF TAKV +Q + Q ++N
Sbjct: 387 SEVVFRNISLELAPKSISILSDCKRVVFETAKVTAQHGSRTAEEVQSFSDINTWTAFKEP 446
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
K SGNR F E ++TTKD +DYLWY + F NI LG
Sbjct: 447 IPQDVSKAMYSGNRLF------EHLSTTKDDTDYLWYIVGL----------FHNI--LGR 488
Query: 482 AALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
+G+H AN ++N I L EG NT+ +LS MVG + GA +
Sbjct: 489 I-------------HGSHGGPANIILNTNISLKEGPNTISLLSAMVGSPDSGAHMERRVF 535
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
GL V + + + L++ W YQVG+ GE + + S W L + L W
Sbjct: 536 GLQKVSIQQGQEPENLLNNELWGYQVGLFGERNSIYTQEGSKSVEWTTIYNLAYSP-LTW 594
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YKTTF P G + LNL MGKG+ WVNG+SIGRYW ++ APS
Sbjct: 595 YKTTFSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS---------------- 638
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
G P+Q+LYHIPR +++P +N+LV+ EE+GG+P +I++ T + +C V+E
Sbjct: 639 ------GNPSQSLYHIPRQFLNPQDNILVLFEEMGGNPQQITVNTVSVTRVCVNVNELS- 691
Query: 721 PPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPI 779
P+L + P V L C+ G I+AI FASYG P G+C R G+CH +
Sbjct: 692 ------APSLQYKNKEPAVDLRCQEGKQISAIEFASYGNPIGDCKKIRFGSCHAGSSESV 745
Query: 780 VQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
V++AC+G+ CSIP++ G CPG+ K+L V A+C
Sbjct: 746 VKQACLGKSGCSIPITPIKFG--GDPCPGIKKSLLVVANC 783
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 611 bits (1576), Expect = e-172, Method: Compositional matrix adjust.
Identities = 327/828 (39%), Positives = 468/828 (56%), Gaps = 46/828 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDGKR + SG+IHYPRS PE+WP+L+ ++K+GGL IETYVFWN HEP
Sbjct: 33 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWNAHEPEP 92
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGR DL++F+K +Q+ ++ +RIGP+ AEWN+GG P WL IP I FR N P
Sbjct: 93 GKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 152
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM++F+ I+ +K ++FASQGGPIILAQ+ENEYGN++ + G+ Y++WAA+ A
Sbjct: 153 YKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLEWAAEMA 212
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ N +PW+MC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG
Sbjct: 213 LSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWTLRDKNKPRLWTENWTAQFRAFGDQA 272
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A++V RFF GGT NYYMY+GGTNFGRT G V T Y +APIDEYG +
Sbjct: 273 AVRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEAPIDEYGLNK 331
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
+PK+GHLR+LHK IK + + + + LG EAH Y N C AF++N ++ D
Sbjct: 332 EPKFGHLRDLHKLIKSYHKAFLVGKQSFELLGHGYEAHNYELPEENLCLAFISNNNTGED 391
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G Y++P+ SVSIL DC +VV+NT +V Q + A + N + +
Sbjct: 392 GTVMFRGKKYYIPSRSVSILADCNHVVYNTKRVFVQHSERSFHTADESTKN------NVW 445
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
Y E + S + EQ N TKD SDYLWYT S + +P + + ++
Sbjct: 446 EMYSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWYTTSFRLEADDLPFRRDIRPVVQVK 505
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S HA + FVN G G+ FL K I+L GIN L +LS +G+++ G
Sbjct: 506 SSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLALLSSSMGMKDSGGELVE 565
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ ++ L G DL W +++ ++GE + + WK +
Sbjct: 566 VKGGIQDCMIQGLNTGTLDLQGNGWGHKINLDGEDKEIYTEKGMGTVKWKPAEN---GHA 622
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+ WY+ F P+G P+ L+++SM KG +VNG+ +GRYW++Y
Sbjct: 623 VTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWTSY----------------- 665
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+ G P+Q+LYHIPR ++ +NLLV+ EE G P I + T IC +SE
Sbjct: 666 -----KTIAGLPSQSLYHIPRPFLKSKKNLLVVFEEEIGKPEGILIQTVRRDDICFLMSE 720
Query: 718 ADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+P V +W + G + S + L C I + FAS+G PEG CG+F G C
Sbjct: 721 HNPAQVKTWDADGGQIKLIAEDHSSRGILTCPHKKTIEEVVFASFGNPEGACGNFTAGTC 780
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
H + V K C+G+ C +P+ G CP LAV+ C
Sbjct: 781 HTPNAKEFVAKECLGKKSCVLPLIHTLYGADIN-CPTTTATLAVQVRC 827
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 610 bits (1573), Expect = e-171, Method: Compositional matrix adjust.
Identities = 328/831 (39%), Positives = 468/831 (56%), Gaps = 48/831 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDGKR + SG+IHYPRS PE+W +L++ +K GGL IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+YYFEGRFDL+RF+ +++ ++ +RIGP+ AEWN+GG P WL I I FR N P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK EM++F+ I+ +K +FA QGGPIIL+Q+ENEYGN++ V G+ Y++WAA+ A
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ VPWVMC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GGT NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
+PK+GHLR+LH IK + + + + LG EAH Y + C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL DCK VV+NT +V Q + + + N + +
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV------W 448
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
Y E + EQ N TKDTSDYLWYT S + +P + + I+
Sbjct: 449 EMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S HA + F N V G G+ +F+ K ++L GIN + +LS +G+++ G
Sbjct: 509 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 568
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNK 596
G+ ++ L G DL W ++ +EGE + WK LP+
Sbjct: 569 VKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT- 627
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
WYK F P+G P+ ++++SM KG +VNG+ IGRYW++++ +
Sbjct: 628 ---WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA------------ 672
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
G P+Q++YHIPR ++ P NLL+I EE G P I + T IC F+S
Sbjct: 673 ----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 722
Query: 717 EADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
E +P + +W+ + G + +S + L C I + FAS+G PEG CG+F G
Sbjct: 723 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAGT 782
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
CH D IV+K C+G+ C +PV + G CP LAV+ C +
Sbjct: 783 CHTPDAKAIVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQVRCKV 832
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 607 bits (1566), Expect = e-171, Method: Compositional matrix adjust.
Identities = 354/832 (42%), Positives = 476/832 (57%), Gaps = 102/832 (12%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ VTY+ RALV+DG RR+L +G +HYPRSTPE+WP+LI K+KEGGL+VI+TYVFWN H
Sbjct: 14 VRGEVTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVH 73
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI+GQY FEGR+DLVRF+K +Q GL++ LRIGP+ +EW YGGFP WLH +P I FR+
Sbjct: 74 EPIQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRS 133
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+RF+ I+++MK E L+ QGGPII +Q+ENEY VE A+G G+ YV WA
Sbjct: 134 DNEPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWA 193
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A AV+L T VPW MC+Q DAPDP++ +S + P+ + +N S +L +G
Sbjct: 194 AAMAVDLQTGVPWTMCKQNDAPDPVVGI-----------HSYTIPVNF-QNDSRNYLIYG 241
Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R +D+ FAVA F G++ +YYMY GGTNFGR A V TSY AP+DEY
Sbjct: 242 NDTKLRSPQDITFAVALFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEY 300
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G I QP WGHLRELH A+K E L+ ++ +G + EAHI+ ++ C AFL N+D
Sbjct: 301 GLIWQPTWGHLRELHAAVKQSSEPLLFGTYSNLSIGQEQEAHIF-ETETQCVAFLVNFDQ 359
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+ V F L S+SIL DCK VVF TAKV +Q + + E+ S
Sbjct: 360 HHISEVVFRNISLELAPKSISILLDCKQVVFETAKVNAQHGS--------RTAEEVQSFS 411
Query: 420 SAFSW--YEE-------KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK 470
+W ++E K SGNR F E ++TTKD +DYLWY +
Sbjct: 412 DISTWKAFKEPIPQDVSKSAYSGNRLF------EHLSTTKDATDYLWYIVGL-------- 457
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSMMVGLQ 529
FLNI LG +G+H AN + + I L EG NT+ +LS MVG
Sbjct: 458 --FLNI--LGRI-------------HGSHGGPANIIFSTNISLQEGPNTISLLSAMVGSP 500
Query: 530 NYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF--WK 587
+ GA + G+ V + + + L++ W YQVG+ GE + I +S W
Sbjct: 501 DSGAHMERRVFGIRKVSIQQGQEPENLLNNELWGYQVGLFGE---RNNIYTQDSKITEWT 557
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
L + L WYKTTF P G + LNL MGKG+ WVNG+SIGRYW ++ APS
Sbjct: 558 TIDNLTYSP-LTWYKTTFSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS--- 613
Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
G P+Q+LYHIPR +++P +N LV+ EE+GG+P I++ T +
Sbjct: 614 -------------------GNPSQSLYHIPREFLNPQDNTLVLFEEMGGNPQLITVNTMS 654
Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
+C V+E P+L P V L C G HI+AI FASYG P G+C F
Sbjct: 655 VSRVCGNVNELS-------APSLQYKDKEPAVDLWCPEGKHISAIEFASYGGPTGDCKKF 707
Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
G CH +V++AC+G+ CS+PV+ G CPG+ K+L V A+
Sbjct: 708 GFGRCHAGSSESVVKQACLGKSGCSVPVTPIKFG--GDPCPGIQKSLLVVAN 757
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 605 bits (1560), Expect = e-170, Method: Compositional matrix adjust.
Identities = 327/826 (39%), Positives = 466/826 (56%), Gaps = 48/826 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDGKR + SG+IHYPRS PE+W +L++ +K GGL IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+YYFEGRFDL+RF+ +++ ++ +RIGP+ AEWN+GG P WL I I FR N P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK EM++F+ I+ +K +FA QGGPIIL+Q+ENEYGN++ V G+ Y++WAA+ A
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ VPWVMC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GGT NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
+PK+GHLR+LH IK + + + + LG EAH Y + C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL DCK VV+NT +V Q + + + N + +
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKN------NVW 448
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
Y E + EQ N TKDTSDYLWYT S + +P + + I+
Sbjct: 449 EMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S HA + F N V G G+ +F+ K ++L GIN + +LS +G+++ G
Sbjct: 509 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 568
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNK 596
G+ ++ L G DL W ++ +EGE + WK LP+
Sbjct: 569 VKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT- 627
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
WYK F P+G P+ ++++SM KG +VNG+ IGRYW++++ +
Sbjct: 628 ---WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA------------ 672
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
G P+Q++YHIPR ++ P NLL+I EE G P I + T IC F+S
Sbjct: 673 ----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 722
Query: 717 EADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
E +P + +W+ + G + +S + L C I + FAS+G PEG CG+F G
Sbjct: 723 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAGT 782
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVE 816
CH D IV+K C+G+ C +PV + G CP LAV+
Sbjct: 783 CHTPDAKAIVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQ 827
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 308/599 (51%), Positives = 396/599 (66%), Gaps = 17/599 (2%)
Query: 116 IQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL 175
+ FRT N PFK M++F KI+ +MK E+LF +QGGPII++Q+ENEYG VEW G G+
Sbjct: 1 MAFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKA 60
Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGW 235
Y KWAA AV L+T VPW MC+QEDAPDP+I+TCNG+YC+ FTPN KP MWTEN+SGW
Sbjct: 61 YTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGW 120
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
+ FG A+ RP EDLA++VA F + G+F NYYMY GGTNFGRT+ G +ATSYDYDAP
Sbjct: 121 YTDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAP 180
Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK-LEAHIYHKSSNDCAAFL 354
IDEYG +PKW HL+ LHKAIK CE LIS DPT LG K LEAH+Y+ +++ CAAFL
Sbjct: 181 IDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFL 240
Query: 355 ANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE 414
ANYD+ S A VTF Y LP WSVSILPDCK VVFNTA V NG H F ++ E
Sbjct: 241 ANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATV-----NG-HSFHKRMTPVE 294
Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----G 469
++S EE S + S + L EQIN T+D+SDYLWY +++ P + G
Sbjct: 295 TTFDWQSYS--EEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNG 352
Query: 470 KEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
+ L I S GH VFVN +L YG D ++ + L G N + +LS+ VGL
Sbjct: 353 QFPTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLP 412
Query: 530 NYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
N G F+ G+ V L L G RDLS +W Y+VG++GE + L I+ ++S W Q
Sbjct: 413 NVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQ 472
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
GS+L + L WYKTTF AP G P+AL+++SMGKG+ W+N QSIGR+W AY+A G
Sbjct: 473 GSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIA--HGNC 530
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+C+Y G++ KC+ +CG+P Q YHIPR+W+ N+LV+ EE GGDP+ ISL+ +T
Sbjct: 531 DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKRT 589
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 601 bits (1550), Expect = e-169, Method: Compositional matrix adjust.
Identities = 321/827 (38%), Positives = 461/827 (55%), Gaps = 59/827 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDGKR + SG+IHYPRS PEVWP+LI ++KEGGL IETY+FWN HEP
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGRFDL++++K +QE ++ +RIGP+ AEWN+GG P WL I I FR N+P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM++F+ I+ +K LFASQGGPIIL Q+ENEYGN++ + G+ Y++WAA A
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ T VPW+MC+Q AP +I TCNG +C D +T +KP++WTEN++ F ++G V
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GG+ NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
+PK+GHLR+LH I+ ++ + + + LG EAHI+ N C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL CKNVV+NT +V Q N + + +E+ ++ +
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSY------HTSEVTSKNNQW 448
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
Y EK+ + + EQ N TKD SDYLWYT S + +P + L ++
Sbjct: 449 EMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVK 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S H+ + F N V G+ F+ K ++L G+N + +LS +G+++ G
Sbjct: 509 SSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAE 568
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
+G+ ++ L G DL W ++ +EGE + WK ++
Sbjct: 569 VKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWKPAEN---GRA 625
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
WYK F P+G P+ L+++SM KG +VNG+ +GRYW +Y
Sbjct: 626 ATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY----------------- 668
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+ G P+Q LYHIPR ++ +NLLV+ EE G P I + T T IC F+SE
Sbjct: 669 -----RTLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISE 723
Query: 718 ADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+P + +W K L S + L C I + FAS+G PEG CG+F
Sbjct: 724 HNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEGMCGNF----- 778
Query: 773 HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
C+G+ C +PV G C L V+ C
Sbjct: 779 ---------TECLGKPSCMLPVDHTVYGADIN-CQSTTATLGVQVRC 815
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 598 bits (1543), Expect = e-168, Method: Compositional matrix adjust.
Identities = 298/617 (48%), Positives = 403/617 (65%), Gaps = 11/617 (1%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+WP+LI+K+K+GGL+ IETY+FW+ HEP R +Y F GR D ++F + +Q+AGL++ +RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
PY CAEWNYGGFPVWLH +PGIQ RT N +K EM+ F KI+++ KQ NLFASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120
Query: 155 LAQVENEYGNVEW-AYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFY 213
LAQ+ENEYGNV AYG G+ Y+ W A A +LN VPW+MCQQ DAP P+INTCNGFY
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180
Query: 214 CDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFG 273
CD FTPN+P P M+TEN+ GWF +G P+R ED+AF+VARFF++GG F NYYMY G
Sbjct: 181 CDNFTPNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHG 240
Query: 274 GTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK 333
GTNFGRT+GGP + TSYDY+AP+DEYG + QPKWGHL++LH +IKL E+ L +S ++Q
Sbjct: 241 GTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSNQN 300
Query: 334 LGAKLE-AHIYHKSSNDCAAFLANYDSSSDANVTFNGN-VYFLPAWSVSILPDCKNVVFN 391
G+ + + ++ + FL+N D +DA + + YF+PAWSVSIL C V+N
Sbjct: 301 FGSSVTLTKFSNPTTGERFCFLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKEVYN 360
Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKD 451
TAKV SQ + F +++N E S A++ K + GN F L EQ T D
Sbjct: 361 TAKVNSQTS----MFVKEQNEKENAQLSWAWAPEPMKDTLQGNGKFAANLLLEQKRVTVD 416
Query: 452 TSDYLWYTASIHVMPGQG-KEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKI 510
SDY WY + + V L + + GH FVNK+ + +G++ +F+ K I
Sbjct: 417 FSDYFWYMTKVDTNGTSSLQNVTLQVNTKGHVLHAFVNKRYIGSKWGSNG-QSFVFEKPI 475
Query: 511 ELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGV 568
L GINT+ +LS VGL+NY A++D+ G+ + LI N DLSS W Y+VG+
Sbjct: 476 LLKSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGL 535
Query: 569 EGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
GE + + + W + + + + WYKT+F P G P+ L++ MGKGQAWV
Sbjct: 536 NGEMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKTSFKTPAGIDPVVLDMQGMGKGQAWV 595
Query: 629 NGQSIGRYWSAYLAPST 645
NGQSIGR+W +++ T
Sbjct: 596 NGQSIGRFWPSFIXKFT 612
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 327/803 (40%), Positives = 461/803 (57%), Gaps = 54/803 (6%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+WP +I K++ GGL I+TYVFWN HEP +G+Y F+GRFDLV+F+K + E GL++ LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
P+ AEWN+GG P WL +P + FRT N PFKE +R++ KI+ +MK+E LFASQGGPII
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
L Q+ENEY V+ AY GE Y+KWAA+ ++N +PWVMC+Q DAP +IN CNG +C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180
Query: 215 -DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYF 272
D F PN KP +WTEN++ F FG R VED+AF+VAR+F G+ NYYMY
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 240
Query: 273 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ 332
GGTNFGRT+ V T Y DAP+DE+G + PK+GHL+ +H+A++LC++ L Q
Sbjct: 241 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ 299
Query: 333 KLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
LG E Y + CAAFL+N ++ + F G Y LP+ S+SILPDCK VV+N
Sbjct: 300 TLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYN 359
Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKD 451
TA++++Q + D F + + ++ L F + E + + + P E TKD
Sbjct: 360 TAQIVAQHSWRD--FVKSEKTSKGL----KFEMFSENIPSLLDGDSLIP--GELYYLTKD 411
Query: 452 TSDYLWYTASIHV----MPGQ-GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLI 506
+DY WYT S+ + P Q G + L + SLGHA +V+VN + +G H+ +F
Sbjct: 412 KTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEF 471
Query: 507 NKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQ 565
K + G N + IL ++ GL + G++ + AG ++ +I LK+G RDL+ + EW +
Sbjct: 472 AKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHL 531
Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQ 625
G+EGE + + W++ K L WYKT F PEG +A+ + +MGKG
Sbjct: 532 AGLEGEKKEVYTEEGSKKVKWEKDGK---RKPLTWYKTYFETPEGVNAVAIRMKAMGKGL 588
Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV--HP 683
WVNG +GRYW ++L+P G+P QT YHIPR+++
Sbjct: 589 IWVNGIGVGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKGEK 626
Query: 684 GENLLVI-HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK-PNLGVVSSSPQVRL 741
+N+LVI EE G I + ICS V E P V SWK +VS S +RL
Sbjct: 627 KKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRL 686
Query: 742 A----CERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSS 796
C + + FAS+G P G CG+F G C +V+K C+G+ CSI V+
Sbjct: 687 KAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVAR 746
Query: 797 AYLGVSAGACPGLLKALAVEAHC 819
G CP ++K LAV+ C
Sbjct: 747 ETFGDK--GCPEIVKTLAVQVKC 767
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 593 bits (1529), Expect = e-166, Method: Compositional matrix adjust.
Identities = 324/825 (39%), Positives = 460/825 (55%), Gaps = 44/825 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDGKR + SG+IHYPRS P++W +L++ +K+GGL IETYVFWN HEP
Sbjct: 35 VSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWNAHEPEP 94
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGR DL++F+K +Q ++ +RIGP+ AEWN+GG P WL IP I FR N P
Sbjct: 95 GKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 154
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM++F+ I+ +K +FASQGGP+ILAQ+ENEYGN++ + V G+ Y++WAA A
Sbjct: 155 YKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMA 214
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ NT VPW+MC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG +
Sbjct: 215 ISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQL 274
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYM-YFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R ED+A++V RFF GGT NYYM Y+GGTNFGRT G V T Y + P+DE
Sbjct: 275 ALRSAEDIAYSVLRFFAKGGTLVNYYMQYYGGTNFGRT-GASYVLTGYYDEGPVDEC-MP 332
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
+ PK+GHLR+LH IK + + + L EAH + C AF++N ++
Sbjct: 333 KAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGE 392
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F G+ Y++P+ SVSIL DCK+VV+NT +V Q + AQ+ L S+A
Sbjct: 393 DGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQK------LAKSNA 446
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK-EVFLNIESLG 480
+ Y E + S + EQ N TKD SDYL + +P +G + ++S
Sbjct: 447 WEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLCFRLEADDLPFRGDIRPVVQVKSTS 506
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA + FVN G G+ F+ I L GIN L +LS +G+++ G
Sbjct: 507 HALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEVKG 566
Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
G+ + L G DL W ++V +EGE + + W +T +++ W
Sbjct: 567 GIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPATT---GRAVTW 623
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YK F P+G+ P+ L++ SMGKG +VNG+ +GRYW +Y
Sbjct: 624 YKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVG---------------- 667
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
G P+Q +YHIPR ++ P NLLVI EE G P I + T IC F+SE +P
Sbjct: 668 ------GVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRRDDICVFISEHNP 721
Query: 721 PPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
+ +W + G + S + L C I + FAS+G PEG+C +F G CH
Sbjct: 722 AQIKTWDKDGGQIKLIAEDHSTRGILKCPPKKTIQEVVFASFGNPEGSCANFTAGTCHTP 781
Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ IV K C+G+ C +PV G CP LAV+ C
Sbjct: 782 NAKDIVAKECLGKKSCVLPVLHTVYGADIN-CPTTTATLAVQVRC 825
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 326/713 (45%), Positives = 422/713 (59%), Gaps = 76/713 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A VTYD R+L+IDG R++L SGSIHYPRSTP++W LI K+KEGG++VI+TYVFWN HEP
Sbjct: 24 AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQY F GR+DL +F+K +Q GL+ LRIGP+ +EW+YGG P WLH + GI +RT N
Sbjct: 84 QPGQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++LMK E L+ASQGGPIIL+Q+ENEY N+E A+ G YV+WAA
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L T VPWVMC+Q DAPDP+INTCNG C FT PNSP+KP MWTEN++ ++ FG
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF VA F G++ NYYM
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYM----------------------------VS 295
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
IRQPKWGHL+ELH AI LC L++ ++ LG EA+++ + C AFL N D
Sbjct: 296 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 355
Query: 361 SDANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
+++ V F N ++ LP S+SILPDCKNV+FNTAK+ + N + +S
Sbjct: 356 NNSTVLFQNVSIELLPK-SISILPDCKNVIFNTAKINTGYN------------ERITTSS 402
Query: 420 SAFS----WYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEV 472
+F W E K I + S + E +N TKD SDYLWYT P E
Sbjct: 403 QSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT--FRFQPNSSCTEP 460
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L+IESL HA FVN V +G+HD F I LN +N + ILS+MVG + G
Sbjct: 461 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 520
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
A+ + AGL V + + G D ++ W YQVG+ GE + + K ++ W++ + +
Sbjct: 521 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRK-TEI 579
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
N+ L WYK F P G P+ALNL++MGKG+AWVNGQSIGRYW
Sbjct: 580 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 625
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
S+ SK G P+QTLYH+PR ++ ENLLV+ EE GDP ISL T
Sbjct: 626 ---SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 670
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 329/720 (45%), Positives = 431/720 (59%), Gaps = 59/720 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYD +LVI+G ++L SGSIHYPRSTP++WP+LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 24 ANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDVIQTYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQY F GRFDLV F+K +Q GL++ LRIGPY +E YGG P+WLH +PGI FRT N
Sbjct: 84 QQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDVPGIVFRTDN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ FK M+RF KI+++MK NLFASQGGPIIL+Q+ENEYG+++ + G Y+ WAA
Sbjct: 144 DQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFRANGLPYIHWAAQ 203
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNSPSKPIMWTENYSGWFLSFG 240
AV L T VPW+MC+Q+DAPDP+IN CNG C + PNSP+KP +WTEN++ + +FG
Sbjct: 204 MAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLWTENWTSFLQAFG 263
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
A R D+A+ VA F G++ NYYMY GGTNF R A ++ YD +AP+DEYG
Sbjct: 264 GAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASAFIITAYYD-EAPLDEYG 322
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN---- 356
+RQPKWGHL+ELH +IK C + L+ T LG++ + I ++SS + +
Sbjct: 323 LVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQV-IKNESSWTYFPLMFSEVPQ 381
Query: 357 -------YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ 409
D + F Y LP S+SILP CKNVVFNT KV Q N
Sbjct: 382 NVLLSWKISGPRDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNN--------V 433
Query: 410 KNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
+ + L +SA +W Y E + + S L +QI+T KDTSDY+WYT +
Sbjct: 434 RAMKPRLQFNSAENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTFRFNNKSP 493
Query: 468 QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
K V L+I S G F+N L +G+ + + K + L G+N + ILS VG
Sbjct: 494 NAKSV-LSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNISILSATVG 552
Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
L N GA+ + AGL V + RD SS W YQVG+ GE + + +S ++ WK
Sbjct: 553 LPNSGAFLESRVAGLRKVEV-----QGRDFSSYSWGYQVGLLGEKLQIFTVSGSSKVQWK 607
Query: 588 --QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
Q ST P L WY+TTF AP G P+ +NL SMGKG AWVNGQ IGRYW ++ P
Sbjct: 608 SFQSSTKP----LTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYWVSFHKPD- 662
Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
G P+Q YHIPR+++ NLLVI EE G+P I+L T
Sbjct: 663 ---------------------GTPSQQWYHIPRSFLKSTGNLLVILEEETGNPLGITLDT 701
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 587 bits (1513), Expect = e-165, Method: Compositional matrix adjust.
Identities = 312/828 (37%), Positives = 460/828 (55%), Gaps = 80/828 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDGKR + SG+IHYPRS PEVWP+L+ ++KEGGL IETY+FWN HEP
Sbjct: 36 VTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWNAHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGR DLV+F+K +QE G++ +RIGP+ AEWN+GG P WL I I FR N+P
Sbjct: 96 GKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM+++ ++ +K LFASQGGP+IL Q+ENEYGN++ + + G+ Y++WAA A
Sbjct: 156 YKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLEWAAQMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ T VPW+MC+Q AP +I TCNG +C D +T +KP++WTEN++ F ++G +
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQL 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GG+ NYYMY GGTNFGRT+ ++ YD +AP+DEYG +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSMVNYYMYHGGTNFGRTSASYVLTGYYD-EAPLDEYGMYK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
+PK+GHLR+LH I+ ++ +S + + LG EA I+ N C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLSGKHSSEILGHGYEAQIFELPEENLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL CK+VV+NT +V Q + + + +E+ ++ +
Sbjct: 395 GTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRVFVQHSERSY------HTSEVTSKNNQW 448
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
Y E V + + EQ N TKD SDYLWYT S + +P +G L ++
Sbjct: 449 EMYSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWYTTSFRLESDDLPFRGDIRPVLQVK 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S H+ + F N V GN F+ K ++L G+N + +LS +G+++ G
Sbjct: 509 SSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVVLLSSTMGMKDSGGELAE 568
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ ++ L G DL W
Sbjct: 569 VKGGIQECLIQGLNTGTLDLQVNGW----------------------------------- 593
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
+K F P+G P+ L+++SM KG +VNG+ IGRYW ++
Sbjct: 594 --GHKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWVSF----------------- 634
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
+ G P+Q +YHIPR ++ P +NLLV+ EE G P I + T T IC +SE
Sbjct: 635 -----RTLAGTPSQAVYHIPRPFLKPKDNLLVVFEEEMGKPDGILVQTVTRDDICLLISE 689
Query: 718 ADPPPVDSWKPN---LGVVSSSPQVR--LACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+P + +W + + +++ VR L C I + FAS+G P+G CG+F G C
Sbjct: 690 HNPGQIKTWDTDGVKIKLIAEDHSVRGTLMCPPEKIIQEVVFASFGNPDGMCGNFTVGTC 749
Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
H + IV+K C+G+ C +PV G C L V+ C
Sbjct: 750 HTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTGTLGVQVRC 796
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 324/803 (40%), Positives = 456/803 (56%), Gaps = 50/803 (6%)
Query: 31 STPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLH 90
S +WP +I K++ GGL I+TYVFWN HEP +G+Y F+GRFDLV+F+K + E GL++
Sbjct: 65 SRKHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVT 124
Query: 91 LRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQG 150
LR+GP+ AEWN+GG P WL +P + FRT N PFKE +R++ KI+ +MK+E LFASQG
Sbjct: 125 LRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQG 184
Query: 151 GPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN 210
GPIIL Q+ENEY V+ AY GE Y+KWAA+ ++N +PWVMC+Q DAP +IN CN
Sbjct: 185 GPIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACN 244
Query: 211 GFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNY 268
G +C D F PN KP +WTEN++ F FG R VED+AF+VAR+F G+ NY
Sbjct: 245 GRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNY 304
Query: 269 YMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD 328
YMY GGTNFGRT+ V T Y DAP+DE+G + PK+GHL+ +H+A++LC++ L
Sbjct: 305 YMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQ 363
Query: 329 PTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKN 387
Q LG E Y + CAAFL+N ++ + F G Y LP+ S+SILPDCK
Sbjct: 364 LRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKT 423
Query: 388 VVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQIN 447
VV+NTA++++Q + D F + + ++ L F + E + + + P E
Sbjct: 424 VVYNTAQIVAQHSWRD--FVKSEKTSKGL----KFEMFSENIPSLLDGDSLIP--GELYY 475
Query: 448 TTKDTSDYLWYTASIHVMPGQ-GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLI 506
TKD +DY P Q G + L + SLGHA +V+VN + +G H+ +F
Sbjct: 476 LTKDKTDYACVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEF 535
Query: 507 NKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQ 565
K + G N + IL ++ GL + G++ + AG ++ +I LK+G RDL+ + EW +
Sbjct: 536 AKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHL 595
Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQ 625
G+EGE + + W++ K L WYKT F PEG +A+ + +MGKG
Sbjct: 596 AGLEGEKKEVYTEEGSKKVKWEKDGK---RKPLTWYKTYFETPEGVNAVAIRMKAMGKGL 652
Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV--HP 683
WVNG +GRYW ++L+P G+P QT YHIPR+++
Sbjct: 653 IWVNGIGVGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKGEK 690
Query: 684 GENLLVI-HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK-PNLGVVSSSPQVRL 741
+N+LVI EE G I + ICS V E P V SWK +VS S +RL
Sbjct: 691 KKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRL 750
Query: 742 A----CERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSS 796
C + + FAS+G P G CG+F G C +V+K C+G+ CSI V+
Sbjct: 751 KAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVAR 810
Query: 797 AYLGVSAGACPGLLKALAVEAHC 819
G CP ++K LAV+ C
Sbjct: 811 ETFGDK--GCPEIVKTLAVQVKC 831
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 316/669 (47%), Positives = 410/669 (61%), Gaps = 78/669 (11%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NVTYDHRA++I GKRR+L S +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 64 RGQYYFEGRFDLVRFVK----------------TVQEAG-------------------LF 88
+GQYYFE RFDLV+F K +E G +
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYY 182
Query: 89 LHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFAS 148
R P + GFPVWL IPGI+FRT N PFK EM+ F+ KI+ LMK+E L++
Sbjct: 183 FEERFDPVKFEKHVIFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSW 242
Query: 149 QGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINT 208
QGGPIIL Q+ENEYGN++ YG G+ Y++WAA A+ L+T +PWVMC+Q DAP+ II+T
Sbjct: 243 QGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDT 302
Query: 209 CNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNY 268
CN FYCDGF PNS +KP +WTE++ GW+ +G A+P RP ED AFAVARF++ GG+ QNY
Sbjct: 303 CNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNY 362
Query: 269 YMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD 328
YMYFGGTNF RTAGGPL TSYDYDAPIDEYG +RQPKWGHL++LH AIKLCE LI+ D
Sbjct: 363 YMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVD 422
Query: 329 --PTHQKLGAKLEAHIYHK-----------SSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
P + KLG+ EAH+Y ++ C+AFLAN D A+V G Y LP
Sbjct: 423 GSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLP 482
Query: 376 AWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQQKNVNELLLASS-----AFSWY- 425
WSVSILPDC+NV FNTA++ +Q + P ++ +L +S + +W+
Sbjct: 483 PWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWT 542
Query: 426 -EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG-------QGKEVFLNIE 477
+E +G G +F + E +N TKD SDYLWYT +++ +G L I+
Sbjct: 543 SKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTID 602
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
+ A VFVN KL G+ + + I+L EG+N L +LS +VGLQNYGA+ +
Sbjct: 603 KIRDVARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEK 658
Query: 538 AGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGL---DKISLANSSFWKQGSTLP 593
GAG V L L +G DL++ W YQVG++GE+ + +K A S ++ S P
Sbjct: 659 DGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQP 718
Query: 594 VNKSLIWYK 602
WYK
Sbjct: 719 ----FTWYK 723
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 303/636 (47%), Positives = 396/636 (62%), Gaps = 20/636 (3%)
Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAF 253
V+C+Q+DAPDPIIN CNGFYCD F+PN KP MWTE ++GWF FG VP+RP ED+AF
Sbjct: 1 VLCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAF 60
Query: 254 AVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG RQPKWGHL++L
Sbjct: 61 SVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDL 120
Query: 314 HKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYF 373
H+AIKLCE L+S +PT LG EAH+Y S C+AFLANY+ S A V+F N Y
Sbjct: 121 HRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYN 180
Query: 374 LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISG 433
LP WS+SILPDCKN V+NTA+V +Q ++ K V + ++ Y E
Sbjct: 181 LPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGLSWQAYNEDPSTYI 233
Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVN 488
+ SF L EQINTT+DTSDYLWY + V + G L + S GHA VF+N
Sbjct: 234 DESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFIN 293
Query: 489 KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VIL 547
+L YG+ D K + L G N + ILS+ VGL N G F+ AG+ V L
Sbjct: 294 GQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSL 353
Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLA 607
L G+RDLS +W Y+VG++GE + L +S ++S W +G+ + + L WYKTTF A
Sbjct: 354 NGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSA 413
Query: 608 PEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCG 667
P G PLA+++ SMGKGQ W+NGQS+GR+W AY A G +C Y G++ KC ++CG
Sbjct: 414 PAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYTGTFREDKCLRNCG 471
Query: 668 QPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK 727
+ +Q YH+PR+W+ P NLLV+ EE GGDP+ I+L+ + +C+ + E V+
Sbjct: 472 EASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQL 531
Query: 728 PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKAC 784
G V+ P+ L C G I + FAS+G PEG CGS+R G+CH K C
Sbjct: 532 HASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLC 591
Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
VGQ CS+ V+ G CP ++K LAVEA C+
Sbjct: 592 VGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 625
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 315/830 (37%), Positives = 458/830 (55%), Gaps = 46/830 (5%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
VTYD +L+IDG+R + SG+IHYPRS ++WP+L++ +KEGGL IETYVFWN HEP
Sbjct: 36 TTVTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWNAHEP 95
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G++ FEGR D+++F+K +Q G++ +RIGP+ EWN+G P WL IP I FR N
Sbjct: 96 EPGKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHIIFRANN 155
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+K EM++F+ I+ ++K ENLFASQGG +ILAQ+ENEYGN++ + G+ Y++WAA+
Sbjct: 156 EPYKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLEWAAE 215
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGY 241
A++ N VPW+MC+Q AP +I TCNG +C D + +KP +WTEN++ F +FG
Sbjct: 216 MAISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDENKPHLWTENWTAQFRAFGN 275
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
+ R ED+A++V RFF GGT NYYMY+GGTNFGRT G V T Y + PIDEYG
Sbjct: 276 DLAQRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPIDEYGM 334
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
+ PK+GHLR+LH IK + + + LG EA + C AF++N ++
Sbjct: 335 PKAPKYGHLRDLHNVIKSYSRAFLEGKQSFELLGQGYEARNFEIPEEKLCLAFISNNNTG 394
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
D V F G+ Y++P+ SVSIL DCK+VV+NT +V Q + A++ N +
Sbjct: 395 EDGTVIFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHKAEKATKN------N 448
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLN 475
+ + E + + + EQ N TKD SDYLWYT S + +P +G +
Sbjct: 449 VWEMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEADDLPIRGDIRPVIA 508
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
++S HA + FVN G+G+ F I L G+N L +LS +G+++ G
Sbjct: 509 VKSTAHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLRLGVNHLALLSSSMGMKDSGGEL 568
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ + L G DL W ++ +EGE + + W +
Sbjct: 569 VELKGGIQDCTIQGLNTGTLDLQINGWGHKAKLEGEVKEIYTEKGMGAVKWVPAVS---G 625
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+++ WYK F P+G P+ L++ SM KG +VNG+ +GRYW++Y P
Sbjct: 626 QAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYWTSYKTPGKVA-------- 677
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+Q +YHIPRT++ NLLV+ EE G P I + T IC F+
Sbjct: 678 --------------SQAVYHIPRTFLKSKNNLLVVFEEELGKPEGILIQTVRRDDICVFI 723
Query: 716 SEADPPPVDSWKPNLG---VVSSSPQVR--LACERGWHIAAINFASYGIPEGNCGSFRPG 770
SE +P + W + G +++ R L C I + FAS+G P G+C +F G
Sbjct: 724 SEHNPAQIKPWDEHGGQIKLIAEDHNTRGFLNCPPKKIIQEVVFASFGNPVGSCANFTVG 783
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CH + IV+K C+G+ C +PV + G CP LAV+ C
Sbjct: 784 TCHTPNAKEIVEKECLGKKGCVLPVLHTFYGADIN-CPTTTATLAVQVRC 832
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 303/666 (45%), Positives = 400/666 (60%), Gaps = 54/666 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+V+YD R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETY+FWN HEP
Sbjct: 29 TSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEP 88
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
R QY FEG +D+VRF K +Q AG++ LRIGPY C EWNYGG P WL IPG+QFR N
Sbjct: 89 HRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHN 148
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWA 180
PF+ EM+ F I++ MK +FA QGGPIILAQ+ENEYGN+ + Y+ W
Sbjct: 149 EPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWC 208
Query: 181 ADTAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
AD A N VPW+MCQQ +D P ++NTCNGFYC + PN P +WTEN++GWF ++
Sbjct: 209 ADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAW 268
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
R ED+AFAVA FF+ G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEY
Sbjct: 269 DKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 328
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPK+GHL+ELH +K E+ L+ + G + Y S+ A F+ N
Sbjct: 329 GNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSS-ACFINNRFD 387
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
D NVT +G + LPAWSVSILPDCK V FN+AK+ +Q + ++ N E S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTS----VMVKKPNTAEQEQES 443
Query: 420 SAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
+SW E + +F + +L EQI T+ D SDYLWY S++ G+G L +
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLN-HKGEGS-YKLYV 501
Query: 477 ESLGHAALVFVNKKLVAFGY-GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ GH FVN KL+ + + DF F + ++L++G N + +LS VGL+NYG F
Sbjct: 502 NTTGHELYAFVNGKLIGKNHSADGDFV-FQLESPVKLHDGKNYISLLSATVGLKNYGPSF 560
Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ G+ V LID DLS+ W
Sbjct: 561 EKMPTGIVGGPVKLIDSNGTAIDLSNSSWS------------------------------ 590
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
YK TF AP G+ P+ ++L + KG AWVNG ++GRYW +Y A +CDY
Sbjct: 591 -------YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHRCDY 643
Query: 654 RGSYDA 659
RG++ A
Sbjct: 644 RGAFQA 649
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 578 bits (1491), Expect = e-162, Method: Compositional matrix adjust.
Identities = 304/643 (47%), Positives = 415/643 (64%), Gaps = 17/643 (2%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD RALV++G RR+L SG +HY RSTPE+WP+LI +K+GGL+VI+TYVFWN HEP++
Sbjct: 40 VTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNVHEPVQ 99
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F+GR+DLV+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FRT N P
Sbjct: 100 GQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNITFRTDNEP 159
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK+ M+RF+ +I+++MK E L+ QGGPII++Q+ENEY VE A+G GG YV+WAA+ A
Sbjct: 160 FKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRWAAEMA 219
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
V L T VPW+MC+Q DAPDPIINTCNG C + F PNSP+KP +WTEN++ + +G
Sbjct: 220 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYPIYGND 279
Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVA F G+F +YYMY GGTNFGR A V TSY AP+DEYG
Sbjct: 280 TKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGL 338
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
I +P WGHLRELH A+KL E L+ ++ LG + EAHI+ ++ C AFL N+D
Sbjct: 339 IWRPTWGHLRELHAAVKLSSEALLFGRYSNFSLGPEQEAHIF-ETELKCVAFLVNFDKHQ 397
Query: 362 DANVTFNGNVYF-LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
V F N+YF L S+S+L +C+ VVF TA+V +Q ++ V E L
Sbjct: 398 TPTVVFR-NIYFQLAPKSISVLSECRTVVFETARVNAQYG------SRTAEVVESLNDIH 450
Query: 421 AFSWYEEKVGISGNRS-FVRPDLAEQINTTKDTSDYLWYTASIHVMPG-QGKEVFLNIES 478
+ ++E + +++ + L E ++ TKD +DYLWY S +P G+ V LN+ES
Sbjct: 451 TWKAFKEPIPEDISKAVYTGNQLFEHLSMTKDETDYLWYIVSYEYIPSDDGQLVLLNVES 510
Query: 479 LGHAALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
H FVN + +G+HD N ++N I LNEG NT+ +LS+MVG + GA +
Sbjct: 511 RAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEGQNTISLLSVMVGSPDSGAHMER 570
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
G+ V + + L++ W YQVG+ GE + ++S+ W + + L +
Sbjct: 571 RSFGIHKVSIQQGQQPLHLLNNELWAYQVGLYGEANRIYTQEESSSAEWTEINNLTYHP- 629
Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
WYKTTF P G +ALNL SMGKG+ WVNG+S+GRYW ++
Sbjct: 630 FTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESLGRYWVSF 672
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 305/628 (48%), Positives = 396/628 (63%), Gaps = 18/628 (2%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 23 ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 82
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+GQ+ F G D+V+F+K V+ GL++ LRIGP+ EW+YGG P WLH + GI FRT N
Sbjct: 83 QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 142
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK MKR+ I+ LMK ENL+ASQGGPIIL+Q+ENEYG V A+ G+ YVKW A
Sbjct: 143 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 202
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L+T VPWVMC+Q+DAPDP++N CNG C + F PNSP+KP +WTEN++ ++ ++G
Sbjct: 203 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYG 262
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
R ED+AF VA F G+F NYYMY GGTNFGR A V TSY AP+DEYG
Sbjct: 263 EEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 321
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+RQPKWGHL+ELH A+KLCEE L+S T LG A ++ K +N CAA L N D
Sbjct: 322 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 380
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
++ V F + Y L SVS+LPDCKNV FNTAKV +Q N + + + L +
Sbjct: 381 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 434
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ + E V S L E +NTT+DTSDYLW T +G L + LG
Sbjct: 435 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 492
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
HA FVN + + +G FL+ K + LN G N L +LS+MVGL N GA +
Sbjct: 493 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 552
Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G SV + NG+ L ++ W YQVG++GE + + WKQ ++ L
Sbjct: 553 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 608
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQA 626
WYK +F PEG+ P+ALNL SMGKG+A
Sbjct: 609 TWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 321/832 (38%), Positives = 445/832 (53%), Gaps = 111/832 (13%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTYD R+L+++G+R +L SGSIHYPRSTPE
Sbjct: 29 AKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPE--------------------------- 61
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
+ FEG +DLV+F+K + + GL+ LRIGP+ AEWN+GGFP WL +P I FR+
Sbjct: 62 -----FNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSY 116
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+++ II++MK+ LFA QGGPIILAQ+ENEY +++ AY G YV+WA
Sbjct: 117 NEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYKELGVQYVQWAG 176
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
AV L VPW+MC+Q+DAPDP+INTCNG +C D FT PN P+KP +WTEN++ + F
Sbjct: 177 KMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVF 236
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G R EDLAF+VARF GT NYYMY GGTNFGRT G V T Y +AP+DEY
Sbjct: 237 GDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEY 295
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYD 358
G R+PKWGHL++LH A++LC++ L + P +KLG E Y K ++ CAAFL N
Sbjct: 296 GLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNH 355
Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
S A +TF G YFLP S+SILPDCK VV+NT +V++Q N + F + K N+ L
Sbjct: 356 SREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARN--FVKSKIANKNL-- 411
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV-F 473
+ +E + + + + E KD SDY W+ SI + +P + +
Sbjct: 412 --KWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFVTSIELSNYDLPMKKDIIPV 469
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I +LGHA L FVN + +G++ NF+ K ++ +G N L ++
Sbjct: 470 LQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKF-QGRNKLHCPAV--------- 519
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+D G+ SV ++ L G D+++ W QVGV GE++ ++ W
Sbjct: 520 -YDSGTTGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKG-- 576
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
++ WYKT F PEG P+ L + SM KG NG
Sbjct: 577 KGPAMTWYKTYFDMPEGNDPVILRMTSMAKG----NGLE--------------------- 611
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
YH+PR W+ P +NLLVI EE GG+P +I ICS
Sbjct: 612 --------------------YHVPRAWLKPSDNLLVIFEETGGNPEEIEXELVNRDTICS 651
Query: 714 FVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
V+E PP V SW+ + + + P+ L C I ++FAS+G P G CG F
Sbjct: 652 IVTEYHPPHVKSWQRHDSKIRAVVDEVKPKGHLKCPNYKVIVKVDFASFGNPLGACGDFE 711
Query: 769 PGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C + +V++ C G+ C IP+ + ++GAC + K LAV+ C
Sbjct: 712 MGNCTAPNSKKVVEQHCXGKTTCEIPMEAGIFXGNSGACSDITKTLAVQVRC 763
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 298/643 (46%), Positives = 393/643 (61%), Gaps = 29/643 (4%)
Query: 191 VPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVED 250
VPWVMC+Q+DAPDP+INTCNGFYCD F+PN P KP WTE ++ WF +FG RPVED
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVED 62
Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHL 310
LAF VARF + GG+ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IRQPK+GHL
Sbjct: 63 LAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHL 122
Query: 311 RELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGN 370
+ LH A+KLCE+ L++ +P L +A ++ SS DCAAFL+NY S++ A VTFNG
Sbjct: 123 KRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFNGR 182
Query: 371 VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEK 428
Y LP WS+SILPDCK+V++NTA+V Q N ++ L +FSW Y E
Sbjct: 183 HYTLPPWSISILPDCKSVIYNTAQVQVQTN----------QLSFLPTKVESFSWETYNEN 232
Query: 429 V-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHA 482
+ I + S L EQ+ TKD SDYLWYT S++V P + GK L S GH
Sbjct: 233 ISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHG 292
Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
VF+N KL +G HD + F +I L G+N + +LS+ GL N G ++ G+
Sbjct: 293 MHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGV 352
Query: 543 FSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN-KSLIW 600
+ I L GK DLS +W Y+VG++GE + L S + W + S N + L W
Sbjct: 353 LGPVAIHGLDXGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQENAQPLTW 412
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
YK F APEG PLAL++ SM KGQ W+NGQ++GRYW+ + + CT C Y G+Y
Sbjct: 413 YKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWT--ITANGNCT-DCSYSGTYRPR 469
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
KCQ CGQP Q YH+PR+W+ P +NL+V+ EE+GG+PS+ISL+ ++ IC+ S+ P
Sbjct: 470 KCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICTEASQYRP 529
Query: 721 PPVD-SWKPNLGVVSSSP--QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVL 777
+ N G ++ ++ L C G I+AI FAS+G P G CGS + G CH
Sbjct: 530 VIKNVHMHQNNGELNEQNVLKINLHCAAGQFISAIKFASFGTPSGACGSHKQGTCHSPKS 589
Query: 778 P-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
++QK CVG+ C + ++ G CP L K L+ E C
Sbjct: 590 DYVLQKLCVGRQRCLATIPTSIFG--EDPCPNLRKKLSAEVVC 630
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 559 bits (1441), Expect = e-156, Method: Compositional matrix adjust.
Identities = 285/648 (43%), Positives = 403/648 (62%), Gaps = 25/648 (3%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+ DG R + SGSIHYPRS P++WPELI K+KEGGL IETYVFWN HEP +
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ FEG+ D+VRF + +QE ++ +R+GP+ AEWN+GG P WL IP I FRT N P
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K M+ F+ II +K NLFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
++ N +PW+MC+Q AP +I TCNG C G T P + S P++WTEN++ + FG
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNC-GDTWPGPTNKSMPLLWTENWTAQYRVFGD 281
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
R ED+AFAVARFF GGT NYYMY GGTNFGRT+ ++ YD +AP+DE+G
Sbjct: 282 PPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 340
Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
++PKWGHLR+LH+A+KLC++ L+ P+ +KLG +LEA ++ C AFL+N+++
Sbjct: 341 YKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTK 400
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
DA +TF G YF+P S+S+L DC+ VVF T V +Q N FA Q NV E+
Sbjct: 401 DDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQTAQNNVWEMFD 460
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EV 472
+ + + K+ + + N TKD +DY+WYT+S + MP + +
Sbjct: 461 GENVPKYKQAKIRLR--------KAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 512
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
L + S GHA++ FVN K V G+G F + K ++L +G+N + +L+ +G+ + G
Sbjct: 513 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSG 572
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
A+ + AG+ V + L G DL++ W + VG+ GE + S WK
Sbjct: 573 AYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKPAMN- 631
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
++ L WYK F P G+ P+ L++++MGKG +VNGQ IGRYW +Y
Sbjct: 632 --DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY 677
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 311/831 (37%), Positives = 445/831 (53%), Gaps = 79/831 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDGKR + SG+IHYPRS PE+W +L++ +K GGL IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+YYFEGRFDL+RF+ +++ ++ +RIGP+ AEWN+GG P WL I I FR N P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK +ENEYGN++ V G+ Y++WAA+ A
Sbjct: 156 FK-------------------------------IENEYGNIKKDRKVEGDKYLEWAAEMA 184
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ VPWVMC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG +
Sbjct: 185 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 244
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GGT NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 245 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 303
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
+PK+GHLR+LH IK + + + + LG EAH Y + C +FL+N ++ D
Sbjct: 304 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 363
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL DCK VV+NT +V Q + + + N + +
Sbjct: 364 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV------W 417
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
Y E + EQ N TKDTSDYLWYT S + +P + + I+
Sbjct: 418 EMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 477
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
S HA + F N V G G+ +F+ K ++L GIN + +LS +G+++ G
Sbjct: 478 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 537
Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNK 596
G+ ++ L G DL ++ +EGE + WK LP+
Sbjct: 538 VKGGIQDCVVQGLNTGTLDLQGNGRGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT- 596
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
WYK F P+G P+ ++++SM KG +VNG+ IGRYW++++ +
Sbjct: 597 ---WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA------------ 641
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
G P+Q++YHIPR ++ P NLL+I EE G P I + T IC F+S
Sbjct: 642 ----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 691
Query: 717 EADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
E +P + +W+ + G + +S + L C I + FAS+G PEG CG+F G
Sbjct: 692 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPQRTIQEVVFASFGNPEGACGNFTAGT 751
Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
CH D +V+K C+G+ C +PV + G CP LAV+ C +
Sbjct: 752 CHTPDAKAVVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQVRCKV 801
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 310/772 (40%), Positives = 437/772 (56%), Gaps = 54/772 (6%)
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QY F+GRFDLV+F+K + E GL++ LR+GP+ AEWN+GG P WL +P + FRT N PF
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
KE +R++ KI+ +MK+E LFASQGGPIIL Q+ENEY V+ AY GE Y+KWAA+
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAV 243
++N +PWVMC+Q DAP +IN CNG +C D F PN KP +WTEN++ F FG
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R VED+AF+VAR+F G+ NYYMY GGTNFGRT+ V T Y DAP+DE+G +
Sbjct: 260 TQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEK 318
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
PK+GHL+ +H+A++LC++ L Q LG E Y + CAAFL+N ++
Sbjct: 319 APKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDT 378
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
+ F G Y LP+ S+SILPDCK VV+NTA++++Q + D F + + ++ L F
Sbjct: 379 NTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRD--FVKSEKTSKGL----KF 432
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQ-GKEVFLNIE 477
+ E + + + P E TKD +DY WYT S+ + P Q G + L +
Sbjct: 433 EMFSENIPSLLDGDSLIP--GELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVA 490
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
SLGHA +V+VN + +G H+ +F K + G N + IL ++ GL + G++ +
Sbjct: 491 SLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEH 550
Query: 538 AGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG ++ +I LK+G RDL+ + EW + G+EGE + + W++ K
Sbjct: 551 RFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK---RK 607
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
L WYKT F PEG +A+ + +MGKG WVNG +GRYW ++L+P
Sbjct: 608 PLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP------------- 654
Query: 657 YDASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHICS 713
G+P QT YHIPR+++ +N+LVI EE G I + ICS
Sbjct: 655 ---------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICS 705
Query: 714 FVSEADPPPVDSWK-PNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSFR 768
V E P V SWK +VS S +RL C + + FAS+G P G CG+F
Sbjct: 706 NVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFT 765
Query: 769 PGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
G C +V+K C+G+ CSI V+ G CP ++K LAV+ C
Sbjct: 766 MGKCSASKSKEVVEKECLGRNYCSIVVARETFG--DKGCPEIVKTLAVQVKC 815
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 313/844 (37%), Positives = 458/844 (54%), Gaps = 101/844 (11%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HEP +
Sbjct: 54 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F GR DLV+F+K +Q+ G+++ LR+GP+ AEW +G + H +R
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDHKNIAGAYR----- 168
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
++ENEY V+ AY G Y+KWA++
Sbjct: 169 --------------------------------KIENEYSAVQRAYKQDGLNYIKWASNLV 196
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F FG
Sbjct: 197 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 256
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
R VED+A++VARFF GT NYYMY GGTNFGRT+ + YD DAP+DEYG
Sbjct: 257 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEYGLE 315
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
++PK+GHL+ LH A+ LC++ L+ P +K G E Y + + CAAFLAN ++ +
Sbjct: 316 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 375
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
+ F G Y + S+SILPDCK VV+NTA+++SQ + + F + K N+
Sbjct: 376 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRN--FMKSKKANKKF----D 429
Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-----HVMPGQGKEVFL 474
F + E + + GN S++ +L TKD +DY WYT S H+ +G + F+
Sbjct: 430 FKVFTETLPSKLEGN-SYIPVEL---YGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFV 485
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
I SLGHA ++N + + G+G+H+ +F+ K++ L G N L +L ++ G + G++
Sbjct: 486 RIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSY 545
Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
+ G + ++ L +G DL+ S +W ++G+EGE +G+ WK+ +
Sbjct: 546 MEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGKA 605
Query: 594 VNKSLIWY----------KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAP 643
L WY +T F APE + + MGKG WVNG+ +GRYW ++L+P
Sbjct: 606 --PGLTWYQKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSP 663
Query: 644 STGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKIS 702
GQP Q YHIPR+++ P +NLLVI EE P +
Sbjct: 664 ----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMD 701
Query: 703 LLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASY 757
+CS+V E P V W V + S L C IAA+ FAS+
Sbjct: 702 FAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKKIAAVEFASF 761
Query: 758 GIPEGNCGSFRPGACHMDVLP-IVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAV 815
G P G CG+F G C+ V +++K C+G+ EC IPV+ S + +C ++K LAV
Sbjct: 762 GNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVVKMLAV 821
Query: 816 EAHC 819
+ C
Sbjct: 822 QVKC 825
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 534 bits (1375), Expect = e-149, Method: Compositional matrix adjust.
Identities = 294/767 (38%), Positives = 418/767 (54%), Gaps = 46/767 (5%)
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
Q FEGR DL++F+K +Q ++ +RIGP+ AEWN+GG P WL IP I FR N P+
Sbjct: 105 QVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEPY 164
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K+EM++F+ I+ +K +FASQGGP+ILAQ+ENEYGN++ + V G+ Y++WAA A+
Sbjct: 165 KKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMAI 224
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
+ NT VPW+MC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG +
Sbjct: 225 STNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQLA 284
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
R ED+A++V RFF GGT NYYMY+GGTNFGRT G V T Y + P+DEYG +
Sbjct: 285 LRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPVDEYGMPKA 343
Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDA 363
PK+GHLR+LH IK + + + L EAH + C AF++N ++ D
Sbjct: 344 PKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGEDG 403
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V F G+ Y++P+ SVSIL DCK+VV+NT +V Q + AQ+ L S+A+
Sbjct: 404 TVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQK------LAKSNAWE 457
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIES 478
Y E + S + EQ N TKD SDYLWYT S + +P +G + ++S
Sbjct: 458 MYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQVKS 517
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
HA + FVN G G+ F+ I L GIN L +LS +G+++ G
Sbjct: 518 TSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEV 577
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ + L G DL W ++V +EGE + + W +T +++
Sbjct: 578 KGGIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPATT---GRAV 634
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
WYK F P+G+ P+ L++ SMGKG +VNG+ +GRYW +Y
Sbjct: 635 TWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVG-------------- 680
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
G P+Q +YHIPR ++ P NLLVI EE G P I + T IC F+SE
Sbjct: 681 --------GVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRRDDICVFISEH 732
Query: 719 DPPPVDSWKPNLG---VVSSSPQVR--LACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
+P + +W + G V++ R L C I + FAS+G PEG+C +F G+CH
Sbjct: 733 NPAQIKTWDKDGGQIKVIAEDHSTRGILKCPPKKTIQEVVFASFGNPEGSCANFTAGSCH 792
Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
+ IV K C+G+ C +PV G CP LAV+ C
Sbjct: 793 TPNAKDIVAKECLGKKSCVLPVLHTVYGADIN-CPTTTATLAVQVRC 838
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 251/467 (53%), Positives = 328/467 (70%), Gaps = 7/467 (1%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L NV+YD A++I+G+RR++ SGSIHYPRST +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 18 LGDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRH 77
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP R +Y F GR D ++F + +Q+AGL++ +RIGPY CAEWNYGGFPVWLH +PGIQ RT
Sbjct: 78 EPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWLHNMPGIQLRT 137
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEW-AYGVGGELYVKW 179
N +K EM+ F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV AYG G+ Y+ W
Sbjct: 138 NNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAYGDAGKAYINW 197
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
A A +LN VPW+MCQQ DAP PIINTCNGFYCD FTPN+P P M+TEN+ GWF +
Sbjct: 198 CAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPKSPKMFTENWVGWFKKW 257
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G P+R ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEY
Sbjct: 258 GDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEY 317
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYD 358
G + QPKWGHL++LH +IKL E+ L + T+Q G+ + ++ ++ + FL+N D
Sbjct: 318 GNLNQPKWGHLKQLHASIKLGEKILTNGTHTNQNFGSSVTLTKFFNPTTGERFCFLSNTD 377
Query: 359 SSSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
+DA + + YF+PAWSVSIL C V+NTAKV SQ + F +++N E
Sbjct: 378 GKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQTS----MFVKEQNEKENAQ 433
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
S A++ K + GN F EQ T D SDY WY ++
Sbjct: 434 LSWAWAPEPMKDTLQGNGKFAANLFLEQKRVTADFSDYFWYMTNVDT 480
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 280/603 (46%), Positives = 369/603 (61%), Gaps = 20/603 (3%)
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV 286
MWTE ++GWF FG VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKS 346
ATSYDYDAP+DEYG RQPKWGHL++LH+AIKLCE L+S +PT LG EAH+Y
Sbjct: 61 ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSK 120
Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF 406
S C+AFLANY+ S A V+F N Y LP WS+SILPDCKN V+NTA+V +Q
Sbjct: 121 SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQT------- 173
Query: 407 AQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP 466
++ K V + ++ Y E + SF L EQINTT+DTSDYLWY + V
Sbjct: 174 SRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDA 233
Query: 467 GQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDI 521
+ G L + S GHA VF+N +L YG+ D K + L G N + I
Sbjct: 234 NEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAI 293
Query: 522 LSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISL 580
LS+ VGL N G F+ AG+ V L L G+RDLS +W Y+VG++GE + L +S
Sbjct: 294 LSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSG 353
Query: 581 ANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
++S W +G+ + + L WYKTTF AP G PLA+++ SMGKGQ W+NGQS+GR+W AY
Sbjct: 354 SSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAY 413
Query: 641 LAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
A G +C Y G++ KC ++CG+ +Q YH+PR+W+ P NLLV+ EE GGDP+
Sbjct: 414 KA--VGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNG 471
Query: 701 ISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYG 758
I+L+ + +C+ + E V+ G V+ P+ L C G I + FAS+G
Sbjct: 472 ITLVRREVDSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFG 531
Query: 759 IPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
PEG CGS+R G+CH K CVGQ CS+ V+ G CP ++K LAVEA
Sbjct: 532 TPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEA 589
Query: 818 HCS 820
C+
Sbjct: 590 VCA 592
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 291/686 (42%), Positives = 405/686 (59%), Gaps = 54/686 (7%)
Query: 158 VENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF 217
+ENE+GNVE +YG G+ YVKW A+ A + N S PW+MCQQ DAP PIINTCNGFYCD F
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQF 60
Query: 218 TPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF 277
PN+ + P MWTE+++GWF +G P+R EDLAFAVARFF+ GG+ NYYMY GGTNF
Sbjct: 61 KPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNF 120
Query: 278 GRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK 337
GR+AGGP + TSYDY+AP+DEYG + QPKWGHL++LH+ I+ E+ L D H G
Sbjct: 121 GRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHS 180
Query: 338 LEAHIY-HKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVI 396
A Y +K + C F N +SD +TF Y +P WSV++LPDCK V+NTAKV
Sbjct: 181 TTATSYTYKGKSSC--FFGN-PENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVN 237
Query: 397 SQRNNGDH-PFAQQKNVNELLLASSAFSWYEEKV-------GISGNRSFVRPDLAEQINT 448
+Q + P K+ L + W EK+ ISG+ + L +Q
Sbjct: 238 TQTTIREMVPSLVGKHKKPL-----KWQWRNEKIEHLTHEGDISGS-AITANSLIDQKMV 291
Query: 449 TKDTSDYLWYTASIHVM---PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFL 505
T D+SDYLWY H+ P GK V L +++ GH FVN K + +G + +F
Sbjct: 292 TNDSSDYLWYLTGFHLNGNDPLFGKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFT 351
Query: 506 INKKIE-LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGK--RDLSSGEW 562
+ KK+ L G N + +LS VGL NYGA+++ G++ + + + +GK RDLS+ EW
Sbjct: 352 LEKKVRNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVEL-IADGKTIRDLSTNEW 410
Query: 563 IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMG 622
IY+VG++GE W + LP+N++ WYKT+F P+G+ + ++L MG
Sbjct: 411 IYKVGLDGEKYEFFDPDHKFRKPW-LSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMG 469
Query: 623 KGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH 682
KGQAWVNG+SIGRYW +YLA GC+ CDYRG+Y SKC +CG+P Q YHIPR++++
Sbjct: 470 KGQAWVNGKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMN 529
Query: 683 PG-ENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRL 741
G EN L++ EE GG P I + T + +C+ K +LG ++ L
Sbjct: 530 DGKENTLILFEEFGGMPLNIEIKTTRVKKVCA-------------KVDLG-----SKLEL 571
Query: 742 ACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLG 800
C + I F +G P+GNC +F G+CH + +++K C+ + +CSI V+ LG
Sbjct: 572 TCHDR-TVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLG 630
Query: 801 VSAGACPGLLKALAVE------AHCS 820
++ P LAV+ +HCS
Sbjct: 631 LTGCKNPK-DNWLAVQPFWHHKSHCS 655
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 264/530 (49%), Positives = 336/530 (63%), Gaps = 17/530 (3%)
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
AV+ N VPW+MCQQ DAP +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG
Sbjct: 2 AVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGRD 61
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED+A++VARFF GG+ NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG R
Sbjct: 62 PHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 121
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
PKWGHL++LHKAI L E LIS + + LG LEA +Y SS CAAFL+N D +D
Sbjct: 122 LPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKNDK 181
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V F Y LPAWSVSILPDCK VFNTAKV S+ ++ + + E L +SS
Sbjct: 182 AVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKS-------SKVEMLPEDLKSSSGLK 234
Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
W + EK GI G FV+ +L + INTTKDT+DYLWYT SI V + G L I
Sbjct: 235 WEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFI 294
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
ES GH VF+NK+ + GN F + K + L G N +D+LSM VGL N G++++
Sbjct: 295 ESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYE 354
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
GAGL SV + G +L++ +W Y++GVEGE++ L K + + W + P +
Sbjct: 355 WVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQ 414
Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL---APSTGCTKKCDY 653
L WYK P G P+ L++ SMGKG AW+NG+ IGRYW +P+ C K+CDY
Sbjct: 415 PLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDY 474
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
RG + KC CG+P+Q YH+PR+W N LVI EE GG+P KI L
Sbjct: 475 RGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKL 524
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 233/340 (68%), Positives = 280/340 (82%), Gaps = 1/340 (0%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHRAL+IDGKRR+L S IHYPR+TPE+WP+LI KSKEGG +VI+TYVFWN HEP+
Sbjct: 28 NVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNGHEPV 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
R QY FEGR+D+V+FVK V +GL+LHLRIGPY CAEWN+GGFPVWL IPGI+FRT N
Sbjct: 88 RRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PFK+EM+RF+ KI+DLM++E LF+ QGGPII+ Q+ENEYGNVE ++G G+ YVKWAA
Sbjct: 148 PFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKWAARM 207
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+ VPWVMCQQ DAPD IIN CNGFYCD F PNS +KP +WTE+++GWF S+G
Sbjct: 208 ALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSANKPKLWTEDWNGWFASWGGRT 267
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RPVED+AFAVARFF+ GG+F NYYMYFGGTNFGR++GGP TSYDYDAPIDEYG +
Sbjct: 268 PKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEYGLLS 327
Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHI 342
QPKWGHL+ELH AIKLCE L++ D P + KLG E +
Sbjct: 328 QPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEVGV 367
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 206/511 (40%), Positives = 286/511 (55%), Gaps = 41/511 (8%)
Query: 334 LGAKLEAHIYH--------KSSN--DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILP 383
+ K AH+Y +S N C+AFLAN D A+VTF G +Y LP WSVSILP
Sbjct: 561 MDTKQTAHVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILP 620
Query: 384 DCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLA 443
DC+ VFNTAKV +Q + N++ + +E + + +F +
Sbjct: 621 DCRTTVFNTAKVGAQTS---------IKTNKISYVPKTWMTLKEPISVWSENNFTIQGVL 671
Query: 444 EQINTTKDTSDYLWYTASIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGY 496
E +N TKD SDYLW I+V + +V L+I+S+ +FVN +L+
Sbjct: 672 EHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVI 731
Query: 497 GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKR 555
G+ + + I+L +G N L +LS VGLQNYGA+ + GAG V L KNG+
Sbjct: 732 GHW----VKVVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEI 787
Query: 556 DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLA 615
DLS W YQVG+ GE+ + I + + W + + WYKT F AP G+ P+A
Sbjct: 788 DLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVA 847
Query: 616 LNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYH 675
L+L SMGKGQAWVNG IGRYW+ +AP GC KCDYRG Y SKC +CG P Q YH
Sbjct: 848 LDLGSMGKGQAWVNGHHIGRYWTR-VAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYH 905
Query: 676 IPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS 735
IPR+W+ NLLV+ EE GG P +IS+ +++ Q IC+ VSE+ P + +W P+ + +
Sbjct: 906 IPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFIDQN 965
Query: 736 S-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIE 789
S P++ L C+ G I++I FASYG P+G+C F G CH + L +V KAC G+
Sbjct: 966 SKNKMTPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSKACQGKGS 1025
Query: 790 CSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
C I + ++ G C G++K LAVEA C+
Sbjct: 1026 CVIRILNSAFG--GDPCRGIVKTLAVEAKCA 1054
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 268/595 (45%), Positives = 363/595 (61%), Gaps = 47/595 (7%)
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P RP ED+AFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R
Sbjct: 1 PHRPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 60
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
+PKWGHLR+LH+AIKLCE L+S DPT +G ++H++ + CAAFL+NYDS S A
Sbjct: 61 EPKWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYA 120
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V FNG Y +P WS+SILPDCK VFNTA++ +Q + +A + FS
Sbjct: 121 RVVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQTSQLKMEWAGK------------FS 168
Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
W Y E +RSF + L EQI+ T+D +DYLWYT +++ + G L +
Sbjct: 169 WESYNEDTNSFDDRSFTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYPVLTV 228
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
S GH+ +++N +L YG + ++L G N + ILS+ VGL N G F+
Sbjct: 229 NSAGHSMHIYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHFE 288
Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ V L L GKRDLS +WIYQ+G++GE + L +S ++S W S
Sbjct: 289 TWNTGVLGPVTLSGLNEGKRDLSWQKWIYQIGLKGEALNLHTLSGSSSVEWGGPSQ---K 345
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+SL WYKT+F AP G PLAL++ SMGKGQ W+NGQS+GRYW AY A +G CDYRG
Sbjct: 346 QSLTWYKTSFNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKA--SGSCGGCDYRG 403
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
+Y+ KCQ +CG+ Q YH+PR+W++P NLLV+ EE GGDPS IS++ + + +C+ +
Sbjct: 404 TYNEKKCQSNCGESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKVESVCAEI 463
Query: 716 SEADPPPVDSWKPNLGVVSSS----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
+E W+PN+ V + + L+C G + I FAS+G P+G CG+F G
Sbjct: 464 AE--------WQPNMDNVHTGNYGRSKAHLSCAPGQKMTNIKFASFGTPQGTCGAFSEGT 515
Query: 772 CH-------MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CH + ++Q C+GQ C++ V+ G CPG +K LAVEA C
Sbjct: 516 CHAHKSYDAFEKESLLQN-CIGQQSCAVLVAPEVFG--GDPCPGTMKKLAVEAIC 567
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 275/584 (47%), Positives = 357/584 (61%), Gaps = 30/584 (5%)
Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHL 310
LAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG IRQPK+GHL
Sbjct: 1 LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60
Query: 311 RELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGN 370
+ELH+AIK+CE+ L+S+DP +G K +AH+Y S DC+AFLANYD+ S A V FN
Sbjct: 61 KELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNV 120
Query: 371 VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW---YEE 427
Y LP WS+SILPDC+N VFNTAKV Q + + KN F W E+
Sbjct: 121 HYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKN----------FQWESYLED 170
Query: 428 KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHA 482
+ + +F L EQIN T+DTSDYLWY S+ + + G+ L I+S GHA
Sbjct: 171 LSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHA 230
Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
+FVN +L +G F KI L+ G N + +LS+ VGL N G F+ G+
Sbjct: 231 VHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGI 290
Query: 543 FS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVNKSLIW 600
V L L GK DLS +W YQVG++GE + L + S W S T+ + L W
Sbjct: 291 LGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTW 350
Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
+KT F APEG PLAL++ MGKGQ WVNG+SIGRYW+A+ +TG C Y G+Y +
Sbjct: 351 HKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSYTGTYKPN 407
Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
KCQ CGQP Q YH+PR W+ P +NLLVI EELGG+PS +SL+ ++ +C+ VSE
Sbjct: 408 KCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH- 466
Query: 721 PPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV- 776
P + +W+ G P+V L C G IA+I FAS+G P G CGS++ G CH
Sbjct: 467 PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATS 526
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
I+++ CVG+ C++ +S++ G CP +LK L VEA C+
Sbjct: 527 YAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 568
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 270/546 (49%), Positives = 346/546 (63%), Gaps = 24/546 (4%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+W L++ +KEGG++VIETYVF N HE YYF G +DL++FVK VQ+AG++L L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
P+ EWN+GG P+WLH++P F+T + PFK M++F+ I+++MK++ LFASQGGPII
Sbjct: 61 PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
L QVENEYG+ + Y GG+ YV WAA+ ++ N VPW+MCQ + DP+INTCN FYC
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180
Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
D FTPNSPSK MWTEN+ WF +FG + R ED+AF+VA FF NYYMY GG
Sbjct: 181 DQFTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKSX--NYYMYHGG 238
Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
TNFG T+GGP + T+Y+Y+APIDEYG R PK GHL+EL +AIK CE L+ +P + L
Sbjct: 239 TNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPINLXL 298
Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK 394
G E +Y S AAF++N D D + F Y +PAWSVSILPDCKNVVFNTAK
Sbjct: 299 GPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFNTAK 358
Query: 395 VISQRNNGDHPFAQQKNVNELLLAS--------SAFSW--YEEKVGISGNRSFVRPDLAE 444
V+SQ +Q + V E L S W + EK GI G FV+ +
Sbjct: 359 VVSQ-------ISQVEMVLEDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFVKNGFVD 411
Query: 445 QINTTKDTSDYLWYTASIHVMPGQG--KEV---FLNIESLGHAALVFVNKKLVAFGYGNH 499
INTTKDT+D LWYT SI V + KE+ L +ES GHA FVN+KL GN
Sbjct: 412 HINTTKDTTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNG 471
Query: 500 DFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSS 559
+ F I L G N + +LSM VGLQN +++ GA L SV + L NG DLS+
Sbjct: 472 SHSPFKFECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTSVKIKGLNNGIMDLST 531
Query: 560 GEWIYQ 565
WIY+
Sbjct: 532 YPWIYK 537
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 284/711 (39%), Positives = 403/711 (56%), Gaps = 42/711 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
NV+YDHR+L+I+G+R++L S SIHYPR+TP +W ++ +K G+++IETY FWN HEP
Sbjct: 42 NVSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYTFWNLHEPT 101
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G Y FEG ++ F+ E GL++ +R GPY CAEWNYGGFP WL I GI FR N
Sbjct: 102 PGTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDGIVFRDYNQ 161
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
PF ++M ++ I++ ++ +AS GGPIILAQVENEYG +E AYG G Y WAA
Sbjct: 162 PFMDQMSNWMTYIVNYLRP--YYASNGGPIILAQVENEYGWLEAAYGASGTKYALWAAQF 219
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC----DGFTPNSPSKPIMWTENYSGWFLSF 239
A +L+ +PW+MC Q+D +INTCNGFYC D P++P WTEN+ GWF ++
Sbjct: 220 ANSLDIGIPWIMCSQDDIAT-VINTCNGFYCHDWIDVHWTAYPNQPAFWTENWPGWFQNW 278
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
VP RPV+D+ ++VAR+ GG+ NYYM+FGGT FGR GGP + TSYDYD IDEY
Sbjct: 279 EGGVPHRPVQDVLYSVARWIAYGGSMMNYYMWFGGTTFGRWTGGPFITTSYDYDGAIDEY 338
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ-KLGAKLE-AHIYHKSSNDCAAFLANY 357
G+ +PK+ E H I E ++S +P LG +E +H Y + + +FLAN+
Sbjct: 339 GYPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEISHFYSVETGESFSFLANF 398
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVI-SQRNNGDHPFAQQKNVNELL 416
++ V +NG + + WSV +L + ++ +A I S P +N+ +
Sbjct: 399 GATGVQTVQWNGITFKVQPWSVQLLYNNVSIFDTSATPIGSPVPKQFTPIKSFENIGQ-- 456
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
+ E ++ P EQ++ T+D +DYLWY I V + NI
Sbjct: 457 --------WSESFDLTFTNYSETP--MEQLSLTRDQTDYLWYVTKIEVNRVGAQLSLPNI 506
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+ H VFV+ + +A G G N +N I + G +TL +L VGL NY +
Sbjct: 507 SDMVH---VFVDNQYIATGRGP---TNITLNSTIGV--GGHTLQVLHTKVGLVNYAEHME 558
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
AG+F + +D D+SS W + V+GE + L + + S W + + N
Sbjct: 559 ATVAGIFEPVTLD----SVDISSNGWSMKPFVQGETLQLYNPNHSGSVQW---TNVTGNP 611
Query: 597 SLIWYKTTF-LAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WYK F L LAL++ M KG +VNG +IGRYW LA + GC C Y+G
Sbjct: 612 PLTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYW---LALAYGC-NPCTYQG 667
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y S CQ CG+P+Q YH+P W+ GEN +VI EE+ G+P I+L+ +
Sbjct: 668 GYSPSMCQLGCGEPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEAITLVQR 718
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 265/539 (49%), Positives = 349/539 (64%), Gaps = 31/539 (5%)
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G +RQPKWGHLR+LHKAIKLCE+ LI++DPT LG+ LEA +Y +S CAAFLAN +
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHP--FAQQK---NVNE 414
SDA V+FNG Y LPAWSVSILPDCKNV FNTAK+ N+ P FA+Q +
Sbjct: 69 KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKI----NSATEPTAFARQSLKPDGGS 124
Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQG 469
S +S+ +E +GIS +F++P L EQINTT D SDYLWY+ + + +G
Sbjct: 125 SAELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEG 184
Query: 470 KEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
+ L+IESLG F+N KL G+G + ++ I L G NT+D+LS+ VGL
Sbjct: 185 SKAVLHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVAGKNTVDLLSVTVGLA 241
Query: 530 NYGAWFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
NYGA+FD+ GAG+ V L K G DL+S +W YQVG++GE GL + +SS W
Sbjct: 242 NYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLGAV---DSSEWV 298
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
S LP + LIWYKTTF AP G P+A++ KG AWVNGQSIGRYW +A + GC
Sbjct: 299 SKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWPTSIAGNGGC 358
Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK- 706
T CDYRGSY A+KC K+CG+P+QTLYH+PR+W+ P N LV+ EE+GGDP++IS TK
Sbjct: 359 TDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQ 418
Query: 707 TGQHICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEG 762
TG ++C VS++ PPPVD+W + + + + P + L C I++I FAS+G P+G
Sbjct: 419 TGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLQCPVSTQVISSIKFASFGTPKG 478
Query: 763 NCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CGSF G+C+ L +VQKAC+G C+I VS+ G C G++K+LAVEA CS
Sbjct: 479 TCGSFTSGSCNSSRSLSLVQKACIGSRSCNIEVSTRVFGE---PCRGVVKSLAVEASCS 534
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 487 bits (1253), Expect = e-134, Method: Compositional matrix adjust.
Identities = 262/647 (40%), Positives = 373/647 (57%), Gaps = 38/647 (5%)
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A +L+ VPW+MCQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G
Sbjct: 2 ANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKH 61
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+R EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY AP+DE+G +
Sbjct: 62 PYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLN 121
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL++LH +K E+ L + + LG ++A IY + + F+ N ++++DA
Sbjct: 122 QPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATADA 180
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
V F G Y +PAWSVS+LPDC +NTAKV +Q + ++ + + SA
Sbjct: 181 LVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESA-- 238
Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIESLG 480
+K+ + G+ + L +Q + T D SDYLWY +H+ P + + L + S
Sbjct: 239 ---QKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNA 295
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDVAG 539
H +VN K V + ++ +K+ L G N + +LS+ VGLQNYG +F+
Sbjct: 296 HVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGP 355
Query: 540 AGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ V L+ K ++DLS +W Y++G+ G L I W LP
Sbjct: 356 TGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN-EKLPTG 414
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+ L WYK F AP GK P+ ++L +GKG+AW+NGQSIGRYW ++ + GC KCDYRG
Sbjct: 415 RMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDKCDYRG 474
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
+Y + KC CG+P Q YH+PR++++ G N + + EE+GG+PS ++ T +C+
Sbjct: 475 AYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCAR 534
Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH- 773
E + +V L+C I+A+ FAS+G P G+CGSF G C
Sbjct: 535 AHEHN------------------KVELSCHNR-PISAVKFASFGNPLGHCGSFAVGTCQG 575
Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
D V K CVG++ C++ VSS G S C K LAVE C
Sbjct: 576 DKDAAKTVAKECVGKLNCTVNVSSDTFG-STLDCGDSPKKLAVELEC 621
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 250/518 (48%), Positives = 328/518 (63%), Gaps = 26/518 (5%)
Query: 197 QQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVA 256
+Q+DAPDP+INTCNGFYCD F+PN KP MWTE ++GWF SFG VP RPVEDLAFAVA
Sbjct: 1 KQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVA 60
Query: 257 RFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
RF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G +RQPKWGHLR+LH+A
Sbjct: 61 RFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRA 120
Query: 317 IKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPA 376
IK E L+S+DPT + +G+ +A+++ + CAAFL+NY ++ V FNG Y LPA
Sbjct: 121 IKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPA 180
Query: 377 WSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGN 434
WS+SILPDCK VFNTA V ++ + + F+W Y E +
Sbjct: 181 WSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVRFAWQSYSEDTNSLSD 228
Query: 435 RSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNIESLGHAALVFVNKKL 491
+F + L EQ++ T D SDYLWYT +++ G+ L + S GH+ VFVN K
Sbjct: 229 SAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKS 288
Query: 492 VAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDL 550
YG +D N ++++ +G N + ILS VGL N G F+ G+ V L L
Sbjct: 289 YGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSL 348
Query: 551 KNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPVNKSLIWYKTTFLAPE 609
G +DLS +W YQVG++GE +GL ++ +++ W G P L W+K F AP
Sbjct: 349 NGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQP----LTWHKAFFNAPA 404
Query: 610 GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQP 669
G P+AL++ SMGKGQ WVNG +GRYWS S GC C Y G+Y KC+ +CG
Sbjct: 405 GNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYAGTYHEDKCRSNCGDL 461
Query: 670 AQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+Q YH+PR+W+ PG NLLV+ EE GGD + +SL T+T
Sbjct: 462 SQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRT 499
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 236/532 (44%), Positives = 337/532 (63%), Gaps = 14/532 (2%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+IDGKR + SG+IHYPRS PEVWP+LI ++KEGGL IETY+FWN HEP
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+Y FEGRFDL++++K +QE ++ +RIGP+ AEWN+GG P WL I I FR N+P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM++F+ I+ +K LFASQGGPIIL Q+ENEYGN++ + G+ Y++WAA A
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ T VPW+MC+Q AP +I TCNG +C D +T +KP++WTEN++ F ++G V
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GG+ NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
+PK+GHLR+LH I+ ++ + + + LG EAHI+ N C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V F G +++P+ SVSIL CKNVV+NT +V Q N + + +E+ ++ +
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSY------HTSEVTSKNNQW 448
Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
Y EK+ + + EQ N TKD SDYLWYT S + +P + L ++
Sbjct: 449 EMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVK 508
Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
S H+ + F N V G+ F+ K ++L G+N + +LS +G++
Sbjct: 509 SSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMK 560
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 471 bits (1211), Expect = e-129, Method: Compositional matrix adjust.
Identities = 279/719 (38%), Positives = 402/719 (55%), Gaps = 42/719 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP- 62
N+TYDHR+L+I+G+R++L SGS+HYPR++ W E+++ SK G+++IETY+FWN H+P
Sbjct: 41 NITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNVHQPN 100
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
++Y E ++ F+ +E LF++LRIGPY CAEWNYGGFP+WL I GI FR N
Sbjct: 101 TPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIVFRDYN 160
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PF + M ++ ++D K ++ FA GGPII+AQ+ENEYG +E YG G Y WA +
Sbjct: 161 QPFMDAMSTWVTMVVD--KLQDYFAPNGGPIIIAQIENEYGWLENEYGASGREYALWAIN 218
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLS 238
A +LN +PW+MC QED D INTCNGFYC + P +P WTEN+ GWF +
Sbjct: 219 FAKSLNIGIPWIMCAQEDI-DSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVGWFEN 277
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
+G AVP RPV+D+ F+ ARF GG+ NYYM+FGGTNFGR+ GGP + TSY+YDAP+DE
Sbjct: 278 WGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYDAPLDE 337
Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
+GF +PK+ + H I E ++ D PT L EAH Y + FL N+
Sbjct: 338 FGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTPVPLSNISEAHPYGED----LVFLTNF 393
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
D + + G Y L WSV I+ +VVF+T+ V + Q K+V +
Sbjct: 394 GLVIDY-IQWQGTNYTLQPWSVVIVY-SGSVVFDTSYVPDEYIKPSTR-DQFKDVPNAIN 450
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDL-AEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
S S+ E N + + EQIN T DT+DYLWYT +I + + L I
Sbjct: 451 YDSILSFSEWGQSDIINDCIINNESPLEQINLTNDTTDYLWYTTNITL----NETTTLTI 506
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNYGAWF 535
E++ VF+N G+ + N IN L IL+M +GL+NY A
Sbjct: 507 ENMYDFCHVFLNGAYQGNGWSPVAYITLE-----PTNGNINYQLQILTMTMGLENYAAHM 561
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
+ GL I + G+ ++++ +W + G+ GE + + ++ W Q
Sbjct: 562 ESYSRGLLGSISL----GQTNITNNQWSMKPGILGEKLQIYNEYSSSKVNW-QPYNPSAT 616
Query: 596 KSLIWYKTT-----FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
+S+ WY+ + LN+ SM KG +VNG +IGRY+ A + CT K
Sbjct: 617 QSMTWYQFNISLDGLSSDPSSNAYVLNMTSMNKGFVYVNGFNIGRYF-LMEATQSNCTLK 675
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGEN----LLVIHEELGGDPSKISLLT 705
DY G Y S + C +P+Q+LYHIP W+ ++ +++ EE+ GDP+KI LL+
Sbjct: 676 QDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQYATVILFEEVNGDPTKIQLLS 734
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 267/586 (45%), Positives = 347/586 (59%), Gaps = 46/586 (7%)
Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD- 328
MYFGGTNFGRT+GGP TSYDYDAP+DEYG +PKWGHL++LH AIKLCE L+++D
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60
Query: 329 PTHQKLGAKLEAHIYHKSSND----CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPD 384
P ++KLG+K EAHIYH CAAFLAN D A+V FNG Y LP WSVSILPD
Sbjct: 61 PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120
Query: 385 CKNVVFNTAKVISQRNNGDHPFAQ---------QKNVNELLLASSAFSWY--EEKVGISG 433
C++V FNTAKV +Q + A+ QK V + ++ + SW +E +GI G
Sbjct: 121 CRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWG 180
Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-------GKEVFLNIESLGHAALVF 486
+F L E +N TKD SDYLW+ I V G ++I+S+ VF
Sbjct: 181 ENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVF 240
Query: 487 VNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SV 545
VNK+L G+ A + + +G N L +L+ VGLQNYGA+ + GAG
Sbjct: 241 VNKQLAGSIVGHWVKAV----QPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKA 296
Query: 546 ILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS---LIWYK 602
L KNG DLS W YQVG++GE DKI + + STL + S +WYK
Sbjct: 297 KLTGFKNGDLDLSKSSWTYQVGLKGE---ADKIYTVEHNEKAEWSTLETDASPSIFMWYK 353
Query: 603 TTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKC 662
T F P G P+ LNL SMG+GQAWVNGQ IGRYW+ ++ GC + CDYRG+Y++ KC
Sbjct: 354 TYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYWN-IISQKDGCDRTCDYRGAYNSDKC 412
Query: 663 QKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPP 722
+CG+P QT YH+PR+W+ P NLLV+ EE GG+P KIS+ T T +C VSE+ PP
Sbjct: 413 TTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPP 472
Query: 723 VDSWKP------NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-D 775
+ W + + S +P+V L CE G I++I FASYG P G+C F G CH +
Sbjct: 473 LRKWSTPDYINGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASN 532
Query: 776 VLPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHCS 820
L IV +AC G+ C I VS +A++ + C G LK LAV + CS
Sbjct: 533 SLSIVSEACKGRNSCFIEVSNTAFI---SDPCSGTLKTLAVMSRCS 575
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 207/299 (69%), Positives = 248/299 (82%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25 NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+RGQYYF R+DLVRFVK ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85 PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PFK M+ F+ KI+ +MK E LF QGGPIILAQVENEYG +E G G + Y WAA
Sbjct: 145 NGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAA 204
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
AV VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG
Sbjct: 205 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 264
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG
Sbjct: 265 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYG 323
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 202/295 (68%), Positives = 246/295 (83%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM+ F KI+D+MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 272/727 (37%), Positives = 406/727 (55%), Gaps = 69/727 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V YD R+L I+G+R+++ SGSIHYPRSTP +WP LI+KSK+ G+ +IETYVFWN H+P
Sbjct: 46 VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105
Query: 65 GQYY-FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
Q Y FEG ++ F+ Q+ GL++HLRIGPY CAEWNYGG P WL IPGI FR N
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGIVFRDYNQ 165
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P+ EM ++ I++ +K FAS GGPIILAQVENEYG +E YG G+LY +WA
Sbjct: 166 PWMTEMASWMTFIVNYLKP--YFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAISF 223
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLSF 239
A +LN +PW MCQQ D D INTCNGFYC + P++P +TEN++GW +
Sbjct: 224 AKSLNIGIPWTMCQQNDIDDA-INTCNGFYCHDWIQYHFQVYPNQPAFFTENWAGWIQYY 282
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
VP RP EDL ++VAR+F GG+ NYYM+ GGT F R + + SYDYDA +DEY
Sbjct: 283 SEGVPHRPTEDLLYSVARWFSRGGSLMNYYMWHGGTTFARYS-STFLTNSYDYDAALDEY 341
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK-------LEAHIYHKSSN---D 349
G+ +PK+ L +LH + L+SS + + +E Y+ + N +
Sbjct: 342 GYEAEPKYSALAQLHSVLSQYSYILLSSGEVARPVNISNITTCNTIEIIQYNTTINGTLE 401
Query: 350 CAAFLANYDSSSDANV--TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA 407
F+ N+ SS A V +NG + WSV IL + + V+ + Q+ + F
Sbjct: 402 TITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYNNQTVI--DTSYVKQQYSAQKEFY 459
Query: 408 QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDL-AEQINTTKDTSDYLWYTASIHVMP 466
Q K V +L++S + E +G+ + V +L +EQ++ T D +DYL
Sbjct: 460 QSKRVKNVLVSS-----WTEPIGVGNYSNVVTANLPSEQLDLTLDQTDYL---------- 504
Query: 467 GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
N + + + ++++ + ++ G+ A+F+++ K + G + L ILS+ +
Sbjct: 505 -------CNADDMIY---IYIDGEYQSWSRGSP--AHFVLDTKFGI--GTHKLSILSLTM 550
Query: 527 GLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
GL +YG+ F+ GL + + G +D+++ W + + GE G+ S + + W
Sbjct: 551 GLISYGSHFESYKRGLNGTVTL----GTQDITNNGWSMRPYLVGEMQGIQ--SNPHLTSW 604
Query: 587 KQGSTLPVNKSLIWYKTTFLAP---EGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAP 643
+ L +N+ L WYK + + AL++ M KG VNG SIGRYW L
Sbjct: 605 SINNELSINQPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVNGNSIGRYW---LTL 661
Query: 644 STGCTKKCDYRGS-YDASKCQKHCGQPAQTLYHIPRTWVH--PGE-NLLVIHEELGGDPS 699
GC C+Y G Y C+ CG+P++ YH+P +++ P + N +++ EEL GDP+
Sbjct: 662 GWGCGSGCNYTGDGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLNEIIVFEELSGDPN 721
Query: 700 KISLLTK 706
I L+ +
Sbjct: 722 SIQLVQR 728
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 207/323 (64%), Positives = 261/323 (80%), Gaps = 3/323 (0%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP
Sbjct: 20 GSVTYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEP 79
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT N
Sbjct: 80 SPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPIWLKFVPGIAFRTDN 139
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F+ KI+D+MK E LF +QGGPIIL+Q+ENEYG VEW G G+ Y KWAA
Sbjct: 140 APFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 199
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
AV L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN KP +WTEN+SGW+ +FG
Sbjct: 200 MAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGP 259
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P+RP ED+AF+VARF + GG+ NYYMY GGTNFGRT+ G V TSYD+DAPIDEYG +
Sbjct: 260 TPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDEYGLL 318
Query: 303 RQPKWG--HLRELHKAIKLCEEY 323
R+P G L+ L++ + +Y
Sbjct: 319 REPILGPVTLKGLNEGTRDMSKY 341
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 80/166 (48%), Positives = 104/166 (62%), Gaps = 4/166 (2%)
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
L V L L G RD+S +W Y+VG+ GE + L + +NS W +GS + L WY
Sbjct: 323 LGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQ--KQPLTWY 380
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
KTTF P G PLAL+++SM KGQ WVNG+SIGRY+ Y+A G KC Y G + K
Sbjct: 381 KTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIA--RGKCNKCSYTGFFTEKK 438
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
C +CG P+Q YHIPR W+ P NLL+I EE+GG+P ISL+ +T
Sbjct: 439 CLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 484
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 272/712 (38%), Positives = 376/712 (52%), Gaps = 90/712 (12%)
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
MK+F+ I++ +K+ LFASQGGPIILAQ+ENEY ++E A+ G Y+ WAA A+ N
Sbjct: 426 MKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMAIATN 485
Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGYAVPF 245
T VPW+MC+Q AP +I TCNG +C G T P KP++WTEN++ + FG
Sbjct: 486 TGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQ 544
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
R ED+AF+VARFF GGT NYYMY GGTNFGR ++ YD +AP+DE+G ++P
Sbjct: 545 RSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRNGAAFVMPRYYD-EAPLDEFGLYKEP 603
Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSDAN 364
KWGHLR+LH A++ C++ L+ +P+ Q LG EA ++ K N C AFL+N+++ D
Sbjct: 604 KWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKEDGT 663
Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLLASSA 421
VTF G YF+ S+SIL DCK VVF+T V SQ N FA Q NV E+
Sbjct: 664 VTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM------ 717
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
+ EEK+ S EQ N TKD +DYLWYT S L + L +
Sbjct: 718 --YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFR----------LETDDLPY 765
Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
V L G G +F + K ++L G+N + ILS +GL + G++ + AG
Sbjct: 766 RKE--VKPVLEGAGTGRRSTRSFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHRMAG 823
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
+++V + L G DL++ W G G D N+ L WY
Sbjct: 824 VYTVTIRGLNTGTLDLTTNGW-------GHVPGKD------------------NQPLTWY 858
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
+ F P G P+ ++L MGKG +VNG+ +GRYW +Y
Sbjct: 859 RRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY--------------------- 897
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
G+P+Q LYH+PR+ + P N L+ EE GG P I +LT +IC+F++E +P
Sbjct: 898 -HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNICTFMTEKNPA 956
Query: 722 PVD-SWKPN-----------LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
V SW+ G P L+C I ++ FASYG P G CG++
Sbjct: 957 HVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVVFASYGNPLGICGNYTV 1016
Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G+CH +V+KAC+G+ CS+ VSS G CPG LAV+A CS
Sbjct: 1017 GSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDV-HCPGTTGTLAVQAKCS 1067
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 190/427 (44%), Positives = 250/427 (58%), Gaps = 75/427 (17%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD R+L+IDG R + SGSIHYPRS P+ WP+LI K+KEGGL VIE+YVFWN HEP +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF----IPGIQFRT 120
G Y FEGR+DL++F K +QE ++ +RIGP+ AEWN+G H IP I FRT
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHG---FVCHIGSGEIPDIIFRT 149
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ MK+F+ I++ +K+ LFASQGGPIILAQ+ENEY ++E A+ G Y+ WA
Sbjct: 150 NNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWA 209
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFL 237
A A+ NT VPW+MC+Q AP +I TCNG +C G T P KP++WTEN++ +
Sbjct: 210 AKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYR 268
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYM--------------------------- 270
FG R ED+AF+VARFF GGT NYYM
Sbjct: 269 VFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMVVLNSNSNLFLTKKRDEISDRTDTGGF 328
Query: 271 -------YFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEY 323
Y GGTNFGR ++ YD +AP+DE+G ++PKWGHLR+LH A++ C++
Sbjct: 329 TCVNNQQYHGGTNFGRNGAAFVMPRYYD-EAPLDEFGLYKEPKWGHLRDLHHALRHCKKA 387
Query: 324 LISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILP 383
L+ +P+ Q LG KL G YF+ S+SIL
Sbjct: 388 LLWGNPSVQPLG-KLT----------------------------RGQKYFVARRSISILA 418
Query: 384 DCKNVVF 390
DCK V +
Sbjct: 419 DCKTVKY 425
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 201/295 (68%), Positives = 245/295 (83%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI RT N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISLRTDNEPF 149
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K EM+ F KI+D+MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 457 bits (1176), Expect = e-125, Method: Compositional matrix adjust.
Identities = 271/741 (36%), Positives = 420/741 (56%), Gaps = 62/741 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+L+I+G+R++L SGSIHYPR++ E+WP ++++SK+ G+++I+TY+FWN H+P
Sbjct: 40 VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99
Query: 65 -GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+YYF+G ++ +F+ +E L+++LRIGPY CAEW YGGFP+WL IP I +R N
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIVYRDYNQ 159
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ EM ++ ++ + +N FA GGPIILAQVENEYG +E YG+ G Y KW+ D
Sbjct: 160 QWMNEMSIWMEFVVKYL--DNYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSIDF 217
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLSF 239
A +LN +PW+MCQQ D + INTCNG+YC + + P++P WTEN+ GWF ++
Sbjct: 218 AKSLNIGIPWIMCQQNDI-ESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G A P RPV+D+ ++ ARF GG+ NYYM+FGGTNFGRT+GGP + TSYDYDAP+DE+
Sbjct: 277 GQAKPKRPVQDILYSNARFIAYGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDEF 336
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK--LGAKLEAHIYHKSSNDCAAFLANY 357
G +PK+ + H+ + E L+++ P L +E H Y + +F+ NY
Sbjct: 337 GQPNEPKFSLSSKFHQVLHAIESDLLNNQPPKSPTFLSQFIEVHQYGIN----LSFITNY 392
Query: 358 DSSSDANVT-FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
+S+ + + Y + WSV I+ + + ++F+T+ + ++ K +N+ +
Sbjct: 393 GTSTTPKIIQWMNQTYTIQPWSVLIIYNNE-ILFDTSFIPPNTLFNNNTINNFKPINQNI 451
Query: 417 LAS----SAF---SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG 469
+ S S F S G + + V P EQ+ TKDTSDY WY+ ++
Sbjct: 452 IQSIFQISDFNLNSGGGGGDGDGNSVNSVSP--IEQLLITKDTSDYCWYSTNVTTTSLSY 509
Query: 470 KE---VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINT----LDIL 522
E +FL I +F++ + Y F+ L +++LN N+ L IL
Sbjct: 510 NEKGNIFLTITEFYDYVHIFIDNE-----YQGSAFSPSLC--QLQLNPINNSTTFQLQIL 562
Query: 523 SMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLAN 582
SM +GL+NY + + G+ ILI G ++L++ +W+ + G+ GE I + + N
Sbjct: 563 SMTIGLENYASHMENYTRGILGSILI----GSQNLTNNQWLMKSGLIGENIKI--FNNDN 616
Query: 583 SSFWKQGSTLP----VNKSLIWYKTTF----LAPEGKGPL-ALNLASMGKGQAWVNGQSI 633
+ W+ + + K L WYK L + + AL+++SM KG WVNG SI
Sbjct: 617 TINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGMIWVNGYSI 676
Query: 634 GRYWSAYLAPST---GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGE----- 685
GRYW S + Y G YD S + C +P+Q++Y +P W+
Sbjct: 677 GRYWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLFNNNYNNQY 736
Query: 686 NLLVIHEELGGDPSKISLLTK 706
++I EEL G+P++I LL+
Sbjct: 737 ATIIIIEELNGNPNEIQLLSN 757
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 201/321 (62%), Positives = 255/321 (79%), Gaps = 1/321 (0%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NV+YD AL+I+G+RR++ SGSIHYPRST +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 18 IGDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRH 77
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP R +Y F GR D ++F + +Q+AGL++ +RIGPY CAEWNYGGFPVWLH +PGIQ RT
Sbjct: 78 EPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWLHNMPGIQLRT 137
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEW-AYGVGGELYVKW 179
N +K EM+ F KI+++ KQ NLFASQGGPIILAQ+ENEYGNV AYG G+ Y+ W
Sbjct: 138 NNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAYGDAGKAYINW 197
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
A A +LN VPW+MCQQ DAP P+INTCNGFYCD FTPN+P P M+TEN+ GWF +
Sbjct: 198 CAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYCDNFTPNNPKSPKMFTENWVGWFKKW 257
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G P+R ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEY
Sbjct: 258 GDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEY 317
Query: 300 GFIRQPKWGHLRELHKAIKLC 320
G + QPKWGHL++LH +I +C
Sbjct: 318 GNLNQPKWGHLKQLHASIXIC 338
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 237/489 (48%), Positives = 309/489 (63%), Gaps = 25/489 (5%)
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV 286
MWTE ++GWF +FG AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKS 346
ATSYDYDAPIDEYG +RQPKWGHLR+LHKAIK E L+S DPT Q LG +A+++ S
Sbjct: 61 ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSS 120
Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF 406
CAAFL+NY +S+ A V FNG Y LPAWS+S+LPDCK VFNTA V P
Sbjct: 121 GGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATV-------SEPS 173
Query: 407 AQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
A + + + FSW Y E R+F + L EQ++ T D SDYLWYT +++
Sbjct: 174 APAR-----MSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNI 228
Query: 465 MPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
+ G+ L I S GH+ VFVN + YG +D + +++ +G N +
Sbjct: 229 NSNEQFLKSGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKI 288
Query: 520 DILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
ILS VGL N G ++ G+ V L L GKRDLS +W YQ+G+ GE +G+ +
Sbjct: 289 SILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSV 348
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
+ ++S W + + L W+K F AP G P+AL++ SMGKGQAWVNG+ IGRYWS
Sbjct: 349 AGSSSVEWGSAAG---KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS 405
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
Y A S+GC C Y G+Y +KCQ CG +Q YH+PR+W++P NLLV+ EE GGD
Sbjct: 406 -YKASSSGC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDL 463
Query: 699 SKISLLTKT 707
S + L+T+T
Sbjct: 464 SGVKLVTRT 472
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 259/628 (41%), Positives = 359/628 (57%), Gaps = 43/628 (6%)
Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
INTCNG+YCD F PN+P P M+TEN+SGW+ +G +R ED+AF+VARF + GG F
Sbjct: 164 INTCNGYYCDTFKPNNPKSPKMFTENWSGWYKLWGGKTSYRTAEDMAFSVARFVQAGGVF 223
Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
NYYMY+GGTNFGRTAGGP + SYDYD+P+DEYG + QPKWGHL++LH +IKL E+ +
Sbjct: 224 NNYYMYYGGTNFGRTAGGPYITASYDYDSPLDEYGNLNQPKWGHLKQLHASIKLGEKIIT 283
Query: 326 SSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDANVTF--NGNVYFLPAWSVSIL 382
+ T + A ++ Y + ++ + FL+N + +DA++ +GN Y +PAWSVSIL
Sbjct: 284 NGTVTIKNFQAGVDLTAYTNNATRERFCFLSNIN-IADAHIDLQQDGN-YTIPAWSVSIL 341
Query: 383 PDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEE--KVGISGNRSFVRP 440
+C +FNTAKV +Q + + L ++ W E K + G F
Sbjct: 342 QNCSKEIFNTAKVNTQTSLMVKKLYENDKPTNL-----SWVWAPEPMKDTLLGKGRFRTS 396
Query: 441 DLAEQINTTKDTSDYLWYTASIHVMPG--QGKEVFLNIESLGHAALVFVNKKLVAFGYGN 498
L +Q TT D SDYLWY S + Q V L + S GH +VNKKL+ G
Sbjct: 397 QLLDQKETTVDASDYLWYMTSFDMNKNTLQWTNVTLRVTSRGHVLHAYVNKKLIV-GSQL 455
Query: 499 HDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGK--RD 556
F K + L G N + +LS VGL NYG++FD G+ + + NGK D
Sbjct: 456 VIQGEFTFEKPVTLKPGNNVISLLSATVGLANYGSFFDKTPVGIVDGPVQLMANGKPVMD 515
Query: 557 LSSGEWIYQVGVEGEYIGL-DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLA 615
LSS W Y++G+ GE D S N W + + + + WYKTTF +P G P+
Sbjct: 516 LSSNLWSYKIGLNGEAKRFYDPTSRHNK--WSAANGVSTARPMTWYKTTFSSPSGTDPVV 573
Query: 616 LNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYH 675
++L MGKG AW NG+S+GRYW + +A + GC+ CDYRG Y+A KC ++CG P Q YH
Sbjct: 574 VDLQGMGKGHAWANGKSLGRYWPSQIANANGCSGTCDYRGPYNAGKCTRNCGIPTQRWYH 633
Query: 676 IPRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVS 734
+PR++++ G+N L++ EE+GGDPS IS T + IC E
Sbjct: 634 VPRSFLNSNGKNTLILFEEVGGDPSGISFQIVTTETICGNAYEGS--------------- 678
Query: 735 SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIP 793
+ L+C+ G I+ I FASYG P+G C SF+ G+ M+ + +VQK CVG+ CSI
Sbjct: 679 ---TLELSCQGGRTISEIQFASYGNPQGTCSSFKKGSFDAMNSVQMVQKECVGKDSCSII 735
Query: 794 VSSAYLGVSAGACPGLL-KALAVEAHCS 820
S V+ G+ K LAV+AHCS
Sbjct: 736 ASDETFMVNEPQ--GISNKRLAVQAHCS 761
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 78/133 (58%), Positives = 103/133 (77%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ V YD AL+I+G+R+++ SG+IHYPRSTPE+WPELI K+K+GGL+ IETYVFW+ HE
Sbjct: 22 ATTVEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELINKAKDGGLDAIETYVFWDRHE 81
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+R QY F G D+V+F + +QEAGL++ LRIGPY CAEWNYGGFP+WLH PG++ RT
Sbjct: 82 PVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEWNYGGFPMWLHNTPGVELRTD 141
Query: 122 NNPFKEEMKRFLA 134
N +K + F
Sbjct: 142 NEIYKVPLLIFFV 154
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 200/295 (67%), Positives = 243/295 (82%), Gaps = 4/295 (1%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
K F KI+D+MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV
Sbjct: 150 KN----FTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 205
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP
Sbjct: 206 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 265
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 266 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 320
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 209/358 (58%), Positives = 253/358 (70%), Gaps = 15/358 (4%)
Query: 104 GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG 163
GGFPVWL ++PGI FRT N PFK M++F KI+ +MK E LF +QGGPIIL+Q+ENE+G
Sbjct: 1 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60
Query: 164 NVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPS 223
VEW G G+ Y KWAA AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN
Sbjct: 61 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 120
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
KP MWTE ++GW+ FG AVP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGG
Sbjct: 121 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTAGG 180
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
P +ATSYDYDAP+DEYG R+PKWGHLR+LHKAIK CE L+S DP+ KLG+ EAH++
Sbjct: 181 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF 240
Query: 344 HKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD 403
KS +DCAAFLANYD+ V+F G Y LP WS+SILPDCK V+NTAKV SQ +
Sbjct: 241 -KSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-- 297
Query: 404 HPFAQQKNVNELLLASSAFSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
++ S F W E + L EQIN T+DT+DYLWY
Sbjct: 298 ---------VQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWY 346
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 262/725 (36%), Positives = 389/725 (53%), Gaps = 43/725 (5%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD+RA++I+G+R++L S SIHYPRST +WP++++++K G+ IETY+FWN H+P
Sbjct: 32 VSYDNRAIIINGERKLLYSASIHYPRSTRTMWPDILKRTKAAGINTIETYIFWNLHQPTP 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
Y FEG D+ F+ +E G + +R GPY CAEWN GG P WL +PGI +RT N P
Sbjct: 92 DTYDFEGSSDVKHFLDLCKEEGFHVIVRFGPYVCAEWNNGGLPSWLKAVPGIVYRTHNEP 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG-VGGELYVKWAADT 183
F EMK+++ I+ + + +A GGPII+AQ+ENEYG +E+ Y GG YV WA
Sbjct: 152 FMREMKKWMDYIVHYLS--DYYAPNGGPIIMAQIENEYGWLEYEYREQGGPEYVDWAVKL 209
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
A + NT +PW+MCQQ D +INTCNGFYC + P +P +TE ++GW F
Sbjct: 210 AKSYNTGIPWIMCQQNTRSD-VINTCNGFYCHDWLQYHQRTFPDQPAFFTELWTGWPQYF 268
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
P RP D+ ++ ARF+ GG NYYM+ GGT FGR P + TSYDYDAP+DEY
Sbjct: 269 EEGFPTRPTVDVLYSAARFYSRGGGMVNYYMWHGGTTFGRFT-SPFLTTSYDYDAPLDEY 327
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
GF ++PK+ L +LH ++ ++ P I +K + FL N+
Sbjct: 328 GFPQEPKYSMLTKLHVTLEKYSSVILHDPNVPPPYVFPDNTVEMIEYKKDAESVVFLVNW 387
Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF---------AQ 408
D + V NG + WSV I + +VF+T ++ + + PF A
Sbjct: 388 DDTFAKQVDMNGKNVKINQWSVQIYYN-NELVFDTFEIPANLTRPNPPFKPIAKTSLDAT 446
Query: 409 QKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ 468
+ L + SW E ++ N S P Q+ T D SDY+WY I + +
Sbjct: 447 AAATSRTGLVNLVSSWNEPFSFLTYNASSQTP--TAQLKLTGDNSDYIWYETEIDLT--K 502
Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
E+ +S + VFV+ + + + G+ A F N K + G +TL IL +G+
Sbjct: 503 TDEILYLYKSYDF-SYVFVDGQFLYWHRGSPIQAYF--NGKFPV--GKHTLQILCAAMGV 557
Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
+YGA + GL I + G ++++ W + + GE +GL + ++ W
Sbjct: 558 PSYGAHIEQHERGLTGDIFL----GSKNITDNGWKMRPFLSGELLGLH--ASPSTVKWSP 611
Query: 589 GSTLPVNKSLIWYKTTFLAPEGK-GP-LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
S + WYK P + GP AL+L SM KG +VNG SIGRYW A
Sbjct: 612 VSKGTAGSGVTWYKFNVKTPSFEDGPAFALDLKSMWKGLVFVNGNSIGRYWVA----KGW 667
Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV-HPGENLLVIHEELGGDPSKISLLT 705
C +KC+ G YD C+++CG+ +Q YH+P+ ++ +N ++I EEL GDP I L+
Sbjct: 668 CEEKCNQTGLYDNYGCRENCGESSQRYYHVPKDFLKESSDNEVIIFEELQGDPYSIELVQ 727
Query: 706 KTGQH 710
+ ++
Sbjct: 728 RNTEY 732
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 192/251 (76%), Positives = 218/251 (86%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 18 FCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLH 77
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP++GQY F+GR DLV+FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 78 EPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRT 137
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK EMKRF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ YG G+ Y+ WA
Sbjct: 138 DNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWA 197
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 198 AKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFG 257
Query: 241 YAVPFRPVEDL 251
AVP RPVE L
Sbjct: 258 GAVPHRPVEIL 268
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 244/627 (38%), Positives = 364/627 (58%), Gaps = 36/627 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+S VTYD R+L+I+G+R++ SGS+HYPRSTP +W +++ SK G+ +I+TYVFW+ H
Sbjct: 104 VSYKVTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDLH 163
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP RG Y FEG +L F+ Q+ GLF++LRIGPY CAEWNYGG P+WL IPGI+ R
Sbjct: 164 EPQRGVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGIKMRD 223
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N + EE++R++ I+D + FA QGGPI+LAQ+ENEY V+W Y G + W
Sbjct: 224 FNTQYMEEVERWMKFIVDYL--HGYFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAHWC 281
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT----PNSPSKPIMWTENYSGWF 236
AD A L+ +PW+MCQQ+D P +INTCNG+YC + N +P ++TEN+SGWF
Sbjct: 282 ADLANRLDIGIPWIMCQQDDIPT-VINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGWF 340
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
++ AV RPV DL ++ AR+F +GG NYYM+ GGTNFGR + GP++A SYDYDAP+
Sbjct: 341 NNWVNAVRHRPVADLLYSAARWFASGGALMNYYMWHGGTNFGRKS-GPMIALSYDYDAPL 399
Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
+EYG R PK+ R+ +K I E+ L+S P A + I++++ N+ A+F+ N
Sbjct: 400 NEYGNPRNPKYSQTRDFNKLILSLEDILLSQYPPTPIFLANNISVIHYRNGNNSASFIIN 459
Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
+ + ++ V F G YF A+SV IL + + VF++++ + RN D + N+
Sbjct: 460 SNENGNSKVMFEGRSYFSYAYSVQILKNYVS-VFDSSQ--NPRNYTDTVVESEPNIP--- 513
Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-HVMPGQGKEVFLN 475
A+S S + E+ S L EQ+N TKD +DY+WYT I H G+ +V +N
Sbjct: 514 FANSIISKHVERFDFE--ESLYDNRLMEQLNLTKDETDYIWYTTMINHDQDGEILKV-IN 570
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
+ H VFV+ V ++ L + L G +TL +L +G+Q+Y
Sbjct: 571 KTDIVH---VFVDSYYVG-----TIMSDSLAITGVPL--GPSTLQLLHTKMGIQHYELHM 620
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS---LANSSFWKQGSTL 592
+ AG+ + G ++++ W + V E + D I + S ++ + +
Sbjct: 621 ENTKAGILGPVYY----GDIEITNQMWGSKPFVSSEKVITDPIQSKFVRWSPLDRKPNEV 676
Query: 593 PVNKSLIWYK-TTFLAPEGKGPLALNL 618
+ L WYK F+ E K P +L L
Sbjct: 677 FYSVPLTWYKFIFFIDSEAKLPTSLAL 703
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 197/393 (50%), Positives = 267/393 (67%), Gaps = 3/393 (0%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L+IDGKR + SG+IHYPRS PE+W +L++ +K GGL IETYVFWN HEP
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G+YYFEGRFDL+RF+ +++ ++ +RIGP+ AEWN+GG P WL I I FR N P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FK EM++F+ I+ +K +FA QGGPIIL+Q+ENEYGN++ V G+ Y++WAA+ A
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
++ VPWVMC+Q AP +I TCNG +C D +T +KP +WTEN++ F +FG +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
R ED+A+AV RFF GGT NYYMY GGTNFGRT G V T Y +AP+DEYG +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
+PK+GHLR+LH IK + + + + LG EAH Y + C +FL+N ++ D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV 395
V F G +++P+ SVSIL DCK VV+NT +V
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV 427
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 217/465 (46%), Positives = 288/465 (61%), Gaps = 24/465 (5%)
Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHL 310
+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG +RQPKWGHL
Sbjct: 1 MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60
Query: 311 RELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGN 370
R+LHKAIK E L+S DPT Q LG +A+++ S CAAFL+NY +S+ A V FNG
Sbjct: 61 RDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGR 120
Query: 371 VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEK 428
Y LPAWS+S+LPDCK VFNTA V P A + + + FSW Y E
Sbjct: 121 RYDLPAWSISVLPDCKAAVFNTATV-------SEPSAPAR-----MSPAGGFSWQSYSEA 168
Query: 429 VGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAA 483
R+F + L EQ++ T D SDYLWYT +++ + G+ L + S GH+
Sbjct: 169 TNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSL 228
Query: 484 LVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF 543
VFVN + YG +D + +++ +G N + ILS VGL N G ++ G+
Sbjct: 229 QVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVL 288
Query: 544 S-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYK 602
V L L GKRDLS+ +W YQ+G+ GE +G+ ++ ++S W + + L W+K
Sbjct: 289 GPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG---KQPLTWHK 345
Query: 603 TTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKC 662
F AP G P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+G C Y G+Y +KC
Sbjct: 346 AYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGGCGGCSYAGTYSETKC 404
Query: 663 QKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Q CG +Q YH+PR+W++P NLLV+ EE GGD + L+T+T
Sbjct: 405 QTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTRT 449
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 230/547 (42%), Positives = 311/547 (56%), Gaps = 34/547 (6%)
Query: 283 GPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
G V Y D + G +R+PKWGHL+ELHKAIKLCE L++ DP LG +A +
Sbjct: 132 GADVQMPYRLDHILVADGLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASV 191
Query: 343 YHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNG 402
+ S++ C AFL N D S A V+FNG Y LP WS+SILPDCK V+NTA V SQ
Sbjct: 192 FRSSTDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQ---- 247
Query: 403 DHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
+Q K + + F+W Y E + G+ SF L EQIN T+D +DYLWYT
Sbjct: 248 ---ISQMK-----MEWAGGFTWQSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTT 299
Query: 461 SIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEG 515
+ + + GK L + S GHA +FVN +L YG+ + + ++L G
Sbjct: 300 YVDIAQDEQFLSNGKNPMLTVMSAGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSG 359
Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIG 574
NT+ LS+ VGL N G F+ AG+ + +D L G+RDL+ +W Y+VG++GE +
Sbjct: 360 SNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALS 419
Query: 575 LDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
L +S ++S W + PV K L WYK F AP+G PLAL+++SMGKGQ W+NGQ I
Sbjct: 420 LHSLSGSSSVEWGE----PVQKQPLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGI 475
Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
GRYW Y A +G CDYRG YD KCQ +CG +Q YH+PR+W++P NLLVI EE
Sbjct: 476 GRYWPGYKA--SGTCGICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEE 533
Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAIN 753
GGDP+ IS++ + IC+ VSE P + +W+ +V L C+ G + I
Sbjct: 534 WGGDPTGISMVKRIAGSICADVSEWQ-PSMANWRTK---GYEKAKVHLQCDHGRKMTHIK 589
Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
FAS+G P+G+CGS+ G CH I K+C+GQ C + V G CPG +K
Sbjct: 590 FASFGTPQGSCGSYSEGGCHAHKSYDIFWKSCIGQERCGVSVVPDAFG--GDPCPGTMKR 647
Query: 813 LAVEAHC 819
VEA C
Sbjct: 648 AVVEAIC 654
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 180/286 (62%), Positives = 217/286 (75%)
Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
EWN+GGFPVWL F+PGI FRT N PFK M+ F KI+ +MK E LF SQGGPIIL+Q+E
Sbjct: 1 EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60
Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
NEY +G GE Y+ WAA A LNT VPWVMC++ DAPDP+INTCNGFYCD F+P
Sbjct: 61 NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKFSP 120
Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
N P KP +WTE ++GWF FG + RPVEDLAFAVARF + GG+F NYYMY GGTNFGR
Sbjct: 121 NKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNFGR 180
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
TAGGP + TSYDYDAPIDEYG IR+PK+ HL+ELH+A+KLCE L+ +DP LG +
Sbjct: 181 TAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGNYEQ 240
Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
AH++ +S CAAFL+N++S S A VTFN ++LP WS+SILPDC
Sbjct: 241 AHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 179/286 (62%), Positives = 218/286 (76%), Gaps = 1/286 (0%)
Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
EWN+GGFPVWL ++PGIQFRT N PFK +M++F KI+++MK E LF Q GPII++Q+E
Sbjct: 1 EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60
Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
NEYG +EW G G+ Y KWAA AV L T VPW+MC+QEDAPDPII+TCNGFYC+ F P
Sbjct: 61 NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMP 120
Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
N+ KP M+TE ++GW+ FG VP+RP ED+A++VARF + G+F NYYMY GGTNFGR
Sbjct: 121 NANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGR 180
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
TAGGP +ATSYDYDAP+DEYG R+PKWGHLR+LHK IKLCE L+S DP LG+ E
Sbjct: 181 TAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQE 240
Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
AH++ ++ CAAFLANYD VTF Y LP WSVSILPDC
Sbjct: 241 AHVFWTKTS-CAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 263/729 (36%), Positives = 366/729 (50%), Gaps = 124/729 (17%)
Query: 110 LHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAY 169
L+ I F + P ++ MKRF IID+M +E ASQGGPIILA V++ A+
Sbjct: 99 LNVIHTYAFWNLHEPVQDHMKRFTRMIIDMMSKEKXIASQGGPIILALVDSAI-----AF 153
Query: 170 GVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIM 227
G V WA AV L T +P VMC+Q+DAPDP+INTC G C D FT PN P+K +
Sbjct: 154 KEMGTRCVHWAGTMAVGLKTGIPXVMCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSV 213
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
+ + G + FG R EDLAF+ F GT NYYMY+ TNFGRT
Sbjct: 214 -SNHXLGMYRVFGDPPSQRAAEDLAFSX--FISKNGTLANYYMYYSVTNFGRTTSS-FAT 269
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-S 346
T Y +AP+DEYG R+ KWGHLR+LH A++L ++ L+ + QKLG LEA IY K
Sbjct: 270 TCYYDEAPLDEYGLPRETKWGHLRDLHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPG 329
Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF 406
SN CA FL N + + T G+ Y+LP S+S LPDCK VVFNT V+SQ +
Sbjct: 330 SNICATFLLNNITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFNTQTVVSQ-------Y 382
Query: 407 AQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
+ KN+ + ++ A YEE + +S V E + TKDT+DYLWYT +I +
Sbjct: 383 SVNKNL-QWXMSQDALPTYEE--CPTKTKSPV-----ELMTMTKDTTDYLWYTTNIELAR 434
Query: 465 --MPGQGKEVFL--NIESLGHAALVFVNKKLVAF-----GYGNHDFANFLINKKIELNEG 515
+P + K+V + +LGH F+N + + F +G++ +F+ NK I L G
Sbjct: 435 TGLPFR-KDVLRVPQVSNLGHVMHAFLNGEYMEFYLTGTRHGSNVEKSFVFNKPITLKAG 493
Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGL 575
+N + L VGL + G++ + AG+ +V + L DL W
Sbjct: 494 LNQIAPLGATVGLPDSGSYMEHRLAGVHNVAIQGLNTRTIDLPKNGW------------- 540
Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
+K F APEG P+AL L++M KG AW+NG+SI
Sbjct: 541 ------------------------GHKAYFDAPEGDVPVALELSTMAKGMAWINGKSIDX 576
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW +YL+P G+P+Q++YH+PR ++ +NLLV+ EE G
Sbjct: 577 YWVSYLSP----------------------LGKPSQSVYHVPRAFLKTSDNLLVLFEETG 614
Query: 696 GDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFA 755
+P I +LT IC ++SE P V SWK A +
Sbjct: 615 RNPDGIEILTLNRDTICCYISEHHPTHVRSWKRE---------------------ASDIQ 653
Query: 756 SYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYL---GVSAGACPGLLK 811
+G P G C F PG C + +V+K C+G+ CSIPV + G+S G+ K
Sbjct: 654 IFGDPTGTCXEFIPGNCAAPNSXKVVEKHCLGKSSCSIPVEQEIVSKDGISISGS-GITK 712
Query: 812 ALAVEAHCS 820
ALAV+ C+
Sbjct: 713 ALAVQVLCA 721
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 37/60 (61%), Positives = 49/60 (81%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R L+++GKR +L SGSIHYPRS PE+WP++I K++ GGL VI TY FWN HEP++
Sbjct: 56 VSYDGRPLIVNGKRELLFSGSIHYPRSIPEMWPDIIXKARHGGLNVIHTYAFWNLHEPVQ 115
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 223/516 (43%), Positives = 295/516 (57%), Gaps = 31/516 (6%)
Query: 319 LCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWS 378
+CE+ LIS+DP LG +A++Y S DC+AFL+NYDS S A V FN Y LP WS
Sbjct: 1 MCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWS 60
Query: 379 VSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRS 436
VSILPDC+N VFNTAKV Q + L S FSW +EE S +
Sbjct: 61 VSILPDCRNAVFNTAKV----------GVQTSQMQMLPTNSERFSWESFEEDTSSSSATT 110
Query: 437 FVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKL 491
L EQIN T+DTSDYLWY S+ V + GK L ++S GHA VF+N +L
Sbjct: 111 ITASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRL 170
Query: 492 VAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-L 550
YG + F + L G NT+ +LS+ VGL N G F+ G+ ++I L
Sbjct: 171 SGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGL 230
Query: 551 KNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVNKSLIWYKTTFLAPE 609
GK DLS +W YQVG++GE + L +S W Q + + N+ L W+KT F APE
Sbjct: 231 DKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPE 290
Query: 610 GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQP 669
G+ PLAL++ MGKGQ W+NG SIGRYW+A +TG C+Y GS+ KCQ CGQP
Sbjct: 291 GEEPLALDMDGMGKGQIWINGISIGRYWTAI---ATGSCNDCNYAGSFRPPKCQLGCGQP 347
Query: 670 AQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP----PVDS 725
Q YH+PR+W+ NLLV+ EELGGDPSKISL ++ +C+ VSE P +DS
Sbjct: 348 TQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCADVSEYHPNLKNWHIDS 407
Query: 726 WKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKAC 784
+ + P+V L C G I++I FAS+G P G CGS+ GACH I+++ C
Sbjct: 408 YGKSENF--RPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQGACHSSSSYDILEQKC 465
Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
+G+ C + VS++ G CP +LK L+VEA C+
Sbjct: 466 IGKPRCIVTVSNSNFG--RDPCPNVLKRLSVEAVCA 499
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 173/289 (59%), Positives = 213/289 (73%), Gaps = 20/289 (6%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+++VTYDHR+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 35 NSSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 94
Query: 62 PIRGQ--------------------YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEW 101
P +GQ YYFE RFDLVRF K V++AGL++ LRIGP+ AEW
Sbjct: 95 PAQGQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEW 154
Query: 102 NYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENE 161
+GG PVWLH+ PG FRT N PFK MKRF I+D+MK+E FASQGG IILAQVENE
Sbjct: 155 TFGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENE 214
Query: 162 YGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS 221
YG++E AYG G + Y WAA A+ NT VPW+MCQQ DAPDP+INTCN FYCD F PNS
Sbjct: 215 YGDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNS 274
Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYM 270
P+KP WTEN+ GWF +FG + P RP ED+AF+VARFF GG+ QNYY+
Sbjct: 275 PTKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYV 323
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 204/497 (41%), Positives = 269/497 (54%), Gaps = 62/497 (12%)
Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR 399
A +Y S C AFL+N DS D VTF Y LPAWSVSILPDCKNV FNTAKV SQ
Sbjct: 324 ADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQT 383
Query: 400 NNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLW 457
D V L +S W + EK GI GN VR + INTTKD++DYLW
Sbjct: 384 LMMDM-------VPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLW 436
Query: 458 YTASIHVMPGQ--GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEG 515
YT S V G L+IES GHA F+N +L+ YGN +NF + + L G
Sbjct: 437 YTTSFDVDGSHLAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAG 496
Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGL 575
N L +LSM VGLQN G ++ AGAG+ SV + ++N DLSS +W Y+V V+
Sbjct: 497 KNKLSLLSMTVGLQNGGPMYEWAGAGITSVKISGMENRIIDLSSNKWEYKVNVD------ 550
Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
P+G P+ L++ SMGKG AW+NG +IGR
Sbjct: 551 -------------------------------VPQGDDPVGLDMQSMGKGLAWLNGNAIGR 579
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW S CT CDYRG++ +KC++ CGQP Q YH+PR+W HP N LVI EE G
Sbjct: 580 YWPRISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKG 639
Query: 696 GDPSKISLLTKTGQHICSFVSEADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAIN 753
GDP+KI+ +T +CSFVSE P ++SW N + +V+L+C +G I+++
Sbjct: 640 GDPTKITFSRRTVASVCSFVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVK 699
Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQK---------ACVGQIECSIPVSSAYLGVSA 803
F S+G P G C S++ G+CH + + +V+K AC+ C++ +S G
Sbjct: 700 FVSFGNPSGTCRSYQQGSCHHPNSISVVEKGTLGWAHRRACLNMNGCTVSLSDE--GFGE 757
Query: 804 GACPGLLKALAVEAHCS 820
CPG+ K LA+EA CS
Sbjct: 758 DLCPGVTKTLAIEADCS 774
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 260/786 (33%), Positives = 377/786 (47%), Gaps = 127/786 (16%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V++DHRAL++DG+R ++ SG++HYPRSTP +WP ++R ++ GL +ETY+FWN HE
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG F GR DLVRF + Q GL + LRIGPY CAE NYGG P WL +P I+ RT N
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
FK E R++ + ++++ L A GGP+ILAQ+ENEY N+ YG G Y++W+ +
Sbjct: 122 AFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 184 AVNLNTSVPWVMC--------QQEDA---PDPIINTCNGFYCDGFT----PNSPSKPIMW 228
A +L +PWV C ++DA + T N F P +P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT 288
TEN++GW+ ++G +P R E+LA+A ARFF GG+ NY+++ GGTNFGR G L+ T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGRD-GMYLLTT 298
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSN 348
+Y++ P+DEYG + K HL L+KA+ C + +++S+ G + + SS
Sbjct: 299 AYEFGGPLDEYG-LPTTKARHLARLNKALAACADKILASERPRAITGERNGLLKFQYSSG 357
Query: 349 DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ 408
F + + + V NG V + S + P V T K R FA
Sbjct: 358 --LTFWCDDVARTVRIVGKNGEVLYDS--SARVAP-----VRRTWKASGVR------FAP 402
Query: 409 QKNVNELLLASSAFSW-YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV--- 464
E L A +W E + ++ + EQ+ TKD +DY WY +I V
Sbjct: 403 WGWRAEPLPA----AWPAEAQSAVTARKPL------EQLLLTKDETDYCWYETAIVVEGS 452
Query: 465 -------------------------------MPGQGKEV------FLNIESLGHAALVFV 487
+ G EV L + + VF+
Sbjct: 453 GDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVFI 512
Query: 488 NKKLVAFG-------YGNHDFANF-----LINKKIELNEGINTLDILSMMVGLQNYGAW- 534
+ VA G D F L K + + G + L +L +GL G W
Sbjct: 513 DGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIK-GDWM 571
Query: 535 -----FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
+ GL++ + NGK+ GEW +Q G+ GE G + + WK
Sbjct: 572 IGYENMALEKKGLWAPVFW---NGKK--LEGEWRHQPGLLGERCGFADPAAGSLLAWKTA 626
Query: 590 STLP---VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW----SAYLA 642
+ L W++TTF P+G GP AL+L MGKG AW+NG IGRYW + +
Sbjct: 627 KAATGRGARRPLRWWRTTFTRPKGHGPWALDLGGMGKGMAWINGHCIGRYWLLADTDPMG 686
Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHP--GENLLVIHEELGGDPSK 700
P K GS A+ P Q YH+P W+ G + LV+ EELGGDP+
Sbjct: 687 PWMAWMK-----GSLTAAPSSG----PTQRYYHVPDDWLRTDGGPDTLVLFEELGGDPAT 737
Query: 701 ISLLTK 706
+ L+ +
Sbjct: 738 VRLVRR 743
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/339 (53%), Positives = 230/339 (67%), Gaps = 28/339 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
V+YD R+L+I+G+R++L SGSIHYPRSTP++WP LI K+K GGL+VIETYVFWN HEP
Sbjct: 26 GQVSYDGRSLIIEGQRKLLFSGSIHYPRSTPDMWPSLISKAKHGGLDVIETYVFWNLHEP 85
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQY F+GR ++VRF++ +Q GL+ +RIGP+ AEW YGG P WLH +PGI +R+ N
Sbjct: 86 RHGQYDFKGRHNIVRFIREIQAHGLYAFIRIGPFIEAEWTYGGLPFWLHDVPGIVYRSDN 145
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M+ F KI++L K E L+A QGGPIIL Q+ENEY N E A+ G YV+WAA
Sbjct: 146 EPFKYHMQNFTTKIVNLFKSEGLYAPQGGPIILQQIENEYKNAERAFHEKGPPYVQWAAA 205
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
AV L T VPWVMC+Q+DAPDP+INTCNG C + F PNSP+KP +WT+N++
Sbjct: 206 MAVGLQTGVPWVMCKQDDAPDPVINTCNGRTCGETFVGPNSPNKPAIWTDNWTSL----- 260
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
G+F NYYMY GGTNFGRT G V TSY +APIDEYG
Sbjct: 261 --------------------KNGSFVNYYMYHGGTNFGRT-GSAFVLTSYYDEAPIDEYG 299
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
IRQPKWGHL++LH IK C + L+ + LG + E
Sbjct: 300 LIRQPKWGHLKQLHSVIKSCSQTLLHGVISVSPLGQQQE 338
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 251/739 (33%), Positives = 375/739 (50%), Gaps = 76/739 (10%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V+YDHRA+ I+G R +L SG IHYPRSTP +WP L+ K+KE GL I+TYVFWN HE
Sbjct: 33 HVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNMHEQK 92
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG Y F GR +L F++ AGLF++LR+GPY CAEW+YG PVWL+ IP I FR++N+
Sbjct: 93 RGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSND 152
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+K EMKRFL+ II + + A GGPIILAQ+ENEYG + A YV W
Sbjct: 153 AWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGNDRA-------YVDWCGSL 203
Query: 184 AVN--LNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNS----PSKPIMWTENYSGW 235
N +T +PW+MC A + I TCNG C DG+ P++P+++TEN+ GW
Sbjct: 204 VSNDFASTQIPWIMCNGL-AANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GW 261
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
F +G + R EDLA++VA +F GG + YYM+ GG ++GRT GG + T+Y D
Sbjct: 262 FQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDDVI 320
Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT----------HQKLGAKLEAHIYHK 345
+ G +PK+ HL L + + + L+S D +G + + Y
Sbjct: 321 LRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSARLPIPYWDGKQWSVGTQQMVYSYPP 380
Query: 346 SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHP 405
S F+ N ++ V FN + SV I + +++++N+A V
Sbjct: 381 S----IQFVIN-QAAFSLFVLFNKQNISIAGQSVQIYDNNEHLLWNSADV-------SGI 428
Query: 406 FAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM 465
F + +++ + Y E +S V EQ+N T D + YLWY ++ +
Sbjct: 429 FRNNTFLVPIVVGPLDWQVYSEPF-LSDLPVIVASTPLEQLNLTNDETIYLWYRRNVSLS 487
Query: 466 PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN------EGINTL 519
+ + ++ + F++++ V + + +H A IN I LN
Sbjct: 488 QPSAQTIVQVQTRRANSLIFFMDRQFVGY-FDDHSHAQGTINVNITLNLSQFLPNQQYLF 546
Query: 520 DILSMMVGLQNYGAWFDVAGAGLFSV--ILIDLKNGKRDLSSGE---WIYQVGVEGEYIG 574
+ILS+ +G+ N+ G G F I+ ++ G + L E W +Q G+ GE
Sbjct: 547 EILSVSLGIDNFN-----IGPGSFEYKGIVGNVSLGGQSLVGDEASIWEHQKGLFGEAYQ 601
Query: 575 LDKISLANSSFWKQGSTLPVNKSLIWYKTTF----LAPE--GKGPLALNLASMGKGQAWV 628
+ + + W T +NKS+ W++T F L E P+ L+ + +G A+V
Sbjct: 602 IYTEQGSKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNANPVLLDAFGLNRGHAFV 661
Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLL 688
NG IG YW L T K C + Q +C QP+Q YHIP W+ P NLL
Sbjct: 662 NGNDIGLYW---LIEGTCQNKLC------CCLQNQTNCQQPSQRYYHIPSDWLKPTNNLL 712
Query: 689 VIHEELGG-DPSKISLLTK 706
+ EE+G P + L+ +
Sbjct: 713 TVFEEIGASSPKSVGLVQR 731
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 368 bits (944), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 168/264 (63%), Positives = 202/264 (76%), Gaps = 1/264 (0%)
Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
T N PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KW
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
AA AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN KP MWTE ++GW+ F
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G AVP RP EDLAF++AR + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARLIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G R+PKWGHLR+LHKAIK E L+S++P+ LG EAH++ KS + CAAFLANYD+
Sbjct: 181 GLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVF-KSKSGCAAFLANYDT 239
Query: 360 SSDANVTFNGNVYFLPAWSVSILP 383
S A V+F Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 365 bits (938), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 190/282 (67%), Positives = 216/282 (76%), Gaps = 7/282 (2%)
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A +L+T VPW+MCQQ +APDPIINTCN FYCD FTPNS +KP MWTEN+SGWFL+FG AV
Sbjct: 2 ATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGAV 61
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNFGRT GGP ++TSYDYDAPIDEYG IR
Sbjct: 62 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDIR 121
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
QPKWGHL++LHKAIKLCEE LI+SDPT G LE +Y K+ C+AFLAN SDA
Sbjct: 122 QPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVY-KTGAVCSAFLANI-GMSDA 179
Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ---KNVNELLLASS 420
VTFNGN Y LP WSVSILPDCKNVV NTAKV + FA + + V+ L +SS
Sbjct: 180 TVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISS--FATESLKEKVDSLDSSSS 237
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI 462
+SW E VGIS +F + L EQINTT D SDYLWY+ SI
Sbjct: 238 GWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSI 279
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 364 bits (935), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 168/264 (63%), Positives = 200/264 (75%), Gaps = 1/264 (0%)
Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
T N PFK M++F KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW G G+ Y KW
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
AA AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN KP MWTE ++GW+ F
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
G AVP RP EDLAF++ARF + GG+ NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARFIQKGGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
G R+PKWGHLR LHKAIK E L+S++P+ LG EAH + KS + CAAFLANYD+
Sbjct: 181 GLPREPKWGHLRNLHKAIKSSESALVSAEPSVTSLGNSQEAHAF-KSKSGCAAFLANYDT 239
Query: 360 SSDANVTFNGNVYFLPAWSVSILP 383
S A V+F Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 172/286 (60%), Positives = 206/286 (72%), Gaps = 5/286 (1%)
Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
EWN+GGFPVWL ++PGI FRT N PFK M +F KI+ +MK E LF SQGGPIIL+Q+E
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
NEYG VE+ G + Y+ WAA AV LNT VPWVMC+Q+DAPDP+IN CNGFYCD F+P
Sbjct: 61 NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYCDYFSP 120
Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
N P KP MWTE ++GWF F V + A V R + T + GTNFGR
Sbjct: 121 NKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQVIRRWILVTTIVPW-----GTNFGR 175
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
TAGGP ++TSYDYDAPIDEYG +RQPKWGHLR+LHKAIK+CE L+S DPT KLG E
Sbjct: 176 TAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQE 235
Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
AH+Y S CAAFL+N++ S A+VTFNG Y +P+WS+SILPDC
Sbjct: 236 AHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 188/423 (44%), Positives = 249/423 (58%), Gaps = 11/423 (2%)
Query: 292 YDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCA 351
YDAP+DEYG R PKWGHL++LHKAIKLCE L+ + LG +EA +Y SS CA
Sbjct: 1 YDAPVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGACA 60
Query: 352 AFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN 411
AF+AN D +D V F Y +PAWSVSILPDCKNVV+NTAKV +Q N +
Sbjct: 61 AFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNK---IAMIPEK 117
Query: 412 VNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----- 464
+ + F W ++E GI G FV + INTTKDT+DYLW+T SI +
Sbjct: 118 LQQSDKGQKTFKWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENEE 177
Query: 465 MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
+ +G + L IES GHA FVN+K YGN + F I L G N + +LS+
Sbjct: 178 LLKKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLSL 237
Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
VGLQ G ++D GAG+ SV + L N DLSS W Y++GV+GE++ + + + NS
Sbjct: 238 TVGLQTAGPFYDFVGAGVTSVKIKGLNNKTIDLSSNAWTYKIGVQGEHLKIYQGNGLNSV 297
Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-P 643
W S P ++L WYK AP G P+ L++ MGKG AW+NG+ IGRYW
Sbjct: 298 SWTSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWPRISEFK 357
Query: 644 STGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
C ++CDYRG ++ KC CG+P+Q YH+PR+W P N+LV EE GGDP+KI+
Sbjct: 358 KEDCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGDPTKITF 417
Query: 704 LTK 706
+ +
Sbjct: 418 VRR 420
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 252/740 (34%), Positives = 371/740 (50%), Gaps = 80/740 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YDHRA+ I+G R +L SG IHYPRSTP +WP L+ K+KE GL I+TYVFWN HE R
Sbjct: 34 VSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNIHEQKR 93
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y F GR +L F++ AGLF++LR+GPY CAEW+YG PVWL+ IP I FR++N+
Sbjct: 94 GTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSNDA 153
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K EMKRFL+ II + + A GGPIILAQ+ENEYG + A YV W
Sbjct: 154 WKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGNDRA-------YVDWCGSLV 204
Query: 185 VN--LNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNS----PSKPIMWTENYSGWF 236
N +T +PW+MC A + I TCNG C DG+ P++P+++TEN+ GWF
Sbjct: 205 SNDFASTQIPWIMCNGL-AANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GWF 262
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
+G + R EDLA++VA +F GG + YYM+ GG ++GRT GG + T+Y D +
Sbjct: 263 QGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDDVIL 321
Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL----------GAKLEAHIYHKS 346
G +PK+ HL L + + + L+S D + G + + Y S
Sbjct: 322 RADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSNRLSIPYWNGKQWTVGTQQMVYSYPPS 381
Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV--ISQRNNGDH 404
F+ N ++ V FN + SV I +++++N+A V IS+ N
Sbjct: 382 ----VQFVIN-QAAFSLFVLFNKQNISIAGQSVQIYDYNEHLLWNSADVSGISRNNTFLV 436
Query: 405 PFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
P +++ + Y E S V EQ+N T D + YLWY ++ +
Sbjct: 437 P---------IVVGPLDWQVYSEPF-TSDLPVIVASTPLEQLNLTNDETIYLWYRRNVSL 486
Query: 465 MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN------EGINT 518
+ + ++ L F++++ V + + +H IN I LN
Sbjct: 487 SQPSVQTIVQVQTRRANSLLFFMDRQFVGY-FDDHSHTQGTINVNITLNLSQFLPNQQYI 545
Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSV--ILIDLKNGKRDLSSGE---WIYQVGVEGEYI 573
+ILS+ +G+ N+ G G F I+ ++ G + L E W +Q G+ GE
Sbjct: 546 FEILSVSLGIDNFN-----IGPGSFEYKGIVGNVSLGGQSLVGDEASIWEHQKGLFGEAH 600
Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTF----LAPE--GKGPLALNLASMGKGQAW 627
+ + + W T +NK + W++T F LA E P+ L+ +G A+
Sbjct: 601 QIYTEQGSKTVEWNPKWTTVINKPVTWFQTRFDLNHLAREDLNANPILLDAFGFNRGHAF 660
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG IG YW L T C + Q +C QP+Q YHI W+ P NL
Sbjct: 661 VNGNDIGLYW---LIEGTCQNNLC------CCLQNQTNCQQPSQRYYHISSDWLKPTNNL 711
Query: 688 LVIHEELGG-DPSKISLLTK 706
L + EE+G P + L+ +
Sbjct: 712 LTVFEEIGASSPKSVGLVQR 731
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 262/791 (33%), Positives = 373/791 (47%), Gaps = 138/791 (17%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V++DHRAL++DG+R ++ SG++HYPRSTP +WP ++R ++ GL +ETY+FWN HE
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG F GR DLVRF + Q GL + LRIGPY CAE NYGG P WL +P I+ RT N
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
FK E R++ + ++++ L A GGP+ILAQ+ENEY N+ YG G Y++W+ +
Sbjct: 122 AFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 184 AVNLNTSVPWVMC--------QQEDA---PDPIINTCNGFYCDGFT----PNSPSKPIMW 228
A +L +PWV C ++DA + T N F P +P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT 288
TEN++GW+ ++G +P R E+LA+A ARFF GG+ NY+++ GGTNFGR G L+ T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGRD-GMYLLTT 298
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP-THQKLGAKLEAHIYHKSS 347
+Y++ P+DEYG R E L S P +K +E YH S
Sbjct: 299 AYEFGGPLDEYGLPTTKARHLARLNAALAACAGELLASERPGVVEKSSGVVE---YHYDS 355
Query: 348 NDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA 407
F+ + + + V +G V + SV + P V K R FA
Sbjct: 356 G--LVFVCDDTARAVRIVKKSGEVLYDS--SVRVAP-----VRRAWKSSGVR------FA 400
Query: 408 QQKNVNELLLASSAFSW-YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
E L A +W E + ++ + EQ+ TKD +DY WY +I V
Sbjct: 401 PWGWRAEPLPA----AWPAEAQSAVTARKPL------EQLLPTKDETDYCWYETAIVVEG 450
Query: 465 --------------------------------MPGQGKEV------FLNIESLGHAALVF 486
+ G EV L + + VF
Sbjct: 451 SGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVF 510
Query: 487 VNKKLVAFG-------YGNHDFANF-----LINKKIELNEGINTLDILSMMVGLQNYGAW 534
++ VA G D F L K + + G + L +L +GL G W
Sbjct: 511 IDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIK-GDW 569
Query: 535 ------FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK- 587
+ GL++ + NGK+ GEW +Q G+ GE G + + WK
Sbjct: 570 MIGYENMALEKKGLWAPVFW---NGKK--LEGEWRHQPGLLGERCGFADPAAGSLLAWKT 624
Query: 588 ------QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL 641
+G+ P+N W++TTF P+G GP AL+L MGKG W+NG IGRYW L
Sbjct: 625 AKAATGRGARRPLN----WWRTTFTRPKGHGPWALDLGGMGKGFCWINGHCIGRYW---L 677
Query: 642 APSTGCTKKCDYRGSYDA----SKCQKHCGQPAQTLYHIPRTWVHP--GENLLVIHEELG 695
P T D G + A S G P Q YH+P W+ G + LV+ EELG
Sbjct: 678 LPDT------DPMGPWMAWMKGSLTAAPSGGPTQRYYHVPDDWLRTDGGPDTLVLFEELG 731
Query: 696 GDPSKISLLTK 706
GDP+ + L+ +
Sbjct: 732 GDPATVRLVRR 742
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 348 bits (894), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 202/606 (33%), Positives = 306/606 (50%), Gaps = 45/606 (7%)
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
++WTEN++ F ++G V R ED+A+AV RFF GG+ NYYMY GGTNFGRT G
Sbjct: 1 MLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASY 59
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
V T Y +AP+DEYG ++PK+GHLR+LH I+ ++ + + + LG EAHI+
Sbjct: 60 VLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFEL 119
Query: 346 SSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH 404
C +FL+N ++ D V F G+ +++P+ SVSIL CKNVV+NT +V Q +
Sbjct: 120 PEEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSE--- 176
Query: 405 PFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
+ + +++ ++ + + E + + + EQ N TKD +DYLWYT S +
Sbjct: 177 ---RSFHTSDVTSKNNQWEMFSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRL 233
Query: 465 ----MPGQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
+P + L ++S HA + F N V GN F+ K ++L G+N +
Sbjct: 234 ESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHV 293
Query: 520 DILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
+LS +G+++ G G+ ++ L G DL W ++ +EGEY +
Sbjct: 294 VLLSSTMGMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYSEK 353
Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
WK +++ WYK F P+G P+ L+++SM KG +VNG+ +GRYW +
Sbjct: 354 GLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWVS 410
Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
Y + G P+Q +YHIPR ++ +NLLVI EE G P
Sbjct: 411 Y----------------------RTLAGTPSQAVYHIPRPFLKSKDNLLVIFEEEMGKPD 448
Query: 700 KISLLTKTGQHICSFVSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINF 754
I + T T IC F+SE +P + +W K L S + L C I + F
Sbjct: 449 GILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCPPEKTIQEVVF 508
Query: 755 ASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKAL 813
AS+G P+G CG+F G CH + IV+K C+G+ C +PV G C L
Sbjct: 509 ASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTATL 567
Query: 814 AVEAHC 819
V+ C
Sbjct: 568 GVQVRC 573
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 348 bits (892), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 207/608 (34%), Positives = 305/608 (50%), Gaps = 49/608 (8%)
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
++WTEN++ F ++G V R ED+A+AV RFF GG+ NYYMY GGTNFGRT G
Sbjct: 1 MLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASY 59
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
V T Y +AP+DEYG ++PK+GHLR+LH I+ ++ + + + LG EAHI+
Sbjct: 60 VLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFEL 119
Query: 346 SSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD- 403
C +FL+N ++ D V F G+ +++P+ SVSIL CKNVV+NT +V Q +
Sbjct: 120 PEEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSF 179
Query: 404 HPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVR-PDLAEQINTTKDTSDYLWYTASI 462
H N+ ++S Y + + VR + EQ N TKD +DYLWYT S
Sbjct: 180 HTSDVTSKNNQWEMSSETIPKYRD--------TKVRTKEPLEQYNQTKDDTDYLWYTTSF 231
Query: 463 HV----MPGQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN 517
+ +P + L ++S HA + F N V GN F+ K ++L G+N
Sbjct: 232 RLESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVN 291
Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
+ +LS +G+++ G G+ ++ L G DL W ++ +EGEY +
Sbjct: 292 HVVLLSSTMGMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYS 351
Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
WK +++ WYK F P+G P+ L+++SM KG +VNG+ +GRYW
Sbjct: 352 EKGLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYW 408
Query: 638 SAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
+Y + G P+Q +YHIPR ++ +NLLVI EE G
Sbjct: 409 VSY----------------------RTLAGTPSQAVYHIPRPFLKSKDNLLVIFEEEMGK 446
Query: 698 PSKISLLTKTGQHICSFVSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAI 752
P I + T T IC F+SE +P + +W K L S + L C I +
Sbjct: 447 PDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCPPEKTIQEV 506
Query: 753 NFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLK 811
FAS+G P+G CG+F G CH + IV+K C+G+ C +PV G C
Sbjct: 507 VFASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTA 565
Query: 812 ALAVEAHC 819
L V+ C
Sbjct: 566 TLGVQVRC 573
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 163/295 (55%), Positives = 212/295 (71%), Gaps = 3/295 (1%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ VTYD +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL I+TYVFWN HE
Sbjct: 38 NKEVTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHE 97
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P +G++ F GR DLV+F+K +Q+ G+++ LR+GP+ AEW +GG P WL +PGI FRT
Sbjct: 98 PQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTD 157
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N FKE +R++ I+D MK+E LFASQGGPIIL Q+ENEY V+ AY G Y+KWA+
Sbjct: 158 NKQFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWAS 217
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
+ ++ +PWVMC+Q DAPDP+IN CNG +C D F PN +KP +WTEN++ F F
Sbjct: 218 NLVDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVF 277
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDA 294
G R VED+A++VARFF GT NYYMY GGTNFGRT+ V T Y DA
Sbjct: 278 GDPPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYEDA 331
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 169/291 (58%), Positives = 203/291 (69%), Gaps = 14/291 (4%)
Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
EWN+GGFPVWL ++PGI FRT N PFK M +F KI+ +MK E LF SQGGPIIL+Q+E
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
NEYG VE+ G + Y+ WAA AV LNT VPWVMC+Q+DAPDP+IN NGFYCD F+P
Sbjct: 61 NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYCDYFSP 120
Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVED-----LAFAVARFFETGGTFQNYYMYFGG 274
NS + + G L + VP F V + E G F+NYYMY GG
Sbjct: 121 NS-------LKTFFGG-LKLDWLVPVSGSSSSQTVRTGFCVQVYTE-GWIFRNYYMYHGG 171
Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
TNFGRTAGG ++TSYDYDAPIDEY +RQPKWGHLR+LHKAIK+CE L+S DPT KL
Sbjct: 172 TNFGRTAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKL 231
Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
G EAH+Y S CAAFL+N++ S A+VTFNG Y +P+WS+SILPDC
Sbjct: 232 GNYQEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 207/319 (64%), Gaps = 5/319 (1%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ V+YD + +I+ ++ ++ SG +HYP ST ++WP + ++ K GGL+ IE+Y+FW+ H
Sbjct: 5 FATEVSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWDRH 64
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP+R +Y G D + F+K +QEA L+ LRIGPY C WN+GGF +WLH +P I+ R
Sbjct: 65 EPVRREYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEIELRI 124
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N K EM+ F KI+++ K+ LFA GGPIIL +ENEYGN+ Y + Y+KW
Sbjct: 125 DNPIXKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIKWC 184
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
A A+ N VPW+MC DAP P+INTCNG YCD F PN+P M+ F +G
Sbjct: 185 AQMALTQNIGVPWIMCXXRDAPQPMINTCNGHYCDSFXPNNPKSSKMFRX-----FQKWG 239
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
VP + E+ F+VARFF++GG NYYMY GGTNFG GGP + SY+YDAP+DEYG
Sbjct: 240 ERVPHKSAEESTFSVARFFQSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDEYG 299
Query: 301 FIRQPKWGHLRELHKAIKL 319
+ +PKW H ++LHK +
Sbjct: 300 NLNKPKWEHFKQLHKELTF 318
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 733 VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECS 791
V+ Q+ +C+ G I+ I FAS+G PEGNCGSF+ G D +V+ AC+G+ C
Sbjct: 415 VNEGAQLDPSCQIGKTISQIQFASFGNPEGNCGSFKGGTWEATDSQSVVEVACIGRNSCG 474
Query: 792 IPVSSAYL 799
V+ ++
Sbjct: 475 FTVTKRHI 482
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%)
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
F AP G P+ ++L GK QAWVNG+SIG YWS+++ + GC
Sbjct: 363 FEAPFGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWITNTNGC 405
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 199/494 (40%), Positives = 270/494 (54%), Gaps = 62/494 (12%)
Query: 158 VENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF 217
+ENEYGN+E A+ G YV WAA AV+L T VPW+MC+Q DAPDP+INTCNG C G
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKC-GE 59
Query: 218 T---PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
T PNSP+KP +WTEN++ ++ +G R +D+AF VA F G++ NYYMY GG
Sbjct: 60 TFGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 119
Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
TNFGRTA ++ YD AP+DEYG IRQPKWGHL+ELH IK C L+ T+ +
Sbjct: 120 TNFGRTAAAYVITGYYD-QAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSV 178
Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK 394
G +A+++ C AFL N D S +A V F + L S+SILPDC N++FNTAK
Sbjct: 179 GQLQQAYMFEAQGGGCVAFLVNND-SVNATVGFRNKSFELLPKSISILPDCDNIIFNTAK 237
Query: 395 VISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGN--RSFVRPD-LAEQINTTKD 451
V + N + +S + +E+ + + N S ++ D L E +NTTKD
Sbjct: 238 VNAGSN------------RRITTSSKKLNTWEKYIDVIPNYSDSTIKSDTLLEHMNTTKD 285
Query: 452 TSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHD-FANFLINKKI 510
SDYLWYT S K + L++ESL H A FVN K +G+ + F++ I
Sbjct: 286 KSDYLWYTFSFQPNLSCTKPL-LHVESLAHVAYAFVNNKYSGSAHGSKNGKVPFIMEVPI 344
Query: 511 ELNEG--INTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGV 568
L++ N + ILS++VGL VG+
Sbjct: 345 VLDDDGLSNNISILSVLVGL------------------------------------SVGL 368
Query: 569 EGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
GE + L W + + + + L W+K F P+G P+ LNLA+M KG+AWV
Sbjct: 369 LGETLQLYGKEHLEMVKWSKAD-ISIAQPLTWFKLEFDTPKGNDPVVLNLATMSKGEAWV 427
Query: 629 NGQSIGRYWSAYLA 642
NGQSIGRYW ++L
Sbjct: 428 NGQSIGRYWISFLT 441
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 331 bits (849), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 236/760 (31%), Positives = 370/760 (48%), Gaps = 107/760 (14%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V+Y R IDG+R +L GSIHYPRS+ W L+R +K GL IE YVFWN HE
Sbjct: 86 SVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVFWNLHEQE 145
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG + F G + RF + E GLFLH+R GPY CAEW+ GG P+WL++IPG++ R++N
Sbjct: 146 RGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGMKVRSSNA 205
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P++ EM+RF+ +++L + A GGPII+AQ+ENE+ + YV+W D
Sbjct: 206 PWQWEMERFVTYMVELSRP--FLAKNGGPIIMAQIENEFA-------MHDPEYVEWCGDL 256
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSF 239
L+TS+PWVMC A + I+ +CNG C F PS P++WTE+ GWF ++
Sbjct: 257 VKRLDTSIPWVMCYANAAENTIL-SCNGNDCVDFAVKHVKERPSDPLVWTED-EGWFQTW 314
Query: 240 GYAV--PF----RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
P R ED+A+AVAR+F GG NYYMY GG NFGR A V T Y
Sbjct: 315 AKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGGNNFGRAASAG-VTTKYADG 373
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD----------PTHQKLG--AKLEAH 341
+ G +PK HLR+LH+A+ C + L+ +D PTH + + L+
Sbjct: 374 VNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPHELAPTHGETAEASSLQQR 433
Query: 342 IYHKSSNDCAAFLANYDSSSDANVT--FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR 399
+ + D +A ++ +D VT F N Y L S+ I+ D ++FNTA V
Sbjct: 434 AFIYGAEDGPNQVAFLENQADKKVTVVFRDNKYELAPTSMMIIKDGA-LLFNTADV---- 488
Query: 400 NNGDHPFAQQKNVNELLLASSA--FSWYEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYL 456
P + ++ A++ +W E V ++ R V EQ+ T D SDYL
Sbjct: 489 -RKSFPGTVHRAYTPIVQAATLQWETWSELNVSSLTPRRRVVAERPVEQLRLTADRSDYL 547
Query: 457 WYTASIHVMPGQ------GKEVFLNIESLGHAALV------FVNKKLVAFGYGN--HDFA 502
Y + V P + + S ++++ + ++ +A+ GN +F
Sbjct: 548 TYETTFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNLAYPGGNCSKEF- 606
Query: 503 NFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG-E 561
F + I++ ++L ++S+ +G+ + G+ G V G+++L+ G +
Sbjct: 607 RFSLPTNIDVTRQ-HSLKLVSVSLGIYSLGSNHTKGLTGKVRV-------GRKNLAKGHQ 658
Query: 562 WIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN--KSLIWYKTTFLAPEGKGP------ 613
W + GE + + + +S W + + + + WY T+F P + P
Sbjct: 659 WEMYPTLVGEQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSYPAFELPAEADPV 718
Query: 614 -----LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQ 668
+ L+ + +G+A++NG +GRYW G+
Sbjct: 719 SEPFSILLDCIGLTRGRAYINGHDLGRYW------------------------LVNDEGE 754
Query: 669 PAQTLYHIPRTW-VHPGENLLVIHEELGGDPSKISLLTKT 707
Q YH+PR W V N+LV+ +ELGG + + L++ +
Sbjct: 755 FVQRYYHVPRDWLVKDQANVLVVFDELGGSVADVRLVSSS 794
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 329 bits (843), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 144/219 (65%), Positives = 179/219 (81%)
Query: 34 EVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRI 93
E+WP+LI+++K+GGL+VI+TYVFWN HEP G+YYFE +DLV+F+K VQ+AGL++HLRI
Sbjct: 1 EMWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRI 60
Query: 94 GPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPI 153
GPY CAEWN+GGFPVWL +IPGIQFRT N PFK++M+RF KI+++MK E LF S GGPI
Sbjct: 61 GPYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPI 120
Query: 154 ILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFY 213
IL+Q+ENEYG +E+ G G+ Y WAA AV L T VPWVMC+Q+DAPDP+IN CNGFY
Sbjct: 121 ILSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFY 180
Query: 214 CDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLA 252
CD F+PN KP MWTE ++GWF FG AVP+RP EDLA
Sbjct: 181 CDYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 325 bits (832), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 249/752 (33%), Positives = 362/752 (48%), Gaps = 96/752 (12%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V Y R VIDGK +L GSIHY RSTP+ W L+ K+KE GL +++ Y+FWN+HEP
Sbjct: 98 DVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFHEPR 157
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG +YF R +L F + V GLF+HLR GPY CAEWN GG P+WL IPG++ R+ +
Sbjct: 158 RGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGMKVRSNSE 217
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+++EM R + +I+L + F+ GGPII+AQ+ENEY + YV W +
Sbjct: 218 SWRQEMNRIILIMINLARP--YFSVNGGPIIMAQIENEYNGHD-------PTYVAWLSQL 268
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTEN---YSGWF 236
L +PW MC A + I+TCN C F + PS+P++WTEN Y W
Sbjct: 269 VRKLGIGIPWTMCNGASAVN-TISTCNDNDCFQFAEKNAKVFPSQPLVWTENEAWYEKWA 327
Query: 237 ---LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
++ R E +A+ VAR+F GG NYYMY GG NFGRTA V T Y
Sbjct: 328 TKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYHGGNNFGRTASAG-VTTMYADG 386
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP--THQK-LGAK------LEAHIYH 344
A + G +PK HLR+LH + C + L+S++ H K LG + A+IY
Sbjct: 387 AILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNHAKPLGPEGKNAYTQRAYIYG 446
Query: 345 KSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH 404
S FL N + A + Y LP ++ IL D NV++NT+ V +G
Sbjct: 447 NCS-----FLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDV-----SGTL 495
Query: 405 PFAQQKNVNELL--LASSAFSWYEEKVGISGNRSFVRPDL-AEQINTTKDTSDYLWYTAS 461
++ + L+ S W E V R + D EQ+ T+DT+DYL Y
Sbjct: 496 GSRSTRSFSPLIRFRKSDWKIWSEWDVNPHNVRDQIVNDSPLEQLLVTQDTTDYLMYQNE 555
Query: 462 IH---VMPGQGK---EVFLNIESLGHAALVFVNKKLVA---FGYGNHDFANFLINKKIEL 512
+ P + K + I ++ LVF+N + + Y D +N L
Sbjct: 556 VRWGSNGPTKNKMKSSILKFISCDANSFLVFINGEFIGEQHLAYPGDDCSNIFRFDLGPL 615
Query: 513 NE-GIN-TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG---EWIYQVG 567
+ G N TL ILS+ +G+ + G + G+ S + ID +R L G W+ G
Sbjct: 616 GKYGANLTLSILSISLGIHSLG---EKHQKGIVSDVQID----ERSLVYGPHERWVMFSG 668
Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNK-SLIWYKTTFLAPE----GKGPLALNLASMG 622
+ GE + L +NS W+ + K + WY T F+ + + + L+ M
Sbjct: 669 LIGELLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQLDWDTETSVLLDCKGMN 728
Query: 623 KGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH 682
+G+ ++NG +GRYW ++ D G+Y Q Y IP W+H
Sbjct: 729 RGRIYLNGHDLGRYW---------LIRRSD--GAY------------VQRYYTIPVAWLH 765
Query: 683 PG--ENLLVIHEELGGDP-SKISLLTKTGQHI 711
N LVI EEL + + ++T T + I
Sbjct: 766 AANKSNYLVIFEELRNETIESMRIVTSTMRRI 797
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 226/739 (30%), Positives = 356/739 (48%), Gaps = 96/739 (12%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD R+ +DGKR + +GS+HYPR+TPE+W ++ ++ E GL +I+ Y FWN HEP++
Sbjct: 35 VTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQIYTFWNLHEPVK 94
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY +EG D+ F++ + GLF+++RIGPY CAEW+ GG PVW++++ G++ R N+
Sbjct: 95 GQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDV 154
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+K+EM ++ + D + + FA +GGPII +Q+ENE W G Y+ W + A
Sbjct: 155 WKKEMGDWMKVLTDYTR--DFFADRGGPIIFSQIENEL----WG---GAREYIDWCGEFA 205
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSP-------SKPIMWTENYSGWFL 237
+L +VPW+MC D + IN CNG C + + +P WTEN GWF
Sbjct: 206 ESLELNVPWMMCNG-DTSEKTINACNGNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQ 263
Query: 238 SFGYAVP---------FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT 288
G A R ED F V +F + GG++ NYYM+FGG ++G+ AG +
Sbjct: 264 IHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHNYYMWFGGNHYGKWAGNGMT-N 322
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP---THQKLGAKLEAHIYHK 345
Y I +PK H ++H+ + E L++ + L ++
Sbjct: 323 WYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYR 382
Query: 346 SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHP 405
+ +F+ N S+D V + VY LPAWS+ +L + NV+F T V P
Sbjct: 383 YGDRLVSFVENNKGSADK-VIYRDIVYELPAWSMIVLDEYDNVLFETNNV--------KP 433
Query: 406 FAQQK--NVNELLLASSAFSWYEEKVGI---SGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
+ + + E L F ++ E V R V P EQ+N T+D +++L+Y
Sbjct: 434 VNKHRVYHCEEKL----EFEYWNEPVSTLSQEAPRVVVSPKANEQLNMTRDLTEFLYYET 489
Query: 461 SIHVMPGQGKEVFLNIESLGHAALV-FVNKKLVAFG--YGNHDFANFLINKKIELNEGIN 517
+ E L+I A V +V+ V + +HD +N ++ +G +
Sbjct: 490 EVEFPQ---DECTLSIGGTDANAFVAYVDDHFVGSDDEHTHHD-GWHTMNINMKSGKGKH 545
Query: 518 TLDILSMMVGLQN------YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGE 571
L +LS +G+ N +W G+ I K D+ + EW + G+ GE
Sbjct: 546 KLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWI----KLCGNDIFNQEWKHYPGLVGE 601
Query: 572 YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEG--KG-PLALNLASMGKGQAWV 628
+ + WK S + +L WY++TF P+G +G + L M +GQA+V
Sbjct: 602 AKQVFTDEGMKTVTWK--SDVENADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYV 659
Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV--HPGEN 686
NG +IGRYW + G+ Q YHIP+ W+ EN
Sbjct: 660 NGHNIGRYW-----------------------MIKDGNGEYTQGYYHIPKDWLKGEGEEN 696
Query: 687 LLVIHEELGGDPSKISLLT 705
+LV+ E LG +++ T
Sbjct: 697 VLVLGETLGASDPSVTICT 715
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 206/591 (34%), Positives = 316/591 (53%), Gaps = 47/591 (7%)
Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
E RF+ K + E FA+ GGPII++QVENEYG V+ YG G Y +W+A A +
Sbjct: 2 ESWMRFITKYL-----ERHFAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQS 56
Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYC----DGFTPNSPSKPIMWTENYSGWFLSFGYA 242
LN VPW+MCQQ+D D +INTCNGFYC +G P++P +TEN+ GWF + +
Sbjct: 57 LNVGVPWIMCQQDDI-DSVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQS 115
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
P RPVED+ +AV +F GG+ NYYM+ GGTNFGRT+ P+V SYDYDA +DEYG
Sbjct: 116 TPHRPVEDVLYAVGNWFARGGSLMNYYMWHGGTNFGRTS-SPMVVNSYDYDAALDEYGNP 174
Query: 303 RQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKS-SNDCAAFLANYDS 359
+PK+ H + + ++ +++ P + LG + IYH + + +FL N
Sbjct: 175 SEPKYSHAAKFNNLLQKYSHIFLNAPEIPRSEYLGGS--SSIYHYTFGGESLSFLINNHE 232
Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV--ISQRNNGDHPFAQQKNVNELLL 417
S+ ++ +NG + + WSV +L + + VF++A +S+ F+ + N +
Sbjct: 233 SALNDIVWNGQNHIIKPWSVHLLYN-NHTVFDSAATPEVSKLAMTSKRFSPVNSFNNAYI 291
Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LNI 476
+ W EE + ++ + +P EQ++ T D +DYLWY I++ +G EVF N+
Sbjct: 292 S----QWVEE-IDMTDSTWSSKP--LEQLSLTHDKTDYLWYVTEINLQV-RGAEVFTTNV 343
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+ HA +++ K + + + F N K ++ G + L IL+ +G+Q+Y +
Sbjct: 344 SDVLHA---YIDGKYQSTIWSANPF-----NIKSDIPLGWHKLQILNSKLGVQHYTVDME 395
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
GL I + G D+++ W + V GE + + + W S V +
Sbjct: 396 KVTGGLLGNIWV----GGTDITNNGWSMKPYVNGERLAIYNPNNIFKVDWSSFSG--VQQ 449
Query: 597 SLIWYKTTFLAPEGKGP-LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
L WYK FL +LN++ M KG W+NG+ + RYW + GC C Y+G
Sbjct: 450 PLTWYKINFLHELSPNKHYSLNMSGMNKGMIWLNGKHVARYW---ITKGWGC-NGCSYQG 505
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
Y C +CG+P+Q YH+P+ W+ G NLLVI EE+GG+P I L K
Sbjct: 506 GYTDQLCSTNCGEPSQINYHLPQDWLIEGANLLVIFEEVGGNPKSIKLEEK 556
>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
Length = 198
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 147/200 (73%), Positives = 171/200 (85%), Gaps = 3/200 (1%)
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKGQAWVNGQSIGRYW AYLAPSTGCT CDYRG+YDASKC ++CGQPAQTLYHIPRTW
Sbjct: 1 MGKGQAWVNGQSIGRYWPAYLAPSTGCTTNCDYRGAYDASKCLRNCGQPAQTLYHIPRTW 60
Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVR 740
VH G+NLLV+HEELGGDPSKISLLT+TGQ +C+ VSEADPPP DSW+PNL +S S QVR
Sbjct: 61 VHSGKNLLVLHEELGGDPSKISLLTRTGQEVCAHVSEADPPPADSWQPNLEFMSQSSQVR 120
Query: 741 LACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLG 800
L CE+GWHI+ INFAS+G P G+CG+F PG CH +VL +VQ+AC+GQ C+IPVS+A LG
Sbjct: 121 LTCEQGWHISMINFASFGTPRGHCGTFNPGNCHANVLSVVQQACIGQEGCAIPVSTARLG 180
Query: 801 VSAGACPGLLKALAVEAHCS 820
CPG+LK+LA+EA CS
Sbjct: 181 ---DPCPGVLKSLAIEALCS 197
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 135/185 (72%), Positives = 159/185 (85%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+NVTYDH+ALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GG++VIETYVFWN HEP
Sbjct: 24 SNVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEP 83
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+RGQY FEGR DLV FVK V AGL++HLRIGPY CAEWNYGGFP+WLHFI GI+FRT N
Sbjct: 84 VRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNN 143
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK EMKRF AKI+D+MKQENL+ASQGGPIIL+Q+ENEYGN++ + Y+ WAA
Sbjct: 144 EPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAAS 203
Query: 183 TAVNL 187
A +L
Sbjct: 204 MATSL 208
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 296 bits (759), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 137/154 (88%), Positives = 145/154 (94%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
VTYDHRALVIDGKRRVLQSGSIHYPRS PEVWPE+IRKSKEGGL+VIETYVFWN HEP+
Sbjct: 159 TVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEGGLDVIETYVFWNNHEPV 218
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG+YYFEGRFDLVRFVKTVQEAGL +HLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN+
Sbjct: 219 RGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTND 278
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ 157
FK EMKRFLAKI+ LMK+ NLFA QGGPIILAQ
Sbjct: 279 LFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 312
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 158/375 (42%), Positives = 222/375 (59%), Gaps = 23/375 (6%)
Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
T LG E H+++ S CAAFLANYD++S A V F Y LP WS+SILPDCK V
Sbjct: 1 TVTSLGNNQEVHVFNPKSGSCAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAV 60
Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW---YEEKVGISGNRSFVRPDLAEQI 446
FNTA++ +Q + Q V S FSW EE S +++F L EQ+
Sbjct: 61 FNTARLGAQSS-----LKQMTPV-------STFSWQSYIEESASSSDDKTFTTDGLWEQL 108
Query: 447 NTTKDTSDYLWYTASIHVMPGQG-----KEVFLNIESLGHAALVFVNKKLVAFGYGNHDF 501
N T+D SDYLWY +I++ +G ++ L I S GHA VF+N +L YG D
Sbjct: 109 NVTRDASDYLWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDN 168
Query: 502 ANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSG 560
++ +++ G+N L +LS+ VGLQN G F+ G+ V L L G RDLS
Sbjct: 169 PKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQ 228
Query: 561 EWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLAS 620
+W Y++G++GE + L +S ++S W +GS+L + L WYKTTF AP G PLAL++++
Sbjct: 229 QWSYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMST 288
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKG W+N QSIGR+W Y+A G +C+Y G+Y KC +CGQP+Q YH+PR+W
Sbjct: 289 MGKGLIWINSQSIGRHWPGYIAH--GSCGECNYAGTYTDKKCHTNCGQPSQRWYHVPRSW 346
Query: 681 VHPGENLLVIHEELG 695
++P NLLV+ + +G
Sbjct: 347 LNPTGNLLVVLKRVG 361
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 295 bits (755), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 137/154 (88%), Positives = 145/154 (94%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
VTYDHRALVIDGKRRVLQSGSIHYPRS PEVWPE+IRKSKEGGL+VIETYVFWN HEP+
Sbjct: 24 TVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEGGLDVIETYVFWNNHEPV 83
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG+YYFEGRFDLVRFVKTVQEAGL +HLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN+
Sbjct: 84 RGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTND 143
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ 157
FK EMKRFLAKI+ LMK+ NLFA QGGPIILAQ
Sbjct: 144 LFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 294 bits (752), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 130/204 (63%), Positives = 162/204 (79%), Gaps = 1/204 (0%)
Query: 29 PRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLF 88
PRSTPE+WP+LI+ +KEGGL+VI+TYVFWN HEP G YYFE R+D V+F+K V +AGL+
Sbjct: 1 PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60
Query: 89 LHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFAS 148
+HLRIGPY C EWN+GGFPVWL ++PGIQFRT N PFK +M++F KI+++MK E LF
Sbjct: 61 VHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEP 120
Query: 149 QGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINT 208
QGGP I++Q+E EYG + W G G+ Y KWAA AV L T VPW+MC+QEDAPDPII+T
Sbjct: 121 QGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDT 179
Query: 209 CNGFYCDGFTPNSPSKPIMWTENY 232
CNGFYC+ F PN+ KP MWTE +
Sbjct: 180 CNGFYCENFMPNANYKPKMWTEAW 203
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 293 bits (750), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 157/285 (55%), Positives = 192/285 (67%), Gaps = 5/285 (1%)
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
F+SFG VP RPVEDLAFAVARF++ GGTFQNYYM+ GGTNFGRT GGP ++TSYD+D P
Sbjct: 6 FVSFGDVVPHRPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTP 65
Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLA 355
IDEYG IRQPKW HL+ +HKAIKLCE+ L+++ PT LG +EA +Y+ + AAFLA
Sbjct: 66 IDEYGIIRQPKWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVYNIGAV-SAAFLA 124
Query: 356 NYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNE 414
N + +DA V+FNGN Y LPAW VS LPDCK+VV NTAK+ S K V
Sbjct: 125 NI-AKTDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGS 183
Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
L + S +SW E +GIS SF + L EQINTT D SDYLWY++SI + E L
Sbjct: 184 LDDSGSGWSWISEPIGISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDL--DAATETVL 241
Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
+IESLGHA FVN KL G GNH+ + ++ I L G NT+
Sbjct: 242 HIESLGHALHAFVNGKLAGSGTGNHEKVSVKVDIPITLVYGKNTI 286
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 290 bits (743), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 183/516 (35%), Positives = 273/516 (52%), Gaps = 42/516 (8%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A VT+D RA+VIDGKR +L GS HYP+ E WP+ + +K+ GL +E Y+FWN HE
Sbjct: 3 TAQVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHE 62
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
+G Y+FE ++ RF++ QE GL + LR+GPY CAE +YGGFP WL IPGI+FRT
Sbjct: 63 KKKGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGIEFRTY 122
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
N PF +EMKR+L I ++K+ L+ +GGPIIL Q+ENEY V YG G+ Y+ W
Sbjct: 123 NEPFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWCY 182
Query: 182 DTAVNLNTSVPWVMCQQED-----APDPIINTCNGFY----CDGFTPNSPSKPIMWTENY 232
+ + + W+ + + + D I T N FY D P +P++WTE +
Sbjct: 183 E--LYKEGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALKPHQPLLWTEFW 240
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDY 292
GW+ + A RPV+D+ +A ARF GG+ NYYM+ GGT+FG A T YD+
Sbjct: 241 IGWYNIWRGAQRQRPVDDVIYAAARFIAQGGSGMNYYMFHGGTHFGNLAMYG-QTTGYDF 299
Query: 293 DAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHK-SSNDC 350
DAP+D YG + K+ L++L+ + E L+S D P QKL + + + S D
Sbjct: 300 DAPVDSYGRPTE-KFERLKQLNHCLSNLEYILLSQDEPEVQKLTPNVNVYRWKDIESGDE 358
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK 410
+F+ N D S + V L SV I N +V N + QK
Sbjct: 359 CSFVCN-DQRSQSYVIVAERAVCLKPLSVKIY-------LNHEEVFDSSQNSYN--VSQK 408
Query: 411 NVNELLLASSAFSWYEEKVGISGNR-------SFVRPDLAEQINTTKDTSDYLWYTASIH 463
+ + L + W ++ I F P + + ++ T+D +DY+WYT
Sbjct: 409 SYHRLDYVCN--EWKTMQIPIPSKEKKDKEHFEFSFPHIPDMLHITQDETDYMWYTGVGT 466
Query: 464 VM-PGQGK------EVFLNIESLGHAALVFVNKKLV 492
+ P +G+ ++ + +E+ + VF+N+K V
Sbjct: 467 IYCPFKGENTPHCLKIHMELEAADYVH-VFLNRKYV 501
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 199/581 (34%), Positives = 294/581 (50%), Gaps = 73/581 (12%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTY R IDGK+ +L GSIHYPRS+P W +L+R++K GL IE YVFWN HE
Sbjct: 84 SVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWEQLLREAKRDGLNHIEMYVFWNLHEQE 143
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
RG + F G ++ RF + E GLFLH+R GPY CAEWN GG P+WL++IPG++ R++N
Sbjct: 144 RGVFNFAGNANITRFYELAAEVGLFLHVRFGPYVCAEWNNGGLPLWLNWIPGMEVRSSNA 203
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
P++ EM+RF+ +++L + A GGPII+AQ+ENE+ W Y+ W +
Sbjct: 204 PWQREMERFIRYMVELSRP--FLAKNGGPIIMAQIENEFA---WH----DPEYIAWCGNL 254
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSF 239
L+TS+PWVMC A + I+ +CN C F PS P++WTE+ GWF ++
Sbjct: 255 VKQLDTSIPWVMCYANAAENTIL-SCNDDDCVDFAVKHVKERPSDPLVWTED-EGWFQTW 312
Query: 240 --GYAVPF----RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
P R ED+A+AVAR+F GG NYYMY GG N+GR A V T Y
Sbjct: 313 QKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHNYYMYHGGNNYGRAASAG-VTTMYADG 371
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK---LEAHIYHKSSNDC 350
+ G +PK HLR+LH+A+ C + L+ +D Q L + L K+S+
Sbjct: 372 VNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRND--RQVLNPRELPLVDEQTVKASSQQ 429
Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK 410
AF+ ++ + + ++F+TA V P Q +
Sbjct: 430 RAFVYGPEAEPNQDGA---------------------ILFDTADV-----RKSFPGRQHR 463
Query: 411 NVNELLLASSAF--SWYEEKVGISGNRSFVRPDLA-EQINTTKDTSDYLWYTASIHVMPG 467
L+ AS+ +W E V + R V D EQ+ T D SDYL Y + P
Sbjct: 464 TYTPLVKASALAWKAWSELNVSSTTPRRRVVADQPIEQLRLTADQSDYLTYETTF--TPK 521
Query: 468 QGKEV--------FLNIESLGHAALV---FVNKKLVAFGYGN--HDFANFLINKKIELNE 514
Q +V + E+ ALV + ++ +A+ GN +F+ F + IE+
Sbjct: 522 QLSDVDDDMWTVKVTSCEASSIIALVDGWLIGERNLAYPGGNCSKEFS-FHLPASIEVGR 580
Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKR 555
+ L ++S+ +G+ + G+ G + DL G+R
Sbjct: 581 Q-HDLKLVSVSLGIYSLGSNHSKGVTGSVRIGHKDLARGQR 620
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 285 bits (730), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 146/271 (53%), Positives = 184/271 (67%), Gaps = 4/271 (1%)
Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
MY GGTNF R+ GGP +ATSYDYDAPIDEYG IRQ KWGHL++++KAIKLCEE LI++DP
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60
Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
LG LEA +Y K+ + CAAFLAN D+ +D V F+GN Y LPAWSVS+LPDCKNVV
Sbjct: 61 KISSLGQNLEAAVY-KTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVV 119
Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
NTAK+ S + F + +++ L +SS +SW E VGIS + + L EQINTT
Sbjct: 120 LNTAKINSASAISN--FVTE-DISSLETSSSKWSWINEPVGISKDDILSKTGLLEQINTT 176
Query: 450 KDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKK 509
D SDYLWY+ S+ + G + L+IESLGH F+N KL GN D + ++
Sbjct: 177 ADRSDYLWYSLSLDLADDPGSQTVLHIESLGHTLHAFINGKLAGNQAGNSDKSKLNVDIP 236
Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
I L G N +D+LS+ VGLQNYGA+FD GA
Sbjct: 237 IALVSGKNKIDLLSLTVGLQNYGAFFDTVGA 267
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 171/438 (39%), Positives = 233/438 (53%), Gaps = 34/438 (7%)
Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
MY GGTNFGRT+ + YD AP+DEYG +RQPK+GHL+ELH AIK L+
Sbjct: 1 MYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQ 59
Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
T LG +A+++ ++N C AFL N D+ + + + F N Y L S+ IL +CKN++
Sbjct: 60 TILSLGPMQQAYVFEDANNGCVAFLVNNDAKA-SQIQFRNNAYSLSPKSIGILQNCKNLI 118
Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
+ TAKV + N Q NV + ++ + E + S L E N T
Sbjct: 119 YETAKVNVKMNTRVTTPVQVFNVPD------NWNLFRETIPAFPGTSLKTNALLEHTNLT 172
Query: 450 KDTSDYLWYTASIHV-MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINK 508
KD +DYLWYT+S + P ++ ES GH VFVN L G+G+ D +
Sbjct: 173 KDKTDYLWYTSSFKLDSPCTNPSIY--TESSGHVVHVFVNNALAGSGHGSRDIRVVKLQA 230
Query: 509 KIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGV 568
+ L G N + ILS MVGL + GA+ + GL V + DLS +W Y VG+
Sbjct: 231 PVSLINGQNNISILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGL 290
Query: 569 EGEYIGLDKISLANSSFWKQGST-LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
GE + L + N W L N+ L WYKTTF P G GP+ L+++SMGKG+ W
Sbjct: 291 LGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIW 350
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW ++L P+ GQP+Q++YHIPR ++ P NL
Sbjct: 351 VNGESIGRYWVSFLTPA----------------------GQPSQSIYHIPRAFLKPSGNL 388
Query: 688 LVIHEELGGDPSKISLLT 705
LV+ EE GGDP ISL T
Sbjct: 389 LVVFEEEGGDPLGISLNT 406
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 281 bits (720), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 171/482 (35%), Positives = 246/482 (51%), Gaps = 54/482 (11%)
Query: 19 RVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRF 78
R+L SIHYPR P W +LI +KE G+ IETYVFWN HE +G Y F GR DL F
Sbjct: 476 RILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGF 535
Query: 79 VKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIID 138
++T+ +AGL+ LRIGPY CAE ++GGFP WL I GI+FRT N PF+ E R++ +++
Sbjct: 536 IRTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVE 595
Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQ 198
+ N F SQGGPI++ Q ENEY + YG G Y+KW ++ A +L VP MC+
Sbjct: 596 KLNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCK- 654
Query: 199 EDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFA 254
+ + ++ T N FY N P++P +WTE ++GW+ +G A RP +DL +A
Sbjct: 655 -GSIENVLETINDFYGHQEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYA 713
Query: 255 VARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELH 314
V RFF GG NYYM+ GGTN+ + A L TSYDYDAPIDEYG + K+ L+ +H
Sbjct: 714 VLRFFAQGGKGINYYMFHGGTNYDQLAMY-LQTTSYDYDAPIDEYGR-KTKKYFGLQYIH 771
Query: 315 KAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCA------------AFLANYDSSSD 362
+ + E++ S L KLEA I H ++ F N +S
Sbjct: 772 RQL---EQHFAS-------LALKLEAPIAHSYEDNYVWIFIWEEQGSNCIFFCNDHPTST 821
Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
V + Y L SV ++ D ++ + ++ QK + + + + +
Sbjct: 822 KQVQWKEQEYCLAPLSVQMVVDHHRLILKSDQLFVDEE------LIQKELKPISVTTEEW 875
Query: 423 SW--YEEKVGISG----------------NRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
+W Y+E + + N E + T +DY WY A +
Sbjct: 876 TWQYYKENIPTTDITSSASQSSSISSLSSNTEIETQVPVEMLRYTGTATDYAWYIAHYQI 935
Query: 465 MP 466
P
Sbjct: 936 DP 937
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 281 bits (719), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 123/205 (60%), Positives = 160/205 (78%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
VTYD RAL++DG RR+L SG +HYPRSTPE+WP+LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 36 GEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEP 95
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
++GQ+ FEGR+DLV+F++ + GL++ LRIGP+ +EW YGG P WL IP I FR+ N
Sbjct: 96 VQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNITFRSDN 155
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
PFK M++F+ KI++LMK E LF QGGPII++Q+ENEY VE A+ G YV WAA
Sbjct: 156 EPFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYVHWAAA 215
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIIN 207
AVNL T VPW+MC+Q+DAPDPI++
Sbjct: 216 MAVNLQTGVPWMMCKQDDAPDPIVS 240
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 122/205 (59%), Positives = 157/205 (76%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L +TYD RALV+ G RR+ SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI+GQY FEGR+DLV+F++ +Q GL++ LRIGP+ AEW YGGFP WLH +P I FR+
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
N PFK+ M+ F+ KI+ +MK E L+ QGGPII++Q+ENEY +E A+G G YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPI 205
A AV L T VPW+MC+Q DAPDP+
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPV 229
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 275 bits (703), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 165/488 (33%), Positives = 255/488 (52%), Gaps = 52/488 (10%)
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ 409
C AFL+N+++ DA +TF G YF+P S+S+L DC+ VVF T V +Q N FA Q
Sbjct: 7 CVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQ 66
Query: 410 ---KNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
NV E+ + + + K+ + + N TKD +DY+WYT+S +
Sbjct: 67 TAQNNVWEMFDGENVPKYKQAKIRLR--------KAGDLYNLTKDKTDYVWYTSSFKLEA 118
Query: 465 --MPGQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDI 521
MP + + L + S GHA++ FVN K V G+G F + K ++L +G+N + +
Sbjct: 119 DDMPIRSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAV 178
Query: 522 LSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLA 581
L+ +G+ + GA+ + AG+ V + L G DL++ W + VG+ GE +
Sbjct: 179 LASSMGMTDSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGM 238
Query: 582 NSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL 641
S WK ++ L WYK F P G+ P+ L++++MGKG +VNGQ IGRYW +Y
Sbjct: 239 GSVTWKPAMN---DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY- 294
Query: 642 APSTGCTKKCDYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
KH G+P+Q LYH+PR+++ +N+LV+ EE G P
Sbjct: 295 ----------------------KHALGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDA 332
Query: 701 ISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSS-------PQVRLACERGWHIAAIN 753
I +LT +IC+F+SE +P + SW+ +++ + LAC I +
Sbjct: 333 IMILTVKRDNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVV 392
Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
FASYG P G CG++ G+CH +V+KAC+G+ C++PV++ G A C G
Sbjct: 393 FASYGNPAGICGNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDAN-CSGTTAT 451
Query: 813 LAVEAHCS 820
LAV+A CS
Sbjct: 452 LAVQAKCS 459
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 159/428 (37%), Positives = 239/428 (55%), Gaps = 31/428 (7%)
Query: 408 QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
Q ++ N+ S ++ +E + I SF + E +N TKD SDYLWY+ ++V
Sbjct: 21 QLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDS 80
Query: 468 -----QGKEVF--LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
+ +V L I+ + VF+N +L+ + I ++ G N
Sbjct: 81 DILFWEENDVHPKLTIDGVRDILRVFINGQLIV--------KDEQFKAVISVSIGKNDCT 132
Query: 521 ILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
S + NYGA+ + GAG+ I I +NG DLS W YQVG++GE++
Sbjct: 133 AGS----INNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEE 188
Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
NS W + + + + WYKT F P G P+AL+ SMGKGQAWVNGQ IGRYW+
Sbjct: 189 NENSE-WVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTR 247
Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
++P +GC + CDYRG+Y++ KC +CG+P QTLYH+PR+W+ NLLVI EE GG+P
Sbjct: 248 -VSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVILEETGGNPF 306
Query: 700 KISLLTKTGQHICSFVSEADPPPV------DSWKPNLGVVSSSPQVRLACERGWHIAAIN 753
+IS+ + + IC+ VSE++ PP+ D + + P++ L C++G I+++
Sbjct: 307 EISVKLHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMIPELHLHCQQGHTISSVA 366
Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
FAS+G P G+C +F G CH + IV +AC G+ CSI +S + GV CPG++K
Sbjct: 367 FASFGTPGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVD--PCPGVVKT 424
Query: 813 LAVEAHCS 820
L+VEA C+
Sbjct: 425 LSVEARCT 432
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 227/766 (29%), Positives = 354/766 (46%), Gaps = 91/766 (11%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
++D RA+ ++GKR +L GS+ YP+ W ++ +KE GL ++ YVFWN HE RG
Sbjct: 8 SFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWNVHEKKRG 67
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
+ F D+ RF++ + GL + LR+GPY CAE +YGGFP WL IPGIQFRT N+PF
Sbjct: 68 IFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGIQFRTYNDPF 127
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
E+KR+L I L+K++ LF QGGPI+L Q+ENEY V GE Y+ W +
Sbjct: 128 MREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYLNWYNELYR 187
Query: 186 NLNTSVPWVMCQQEDAPDPI---------------------INTCNGFY----CDGFTPN 220
L VP +MC+ +P+ + I T N FY
Sbjct: 188 ELAFDVPLIMCR--SSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKIADLRRR 245
Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
P +PI+WTE + GW+ + A R ED+ +A RF GG +YYM+ GGT+F
Sbjct: 246 KPHQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQGGAGFSYYMFHGGTHFNNL 305
Query: 281 AGGPLVATSYDYDAPIDEYGFIRQPKWGH--LRELHKAIKLCEEYLISSD-PTHQKLGAK 337
A TSY +D+PIDEYG +P + L+ ++ + +L+S D P L +
Sbjct: 306 AMYS-QTTSYYFDSPIDEYG---RPSFLFYMLKRINHILHQFSSHLLSQDHPQVLHLLPQ 361
Query: 338 LEAHIYHK-SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVI 396
+ A I+ + SS +FL N DS A + F ++ + SV++ + ++F++
Sbjct: 362 VVAFIWQEHSSQQSLSFLCN-DSEQIAYIMFQQSMMKMNPLSVAVFLE-NELLFDS---- 415
Query: 397 SQRNNGDHPFAQQKNVNELLLAS-SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDY 455
S + PF K + F +S + F + L + ++ T+D +DY
Sbjct: 416 SSGYDWQIPFRDFKPLERAYFRELKTFQLDIPIPPLSSSCDFSQ--LPDMLSVTQDETDY 473
Query: 456 LWYTASIHVMPGQGKE-----VFLNIESLGHAALVFVNKKLVAFGYGNHD---FAN---- 503
+WY +S +P KE V L IE + +F+N++ + + D FAN
Sbjct: 474 MWYISSA-TLPVSSKEFTCEKVLLQIE-MADLIHLFINQQYMGSSWIKIDDERFANGKNG 531
Query: 504 ----------------FLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVIL 547
F N K+ ++ + +L ++ L GA + GLF +
Sbjct: 532 FRFSIEFENSVYPQPVFSSNSKLYVSILVCSLGLIKGEFQLWK-GATMEKEKKGLFKQPI 590
Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL----IWYKT 603
I +L + + L + S+F K+ + V+K L +YK
Sbjct: 591 IHFVVKHSELETETIPLSFTSSWAMMPLSIMKDHQSAFVKEYNIKNVDKPLSLGPTYYKQ 650
Query: 604 TFLAPEG-----KGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
T + + K L ++ +SM KG N GRY+S + G + R S
Sbjct: 651 TVIINKAMIDALKWGLVIDFSSMTKGIFRWNSFCCGRYYSIQVL---GKERDPSLRNS-- 705
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
+ H + Q YHIP+ V N L + EE+GG+ ++ +L
Sbjct: 706 -PVQEDHLFKSTQRYYHIPKG-VLQERNELEVFEEIGGNFMQLRIL 749
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 272 bits (695), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 196/317 (61%), Gaps = 11/317 (3%)
Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVE 569
I L G N + +LS+MVGL N G F+ AG+ +V L K+G RDLS W YQ+G+
Sbjct: 6 ISLIPGTNDIALLSVMVGLPNSGGHFERKIAGISTVTLRGFKDGTRDLSQELWTYQIGLL 65
Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
GE + S W ST N L WYK P+G P+ L+L+SMGKGQAW+N
Sbjct: 66 GEMSTIYSDVGFISVNWTSSST--PNPPLTWYKAVIDVPDGDEPVILDLSSMGKGQAWIN 123
Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
G+ IGRYW ++LAP C+K CDYRG+Y KC +CGQP+QTLYH+PR+W+ P NLLV
Sbjct: 124 GEHIGRYWISFLAPLGDCSK-CDYRGNYSLHKCATNCGQPSQTLYHVPRSWLRPTGNLLV 182
Query: 690 IHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW---KPNLGVVSSS--PQVRLACE 744
+ EE GGDPSK+SLLT++ +C+ E PP + SW K N V+ + P ++L C
Sbjct: 183 LFEETGGDPSKVSLLTRSIDSVCAHAFETHPPSIQSWQKTKVNSEVLRENVEPSLQLDCS 242
Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
G I++I FAS+G P+G CG+F G CH ++ V+KAC+GQ CSI S G
Sbjct: 243 VGRRISSIKFASFGNPKGVCGNFMKGTCHSVESEKAVEKACLGQHGCSITNSPKEFG--G 300
Query: 804 GACPGLLKALAVEAHCS 820
AC G +K+LAVEA CS
Sbjct: 301 DACVGTVKSLAVEATCS 317
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 269 bits (688), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 148/359 (41%), Positives = 213/359 (59%), Gaps = 14/359 (3%)
Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
GK+ L ++S GHA VFVN + +G + F K + L GIN + +LS+ VGL
Sbjct: 13 GKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRAGINKIALLSIAVGL 72
Query: 529 QNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
N G ++ G+ + +D L G++DL+ +W +VG++GE + L + +S W
Sbjct: 73 PNVGLHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMDLVSPNGGSSVDWI 132
Query: 588 QGSTLPVNK-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
+GS K +L WYK F AP G PLAL++ SMGKGQ W+NGQSIGRYW AY + G
Sbjct: 133 RGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---ANG 189
Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
C Y G++ +KCQ CGQP Q YH+PR+W+ P +NL+V+ EELGGDPSKI+L+ +
Sbjct: 190 DCSLCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMVMFEELGGDPSKITLVKR 249
Query: 707 TGQHICSFVSEADPPP----VDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEG 762
+ +C+ + E P +DS + + + + QV L C G I++I FAS+G P G
Sbjct: 250 SVAGVCADLQEHHPNAEKFDIDSHEESKTLHQA--QVHLQCVPGQSISSIKFASFGTPTG 307
Query: 763 NCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
CGSF+ G CH + IV+K C+G+ C + VS++ G CP +LK L+VEA CS
Sbjct: 308 TCGSFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSIFGTD--PCPNVLKRLSVEAVCS 364
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 125/266 (46%), Positives = 167/266 (62%), Gaps = 22/266 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
VTYD +L+I+GKR +L S S+HYPRSTP++WP +I K++ GGL I+TYVFWN HEP
Sbjct: 42 VTYDGTSLIINGKRELLFSVSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEH 101
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
+Y F+GRFDLV F+K +QE GL++ LR+GP+ AEWN+GG P WL +P + FRT N P
Sbjct: 102 RKYDFKGRFDLVTFIKLIQEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPEVYFRTDNEP 161
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
FKE +R++ KI+ +MK+E L ASQ L ENE V+ AY GE Y+KWAA+
Sbjct: 162 FKEHTERYVRKILGMMKEEKLLASQRRSHHLG-TENECNAVQLAYKENGERYIKWAANLV 220
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
++ +PWVMC+Q +A D +IN CNG +C F G
Sbjct: 221 ESMKLGIPWVMCKQNNASDNLINACNGRHC---------------------FEFLGILQL 259
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYM 270
ED+AF+VAR+F G+ NYYM
Sbjct: 260 IEQSEDIAFSVARYFSKNGSHVNYYM 285
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 53/107 (49%), Gaps = 8/107 (7%)
Query: 674 YHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL 730
YHIPR+++ +N+LVI EE G I + ICS+V E P V SWK
Sbjct: 290 YHIPRSFMKEEKKKNMLVILEEEPGVKLEAIDFVLVNRDTICSYVGEDYPVSVKSWKRER 349
Query: 731 -GVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSFRPGAC 772
+ S S +RL C + A+ FAS+G P G CG+F G C
Sbjct: 350 PKIASRSKDMRLKAVMKCPPEKQMVAVEFASFGDPTGTCGNFTMGKC 396
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/380 (37%), Positives = 213/380 (56%), Gaps = 15/380 (3%)
Query: 267 NYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLIS 326
NYYMY GGTNFGRT+ ++ YD +AP+DE+G ++PKWGHLR+LH A+KLC++ L+
Sbjct: 3 NYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 327 SDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
+ +KLG + EA ++ C AFL+N+++ D +TF G YF+P S+SIL DC
Sbjct: 62 GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121
Query: 386 KNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQ 445
K VVF T V +Q N FA Q N + + EEKV +
Sbjct: 122 KTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQM-----FDEEKVPKYKQSKIRLRKAGDL 176
Query: 446 INTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHD 500
N TKD +DY+WYT+S + MP + + L + S GHA++ FVN K V G+G
Sbjct: 177 YNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKM 236
Query: 501 FANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG 560
F + K ++L +G+N + +L+ +G+ + GA+ + AG+ V + L G DL++
Sbjct: 237 NKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNN 296
Query: 561 EWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLAS 620
W + VG+ GE + S WK ++ L WYK F P G+ P+ L++++
Sbjct: 297 GWGHIVGLVGEQKQIYTDKGMGSVTWKPAVN---DRPLTWYKRHFDMPSGEDPIVLDMST 353
Query: 621 MGKGQAWVNGQSIGRYWSAY 640
MGKG +VNGQ IGRYW +Y
Sbjct: 354 MGKGLMFVNGQGIGRYWISY 373
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 252 bits (643), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 117/188 (62%), Positives = 143/188 (76%), Gaps = 1/188 (0%)
Query: 153 IILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF 212
++L V G +E YG GG+ Y KWAA A++L VPWVMC+Q+DAP II+TCN +
Sbjct: 32 LVLGTVSLGVGAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAY 91
Query: 213 YCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYF 272
YCDGF PNS +KP MWTEN+ GW+ +G +P RPVEDLAFAVA FF+ GG+FQNYYMYF
Sbjct: 92 YCDGFKPNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYYMYF 151
Query: 273 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTH 331
G TNFGRTAGGPL TSYDY A IDEYG +R+PKWGHL++LH A+KLCE L+++D PT+
Sbjct: 152 GRTNFGRTAGGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALVATDSPTY 211
Query: 332 QKLGAKLE 339
KLG E
Sbjct: 212 IKLGPNQE 219
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 204/702 (29%), Positives = 316/702 (45%), Gaps = 111/702 (15%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP- 62
+VTYD RA IDG R +L GSIHYPR + W ++ + GL ++ YVFWNYHEP
Sbjct: 50 SVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYHEPR 109
Query: 63 ----------IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF 112
+ +Y F GR DL+ F++ + LF+ LRIGPY CAEW +GG P+WL
Sbjct: 110 PPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLWLRD 169
Query: 113 IPGIQFRT--------------------TNNPFKEEMKRFLAKIIDLMKQENLFASQGGP 152
+ G+ FR+ + +P+++ M F+ +I ++K+ NL A+QGGP
Sbjct: 170 VEGMCFRSICGYNGSPGKCKPWEGGKFRSCDPWRKYMADFVMEIGRMVKEANLMAAQGGP 229
Query: 153 IILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF 212
+IL Q+ENEYG+ + G Y+ W + + L VPWVMC A + +N CNG
Sbjct: 230 VILGQLENEYGH----HSDAGRAYIDWVGELSFGLGLDVPWVMCNGISA-NGTLNVCNGD 284
Query: 213 YC-DGFTPNS----PSKPIMWTENYSGWFLSFGYAV--PFRPVEDLAFAVARFFETGGTF 265
C D + + P +P+ WTEN GWF ++G AV R E++A+ +A++ GG+
Sbjct: 285 DCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVGGSH 343
Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
NYYM++GG + + L +Y G +PK HL+ LH+ + L+
Sbjct: 344 HNYYMWYGGNHLAQWGAASLT-NAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGELM 402
Query: 326 SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD-SSSDANVTFNGNVYFLPAWSVSIL-P 383
+ H + +LE + AFL S S V + Y + V ++ P
Sbjct: 403 QVEDRHSVMPVQLENGVEVYEWTAGLAFLHRPACSGSPVEVHYAKATYSIACREVLVVDP 462
Query: 384 DCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLA 443
V+F TA V + P + V L A +S +E++ + G + +
Sbjct: 463 SSSTVLFATASV-------EPPPELVRRVVATLTADR-WSMRKEEL-LHGMATVEGREPV 513
Query: 444 EQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFAN 503
E + + +DY+ Y ++ G V L I+S ++ + + D A+
Sbjct: 514 EHLRVSGLDTDYVTYKTTVTATEGV-TNVSLEIDS-----------RISQVFHVSVDNAS 561
Query: 504 FLINKKIELNEGINT------------------LDILSMMVGLQN---YGAWFDVAGAGL 542
L +++N+G NT L ILS +G++N YGA L
Sbjct: 562 SLAATVMDVNKG-NTEWTAVAQLHNLTAGRTYDLWILSESLGVENGMLYGA-PAATEPSL 619
Query: 543 FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG-STLPVNKSL--I 599
I D++ ++ + G W G++GE G QG + LP SL
Sbjct: 620 QKGIFGDIRLNEKSIRKGRWSMVKGLDGEVDG------------GQGKAELPCCDSLGPA 667
Query: 600 WYKTTFLAPEGKGP-----LALNLASMGKGQAWVNGQSIGRY 636
W+ F + L L L G W+NG IGR+
Sbjct: 668 WFVAGFTLHSVRSKSISLTLPLGLPQQAGGHIWLNGVDIGRW 709
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 141/339 (41%), Positives = 194/339 (57%), Gaps = 27/339 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE-- 61
+TYD R+L I+GK SG++HY RS P WP++ R + GL +ETYVFW HE
Sbjct: 9 EITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEFE 68
Query: 62 -----PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI--- 113
+ F G DLVRF++ + GL LR+GPY CAE NYGGFP WL +
Sbjct: 69 PPEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCEK 128
Query: 114 ---PGIQFRTTNNPFKEEMKRFLAKIID-LMKQENLFASQGGPIILAQVENEYGNVEWAY 169
++FRT + + +++R+L ++D ++K +FA QGGP+ILAQ+ENEY + +Y
Sbjct: 129 GSSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAESY 188
Query: 170 GVGGELYVKWAADTAVNLNTSVPWVMC----QQEDAPDPIINTCNGFYCDGFTPN----- 220
G G+ Y+ W A A L VP VMC Q+E +I T N FY +
Sbjct: 189 GPDGQQYLDWIASLANQLALGVPLVMCYGASQRESGR--VIETINAFYAHEHVESLRRAQ 246
Query: 221 -SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
+ +P++WTE ++GW+ +G R DLA+AV RF GG NYYMYFGGTN+ R
Sbjct: 247 GANPQPLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAAGGAGINYYMYFGGTNWRR 306
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
L ATSYDYDAP++EY + K HLR LH++I+
Sbjct: 307 ENTMYLQATSYDYDAPLNEY-VMETTKSRHLRRLHESIQ 344
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 106/154 (68%), Positives = 136/154 (88%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP
Sbjct: 1 SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 60
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT N
Sbjct: 61 PDKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 120
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ 157
PFK M++F+ KI+D+MK E LF +QGGPIIL+Q
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 113/172 (65%), Positives = 131/172 (76%)
Query: 104 GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG 163
GGFPVWL ++PGI FRT N PFK M+ F KI++LMK ENLF SQGGPIIL+Q+ENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60
Query: 164 NVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPS 223
G G YV WAA+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P
Sbjct: 61 PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPY 120
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGT 275
KP +WTE +SGWF FG + RPV+DLAFAVARF + GG+F NYYMY GGT
Sbjct: 121 KPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 149/472 (31%), Positives = 225/472 (47%), Gaps = 45/472 (9%)
Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
D V F G +++P+ SVSIL DCK VV+NT +V Q + + + N +
Sbjct: 2 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV------ 55
Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNI 476
+ Y E + EQ N TKDTSDYLWYT S + +P + + I
Sbjct: 56 WEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQI 115
Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
+S HA + F N V G G+ +F+ K ++L GIN + +LS +G+++ G
Sbjct: 116 KSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELV 175
Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVN 595
G+ ++ L G DL W ++ +EGE + WK LP+
Sbjct: 176 EVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT 235
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
WYK F P+G P+ ++++SM KG +VNG+ IGRYW++++ +
Sbjct: 236 ----WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA----------- 280
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
G P+Q++YHIPR ++ P NLL+I EE G P I + T IC F+
Sbjct: 281 -----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFI 329
Query: 716 SEADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
SE +P + +W+ + G + +S + L C I + FAS+G PEG CG+F G
Sbjct: 330 SEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAG 389
Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
CH D IV+K C+G+ C +PV + G CP LAV+ C +
Sbjct: 390 TCHTPDAKAIVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQVRCKV 440
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 148/416 (35%), Positives = 218/416 (52%), Gaps = 44/416 (10%)
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAA 352
P+DE+G R+PKWGHL+++H+A+ LC+ L PT KLG +A ++ + ++ CAA
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63
Query: 353 FLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNV 412
LAN ++ +V F G LPA S+S+LPDCK VVFNT V +Q N+ +N
Sbjct: 64 LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNS--------RNF 115
Query: 413 NELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP 466
+A+ F+W Y E + F P E + TKDT+DY WYT S+ + +P
Sbjct: 116 VRSEIANKNFNWEMYREVPPVGLGFKFDVP--RELFHLTKDTTDYAWYTTSLLLGRRDLP 173
Query: 467 GQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMM 525
+ L + SLGH +VN + +G+ +F+ + L EG N + +L +
Sbjct: 174 MKKNVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALLGYL 233
Query: 526 VGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
VGL + GA+ + AG S+ ++ L G D+S W +QVG +GE L + S
Sbjct: 234 VGLPDSGAYMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGTDGEKKKLFTEEGSKSVQ 293
Query: 586 WKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
W + L WYK F APEG P+A+ + MGKG WVNG+SIGRYW+ YL+P
Sbjct: 294 WTKPDQ---GGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP-- 348
Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKI 701
+P Q+ YHIPR ++ P +NL+V+ EE GG+P +
Sbjct: 349 --------------------LKKPTQSEYHIPRAYLKP-KNLIVLLEEEGGNPKDV 383
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/326 (40%), Positives = 182/326 (55%), Gaps = 16/326 (4%)
Query: 496 YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGK 554
YG+ D ++L G NT+ LS+ VGL N G F+ AG+ + +D L G+
Sbjct: 168 YGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGR 227
Query: 555 RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPL 614
RDL+ +W YQVG++GE L +S +++ W + N + F AP+G PL
Sbjct: 228 RDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASNMAF------FNAPDGDEPL 281
Query: 615 ALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLY 674
AL+++SMGKGQ W+NGQ IGRYW Y A +G CDYRG YD +KCQ +CG +Q Y
Sbjct: 282 ALDMSSMGKGQIWINGQGIGRYWPGYKA--SGNCGTCDYRGEYDETKCQTNCGDSSQRWY 339
Query: 675 HIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVS 734
H+PR+W+ P NLLVI EE GGDP+ IS++ ++ +C+ VSE P + +W
Sbjct: 340 HVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHTK---DY 395
Query: 735 SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIP 793
+V L C+ G I I FAS+G P+G+CGS+ G CH I K CVGQ C +
Sbjct: 396 EKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCVGQERCGVS 455
Query: 794 VSSAYLGVSAGACPGLLKALAVEAHC 819
V G CPG +K VEA C
Sbjct: 456 VVPEIFG--GDPCPGTMKRAVVEAIC 479
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%)
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
M++F KI+++MK E LF QGGPIIL+Q+ENE+G +EW G + Y WAA+ AV LN
Sbjct: 1 MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60
Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPV 248
TSVPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+ FG VP RPV
Sbjct: 61 TSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPV 120
Query: 249 EDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
EDLA+ VA+F + GG+F NYYM+ F +
Sbjct: 121 EDLAYGVAKFIQKGGSFVNYYMFLNLRGFTK 151
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 179/319 (56%), Gaps = 39/319 (12%)
Query: 24 GSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQ 83
GS+HYPR PE+WP++ +K+K Q+ FEG +DL++F+K +
Sbjct: 11 GSVHYPRCPPEMWPDIFKKAK---------------------QFNFEGNYDLIKFIKMIG 49
Query: 84 EAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQE 143
HL + ++ E P+WL IP I FR+ N PF M++F II M+ E
Sbjct: 50 IMICMQHLEL-VHSLKE-----LPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMRDE 103
Query: 144 NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPD 203
F + Q+ENE+ V+ AY G YV+W + AV L+T VPW+MC+Q +A
Sbjct: 104 KFFPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNALG 156
Query: 204 PIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFET 261
P++NTCNG YC D F+ PN S + +Y + +FG R ED+A AVARFF
Sbjct: 157 PVMNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFSK 214
Query: 262 GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCE 321
GT NYYMY+GGTNFGRT+ V T Y +API EYG R+PKWGH R+LH A+KLC+
Sbjct: 215 KGTMANYYMYYGGTNFGRTSSS-FVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLCQ 273
Query: 322 EYLISSDPTHQKLGAKLEA 340
+ L+ Q LG LE
Sbjct: 274 KALLWGTQPVQMLGKDLEV 292
Score = 86.7 bits (213), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 59/121 (48%), Gaps = 5/121 (4%)
Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
D QK G LYH PR + P N LV+ EE+GG I +LT ICS E
Sbjct: 289 DLEVGQKQFGSYVSMLYHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVNRDTICSIAGE 348
Query: 718 ADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
PP V++W GV+ ++ P L C I ++FASYG P GNCG F G C
Sbjct: 349 HYPPNVETWSRYKGVIRTNVDTPKPAANLVCLDNKTITQVDFASYGDPVGNCGHFILGKC 408
Query: 773 H 773
+
Sbjct: 409 N 409
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 236 bits (601), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 127/169 (75%)
Query: 105 GFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN 164
GF ++PGI FRT N PFK M++F KI+++MK E LF QGGPII++Q+ENEYG
Sbjct: 3 GFSCLAQYVPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYGP 62
Query: 165 VEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK 224
VEW G G+ Y KWAA AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+GF PN K
Sbjct: 63 VEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKNYK 122
Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFG 273
P MWTEN++GW+ FG P+RPVEDLAF+VARF + G+F NYYMY G
Sbjct: 123 PKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHG 171
>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
Length = 287
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 130/295 (44%), Positives = 175/295 (59%), Gaps = 15/295 (5%)
Query: 283 GPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
GP +ATSYDYDAP+DEYG R+PKWGHLR+LHKAIK E L+S++P+ LG EAH+
Sbjct: 1 GPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHV 60
Query: 343 YHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNG 402
+ KS + CAAFLANYD+ S A V+F Y LP WS+SILPDCK V+NTA++ SQ
Sbjct: 61 F-KSKSGCAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGSQ---- 115
Query: 403 DHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI 462
+ Q + + A S+ EE + + L EQIN T+DT+DYLWY I
Sbjct: 116 ----SSQMKMTPVKSALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDI 171
Query: 463 HVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN 517
+ P + G+ L I S GHA VF+N +L YG + ++ ++L GIN
Sbjct: 172 TISPDEGFIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGIN 231
Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGE 571
L +LS+ VGL N G F+ AG+ V L L +G D+S +W Y+ G++GE
Sbjct: 232 KLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKTGLKGE 286
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 174/298 (58%), Gaps = 11/298 (3%)
Query: 419 SSAFSW--YEEKVGISG-NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-----K 470
SSAF W Y E SG + S L EQI T+D+SDYLWY +++ P +G +
Sbjct: 12 SSAFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQ 71
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
L S GH VFVN + YG + + ++L G N + +LS+ VGL N
Sbjct: 72 YPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSN 131
Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
G ++ G+ V L L G RDLS +W Y++G++GE + L + ++S W +G
Sbjct: 132 VGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKG 191
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
S+L + L WYK TF AP G PLAL+++SMGKG+ WVNG+SIGR+W AY+A G
Sbjct: 192 SSLVEKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIA--RGSCG 249
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
C+Y G++ KC+ CGQP Q YHIPR+WV+P N LV+ EE GGDPS ISL+ +T
Sbjct: 250 GCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISLVKRT 307
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 128/298 (42%), Positives = 174/298 (58%), Gaps = 23/298 (7%)
Query: 530 NYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
NYGA+ + GAG V L KNG+ DLS W YQVG+ GE+ + I + + W
Sbjct: 26 NYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTD 85
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
+ + WYKT F AP G+ P+AL+L SMGKGQAWVNG IGRYW+ +AP GC
Sbjct: 86 LTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWT-RVAPKDGC- 143
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
KCDYRG Y SK YHIPR+W+ NLLV+ EE GG P +IS+ +++
Sbjct: 144 GKCDYRGHYHTSK------------YHIPRSWLQASNNLLVLFEETGGKPFEISVKSRST 191
Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGN 763
Q IC+ VSE+ P + +W P+ + +S P++ L C+ G I++I FASYG P+G+
Sbjct: 192 QTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHTISSIEFASYGTPQGS 251
Query: 764 CGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
C F G CH + L +V KAC G+ C I + ++ G C G++K LAVEA C+
Sbjct: 252 CQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFG--GDPCRGIVKTLAVEAKCA 307
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 228 bits (580), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 185/649 (28%), Positives = 294/649 (45%), Gaps = 88/649 (13%)
Query: 91 LRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQG 150
+RIGPY CAEW+ GG PVW++++ G++ R N+ +K+EM ++ + D + + FA +G
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTR--DFFADRG 58
Query: 151 GPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN 210
GPII +Q+ENE W G Y+ W + A +L +VPW+MC D + IN CN
Sbjct: 59 GPIIFSQIENEL----WG---GAREYIDWCGEFAESLELNVPWMMCNG-DTSEKTINACN 110
Query: 211 GFYCDGFTPNSP-------SKPIMWTENYSGWFLSFGYAVP---------FRPVEDLAFA 254
G C + + +P WTEN GWF G A R ED F
Sbjct: 111 GNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFN 169
Query: 255 VARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELH 314
V +F + GG++ NYYM+FGG ++G+ AG + Y I +PK H ++H
Sbjct: 170 VLKFMDRGGSYHNYYMWFGGNHYGKWAGNGMT-NWYTNGVMIHSDTLPNEPKHSHTAKMH 228
Query: 315 KAIKLCEEYLISSDP---THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNV 371
+ + E L++ + L ++ + +F+ N S+D V + V
Sbjct: 229 RMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYRYGDRLVSFVENSKGSADK-VIYRDIV 287
Query: 372 YFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK--NVNELLLASSAFSWYEEKV 429
Y LPAWS+ +L + NV+F T V P + + + E L F ++ E V
Sbjct: 288 YELPAWSMIVLDEYDNVLFETNNV--------KPVNKHRVYHCEEKL----EFEYWNEPV 335
Query: 430 GI---SGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALV- 485
R V P EQ+N T+D +++L+Y + E L+I A V
Sbjct: 336 STLSQEAPRVVVSPKANEQLNMTRDLTEFLYYETEVEFPQ---DECTLSIGGTDANAFVA 392
Query: 486 FVNKKLVAFG--YGNHDFANFLINKKIELNEGINTLDILSMMVGLQN-YGAWFDVAGA-G 541
+V+ V + +HD +N ++ +G + L +LS +G+ N + D + A
Sbjct: 393 YVDDHFVGSDDEHTHHD-GWHTMNINMKSGKGKHKLVLLSESLGVSNGMDSNLDPSWASS 451
Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
I +K D+ + EW + G+ GE + + WK S + +L WY
Sbjct: 452 RLKGICGWIKLCGNDIFNQEWKHYPGLVGEAKQVFTDEGMKTVTWK--SDVENADNLAWY 509
Query: 602 KTTFLAPEG--KG-PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
++TF P+G +G + L M +GQA+ NG +IGRYW
Sbjct: 510 RSTFKTPQGLKRGIEVLLRPEGMNRGQAYANGHNIGRYW--------------------- 548
Query: 659 ASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVIHEELGGDPSKISLLT 705
+ G+ Q YHIP+ W+ EN+LV+ E LG +++ T
Sbjct: 549 --MIKDGNGEYTQGFYHIPKDWLKGEGEENVLVLGETLGASDPSVTICT 595
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 121/270 (44%), Positives = 166/270 (61%), Gaps = 11/270 (4%)
Query: 556 DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVNKSLIWYKTTFLAPEGKGPL 614
DLS +W YQVG++GE + L + S W S T+ + L W+KT F APEG PL
Sbjct: 2 DLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPL 61
Query: 615 ALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLY 674
AL++ MGKGQ WVNG+SIGRYW+A+ +TG C Y G+Y +KCQ CGQP Q Y
Sbjct: 62 ALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSYTGTYKPNKCQTGCGQPTQRWY 118
Query: 675 HIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN---LG 731
H+PR W+ P +NLLVI EELGG+PS +SL+ ++ +C+ VSE P + +W+ G
Sbjct: 119 HVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKG 177
Query: 732 VVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV-LPIVQKACVGQIEC 790
P+V L C G IA+I FAS+G P G CGS++ G CH I+++ CVG+ C
Sbjct: 178 QTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARC 237
Query: 791 SIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
++ +S++ G CP +LK L VEA C+
Sbjct: 238 AVTISNSNFG--KDPCPNVLKRLTVEAVCA 265
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 223 bits (567), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/295 (41%), Positives = 171/295 (57%), Gaps = 13/295 (4%)
Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
FSW Y E R+F + L EQ++ T D SDYLWYT +++ + G+
Sbjct: 6 GFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQ 65
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I S GH+ VFVN + YG +D + +++ +G N + ILS VGL N G
Sbjct: 66 LTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGT 125
Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
++ G+ V L L GKRDLS +W YQ+G+ GE +G+ ++ ++S W +
Sbjct: 126 HYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG- 184
Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
+ L W+K F AP G P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+GC C
Sbjct: 185 --KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGC-GGCS 240
Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
Y G+Y +KCQ CG +Q YH+PR+W++P NLLV+ EE GGD S + L+T+T
Sbjct: 241 YAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTRT 295
>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
Length = 447
Score = 221 bits (564), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 156/433 (36%), Positives = 221/433 (51%), Gaps = 36/433 (8%)
Query: 195 MCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLA 252
MC+Q+DAPDP+INTC G C D FT PN P+K + TE L +
Sbjct: 1 MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTEYLETPHLKGQQKI--------- 51
Query: 253 FAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
+ F GT NYYMY+ TNFGRT T Y +AP+DEYG R+ KWGHLR+
Sbjct: 52 -LHSLFISKNGTLANYYMYYSVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLRD 109
Query: 313 LHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSSDANVTFNGNV 371
LH A++L ++ L+ + QKLG LEA IY K SN CA FL N + + T G+
Sbjct: 110 LHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSK 169
Query: 372 YFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGI 431
Y+LP S+S LPDCK VVFNT V S N PF+ ++NE + + A YEE
Sbjct: 170 YYLPQHSISNLPDCKTVVFNTQTVAS--NYLIFPFSMFDSLNEPNMKTDALPTYEE--CP 225
Query: 432 SGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKK- 490
+ +S V E + TKDT+DYLWYT V+ + +LGH F+N +
Sbjct: 226 TKTKSPV-----ELMTMTKDTTDYLWYTTKKDVL------RVPQVSNLGHVMHAFLNGEY 274
Query: 491 -----LVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSV 545
L +G++ +F+ NK I L G+N + L VGL + G++ + AG+ +V
Sbjct: 275 VMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGVHNV 334
Query: 546 ILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTT 604
+ L DL W ++VG+ G+ + L + S + + L + + L+ ++ T
Sbjct: 335 AIQGLNTRTIDLPKNGWGHKVGLNGDKLHLFTQPPSQSVYHVPRAFLKTSDNLLVLFEET 394
Query: 605 FLAPEGKGPLALN 617
P+G L LN
Sbjct: 395 GRNPDGIEILTLN 407
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 38/59 (64%)
Query: 669 PAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK 727
P+Q++YH+PR ++ +NLLV+ EE G +P I +LT IC ++SE P V SWK
Sbjct: 369 PSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGIEILTLNRDTICCYISEHHPTHVRSWK 427
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 101/159 (63%), Positives = 126/159 (79%), Gaps = 1/159 (0%)
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+ FG AV
Sbjct: 2 ALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFGGAV 61
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
P+RPVED+A++VARF + GG+ NYYMY GGTNF RTA G +A+SYDYDAP+DEYG R
Sbjct: 62 PYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPR 120
Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
+PK+ HL+ LHKAIKL E L+S+D T LGAK E I
Sbjct: 121 EPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEVTI 159
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 216 bits (549), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 116/274 (42%), Positives = 165/274 (60%), Gaps = 7/274 (2%)
Query: 550 LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPE 609
L G+RDLS +W Y+VG++GE + L +S ++S W +G+ + + L WYKTTF AP
Sbjct: 1 LNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPA 60
Query: 610 GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQP 669
G PLA+++ SMGKGQ W+NGQS+GR+W AY A G +C Y G++ KC ++CG+
Sbjct: 61 GDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYTGTFREDKCLRNCGEA 118
Query: 670 AQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN 729
+Q YH+PR+W+ P NLLV+ EE GGDP+ I+L+ + +C+ + E V+
Sbjct: 119 SQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQLHA 178
Query: 730 LGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVG 786
G V+ P+ L C G I + FAS+G PEG CGS+R G+CH K CVG
Sbjct: 179 SGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVG 238
Query: 787 QIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
Q CS+ V+ G CP ++K LAVEA C+
Sbjct: 239 QNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 270
>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
gi|238005922|gb|ACR33996.1| unknown [Zea mays]
Length = 345
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/357 (34%), Positives = 189/357 (52%), Gaps = 34/357 (9%)
Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
+ L + S GHA++ FVN K V G+G F + K ++L +G+N + +L+ +G+ +
Sbjct: 8 KTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMD 67
Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
GA+ + AG+ V + L G DL++ W + VG+ GE + S WK
Sbjct: 68 SGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAV 127
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
++ L WYK F P G+ P+ L++++MGKG +VNGQ IGRYW +Y
Sbjct: 128 N---DRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISY---------- 174
Query: 651 CDYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
KH G+P+Q LYHIPR+++ +N+LV+ EE G P I +LT
Sbjct: 175 -------------KHALGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVKRD 221
Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNC 764
+IC+F+SE +P + SW+ ++ + P+ L C I + FASYG P G C
Sbjct: 222 NICTFISERNPAHIKSWERKDSQITVTAADLKPRATLTCSPKKLIQQVVFASYGNPMGIC 281
Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
G++ G+CH +V+KAC+G+ C++PVS+ G CPG LAV+A CS
Sbjct: 282 GNYTIGSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVN-CPGTTATLAVQAKCS 337
>gi|125536445|gb|EAY82933.1| hypothetical protein OsI_38150 [Oryza sativa Indica Group]
Length = 314
Score = 209 bits (533), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 108/224 (48%), Positives = 141/224 (62%), Gaps = 9/224 (4%)
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
+T F P+G P+A++L SMGKGQAWVNG IGRYWS +AP +GC+ C Y G+Y+ K
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCSSSCYYPGAYNERK 141
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
CQ +CG P Q YHIPR W+ +NLLV+ EE GGDPS ISL + +CS +SE P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKAVCSRISENYYP 201
Query: 722 PVDSW----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DV 776
P+ +W V +++P++RL C+ G I+ I FASYG P G C +F G CH
Sbjct: 202 PLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASST 261
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L +V +ACVG +C+I VS+ G C G+LK LAVEA CS
Sbjct: 262 LDLVTEACVGNTKCAISVSNDVFG---DPCRGVLKDLAVEAKCS 302
>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
Length = 314
Score = 209 bits (533), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 108/224 (48%), Positives = 141/224 (62%), Gaps = 9/224 (4%)
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
+T F P+G P+A++L SMGKGQAWVNG IGRYWS +AP +GC+ C Y G+Y+ K
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCSSSCYYPGAYNERK 141
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
CQ +CG P Q YHIPR W+ +NLLV+ EE GGDPS ISL + +CS +SE P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201
Query: 722 PVDSW----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DV 776
P+ +W V +++P++RL C+ G I+ I FASYG P G C +F G CH
Sbjct: 202 PLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASST 261
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L +V +ACVG +C+I VS+ G C G+LK LAVEA CS
Sbjct: 262 LDLVTEACVGNTKCAISVSNDVFG---DPCRGVLKDLAVEAKCS 302
>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 317
Score = 209 bits (533), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 108/224 (48%), Positives = 141/224 (62%), Gaps = 9/224 (4%)
Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
+T F P+G P+A++L SMGKGQAWVNG IGRYWS +AP +GC+ C Y G+Y+ K
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCSSSCYYPGAYNERK 141
Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
CQ +CG P Q YHIPR W+ +NLLV+ EE GGDPS ISL + +CS +SE P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201
Query: 722 PVDSW----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DV 776
P+ +W V +++P++RL C+ G I+ I FASYG P G C +F G CH
Sbjct: 202 PLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASST 261
Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
L +V +ACVG +C+I VS+ G C G+LK LAVEA CS
Sbjct: 262 LDLVTEACVGNTKCAISVSNDVFG---DPCRGVLKDLAVEAKCS 302
>gi|217075719|gb|ACJ86219.1| unknown [Medicago truncatula]
Length = 200
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/203 (50%), Positives = 135/203 (66%), Gaps = 6/203 (2%)
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKG+AWVNGQSIGRYW Y++P++GCT C+YRG+Y ASKC K+CG+P+QTLYH+PR W
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAW 60
Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL-GVVSSSPQV 739
+ P N V+ EE GGDP+KIS TK + +CS V+E+ PPPVD+W N P +
Sbjct: 61 LKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAESERKVGPVL 120
Query: 740 RLACE-RGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSA 797
L C I++I FAS+G P CG++ G+C + L IVQKAC+G C+I VS
Sbjct: 121 SLECPYPNQAISSIKFASFGTPRRTCGNYNHGSCSSNRALSIVQKACIGSSSCNIGVSIN 180
Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
G C G+ K+LAVEA C+
Sbjct: 181 TFG---NPCRGVTKSLAVEAACT 200
>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
Length = 206
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 105/208 (50%), Positives = 142/208 (68%), Gaps = 9/208 (4%)
Query: 619 ASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPR 678
A GKG AWVNGQSIGRYW +A + GCT+ CDYRGSY A+KC K+CG+P+QTLYH+PR
Sbjct: 2 AGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPR 61
Query: 679 TWVHPGENLLVIHEELGGDPSKISLLTK-TGQHICSFVSEADPPPVDSWKPNLGVVS--- 734
+W+ P N+LV+ EE+GGDP++IS TK TG ++C VS++ PPPVD+W + + +
Sbjct: 62 SWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNR 121
Query: 735 SSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSI 792
+ P + L C I +I FAS+G P+G CGSF G C+ L +VQKAC+G C++
Sbjct: 122 TRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSCNV 181
Query: 793 PVSSAYLGVSAGACPGLLKALAVEAHCS 820
VS+ G C G++K+LAVEA CS
Sbjct: 182 EVSTRVFGE---PCRGVVKSLAVEASCS 206
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 126/354 (35%), Positives = 174/354 (49%), Gaps = 63/354 (17%)
Query: 35 VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
+W L++ +KEGG++VIETYVF N HE YYF G +DL++FVK VQ+AG++L L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 95 PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
P+ EWN+G F+T + PFK M++F+ I+++MK++ LFASQGGPII
Sbjct: 61 PFVATEWNFGTI-----------FQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109
Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPI-INTCNGFY 213
L Q +NEYG+ + Y GG+ YV WAA+ ++ N VPW+MCQ I I G Y
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQYSYVDIYIYIVKKEGLY 169
Query: 214 CDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYM--- 270
+ I+ T + + +P L + G YM
Sbjct: 170 SLSYQ----YALILSTLVTHSIVTNSHQILQAKPKCGLKIGLDGLKHLGHRILTDYMKIL 225
Query: 271 ----------------YFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELH 314
Y GGTNFG T+GGP + T+Y+Y+APIDEYG R PK
Sbjct: 226 LFLLLFFFFQKVNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK-------- 277
Query: 315 KAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFN 368
C P+ E +Y S AAF++N D D + F
Sbjct: 278 -----C--------PSQ-------EVDVYADSLGGYAAFISNVDEKEDKMIVFQ 311
>gi|223942939|gb|ACN25553.1| unknown [Zea mays]
Length = 199
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/202 (52%), Positives = 135/202 (66%), Gaps = 5/202 (2%)
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKG+AWVNGQSIGRYW LAP +GC C+YRG+Y +SKC K CGQP+QTLYH+PR++
Sbjct: 1 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSF 60
Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVR 740
+ PG N LV+ E GGDPSKIS + + +C+ VSEA P +DSW + P +R
Sbjct: 61 LQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRYGPALR 120
Query: 741 LACER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAY 798
L C + G I+++ FAS+G P G CGS+ G C L IVQ+AC+G CS+PVSS Y
Sbjct: 121 LECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNY 180
Query: 799 LGVSAGACPGLLKALAVEAHCS 820
G C G+ K+LAVEA CS
Sbjct: 181 FG---NPCTGVTKSLAVEAACS 199
>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
Length = 138
Score = 199 bits (506), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 90/138 (65%), Positives = 107/138 (77%)
Query: 157 QVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG 216
Q+ENEYG VEW G+ Y WAA AV LNT VPWVMC+Q+DAPDP+I+TCNG+YC+
Sbjct: 1 QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYCEN 60
Query: 217 FTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTN 276
FTPN KP MWTEN+SGW+ +G AVP RPVED+A++V RF + GG+F NYYMY GGTN
Sbjct: 61 FTPNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFIQNGGSFVNYYMYHGGTN 120
Query: 277 FGRTAGGPLVATSYDYDA 294
FGRT G +ATSYDYDA
Sbjct: 121 FGRTYSGLFIATSYDYDA 138
>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
Length = 200
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 102/205 (49%), Positives = 137/205 (66%), Gaps = 10/205 (4%)
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKG+AWVNGQSIGRYW Y+A + GCT C+YRG Y +SKC+K+CG+P+QTLYH+PR++
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSF 60
Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSP 737
+ P N LV+ EE GGDP++IS TK + +CS VS++ PP +D W + G V P
Sbjct: 61 LKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKV--GP 118
Query: 738 QVRLAC-ERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVS 795
+ L+C I++I FASYG P G CG+F G C + L IV+KAC+G CS+ VS
Sbjct: 119 ALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSVGVS 178
Query: 796 SAYLGVSAGACPGLLKALAVEAHCS 820
+ G C G+ K+LAVEA C+
Sbjct: 179 TDTFG---DPCRGVPKSLAVEATCA 200
>gi|388518087|gb|AFK47105.1| unknown [Lotus japonicus]
Length = 220
Score = 188 bits (478), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 132/207 (63%), Gaps = 10/207 (4%)
Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
MGKGQAWVNG IGRYW+ ++P +GC + CDYRG+Y++ KC +CG+P QTLYH+PR+W
Sbjct: 1 MGKGQAWVNGHHIGRYWT-RVSPKSGCEQVCDYRGAYNSDKCTTNCGKPTQTLYHVPRSW 59
Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPV------DSWKPNLGVVS 734
+ +NLLVI EE GG+P +IS+ + + +C+ VSE+ P+ D + S
Sbjct: 60 LKASDNLLVIFEETGGNPFRISVKLHSARIVCAKVSESHYQPLHKLMNADLIGHEVSANS 119
Query: 735 SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIP 793
P++ L C+ G I++I FASYG PEG+C SF G CH + IV KAC G+ CSI
Sbjct: 120 MIPELHLRCQDGRIISSITFASYGNPEGSCQSFSRGNCHAPSSMAIVSKACQGKRSCSIK 179
Query: 794 VSSAYLGVSAGACPGLLKALAVEAHCS 820
+S G C G++K L+VEA C+
Sbjct: 180 ISDTIFG--GDPCQGVMKTLSVEARCT 204
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 183/379 (48%), Gaps = 33/379 (8%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
R +DG+ + SG+IHY R P+ W + IRK++ GL IETYV WN+H P R +++
Sbjct: 9 RDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHT 68
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
+G DL RF+ +QE GL +R GPY CAEW+ GG P WL P I R+++ + E+
Sbjct: 69 DGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDIVVRSSDPTYLTEV 128
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
+R+L + +++ + + GGPIIL QVENEYG AYG Y+ + NL
Sbjct: 129 ERYLEHLAPIVEPRQI--NHGGPIILMQVENEYG----AYG-NDRAYLTHLTNVYRNLGF 181
Query: 190 SVPWVMCQQ------EDAPDPIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFL 237
VP Q P ++T F + + P+M +E + GWF
Sbjct: 182 VVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIGWFD 241
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSY 290
+G V D A A+ R G + N YM+ GGTNFG T G PLV TSY
Sbjct: 242 HWGAHHHTTDVADAANALDRLLGAGASV-NIYMFHGGTNFGFTNGANDKGVYQPLV-TSY 299
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDC 350
DYDAP+ E G+ + W + + + E P + L A+ + H+
Sbjct: 300 DYDAPLAEDGYPTEKYWAFREVIARYAPVPAEV-----PAERPLVAERSVPLTHRVGWLD 354
Query: 351 AAFLANYDSSSDANVTFNG 369
+ + D+ TF+G
Sbjct: 355 VPLDVDEAVTCDSPATFDG 373
>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 213
Score = 182 bits (463), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 5/213 (2%)
Query: 496 YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGK 554
YG+ + +K + L +G+N L +LS+ VGL N G FD AG+ V L L G
Sbjct: 3 YGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGT 62
Query: 555 RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPL 614
RD+S +W Y+VG++GE + L + +NS W +GS + L WYKTTF P G PL
Sbjct: 63 RDMSKYKWSYKVGLKGEILNLYSVKGSNSVQWMKGSF--QKQPLTWYKTTFNTPAGNEPL 120
Query: 615 ALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLY 674
AL+++SM KGQ WVNG+SIGRY+ Y+A +G KC Y G + KC +CG P+Q Y
Sbjct: 121 ALDMSSMSKGQIWVNGRSIGRYFPGYIA--SGKCNKCSYTGFFTEKKCLWNCGGPSQKWY 178
Query: 675 HIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
HIPR W+ P NLL+I EE+GG+P ISL+ +T
Sbjct: 179 HIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 211
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 182 bits (462), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 114/308 (37%), Positives = 159/308 (51%), Gaps = 28/308 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + SG++HY R P++W + I K++ GL IETYV WN H P RG + +G
Sbjct: 11 FLLDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTDG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF++ V AGL+ +R GPY CAEW+ GG P WL PG+ R F +++
Sbjct: 71 MLDLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVEQ 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L +++DL++ L QGGP++L QVENEYG A+G E Y++ A +V
Sbjct: 131 YLEQVLDLVRP--LQVDQGGPVLLLQVENEYG----AFGNDPE-YLEAVAGMIRKAGITV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
P V Q +G G + P+ P+M E + GWF +
Sbjct: 184 PLVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDHW 243
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDY 292
G VED A + G + N YM+ GGTNFG T+G P V TSYDY
Sbjct: 244 GGPHHTTSVEDAARELDALLAAGASV-NIYMFHGGTNFGLTSGADDKGVFRPTV-TSYDY 301
Query: 293 DAPIDEYG 300
DAP+DE G
Sbjct: 302 DAPLDEAG 309
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 179/362 (49%), Gaps = 36/362 (9%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ +TYD + ++DGK L SG++HY R+ PE W + + K K G +ETYV WN HEP
Sbjct: 2 SQLTYDD-SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEP 60
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQ+ FEG D+VRF+KT ++ GL + +R GP+ CAEW +GGFP WL +P I+ R N
Sbjct: 61 EEGQFVFEGIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNIKLRCFN 120
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+ E++ + + + ++ L +S GGPII Q+ENEYG ++G + Y+++ D
Sbjct: 121 QPYLEKVDAYFDVLFERLRP--LLSSNGGPIIALQIENEYG----SFG-NDQKYLQYLRD 173
Query: 183 TAVNLNTSVPWVMCQQEDAPDP----------IINTCN-GFYCDG----FTPNSPSKPIM 227
+ V + D P+P I T N G + P+ P+M
Sbjct: 174 ---GIKKRVGNELLFTSDGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNAPLM 230
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
E + GWF +G R E + + + G+ N+YM GGTNFG G
Sbjct: 231 CMEFWHGWFDHWGEEHHTRSAESVVETLEEILKQNGSV-NFYMAHGGTNFGFYNGANHNE 289
Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
P + TSYDYD + E G + + + + K + L E L + P K
Sbjct: 290 TDYQPTI-TSYDYDGLLTESGDVTEKFYAVRKVFEKYVDLPELNLPAPIPKRLFGKVKFT 348
Query: 340 AH 341
H
Sbjct: 349 EH 350
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 172/335 (51%), Gaps = 23/335 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ +D + +IDGKR+ + S ++HY R W +IRK++ GG IETY+ WNYHE
Sbjct: 2 IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
Q+ F G DL F + G+++ +R GPY CAEW++GG P +L+ GI++R +N
Sbjct: 62 EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGIEYRCSNAA 121
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+++ ++R+ +I+ ++++ L GG II+ Q+ENEY A+G ++++ +
Sbjct: 122 YEQAVRRYFERIMPIIRRYQL--GSGGSIIMVQIENEYH----AFGKKDLAHIRFLEELT 175
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLS 238
+VP V C A + N F+ +P+ E + GW
Sbjct: 176 RGFGITVPLVSCY--GAGRNTVEMRN-FWSGAERAAAVLRERQSGQPLGIMEFWIGWVEH 232
Query: 239 F-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGP--LVATSYD 291
+ G +P E + ++G F NYYMYFGG+NF GRT G + SYD
Sbjct: 233 WGGEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYD 292
Query: 292 YDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLIS 326
YDAP+DE+GF K+ L LH I E L +
Sbjct: 293 YDAPLDEFGF-ETEKYRLLAVLHTFIAWLENDLTA 326
>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 951
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 222/835 (26%), Positives = 345/835 (41%), Gaps = 157/835 (18%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V+YD RA+ I+ KR +L SGS+H R+T W + ++ GL +I Y+FW H+
Sbjct: 149 SVSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMITVYIFWGAHQSF 208
Query: 64 RGQYY-----------FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF 112
R + E +++L +++ GLF+H+RIGPYAC E+ YGG P WL
Sbjct: 209 RDEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGEYTYGGIPEWLPL 268
Query: 113 IPG-IQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENE---------- 161
++ R N P+ + M+ F+A I + NL+A QGGPI++AQ+ENE
Sbjct: 269 QSSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIENELGSGVDGSAA 328
Query: 162 --------------------------YGNV---EWAYGVGGEL-------YVKWAADTAV 185
YG++ + G+ EL Y W +
Sbjct: 329 ANYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATVQDYADWCGNLVA 388
Query: 186 NLNTSVPWVMCQQEDAPDPI--INTCNGF-----YCDGFTPNSPSKPIMWTENYSGWFLS 238
L +V W MC A + I N NG Y D +P +WTE+ G F
Sbjct: 389 RLAPNVIWTMCNGLSAENTISTFNGNNGIDWLEKYGDSGRIQV-DQPAIWTEDEGG-FQL 446
Query: 239 FGYAVPFRPVE--------DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSY 290
+G P +P + +A ++F GGT NYYM++GG N GR++ ++ +Y
Sbjct: 447 WG-DQPSKPSDYFWGRTSRAMATDALQWFARGGTHLNYYMWWGGYNRGRSSAAGIM-NAY 504
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAH------IYH 344
DA + G R PK+ H LH I L+ + PT A +E +
Sbjct: 505 ATDAFLCSSGQRRHPKYDHFLALHLVIADIAAILLHA-PTSLLKNASVEIMDGDDWIVGD 563
Query: 345 KSSNDCAAFLANYDS------SSDANVTFNGN----------VYFLPAWSVSILPDCKNV 388
L +DS +DAN T V+ + +S I+ D V
Sbjct: 564 NQRQFLYQVLDTHDSKQVIFLENDANTTEMARLTGAKADDSLVFVMKPYSSQIVIDGI-V 622
Query: 389 VFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNR-SFVRPDLAEQIN 447
F+++ + ++ + + V LL + SW E G ++ + V + EQ N
Sbjct: 623 AFDSSTISTKAMSFRRTLHYEPAV--LLHLT---SWSEPIAGADTDQNAHVSTEPLEQTN 677
Query: 448 TTKD---TSDYLWY--TASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFA 502
+SDY WY I V+ Q K +++ E A VF++ + NH A
Sbjct: 678 LNSKASISSDYAWYGTDVKIDVVLSQVK-LYIGTEK-ATALAVFIDGAFIGEA-NNHQHA 734
Query: 503 N--FLINKKIE-LNEGINTLDILSMMVGLQNY-GAWFDVAGA---GLFSVILID--LKNG 553
+++ +IE L G + L IL +G N G W + A G+ +LI L +
Sbjct: 735 EGPTVLSIEIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITGNVLIGSPLLSE 794
Query: 554 KRDLSSGE--WIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGK 611
L G W G+ E + L SF + + +W F +P+
Sbjct: 795 NISLVDGRQMWWSLPGLSVERKAA-RHGLRRESF-EDAAQAEAGLHPLWSSVLFTSPQFD 852
Query: 612 G---PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQ 668
L L+L S G+G W+NG+ +GRYW+ T DY
Sbjct: 853 STVHSLFLDLTS-GRGHLWLNGKDLGRYWNI-----TRGNSWNDY--------------- 891
Query: 669 PAQTLYHIPRTWVH-PGE-NLLVIHEELGGDPSKISLLTKTGQ--HICSFVSEAD 719
+Q Y +P ++H G+ N L++ + LGGD S LL + + F E D
Sbjct: 892 -SQRYYFLPADFLHLDGQLNELILFDMLGGDHSAARLLLSSIEESETSKFSDEVD 945
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 115/339 (33%), Positives = 169/339 (49%), Gaps = 36/339 (10%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A +T+ H A + G+ + SGS+HY R PE W + + + GL ++TYV WN+HE
Sbjct: 22 TATLTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHE 81
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G+ F+G DL RFV+ Q AGL + +R GPY CAEW+ GG P WL PG++ R
Sbjct: 82 RRPGEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMRLRAG 141
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ P+ + + R+ ++ + + L A GGP++ Q+ENEYG+ YG YV+W
Sbjct: 142 HQPYLDAVARWFDALVPRVAE--LQAVHGGPVVAVQIENEYGS----YG-DDHAYVRWVR 194
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPII---NTCNGFYCDG------------FTPNSPSKPI 226
D V+ + + D P P++ T G P +P
Sbjct: 195 DALVDRGIT---ELLYTADGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPF 251
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
+ E ++GWF +G R + A V + GG+ + YM GGTNFG AG
Sbjct: 252 LCAEFWNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGSV-SLYMAHGGTNFGLWAGANHD 310
Query: 284 -----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
P V TSYD DAP+ E+G + PK+ LRE A+
Sbjct: 311 GGVLRPTV-TSYDSDAPVSEHGAL-TPKFHALRERFAAL 347
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/309 (36%), Positives = 165/309 (53%), Gaps = 30/309 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + SG++HY R P+ W + I K++ GL IETYV WN H P G + +G
Sbjct: 11 FLLDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTDG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF++ V++AG++ +R GP+ CAEW+ GG P WL PG+ R F +E+++
Sbjct: 71 ILDLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGVGIRRHEPRFLDEVEK 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L +++ L++ + GGP++L QVENEYG AYG + Y++ AD V
Sbjct: 131 YLHQVLALVRPHQV--DLGGPVLLVQVENEYG----AYGDDRD-YLQAVADMIRGAGIDV 183
Query: 192 PWVMCQQE-DAP------DPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLS 238
P V Q DA D ++ T + F D + P+ P+M E + GWF
Sbjct: 184 PLVTVDQPVDAMLAAGGLDGVLRTSS-FGSDSANRLRTLRDHQPTGPLMCMEFWDGWFDH 242
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYD 291
+G PVE A + G + N YM+ GGTNFG T+G P V TSYD
Sbjct: 243 WGGRHHTTPVEQAAEELDALLAAGASV-NVYMFHGGTNFGLTSGANDKGIYRPTV-TSYD 300
Query: 292 YDAPIDEYG 300
YDAP+DE G
Sbjct: 301 YDAPLDEAG 309
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 122/335 (36%), Positives = 174/335 (51%), Gaps = 39/335 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++ GK + SG +HYPR E W ++ K GL + TYVFWNYHE G++ F G
Sbjct: 34 FLLSGKPFTIYSGEMHYPRVPSEYWKHRLQMMKSMGLNTVTTYVFWNYHEEEPGKWNFSG 93
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +F+KT QEAGL++ +R GPY CAEW +GG+P WL ++ RT N F ++ +
Sbjct: 94 EKDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFGGYPWWLQKDKNLEIRTDNKAFLKQCEN 153
Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWA---ADTAVN 186
+ I +L KQ L + GGP+I+ Q ENE+G+ V + E + K++ D V
Sbjct: 154 Y---INELAKQIIPLQINNGGPVIMVQAENEFGSYVAQRKDISLEQHKKYSHKIKDFLVK 210
Query: 187 LNTSVPWVMCQ-----QEDAPDPIINTCNGFYCDGFTPNSPSK---------PIMWTENY 232
+VP+ +E + + + T NG +G N K P M E Y
Sbjct: 211 SGITVPFFTSDGSWLFKEGSIEGALPTANG---EGDVDNLRKKINEFNNGKGPYMVAEYY 267
Query: 233 SGWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW +A PF V ED+ + + G +F NYYM GGTNFG T+G
Sbjct: 268 PGWLDH--WAEPFVKVSTEDVVKQTELYIKNGISF-NYYMIHGGTNFGFTSGANYDKNHD 324
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDAPI+E G++ PK+ LR++ + I
Sbjct: 325 IQPDLTSYDYDAPINEAGWV-TPKFNALRDIFQKI 358
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 145/390 (37%), Gaps = 66/390 (16%)
Query: 317 IKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPA 376
+K+ E ++ + K G ++ H +N ANYD + D Y P
Sbjct: 279 VKVSTEDVVKQTELYIKNGISFNYYMIHGGTNFGFTSGANYDKNHDIQPDLTSYDYDAPI 338
Query: 377 WSVS-ILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNR 435
+ P FN + I Q+ N K + + + F+ + +
Sbjct: 339 NEAGWVTPK-----FNALRDIFQKINRQRLPEVPKPMKVITIPEIKFTKINSLFDVIQQQ 393
Query: 436 SFVRPDLAEQINTTKDTS---DYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLV 492
+P + Q T +D + Y+ Y + +GK L I+ L A V+VN++
Sbjct: 394 ---KPIIHNQPLTFEDLNIGNGYIMYRRKFN-KDQKGK---LEIKGLRDYANVYVNERW- 445
Query: 493 AFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKN 552
G + N + IE+ G + L+IL +G NYGA G+ S ++I N
Sbjct: 446 ---QGELNRVNKKYDLDIEIKAG-DRLEILVENMGRINYGAEIVHNLKGIISPVII---N 498
Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKG 612
G SG W E L + ++ N S + + F E G
Sbjct: 499 GSE--ISGNW--------EMFPLPFDQFPKHKYQQKDIA---NNSPVISEAEFKLDE-TG 544
Query: 613 PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT 672
L++ GKG ++NG++IGRYWS P QT
Sbjct: 545 DTFLDMRKFGKGIVFINGRNIGRYWSK---------------------------AGPQQT 577
Query: 673 LYHIPRTWVHPGENLLVIHEELGGDPSKIS 702
LY +P W+ G+N + I E++ S I+
Sbjct: 578 LY-VPGVWLKKGKNGIQIFEQIFEGSSSIN 606
>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
Length = 199
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 134/202 (66%), Gaps = 4/202 (1%)
Query: 507 NKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG-LFSVILIDLKNGKRDLSSGEWIYQ 565
++KI+L+ G+N + +LS+ VGL N G F+ G L V L + +G D+S +W Y+
Sbjct: 1 SQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGTWDMSKWKWSYK 60
Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQ 625
+GV+GE + L + ++ W QGS + + L WYK+TF P G PLAL++ +MGKGQ
Sbjct: 61 IGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQ 120
Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGE 685
W+NG++IGR+W AY A G +C+Y G++DA KC +CG+ +Q YH+PR+W+ +
Sbjct: 121 VWINGRNIGRHWPAYKA--QGSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKS-Q 177
Query: 686 NLLVIHEELGGDPSKISLLTKT 707
NL+V+ EELGGDP+ ISL+ +T
Sbjct: 178 NLIVVFEELGGDPNGISLVKRT 199
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 115/337 (34%), Positives = 174/337 (51%), Gaps = 35/337 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
++ D + I GK+ + SGSIHY R P+ W + ++K K GL ++TYV WN HEP+
Sbjct: 71 LSLDEDSFYIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMP 130
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F G ++ F+K L + +R GPY C+EW+ GG P WL P ++ R+ P
Sbjct: 131 GEFDFSGLLNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNMKIRSNYKP 190
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG---VGGELYVKWAA 181
+++ +KRF K+ +++ L +S GGPII QVENEY AYG G ++++ A
Sbjct: 191 YQDAVKRFFTKLFEILTP--LQSSYGGPIIAFQVENEYA----AYGPRNATGRHHMQYLA 244
Query: 182 DTAVNLNTSVPWVMCQQED--------APDPIINTCNGFYCD------GFTPNSPSKPIM 227
+ +L ++ ++ AP+ + T N F D P+KP +
Sbjct: 245 NLMRSLGAVELFITSDGQNDIKASSDMAPNNALLTVN-FQNDPSEALNKLLLVQPNKPPL 303
Query: 228 WTENYSGWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
E ++GWF +G R + L + + GG+F N YM+ GGTNFG G +
Sbjct: 304 VMEYWTGWFDHWGRRHLERTLSPSQLIVNIGTILQMGGSF-NLYMFHGGTNFGFMNGANI 362
Query: 286 V-------ATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAP+ E G I + K+ LREL K
Sbjct: 363 EGGEYRPDVTSYDYDAPLSEAGDITK-KYTLLRELLK 398
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 118/339 (34%), Positives = 172/339 (50%), Gaps = 43/339 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A ++GK+ +L SG++HY R PE W + + K K GL +ETYV WN HE +RG + F
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G DL RF++ Q+ GL++ LR GPY C+EW++GG P WL P ++ RT+ P+ E +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------------NVEWAYGVGGELYV 177
+LAKI+ L+ +L S+GGPII Q+ENEYG N YG+ L+
Sbjct: 130 AYLAKILPLVN--DLQMSKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLF- 186
Query: 178 KWAADTAVNL-NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPN--SPSKPIMWTENYSG 234
+D + N +P V+ A G+ + N P P+M E +SG
Sbjct: 187 --TSDNGTGIQNGPIPGVL-----ATTNFQEQEQGYLMFEYLRNIKQPGLPMMVMEFWSG 239
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----------- 283
WF +G + V ++ G+ N+YM+ GGTNFG AG
Sbjct: 240 WFDHWGEQHNLCHHAEF-IDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATNEGG 298
Query: 284 --PLVA--TSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
P A TSYDYD P+ E G + + K+ +R + +K
Sbjct: 299 GEPYAADTTSYDYDCPVSESGQLNE-KFYEIRNILSEMK 336
>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
Length = 596
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 178/349 (51%), Gaps = 35/349 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+ +DG+R + SGS HY R+ P +W + + + K GL + TYV WN+HEP +GQ+
Sbjct: 8 SFYLDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLG 67
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN-NPFKEEM 129
G +DLV F++ VQ+ GL+L +R GPY CAEW +GGFP WL P + RT++ P+ E+
Sbjct: 68 GLYDLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEV 127
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD--TAVNL 187
K++L+++ ++ + GGPII QVENE+G + GV Y+++ ++ NL
Sbjct: 128 KQYLSQLFAVLTK--FTYKHGGPIIAFQVENEFG----SKGVHDPEYLQFLVTQYSSWNL 181
Query: 188 N----TSVPWVMCQQEDAPDPI----INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
N TS PD + +N + P +P+M TE ++GWF +
Sbjct: 182 NELLFTSDGKKYLSNGTLPDVLATINLNDHAKEDLEELKEFQPERPLMVTEFWAGWFDHW 241
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------------GP 284
G +L + + N+YM+ GGTNFG G GP
Sbjct: 242 GEEHHHYGTTELERELEAILSLNASV-NFYMFIGGTNFGFWNGANYLSYNKDKEASLLGP 300
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK 333
V TSYDYDA + E+G ++ PK+ +R L K L L PT K
Sbjct: 301 TV-TSYDYDAAVSEWGHVK-PKYNVIRNLLKKYSLTPLDLPDVPPTPMK 347
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 75/127 (59%), Positives = 102/127 (80%), Gaps = 1/127 (0%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ NV+YD R+L+I+G+R++L S +IHYPRS P +WPEL++ +KEGG++VIETYVFWN H
Sbjct: 17 FAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVH 76
Query: 61 EPIR-GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFR 119
+P +Y+F+GRFDLV+F+ VQEAG++L LRIGP+ AEWN+GG PVWLH++ G FR
Sbjct: 77 QPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFR 136
Query: 120 TTNNPFK 126
T N FK
Sbjct: 137 TDNYNFK 143
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 116/343 (33%), Positives = 167/343 (48%), Gaps = 44/343 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++GK + SG +HYPR E W ++ K GL + TYVFWNYHE G++ + G
Sbjct: 36 FLLNGKLFSIHSGEMHYPRIPQEYWKHRLQMMKAMGLNAVTTYVFWNYHEENPGKWNWSG 95
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +F+KT QE GL++ +R GPY CAEW +GG+P WL I G++ R NN F E ++
Sbjct: 96 EKDLKKFIKTAQEVGLYVIIRPGPYVCAEWEFGGYPWWLQNIKGLKIREDNNLFLAETQK 155
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNT- 189
++ ++ + +K +L + GGP+I+ Q ENE+G+ V + + + A L
Sbjct: 156 YITQLYNQVK--DLQITNGGPVIMVQAENEFGSFVAQRKDIPLASHRTYNAKIVKQLKDA 213
Query: 190 --SVP-------WVM-----------CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
SVP W+ ED + + N + N+ P M
Sbjct: 214 GFSVPMFTSDGSWLFEGGSVVGALPTANGEDNIENLKKIVNQY-------NNNQGPYMVA 266
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-- 287
E Y GW + P +A ++ + +F NYYM GGTNFG T G
Sbjct: 267 EFYPGWLAHWAEKFPRVDAGTVARQTDKYLKNDVSF-NYYMVHGGTNFGFTNGANYDKNH 325
Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLREL---HKAIKLCE 321
TSYDYDAPI E G+ R PK+ LR + H KL E
Sbjct: 326 DIQPDLTSYDYDAPITEAGW-RTPKYDSLRAVISKHTKAKLPE 367
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/312 (36%), Positives = 155/312 (49%), Gaps = 36/312 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + SG++HY R PE W + IR +K GL IETYV WN HEP+RG++ G
Sbjct: 11 FLLDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDATG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+ + GL +R GPY CAEW+ GG PVWL PGI R + F E +
Sbjct: 71 WNDLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGIGIRRSEPQFVEAVSE 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L ++ +++ + +GG ++L Q+ENEYG AYG E Y++ + +V
Sbjct: 131 YLRRVYEIVAPRQI--DRGGNVVLVQIENEYG----AYGSDKE-YLRELVRVTKDAGITV 183
Query: 192 PWVMCQQ------EDAPDPIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFLSF 239
P Q E P ++ F + P+ P+M +E + GWF +
Sbjct: 184 PLTTVDQPMPWMLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWDGWFDWW 243
Query: 240 G----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVAT 288
G P DL +A G N YM GGTNFG T G P+V T
Sbjct: 244 GSIHHTTDPAASAHDLDVLLA-----AGASVNIYMVHGGTNFGTTNGANDKGRFDPIV-T 297
Query: 289 SYDYDAPIDEYG 300
SYDYDAPIDE G
Sbjct: 298 SYDYDAPIDESG 309
>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
Length = 255
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 109/199 (54%), Gaps = 48/199 (24%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V+YD R+LVIDG+RR++ SGSIHYPRSTPE
Sbjct: 29 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPE----------------------------- 59
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+Q AG++ LRIGPY C EWNYGG P WL IPG+QFR N
Sbjct: 60 -----------------EIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 102
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAA 181
PF+ EM+ F I++ MK +FA QGGPIILAQ+ENEYGN+ + Y+ W A
Sbjct: 103 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 162
Query: 182 DTAVNLNTSVPWVMCQQED 200
D A N VPW+MCQQ+D
Sbjct: 163 DMANKQNVGVPWIMCQQDD 181
>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
Length = 451
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 137/474 (28%), Positives = 204/474 (43%), Gaps = 76/474 (16%)
Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
MY GGTNF R +GGP++ TSYDYDAP+DEYG + QPKWGHLR+LH I L +L S
Sbjct: 38 MYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPKWGHLRDLHVRILL---HLSQSRG 94
Query: 330 THQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDANVTFNGN-VYFLPAWSVSILPDCKN 387
L Y + ++ + FL+N ++ DAN+ + ++F+PAW
Sbjct: 95 LGFATVYALNLTTYINNATGERFCFLSNTKTNEDANIDLQQDGIFFVPAW---------- 144
Query: 388 VVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQIN 447
+ + +++V Q+ N A + L + F ++ V D+ +
Sbjct: 145 IYYYSSRV--QQGNFQQCKATSDETDYLRYITRYFDFFTVSV----------KDVHSRCQ 192
Query: 448 TTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLIN 507
+T ++ P + ++ + H+ + G ++ F
Sbjct: 193 QCNNTEEHDLACDFFGTSPACSCQSAARLQQVFHSIYNLTS--------GKQNYGEFF-- 242
Query: 508 KKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVG 567
E EGI LS W G G + L D +G RD+
Sbjct: 243 --DEGPEGIAGAADLSS-------NQWAYKIGLGGEAKRLYDPNSGHRDV---------- 283
Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
++ + LPV +++ WYKTTF P G PL LNL MGKG AW
Sbjct: 284 ------------------FRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHAW 325
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG S+GR+W A TG + CDYRG YD KC +CG P Q HI T++ G +
Sbjct: 326 VNGHSLGRFWPMQSADPTGYSGSCDYRGKYDKDKCLTNCGNPTQRWKHIA-TFMPNGRII 384
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEA-DPPPVDSWKPNLGVVSSSPQVR 740
VI G+P + G ++ + A + V +LGV S+ V+
Sbjct: 385 SVIQFASFGNPEGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGVSESTLGVK 438
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 97/282 (34%), Positives = 128/282 (45%), Gaps = 73/282 (25%)
Query: 521 ILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISL 580
I ++ G QNYG +FD G+ G DLSS +W Y++G+ GE L +
Sbjct: 228 IYNLTSGKQNYGEFFDEGPEGI---------AGAADLSSNQWAYKIGLGGEAKRLYDPNS 278
Query: 581 ANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
+ ++ + LPV +++ WYKTTF P G PL LNL MGKG AWVNG S+GR+W
Sbjct: 279 GHRDVFRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHAWVNGHSLGRFWPMQ 338
Query: 641 LAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
A TG + CDYRG YD KC +CG P Q W
Sbjct: 339 SADPTGYSGSCDYRGKYDKDKCLTNCGNPTQR-------W-------------------- 371
Query: 701 ISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIP 760
+HI +F+ P R+ I+ I FAS+G P
Sbjct: 372 --------KHIATFM---------------------PNGRI-------ISVIQFASFGNP 395
Query: 761 EGNCGSFRPGACHMDVLPI-VQKACVGQIECSIPVSSAYLGV 801
EG CGS + G V+KACVG+ CS+ VS + LGV
Sbjct: 396 EGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGVSESTLGV 437
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 21/26 (80%)
Query: 139 LMKQENLFASQGGPIILAQVENEYGN 164
+ K+ LFAS GGPI+ AQ+EN+YGN
Sbjct: 1 MAKEAKLFASSGGPIVFAQIENDYGN 26
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 174/351 (49%), Gaps = 37/351 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ ++Y L+ +G+ L +GS+HY R P W + +R+ GL ++TYV WN+HE
Sbjct: 4 STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G F+G DL RF++ QE GL + +R GPY CAEW+ GG P WL PG++ RT++
Sbjct: 64 TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+ E + R+ ++ + + L A +GGP++ Q+ENEYG+ YG YV+ D
Sbjct: 124 GPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGS----YG-DDRAYVRHIRD 176
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------GFTPN---------SPSKPIM 227
V + + D P P++ + G P+ P++P
Sbjct: 177 ALVARGIT---ELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFF 233
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
E ++GWF +G RP A + + GG+ + YM GGTNFG AG
Sbjct: 234 CAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEG 292
Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAI-KLCEEYLISSDP 329
P V TSYD DAPI E G + PK+ LR+ A+ + + +DP
Sbjct: 293 GTIRPTV-TSYDSDAPIAENGAL-TPKFFALRDRLTALGTVAARRPLPADP 341
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 166/320 (51%), Gaps = 19/320 (5%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++G+ V+++ IHYPR E W I+ SK G+ I YVFWN+HEP G+Y F
Sbjct: 33 KTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + QE G+++ +R GPY CAEW GG P WL I+ R + + E +
Sbjct: 93 TGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKEDIKLREQDPYYMERV 152
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
K F+ ++ + +L S+GG II+ QVENEYG+ ++ Y VK A T V L
Sbjct: 153 KLFMNEVGKQLA--DLQISKGGNIIMVQVENEYGSFGIDKPYIAAIRDMVKQAGFTGVPL 210
Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
W + +A D ++ T N G D P+ P+M +E +SGWF +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTVNFGTGANIDQQFERLKELRPNTPLMCSEFWSGWFDHWG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
R E+L + + +F + YM GGT+FG G TSYDYDAP
Sbjct: 270 AKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328
Query: 296 IDEYGFIRQPKWGHLRELHK 315
I+E G + PK+ +R+L K
Sbjct: 329 INESGKV-TPKFLEVRDLLK 347
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 95/236 (40%), Gaps = 53/236 (22%)
Query: 464 VMPGQGKEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDIL 522
+P +E L I A VF+N KKL + L K E + LDIL
Sbjct: 410 TLPASKEEQTLIITEAHDWAQVFLNGKKLATLSRLKGEGTVILPPMKEE-----SRLDIL 464
Query: 523 SMMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSS-GEW-IYQVGVEGEYIGLDKIS 579
+G N+G +D G ++L++ +++S +W +Y + V+ +
Sbjct: 465 VEAMGRMNFGKGIYDWKGI----TEKVELQSNDGNITSLKDWQVYNIPVDYSFA------ 514
Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
N + K+ +T K +Y+ TF + G LN+ + KG W+NG ++GRYW
Sbjct: 515 -QNKKYEKRDNT---EKYPAYYRGTFTL-DKVGDTFLNMMNWSKGMVWINGHAVGRYWEI 569
Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTLY +P W+ G+N +VI + G
Sbjct: 570 ----------------------------GPQQTLY-VPGCWLKEGDNEVVILDMAG 596
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 113/338 (33%), Positives = 169/338 (50%), Gaps = 36/338 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ ++Y L+ +G+ L +GS+HY R P W + +R+ GL ++TYV WN+HE
Sbjct: 4 STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G F+G DL RF++ QE GL + +R GPY CAEW+ GG P WL PG++ RT++
Sbjct: 64 TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+ E + R+ ++ + + L A +GGP++ Q+ENEYG+ YG YV+ D
Sbjct: 124 GPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGS----YG-DDRAYVRHIRD 176
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------GFTPN---------SPSKPIM 227
V + + D P P++ + G P+ P++P
Sbjct: 177 ALVARGIT---ELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFF 233
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
E ++GWF +G RP A + + GG+ + YM GGTNFG AG
Sbjct: 234 CAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEG 292
Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
P V TSYD DAPI E G + PK+ LR+ A+
Sbjct: 293 GTIRPTV-TSYDSDAPIAENGAL-TPKFFALRDRLTAL 328
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 116/343 (33%), Positives = 162/343 (47%), Gaps = 28/343 (8%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
T ++DG+ + SG+IHY R P+ W + I K++ GL IETYV WN HEP+ G
Sbjct: 5 TIGEHDFLLDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEG 64
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
Q+ +EG DL F+K V + G+ +R PY CAEW+ GG P WL R F
Sbjct: 65 QWSWEGGLDLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKAAGVRRDEPVF 124
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
++ +L ++ +++ E L GGP+IL Q+ENEYG AYG E Y++ D
Sbjct: 125 MAAVQAYLRRVYEVI--EPLQIHHGGPVILVQIENEYG----AYGSDPE-YLRKLVDITS 177
Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDG-FTPNSPSK-----------PIMWTENYS 233
+ +VP Q + + G G F SP + P+M E ++
Sbjct: 178 SAGITVPLTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWN 237
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLV 286
GWF +G E A + +G + N YM GGTNFG T G P+V
Sbjct: 238 GWFDDWGTPHHTTDAEASAADLDALLGSGASV-NLYMLCGGTNFGLTNGANDKGTYEPIV 296
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
TSYDYDAP+DE G W + + +L E S P
Sbjct: 297 -TSYDYDAPLDEAGHPTAKYWAFREVIGRYTELPGEVPPGSSP 338
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 151/311 (48%), Gaps = 29/311 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + SG++HY R P+ W + IRK++ GL +ETYV WN H P RG + G
Sbjct: 11 FLLDGRSLQIVSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVHSPERGVFDTSG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
R DL RF+ V GL +R GPY CAEW GG P WL P + R F E +
Sbjct: 71 RRDLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIGE 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG----VGGELYVKWAADTAVNL 187
+ A ++ ++ + + ++GGP+++ QVENEYG AYG V E Y++ AD
Sbjct: 131 YYAALLPIVAERQV--TRGGPVLMVQVENEYG----AYGDDPPVERERYLRALADMIRAQ 184
Query: 188 NTSVPWVMCQQED------APDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGW 235
VP Q + P + T F + P+ P+M E + GW
Sbjct: 185 GIDVPLFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMCMEFWDGW 244
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATS 289
F S G P E A + G + N YM GGTNFG T+G + TS
Sbjct: 245 FDSAGLHHHTTPPEANARDLDDLLAAGASV-NLYMLHGGTNFGLTSGANDKGVYRPITTS 303
Query: 290 YDYDAPIDEYG 300
YDYDAP+ E+G
Sbjct: 304 YDYDAPLSEHG 314
>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 139
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 73/103 (70%), Positives = 94/103 (91%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25 NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYG 104
P+RGQYYF R+DLVRFVK ++AGL++HLRIGPY CAEWN+G
Sbjct: 85 PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFG 127
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/329 (34%), Positives = 162/329 (49%), Gaps = 25/329 (7%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + + R +DGK + SG++HY R P+ W + I K K GL +ETYV WN HE
Sbjct: 39 SKGLVANGRHFTMDGKPFTILSGAMHYFRIPPQYWEDRIVKLKAMGLNTVETYVSWNLHE 98
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
I+G + F+ D+V F+KT Q+ L++ +R GPY CAEW+ GG P WL P I R+
Sbjct: 99 EIQGDFNFKDGLDIVEFIKTAQKHDLYVIMRPGPYICAEWDLGGLPSWLLHNPNIYLRSL 158
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVE----WAYGVGGELYV 177
+ F + RF ++I + S GGPII Q+ENEY + + + + E+ +
Sbjct: 159 DPIFMKATLRFFDELIPRLIDYQY--SNGGPIIAWQIENEYLSYDNSSAYMRKLQQEMVI 216
Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDP-IINTCN-----GFYCDGFTPNSPSKPIMWTEN 231
+ + + W M ++ P ++ T N G P+ P+M TE
Sbjct: 217 RGVKELL--FTSDGIWQMQIEKKYSLPGVLKTVNFQRNETNILKGLRKLQPNMPLMVTEF 274
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
+SGWF +G VE A + + NYYM GGTNFG G
Sbjct: 275 WSGWFDHWGEDKHVLTVEKAAERTKNILKMESSI-NYYMLHGGTNFGFMNGANAENGKYK 333
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
P + TSYDYDAPI E G I PK+ LRE
Sbjct: 334 PTI-TSYDYDAPISESGDI-TPKYRELRE 360
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 26/308 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SG++HY R P++W + I K++ GL IETYV WN H P RG++ +G
Sbjct: 8 FLLDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTDG 67
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF++ V+ G+ +R GPY CAEW+ GG P WL P + R + E +
Sbjct: 68 ALDLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVSE 127
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L ++DL+ + +GGP++L QVENEYG AYG +Y++ + +V
Sbjct: 128 YLGTVLDLVAPFQV--DRGGPVVLVQVENEYG----AYG-SDHVYLEKLMALTRSHGITV 180
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
P Q + +G + G + P+ P+M E + GWF +
Sbjct: 181 PLTSIDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGWFDHW 240
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
G +D A + G + N YM+ GGTNFG T+G TSYDYD
Sbjct: 241 GAHHHTTSAQDAARELDELLAAGASV-NIYMFHGGTNFGFTSGANDKGVYQPTTTSYDYD 299
Query: 294 APIDEYGF 301
AP+ E G+
Sbjct: 300 APLAEDGY 307
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 162/318 (50%), Gaps = 19/318 (5%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++GK V+++ IHYPR E W I+ K G+ I YVFWN+HEP G+Y F
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + QE G+++ +R GPY CAEW GG P WL I+ R + + E +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
K F+ ++ + +L S+GG II+ QVENEYG+ ++ Y VK A T V L
Sbjct: 153 KLFMNEVGKQLT--DLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210
Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
W + +A D ++ T N G D P P+M +E +SGWF +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
R EDL + + +F + YM GGT+FG G TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328
Query: 296 IDEYGFIRQPKWGHLREL 313
I+E G + PK+ +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 85/233 (36%), Gaps = 47/233 (20%)
Query: 464 VMPGQGKEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDIL 522
+P +E L I A VF++ KKL + L K EG LDIL
Sbjct: 410 TLPASKEEQTLTITEAHDWAQVFLDGKKLATLSRLKGEGTVILPPMK----EGAQ-LDIL 464
Query: 523 SMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLAN 582
+G N+G G+ + + NG +Y + V D N
Sbjct: 465 VEAMGRMNFGKGI-YDWKGITEKVEVQSNNGVITSLKNWKVYNIPV-------DYAFAQN 516
Query: 583 SSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
F KQ + K +Y+ TF + G LN+ + KG WVNG +IGRYW
Sbjct: 517 KKFVKQDNP---QKYPAYYRGTFTL-DKTGDTFLNMTNWSKGMVWVNGYAIGRYWEI--- 569
Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTLY +P W+ GEN ++I + G
Sbjct: 570 -------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 162/318 (50%), Gaps = 19/318 (5%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++GK V+++ IHYPR E W I+ K G+ I YVFWN+HEP G+Y F
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + QE G+++ +R GPY CAEW GG P WL I+ R + + E +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
K F+ ++ + +L S+GG II+ QVENEYG+ ++ Y VK A T V L
Sbjct: 153 KLFMNEVGKQLA--DLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210
Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
W + +A D ++ T N G D P P+M +E +SGWF +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
R EDL + + +F + YM GGT+FG G TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328
Query: 296 IDEYGFIRQPKWGHLREL 313
I+E G + PK+ +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 66/177 (37%), Gaps = 41/177 (23%)
Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
LDIL +G N+G G+ + + NG +Y + V D
Sbjct: 461 LDILVEAMGRMNFGKGI-YDWKGITEKVEVQSNNGVITSLKNWKVYNIPV-------DYA 512
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
N F KQ + K +Y+ TF + G LN+ + KG WVNG +IGRYW
Sbjct: 513 FAQNKKFVKQDNP---QKYPAYYRGTFTL-DKTGDTFLNMTNWSKGMVWVNGYAIGRYWE 568
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTLY +P W+ GEN ++I + G
Sbjct: 569 I----------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 111/326 (34%), Positives = 160/326 (49%), Gaps = 35/326 (10%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A +T+ L+ G+ + SGS+HY R P W + + + GL ++TYV WN+HE
Sbjct: 14 AATLTHAGGTLLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHE 73
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G F+G DL RFV+ QE GL + +R GPY CAEW+ GG P WL PG++ RT+
Sbjct: 74 RTPGDVRFDGWRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRPRTS 133
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ PF + R+ ++I + L A +GGP++ Q+ENEYG+ YG G+ YV+W
Sbjct: 134 HPPFLAAVARWFDQLIPRIAA--LQAGRGGPVVAVQIENEYGS----YGDDGD-YVRWVR 186
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------GFTPNS---------PSKPI 226
D + + D P ++ + G P P +P
Sbjct: 187 DALTARGVT---ELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPF 243
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
E ++GWF +G RP A V R GG+ + YM GGTNFG AG
Sbjct: 244 FCAEFWNGWFDHWGEQHHVRPARSAADDVGRILGAGGSL-SLYMAHGGTNFGLWAGANHD 302
Query: 284 -----PLVATSYDYDAPIDEYGFIRQ 304
P V TSYD DAP+ E+G + +
Sbjct: 303 GDRLQPTV-TSYDSDAPVAEHGALTE 327
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 162/318 (50%), Gaps = 19/318 (5%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++GK V+++ IHYPR E W I+ K G+ I YVFWN+HEP G+Y F
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + QE G+++ +R GPY CAEW GG P WL I+ R + + E +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
K F+ ++ + +L ++GG II+ QVENEYG+ ++ Y VK A T V L
Sbjct: 153 KLFMNEVGKQLT--DLQINKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210
Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
W + +A D ++ T N G D P P+M +E +SGWF +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
R EDL + + +F + YM GGT+FG G TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328
Query: 296 IDEYGFIRQPKWGHLREL 313
I+E G + PK+ +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 87/233 (37%), Gaps = 47/233 (20%)
Query: 464 VMPGQGKEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDIL 522
+P +E L I A VF++ KKL + L K EG LDIL
Sbjct: 410 TLPASKEEQTLTITEAHDWAQVFLDGKKLATLSRLKGEGTVILPPMK----EGAQ-LDIL 464
Query: 523 SMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLAN 582
+G N+G G+ + I NG +Y + V D N
Sbjct: 465 VEAMGRMNFGKGI-YDWKGITEKVEIQSNNGVITSLKNWKVYNIPV-------DYAFAQN 516
Query: 583 SSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
F KQ + L K +Y+ TF+ + G LN+ + KG WVNG +IGRYW
Sbjct: 517 KEFMKQDNPL---KYPAYYRGTFML-DKTGDTFLNMTNWSKGMVWVNGYAIGRYWEI--- 569
Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTLY +P W+ GEN ++I + G
Sbjct: 570 -------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 119/399 (29%), Positives = 188/399 (47%), Gaps = 26/399 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD ++ I KR + S +IHY R W +++ K+K GG IETY+ WN+HE
Sbjct: 2 ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F G DL F++ GL++ R GPY CAEW++GGFP WL IQ+R+
Sbjct: 62 GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPS 121
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F + ++ ++I ++ + L ++ G +I+ Q+ENE+ AYG + Y+++ D
Sbjct: 122 FLHYVDQYFDQVISIIDEYQL--TKNGSVIMVQIENEFQ----AYGKPDKKYMEYLRDGM 175
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
+ VP+V C A D + N + +P E + GWF +
Sbjct: 176 IARGIEVPFVTCY--GAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEHW 233
Query: 240 -GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV-ATSYDYD 293
G + E L + G T NYYMYFGGTNF GRT + T+YDYD
Sbjct: 234 GGNKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYDYD 293
Query: 294 APIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDPTHQ--KLGAKLEAHIYHKSSND 349
IDEY QP K+ L+ H +K E +++ + KL + L++ +
Sbjct: 294 VAIDEY---LQPTRKYEVLKRYHLFVKWLEPLFTNAEQANSDVKLSSDLKSGRIVSPHGE 350
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNV 388
N + ++V + + ++LP +NV
Sbjct: 351 VLFIENNRNERIQSHVKHGNELVPFTIEANAVLPIVRNV 389
>gi|254443764|ref|ZP_05057240.1| Glycosyl hydrolases family 35 [Verrucomicrobiae bacterium DG1235]
gi|198258072|gb|EDY82380.1| Glycosyl hydrolases family 35 [Verrucomicrobiae bacterium DG1235]
Length = 792
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 165/325 (50%), Gaps = 35/325 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ ++ G +HY R E W I + G+ + Y+FWNYHE G++ +EG
Sbjct: 48 FLLDGEPIQIRCGELHYSRVPREYWKHRIEMIRAMGMNAVCVYLFWNYHEREEGEFTWEG 107
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
+ D+V F + QEAGL++ LR GPY+CAEW GG P WL IQ RTT+ F +
Sbjct: 108 QADVVEFCRLAQEAGLWVVLRPGPYSCAEWEMGGLPWWLLKHDDIQLRTTDKRFISAARN 167
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
++A++ + NL S+GGPI++ QVENEYG YG E Y+ ++ ++ V
Sbjct: 168 YMAEVGRTLG--NLQVSRGGPILMVQVENEYG----FYGSDPE-YMGAIRESLIDAGFEV 220
Query: 192 PWVMCQQEDAPDPIINTCNGFYCD-------GFTPNS---------PSKPIMWTENYSGW 235
P C +P + G+ D G P S + P+M E Y GW
Sbjct: 221 PLFAC------NPPYHLERGYRDDLFQVVNFGSEPESAFAELRKVQATGPLMCGEFYPGW 274
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----ATSYD 291
F ++G +E+ A+ R E +F + YM GGT FG AG +SYD
Sbjct: 275 FDTWGNPHHTGKIENYTGALGRMMEMRASF-SIYMAHGGTTFGFWAGADRPFKPDTSSYD 333
Query: 292 YDAPIDEYGFIRQPKWGHLRELHKA 316
YDAP+ E G+ P++ LREL ++
Sbjct: 334 YDAPVSEAGWT-TPQYFRLRELMQS 357
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/263 (23%), Positives = 104/263 (39%), Gaps = 60/263 (22%)
Query: 468 QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
+G V L ++ VFV+ + + G D + + I + TL+IL +G
Sbjct: 422 KGPAVTLKAAAVNDFGWVFVDGEPM----GTFDRRSRTFSIDIPKRDSPATLEILVYAMG 477
Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
N+G + V L+D K R L G + + ++ +Y+ K A+
Sbjct: 478 RINFGPEVHDRKGLIGPVELVDEKGRARQLK-GWKHHSLPMDDDYLASLKYQAASE---- 532
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
KS ++++ F E G L+L+S GKG W+NG ++GRYW+
Sbjct: 533 -------EKSPAFWRSEFELKE-TGDTFLDLSSWGKGAVWINGYALGRYWNI-------- 576
Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
P QT+Y +P W+ G N +V+ + LG + I+ L K
Sbjct: 577 --------------------GPTQTMY-VPGPWLKEGRNEIVVLDLLGPESPVIAGLEK- 614
Query: 708 GQHICSFVSEADPPPVDSWKPNL 730
P +D+ +P L
Sbjct: 615 -------------PVLDTLRPEL 624
>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
Length = 636
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 175/662 (26%), Positives = 269/662 (40%), Gaps = 127/662 (19%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + YD V DGK SGSIHY R P W + + K K GL+ I+TYV WNYHE
Sbjct: 8 SFGIDYDSNCFVKDGKPFRYISGSIHYSRVPPYYWKDRLLKMKMAGLDAIQTYVPWNYHE 67
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G Y F G DL F++ + GL + LR GPY CAEW+ GG P WL I R++
Sbjct: 68 PQMGTYDFFGGKDLQYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSS 127
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG------ 172
++ + E ++R++ ++ M+ GGPII+ QVENEYG+ ++ Y
Sbjct: 128 DSDYLEAVERWMGVLLPKMRP--YLYQNGGPIIMVQVENEYGSYFACDYNYLRFLLKLFR 185
Query: 173 ---GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
G+ V + D A + ++ + + AP N F + P P+
Sbjct: 186 LHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPG--ANVTAAFLAQ--RSSEPKGPL 241
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
+ +E Y+GW +G+ P + +A + +G N YM+ GGTNF G +
Sbjct: 242 VNSEFYTGWLDHWGHHHSVVPAQTIAKTLNEILASGANV-NLYMFIGGTNFAYWNGANMP 300
Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
TSYDYDAP+ E G + + K+ LR++ K E L + PT K +
Sbjct: 301 YMPQPTSYDYDAPLSEAGDLTE-KYFALRKVIGMYKQLPEGL--TPPTTPKFAY---GKV 354
Query: 343 YHKSSNDCAAFLANYDSSSDANVTF-----NGNVYFLPAWSVSILPDCKNVVFNTAKVIS 397
+ + L S T+ YF + LP KN V T +S
Sbjct: 355 RLQKAGTVLEVLDGLSRSGPVRSTYPLTFVELKQYFGYVLYRTTLP--KNCVEPTP--LS 410
Query: 398 QRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLW 457
NG H +R++V D Q +D S
Sbjct: 411 SPLNGVH-----------------------------DRAYVSVDGVPQGVLERDKS---- 437
Query: 458 YTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN 517
I++ G + + +E++G V FG N+DF + N + +
Sbjct: 438 --LKINITGQAGASLDILVENMGR----------VNFGRYNNDFKGLVSNLTLAQD---- 481
Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
++VG + Y D+ GA + +I + L + KR + +
Sbjct: 482 ------VLVGWEIYP--LDIDGAVNYDIIYL-LHHPKRS-----------------AIKE 515
Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
+S +F+ ++P P+ +N KGQ W+NG ++GRYW
Sbjct: 516 LSYEVPTFYTGTLSIPGG-----------IPDLPQDTYVNFPGWTKGQIWINGFNLGRYW 564
Query: 638 SA 639
A
Sbjct: 565 PA 566
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 161/318 (50%), Gaps = 19/318 (5%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++G V+++ IHYPR E W I+ K G+ I YVFWN+HEP G+Y F
Sbjct: 33 KTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + QE G+++ +R GPY CAEW GG P WL I+ R + + E +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
K F+ ++ + +L S+GG II+ QVENEYG+ ++ Y VK A T V L
Sbjct: 153 KLFMNEVGKQLT--DLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210
Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
W + +A D ++ T N G D P P+M +E +SGWF +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
R EDL + + +F + YM GGT+FG G TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328
Query: 296 IDEYGFIRQPKWGHLREL 313
I+E G + PK+ +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 66/177 (37%), Gaps = 41/177 (23%)
Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
LDIL +G N+G G+ + + NG +Y + V D
Sbjct: 461 LDILVEAMGRMNFGKGI-YDWKGITEKVEVQSNNGVITSLKNWKVYNIPV-------DYA 512
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
N F KQ + K +Y+ TF + G LN+ + KG WVNG +IGRYW
Sbjct: 513 FAQNKKFVKQDNP---QKYPAYYRGTFTL-DKTGDTFLNMTTWSKGMVWVNGYAIGRYWE 568
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTLY +P W+ GEN ++I + G
Sbjct: 569 I----------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 112/321 (34%), Positives = 162/321 (50%), Gaps = 27/321 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++G+ V+++ IHYPR E W I+ K G+ I YVFWN+HEP G+Y F
Sbjct: 34 TFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGRYDFA 93
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + QE G+++ +R GPY CAEW GG P WL I+ R + + E +K
Sbjct: 94 GQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVK 153
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
FL ++ + +L S+GG II+ QVENEYG A+G+ + Y+ D T
Sbjct: 154 LFLNEVGKQLA--DLQISKGGNIIMVQVENEYG----AFGI-DKPYISEIRDMVKQAGFT 206
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C + +A D ++ T N G D P P+M +E +SGWF
Sbjct: 207 GVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLMCSEFWSGWFD 266
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDY 292
+G R E+L + + +F + YM GGT+FG G TSYDY
Sbjct: 267 HWGAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDY 325
Query: 293 DAPIDEYGFIRQPKWGHLREL 313
DAPI+E G + PK+ +R L
Sbjct: 326 DAPINESGKV-TPKYLEVRNL 345
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 49/191 (25%)
Query: 512 LNEGINTLDILSMMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVE 569
L EG + LDIL +G N+G +D G ++L++ K +W +Y + V+
Sbjct: 455 LKEG-DRLDILVEAMGRMNFGKGIYDWKGI----TEKVELQSDKGVELVKDWQVYTIPVD 509
Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
S A +KQ +Y++TF E G LN+ + KG WVN
Sbjct: 510 --------YSFARDKQYKQQEN--AENQPAYYRSTFNLNE-LGDTFLNMMNWSKGMVWVN 558
Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
G +IGRYW P QTLY +P W+ GEN ++
Sbjct: 559 GHAIGRYWEI----------------------------GPQQTLY-VPGCWLKKGENEII 589
Query: 690 IHEELGGDPSK 700
I + G PSK
Sbjct: 590 ILDMAG--PSK 598
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 169/345 (48%), Gaps = 47/345 (13%)
Query: 19 RVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRF 78
RVL SG+IHY R P++W + +R+ GL +ETYV WN+HE +RG+ F G DL RF
Sbjct: 25 RVL-SGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARF 83
Query: 79 VKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIID 138
+ + GL + +R GPY CAEW++GG P WL PGI RT++ F + + ++
Sbjct: 84 ISLAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAVVP 143
Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG---ELYVKWAADTAVNLNTSVPWVM 195
+++ L + GGP++ QVENEYG +YG E K D ++ V+
Sbjct: 144 VIRP--LLTTAGGPVVAVQVENEYG----SYGDDAAYLEHCRKGLLDRGID-------VL 190
Query: 196 CQQEDAPDP----------IINTCN-GFYCD----GFTPNSPSKPIMWTENYSGWFLSFG 240
D P P ++ T N G D P+ P M E ++GWF +G
Sbjct: 191 LFTSDGPGPDWLDNGTIPGVLATVNFGSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHWG 250
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-------VATSYDYD 293
R V+D A + GG+ N+YM GGTNFG +G + TSYDYD
Sbjct: 251 EPHHVRDVDDAAGVLDDVLRAGGSV-NFYMAHGGTNFGLWSGANVEDGKLQPTVTSYDYD 309
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKL 338
A + E G + PK+ RE + Y +++ P L A+L
Sbjct: 310 AAVGEAGEL-TPKFHAFRE------VISRYAVTALPELPPLPARL 347
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 104/308 (33%), Positives = 157/308 (50%), Gaps = 28/308 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + +G++HY R P++W + I K++ GL IETY WN HEP+ G Y F G
Sbjct: 11 FLLDGRPHRVLAGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFTG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF++ V +AG+ +R GPY CAEW+ GG P WL+ P + R + + +
Sbjct: 71 MLDLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLGAVSA 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L ++ D++ L +GGP++L Q+ENEYG AYG + Y++ D +V
Sbjct: 131 YLRRVYDVVTP--LQIDRGGPVVLVQIENEYG----AYG-SDKFYLRHLVDLTRECGITV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
P Q + + + G + P+ P+M +E ++GWF +
Sbjct: 184 PLTTVDQPTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFWNGWFDHW 243
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDY 292
G ED A + G + N YM+ GGTNFG T+G P + TSYDY
Sbjct: 244 GDRHHTTSAEDSAAELDALLAAGASV-NIYMFHGGTNFGLTSGANDKGVYQPTI-TSYDY 301
Query: 293 DAPIDEYG 300
DAP+DE G
Sbjct: 302 DAPLDEAG 309
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/400 (30%), Positives = 187/400 (46%), Gaps = 28/400 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+TYD ++ I +R + S +IHY R W E++ K+K GG IETY+ WN+HE
Sbjct: 2 ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ F G DL F + + L++ R GPY CAEW++GGFP WL IQ+R+
Sbjct: 62 GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPA 121
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F + ++ ++I ++ + L ++ G +I+ QVENE+ AYG + Y+++ D
Sbjct: 122 FLHYVDQYFDRVIPIIDEYQL--TKNGTVIMVQVENEFQ----AYGKPDKPYMEYIRDGM 175
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLS 238
VP V C A + + N F+ P +P E + GWF
Sbjct: 176 KARGIDVPLVTCY--GAVEGAVEFRN-FWSHSKHAAAILDERFPDQPKGVMEFWIGWFEQ 232
Query: 239 F-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAG-GPLVATSYDY 292
+ G + E L + G T NYYMYFGGTNF GRT G L T+YDY
Sbjct: 233 WGGNKADQKTPEQLERECYQLLSNGFTAINYYMYFGGTNFDHWGGRTVGEQTLCTTTYDY 292
Query: 293 DAPIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDP--THQKLGAKLEAHIYHKSSN 348
D IDEY QP K+ L+ H +K E ++ + KL + L++
Sbjct: 293 DVAIDEY---LQPTRKYEVLKRYHSFVKWLEPLFTDAEKVASDMKLPSDLKSERIASPYG 349
Query: 349 DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNV 388
+ N + ++V + + ++LP +NV
Sbjct: 350 EVIFIENNRNERIQSHVKHGYDQILFTIEANTVLPIVRNV 389
>gi|328958462|ref|YP_004375848.1| beta-galactosidase [Carnobacterium sp. 17-4]
gi|328674786|gb|AEB30832.1| beta-galactosidase [Carnobacterium sp. 17-4]
Length = 589
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 178/661 (26%), Positives = 262/661 (39%), Gaps = 157/661 (23%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG++HY R PE W + K G +ETY+ WN HEP G+Y F G
Sbjct: 10 FLLNGEPFKITSGAVHYFRVLPEDWYHSLYNLKALGFNTVETYIPWNVHEPKEGEYQFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++D+ +FV+ +E GLF+ LR PY CAEW +GG P WL + R+++ F E++ R
Sbjct: 70 QWDIKKFVQLAEELGLFVILRPSPYICAEWEFGGLPAWLLTYKDMLIRSSDPVFIEKVSR 129
Query: 132 FLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
+ +L+KQ L GGP+I+ Q+ENEYG +YG E Y++ + + L +
Sbjct: 130 YYK---ELLKQITPLQVDHGGPVIMMQLENEYG----SYGEDKE-YLRTLYELMLKLGVT 181
Query: 191 VP-------WVMCQQE-DAPDPIINTCNGF---------YCDGFTPNSPSK-PIMWTENY 232
+P W Q+ D I T F F + K P+M E +
Sbjct: 182 IPIFTSDGAWRATQEAGTMTDLDILTTGNFGSRSKENFKELKEFHESKGKKWPLMCMEYW 241
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPL 285
GWF + + R +L V E G N YM+ GGTNFG R
Sbjct: 242 DGWFNRWNDPIIKRDALELTQDVKEALEIGSL--NLYMFHGGTNFGFMNGCSARLRKDLP 299
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
TSYDYDAP++E G +PT + K ++ +
Sbjct: 300 QVTSYDYDAPLNEQG---------------------------NPTEKYFALK---NMMQE 329
Query: 346 SSNDCAAF--LANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD 403
S D L DS S N+ G V L + I+++
Sbjct: 330 SFPDIEQHPPLVK-DSMSITNIQVGGKVSLL----------------SIVDRIAKKQESR 372
Query: 404 HPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIH 463
+P K + EL ++ G + RS+V+ D E+ D SD L +
Sbjct: 373 YP----KTMEEL----------GQQYGYTLYRSYVKKDSDEEFYRVIDGSDRLHF----- 413
Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
FLN E + + +K+ A G N LD+L
Sbjct: 414 ---------FLNEEKIATQYQEEIGEKIYASPIS-----------------GSNQLDVLV 447
Query: 524 MMVGLQNYGAWF--DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD---KI 578
+G NYG D G+ ++ DL E LD +
Sbjct: 448 ENMGRVNYGHKLLADTQQKGIRRGVMSDL--------------HFITNWEQYSLDFSEPL 493
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
S+ WK+ S S YK T APE +N+ GKG VNG +IGR+W+
Sbjct: 494 SIDFDKEWKENSP-----SFYQYKVTIDAPEDT---FINMELFGKGIVLVNGFNIGRFWN 545
Query: 639 A 639
Sbjct: 546 V 546
>gi|357626884|gb|EHJ76789.1| putative carbamoyl-phosphate synthase large chain [Danaus
plexippus]
Length = 2861
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/334 (34%), Positives = 162/334 (48%), Gaps = 56/334 (16%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N++ ++DGK + SGS+HY R E W + +RK + GL + TYV W+ HE
Sbjct: 52 ARNISIVGDDFMLDGKPLRIVSGSVHYYRLPAEYWRDRLRKIRAAGLNAVSTYVEWSSHE 111
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRT 120
G Y FEG D+ RF+K E L++ LR GPY CAE + GG P W L P I+ RT
Sbjct: 112 EEEGAYSFEGDKDIARFLKIAAEENLYVLLRPGPYICAERDLGGLPYWLLSKYPDIKLRT 171
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
T+ F E K+++AK+ + +K GGPIIL QVENEYG +YG E Y+K
Sbjct: 172 TDGNFIAETKKWMAKLFEEVKP--FLLGNGGPIILVQVENEYG----SYGASKE-YMKQI 224
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNG----FYCDG----------FTPNS----- 221
D + EDA ++ T +G ++ DG F P +
Sbjct: 225 RDI----------IKSHVEDA--ALLYTTDGPYRSYFIDGSISGTLTTIDFGPTTSVINT 272
Query: 222 --------PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFG 273
P P+M +E Y GW + + + + F + E N+Y++FG
Sbjct: 273 FKELRAYMPVGPLMNSEFYPGWLTHWSEHIQQVSTDRVTFTLRDMLENKINL-NFYVFFG 331
Query: 274 GTNFGRTAGG-------PLVATSYDYDAPIDEYG 300
GTNF T+G P + TSYDYDAP+ E G
Sbjct: 332 GTNFEFTSGANYGRFYQPDI-TSYDYDAPLSEAG 364
>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
Length = 216
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/207 (42%), Positives = 114/207 (55%), Gaps = 46/207 (22%)
Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
G+ Y+ W +D A +L+ VPW++CQQ DAP P+INTC G+YCD FTPN+ + P WTE
Sbjct: 55 TAGKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQFTPNTANSPKKWTE 114
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-VATS 289
N++GWF S+G P R E +AFAVARFF+ FQN YMY GGTNFGRTAGGP TS
Sbjct: 115 NWTGWFKSWGDKDPHRTAEGVAFAVARFFQ----FQNCYMYHGGTNFGRTAGGPYSTTTS 170
Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND 349
+DYDAP+DE+ I H + +
Sbjct: 171 HDYDAPLDEHVTI-----------------------------------------HATEKE 189
Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPA 376
+ F N + +SDA + F G Y +PA
Sbjct: 190 SSCFFGNINETSDAVIEFRGAKYKIPA 216
>gi|255652865|ref|NP_001157373.1| beta-galactosidase [Bombyx mori]
gi|239938036|gb|ACS36117.1| beta-galactosidase [Bombyx mori]
Length = 606
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 167/346 (48%), Gaps = 26/346 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
N++ +IDGK + SGS+HY R W + + K K GL + TYV W+YHEP
Sbjct: 5 NISIVGDKFMIDGKPLHIISGSLHYFRVPAVYWRDRLHKFKAAGLNTVATYVEWSYHEPE 64
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
QY FEG DLVRFV+T E GL + LR+GPY CAE + GG P W L P I+ RTT+
Sbjct: 65 EKQYNFEGDRDLVRFVQTAAEVGLHVLLRVGPYICAERDLGGLPYWLLGKYPNIKLRTTD 124
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVG-GELYVKW 179
F E +L K+ + + +L GGPIIL QVENEYG + + AY +L
Sbjct: 125 KDFIAESDIWLKKLFE--QVSHLLFGNGGPIILVQVENEYGSYDSDLAYKEKMRDLISAH 182
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF---------YCDGFTPNSPSKPIMWTE 230
D A+ T P ++ P ++ F + F P+M +E
Sbjct: 183 VGDKALLYTTDGPSLVGA---GMIPGVHATIDFGVTSQPTEQFDSLFHLRPAPGPLMNSE 239
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
Y GW +G + D+ + R N+Y++FGG+NF T+G
Sbjct: 240 FYPGWLTHWGERMARVGTNDIVLTL-RNMIVNKIHVNFYVFFGGSNFEFTSGANFDGTYQ 298
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT 330
TSYDYDAP+ E G PK+ +RE K + +E + P+
Sbjct: 299 PDITSYDYDAPLSEAG-DPTPKYYAIRETLKQLNFVDEKIEPPQPS 343
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMG--KGQAWVNGQSIGRYW 637
V + +Y+ TF+ PEG+ PL L + G KG WVNG ++GRYW
Sbjct: 502 VTQGPTFYEGTFVLPEGQKPLDTFLDTTGWDKGYVWVNGHNLGRYW 547
>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
Length = 638
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 172/348 (49%), Gaps = 44/348 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG +HY R + W ++ K GL + TYVFWN+HE G + FEG
Sbjct: 41 FVYDGKTTRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEG 100
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F+KT E GL + LR GPYACAEW++GG+P WL I G++ R N F E K+
Sbjct: 101 DHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNAKFLEYTKK 160
Query: 132 FLAKIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLN 188
+ ID + +E +L + GGPII+ Q ENE+G+ V + E + + A L
Sbjct: 161 Y----IDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLE 216
Query: 189 TS---VPWVMCQ-----QEDAPDPIINTCNG--------FYCDGFTPNSPSKPIMWTENY 232
+ VP + A + T NG D + N+ P M E Y
Sbjct: 217 EAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVAEFY 274
Query: 233 SGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------- 283
GW +A PF V+ +A ++ + +F NYYM GGTNFG T+G
Sbjct: 275 PGWLDH--WAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKSD 331
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
P + TSYDYDAPI E G+ PK+ +R + I+ +Y + + P
Sbjct: 332 IQPDI-TSYDYDAPISEAGW-ATPKYDSIRTV---IQKYADYTVPAVP 374
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 110/262 (41%), Gaps = 61/262 (23%)
Query: 444 EQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFAN 503
EQ+N + Y+ Y+ + P GK L I+ L A+V+++ V G N F N
Sbjct: 412 EQLN---QANGYVLYSKQFN-QPINGK---LKIDGLRDFAVVYIDGTKV--GELNRVFKN 462
Query: 504 FLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEW 562
+ ++ I N +TL IL +G NYG+ G+ S +LI+ D+ +G+W
Sbjct: 463 YEMDIDIPFN---STLQILVENMGRINYGSEMIHNHKGIISPVLIN------DMEITGDW 513
Query: 563 IYQV-------GVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLA 615
Q + G+ + + N+S + PV Y+ TF E G
Sbjct: 514 TMQQLPMDKVPDLAGKQTAAIQNTKTNASKIAALTGQPV-----LYQGTFDLKE-IGDTF 567
Query: 616 LNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYH 675
+++ GKG ++NG +IGRYW TG P TLY
Sbjct: 568 IDMEKWGKGIVFINGINIGRYW------KTG----------------------PQHTLY- 598
Query: 676 IPRTWVHPGENLLVIHEELGGD 697
IP ++ G N +VI E+L +
Sbjct: 599 IPAPYLKKGSNSIVIFEQLNDE 620
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/300 (35%), Positives = 147/300 (49%), Gaps = 17/300 (5%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+D K + SG++HY R PE W + + + K GL +ETYV WN HE I G++ F G
Sbjct: 65 LDNKELRILSGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTGML 124
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D+ RFV ++ GL + LR GP+ C+EW +GG P WL P + R+T PF + + ++
Sbjct: 125 DIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQMDVRSTYRPFMDAARSYM 184
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN-LNTSVP 192
+I + E++ GGPII Q+ENEYG+ EL + L TS
Sbjct: 185 RSLISEL--EDMQYQYGGPIIAMQIENEYGSYSDDVNYMQELKNIMTDSGVIEILFTSDN 242
Query: 193 WVMCQQEDAPDPIINTC------NGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
Q P + T G D P KP+M E +SGWF +
Sbjct: 243 KHGLQPGRVPGVFMTTNFKNTNEGGRMFDKLHELQPGKPLMVMEFWSGWFDHWEEKHHTM 302
Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAPIDEYG 300
+E+ A AV + G + N YM+ GGTNFG G P V TSYDYD+P+ E G
Sbjct: 303 SLEEYASAVEYILQQGSSI-NLYMFHGGTNFGFLNGANTEPYLPTV-TSYDYDSPLSEAG 360
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 119/334 (35%), Positives = 166/334 (49%), Gaps = 37/334 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++GK + SG IHYPR W + K GL + TYVFWNYHE G++ F G
Sbjct: 38 FLLNGKPFTIYSGEIHYPRVPSAYWKHRLEMMKAMGLNTVTTYVFWNYHEEAPGKWNFSG 97
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +F+KT QE GL++ +R GPY CAEW +GG+P WL ++ R N F EE +
Sbjct: 98 EKDLQKFIKTAQETGLYVIIRPGPYVCAEWEFGGYPWWLQKNKELEIRRDNKAFSEECWK 157
Query: 132 F---LAKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWA---ADTA 184
+ LAK I M+ N GGP+I+ Q ENE+G+ V + E + K++ +
Sbjct: 158 YISQLAKQITPMQITN-----GGPVIMVQAENEFGSYVAQRKDIPLEEHRKYSHKIKEML 212
Query: 185 VNLNTSVPWVMCQ-----QEDAPDPIINTCNGFYCDGFTPNSPSK------PIMWTENYS 233
+ SVP + + + + T NG S ++ P M E Y
Sbjct: 213 LKSGISVPLFTSDGSSLFKGGSVEGALPTANGESDIDVLKKSINEYNGGKGPYMIAEYYP 272
Query: 234 GWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
GW +A PF V E++ + E G +F NYYM GGTNFG T+G
Sbjct: 273 GWLDH--WAEPFVKVSTEEVVKQTNLYIENGVSF-NYYMIHGGTNFGFTSGANYDKDHDI 329
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDAPI E G+ PK+ LR++ + I
Sbjct: 330 QPDLTSYDYDAPISEAGWA-TPKYNALRKIFQKI 362
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 152/313 (48%), Gaps = 32/313 (10%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
H+ +++GK +++ +HYPR W I+ K G+ I YVFWN HE G++
Sbjct: 35 HKTFLLNGKPFTVKAAELHYPRIPRPYWEHRIKMCKALGMNAICIYVFWNIHEQKEGEFN 94
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R + F E
Sbjct: 95 FTGNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLRERDPYFMER 154
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE--------LYVKWA 180
+K F K+ + + L +GGPII+ QVENEYG +YG+ + L W
Sbjct: 155 VKIFEDKVAEQLAP--LTIQRGGPIIMVQVENEYG----SYGIDKQYVGEIRDMLRQGWG 208
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYS 233
D + W + D +I T N G D P P+M +E +S
Sbjct: 209 NDVKM---FQCDWSSNFTHNGLDDLIWTMNFGTGANIDNQFKKLKSLRPDAPLMCSEFWS 265
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVA 287
GWF +G RP +D+ + G +F + YM GGT+FG AG P V
Sbjct: 266 GWFDKWGARHETRPAQDMVNNIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDV- 323
Query: 288 TSYDYDAPIDEYG 300
TSYDYDAPI+EYG
Sbjct: 324 TSYDYDAPINEYG 336
>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
Length = 638
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 172/348 (49%), Gaps = 44/348 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG +HY R + W ++ K GL + TYVFWN+HE G + FEG
Sbjct: 41 FVYDGKATRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEG 100
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F+KT E GL + LR GPYACAEW++GG+P WL I G++ R N F E K+
Sbjct: 101 DHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNAKFLEYTKK 160
Query: 132 FLAKIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLN 188
+ ID + +E +L + GGPII+ Q ENE+G+ V + E + + A L
Sbjct: 161 Y----IDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLE 216
Query: 189 TS---VPWVMCQ-----QEDAPDPIINTCNG--------FYCDGFTPNSPSKPIMWTENY 232
+ VP + A + T NG D + N+ P M E Y
Sbjct: 217 EAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVAEFY 274
Query: 233 SGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------- 283
GW +A PF V+ +A ++ + +F NYYM GGTNFG T+G
Sbjct: 275 PGWLDH--WAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKSD 331
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
P + TSYDYDAPI E G+ PK+ +R + I+ +Y + + P
Sbjct: 332 IQPDI-TSYDYDAPISEAGWTT-PKYDSIRTV---IQKYADYTVPAIP 374
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 110/256 (42%), Gaps = 68/256 (26%)
Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
Y+ Y+ + P GK L I+ L A+V+++ V G N F N+ ++ I N
Sbjct: 420 YVLYSKQFN-QPINGK---LKIDGLRDFAVVYIDGTKV--GELNRVFKNYEMDIDIPFN- 472
Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYI 573
+TL IL +G NYG+ G+ S +LI+ D+ +G+W Q +
Sbjct: 473 --STLQILVENMGRINYGSEIIHNHKGIISPVLIN------DMEITGDWTMQ------QL 518
Query: 574 GLDKI-SLANSSFWKQGSTL---PVNKSLI--------WYKTTFLAPEGKGPLALNLASM 621
+DK+ LA KQ +T+ VN S I Y+ TF E G +++
Sbjct: 519 PMDKVPDLAG----KQTATIQNTKVNTSKIATLKGQPVLYQGTFDLKE-IGDTFIDMEKW 573
Query: 622 GKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV 681
GKG ++NG +IGRYW TG P TLY IP ++
Sbjct: 574 GKGIVFINGINIGRYW------KTG----------------------PQHTLY-IPGPYL 604
Query: 682 HPGENLLVIHEELGGD 697
G N +VI E+L +
Sbjct: 605 KKGSNSIVIFEQLNDE 620
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/338 (34%), Positives = 164/338 (48%), Gaps = 26/338 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V YD + +IDG+R + S ++HY R W E++ KSKE G IETYV WN+HE
Sbjct: 5 RVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEEE 64
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQ+ F G DL F+ E GL++ +R GPY CAEW+ GG P WL P +Q+R +
Sbjct: 65 EGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDMQYRKFHR 124
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
F + + +++ ++ L S G +I+ QVENE+ A G + Y+++ D
Sbjct: 125 EFLHYVDLYWDRLVPVVLPRLL--SNSGTVIMVQVENEFQ----ALGKPDKAYMEYLRDG 178
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGF-----YCDGFTPNSPSKPIMWTENYSGWFLS 238
+ VP V C A D + N + + +P E + GWF
Sbjct: 179 LIERGIDVPLVTCY--GAVDGAVEFRNFWSHAEEHARTLEERFADQPKGVLEFWIGWFEQ 236
Query: 239 FGYAVPFRPVEDLAFAVAR----FFETGGTFQNYYMYFGGTNF----GRTAGG-PLVATS 289
+G R + A V R G T NYYM+FGGTNF GRT G + TS
Sbjct: 237 WGGP---RANQKTASQVERKTYELIREGFTAINYYMFFGGTNFGHWGGRTIGEHTFMTTS 293
Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
YDYDA +DEY K+ L+ +H ++ E L +
Sbjct: 294 YDYDAALDEY-LRPTAKYKALKLVHDFVRWMEPLLTET 330
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 173/349 (49%), Gaps = 39/349 (11%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
++Y+ + +++GK L SG++HY R PE W + +RK K G +ETY+ WN HEP
Sbjct: 4 LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQ+ F+G D+V F++ Q L + +R PY CAEW +GG P WL I+ R ++
Sbjct: 64 GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWL-LKEDIRLRCSDPR 122
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F E++ + +I +K L ++ GGPII Q+ENEYG +YG + Y++ +
Sbjct: 123 FLEKVSAYYDALIPQLKP--LLSTSGGPIIAVQIENEYG----SYG-NDQAYLQALRNML 175
Query: 185 VNLNTSV-------PWVMCQQEDAPDPIINTCNGFYCDGFTPN---------SPSKPIMW 228
V V P Q + ++ T N G P P+ P+M
Sbjct: 176 VERGIDVLLFTSDGPADDMLQGGMTEGVLATVNF----GSRPKEAFGKLEEYQPNAPLMC 231
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
E ++GWF + R ED A + G + N+YM GGTNFG ++G
Sbjct: 232 MEYWNGWFDHWFEEHHTRSAEDAAQVLDEMLSMGASV-NFYMLHGGTNFGFSSGANHGGR 290
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
P V TSYDYD+ I E G I PK+ R+ + K + L E+ + + P
Sbjct: 291 YKPTV-TSYDYDSAISEAGDI-TPKYQLFRKVIGKYVSLSEDDMPQNTP 337
>gi|312378199|gb|EFR24839.1| hypothetical protein AND_10320 [Anopheles darlingi]
Length = 639
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 163/327 (49%), Gaps = 37/327 (11%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + Y+ V+DGK +GS HY R+ P+ W +R + GGL ++ YV W+ H
Sbjct: 23 SFTIDYERDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLRTLRAGGLNAVDLYVQWSLHN 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRT 120
P G Y +EG ++ ++ E L++ LR GPY CAE + GG P WL + PGIQ RT
Sbjct: 83 PRDGVYSWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRT 142
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYV--- 177
++ + E+K++ +++ M E GGPII+ Q+ENEYG A+G + Y+
Sbjct: 143 SDANYLAEVKKWYGELMSRM--EPYMYGNGGPIIMVQIENEYG----AFGKCDKPYLNFL 196
Query: 178 -----KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCDGFTPN--------S 221
++ D AV P+ + C Q D I T G D
Sbjct: 197 KEETNRYVQDKAVLFTVDRPYDDEIGCGQIDG--VFITTDFGLMTDEEVDTHAAKVRSYQ 254
Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA 281
P P++ TE Y+GW + + RP LA + + + G ++YMYFGGTNFG A
Sbjct: 255 PKGPLVNTEFYTGWLTHWQESNQRRPAGPLAATLRKMLKDGWNV-DFYMYFGGTNFGFWA 313
Query: 282 G------GPLVA--TSYDYDAPIDEYG 300
G G +A TSYDYDAP+DE G
Sbjct: 314 GANDWGLGKYMADITSYDYDAPMDEAG 340
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 166/333 (49%), Gaps = 39/333 (11%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T + L+++ + + +G+IHY R PE W + + K K G +ETYV WN+HEP
Sbjct: 4 LTIQGKQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEE 63
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ FEG DL +F+ E GL+ +R PY CAEW +GG P WL PG++ R + P
Sbjct: 64 GRFVFEGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGMRLRCSYKP 123
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F ++ + ++I + +++GGP+I Q+ENEYG +YG + Y+ + +
Sbjct: 124 FLDKADAYYDELIPRLTP--FLSTKGGPLIAMQIENEYG----SYG-NDKTYLNYLKEAL 176
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG-----------------FTPNSPSKPIM 227
V V+ D P+ + G +G P +P+M
Sbjct: 177 VKRGVD---VLLFTSDGPEDFM--LQGGMVEGVWETVNFGSRSAEAFAKLQEYQPDQPLM 231
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
E ++GWF +G R D+A + G + N+YM+ GGTNFG +G
Sbjct: 232 CMEFWNGWFDHWGETHHTRGAADVALVLDEMLAAGASV-NFYMFHGGTNFGFFSGANYTD 290
Query: 284 ---PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
P V TSYDYD+P+ E G + + K+ +RE+
Sbjct: 291 RLLPTV-TSYDYDSPLSESGELTE-KYYAVREV 321
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 165 bits (418), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 161/324 (49%), Gaps = 23/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+DGK + SG+IHY R E W + + K K GL +ETYV WN HEP +G++ F G
Sbjct: 18 FTLDGKPVQILSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHEPEKGKFDFTG 77
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ +++ GL++ R GPY CAEW+YGG P WL P +Q RTT P+ E ++R
Sbjct: 78 MLDIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNMQVRTTYQPYMEAVER 137
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
F ++ ++K +GGPII QVENEYG+ + Y + ++ + L +
Sbjct: 138 FFDALLPIVKP--FQYKEGGPIIAMQVENEYGSYARDDKYLTAVKQAIQKRGIEELLLTS 195
Query: 190 SVPWVMCQQEDAPDPIINTCNGFY-----CDGFTPNSPSKPIMWTENYSGWFLSFG---Y 241
+ + ++ T N + P++P M E +SGWF +G +
Sbjct: 196 DGGQIERLERGCIPGVLMTANFNFNPKKQLGALKKLQPNRPQMVMEFWSGWFDHWGRDHH 255
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAP 295
+ E L + RF + N+YM+ GGTNFG G + TSYDYDAP
Sbjct: 256 KLHVEKFEQLLGDILRFPSS----VNFYMFHGGTNFGFMNGANYINGYKPDVTSYDYDAP 311
Query: 296 IDEYGFIRQPKWGHLRELHKAIKL 319
+ E G PK+ REL K + +
Sbjct: 312 LSEAG-DPTPKYYKTRELLKTLAM 334
>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
Length = 603
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 110/339 (32%), Positives = 164/339 (48%), Gaps = 36/339 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T + ++DGK + SG+ HY R+ P+ W + + + + GL +ETYV WN+H+P
Sbjct: 27 LTIRGKEFLLDGKPFRILSGAFHYFRTHPQDWRDRLMRMRAMGLNTVETYVAWNFHQPDE 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
+ F G D+V FV+T E GL + +R GPY CAEW++GG P WL R ++
Sbjct: 87 KEADFTGWRDVVAFVRTADEVGLKVIVRPGPYICAEWDFGGLPAWLLKDKDAPLRRSDPA 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------------NVEWAYGV 171
F+ + + A++ L + +L A++GGPII QVENEYG + A G+
Sbjct: 147 FERAVDAWFAEL--LPRFVDLQATRGGPIIAMQVENEYGSYGDDHAYLEHLRDTMRAQGI 204
Query: 172 GGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTEN 231
G L+ A S+P ++ DP G + + P KP+ TE
Sbjct: 205 DGLLFCSNGATQEALKAGSLPDLLSTVNFGGDP-----TGPFAE-LRAFQPDKPLFCTEF 258
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------ 285
+ GWF +G A V + E G + N+YM GGTNFG +AG L
Sbjct: 259 WDGWFDHWGERHRTTDPAQTAADVEKMLEAGASI-NFYMAVGGTNFGWSAGANLSGSGYQ 317
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEY 323
TSYDYD+PI E G + + + HK + +Y
Sbjct: 318 PTVTSYDYDSPISESGELTE-------KFHKVRDVLGKY 349
>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 627
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 113/337 (33%), Positives = 160/337 (47%), Gaps = 50/337 (14%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE- 70
V +GK L SG +HY R W ++ K GL + TYVFWNYHE G++ ++
Sbjct: 42 FVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKT 101
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G +L +FVKT E G+ + LR GPY CAEW++GG+P WL G+ R N PF + +
Sbjct: 102 GNRNLRQFVKTAAEEGMLVILRPGPYCCAEWDFGGYPWWLSKAKGLVIRADNQPFLDSCR 161
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
++ ++ M+ +L ++GGPII+ Q ENE+G+ YV D + + +
Sbjct: 162 VYINQLASQMR--DLQITKGGPIIMVQAENEFGS-----------YVAQRKDVPLESHRA 208
Query: 191 VPWVMCQQ--EDAPDPIINTCNGFY------CDGFTP------------------NSPSK 224
+ QQ + D + T +G + +G P N
Sbjct: 209 YSAKIKQQLIDAGFDVPLFTSDGSWLFKGGTIEGALPTANGENDIEKLKKVVNEYNGGKG 268
Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
P M E Y GW + P E + A++ E G +F NYYM GGTNFG T+G
Sbjct: 269 PYMVAEFYPGWLSHWAEPFPQVSTESIVKQTAKYLENGVSF-NYYMVHGGTNFGFTSGAN 327
Query: 285 LVA--------TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G+ PK+ LR L
Sbjct: 328 YTTATNLQSDLTSYDYDAPISEAGW-NTPKYDALRAL 363
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 94/222 (42%), Gaps = 55/222 (24%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNYG 532
+ I L ALV+VN + V G D + + IE+N N LDIL +G NYG
Sbjct: 437 MKIAGLADYALVYVNGQKV----GELDRVSDV--DSIEINMPFNGVLDILVENMGRINYG 490
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGST 591
A + G+ ++ID + +G W +Y++ + E ++ + AN
Sbjct: 491 ARIPQSIKGINGPVVID-----GNEITGNWQMYKLPMN-EAPDVNALPTAN--------- 535
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
NK L + + G LN+ + GKG ++NG ++GRYW
Sbjct: 536 ---NKGLPTLYSGTFNLDTTGDTFLNMETWGKGIVFINGFNLGRYWK------------- 579
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
RG P QTLY +P ++ GEN +V+ E+
Sbjct: 580 --RG-------------PQQTLY-LPGCFLKKGENKIVVFEQ 605
>gi|158301280|ref|XP_550752.3| AGAP002055-PA [Anopheles gambiae str. PEST]
gi|157012394|gb|EAL38488.3| AGAP002055-PA [Anopheles gambiae str. PEST]
Length = 657
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/327 (34%), Positives = 163/327 (49%), Gaps = 37/327 (11%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + Y+ V+DGK +GS HY R+ PE W +R + GGL ++ YV W+ H
Sbjct: 42 SFKIDYERDTFVMDGKDFRYVAGSFHYFRALPETWRTKLRTLRAGGLNAVDLYVQWSLHN 101
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRT 120
P G Y +EG ++ ++ E L++ LR GPY CAE + GG P WL + PGI RT
Sbjct: 102 PRDGVYNWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIAVRT 161
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYV--- 177
++ + EE++++ +++ M E GGPII+ Q+ENEYG A+G + Y+
Sbjct: 162 SDANYLEEVRKWYGELMSRM--EPYMYGNGGPIIMVQIENEYG----AFGKCDKPYLNFL 215
Query: 178 -----KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCDGFTPN--------S 221
++ D AV P+ + C Q D I T G +
Sbjct: 216 KQQTERYVQDKAVLFTVDRPYDDEIGCGQIDG--VFITTDFGLMTEEEVDTHAAKVRSYQ 273
Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA 281
P P++ TE Y+GW + + RP + LA + + G ++YMYFGGTNFG A
Sbjct: 274 PKGPLVNTEFYTGWLTHWQESNQRRPAQPLAATLRKMLRDGWNV-DFYMYFGGTNFGFWA 332
Query: 282 G------GPLVA--TSYDYDAPIDEYG 300
G G +A TSYDYDAP+DE G
Sbjct: 333 GANDWGLGKYMADITSYDYDAPMDEAG 359
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 105/304 (34%), Positives = 151/304 (49%), Gaps = 26/304 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ + SG +HY R P +W + + K++ GL +ETYV WN H+P ++ +G
Sbjct: 18 LDGEPFRILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGL 77
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF+ GL + LR GPY CAEW GG P WL P ++ R+ + F + +
Sbjct: 78 DLPRFLDLAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYF 137
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+++ + + AS+GGP++ QVENEYG AYG Y++ AD+ VP
Sbjct: 138 RRLLPPL--HDRLASRGGPVLAVQVENEYG----AYG-DDTAYLEHLADSLRRHGVDVPL 190
Query: 194 VMCQQ-----EDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
C Q A ++ T N + PS P++ TE + GWF +G
Sbjct: 191 FTCDQPADLERGALAGVLATANFGSRPAAHLATLRTARPSAPLLCTEFWIGWFDRWGGNH 250
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDYDAPI 296
R E + + TG + N+YM+ GGTNFG G P V TSYDYDAP+
Sbjct: 251 VVRDAEQASQELDELLATGASV-NFYMFHGGTNFGFMNGANDKHTYRPTV-TSYDYDAPL 308
Query: 297 DEYG 300
DE G
Sbjct: 309 DEAG 312
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 164/318 (51%), Gaps = 40/318 (12%)
Query: 21 LQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVK 80
++SG+IHY R PE W + + K K GL +ETYV WN HEP+ GQ+ + G ++ +F+
Sbjct: 13 IRSGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFIL 72
Query: 81 TVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLM 140
QE G ++ LR GPY CAEW +GG P WL +Q R+T PFK+ + RF I +
Sbjct: 73 LAQELGFYVILRPGPYICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEI 132
Query: 141 KQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------LNTSVPWV 194
K +L AS+GGPII QVENEYG +YG E Y+++ D +N L TS
Sbjct: 133 K--SLQASKGGPIIAVQVENEYG----SYG-SDEEYMQFIRDALINRGIVELLVTSDNSE 185
Query: 195 MCQQEDAPDPIINTCNGFYCDGFTPNSPS-------KPIMWTENYSGWFLSFGYAVPFRP 247
+ AP ++ T N G + S P + E +SGWF +G
Sbjct: 186 GIKHGGAPG-VLKTYN---FQGHAKSHLSILERLQDAPSIVMEFWSGWFDHWGEKN--HQ 239
Query: 248 VEDLAFAVARF---FETGGTFQNYYMYFGGTNFGRTAGGPLV---------ATSYDYDAP 295
V +A F + +F N+Y++ GGTNFG G + TSYDYDAP
Sbjct: 240 VHTIAHVTNTFKDILDCDASF-NFYVFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDAP 298
Query: 296 IDEYGFIRQPKWGHLREL 313
+ E G I + K+ LR++
Sbjct: 299 LSEAGDITE-KYMELRKI 315
>gi|296081427|emb|CBI16778.3| unnamed protein product [Vitis vinifera]
Length = 242
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 77/124 (62%), Positives = 90/124 (72%), Gaps = 4/124 (3%)
Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
INTCN FYCD FTPNSP+KP MWTEN+ GW +FG P P ED+ F+VARFF
Sbjct: 120 INTCNSFYCDQFTPNSPNKPKMWTENWPGWSKTFGALDPHGPREDIVFSVARFFWK---- 175
Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
NYYM GGTNFGRT+GGP + T+YDY+APIDEYG R PK GHL+EL +AIK CE L+
Sbjct: 176 VNYYMDHGGTNFGRTSGGPFITTTYDYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLL 235
Query: 326 SSDP 329
+P
Sbjct: 236 YGEP 239
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/305 (35%), Positives = 150/305 (49%), Gaps = 28/305 (9%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG+ L SG+IHY R PE W + +RK K G +ETYV WN HEP G++ FEG D
Sbjct: 14 DGEELRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L RF++ GL + +R PY CAEW +GG P WL PG++ R + + ++ +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW- 193
++I + L + GGP+IL QVENEYG +YG + Y++ D V VP
Sbjct: 134 ELIPRLVP--LLCTSGGPVILVQVENEYG----SYG-SDKAYLEHLRDGLVRRGIDVPLF 186
Query: 194 -------VMCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGY 241
M Q P ++ T N + F P P+M E ++GWF +
Sbjct: 187 TSDGPTDAMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDYDAP 295
R D A E G + N+YM+ GGTNFG G + TSYDYD+P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSP 304
Query: 296 IDEYG 300
+ E+G
Sbjct: 305 LTEWG 309
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/324 (35%), Positives = 161/324 (49%), Gaps = 27/324 (8%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++GK V+++ +HYPR W IR K G+ I YVFWN HE G++ F
Sbjct: 35 KTFLLNGKPFVIKAAELHYPRIPRPYWEHRIRMCKALGMNTICLYVFWNIHEQQEGKFNF 94
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G D+ F + Q+ GL++ +R GPY CAEW GG P WL I+ R + F E +
Sbjct: 95 TGNNDVAAFCRLAQKHGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRERDPYFMERV 154
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
K F ++ + + L +GGPII+ QVENEYG +YGV E YV D +
Sbjct: 155 KVFEQQVGNQLAP--LTIDKGGPIIMVQVENEYG----SYGVDKE-YVSQIRDIVRSSGF 207
Query: 190 S------VPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWF 236
W +++ D +I T N G D P P M +E +SGWF
Sbjct: 208 DKVALFQCDWASNFEKNGLDDLIWTMNFGTGANIDEQFKRLGELRPQSPKMCSEFWSGWF 267
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
+G RP +++ + G +F + YM GGT+FG AG P A TSYD
Sbjct: 268 DKWGARHETRPAKNMVAGIDEMLTKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 326
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAPI+EYG + PK+ LR + +
Sbjct: 327 YDAPINEYG-LATPKYYELRAMMQ 349
>gi|347967091|ref|XP_001689312.2| AGAP002056-PA [Anopheles gambiae str. PEST]
gi|333469762|gb|EDO63217.2| AGAP002056-PA [Anopheles gambiae str. PEST]
Length = 629
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/334 (33%), Positives = 157/334 (47%), Gaps = 33/334 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
++ YD+ V+DGK +GS HY R+ PE WP ++R + GL I TYV W+ H P
Sbjct: 27 SIDYDNDTFVMDGKPFQYVAGSFHYFRALPESWPSILRSMRAAGLNAITTYVEWSLHNPK 86
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
Y ++G D+ F++ AGL++ LR GPY CAE + GGFP W LH P I RT +
Sbjct: 87 EDVYNWQGMADIEHFLELADSAGLYVILRPGPYICAERDMGGFPSWLLHKYPDILLRTND 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E++ + A++ L + + QGGPII+ QVENEYG ++ Y+ W D
Sbjct: 147 LRYLREVRTWYAQL--LSRVQRFLVGQGGPIIMVQVENEYG----SFYACDHKYLNWLRD 200
Query: 183 --------TAVNLNTSVPWV--------MCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
AV + P + + D + NGF+ P P+
Sbjct: 201 ETERYVMGNAVLFTNNGPGLEGCGAIEHVLSSLDFGPGTEDEINGFWST-LRKTQPKGPL 259
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV 286
+ E Y GW + R F N YM+FGGTN+G TAG +
Sbjct: 260 VNAEYYPGWLTHWQEPHMARTDTKPVVDSLDFMLRNKVNVNIYMFFGGTNYGFTAGANNM 319
Query: 287 A--------TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAP+DE G PK+ LR+
Sbjct: 320 GAGGYAADLTSYDYDAPLDESG-DPTPKYFALRD 352
>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
Length = 783
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/324 (34%), Positives = 159/324 (49%), Gaps = 29/324 (8%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ +++GK ++++ IHY R E W I K G+ I Y FWN HE G++
Sbjct: 37 NKEFLLNGKPFLIKAAEIHYTRIPAEYWEHRIEMCKALGMNTICIYAFWNIHEQRPGEFD 96
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
FEG+ D+ RF + Q+ G+++ LR GPY C+EW GG P WL I RT++ F E
Sbjct: 97 FEGQNDVARFCRLAQKHGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIALRTSDPYFLER 156
Query: 129 MKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
K F+ +L KQ +L A +GG II+ QVENEYG AY E Y+ D
Sbjct: 157 TKIFMN---ELGKQLADLQAPRGGNIIMVQVENEYG----AYAEDKE-YIASIRDIVRGA 208
Query: 188 N-TSVPWVMCQ-----QEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSG 234
T VP C Q + D ++ T N G D P P+M +E +SG
Sbjct: 209 GFTDVPLFQCDWASTFQRNGLDDLLWTINFGTGADIDQQFKALREARPETPLMCSEYWSG 268
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATS 289
WF +G RP + + + + +F + YM GGT FG G + +S
Sbjct: 269 WFDHWGRKHETRPADVMVKGIKDMMDRNISF-SLYMTHGGTTFGHWGGANSPSYSAMCSS 327
Query: 290 YDYDAPIDEYGFIRQPKWGHLREL 313
YDYDAPI E G+ PK+ LR+L
Sbjct: 328 YDYDAPISEAGWA-TPKYYQLRDL 350
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+Y+TTF + G L++++ GKG WVNG ++GR+W
Sbjct: 536 YYRTTFTL-DKTGDTFLDMSTWGKGMVWVNGHAMGRFWKI-------------------- 574
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ G+N +V+ + LG D +KI L +
Sbjct: 575 --------GPQQTLF-MPGCWLKKGKNEIVVLDLLGPDETKIEGLKQ 612
>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 629
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 167/330 (50%), Gaps = 22/330 (6%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + Y++ ++DGK SGS HY R+ + W ++RK + GGL + TYV W+ HE
Sbjct: 30 SFAIDYENDQFLLDGKPFRYVSGSFHYFRTPRQHWRGILRKMRAGGLNAVSTYVEWSMHE 89
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRT 120
P Q+ ++G D+V F+K QE LF+ LR GPY CAE ++GGFP W L +P I+ RT
Sbjct: 90 PEFDQWVWDGDADIVEFIKIAQEEDLFVILRPGPYICAERDFGGFPYWLLSRVPDIKLRT 149
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV----EWAYGVGGELY 176
+ + +RFL +I L + + L GGPII+ QVENEYG+ + E++
Sbjct: 150 KDERYVFYAERFLNEI--LRRTKPLLRGNGGPIIMVQVENEYGSFYACDDQYKSKMYEIF 207
Query: 177 VKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNG----FYCDGFTPNSPSKPIMWT 229
+ + AV T + + C I+ NG F SP P++ +
Sbjct: 208 HRHVKNDAVLFTTDGSARSMLKCGSIPGVYATIDFGNGANVPFNYKIMREFSPKGPLVNS 267
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL---- 285
E Y GW +G + ++A + + N YMY+GGTNF T+G +
Sbjct: 268 EYYPGWLTHWGESFQRVNSHNVAKTLDEMLAYNVSV-NIYMYYGGTNFAFTSGANINEHY 326
Query: 286 --VATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E G PK+ LR++
Sbjct: 327 WPQLTSYDYDAPLTEAG-DPTPKYFELRDV 355
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 162/321 (50%), Gaps = 29/321 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + SG++HY R P++W + I K++ GL IETYV WN H P G + G
Sbjct: 11 FLLDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLSG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF++ V +AG++ +R GPY CAEW+ GG P WL P + R + + ++
Sbjct: 71 GLDLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVRE 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L K+ +++ + +GGP++L QVENEYG A+G + Y+K A+ +V
Sbjct: 131 YLTKVYEVVVPHQI--DRGGPVLLVQVENEYG----AFG-DDKRYLKALAEHTREAGVTV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
P Q + +G + + P+ P+M +E ++GWF +
Sbjct: 184 PLTTVDQPTPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGWFDHW 243
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDY 292
G D A + G + N YM+ GGTNFG T G PL+ TSYDY
Sbjct: 244 GAHHHTTSAADSAAELDALLAAGASV-NLYMFHGGTNFGLTNGANDKGVYQPLI-TSYDY 301
Query: 293 DAPIDEYGFIRQPKWGHLREL 313
DAP+DE G PK+ R++
Sbjct: 302 DAPLDEAG-DPTPKYHAFRDV 321
>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 778
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 157/323 (48%), Gaps = 27/323 (8%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ ++DGK ++++ +HY R E W I+ K G+ I Y FWN HE G++
Sbjct: 35 NQTFLLDGKPFIIKAAEMHYTRIPAEYWEHRIQMCKALGMNTICIYAFWNIHEQRPGEFD 94
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F+G+ D+ F + Q+ G+++ LR GPY C+EW GG P WL IQ RT + F E
Sbjct: 95 FKGQNDIAEFCRLAQKNGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIQLRTNDPYFLER 154
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
K F+ +I + +L A +GG II+ QVENEYG Y V E Y+ D
Sbjct: 155 TKLFMNEIGKQLA--DLQAPRGGNIIMVQVENEYG----GYAVNKE-YIANVRDIVRGAG 207
Query: 189 -TSVPWVMCQ-----QEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGW 235
T VP C Q + D ++ T N G D P P+M +E +SGW
Sbjct: 208 FTDVPLFQCDWSSTFQLNGLDDLLWTINFGTGANIDAQFKSLKEARPDAPLMCSEFWSGW 267
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
F +G R E + + + +F + YM GGT FG G + +SY
Sbjct: 268 FDHWGRKHETRDAETMVSGLKDMLDRNISF-SLYMAHGGTTFGHWGGANCPPYSAMCSSY 326
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAPI E G+ PK+ LRE+
Sbjct: 327 DYDAPISEAGWA-TPKYYKLREM 348
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 63/151 (41%), Gaps = 35/151 (23%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+Y+ +F E G + L++ + GKG WVNG++IGR+W
Sbjct: 532 YYRASFNLKE-TGDVFLDMQTWGKGMVWVNGKAIGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
P QTLY +P W+ G+N +V+ + LG D ++I L Q I + +
Sbjct: 571 --------GPQQTLY-MPGCWLKKGKNEIVVLDLLGPDKAEIKGLK---QPILDMLRSEE 618
Query: 720 PPPVDSWKPNLGVVSSSPQV--RLACERGWH 748
P NL + + P +A GW
Sbjct: 619 PLTHRKEGENLNLKNEKPVAAGEMAAGNGWQ 649
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/305 (35%), Positives = 150/305 (49%), Gaps = 28/305 (9%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG+ L SG+IHY R PE W + +RK K G +ETYV WN HEP G++ FEG D
Sbjct: 14 DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L RF++ GL + +R PY CAEW +GG P WL PG++ R + + ++ +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWV 194
++I + L + GGP+IL QVENEYG +YG + Y++ D V VP
Sbjct: 134 ELIPRLVP--LLCTSGGPVILVQVENEYG----SYG-SDKAYLEHLRDGLVRRGIDVPLF 186
Query: 195 --------MCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGY 241
M Q P ++ T N + F P P+M E ++GWF +
Sbjct: 187 TSDGPTDSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDYDAP 295
R D A E G + N+YM+ GGTNFG G + TSYDYD+P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFHNGANHIKTYEPTITSYDYDSP 304
Query: 296 IDEYG 300
+ E+G
Sbjct: 305 LTEWG 309
>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 674
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/337 (33%), Positives = 159/337 (47%), Gaps = 50/337 (14%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE- 70
V +GK L SG +HY R W ++ K GL + TYVFWNYHE G++ ++
Sbjct: 89 FVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKT 148
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G +L +FVKT E G+ + LR GPY CAEW +GG+P WL G+ R N PF + +
Sbjct: 149 GNRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGLVIRADNQPFLDSCR 208
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
++ ++ M+ +L ++GGPII+ Q ENE+G+ YV D + + +
Sbjct: 209 VYINQLASQMR--DLQITKGGPIIMVQAENEFGS-----------YVAQRKDIPLETHRA 255
Query: 191 VPWVMCQQ--EDAPDPIINTCNGFY------CDGFTP------------------NSPSK 224
+ QQ + D + T +G + +G P N
Sbjct: 256 YSAKIKQQLLDAGFDVPLFTSDGSWLFKGGTIEGALPTANGESDIEKLKKVVNEYNGGKG 315
Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
P M E Y GW + P E + A++ E G +F NYYM GGTNFG T+G
Sbjct: 316 PYMVAEFYPGWLSHWAEPFPQVSTESIVKQTAKYLENGISF-NYYMVHGGTNFGFTSGAN 374
Query: 285 LVA--------TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G+ PK+ LR L
Sbjct: 375 YTTATNLQPDLTSYDYDAPISEAGW-NTPKYDALRAL 410
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 62/229 (27%), Positives = 95/229 (41%), Gaps = 55/229 (24%)
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNY 531
L + L ALV+VN + V G D + + IE+N N LDIL +G NY
Sbjct: 483 MLKVAGLADYALVYVNGQKV----GELDRVSDV--DSIEINVPFNGVLDILVENMGRINY 536
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS 590
GA + G+ ++ID + +G W +Y++ + E ++ + AN
Sbjct: 537 GARITQSIKGINGPVVID-----GNEITGNWQMYKLPMN-EVPDVNALPTAN-------- 582
Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
NK L + + G LN+ + GKG +VNG ++GRYW
Sbjct: 583 ----NKGLPTLYSGTFNLDTTGDTFLNMETWGKGIVFVNGINLGRYWK------------ 626
Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
RG P QTLY +P ++ GEN +V+ E+ P
Sbjct: 627 ---RG-------------PQQTLY-LPGCFLKKGENKIVVFEQQNDTPQ 658
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 105/302 (34%), Positives = 145/302 (48%), Gaps = 25/302 (8%)
Query: 33 PEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLR 92
PE W + + K K GL +ETYV WN HE ++ + F+ D+V+FVK Q GL++ +R
Sbjct: 2 PEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIR 61
Query: 93 IGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGP 152
GPY CAEW+ GG P WL P ++ RT+ PF E + R+ K+ L+ L QGGP
Sbjct: 62 PGPYICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLLTP--LQYCQGGP 119
Query: 153 IILAQVENEYGNVEWAYGVG-GELYVKWAADTAVNLNTSVPWVMCQQEDAP-DPIINTCN 210
II Q+ENEY + + + EL K V + + + P + ++ T N
Sbjct: 120 IIAWQIENEYSSFDKKVDMTYMELLQKMMVKNGVTEMLLMSDNLFSMKTHPINLVLKTIN 179
Query: 211 -----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
P KP+M TE + GWF +G P E L + F G +
Sbjct: 180 LQKNVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLFSLGASI 239
Query: 266 QNYYMYFGGTNFGRTAGGPLVA--------------TSYDYDAPIDEYGFIRQPKWGHLR 311
N+YM+ GGTNFG G TSYDYDAP+ E G I PK+ LR
Sbjct: 240 -NFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDI-TPKYKALR 297
Query: 312 EL 313
+
Sbjct: 298 KF 299
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/305 (35%), Positives = 150/305 (49%), Gaps = 28/305 (9%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG+ L SG+IHY R PE W + +RK K G +ETYV WN HEP G++ FEG D
Sbjct: 14 DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L RF++ GL + +R PY CAEW +GG P WL PG++ R + + ++ +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWV 194
++I + L + GGP+IL QVENEYG +YG + Y++ D V VP
Sbjct: 134 ELIPRLVP--LLCTSGGPVILVQVENEYG----SYG-SDKAYLEHLRDGLVRRGIDVPLF 186
Query: 195 --------MCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGY 241
M Q P ++ T N + F P P+M E ++GWF +
Sbjct: 187 TSDGPTDSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDYDAP 295
R D A E G + N+YM+ GGTNFG G + TSYDYD+P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSP 304
Query: 296 IDEYG 300
+ E+G
Sbjct: 305 LTEWG 309
>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
Length = 578
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 107/297 (36%), Positives = 152/297 (51%), Gaps = 20/297 (6%)
Query: 33 PEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLR 92
PE W + ++K K GL +ETYV WN HE ++ + F+ D+V+FV QE GL + +R
Sbjct: 2 PEYWADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIR 61
Query: 93 IGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGP 152
GPY C+EW+ GG P WL P ++ R+T PF E ++++ +K+ L+ L S+GGP
Sbjct: 62 PGPYICSEWDLGGLPSWLLNDPNMRLRSTYGPFMEAVEKYFSKLFALLTP--LQFSRGGP 119
Query: 153 IILAQVENEYGNVEWAYG-----VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIIN 207
II QVENEY +V+ + +L +K A + + V +
Sbjct: 120 IIAWQVENEYASVQEEVDNHYMELLHKLMLKNGATELLFTSDDVGYTKRYPIKLDGGKYM 179
Query: 208 TCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQN 267
+ N ++C F P KPIM TE +SGWF +G E + G N
Sbjct: 180 SFNKWFC-LFLHFQPDKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILDMGASIN 238
Query: 268 YYMYFGGTNFG-----RTAGGPL------VATSYDYDAPIDEYGFIRQPKWGHLREL 313
+YM+ GGTNFG TAG + TSYDYDAP+ E G I PK+ LR+L
Sbjct: 239 FYMFHGGTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDI-TPKYKALRKL 294
>gi|157106611|ref|XP_001649403.1| beta-galactosidase [Aedes aegypti]
gi|108879822|gb|EAT44047.1| AAEL004580-PA [Aedes aegypti]
Length = 656
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/328 (34%), Positives = 162/328 (49%), Gaps = 39/328 (11%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + YD V+DGK +GS HY R+ P+ W ++ + GGL ++ YV W+ H
Sbjct: 42 SFTIDYDRDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLKTLRAGGLNAVDLYVQWSLHN 101
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRT 120
P QY ++G ++ ++ EA L++ LR GPY CAE + GG P WL PGIQ RT
Sbjct: 102 PKENQYVWDGIANIKDVIEAAIEADLYVILRPGPYICAEIDNGGLPYWLFTKYPGIQVRT 161
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFA-SQGGPIILAQVENEYGNVEWAYGVGGELYV-- 177
++ + +E+ + K LM Q + GGPII+ Q+ENEYG A+G + Y+
Sbjct: 162 SDANYLKEVATWYEK---LMSQLTPYMYGNGGPIIMVQLENEYG----AFGKCDKPYLNF 214
Query: 178 ------KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCD--------GFTPN 220
K+ AV P+ + C Q P + T G D
Sbjct: 215 LKEETEKYTQGKAVLFTVDRPYGNEMECGQ--VPGVFVTTDFGLMTDEEVDTHKAKLRSV 272
Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
P+ P++ TE Y+GW + + RP E LA + + G ++YMYFGGTNFG
Sbjct: 273 QPNGPLVNTEFYTGWLTHWQESNQRRPAEPLANTLRKMLHDGWNV-DFYMYFGGTNFGFW 331
Query: 281 AG------GPLVA--TSYDYDAPIDEYG 300
AG G +A TSYDYDAP+DE G
Sbjct: 332 AGANDWGLGKYMADITSYDYDAPMDEAG 359
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 126/374 (33%), Positives = 178/374 (47%), Gaps = 34/374 (9%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++G+ V+++ +HYPR W I++ K G+ I YVFWN+HE G++ F
Sbjct: 39 TFLLNGEPFVVKAAELHYPRIPRAYWEHRIKQCKALGMNTICLYVFWNFHEEKPGEFDFT 98
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R + F E +
Sbjct: 99 GQKDLAEFCRLCQKNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREDDPYFLERVA 158
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F ++ + + L +GGPII+ QVENEYG +YG E YV D
Sbjct: 159 IFEKEVANQVA--GLTIQKGGPIIMVQVENEYG----SYGESKE-YVAKIRDIVRGNFGD 211
Query: 191 VPWVMCQ-----QEDAPDPIINTCN---GFYCD-GFTP---NSPSKPIMWTENYSGWFLS 238
V C Q +A D ++ T N G D F P P P+M +E +SGWF
Sbjct: 212 VTLFQCDWASNFQLNALDDLVWTMNFGTGANIDEQFAPLKKVRPDSPLMCSEFWSGWFDK 271
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYDYD 293
+G R +D+ + G +F + YM GGTN+G AG P A TSYDYD
Sbjct: 272 WGANHETRAADDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYD 330
Query: 294 APIDEYGFIRQPKWGHLREL-------HKAIKLCEEYLISSDPTHQKLG-AKLEAHIYHK 345
API E G I PK+ LRE K K+ ++ S P + A L A++
Sbjct: 331 APISESGKI-TPKYEKLRETLAKYMDGKKQAKVPDDIPTISVPAFEFTEVAPLFANLPEP 389
Query: 346 SSNDCAAFLANYDS 359
S+D + YD
Sbjct: 390 KSDDTIRTMEEYDQ 403
Score = 42.4 bits (98), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 87/235 (37%), Gaps = 47/235 (20%)
Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
+P + L + A +F++ K + G D N I LDIL
Sbjct: 413 TLPKIDRSATLTVTEAHDYAQIFIDGKYI----GKLDRRNGEKQLDIPACAEGAQLDILV 468
Query: 524 MMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGE-YIGLDKISL 580
+G N+G A D G ++LKNG R W +Y + E Y GL K
Sbjct: 469 EAMGRINFGRAIKDFKGI----TEKVELKNGGRTTELKGWKVYNLEDRYEGYKGL-KFEP 523
Query: 581 ANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
S QG +P Y+ TF E G LN + GKG +VNG IGR W
Sbjct: 524 LKSVKDAQGQRVPGC-----YRATFHV-EKPGDTFLNFETWGKGLVYVNGYGIGRIWEI- 576
Query: 641 LAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTLY +P W+ GEN +++ + +G
Sbjct: 577 ---------------------------GPQQTLY-MPGCWLKEGENEILVFDIVG 603
>gi|170034404|ref|XP_001845064.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875697|gb|EDS39080.1| beta-galactosidase [Culex quinquefasciatus]
Length = 650
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 114/325 (35%), Positives = 160/325 (49%), Gaps = 39/325 (12%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ YD V+DGK SGS HY R+ P+ W +R + GGL ++ YV W+ H P
Sbjct: 37 IDYDRDTFVMDGKDFRYVSGSFHYFRALPQTWRSKLRTMRAGGLNAVDLYVQWSLHNPKD 96
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTNN 123
QY ++G ++ ++ E L++ LR GPY CAE + GG P WL + PGIQ R ++
Sbjct: 97 NQYVWDGIANITDVIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRISDA 156
Query: 124 PFKEEMKRFLAKIIDLMKQENLFA-SQGGPIILAQVENEYGNVEWAYGVGGELYV----- 177
+ +E+K + K LM Q + GGPII+ Q+ENEYG A+G + Y+
Sbjct: 157 NYIKEVKIWYEK---LMSQLTPYMYGNGGPIIMVQLENEYG----AFGKCDKQYLNVLKE 209
Query: 178 ---KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCDGFTPN--------SPS 223
K+ AV P+ ++C Q P I T G D P
Sbjct: 210 ETEKYTQGKAVLFTVDRPYDDELVCGQ--IPGVFITTDFGLMTDDEVDTHAAKVRSIQPK 267
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG- 282
P++ TE Y+GW + RP LA + + + G ++YMYFGGTNFG AG
Sbjct: 268 GPLVNTEFYTGWLTHWQEKNQRRPAGPLAATLRKMLKDGWNV-DFYMYFGGTNFGFWAGA 326
Query: 283 -----GPLVA--TSYDYDAPIDEYG 300
G +A TSYDYDAP+DE G
Sbjct: 327 NDWGLGKYMADITSYDYDAPMDEAG 351
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 114/375 (30%), Positives = 166/375 (44%), Gaps = 89/375 (23%)
Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
MY G TNF RTAGGP + T+YDYDAP+DE+G + QPK+GHL++LH E+ L +
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82
Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
+ G + +Y ++ + F+ N +A + F G Y +PAW VSILPDCK
Sbjct: 83 STADFGNLVMTTVY-QTEEGSSCFIGNV----NAKINFQGTSYDVPAWYVSILPDCKTES 137
Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
+NTAK + R S F N +
Sbjct: 138 YNTAKRMKLR------------------TSLRFK-----------------------NVS 156
Query: 450 KDTSDYLWYTASIHVM---PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFAN--- 503
D SD+LWY ++++ P GK + L I S H FVN + GN+ N
Sbjct: 157 NDESDFLWYMTTVNLKEQDPAWGKNMSLRINSTAHVLHGFVNGQHT----GNYRVENGKF 212
Query: 504 -FLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW 562
++ + + N G+N + +LS+ V L NYGA+F+ AG+ + I +NG +
Sbjct: 213 HYVFEQDAKFNPGVNVITLLSVTVDLPNYGAFFENVPAGITGPVFIIGRNGDETV----- 267
Query: 563 IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMG 622
V + G K+ T F AP G P+ ++L G
Sbjct: 268 ---VKYLSTHNGATKL------------------------TIFKAPLGSEPVVVDLLGFG 300
Query: 623 KGQAWVNGQSIGRYW 637
KG+A +N GRYW
Sbjct: 301 KGKASINENYTGRYW 315
>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
johnsonii DSM 18315]
gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
DSM 18315]
Length = 539
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 158/323 (48%), Gaps = 27/323 (8%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ ++DGK V+++ IHY R E W I+ K G+ I Y FWN HE G++
Sbjct: 36 NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G+ D+ F + Q+ +++ LR GPY C+EW GG P WL I+ RT + F E
Sbjct: 96 FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
K F+ +I + +L ++GG II+ QVENEYG +Y E Y+ D
Sbjct: 156 TKLFMNEIGKQLA--DLQITKGGNIIMVQVENEYG----SYATDKE-YIANIRDIVKGAG 208
Query: 189 -TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
T VP C Q +A D ++ T N G D P+ P+M +E +SGW
Sbjct: 209 FTDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGW 268
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
F +G R E + + + G +F + YM GGT FG G + +SY
Sbjct: 269 FDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSY 327
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAPI E G+ PK+ LREL
Sbjct: 328 DYDAPISEAGWT-TPKYFKLREL 349
>gi|332030018|gb|EGI69843.1| Beta-galactosidase [Acromyrmex echinatior]
Length = 594
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 113/332 (34%), Positives = 163/332 (49%), Gaps = 29/332 (8%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V Y++ ++DGK SGS HY R+ + W + +RK + GL I TYV W+ HEP
Sbjct: 1 DVDYENNQFLLDGKPFQYVSGSFHYFRTPRQYWRDRLRKMRAAGLNAISTYVEWSLHEPE 60
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
GQ+ + G DLV F+ QE LF+ LR GPY CAE + GG P W L +P I RT +
Sbjct: 61 PGQFNWTGDADLVNFLNIAQEEDLFVLLRPGPYICAERDMGGLPYWLLREVPNINLRTKD 120
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-----NVEWAYGVGGELYV 177
F +L +I L K L GGPII+ Q+ENEYG ++E+ + E++V
Sbjct: 121 ADFVRYATLYLNEI--LSKIRPLLRGNGGPIIMVQIENEYGSYYACDIEYM-DMLKEVFV 177
Query: 178 KWAADTAVNLNT---SVPWVMCQQEDAPDPII------NTCNGFYCDGFTPNSPSKPIMW 228
K + A+ T + + C + N N F P P++
Sbjct: 178 KKVGNKALLYTTDGAAASLLRCGFISGAYATVDFGTASNVTNSFLSMRLY--QPRGPLVN 235
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA- 287
+E Y GW +G E + ++ G + N+YM++GGTNFG T+G A
Sbjct: 236 SEFYPGWLTHWGEPFQRTKTEAIVKSLEEMLALGASV-NFYMFYGGTNFGFTSGANGGAG 294
Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E G PK+ +R++
Sbjct: 295 VYNPQLTSYDYDAPLTEAG-DPTPKYFAIRDV 325
>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
Length = 493
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 107/321 (33%), Positives = 152/321 (47%), Gaps = 30/321 (9%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A +DG++ L SGSIHY R E W + + K K GL +E YV WN HEP G++ F
Sbjct: 62 AFWLDGEKITLVSGSIHYFRVPNEYWLDRLTKLKYAGLNTVELYVSWNLHEPYSGEFNFS 121
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G D+VRF++ E GL + R GPY CAEW +GG P WL ++ RTT + E ++
Sbjct: 122 GDLDVVRFIEMAGELGLHVLFRPGPYICAEWEWGGHPYWLLHDTDMKVRTTYPGYLEAVE 181
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVG--GELYVKWAADTAVN-- 186
+F +++ + +L GGPII Q+ENEY A+ +G ++ W T +
Sbjct: 182 KFYSELFG--RVNHLMYRNGGPIIAVQIENEYAGFADAFEIGPLDPGFLTWLRQTIKDQQ 239
Query: 187 -----LNTSVPWVMCQQEDAPDPI-INTCN----GFYCDGFTPNSPSKPIMWTENYSGWF 236
+ W + E DP +N + ++ + N P KP M E +SGWF
Sbjct: 240 CEELLFTSDGGWDFYKYELEGDPYGLNFDDVLRANYWLNILENNQPGKPKMVMEWWSGWF 299
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------- 285
+GY + + + NYYM+ GGTNFG G
Sbjct: 300 DFWGYHHQGTTADSFEENLRAILSQNASV-NYYMFHGGTNFGYMNGANFNTNDQTNDLEY 358
Query: 286 --VATSYDYDAPIDEYGFIRQ 304
V TSYDYD P+ E G I +
Sbjct: 359 QPVVTSYDYDCPLSEEGRITK 379
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 26/307 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ + SG++HY R P++W + IRK++ GL IETYV WN H P RG + G
Sbjct: 11 FLLDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDLTG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+ V GL +R GPY CAEW+ GG P WL PG+ RT + E +
Sbjct: 71 NLDLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAIAG 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I+ ++ + ++GGP+++ QVENEYG AYG + Y++ V
Sbjct: 131 YYDEILAVVAPRQV--TRGGPVLMVQVENEYG----AYGDDAD-YLRALVTMMRERGIEV 183
Query: 192 PWVMCQQEDAPD------PIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFLSF 239
P C Q + P ++ F + + P+ P+M E + GWF S+
Sbjct: 184 PLTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFDSW 243
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP------LVATSYDYD 293
G A A + G N YM+ GGTN G T G + TSYDYD
Sbjct: 244 GEQH-HTTDAAEAAADLDLLLSQGASANLYMFHGGTNLGFTNGANDKGTYLPITTSYDYD 302
Query: 294 APIDEYG 300
AP+ E G
Sbjct: 303 APLAEDG 309
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/322 (34%), Positives = 151/322 (46%), Gaps = 25/322 (7%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK ++++ +HYPR W + I+ K G+ I YVFWN HEP G++ F
Sbjct: 74 TFLLNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFT 133
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R + F E +
Sbjct: 134 GQNDLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREADPYFIERVN 193
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F ++ + L GGPII+ QVENEYG +YG E YV D
Sbjct: 194 IFEQEVARQVG--GLTIQNGGPIIMVQVENEYG----SYGESKE-YVSLIRDIVRTNFGD 246
Query: 191 VPWVMCQ------QEDAPDPI--INTCNGFYCD----GFTPNSPSKPIMWTENYSGWFLS 238
V C + PD + IN G D G P P+M +E +SGWF
Sbjct: 247 VTLFQCDWASNFTKNALPDLLWTINFGTGANIDQQFAGLKKLRPDSPLMCSEFWSGWFDK 306
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYDYD 293
+G RP D+ + G +F + YM GGTN+G AG P A TSYDYD
Sbjct: 307 WGANHETRPASDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYD 365
Query: 294 APIDEYGFIRQPKWGHLRELHK 315
API E G W + L K
Sbjct: 366 APISESGQTTPKYWALRKTLGK 387
>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
Length = 779
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 158/323 (48%), Gaps = 27/323 (8%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ ++DGK V+++ IHY R E W I+ K G+ I Y FWN HE G++
Sbjct: 36 NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G+ D+ F + Q+ +++ LR GPY C+EW GG P WL I+ RT + F E
Sbjct: 96 FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
K F+ +I + +L ++GG II+ QVENEYG +Y E Y+ D
Sbjct: 156 TKLFMNEIGKQLA--DLQITKGGNIIMVQVENEYG----SYATDKE-YIANIRDIVKGAG 208
Query: 189 -TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
T VP C Q +A D ++ T N G D P+ P+M +E +SGW
Sbjct: 209 FTDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGW 268
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
F +G R E + + + G +F + YM GGT FG G + +SY
Sbjct: 269 FDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSY 327
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAPI E G+ PK+ LREL
Sbjct: 328 DYDAPISEAGWT-TPKYFKLREL 349
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+Y+ TF E G + L++ + GKG WVNG++IGR+W
Sbjct: 533 YYRATFNLEEA-GDVFLDMQTWGKGMVWVNGKAIGRFWEI-------------------- 571
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + LG + + I L K
Sbjct: 572 --------GPQQTLF-MPGCWLKKGENEIIVLDLLGPEKATIKGLDK 609
>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 649
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 164/335 (48%), Gaps = 23/335 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y H + DG+ SGSIHY R W + + K K GL+ I+TYV WN+HEP R
Sbjct: 32 IDYGHNCFLKDGQPFRYISGSIHYSRIPRYYWKDRLLKMKMAGLDAIQTYVPWNFHEPER 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y F G DL F++ QE GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 92 GVYNFTGDRDLEYFLQLAQEVGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDPD 151
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY-GVGGELYVKWA 180
+ + ++ + MK GGPII+ QVENEYG+ ++ Y L+ ++
Sbjct: 152 YLTAVGSWMGIFLPKMKPH--LYQNGGPIIMVQVENEYGSYFACDFDYLRYLQNLFRQYL 209
Query: 181 ADTAVNLNT---SVPWVMCQQEDAP------DPIINTCNGFYCDGFTPNSPSKPIMWTEN 231
D V T S+ ++ C P N F T P P++ +E
Sbjct: 210 GDEVVLFTTDGASMFYLRCGALQGLYSTVDFGPGRNVTAAFSTQRHT--EPKGPLVNSEF 267
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
Y+GW +G+ P +A +++ +G N YM+ GGTNFG G P +A
Sbjct: 268 YTGWLDHWGHRHITVPASIVAKSLSEILASGANV-NMYMFIGGTNFGYWNGANMPYMAQP 326
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
TSYDYDAP+ E G + + K+ +RE+ K E
Sbjct: 327 TSYDYDAPLSEAGDLTE-KYFAIREVIGMFKKLPE 360
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/296 (35%), Positives = 144/296 (48%), Gaps = 26/296 (8%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SG+IHY R PE W + + K + GL +ETY+ WN HEP GQ+ F+G DL RFV+
Sbjct: 23 SGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVRIA 82
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
+ GL + LR PY CAEW +GG P WL P IQ R + + E++ ++ ++I +
Sbjct: 83 GDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRLVP 142
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-------PWVM 195
L S+GGP+I Q+ENEYG +YG Y+++ D + V P
Sbjct: 143 --LLTSKGGPVIAMQIENEYG----SYG-NDTAYLEYLKDGLIKRGVDVLLFTSDGPTDG 195
Query: 196 CQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVED 250
Q A ++ T N D P P+M E ++GWF + R ED
Sbjct: 196 MLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAED 255
Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYG 300
A + + N+YM+ GGTNFG G TSYDYDAP+ E G
Sbjct: 256 AAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 166/351 (47%), Gaps = 47/351 (13%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + + K K G +ETY+ WN HEP +G+++FEG
Sbjct: 19 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 78
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF- 132
D+ RFVKT QE GL++ LR PY CAEW +GG P WL G++ R + PF + ++ +
Sbjct: 79 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 138
Query: 133 ---LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
L KI+ + GGP+IL QVENEYG Y Y+ D
Sbjct: 139 DVLLKKIVPYQ------INYGGPVILMQVENEYG-----YYANDREYLLAMRDKMQKGGV 187
Query: 190 SVPWVMCQQEDAPDPIINTCNGFYCDGFTP--NSPSK---------------PIMWTENY 232
VP V + P NG + +G P N SK P+M TE +
Sbjct: 188 VVPLVT-----SDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFW 242
Query: 233 SGWFLSFGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----- 286
GWF +G +E+ + + E G N YM+ GGTNFG G
Sbjct: 243 VGWFDHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTP 300
Query: 287 -ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
TSYDYDA + E G I + K+ R++ + E +++ + G+
Sbjct: 301 DVTSYDYDALLTEDGQITE-KYRRYRDVIAKYREIPEVTFTTEIKRKAYGS 350
>gi|157106609|ref|XP_001649402.1| beta-galactosidase [Aedes aegypti]
gi|108879821|gb|EAT44046.1| AAEL004575-PA [Aedes aegypti]
Length = 648
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/336 (33%), Positives = 171/336 (50%), Gaps = 37/336 (11%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y++ ++DG +GS HY R+ P+ W +++ + GL + TYV W+ H P +
Sbjct: 36 IDYENNTFLLDGAPFQYIAGSFHYFRALPQAWGPILKSMRAAGLNAVTTYVEWSLHNPKK 95
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
G Y ++G D+ RFV+ Q L + LR GPY CAE + GGFP W L+ PGIQ RT +
Sbjct: 96 GVYNWDGMADIERFVQLAQNEDLLVILRPGPYICAERDMGGFPYWLLNKYPGIQLRTADV 155
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD- 182
+ E++ + A++ + E F GGPII+ QVENEYG ++ Y+KW D
Sbjct: 156 AYLREVRTWYAELFS--RLEPYFYGNGGPIIMVQVENEYG----SFFACDYKYMKWLRDE 209
Query: 183 -------TAVNLNTSVPWVMCQQEDAPDPIINTCN-----GFYCDGFTPN----SPSKPI 226
AV + P + Q D +++T + DG+ + P P+
Sbjct: 210 TERYVRGKAVLFTNNGPGL--TQCGGIDGVLSTLDFGPGTALEIDGYWKDLRKLQPKGPL 267
Query: 227 MWTENYSGWFLSFGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--- 282
+ E Y GW + R P+E + ++ R+ + N YM++GGTNFG TAG
Sbjct: 268 VNAEYYPGWLTHWQEQQMARSPIEPVVTSL-RYMLSSKVNVNIYMFYGGTNFGFTAGANE 326
Query: 283 ---GPLVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
G + TSYDYDAP+DE G PK+ +R++
Sbjct: 327 QGPGRFIPDITSYDYDAPLDESG-DPTPKYEAIRKV 361
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 166/351 (47%), Gaps = 47/351 (13%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + + K K G +ETY+ WN HEP +G+++FEG
Sbjct: 12 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF- 132
D+ RFVKT QE GL++ LR PY CAEW +GG P WL G++ R + PF + ++ +
Sbjct: 72 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 131
Query: 133 ---LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
L KI+ + GGP+IL QVENEYG Y Y+ D
Sbjct: 132 DVLLKKIVPYQ------INYGGPVILMQVENEYG-----YYANDREYLLAMRDKMQKGGV 180
Query: 190 SVPWVMCQQEDAPDPIINTCNGFYCDGFTP--NSPSK---------------PIMWTENY 232
VP V + P NG + +G P N SK P+M TE +
Sbjct: 181 VVPLVT-----SDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFW 235
Query: 233 SGWFLSFGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----- 286
GWF +G +E+ + + E G N YM+ GGTNFG G
Sbjct: 236 VGWFDHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTP 293
Query: 287 -ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
TSYDYDA + E G I + K+ R++ + E +++ + G+
Sbjct: 294 DVTSYDYDALLTEDGQITE-KYRRYRDVIAKYREIPEVTFTTEIKRKAYGS 343
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/347 (32%), Positives = 168/347 (48%), Gaps = 29/347 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A +TY L+ G+ + +G++HY R P+ W + + + GL ++TY+ WN+HE
Sbjct: 7 ALLTYSEGTLLRAGRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHER 66
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G++ F+G D+ RFV+T Q GL + +R GPY CAEW+ GG P WL PG++ R++
Sbjct: 67 RTGEHRFDGWRDIERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRPRSSY 126
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+ +E+ R+ +I + +L A++GGP++ QVENEYG+ YG Y++W D
Sbjct: 127 APYLDEVARWFDVLIPRIA--DLQAARGGPVVAVQVENEYGS----YG-DDHAYMRWVHD 179
Query: 183 TAVNLNTS--------VPWVMCQQEDAPDPIINTCNGFYCDG----FTPNSPSKPIMWTE 230
+ +M P + G D +P + E
Sbjct: 180 ALAGRGVTELLYTADGPTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLCAE 239
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-----GPL 285
++GWF +G R V A A+ GG+ + Y GGTNFG AG G L
Sbjct: 240 FWNGWFDHWGEKHHTRSVGSAAAALDEILAKGGSV-SLYPAHGGTNFGLWAGANHADGAL 298
Query: 286 --VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYD DAPI E+G PK+ R+ L A E L S P
Sbjct: 299 QPTVTSYDSDAPIAEHG-APTPKFHAFRDRLLAATGAAERELPRSRP 344
>gi|387791561|ref|YP_006256626.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379654394|gb|AFD07450.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 619
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/339 (33%), Positives = 165/339 (48%), Gaps = 42/339 (12%)
Query: 8 DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
++ A V DGK + SG +H+ R E W ++ K GL + TYVFWNYHE G +
Sbjct: 29 ENGAFVYDGKPVQIHSGEMHFARVPQEYWRHRLKMMKAMGLNSVATYVFWNYHETAPGVW 88
Query: 68 YFE-GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
F+ G ++ F+K E GL + LR GPYACAEW YGG+P +L + G++ R N F
Sbjct: 89 DFKTGNKNISEFIKIAGEEGLMVILRPGPYACAEWEYGGYPWFLQNVEGLEVRRNNPKFL 148
Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWAYGVGG 173
K ++ + +K + + ++GGPII+ Q ENE+G+ ++ +
Sbjct: 149 AACKEYIDHLAKEVKNQQI--TKGGPIIMVQAENEFGSYVAQRKDIPLAEHKAYSSAIKA 206
Query: 174 ELYVKWAADTAVNLNTSV-PWVMCQQEDAPDPIINTCNG--------FYCDGFTPNSPSK 224
+L AA V L TS W+ + + + + T NG D + N
Sbjct: 207 QLL---AAGFDVPLFTSDGSWLF--EGGSIENCLPTANGEDNIENLKKVVDQY--NGGKG 259
Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
P M E Y GW + P P ED+ ++ + +F NYYM GGTNFG T+G
Sbjct: 260 PYMVAEFYPGWLDHWAEPFPKVPTEDVVKQTEKYLQNNVSF-NYYMVHGGTNFGYTSGAN 318
Query: 285 LVA--------TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAPI E G+ PK+ +REL K
Sbjct: 319 YDKNHDIQPDMTSYDYDAPISEAGW-ATPKYIAIRELMK 356
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 97/240 (40%), Gaps = 59/240 (24%)
Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
Y+ Y+ + P GK L + L ALV+VN + VA + + N E++
Sbjct: 413 YVLYSRKFN-QPISGK---LELNGLRDYALVYVNGEKVA------ELNRYYKNYSCEIDV 462
Query: 515 GIN-TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEY 572
N TLDI +G NYGA G+ S ++I NG SG W +Y+
Sbjct: 463 PFNATLDIFVENMGRINYGAKITENNKGIISPVVI---NGTE--ISGNWKMYK------- 510
Query: 573 IGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQS 632
+ L+K S K+ + PV K TF E G L++ + GKG +VNG
Sbjct: 511 MPLEKQEEVASIKAKEVKSQPV-----VLKGTFNLTE-TGDTFLDMEAWGKGIVFVNGYH 564
Query: 633 IGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
+GRYW+ P QTLY +P W+ G N + I E
Sbjct: 565 LGRYWNV----------------------------GPQQTLY-LPGCWLKKGANEITIVE 595
>gi|383128326|gb|AFG44819.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128328|gb|AFG44820.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128336|gb|AFG44824.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128338|gb|AFG44825.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 104/153 (67%), Gaps = 5/153 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A GCT CDYRG+Y +SKC +CGQP+Q LYH+PR+W+ P N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
H I +I FAS+G P G+CGSF G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTN 153
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/326 (34%), Positives = 157/326 (48%), Gaps = 39/326 (11%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+ +++GK V+++ +HYPR W + I+ K G+ + YVFWN HEP G Y F
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R ++ F E +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVN 475
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + +K +L + GGPII+ QVENEYG+ V +G L+
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIALF 533
Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
WA++ +N + W M N G D P+ P+M +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKKLRPNSPLMCSEF 582
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
+SGWF +G RP ED+ + G +F + YM GGTN+G AG P A
Sbjct: 583 WSGWFDKWGANHETRPAEDMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI E G PK+ LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWKLRE 666
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/296 (35%), Positives = 144/296 (48%), Gaps = 26/296 (8%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SG+IHY R PE W + + K + GL +ETY+ WN HEP GQ+ F+G DL RFV+
Sbjct: 23 SGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVRIA 82
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
+ GL + LR PY CAEW +GG P WL P IQ R + + E++ ++ ++I +
Sbjct: 83 GDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRLVP 142
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-------PWVM 195
L S+GGP+I Q+ENEYG +YG Y+++ D + V P
Sbjct: 143 --LLTSKGGPVIAMQIENEYG----SYG-NDTAYLEYLKDGLIKRGVDVLLFTSDGPTDG 195
Query: 196 CQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVED 250
Q A ++ T N D P P+M E ++GWF + R ED
Sbjct: 196 MLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAED 255
Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYG 300
A + + N+YM+ GGTNFG G TSYDYDAP+ E G
Sbjct: 256 AAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310
>gi|361068121|gb|AEW08372.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128330|gb|AFG44821.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128334|gb|AFG44823.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 5/153 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A GCT CDYRG+Y +SKC +CGQP+Q LYH+PR+W+ P N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
H I +I FAS+G P G CGSF G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTN 153
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 166/332 (50%), Gaps = 37/332 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SG+IHY R P+ W + + K G +ETY+ WN HEP G++ F+G
Sbjct: 10 FIVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V F+K QE L + +R PY CAEW +GG P WL + R+ + E++K
Sbjct: 70 IKDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ ++ ++ +L ++QGGPII+ QVENE+G+ + Y+K ++L V
Sbjct: 130 YYEVLLPMLT--SLQSTQGGPIIMMQVENEFGSFS-----NNKTYLKKLKKIMLDLGVEV 182
Query: 192 P-------WVMCQQEDA--PDPIINTCN-GFY-------CDGFTPNSPSK-PIMWTENYS 233
P W + + D ++ T N G + + F N K P+M E +
Sbjct: 183 PLFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PL 285
GWF +G + R +DLA V G N YM+ GGTNFG G P
Sbjct: 243 GWFNRWGEEIITRDAQDLANCVKELLTRGSI--NLYMFHGGTNFGFMNGCSARGQKDLPQ 300
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
V TSYDYDA + E G I + K+ ++++ K +
Sbjct: 301 V-TSYDYDALLTEAGDITE-KYQCVKKVMKEL 330
>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
Length = 617
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/339 (33%), Positives = 165/339 (48%), Gaps = 56/339 (16%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GRF 73
DGK + SG +HY R E W ++ K GL + TYVFWNYHE G + F+ G
Sbjct: 37 DGKIIKIHSGEMHYERIPKEYWRHRLQMLKAMGLNTVATYVFWNYHEIEPGVWDFKTGNR 96
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL F++ + GL++ LR GPYAC EW +GG+P WL P + RT N F + K +L
Sbjct: 97 DLAEFLRIAKSEGLYVILRPGPYACGEWEFGGYPWWLQNNPDLVIRTNNKAFLDACKTYL 156
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP- 192
+ ++K FA+QGGPII+ Q ENE+G+ YV D + + +
Sbjct: 157 EHLYAVVKGN--FANQGGPIIMVQAENEFGS-----------YVSQRTDISAEDHKAYKT 203
Query: 193 --WVMCQQEDAPDPIINT-----CNGFYCDGFTPNSPSK------------------PIM 227
+ + ++ P+P + G +G P + + P M
Sbjct: 204 AIYNILKETGFPEPFFTSDGSWLFEGGMVEGVLPTANGESNIENLKKQVDKYHKGQGPYM 263
Query: 228 WTENYSGWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
E Y GW +A PF + E++A ++ + G +F NYYM GGTNFG T+G
Sbjct: 264 VAEFYPGWLDH--WAEPFVKIGSEEIASQTKKYLDAGVSF-NYYMAHGGTNFGFTSGANY 320
Query: 284 -------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
P + TSYDYDAPI E G+ PK+ +R++ +
Sbjct: 321 NEESDIQPDI-TSYDYDAPISEAGW-ATPKFMAIRDVMQ 357
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/334 (33%), Positives = 160/334 (47%), Gaps = 44/334 (13%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ + YVFWN HE GQ+ F
Sbjct: 100 TFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFT 159
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R + F E ++
Sbjct: 160 GQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVE 219
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F K+ + + L +GGPII+ QVENEYG +YG + YV D +
Sbjct: 220 LFEQKVAEQLAP--LTIRRGGPIIMVQVENEYG----SYGED-KAYVSQIRDVLRRYWSL 272
Query: 191 VPWVMCQQEDAPDPIINTCNGFYCDGFTPN---------------------------SPS 223
P + E A P++ C+ + FT N P
Sbjct: 273 SPTGEGRGE-AASPLMFQCD--WSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPD 329
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
P M +E +SGWF +G RP D+ + G +F + YM GGT+FG AG
Sbjct: 330 APKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGA 388
Query: 284 --PLVA---TSYDYDAPIDEYGFIRQPKWGHLRE 312
P A TSYDYDAPI+EYG PK+ LR+
Sbjct: 389 NSPGFAPDVTSYDYDAPINEYGQA-TPKFWELRK 421
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/334 (33%), Positives = 160/334 (47%), Gaps = 44/334 (13%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ + YVFWN HE GQ+ F
Sbjct: 38 TFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFT 97
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R + F E ++
Sbjct: 98 GQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVE 157
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F K+ + + L +GGPII+ QVENEYG +YG + YV D +
Sbjct: 158 LFEQKVAEQLAP--LTIRRGGPIIMVQVENEYG----SYGED-KAYVSQIRDVLRRYWSL 210
Query: 191 VPWVMCQQEDAPDPIINTCNGFYCDGFTPN---------------------------SPS 223
P + E A P++ C+ + FT N P
Sbjct: 211 SPTGEGRGE-AASPLMFQCD--WSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPD 267
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
P M +E +SGWF +G RP D+ + G +F + YM GGT+FG AG
Sbjct: 268 APKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGA 326
Query: 284 --PLVA---TSYDYDAPIDEYGFIRQPKWGHLRE 312
P A TSYDYDAPI+EYG PK+ LR+
Sbjct: 327 NSPGFAPDVTSYDYDAPINEYGQA-TPKFWELRK 359
>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
Neff]
Length = 604
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 150/309 (48%), Gaps = 30/309 (9%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG+ + SGSIHY RS PE WP +R + GL + TYV WN HEP GQY F GR D
Sbjct: 36 DGQEFRIVSGSIHYFRSLPEQWPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLD 95
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
+VRF++ Q+ G + +R PY CAE +GG P WL G+Q R ++ + + + FL
Sbjct: 96 IVRFIEAAQQEGFLVIVRPPPYICAELEFGGLPAWLLNEEGLQLRCSDPKYLKRVDSFLD 155
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSVP 192
+ ++ S+GGPII QVENEYG+ + Y EL + A+ +++
Sbjct: 156 HFLPMLATYQY--SRGGPIIAMQVENEYGSYGNDHLYLRHLELKFRQHQIDAILFSSNGA 213
Query: 193 WVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG----Y 241
A ++ T N G +G PS P+ TE + GWF +G
Sbjct: 214 GDQMFVGGALPSLLRTVNFGTGADVEGNLKVLRKYQPSGPLFVTEFWDGWFDHWGEEHHT 273
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----------PLVATSYD 291
P + ++ L + + N YM FGGTNFG T G TSYD
Sbjct: 274 TTPTQSMKTLEAIL-----SNNASVNLYMAFGGTNFGFTNGANKGYGETDPYQPTTTSYD 328
Query: 292 YDAPIDEYG 300
YDAP++E G
Sbjct: 329 YDAPVNESG 337
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 116/337 (34%), Positives = 167/337 (49%), Gaps = 46/337 (13%)
Query: 8 DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
D IDGK L SG++HY R PE W + + K K GL +ETYV WN HEP + Y
Sbjct: 26 DGANFTIDGKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHEPEKYTY 85
Query: 68 YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
FEG DL R++ E GL++ LR GPY CAEW +GG P WL ++ RTT F +
Sbjct: 86 NFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYVKE-HVRTTRPMFID 144
Query: 128 EMK----RFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVG------G 173
++ R LA+++ + GGPII Q+ENEYG + E+ + G
Sbjct: 145 PVEVWFGRLLAEVVPRQ------YTNGGPIIAVQIENEYGGFSNSTEYMERLKKILESRG 198
Query: 174 ELYVKWAAD-TAVNLNTSVPWVMCQ---QEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
+ + + +D ++ +P V+ Q +A D + P +P+M
Sbjct: 199 IVELLFTSDGKGALISGGIPGVLKTVNFQNNASDKL---------QKLKEIQPDRPMMVM 249
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFG---------R 279
E ++GWF +G +E +F + F+ G N+YM+ GGTNFG +
Sbjct: 250 EYWTGWFDHWGEDHHLYRLESESFVHSVFYILDAGASVNFYMFHGGTNFGFMNGANTRYK 309
Query: 280 TAGGPL-VATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ G L TSYDYDAPI E G + PK+ +RE+ K
Sbjct: 310 SGGRTLPTITSYDYDAPISETGDL-TPKYFKIREILK 345
>gi|376338072|gb|AFB33581.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
gi|376338074|gb|AFB33582.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 103/153 (67%), Gaps = 5/153 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A +GCT CDYRG+Y +SKC +CGQP+Q LYH+PR+W+ N+
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSVLKVNKPKAELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
H I +I FAS+G P G CGSF G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTN 153
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 163/337 (48%), Gaps = 41/337 (12%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ +T+ + +DG+ + SG+IHY R PE W + + K K G +ETY+ WN HEP
Sbjct: 2 SRLTWKDQKYRLDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEP 61
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G + F+G D+ RF++T GL + +R PY CAEW +GG P WL + R +
Sbjct: 62 REGSFRFDGFADVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWL-LKSSMGLRCMD 120
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
N + E++ R+ ++I + L S+GGPII QVENEYG +YG Y+ + D
Sbjct: 121 NEYLEKVDRYYDELIPRLLP--LLDSRGGPIIAVQVENEYG----SYG-NDTAYLAYLRD 173
Query: 183 TAVNLNTSVPWVMCQQEDAPDPII--NTCNGFYCD------------GFTPNSPSKPIMW 228
+ V ++ + D ++ T G + + +P+M
Sbjct: 174 GLIR--RGVDCLLFTSDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQDEPLMV 231
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
E + GWF + R D+A + E G + N YM+ GGTNFG +G
Sbjct: 232 MEYWLGWFDHWRKPHHVREAGDVANVLDEMLEQGASV-NLYMFHGGTNFGFYSGANYGEH 290
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
P + TSYDYDAP+ E WG + E +KAI+
Sbjct: 291 YEPTI-TSYDYDAPLTE--------WGDITEKYKAIR 318
>gi|340722578|ref|XP_003399681.1| PREDICTED: beta-galactosidase-like [Bombus terrestris]
Length = 646
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 162/325 (49%), Gaps = 35/325 (10%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
S V Y++ ++DGK SGS HY R+ + W + +RK + GL + TYV W+ H
Sbjct: 30 FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLRKMRAAGLNAVSTYVEWSLH 89
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFR 119
+P ++++ G D++ F+ QE GLF+ LR GPY CAE ++GG P W L +P I+ R
Sbjct: 90 QPTENEWHWTGDADVIEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLARVPDIKLR 149
Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------V 165
T ++ + + ++ +L +I+D K + GGPII+ QVENEYG+ +
Sbjct: 150 TNDSRYMKYVEIYLNEILD--KVQPYLRGNGGPIIMVQVENEYGSYACDREYLSRLRDIM 207
Query: 166 EWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
G LY A+ + +P V + P+ N F + P P
Sbjct: 208 RQKIGTKALLYSTDGANANMLRCGFIPEVYATVDFGPN--TNVTKNF--EIMRMYQPRGP 263
Query: 226 IMWTENYSGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
++ +E Y GW + PF+ V+ + + G + N YM++GGTNFG TAG
Sbjct: 264 LVNSEFYPGWLTH--WREPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGYTAGA 320
Query: 284 --------PLVATSYDYDAPIDEYG 300
P + TSYDYDAP+ E G
Sbjct: 321 NGGHNAYNPQL-TSYDYDAPLTEAG 344
>gi|376338078|gb|AFB33584.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 103/153 (67%), Gaps = 5/153 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A +GCT CDYRG+Y +SKC +CGQP+Q LYH+PR+W+ N+
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
H I +I FAS+G P G CGSF G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTN 153
>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
Length = 586
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 28/310 (9%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
R ++DG+ + SG+IHY R P++W + IRK++ GL IETYV WN H G +
Sbjct: 9 RDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFRT 68
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
+G DL RF+ V G+ +R GPY CAEW+ GG P WL P I R++ + +
Sbjct: 69 DGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSIGVRSSEPGYLAAV 128
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
F+ +++ ++ + + ++GGP+IL Q+ENEYG AYG + Y++ DTA
Sbjct: 129 DGFMDRLLPIVVERQI--TRGGPVILFQIENEYG----AYG-SDKAYLQHLVDTATRAGV 181
Query: 190 SVPWVMCQQ------EDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFL 237
VP C Q ED P ++ F P P+M E ++GWF
Sbjct: 182 EVPLFTCDQPFETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGWFD 241
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSY 290
++G A + G + N YM+ GGTNFG T G P + TSY
Sbjct: 242 NWGTHHHTTDAAASAAELDALLAAGASV-NIYMFHGGTNFGFTNGANDKGIYEPTI-TSY 299
Query: 291 DYDAPIDEYG 300
DYDAP+ E G
Sbjct: 300 DYDAPLSEDG 309
>gi|393782614|ref|ZP_10370797.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
gi|392672841|gb|EIY66307.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
Length = 605
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 160/325 (49%), Gaps = 39/325 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GR 72
+D K + SG IH R E W + I+ K G + Y+ WNYHE G + F+ G
Sbjct: 41 LDDKPFQIISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGN 100
Query: 73 FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF 132
+L +F++TVQ+ G+FL R GPY C EW++GG P +L IP I+ R + + ++R+
Sbjct: 101 KNLEKFIQTVQDEGMFLLFRPGPYVCGEWDFGGLPPYLLSIPDIKIRCMDTRYTAAVERY 160
Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
+ KI ++K+ + + GGPII+ QVENEYG +YG +Y+KW D + VP
Sbjct: 161 VDKIAPIIKKYEI--TNGGPIIMVQVENEYG----SYG-NDRIYMKWMHDLWRDKGIEVP 213
Query: 193 WVMCQQEDAPDPII---NTCNGFYCDGFTPNS------------PSKPIMWTENYSGWFL 237
+ D P + T G G P + P + +E Y GW
Sbjct: 214 FYTA---DGATPYMLEAGTLPGVAI-GLDPAASKAEFDEALKVHPDASVFCSELYPGWLT 269
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------PLVAT 288
+ +E + V + G +F NYY+ GGTNFG AG P V T
Sbjct: 270 HWREEWQHPSIEKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGTYQPDV-T 327
Query: 289 SYDYDAPIDEYGFIRQPKWGHLREL 313
SYDYDAPI+E G PK+ LREL
Sbjct: 328 SYDYDAPINEMG-QATPKYMALREL 351
>gi|296399387|gb|ADH10509.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 166/342 (48%), Gaps = 27/342 (7%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + Y+ + V DGK SGSIHY R W + + K K GL+ I+TYV WNYHE
Sbjct: 6 SFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHE 65
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G Y F G DL F++ + GL + LR GPY CAEW+ GG P WL I R++
Sbjct: 66 PRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSS 125
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG------ 172
++ + E ++R++ ++ M+ GGPII+ QVENEYG+ ++ Y
Sbjct: 126 DSDYLEAVERWMGVLLPKMRP--YLYQNGGPIIMVQVENEYGSYFACDYDYLRFLLKLFR 183
Query: 173 ---GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
G+ V + D A + ++ + + AP N F + P P+
Sbjct: 184 LHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPMGPL 239
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
+ +E Y+GW +G+ P E +A + G N YM+ GGTNF G +
Sbjct: 240 VNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMP 298
Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL 324
TSYDYDAP+ E G + + K+ +R++ + + +L
Sbjct: 299 YMPQPTSYDYDAPLSEAGDLTE-KYFTIRKVIGMVSVPRTFL 339
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 168/349 (48%), Gaps = 35/349 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++YD + + L SG+IHY R P W + +RK K G IETYV WN HEP
Sbjct: 2 TTLSYDQGQFTMGDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEP 61
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+++FEG D+ FV+ E GL++ +R PY CAEW +GG P WL ++ R +
Sbjct: 62 REGEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL-LKDDMRLRCND 120
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
F E++ + ++ + L A++GGPII Q+ENEYG +YG + Y++ A
Sbjct: 121 PRFLEKVAAYYDALLPQLTP--LLATKGGPIIAVQIENEYG----SYG-NDQAYLQ--AQ 171
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYC------------DGFTPNSPSKPIMW 228
A+ + V ++ + D ++ G D P P+M
Sbjct: 172 RAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMC 231
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
E ++GWF + R ED A + G + N+YM GGTNFG +G
Sbjct: 232 MEYWNGWFDHWFEQHHTRDAEDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDK 290
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
P V TSYDYDA I E G + PK+ RE + K + L E L ++ P
Sbjct: 291 YEPTV-TSYDYDAAISEAGDL-TPKYHAFREVIGKYVSLPEGDLPANTP 337
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 170/639 (26%), Positives = 260/639 (40%), Gaps = 143/639 (22%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SGSIHY R P W + + K + G +ETYV WN HEP G++ F DL RF++
Sbjct: 21 SGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQLA 80
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
QE GL++ LR PY CAEW +GG P WL P ++ R PF E++ R+ ++ +
Sbjct: 81 QEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQVS- 139
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA------VNLNTSV-PWVM 195
+L +Q GPI++ QVENEYG +YG + Y++ +A+ V+L TS PW+
Sbjct: 140 -DLQITQEGPILMMQVENEYG----SYG-NDKSYLRKSAELMRHNGIDVSLFTSDGPWLD 193
Query: 196 CQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSFGYAVPF-R 246
+ +D P IN C + F + +P+M E + GWF ++G
Sbjct: 194 MLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHTT 252
Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAPIDEYG 300
V D A + E G N YM+ GGTNFG G TSYDYDA + E+G
Sbjct: 253 SVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLSPDVTSYDYDALLSEWG 310
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ PK+ +++ I + +++ T + G
Sbjct: 311 DV-TPKYEAFQQVIGEITEIPSFPLTTKITKRAYG------------------------- 344
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+W VS + +F T ISQ ++P ELL ++
Sbjct: 345 ---------------SWKVS----QRVSLFETLASISQPVKHNYPLTM-----ELLDQAT 380
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ +Y ++G S + DY ++ +
Sbjct: 381 GYVYYRSQIGKS-----------------RVIEDYR----------------LIHCQDRA 407
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H F+N +L Y K + L E N L IL +G NY +
Sbjct: 408 HT---FINNQLQFIQYDQE----IGQKKTLTLTEESNELGILVENMGRVNYSVQMNHQYK 460
Query: 541 GLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
G+ +++ NG EW IY + ++ LD++ S W+ G
Sbjct: 461 GIKDGVIV---NGA---FQSEWEIYSLPMD----NLDQVDF--SGHWQTGQP-------S 501
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
+ K +F E + L GKG +NG +IGR+W
Sbjct: 502 FSKVSFQVDECADTF-VELPGWGKGFIVINGHNIGRFWE 539
>gi|383128332|gb|AFG44822.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 5/153 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A GCT CDYRG+Y +SKC +CGQP+Q LYH+PR+W+ P N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
H I +I F S+G P G CGSF G C+ +
Sbjct: 121 PSSGHLIKSIKFVSFGTPTGRCGSFTYGHCNTN 153
>gi|297842039|ref|XP_002888901.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297334742|gb|EFH65160.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 686
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 161/351 (45%), Gaps = 39/351 (11%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG + G +HY R PE W + + ++K GL I+ YV WN HEP G+ FEG D
Sbjct: 72 DGNHFQIIGGDLHYFRVLPEYWEDRLLRAKALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 131
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI-PGIQFRTTNNPFKEEMKRFL 133
LV F+K + + LR GPY C EW+ GGFP WL + P +Q RT++ + + ++R+
Sbjct: 132 LVSFLKLCDKLDFMVMLRAGPYICGEWDLGGFPAWLLSVKPRLQLRTSDPAYLKLVERWW 191
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEWAYGVGGELYVKWAAD 182
+ L K L S GGP+I+ Q+ENEYG+ V A G G+ + + D
Sbjct: 192 GVL--LPKIFPLIYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTD 249
Query: 183 ---------TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
V ++ V D P PI F G S P + +E Y+
Sbjct: 250 GGTKETLEKGTVPVDDVYSAVDFTTGDDPWPIFELQKKFNAPG------SSPPLSSEFYT 303
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GW +G + E A ++ + G+ YM GGTNFG G +
Sbjct: 304 GWLTHWGEKIAKTDAEFTATSLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDYK 362
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
TSYDYDAPI E G I PK+ L+ + K + +I S+ + G
Sbjct: 363 PDLTSYDYDAPIKESGDIDNPKFRALQRVIKKYNVASHSIIPSNKQRKAYG 413
>gi|383128340|gb|AFG44826.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 159 bits (401), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 104/153 (67%), Gaps = 5/153 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A GCT CDYRG+Y +SKC +CG+P+Q LYH+PR+W+ P N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGKPSQKLYHVPRSWIQPTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKGELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
H I +I FAS+G P G+CGSF G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTN 153
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 32/312 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A ++DG+ + SG +HYPR E W + +RK+K GL I TYVFWN HEP +G+Y F
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G D+ FVKT QE GL++ LR PY CAEW +GG+P WL I G++ R+ + + K
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGLEVRSKEPQYLQAYK 465
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG-------VGGELYVKWAADT 183
++ ++ + L + GG I++ QVENEYG AYG + L+++ D
Sbjct: 466 NYIMQVGKQLAP--LQVNHGGNILMVQVENEYG----AYGSDREYLDINRRLFIEAGFDG 519
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP------NSPSKPIMWTENYSGWFL 237
L T P + + P + + NG N P E Y WF
Sbjct: 520 L--LYTCDPEPFLAKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVAEWYPAWFD 577
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------PLVAT 288
+G P E + G + N YM+ GGT G P + +
Sbjct: 578 WWGTQHHKVPAEKYTPGLDSVLSAGMSV-NMYMFHGGTTRDFMNGANYNDQNPYEPQI-S 635
Query: 289 SYDYDAPIDEYG 300
SYDYDAP+DE G
Sbjct: 636 SYDYDAPLDEAG 647
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 102/241 (42%), Gaps = 57/241 (23%)
Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL---NEGINTLDILSMM 525
G+E L I+ L LVF+N K ++ L I L +E I LDIL
Sbjct: 728 GREGALKIKDLRDYGLVFINGKRISV------LDRRLKQDSIWLKLPDEKIQ-LDILVEN 780
Query: 526 VGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
+G NYG + G+ + NGK +G ++++ + L+ ++L NS
Sbjct: 781 LGRINYGPYLLKNKKGITEGVSF---NGKE--LTGWQMFKL----PFNDLNSVALKNS-- 829
Query: 586 WKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
K S PV K K TF + + G LNL + GKG WVNG ++GRYW+
Sbjct: 830 -KTLSGAPVLK-----KGTF-SLQTVGDTYLNLGNWGKGVVWVNGHNLGRYWNI------ 876
Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
P QTLY +P W+ G N +++ E L + S++ +
Sbjct: 877 ----------------------GPQQTLY-VPVEWLKKGGNEIIVLELLKPEQSQLQAVD 913
Query: 706 K 706
K
Sbjct: 914 K 914
>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
Length = 768
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK + SG +HYPR + W +R + GL + TYVFWN HE G++ FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L +++ E GL + LR GPY CAEW +GG+P WL IPG++ R N F + K ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
K+ + + +L S+GGPII+ Q ENE+G+ V + E + ++ A L +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
VP E P + T NG Y G P M E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270
Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW + +A PF + D +A + + +F N+YM GGTNFG T+G
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G++ PK+ +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 157/319 (49%), Gaps = 20/319 (6%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++ + V+++ +HYPR W I+ K G+ I YVFWN HE G++ F
Sbjct: 38 TFLLNNRPFVVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFWNIHEQREGEFDFS 97
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R ++ F E ++
Sbjct: 98 GNSDVAAFCRLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVE 157
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVGGELYVKWAADTAVN 186
F K+ + + L GGPII+ QVENEYG + ++ + L W +
Sbjct: 158 IFEQKVAEQLAP--LTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRDVLRKYWYTNGRGP 215
Query: 187 LNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSF 239
W +++ + +I T N G D P P M +E +SGWF +
Sbjct: 216 ALFQCDWASNFEKNGLEDLIWTMNFGTGANIDAQFMRLGELRPDAPKMCSEFWSGWFDKW 275
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYDYDA 294
G RP +D+ + G +F + YM GGT+FG AG P A TSYDYDA
Sbjct: 276 GARHETRPAKDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDA 334
Query: 295 PIDEYGFIRQPKWGHLREL 313
PI+EYG + PK+ LR++
Sbjct: 335 PINEYGQV-TPKFWELRKM 352
>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
Length = 768
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK + SG +HYPR + W +R + GL + TYVFWN HE G++ FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L +++ E GL + LR GPY CAEW +GG+P WL IPG++ R N F + K ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
K+ + + +L S+GGPII+ Q ENE+G+ V + E + ++ A L +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
VP E P + T NG Y G P M E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270
Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW + +A PF + D +A + + +F N+YM GGTNFG T+G
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G++ PK+ +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
Length = 765
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK + SG +HYPR + W +R + GL + TYVFWN HE G++ FEG
Sbjct: 36 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L +++ E GL + LR GPY CAEW +GG+P WL IPG++ R N F + K ++
Sbjct: 96 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 155
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
K+ + + +L S+GGPII+ Q ENE+G+ V + E + ++ A L +
Sbjct: 156 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 213
Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
VP E P + T NG Y G P M E Y
Sbjct: 214 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 267
Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW + +A PF + D +A + + +F N+YM GGTNFG T+G
Sbjct: 268 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 324
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G++ PK+ +R +
Sbjct: 325 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 354
>gi|376338076|gb|AFB33583.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 75/150 (50%), Positives = 101/150 (67%), Gaps = 5/150 (3%)
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG+SIGRYW +Y+A +GCT CDYRG+Y +SKC +CGQP+Q LYH+PR+W+ N+
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
LV+ EELGGDP++IS + ++ +C+ VSE PPV SWK + L V +++L C
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120
Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGAC 772
H I +I FAS+G P G CGSF G C
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHC 150
>gi|296399420|gb|ADH10537.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 165/342 (48%), Gaps = 27/342 (7%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S + Y+ + V DGK SGSIHY R W + + K K GL+ I+TYV WNYHE
Sbjct: 6 SFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHE 65
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P G Y F G DL F++ + GL + LR GPY CAEW+ GG P WL I R++
Sbjct: 66 PRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSS 125
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG------ 172
++ + E ++R++ ++ M+ GGPII+ QVENEYG+ ++ Y
Sbjct: 126 DSDYLEAVERWMGVLLPKMRP--YLYQNGGPIIMVQVENEYGSYFACDYDYLRFLLKLFR 183
Query: 173 ---GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
G V + D A + ++ + + AP N F + P P+
Sbjct: 184 LHLGHEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPMGPL 239
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
+ +E Y+GW +G+ P E +A + G N YM+ GGTNF G +
Sbjct: 240 VNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMP 298
Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL 324
TSYDYDAP+ E G + + K+ +R++ + + +L
Sbjct: 299 YMPQPTSYDYDAPLSEAGDLTE-KYFTIRKVIGMVSVPRTFL 339
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/326 (34%), Positives = 156/326 (47%), Gaps = 39/326 (11%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ + YVFWN HEP G Y F
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
+ DL F + Q+ +++ LR GPY CAEW GG P WL ++ R ++ F E +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + +K +L + GGPII+ QVENEYG+ V +G G L+
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALF 533
Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
WA++ +N + W M N G D P+ P+M +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
+SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI E G PK+ LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWALRE 666
>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 768
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK + SG +HYPR + W +R + GL + TYVFWN HE G++ FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L +++ E GL + LR GPY CAEW +GG+P WL IPG++ R N F + K ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
K+ + + +L S+GGPII+ Q ENE+G+ V + E + ++ A L +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
VP E P + T NG Y G P M E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270
Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW + +A PF + D +A + + +F N+YM GGTNFG T+G
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G++ PK+ +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 105/256 (41%), Gaps = 52/256 (20%)
Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
Y+ Y+ + P +G+ L I L A ++V+ + V G N F + + I N
Sbjct: 416 YVLYSTHFN-QPLKGR---LEIPGLRDYATIYVDGERV--GELNRCFNQYAMEIDIPFNA 469
Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYI 573
TLDIL +G NYG G+ S + I NG +W +Y++ ++
Sbjct: 470 ---TLDILVENMGRINYGEEIVRNTKGIISSVKI---NGS---EISDWKMYKLPMDR--- 517
Query: 574 GLDKISLANSSFWKQGS--TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQ 631
+ + +K GS + + Y+ TF + G +++ GKG ++NG
Sbjct: 518 -MPALVSGEPYVYKNGSPEVAALGNKPVLYEGTFHLSD-TGDTFIDMEDWGKGIIFINGV 575
Query: 632 SIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIH 691
+IGRYW A P QTLY IP W++ GEN +VI+
Sbjct: 576 NIGRYWYA----------------------------GPQQTLY-IPGVWLNKGENKIVIY 606
Query: 692 EELGGDPSKISLLTKT 707
E+L D KT
Sbjct: 607 EQLNNDRKSSVRTVKT 622
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 153/313 (48%), Gaps = 40/313 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ L SG+IHY R PE W + + K K G +ETY+ WN HEP GQ+ F+G
Sbjct: 13 LDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFDGLA 72
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D+VRFV+ E GL + +R PY CAEW +GG P WL PG++ R + P+ + + +
Sbjct: 73 DVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGMRVRCMHRPYLDRVDAYY 132
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+ L + L + GGPII Q+ENEYG +YG Y+ + D +
Sbjct: 133 D--VLLPLLKPLLCTNGGPIIAMQIENEYG----SYG-NDRAYLVYLKDAMLQRGMD--- 182
Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
V+ D P+ ++ T N F + P PIM E ++GWF
Sbjct: 183 VLLFTSDGPEHFMLQGGMIPGVLETVNFGSRAEEAF--EMLRKYQPDGPIMCMEYWNGWF 240
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------PLVA 287
+G R +D+A G + N+YM+ GGTNFG +G P +
Sbjct: 241 DHWGEQHHTRDAKDVADVFDDMLRLGASV-NFYMFHGGTNFGYMSGANCPQRDHYEPTI- 298
Query: 288 TSYDYDAPIDEYG 300
TSYDYD P++E G
Sbjct: 299 TSYDYDVPLNESG 311
>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
Length = 768
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK + SG +HYPR + W +R + GL + TYVFWN HE G++ FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L +++ E GL + LR GPY CAEW +GG+P WL IPG++ R N F + K ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
K+ + + +L S+GGPII+ Q ENE+G+ V + E + ++ A L +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
VP E P + T NG Y G P M E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270
Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW + +A PF + D +A + + +F N+YM GGTNFG T+G
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G++ PK+ +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 106/259 (40%), Gaps = 58/259 (22%)
Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
Y+ Y+ + P +G+ L I L A ++V+ + V G N F + + I N
Sbjct: 416 YVLYSTHFN-QPLKGR---LEIPGLRDYATIYVDGERV--GELNRCFNQYAMEIDIPFNA 469
Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYI 573
TLDIL +G NYG G+ S + I NG +W +Y+ +
Sbjct: 470 ---TLDILVENMGRINYGEEIVRNTKGIISSVKI---NGS---EISDWKMYK-------L 513
Query: 574 GLDKISLANSS---FWKQGS--TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
+D++ S +K GS + + Y+ TF + G +++ GKG ++
Sbjct: 514 PMDRMPALVSDEPYVYKNGSPEVAALGNKPVLYEGTFHLSD-TGDTFIDMEDWGKGIIFI 572
Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLL 688
NG +IGRYW A P QTLY IP W++ GEN +
Sbjct: 573 NGVNIGRYWYA----------------------------GPQQTLY-IPGVWLNKGENKI 603
Query: 689 VIHEELGGDPSKISLLTKT 707
VI+E+L D KT
Sbjct: 604 VIYEQLNNDRKSSVRTVKT 622
>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
Length = 768
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK + SG +HYPR + W +R + GL + TYVFWN HE G++ FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L +++ E GL + LR GPY CAEW +GG+P WL IPG++ R N F + K ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
K+ + + +L S+GGPII+ Q ENE+G+ V + E + ++ A L +
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216
Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
VP E P + T NG Y G P M E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270
Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
GW + +A PF + D +A + + +F N+YM GGTNFG T+G
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327
Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G++ PK+ +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 167/639 (26%), Positives = 257/639 (40%), Gaps = 143/639 (22%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SGSIHY R P W + + K + G +ETYV WN HEP G++ F DL RF++
Sbjct: 21 SGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQLA 80
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
QE GL++ LR PY CAEW +GG P WL P ++ R PF E++ R+ ++ +
Sbjct: 81 QEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQVS- 139
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-------PWVM 195
+L +Q GPI++ QVENEYG +YG + Y++ +A+ + V PW+
Sbjct: 140 -DLQITQEGPILMMQVENEYG----SYG-NDKSYLRKSAELMRHNGIDVPLFTSDGPWLD 193
Query: 196 CQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSFGYAVPF-R 246
+ +D P IN C + F + +P+M E + GWF ++G
Sbjct: 194 MLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHTT 252
Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAPIDEYG 300
V D A + E G N YM+ GGTNFG G TSYDYDA + E+G
Sbjct: 253 SVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDALLSEWG 310
Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
+ PK+ +++ I + +++ T + G
Sbjct: 311 DV-TPKYEAFQQVIGEITEIPSFPLTTKITKRAYG------------------------- 344
Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
+W VS + +F T ISQ ++P ELL ++
Sbjct: 345 ---------------SWKVS----QRVSLFETLASISQPVKHNYPLTM-----ELLDQAT 380
Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
+ +Y ++G S + DY ++ +
Sbjct: 381 GYVYYRSQIGKS-----------------RVIEDYR----------------LIHCQDRA 407
Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
H F+N +L Y K + L E N L IL +G NY +
Sbjct: 408 HT---FINNQLQFIQYDQE----IGQKKTLTLTEESNELGILVENMGRVNYSVQMNHQYK 460
Query: 541 GLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
G+ +++ NG EW IY + ++ LD++ S W+ G
Sbjct: 461 GIKDGVIV---NGA---FQSEWEIYSLPMD----NLDQVDF--SGHWQTGQP-------S 501
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
+ K +F E + L GKG +NG +IGR+W
Sbjct: 502 FSKVSFQVDECADTF-VELPGWGKGFIVINGHNIGRFWE 539
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 153/309 (49%), Gaps = 32/309 (10%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+D K + SG+IHY R PE W + + K + G +ETYV WN HE G Y F+G
Sbjct: 12 LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T QE GL++ LR PY CAEW +GG P WL P ++ R PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
A + ++ +L +QGGPII+ QVENEYG +Y E K A + +
Sbjct: 132 AHLFPQVR--DLQITQGGPIIMMQVENEYG----SYANDKEYLRKMVAAMRQHGVETPLV 185
Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
+ PW + +D P IN C + F + +P+M E + GWF ++
Sbjct: 186 TSDGPWHDMLENGSIKDLALPTIN-CGSNIKENFEKLRKFHGEKRPLMVMEFWIGWFDAW 244
Query: 240 GYAVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVATSYD 291
G ++D + G N YM+ GGTNFG G P V TSYD
Sbjct: 245 GDDQHHTTSIQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDV-TSYD 301
Query: 292 YDAPIDEYG 300
YDA + E+G
Sbjct: 302 YDALLTEWG 310
>gi|239986962|ref|ZP_04707626.1| putative beta-galactosidase [Streptomyces roseosporus NRRL 11379]
Length = 606
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 126/398 (31%), Positives = 179/398 (44%), Gaps = 55/398 (13%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ T D DGK L SG++HY R E W + GL +ETYV WN HEP
Sbjct: 3 DFTVDDDGFRFDGKPVRLLSGALHYFRVHEEQWGHRLAVLAAMGLNCVETYVPWNLHEPR 62
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G+ G L RF+ V+ AGL+ +R GPY CAEW GG PVW+ G + RT +
Sbjct: 63 EGEVRDVG--ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDA 120
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
++ ++R+ +++ + + + +GGP+IL Q ENEYG+ +G +Y++W A
Sbjct: 121 EYRAVVERWFRELLPQVVERQVV--RGGPVILVQAENEYGS----FG-SDAVYLEWLAGL 173
Query: 184 AVNLNTSVPWVMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPI 226
+VP D P+ ++ T N GF + P P+
Sbjct: 174 LRECGVTVPLFTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFAV--LRRHQPKGPL 228
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAG 282
M E + GWF +G R E+ A A+ E G + N YM GGTNF G G
Sbjct: 229 MCMEFWCGWFDHWGAEPVLRDAEEAAGALREILECGASV-NIYMAHGGTNFAGWAGANRG 287
Query: 283 GPL-------VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEY----LISSDPTH 331
GPL TSYDYDAP+DEYG + + H K+ E Y L + P
Sbjct: 288 GPLQDGEFQPTVTSYDYDAPVDEYGRATE-------KFHLFRKVLEGYAQRPLPALPPEP 340
Query: 332 QKLGAKLEAHIYHKSS-NDCAAFLANYDSSSDANVTFN 368
Q L + A + + D L + ++ S TF
Sbjct: 341 QGLAGPVRAELTGWAGLGDVLEALGDPETESGVPPTFE 378
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 157/324 (48%), Gaps = 34/324 (10%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ V++SG +HYPR W E +R ++ GL + TY FW+ HEP GQ+ F G+
Sbjct: 42 LDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQHEPEPGQWSFSGQN 101
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL F+KT E GL + LR GPY CAE ++GGFP WL G++ R+ + + R+
Sbjct: 102 DLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRSMDARYLAASARYF 161
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
++ + +L +S+GGPI++ Q+ENEYG+ YG + Y++ P
Sbjct: 162 KRLA--QEVADLQSSRGGPILMLQLENEYGS----YGRDHD-YLRAVRTQMRQAGFDAPL 214
Query: 194 VMCQQ-----------EDAPDPIINTCNG-----FYCDGFTPNSPSKPIMWTENYSGWFL 237
D P ++N G P P M E ++GWF
Sbjct: 215 FTSDGGAGRLFEGGTLADVP-AVVNFGGGADDAQASVQELAAWRPHGPRMAGEYWAGWFD 273
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--------TS 289
+G + E+ A V R G +F N YM+ GGT+FG AG TS
Sbjct: 274 HWGEQHHTQSPEEAARTVERMLSQGVSF-NLYMFHGGTSFGWLAGANYSGSEPYQPDTTS 332
Query: 290 YDYDAPIDEYGFIRQPKWGHLREL 313
YDYDA +DE G PK+ LR++
Sbjct: 333 YDYDAALDEAGRP-TPKYFALRDV 355
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 157/320 (49%), Gaps = 19/320 (5%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
T + +++GK V+++ +HYPR W I+ K G+ + YVFWN HE G
Sbjct: 22 TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEG 81
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
++ F G D+ F + Q GL++ +R GPY CAEW GG P WL I+ R + F
Sbjct: 82 KFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYF 141
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADT 183
E +K F K+ + + +L GGPII+ QVENEYG+ AY V+ +
Sbjct: 142 MERVKLFERKVGEQLA--SLTIQNGGPIIMVQVENEYGSYGKNKAYVSAIRDIVRRSGFD 199
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSGWF 236
V L W +++ D ++ T N G D P+ P M +E +SGWF
Sbjct: 200 KVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWF 258
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
+G RP + + + G +F + YM GGT+FG AG P A TSYD
Sbjct: 259 DKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 317
Query: 292 YDAPIDEYGFIRQPKWGHLR 311
YDAPI+EYG PK+ LR
Sbjct: 318 YDAPINEYGQA-TPKYWELR 336
>gi|421514041|ref|ZP_15960756.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|401672838|gb|EJS79281.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 611
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G IETY+ WN HEP+ G Y FEG
Sbjct: 10 FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V FV QE GL + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + L K L + GGP+I+ QVENEYG +YG+ E Y++ V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
P + + A + +++ D F S SK PIM E
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
TSYDYDA + E G + K+ H++ +AIK +C E + ++P + G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345
>gi|229545563|ref|ZP_04434288.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256619317|ref|ZP_05476163.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853375|ref|ZP_05558745.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256964870|ref|ZP_05569041.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|257090147|ref|ZP_05584508.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|294614275|ref|ZP_06694194.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|307272958|ref|ZP_07554205.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|307277803|ref|ZP_07558888.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291733|ref|ZP_07571605.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|384518848|ref|YP_005706153.1| beta-galactosidase [Enterococcus faecalis 62]
gi|422685728|ref|ZP_16743941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422689100|ref|ZP_16747212.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422720655|ref|ZP_16777264.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422731066|ref|ZP_16787446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|422739263|ref|ZP_16794446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|430849460|ref|ZP_19467237.1| glycosyl hydrolase [Enterococcus faecium E1185]
gi|229309303|gb|EEN75290.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256598844|gb|EEU18020.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711834|gb|EEU26872.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256955366|gb|EEU71998.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256998959|gb|EEU85479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|291592934|gb|EFF24524.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|306497185|gb|EFM66730.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505543|gb|EFM74728.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306510572|gb|EFM79595.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|315029440|gb|EFT41372.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032046|gb|EFT43978.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144925|gb|EFT88941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|315162898|gb|EFU06915.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577862|gb|EFU90053.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|323480981|gb|ADX80420.1| beta-galactosidase [Enterococcus faecalis 62]
gi|430537598|gb|ELA77922.1| glycosyl hydrolase [Enterococcus faecium E1185]
Length = 611
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G IETY+ WN HEP+ G Y FEG
Sbjct: 10 FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V FV QE GL + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + L K L + GGP+I+ QVENEYG +YG+ E Y++ V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
P + + A + +++ D F S SK PIM E
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
TSYDYDA + E G + K+ H++ +AIK +C E + ++P + G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345
>gi|312903586|ref|ZP_07762766.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|310633462|gb|EFQ16745.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
Length = 611
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G IETY+ WN HEP+ G Y FEG
Sbjct: 10 FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V FV QE GL + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + L K L + GGP+I+ QVENEYG +YG+ E Y++ V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
P + + A + +++ D F S SK PIM E
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
TSYDYDA + E G + K+ H++ +AIK +C E + ++P + G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345
>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
43184]
gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
Length = 780
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/317 (34%), Positives = 154/317 (48%), Gaps = 19/317 (5%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ IHY R E W I+ K G+ I Y FWN HE G++ F+
Sbjct: 39 TFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFK 98
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ LR GPY C+EW GG P WL I+ RT + F E K
Sbjct: 99 GQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTK 158
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVGGELYVKWAADTAVNLN 188
F+ +I + +L ++GG II+ QVENEYG + AY VK A T V L
Sbjct: 159 LFMNEIGKQLA--DLQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPL- 215
Query: 189 TSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFGY 241
W Q + D ++ T N G D P P+M +E +SGWF +G
Sbjct: 216 FQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGR 275
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDYDAPI 296
R + + + +F + YM GGT FG G + +SYDYDAPI
Sbjct: 276 KHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPI 334
Query: 297 DEYGFIRQPKWGHLREL 313
E G+ PK+ LREL
Sbjct: 335 SEAGWA-TPKYYKLREL 350
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 30/96 (31%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+Y+TTF E G + L++ + GKG WVNG+++GR+W
Sbjct: 534 YYRTTFELDE-VGDVFLDMQTWGKGMVWVNGKAMGRFWEI-------------------- 572
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTL+ +P W+ G+N ++I + LG
Sbjct: 573 --------GPQQTLF-MPGCWLKKGKNEIIILDLLG 599
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 157/320 (49%), Gaps = 19/320 (5%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
T + +++GK V+++ +HYPR W I+ K G+ + YVFWN HE G
Sbjct: 31 TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEG 90
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
++ F G D+ F + Q GL++ +R GPY CAEW GG P WL I+ R + F
Sbjct: 91 RFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYF 150
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADT 183
E +K F K+ + + +L GGPII+ QVENEYG+ AY V+ +
Sbjct: 151 MERVKLFERKVGEQLA--SLTIQNGGPIIMVQVENEYGSYGENKAYVSAIRDIVRQSGFD 208
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSGWF 236
V L W +++ D ++ T N G D P+ P M +E +SGWF
Sbjct: 209 KVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWF 267
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
+G RP + + + G +F + YM GGT+FG AG P A TSYD
Sbjct: 268 DKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 326
Query: 292 YDAPIDEYGFIRQPKWGHLR 311
YDAPI+EYG PK+ LR
Sbjct: 327 YDAPINEYGQA-TPKYWELR 345
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/306 (35%), Positives = 154/306 (50%), Gaps = 36/306 (11%)
Query: 19 RVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRF 78
RVL SG++HY R PE+W + + K K GL +ETYV WN HEP GQ+ +EG DL F
Sbjct: 23 RVL-SGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFRYEGGLDLAAF 81
Query: 79 VKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIID 138
++ + GL++ +R GP+ CAEW +GG P WL P ++ R P+ E ++RF ++
Sbjct: 82 IRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEAVRRFYDDLLP 141
Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQ 198
+ + +GGPI+ QVENEYG +YG +LY+ W + L+ V ++
Sbjct: 142 RLLPLQI--QRGGPILAMQVENEYG----SYG-SDQLYLTWL--RRLMLDGGVETLLFTS 192
Query: 199 EDAPDPII---NTCNGFYCDGFTPNS-----------PSKPIMWTENYSGWFLSFGYAVP 244
+ A D ++ + F + P P+M E ++GWF +G
Sbjct: 193 DGATDHMLKHGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWNGWFDHWGEPHH 252
Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----------PLVATSYDYDA 294
R D A A+ R G N YM+ GGTNFG G P V SYDYDA
Sbjct: 253 TRDAADAADALERIMACGAHV-NVYMFHGGTNFGFMNGANTDLLTRDYQPTV-NSYDYDA 310
Query: 295 PIDEYG 300
P+DE G
Sbjct: 311 PLDETG 316
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 159/327 (48%), Gaps = 27/327 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ T +++G+ V+++ +HYPR W + I+ K G+ + YVFWN HE
Sbjct: 26 GDFTVGKNTFLLNGRPFVIKAAELHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQ 85
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
GQ+ F G D+ F + + G+++ +R GPY CAEW GG P WL ++ R +
Sbjct: 86 REGQFDFTGNNDVAAFCRLAHKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVRLREDD 145
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY------ 176
F +K F A++ + L GGPII+ QVENEYG +YG+ +
Sbjct: 146 PYFMARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENEYG----SYGINKKYVSEIRDI 199
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWT 229
VK + V L W + + D ++ T N G D P P+M +
Sbjct: 200 VKASGFDKVTL-FQCDWASNFEHNGLDDLVWTMNFGTGANIDEQFRRLKQLRPEAPLMCS 258
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
E +SGWF +G RP +D+ + G +F + YM GGT+FG AG P A
Sbjct: 259 EFWSGWFDKWGARHETRPAKDMVEGIDEMLRKGISF-SLYMTHGGTSFGHWAGANSPGFA 317
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLR 311
TSYDYDAPI+EYG + PK+ LR
Sbjct: 318 PDVTSYDYDAPINEYG-MPTPKFFALR 343
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 42/100 (42%), Gaps = 29/100 (29%)
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
K I Y + + G LNL GKGQ +VNG ++GR+W
Sbjct: 535 KQNIGYYRGYFDLKKTGDTFLNLEQWGKGQVYVNGHALGRFW------------------ 576
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
H G P QTLY +P W+ G N +++ + +G
Sbjct: 577 ---------HIG-PQQTLY-LPGCWLKKGRNEIIVLDVVG 605
>gi|29376389|ref|NP_815543.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|227519038|ref|ZP_03949087.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553661|ref|ZP_03983710.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256961654|ref|ZP_05565825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293383358|ref|ZP_06629271.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388990|ref|ZP_06633475.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907816|ref|ZP_07766806.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910433|ref|ZP_07769280.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714340|ref|ZP_16771066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715597|ref|ZP_16772313.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676484|ref|ZP_18113355.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681702|ref|ZP_18118489.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685588|ref|ZP_18122282.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686206|ref|ZP_18122874.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690524|ref|ZP_18127059.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424694932|ref|ZP_18131318.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696643|ref|ZP_18132984.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424700339|ref|ZP_18136532.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703758|ref|ZP_18139884.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424712611|ref|ZP_18144783.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424718249|ref|ZP_18147501.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424721894|ref|ZP_18150963.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723972|ref|ZP_18152924.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733572|ref|ZP_18162127.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424741709|ref|ZP_18170052.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424751990|ref|ZP_18179997.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|29343852|gb|AAO81613.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|227073538|gb|EEI11501.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177203|gb|EEI58175.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256952150|gb|EEU68782.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291079149|gb|EFE16513.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081771|gb|EFE18734.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626177|gb|EFQ09460.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289706|gb|EFQ68262.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575942|gb|EFU88133.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580774|gb|EFU92965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350621|gb|EJU85522.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356496|gb|EJU91227.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358329|gb|EJU93003.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364102|gb|EJU98549.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367740|gb|EJV02077.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402369105|gb|EJV03397.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402374029|gb|EJV08075.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377412|gb|EJV11319.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402379869|gb|EJV13650.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402382152|gb|EJV15835.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402384002|gb|EJV17579.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402390099|gb|EJV23464.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402391584|gb|EJV24885.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402396442|gb|EJV29504.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402401146|gb|EJV33935.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402404973|gb|EJV37581.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 611
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G IETY+ WN HEP+ G Y FEG
Sbjct: 10 FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V FV QE GL + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + L K L + GGP+I+ QVENEYG +YG+ E Y++ V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
P + + A + +++ D F S SK PIM E
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
TSYDYDA + E G + K+ H++ +AIK +C E + ++P + G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345
>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
Length = 780
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/317 (34%), Positives = 154/317 (48%), Gaps = 19/317 (5%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ IHY R E W I+ K G+ I Y FWN HE G++ F+
Sbjct: 39 TFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFK 98
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ LR GPY C+EW GG P WL I+ RT + F E K
Sbjct: 99 GQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTK 158
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVGGELYVKWAADTAVNLN 188
F+ +I + +L ++GG II+ QVENEYG + AY VK A T V L
Sbjct: 159 LFMNEIGKQLA--DLQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPL- 215
Query: 189 TSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFGY 241
W Q + D ++ T N G D P P+M +E +SGWF +G
Sbjct: 216 FQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGR 275
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDYDAPI 296
R + + + +F + YM GGT FG G + +SYDYDAPI
Sbjct: 276 KHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPI 334
Query: 297 DEYGFIRQPKWGHLREL 313
E G+ PK+ LREL
Sbjct: 335 SEAGWA-TPKYYKLREL 350
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 30/96 (31%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+Y+TTF E G + L++ + GKG WVNG+++GR+W
Sbjct: 534 YYRTTFELDE-VGDVFLDMQTWGKGMVWVNGKAMGRFWEI-------------------- 572
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
P QTL+ +P W+ G+N ++I + LG
Sbjct: 573 --------GPQQTLF-MPGCWLKKGKNEIIILDLLG 599
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 154/319 (48%), Gaps = 29/319 (9%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++G+ + SG++HY R P+ W + +RK++ GL +ETYV WN H+P G +G
Sbjct: 13 LNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLDGLL 72
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++ GL + LR GPY CAEW+ GG P WL +Q R+++ F + R+L
Sbjct: 73 DLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIIDRYL 132
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
++ + A GGP+I QVENEYG AYG E Y+K+ + +
Sbjct: 133 DLLLPPLLPH--MAESGGPVIAVQVENEYG----AYGNDAE-YLKYLVEAFRSRGIEELL 185
Query: 194 VMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSFGY 241
C Q + + G G + P P+M E + GWF +G
Sbjct: 186 FTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFDHWGG 245
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVATSYDYDA 294
R D+A + + G + N YM+ GGTNFG T G P + TSYDYDA
Sbjct: 246 PHHTRDTADVAADLDKLLAAGASV-NIYMFHGGTNFGLTNGANHHHTYAPTI-TSYDYDA 303
Query: 295 PIDEYGFIRQPKWGHLREL 313
P+ E G PK+ RE+
Sbjct: 304 PLTENG-DPGPKYHAFREV 321
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 156/314 (49%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SG+IHY R PE W + K G +ETYV WN+HE + G++ F G
Sbjct: 10 FMLDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ RF+ T + GL++ +R PY CAEW +GG P WL P ++ R+ + F E ++R
Sbjct: 70 TKDIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEYVER 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ ++ +++ L GPI++ QVENEYG +YG + Y+ A + +V
Sbjct: 130 YYDRLFEILTP--LQIDHHGPILMMQVENEYG----SYG-EDKTYLSALARMMRDRGVTV 182
Query: 192 P-------WVMCQQED--APDPIINTCN-GFYCDGFTPN--------SPSKPIMWTENYS 233
P W C + A II T N G N + P+M E +
Sbjct: 183 PLFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEFWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL---V 286
GWF +G + R ++L + + G N YM+ GGTNFG +A G +
Sbjct: 243 GWFNRWGDRIITRQSDELIDEIGEVLKRGSI--NLYMFHGGTNFGFWNGCSARGRIDLPQ 300
Query: 287 ATSYDYDAPIDEYG 300
TSYDYDAP+DE G
Sbjct: 301 VTSYDYDAPLDEAG 314
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/387 (31%), Positives = 180/387 (46%), Gaps = 40/387 (10%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+D K + SG+IHY R PE W + + K + G +ETYV WN HE G Y F+G
Sbjct: 12 LDNKPLKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T QE GL++ LR PY CAEW +GG P WL P ++ R PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
A + ++ +L +QGGPII+ QVENEYG +Y E K A + +
Sbjct: 132 AHLFPQVR--DLQITQGGPIIMMQVENEYG----SYANDKEYLRKMVAAMRQHGVETPLV 185
Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
+ PW + +D P IN C + F + +P+M E + GWF ++
Sbjct: 186 TSDGPWHDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244
Query: 240 GYAVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVATSYD 291
G +D + G N YM+ GGTNFG G P V TSYD
Sbjct: 245 GDDQHHTTSTQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDV-TSYD 301
Query: 292 YDAPIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDPTHQKLGA---KLEAHIYHKS 346
YDA + E+G +P K+ +++ E+ +S + + G K ++
Sbjct: 302 YDALLTEWG---EPTAKYQAFKKVIADYAEIPEFPLSMEIERKAYGTFSVKERVSLFSTI 358
Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYF 373
++NY S +A G +Y+
Sbjct: 359 DTISQPIISNYPLSMEACNQATGYIYY 385
>gi|307275710|ref|ZP_07556850.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|306507586|gb|EFM76716.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
Length = 611
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G IETY+ WN HEP+ G Y FEG
Sbjct: 10 FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V FV QE GL + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + L K L + GGP+I+ QVENEYG +YG+ E Y++ V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
P + + A + +++ D F S SK PIM E
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
TSYDYDA + E G + K+ H++ +AIK +C E + ++P + G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/326 (34%), Positives = 155/326 (47%), Gaps = 39/326 (11%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ + YVFWN HEP G Y F
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
+ DL F + Q+ +++ LR GPY CAEW GG P WL ++ R ++ F E +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + +K NL + GGPII+ QVENEYG+ V +G L+
Sbjct: 476 LFEEAVAKQVK--NLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533
Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
WA++ +N + W M N G D P+ P+M +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
+SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI E G PK+ LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWALRE 666
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 163/329 (49%), Gaps = 32/329 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ D +DGK L G +HY R E W + +++++ GL I YVFWN+HE
Sbjct: 28 RIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQ 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G++ F G+ D+ FV+ QE GL++ LR GPYACAEW++GG+P WL + +R+ +
Sbjct: 88 PGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDP 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
F E +R++ + + L + GG I++ QVENEYG +Y E Y+ D
Sbjct: 148 RFLEYCERYIKALGKQLAP--LTVNNGGNILMVQVENEYG----SYAADKE-YLAALRDM 200
Query: 184 AVNLNTSVPWVMCQ---QEDAP--DPIINTCNGFYCDG----FTPNSPSKPIMWTENYSG 234
+ +VP C Q +A D + T NG + + P P E Y
Sbjct: 201 IKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPA 260
Query: 235 WFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG-P 284
WF +G V + RP E L + + + G + YM+ GGTNF TAGG
Sbjct: 261 WFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSMYMFHGGTNFWYMNGANTAGGYR 315
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E+G PK+ RE+
Sbjct: 316 PQPTSYDYDAPLGEWGNC-YPKYYAFREV 343
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 163/329 (49%), Gaps = 32/329 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ D +DGK L G +HY R E W + +++++ GL I YVFWN+HE
Sbjct: 28 RIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQ 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G++ F G+ D+ FV+ QE GL++ LR GPYACAEW++GG+P WL + +R+ +
Sbjct: 88 PGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDP 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
F E +R++ + + L + GG I++ QVENEYG +Y E Y+ D
Sbjct: 148 RFLEYCERYIKALGKQLAP--LTVNNGGNILMVQVENEYG----SYAADKE-YLAALRDM 200
Query: 184 AVNLNTSVPWVMCQ---QEDAP--DPIINTCNGFYCDG----FTPNSPSKPIMWTENYSG 234
+ +VP C Q +A D + T NG + + P P E Y
Sbjct: 201 IKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPA 260
Query: 235 WFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG-P 284
WF +G V + RP E L + + + G + YM+ GGTNF TAGG
Sbjct: 261 WFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSMYMFHGGTNFWYMNGANTAGGYR 315
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E+G PK+ RE+
Sbjct: 316 PQPTSYDYDAPLGEWGNC-YPKYYAFREV 343
>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
Thetaiotaomicron
Length = 612
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/321 (34%), Positives = 156/321 (48%), Gaps = 27/321 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++G+ V+++ IHYPR E W I+ K G I YVFWN+HEP G+Y F
Sbjct: 14 TFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEGRYDFA 73
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + QE G ++ +R GPY CAEW GG P WL I+ R + + E +K
Sbjct: 74 GQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYYXERVK 133
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
FL ++ + +L S+GG II QVENEYG A+G+ + Y+ D T
Sbjct: 134 LFLNEVGKQLA--DLQISKGGNIIXVQVENEYG----AFGI-DKPYISEIRDXVKQAGFT 186
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C + +A D ++ T N G D P P+ +E +SGWF
Sbjct: 187 GVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEFWSGWFD 246
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDY 292
+G R E+L + +F Y + GGT+FG G TSYDY
Sbjct: 247 HWGAKHETRSAEELVKGXKEXLDRNISFSLYXTH-GGTSFGHWGGANFPNFSPTCTSYDY 305
Query: 293 DAPIDEYGFIRQPKWGHLREL 313
DAPI+E G + PK+ +R L
Sbjct: 306 DAPINESGKV-TPKYLEVRNL 325
Score = 43.5 bits (101), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 90/234 (38%), Gaps = 54/234 (23%)
Query: 470 KEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
KE L I A VF+N KKL + K L EG + LDIL G
Sbjct: 396 KEQTLLITEAHDWAQVFLNGKKLATLSR----LKGEGVVKLPPLKEG-DRLDILVEAXGR 450
Query: 529 QNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFW 586
N+G +D G ++L++ K +W +Y + V+ S A +
Sbjct: 451 XNFGKGIYDWKGI----TEKVELQSDKGVELVKDWQVYTIPVD--------YSFARDKQY 498
Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
KQ +Y++TF E G LN + KG WVNG +IGRYW
Sbjct: 499 KQQEN--AENQPAYYRSTFNLNE-LGDTFLNXXNWSKGXVWVNGHAIGRYWEI------- 548
Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
P QTLY +P W+ GEN ++I + G PSK
Sbjct: 549 ---------------------GPQQTLY-VPGCWLKKGENEIIILDXAG--PSK 578
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 160/328 (48%), Gaps = 27/328 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ YD V DG SGSIHY R W + + K K GL I+TYV WNYHEP
Sbjct: 27 IDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNYHEPQM 86
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y F G DL F++ E GL + LR GPY CAEW+ GG P WL I R++++
Sbjct: 87 GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 146
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG--------- 172
+ +++++ ++ MK GGPII+ QVENEYG+ ++ Y
Sbjct: 147 YLTAVEKWMGVLLPKMKPH--LYHNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFRQHL 204
Query: 173 GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
G+ V + D A + ++ + + AP N F + P+ P++ +
Sbjct: 205 GDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPTGPLVNS 260
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
E Y+GW +G+ P E +A + G N YM+ GGTNF G P ++
Sbjct: 261 EFYTGWLDHWGHRHIVVPSETIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMS 319
Query: 288 --TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E G + + K+ LRE+
Sbjct: 320 QPTSYDYDAPLSEAGDLTE-KYFALREV 346
>gi|225872977|ref|YP_002754436.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
gi|225792973|gb|ACO33063.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
Length = 619
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/316 (33%), Positives = 163/316 (51%), Gaps = 40/316 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SGSIH+ R W + +RK++ GL I YVFWN EP RGQ+ F G
Sbjct: 45 FILDGKPVQIISGSIHFARVPRAEWGDRLRKARAMGLNAISVYVFWNVQEPHRGQWDFSG 104
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++D+ RF++ Q+AGL++ LR GPYACAEW+ GG+P WL ++ R+++ + +
Sbjct: 105 QYDVARFIRMAQQAGLYVILRPGPYACAEWSMGGYPAWLWKDGRVKIRSSDPAYLHAAQD 164
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
++ + +K L + GGPII QVENEYG+ AY G+GG V
Sbjct: 165 YMDHLGQQLKP--LLWTHGGPIIAVQVENEYGSFGKSRAYLEEVRRMVAGAGLGG--VVL 220
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
+ AD + S+P + + P + N + P+S K + E Y GWF
Sbjct: 221 YTADGPGLWSGSLPELPEAIDVGPGGVENGVKQLLA--YRPHS--KLVYVAEYYPGWFDQ 276
Query: 239 FG----YAVPFRP-VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
+G + P + ++DL + ++R + N YM+ GGT++G G A
Sbjct: 277 WGQPHHHGAPLKEQLKDLRWILSRGYSV-----NLYMFHGGTDWGFMNGANDNAADTDYA 331
Query: 288 ---TSYDYDAPIDEYG 300
TSYDY AP++E G
Sbjct: 332 PQTTSYDYAAPLNEAG 347
>gi|242004937|ref|XP_002423332.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
gi|212506351|gb|EEB10594.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
Length = 596
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 121/342 (35%), Positives = 162/342 (47%), Gaps = 52/342 (15%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK SGS HY R + W + +RK K GL + TYV W+ HE + G Y FEG
Sbjct: 1 MDGKPFQYVSGSAHYFRMPNQYWRDRLRKIKAAGLNAVSTYVEWSQHERVPGVYDFEGDL 60
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI-PGIQFRTTNNPFKEEMKRF 132
D+ RFV+ QE GLF+ LR GPY CAE + GG P WL P IQ R+++ + ++R+
Sbjct: 61 DVKRFVEMAQEEGLFVILRPGPYICAERDMGGLPYWLMTKHPDIQLRSSDFFYTYYVQRW 120
Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVE--------WAYGVGGELYVKWAADTA 184
+ K+ L K +L+ +GGPIIL QVENEYG+ W + E +V + A
Sbjct: 121 MDKL--LGKFTDLWYGKGGPIILVQVENEYGSYHSCDYNHTYWLRNLF-EKHVDYNAVLF 177
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS-------------PSKPIMWTE 230
S ++ C + G Y F PNS PS P++ +E
Sbjct: 178 TTDGASRNFLKCGK----------IPGVYATVDFGPNSNVSKMFEAQREFEPSGPLVNSE 227
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
Y GW +G R R N+YM++GG+NFG TAG
Sbjct: 228 YYPGWLTHWGEKKHARQDTKDVVKTLREMLNEKANVNFYMFYGGSNFGFTAGANQFGSIY 287
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYL 324
TSYDYDAPI E G L + + AIK + EEY
Sbjct: 288 QSDITSYDYDAPISE--------AGDLTDKYYAIKNVLEEYF 321
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 157/338 (46%), Gaps = 49/338 (14%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G
Sbjct: 34 FVYDGKAIRIISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPGKWDFSG 93
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
+L +++ E GL + LR GPY CAEW +GG+P WL + G++ R N F + K
Sbjct: 94 DRNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGMELRRDNEQFLKYTKL 153
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L ++ + + L +QGGPII+ Q ENE+G+ YV D + + +
Sbjct: 154 YLERLYKEVGK--LQITQGGPIIMVQGENEFGS-----------YVSQRKDITLEEHRAY 200
Query: 192 PWVMCQQ--EDAPDPIINTCNG--FYCDGFTP----------------------NSPSKP 225
+ +Q E D + T +G + G+ P N P
Sbjct: 201 NAKIIKQLKEVGFDVPMFTSDGSWLFEGGYVPGALPTANGENNIENLKKVVNQYNGGQGP 260
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
M E Y GW + P +A ++ G +F NYYM GGTNFG T+G
Sbjct: 261 YMVAEFYPGWLAHWCEPHPQVKASTIARQTEKYLANGVSF-NYYMVHGGTNFGFTSGANY 319
Query: 286 VA--------TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAPI E G++ PK+ +R + K
Sbjct: 320 DKKHDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNVIK 356
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 86/367 (23%), Positives = 130/367 (35%), Gaps = 62/367 (16%)
Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVF 390
G ++ H +N ANYD D Y P W +NV+
Sbjct: 297 GVSFNYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGWVTPKFDSIRNVI- 355
Query: 391 NTAKVISQRNNGDHPFAQQKNVNELL-LASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
+ D+P + L+ + S + + I+ + V+ D
Sbjct: 356 --------KRYVDYPLPEAPKAFPLIEIPSIELQQVADLLAITETQEAVQGDKPLTFEEL 407
Query: 450 KDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKK 509
Y+ Y + P GK L IE L A V+V+ + V G N + ++ +
Sbjct: 408 NQGYGYVLYRRHFN-QPISGK---LTIEGLRDYATVYVDGEFV--GRLNRYNKKYSMDIE 461
Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVE 569
I N L+IL +G NYG+ G+ S + ID + GEW
Sbjct: 462 IPFN---GNLEILVENMGRINYGSEIVHNNKGIISPVKID-----DNFIEGEWEMTKLPM 513
Query: 570 GEYIGLDKI--SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
E +K+ + S + L SL YK TF E G L++ GKG +
Sbjct: 514 SEVPAFEKMPANTVTSIMGSSANALVGKPSL--YKGTFTLQE-TGDTFLDMKDWGKGIVF 570
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNG +IGRYW P QTL+ +P W+ G N
Sbjct: 571 VNGINIGRYWQV----------------------------GPQQTLF-VPGVWLKKGINE 601
Query: 688 LVIHEEL 694
+VI ++L
Sbjct: 602 IVIFDQL 608
>gi|345487997|ref|XP_001602984.2| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 638
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 169/361 (46%), Gaps = 66/361 (18%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ +++ ++DGK SGS HY R+ + W + +RK + GL + TYV W+ H+P
Sbjct: 32 IDFENNQFLLDGKPFRYVSGSFHYFRTPKQYWRDRLRKMRAAGLNALSTYVEWSLHQPEP 91
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
++ ++G DLV+F++ QE LF+ LR GPY CAE +GGFP W L+ +PGI+ RT +
Sbjct: 92 NKWVWDGDADLVKFLQLAQEEDLFVLLRPGPYICAEREFGGFPYWLLNLVPGIKLRTNDT 151
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ E + +L +++ +K L GGPII+ QVENEYG+ + D
Sbjct: 152 RYLEYAEEYLNQVLTRVKP--LLRGNGGPIIMVQVENEYGS-----------FHACDKDY 198
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPN--- 220
L + Q D ++ T +G Y T N
Sbjct: 199 MTKLKN-----IIQNHVGTDALLYTTDGSYRQALRCGPVSGAYATIDFGTSSNVTQNFNL 253
Query: 221 ----SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFE---TGGTFQNYYMYFG 273
P P++ +E Y GW + PF VE F + + + + G N YM++G
Sbjct: 254 MREFEPKGPLVNSEFYPGWLSH--WEEPFERVE--TFKITKMLDEMLSLGASVNMYMFYG 309
Query: 274 GTNFGRTAGGPLV------ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
GTNF ++G + TSYDYDAP+ E G + + H+ K+ +YL
Sbjct: 310 GTNFAFSSGANIFDNYTPDLTSYDYDAPLSEAGDLTA-------KYHEIKKIISKYLPIP 362
Query: 328 D 328
D
Sbjct: 363 D 363
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 160/329 (48%), Gaps = 19/329 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V Y + DG++ SGSIHY R W + + K GL I+TYV WNYHE
Sbjct: 25 SFTVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKDRLLKMYMAGLNAIQTYVPWNYHE 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
+ G Y F G DL F+K Q+ GL + LR GPY CAEW+ GG P WL I R+T
Sbjct: 85 EVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYICAEWDMGGLPAWLLKKKDIVLRST 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG-GELYV 177
+ + + +++ K++ ++K GGPII QVENEYG+ ++ Y +L+
Sbjct: 145 DPDYIAAVDKWMGKLLPMIKP--YLYQNGGPIITVQVENEYGSYFACDYNYMRHLSKLFR 202
Query: 178 KWAADTAVNLNTS---VPWVMCQQ-EDAPDPIINTCNGFYCDGFTPN---SPSKPIMWTE 230
+ D V T + ++ C +D + F P P P++ +E
Sbjct: 203 SYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPGANVTAAFEPQRQVQPHGPLVNSE 262
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--RTAGGPLVA- 287
Y+GW +G +A A++ G N YM+ GGTNFG A P A
Sbjct: 263 FYTGWLDHWGSRHSVVSPTQVAKALSEMLLMGANV-NLYMFIGGTNFGYWNGANTPYAAQ 321
Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAP+ E G + + K+ +RE+ K
Sbjct: 322 PTSYDYDAPLTEAGDLTE-KYFAIREVIK 349
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 122/390 (31%), Positives = 178/390 (45%), Gaps = 49/390 (12%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
++ D R+L+++G R +L SGSIHYPRSTP +WP+L +++ GL IE+Y FWN H
Sbjct: 1037 SIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKHSAT 1096
Query: 64 R---GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFP------------V 108
R Y F G DL F+ E LF+ R GPY CAEW GG P
Sbjct: 1097 RYGAYDYGFNGDVDL--FLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASNA 1154
Query: 109 WLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWA 168
W+H +PG++ RT N + E R++ ++ E + G ++ENEYG +
Sbjct: 1155 WIHDVPGMKTRTNNTAWLNETGRWMRDHFAVI--EPHLSRNGAS---NRIENEYGGSKSD 1209
Query: 169 YGVGGELYVKWAADTAVNLNTSVPWVMCQ--QEDAPDPIINTCNGFYCDG-------FTP 219
YV A + + W+MC APD ++T NG D P
Sbjct: 1210 AAA--VAYVDALDALADAVAPELVWMMCGFVSLVAPD-ALHTGNGCPHDQGPASAHVVVP 1266
Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
+P W W+ ++G RP D+A+ VA + TGG N+YM+ GG ++G
Sbjct: 1267 PAPGADPAWYTEDELWYDAWGLPSLARPPADVAYGVASYVATGGAMHNFYMWHGGNHYGN 1326
Query: 280 --TA----GG------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
TA GG P Y AP+ G +P + HL +H + E L+ +
Sbjct: 1327 WSTATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHGTLDAYAEVLLGA 1386
Query: 328 DPTHQKLGAKLEA--HIYH-KSSNDCAAFL 354
P + + A H Y K +ND A+ +
Sbjct: 1387 TPEALATPSCVAACPHAYFLKFANDTASVV 1416
>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
Length = 648
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 112/343 (32%), Positives = 162/343 (47%), Gaps = 31/343 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ Y H + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 22 KIDYHHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 81
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 82 PGQYKFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 141
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ + ++L ++ MK L GGPII QVENEYG +Y Y+++
Sbjct: 142 DYLAAVDKWLGVLLPRMKP--LLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQKL 195
Query: 184 -AVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYCD-GFTPNS-------------PSKPI 226
+L V ++ + A +P + G Y F P + P P+
Sbjct: 196 FHYHLGKDV--LLFTTDGALEPFLQCGALQGLYATVDFGPGANITAAFEVQRKSEPKGPL 253
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
+ +E Y+GW +G E +A ++ G N YM+ GGTNF G +
Sbjct: 254 VNSEFYTGWLDHWGQPHSTVKTEVVASSLHDILARGANV-NLYMFIGGTNFAYWNGANMP 312
Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
TSYDYDAP+ E G + + K+ LR++ + + E +I
Sbjct: 313 YKAQPTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGVI 354
>gi|307188518|gb|EFN73255.1| Beta-galactosidase [Camponotus floridanus]
Length = 624
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 153/323 (47%), Gaps = 32/323 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V Y++ ++DGK SGS HY R+ + W + +RK + GL + TYV W+ HE
Sbjct: 31 SFGVDYENNQFLLDGKPFRYVSGSFHYFRAPRQYWRDRLRKMRAAGLNAVSTYVEWSLHE 90
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRT 120
P GQ+ + G DL+ F+ QE LF+ LR GPY CAE + GG P WL P I+ RT
Sbjct: 91 PEPGQFNWAGDADLIEFLNIAQEEDLFVLLRPGPYICAERDLGGLPYWLLREAPDIKLRT 150
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-----NVEWA------- 168
+ F + +L ++++ +K L GGPII+ Q+ENEYG + E+
Sbjct: 151 KDAAFMKYATAYLNQVLEKVKP--LLRGNGGPIIMVQIENEYGSYNACDTEYTDMLKEII 208
Query: 169 ---YGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
G LY A ++ VP + +N N F P P
Sbjct: 209 VGKVGSKALLYTTDGASASLLRCGFVPGAYATIDFGTS--VNVTNSF--QSMRLYQPRGP 264
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
++ +E Y GW +G E + + G + N YM++GGTNFG T+G
Sbjct: 265 LVNSEFYPGWLTHWGETFQRVKTEAVTKTLREMLALGASV-NIYMFYGGTNFGFTSGANG 323
Query: 284 ------PLVATSYDYDAPIDEYG 300
P + TSYDYDAP+ E G
Sbjct: 324 GVGAYSPQI-TSYDYDAPLTEAG 345
>gi|336063700|ref|YP_004558559.1| beta-galactosidase [Streptococcus pasteurianus ATCC 43144]
gi|334281900|dbj|BAK29473.1| beta-galactosidase precursor [Streptococcus pasteurianus ATCC
43144]
Length = 595
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 152/320 (47%), Gaps = 43/320 (13%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +DGK + SGSIHY R P+ W + + K G +ETYV WN HEP G++ F
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G DL RF+ QE GL+ +R PY CAEW +GG P WL G++ R+ + F + +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWL-LEKGVRVRSQDKDFLQVV 126
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
KR+ +I + + L QGG I++ QVENEYG +YG ++Y++ + L
Sbjct: 127 KRYYEALIPRLIKHQL--DQGGNILMFQVENEYG----SYG-EDKVYLRELKQMMLELGL 179
Query: 190 SVPWVMCQQEDAP------------DPIINTCNGFYCDG----------FTPNSPSKPIM 227
P+ D P D ++ T N F F P+M
Sbjct: 180 EEPFFTS---DGPWHTALRAGSLIEDDVLVTGN-FGSKAKENFASMEMFFQQYGKKWPLM 235
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
E + GWF +G V R E+LA AV E G N YM+ GGTNFG R
Sbjct: 236 CMEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARK 293
Query: 281 AGGPLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 294 QTDLPQVTSYDYDAILDEAG 313
>gi|420262409|ref|ZP_14765050.1| beta-galactosidase [Enterococcus sp. C1]
gi|394770166|gb|EJF49970.1| beta-galactosidase [Enterococcus sp. C1]
Length = 585
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 155/313 (49%), Gaps = 40/313 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+D K + SG+IHY R PE W + + K + G +ETYV WN HE G Y FEG
Sbjct: 12 LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGIL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T QE GL++ LR PY CAEW +GG P WL P ++ R PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
A + ++ +L +QGGPI++ QVENEYG +Y E K A +
Sbjct: 132 AHLFPQVR--DLQITQGGPILMMQVENEYG----SYANDKEYLRKMVAAMRQQGVETPLV 185
Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
+ PW + +D P IN C + F + +P+M E + GWF ++
Sbjct: 186 TSDGPWHDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244
Query: 240 G-----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVA 287
G V++L +A G+ N YM+ GGTNFG G P V
Sbjct: 245 GDDHHHTTSTADAVKELQDCLAE-----GSV-NIYMFHGGTNFGFMNGSNYYERLAPDV- 297
Query: 288 TSYDYDAPIDEYG 300
TSYDYDA + E+G
Sbjct: 298 TSYDYDALLTEWG 310
>gi|350418578|ref|XP_003491903.1| PREDICTED: beta-galactosidase-like [Bombus impatiens]
Length = 646
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 45/359 (12%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
S V Y++ ++DGK SGS HY R+ + W + ++K + GL + TYV WN H
Sbjct: 30 FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLKKMRAAGLNAVSTYVEWNLH 89
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFR 119
+P ++++ G D+V F+ QE GLF+ LR GPY CAE ++GG P W L +P I R
Sbjct: 90 QPTENEWHWTGDADVVEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLGRVPDINLR 149
Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
T + + + ++ ++ +++D K + GGPII+ QVENEYG +Y E ++
Sbjct: 150 TNDPRYMKYVEIYINEVLD--KVQPYLRGNGGPIIMVQVENEYG----SYACDTEYLIRL 203
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-----GFTPNS------------- 221
+ T + D +P + C GF + F N+
Sbjct: 204 RDIMRQKIGTK---ALLYSTDGSNPNMLRC-GFVPEVYATVDFGTNTNVTKNFEIMRMYQ 259
Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
P P++ +E Y GW + PF+ V+ + + G + N YM++GGTNFG
Sbjct: 260 PRGPLVNSEFYPGWLSH--WREPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGY 316
Query: 280 TAGG--------PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TAG P + TSYDYDAP+ E G PK+ +R + K + L L S P
Sbjct: 317 TAGANGGHNAYNPQL-TSYDYDAPLTEAG-DPTPKYFAIRNVISKYLPLPNVPLPSPSP 373
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 155/326 (47%), Gaps = 39/326 (11%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ + YVFWN HEP G Y F
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
+ DL F + Q+ +++ LR GPY CAEW GG P WL ++ R ++ F E +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + +K +L + GGPII+ QVENEYG+ V +G L+
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533
Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
WA++ +N + W M N G D P+ P+M +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
+SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI E G PK+ LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWALRE 666
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/320 (34%), Positives = 156/320 (48%), Gaps = 19/320 (5%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
T + +++GK V+++ +HYPR W I+ K G+ + YVFWN HE G
Sbjct: 35 TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEG 94
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
++ F D+ F + Q GL++ +R GPY CAEW GG P WL I+ R + F
Sbjct: 95 KFDFTDNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYF 154
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADT 183
E +K F K+ + + +L GGPII+ QVENEYG+ AY V+ +
Sbjct: 155 MERVKLFERKVGEQLA--SLTIQNGGPIIMVQVENEYGSYGENKAYVSAIRDIVRQSGFD 212
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSGWF 236
V L W +++ D ++ T N G D P+ P M +E +SGWF
Sbjct: 213 KVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWF 271
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
+G RP + + + G +F + YM GGT+FG AG P A TSYD
Sbjct: 272 DKWGARHETRPAKTMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 330
Query: 292 YDAPIDEYGFIRQPKWGHLR 311
YDAPI+EYG PK+ LR
Sbjct: 331 YDAPINEYGQA-TPKYWELR 349
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 115/332 (34%), Positives = 162/332 (48%), Gaps = 34/332 (10%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V DH L +G+ L SG +HY R E W ++ +K GL + TY+FWN HEP
Sbjct: 43 RVAGDHFEL--NGEPVQLLSGEMHYARIPREYWRARLQMAKAMGLNTVATYIFWNVHEPK 100
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIP--GIQFRTT 121
G Y F G D+ FVK QE GL + LR GPYACAEW +GG+P WL P G R+
Sbjct: 101 PGVYDFSGNHDVAAFVKMAQEEGLNVILRAGPYACAEWEFGGYPSWLMKDPKMGSALRSN 160
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ + ++R++ ++ M L S GGPI+ QVENEYG+ +G G + Y+
Sbjct: 161 DEVYMAPVERWIKRLGQEMVP--LLISNGGPIVAVQVENEYGD----FG-GDKKYLAHML 213
Query: 182 DTAVN----------LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMW 228
+ N ++ S V E P +N G G T + P +P+
Sbjct: 214 EIFQNAGFKDSFLYTVDPSKALVNGSLEGLPSG-VNFGVGNAERGLTALAHLRPGQPLFA 272
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA- 287
+E + GWF +G+ RP+ +A + + N YM+ GGT+FG +G
Sbjct: 273 SEYWPGWFDHWGHPHETRPIPPQLKDIAYTLDHKSSI-NIYMFHGGTSFGFMSGASWTGG 331
Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+DE G PK+ R+L
Sbjct: 332 EYLPDVTSYDYDAPLDEAGH-PTPKFYAYRDL 362
>gi|325569852|ref|ZP_08145846.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325156975|gb|EGC69143.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 585
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 155/313 (49%), Gaps = 40/313 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+D K + SG+IHY R PE W + + K + G +ETYV WN HE G Y FEG
Sbjct: 12 LDKKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGIL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T QE GL++ LR PY CAEW +GG P WL P ++ R PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
A + ++ +L +QGGPI++ QVENEYG +Y E K A +
Sbjct: 132 AHLFPQVR--DLQITQGGPILMMQVENEYG----SYANDKEYLRKMVAAMRQQGVETPLV 185
Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
+ PW + +D P IN C + F + +P+M E + GWF ++
Sbjct: 186 TSDGPWHDMLENGTIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244
Query: 240 G-----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVA 287
G V++L +A G+ N YM+ GGTNFG G P V
Sbjct: 245 GDDHHHTTSTADAVKELQDCLAE-----GSV-NIYMFHGGTNFGFMNGSNYYERLAPDV- 297
Query: 288 TSYDYDAPIDEYG 300
TSYDYDA + E+G
Sbjct: 298 TSYDYDALLTEWG 310
>gi|334348881|ref|XP_001378605.2| PREDICTED: beta-galactosidase-like [Monodelphis domestica]
Length = 658
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 118/348 (33%), Positives = 164/348 (47%), Gaps = 25/348 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y+ + DGK SGSIHY R W + + K K GL I+TYV WN+HEP+
Sbjct: 50 IDYERDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPLP 109
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y F +DL F++ E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 110 GVYRFSDDYDLEYFLQLAHEIGLLVILRPGPYICAEWDMGGLPAWLLTKKSIVLRSSDPD 169
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY-GVGGELYVKWA 180
+ E +++L ++ MK GGPII QVENEYG+ ++ Y +L+ K
Sbjct: 170 YLAETEKWLGVLLPKMKP--YLYQNGGPIITVQVENEYGSYFTCDYNYLRFLQQLFHKHL 227
Query: 181 ADTAVNLNT---SVPWVMCQQEDAPDPII------NTCNGFYCDGFTPNSPSKPIMWTEN 231
+ V T S ++ C + N F T P P++ +E
Sbjct: 228 GEEVVLFTTDGASEDYLKCGTLQGLYATVDFGTNHNITEAFQSQRKT--EPKGPLVNSEF 285
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
Y+GW +G A + + ++ G N YM+ GGTNFG G P A
Sbjct: 286 YTGWLDHWGEAHETVDTKAIISSLNDMLSQGANV-NMYMFIGGTNFGFWNGANIPYAAQP 344
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
TSYDYDAP+ E G + + K+ LREL + E LI PT K
Sbjct: 345 TSYDYDAPLSEAGDLTE-KYFALRELIGKFEKLPEGLIP--PTTPKFA 389
>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
Length = 595
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 152/316 (48%), Gaps = 34/316 (10%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+++G+ + SG+IHY R PE W + K G +ETY+ WN HE +Y F
Sbjct: 8 EEFLLNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDF 67
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ RFV+T +E GLF+ LR PY CAEW +GG P WL ++ R+++ F E++
Sbjct: 68 SGQLDIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNMRIRSSDPQFIEKV 127
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
+ K+ + + L + GGP+I+ Q+ENEYG +YG E Y+K + + L
Sbjct: 128 SSYYKKLFEQIVP--LQVTSGGPVIMMQLENEYG----SYGEDKE-YLKTLYELMLELGV 180
Query: 190 SVP-------WVMCQQEDAPDPIINTCNGFYCDGFTPN---------SPSK--PIMWTEN 231
+VP W Q+ + G + N S K P+M E
Sbjct: 181 TVPIFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEY 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF + + R +DL V + G N YM+ GGTNFG R
Sbjct: 241 WGGWFNRWNDPIIKRDAQDLTNDVKEALKIGSL--NLYMFHGGTNFGFMNGCSARLGKDL 298
Query: 285 LVATSYDYDAPIDEYG 300
TSYDYDAP++E G
Sbjct: 299 PQLTSYDYDAPLNEQG 314
>gi|449664450|ref|XP_002165261.2| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 589
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/317 (30%), Positives = 158/317 (49%), Gaps = 25/317 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y++ + DG SGSIHY R + W + + K ++ GL I+TY+ WN+HEP
Sbjct: 25 IDYENNKFLKDGTEFRYISGSIHYMRVPEDYWEDRLSKIRKAGLNAIQTYIPWNFHEPTE 84
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPG---IQFRTT 121
G + F G+ ++ +F+K Q+ L + LR GPY CAEW +GGFP WL G +Q RT+
Sbjct: 85 GNFQFGGQQNVFKFLKLAQKYDLLVILRPGPYICAEWEFGGFPYWLLKKVGNKTMQLRTS 144
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV----EWAYGVGGELYV 177
+N + ++++ +++ ++ ++ GGPII QVENEYG+ E+ Y + ++
Sbjct: 145 DNLYLQKVENYMSVLLSGLRP--YLYENGGPIITVQVENEYGSYGCDHEYMYKLES-IFR 201
Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN-------GFYCDGFTPNSPSKPIMWTE 230
K+ + + T + P+ T + Y D P P++ +E
Sbjct: 202 KYLGENVILFTTDGAGDSYLKCGTIKPLFATVDFGPTAEPKLYFDIQRKYQPLGPLVNSE 261
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
Y+GW +G +ED+ + + + N YM+ GGTNFG G +
Sbjct: 262 FYTGWLDHWGGQHAHTSLEDVTDTLDKMLSLNASV-NMYMFEGGTNFGFMNGANQDSNSL 320
Query: 288 ----TSYDYDAPIDEYG 300
TSYDYDAP+ E G
Sbjct: 321 QPQPTSYDYDAPLSEAG 337
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 162/321 (50%), Gaps = 23/321 (7%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ +++GK ++++ +HYPR W + I+ K G+ + YVFWN HE G++
Sbjct: 40 NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFD 99
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G D+ F++ QE GL++ +R GPY CAEW GG P WL I+ R + F E
Sbjct: 100 FTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMER 159
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVN 186
+ F K+ + + +L +GGPII+ QVENEYG+ + Y G ++ + V
Sbjct: 160 YRIFAKKLGEQIG--DLTIEKGGPIIMVQVENEYGSYGEDKPYVSGIRDIIRDSGFDKVT 217
Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS--------PSKPIMWTENYSGWFLS 238
L W ++ D ++ T N F N P P M +E +SGWF
Sbjct: 218 L-FQCDWSSNFTKNGLDDLVWTMN-FGTGANIENEFKKLGELRPESPQMCSEFWSGWFDK 275
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDY 292
+G R +++ + + G +F + YM GGT++G AG P V TSYDY
Sbjct: 276 WGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDV-TSYDY 333
Query: 293 DAPIDEYGFIRQPKWGHLREL 313
DAPI+E G + PK+ LRE+
Sbjct: 334 DAPINEAGQV-TPKYMELREM 353
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 98/253 (38%), Gaps = 66/253 (26%)
Query: 466 PGQGKEVFLNIESLGHAALVFVNKKLV-AFGYGNHDFANFLINKKIELNEGINTLDILSM 524
P + L I A VF+N KL+ + NH+ L K EG + LDIL
Sbjct: 418 PAVPTQSILTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMK----EG-DQLDILVE 472
Query: 525 MVGLQNYG-AWFDVAG----------AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
+G N+G A D G S + ++LKN + S YQV + +Y+
Sbjct: 473 AMGRINFGRAIKDFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSDS--YQVQKDMKYV 530
Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
L + Y+ TF + G LNL + GKGQ +VNG +I
Sbjct: 531 PLKDQKVPGC-----------------YRATFNLKK-TGDTFLNLETWGKGQVYVNGHAI 572
Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
GR+W P QTLY +P W+ GEN +++ +
Sbjct: 573 GRFWKI----------------------------GPQQTLY-MPGCWLKKGENEIIVQDI 603
Query: 694 LGGDPSKISLLTK 706
+G + + L+K
Sbjct: 604 VGPQETVVEGLSK 616
>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 605
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 167/368 (45%), Gaps = 58/368 (15%)
Query: 8 DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
D ++GK + GS+HY R W + + K K GL + TYV WN HEP RG +
Sbjct: 10 DSSQFTLEGKPFRILGGSVHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGTF 69
Query: 68 YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
F+ + DL +V + GL++ LR GPY CAEW+ GG P WL +Q RTT F
Sbjct: 70 NFQDQLDLKAYVSLAAQLGLWVILRPGPYICAEWDLGGLPSWLLQDEEMQLRTTYPGFVN 129
Query: 128 EMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
+ + K+I ++K L GGPII QVENEYG+ +A D
Sbjct: 130 AVNLYFDKLISVIKP--LMFEGGGPIIAVQVENEYGS--------------FAKD----- 168
Query: 188 NTSVPWVM-CQQEDAPDPIINTCN---GFYCDGF-----TPN---------------SPS 223
+ +P++ C Q ++ T + G C G T N P
Sbjct: 169 DKYMPFIKNCLQSRGIKELLMTSDNWEGLRCGGVEGALKTVNLQRLSFGAIQHLADIQPQ 228
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
KP+M E +SGWF +G ED+ V+ + G + N YM+ GGT FG G
Sbjct: 229 KPLMVMEYWSGWFDVWGEHHHVFYAEDMLAVVSEILDRGVSI-NLYMFHGGTTFGFMNGA 287
Query: 284 ------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL--ISSDPTHQKLG 335
TSYDYDAP+ E G PK+ HLR L E+L + S P + G
Sbjct: 288 MDFGTYKSQVTSYDYDAPLSEAGDC-TPKYHHLRNLFSQYH--SEHLPGVPSSPERKAYG 344
Query: 336 -AKLEAHI 342
A ++ H+
Sbjct: 345 PALIQQHL 352
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 109/328 (33%), Positives = 158/328 (48%), Gaps = 27/328 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ T +++G+ V+++ +HYPR W + I+ K G+ + YVFWN HE
Sbjct: 67 GDFTVGKNTFLLNGQPFVVKAAELHYPRIPRPYWEQRIKMCKSLGMNTVCLYVFWNIHEQ 126
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G++ F G D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R +
Sbjct: 127 QEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREDD 186
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY------ 176
F +K F A++ + L GGPII+ QVENEYG +YGV +
Sbjct: 187 PYFMARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENEYG----SYGVNKKYVSQIRDI 240
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWT 229
VK + V L W + + D ++ T N G D P P+M +
Sbjct: 241 VKASGFDKVTL-FQCDWASNFENNGLDDLVWTMNFGTGSNIDAQFKRLKQLRPDAPLMCS 299
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
E +SGWF +G RP + + + +F + YM GGT+FG AG P A
Sbjct: 300 EFWSGWFDKWGARHETRPAKAMVEGIDEMLSKNISF-SLYMTHGGTSFGHWAGANSPGFA 358
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI+EYG PK+ LR+
Sbjct: 359 PDVTSYDYDAPINEYGHA-TPKFWELRK 385
>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
Length = 592
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 155/326 (47%), Gaps = 34/326 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ L SG++HY R PE W + + K K G +ETY+ WNYHEP +GQ+ F G
Sbjct: 10 FMLDGQPVKLISGALHYFRIVPEYWQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
R D+ RFV+ Q GL++ LR PY CAEW +GG P WL ++ R+T P+ + +
Sbjct: 70 RKDVARFVRKAQALGLWVILRPTPYICAEWEFGGLPAWLLADDSMRVRSTYQPYLDAVDA 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN-------------VEWAYGVGGELYV- 177
+ A++ +++ LF + GGP+++ Q+ENEYG+ + +G ++
Sbjct: 130 YYAELFKVIRP--LFFTHGGPVLMCQIENEYGSFGNDKQYLKAIKRLMEKHGCDVPMFTS 187
Query: 178 ----KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
+ D LN V D I F D N P+M E +
Sbjct: 188 DGGWREVLDAGTLLNEGV-LPTANFGSRTDEQIGALRQFMND----NDIHGPLMCMEFWI 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTN------FGRTAGGPLVA 287
GWF ++G + R ++ A + G N YM+ GGTN G
Sbjct: 243 GWFNNWGSPLKTRDAKEAADELDAMLRQGSV--NIYMFHGGTNPEFYNGCSYHNGMDPQI 300
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDY AP+ E+G K+ RE+
Sbjct: 301 TSYDYAAPLTEWG-TEAEKYAAFREV 325
>gi|311281324|ref|YP_003943555.1| glycoside hydrolase [Enterobacter cloacae SCF1]
gi|308750519|gb|ADO50271.1| glycoside hydrolase family 35 [Enterobacter cloacae SCF1]
Length = 591
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 152/313 (48%), Gaps = 30/313 (9%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
+ L+ DGK L SG+IHY R P+ W + K G +ETY+ WN H+P ++
Sbjct: 7 EKNLLQDGKPVQLISGAIHYFRLVPQYWEHSLNNLKALGANCVETYLPWNIHQPDPERFC 66
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G D+ RF+ Q GLF+ LR PY CAEW +GG P WL P ++ R++ F +
Sbjct: 67 FTGMADVERFIALAQRKGLFVILRPSPYICAEWEFGGLPAWLLRDPSMRVRSSQPAFLQA 126
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
++R+ A+++ + +GGP+++ Q+ENEYG ++G + Y++ A
Sbjct: 127 VERYYAELLPRLAPWQY--DRGGPVVMMQLENEYG----SFG-NDKAYLRTLAAMMRRYG 179
Query: 189 TSVP-------WVMCQQEDA--PDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSG 234
SVP W Q + D ++ T N D P +P+M E ++G
Sbjct: 180 VSVPLFTSDGAWQEALQAGSLCEDNVLATANFGSRSAESLDNLAAFQPERPLMCLEFWNG 239
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------A 287
WF +G A+ R +D+ + N YM+ GGTNFG G +
Sbjct: 240 WFNRYGDAIIRRDADDVGQEIRTLLTRASI--NIYMFQGGTNFGFMNGCSVRGDKDLPQV 297
Query: 288 TSYDYDAPIDEYG 300
TSYDYDA + E+G
Sbjct: 298 TSYDYDALLSEWG 310
Score = 39.3 bits (90), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 81/197 (41%), Gaps = 50/197 (25%)
Query: 512 LNEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVE 569
L E N LD+L +G NYG GL ++IDL L + I+ + ++
Sbjct: 432 LREADNVLDLLIENMGRVNYGPRLLAPTQRKGLRGGLVIDLH-----LETDWDIFPLPLD 486
Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
+D + S+ W+ + +Y+ F A + L+ S+GKG A++N
Sbjct: 487 N----IDDVDF--SAGWQP-------QQPAFYEYCF-AIDSPADTFLDTRSLGKGVAFIN 532
Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
G ++GRYW YRG P LY IP + GEN L+
Sbjct: 533 GFNLGRYW---------------YRG-------------PLGYLY-IPAPLLKQGENRLI 563
Query: 690 IHEELGGDPSKISLLTK 706
I E G + ++LL K
Sbjct: 564 IFETEGVEVGALALLNK 580
>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 588
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 156/328 (47%), Gaps = 29/328 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T ++ G+ + SG++HY R P+ W + +RK++ GL IETY+ WN HEP
Sbjct: 7 LTTSSDGFLLHGEPFRIISGAMHYFRIHPDQWTDRLRKARLMGLNTIETYLPWNLHEPEP 66
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G +G DL R+++ Q+ GL + LR GP+ CAEW+ GG P WL P I+ R+++
Sbjct: 67 GTLVLDGFLDLPRWLRLAQDEGLHVLLRPGPFICAEWDDGGLPAWLLADPDIRLRSSDPR 126
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
F +L +++ ++ A+ GGP+I QVENEYG AYG Y+K
Sbjct: 127 FTGAFDGYLDQLLPALRP--FMAAHGGPVIAVQVENEYG----AYG-DDTAYLKHVHQAL 179
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------------GFTPNSPSKPIMWTENY 232
+ C Q A T G + P P+M +E +
Sbjct: 180 RDRGVEELLYTCDQASAEHLAAGTLPGTLATATFGSRVEENLAALRTHQPEGPLMCSEFW 239
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PL 285
GWF +G R D A + R G + N YM+ GGTNFG T G P
Sbjct: 240 VGWFDHWGGPHHVRSAADAAADLDRLLSAGASV-NIYMFHGGTNFGFTNGANHKHAYEPT 298
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLREL 313
V TSYDYDAP+ E G PK+ RE+
Sbjct: 299 V-TSYDYDAPLTESG-DPGPKYHAFREV 324
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 163/337 (48%), Gaps = 37/337 (10%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ + T ++ +++G+ V+++ +HYPR W I+ K G+ + YVFWN HE
Sbjct: 21 AGDFTVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEHRIKMCKALGMNTLCIYVFWNIHE 80
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R
Sbjct: 81 QREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRER 140
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-----------EWAYG 170
+ F E +K F K+ + + L GGPII+ QVENEYG+ + G
Sbjct: 141 DPYFLERVKIFEQKVGEQLAP--LTIQNGGPIIMVQVENEYGSYGEDKPYVSEIRDCLRG 198
Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPS 223
+ GE + D + N + + D ++ T N G D P+
Sbjct: 199 IYGEKLTLFQCDWSSNF----------ERNGLDDLVWTMNFGTGANIDHEFARLKQLRPN 248
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
P+M +E +SGWF +G RP +D+ + +F + YM GGT+FG AG
Sbjct: 249 APLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF-SLYMTHGGTSFGHWAGA 307
Query: 284 --PLVA---TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
P A TSYDYDAPI+EYG + K+ LR++ +
Sbjct: 308 NSPGFAPDVTSYDYDAPINEYGGTTE-KFFQLRKMMQ 343
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 158/323 (48%), Gaps = 34/323 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
I+GK L G +HYPR E W + +++++ GL + YVFWN+HE G++ F
Sbjct: 38 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 97
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F++T QE GL++ LR GPY CAEW++GG+P WL + +R+ + F +
Sbjct: 98 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157
Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
R+ I +L KQ + L + GG II+ QVENEYG+ G Y+ D
Sbjct: 158 RY---IKELGKQLSPLTINNGGNIIMVQVENEYGSYAADKG-----YLAAIRDMIKEAGF 209
Query: 190 SVPWVMCQ-----QEDAPDPIINTCNGFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
+VP C + + + T NG + + P E Y WF +G
Sbjct: 210 NVPLFTCDGGGQVEAGHTEGALPTLNGVFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWG 269
Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
+V + RP E L + ++ G + YM+ GGTNF T G TSY
Sbjct: 270 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 324
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAP+ E+G PK+ RE+
Sbjct: 325 DYDAPLGEWGNC-YPKYHAFREV 346
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 111/274 (40%), Gaps = 52/274 (18%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
++ F+ E K S +F + +E + + +D Y + GK+ + I+
Sbjct: 366 TTTFATVELKESASLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 424
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
L A++ ++ K VA ++ + +N +++ TL+IL G NYG
Sbjct: 425 LRDYAVILIDGKQVASLDRRYNQNSMTLN----VSKTPATLEILVENTGRVNYGPDILFN 480
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ S +L G L+ G + L K ++ F + +P
Sbjct: 481 RKGITSQVLW----GNEKLT--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 524
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
++K TF E KG ++++ GKG WVNG+S+GR+W+
Sbjct: 525 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 563
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY +P W+ GEN +V+ E
Sbjct: 564 ---------GPQQTLY-LPAPWLKEGENEIVVFE 587
>gi|411007376|ref|ZP_11383705.1| beta-galactosidase [Streptomyces globisporus C-1027]
Length = 606
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 152/314 (48%), Gaps = 43/314 (13%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DGK L SG++HY R E W + GL +ETYV WN HEP G+ G
Sbjct: 14 DGKPVRLLSGALHYFRVHEEQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--A 71
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L RF+ V+ AGL+ +R GPY CAEW GG PVW+ G + RT + ++ ++R+
Sbjct: 72 LGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAEYRAVVERWFR 131
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWV 194
+++ + Q + +GGP+IL Q ENEYG+ +G +Y++W A +VP
Sbjct: 132 ELLPQVVQRQVV--RGGPVILVQAENEYGS----FGSDA-VYLEWLAGLLRECGVTVPLF 184
Query: 195 MCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWFL 237
D P+ ++ T N GF + P P+M E + GWF
Sbjct: 185 TS---DGPEDHMLTGGSVPGLLATANFGSGAREGFEV--LRRHQPKGPLMCMEFWCGWFD 239
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPL-------V 286
+G R E+ A A+ E G + N YM GGTNF G GGPL
Sbjct: 240 HWGAEPVLRDAEEAAGALREILECGASV-NVYMAHGGTNFAGWAGANRGGPLQDGEFQPT 298
Query: 287 ATSYDYDAPIDEYG 300
TSYDYDAP+DEYG
Sbjct: 299 VTSYDYDAPVDEYG 312
>gi|297727459|ref|NP_001176093.1| Os10g0340600 [Oryza sativa Japonica Group]
gi|255679317|dbj|BAH94821.1| Os10g0340600 [Oryza sativa Japonica Group]
Length = 143
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 67/104 (64%), Positives = 86/104 (82%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPV 108
++ FEG +D+VRF K +Q AG++ LRIGPY C EWNYG P+
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGYMPM 134
>gi|306832839|ref|ZP_07465973.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
gi|304424978|gb|EFM28110.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
Length = 595
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 152/320 (47%), Gaps = 43/320 (13%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +DGK + SGSIHY R P+ W + + K G +ETYV WN HEP G++ F
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G DL RF+ QE GL+ +R PY CAEW +GG P WL G++ R+ + F + +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWL-LEKGVRVRSQDKGFLQVV 126
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
KR+ +I + + L QGG I++ QVENEYG +YG ++Y++ + L
Sbjct: 127 KRYYEVLIPRLIKHQL--DQGGNILMFQVENEYG----SYG-EDKVYLRELKQMMLELGL 179
Query: 190 SVPWVMCQQEDAP------------DPIINTCNGFYCDG----------FTPNSPSKPIM 227
P+ D P D ++ T N F F P+M
Sbjct: 180 EEPFFTS---DGPWHTALRAGSLIEDDVLVTGN-FGSKAKENFASMEMFFQQYGKKWPLM 235
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
E + GWF +G V R E+LA AV E G N YM+ GGTNFG R
Sbjct: 236 CMEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARK 293
Query: 281 AGGPLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 294 QTDLPQVTSYDYDAILDEAG 313
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 112/339 (33%), Positives = 164/339 (48%), Gaps = 33/339 (9%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ +T + +DGK + SG++HY R E W + + K K GL IETYV WN HEP
Sbjct: 56 SGLTAQGKTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEP 115
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
I G+Y F G DLV F+ + ++ LR GPY C+EW +GG P WL P ++ RT
Sbjct: 116 IPGKYNFTGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKMKVRTMY 175
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+ + ++ ++ +K L GGPII Q++NEYG +Y + Y+ + +
Sbjct: 176 PPYIAAVTKYFNYLLPFVKP--LQYQYGGPIIAFQLDNEYG----SYFKDAD-YLPYLKE 228
Query: 183 TAVN------LNTSVPWVMCQQEDAPDPIINTCNGFYCDG-FTPNS---PSKPIMWTENY 232
N L S +Q+ P ++ T N + FT S P P+M E +
Sbjct: 229 FLQNKGIIELLFISDSIEGLRQQTIPG-VLKTVNFKRMENHFTDLSNMQPDAPLMVMEFW 287
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PL 285
+GWF +G V++ + F GG+ N+YM+FGGTNFG G
Sbjct: 288 TGWFDWWGEKHHILTVQEFGETLNEIFSQGGSV-NFYMFFGGTNFGFMNGAYKDGTGFHA 346
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL 324
TSYDYDA I E G + + + KA ++ E Y
Sbjct: 347 DITSYDYDALIAENGDLTE-------KYFKAKQIIEHYF 378
>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
gallopavo]
Length = 643
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 167/350 (47%), Gaps = 29/350 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ YD V DG+ SGSIHY R W + + K K GL+ I+TYV WNYHE
Sbjct: 18 IDYDCNCFVKDGRPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLDAIQTYVPWNYHETQM 77
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G Y F G DL F++ E GL + LR GPY CAEW+ GG P WL I R++++
Sbjct: 78 GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 137
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG--------- 172
+ +++++ ++ MK GGPII+ QVENEYG+ ++ Y
Sbjct: 138 YLTAVEKWMGVLLPKMKPH--LYQNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFRQHL 195
Query: 173 GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
G+ V + D A + ++ + + AP N F + P+ P++ +
Sbjct: 196 GDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPTGPLVNS 251
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--RTAGGPLVA 287
E Y+GW +G+ P + +A + G N YM+ GGTNF A P ++
Sbjct: 252 EFYTGWLDHWGHRHAVVPSQTIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMS 310
Query: 288 --TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
TSYDYDAP+ E G + + K+ LRE+ E LI PT K
Sbjct: 311 QPTSYDYDAPLSEAGDLTE-KYFALREVIGMYNQLPEGLIP--PTTSKFA 357
>gi|18410234|ref|NP_565051.1| beta-galactosidase 17 [Arabidopsis thaliana]
gi|75163694|sp|Q93Z24.1|BGL17_ARATH RecName: Full=Beta-galactosidase 17; Short=Lactase 17; Flags:
Precursor
gi|16648842|gb|AAL25611.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|22655360|gb|AAM98272.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|332197279|gb|AEE35400.1| beta-galactosidase 17 [Arabidopsis thaliana]
Length = 697
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 113/332 (34%), Positives = 160/332 (48%), Gaps = 41/332 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG R + G +HY R PE W + + ++ GL I+ YV WN HEP G+ FEG D
Sbjct: 73 DGNRFQIIGGDLHYFRVLPEYWEDRLLRANALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 132
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI-PGIQFRTTNNPFKEEMKRFL 133
LV F+K ++ + LR GPY C EW+ GGFP WL + P +Q RT++ + + ++R+
Sbjct: 133 LVSFLKLCEKLDFLVMLRAGPYICGEWDLGGFPAWLLAVKPRLQLRTSDPVYLKLVERWW 192
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEWAYGVGGELYVKWAAD 182
+ L K L S GGP+I+ Q+ENEYG+ V A G G+ + + D
Sbjct: 193 DVL--LPKVFPLLYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTD 250
Query: 183 --TAVNLNT-SVP------WVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-PIMWTENY 232
T L+ +VP V D P PI F N+P + P + +E Y
Sbjct: 251 GGTKETLDKGTVPVADVYSAVDFSTGDDPWPIFKLQKKF-------NAPGRSPPLSSEFY 303
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA----- 287
+GW +G + E A ++ + G+ YM GGTNFG G +
Sbjct: 304 TGWLTHWGEKITKTDAEFTAASLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDY 362
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAPI E G I PK+ L+ + K
Sbjct: 363 KPDLTSYDYDAPIKESGDIDNPKFQALQRVIK 394
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 158/323 (48%), Gaps = 34/323 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
I+GK L G +HYPR E W + +++++ GL + YVFWN+HE G++ F
Sbjct: 38 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 97
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F++T QE GL++ LR GPY CAEW++GG+P WL + +R+ + F +
Sbjct: 98 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157
Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
R+ I +L KQ + L + GG II+ QVENEYG+ G Y+ D
Sbjct: 158 RY---IKELGKQLSPLTINNGGNIIMVQVENEYGSYAADKG-----YLAAIRDMIKEAGF 209
Query: 190 SVPWVMCQ-----QEDAPDPIINTCNGFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
+VP C + + + T NG + + P E Y WF +G
Sbjct: 210 NVPLFTCDGGGQVEAGHTEGALPTLNGVFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWG 269
Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
+V + RP E L + ++ G + YM+ GGTNF T G TSY
Sbjct: 270 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 324
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAP+ E+G PK+ RE+
Sbjct: 325 DYDAPLGEWGNC-YPKYHAFREV 346
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 111/274 (40%), Gaps = 52/274 (18%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
++ F+ E K S +F + +E + + +D Y + GK+ + I+
Sbjct: 366 TTTFATVELKESASLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 424
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
L A++ ++ K VA ++ + +N +++ TL+IL G NYG
Sbjct: 425 LRDYAVILIDGKQVASLDRRYNQNSMTLN----VSKTPATLEILVENTGRVNYGPDILFN 480
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ S +L G L+ G + L K ++ F + +P
Sbjct: 481 RKGITSQVLW----GNEKLT--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 524
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
++K TF E KG ++++ GKG WVNG+S+GR+W+
Sbjct: 525 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 563
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY +P W+ GEN +V+ E
Sbjct: 564 ---------GPQQTLY-LPAPWLKEGENEIVVFE 587
>gi|393785841|ref|ZP_10373985.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
CL02T12C05]
gi|392660955|gb|EIY54552.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
CL02T12C05]
Length = 605
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 155/324 (47%), Gaps = 37/324 (11%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GR 72
+D K + SG IH R E W + I+ K G + Y+ WNYHE G + F+ G
Sbjct: 41 LDDKPFQIISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGN 100
Query: 73 FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF 132
DL +F++TVQE +FL R GPY C EW++GG P +L P I+ R + + ++R+
Sbjct: 101 KDLEKFIRTVQEEDMFLLFRPGPYVCGEWDFGGLPAYLLSTPDIKIRCMDPRYTTAVERY 160
Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
I ++K+ + + GGPII+ QVENEYG +YG Y+KW D + VP
Sbjct: 161 ATAIAPIIKKYEV--TNGGPIIMVQVENEYG----SYG-NDRTYMKWIHDLWRDKGIEVP 213
Query: 193 WVMCQQEDAPDPII---NTCNGFYCDGFTPNS------------PSKPIMWTENYSGWFL 237
+ D P + T G G P + P + +E Y GW
Sbjct: 214 FYTA---DGATPYMLEAGTLPGVAI-GLDPAASKAEFDEALKVHPDASVFCSELYPGWLT 269
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----PLV----ATS 289
+ +E + V + G +F NYY+ GGTNFG AG P + TS
Sbjct: 270 HWRENWQHPSIEKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGIYQPDVTS 328
Query: 290 YDYDAPIDEYGFIRQPKWGHLREL 313
YDYDAPI+E G PK+ LREL
Sbjct: 329 YDYDAPINEMG-QATPKYMALREL 351
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 161/323 (49%), Gaps = 34/323 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
I+GK L G +HYPR E W + +++++ GL + YVFWN+HE G++ F
Sbjct: 36 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 95
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F++T QE GL++ LR GPY CAEW++GG+P WL + +R+ + F +
Sbjct: 96 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 155
Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
R+ I +L KQ + L + GG II+ QVENEYG +Y E Y+ D
Sbjct: 156 RY---IKELGKQLSPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGF 207
Query: 190 SVPWVMCQ---QEDA--PDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFG 240
+VP C Q +A + + T NG + + P E Y WF +G
Sbjct: 208 NVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWG 267
Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
+V + RP E L + ++ G + YM+ GGTNF T G TSY
Sbjct: 268 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 322
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAP+ E+G PK+ RE+
Sbjct: 323 DYDAPLGEWGNC-YPKYHAFREV 344
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 109/274 (39%), Gaps = 52/274 (18%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
++ F+ E K +F +E + + +D Y + GK+ + I+
Sbjct: 364 TTTFATVELKESAPLRTAFHPTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 422
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
L A++ ++ K VA ++ + +N +++ TL+IL G NYG
Sbjct: 423 LRDYAVILIDGKQVASLDRRYNQNSVTLN----VSKTPATLEILVENTGRVNYGPDILFN 478
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ S +L G L+ G + L K ++ F + +P
Sbjct: 479 RKGITSQVLW----GNEKLT--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 522
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
++K TF E KG ++++ GKG WVNG+S+GR+W+
Sbjct: 523 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 561
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY +P W+ GEN +V+ E
Sbjct: 562 ---------GPQQTLY-LPAPWLKEGENEIVVFE 585
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 158/328 (48%), Gaps = 27/328 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ T +++G+ V+++ +HYPR W + I+ K G+ I YVFWN HE
Sbjct: 28 GDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIKMCKALGMNTICLYVFWNIHEQ 87
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
+Y F G D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R +
Sbjct: 88 QESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREDD 147
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY------ 176
F +K F A++ + L GGPII+ QVENEYG +YGV +
Sbjct: 148 PYFLARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENEYG----SYGVNKQYVSQIRDI 201
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWT 229
VK + V L W +++ D ++ T N G D P P+M +
Sbjct: 202 VKASGFDKVTL-FQCDWASNFEKNGLDDLLWTMNFGTGSNIDAQFKRLKQLRPETPLMCS 260
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
E +SGWF +G RP + + + +F + YM GGT+FG AG P A
Sbjct: 261 EFWSGWFDKWGARHETRPAKAMVEGINEMLSKNISF-SLYMTHGGTSFGHWAGANSPGFA 319
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI+EYG PK+ LR+
Sbjct: 320 PDVTSYDYDAPINEYGHA-TPKFWELRK 346
Score = 42.4 bits (98), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 91/241 (37%), Gaps = 53/241 (21%)
Query: 465 MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
+P K L++ A +F++ KL+ G D + K+ + TL IL
Sbjct: 413 LPQIEKSSRLSLNEAHDYAQIFIDNKLI----GTIDRTKNEKSIKLPPVKQGATLTILIE 468
Query: 525 MVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSS--GEWI-------YQVGVEGEYIG 574
+G N+G A D G + + ID + D+S W+ YQ
Sbjct: 469 AMGRINFGRAVKDFKG--ITESVTIDTEMNGHDVSYHLKNWVIAPIPDSYQTAQHA---- 522
Query: 575 LDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIG 634
DK+ N F S + + I Y + + G LNL GKGQ +VNG ++G
Sbjct: 523 FDKLDETNRCF----SPINFSSPSIGYYRGYFNLKKVGDTFLNLEQWGKGQVYVNGHALG 578
Query: 635 RYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
R+W P QTLY +P W+ G N +++ + +
Sbjct: 579 RFWRI----------------------------GPQQTLY-LPGCWLKKGRNEIIVMDIV 609
Query: 695 G 695
G
Sbjct: 610 G 610
>gi|182414740|ref|YP_001819806.1| beta-galactosidase [Opitutus terrae PB90-1]
gi|177841954|gb|ACB76206.1| Beta-galactosidase [Opitutus terrae PB90-1]
Length = 799
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 112/338 (33%), Positives = 157/338 (46%), Gaps = 27/338 (7%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A ++DG+ ++ G +H PR E W ++ K GL + Y+FWN HEP G++ +
Sbjct: 53 AFLLDGQPFQIRCGELHAPRVPREYWRHRLQMVKAMGLNTVCAYLFWNMHEPRPGEFDWS 112
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D F + Q AGL++ LR GPYACAEW GG P WL I+ RT + F E +
Sbjct: 113 GQADAAAFCREAQAAGLWVILRPGPYACAEWEMGGLPWWLLKHDEIKLRTRDPRFIEAAR 172
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
R+L ++ + L S+GGPI++ QVENE+G + Y+ ++
Sbjct: 173 RYLQEVGRELGP--LQVSRGGPILMVQVENEHG-----FYADDPAYMGDIRQALLDAGFD 225
Query: 191 VPWVMCQQEDA------PD--PIIN----TCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
VP C PD P++N GF P+ P+M E Y GWF +
Sbjct: 226 VPLFACNPTQQVRRGYRPDLFPVVNFGTDPAGGFRA--LREILPTGPLMCGEFYPGWFDT 283
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----ATSYDYDA 294
+G E + TG +F + YM GGT FG G +SYDYDA
Sbjct: 284 WGAPHHTGQTERYLTDLDYMLRTGASF-SIYMAHGGTTFGFWTGADRPFKPDTSSYDYDA 342
Query: 295 PIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ 332
PI E G+ PK+ R L L EE L P H+
Sbjct: 343 PISEAGWA-TPKFEQSRALLSKYLLPEETLPEPAPRHR 379
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 87/230 (37%), Gaps = 53/230 (23%)
Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
TLDIL +G N+G V L +R+L G I+++ ++ +G
Sbjct: 474 TLDILVEAMGRVNFGVEVHDRKGIHGPVTLTASGQPRRELR-GWQIFRLPLDQPMLG--- 529
Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
+L +Q T P W T + E G L++ GKG WVNG ++GRYW
Sbjct: 530 -TLRYQPTGEQERTSPA--PAFWRATVKV--EQPGDCFLDMRPWGKGFVWVNGHNLGRYW 584
Query: 638 SAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
+ P QT+Y +P W+ G+N +V+ + +G
Sbjct: 585 NI----------------------------GPQQTMY-VPAPWLKAGDNEIVVLDLIGPA 615
Query: 698 PSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVV-SSSPQVRLACERG 746
I+ L D P +D +P L S QV L + G
Sbjct: 616 NPVIAAL--------------DQPILDQLRPKLDFAPSRRRQVTLRADFG 651
>gi|195069729|ref|XP_001997012.1| GH25263 [Drosophila grimshawi]
gi|193895091|gb|EDV93957.1| GH25263 [Drosophila grimshawi]
Length = 619
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 154/320 (48%), Gaps = 28/320 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y++ + DG+ SGS HY R+ PE W +R + GL + TYV W+ H P
Sbjct: 28 VDYENDRFLKDGQPFRFISGSFHYFRAHPETWSRHLRTMRAAGLNAVTTYVEWSLHNPRD 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
G Y + G DL RF++ + L + LR GPY CAE + GGFP W L PGIQ RT +
Sbjct: 88 GVYVWTGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLKKYPGIQLRTADI 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD- 182
+ E++ + A++ +++ GGPII+ QVENEYG +Y Y W D
Sbjct: 148 NYLSEVRIWYAQL--MVRMSPFLYGNGGPIIMVQVENEYG----SYFACDVNYRNWLRDE 201
Query: 183 TAVNLNTSVPWV-MCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
T ++N +C + D P P++ E Y GW +
Sbjct: 202 TQSHVNGCFGHNGLCATSNLKDTWAR---------LRRFEPKGPLVNAEYYPGWLTHWTE 252
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG------GPLVA--TSYDYD 293
+ + + E+G + N+YM++GGTNFG TAG G +A TSYDYD
Sbjct: 253 PMANVSTDSITGTFIDMLESGASV-NFYMFYGGTNFGFTAGANDNNPGKYIADITSYDYD 311
Query: 294 APIDEYGFIRQPKWGHLREL 313
AP+ E G PK+ LR +
Sbjct: 312 APMTEAG-DPTPKYMALRRI 330
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 159/326 (48%), Gaps = 33/326 (10%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ +++GK ++++ +HYPR W + I+ K G+ + YVFWN HE G++
Sbjct: 40 NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFD 99
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G D+ F++ QE GL++ +R GPY CAEW GG P WL I+ R + F E
Sbjct: 100 FTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMER 159
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV---------------EWAYGVGG 173
+ F K+ + + +L +GGPII+ QVENEYG+ + +
Sbjct: 160 YRIFAQKLGEQIG--DLTIEKGGPIIMVQVENEYGSYGEDKPYVSAIRDIIRDSGFDKVT 217
Query: 174 ELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
W+++ N + W M A N N F G P P M +E +S
Sbjct: 218 LFQCDWSSNFTKNGLDDLVWTMNFGTGA-----NIENEFKKLGEL--RPESPQMCSEFWS 270
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVA 287
GWF +G R +++ + + G +F + YM GGT++G AG P V
Sbjct: 271 GWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDV- 328
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI+E G + PK+ LRE+
Sbjct: 329 TSYDYDAPINEAGQV-TPKYMELREM 353
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 98/253 (38%), Gaps = 66/253 (26%)
Query: 466 PGQGKEVFLNIESLGHAALVFVNKKLV-AFGYGNHDFANFLINKKIELNEGINTLDILSM 524
P + L I A VF+N KL+ + NH+ L K EG + LDIL
Sbjct: 418 PAVPTQSVLTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMK----EG-DQLDILVE 472
Query: 525 MVGLQNYG-AWFDVAG----------AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
+G N+G A D G S + ++LKN + S YQV + +Y+
Sbjct: 473 AMGRINFGRAIKDFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSDS--YQVQKDMKYV 530
Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
L + Y+ TF + G LNL + GKGQ +VNG +I
Sbjct: 531 PLKDQKVPGC-----------------YRATFNLKK-TGDTFLNLETWGKGQVYVNGHAI 572
Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
GR+W P QTLY +P W+ GEN +++ +
Sbjct: 573 GRFWKI----------------------------GPQQTLY-MPGCWLKKGENEIIVQDI 603
Query: 694 LGGDPSKISLLTK 706
+G + + L+K
Sbjct: 604 VGPQETVVEGLSK 616
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 153/317 (48%), Gaps = 36/317 (11%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ ++DG+ + SG++HY R PE W + K G +ETYV WN HEP G + F
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGVFNF 67
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
EG DLV++V+ Q+ GL + LR PY CAEW +GG P WL I+ R+ N F ++
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDIRVRSNTNLFLNKV 127
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
+ F ++ L+ +L GGPII+ QVENEYG ++G E YV+ +L
Sbjct: 128 ENFYKVLLPLVT--SLQVENGGPIIMMQVENEYG----SFGNDKE-YVRSIKKLMRDLGV 180
Query: 190 SVP-------WVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTEN 231
+VP W + + D ++ T N N P+M E
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESFIKENKKEWPLMCMEF 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
+ GWF +G + R +LA V + N+YM+ GGTNFG G
Sbjct: 241 WDGWFNRWGMEIIRRDSSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298
Query: 284 PLVATSYDYDAPIDEYG 300
P + TSYDYDA + E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 159/331 (48%), Gaps = 35/331 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG +HYPR + W ++ K GL + TYVFWN HEP G++ F G
Sbjct: 35 FVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNIHEPEPGKWDFTG 94
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
+L ++K E GL + LR GPY CAEW +GG+P WL + G++ R N F + +
Sbjct: 95 DKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGLELRRDNEQFLKYTQL 154
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS 190
++ ++ + NL ++GGPI++ Q ENE+G+ V + E + ++ A L +
Sbjct: 155 YINRLYKEVG--NLQITKGGPIVMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLKDA 212
Query: 191 ---VP-------WVMCQQEDAPDPIINTCNG--------FYCDGFTPNSPSKPIMWTENY 232
VP W+ + A + T NG D + N P M E Y
Sbjct: 213 GFDVPSFTSDGSWLF--EGGAVPGALPTANGESNIENLKKAVDKY--NGGQGPYMVAEFY 268
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA----- 287
GW + P +A ++ + + NYYM GGTNFG T+G
Sbjct: 269 PGWLAHWLEPHPQISATSIARQTEKYLQNNVSI-NYYMVHGGTNFGFTSGANYDKKHDIQ 327
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAPI E G++ PK+ LR + K
Sbjct: 328 PDLTSYDYDAPISEAGWV-TPKYDSLRNVIK 357
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 100/225 (44%), Gaps = 50/225 (22%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I L A+++ N + V G N F I+ I N +TL+IL +G NYG+
Sbjct: 427 LKINGLRDYAIIYANDEKV--GELNRYFNQDSIDVDIPFN---STLEILVENMGRINYGS 481
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
G+ S ++I NG G+W +YQ+ ++ E K+ NS F G+T
Sbjct: 482 EIVHNTKGIISPVII---NGME--IEGDWQMYQIPMD-EAPDFSKMQ-KNSVF---GNTE 531
Query: 593 PVNKSLI----WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
K L+ YK TF E G L++ GKG ++NG++IGRYW
Sbjct: 532 SAAKRLLGAPALYKGTFNLTE-TGDTFLDMEDWGKGIVFINGKNIGRYW----------- 579
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
H G P QTLY +P W+ G+N +VI E+
Sbjct: 580 ----------------HVG-PQQTLY-VPGVWLKKGQNEIVIFEQ 606
>gi|260592848|ref|ZP_05858306.1| beta-galactosidase [Prevotella veroralis F0319]
gi|260535218|gb|EEX17835.1| beta-galactosidase [Prevotella veroralis F0319]
Length = 621
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 158/327 (48%), Gaps = 30/327 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY-YFE 70
+ DGK + SG +HY R W ++ K GL + TY+FWN+HE G + +
Sbjct: 36 FIYDGKPIQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWTT 95
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G +L +F+KT E GL + LR GPY CAEW +GG+P WL + RT N PF + +
Sbjct: 96 GTHNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKDLVIRTDNKPFLDSCR 155
Query: 131 RFLAKIIDLMKQE-NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNL- 187
++ + L KQ +L +QGGP+I+ Q ENE+G+ V + E + ++AA L
Sbjct: 156 VYINQ---LAKQVLDLQVTQGGPVIMVQAENEFGSYVAQRKDIPLETHKRYAAQIRQQLL 212
Query: 188 --NTSVPWVMCQ-----QEDAPDPIINTCNGF-YCDGFTP-----NSPSKPIMWTENYSG 234
+VP + A + + T NG D + P M E Y G
Sbjct: 213 DAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDIDKLKKVVNEYHGGVGPYMVAEFYPG 272
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------- 286
W + P E + ++ + G +F NYYM GGTNFG +AG
Sbjct: 273 WLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNATNIQPD 331
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G+ PK+ LR+L
Sbjct: 332 MTSYDYDAPISEAGWA-TPKYNALRDL 357
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 91/225 (40%), Gaps = 57/225 (25%)
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNY 531
+ ++ L A+V+VN G + +E++ N TLDIL +G NY
Sbjct: 430 LMRMKGLADYAVVYVN------GEKKGELNKVFDKDSMEIDIPFNSTLDILVENMGRINY 483
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG-- 589
GA + G+ I ID + +GEW + L S+ +++ G
Sbjct: 484 GARIVQSSKGITRPITID-----DNEITGEW--------QMYPLPMASMPDTNRLPAGYK 530
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
+ LPV Y +F + G L++A GKG +VNG ++GRYW
Sbjct: 531 AGLPV-----LYSGSFNL-DKVGDTFLDMAQWGKGIVFVNGINLGRYWKV---------- 574
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
P QTLY +P ++ G+N +VI E+L
Sbjct: 575 ------------------GPQQTLY-LPGCYLKKGKNDIVIFEQL 600
>gi|301767332|ref|XP_002919083.1| PREDICTED: beta-galactosidase-like [Ailuropoda melanoleuca]
Length = 668
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 161/344 (46%), Gaps = 26/344 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y H + DG+ SGSIHY R W + + K K GL I++YV WN+HEP
Sbjct: 35 IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 94
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 95 GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWAYGV 171
+ + ++L ++ MK L GGPII QVENEYG+ + Y +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFSCDYDHLRFLQKLFHYHL 212
Query: 172 GGELYVKWAADTAVNLNTSVPWVMCQQEDAP-DPIINTCNGFYCDGFTPNSPSKPIMWTE 230
G ++ + + D A + + P N F + P P++ +E
Sbjct: 213 GNDVLL-FTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQ--RKSEPRGPLVNSE 269
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----V 286
Y+GW +G E +A A+ G N YM+ GGTNF G +
Sbjct: 270 FYTGWLDHWGQPHSTAKTEVVASALHEILSRGANV-NLYMFIGGTNFAYWNGANMPYQAQ 328
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + + K+ LR+ + K K+ E ++ S P
Sbjct: 329 PTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGFIPPSTP 371
>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
Length = 596
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 40/320 (12%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
+ +++GK + SG++HY R PE W + + K G +ETYV WN H+P Q+
Sbjct: 7 EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F R DLV+F++T ++ GL++ LR PY CAEW +GG P WL IP I+ R + F E
Sbjct: 67 FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
+ R+ +++ + + +QGG I++ Q+ENEYG ++G + Y++ +
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYG----SFG-NDKNYLRAILALMLIHG 179
Query: 189 TSVP-------WVMCQQEDA--PDPIINTCN------------GFYCDGFTPNSPSKPIM 227
+VP W + A D I+ T N Y D + S P+M
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
E + GWF + V R +DLA E N+YM+ GGTNFG R
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLADCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294
Query: 281 AGGPLVATSYDYDAPIDEYG 300
TSYDYDAP+ E+G
Sbjct: 295 DTDLPQVTSYDYDAPVHEWG 314
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 160/323 (49%), Gaps = 34/323 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
I+GK L G +HYPR E W + ++++ GL + YVFWN+HE G++ F
Sbjct: 38 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFS 97
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F++T QE GL++ LR GPY CAEW++GG+P WL + +R+ + F +
Sbjct: 98 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157
Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
R+ I +L KQ + L + GG II+ QVENEYG +Y E Y+ D
Sbjct: 158 RY---IKELGKQLSPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGF 209
Query: 190 SVPWVMCQ---QEDA--PDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFG 240
+VP C Q +A + + T NG + + P E Y WF +G
Sbjct: 210 NVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWG 269
Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
+V + RP E L + ++ G + YM+ GGTNF T G TSY
Sbjct: 270 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 324
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAP+ E+G PK+ RE+
Sbjct: 325 DYDAPLGEWGNCY-PKYHAFREV 346
Score = 46.2 bits (108), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 52/274 (18%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
++ F+ E K +F + +E + + +D Y + GK+ + I+
Sbjct: 366 TTTFATVELKESAPLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 424
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
L A++ ++ K VA ++ + +N +++ TL+IL G NYG
Sbjct: 425 LRDYAVILIDGKQVASLDRRYNQNSVTLN----VSKTPATLEILVENTGRVNYGPDILFN 480
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ S +L G L+ G + L K ++ F + +P
Sbjct: 481 RKGITSQVLW----GNEKLA--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 524
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
++K TF E KG ++++ GKG WVNG+S+GR+W+
Sbjct: 525 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 563
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY +P W+ GEN +V+ E
Sbjct: 564 ---------GPQQTLY-LPAPWLKEGENEIVVFE 587
>gi|195054633|ref|XP_001994229.1| GH23545 [Drosophila grimshawi]
gi|193896099|gb|EDV94965.1| GH23545 [Drosophila grimshawi]
Length = 639
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 112/335 (33%), Positives = 158/335 (47%), Gaps = 36/335 (10%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V Y++ + DG+ SGS HY R+ PE W +R + GL + TYV W+ H P
Sbjct: 27 TVDYENDRFLKDGQPFRFISGSFHYFRAHPETWSRHLRTMRAAGLNAVTTYVEWSLHNPR 86
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
G Y + G DL RF++ + L + LR GPY CAE + GGFP W L PGIQ RT +
Sbjct: 87 DGVYVWTGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLKKYPGIQLRTAD 146
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E++ + A++ +++ GGPII+ QVENEYG +Y Y W D
Sbjct: 147 INYLSEVRIWYAQL--MVRMSPFLYGNGGPIIMVQVENEYG----SYFACDVNYRNWLRD 200
Query: 183 -TAVNLNTSVPWVMCQQEDAPDPI----INTCNGFYCDGFTPN-----------SPSKPI 226
T ++N + D P + I G T N P P+
Sbjct: 201 ETQSHVNGK---AVLFTNDGPSVLRCGKIQGVLATMDFGATSNLKDTWARLRRFEPKGPL 257
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---- 282
+ E Y GW + + + + E+G + N+YM++GGTNFG TAG
Sbjct: 258 VNAEYYPGWLTHWTEPMANVSTDSITGTFIDMLESGASV-NFYMFYGGTNFGFTAGANDN 316
Query: 283 --GPLVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
G +A TSYDYDAP+ E G PK+ LR +
Sbjct: 317 NPGKYIADITSYDYDAPMTEAG-DPTPKYMALRRI 350
>gi|281352249|gb|EFB27833.1| hypothetical protein PANDA_007660 [Ailuropoda melanoleuca]
Length = 626
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 161/344 (46%), Gaps = 26/344 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y H + DG+ SGSIHY R W + + K K GL I++YV WN+HEP
Sbjct: 8 IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 67
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 68 GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 127
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWAYGV 171
+ + ++L ++ MK L GGPII QVENEYG+ + Y +
Sbjct: 128 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFSCDYDHLRFLQKLFHYHL 185
Query: 172 GGELYVKWAADTAVNLNTSVPWVMCQQEDAP-DPIINTCNGFYCDGFTPNSPSKPIMWTE 230
G ++ + + D A + + P N F + P P++ +E
Sbjct: 186 GNDVLL-FTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQ--RKSEPRGPLVNSE 242
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----V 286
Y+GW +G E +A A+ G N YM+ GGTNF G +
Sbjct: 243 FYTGWLDHWGQPHSTAKTEVVASALHEILSRGANV-NLYMFIGGTNFAYWNGANMPYQAQ 301
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + + K+ LR+ + K K+ E ++ S P
Sbjct: 302 PTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGFIPPSTP 344
>gi|383812458|ref|ZP_09967896.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
F0472]
gi|383355018|gb|EID32564.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
F0472]
Length = 608
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 159/327 (48%), Gaps = 30/327 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY-YFE 70
+ DGK + SG +HY R W ++ K GL + TY+FWN+HE G + +
Sbjct: 28 FIYDGKPTQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWST 87
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G +L +F+KT E GL + LR GPY CAEW +GG+P WL + RT N PF + +
Sbjct: 88 GTHNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKNKDLVIRTDNKPFLDSCR 147
Query: 131 RFLAKIIDLMKQE-NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTA---V 185
++ + L KQ +L +QGGP+I+ Q ENE+G+ V + E + ++AA +
Sbjct: 148 VYINQ---LAKQVLDLQVTQGGPVIMVQAENEFGSYVAQRKDIPLETHKRYAAQIRQLLL 204
Query: 186 NLNTSVPWVMCQ-----QEDAPDPIINTCNGF-YCDGFTP-----NSPSKPIMWTENYSG 234
+ +VP + A + + T NG D + P M E Y G
Sbjct: 205 DAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDIDKLKKVVNEYHGGVGPYMVAEFYPG 264
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------- 286
W + P E + ++ + G +F NYYM GGTNFG +AG
Sbjct: 265 WLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNATNIQPD 323
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G+ PK+ LR+L
Sbjct: 324 MTSYDYDAPISEAGW-ATPKYNALRDL 349
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 92/225 (40%), Gaps = 57/225 (25%)
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNY 531
+ ++ L A+V+VN G + +E++ N TLDIL +G NY
Sbjct: 422 LMRMKGLADYAIVYVN------GEKKGELNKVFDKDSMEIDIPFNSTLDILVENMGRINY 475
Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG-- 589
GA + G+ I ID + +GEW + L S+ +++ G
Sbjct: 476 GARIVESAKGITRPITID-----DNEITGEW--------QMYPLPMASMPDTNRLPAGYK 522
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
+ +PV Y +F E G L++A GKG +VNG ++GRYW
Sbjct: 523 AGMPV-----LYSGSFNL-EKVGDTFLDMAHWGKGIVFVNGINLGRYWKV---------- 566
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
P QTLY +P +++ G+N +VI E+L
Sbjct: 567 ------------------GPQQTLY-LPGCYLNKGKNDIVIFEQL 592
>gi|3025876|gb|AAC12775.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 662
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/348 (33%), Positives = 164/348 (47%), Gaps = 32/348 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ Y H + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 28 TIDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 87
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 88 PGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ + ++L ++ MK L GGPII QVENEYG +Y Y+++
Sbjct: 148 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL 201
Query: 184 -AVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYCD-GFTPNS-------------PSKPI 226
+L V ++ + A + + G Y F P + P P+
Sbjct: 202 FHHHLGNDV--LLFTTDGANEKFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGPL 259
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
+ +E Y+GW +G E +A ++ G N YM+ GGTNF G +
Sbjct: 260 VNSEFYTGWLDHWGQPHSTVRTEVVASSLHDILAHGANV-NLYMFIGGTNFAYWNGANMP 318
Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + + K+ LRE + K K+ E ++ S P
Sbjct: 319 YQAQPTSYDYDAPLSEAGDLTE-KYFALREVIRKFEKVPEGFIPPSTP 365
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 156/317 (49%), Gaps = 36/317 (11%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ ++DG+ + SG++HY R PE W + K G +ETYV WN HEP G + F
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIHEPKEGVFNF 67
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
EG DLV++V+ Q+ GL + LR PY CAEW +GG P WL I+ R+ N F +++
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLDKV 127
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
+ F ++ ++ L GGPII+ QVENEYG ++G E YV+ +L+
Sbjct: 128 ENFYKVLLPMVTP--LQVENGGPIIMMQVENEYG----SFGNDKE-YVRSIKKIMRDLDV 180
Query: 190 SVP-------WVMCQQEDA--PDPIINTCN-GFYCDG--------FTPNSPSKPIMWTEN 231
+VP W + + D ++ T N G + N P+M E
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCMEF 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
+ GWF +G + R +LA V + N+YM+ GGTNFG G
Sbjct: 241 WDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298
Query: 284 PLVATSYDYDAPIDEYG 300
P + TSYDYDA + E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 160/323 (49%), Gaps = 34/323 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
I+GK L G +HYPR E W + ++++ GL + YVFWN+HE G++ F
Sbjct: 36 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFS 95
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F++T QE GL++ LR GPY CAEW++GG+P WL + +R+ + F +
Sbjct: 96 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 155
Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
R+ I +L KQ + L + GG II+ QVENEYG +Y E Y+ D
Sbjct: 156 RY---IKELGKQLSPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGF 207
Query: 190 SVPWVMCQ---QEDA--PDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFG 240
+VP C Q +A + + T NG + + P E Y WF +G
Sbjct: 208 NVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWG 267
Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
+V + RP E L + ++ G + YM+ GGTNF T G TSY
Sbjct: 268 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 322
Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
DYDAP+ E+G PK+ RE+
Sbjct: 323 DYDAPLGEWGNCY-PKYHAFREV 344
Score = 46.2 bits (108), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 52/274 (18%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
++ F+ E K +F + +E + + +D Y + GK+ + I+
Sbjct: 364 TTTFATVELKESAPLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 422
Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
L A++ ++ K VA ++ + +N +++ TL+IL G NYG
Sbjct: 423 LRDYAVILIDGKQVASLDRRYNQNSVTLN----VSKTPATLEILVENTGRVNYGPDILFN 478
Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
G+ S +L G L+ G + L K ++ F + +P
Sbjct: 479 RKGITSQVLW----GNEKLA--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 522
Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
++K TF E KG ++++ GKG WVNG+S+GR+W+
Sbjct: 523 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 561
Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY +P W+ GEN +V+ E
Sbjct: 562 ---------GPQQTLY-LPAPWLKEGENEIVVFE 585
>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
Length = 583
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/349 (32%), Positives = 167/349 (47%), Gaps = 35/349 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
++YD + + L SG+IHY R P W + +RK K G IETYV WN HEP
Sbjct: 2 TTLSYDEGQFKMGDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEP 61
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+++FE D+ FV+ E GL++ +R PY CAEW +GG P WL ++ R +
Sbjct: 62 REGEFHFERMADVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL-LKDDMRLRCND 120
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
F E++ + ++ + L A++GGPII Q+ENEYG +YG + Y++ A
Sbjct: 121 PRFLEKVSAYYDALLPQLTP--LLATKGGPIIAVQIENEYG----SYG-NDQAYLQ--AQ 171
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYC------------DGFTPNSPSKPIMW 228
A+ + V ++ + D ++ G D P P+M
Sbjct: 172 RAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMC 231
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
E ++GWF + R +D A + G + N+YM GGTNFG +G
Sbjct: 232 MEYWNGWFDHWFEPHHTRDAKDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDK 290
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
P V TSYDYDA I E G + PK+ RE + K + L E L ++ P
Sbjct: 291 YEPTV-TSYDYDAAISEAGDL-TPKYHAFREVIGKYVSLPEGELPANTP 337
>gi|313245457|emb|CBY40184.1| unnamed protein product [Oikopleura dioica]
Length = 620
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 153/330 (46%), Gaps = 52/330 (15%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+++YD + + + L SGS+HY R + W + + K K GL + TYV WN HEP
Sbjct: 9 SLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYVPWNLHEPE 68
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G++ F G D+V F+ + LF+ LR GPY C+EW +GG P WL ++ RT +
Sbjct: 69 PGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPAWLLRDSFMKVRTNYS 128
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ +KRF ++I L+K + + GGPI+ QVENEYG +A
Sbjct: 129 GYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYG--------------MYAGQD 172
Query: 184 AVNLNTSVPWVMCQQEDAPDPII---------NTCNGFYCDG-----FTPNS-------- 221
+LNT + + E +P+ N N Y DG F N
Sbjct: 173 GAHLNTLAE--LLKNEGIVEPLFTSDGSSVWDNEKNTIYEDGLKSVNFKSNPEKHLKSLR 230
Query: 222 ---PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG 278
P +P+ E ++GWF +G D + + + N+YM+ GGTNFG
Sbjct: 231 GHFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDVILDHKASL-NFYMFHGGTNFG 289
Query: 279 RTAGGPLVA--------TSYDYDAPIDEYG 300
T GG +A TSYDYD PI E G
Sbjct: 290 FTNGGLTIARGYYTADVTSYDYDCPISEAG 319
>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
parasuis SH0165]
Length = 596
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 40/320 (12%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
+ +++GK + SG++HY R PE W + + K G +ETYV WN H+P Q+
Sbjct: 7 EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F R DLV+F++T ++ GL++ LR PY CAEW +GG P WL IP I+ R + F E
Sbjct: 67 FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
+ R+ +++ + + +QGG I++ Q+ENEYG ++G + Y++ +
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYG----SFG-NDKNYLRAIRALMLIHG 179
Query: 189 TSVP-------WVMCQQEDA--PDPIINTCN------------GFYCDGFTPNSPSKPIM 227
+VP W + A D I+ T N Y D + S P+M
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
E + GWF + V R +DLA E N+YM+ GGTNFG R
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLANCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294
Query: 281 AGGPLVATSYDYDAPIDEYG 300
TSYDYDAP+ E+G
Sbjct: 295 DTDLPQVTSYDYDAPVHEWG 314
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/331 (31%), Positives = 153/331 (46%), Gaps = 25/331 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T D +DGK + SG+IHY R + W ++ + GL I+ Y+ WN HE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G + F G DLV F E GL + R GPY C+EW++GG P WL P + R+
Sbjct: 68 GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
++ + + +K++ L+ L S GGPII QVENEYG+ Y ++ W AD
Sbjct: 128 YQAAVSSYFSKLLPLLAP--LQHSNGGPIIAFQVENEYGD----YVDKDNEHLPWLADLM 181
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS-----PSKPIMWTENYSGWFLSF 239
+ + + I N TP S P+KP++ TE ++GWF +
Sbjct: 182 KSHGLFELFFISDGGHT----IRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYW 237
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV--------ATSYD 291
G+ + + + G + N+YM+ GGTNFG G + TSYD
Sbjct: 238 GHGRNLLNNDVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGYYTADVTSYD 296
Query: 292 YDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
YD P+DE G R KW ++ K E
Sbjct: 297 YDCPVDESG-NRTEKWEIIKRCLDVQKTSSE 326
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 159/353 (45%), Gaps = 44/353 (12%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y+ + DGK SGSIHY R W + + K K GL IETYV WN+HEP
Sbjct: 63 IDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFHEPFP 122
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F G DL F++ V E GL + LR GPY CAEW+ GG PVWL I R+++
Sbjct: 123 GQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIFLRSSDPD 182
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN-------------------- 164
+ + + ++L ++ MK GGPII QVENEYG+
Sbjct: 183 YLKAVDKWLEVLLPKMKP--YLYQNGGPIITVQVENEYGSYFACDYNYLRFLLKVFRQHL 240
Query: 165 ----VEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPN 220
V + GE Y+K T +L +V + N F
Sbjct: 241 GEEVVLFTTDGAGENYLK--CGTLQDLYATVDFGTSS---------NITQAFMIQRKV-- 287
Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
P P++ +E Y+GW +G + +++ ++ G N YM+ GGTNFG
Sbjct: 288 EPKGPLVNSEFYTGWLDHWGESHQTVSTKNIVASLTDMLSRGANV-NLYMFIGGTNFGFW 346
Query: 281 AGGPL----VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
G + TSYDYDAP+ E G + + + + K KL E + S P
Sbjct: 347 NGANMPYLPQPTSYDYDAPLSEAGDLTEKYYAVREAIGKFEKLPEGPIPPSTP 399
>gi|326779952|ref|ZP_08239217.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
gi|326660285|gb|EGE45131.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
Length = 648
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/384 (32%), Positives = 177/384 (46%), Gaps = 47/384 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK L SG++HY R W + GL +ETYV WN HEP G+ G
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
L RF+ V+ AGL+ +R GPY CAEW GG PVW+ G + RT + ++ ++R+
Sbjct: 71 ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+++ + + + S+GGP++L Q ENEYG+ YG +Y++W A +VP
Sbjct: 131 RELLPQVVRRQV--SRGGPVVLVQAENEYGS----YG-SDAVYLEWLAGLLRQCGVTVPL 183
Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
D P+ ++ T N GF + P P+M E + GWF
Sbjct: 184 FTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFKV--LRRHQPGGPLMCMEFWCGWF 238
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GP-------L 285
+G R E A A+ E G + N YM GGTNFG AG GP
Sbjct: 239 DHWGAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSGPHQDESFQP 297
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
TSYDYDAP+DEYG + K+ RE+ +A E L + P L + +
Sbjct: 298 TVTSYDYDAPVDEYGRATE-KFRLFREVLEA--YAEGPLPALPPEPVGLAGPVRVELAEW 354
Query: 346 SS-NDCAAFLANYDSSSDANVTFN 368
+S D L + ++ S TF
Sbjct: 355 ASLGDVLEVLGDPETESGVPATFE 378
>gi|83415088|ref|NP_001032730.1| beta-galactosidase precursor [Canis lupus familiaris]
gi|94730362|sp|Q9TRY9.3|BGAL_CANFA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|76470548|gb|ABA43388.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 668
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 159/346 (45%), Gaps = 28/346 (8%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ Y H + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 34 TIDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 93
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 94 PGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 153
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ + ++L ++ MK L GGPII QVENEYG +Y Y+++
Sbjct: 154 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL 207
Query: 184 -AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS-------------PSKPIMW 228
+L V + G Y F P + P P++
Sbjct: 208 FHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGPLVN 267
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL--- 285
+E Y+GW +G E +A ++ G N YM+ GGTNF G +
Sbjct: 268 SEFYTGWLDHWGQPHSTVRTEVVASSLHDILAHGANV-NLYMFIGGTNFAYWNGANMPYQ 326
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + + K+ LRE + K K+ E ++ S P
Sbjct: 327 AQPTSYDYDAPLSEAGDLTE-KYFALREVIRKFEKVPEGFIPPSTP 371
>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
Length = 595
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 148/314 (47%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVMCQ---QEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTENYS 233
P E I+ + F F +S PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 153/325 (47%), Gaps = 38/325 (11%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ I YVFWN HEP G + F
Sbjct: 355 TFLLNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFT 414
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + ++ +++ LR GPY CAEW GG P WL I+ R ++ F E +
Sbjct: 415 GQNDLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVG 474
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY-------------- 176
F + + + ++ GGPII+ QVENEYG+ G ++
Sbjct: 475 IFEKAVAEQVA--DMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVTLFQ 532
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
WA++ N + W M N G D F P P P+M +E +
Sbjct: 533 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 581
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 582 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 640
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRE 312
TSYDYDAPI E G PK+ LR+
Sbjct: 641 TSYDYDAPISESGQT-TPKYWELRK 664
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|347967093|ref|XP_320991.5| AGAP002058-PA [Anopheles gambiae str. PEST]
gi|333469761|gb|EAA01064.5| AGAP002058-PA [Anopheles gambiae str. PEST]
Length = 630
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 159/329 (48%), Gaps = 24/329 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
++ + + DG+ SGS HY R+ PE W ++R + GL + TY+ W+ HEP+
Sbjct: 33 DIDFQNDTFTKDGQPFQFISGSFHYFRALPESWRHILRSMRAAGLNTVMTYIEWSLHEPM 92
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
GQY +EG +L F++ Q LF+ LR GPY CAE + GGFP WL P I+ RT +
Sbjct: 93 PGQYQWEGIANLEEFIEIAQSENLFVILRPGPYICAERDMGGFPHWLLTKYPSIKLRTYD 152
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE----LYVK 178
+ E++ + +++ + + GGP+I+ +ENEYG+ + G + L V
Sbjct: 153 TDYLREVQNWYNQLMPRLVR--YLYGNGGPVIMVSIENEYGSFKACDGQYMQFLKNLTVH 210
Query: 179 WAADTAVNLNTSVPWVM-CQQEDAPDP-----IINTCNGFYCDGFTPNSPSKPIMWTENY 232
+ D AV P ++ C P I N N F+ P P++ E Y
Sbjct: 211 FVQDKAVLFTNDGPELLKCGSIPGILPTLDFGITNNPNAFWQQ-LRKYLPKGPLVNAEYY 269
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA----- 287
GW L+ R + + N+YM+FGGTNFG TAG V
Sbjct: 270 PGW-LTHWMEPTARVDAGMVVNTLKLMLNQKANVNFYMFFGGTNFGFTAGANDVGPGKYS 328
Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+DE G PK+ +R++
Sbjct: 329 ADITSYDYDAPLDEAG-DPTPKYFAIRKV 356
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 167/340 (49%), Gaps = 36/340 (10%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ + DG+ SG +HY R W + I+K K GL I TYV W+ HEP
Sbjct: 31 VDYEKNEFLKDGEVFRYVSGDLHYFRVPKSYWKDRIQKIKAAGLNAITTYVEWSLHEPFP 90
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
G Y FEG DL F+K +Q+ G++L LR GPY CAE ++GGFP W L+ P RT ++
Sbjct: 91 GTYNFEGMADLEYFIKLIQDEGMYLLLRPGPYICAERDFGGFPYWLLNVTPKGSLRTNDS 150
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK----- 178
+K+ + ++ + ++ M Q +L+ + GG II+ QVENEYG+ +A +L+++
Sbjct: 151 SYKKYVSQWFSVLMKKM-QPHLYGN-GGNIIMVQVENEYGSY-YACDSDYKLWLRDLLKG 207
Query: 179 WAADTAVNLNTSVPWVMCQQED---APDPIIN-------TCNGFYCDGFTPN-SPSKPIM 227
+ D A+ + C+Q D P P + + N C F N P +
Sbjct: 208 YVEDKALLYTIDI----CRQRDFDCGPIPEVYATVDFGISVNAATCFDFLKNYQKGGPSV 263
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
+E Y GW + P +D+ + +F ++YM+ GGTNFG T+G
Sbjct: 264 NSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLSLNASF-SFYMFHGGTNFGFTSGANTNE 322
Query: 286 ---------VATSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
TSYDYDAPI E G + + + + L A
Sbjct: 323 SDANIGYLPQLTSYDYDAPITEAGDLTEKYFKIKQTLENA 362
>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
Length = 779
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 156/323 (48%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 36 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 95
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 96 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 155
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D T
Sbjct: 156 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-NKPYVSAVRDLVRESGFT 208
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 209 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 268
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 269 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 327
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 328 DAPISEAGWTTE-KYFLLRDLLK 349
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 37/117 (31%)
Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
KQ T+P +YK TF + G L++++ GKG WVNG ++GR+W
Sbjct: 525 KQLPTMPA-----YYKGTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI------- 571
Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
P QTL+ +P W+ GEN +++ + G P+K S+
Sbjct: 572 ---------------------GPQQTLF-MPGCWLKKGENEILVLDLKG--PAKASI 604
>gi|313231869|emb|CBY08981.1| unnamed protein product [Oikopleura dioica]
Length = 664
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 153/330 (46%), Gaps = 52/330 (15%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+++YD + + + L SGS+HY R + W + + K K GL + TYV WN HEP
Sbjct: 53 SLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYVPWNLHEPE 112
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G++ F G D+V F+ + LF+ LR GPY C+EW +GG P WL ++ RT +
Sbjct: 113 PGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPPWLLRDSFMKVRTNYS 172
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ +KRF ++I L+K + + GGPI+ QVENEYG +A
Sbjct: 173 GYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYG--------------MYAGQD 216
Query: 184 AVNLNTSVPWVMCQQEDAPDPII---------NTCNGFYCDG-----FTPNS-------- 221
+LNT + + E +P+ N N Y DG F N
Sbjct: 217 GAHLNTLAE--LLKNEGIVEPLFTSDGSSVWDNEKNTIYEDGLKSVNFKSNPEKHLKSLR 274
Query: 222 ---PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG 278
P +P+ E ++GWF +G D + + + N+YM+ GGTNFG
Sbjct: 275 GHFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDVILDHKASL-NFYMFHGGTNFG 333
Query: 279 RTAGGPLVA--------TSYDYDAPIDEYG 300
T GG +A TSYDYD PI E G
Sbjct: 334 FTNGGLTIARGYYTADVTSYDYDCPISEAG 363
>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
Length = 595
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V+FVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKDIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|419799561|ref|ZP_14324899.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
gi|385697826|gb|EIG28233.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
Length = 595
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 150/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+ G+ + SG+IHY R P W + K G +ETYV WN HEP +GQ+ F GR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E + R+
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPVFIEAVDRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
++ L+ + + QGGPI++ QVENEYG+ + AY +K T +
Sbjct: 131 DHLLGLLTRYQV--DQGGPILMMQVENEYGSYGEDKAYLRAIRDLMKEKGVTCPLFTSDG 188
Query: 192 PWVMCQQED--APDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N G + F P+M E + GWF +
Sbjct: 189 PWRATLRAGNLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
V R E+LA AV E G N YM+ GGTNFG G G L TSYDY
Sbjct: 249 EPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYG 306
Query: 294 APIDEYG 300
A ++E G
Sbjct: 307 ALLNEQG 313
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L GG I++ Q+ENEYG+ E AY + TA+ +
Sbjct: 139 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
Length = 779
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 103/322 (31%), Positives = 154/322 (47%), Gaps = 27/322 (8%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++GK ++++ IHY R E W I+ K G+ I Y FWN HE G++ F
Sbjct: 37 KTFLLNGKPFIIKAAEIHYTRIPVEYWEHRIQMCKALGMNTICIYAFWNIHEQKPGEFDF 96
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + Q+ G+++ LR GPY C+EW GG P WL IQ RT + F E
Sbjct: 97 SGQNDIAAFCRLAQKNGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIQLRTNDPYFIERT 156
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN- 188
+ ++ +I + + ++GG II+ QVENEYG+ + Y+ D +
Sbjct: 157 RIYMNEIGKQLADRQI--TRGGNIIMVQVENEYGSY-----ATDKSYIAKNRDILRDAGF 209
Query: 189 TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWF 236
T VP C +A D ++ T N G D P+ P+M +E +SGWF
Sbjct: 210 TDVPLFQCDWSSNFLNNALDDLVWTVNFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWF 269
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYD 291
+G R E + + + +F + YM GGT FG G + +SYD
Sbjct: 270 DHWGRKHETRDAETMIAGLRDMLDRNISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSYD 328
Query: 292 YDAPIDEYGFIRQPKWGHLREL 313
YDAPI E G+ PK+ LRE
Sbjct: 329 YDAPISEAGWA-TPKYHKLREF 349
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 99/245 (40%), Gaps = 59/245 (24%)
Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
+P L I+ + A VF++ KL+ G + F I K+ LDIL
Sbjct: 414 TLPAVKAGTTLLIDEVHDWAQVFIDGKLI--GRLDRRRGEFTI--KLPATAAGARLDILI 469
Query: 524 MMVGLQNYGAWFDVA---GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISL 580
+G N FD A G+ + +++ ++ +L + +Y + V+
Sbjct: 470 EAMGRVN----FDKAIHDRKGITNKVVLITESSSDELKDWQ-VYNLPVD----------- 513
Query: 581 ANSSFWKQGSTLPVNK--SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
SF K P K + +Y+ TF E G + L++ + GKG WVNG+++GR+W
Sbjct: 514 --YSFVKDKKYTPGKKIEAPAYYRATF-NLETPGDVFLDMQTWGKGMVWVNGKAMGRFWE 570
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
P QTL+ +P W+ GEN +++ + G P
Sbjct: 571 I----------------------------GPQQTLF-MPGCWLKKGENEIIVLDLKG--P 599
Query: 699 SKISL 703
K S+
Sbjct: 600 EKASV 604
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 583
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/309 (34%), Positives = 147/309 (47%), Gaps = 34/309 (11%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG++HY R PE W + + K K G +ETYV WN HEP +G++ FEG
Sbjct: 14 LDGKPFKIISGAVHYFRIVPEYWRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGML 73
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D+ RF+ QE GL++ +R PY CAEW +GG P WL G++ R PF E ++ +
Sbjct: 74 DISRFILLAQELGLYVIVRPSPYICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYY 133
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+ + ++ L GGP+IL QVENEYG Y Y++ ++ VP
Sbjct: 134 SVLFPILVP--LQIHHGGPVILMQVENEYG-----YYGDDTRYMETMKQLMLDNGAEVPL 186
Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTENYSGWFLS 238
V D P +C T N SK P+M TE + GWF
Sbjct: 187 VTS---DGPMDESLSCGRLPGVLPTGNFGSKTEERFEVLKKYTEGGPLMCTEFWVGWFDH 243
Query: 239 FGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYD 291
+G R +E+ + + E G N YM+ GGTNFG G TSYD
Sbjct: 244 WGNGGHMRGNLEESTKDLDKMLEMGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTSYD 301
Query: 292 YDAPIDEYG 300
YDA + E G
Sbjct: 302 YDAVLTEAG 310
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L GG I++ Q+ENEYG+ E AY + TA+ +
Sbjct: 129 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 779
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 156/323 (48%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 36 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 95
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 96 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKRDIALRTLDPYYMERVG 155
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D T
Sbjct: 156 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-NKPYVSAVRDLVRESGFT 208
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 209 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 268
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 269 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 327
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 328 DAPISEAGWTTE-KYFLLRDLLK 349
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 37/117 (31%)
Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
KQ T+P +YK TF + G L++++ GKG WVNG ++GR+W
Sbjct: 525 KQLPTMPA-----YYKGTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI------- 571
Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
P QTL+ +P W+ GEN +++ + G P+K S+
Sbjct: 572 ---------------------GPQQTLF-MPGCWLKKGENEILVLDLKG--PAKASI 604
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 105/314 (33%), Positives = 155/314 (49%), Gaps = 38/314 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
ID + + SG++HY R P W + + K G +ETY+ WN HEP G++ FEG
Sbjct: 12 IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D+ +F+K ++ GL++ LR PY CAEW +GG P WL I+ R++++ F E+++ +
Sbjct: 72 DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEIKLRSSDDNFIEKLRNYY 131
Query: 134 AKII-DLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
++ L+K + ++GGP+++ QVENEYG +YG E Y++ A VP
Sbjct: 132 NDLLPRLVKYQ---VTKGGPVLMMQVENEYG----SYGNEKE-YLRIVASIMKENGVDVP 183
Query: 193 -------WVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSG 234
W+ + + D I + N D N PIM E + G
Sbjct: 184 LFTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDG 243
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLV 286
WF +G + R DLA V + G N YM+ GGTNFG G P V
Sbjct: 244 WFNRWGEDIIRRDSIDLAEDVKEMLKIGSI--NLYMFRGGTNFGFMNGCSARGNNDLPQV 301
Query: 287 ATSYDYDAPIDEYG 300
TSYDYDA + E+G
Sbjct: 302 -TSYDYDAILTEWG 314
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
Length = 778
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F K Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 32/104 (30%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTF-KLDKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
P QTL+ +P W+ GEN +++ + G P+K S+
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASI 603
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|62955063|ref|NP_001017547.1| beta-galactosidase precursor [Danio rerio]
gi|62089564|gb|AAH92166.1| Galactosidase, beta 1 [Danio rerio]
gi|182890870|gb|AAI65636.1| Glb1 protein [Danio rerio]
Length = 651
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 161/337 (47%), Gaps = 25/337 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+V Y + DG+ SGSIHY R W + + K GL I+TYV WN+HE +
Sbjct: 27 SVDYHRNCFLKDGEPFRYISGSIHYSRIPRVYWKDRLLKMYMAGLNAIQTYVPWNFHEAV 86
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G DL +F++ Q+ GL + +R GPY CAEW+ GG P WL I R+++
Sbjct: 87 PGQYDFSGDRDLEQFLQLCQDIGLLVIMRPGPYICAEWDMGGLPAWLLKKKDIVLRSSDP 146
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG-GELYVKW 179
+ + +++ K++ ++K+ GGPII QVENEYG+ ++ Y +L+ +
Sbjct: 147 DYLAAVDKWMGKLLPIIKR--YLYQNGGPIITVQVENEYGSYFACDFNYMRHLSQLFRFY 204
Query: 180 AADTAVNLNTS---VPWVMCQQEDAP------DPIINTCNGFYCDGFTPNSPSKPIMWTE 230
+ AV T + ++ C P N F P P++ +E
Sbjct: 205 LGEEAVLFTTDGAGLGYLKCGSLQGLYATVDFGPGANVTAAFEAQRHV--EPRGPLVNSE 262
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-----RTAGGPL 285
Y GW +G P + + E G N YM+ GGTNFG T GP
Sbjct: 263 FYPGWLDHWGEKHSVVPTSAVVKTLNEILEIGANV-NLYMFIGGTNFGYWNGANTPYGPQ 321
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
TSYDYD+P+ E G + + K+ +RE+ K K E
Sbjct: 322 -PTSYDYDSPLTEAGDLTE-KYFAIREVIKMYKDVPE 356
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 166/337 (49%), Gaps = 23/337 (6%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y+ + I+G++ L S +IHY R E W E++ K+K G+ ++TY WN HEP
Sbjct: 18 VQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNCVDTYFAWNVHEPEE 77
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G++ FEG D F+ E GL++ R GP+ CAEW++GGFP WL+ ++FR +
Sbjct: 78 GEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWLNTKKDMKFRAFDMQ 137
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
+ + R++ +II +++ + A GG +IL QVENEYG + A Y+ D
Sbjct: 138 YLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYL--ASDEVARDYMLHLRDVM 193
Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
++ VP + C + + N + + P P + TE ++GWF +
Sbjct: 194 LDRGVMVPLITCV--GGAEGTVEGANFWSGADHHYNNLVQKQPDTPKIVTEFWTGWFEHW 251
Query: 240 GYAVPFRPVEDLAFAVARFFET---GGTFQNYYM----YFGGTNFGRTAGGP--LVATSY 290
G P + A R E+ G T ++YM G GRT G + TSY
Sbjct: 252 G--APAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNFGGYGGRTVGASDIFMVTSY 309
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
DYDAP+ EYG + K+ + + ++ E L+++
Sbjct: 310 DYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLLNA 345
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
Length = 595
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+V+FVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKDVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|57619080|ref|NP_001009860.1| beta-galactosidase precursor [Felis catus]
gi|5915775|sp|O19015.1|BGAL_FELCA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|2547317|gb|AAB81350.1| lysosomal beta-galactosidase [Felis catus]
Length = 669
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 163/345 (47%), Gaps = 28/345 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y H + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 35 IDYGHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 95 GQYQFSGEHDVEYFLKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY----------GV 171
+ + ++L ++ MK L GGPII QVENEYG+ ++ Y +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQRRFRDHL 212
Query: 172 GGE--LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
GG+ L+ A ++ + + PD N F + P P++ +
Sbjct: 213 GGDVLLFTTDGAHEKFLQCGALQGIYATVDFGPD--ANITAAFQIQ--RKSEPRGPLVNS 268
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL---- 285
E Y+GW +G E +A ++ G N YM+ GGTNF G +
Sbjct: 269 EFYTGWLDHWGQPHSRVRTEVVASSLHDVLAHGANV-NLYMFIGGTNFAYWNGANIPYQP 327
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + K+ LR+ + K K+ E ++ S P
Sbjct: 328 QPTSYDYDAPLSEAGDLTD-KYFALRDVIRKFEKVPEGFIPPSTP 371
>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
Length = 595
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
Length = 595
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|2623150|gb|AAB86405.1| mutant lysosomal beta-galactosidase [Felis catus]
Length = 669
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 163/345 (47%), Gaps = 28/345 (8%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y H + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 35 IDYGHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 95 GQYQFSGEHDVEYFLKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY----------GV 171
+ + ++L ++ MK L GGPII QVENEYG+ ++ Y +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQRRFRDHL 212
Query: 172 GGE--LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
GG+ L+ A ++ + + PD N F + P P++ +
Sbjct: 213 GGDVLLFTTDGAHEKFLQCGALQGIYATVDFGPD--ANITAAFQIQ--RKSEPRGPLVNS 268
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL---- 285
E Y+GW +G E +A ++ G N YM+ GGTNF G +
Sbjct: 269 EFYTGWLDHWGQPHSRVRTEVVASSLHDVLAHGANV-NLYMFIGGTNFAYWNGANIPYQP 327
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + K+ LR+ + K K+ E ++ S P
Sbjct: 328 QPTSYDYDAPLSEAGDLTD-KYFALRDVIRKFEKVPEGFIPPSTP 371
>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
Length = 595
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
Length = 595
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
Length = 595
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
carolinensis]
Length = 584
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 115/330 (34%), Positives = 156/330 (47%), Gaps = 33/330 (10%)
Query: 8 DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
D + L+ + R+L GS+HY R E W + + K K GL + TYV WN HE IRG++
Sbjct: 17 DTQFLLEERPFRIL-GGSLHYFRIPREYWKDRLMKMKACGLNTVTTYVPWNLHEAIRGKF 75
Query: 68 YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
F G DL F+K +E GL++ LR GPY C+EW+ GG P WL P +Q RTT F E
Sbjct: 76 DFSGNLDLQVFIKMAEEVGLWVILRPGPYICSEWDLGGLPSWLLQDPEMQLRTTYRGFTE 135
Query: 128 EMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
+ + ++I + L GGPII QVENEYG+ +A Y+K A L
Sbjct: 136 AVDNYFDRLIP--QVVPLQYKYGGPIIAVQVENEYGS--YAQDPSYMTYIKMA------L 185
Query: 188 NTSVPWVMCQQEDAPDPIIN--------TCNGFYCDGF------TPNSPSKPIMWTENYS 233
+ M D D +++ T N D T P M E ++
Sbjct: 186 TSRKIVEMLMTSDNHDGLVSGTVDGALATINFQKLDTAIMVFLSTDQRNKMPKMVMEYWT 245
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVA 287
GWF S+G +D+ V + + G + N YM+ GGTNFG G
Sbjct: 246 GWFDSWGGLHHVFDADDMVQTVGKVIKLGASI-NLYMFHGGTNFGFLNGAQHSNEYKSTI 304
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDA + E G K+ LR+L I
Sbjct: 305 TSYDYDAVLTESGDYTS-KFFKLRQLFTDI 333
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 158/333 (47%), Gaps = 27/333 (8%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ +T D + ++DG+ L SG +HYPR W + +RK++ GL + Y FWN+HE
Sbjct: 23 THRLTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHE 82
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G + F G+ D+ FV+ Q+ GLF+ LR GPY CAEW+ GG+P WL P + R+
Sbjct: 83 EEEGHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAVNLRSL 142
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
++ + +++ + + L A++GGPI+ QVENEYG+ + + Y+
Sbjct: 143 DSRYIAAADKWMKALGQQLAP--LQAAKGGPILAVQVENEYGSFPDSAQPNAQAYLDRVH 200
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIIN------TCNGFYCDGFTPNS--------PSKPIM 227
L+ + D D + T Y G + S P+ I
Sbjct: 201 QMV--LDAGFKDSLLYTGDGADVLARGTFADLTAGIDYGTGDSARSIALYKKFRPNTNIY 258
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
E + GWF +G V +GG+ + YM GGT+FG G +
Sbjct: 259 TAEYWDGWFDHWGAKHEVVDASIHLKEVHDVLTSGGSI-SLYMLHGGTSFGWMNGANIDH 317
Query: 286 -----VATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPIDE G +R P++ +R++
Sbjct: 318 NHYEPDVTSYDYDAPIDEAGQLR-PEYFAMRKV 349
>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
Length = 1106
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ I YVFWN HE G + F
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R ++ F E +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + + + + GGPII+ QVENEYG+ V Y
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
WA++ N + W M N G D F P P P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDAPI E G W EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668
>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
Length = 1106
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ I YVFWN HE G + F
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R ++ F E +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + + + + GGPII+ QVENEYG+ V Y
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
WA++ N + W M N G D F P P P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDAPI E G W EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668
>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 590
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 147/330 (44%), Gaps = 40/330 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ L SG++HY R PE WP +R + GL+ +ETYV WN HEP G+Y F+G
Sbjct: 11 LDGRPLRLLSGALHYFRVLPEQWPHRLRMLRAMGLDTVETYVPWNLHEPRPGEYDFDGIA 70
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGI-QFRTTNNPFKEEMKRF 132
DL RF+ +EAGL +R PY CAEW GG P WL P + R + + + R+
Sbjct: 71 DLDRFLHATREAGLHAIVRPSPYICAEWENGGLPWWLLADPEVGALRCQDPAYLAHVDRW 130
Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
++I ++ + S+GG +++ QVENEYG+ G Y++ A VP
Sbjct: 131 FDRLIPVVAAHQV--SRGGNVLMVQVENEYGSYGTDTG-----YLEHLAAGLRARGIDVP 183
Query: 193 WVMCQQEDAPDPIINTCNGFYCDGFTPN---------------SPSKPIMWTENYSGWFL 237
D PD T T N P P M E + GWF
Sbjct: 184 LFTS---DGPDDFFLTGGALPGHLATVNFGSRPKEALADLARLRPDDPAMCMEFWCGWFD 240
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------------PL 285
+G R D A + G + N YM GGTNF AG P
Sbjct: 241 HWGTDHVVRDPADAAGVLEELLAAGASV-NVYMAHGGTNFSTWAGANTEDPAAGTGYRPT 299
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
V TSYDYDAP+DE G + W L +
Sbjct: 300 V-TSYDYDAPVDERGAATEKFWAFREVLER 328
>gi|408677368|ref|YP_006877195.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
gi|328881697|emb|CCA54936.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
Length = 611
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 114/333 (34%), Positives = 159/333 (47%), Gaps = 44/333 (13%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ L SG++HY R E W + + GL +ETYV WN HEP G+Y
Sbjct: 11 FLLDGRPVRLLSGALHYFRVREEQWEHRLGMLRAMGLNCVETYVPWNLHEPEPGRYADVA 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
L RF+ V AG++ +R GPY CAEW GG P WL G + R+ + F ++
Sbjct: 71 --ALGRFLDAVARAGMWAIVRPGPYICAEWENGGLPHWLTGPLGRRVRSFDPEFLAPVEA 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +++ + + + +GGP++L QVENEYG+ YG Y++W A+ +V
Sbjct: 129 WFRRLLPQVVERQI--DRGGPVVLVQVENEYGS----YG-SDRAYLEWLAELLRGCGVAV 181
Query: 192 PWVMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSG 234
P D P+ ++ T N GF + PS P+M E + G
Sbjct: 182 PLFTS---DGPEDHMLTGGSVPGVLATANFGSGAREGFAT--LRRHQPSGPLMCMEFWCG 236
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPL 285
WF +G R D A A+ E G + N YM GGTNFG AG GPL
Sbjct: 237 WFDHWGTEHAVRDAADAAEALREILECGASV-NVYMAHGGTNFGGFAGANRAGELHDGPL 295
Query: 286 VA--TSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
A TSYDYDAP+DE G + W RE+ A
Sbjct: 296 RATVTSYDYDAPVDEAGRPTEKFW-RFREVLAA 327
>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
Length = 778
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 154/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F K Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L +GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVDKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 51/121 (42%), Gaps = 33/121 (27%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
P QTL+ +P W+ GEN +++ + G + I L K I + E
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPAKASIKGLKKP---ILDMLREKA 618
Query: 720 P 720
P
Sbjct: 619 P 619
>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
Length = 778
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 154/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F K Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L +GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVDKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 32/104 (30%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
P QTL+ +P W+ GEN +++ + G P+K S+
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASI 603
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
Length = 1106
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ I YVFWN HE G + F
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R ++ F E +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + + + + GGPII+ QVENEYG+ V Y
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
WA++ N + W M N G D F P P P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDAPI E G W EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|365860016|ref|ZP_09399844.1| putative beta-galactosidase [Streptomyces sp. W007]
gi|364010544|gb|EHM31456.1| putative beta-galactosidase [Streptomyces sp. W007]
Length = 645
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 125/384 (32%), Positives = 176/384 (45%), Gaps = 47/384 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK L SG++HY R W + GL +ETYV WN HEP G+ G
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
L RF+ V+ AGL+ +R GPY CAEW GG PVW+ G + RT + ++ ++R+
Sbjct: 71 ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+++ + Q + S+GGP+IL Q ENEYG+ YG +Y++W A +VP
Sbjct: 131 RELLPQVVQRQV--SRGGPVILVQAENEYGS----YGSDA-VYLEWLAGLLRQCGVTVPL 183
Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
D P+ ++ T N GF + P P+M E + GWF
Sbjct: 184 FTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFEV--LLRHQPRGPLMCMEFWCGWF 238
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GP-------L 285
+G R E A A+ E G + N YM GGTNFG AG GP
Sbjct: 239 DHWGAEPVRRDPEQAAGALREVLECGASV-NIYMAHGGTNFGGWAGANRSGPHQDESFQP 297
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
TSYDYDAP+DEYG + K+ RE+ +A E L + P L + +
Sbjct: 298 TVTSYDYDAPVDEYGRATE-KFRLFREVLEA--YAEGPLPALPPEPVGLAGPVRVELAEW 354
Query: 346 SS-NDCAAFLANYDSSSDANVTFN 368
+ D L + ++ S TF
Sbjct: 355 AGLGDVLEALGDPETESGVPPTFE 378
>gi|182439300|ref|YP_001827019.1| beta-galactosidase [Streptomyces griseus subsp. griseus NBRC 13350]
gi|178467816|dbj|BAG22336.1| putative beta-galactosidase [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 630
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 123/384 (32%), Positives = 176/384 (45%), Gaps = 47/384 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK L SG++HY R W + GL +ETYV WN HEP G+ G
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
L RF+ V+ AGL+ +R GPY CAEW GG PVW+ G + RT + ++ ++R+
Sbjct: 71 ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+++ + + + S+GGP++L Q ENEYG+ YG +Y++W A +VP
Sbjct: 131 RELLPQVVRRQV--SRGGPVVLVQAENEYGS----YG-SDAVYLEWLAGLLRQCGVTVPL 183
Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
D P+ ++ T N GF + P P+M E + GWF
Sbjct: 184 FTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFAV--LRRHQPGGPLMCMEFWCGWF 238
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GP-------L 285
+G R E A A+ E G + N YM GGTNFG AG GP
Sbjct: 239 DHWGAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSGPHQDESFQP 297
Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
TSYDYDAP+DEYG + K+ RE+ +A E L + P L + +
Sbjct: 298 TVTSYDYDAPVDEYGRATE-KFRLFREVLEA--YAEGPLPALPPEPVGLAGPVRVELAEW 354
Query: 346 SS-NDCAAFLANYDSSSDANVTFN 368
+ D L + ++ S TF
Sbjct: 355 APLGDVLEVLGDPETESGVPATFE 378
>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
Length = 1106
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
+++GK V+++ +HYPR W + I+ K G+ I YVFWN HE G + F
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ DL F + Q+ +++ LR GPY CAEW GG P WL I+ R ++ F E +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
F + + + + GGPII+ QVENEYG+ V Y
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
WA++ N + W M N G D F P P P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
SGWF +G RP D+ + G +F + YM GGTN+G AG P A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
TSYDYDAPI E G W EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668
>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
Length = 778
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 154/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFA 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F K Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L +GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVDKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 32/104 (30%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTF-KLDKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
P QTL+ +P W+ GEN +++ + G P+K S+
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASI 603
>gi|380693434|ref|ZP_09858293.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 778
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 27/324 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHKA 316
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KYFLLRDLLKT 349
>gi|322390566|ref|ZP_08064082.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
gi|321142719|gb|EFX38181.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
Length = 595
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/318 (33%), Positives = 152/318 (47%), Gaps = 41/318 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A + G+ + SG+IHY R P W + K G +ETYV WN HEP +GQ+ F
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
GR DL RF++T Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E +
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPAFIEAVD 127
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
R+ +++ L+ + QGGPI++ QVENEYG +YG + Y++ D +
Sbjct: 128 RYYDRLLGLLTPYQV--DQGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVT 180
Query: 191 VPWVMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWT 229
P D P + + T N G + F P+M
Sbjct: 181 CPLFTS---DGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWPLMCM 237
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
E + GWF + V R E+LA AV E G N YM+ GGTNFG G G L
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTL 295
Query: 286 ---VATSYDYDAPIDEYG 300
TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
Length = 595
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG + SG+IHY R P W + K G +ETY+ WN HEP G + F G
Sbjct: 10 FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++VRFVK QE L + LR Y CAEW +GG P WL P I+ R+T+ F E++K
Sbjct: 70 FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ Q+ENEYG +YG+ Y++ + + + +
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDI 182
Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
P + DA I N F N PIM E +
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
GWF +G + R E+LA V E G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300
Query: 288 -TSYDYDAPIDEYG 300
TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314
>gi|195342884|ref|XP_002038028.1| GM17976 [Drosophila sechellia]
gi|194132878|gb|EDW54446.1| GM17976 [Drosophila sechellia]
Length = 672
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 107/325 (32%), Positives = 158/325 (48%), Gaps = 39/325 (12%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH A ++DG+ SGS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 47 TIDHEANTFLLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 106
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
G+Y +EG DLV+F++ QE ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 107 DGEYNWEGIADLVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTND 166
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E+ ++ A+++ + ++LF GG II+ QVENEYG+ + Y+ W D
Sbjct: 167 PNYISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 219
Query: 183 --------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKP 225
A+ +P + C + D IN + + P+ P
Sbjct: 220 ETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGP 278
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
++ +E Y GW + R +++A A+ + N YM+FGGTNFG TAG
Sbjct: 279 LVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANY 337
Query: 284 --------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 338 NLDGGIGYAADITSYDYDAVMDEAG 362
>gi|339640120|ref|ZP_08661564.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
gi|339453389|gb|EGP66004.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
Length = 595
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 163/349 (46%), Gaps = 40/349 (11%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+I Y R P+ W E + K G +ETY+ W+ HEP GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRETLHNLKALGYNTVETYIPWSLHEPQEGQFVTDGLL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D + VQE GL L +R PY CAE+++GG P WL PG++FR + F E++ RF
Sbjct: 72 DFEAYFDLVQEMGLHLIVRPTPYICAEFDFGGMPPWLLNYPGMRFRVNDALFLEKVSRFY 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP- 192
+ + ++GGPI++ QVENEYG +Y E Y++ A + SVP
Sbjct: 132 DWLFPKLLPYQF--TEGGPILMMQVENEYG----SYAEDKE-YMRNIAKMMRDRGVSVPL 184
Query: 193 ------WVMCQQEDA--PDPIINTCN-GFYCDGFTPN--------SPSKPIMWTENYSGW 235
W+ + D I T N G T N P+M TE + GW
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGNFGSQAKENTDNLRAFMERHGKKWPLMCTEFWDGW 244
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVA 287
F +G + R EDLA V G N ++ GGTNFG +T P +
Sbjct: 245 FSRWGEEIVRRDAEDLAQDVKEMMRIGSM--NLFLLRGGTNFGFISGCSARKTRDLPQI- 301
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
TSYD+DAP+ E+G + + R H+ E+ DP +K A
Sbjct: 302 TSYDFDAPVTEWGVPTEKYYAVQRVTHELFPELEQ----MDPIIRKARA 346
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 157/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYD 291
+ + R ++LA +V G N YM+ GGTNFG G G + TSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 105/340 (30%), Positives = 153/340 (45%), Gaps = 31/340 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+T D +DGK + SG+IHY R + W ++ + GL I+ Y+ WN HE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
G + F G DLV F E GL + R GPY C+EW++GG P WL P + R+
Sbjct: 68 GNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------VEW------AYG 170
++ + + +K++ L+ L S GGPII QVENEYG+ + W ++G
Sbjct: 128 YQAAVSSYFSKLLPLLAP--LQHSNGGPIIAFQVENEYGDYVDKDNEHLPWLADLMKSHG 185
Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
+ ++ T N Q ++ F PN KP++ TE
Sbjct: 186 LFELFFISDGGHTIRKANMLKVRSTAQLNSGSFQLL--AKAFSLKSLQPN---KPMLVTE 240
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV---- 286
++GWF +G+ E + + G + N+YM+ GGTNFG G +
Sbjct: 241 FWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGY 299
Query: 287 ----ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
TSYDYD P+DE G R KW +R K E
Sbjct: 300 YTADVTSYDYDCPVDESG-NRTEKWEIIRRCLNVQKTSSE 338
>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 640
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 117/342 (34%), Positives = 165/342 (48%), Gaps = 54/342 (15%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + +G +HY R W + ++K+K GL I TYVFWN HEP G Y F G+
Sbjct: 35 LDGKPFRILTGEMHYARIPRARWDDAMQKAKALGLNAITTYVFWNVHEPRPGVYDFTGQN 94
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL ++ Q AGL + LR GPYACAEW +GG+P WL P + R+++ F + + ++
Sbjct: 95 DLGEYLAAAQRAGLKVILRPGPYACAEWEFGGYPAWLIKDPTVVVRSSDPKFMKPVAKWF 154
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVKWA 180
++ + + A+ GGPII QVENEYG+ + AY G+GG+ K
Sbjct: 155 HRLG--QEVQPYLAANGGPIIAVQVENEYGSFGNDHAYMEQMKDLVISSGIGGKNPKKAV 212
Query: 181 --------ADTAVNLNTSVPWVMCQQEDAPD--PIINTCNG------FYCDGFTPNSPSK 224
DT L T+ V P+ ++N G + F PN P
Sbjct: 213 DEDGKNVPQDTGTMLYTADGGVQLPNGTLPELPAVVNFGGGQAKSELARYEAFRPNGPR- 271
Query: 225 PIMWTENYSGWFLSFG----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
M E ++GWF +G V + + + R + + YM +GGT+FG
Sbjct: 272 --MVGEYWAGWFDHWGNNHQKTNAAEQVAEYEYMLKRGYSV-----SLYMLYGGTSFGWM 324
Query: 281 AGG---------PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
AG P V TSYDYDAPIDE G PK+ LRE+
Sbjct: 325 AGANSGDKAPYEPDV-TSYDYDAPIDERGN-PTPKYFALREV 364
>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
Length = 633
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 159/320 (49%), Gaps = 31/320 (9%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
+Y+ +++G+ + G + R PE W ++ ++ GL I +Y++WN HEP G
Sbjct: 30 SYNRTDFLLNGQPFQIIGGQMDPQRILPEYWTHRLKMARAMGLNTIFSYLYWNLHEPRPG 89
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
+ F GR D+ RF + Q+ GL + LR GPY C E ++GGFP WL +PG+ R N PF
Sbjct: 90 AWDFSGRNDVARFFRLAQQEGLRVVLRPGPYICGERDWGGFPAWLSQVPGMAVRQNNRPF 149
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
+ K ++ ++ + Q L +QGGPI++AQ+ENEYG ++G AA
Sbjct: 150 LDAAKSYIDRLGKELGQ--LQITQGGPILMAQLENEYG----SFGTDKTYLAALAAMLRE 203
Query: 186 NLNTSV--------PWVMCQQEDAPDPII--NTCNGFYCDGFTPNSPSK--PIMWTENYS 233
N + + ++ Q +I ++ +GF P+ P + E Y
Sbjct: 204 NFDVFLYTNDGGGQSYLEGGQLHGVLAVIDGDSQSGFAARDKYVTDPTSLGPQLNGEYYI 263
Query: 234 GWFLSFGYAVPFRPV----EDLAFAVARFFET--GGTFQNYYMYFGGTNFGRTAG----- 282
W +G P + + D+A AVA T GG + YM+ GGTNFG G
Sbjct: 264 SWIDQWGSDYPHQQIAGSQADVAKAVADLDWTLAGGYSFSIYMFHGGTNFGFENGGIRDD 323
Query: 283 GPLVA--TSYDYDAPIDEYG 300
GPL A TSYDY AP+DE G
Sbjct: 324 GPLAAMTTSYDYGAPLDESG 343
>gi|298386767|ref|ZP_06996322.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|298260441|gb|EFI03310.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 778
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 27/324 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHKA 316
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KYYLLRDLLKT 349
>gi|29349062|ref|NP_812565.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124327|ref|ZP_09944991.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
gi|29340969|gb|AAO78759.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839176|gb|EES67260.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
Length = 778
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 27/324 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHKA 316
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KYYLLRDLLKT 349
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 155/324 (47%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
Length = 581
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 108/326 (33%), Positives = 156/326 (47%), Gaps = 43/326 (13%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
ID ++ + SG +HY R E W + + K K G +ETY+ WN HE +G++ FEG
Sbjct: 12 IDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCFEGNL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D+ +FV ++ GL++ LR PY CAEW +GG P WL G++ R + PF + ++ +
Sbjct: 72 DITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGMRLRCSYKPFLKHVEEYY 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
++ +++ L ++GGP+I+ QVENEYG Y LY+K D V+ VP
Sbjct: 132 HRLFEVIAP--LQYTKGGPVIMMQVENEYG-----YYGNDTLYLKTLQDFMVSYGCEVPL 184
Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPS---------------KPIMWTENYSGWFLS 238
V D P C T N S KP+M E + GWF S
Sbjct: 185 VTS---DGPWGDAFDCGKLEGVLQTGNFGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFDS 241
Query: 239 FGYAV-----PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------A 287
+G P + E+L E+G N YM+ GGTNFG G
Sbjct: 242 WGQTEHKQEDPNKNAENL----DEILESGHV--NIYMFMGGTNFGFMNGSNYYDVLTPDV 295
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDA + E G + PK+ L+ +
Sbjct: 296 TSYDYDALLTEAGDL-TPKYELLKNV 320
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 152 bits (383), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 154/317 (48%), Gaps = 36/317 (11%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ ++DG+ + SG++HY R PE W + K G +ETYV WN HEP G + F
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGIFNF 67
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
EG DLV++V+ Q+ GL + LR PY CAEW +GG P WL I+ R+ N F ++
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLNKV 127
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
+ F ++ ++ L GGPII+ QVENEYG ++G E YV+ +L
Sbjct: 128 ENFYKVLLPMVTP--LQVENGGPIIMMQVENEYG----SFGNDKE-YVRNIKKLMRDLGV 180
Query: 190 SVP-------WVMCQQEDA--PDPIINTCN-GFYCDG--------FTPNSPSKPIMWTEN 231
+VP W + + D ++ T N G + N P+M E
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCMEF 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
+ GWF +G + R +LA V + N+YM+ GGTNFG G
Sbjct: 241 WDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298
Query: 284 PLVATSYDYDAPIDEYG 300
P + TSYDYDA + E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 152 bits (383), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 156/324 (48%), Gaps = 26/324 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
L+ D ++L SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 21 LLNDQPFKIL-SGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|222526932|ref|YP_002571403.1| beta-galactosidase [Chloroflexus sp. Y-400-fl]
gi|222450811|gb|ACM55077.1| Beta-galactosidase [Chloroflexus sp. Y-400-fl]
Length = 917
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 151/313 (48%), Gaps = 17/313 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +V + +DGK L SG +HY R W L+ +++ GL I+T + WN H
Sbjct: 21 MQHSVRVHRNGIELDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRH 80
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G++ F DL F+ E GL +R GPY CAEW GG P WL ++ R+
Sbjct: 81 EPQPGEFDFSEEADLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWLTASGDMRLRS 140
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGV-GGELYVKW 179
+ F++ + R+ ++ ++ GGPIIL Q+ENE+ WA GV G + + +
Sbjct: 141 DDPAFRDAVLRWFDTLMPILVPRQY--PHGGPIILCQIENEH----WASGVYGADTHQQT 194
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMWTENYSGWF 236
A A+ VP C P + P P++ +E +SGWF
Sbjct: 195 LAQAALERGIVVPQYTCVGAMPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWF 254
Query: 237 LSF-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV--ATS 289
++ G+ + L + + G +++M+ GGTNF GRT GG L+ TS
Sbjct: 255 DNWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTS 314
Query: 290 YDYDAPIDEYGFI 302
YDYDAP+DEYG +
Sbjct: 315 YDYDAPVDEYGRL 327
>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
Length = 778
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 95 GQNDIATFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ K+ LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + I L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 313
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 314 DYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
Length = 620
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 119/344 (34%), Positives = 167/344 (48%), Gaps = 40/344 (11%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
A+ ++ + V +GK + SG +HY R E W I+ K GL I TYVFWNYH
Sbjct: 26 DASFKIENGSFVYNGKPTPIYSGEMHYERIPKEYWRHRIQMMKAMGLNTIATYVFWNYHN 85
Query: 62 PIRGQYYFE-GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
P G + FE G ++ F+K +E +F+ LR GPYAC EW +GG+P +L IPG++ R
Sbjct: 86 PAPGVWDFESGNRNVAEFIKIAKEEEMFVILRPGPYACGEWEFGGYPWFLQNIPGLKVRE 145
Query: 121 TNNPFKEEMKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV-----------EWA 168
N F K + I +L KQ L + GG II+ QVENE+G+ A
Sbjct: 146 NNAQFLAACKEY---INELAKQVAPLQVNNGGNIIMTQVENEFGSYVAQREDIAPEDHKA 202
Query: 169 YGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF-YCDGFTP-----NSP 222
Y +K A A + W+ + + + ++ T NG D N+
Sbjct: 203 YKEAIFKMLKDAGFQAPFFTSDGAWLF--EGGSLEGVLPTANGEGNIDNLKKVVNKFNNN 260
Query: 223 SKPIMWTENYSGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
P M E Y GW +A PF + D+A + + G F N+YM GGTNFG T
Sbjct: 261 EGPYMVAEFYPGWLDH--WAEPFVKISASDIAKQTEVYLKNGVNF-NFYMAHGGTNFGFT 317
Query: 281 AGG---------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+G P + TSYDYDAPI E G++ PK+ +R L +
Sbjct: 318 SGANYNDEHDIQPDI-TSYDYDAPISEAGWVT-PKYDSIRALMQ 359
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 89/221 (40%), Gaps = 52/221 (23%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L + L A V+VN K V G N F ++ + KI N +L+IL +G NYGA
Sbjct: 431 LKVPGLRDFATVYVNGKKV--GELNRVFNSYEMPIKIPFN---GSLEILVENMGRINYGA 485
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
G+ + + I+ +++ G +Y+ + + NS+ K G +
Sbjct: 486 EIVNNLKGITAPVSIN----DYEITGGWEMYKAPF------AEVPEVINSTEVKTGRPVV 535
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ S K +G LN++ MGKG +VNG ++GRYW
Sbjct: 536 YSGSFDLKK--------QGDTFLNMSEMGKGIVFVNGHNLGRYWKV-------------- 573
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
P QTLY +P W+ N + I E+L
Sbjct: 574 --------------GPQQTLY-VPGCWLKKKGNTITIFEQL 599
>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
Length = 778
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ K+ LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + I L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608
>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
ED99]
Length = 590
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 153/315 (48%), Gaps = 34/315 (10%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++D K + SG+IHY R + W + + K G +ETYV WN+HE I +Y F+
Sbjct: 9 TFLLDDKPIKILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYDFK 68
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G DL F++ + GL++ +R PY CAEW +GGFP WL ++ R+ + + E++K
Sbjct: 69 GHKDLKHFIELAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTMRIRSRDEKYLEKVK 128
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
++ ++ ++ L QGGPII+ QVENEYG ++G + Y++ A +
Sbjct: 129 KYYHELFKILTP--LQIDQGGPIIMMQVENEYG----SFGQDHD-YLRSLAHMMREEGVT 181
Query: 191 VP-------WVMCQQ-----EDAPDPIIN----TCNGFYCDGFTPNSPSK--PIMWTENY 232
VP W C + ED P N T F SK P+M E +
Sbjct: 182 VPFFTSDGAWDQCLRAGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCMEFW 241
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPL 285
GWF +G V R +DLA V + G N YM+ GGTNFG R
Sbjct: 242 DGWFNRWGEPVIKRDSDDLAEEVRDAVKLGSL--NLYMFHGGTNFGFWNGCSARGTKDLP 299
Query: 286 VATSYDYDAPIDEYG 300
TSYDY AP+DE G
Sbjct: 300 QVTSYDYHAPLDEAG 314
>gi|384248639|gb|EIE22122.1| hypothetical protein COCSUDRAFT_1093, partial [Coccomyxa
subellipsoidea C-169]
Length = 632
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 157/328 (47%), Gaps = 37/328 (11%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SGS+HY R P W + + ++K GL + YV WN HEP GQY ++G
Sbjct: 28 MDGKPFRIISGSLHYHRIHPAQWKDRMLRTKALGLNTLSVYVPWNLHEPFPGQYNWDGFA 87
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPG---------IQFRTTNNP 124
DL ++ QE GL++ LR GPY CAEW++GGFP WL + R+ +
Sbjct: 88 DLEAYLALAQEQGLYVLLRPGPYICAEWDFGGFPWWLASSKAGLCSTSSHSVTLRSDDPA 147
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVGGELYVKWA 180
+ E + R+ + L K S+GG I++ QVENE+G N ++ + G +
Sbjct: 148 YLELVDRWWKVL--LPKIGRFLYSRGGNILMVQVENEFGFVGPNEKYMRHLVGTVRAS-L 204
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTC---------NGFYCDGFTPNSPSK-PIMWTE 230
D A+ T P + + D +++ N + N+P K P M +E
Sbjct: 205 GDDALIYTTDPPPNIAKGTLPGDEVLSVVDFGAGWFDLNWAFSQQRAMNAPGKSPPMCSE 264
Query: 231 NYSGWFLSFGYAVPFRPVE---DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
Y+GW +G + V+ D V F G+ N YM GGTNFG TAGG +
Sbjct: 265 FYTGWLTRWGEKMANTSVDQFLDTLHGVLGFANNTGSV-NLYMVHGGTNFGFTAGGSIDN 323
Query: 286 -----VATSYDYDAPIDEYGFIRQPKWG 308
TSYDYDAPI E G QP G
Sbjct: 324 GVYWACITSYDYDAPISEAGDTGQPGIG 351
>gi|163848976|ref|YP_001637020.1| beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
gi|163670265|gb|ABY36631.1| Beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
Length = 897
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 151/313 (48%), Gaps = 17/313 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
+ +V + +DGK L SG +HY R W L+ +++ GL I+T + WN H
Sbjct: 1 MQHSVRVHRNGIELDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRH 60
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G++ F DL F+ E GL +R GPY CAEW GG P WL ++ R+
Sbjct: 61 EPQPGEFDFSEEADLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWLTASGDMRLRS 120
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGV-GGELYVKW 179
+ F++ + R+ ++ ++ GGPIIL Q+ENE+ WA GV G + + +
Sbjct: 121 DDPAFRDAVLRWFDTLMPILVPRQY--PHGGPIILCQIENEH----WASGVYGADTHQQT 174
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMWTENYSGWF 236
A A+ VP C P + P P++ +E +SGWF
Sbjct: 175 LAQAALERGIVVPQYTCVGAMPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWF 234
Query: 237 LSF-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV--ATS 289
++ G+ + L + + G +++M+ GGTNF GRT GG L+ TS
Sbjct: 235 DNWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTS 294
Query: 290 YDYDAPIDEYGFI 302
YDYDAP+DEYG +
Sbjct: 295 YDYDAPVDEYGRL 307
>gi|393780989|ref|ZP_10369190.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
gi|392677324|gb|EIY70741.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
Length = 776
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++G+ ++++ +HY R W I+ K G+ I YVFWN HE GQ+ F
Sbjct: 32 KTFLLNGEPFIVKAAELHYTRIPQPYWEHRIKMCKALGMNTICLYVFWNIHEQEEGQFDF 91
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 92 TGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERV 151
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN- 188
F+ K+ + + L ++GG II+ QVENEYG +YG + YV D
Sbjct: 152 GIFMKKVGEQLVP--LQITRGGNIIMVQVENEYG----SYGT-DKPYVSAIRDMVRGAGF 204
Query: 189 TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWF 236
T VP C +A D ++ T N G D P P+M +E +SGWF
Sbjct: 205 TEVPLFQCDWSSNFTNNALDDLLWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWF 264
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYD 291
+G RP +D+ + + +F + YM GGT FG G + +SYD
Sbjct: 265 DHWGRKHETRPAKDMVQGLKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYD 323
Query: 292 YDAPIDEYGFIRQPKWGHLRELHKA 316
YDAPI E G+ + K+ LR+L K
Sbjct: 324 YDAPISEAGWTTE-KYFLLRDLLKG 347
>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
Length = 644
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 155/320 (48%), Gaps = 29/320 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K + G + TY+ WN HE RG++ F
Sbjct: 69 FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 128
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +V + GL++ LR GPY CAE + GG P WL PG RTTN F E + +
Sbjct: 129 ILDLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDK 188
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I K L +GGP+I QVENEYG+ + Y+++ LN +
Sbjct: 189 YFDHLIP--KILPLQYRRGGPVIAVQVENEYGSFR-----NDKNYMEYIKKAL--LNRGI 239
Query: 192 PWVMCQQEDAPDPIINTCNG---------FYCDGFTP---NSPSKPIMWTENYSGWFLSF 239
++ ++ I + G F D F KPIM E ++GW+ S+
Sbjct: 240 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 299
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
G + ++ + RFF G +F N YM+ GGTNFG GG V TSYDYD
Sbjct: 300 GSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYD 358
Query: 294 APIDEYGFIRQPKWGHLREL 313
A + E G + K+ LR+L
Sbjct: 359 AVLSEAGDYTE-KYFKLRKL 377
>gi|307707961|ref|ZP_07644436.1| beta-galactosidase [Streptococcus mitis NCTC 12261]
gi|307616026|gb|EFN95224.1| beta-galactosidase [Streptococcus mitis NCTC 12261]
Length = 595
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 151/307 (49%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRLRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L +GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLSRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
Length = 778
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ K+ LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + I L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608
>gi|337283005|ref|YP_004622476.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
gi|335370598|gb|AEH56548.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
Length = 595
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/307 (34%), Positives = 149/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+ G+ + SG+IHY R P W + K G +ETYV WN HEP +GQ+ F GR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E + R+
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPAFIEAVDRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
++ L+ + QGGPI++ QVENEYG+ + AY +K T +
Sbjct: 131 DHLLGLLTPYQV--DQGGPILMMQVENEYGSYGEDKAYLRAIRDLMKKKGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + + + T N G + F P+M E + GWF +
Sbjct: 189 PWRAALRAGTLIEEDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
V R E+LA AV E G N YM+ GGTNFG G G L TSYDY
Sbjct: 249 EPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYG 306
Query: 294 APIDEYG 300
A ++E G
Sbjct: 307 ALLNEQG 313
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 155/320 (48%), Gaps = 29/320 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K + G + TY+ WN HE RG++ F
Sbjct: 69 FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 128
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +V + GL++ LR GPY CAE + GG P WL P RTTN F E + +
Sbjct: 129 ILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDK 188
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I K L GGP+I QVENEYG+ + Y+K A L +
Sbjct: 189 YFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGI 239
Query: 192 PWVMCQQEDAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSF 239
++ +D I + NG + FT +S KPIM E ++GW+ S+
Sbjct: 240 VELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSW 299
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
G + E++ V +F G +F N YM+ GGTNFG GG V TSYDYD
Sbjct: 300 GSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYD 358
Query: 294 APIDEYGFIRQPKWGHLREL 313
A + E G + K+ LR+L
Sbjct: 359 AVLSEAGDYTE-KYFKLRKL 377
>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 584
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 146/303 (48%), Gaps = 24/303 (7%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ + SG +HY R P W + +RK++ GL I+TY+ WN HE G + F G
Sbjct: 13 LDGQPFRIVSGGLHYFRVHPAQWSDRLRKARLMGLNTIDTYIPWNLHERRPGTFDFGGIL 72
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL F+ GL + LR GPY C EW GG P WL P + R+T+ F + ++ +L
Sbjct: 73 DLAAFLDAAAAEGLHVLLRPGPYICGEWEGGGLPSWLLADPDLALRSTDPAFLQAVEAYL 132
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
I+ ++ ++GGP+I QVENEYG AYG Y++ + + VP+
Sbjct: 133 DAIMPIVLPR--LGTRGGPVIAVQVENEYG----AYG-SDTAYMERLYEALTSRGIDVPF 185
Query: 194 VMCQQ----EDAPDPIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
Q D P + F P+ P+M E ++GWF +G
Sbjct: 186 FTSDQPNDLADGALPGVLATANFGGKVTASLAALRAQQPTGPLMCAEFWNGWFDYWGGTH 245
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAPID 297
R ED A+ + G + N+YM+ GGTNFG T G TSYDYD+P+D
Sbjct: 246 AQRSAEDAGAALEEMLQAGASV-NFYMFHGGTNFGFTNGANDKGTYRATVTSYDYDSPLD 304
Query: 298 EYG 300
E G
Sbjct: 305 EAG 307
>gi|195030628|ref|XP_001988170.1| GH10713 [Drosophila grimshawi]
gi|193904170|gb|EDW03037.1| GH10713 [Drosophila grimshawi]
Length = 680
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 156/324 (48%), Gaps = 36/324 (11%)
Query: 8 DHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
DH A +++GK SGS HY R+ P+ W +R + GL ++TYV W+ H P G
Sbjct: 59 DHVANTFLMNGKPFRYVSGSFHYFRALPDAWRSRLRTMRASGLNALDTYVEWSLHNPHDG 118
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTNNP 124
+Y +EG D+VRF++ QE ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 119 EYDWEGIADIVRFLEIAQEEDFYIVLRPGPYICAERDNGGLPHWLFTKYPDIKVRTNDPN 178
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD-- 182
+ E+ ++ A+++ +K +L GG II+ QVENEYG AY Y+ W D
Sbjct: 179 YIAEVGKWYAQLMPRLK--HLLFGNGGKIIMVQVENEYG----AYHACDHDYLNWLRDET 232
Query: 183 ------TAVNLNTSVPWVMCQQEDAPDPIINTCNG----FYCDG----FTPNSPSKPIMW 228
A+ +P + T G F D P+ P++
Sbjct: 233 DKYVENKALLFTVDIPNERMHCGKIDNVFATTDFGIDRIFEIDKIWELLRGIQPTGPLVN 292
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
+E Y GW + R +++A A+ + G + N YM+FGGTNFG TAG
Sbjct: 293 SEFYPGWLTHWQEMNQRRDGKEVADALKKILSYGASV-NLYMFFGGTNFGFTAGANYDLD 351
Query: 284 -----PLVATSYDYDAPIDEYGFI 302
TSYDYDA +DE G +
Sbjct: 352 GGIGYAADITSYDYDAVMDEAGGV 375
>gi|307710114|ref|ZP_07646558.1| beta-galactosidase [Streptococcus mitis SK564]
gi|307619094|gb|EFN98226.1| beta-galactosidase [Streptococcus mitis SK564]
Length = 595
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRLRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELAEAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/325 (33%), Positives = 151/325 (46%), Gaps = 19/325 (5%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A+ T ++G + ++ GSIHY R E W + + K K G + TY+ WN HE
Sbjct: 92 TASTTKGRAHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYIPWNLHE 151
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P RG++ F G DL FV E GL++ LR GPY CAE + GG P WL P Q RTT
Sbjct: 152 PQRGKFVFSGNLDLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLLQNPKTQLRTT 211
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
F + + + ++ M L GGP+I QVENEYG+ L
Sbjct: 212 ERTFVDAVDAYFDHLMRRMVP--LQYHHGGPVIAVQVENEYGSFNRDGQYMAYLKEALLK 269
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTC-------NGFYCDGFTPNSPSKPIMWTENYSG 234
V L + + + ++ T N FY KPI+ E + G
Sbjct: 270 RGIVELLFTCDYYKDVVNGSLKGVLATVNLGSLGKNSFY--QLLQVQSHKPILIMEYWVG 327
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR------TAGGPLVAT 288
W+ S+G + ++A V+ F + G +F N YM+ GGTNFG G V T
Sbjct: 328 WYDSWGLPHANKSAAEVAHTVSTFIKNGISF-NVYMFHGGTNFGFINAAGIVEGRRSVTT 386
Query: 289 SYDYDAPIDEYGFIRQPKWGHLREL 313
SYDYDA + E G + K+ LREL
Sbjct: 387 SYDYDAVLSEAGDYTE-KYFKLREL 410
>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
Length = 631
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 155/320 (48%), Gaps = 29/320 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K + G + TY+ WN HE RG++ F
Sbjct: 56 FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 115
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +V + GL++ LR GPY CAE + GG P WL PG RTTN F E + +
Sbjct: 116 ILDLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDK 175
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I K L +GGP+I QVENEYG+ + Y+++ LN +
Sbjct: 176 YFDHLIP--KILPLQYRRGGPVIAVQVENEYGSFR-----NDKNYMEYIKKAL--LNRGI 226
Query: 192 PWVMCQQEDAPDPIINTCNG---------FYCDGFTP---NSPSKPIMWTENYSGWFLSF 239
++ ++ I + G F D F KPIM E ++GW+ S+
Sbjct: 227 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 286
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
G + ++ + RFF G +F N YM+ GGTNFG GG V TSYDYD
Sbjct: 287 GSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYD 345
Query: 294 APIDEYGFIRQPKWGHLREL 313
A + E G + K+ LR+L
Sbjct: 346 AVLSEAGDYTE-KYFKLRKL 364
>gi|387878583|ref|YP_006308886.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
gi|386792040|gb|AFJ25075.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
Length = 595
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 151/315 (47%), Gaps = 41/315 (13%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+ G+ + SG+IHY R P W + K G +ETYV WN HEP +GQ+ F GR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++ Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E + R+
Sbjct: 72 DLERFIQIAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPAFIEAVDRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
++ L+ + + QGGPI++ QVENEYG +YG ++Y++ D + P
Sbjct: 131 DHLLGLLTRYQV--DQGGPILMMQVENEYG----SYG-EDKVYLRAIRDLMKKKGVTCPL 183
Query: 194 VMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWTENY 232
D P D + T N G + F P+M E +
Sbjct: 184 FTS---DGPWRATLRAGTLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFW 240
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL--- 285
GWF + V R E+LA AV E G N YM+ GGTNFG G G L
Sbjct: 241 DGWFTRWKEPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLP 298
Query: 286 VATSYDYDAPIDEYG 300
TSYDY A ++E G
Sbjct: 299 QVTSYDYGALLNEQG 313
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 155/320 (48%), Gaps = 29/320 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K + G + TY+ WN HE RG++ F
Sbjct: 95 FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 154
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +V + GL++ LR GPY CAE + GG P WL P RTTN F E + +
Sbjct: 155 ILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDK 214
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I K L GGP+I QVENEYG+ + Y+K A L +
Sbjct: 215 YFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGI 265
Query: 192 PWVMCQQEDAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSF 239
++ +D I + NG + FT +S KPIM E ++GW+ S+
Sbjct: 266 VELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSW 325
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
G + E++ V +F G +F N YM+ GGTNFG GG V TSYDYD
Sbjct: 326 GSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYD 384
Query: 294 APIDEYGFIRQPKWGHLREL 313
A + E G + K+ LR+L
Sbjct: 385 AVLSEAGDYTE-KYFKLRKL 403
>gi|256424388|ref|YP_003125041.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256039296|gb|ACU62840.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 586
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/329 (34%), Positives = 166/329 (50%), Gaps = 40/329 (12%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ ++D K + SG +H R E W I+ +K G I YVFWNYHE G++ F
Sbjct: 17 KDFLLDSKPYQIISGEMHPARIPKEYWRHRIQMAKAMGCNTIAAYVFWNYHEQEEGKFDF 76
Query: 70 --EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
E R D+V F+K VQE G+++ LR GPY CAEW +GG P +L IP I+ R + +
Sbjct: 77 TSENR-DIVAFIKMVQEEGMWVMLRPGPYVCAEWEFGGLPPYLLRIPDIKVRCMDPRYIA 135
Query: 128 EMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
+R++ + + +K L + GGPI++ QVENEYG ++G E +K D V
Sbjct: 136 ATERYIKALSEEVKP--LQITNGGPIVMVQVENEYG----SFGNDREYMLK-VKDMWVQN 188
Query: 188 NTSVPW--------VMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWF 236
+VP+ + + P I +G F +P P +E+Y GW
Sbjct: 189 GINVPFYTADGPVSALLEAGSVPGAAIGLDSGSSEGDFAAAEKQNPDVPSFSSESYPGWL 248
Query: 237 LSFG--YAVPFRP--VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------P 284
+G +A P + V+++ F +T +F N Y+ GGTNFG TAG P
Sbjct: 249 THWGEKWARPDKAGIVKEVKF----LMDTKRSF-NLYVIHGGTNFGFTAGANSGGKGYEP 303
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
+ TSYDYDAPI+E G K+ LR+L
Sbjct: 304 DL-TSYDYDAPINEQGDTTA-KYNALRDL 330
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 159/324 (49%), Gaps = 26/324 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
L+ D ++L SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 21 LLNDQPFKIL-SGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPL---VATSYD 291
+ + R ++LA +V G N YM+ GGTNF G +A G + TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFEFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 155/320 (48%), Gaps = 29/320 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K + G + TY+ WN HE RG++ F
Sbjct: 56 FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 115
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +V + GL++ LR GPY CAE + GG P WL P RTTN F E + +
Sbjct: 116 ILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDK 175
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I K L GGP+I QVENEYG+ + Y+K A L +
Sbjct: 176 YFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGI 226
Query: 192 PWVMCQQEDAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSF 239
++ +D I + NG + FT +S KPIM E ++GW+ S+
Sbjct: 227 VELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSW 286
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
G + E++ V +F G +F N YM+ GGTNFG GG V TSYDYD
Sbjct: 287 GSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYD 345
Query: 294 APIDEYGFIRQPKWGHLREL 313
A + E G + K+ LR+L
Sbjct: 346 AVLSEAGDYTE-KYFKLRKL 364
>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 104/304 (34%), Positives = 152/304 (50%), Gaps = 32/304 (10%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR-FDLVRFVKT 81
SGS+HY R E W + + +K GL I TYV WN+HE G + FE DL RF+
Sbjct: 70 SGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFDFETHAHDLARFLNL 129
Query: 82 VQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMK 141
E GL + +R PY CAEW++GG P L P ++ R++N+ F +E++R+ ++ +++
Sbjct: 130 AHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDLELRSSNDAFLDEVERYYDALMPILR 189
Query: 142 QENLFASQGGPIILAQVENEYGNVEWAYGVG--------------GELYVKWAADTAVNL 187
L AS GGPII VENEYG +YG G + + D A L
Sbjct: 190 P--LQASNGGPIIAFYVENEYG----SYGADRDYLQALVAMMRDRGIVEQMFTCDNAQGL 243
Query: 188 NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRP 247
+ Q + D + + D P +P+M +E ++GWF G
Sbjct: 244 SRGALPGALQTINFQDNVER-----HLDQLAHFQPDQPLMVSEYWTGWFDHDGEEHHTFD 298
Query: 248 VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--TSYDYDAPIDEYGFIR 303
EDL + + + G +F N Y++ GGT+FG AG P TSYDYDAP+ E+G +
Sbjct: 299 SEDLVEGLQKILDRGASF-NLYVFHGGTSFGWNAGANSPYAPDITSYDYDAPLSEHGQV- 356
Query: 304 QPKW 307
PK+
Sbjct: 357 TPKY 360
>gi|445493871|ref|ZP_21460915.1| beta-galactosidase [Janthinobacterium sp. HH01]
gi|444790032|gb|ELX11579.1| beta-galactosidase [Janthinobacterium sp. HH01]
Length = 783
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 107/309 (34%), Positives = 159/309 (51%), Gaps = 29/309 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK ++ G +H+ R E WP ++ K GL + Y+FWNYHE G++ + G
Sbjct: 41 FLLDGKPLQIRCGEMHFARVPREYWPHRLKAIKAMGLNTVCAYLFWNYHEWREGKFDWSG 100
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNNP-FKEEM 129
+ D V F + ++ GL++ LR GPYACAEW GG P W L+ PG F T +P F +
Sbjct: 101 QRDAVEFCRLARQEGLWVILRPGPYACAEWEMGGLPWWLLNKHPGDAFLRTRDPAFVDPA 160
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL-YVKWAADTAVNLN 188
+R+L ++ ++ + +QGGPI++ QVENEYG G +L Y++ ++
Sbjct: 161 RRYLREVGRVLAPMQV--TQGGPILMVQVENEYGF------FGEDLEYMRTMRQALLDAR 212
Query: 189 TSVPWVMCQQEDAPD----PIINTCNGFYCD---GFTPNSPSK--PIMWTENYSGWFLSF 239
VP C +A P + T F D GF + + P+M E YSGWF ++
Sbjct: 213 FDVPLFQCNPTNAVAKTHLPGMLTVANFGSDPAGGFKALAAVQQAPLMCGEYYSGWFDTW 272
Query: 240 GYAVPFRPVEDLAFAV--ARFFETGGTFQNYYMYFGGTNFGRTAG--GPLV--ATSYDYD 293
G P R ++ + V + G+F + YM GGT F G P TSYDYD
Sbjct: 273 GN--PHRRGDNTSAVVDIQAMLKANGSF-SLYMAHGGTTFSLWGGCDRPFRPDTTSYDYD 329
Query: 294 APIDEYGFI 302
API E G++
Sbjct: 330 APISEAGWV 338
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 170/352 (48%), Gaps = 32/352 (9%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK + SGSIHY R PE W + + K K G +ETY+ WN EP +G++ F+
Sbjct: 9 TFLLDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFD 68
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G D +F+ Q+ GL+ +R PY CAEW GG P W+ +PG++ R N P+ + ++
Sbjct: 69 GLCDFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVR 128
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVGGELYVKWAADTAVNLN 188
+ ++ + + +GG IIL Q+ENEYG + +Y E ++ T +
Sbjct: 129 DYYKVLLPRLVNHQI--DKGGNIILMQIENEYGYYGKDMSYMHFLEGLMREGGITVPFVT 186
Query: 189 TSVPWVMCQQEDAPDPIINTCN-GFYCDGFTPNSPSK--------PIMWTENYSGWFLSF 239
+ PW D + T N G + N P+M E + GWF ++
Sbjct: 187 SDGPWGKMFIHGQCDGALPTGNFGSHARPLFANMKRMMKKTGNRGPLMCMEFWIGWFDAW 246
Query: 240 G-----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------AT 288
G + R ++DL + + + G N+YM+ GGTNFG G T
Sbjct: 247 GNKEHKTSKLKRNIKDLNYMLKK----GNV--NFYMFHGGTNFGFMNGSNYFTKLTPDTT 300
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEA 340
SYDYDAP+ E G I + K+ + + K + EE +S+ QK K++A
Sbjct: 301 SYDYDAPLSEDGKITE-KYRTFQSIIKKYRDFEEMPLSTK-IEQKAYGKVKA 350
>gi|160885481|ref|ZP_02066484.1| hypothetical protein BACOVA_03481 [Bacteroides ovatus ATCC 8483]
gi|423290348|ref|ZP_17269197.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
gi|156109103|gb|EDO10848.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
gi|392665735|gb|EIY59258.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
Length = 778
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL + RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
F+ ++ + L ++GG II+ QVENEYG +YG + YV D T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207
Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ + K+ LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 81/197 (41%), Gaps = 47/197 (23%)
Query: 512 LNEGINTLDILSMMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVE 569
L +GI LDIL +G N+ + D G ++L +G + W +Y V+
Sbjct: 457 LKKGIQ-LDILVEAMGRVNFDKSIHDRKGI----TEKVELISGNQTKELKNWTVYNFPVD 511
Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
+I K S K T+P +YK+TF + G L++++ GKG WVN
Sbjct: 512 YSFIKDKKYSDT-----KILPTMPA-----YYKSTFTL-DKVGDTFLDMSTWGKGMVWVN 560
Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
G ++GR+W P QTL+ +P W+ GEN ++
Sbjct: 561 GHAMGRFWEI----------------------------GPQQTLF-MPGCWLKEGENEIL 591
Query: 690 IHEELGGDPSKISLLTK 706
+ + G + I L K
Sbjct: 592 VLDLKGPTRASIKGLKK 608
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 161/330 (48%), Gaps = 34/330 (10%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V ++ I+GK L G +HYPR E W + + +++ GL + YVFWN+HE
Sbjct: 29 QVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRARAMGLNTVSAYVFWNFHERQ 88
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G + F G+ D+ FV+ QE GL++ LR GPY CAEW++GG+P WL + +R+ +
Sbjct: 89 PGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDLTYRSKDP 148
Query: 124 PFKEEMKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
F +R+ I +L KQ L + GG II+ QVENEYG +Y E Y+ D
Sbjct: 149 RFMSYCERY---IKELGKQLAPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRD 200
Query: 183 TAVNLNTSVPWVMCQ---QEDAPD--PIINTCNGFYCDGF----TPNSPSKPIMWTENYS 233
+VP C Q +A + T NG + + P P E Y
Sbjct: 201 MLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFKIVDKYHPGGPYFVAEFYP 260
Query: 234 GWFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG- 283
WF +G +V + RP E L + + G + YM+ GGTNF T+GG
Sbjct: 261 AWFDEWGKRHSSVAYERPAEQLDWMLGH-----GVSVSMYMFHGGTNFWYMNGANTSGGF 315
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E+G PK+ RE+
Sbjct: 316 RPQPTSYDYDAPLGEWGNCY-PKYHAFREI 344
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 118/277 (42%), Gaps = 58/277 (20%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTS---DYLWYTASIHVMPGQGKEVFLN 475
++ F+ E K +F + +E + + +D Y+ Y +I PG+ K L
Sbjct: 364 TTTFATVELKESAPLTTAFHQTIQSEDVLSMEDVGADFGYIHYQTTIKT-PGKQK---LI 419
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+ L A++ V+ K VA + D + +++++ TL+IL G NYG
Sbjct: 420 IQDLRDYAVILVDGKQVA----SLDRRYNQNSTTLDIHKVPATLEILVENTGRVNYGPDI 475
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ S +L G L+ G + L K +++ SF ++ +P
Sbjct: 476 LFNRKGITSQVLW----GNEKLT--------GWSITPLPLYKEEVSSLSFGQEIKGVPA- 522
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+++ TF+ E +G ++++ GKG WVNG+S+GR+W+
Sbjct: 523 ----FHRGTFII-EQQGDCFVDMSQWGKGAVWVNGKSLGRFWNI---------------- 561
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY IP W+ GEN +V+ E
Sbjct: 562 ------------GPQQTLY-IPAPWLKKGENEIVVFE 585
>gi|414880685|tpg|DAA57816.1| TPA: putative RAN GTPase activating family protein [Zea mays]
Length = 598
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/292 (34%), Positives = 146/292 (50%), Gaps = 54/292 (18%)
Query: 269 YMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD 328
+ Y GGTNFGRT+GGP + TSYDYDAP+DEYG IRQPK+GHL++LH I+ E+ L+
Sbjct: 308 FKYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILV--- 364
Query: 329 PTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNV 388
H K Y+ +S A + D VT +G + +PAWSVSILPDCK V
Sbjct: 365 --HGK---------YNDTSYGKNAIFVD----RDVKVTLSGGTHLVPAWSVSILPDCKTV 409
Query: 389 VFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVG--ISGNR-SFVRPDLAEQ 445
+NTAK+ +Q + ++ N E + +SW E + ++ +R SF L EQ
Sbjct: 410 AYNTAKIKTQTS----VMVKKANSVEKEPEALRWSWMPENLKPFMTDHRDSFRHSQLLEQ 465
Query: 446 INTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVN----------------K 489
I T+ D SDYLWY S+ G+G L + + GH + +
Sbjct: 466 ITTSTDQSDYLWYRTSLE-HKGEGSYT-LYVNTSGHEMAKLLGRWSVRLPAPVSGEAPLR 523
Query: 490 KLVAFGYGNHDFAN-----------FLINKKIELNEGINTLDILSMMVGLQN 530
K + F H F + ++L+ G N + +LS VGL++
Sbjct: 524 KELRFSPQRHSRTQGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKS 575
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 156/324 (48%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV W+ HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GGTNFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|442626280|ref|NP_001260120.1| beta galactosidase, isoform B [Drosophila melanogaster]
gi|440213416|gb|AGB92656.1| beta galactosidase, isoform B [Drosophila melanogaster]
Length = 670
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 39/325 (12%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH A ++DG+ SGS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 45 TIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 104
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
G+Y +EG D+V+F++ QE ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 105 DGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTND 164
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E+ ++ A+++ + ++LF GG II+ QVENEYG+ + Y+ W D
Sbjct: 165 PNYISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 217
Query: 183 --------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKP 225
A+ +P + C + D IN + + P+ P
Sbjct: 218 ETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGP 276
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
++ +E Y GW + R +++A A+ + N YM+FGGTNFG TAG
Sbjct: 277 LVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANY 335
Query: 284 --------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 336 NLDGGIGYAADITSYDYDAVMDEAG 360
>gi|24582088|ref|NP_608978.2| beta galactosidase, isoform A [Drosophila melanogaster]
gi|21430516|gb|AAM50936.1| LP09580p [Drosophila melanogaster]
gi|22945722|gb|AAF52321.2| beta galactosidase, isoform A [Drosophila melanogaster]
Length = 672
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 39/325 (12%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH A ++DG+ SGS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 47 TIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 106
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
G+Y +EG D+V+F++ QE ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 107 DGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTND 166
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E+ ++ A+++ + ++LF GG II+ QVENEYG+ + Y+ W D
Sbjct: 167 PNYISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 219
Query: 183 --------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKP 225
A+ +P + C + D IN + + P+ P
Sbjct: 220 ETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGP 278
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
++ +E Y GW + R +++A A+ + N YM+FGGTNFG TAG
Sbjct: 279 LVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANY 337
Query: 284 --------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 338 NLDGGIGYAADITSYDYDAVMDEAG 362
>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
Length = 594
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/325 (32%), Positives = 158/325 (48%), Gaps = 27/325 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV W+ HEP +G ++FEG
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
+ + R ++LA +V G N YM+ GGTNFG G P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
DYDAP+DE G + + + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
Length = 628
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 162/346 (46%), Gaps = 46/346 (13%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E NL ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGNLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
TSYDYDAPI E G++ PK+ +R + I+ +Y + P
Sbjct: 326 QPDLTSYDYDAPISEAGWV-TPKYDSIRNV---IRKYVKYTVPEAP 367
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVTPKYDSIRNVIRKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 VPEAPAPNPVIEIPSI-KLTKVADVLAFAEKQKPVSADT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGL 575
TL IL +G NYG+ G+ S + I K +GEW +YQ+ + E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVKIAGKE-----ITGEWDMYQLPM-SEMPDL 519
Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
K+ A++ + + Y+ TF + G +++ + GKG +VNG +IGR
Sbjct: 520 AKLK-ADAHANVPAEAAKLKGCPVLYEGTFTL-DNVGDTFIDMENWGKGIIFVNGVNIGR 577
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW P QTLY IP W+ G N +VI E+L
Sbjct: 578 YWKV----------------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLN 608
Query: 696 GDPSKISLLTKT 707
P KT
Sbjct: 609 EVPQAEVKTVKT 620
>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
Length = 591
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 169/352 (48%), Gaps = 43/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FVK Q GL + LR Y CAEW +GG P WL P ++ R+T+ F +++
Sbjct: 70 MKDICAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L + GGP+I+ QVENEYG +YG+ + Y++ + V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEEYGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
P + + A + +++ D F T N S+ PIM E
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRAGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
+SYDYDA + E G +P ++ KAIK + ++P ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPT-DKYYQVQKAIKEACPEVWQANPRTKQLAA 345
>gi|1911627|gb|AAB50770.1| beta-galactosidase [dogs, spleen, Peptide Partial, 667 aa]
Length = 667
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 161/349 (46%), Gaps = 34/349 (9%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ Y H + DG+ SGSIHY W + + K K GL I+TYV WN+HEP
Sbjct: 33 TIDYSHNRFLKDGQPFRYISGSIHYSHVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 92
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F G D+ F+K E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 93 PGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 152
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
+ + ++L ++ MK L GGPII QVENEYG +Y Y+++
Sbjct: 153 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL 206
Query: 184 -AVNLNTSVPWVMCQQEDAPDPIINTC---NGFYCD-GFTP-------------NSPSKP 225
+L V+ D + + C G Y F P + P P
Sbjct: 207 FHHHLGND---VLLFTTDGANELFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGP 263
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
++ +E Y+GW +G E +A ++ G N YM+ GGTNF G +
Sbjct: 264 LVNSEFYTGWLDHWGQPHSTVRTEVVASSLHDILAHGANV-NLYMFIGGTNFAYWNGANM 322
Query: 286 ----VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
TSYDYDAP+ E + + K+ LRE + K K+ E ++ S P
Sbjct: 323 PYQAQPTSYDYDAPLSEAADLTE-KYFALREVIRKFEKVPEGFIPPSTP 370
>gi|423248537|ref|ZP_17229553.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
gi|423253485|ref|ZP_17234416.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392657385|gb|EIY51022.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392659750|gb|EIY53368.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
Length = 773
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/332 (31%), Positives = 159/332 (47%), Gaps = 29/332 (8%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
R +++G V+++ +HY R W I K G+ I Y+FWNYHE G++ F
Sbjct: 31 RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G ++ +F K Q+ G+++ LR GPYACAEW GG P WL ++ R+ N F E
Sbjct: 91 SGEKNVAKFCKLAQKHGMYIILRPGPYACAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG-------ELYVKWAAD 182
+ F+ ++ + L + GG II+ QVENE+G YGV ++ + D
Sbjct: 151 EIFMKELGKQLAPLQL--ANGGNIIMVQVENEFG----GYGVDKPYMTAIRDIVCRAGFD 204
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
+V W + +A D ++ T N G D + P P+M +E +SGW
Sbjct: 205 KSVLFQCD--WDSTFELNALDDLLWTLNFGTGANIDKEFKKLSTVRPDTPLMCSEFWSGW 262
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
F +G RP E + + + +F + YM GGT FG G + +SY
Sbjct: 263 FDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSSY 321
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
DYDAPI E G+ PK+ L+EL + EE
Sbjct: 322 DYDAPISEAGWT-TPKYYLLQELLGKYRSPEE 352
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 50/254 (19%)
Query: 440 PDL--AEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYG 497
PD +EQ+ +D + +P L I A ++ + KL+ +
Sbjct: 382 PDFVQSEQVKPMEDFNQGWGSILYRTTLPATEANTLLRITEAHDWAQIYADGKLLGYLDR 441
Query: 498 NHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDL 557
D ++ +L EG LDI +G N+G+ V LI K K+ +
Sbjct: 442 RKDDNQVILP---QLPEGTQ-LDIWVEAMGRVNFGSTVHDRKGITEKVELI--KPDKQAV 495
Query: 558 SSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLAL 616
+ W +Y + V+ ++ K S +NS + +YK TF + G +
Sbjct: 496 TLKNWKVYSIPVDYKFAARKKYS-SNSR----------PEGPAYYKATFNLTK-TGDTFI 543
Query: 617 NLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHI 676
++++ GKG WVNG ++GR+W P QTL+ +
Sbjct: 544 DMSTWGKGMVWVNGHALGRFWEI----------------------------GPQQTLF-L 574
Query: 677 PRTWVHPGENLLVI 690
P W+ G+N +++
Sbjct: 575 PGCWLKKGKNEIIV 588
>gi|414879450|tpg|DAA56581.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 154
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 63/102 (61%), Positives = 85/102 (83%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
VTYD RAL++DG RR+L SG +HYPRSTPE+WP+LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 36 GEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEP 95
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYG 104
++GQ+ FEGR+DLV+F++ + GL++ LRIGP+ +EW YG
Sbjct: 96 VQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYG 137
>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
Length = 583
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 149/307 (48%), Gaps = 33/307 (10%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DG + SG++HY R P+ W + + +++E GL IETY+ WN H P RG++ +G D
Sbjct: 14 DGTPVRILSGALHYFRHHPDQWRDRLTRARELGLNTIETYIPWNAHSPARGEFRTDGILD 73
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L RF+ V G++ +R GPY CAEW GG P WL F G R + ++ +
Sbjct: 74 LGRFLDEVAAQGMWAIVRPGPYICAEWTGGGLPGWL-FTAGAAVRRHEPTYLAAIQDYYE 132
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE---LYVKWAADTAV-----N 186
+ ++ + +GGP++L QVENEYG AYG + VK ++ +
Sbjct: 133 AVAGIVAPRQV--DRGGPVVLVQVENEYG----AYGDDKDYLRALVKLLRESGITTPLTT 186
Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLSFG 240
++ PW++ E+ P ++ F + P+ P+M E + GWF S+G
Sbjct: 187 IDQPEPWML---ENGSLPELHKTGSFGSRAAERLATLREHQPTGPLMCAEFWDGWFDSWG 243
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDYD 293
A + G + N YM GGTNFG T G P+V TSYDYD
Sbjct: 244 LHHHTTDAAASAHELDTLLAAGASV-NLYMVCGGTNFGFTNGANDKGTYVPIV-TSYDYD 301
Query: 294 APIDEYG 300
AP+DE G
Sbjct: 302 APLDEAG 308
>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
Length = 656
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 152/318 (47%), Gaps = 17/318 (5%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G ++ GSIHY R E W + + K K G + TYV WN HEP RG++ F G
Sbjct: 82 FTLEGHEFLILGGSIHYFRVPRESWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 141
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ F+ E GL++ LR GPY C+E + GG P L P Q RTTN+ F E +
Sbjct: 142 NLDMEAFILLAAEVGLWVILRPGPYICSEIDLGGLPSRLLQDPTSQLRTTNHSFIEAVDE 201
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L +I + L +GGPII QVENEYG+ L+ V L +
Sbjct: 202 YLDHLI--ARVVPLQYRKGGPIIAVQVENEYGSFHKDEAYMPYLHKALLKRGIVELLLTS 259
Query: 192 PWVMCQQEDAPDPIINTCN------GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
+ ++ T N G + D + S +KPI+ E + GWF ++G
Sbjct: 260 DNTNEVLKGHIKGVLATVNMKSFKEGEFKDLYQVQS-NKPILIMEFWVGWFDTWGNKHAV 318
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEY 299
R D+ + F +F N YM+ GGTNFG G V TSYDYDA + E
Sbjct: 319 RDAIDVENTIFDFIRLEISF-NVYMFHGGTNFGFMNGATYFEQHRGVVTSYDYDAVLTEA 377
Query: 300 GFIRQPKWGHLRELHKAI 317
G PK+ LREL K+I
Sbjct: 378 G-DYTPKFFKLRELFKSI 394
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT-------SYD 291
+ + R ++LA +V G N YM+ GGTNFG G T SYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 25/324 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R P W + K G +ETYV WN HEP +G ++FEG
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+K QE GL+ +R PY CAEW +GGFP WL PG + R+ N + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
+ +++ + L + GG I++ Q+ENEYG+ E AY + TA +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196
Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
PW + + D I+ T N G F + P+M E + GWF
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
+ + R ++LA +V G N YM+ GG NFG G TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGINFGFMNGCSARGTIDLPQITSYD 314
Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
YDAP+DE G + + + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|297194972|ref|ZP_06912370.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|297152570|gb|EFH31854.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 599
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 36/335 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A+ T ++DG+ L SG++HY R W + + GL +ETYV WN HEP
Sbjct: 9 ADFTVGDTDFLLDGRPVRLLSGALHYFRVHEGQWGHRLAMLRAMGLNCVETYVPWNLHEP 68
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+Y +G L RF+ V AG++ +R GPY CAEW GG P WL G + RT +
Sbjct: 69 EPGRYADDG--ALGRFLDAVHAAGMWAIVRPGPYICAEWENGGLPFWLTGRVGRRVRTED 126
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEW--AY 169
+ ++R+ +++ + + + ++GGP+++ QVENEYG+ VE +
Sbjct: 127 PEYLGHVERWFTRLLPQVVEREI--TRGGPVVMVQVENEYGSYGSDGGYLRQLVELLRSC 184
Query: 170 GVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
GVG L+ + + SVP V+ + G + P+ P+M
Sbjct: 185 GVGVPLFTSDGPEDHMLSGGSVPGVLATVN------FGSGAGEAFAALRRHRPTGPLMCM 238
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
E + GWF +G R ED A A+ E G + N YM GGT+FG AG G L
Sbjct: 239 EFWCGWFEHWGAEPARRDAEDAARALREILEAGASV-NVYMAHGGTSFGGWAGANRSGEL 297
Query: 286 -------VATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+DE G + W RE+
Sbjct: 298 HDGVLEPTVTSYDYDAPVDEAGRPTEKFW-RFREV 331
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 102/316 (32%), Positives = 152/316 (48%), Gaps = 31/316 (9%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
T A ++DGK + SG IHYPR E W + ++ +K GL I TYVFWN HEP +G
Sbjct: 27 TLGDTAFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAKAMGLNTIGTYVFWNVHEPEKG 86
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
QY F G D+ FVK +E L++ LR PY CAEW +GG+P WL I G++ R+ +
Sbjct: 87 QYDFSGNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFGGYPYWLQEIKGLKVRSKEPQY 146
Query: 126 KEEMKRFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNV---EWAYGVGGELYVKWAA 181
E + + I+ + KQ + L + GG I++ Q+ENEYG+ + + +++V+
Sbjct: 147 LEAYRNY---IMAVGKQLSPLLVTHGGNILMVQIENEYGSYSDDKDYLDINRKMFVEAGF 203
Query: 182 D--------TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
D A N +P ++ DP+ + + P W +
Sbjct: 204 DGLLYTCDPKAAIKNGHLPGLLPAINGVDDPL--QVKQLINENHSGKGPYYIAEWYPAWF 261
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------P 284
W+ + + VP+R +V G N YM+ GGT G G P
Sbjct: 262 DWWGTKHHTVPYRQYLGKLDSVL----AAGISINMYMFHGGTTRGFMNGANANDADPYEP 317
Query: 285 LVATSYDYDAPIDEYG 300
+ +SYDYDAP+DE G
Sbjct: 318 QI-SSYDYDAPLDEAG 332
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 51/224 (22%), Positives = 85/224 (37%), Gaps = 51/224 (22%)
Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
G++ L ++ L +V VN K G D + + ++L G LD+L +G
Sbjct: 413 GRKGLLQLKELRDYCVVMVNGKRA----GVLDRRSKRDSIALDLPAGKVKLDLLVENLGR 468
Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
N+G + G+ +L D + K G + + DK+ + K
Sbjct: 469 INFGPYLLSNRKGITEKVLFDRQELK------------GWQQYGLPFDKLPAVAAKGIKA 516
Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
G+ +P Y+ + G L++++ GKG W+NG +GRYW
Sbjct: 517 GANVPT------YRQGTFTLDKTGDTWLDMSNWGKGAVWINGHHLGRYWQV--------- 561
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QT+Y +P W+ G N +VI E
Sbjct: 562 -------------------GPQQTIY-VPAEWLKKGMNDIVIME 585
>gi|293370654|ref|ZP_06617206.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|292634388|gb|EFF52925.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 778
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 95 GQNDIATFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D S
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFS 207
Query: 191 -VPWVMCQ-----QEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKRLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ K+ LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + I L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608
>gi|290956543|ref|YP_003487725.1| glycosyl hydrolase family 42 [Streptomyces scabiei 87.22]
gi|260646069|emb|CBG69162.1| putative glycosyl hydrolase (family 42) [Streptomyces scabiei
87.22]
Length = 591
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 156/330 (47%), Gaps = 31/330 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP-I 63
+T +++G+ + SG++HY R P++W + +RK++ GL +ETYV WN H+P
Sbjct: 6 LTTSSDGFLLNGEPFRIVSGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPDP 65
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+G DL R++ + GL + LR GPY CAEW+ GG P WL PGI+ R+++
Sbjct: 66 DSPLVLDGLLDLPRYLSLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPGIRLRSSDP 125
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
F + + +L I L A+ GGP+I QVENEYG AYG Y+K
Sbjct: 126 RFTDALDGYLD--ILLPPLLPYMAANGGPVIAVQVENEYG----AYG-DDTAYLKHVHQA 178
Query: 184 AVNLNTSVPWVMCQQEDA---------PDPIINTCNGFYCD----GFTPNSPSKPIMWTE 230
C Q + P + G + + P P+M +E
Sbjct: 179 LRARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGPLMCSE 238
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------G 283
+ GWF +G R E A + + G + N YM+ GGTNFG T G
Sbjct: 239 FWIGWFDHWGEEHHVRDAESAAADLDKLLAAGASV-NIYMFHGGTNFGFTNGANHDQCYA 297
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
P+V TSYDYDA + E G PK+ RE+
Sbjct: 298 PIV-TSYDYDAALTESG-DPGPKYHAFREV 325
>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
Length = 628
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
TSYDYDAPI E G++ PK+ +R + IK +Y I P + ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380
Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
+K ++ A SSD +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
TL IL +G NYG+ G+ S + I +++ G +YQ+ ++ E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
K+ A++ + + Y+ TF + G +++ S GKG +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTLY IP W+ GEN +VI E+L
Sbjct: 579 WKV----------------------------GPQQTLY-IPGVWLKKGENKIVIFEQLNE 609
Query: 697 DPSKISLLTKT 707
P KT
Sbjct: 610 TPQTEVKTVKT 620
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
TSYDYDAPI E G++ PK+ +R + IK +Y I P + ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380
Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
+K ++ A SSD +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSVEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
TL IL +G NYG+ G+ S + I +++ G +YQ+ ++ E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
K+ A++ + + Y+ TF + G +++ S GKG +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTLY +P W+ GEN +VI E+L
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609
Query: 697 DPSKISLLTKT 707
P KT
Sbjct: 610 TPQTEVKTVKT 620
>gi|321461557|gb|EFX72588.1| hypothetical protein DAPPUDRAFT_58801 [Daphnia pulex]
Length = 648
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 159/328 (48%), Gaps = 47/328 (14%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF- 69
+++GK + SG++HY R P W + +RK + G+ V+ETYV WN HEP + + F
Sbjct: 35 GFLLNGKPFRIFSGAVHYFRVHPAYWRDRLRKLRAAGITVVETYVAWNLHEPQKNVFDFG 94
Query: 70 EGR------FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
+G DL F++T E LF+ LR GPY C+EW++GG P WL P + RT+
Sbjct: 95 KGNNDMSIFLDLKLFIQTAYEEDLFVILRPGPYICSEWDFGGLPSWLLRDPTMHVRTSYG 154
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQG-GPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
P+ + + ++L K+ +L+ +S G GPII QVENEYG+ + + Y++ +D
Sbjct: 155 PYVDRVDKYLEKLSNLVNHMQFTSSYGKGPIIAFQVENEYGSFGYQDHPRDKAYLQHLSD 214
Query: 183 TAVNLNTSVPWVMCQQEDAP---------DPIINTCNGFYCDGFTPN-------SPSKPI 226
+L + D+P ++ T N + G T P+ P+
Sbjct: 215 KMKSLGLK---ELFFTSDSPAGYLDWGSIPGVLQTAN--FQSGATQEFKMLQELQPNMPL 269
Query: 227 MWTENYSGWF----LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG 282
M TE +SGWF F + + E + F + ++YM+ GGTNFG G
Sbjct: 270 MVTEFWSGWFDHWTQDFRKGLKLKDFETSLMEILSFDAS----VSFYMFHGGTNFGFMNG 325
Query: 283 GPLVA----------TSYDYDAPIDEYG 300
+ TSYDYDAP+ E G
Sbjct: 326 ANVRKEYPGGYLPDITSYDYDAPLSEAG 353
>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
Length = 628
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
TSYDYDAPI E G++ PK+ +R + IK +Y I P + ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380
Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
+K ++ A SSD +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
TL IL +G NYG+ G+ S + I +++ G +YQ+ ++ E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
K+ A++ + + Y+ TF + G +++ S GKG +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTLY +P W+ GEN +VI E+L
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609
Query: 697 DPSKISLLTKT 707
P KT
Sbjct: 610 TPQTEVKTVKT 620
>gi|324507659|gb|ADY43243.1| Beta-galactosidase [Ascaris suum]
Length = 655
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 164/340 (48%), Gaps = 32/340 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S ++ + ++DG+ SGSIHY R P+ W + + + + GL I+ Y+ WN+HE
Sbjct: 31 SFSIDPQNNVFLLDGRSFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHE 90
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G++ F+G ++ F++ + L+ +RIGPY CAEW GG P WL I+ RT+
Sbjct: 91 IYEGKHRFDGSRNITHFLQLAMQNELYALVRIGPYICAEWENGGAPWWLLKYKDIKMRTS 150
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F + +KR+ ++ ++K GGPI++ Q+ENEYG+ + ++++ A
Sbjct: 151 DKRFLDAVKRWFDVLLPILKPN--LRKNGGPILMLQLENEYGSFDGGCDRNYTIFLRDLA 208
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD-GFTPNS---------------P 222
+ V+ D D C G Y F P S P
Sbjct: 209 RRHFGDD-----VVLYTTDGGDDFYLKCGTIPGVYATVDFGPASSEAIDHCFASQRQYEP 263
Query: 223 SKPIMWTENYSGWFLSFGYAVP-FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA 281
P++ +E Y GWFL++ +PV ++ FE G F NYYM+ GGTNF
Sbjct: 264 HGPLVNSEFYPGWFLTWSQKERGDQPVHNVINGSKYMFEKGANF-NYYMFHGGTNFAFWN 322
Query: 282 GGPL---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
GG + TSYDY AP+ E I K+ +R+ K I+
Sbjct: 323 GGATKTAITTSYDYFAPLSEAADITD-KYLAIRDWIKTIE 361
>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 628
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
TSYDYDAPI E G++ PK+ +R + IK +Y I P + ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380
Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
+K ++ A SSD +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
TL IL +G NYG+ G+ S + I +++ G +YQ+ ++ E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
K+ A++ + + Y+ TF + G +++ S GKG +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTLY +P W+ GEN +VI E+L
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609
Query: 697 DPSKISLLTKT 707
P KT
Sbjct: 610 TPQTEVKTVKT 620
>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
Length = 632
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/351 (31%), Positives = 161/351 (45%), Gaps = 48/351 (13%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG +HYPR + W ++ K GL + TYVFWN HEP G++ F
Sbjct: 37 FVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHEPEPGKWDFTE 96
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
+L ++K E GL + LR GPY CAEW +GG+P WL + ++ R N E+ +
Sbjct: 97 DKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEMELRRDN----EQFLK 152
Query: 132 FLAKIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLN 188
+ I+ + QE NL ++GGPII+ Q ENE+G+ V + E + ++ A L
Sbjct: 153 YTQLYINRLYQEVGNLQITKGGPIIMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLK 212
Query: 189 T-------------------SVPWVM--CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIM 227
T +VP + E D + N + N P M
Sbjct: 213 TAGFDIPSFTSDGSWLFEGGAVPGALPTANGESNIDNLKKVVNRY-------NGGQGPYM 265
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
E Y GW + P +A ++ + + NYYM GGTNFG T+G
Sbjct: 266 VAEFYPGWLAHWVEPHPQVSATSVARQTEKYLQNDVSI-NYYMVHGGTNFGFTSGANYDK 324
Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT 330
TSYDYDAP+ E G++ PK+ LR + I+ +Y + P+
Sbjct: 325 KHDIQPDLTSYDYDAPVSEAGWV-TPKFDSLRNV---IQKYVDYTLPEAPS 371
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 105/243 (43%), Gaps = 54/243 (22%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
L I+ L A V+ N + G N F + ++ + N +TL+IL +G NYG+
Sbjct: 429 LEIKGLRDYATVYTNDEKA--GELNRYFNKYTMDIDVPFN---STLEILVENMGRINYGS 483
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEG--EYIGLDKISL--ANSSFWKQG 589
G+ S + I+ ++ G + + ++ ++ +D+ S+ N S K
Sbjct: 484 EIIHNTKGIISPVRIN----DMEIEGGWQMISIPMDKAPDFSKMDQASVYDNNESAIKSL 539
Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
+ PV YK TF E G +N+ GKG ++NG++IGRYW
Sbjct: 540 AGKPV-----LYKGTFNLTE-TGDTFINMEDWGKGIIFINGKNIGRYW------------ 581
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP------SKISL 703
Y G P QTLY IP W+ GEN ++I E+L P +K+ +
Sbjct: 582 ---YVG-------------PQQTLY-IPGVWLKKGENKIIIFEQLNDKPHTEVRTTKVPV 624
Query: 704 LTK 706
L K
Sbjct: 625 LAK 627
>gi|299147339|ref|ZP_07040404.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298514617|gb|EFI38501.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D S
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFS 207
Query: 191 -VPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G RP +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ K+ LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 53/128 (41%), Gaps = 32/128 (25%)
Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
S N + LP + +YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 513 SFINDKKYSDTKILPTMPA--YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWE 569
Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
P QTL+ +P W+ GEN +++ + G
Sbjct: 570 I----------------------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTR 600
Query: 699 SKISLLTK 706
+ I L K
Sbjct: 601 ASIKGLKK 608
>gi|427399434|ref|ZP_18890672.1| hypothetical protein HMPREF9710_00268 [Massilia timonae CCUG 45783]
gi|425721626|gb|EKU84536.1| hypothetical protein HMPREF9710_00268 [Massilia timonae CCUG 45783]
Length = 786
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 154/308 (50%), Gaps = 28/308 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK ++ G +H+ R E W ++ K GL + Y+FWNYHE G++ + G
Sbjct: 42 FLLDGKPLQIRCGEMHFSRVPREYWTHRLKTIKAMGLNSVCAYLFWNYHEWREGRFDWAG 101
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF-RTTNNPFKEEMK 130
+ D F + Q+ GL++ LR GPYACAEW GG P WL PG F R+T+ F +
Sbjct: 102 QRDAAEFCRLAQQEGLWVILRPGPYACAEWEMGGLPWWLLKQPGDAFLRSTSEAFLAPSR 161
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
R+L ++ ++ + + ++GGPI++ QVENEYG YG + Y++ ++
Sbjct: 162 RWLREVGRVLGPQQV--TRGGPILMVQVENEYG----FYGEDLD-YMRALRQAVLDAGFD 214
Query: 191 VPWVMCQQEDAPD----PIINTCNGFYCDGFTPNSPSK--------PIMWTENYSGWFLS 238
VP C +A P + + F G P + K P+M E YSGWF +
Sbjct: 215 VPLFQCNPTNAVAKTHIPELYSVANF---GSNPEAGFKALAEVQQGPLMCGEYYSGWFDT 271
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--GPLV--ATSYDYDA 294
+G VE+ + G+F + YM GGT FG G P TSYDYDA
Sbjct: 272 WGAPHRRGGVENAVADIRTMLAANGSF-SLYMAHGGTTFGLWGGCDRPFRPDTTSYDYDA 330
Query: 295 PIDEYGFI 302
PI E G+I
Sbjct: 331 PISEAGWI 338
>gi|312866933|ref|ZP_07727144.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
gi|311097415|gb|EFQ55648.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
Length = 595
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 152/318 (47%), Gaps = 41/318 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A + G+ + SG+IHY R P W + K G +ETY+ WN HEP +GQ+ F
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYIPWNAHEPRKGQFDFS 68
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
GR DL RF++T Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E +
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPAFIEAVD 127
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
R+ +++ L+ + +GGPI++ QVENEYG +YG + Y++ D +
Sbjct: 128 RYYDRLLGLLTPYQV--DRGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVT 180
Query: 191 VPWVMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWT 229
P D P + + T N G + F P+M
Sbjct: 181 CPLFTS---DGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWPLMCM 237
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
E + GWF + V R E+LA AV E G N YM+ GGTNFG G G L
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTL 295
Query: 286 ---VATSYDYDAPIDEYG 300
TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313
>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
Length = 592
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGVRLRSTDPIFMTKVRN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNT-- 189
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADVG 212
Query: 190 -SVPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
+VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
TSYDYDAPI E G++ PK+ +R + IK +Y I P + ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380
Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
+K ++ A SSD +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
TL IL +G NYG+ G+ S + I +++ G +YQ+ ++ E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
K+ A++ + + Y+ TF + G +++ S GKG +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTLY +P W+ GEN +VI E+L
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609
Query: 697 DPSKISLLTKT 707
P KT
Sbjct: 610 TPQTEVKTVKT 620
>gi|414156558|ref|ZP_11412859.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
gi|410869551|gb|EKS17511.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
Length = 595
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 152/315 (48%), Gaps = 41/315 (13%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+ G+ + SG+IHY R P W + K G +ETYV WN HEP +GQ+ F GR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPTDWYHSLYNLKALGFNTVETYVPWNAHEPKKGQFDFSGRL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E + R+
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPAFIEAIDRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+++ L+ + +GGPI++ QVENEYG +YG + Y++ D + P
Sbjct: 131 DRLLGLLTPYQV--DRGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVTCPL 183
Query: 194 VMCQQEDAP-DPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTENY 232
D P + T D F T N SK P+M E +
Sbjct: 184 FTS---DGPWRATLRTGTLIEEDLFVTGNFGSKAAYNFGQMKEFFNEYGKKWPLMCMEFW 240
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL--- 285
GWF + V R E+LA AV E G N YM+ GGTNFG G G L
Sbjct: 241 DGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLP 298
Query: 286 VATSYDYDAPIDEYG 300
TSYDY A ++E G
Sbjct: 299 QVTSYDYGALLNEQG 313
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 148/318 (46%), Gaps = 37/318 (11%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
+ +++G+ + SG++HY R P W + + K K GL +ETY+ WN HEP GQ+ F
Sbjct: 10 KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
E R+D+ +FVK Q GL++ LR PY CAEW +GG P WL P + R+ F E++
Sbjct: 70 EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKV 129
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
+ + ++ L + GGP+++ QVENEYG ++G + Y++
Sbjct: 130 ANYYEALFKVLVP--LQITHGGPVLMMQVENEYG----SFG-NDKAYLRHVKSLMETNGV 182
Query: 190 SVPWVMC----QQEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTE 230
VP QQ +I + F F S P+M E
Sbjct: 183 DVPLFTADGSWQQALKAGSLIED-DVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCME 241
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAG 282
+ GWF + + R + +A + +F N YM+ GGTNFG +
Sbjct: 242 FWDGWFNRWQEEIVTRSADSFQTDLAELVKEQASF-NLYMFRGGTNFGFFNGCSSRQNVD 300
Query: 283 GPLVATSYDYDAPIDEYG 300
P + TSYDYDA + E G
Sbjct: 301 YPQI-TSYDYDAVLHEDG 317
>gi|322703307|gb|EFY94918.1| beta-calactosidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 645
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 150/323 (46%), Gaps = 31/323 (9%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
N TYD ++DG L G + R P W + ++ +K GL I +YVFWN EP
Sbjct: 32 GNFTYDRHNFLLDGVPIQLIGGQMDPQRIPPAYWTQRLQMAKAMGLNTIFSYVFWNNIEP 91
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G + F+GR D+ RF++ Q+ GL++ LR GPY C E +GGFP WL IPG+ R N
Sbjct: 92 TEGSWDFDGRNDIARFLRLAQQEGLYVVLRPGPYICGEHEWGGFPSWLAQIPGMAVRQNN 151
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY----------G 170
PF + + +L ++ + ++ SQGGP+++ Q+ENEYG+ + AY
Sbjct: 152 KPFLDASRNYLEQLGKHLAATHI--SQGGPVLMTQLENEYGSFGKDKAYLRAMADMLKAN 209
Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
G LY + S+ ++ + + P + + D + P + E
Sbjct: 210 FDGFLYTNDGGGKSYLDGGSLHGILAETDGDPKTGFAARDQYVTD----PTMLGPQLDGE 265
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFE------TGGTFQNYYMYFGGTNFGRTAGG- 283
Y W + P++ A R + G + YM+ GGTN+G GG
Sbjct: 266 YYVTWIDDWSSNSPYQYTSGRPDATKRVLDDLDWILAGNNSFSIYMFHGGTNWGFENGGI 325
Query: 284 ------PLVATSYDYDAPIDEYG 300
V TSYDY AP+DE G
Sbjct: 326 WVDNRLNAVTTSYDYGAPLDESG 348
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 158/320 (49%), Gaps = 31/320 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G R + GSIHY R E W + + K K GL + TY+ WN HEP RG++ F G
Sbjct: 90 FLLEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 149
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ + GL++ LR GPY C+EW+ GG P WL ++ RTT F + +
Sbjct: 150 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVDL 209
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN----------VEWAYGVGGELYVKWAA 181
+ ++I + L +QGGPII QVENEYG+ ++ A G + + +
Sbjct: 210 YFNQLIP--RVVPLQYTQGGPIIAVQVENEYGSYDKDPNYMPYIKMALLKRGIVELLMTS 267
Query: 182 DTAVNLNTS-VPWVMC--QQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
D L+ V V+ ++ I N Y F N KP M TE ++GWF +
Sbjct: 268 DNKDGLSGGYVEGVLATINLKNVDSIIFN-----YLQSFQDN---KPTMVTEFWTGWFDT 319
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDY 292
+G +D+ +V+ + G + N YM+ GGTNFG G TSYDY
Sbjct: 320 WGGPHHIVDADDVMVSVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFTDYQADVTSYDY 378
Query: 293 DAPIDEYGFIRQPKWGHLRE 312
DA + E G PK+ LRE
Sbjct: 379 DAILTEAG-DYTPKFFKLRE 397
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/299 (33%), Positives = 139/299 (46%), Gaps = 30/299 (10%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
+G +HY R+ + W + + K K G +ETYV WN HE +G Y F G D+ F++
Sbjct: 22 AGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIELA 81
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
Q LF+ +R PY CAEW +GG P WL PG++ RT PF + +K + + ++
Sbjct: 82 QSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKILAP 141
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQ----- 197
L Q GPIIL Q+ENEYG YG E Y+ + T+VP V
Sbjct: 142 --LQIDQDGPIILMQIENEYG----YYGNDKE-YLSTLLKIMRDFGTTVPVVTSDGPWGE 194
Query: 198 -------QEDAPDPIINTCNGF--YCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF-RP 247
D P +N G + + F +KP+M E + GWF ++G R
Sbjct: 195 ALDAGSLLADVSLPTMNFGTGAKEHIENFKEKYVNKPVMCMEFWVGWFDAWGDDRHHTRD 254
Query: 248 VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAPIDEYG 300
D A + G N YM+ GGTNFG G + TSYDYDA + E G
Sbjct: 255 ASDAANELRDILNEGSV--NIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTECG 311
>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 591
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 171/354 (48%), Gaps = 47/354 (13%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FVK Q GL + LR Y CAEW +GG P WL P ++ R+T+ F +++
Sbjct: 70 MKDICAFVKQAQTLGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L + GGP+I+ QVENEYG +YG+ + Y++ + V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEEYGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
P + + A + +++ D F T N S+ PIM E
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
+SYDYDA + E G +P K+ H++ KAIK + + P ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPTDKYYHVQ---KAIKEACPEVWQAKPRTKQLAA 345
>gi|417918764|ref|ZP_12562312.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
gi|342827747|gb|EGU62128.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
Length = 595
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 152/318 (47%), Gaps = 41/318 (12%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A + G+ + SG+IHY R P W + K G +ETYV WN HEP +GQ+ F
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
GR DL RF++T Q GL++ +R P+ CAEW +GG P WL ++ R+++ F E +
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPVFIEAVD 127
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
R+ +++ L+ + +GGPI++ QVENEYG +YG + Y++ D +
Sbjct: 128 RYYDRLLGLLTPYQV--DRGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVT 180
Query: 191 VPWVMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWT 229
P D P + + T N G + F P+M
Sbjct: 181 CPLFTS---DGPWRATLRAGTLIEEDLFVTGNFGSKATYNFGQMKEFFDEYGKRWPLMCM 237
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
E + GWF + V R E+LA AV E G N YM+ GGTNFG G G L
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTL 295
Query: 286 ---VATSYDYDAPIDEYG 300
TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313
>gi|195388836|ref|XP_002053084.1| GJ23531 [Drosophila virilis]
gi|194151170|gb|EDW66604.1| GJ23531 [Drosophila virilis]
Length = 640
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 154/331 (46%), Gaps = 30/331 (9%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
V Y++ + DG SGS HY R+ P+ W +R + GL + TYV W+ H P
Sbjct: 28 VDYENDRFLKDGLPFRFISGSFHYFRAHPDTWSRHLRTMRAAGLNAVTTYVEWSLHNPRD 87
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
G Y + G DL RF++ + L + LR GPY CAE + GGFP W L+ PGIQ RT +
Sbjct: 88 GVYVWNGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLNKFPGIQLRTADI 147
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW---- 179
+ E++ + A++ + + GGPII+ QVENEYG +Y Y W
Sbjct: 148 NYLSEVRIWYAQL--MTRIVPYLYGNGGPIIMVQVENEYG----SYFACDVNYRNWLRDE 201
Query: 180 ----AADTAVNLNTSVPWVM----CQQEDAPDPIINTCN-GFYCDGFTPNSPSKPIMWTE 230
D AV P V+ Q A T N P P++ E
Sbjct: 202 TQSHVKDNAVLFTNDGPTVLRCGKIQNVLATMDFGATTNLKDIWSKLRRYEPKGPLVNAE 261
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG------GP 284
Y GW + + E + E+G + N+YM++GGTNFG TAG G
Sbjct: 262 YYPGWLTHWTEPMANVSTEAITGTFIDMLESGASV-NFYMFYGGTNFGFTAGANDNGPGN 320
Query: 285 LVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
+A TSYDYDAP+ E G PK+ LR +
Sbjct: 321 YIADITSYDYDAPMTEAG-DPTPKYMALRHI 350
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/335 (31%), Positives = 165/335 (49%), Gaps = 37/335 (11%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SG+IHY R PE W + + K K GL +ETY+ WN+HEP G++ F G D+ F+
Sbjct: 22 SGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMADIEAFITLA 81
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
+ GL + +R PY CAEW +GG P WL P +Q R + F +++ + ++I +
Sbjct: 82 GKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKKVDAYYDELIPRLVP 141
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAP 202
L ++ GGPII Q+ENEYG +YG Y+++ + + V ++ +
Sbjct: 142 --LLSTNGGPIIAVQIENEYG----SYG-NDTAYLQYLQEALIARGVDV--LLFTSDGPT 192
Query: 203 DPIIN--TCNGFYCDGFTPNSPSK------------PIMWTENYSGWFLSFGYAVPFRPV 248
D ++ T G + PS+ P+M E ++GWF + R
Sbjct: 193 DGMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMKPHHTRDS 252
Query: 249 EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDYDAPIDEYGF 301
ED A A G + N+YM+ GGTNFG G P + TSYDYDAP+ E G
Sbjct: 253 EDAASVFAEMLALGASV-NFYMFHGGTNFGFYNGANYHDKYEPTI-TSYDYDAPLSECGD 310
Query: 302 IRQPKWGHLREL---HKAIKLCEEYLISSDPTHQK 333
+ K+ +R++ H+ ++L + + DP +K
Sbjct: 311 VTT-KYEAVRQVIAKHQGVELGDLPAL-PDPVRKK 343
>gi|195116355|ref|XP_002002721.1| GI11295 [Drosophila mojavensis]
gi|193913296|gb|EDW12163.1| GI11295 [Drosophila mojavensis]
Length = 678
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 160/336 (47%), Gaps = 44/336 (13%)
Query: 2 SANVTYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNY 59
+ + + DH+A +++GK +GS HY R+ PE W +R + GL ++TYV W+
Sbjct: 49 TTSFSIDHQANTFLLNGKPFRYVAGSFHYFRALPEAWRNRLRTMRAAGLNALDTYVEWSL 108
Query: 60 HEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQF 118
H P G+Y +EG DLV+F++ QE ++ LR GPY CAE + GG P WL P I+
Sbjct: 109 HNPHDGEYNWEGIADLVKFLEIAQEEDFYIVLRPGPYICAERDNGGLPHWLFTKYPDIKV 168
Query: 119 RTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK 178
RT + + E+ ++ A+++ +K +L GG II+ QVENEY AY Y+
Sbjct: 169 RTNDPRYIAEVSKWYAELMPRLK--HLLIGNGGKIIMVQVENEYA----AYYACDHDYLN 222
Query: 179 WAAD--------TAVNLNTSVP--WVMCQQEDAPDPIINTCNGFYCDG----------FT 218
W D A+ +P + C + D + F D
Sbjct: 223 WLRDETDKYVENKALLFTVDIPNERMHCGKIDN----VFATTDFGIDRIHEIDQIWKYLR 278
Query: 219 PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG 278
P+ P++ +E Y GW + R +++A A+ + N YM+FGGTNFG
Sbjct: 279 SVQPTGPLVNSEFYPGWLTHWQEMNQRRDPQEVASALKTILSYNASV-NLYMFFGGTNFG 337
Query: 279 RTAGG----------PLVATSYDYDAPIDEYGFIRQ 304
TAG TSYDYDA +DE G + +
Sbjct: 338 FTAGANYDLDGSIGYTADITSYDYDAVMDEAGGVTK 373
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 79/181 (43%), Gaps = 36/181 (19%)
Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI-NTLDILSMMVGLQN 530
L +E L A VFV+++LV G + + + + L++G +TL +L G N
Sbjct: 459 TLLQVEDLRDRAHVFVDQQLV--GTLSREARIY----ALPLSKGWGSTLQLLVENQGRIN 512
Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
Y D G +F + + L NG L +W L+ I++ N W+Q
Sbjct: 513 YDRANDTKG--IFGKVTLQLHNGGA-LPLEDWTTTA------YPLEAITIEN---WRQ-- 558
Query: 591 TLPVNKSL--------------IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
LP N +L I Y +F E G LN A GKG A+VNG ++GRY
Sbjct: 559 KLPENAALDSSIAKQRLLRSGPILYTGSFQVSE-VGDTYLNPAGWGKGVAYVNGFNLGRY 617
Query: 637 W 637
W
Sbjct: 618 W 618
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 160/330 (48%), Gaps = 34/330 (10%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
V ++ I+GK L G +HYPR E W + + ++ GL + YVFWN+HE
Sbjct: 29 QVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRAHAMGLNTVSAYVFWNFHERQ 88
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G + F G+ D+ FV+ QE GL++ LR GPY CAEW++GG+P WL + +R+ +
Sbjct: 89 PGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDLTYRSKDP 148
Query: 124 PFKEEMKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
F +R+ I +L KQ L + GG II+ QVENEYG +Y E Y+ D
Sbjct: 149 RFMSYCERY---IKELGKQLAPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRD 200
Query: 183 TAVNLNTSVPWVMCQ---QEDAPD--PIINTCNGFYCDGF----TPNSPSKPIMWTENYS 233
+VP C Q +A + T NG + + P P E Y
Sbjct: 201 MLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFKIVDKYHPGGPYFVAEFYP 260
Query: 234 GWFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG- 283
WF +G +V + RP E L + + G + YM+ GGTNF T+GG
Sbjct: 261 AWFDEWGKRHSSVAYERPAEQLDWMLGH-----GVSVSMYMFHGGTNFWYMNGANTSGGF 315
Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAP+ E+G PK+ RE+
Sbjct: 316 RPQPTSYDYDAPLGEWGNCY-PKYHAFREI 344
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 118/277 (42%), Gaps = 58/277 (20%)
Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSD---YLWYTASIHVMPGQGKEVFLN 475
++ F+ E K +F + +E + + +D Y+ Y +I PG+ K L
Sbjct: 364 TTTFATVELKESAPLTTAFHQTIQSEDVLSMEDVGTDFGYIHYQTTIKT-PGKQK---LI 419
Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
I+ L A++ V+ K VA + D + +++++ TL+IL G NYG
Sbjct: 420 IQDLRDYAVILVDGKQVA----SLDRRYNQNSTTLDIHKVPATLEILVENTGRVNYGPDI 475
Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
G+ S +L G L+ G + L K +++ SF ++ +P
Sbjct: 476 LFNRKGITSQVLW----GNEKLT--------GWSITPLPLYKEEVSSLSFGQEIKGVPA- 522
Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
+++ TF+ E +G ++++ GKG WVNG+S+GR+W+
Sbjct: 523 ----FHRGTFII-EQQGDCFVDMSQWGKGAVWVNGKSLGRFWNI---------------- 561
Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
P QTLY IP W+ GEN +V+ E
Sbjct: 562 ------------GPQQTLY-IPAPWLKKGENEIVVFE 585
>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 593
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
griseus]
Length = 761
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 154/317 (48%), Gaps = 23/317 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+DG + ++ GSIHY R E W + + K + G + TY+ WN HE RG + F
Sbjct: 186 FTLDGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSE 245
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL +V GL++ LR GPY CAE + GG P WL P +Q RTT F + + +
Sbjct: 246 ILDLEAYVSLAATLGLWVILRPGPYICAEVDLGGLPSWLLGYPELQLRTTQQEFLDAVDK 305
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL--YVKWAADTA--VNL 187
+ +I + +GGP+I Q+ENEYG ++ G+ Y+K A V L
Sbjct: 306 YFDHLIPRILPLQYL--RGGPVIAVQIENEYG----SFSKDGDYMEYIKEALQKRGIVEL 359
Query: 188 NTSVPWVMCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGYA 242
+ Q + + T N F D F KPIM E ++GWF ++G
Sbjct: 360 LLTSDNHKGIQTGSVKGALTTINMASFEKDSFIKLLQMQNDKPIMVMEYWTGWFDTWGRE 419
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAPI 296
+ E++ + V+RF + G +F N YM+ GGTNFG G V TSYDYDA +
Sbjct: 420 HNVKSAEEIRYTVSRFIKYGISF-NMYMFHGGTNFGFINGAFHYDKHSSVVTSYDYDAVL 478
Query: 297 DEYGFIRQPKWGHLREL 313
E G + K+ LR+L
Sbjct: 479 TEAGDYTE-KYFKLRKL 494
>gi|257870316|ref|ZP_05649969.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
gi|257804480|gb|EEV33302.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
Length = 593
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/314 (35%), Positives = 149/314 (47%), Gaps = 35/314 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G L SG+IHY R P+ W + K G +ETYV WN HEP +G + FEG
Sbjct: 10 FLMNGSPFKLLSGAIHYFRVHPDDWEHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL RF+ QE GL++ LR PY CAEW +GG P WL G + R + + +
Sbjct: 70 ILDLERFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG-RLRACDPSYLAHVAE 128
Query: 132 F----LAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVG-GELYVKWAADTA 184
+ L KII S GG I++ QVENEYG+ E AY E+ + D
Sbjct: 129 YYDVLLPKIIPYQ------LSHGGNILMIQVENEYGSYGEEKAYLRAIKEMLINRGIDMP 182
Query: 185 VNLNTSVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYS 233
+ + PW + + D ++ T N D F ++ P+M E +
Sbjct: 183 L-FTSDGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWD 241
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
GWF + + R +DLA +V E G N YM+ GGTNFG R A
Sbjct: 242 GWFNRWNEPIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQ 299
Query: 287 ATSYDYDAPIDEYG 300
TSYDYDAP+DE G
Sbjct: 300 VTSYDYDAPLDEQG 313
>gi|418977089|ref|ZP_13524926.1| glycosyl hydrolase family 35 [Streptococcus mitis SK575]
gi|383350422|gb|EID28291.1| glycosyl hydrolase family 35 [Streptococcus mitis SK575]
Length = 601
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G++ FEG
Sbjct: 18 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFRFEGAL 77
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 78 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 136
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 137 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 194
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 195 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 254
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 255 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 312
Query: 294 APIDEYG 300
A +DE G
Sbjct: 313 ALLDEEG 319
>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
Length = 593
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
Length = 593
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 592
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345
>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 593
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|443718372|gb|ELU09030.1| hypothetical protein CAPTEDRAFT_226658 [Capitella teleta]
Length = 347
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 71/154 (46%), Positives = 102/154 (66%), Gaps = 2/154 (1%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
A ++GK+ +L SG++HY R PE W + + K K GL +ETYV WN HE +RG + F
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G DL RF++ Q+ GL++ LR GPY C+EW++GG P WL P ++ RT+ P+ E +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN 164
+LAKI+ L+ +L S+GGPII Q+ENEYG+
Sbjct: 130 AYLAKILPLVN--DLQMSKGGPIIAVQLENEYGS 161
>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 593
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|194857009|ref|XP_001968877.1| GG24263 [Drosophila erecta]
gi|190660744|gb|EDV57936.1| GG24263 [Drosophila erecta]
Length = 672
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 158/327 (48%), Gaps = 43/327 (13%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH A ++DG+ SGS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 47 TIDHAANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 106
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
G+Y +EG D+V+F++ Q+ ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 107 DGEYNWEGIADVVKFLEIAQQEDFYIILRPGPYICAERDNGGLPHWLFAKYPSIKMRTND 166
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E+ ++ A+++ + ++LF GG II+ QVENEYG+ + Y+ W D
Sbjct: 167 PNYIAEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 219
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTC----NGFYCDGFTPN---------------SPS 223
T + D P+ + +C N F F + P+
Sbjct: 220 ETEKYVTGKALLFTV--DIPNEKM-SCGKIENVFATTDFGIDRINEIDQIWAMLRTLQPT 276
Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
P++ +E Y GW + R +++A A+ + N YM+FGGTNFG TAG
Sbjct: 277 GPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGA 335
Query: 284 ----------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 336 NYNLDGGIGYAADITSYDYDAVMDEAG 362
>gi|417846883|ref|ZP_12492867.1| beta-galactosidase [Streptococcus mitis SK1073]
gi|339458003|gb|EGP70556.1| beta-galactosidase [Streptococcus mitis SK1073]
Length = 595
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + + + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRLRSSDPAYIDAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLSRLVPHLL--DNGGNILIMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 593
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 159/327 (48%), Gaps = 35/327 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+IDG++ + SG++HY R PE W + + K+ G +ETY+ WN HEP +G++ F+G
Sbjct: 10 FIIDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDFDG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
+ D+ F++ ++ GL++ +R PY C+EW GG P WL I+ RT ++ + + ++
Sbjct: 70 QKDVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHLEE 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ A ++ ++ + + ++ G IILAQ+ENEYG +Y + Y+K V
Sbjct: 130 YYAVLLPMIAKYQI--NREGTIILAQLENEYG----SYNQDKD-YLKALLKMMREYGIEV 182
Query: 192 PWVMCQ---QEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTENYS 233
P +E + + F F N+ PIM E +
Sbjct: 183 PIFTADGTWEEALEAGSLFEEDVFPTGNFGSNAKENIAVLKEFMKKHQIVAPIMCMEFWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
GWF + + R E+L + + G N+YM+ GGTNFG R
Sbjct: 243 GWFNRWNMEIVKRDPEELVQSAKEMIDLGSI--NFYMFHGGTNFGWMNGCSARKEHDLPQ 300
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDA + EYG + K+ LR++
Sbjct: 301 ITSYDYDAILTEYG-AKTEKYHLLRKM 326
>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
Length = 628
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 162/346 (46%), Gaps = 46/346 (13%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
TSYDYDAPI E G++ PK+ +R + I+ +Y + P
Sbjct: 326 QPDLTSYDYDAPISEAGWV-TPKYDSIRNV---IRKYVKYTVPEAP 367
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVTPKYDSIRNVIRKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 VPEAPAPNPVIEIPSI-KLTKVADVLAFAEKQKPVSADT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGL 575
TL IL +G NYG+ G+ S + I K +GEW +YQ+ + E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVKIAGKE-----ITGEWDMYQLPM-SEMPDL 519
Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
K+ A++ + + Y+ TF + G +++ + GKG +VNG +IGR
Sbjct: 520 AKLK-ADAHANVPAEAAKLKGCPVLYEGTFTL-DNVGDTFIDMENWGKGIIFVNGVNIGR 577
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW P QTLY IP W+ G N +VI E+L
Sbjct: 578 YWKV----------------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLN 608
Query: 696 GDPSKISLLTKT 707
P KT
Sbjct: 609 EVPQAEVKTVKT 620
>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 593
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLSPLQITQGGPVIMMQVENEYG----SYGM-EKAYLQQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|194221516|ref|XP_001490197.2| PREDICTED: beta-galactosidase-like [Equus caballus]
Length = 641
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 20/342 (5%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ Y H + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 12 KIDYSHNRFLKDGQPFRYISGSIHYFRIPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 71
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
GQY F D+ F++ E GL + LR GPY CAEW+ GG P WL I R+++
Sbjct: 72 PGQYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDP 131
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY-GVGGELYVKW 179
+ + ++L ++ MK L GGPII QVENEYG+ ++ Y +L+ +
Sbjct: 132 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQKLFHQH 189
Query: 180 AADTAVNLNTS---VPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMWTENY 232
D + T ++ C ++ +G + P P++ +E Y
Sbjct: 190 LGDDVLLFTTDGIFQKFLKCGALQGLYATVDFGSGINVTAAFQIQRKSEPRGPLINSEFY 249
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----VAT 288
+GW +G + +A + +G N YM+ GGTNF G L T
Sbjct: 250 TGWLDHWGQRHSKAKTDVVASTLYDILASGANV-NMYMFIGGTNFAYWNGANLPYQPQPT 308
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAI-KLCEEYLISSDP 329
SYDYDAP+ E G + + K+ LR++ K K+ E ++ S P
Sbjct: 309 SYDYDAPLSEAGDLTE-KYFALRDVIKKFEKVPEGFIPPSTP 349
>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
Length = 592
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345
>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
Length = 628
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 162/346 (46%), Gaps = 46/346 (13%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+KT E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
TSYDYDAPI E G++ PK+ +R + I+ +Y + P
Sbjct: 326 QPDLTSYDYDAPISEAGWV-TPKYDSIRNV---IRKYVKYTVPEAP 367
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVTPKYDSIRNVIRKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + F EQ+N Y+
Sbjct: 363 VPEAPAPNPVIEIPSI-KLTKVADVLAFAEKQKPVSADTPFT----FEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGL 575
TL IL +G NYG+ G+ S + I K +GEW +YQ+ + E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVKIAGKE-----ITGEWDMYQLPM-SEMPDL 519
Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
K+ A++ + + Y+ TF + G +++ + GKG +VNG +IGR
Sbjct: 520 AKLK-ADAHANVPAEAAKLKGCPVLYEGTFTL-DNVGDTFIDMENWGKGIIFVNGVNIGR 577
Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
YW P QTLY IP W+ G N +VI E+L
Sbjct: 578 YWKV----------------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLN 608
Query: 696 GDPSKISLLTKT 707
P KT
Sbjct: 609 EVPQAEVKTVKT 620
>gi|195473731|ref|XP_002089146.1| GE18961 [Drosophila yakuba]
gi|194175247|gb|EDW88858.1| GE18961 [Drosophila yakuba]
Length = 672
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 157/323 (48%), Gaps = 39/323 (12%)
Query: 8 DHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
DH A ++DG+ SGS HY R+ PE W +R + GL ++TYV W+ H P G
Sbjct: 49 DHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHDG 108
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTNNP 124
+Y +EG D+V+F++ QE ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 109 EYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFAKYPSIKMRTNDPN 168
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD-- 182
+ E+ ++ A+++ + ++LF GG II+ QVENEYG+ + Y+ W D
Sbjct: 169 YISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRDET 221
Query: 183 ------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKPIM 227
A+ +P + C + D IN + + P+ P++
Sbjct: 222 EKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGPLV 280
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
+E Y GW + R +++A A+ + N YM+FGGTNFG TAG
Sbjct: 281 NSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNL 339
Query: 284 ------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 340 DGGIGYAADITSYDYDAVMDEAG 362
>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
Length = 598
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 116/320 (36%), Positives = 152/320 (47%), Gaps = 25/320 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 37 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 96
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FVK GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 97 NNDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 156
Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNL 187
+L L KQ + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 157 YLDA---LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-L 212
Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 213 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 272
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
A G + N YM+ GGT+FG G TSYDY
Sbjct: 273 HAATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDY 331
Query: 293 DAPIDEYGFIRQPKWGHLRE 312
DA +DE G PK+ +R+
Sbjct: 332 DAILDEAGHP-TPKFALMRD 350
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 111/325 (34%), Positives = 156/325 (48%), Gaps = 38/325 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
L+ D R++ SG++HY R PE W + + K K G +ETYV WN HEP G++ F G
Sbjct: 12 LLNDKPLRII-SGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFGG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D++ FV+ E GL + +R PY CAEW +GG P WL +Q R ++ +
Sbjct: 71 IADVIAFVELAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQLRCSD-------PK 123
Query: 132 FLAKI-----IDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
FLAK+ + L K L + GGPII QVENEYG +YG + Y+ + D +
Sbjct: 124 FLAKVDAYYDVLLPKFVPLLCTNGGPIIAMQVENEYG----SYG-NDKAYLGYLRDGMIA 178
Query: 187 LNTSVPWV--------MCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTENYSG 234
V M Q PD + G + F P +P+M E ++G
Sbjct: 179 RGIDVLLFTSDGPTDEMLQGGTLPDVLATVNFGSRPEESFAKFREYRPDEPLMCMEFWNG 238
Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------AT 288
WF + R ED A + G + N+YM+ GGTNFG +G + T
Sbjct: 239 WFDHWMEEHHTRDGEDAARVLDDMLGAGASV-NFYMFHGGTNFGFYSGANHIKTYEPTVT 297
Query: 289 SYDYDAPIDEYGFIRQPKWGHLREL 313
SYDYDAP+ E G + K+ RE+
Sbjct: 298 SYDYDAPLTERGDL-TAKYEAFREV 321
>gi|327260596|ref|XP_003215120.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 679
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 102/320 (31%), Positives = 146/320 (45%), Gaps = 27/320 (8%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S ++ Y + + DG + SGSIHY R W + + K GL ++ Y+ WNYHE
Sbjct: 70 SFSIDYTDKCFLKDGVKFRYISGSIHYFRIPRAYWKDRLLKMYMSGLNAVQIYIPWNYHE 129
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P+ G Y F+G DL F+ L + LR GPY CAEW GG P WL P I RT+
Sbjct: 130 PLSGVYNFDGDRDLEGFLDLAANFDLLVILRPGPYICAEWEMGGIPSWLLAKPNIILRTS 189
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F + + ++ + ++ +K GG II QVENEYG+ Y + A
Sbjct: 190 DPDFLQAVDKWFSVLLPKIKPH--LYINGGNIISVQVENEYGSY---YACDYDYLRHLEA 244
Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTP-------------NSPSKPIM 227
L V + + T +G Y F P + P+ P++
Sbjct: 245 VFRSYLGKKVVLFTTDGTKESELLCGTLHGLYTTVDFGPEENVTEAFEKQRIHEPNGPLV 304
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
+E Y+GW +G + ED+A + + E G N YM+ GGTNFG +G
Sbjct: 305 NSEYYTGWLDYWGEPHSTKSAEDVARGLEKMLELGANV-NMYMFQGGTNFGYWSGADYNN 363
Query: 286 -----VATSYDYDAPIDEYG 300
+ TSYDYDAP+ E G
Sbjct: 364 GIYNPITTSYDYDAPLSEAG 383
>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
Length = 593
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTRQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
Length = 591
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 168/352 (47%), Gaps = 43/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLDGKPIKLISGAIHYFRMTPVQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FVK Q GL + LR Y CAEW +GG P WL P ++ R+T+ F +++
Sbjct: 70 MKDICAFVKQAQTIGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L + GGP+I+ QVENEYG +YG+ + Y++ + V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEEYGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
P + + A + +++ D F T N S+ PIM E
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG +A G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
+SYDYDA + E G +P ++ KAIK + + P ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPT-DKYYQVQKAIKEACPEVWQAKPRTKQLAA 345
>gi|297483826|ref|XP_002693891.1| PREDICTED: galactosidase, beta 1-like 3 [Bos taurus]
gi|296479482|tpg|DAA21597.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 899
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/328 (34%), Positives = 153/328 (46%), Gaps = 37/328 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G ++ GS+HY R W + + K + G + TYV WN HEP RG + F G
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F+ +E GL++ LR GPY C+E + GG P WL P Q RTTN F + +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNK 440
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
+ +I + QGGPII QVENEYG + AY G+GG L
Sbjct: 441 YFDHLIPRVALLQYL--QGGPIIAVQVENEYGFFYKDEAYMPYLLQALQQRGIGGLLLT- 497
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGW 235
A T + + V+ IN GF D F KPI+ E + GW
Sbjct: 498 -ADSTEEVMRGHIKGVLAS--------IN-MKGFKVDSFKHLYKLQRHKPILIMEFWVGW 547
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATS 289
F ++G V ++ +V+ F G +F N YM+ GGTNFG G V TS
Sbjct: 548 FDTWGIDHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTS 606
Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAI 317
YDYDA + E G K+ LR L ++I
Sbjct: 607 YDYDAVLTEAGDYTA-KYFMLRSLFESI 633
>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 593
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 593
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 593
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
Length = 593
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTRQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
Length = 593
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 152/319 (47%), Gaps = 23/319 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 HNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPH 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
A G + N YM+ GGT+FG G TSYDYD
Sbjct: 276 AATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYD 334
Query: 294 APIDEYGFIRQPKWGHLRE 312
A +DE G PK+ +R+
Sbjct: 335 AILDEAGHP-TPKFALMRD 352
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 156/322 (48%), Gaps = 29/322 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK-- 273
Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
P + A A FE G N YM+ GGT+FG G TSY
Sbjct: 274 PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331
Query: 291 DYDAPIDEYGFIRQPKWGHLRE 312
DYDA +DE G PK+ +R+
Sbjct: 332 DYDAILDEAGHP-TPKFALMRD 352
>gi|358415935|ref|XP_600640.6| PREDICTED: uncharacterized protein LOC522360 [Bos taurus]
Length = 1360
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 114/328 (34%), Positives = 154/328 (46%), Gaps = 37/328 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G ++ GS+HY R W + + K + G + TYV WN HEP RG + F G
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F+ +E GL++ LR GPY C+E + GG P WL P Q RTTN F + +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNK 440
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
+ +I + L QGGPII QVENEYG + AY G+GG L
Sbjct: 441 YFDHLIP--RVALLQYLQGGPIIAVQVENEYGFFYKDEAYMPYLLQALQQRGIGGLLLT- 497
Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGW 235
A T + + V+ IN GF D F KPI+ E + GW
Sbjct: 498 -ADSTEEVMRGHIKGVLAS--------IN-MKGFKVDSFKHLYKLQRHKPILIMEFWVGW 547
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATS 289
F ++G V ++ +V+ F G +F N YM+ GGTNFG G V TS
Sbjct: 548 FDTWGIDHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTS 606
Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAI 317
YDYDA + E G K+ LR L ++I
Sbjct: 607 YDYDAVLTEAGDYTA-KYFMLRSLFESI 633
>gi|148273884|ref|YP_001223445.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
gi|147831814|emb|CAN02784.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
Length = 599
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 144/305 (47%), Gaps = 26/305 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ + +G++HY R P+ W + IRK++ GL+ IETYV WN H P RG +
Sbjct: 20 LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGAFDTSAGL 79
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF+ V G+ +R GPY CAEW+ GG P WL P + R + + + FL
Sbjct: 80 DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFEDPAVGVRRSEPLYLAAVDEFL 139
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
++ +++ + GGP+IL Q+ENEYG AYG + Y++ D VP
Sbjct: 140 RRVYEIVAPRQI--DMGGPVILVQIENEYG----AYGDDAD-YLRHLVDLTRESGIIVPL 192
Query: 194 VMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSFGY 241
Q + + + G + P+ P+M +E + GWF +G
Sbjct: 193 TTVDQPTDEMLSRGSLDELHRTGSFGSRATERLATLRRHQPTGPLMCSEFWDGWFDHWGE 252
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAP 295
D A + G + N YM+ GGTNFG T G TSYDYDAP
Sbjct: 253 HHHTTSAADAAAELDALLAAGASV-NIYMFHGGTNFGFTNGANHKGTYQSHVTSYDYDAP 311
Query: 296 IDEYG 300
+DE G
Sbjct: 312 LDETG 316
>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
Length = 608
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 109/347 (31%), Positives = 159/347 (45%), Gaps = 38/347 (10%)
Query: 13 VIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR 72
+ K R+L SGS+HY R E W + + K K GL ++TY+ WN HEP G + FE
Sbjct: 12 LFKSKTRIL-SGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDE 70
Query: 73 FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN-PFKEEMKR 131
D+ F+K ++ GL++ +R GPY CAEW +GGFP WL + R T + + ++
Sbjct: 71 LDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + ++ S+GGPII QVENEY +Y E Y+ W + ++
Sbjct: 131 WFTVLFSQLRDHQW--SRGGPIISIQVENEYA----SYNKDSE-YLPWVKNLLTDVGKCF 183
Query: 192 PWVMCQQED--------APDPII-----NTCNGF-YCDGFTPNSPSKPIMWTENYSGWFL 237
+ + + PD + + N F D PN +P M TE ++GWF
Sbjct: 184 LLKIINETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQPN---RPKMVTEFWAGWFD 240
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPLVAT 288
+G R G+ N YM+ GGT+FG AG G T
Sbjct: 241 HWGQQGHSTLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTT 300
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAI--KLCEEYLISSDPTHQK 333
SYDYDAP+ E G + + KW RE+ K K + + P QK
Sbjct: 301 SYDYDAPLSESGDLTE-KWNVTREIIKEFFPKYINDSYVFRRPEIQK 346
>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
Length = 593
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLQQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
Length = 647
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 107/333 (32%), Positives = 163/333 (48%), Gaps = 25/333 (7%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
LS ++ YD+ + DGK SG +HY R W + + K K G+ ++TYV WN H
Sbjct: 18 LSFSIDYDNNCFMKDGKPFRYISGGMHYFRVPQYYWKDRLLKLKASGMNTVQTYVPWNLH 77
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EPI QY F G +L F++ Q L + LR GPY CAEW++GG P WL P I R+
Sbjct: 78 EPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGPYICAEWDFGGLPGWLLKDPSIVIRS 137
Query: 121 TN-NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG-GEL 175
+ + E + +++ ++ L+K GGP+I+ QVENEYG+ + Y + +L
Sbjct: 138 SQGKAYMEAVDAWMSVLLPLVKP--FLYENGGPVIMVQVENEYGDYIHCDHQYMLHLQQL 195
Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSP---------SKPI 226
+ D + T + E P + T F + P+ P P+
Sbjct: 196 FRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDFGANT-DPSIPFANQRKLQQKGPL 254
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
+ +E Y+GW +G R + +A A+ + + N YM+ GGTNFG +G
Sbjct: 255 VNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNASV-NLYMFEGGTNFGFWSGADFH 313
Query: 286 -----VATSYDYDAPIDEYGFIRQPKWGHLREL 313
V TSYDYDAP+ E G + + K+ +RE+
Sbjct: 314 GQYQPVPTSYDYDAPLTEAGDLTE-KYHAIREV 345
>gi|383939096|ref|ZP_09992284.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae SK674]
gi|418972932|ref|ZP_13520979.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae ATCC
BAA-960]
gi|383350776|gb|EID28631.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae ATCC
BAA-960]
gi|383714006|gb|EID70024.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae SK674]
Length = 595
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 156/322 (48%), Gaps = 29/322 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 HNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK-- 273
Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
P + A A FE G N YM+ GGT+FG G TSY
Sbjct: 274 PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331
Query: 291 DYDAPIDEYGFIRQPKWGHLRE 312
DYDA +DE G PK+ +R+
Sbjct: 332 DYDAILDEAGHP-TPKFALMRD 352
>gi|423259078|ref|ZP_17240001.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|423263951|ref|ZP_17242954.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
gi|387776658|gb|EIK38758.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|392706217|gb|EIY99340.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
Length = 773
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 158/332 (47%), Gaps = 29/332 (8%)
Query: 10 RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
R +++G V+++ +HY R W I K G+ I Y+FWNYHE G++ F
Sbjct: 31 RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90
Query: 70 EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
G ++ +F K Q+ G+++ LR GPY CAEW GG P WL ++ R+ N F E
Sbjct: 91 SGEKNVAKFCKLAQKHGMYIILRPGPYVCAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150
Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG-------ELYVKWAAD 182
+ F+ ++ + L + GG II+ QVENE+G YGV ++ + D
Sbjct: 151 EIFMKELGKQLAPLQL--ANGGNIIMVQVENEFG----GYGVDKPYMTAIRDIVCRAGFD 204
Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
+V W + +A D ++ T N G D + P P+M +E +SGW
Sbjct: 205 KSVLFQCD--WDSTFELNALDDLLWTLNFGTGANIDKEFKKLSTVRPDTPLMCSEFWSGW 262
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
F +G RP E + + + +F + YM GGT FG G + +SY
Sbjct: 263 FDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSSY 321
Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
DYDAPI E G+ PK+ L+EL + EE
Sbjct: 322 DYDAPISEAGWT-TPKYYLLQELLGKYRSPEE 352
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 50/254 (19%)
Query: 440 PDL--AEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYG 497
PD +EQ+ +D + +P L I A ++ + KL+ +
Sbjct: 382 PDFVQSEQVKPMEDFNQGWGSILYRTTLPATEANTLLRITEAHDWAQIYADGKLLGYLDR 441
Query: 498 NHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDL 557
D ++ +L EG LDI +G N+G+ V LI K K+ +
Sbjct: 442 RKDDNQVILP---QLPEGTQ-LDIWVEAMGRVNFGSTVHDRKGITEKVELI--KPDKQAV 495
Query: 558 SSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLAL 616
+ W +Y + V+ ++ K S +NS + +YK TF + G +
Sbjct: 496 TLKNWKVYSIPVDYKFAARKKYS-SNSR----------PEGPAYYKATFNLTK-TGDTFI 543
Query: 617 NLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHI 676
++++ GKG WVNG ++GR+W P QTL+ +
Sbjct: 544 DMSTWGKGMVWVNGHALGRFWEI----------------------------GPQQTLF-L 574
Query: 677 PRTWVHPGENLLVI 690
P W+ G+N +++
Sbjct: 575 PGCWLKKGKNEIIV 588
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 152/319 (47%), Gaps = 23/319 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 HNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPH 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
A G + N YM+ GGT+FG G TSYDYD
Sbjct: 276 AATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYD 334
Query: 294 APIDEYGFIRQPKWGHLRE 312
A +DE G PK+ +R+
Sbjct: 335 AILDEAGHP-TPKFALMRD 352
>gi|342162833|ref|YP_004767472.1| beta-galactosidase [Streptococcus pseudopneumoniae IS7493]
gi|341932715|gb|AEL09612.1| beta-galactosidase (Lactase) [Streptococcus pseudopneumoniae
IS7493]
Length = 595
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 83/205 (40%), Gaps = 57/205 (27%)
Query: 513 NEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLK---NGKRDLSSGEWIYQVG 567
+G++ LDIL +G NYG F D G+ + + DL N K Y +
Sbjct: 437 KKGLSRLDILIENMGRVNYGHKFLADTQRKGIRTGVCKDLHFLLNWKH--------YPLP 488
Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
++ +KI + W QG +Y F E K L+L+ GKG A+
Sbjct: 489 LDNP----EKIDFSKG--WTQGQP-------AFYAYDFTVEEPKDTY-LDLSEFGKGVAF 534
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNGQ++GR+W+ P +LY IP +++ G N
Sbjct: 535 VNGQNLGRFWNV----------------------------GPTLSLY-IPHSYLKEGANR 565
Query: 688 LVIHEELGGDPSKISLLTK-TGQHI 711
++I E G +I L K T +HI
Sbjct: 566 IIIFETEGQYKEEIHLTRKPTLKHI 590
>gi|417848939|ref|ZP_12494871.1| beta-galactosidase [Streptococcus mitis SK1080]
gi|339457687|gb|EGP70254.1| beta-galactosidase [Streptococcus mitis SK1080]
Length = 595
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLFPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 624
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 106/338 (31%), Positives = 160/338 (47%), Gaps = 50/338 (14%)
Query: 16 GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
G+ + SG +HY R + W ++ K GL + TYVFWN HE G++ F G +L
Sbjct: 35 GEEIPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 76 VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
+++ E G+ + LR GPY CAEW +GG+P WL IPG++ R N F + K+++ +
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDR 154
Query: 136 IIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-PWV 194
+ + + +L ++GGPII+ Q ENE+G+ YV D + + S +
Sbjct: 155 LYEEVG--DLQCTKGGPIIMVQCENEFGS-----------YVSQRKDIPLEEHRSYNAKI 201
Query: 195 MCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIMWT 229
Q DA P+ + + +G T N S P M
Sbjct: 202 KGQLADAGFTIPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGDKGPYMVA 261
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-- 287
E YSGW +G P ++A + + +F N+YM GGTNFG T+G
Sbjct: 262 EFYSGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKR 320
Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
TSYDYDAPI E G++ PK+ +R + K +K
Sbjct: 321 DIQPDLTSYDYDAPISEAGWL-TPKYDSIRSVIQKYVK 357
>gi|170034400|ref|XP_001845062.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875695|gb|EDS39078.1| beta-galactosidase [Culex quinquefasciatus]
Length = 611
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 115/394 (29%), Positives = 182/394 (46%), Gaps = 46/394 (11%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
++ Y+ ++DG+ SGS HY R+ P W ++R + GL + TY+ W+ HEP
Sbjct: 10 SIDYERDTFLLDGEPFRFISGSFHYFRALPGSWRHILRAMRAAGLNAVMTYIEWSTHEPT 69
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
G Y + DL +F++ +E L++ LR GPY CAE + GGFP W L P I+ RT +
Sbjct: 70 EGDYRWNEIADLEQFIRIAEEENLYVILRPGPYICAERDMGGFPYWLLTKFPNIKLRTQD 129
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA- 181
+ + E++++ + ++ +++ +GGP+I+ +ENEYG ++ + Y+K+
Sbjct: 130 SDYMREVQKWYSVLMPRIQK--YLYGRGGPVIMVSIENEYG----SFSACDKTYLKFLKN 183
Query: 182 --------DTAVNLNTSVPWVMCQQEDAPDPIINTCN-------GFYCDGFTPNSPSKPI 226
D + N + C + I+ T + Y P P+
Sbjct: 184 MTESYIQYDAVLFTNDGPEQLNCGRIPG---ILATLDFGSTGSPERYWQKLRKVQPKGPL 240
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA----G 282
+ E Y GW + + R R G N+YM+FGGTNF TA G
Sbjct: 241 VNAEFYPGWLTHWMEPMA-RTATGPVVDTLRLMLNQGANVNFYMFFGGTNFAFTAGANDG 299
Query: 283 GP----LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL-GAK 337
GP TSYDYDAP+DE G PK+ LR++ + E P QKL K
Sbjct: 300 GPGKFNTDITSYDYDAPLDEAG-DPTPKYFALRDV-----ILEYMPDPGVPVPQKLPKMK 353
Query: 338 LEAHIYHK----SSNDCAAFLANYDSSSDANVTF 367
L + +SN+ LA Y ++D ++F
Sbjct: 354 LPPVTLTQYGFLTSNEARQALAKYIFTNDRTLSF 387
>gi|387790696|ref|YP_006255761.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379653529|gb|AFD06585.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 790
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 28/309 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++GK ++++G IH+PR E W I+ K G+ I Y+FWN+HE Q+ F G
Sbjct: 45 FLLNGKPFLIRAGEIHFPRIPREYWDHRIKLCKAMGMNTICIYLFWNFHEQKPDQFDFTG 104
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
+ D+ FVK VQ G++ +R GPYACAEW+ GG P WL P ++ RT + + M+R
Sbjct: 105 QKDVAAFVKLVQANGMYCIVRPGPYACAEWDMGGLPWWLLKKPDLKVRTLEDRY--FMER 162
Query: 132 FLAKIIDLMKQENLFASQ-GGPIILAQVENEYGNVEWAYGVGGELY------VKWAADTA 184
+ ++ KQ L Q GG II+ QVENEY A+G E +K A
Sbjct: 163 SAKYLKEVGKQLALLQIQNGGNIIMVQVENEYA----AFGNSAEYMDANRKNLKDAGFNK 218
Query: 185 VNLNTSVPWVMCQQEDAPDP----IINTCNGFYCD----GFTPNSPSKPIMWTENYSGWF 236
V L W DP +N G D GF P+ P+M +E ++GWF
Sbjct: 219 VQL-MRCDWSSTFNSYITDPEVAITLNFGAGSDVDKQFKGFQEKHPTAPLMCSEYWTGWF 277
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYD 291
+G R + ++ + +F + YM GGT FG+ G + SYD
Sbjct: 278 DHWGRPHETRSINSFIGSLKDMMDRKISF-SLYMAHGGTTFGQWGGANSPPYSAMVASYD 336
Query: 292 YDAPIDEYG 300
Y+API E G
Sbjct: 337 YNAPIGEQG 345
Score = 43.9 bits (102), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 70/167 (41%), Gaps = 23/167 (13%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG- 532
L I + A VF+N KLV G D +I + LDIL G N+G
Sbjct: 432 LIITEVHDWAQVFINGKLV----GKLDRRRADSTIEIPATKAGAVLDILVEATGRVNFGE 487
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGST 591
A D G +++ +G W +Y V+ ++ AN+ F KQ
Sbjct: 488 AVIDRKGI----TEKVEISDGSTVQELKNWTVYNFPVDYQF-------QANAKFVKQKVN 536
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
P WY+ F + G ++L++ GKG WVNG +IGR+W
Sbjct: 537 GPA-----WYRAKFNLNQ-TGDTYIDLSTWGKGMIWVNGYNIGRFWK 577
>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 105/327 (32%), Positives = 153/327 (46%), Gaps = 36/327 (11%)
Query: 13 VIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR 72
+ K R+L SGS+HY R E W + + K K GL ++TY+ WN HEP G + FE
Sbjct: 12 LFKSKTRIL-SGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDE 70
Query: 73 FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN-PFKEEMKR 131
D+ F+K ++ GL++ +R GPY CAEW +GGFP WL + R T + + ++
Sbjct: 71 LDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + ++ S+GGPII QVENEY +Y E Y+ W + ++
Sbjct: 131 WFTVLFSQLRDHQW--SRGGPIISIQVENEYA----SYNKDSE-YLPWVKNLLTDVGKCF 183
Query: 192 PWVMCQQED--------APDPII-----NTCNGF-YCDGFTPNSPSKPIMWTENYSGWFL 237
+ + + PD + + N F D PN +P M TE ++GWF
Sbjct: 184 LLKIINETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQPN---RPKMVTEFWAGWFD 240
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPLVAT 288
+G R G+ N YM+ GGT+FG AG G T
Sbjct: 241 HWGQQGHSLLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTT 300
Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHK 315
SYDYDAP+ E G + + KW RE+ K
Sbjct: 301 SYDYDAPLSESGDLTE-KWNVTREIIK 326
>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 625
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 160/335 (47%), Gaps = 40/335 (11%)
Query: 10 RALVIDGKRRV-------LQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
R L IDG R + + S +IHY R P++W + +++ + G +E Y+ WN+H+P
Sbjct: 5 RVLTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQP 64
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
F+G D+ FV+ E G + R GPY CAEW++GG P WL ++ RTT+
Sbjct: 65 TPAAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENVRLRTTD 124
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL------- 175
+ + + ++I ++ + L A++GGP++ Q+ENEYG+ +G +
Sbjct: 125 PVYLAAVDAWFDELIPVLAE--LQATRGGPVVAVQIENEYGS----FGADPDYLDHLRKG 178
Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
++ DT + + +M PD + G D P P + E
Sbjct: 179 LIERGVDTLLFTSDGPQELMLAGGTVPDVLATVNFGSRADEAFATLRRVRPDDPPVCMEF 238
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
++GWF FG R +D A ++ GG+ N+YM GGTNFG AG
Sbjct: 239 WNGWFDHFGEPHHTRSAQDAARSLDEILAAGGSV-NFYMGHGGTNFGFWAGANHSGVGTG 297
Query: 284 -----PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
P + TSYDYDAP+ E G + PK+ RE+
Sbjct: 298 DPGYQPTI-TSYDYDAPVGEAGEL-TPKFHLFREV 330
>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
Length = 629
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 159/336 (47%), Gaps = 49/336 (14%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
++GK+ + SG +HY R + W ++ K GL + TYVFWN+HE G++ F G
Sbjct: 38 LNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHETEPGKWDFTGDK 97
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L ++KT E G+ + LR GPY CAEW +GG+P WL +PG++ R N F + + ++
Sbjct: 98 NLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGMEIRRDNPQFLKHTEAYI 157
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
++ + +L ++GGPI++ Q ENE+G+ YV D + + +
Sbjct: 158 QRLYKEVG--HLQCTKGGPIVMVQCENEFGS-----------YVAQRKDITLQEHRAYNA 204
Query: 194 VMCQQ--EDAPDPIINTCNGFY------CDGFTPNSPSK------------------PIM 227
+ QQ + D + T +G + +G P + + P M
Sbjct: 205 KIKQQLADAGFDVPLFTSDGSWLFEGGSTEGALPTANGETDIANLKKVVNQYHGGQGPYM 264
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
E Y GW + P +A + + +F N YM GGTNFG T+G
Sbjct: 265 VAEFYPGWLSHWAEPFPQVSASSVARTTESYLKNDVSF-NVYMVHGGTNFGFTSGANYDK 323
Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAPI E G++ PK+ +R + K
Sbjct: 324 KRDIQPDLTSYDYDAPISEAGWV-TPKYDSIRAVIK 358
>gi|315500613|ref|YP_004089415.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
gi|315418625|gb|ADU15264.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
Length = 785
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 169/367 (46%), Gaps = 37/367 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DG+ ++ G +H+PR E WP ++ K GL + Y+FWNYHE GQ+ +EG
Sbjct: 42 FLLDGRPIQIRCGEMHFPRVPREYWPHRLKMIKAMGLNAVCAYLFWNYHEWNEGQFDWEG 101
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF-RTTNNPFKEEMK 130
+ D F + Q+ GL++ LR GPYACAEW GG P WL G F RT F
Sbjct: 102 QRDAAAFCRMAQKEGLWVILRPGPYACAEWEMGGLPWWLLKAEGDAFLRTRAEAFTGPAH 161
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGV-------GGELYVKW 179
R++ ++ + L ++GGPI++ QVENEYG ++E+ G+ G ++ +
Sbjct: 162 RWIEEVGRHLGP--LQVTKGGPILMVQVENEYGFFGNDLEYLQGMRKAVEQAGFDVPLFQ 219
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPI--INTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
T V T +P ++ DP NT P+M E YSGWF
Sbjct: 220 CNPTHVVAKTHIPELLSVANFGNDPETGFNTLRAVQ---------RAPLMCGEYYSGWFD 270
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--GPLV--ATSYDYD 293
+G V+ + + G+F + YM GGT+FG G P TSYDYD
Sbjct: 271 VWGAGHRTGGVQSSVADIKWMLQQNGSF-SLYMAHGGTSFGLWGGCDRPFQPDTTSYDYD 329
Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAF 353
API E G I + K+ R + E L + P + + S +CA
Sbjct: 330 APISEAGRIGE-KFEAYRSAMRPFLKAGERLPAPPPQKDTMA------LAPFSLEECAPV 382
Query: 354 LANYDSS 360
A Y S+
Sbjct: 383 SAGYTSN 389
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 160/324 (49%), Gaps = 29/324 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+ G + ++ GSIHY R E W + + K K G + TYV WN HEP RG++ F G
Sbjct: 91 FTLGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSG 150
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL FV E GL++ LR GPY C+E + GG P WL P + RTT F E + +
Sbjct: 151 NLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNK 210
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I + L + GPII QVENEYG+ +A Y++ A L +
Sbjct: 211 YFDHLIS--RVVPLQYRKRGPIIAVQVENEYGS--FAEDKDYMPYIQKAL-----LERGI 261
Query: 192 PWVMCQQEDAP-------DPIINTCN--GFYCDGFTPNSP---SKPIMWTENYSGWFLSF 239
++ +DA + ++ T N F + F S +KPIM E + GWF ++
Sbjct: 262 VELLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFDTW 321
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
G + ED+ V++F + +F N YM+ GGTNFG G V TSYDYD
Sbjct: 322 GGKHMIKNAEDVEDTVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGKHRGVVTSYDYD 380
Query: 294 APIDEYGFIRQPKWGHLRELHKAI 317
A + E G + K+ LR+L ++
Sbjct: 381 AVLTEAGDYTE-KYFKLRKLFGSV 403
>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 593
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R
Sbjct: 242 WDGWFNRWGEPVIHREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGEKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|60683116|ref|YP_213260.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60494550|emb|CAH09349.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 769
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|375359947|ref|YP_005112719.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301164628|emb|CBW24187.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 769
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|265767009|ref|ZP_06094838.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263253386|gb|EEZ24862.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 769
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
Length = 591
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 168/352 (47%), Gaps = 43/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK L SG+IHY R T W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLDGKPIKLISGAIHYFRMTSAQWADSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FVK Q GL + LR Y CAEW +GG P WL P ++ R+T+ F +++
Sbjct: 70 MKDIFAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L + GGP+I+ QVENEYG +YG+ + Y++ + V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEECGIDV 181
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
P + + A + +++ D F T N S+ PIM E
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL-- 285
+ GWF +G + R +DLA V G N YM+ GGTNFG + G G L
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFSNGCSARGALDL 297
Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
+SYDYDA + E G +P ++ KAIK + ++P ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPT-DKYYQVQKAIKEACPEVWQANPRTKQLAA 345
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 149/305 (48%), Gaps = 19/305 (6%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V+DG+ + SG +HY R W ++ +K GL I TYVFWN HEP G++ F G
Sbjct: 37 FVLDGQPFQIISGEMHYERIPRAYWKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSG 96
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFR-TTNNPFKEEMK 130
DL +F++ Q+ GL + LR GPY+CAEW +GGFP WL P +Q +N+P E MK
Sbjct: 97 NADLAQFIRDAQQTGLKVLLRAGPYSCAEWEFGGFPAWLMKNPKMQTALRSNDP--EFMK 154
Query: 131 RFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNL 187
I+ L ++ L GGPII Q+ENEYG+ + AY + A T L
Sbjct: 155 PAEQWILRLGREVAPLQVGYGGPIIGVQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSLL 214
Query: 188 NTSVPWVMCQQEDAPD--PIINTCNGFYC---DGFTPNSPSKPIMWTENYSGWFLSFGYA 242
T+ P + P +N G D +P++ +E ++GWF +G
Sbjct: 215 YTANPSRALVRGSIPGVYSAVNFAPGHAAQALDSLAQLRAGQPLLSSEYWTGWFDHWGEP 274
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------ATSYDYDAP 295
+P+ L + G N YM+ GGT+FG +G TSYDY AP
Sbjct: 275 HQSKPLS-LQVKDFNYILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAP 333
Query: 296 IDEYG 300
+DE G
Sbjct: 334 LDEAG 338
>gi|219847209|ref|YP_002461642.1| beta-galactosidase [Chloroflexus aggregans DSM 9485]
gi|219541468|gb|ACL23206.1| Beta-galactosidase [Chloroflexus aggregans DSM 9485]
Length = 898
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 151/315 (47%), Gaps = 17/315 (5%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
++A V + + +D + L SG IHY R W L+ +++ GL I+T + WN H
Sbjct: 1 MNATVRVGRQGIELDSRPFYLLSGCIHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRH 60
Query: 61 EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
EP G + F DL F+ + GL + +R GPY CAEW GG P WL ++ RT
Sbjct: 61 EPQPGVFDFADEADLGAFLDLCHDLGLKVIVRPGPYICAEWENGGLPAWLTANGDLRLRT 120
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGV-GGELYVKW 179
+ F + R+ ++ ++ ++GGPIIL Q+ENE+ WA GV G + + +
Sbjct: 121 NDPVFLSAVLRWFDTLMPILVPRQ--HTRGGPIILCQIENEH----WASGVYGADEHQQT 174
Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMWTENYSGWF 236
A A VP C P + P P++ +E +SGWF
Sbjct: 175 LARAAFERGIEVPQYTCMGATPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWF 234
Query: 237 LSF-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV--ATS 289
++ G+ + L + + G +++M+ GGTNF GRT GG L+ T
Sbjct: 235 DNWGGHRQTRKSAAKLDMILHQLTAVGCAGFSHWMWAGGTNFGYWGGRTVGGDLIHMTTG 294
Query: 290 YDYDAPIDEYGFIRQ 304
YDYDAPIDEYG + +
Sbjct: 295 YDYDAPIDEYGRLTE 309
>gi|53715181|ref|YP_101173.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218046|dbj|BAD50639.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 769
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|423285593|ref|ZP_17264475.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
615]
gi|404579108|gb|EKA83826.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
615]
Length = 769
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 96/237 (40%), Gaps = 55/237 (23%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG- 532
+ I + A VF + KL+A + F + + L +G +DIL +G N+
Sbjct: 414 MKITEVHDWAQVFADGKLLA--RLDRRRGEFALQLPV-LKKGTR-IDILVEAMGRVNFDE 469
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS- 590
+ D G ++L GK+ W +Y V+ S +K G+
Sbjct: 470 SIHDRKGI----TEKVELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTA 517
Query: 591 -TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
T+P +Y+TTF + G L++++ GKG WVNG +IGR+W
Sbjct: 518 QTMPA-----YYRTTFRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI---------- 561
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|423260608|ref|ZP_17241530.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
CL07T00C01]
gi|423266742|ref|ZP_17245744.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
CL07T12C05]
gi|387775162|gb|EIK37271.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
CL07T00C01]
gi|392699974|gb|EIY93143.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
CL07T12C05]
Length = 769
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 96/237 (40%), Gaps = 55/237 (23%)
Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG- 532
+ I + A VF + KL+A + F + + L +G +DIL +G N+
Sbjct: 414 MKITEVHDWAQVFADGKLLA--RLDRRRGEFALQLPV-LKKGTR-IDILVEAMGRVNFDE 469
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS- 590
+ D G ++L GK+ W +Y V+ S +K G+
Sbjct: 470 SIHDRKGI----TEKVELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTA 517
Query: 591 -TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
T+P +Y+TTF + G L++++ GKG WVNG +IGR+W
Sbjct: 518 QTMPA-----YYRTTFRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI---------- 561
Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|431919435|gb|ELK17954.1| Beta-galactosidase [Pteropus alecto]
Length = 675
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 154/344 (44%), Gaps = 26/344 (7%)
Query: 5 VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
+ Y+H + DG+ SGSIHY R W + + K K GL I+ YV WN+HEP
Sbjct: 54 IDYNHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQVYVPWNFHEPQP 113
Query: 65 GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
GQY F D+ F++ E L + LR GPY CAEW GG P WL GI R+++
Sbjct: 114 GQYQFSEDHDVEHFIQLAHELTLLVILRPGPYICAEWEMGGLPAWLLQKEGIILRSSDPD 173
Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT- 183
+ E + ++L I+ MK GGPII QVENEYG +Y Y+++ +
Sbjct: 174 YLEAVDKWLGVILPKMKP--FLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQKSF 227
Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS-------------PSKPIMWT 229
+L V P T G Y F P + P P++ +
Sbjct: 228 RYHLGNDVILFTTDGVYKDLPHCGTLQGLYSTVDFGPGANITDAFLLQRKYEPKGPLINS 287
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
E Y+GW +G E + ++ G N YM+ GGTNF G P A
Sbjct: 288 EFYTGWLDHWGQPHSTVTTEAVVSSLHDILAHGANV-NLYMFIGGTNFAYWNGANIPYQA 346
Query: 288 --TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
TSYDYDAP+ E G + + + + K K+ E + S P
Sbjct: 347 QPTSYDYDAPLSEAGDLTKKYFAVRDVIQKFQKVPEGPIPPSTP 390
>gi|198475912|ref|XP_002132214.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
gi|198137462|gb|EDY69616.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
Length = 672
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 105/328 (32%), Positives = 158/328 (48%), Gaps = 45/328 (13%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH + V++G+ +GS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 48 TIDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNPH 107
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTN 122
G Y +EG D+V+F++ QE ++ LR GPY CAE + GG P WL P I+ RT++
Sbjct: 108 DGVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTSD 167
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ + E+ ++ A+++ + ++L GG II+ QVENEYG+ E + Y+ W D
Sbjct: 168 SNYMAEVGKWYAELMP--RLQHLLIGNGGKIIMVQVENEYGDYEC-----DKDYLNWLRD 220
Query: 183 --------TAVNLNTSVP--WVMCQQEDAPDPIINTCNGFYCDG----------FTPNSP 222
A+ T +P + C + D + F D P
Sbjct: 221 ETEKYVNGNALLFTTDIPNERMSCGKIDN----VFATTDFGIDRIHEIDDIWAMLRKLQP 276
Query: 223 SKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG 282
+ P++ +E Y GW + R + +A A+ + N YM+FGGTNFG TAG
Sbjct: 277 TGPLVNSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAG 335
Query: 283 G----------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 336 ANYNLDGGVGYAADITSYDYDAVMDEAG 363
>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 650
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 115/320 (35%), Positives = 152/320 (47%), Gaps = 25/320 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 76 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 135
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 136 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQS 195
Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNL 187
+L L KQ + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 196 YLDA---LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-L 251
Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 252 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 311
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
A G + N YM+ GGT+FG G TSYDY
Sbjct: 312 HAATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDY 370
Query: 293 DAPIDEYGFIRQPKWGHLRE 312
DA +DE G PK+ +R+
Sbjct: 371 DAILDEAGHP-TPKFALMRD 389
>gi|195146534|ref|XP_002014239.1| GL19091 [Drosophila persimilis]
gi|194106192|gb|EDW28235.1| GL19091 [Drosophila persimilis]
Length = 672
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/328 (32%), Positives = 159/328 (48%), Gaps = 45/328 (13%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH + V++G+ +GS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 48 TIDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNPH 107
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTN 122
G Y +EG D+V+F++ QE ++ LR GPY CAE + GG P WL P I+ RT++
Sbjct: 108 DGVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTSD 167
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ + E+ ++ A+++ + ++L GG II+ QVENEYG+ E + Y+ W D
Sbjct: 168 SNYMAEVGKWYAELMP--RLQHLLIGNGGKIIMVQVENEYGDYEC-----DKDYLNWLRD 220
Query: 183 TA---VNLN-----TSVP--WVMCQQEDAPDPIINTCNGFYCDG----------FTPNSP 222
VN N T +P + C + D + F D P
Sbjct: 221 ETEKYVNRNALLFTTDIPNERMSCGKIDN----VFATTDFGIDRIHEIDDIWTMLRKLQP 276
Query: 223 SKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG 282
+ P++ +E Y GW + R + +A A+ + N YM+FGGTNFG TAG
Sbjct: 277 TGPLVNSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAG 335
Query: 283 G----------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 336 ANYNLDGGIGYAADITSYDYDAVMDEAG 363
>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
Length = 586
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 147/306 (48%), Gaps = 44/306 (14%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SG+IHY R PE W ++ K G +ETYV WN HEP +GQY F DL RF++
Sbjct: 21 SGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQLA 80
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF----LAKIID 138
GL + LR PY CAE+ +GG P WL ++ R+T PF E ++ + ++ID
Sbjct: 81 DSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMRVRSTYPPFMERVRLYYRELFKEVID 140
Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE-LYVKWAADTAVNLNTSVPWVMCQ 197
L + GGPIIL QVENEYG G G E Y++ +VP V
Sbjct: 141 LQ------ITSGGPIILMQVENEYG------GYGSEKKYLQELVTMMKENGVTVPLVTSD 188
Query: 198 ------------QEDAPDPIINTCNGFYCDGFTPNSPSK----PIMWTENYSGWFLSFGY 241
QE A P +N C + F + K P+M E + GWF ++
Sbjct: 189 GPWGDMLENGSLQESAL-PTVN-CGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAWQD 246
Query: 242 AVPFRP-VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPLV--ATSYDYDA 294
V+ ++ + G N+YM+ GGTNFG G G L+ TSYDYDA
Sbjct: 247 KKHHTTDVKSSVESLEEILKRGSV--NFYMFHGGTNFGFMNGANYYGKLLPDTTSYDYDA 304
Query: 295 PIDEYG 300
P++EYG
Sbjct: 305 PLNEYG 310
>gi|134096920|ref|YP_001102581.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|291006638|ref|ZP_06564611.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|133909543|emb|CAL99655.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
Length = 594
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 164/333 (49%), Gaps = 34/333 (10%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
A +T ++DG+ + +G +HY R+ P+ W + + + GL ++TYV WN+HEP
Sbjct: 15 AGLTVRGNEFLLDGEPFRIIAGEMHYFRTHPDQWRNRLDRMRALGLNSVDTYVAWNFHEP 74
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
RG+ F G D+VRFV+T EAGL + +R GPY CAEW++GG P WL R ++
Sbjct: 75 RRGEVDFTGWRDVVRFVETAAEAGLKVIIRPGPYICAEWDFGGLPAWLLESGNPPLRCSD 134
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA- 181
+ E R+ ++ L + L A++GGP++ QVENEYG +YG + A
Sbjct: 135 PAYTELTLRWFDEL--LPRLAPLQATRGGPVLAFQVENEYG----SYGNDQTHLEQLRAG 188
Query: 182 ------DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP------NSPSKPIMWT 229
D+ + + M + + PD + T N F D P P P+ T
Sbjct: 189 MLERGIDSLLFCSNGPSDYMLRGGNLPD-TLATVN-FAGDPTAPFEALREYQPEGPLWCT 246
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------ 283
E + GWF +G + A V R G + + YM GGTNFG AG
Sbjct: 247 EFWDGWFDHWGEEHHTTDPVETAGHVDRMLAAGASV-SLYMAVGGTNFGWWAGANYDTSK 305
Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
P + TSYDYD+PI E G + + K+ +RE
Sbjct: 306 DQYQPTI-TSYDYDSPIGEAGELTE-KFQRIRE 336
>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
Length = 628
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 176/384 (45%), Gaps = 47/384 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
+GK + SG +HY R + W ++ K GL + TYVFWN HEP G++ F G +
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
L F+K E G+ + LR GPY CAEW +GG+P WL + G++ R N F + K +
Sbjct: 97 LAEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154
Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
ID + +E +L ++GGPI++ Q ENE+G+ V + E + + A L +
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212
Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
VP + A + T NG Y DG P M E
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
Y GW + P +A ++ + +F N+YM GGTNFG T+G
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
TSYDYDAPI E G++ PK+ +R + IK +Y I P + ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380
Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
+K ++ A SSD +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)
Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
++ H +N ANYD D Y P W +NV+ K
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362
Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
+P + ++ +L + ++ E++ +S + P EQ+N Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414
Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
YT + P G L I L A+V+V+ + V G N + + + ++ N
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466
Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
TL IL +G NYG+ G+ S + I +++ G +YQ+ ++ E L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520
Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
K+ A++ + + Y+ TF + G +++ S GKG +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTLY +P W+ GEN +VI E+L
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609
Query: 697 DPSKISLLTKT 707
P KT
Sbjct: 610 TPQTEVKTVKT 620
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 149/316 (47%), Gaps = 38/316 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SG+IHY R P+ W + K G +ETYV WN HE GQ+ F G
Sbjct: 10 FLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DLV FVK +E GL + LR GPY CAEW GG P WL ++ R + F E+++
Sbjct: 70 GKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVEN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ ++ L+ L ++GGP+I+ QVENEYG+ +LY++ + V
Sbjct: 130 YFKVLLPLIVP--LQVTKGGPVIMVQVENEYGSFS-----NDKLYLRALKKMIEDAGIDV 182
Query: 192 P-------W--VMCQQEDAPDPIINTCNGFYCDG---------FTPNSPSK-PIMWTENY 232
P W + + ++ T N F G F K P+M E +
Sbjct: 183 PLFTSDGAWEQALMSGTLIEEEVLVTAN-FGSRGNENFDVLQSFMEKHDKKWPLMCMEFW 241
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------P 284
GWF + + R +++ + + G N YM+ GGTNFG G P
Sbjct: 242 CGWFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLP 299
Query: 285 LVATSYDYDAPIDEYG 300
V TSYDYDA + E+G
Sbjct: 300 QV-TSYDYDAFLTEWG 314
>gi|383116237|ref|ZP_09936989.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
gi|251945420|gb|EES85858.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
Length = 769
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + Y+
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYISAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 593
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL G++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K + +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPMQITQGGPVIMMQVENEYG----SYGM-EKAYLQQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 105/331 (31%), Positives = 160/331 (48%), Gaps = 29/331 (8%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ +T + +DG+ + +G++HY R P W + + K K GL +ETYV WN HEP
Sbjct: 2 STLTIEGDHFELDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEP 61
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G+++F ++ R+++ E GL++ +R GPY CAEW GG P WL P ++ R
Sbjct: 62 HEGEFHFGDWLNIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQMKLRCMY 121
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG-------EL 175
P+ + + + ++++ + L +++GGPII QVENEYG +YG EL
Sbjct: 122 QPYLDAVGEYFSQLMHRLVP--LQSTRGGPIIAMQVENEYG----SYGNDTRYLKYLEEL 175
Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPD--PIINTCN--GFYCDGFTPNSPSKPIMWTEN 231
+ D + V M Q P +N N G + P++ E
Sbjct: 176 LRQCGVDVLLFTADGVADEMMQYGSLPHLFKAVNFGNRPGDAFEKLREYQTGGPLLVAEF 235
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
+ GWF +G R ++A + G + N YM+ GGTNFG G
Sbjct: 236 WDGWFDHWGERHHTRSAGEVARVLDDLLSEGASV-NLYMFHGGTNFGFMNGANAFPSPHY 294
Query: 284 -PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
P V TSYDYDAP+ E G I PK+ +RE+
Sbjct: 295 TPTV-TSYDYDAPLSECGNI-TPKYEAMREV 323
>gi|289166983|ref|YP_003445250.1| beta-galactosidase 3 [Streptococcus mitis B6]
gi|288906548|emb|CBJ21380.1| beta-galactosidase 3 [Streptococcus mitis B6]
Length = 595
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRIRSSAPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEDRGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 151/312 (48%), Gaps = 29/312 (9%)
Query: 20 VLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFV 79
++ GSIHY R E W + + K + G + TY+ WN HE RG++ F DL +V
Sbjct: 1 MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60
Query: 80 KTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDL 139
+ GL++ LR GPY CAE + GG P WL P RTTN F E + ++ +I
Sbjct: 61 LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIP- 119
Query: 140 MKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQE 199
K L GGP+I QVENEYG+ + Y+K A L + ++ +
Sbjct: 120 -KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGIVELLLTSD 171
Query: 200 DAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSFGYAVPFRP 247
D I + NG + FT +S KPIM E ++GW+ S+G +
Sbjct: 172 DKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKS 231
Query: 248 VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYGF 301
E++ V +F G +F N YM+ GGTNFG GG V TSYDYDA + E G
Sbjct: 232 AEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGD 290
Query: 302 IRQPKWGHLREL 313
+ K+ LR+L
Sbjct: 291 YTE-KYFKLRKL 301
>gi|403528012|ref|YP_006662899.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
gi|403230439|gb|AFR29861.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
Length = 598
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 163/350 (46%), Gaps = 58/350 (16%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+A ++Y L G+ + +G+IHY R P++W + +R+ K G ++TYV WN+H+
Sbjct: 3 NALLSYHDAVLYRSGEPYRILAGAIHYFRVHPDLWQDRLRRLKAMGANTVDTYVAWNFHQ 62
Query: 62 PIRGQY-YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
P R + F G DL RF+ E GL + +R GPY CAEW+ GGFP WL IPGI R
Sbjct: 63 PKRDEAPDFSGWQDLGRFMDLAAEEGLDVIVRPGPYICAEWDNGGFPSWLTGIPGIGLRC 122
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
+ F ++ + ++ ++ S GGP++ Q+ENEYG +YG E Y++W
Sbjct: 123 MDPVFTAAIEEWFDHLLPIVASRQ--TSAGGPVVAVQIENEYG----SYGDDHE-YIRWN 175
Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNG---FYCDG--------------------- 216
+E ++ T +G ++ DG
Sbjct: 176 R-------------RALEERGITELLFTADGGTDYFLDGGAVEGTWATATLGSRGDEAVA 222
Query: 217 -FTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGT 275
+ P +P E + GWF +G R ED A + + GG+ YM GGT
Sbjct: 223 TWQRRRPGEPFFNVEFWGGWFDHWGEHHHGRDAEDAALEARKMLDLGGSL-CAYMAHGGT 281
Query: 276 NFGRTAGG--------PLVATSYDYDAPIDEYGFIRQPKWGHLR-ELHKA 316
NFG +G P V TSYD DAPI E G + PK+ R E ++A
Sbjct: 282 NFGLRSGSNHDGTMLQPTV-TSYDSDAPIAENGAL-TPKFHAFRKEFYRA 329
>gi|449458169|ref|XP_004146820.1| PREDICTED: beta-galactosidase 17-like [Cucumis sativus]
Length = 719
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 166/353 (47%), Gaps = 45/353 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DGK + G +HY R+ PE W + + ++K GL I+TY+ WN HEP G + F G +
Sbjct: 79 DGKPFQIIGGDLHYFRTLPEYWEDRLLRAKALGLNTIQTYIPWNLHEPKPGNFTFNGIAN 138
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNNPFKEEMKRFL 133
+V F++ Q+ + LR GPY CAEW+ GGFP W L +P + R+++ + + ++R+
Sbjct: 139 IVSFIQLCQKLDFLVLLRPGPYICAEWDLGGFPAWLLSKMPASRLRSSDPGYLQWVERWW 198
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEWAYGVGGELYVKWAAD 182
I L K L + GGPII+ Q+ENE+G+ V A G G+ + + D
Sbjct: 199 GII--LPKVAPLLYNNGGPIIMVQIENEFGSYGDDQAYLHHLVALARGYLGDEIILYTTD 256
Query: 183 ---------TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-PIMWTENY 232
+ N V + P PI N F N P K P + E Y
Sbjct: 257 GGTRETLEKGTIRGNAVFSAVDFSTGERPWPIFNLQKEF-------NPPGKSPPLTAEFY 309
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV-- 286
+GW +G + A A+ G+ YM GGTNF G G ++
Sbjct: 310 TGWLTHWGENIATTDANSTAAALNEILAGKGS-AVLYMAHGGTNFGFYNGANTGNDVLDY 368
Query: 287 ---ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT-HQKLG 335
TSYDYDAPI E G + K+ +R + I+ LI S P+ ++K+G
Sbjct: 369 KPDLTSYDYDAPIKESGDVDNAKYEAIR---RVIQHYSGALIPSVPSNNEKIG 418
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 149/316 (47%), Gaps = 38/316 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SG+IHY R P+ W + K G +ETYV WN HE GQ+ F G
Sbjct: 10 FLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DLV FVK +E GL + LR GPY CAEW GG P WL ++ R + F E+++
Sbjct: 70 GKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVEN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ ++ L+ L ++GGP+I+ QVENEYG+ +LY++ + V
Sbjct: 130 YFKVLLPLIVP--LQVTKGGPVIMVQVENEYGSFS-----NDKLYLRALKKMIEDAGIDV 182
Query: 192 P-------W--VMCQQEDAPDPIINTCNGFYCDG---------FTPNSPSK-PIMWTENY 232
P W + + ++ T N F G F K P+M E +
Sbjct: 183 PLFTSDGAWEQALMSGTLIEEEVLVTAN-FGSRGNENFDVLQSFMEKHDKKWPLMCMEFW 241
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------P 284
GWF + + R +++ + + G N YM+ GGTNFG G P
Sbjct: 242 CGWFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLP 299
Query: 285 LVATSYDYDAPIDEYG 300
V TSYDYDA + E+G
Sbjct: 300 QV-TSYDYDAFLTEWG 314
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 149/310 (48%), Gaps = 28/310 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 77 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 136
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 137 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 196
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 197 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 253
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 254 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK-- 311
Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
P + A A FE G N YM+ GGT+FG G TSY
Sbjct: 312 PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 369
Query: 291 DYDAPIDEYG 300
DYDA +DE G
Sbjct: 370 DYDAIVDEAG 379
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/340 (32%), Positives = 158/340 (46%), Gaps = 54/340 (15%)
Query: 16 GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
G+ + SG +HY R + W ++ K GL + TYVFWN HE G++ F G +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 76 VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
+++ E G+ + LR GPY CAEW +GG+P WL IPG++ R N F + K++
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151
Query: 136 IIDLMKQE--NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-P 192
ID + QE L ++GGPII+ Q ENE+G+ YV D + + S
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGS-----------YVSQRKDISFEEHRSYNA 199
Query: 193 WVMCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIM 227
+ Q DA P+ + + +G T N S P M
Sbjct: 200 KIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYM 259
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
E Y GW +G P ++A + + +F N+YM GGTNFG T+G
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNNVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
TSYDYDAPI E G+I PK+ +R + K +K
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357
>gi|332375542|gb|AEE62912.1| unknown [Dendroctonus ponderosae]
Length = 454
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 168/354 (47%), Gaps = 52/354 (14%)
Query: 1 LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
L+AN ++ ++ K + SG++HY R W + +RK + GL +ETYV WN H
Sbjct: 27 LNANQSF----FTLNDKLIKIYSGAMHYFRVPRPYWRDRLRKIRAAGLNTVETYVPWNLH 82
Query: 61 EPIRGQY-------YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI 113
EP G++ FE L F+ +E LF+ LR GPY C+E+N GGFP WL
Sbjct: 83 EPENGKFDFGEGGSEFEDFLHLEEFLNAAKEEDLFVILRTGPYICSEYNSGGFPSWLLRE 142
Query: 114 PGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQ-GGPIILAQVENEYGNVEWAYGVG 172
+ FRT+ + + + RF ++ L+ F Q GGP+I QVENEYGN+E
Sbjct: 143 KPMGFRTSEENYMKFVTRFFNVVLTLLAA---FQFQLGGPVIAFQVENEYGNLENGAAFQ 199
Query: 173 ---------GELYVK-------WAADTAVNLNTS--VPWVMCQQEDAPDPIINTCNGFYC 214
+L++K +AD+ + TS +P + Q + D +N N
Sbjct: 200 PDKVYMEELRQLFLKNGIVELLTSADSPLWKGTSGTLPGELFQTANFGDNAVNQLNK--L 257
Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
+ F P +P+M E + GWF + G + ED + F +F N YM+ GG
Sbjct: 258 EEF---QPGRPLMVMEYWIGWFDNVGGEHSVKSDEDSRRVLEDIFSKNASF-NAYMFHGG 313
Query: 275 TNFGRTAGGPL------------VATSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
TNF G L + TSYDYDAPI E G R K+ ++EL A
Sbjct: 314 TNFWFNNGANLDNDLMDNSGYTAITTSYDYDAPISESGGYRN-KYFIVKELVAA 366
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 156/329 (47%), Gaps = 33/329 (10%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
T + +++GK V+++ +HYPR W I+ K G+ + YVFWN HE G
Sbjct: 33 TTGDKTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHEQEEG 92
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
++ F G D+ F + Q+ G+++ +R GPY CAEW GG P WL I+ R + F
Sbjct: 93 KFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYF 152
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV---------------EWAYG 170
+ ++ F ++ + L GGPII+ QVENEYG+ + +
Sbjct: 153 MQRVEIFEKEVGKQLAP--LTIQNGGPIIMVQVENEYGSYGKDKPYVSAIRDIVRKSGFD 210
Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
W+++ N + W M A N F G P+ P M +E
Sbjct: 211 KVSLFQCDWSSNFLNNGLDDLTWTMNFGTGA-----NIDQQFKRLGEV--RPNAPKMCSE 263
Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------P 284
+SGWF +G RP +D+ + G +F + YM GGT+FG AG P
Sbjct: 264 FWSGWFDKWGARHETRPAKDMVEGMDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQP 322
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
V TSYDYDAPI+E+G + PK+ L+++
Sbjct: 323 DV-TSYDYDAPINEWG-LATPKFYELQKM 349
>gi|194761012|ref|XP_001962726.1| GF14288 [Drosophila ananassae]
gi|190616423|gb|EDV31947.1| GF14288 [Drosophila ananassae]
Length = 661
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 155/324 (47%), Gaps = 37/324 (11%)
Query: 6 TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
T DH A ++DG+ SGS HY R+ PE W +R + GL ++TYV W+ H P
Sbjct: 36 TIDHEANSFMLDGEPFRYVSGSFHYFRAVPEAWRSRLRTMRASGLNALDTYVEWSLHNPH 95
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTN 122
+Y +EG D+V+F++ QE ++ LR GPY CAE + GG P WL P I+ RT +
Sbjct: 96 EDEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTND 155
Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
+ E+ ++ A++ + + ++L GG II+ QVENEYG+ + Y+ W D
Sbjct: 156 PDYIAEVGKWYAQL--MPRLQHLLVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 208
Query: 183 --------TAVNLNTSVP--WVMCQQED----APDPIINTCNGF--YCDGFTPNSPSKPI 226
A+ +P + C + D D I+ N P+ P+
Sbjct: 209 ETEKYVSGKALLFTVDIPNEKMSCGKIDNVFATTDFGIDRINEIDEIWKMLRVQQPTGPL 268
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
+ +E Y GW + R + +A A+ + N YM+FGGTNFG TAG
Sbjct: 269 VNSEFYPGWLTHWQEQNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGANYD 327
Query: 284 -------PLVATSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 328 LDGGIGYAADITSYDYDAVMDEAG 351
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 115/319 (36%), Positives = 155/319 (48%), Gaps = 29/319 (9%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G D
Sbjct: 42 DGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHND 101
Query: 75 LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F + +L
Sbjct: 102 VAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLD 161
Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLNTSV 191
+ + + + L GGPII QVENEYG+ + AY +YVK D A+ L TS
Sbjct: 162 ALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSD 218
Query: 192 PWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
M PD ++N G D P +P M E ++GWF +G P
Sbjct: 219 GADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHA 276
Query: 247 PVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
+ A A FE G N YM+ GGT+FG G TSYDYD
Sbjct: 277 ATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYD 334
Query: 294 APIDEYGFIRQPKWGHLRE 312
A +DE G PK+ +R+
Sbjct: 335 AILDEAGHP-TPKFALMRD 352
>gi|432894411|ref|XP_004075980.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oryzias
latipes]
Length = 640
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 108/318 (33%), Positives = 148/318 (46%), Gaps = 35/318 (11%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S+N T + + +I G GSIHY R W + + K K GL + TYV WN HE
Sbjct: 52 SSNFTLERKPFLILG-------GSIHYFRVPKAYWEDRLLKLKACGLNTLTTYVPWNLHE 104
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
P RG + FEG DL ++ G+++ LR GPY CAEW+ GG P WL ++ RTT
Sbjct: 105 PERGVFDFEGELDLEAYLGLAASLGIWVILRPGPYICAEWDLGGLPSWLLRDQNMRLRTT 164
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
F + + +I K S+GGPII QVENEYG +Y + E Y+ +
Sbjct: 165 YPGFTAAVDSYFDHLIK--KVAPYQYSRGGPIIAVQVENEYG----SYAMDEE-YMPFIK 217
Query: 182 DTAVNLNTSVPWVMCQQED-----APDPIINTCNGFYCDG-----FTPNSPSKPIMWTEN 231
+ ++ + V +D + T N D P KP M E
Sbjct: 218 EALLSRGITELLVTSDNKDGLKLGGVKGALETINFQKLDPEEIKYLEKIQPQKPKMVMEY 277
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF---------GRTAG 282
+SGWF +G P E++ V + + N YM+ GGTNF GR +
Sbjct: 278 WSGWFDLWGGLHHVFPAEEMMAVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGRPSP 336
Query: 283 GPLVATSYDYDAPIDEYG 300
P+V TSYDYDAP+ E G
Sbjct: 337 APMV-TSYDYDAPLSEAG 353
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 90/224 (40%), Gaps = 48/224 (21%)
Query: 475 NIESLGHAALVFVNKKLV-AFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
++ ++ ALVFV K+ V Y + + I +G TL +L G NYG
Sbjct: 446 SLNNIRDRALVFVEKQFVGVLDYKEQELS-------IPDGKGKRTLGLLVENCGRVNYGK 498
Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
D GL I ++ N RD I+ + ++ +++ L +S+ WK P
Sbjct: 499 TLDEQRKGLVGDIQLN-ANILRDFM----IHSLDMKPDFVS----RLQSSAQWKSMREKP 549
Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
+ +++T L L KG +VNG+++GRYWS
Sbjct: 550 SFPA--FFQTKLYLSSSPKDTFLKLPGWSKGVVFVNGKNLGRYWSV-------------- 593
Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
P QTLY +P W++ +N +++ EEL D
Sbjct: 594 --------------GPQQTLY-VPGAWLNRWDNEIIVFEELETD 622
>gi|423251759|ref|ZP_17232772.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
CL03T00C08]
gi|423255080|ref|ZP_17236010.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
CL03T12C07]
gi|392649184|gb|EIY42863.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
CL03T00C08]
gi|392652521|gb|EIY46180.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
CL03T12C07]
Length = 769
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L +
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLR 340
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/340 (32%), Positives = 158/340 (46%), Gaps = 54/340 (15%)
Query: 16 GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
G+ + SG +HY R + W ++ K GL + TYVFWN HE G++ F G +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 76 VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
+++ E G+ + LR GPY CAEW +GG+P WL IPG++ R N F + K++
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151
Query: 136 IIDLMKQE--NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-P 192
ID + QE L ++GGPII+ Q ENE+G+ YV D + + S
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGS-----------YVSQRKDISFEEHRSYNA 199
Query: 193 WVMCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIM 227
+ Q DA P+ + + +G T N S P M
Sbjct: 200 KIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYM 259
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
E Y GW +G P ++A + + +F N+YM GGTNFG T+G
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
TSYDYDAPI E G+I PK+ +R + K +K
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357
>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
Length = 611
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 152/320 (47%), Gaps = 25/320 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 37 FVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 96
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 97 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQS 156
Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNL 187
+L L KQ + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 157 YLDA---LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-L 212
Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 213 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 272
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
A G + N YM+ GGT+FG G TSYDY
Sbjct: 273 HAATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDY 331
Query: 293 DAPIDEYGFIRQPKWGHLRE 312
DA +DE G PK+ +R+
Sbjct: 332 DAILDEAGHP-TPKFALMRD 350
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 149/310 (48%), Gaps = 28/310 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK-- 273
Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
P + A A FE G N YM+ GGT+FG G TSY
Sbjct: 274 PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331
Query: 291 DYDAPIDEYG 300
DYDA +DE G
Sbjct: 332 DYDAIVDEAG 341
>gi|307705099|ref|ZP_07641979.1| beta-galactosidase [Streptococcus mitis SK597]
gi|307621359|gb|EFO00416.1| beta-galactosidase [Streptococcus mitis SK597]
Length = 595
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++T Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
++ + L GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLFPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRVIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|195108029|ref|XP_001998595.1| GI23552 [Drosophila mojavensis]
gi|193915189|gb|EDW14056.1| GI23552 [Drosophila mojavensis]
Length = 641
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/341 (32%), Positives = 162/341 (47%), Gaps = 44/341 (12%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S V Y++ + DG+ +GS HY R+ P+ W +R + GL + TYV W+ H
Sbjct: 25 SFTVDYENDRFLKDGRPFHFIAGSFHYFRAHPDTWSRHLRTMRAAGLNAVTTYVEWSLHN 84
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRT 120
P G Y + G DL RF++ + L + LR GPY CAE + GGFP W L+ PGIQ RT
Sbjct: 85 PRDGVYVWTGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLNKFPGIQLRT 144
Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW- 179
+ + E++ + +++ + + GGPII+ QVENEYG +Y Y W
Sbjct: 145 ADINYLSEVRIWYSQL--MARIGPYLYGNGGPIIMVQVENEYG----SYFACDANYRNWL 198
Query: 180 -------AADTAVNLNTSVPWVM-CQQEDAPDPIINTCNGFYCDGFTPN----------- 220
D+AV P V+ C + ++ T + G T N
Sbjct: 199 RDETQNHVKDSAVLFTNDGPGVLRCGKIQG---VLATMDF----GATSNLKDVWAKLRQY 251
Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
P P++ E Y GW + + + ++G + N+YM++GGTNFG T
Sbjct: 252 QPKGPLVNAEYYPGWLTHWTEPMANVSTSAITGTFIDMLDSGASV-NFYMFYGGTNFGFT 310
Query: 281 AG------GPLVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
AG G +A TSYDYDAP+ E G PK+ LR++
Sbjct: 311 AGANDNGPGNYIADITSYDYDAPMTEAG-DPTPKYMALRQI 350
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 149/310 (48%), Gaps = 28/310 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGAEMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK-- 273
Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
P + A A FE G N YM+ GGT+FG G TSY
Sbjct: 274 PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331
Query: 291 DYDAPIDEYG 300
DYDA +DE G
Sbjct: 332 DYDAIVDEAG 341
>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
garnettii]
Length = 669
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 156/338 (46%), Gaps = 27/338 (7%)
Query: 4 NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
+ Y + DG+ SGSIHY R W + + K K GL I+TYV WN+HEP
Sbjct: 33 KIDYSRDRFLKDGQPFRYISGSIHYSRLPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 92
Query: 64 RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
G+Y F D+ F++ E GL + LR GPY CAEW+ GG P WL + R+++
Sbjct: 93 PGKYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKESMILRSSDP 152
Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK----- 178
+ + ++L ++ MK L GGPII QVENEYG +Y Y++
Sbjct: 153 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIISVQVENEYG----SYFTCDHDYMRFLLKR 206
Query: 179 ---WAADTAVNLNTS---VPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMW 228
+ D V T ++ C ++ G + P P++
Sbjct: 207 FRYYLGDDVVLFTTDGIFEKYLNCGALQGLYATVDFGTGVNITAAFKLQRKSEPKGPLIN 266
Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLV 286
+E Y+GW +G ED+AF++ G + N YM+ GGTNF G P
Sbjct: 267 SEFYTGWLDHWGQPHSTVKTEDVAFSLFDILARGASV-NLYMFTGGTNFAYWNGANIPYS 325
Query: 287 A--TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
A TSYDYDAP+ E G + + K+ LR + + K E
Sbjct: 326 AQPTSYDYDAPLSEAGDLTE-KYFALRSVIQKFKETPE 362
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/340 (32%), Positives = 158/340 (46%), Gaps = 54/340 (15%)
Query: 16 GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
G+ + SG +HY R + W ++ K GL + TYVFWN HE G++ F G +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 76 VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
+++ E G+ + LR GPY CAEW +GG+P WL IPG++ R N F + K++
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151
Query: 136 IIDLMKQE--NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-P 192
ID + QE L ++GGPII+ Q ENE+G+ YV D + + S
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGS-----------YVSQRKDISFEEHRSYNA 199
Query: 193 WVMCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIM 227
+ Q DA P+ + + +G T N S P M
Sbjct: 200 KIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYM 259
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
E Y GW +G P ++A + + +F N+YM GGTNFG T+G
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
TSYDYDAPI E G+I PK+ +R + K +K
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357
>gi|336410484|ref|ZP_08590961.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
gi|335944314|gb|EGN06136.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
Length = 769
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFLLRDLLK 340
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)
Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
++L GK+ W +Y V+ S +K G+ T+P +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527
Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
F + G L++++ GKG WVNG +IGR+W
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561
Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|423270210|ref|ZP_17249181.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
CL05T00C42]
gi|423276168|ref|ZP_17255110.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
CL05T12C13]
gi|392698134|gb|EIY91316.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
CL05T00C42]
gi|392699308|gb|EIY92489.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
CL05T12C13]
Length = 769
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 154/334 (46%), Gaps = 31/334 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
GQ+ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
+ +SYDYDAPI E G+ K+ LR+L +
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLR 340
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 97/238 (40%), Gaps = 57/238 (23%)
Query: 474 LNIESLGHAALVFVNKKLVA-FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+ I + A VFV+ KL+A +FA L L +G +DIL +G N+
Sbjct: 414 MKITEVHDWAQVFVDGKLLARLDRRRGEFALQLP----ALKKGTR-IDILVEAMGRVNFD 468
Query: 533 -AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS 590
+ D G ++L GK+ W +Y V+ S +K G+
Sbjct: 469 ESIHDRKGI----TEKVELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGT 516
Query: 591 --TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
T+P +Y+TTF + G L++++ GKG WVNG +IGR+W
Sbjct: 517 ARTMPA-----YYRTTFRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI--------- 561
Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + + I L K
Sbjct: 562 -------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599
>gi|307289489|ref|ZP_07569436.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422703871|ref|ZP_16761687.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499556|gb|EFM68926.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315164595|gb|EFU08612.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 593
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKSVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 147/329 (44%), Gaps = 33/329 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++D + + SG+IHY R P W + K G +ETYV WN HEP G + F G
Sbjct: 10 FLLDDEPFTILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F+ GL+ +R P+ CAEW +GG P WL ++ R+++ F + +
Sbjct: 70 SIDLAAFLDEAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQ 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ ++ ++ + +GG II+ QVENEYG+ + Y++ V SV
Sbjct: 130 YYDHLMPILVSRQI--DKGGNIIMMQVENEYGSY-----CEDKDYLRAIRRLMVERGVSV 182
Query: 192 -------PWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-----------PIMWTENYS 233
PW C + C G + N + P+M E +
Sbjct: 183 PLCTSDGPWRGCLRAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWD 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
GWF +G V R EDLA V E GG+ N YM+ GGTNFG R
Sbjct: 243 GWFNRYGENVIRRDPEDLASCVREVLELGGSL-NLYMFHGGTNFGFMNGCSARHTHDLHQ 301
Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAP+DE G + + R +H+
Sbjct: 302 VTSYDYDAPLDEQGNPTEKYFAIQRTVHE 330
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 126/335 (37%), Gaps = 78/335 (23%)
Query: 393 AKVISQRNNGDHPFAQQKNVNELL--------LASSAFSWYE----EKVGISGNRSFVRP 440
A + Q N + FA Q+ V+EL L AFS + E+V + +
Sbjct: 309 APLDEQGNPTEKYFAIQRTVHELYPDIAQSKPLTKKAFSMPDISVSERVSLFNVLDILSE 368
Query: 441 DLAEQ----INTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGY 496
+ Q + + Y YT ++ + E + + A +FVN VA Y
Sbjct: 369 PIEAQYPMPMEEMGQSYGYTLYTTTVE--RDRADEERIRVIDARDRAQMFVNGDKVATQY 426
Query: 497 GNHDFANFLINKKIE--LNEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLKN 552
H I + I L N LD+L+ +G NYG D G+ + + +DL
Sbjct: 427 QEH------IGEDIHCVLPCEHNRLDVLTEDMGRVNYGHKLLADTQHKGIRTGVCVDLH- 479
Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKI-SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGK 611
+ G E + LD I +L S+ W +G +Y+ F E
Sbjct: 480 -----------FVTGWEMRCLPLDNIDNLDYSAGWVEGQP-------SFYRAKFDISEPA 521
Query: 612 GPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQ 671
++ GKG A+VNG ++GR+W P
Sbjct: 522 DTF-IDTTGFGKGVAFVNGTNVGRFWDK----------------------------GPIM 552
Query: 672 TLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
TLY +P +HPG N LV+ E G +KISL ++
Sbjct: 553 TLY-VPHGLLHPGTNELVMFETEGVYDAKISLRSE 586
>gi|423278914|ref|ZP_17257828.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
gi|404585906|gb|EKA90510.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
Length = 769
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 153/332 (46%), Gaps = 31/332 (9%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
+ N T +++GK +++ +HY R W I K G+ I YVFWN HE
Sbjct: 18 AQNFTIGKNTFLLNGKSFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G++ F G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT
Sbjct: 78 QTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
+ F E F+ ++ + L ++GG II+ QVENEYG AY V + YV
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190
Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
D + T VP C + D ++ T N + G P P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPDTPL 248
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
M +E +SGWF +G RP + + + + +F + YM GGT FG G
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307
Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
+ +SYDYDAPI E G+ K+ LR+L
Sbjct: 308 AYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 55/130 (42%), Gaps = 37/130 (28%)
Query: 579 SLANSSFWKQGS--TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
S +K G+ T+P +YK TF + G L++++ GKG WVNG +IGR+
Sbjct: 505 SFVQDKKYKSGTAQTMPA-----YYKATFHLDKA-GDTFLDMSTWGKGMVWVNGIAIGRF 558
Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
W P QTL+ +P W+ GEN +++ + G
Sbjct: 559 WEI----------------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGP 589
Query: 697 DPSKISLLTK 706
+ + + L K
Sbjct: 590 EKASVRGLKK 599
>gi|328956117|ref|YP_004373450.1| beta-galactosidase [Coriobacterium glomerans PW2]
gi|328456441|gb|AEB07635.1| Beta-galactosidase [Coriobacterium glomerans PW2]
Length = 597
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 101/314 (32%), Positives = 156/314 (49%), Gaps = 38/314 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DG+ ++SG+IHY R P+ W + K G +ETY+ WN HEP + ++
Sbjct: 12 MDGRPFQIRSGAIHYFRLHPDDWEHSLYNLKAMGFNTVETYIPWNMHEPHKDEFRITAET 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D RF+ + GL+ +R P+ CAEW +GG P WL G++ R+ + F E + +
Sbjct: 72 DFERFLGLASDLGLWAIVRPSPFICAEWEFGGLPAWLLAERGMRIRSNDPRFLERLALYY 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVGGELYVKWAADTAVNLNT 189
++ + + + ++G II+ Q+ENEYG + ++ V +L V+ D V L T
Sbjct: 132 DMLMPHLAKHQI--TRGANIIMMQIENEYGSYCEDSDYMRSV-RDLMVERGID--VKLCT 186
Query: 190 SV-PWVMCQQEDA--PDPIINTCN-------------GFYCDGFTPNSPSKPIMWTENYS 233
S PW CQ+ + D ++ T N GF+ + + + P+M E ++
Sbjct: 187 SDGPWRACQRAGSLIEDNVLATGNFGSHATENFAALKGFHKE----HGKTWPLMCMEFWA 242
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------- 286
GWF +G +V R E+LA +V G N YM+ GGTNFG G
Sbjct: 243 GWFNRWGESVVRRDPEELARSVREALREGSI--NLYMFHGGTNFGFMNGCSARHDHDLHQ 300
Query: 287 ATSYDYDAPIDEYG 300
TSYDYDAP+DE G
Sbjct: 301 ITSYDYDAPLDEAG 314
>gi|319945941|ref|ZP_08020191.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|417919516|ref|ZP_12563047.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
gi|319748006|gb|EFW00250.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|342832897|gb|EGU67186.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
Length = 595
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 153/323 (47%), Gaps = 44/323 (13%)
Query: 23 SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
SG+IHY R E W + K G +ETYV WN HEP RG ++FEG DL F++
Sbjct: 21 SGAIHYFRIDREDWYHSLYNLKALGFNTVETYVPWNAHEPQRGHFHFEGNLDLEHFIQVA 80
Query: 83 QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
QE L++ LR P+ C+EW +GG P WL ++ R+++ F EE+ R+ +++ + +
Sbjct: 81 QELDLYVILRPSPFICSEWEFGGLPAWL-IEKDLRIRSSDPAFLEEVARYYDELLPRVAK 139
Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAP 202
L +GG I++ QVENEYG +YG + Y++ D + + + P D P
Sbjct: 140 YQL--DRGGNILMMQVENEYG----SYG-EDKAYLRAIRDLMIERDITCPLFTS---DGP 189
Query: 203 DPIINTCNGFYCDG---------------------FTPNSPSKPIMWTENYSGWFLSFGY 241
DG F + P+M E + GWF +
Sbjct: 190 WRATLRAGTLIEDGLFVTGNFGSRANYNFSQMKEFFAEHDRKWPLMCMEFWDGWFNRWKE 249
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVATSYDYD 293
+ R E+LA AV + G N YM+ GGTNFG T P V TSYDYD
Sbjct: 250 PIIKRDPEELAEAVHEVLQEGSI--NLYMFHGGTNFGFMNGCSARGTVDLPQV-TSYDYD 306
Query: 294 APIDEYGFIRQPKWGHLRELHKA 316
A +DE G PK+ ++++ K
Sbjct: 307 ALLDEQG-NPTPKYDAVKKMMKT 328
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 115/322 (35%), Positives = 156/322 (48%), Gaps = 29/322 (9%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 99 NNDVAAFVREAAAQGLNIILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
+L + + + + L GGPII QVENEYG+ + AY +YVK D A+ L
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK-- 273
Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
P + A A FE G + YM+ GGT+FG G TSY
Sbjct: 274 PHAATD--ARQQAEEFEWILRQGHSASLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331
Query: 291 DYDAPIDEYGFIRQPKWGHLRE 312
DYDA +DE G PK+ +R+
Sbjct: 332 DYDAILDEAGHP-TPKFALMRD 352
>gi|357050580|ref|ZP_09111778.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
gi|355381233|gb|EHG28360.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
Length = 593
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 148/314 (47%), Gaps = 35/314 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G L SG+IHY R P+ W + K G +ETYV WN HEP +G + FEG
Sbjct: 10 FLMNGSPFKLLSGAIHYFRVHPDDWRHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F+ QE GL++ LR PY CAEW +GG P WL G + R + + +
Sbjct: 70 ILDLEHFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG-RLRACDPSYLAHVAE 128
Query: 132 F----LAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVG-GELYVKWAADTA 184
+ L KII S GG I++ QVENEYG+ E AY E+ + D
Sbjct: 129 YYDVLLPKIIPYQ------LSHGGNILMIQVENEYGSYGEEKAYLRAIKEMLINRGIDMP 182
Query: 185 VNLNTSVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYS 233
+ + PW + + D ++ T N D F ++ P+M E +
Sbjct: 183 L-FTSDGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWD 241
Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
GWF + + R +DLA +V E G N YM+ GGTNFG R A
Sbjct: 242 GWFNRWNEPIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQ 299
Query: 287 ATSYDYDAPIDEYG 300
TSYDYDAP+DE G
Sbjct: 300 VTSYDYDAPLDEQG 313
>gi|417923406|ref|ZP_12566873.1| glycosyl hydrolase family 35 [Streptococcus mitis SK569]
gi|342837055|gb|EGU71256.1| glycosyl hydrolase family 35 [Streptococcus mitis SK569]
Length = 595
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G++ FEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWSHSLYNLKALGFNTVETYVAWNLHEPREGEFNFEGAL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++ Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L +GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLSRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRHLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|453049630|gb|EME97211.1| beta-galactosidase [Streptomyces mobaraensis NBRC 13819 = DSM
40847]
Length = 584
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/313 (34%), Positives = 147/313 (46%), Gaps = 39/313 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
IDG+ L SG++HY R WP + + GL +ETYV WN HEP+ G+ + G
Sbjct: 13 IDGREVRLLSGALHYFRVHEGHWPHRLAMLRAMGLNCVETYVPWNRHEPVEGRLHDVG-- 70
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L RF+ AGL+ +R GPY CAEW GG P WL G + RT++ F + +L
Sbjct: 71 ELGRFLDAAGAAGLYAIVRPGPYVCAEWENGGLPHWLTGRLGRRVRTSDPEFLRAVDGWL 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
+ + +GGP++L QVENEYG+ YG + Y++ + VP
Sbjct: 131 EAVGAELTGRQF--GRGGPVVLVQVENEYGS----YG-SDQPYLEHLVGRLRDSGVVVPL 183
Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPN---------------SPSKPIMWTENYSGWFLS 238
V D P+ + T T N P+ P+M E + GWF
Sbjct: 184 VTS---DGPEDHMLTGGTVPGATATVNFGSGAREAFRVLRRHRPAGPLMCMEFWCGWFAH 240
Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPL--VA 287
+G A R + A A+ E G + N YM GGTNFG AG G L
Sbjct: 241 WGGAPAARDAGEAAEALREVLECGASV-NVYMAHGGTNFGGWAGANRAGAEHRGALRPTT 299
Query: 288 TSYDYDAPIDEYG 300
TSYDYDAP+DEYG
Sbjct: 300 TSYDYDAPVDEYG 312
>gi|422877900|ref|ZP_16924370.1| beta-galactosidase [Streptococcus sanguinis SK1056]
gi|332358593|gb|EGJ36417.1| beta-galactosidase [Streptococcus sanguinis SK1056]
Length = 592
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/333 (32%), Positives = 155/333 (46%), Gaps = 32/333 (9%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+I Y R P+ W + + K G +ETY+ W HEP GQ+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D + K V+E GL+L +R PY CAE+++GG P WL P ++ R + F E++ F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + QGGPI++ QVENEYG+ + AY +K T +
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAEDKAYMRSIAQMMKVRGVTVPLFTSDG 189
Query: 192 PWVMCQQ-----ED--------APDPIINTCNGFYCDGFTPNSPSK-PIMWTENYSGWFL 237
W+ + ED P NT N F K P+M TE + GWF
Sbjct: 190 TWIEALESGTLIEDDIFVTGNFGSQPKENTDN---LRAFMERYGKKWPLMCTEFWDGWFS 246
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVATS 289
+ + +R EDLA V + G N ++ GGTNFG +T P + TS
Sbjct: 247 RWSEEIVWREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI-TS 303
Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
YD+DAPI E+G + + R H+ E+
Sbjct: 304 YDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|188990653|ref|YP_001902663.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
B100]
gi|167732413|emb|CAP50607.1| exported beta-galactosidase [Xanthomonas campestris pv. campestris]
Length = 680
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 152/320 (47%), Gaps = 25/320 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK + SG+IH+ R W + ++K++ GL +ETYVFWN EP +GQ+ F
Sbjct: 106 FVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNA 165
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPYACAEW GG+P WL I+ R+ + F +
Sbjct: 166 NNDVAAFVREAAAQGLNVILRPGPYACAEWETGGYPAWLFGKDNIRVRSRDPRFLAASQA 225
Query: 132 FLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGE---LYVKWAADTAVNL 187
+L + KQ + L GGPII QVENEYG+ + + + +YVK D A+ L
Sbjct: 226 YLDAV---SKQVHPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-L 281
Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
TS M PD ++N G D P +P M E ++GWF +G
Sbjct: 282 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 341
Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
+ + G + N YM+ GGT+FG G TSYDY
Sbjct: 342 HASTDAKQQTEELEWILRQGHS-ANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDY 400
Query: 293 DAPIDEYGFIRQPKWGHLRE 312
DA +DE G PK+ +R+
Sbjct: 401 DAILDEAGRA-TPKFALMRD 419
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 149/313 (47%), Gaps = 34/313 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+ DG+ L SG+IH+ R W + ++K++ GL +ETYVFWN E GQ+ F G
Sbjct: 35 FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPY CAEW GGFP WL P ++ R+ + F + +R
Sbjct: 95 NNDISAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
+L + ++ L GGPII QVENEYG+ + Y G+GG L
Sbjct: 155 YLEALGTQVRP--LLNGNGGPIIAVQVENEYGSYGDDHGYLQAVRALFIKAGLGGALL-- 210
Query: 179 WAADTAVNL-NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
+ AD A L N ++P V+ AP D P +P + E ++GWF
Sbjct: 211 FTADGAQMLGNGTLPDVLAAVNVAPGEAKQA-----LDKLATFHPGQPQLVGEYWAGWFD 265
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-----RTAGGPL-----VA 287
+G + A + G + N YM+ GGT+FG GGP
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPSDHYSPQT 324
Query: 288 TSYDYDAPIDEYG 300
TSYDYDA +DE G
Sbjct: 325 TSYDYDAALDEAG 337
>gi|345880280|ref|ZP_08831835.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
gi|343923634|gb|EGV34320.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
Length = 621
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 159/334 (47%), Gaps = 50/334 (14%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GRF 73
DGK + SG +HY R W ++ K GL + +YVFWN+HE G + ++ G
Sbjct: 39 DGKPTQIHSGELHYARVPAPYWRHRLQMMKAMGLNAVTSYVFWNHHETSPGVWDWQTGNH 98
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
++ F+K E GL + LR GPY CAEW +GG+P WL G+ RT N PF + + ++
Sbjct: 99 NIRNFIKIAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKGLVIRTDNKPFLDSCRVYI 158
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTSVP 192
++ + ++ +L ++GGP+++ Q ENE+G+ V + E++ K+AA
Sbjct: 159 NQLANQVR--DLQITKGGPVVMVQAENEFGSYVAQRKDIPLEVHKKYAAQ---------- 206
Query: 193 WVMCQQEDAP-DPIINTCNGFY------CDGFTPNSPSK------------------PIM 227
+ Q DA D + T +G + +G P + + P M
Sbjct: 207 -IRQQLLDAGFDIPMFTSDGSWLFKGGSIEGALPTANGEGNIEKLKQVVNEYHGGVGPYM 265
Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV- 286
E Y GW + P E + ++ + G +F NYYM GGTNFG T G
Sbjct: 266 VAEFYPGWLSHWAEPFPRVSTESVVKQTKKYLDNGVSF-NYYMVHGGTNFGFTTGANYSN 324
Query: 287 -------ATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDAPI E G+ + K+ +R L
Sbjct: 325 ATNLQPDMTSYDYDAPISEAGWATE-KYNAIRAL 357
>gi|302526862|ref|ZP_07279204.1| beta-galactosidase [Streptomyces sp. AA4]
gi|302435757|gb|EFL07573.1| beta-galactosidase [Streptomyces sp. AA4]
Length = 609
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 148/314 (47%), Gaps = 35/314 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK + SG+IHY R P+ W + + + K GL +ETYV WN+H+P G+ F G
Sbjct: 40 FLLDGKPFQIVSGAIHYFRLRPDQWHDRLSRLKALGLNTVETYVAWNFHQPTPGRADFRG 99
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL F++T E G + +R PY CAEW +GG P WL ++ R + + + +
Sbjct: 100 DRDLPAFIRTAGELGFQVIVRPSPYICAEWEFGGLPAWLLADRNMELRCADPAYLKAVDA 159
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
+ ++I + L A GGPI+ Q+ENEYG+ + +Y G+ L+V
Sbjct: 160 WYDQLIPQLTP--LEAQHGGPIVAVQIENEYGSYGNDTSYLAHLRDSLRSRGITSLLFVA 217
Query: 179 WAADTAVNLNTSVPWVM--CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
A +P + + P P I F P P+M E + GWF
Sbjct: 218 DGASEFFMRFGELPGTLEAGTGDGDPAPSIAALKAF--------RPGAPVMMAEYWDGWF 269
Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVAT 288
+G + A + + TG + N YM GGTN+G TAG P V T
Sbjct: 270 DHWGEPHHTTDPQQTAAHIDQLLATGASV-NLYMACGGTNYGFTAGANTSGLQYQPTV-T 327
Query: 289 SYDYDAPIDEYGFI 302
SYDYD+P+ E G +
Sbjct: 328 SYDYDSPVGEAGDV 341
>gi|315499712|ref|YP_004088515.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
gi|315417724|gb|ADU14364.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
Length = 613
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 162/352 (46%), Gaps = 39/352 (11%)
Query: 3 ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
+ T ++DG+ L +G +HYPR E+W + +RK K GL + TY FW+ HE
Sbjct: 30 SRFTIKDDQFLLDGQPLHLMAGEMHYPRIPRELWRDRLRKLKALGLNTLSTYTFWSAHEK 89
Query: 63 IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
G Y F G D+ +VK QE GL + LR GPYACAEW+ GG+P W P I+ R+ +
Sbjct: 90 KPGVYDFSGNLDVAAWVKMAQEEGLHVLLRPGPYACAEWDNGGYPAWFLNDPDIRPRSLD 149
Query: 123 ----NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------------NV 165
P + +KR ++ +L +GGP+++ Q+ENEYG +
Sbjct: 150 PRYMGPSGQWLKRLGQEVA------HLEIDKGGPVLMTQIENEYGSYGNDLNYMRAVRDQ 203
Query: 166 EWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
A G G+LY A AV N ++P + + G + + P
Sbjct: 204 VRAAGFSGQLYTVDGA--AVIENGALPELFNGINFG---TYDKAEGEFAR-YAKFKTKGP 257
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
M TE + GWF FG + L ++ + +F ++YM GGT+F AG
Sbjct: 258 RMCTELWGGWFDHFGEVHSNMEISPLMESLKWMLDNRISF-SFYMLHGGTSFAFDAGANF 316
Query: 286 VAT--------SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
T SYDYDA +DE G + PK+ REL + E + +P
Sbjct: 317 HKTHGYQPDISSYDYDAMLDEAGRV-TPKYEAARELFRRYLPPERFTALPEP 367
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 61/130 (46%), Gaps = 21/130 (16%)
Query: 509 KIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGV 568
++ L G +TLD+L +G NYG GL + + NGK L+ W +Q
Sbjct: 455 EVSLKAG-DTLDLLIDAMGHVNYGDQIGKDQKGLIGPVTL---NGK-PLTG--WTHQG-- 505
Query: 569 EGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
+ LD +S+ F +Q P +Y+ TF E G L+L GKG WV
Sbjct: 506 ----VPLDDLSVLR--FKRQRVNGPA-----FYRGTFETSEA-GFTFLDLRGWGKGYVWV 553
Query: 629 NGQSIGRYWS 638
NG ++GRYWS
Sbjct: 554 NGHNLGRYWS 563
>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
Length = 584
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 154/334 (46%), Gaps = 43/334 (12%)
Query: 9 HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
++ I+G + + SG++HY R PE W + + K G +ETYV WN HEP +G+Y
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 69 FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
F G D+ F+K +E LF+ LR PY CAEW GG P WL P I+ RT + + +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
+ ++ + ++ + + + +Q GPIILAQ+ENEYG +YG E Y+
Sbjct: 127 LDQYFSILLPKLSKYQI--TQNGPIILAQLENEYG----SYGEDKE-YLLAVYQMMRKYG 179
Query: 189 TSVPWV--------------MCQQEDAP--------DPIINTCNGFYCDGFTPNSPSKPI 226
VP + +++ P I F + + P+
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKF----MESHQITAPL 235
Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------R 279
M E + GWF + + R ++ + G N+YM+ GGTNFG R
Sbjct: 236 MCMEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSAR 293
Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDA + EYG + K+ LRE+
Sbjct: 294 KEHDLPQITSYDYDAILTEYG-AKTEKYHLLREV 326
>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
harrisii]
Length = 704
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 158/322 (49%), Gaps = 25/322 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G + GSIHY R E W + + K K GL + TY+ WN HEP RG++ F G
Sbjct: 122 FLLEGSHFQIFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 181
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ + GL++ LR GPY C+EW+ GG P WL ++ RTT F + + R
Sbjct: 182 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYAGFLKAVDR 241
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I + L QGGPII QVENEYG+ + Y+ + ++ +
Sbjct: 242 YFNHLIP--RVVPLQYKQGGPIIAVQVENEYGSYD-----KDSNYMPYIKKALMSRGINE 294
Query: 192 PWVMCQQEDA-----PDPIINTCNGFYCDGFTPN-----SPSKPIMWTENYSGWFLSFGY 241
+ +D + ++ T N + D N +KP M TE ++GWF ++G
Sbjct: 295 LLMTSDNKDGLSGGYLEGVLATVNLKHVDSMIFNYLHSFQENKPTMVTEYWTGWFDTWGG 354
Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPLVA--TSYDYDAP 295
+D+ V+ + G + N YM+ GGTNFG G G +A TSYDYDA
Sbjct: 355 PHNIVDADDVVVTVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFGEYLADVTSYDYDAI 413
Query: 296 IDEYGFIRQPKWGHLRELHKAI 317
+ E G PK+ LRE I
Sbjct: 414 LTEAG-DYTPKFFKLREFFSTI 434
>gi|294779195|ref|ZP_06744602.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|294453706|gb|EFG22101.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
Length = 592
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345
>gi|322378066|ref|ZP_08052553.1| glycosyl hydrolase, family 35 [Streptococcus sp. M334]
gi|321281048|gb|EFX58061.1| glycosyl hydrolase, family 35 [Streptococcus sp. M334]
Length = 595
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAQ 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL RF++ Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+++ + L +GG I++ QVENEYG+ + AY ++ T +
Sbjct: 131 DQLLPRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
PW + D + T N + F + P+M E + GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
+ R ++LA AV E G N YM+ GGTNFG G G L TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306
Query: 294 APIDEYG 300
A +DE G
Sbjct: 307 ALLDEEG 313
>gi|194213011|ref|XP_001503026.2| PREDICTED: beta-galactosidase-1-like protein 3-like [Equus
caballus]
Length = 880
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 155/324 (47%), Gaps = 29/324 (8%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K K G + TYV WN HEP RG++ F G
Sbjct: 248 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSG 307
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL FV T E GL++ LR GPY C+E + GG P L P + RTT+ F E + +
Sbjct: 308 NLDLEAFVLTAAEIGLWVILRPGPYICSEIDLGGLPSRLLQDPQVNLRTTDKGFVEAVDK 367
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ +I + +L +GGPII QVENEYG+ + Y+ + L +
Sbjct: 368 YFDHLIS--RVVHLQYRKGGPIIAVQVENEYGSF-----YKDKDYMPYLQQAL--LKRGI 418
Query: 192 PWVMCQQEDAPDPIINTCNG---------FYCDGFT---PNSPSKPIMWTENYSGWFLSF 239
++ ++ D + G F D F KPIM E + GWF ++
Sbjct: 419 VELLLTSDNVDDVLKGYIKGVLATINMKKFRKDAFQHLYKVQRDKPIMIMEYWVGWFDTW 478
Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
G + D+ V+ F + +F N YM+ GGTNFG G V TSYDYD
Sbjct: 479 GSKHEVKDAGDVKNTVSEFIKFEISF-NVYMFHGGTNFGFINGAINFVKHAGVVTSYDYD 537
Query: 294 APIDEYGFIRQPKWGHLRELHKAI 317
A + E G + K+ LR+L +I
Sbjct: 538 AVLTEAGDYTK-KYFKLRKLFGSI 560
>gi|256957323|ref|ZP_05561494.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|257077681|ref|ZP_05572042.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|307270129|ref|ZP_07551446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|422710565|ref|ZP_16767610.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|422721468|ref|ZP_16778057.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422867159|ref|ZP_16913760.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|256947819|gb|EEU64451.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256985711|gb|EEU73013.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|306513498|gb|EFM82113.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|315031294|gb|EFT43226.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315035298|gb|EFT47230.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|329577710|gb|EGG59137.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 593
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 130
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 155/308 (50%), Gaps = 34/308 (11%)
Query: 26 IHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEA 85
+HYPR E W + +++++ GL + YVFWN+HE G++ F G+ D+ FV+T QE
Sbjct: 1 MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60
Query: 86 GLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ-EN 144
GL++ LR GPY CAEW++GG+P WL + +R+ + F +R+ I +L KQ +
Sbjct: 61 GLYVILRPGPYVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERY---IKELGKQLSS 117
Query: 145 LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQ---QEDA 201
L + GG II+ QVENEYG +Y E Y+ D +VP C Q +A
Sbjct: 118 LTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGFNVPLFTCDGGGQVEA 172
Query: 202 P--DPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFGY---AVPF-RPVEDL 251
+ + T NG + + P E Y WF +G +V + RP E L
Sbjct: 173 GHIEGALPTLNGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQL 232
Query: 252 AFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYGFIRQP 305
+ ++ G + YM+ GGTNF T G TSYDYDAP+ E+G P
Sbjct: 233 DWMLSH-----GVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YP 286
Query: 306 KWGHLREL 313
K+ RE+
Sbjct: 287 KYHAFREV 294
>gi|414888317|tpg|DAA64331.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 284
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 134/298 (44%), Gaps = 32/298 (10%)
Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
LQ+ G +G+ ++ L G DL W ++ +EGE + WK
Sbjct: 6 LQDSGGELAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWK 65
Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
++ WYK F P+G P+ L+++SM KG +VNG+ +GRYW +Y
Sbjct: 66 PAEN---GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY------- 115
Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
+ G P+Q LYHIPR ++ +NLLV+ EE G P I + T T
Sbjct: 116 ---------------RTLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVT 160
Query: 708 GQHICSFVSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEG 762
IC F+SE +P + +W K L S + L C I + FAS+G PEG
Sbjct: 161 RDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEG 220
Query: 763 NCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
CG+F G CH + IV+K C+G+ C +PV G C L V+ C
Sbjct: 221 MCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTATLGVQVRC 277
>gi|187736173|ref|YP_001878285.1| beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
gi|187426225|gb|ACD05504.1| Beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
Length = 780
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 36/320 (11%)
Query: 6 TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
+ + ++DGK + SG +HYPR + W + ++ K G+ + TY+FWN HEP G
Sbjct: 35 STNQENFLMDGKPVKIISGEMHYPRVPRQHWKDRFQRIKAMGMNTVCTYLFWNVHEPEPG 94
Query: 66 QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
++ F G D V F+K Q+AGL++ +R GPY CAEW +GGFP WL ++ R+ + F
Sbjct: 95 KWDFSGNLDFVEFIKEAQKAGLWVIVRPGPYVCAEWEFGGFPGWLLKDEDLKVRSQDPRF 154
Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
E +L K+ ++ E L ++GGPII+AQVENEYG +YG + YVK D
Sbjct: 155 LEPAMAYLKKVCSML--EPLQITKGGPIIMAQVENEYG----SYGSDKD-YVKKHLDV-- 205
Query: 186 NLNTSVPWVMCQQEDAPD-------------PIIN---TCNGFYCDGFTPNSPSKPIMWT 229
+ +P V+ D P+ P +N G + + + P +
Sbjct: 206 -IRKELPGVVPFTSDGPNDWMIKNGTLPGVVPAMNFGGGAKGAFAN-LEKHKGKTPRING 263
Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------ 283
E + GWF +G E + E + N +M GGT+FG G
Sbjct: 264 EFWVGWFDHWGKPKNGGSTEGFNRDLKWMLENNVS-PNLFMAHGGTSFGFMNGANWEGAY 322
Query: 284 -PLVATSYDYDAPIDEYGFI 302
P V T+YDY API E G +
Sbjct: 323 TPDV-TNYDYGAPISENGTL 341
>gi|419456662|ref|ZP_13996611.1| beta-galactosidase family protein [Streptococcus pneumoniae
GA02254]
gi|379533348|gb|EHY98561.1| beta-galactosidase family protein [Streptococcus pneumoniae
GA02254]
Length = 595
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 151/322 (46%), Gaps = 55/322 (17%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+IHY R PE W + K G +ETYV WN HEP G+++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGDL 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
DL +F++ Q+ GL+ +R P+ CAEW +GG P WL ++ R+++ + E + R+
Sbjct: 72 DLEKFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRIRSSDPAYIEAVGRYY 130
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK-- 178
+++ + L GG I++ QVENEYG+ + AY GV L+
Sbjct: 131 DQLLPRLVSRLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEECGVTCPLFTSDG 188
Query: 179 -WAADTAV------------NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
W A V N + P+ Q ++ F + P
Sbjct: 189 PWRATLKVGTLIEEDLFVTGNFGSKAPYNFSQMQEF---------------FDEHGKKWP 233
Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--- 282
+M E + GWF + + R ++LA AV E G N YM+ GGTNFG G
Sbjct: 234 LMCMEFWDGWFNRWKEPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSA 291
Query: 283 -GPL---VATSYDYDAPIDEYG 300
G L TSYDYDA +DE G
Sbjct: 292 RGTLDLPQVTSYDYDALLDEEG 313
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 82/205 (40%), Gaps = 57/205 (27%)
Query: 513 NEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLK---NGKRDLSSGEWIYQVG 567
+G++ LDIL +G NYG F D G+ + + DL N K Y +
Sbjct: 437 KKGLSRLDILIENMGRVNYGHKFLADTQRKGIRTGVCKDLHFLLNWKH--------YPLP 488
Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
++ +KI + W QG +Y F E K L+L+ GKG A+
Sbjct: 489 LDNP----EKIDFSKG--WTQGQP-------AFYAYDFTVEEPKDTY-LDLSEFGKGVAF 534
Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
VNGQ++GR+W+ P +LY IP ++ G N
Sbjct: 535 VNGQNLGRFWNV----------------------------GPTLSLY-IPHCYLKEGANR 565
Query: 688 LVIHEELGGDPSKISLLTK-TGQHI 711
++I E G +I L K T +HI
Sbjct: 566 IIIFETEGQYKEEIHLTRKPTLKHI 590
>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
Length = 613
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 109/319 (34%), Positives = 150/319 (47%), Gaps = 23/319 (7%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
V DGK L SG+IH+ R E W + ++K++ GL +ETYVFWN EP +GQ+ F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPREYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAG 98
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPY CAEW GG+P WL I+ R+ + F +
Sbjct: 99 NNDVAAFVREAAAQGLNVILRPGPYTCAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQA 158
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE---LYVKWAADTAVNLN 188
+L + + L GGPII QVENEYG+ + + + +YVK D A+ L
Sbjct: 159 YLDAVSKQV--HPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-LF 215
Query: 189 TSVPWVMCQQEDAPD--PIINTCNGFYCDGF---TPNSPSKPIMWTENYSGWFLSFGYAV 243
TS M PD ++N G F P +P M E ++GWF +G
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKTAFEKLIKFRPEQPRMVGEYWAGWFDHWGKPH 275
Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
+ G + N YM+ GGT+FG G TSYDYD
Sbjct: 276 ASTDAKQQTEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYD 334
Query: 294 APIDEYGFIRQPKWGHLRE 312
A +DE G PK+ +R+
Sbjct: 335 AILDEAGRP-TPKFALMRD 352
>gi|295113973|emb|CBL32610.1| Beta-galactosidase [Enterococcus sp. 7L76]
Length = 592
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+++G+ + SG+IHY R TP W + + K G +ETY+ WN HEP G Y FEG
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
++ FV+ ++ L + LR Y CAEW +GG P WL ++ R+T+ F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 129
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+ + L K L +QGGP+I+ QVENEYG +YG+ + Y++ L V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMIQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182
Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
P + + A + +++ D F T + P+M E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
+ GWF +G V R DLA V G N YM+ GGTNFG R A
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
TSYDYDA + E G + + + KAIK +C E + + P +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345
>gi|324509196|gb|ADY43870.1| Beta-galactosidase [Ascaris suum]
Length = 639
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/337 (30%), Positives = 164/337 (48%), Gaps = 26/337 (7%)
Query: 2 SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
S ++ Y ++ ++DG+ SGSIHY R P+ W + + + + GL I+ Y+ WN+HE
Sbjct: 26 SFSIDYVNKRFLLDGQPFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHE 85
Query: 62 PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
G F+G ++ RF+ + L+ +RIGPY C EW GG P WL I+ RT+
Sbjct: 86 IYEGVIGFDGGRNITRFLSLAAQNELYALVRIGPYICGEWENGGLPWWLLKYDDIKMRTS 145
Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWA 168
+ F ++R+ ++ ++K GGPI++ QVENEYG+ +
Sbjct: 146 DKRFIRAVERWFGVLLPILKPS--LRKNGGPILMIQVENEYGSFTEGCDRKYTTFLRDLT 203
Query: 169 YGVGGELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
G+ V + D A N + S+P V + P+ F P+ P
Sbjct: 204 IKHLGDDVVLYTTDGANNQSLKCGSIPGVFATVDFGPNSEEQIDKNFATQ--RSYEPNGP 261
Query: 226 IMWTENYSGWFLSFGYAVPFRP-VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
++ +E Y GW +++ P V+++ F+ G +F NYYM++GGTNF G
Sbjct: 262 LVNSEFYPGWIVTWSQKGRIDPSVDEIINGSKYMFKLGASF-NYYMFYGGTNFAFWNGAE 320
Query: 285 L---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
V TSYDY AP+ E I + K+ +R K+I+
Sbjct: 321 TTSAVITSYDYFAPLTEAADINE-KFVAIRNWIKSIE 356
>gi|397689967|ref|YP_006527221.1| Beta-galactosidase [Melioribacter roseus P3M]
gi|395811459|gb|AFN74208.1| Beta-galactosidase [Melioribacter roseus P3M]
Length = 772
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/314 (32%), Positives = 149/314 (47%), Gaps = 36/314 (11%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++DGK ++ G +H+ R E W I+ K G+ I Y+FWN+HE G + ++G
Sbjct: 31 FLLDGKPFQIRCGELHFARIPKEYWRHRIKMMKAMGMNTICAYLFWNFHERTPGNFKWDG 90
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ +F K QE GL++ LR GPY CAEW GG P WL I+ RT + F +
Sbjct: 91 EADVAQFCKIAQEEGLWVILRPGPYVCAEWEMGGLPWWLLKNENIKLRTKDPLFINASRN 150
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
+L ++ ++ L + GGPIIL QVENE+G + Y+ D + +V
Sbjct: 151 YLMEVGRVLAP--LQITNGGPIILVQVENEHG-----FYADDPEYMGIIKDAILEAGFNV 203
Query: 192 PWVMCQQEDAPDPIINTCNGFYCD-------GFTPNS---------PSKPIMWTENYSGW 235
P C +P + G+ D G P P P+M E YSGW
Sbjct: 204 PLFAC------NPTYHLEKGYRKDIFPVVNFGSNPEEAFRALRKILPEGPLMCGEFYSGW 257
Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
F ++G F ++ + +TG +F + YM GGT FG AG P V +SY
Sbjct: 258 FDTWGNPHTFGEIDRYLKDMEYMLKTGASF-SIYMAHGGTTFGFWAGADRPFKPDV-SSY 315
Query: 291 DYDAPIDEYGFIRQ 304
DY AP+ E G+ +
Sbjct: 316 DYGAPVTEAGWTSE 329
>gi|422861007|ref|ZP_16907651.1| beta-galactosidase [Streptococcus sanguinis SK330]
gi|327468658|gb|EGF14137.1| beta-galactosidase [Streptococcus sanguinis SK330]
Length = 592
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 155/338 (45%), Gaps = 42/338 (12%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+I Y R P+ W + + K G +ETY+ W HEP GQ+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D + K V+E GL+L +R PY CAE+++GG P WL P ++ R + F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP- 192
+ + + QGGPI++ QVENEYG+ + Y++ A SVP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVSVPL 184
Query: 193 ------WVMCQQ-----ED--------APDPIINTCNGFYCDGFTPNSPSK-PIMWTENY 232
W+ + ED P NT N F K P+M TE +
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDN---LRAFMERYGKKWPLMCTEFW 241
Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGP 284
GWF + + R EDLA V + G N ++ GGTNFG +T P
Sbjct: 242 DGWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLP 299
Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
+ TSYD+DAPI E+G + + R H+ E+
Sbjct: 300 QI-TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
>gi|282859441|ref|ZP_06268546.1| glycosyl hydrolase family 35 [Prevotella bivia JCVIHMP010]
gi|424900868|ref|ZP_18324410.1| beta-galactosidase [Prevotella bivia DSM 20514]
gi|282587669|gb|EFB92869.1| glycosyl hydrolase family 35 [Prevotella bivia JCVIHMP010]
gi|388593068|gb|EIM33307.1| beta-galactosidase [Prevotella bivia DSM 20514]
Length = 622
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/332 (31%), Positives = 154/332 (46%), Gaps = 42/332 (12%)
Query: 15 DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY-YFEGRF 73
DGK + SG +HY R W ++ K GL V+ +YVFWN+HE G + + G
Sbjct: 40 DGKPLQIYSGELHYARVPAPYWRHRLQMMKAMGLNVVTSYVFWNHHEVAPGVWDWSTGNH 99
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
+L FVKT E G+ + LR GPY CAEW +GG+P WL G+ RT N PF + + ++
Sbjct: 100 NLREFVKTAAEEGMKVILRPGPYCCAEWEFGGYPWWLPKTKGLVVRTDNQPFLDSCRVYI 159
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAA-------DTAV 185
++ ++ +L ++GGPII+ Q ENE+G+ V + E + ++A D
Sbjct: 160 NQLASQVR--DLQVTKGGPIIMVQAENEFGSYVAQRPDIPLETHKAYSAKIRQQLLDAGF 217
Query: 186 NL---NTSVPWVM-----------CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTEN 231
N+ + W+ ED D + N ++ P M E
Sbjct: 218 NIPMFTSDGSWLFKGGVIEGVLPTANGEDNIDNLKKVVNEYHGG-------QGPYMVAEF 270
Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------ 285
Y GW + P + ++ + +F NYYM GGTNFG AG
Sbjct: 271 YPGWLSHWAEKFPQVSTTSVVTQTKKYLDNKVSF-NYYMVHGGTNFGFMAGANCDNIHKL 329
Query: 286 --VATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
TSYDYDAPI E G++ K+ LR L K
Sbjct: 330 QPDMTSYDYDAPISEAGWVTD-KYTALRNLMK 360
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 93/226 (41%), Gaps = 53/226 (23%)
Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
+ I L A ++VN + V G N F + I N TLDIL G NYG
Sbjct: 431 MMKIPGLADYATIYVNGERV--GELNRVFGKHEMEIDIPFNA---TLDILVENWGRINYG 485
Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGST 591
+ + G+ I I+ ++ +G W +Y++ ++ + D ++NS S
Sbjct: 486 KFIVNSTKGITLPITIN-----DNVITGSWQMYKLPMDKQ---PDLTDISNS----YNSG 533
Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
LPV Y +F + + G L++ GKG +VNG ++GRYW
Sbjct: 534 LPV-----LYSGSF-SVDKVGDTFLDMEKWGKGIVFVNGVNLGRYWRI------------ 575
Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
P TLY +P ++ GEN +V+ E+L +
Sbjct: 576 ----------------GPQHTLY-LPGCFLKQGENKIVVFEQLNDE 604
>gi|242077941|ref|XP_002443739.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
gi|241940089|gb|EES13234.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
Length = 111
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 61/109 (55%), Positives = 86/109 (78%)
Query: 34 EVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRI 93
++WP+LI K+KEGGL+VI+TYVFWN HEP++GQY FEGR+D VRF+K +Q GL+++LRI
Sbjct: 1 QMWPKLIAKAKEGGLDVIQTYVFWNVHEPVQGQYNFEGRYDFVRFIKEIQGQGLYVNLRI 60
Query: 94 GPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
GP+ +EW YGGFP WLH +P I FR+ N PFK ++ L +++ L++
Sbjct: 61 GPFIESEWKYGGFPFWLHDVPNITFRSDNEPFKPSVRNMLGELVSLLEH 109
>gi|336415312|ref|ZP_08595652.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
gi|335940908|gb|EGN02770.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
Length = 778
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 154/323 (47%), Gaps = 27/323 (8%)
Query: 11 ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
++DGK V+++ +HY R W I K G+ I Y+FWN HE G++ F
Sbjct: 35 TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94
Query: 71 GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
G+ D+ F + Q+ G+++ +R GPY CAEW GG P WL I RT + + E +
Sbjct: 95 GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154
Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
F+ ++ + L ++GG II+ QVENEYG +YG+ + YV D S
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFS 207
Query: 191 -VPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
VP C +A D +I T N G D P P+M +E +SGWF
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
+G R +D+ + + +F + YM GGT FG G + +SYDY
Sbjct: 268 HWGRKHETRLAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326
Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
DAPI E G+ K+ LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)
Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
+YK+TF + G L++++ GKG WVNG ++GR+W
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570
Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
P QTL+ +P W+ GEN +++ + G + I L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608
>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
Length = 653
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 153/318 (48%), Gaps = 17/318 (5%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
++G + ++ GSIHY R E W + + K K G + TYV WN HEP RG++ F G
Sbjct: 80 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 139
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
DL FV E GL++ LR GPY C+E + GG P WL P + RTTN F E +++
Sbjct: 140 NLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEK 199
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL---N 188
+ +I + L QGGP+I QVENEYG+ L+ V L +
Sbjct: 200 YFDHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTS 257
Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGYAVPF 245
V+ IN + D F KP++ E + GWF +G
Sbjct: 258 DGEKHVLSGHTKGVLAAIN-LQKLHQDTFNQLHKIQRDKPLLIMEYWVGWFDRWGDKHHV 316
Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEY 299
+ +++ AV+ F + +F N YM+ GGTNFG G + TSYDYDA + E
Sbjct: 317 KDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEA 375
Query: 300 GFIRQPKWGHLRELHKAI 317
G + K+ L++L +++
Sbjct: 376 GDYTE-KYLKLQKLFQSV 392
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 156/326 (47%), Gaps = 35/326 (10%)
Query: 12 LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
+ DG+ L SG+IH+ R W + ++K++ GL +ETYVFWN E GQ+ F G
Sbjct: 35 FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94
Query: 72 RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
D+ FV+ GL + LR GPY CAEW GGFP WL P ++ R+ + F + +R
Sbjct: 95 NNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154
Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
+L + ++ L S GGPII QVENEYG+ + Y G+GG L
Sbjct: 155 YLEALGTQVRP--LLNSNGGPIIAMQVENEYGSYGDDHGYLQAVRALFIKAGLGGALL-- 210
Query: 179 WAADTAVNL-NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
+ +D A L N ++P V+ AP D P +P + E ++GWF
Sbjct: 211 FTSDGAQMLGNGTLPDVLAAVNVAPGEAKQA-----LDKLATFHPGQPQLVGEYWAGWFD 265
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-----RTAGGP-----LVA 287
+G + A + G + N YM+ GGT+FG GGP
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPGDHYSPQT 324
Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
TSYDYDA +DE G PK+ R++
Sbjct: 325 TSYDYDAALDEAGRP-MPKFALFRDV 349
>gi|422824944|ref|ZP_16873129.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|422827211|ref|ZP_16875390.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|422857055|ref|ZP_16903709.1| beta-galactosidase [Streptococcus sanguinis SK1]
gi|324992224|gb|EGC24146.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|324994315|gb|EGC26229.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|327459541|gb|EGF05887.1| beta-galactosidase [Streptococcus sanguinis SK1]
Length = 592
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 107/333 (32%), Positives = 154/333 (46%), Gaps = 32/333 (9%)
Query: 14 IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
+DGK + SG+I Y R P+ W + + K G +ETY+ W HEP GQ+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 74 DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
D + K V+E GL+L +R PY CAE+++GG P WL P ++ R + F E++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
+ + + QGGPI++ QVENEYG+ + AY +K T +
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAEDKAYMRSIAQMMKVRGVTVPLFTSDG 189
Query: 192 PWVMCQQ-----ED--------APDPIINTCNGFYCDGFTPNSPSK-PIMWTENYSGWFL 237
W+ + ED P NT N F K P+M TE + GWF
Sbjct: 190 TWIEALESGTLIEDDIFVTGNFGSQPKENTDN---LRAFMERYGKKWPLMCTEFWDGWFS 246
Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVATS 289
+ + R EDLA V + G N ++ GGTNFG +T P + TS
Sbjct: 247 RWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI-TS 303
Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
YD+DAPI E+G + + R H+ E+
Sbjct: 304 YDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.439
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,516,308,602
Number of Sequences: 23463169
Number of extensions: 675811089
Number of successful extensions: 1256876
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2107
Number of HSP's successfully gapped in prelim test: 195
Number of HSP's that attempted gapping in prelim test: 1243565
Number of HSP's gapped (non-prelim): 5395
length of query: 821
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 670
effective length of database: 8,816,256,848
effective search space: 5906892088160
effective search space used: 5906892088160
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)