BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038226
(849 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 1409 bits (3647), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 664/852 (77%), Positives = 733/852 (86%), Gaps = 25/852 (2%)
Query: 8 RALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAG 67
R L CLA+ + +A+ +FKPFNVSYDHRA+IIDG RRML+SAG
Sbjct: 12 RCLFLCLAVQF---------------ALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAG 56
Query: 68 IHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSS 127
IHYPRATPEMWPDLIAKSKEGG DVI+TY FW+ HE +RGQYNF+G+ DIVKF LVG+S
Sbjct: 57 IHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGAS 116
Query: 128 GLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEML 187
GLYL LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA FKEEMQRFVKK+VDLM+EE L
Sbjct: 117 GLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEEL 176
Query: 188 FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENI 247
SWQGGPIIMLQIENEYGN+E +GQ+GK+Y+KWAA MALGLGAGVPWVMCKQ DAP +I
Sbjct: 177 LSWQGGPIIMLQIENEYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSI 236
Query: 248 IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
IDACNGYYCDGYKPNSYNKPT+WTE+WDGWY +WGGRLPHRPVEDLAFAVARF+QRGGSF
Sbjct: 237 IDACNGYYCDGYKPNSYNKPTMWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSF 296
Query: 308 MNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
NYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV
Sbjct: 297 QNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 356
Query: 368 AADSAQYIKLGQNQEAHVYRAN---------RYGSQSNCSAFLANIDEHTAASVTFLGQS 418
AADS YIKLG QEAHVYR N YGSQ +CSAFLANIDEH AASVTFLGQ
Sbjct: 357 AADSPNYIKLGPKQEAHVYRMNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQK 416
Query: 419 YTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK 478
Y LPPWSVSILPDCRN V+NTAKV +QTSIKTVEF LPL IS QQ + ++ +K
Sbjct: 417 YNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITK 476
Query: 479 SWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPT 538
SWMTVKEP+GVWSENNFTVQGILEHLNVTKD SDYLWHIT+I+VS+DDISFW+ N +
Sbjct: 477 SWMTVKEPVGVWSENNFTVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAA 536
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
V+IDSMRDVLRVF+NGQLTGSVIGHWVKV QPV+F GYNDL+LL+QTVGLQNYGAFLEK
Sbjct: 537 VSIDSMRDVLRVFVNGQLTGSVIGHWVKVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEK 596
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPS 657
DGAGFRGQ+KLTGFKNGDID SK+LWTYQVGLKGEF +IY+IEENE A W +L+ D PS
Sbjct: 597 DGAGFRGQIKLTGFKNGDIDFSKLLWTYQVGLKGEFLKIYTIEENEKASWAELSPDDDPS 656
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
TF WYKTYFD+P G DPVALDLGSMGKGQAWVNGHHIGRYWT+VAP+ GC + CDYRGAY
Sbjct: 657 TFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYWTLVAPEDGCPEICDYRGAY 716
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSE 777
+SDKC+ NCG PTQT YHVPRSWLQ+S+NLLVI EETGGNPF+IS+KLRS ++C QVSE
Sbjct: 717 DSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAGVLCAQVSE 776
Query: 778 SHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGN 837
SHYPPV+KW N SVD K+++N + PEMHL CQDG+ ISSIEFASYGTPQG CQKFS GN
Sbjct: 777 SHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQDGFTISSIEFASYGTPQGSCQKFSMGN 836
Query: 838 CHAPMSLSVVSE 849
CHA S S+VS+
Sbjct: 837 CHATNSSSIVSK 848
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 1405 bits (3638), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 666/853 (78%), Positives = 734/853 (86%), Gaps = 26/853 (3%)
Query: 8 RALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAG 67
R L CLA+ + +A+ +FKPFNVSYDHRA+IIDG RRML+SAG
Sbjct: 12 RCLFLCLAVQF---------------ALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAG 56
Query: 68 IHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSS 127
IHYPRATPEMWPDLIAKSKEGG DVI+TY FW+ HE +RGQYNF+G+ DIVKF LVG+S
Sbjct: 57 IHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGAS 116
Query: 128 GLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEML 187
GLYL LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA FKEEMQRFVKK+VDLM+EE L
Sbjct: 117 GLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEEL 176
Query: 188 FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENI 247
SWQGGPIIM+QIENEYGN+E +GQ+GK+Y+KWAA MALGLGAGVPWVMCKQ DAP +I
Sbjct: 177 LSWQGGPIIMMQIENEYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSI 236
Query: 248 IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
IDACNGYYCDGYKPNSYNKPTLWTE+WDGWY +WGGRLPHRPVEDLAFAVARF+QRGGSF
Sbjct: 237 IDACNGYYCDGYKPNSYNKPTLWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSF 296
Query: 308 MNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
NYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV
Sbjct: 297 QNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 356
Query: 368 AADSAQYIKLGQNQEAHVYRAN---------RYGSQSNCSAFLANIDEHTAASVTFLGQS 418
AADS YIKLG QEAHVYR N YGSQ +CSAFLANIDEH AASVTFLGQ
Sbjct: 357 AADSPNYIKLGPKQEAHVYRVNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQK 416
Query: 419 YTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK 478
Y LPPWSVSILPDCRN V+NTAKV +QTSIKTVEF LPL IS QQ + ++ +K
Sbjct: 417 YNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITK 476
Query: 479 SWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPT 538
SWMTVKEP+GVWSENNFTVQGILEHLNVTKD SDYLWHIT+I+VS+DDISFW+ N +
Sbjct: 477 SWMTVKEPVGVWSENNFTVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAA 536
Query: 539 VTIDSMRDVLRVFINGQLT-GSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
V+IDSMRDVLRVF+NGQLT GSVIGHWVKV QPV+F GYNDL+LL+QTVGLQNYGAFLE
Sbjct: 537 VSIDSMRDVLRVFVNGQLTEGSVIGHWVKVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLE 596
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIP 656
KDGAGFRGQ+KLTGFKNGDIDLSK+LWTYQVGLKGEF +IY+IEENE A W +L+ D P
Sbjct: 597 KDGAGFRGQIKLTGFKNGDIDLSKLLWTYQVGLKGEFFKIYTIEENEKAGWAELSPDDDP 656
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGA 716
STF WYKTYFD+P G DPVALDLGSMGKGQAWVNGHHIGRYWT+VAP+ GC + CDYRGA
Sbjct: 657 STFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYWTLVAPEDGCPEICDYRGA 716
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVS 776
YNSDKC+ NCG PTQT YHVPRSWLQ+S+NLLVI EETGGNPF+IS+KLRS ++C QVS
Sbjct: 717 YNSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAGVLCAQVS 776
Query: 777 ESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRG 836
ESHYPPV+KW N SVD K+++N + PEMHL CQDG+ ISSIEFASYGTPQG CQKFS G
Sbjct: 777 ESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQDGFTISSIEFASYGTPQGSCQKFSMG 836
Query: 837 NCHAPMSLSVVSE 849
NCHA S S+VS+
Sbjct: 837 NCHATNSSSIVSK 849
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 1398 bits (3619), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 656/846 (77%), Positives = 736/846 (86%), Gaps = 16/846 (1%)
Query: 14 LALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRA 73
L +S + + ++I + +SS+ FF+PFNV+YDHRA+IIDG RR+L SAGIHYPRA
Sbjct: 7 LKISFFQFLSFYLIIQFTLISSN----FFEPFNVTYDHRALIIDGRRRILNSAGIHYPRA 62
Query: 74 TPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQL 133
TPEMWPDLIAKSKEGGADV++TYVFW HE ++GQY F+G+ D+VKFVKLVG SGLYL L
Sbjct: 63 TPEMWPDLIAKSKEGGADVVQTYVFWGGHEPVKGQYYFEGRYDLVKFVKLVGESGLYLHL 122
Query: 134 RIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGG 193
RIGPYVCAEWNFGGFPVWLRD+PG+ FRT+NAPFKEEMQ+FV KIVDLMREEML SWQGG
Sbjct: 123 RIGPYVCAEWNFGGFPVWLRDVPGVVFRTDNAPFKEEMQKFVTKIVDLMREEMLLSWQGG 182
Query: 194 PIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNG 253
PIIM QIENEYGN+E S+GQ GK+Y+KWAA MAL L AGVPWVMCKQTDAPENIIDACNG
Sbjct: 183 PIIMFQIENEYGNIEHSFGQGGKEYMKWAAGMALALDAGVPWVMCKQTDAPENIIDACNG 242
Query: 254 YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
YYCDG+KPNS KP WTE+WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF NYYMY
Sbjct: 243 YYCDGFKPNSPKKPIFWTEDWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFQNYYMY 302
Query: 314 FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ 373
FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ
Sbjct: 303 FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ 362
Query: 374 YIKLGQNQEAHVYRA---------NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPW 424
YIKLG QEAHVY ++YGSQS CSAFLANIDE AA+V FLGQS+TLPPW
Sbjct: 363 YIKLGPKQEAHVYGGSLSIQGMNFSQYGSQSKCSAFLANIDERQAATVRFLGQSFTLPPW 422
Query: 425 SVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK 484
SVSILPDCRNTVFNTAKV++QT IKTVEF LPLS N S+ Q +++++ S S SW+ K
Sbjct: 423 SVSILPDCRNTVFNTAKVAAQTHIKTVEFVLPLS-NSSLLPQFIVQNEDSPQSTSWLIAK 481
Query: 485 EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSM 544
EPI +WSE NFTV+GILEHLNVTKD SDYLW+ T+IYVSDDDI+FW+ N+V P V+IDSM
Sbjct: 482 EPITLWSEENFTVKGILEHLNVTKDESDYLWYFTRIYVSDDDIAFWEKNKVSPAVSIDSM 541
Query: 545 RDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR 604
RDVLRVFINGQLTGSV+GHWVK VQPV+FQ GYN+L+LLSQTVGLQNYGAFLE+DGAGF+
Sbjct: 542 RDVLRVFINGQLTGSVVGHWVKAVQPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFK 601
Query: 605 GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTWYK 663
GQ+KLTGFKNGDIDLS + WTYQVGLKGEF ++YS +NE EW++L D PSTFTWYK
Sbjct: 602 GQIKLTGFKNGDIDLSNLSWTYQVGLKGEFLKVYSTGDNEKFEWSELAVDATPSTFTWYK 661
Query: 664 TYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCT 723
T+FDAP G+DPVALDLGSMGKGQAWVNGHHIGRYWTVV+PK GC +CDYRGAY+S KC
Sbjct: 662 TFFDAPSGVDPVALDLGSMGKGQAWVNGHHIGRYWTVVSPKDGC-GSCDYRGAYSSGKCR 720
Query: 724 TNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPV 783
TNCGNPTQTWYHVPR+WL+ASNNLLV+FEETGGNPFEISVKLRS +++C QVSESHYPP+
Sbjct: 721 TNCGNPTQTWYHVPRAWLEASNNLLVVFEETGGNPFEISVKLRSAKVICAQVSESHYPPL 780
Query: 784 RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
RKWS + G +S N M PEMHL CQDG+I+SSIEFASYGTP G CQKFSRGNCHA S
Sbjct: 781 RKWSRADLTGGNISRNDMTPEMHLKCQDGHIMSSIEFASYGTPNGSCQKFSRGNCHASNS 840
Query: 844 LSVVSE 849
SVV+E
Sbjct: 841 SSVVTE 846
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 1397 bits (3616), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 656/822 (79%), Positives = 725/822 (88%), Gaps = 11/822 (1%)
Query: 38 ASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYV 97
++ FFKPFNVSYDHRA+IIDG+RRMLIS GIHYPRATP+MWPDLIAKSKEGG DVI+TYV
Sbjct: 31 SANFFKPFNVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYV 90
Query: 98 FWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPG 157
FWN HE ++GQY F+G+ D+VKFVKLVG SGLYL LRIGPYVCAEWNFGGFPVWLRDIPG
Sbjct: 91 FWNGHEPVKGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPG 150
Query: 158 IEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD 217
I FRT+N+PF EEMQ+FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN+E S+G GK+
Sbjct: 151 IVFRTDNSPFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKE 210
Query: 218 YVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
YVKWAA MALGLGAGVPWVMC+QTDAP +IIDACN YYCDGYKPNS KP LWTE+WDGW
Sbjct: 211 YVKWAARMALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGYKPNSNKKPILWTEDWDGW 270
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAP 337
YTTWGG LPHRPVEDLAFAVARFFQRGGSF NYYMYFGGTNF RT+GGPFYITSYDYDAP
Sbjct: 271 YTTWGGSLPHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAP 330
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN-------- 389
IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG QEAHVYRAN
Sbjct: 331 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGSKQEAHVYRANVHAEGQNL 390
Query: 390 -RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI 448
++GSQS CSAFLANIDEH A +V FLGQSYTLPPWSVS+LPDCRN VFNTAKV++QTSI
Sbjct: 391 TQHGSQSKCSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSI 450
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K++E +LP IS P+Q M +++ S S SWMTVKEPI VWS NNFTV+GILEHLNVTK
Sbjct: 451 KSMELALPQFSGISAPKQLMAQNEGSYMSSSWMTVKEPISVWSGNNFTVEGILEHLNVTK 510
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
D+SDYLW+ T+IYVSDDDI+FW+ N V P + IDSMRDVLRVFINGQLTGSVIG W+KVV
Sbjct: 511 DHSDYLWYFTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIGRWIKVV 570
Query: 569 QPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
QPV+FQ GYN+L+LLSQTVGLQNYGAFLE+DGAGFRG KLTGF++GDIDLS + WTYQV
Sbjct: 571 QPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDGDIDLSNLEWTYQV 630
Query: 629 GLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
GL+GE Q+IY+ E NE AEWTDLT D IPSTFTWYKTYFDAP G DPVALDLGSMGKGQA
Sbjct: 631 GLQGENQKIYTTENNEKAEWTDLTLDDIPSTFTWYKTYFDAPSGADPVALDLGSMGKGQA 690
Query: 688 WVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
WVN HHIGRYWT+VAP+ GCQ CDYRGAYNS+KC TNCG PTQ WYH+PRSWLQ SNNL
Sbjct: 691 WVNDHHIGRYWTLVAPEEGCQ-KCDYRGAYNSEKCRTNCGKPTQIWYHIPRSWLQPSNNL 749
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LVIFEETGGNPFEIS+KLRS +VC QVSE+HYPP+++W ++ + G +S M PE+ L
Sbjct: 750 LVIFEETGGNPFEISIKLRSASVVCAQVSETHYPPLQRWIHTDFIYGNVSGKDMTPEIQL 809
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
CQDGY+ISSIEFASYGTPQG CQKFSRGNCHAP SLSVVS+
Sbjct: 810 RCQDGYVISSIEFASYGTPQGSCQKFSRGNCHAPNSLSVVSK 851
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 1363 bits (3527), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 639/815 (78%), Positives = 719/815 (88%), Gaps = 9/815 (1%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F PFNVSYDHRA++IDG RRML+SAGIHYPRATPEMWPDLIAKSKEGGADVI+TYVFWN
Sbjct: 24 FAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNG 83
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE +R QYNF+G+ DIVKFVKLVGSSGLYL LRIGPYVCAEWNFGGFPVWLRDIPGIEFR
Sbjct: 84 HEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 143
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+NAPFK+EMQRFVKKIVDLM++EMLFSWQGGPIIMLQIENEYGN+ESS+GQ+GKDYVKW
Sbjct: 144 TDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKW 203
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MAL L AGVPWVMC+Q DAP+ II+ACNG+YCD + PNS NKP LWTE+W+GW+ +W
Sbjct: 204 AARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSANKPKLWTEDWNGWFASW 263
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GGR P RPVED+AFAVARFFQRGGSF NYYMYFGGTNFGR+SGGPFY+TSYDYDAPIDEY
Sbjct: 264 GGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEY 323
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYR------ANRYGSQS 395
GLLS+PKWGHLK+LHAAIKLCEPALVA DS QYIKLG QEAHVYR + + G+ S
Sbjct: 324 GLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEAHVYRVKESLYSTQSGNGS 383
Query: 396 NCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSL 455
+CSAFLANIDEH ASVTFLGQ Y LPPWSVSILPDCR TVFNTAKV +QTSIKTVEF L
Sbjct: 384 SCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQTSIKTVEFDL 443
Query: 456 PLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLW 515
PL NISV Q M+++K+S K+WMT+KEPI VWSENNFT+QG+LEHLNVTKD+SDYLW
Sbjct: 444 PLVRNISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFTIQGVLEHLNVTKDHSDYLW 503
Query: 516 HITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQS 575
IT+I VS +DISFW+ N+V PT++IDSMRD+L +F+NGQL GSVIGHWVKVVQP++
Sbjct: 504 RITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVIGHWVKVVQPIQLLQ 563
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
GYNDL+LLSQTVGLQNYGAFLEKDGAGF+GQVKLTGFKNG+IDLS+ WTYQVGL+GEFQ
Sbjct: 564 GYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQ 623
Query: 636 QIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+IY I+E+E AEWTDLT D PSTFTWYKT+FDAP+G +PVALDLGSMGKGQAWVNGHHI
Sbjct: 624 KIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHI 683
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT VAPK GC CDYRG Y++ KC TNCGNPTQ WYH+PRSWLQASNNLLV+FEET
Sbjct: 684 GRYWTRVAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYHIPRSWLQASNNLLVLFEET 742
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GG PFEISVK RST+ +C +VSESHYP ++ WS S +D + S NKM PEMHL C DG+
Sbjct: 743 GGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFID-QNSKNKMTPEMHLQCDDGHT 801
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
ISSIEFASYGTPQG CQ FS+G CHAP SL++VS+
Sbjct: 802 ISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSK 836
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 1334 bits (3452), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 620/829 (74%), Positives = 705/829 (85%), Gaps = 5/829 (0%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ ++++ VS S FFKPFNVSYDHRA+II RRML+SAGIHYPRATPEMW DL
Sbjct: 17 LIIALLVYFPIVSGS----FFKPFNVSYDHRALIIADKRRMLVSAGIHYPRATPEMWSDL 72
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I KSKEGGADVI+TYVFW+ HE ++GQYNF+G+ D+VKFVKL+GSSGLYL LRIGPYVCA
Sbjct: 73 IEKSKEGGADVIQTYVFWSGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCA 132
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWLRDIPGI+FRT+N PFK+EMQ+FV KIVDLMR+ LF WQGGPIIMLQIE
Sbjct: 133 EWNFGGFPVWLRDIPGIQFRTDNEPFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQIE 192
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYG++E SYGQ+GKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG+KP
Sbjct: 193 NEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKP 252
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
NS KP LWTE+WDGWYT WGG LPHRP EDLAFAVARF+QRGGSF NYYMYFGGTNFGR
Sbjct: 253 NSQMKPILWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 312
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
TSGGPFYITSYDYDAP+DEYGL SEPKWGHLKDLHAAIKLCEPALVAAD+ QY KLG NQ
Sbjct: 313 TSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSNQ 372
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
EAH+YR + C+AFLANIDEH +A V F GQSYTLPPWSVSILPDCR+ FNTAK
Sbjct: 373 EAHIYRGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAK 432
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
V +QTS+KTVE + P + S+ Q+ + + +S SKSWM +KEPIG+W ENNFT QG+L
Sbjct: 433 VGAQTSVKTVESARPSLGSKSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLL 492
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
EHLNVTKD SDYLWH T+I VS+DDISFWK N PTV+IDSMRDVLRVF+N QL+GSV+
Sbjct: 493 EHLNVTKDRSDYLWHKTRITVSEDDISFWKKNGANPTVSIDSMRDVLRVFVNKQLSGSVV 552
Query: 562 GHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK 621
GHWVK VQPV F G NDL+LL+QTVGLQNYGAFLEKDGAGFRG+ KLTGFKNGD+DL+K
Sbjct: 553 GHWVKAVQPVRFMQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDMDLAK 612
Query: 622 ILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLG 680
WTYQVGLKGE ++IY++E NE AEW+ L D PS F WYKTYFD P G DPV LDL
Sbjct: 613 SSWTYQVGLKGEAEKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDTPAGTDPVVLDLE 672
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
SMGKGQAWVNGHHIGRYW +++ K GC+ TCDYRGAY SDKCTTNCG PTQT YHVPRSW
Sbjct: 673 SMGKGQAWVNGHHIGRYWNIISQKDGCERTCDYRGAYYSDKCTTNCGKPTQTRYHVPRSW 732
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
L+ S+NLLV+FEETGGNPF ISVK + I+C QV ESHYPP+RKWS ++G +SIN
Sbjct: 733 LKPSSNLLVLFEETGGNPFNISVKTVTAGILCGQVLESHYPPLRKWSTPDYINGTMSINS 792
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+APE++LHC+DG++ISSIEFASYGTP+G C +FS G CHA SLS+VSE
Sbjct: 793 VAPEVYLHCEDGHVISSIEFASYGTPRGSCDRFSIGKCHASNSLSIVSE 841
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 1333 bits (3450), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 619/829 (74%), Positives = 703/829 (84%), Gaps = 5/829 (0%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ ++++ +S S +FKPFNVSYDHRA+II G RRML+SAGIHYPRATPEMW DL
Sbjct: 17 LIIALLVYFPILSGS----YFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDL 72
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
IAKSKEGGADV++TYVFWN HE ++GQYNF+G+ D+VKFVKL+GSSGLYL LRIGPYVCA
Sbjct: 73 IAKSKEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCA 132
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWLRDIPGIEFRT+N PFK+EMQ+FV KIVDLMRE LF WQGGPIIMLQIE
Sbjct: 133 EWNFGGFPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIE 192
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYG++E SYGQ+GKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG+KP
Sbjct: 193 NEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKP 252
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
NS KP LWTE+WDGWYT WGG LPHRP EDLAFAVARF+QRGGSF NYYMYFGGTNFGR
Sbjct: 253 NSRTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 312
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
TSGGPFYITSYDYDAP+DEYGL SEPKWGHLKDLHAAIKLCEPALVAAD+ QY KLG Q
Sbjct: 313 TSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQ 372
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
EAH+Y + C+AFLANIDEH +A V F GQSYTLPPWSVSILPDCR+ FNTAK
Sbjct: 373 EAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAK 432
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
V +QTS+KTVE + P ++S+ Q+ + + +S SKSWM +KEPIG+W ENNFT QG+L
Sbjct: 433 VGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLL 492
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
EHLNVTKD SDYLWH T+I VS+DDISFWK N TV+IDSMRDVLRVF+N QL GS++
Sbjct: 493 EHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIV 552
Query: 562 GHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK 621
GHWVK VQPV F G NDL+LL+QTVGLQNYGAFLEKDGAGFRG+ KLTGFKNGD+DLSK
Sbjct: 553 GHWVKAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSK 612
Query: 622 ILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLG 680
WTYQVGLKGE +IY++E NE AEW+ L D PS F WYKTYFD P G DPV L+L
Sbjct: 613 SSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLE 672
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
SMG+GQAWVNG HIGRYW +++ K GC TCDYRGAYNSDKCTTNCG PTQT YHVPRSW
Sbjct: 673 SMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSW 732
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
L+ S+NLLV+FEETGGNPF+ISVK + I+C QVSESHYPP+RKWS ++G +SIN
Sbjct: 733 LKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDYINGTMSINS 792
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+APE+HLHC+DG++ISSIEFASYGTP+G C FS G CHA SLS+VSE
Sbjct: 793 VAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSE 841
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 1331 bits (3445), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 619/829 (74%), Positives = 703/829 (84%), Gaps = 5/829 (0%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ ++++ +S S +FKPFNVSYDHRA+II G RRML+SAGIHYPRATPEMW DL
Sbjct: 17 LIIALLVYFPILSGS----YFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDL 72
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
IAKSKEGGADV++TYVFWN HE ++GQYNF+G+ D+VKFVKL+GSSGLYL LRIGPYVCA
Sbjct: 73 IAKSKEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCA 132
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWLRDIPGIEFRT+N PFK+EMQ+FV KIVDLMRE LF WQGGPIIMLQIE
Sbjct: 133 EWNFGGFPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIE 192
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYG++E SYGQ+GKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG+KP
Sbjct: 193 NEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKP 252
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
NS KP LWTE+WDGWYT WGG LPHRP EDLAFAVARF+QRGGSF NYYMYFGGTNFGR
Sbjct: 253 NSRTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGR 312
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
TSGGPFYITSYDYDAP+DEYGL SEPKWGHLKDLHAAIKLCEPALVAAD+ QY KLG Q
Sbjct: 313 TSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQ 372
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
EAH+Y + C+AFLANIDEH +A V F GQSYTLPPWSVSILPDCR+ FNTAK
Sbjct: 373 EAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAK 432
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
V +QTS+KTVE + P ++S+ Q+ + + +S SKSWM +KEPIG+W ENNFT QG+L
Sbjct: 433 VGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLL 492
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
EHLNVTKD SDYLWH T+I VS+DDISFWK N TV+IDSMRDVLRVF+N QL GS++
Sbjct: 493 EHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIV 552
Query: 562 GHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK 621
GHWVK VQPV F G NDL+LL+QTVGLQNYGAFLEKDGAGFRG+ KLTGFKNGD+DLSK
Sbjct: 553 GHWVKAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSK 612
Query: 622 ILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLG 680
WTYQVGLKGE +IY++E NE AEW+ L D PS F WYKTYFD P G DPV L+L
Sbjct: 613 SSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLE 672
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
SMG+GQAWVNG HIGRYW +++ K GC TCDYRGAYNSDKCTTNCG PTQT YHVPRSW
Sbjct: 673 SMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSW 732
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
L+ S+NLLV+FEETGGNPF+ISVK + I+C QVSESHYPP+RKWS ++G +SIN
Sbjct: 733 LKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDYINGTMSINS 792
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+APE+HLHC+DG++ISSIEFASYGTP+G C FS G CHA SLS+VSE
Sbjct: 793 VAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSE 841
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 1323 bits (3425), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 609/842 (72%), Positives = 710/842 (84%), Gaps = 14/842 (1%)
Query: 18 VYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEM 77
+ +M + + IHL VS FFKPFNVSYDHRA+IIDG RRMLISAG+HYPRA+PEM
Sbjct: 8 IVQLMSLTLTIHLLVVSGE----FFKPFNVSYDHRALIIDGKRRMLISAGVHYPRASPEM 63
Query: 78 WPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGP 137
WPD+I KSKEGGADVI++YVFWN HE +GQYNF G+ D+VKF++LVGSSGLYL LRIGP
Sbjct: 64 WPDIIEKSKEGGADVIQSYVFWNGHEPTKGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGP 123
Query: 138 YVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIM 197
YVCAEWNFGGFP+WLRD+PGIEFRT+NAPFKEEMQRFVKKIVDL+R+E LF WQGGP+IM
Sbjct: 124 YVCAEWNFGGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIM 183
Query: 198 LQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD 257
LQ+ENEYGN+ESSYG++G++Y+KW +MALGLGA VPWVMC+Q DAP II++CNGYYCD
Sbjct: 184 LQVENEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCD 243
Query: 258 GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGT 317
G+K NS +KP WTENW+GW+T+WG R PHRPVEDLAF+VARFFQR GSF NYYMYFGGT
Sbjct: 244 GFKANSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGT 303
Query: 318 NFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
NFGRT+GGPFYITSYDYD+PIDEYGL+ EPKWGHLKDLH A+KLCEPALV+ADS QYIKL
Sbjct: 304 NFGRTAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKL 363
Query: 378 GQNQEAHVYRA---------NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSI 428
G QEAHVY ++ G+ NCSAFLANIDE A +V F GQ+Y LPPWSVSI
Sbjct: 364 GPKQEAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSI 423
Query: 429 LPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG 488
LPDC+N VFNTAKV++QTSIK +E PLS N+S+ + +++LS + SWMTVKEPIG
Sbjct: 424 LPDCQNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIG 483
Query: 489 VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVL 548
+WS+ NFTV+GILEHLNVTKD SDYLW++T+I+VS+DDI FWK + PT+TIDS+RDV
Sbjct: 484 IWSDQNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVF 543
Query: 549 RVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
RVF+NG+LTGS IG WVK VQPV+F GYNDL+LLSQ +GLQN GAF+EKDGAG RG++K
Sbjct: 544 RVFVNGKLTGSAIGQWVKFVQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIK 603
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFD 667
LTGFKNGDIDLSK LWTYQVGLKGEF YS+EENE A+WT+L+ D IPSTFTWYK YF
Sbjct: 604 LTGFKNGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFS 663
Query: 668 APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
+PDG DPVA++LGSMGKGQAWVNGHHIGRYW+VV+PK GC CDYRGAYNS KC TNCG
Sbjct: 664 SPDGTDPVAINLGSMGKGQAWVNGHHIGRYWSVVSPKDGCPRKCDYRGAYNSGKCATNCG 723
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
PTQ+WYH+PRSWL+ S+NLLV+FEETGGNP EI VKL ST ++C QVSESHYP +RK S
Sbjct: 724 RPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYPSLRKLS 783
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
N Y DG+ N+ PEM LHC DG++ISS+EFASYGTPQG C KFSRG CHA SLSVV
Sbjct: 784 NDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSVV 843
Query: 848 SE 849
S+
Sbjct: 844 SQ 845
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 1322 bits (3422), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 614/826 (74%), Positives = 696/826 (84%), Gaps = 9/826 (1%)
Query: 33 VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
V + +FKPFNVSYDHRA+I++G RR LISAGIHYPRATPEMWPDLIAKSKEGGADV
Sbjct: 33 VRVTEGEEYFKPFNVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
Query: 93 IETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
IETYVFWN HE +RGQYNF+G+ D+VKFV+L S GLY LRIGPY CAEWNFGGFPVWL
Sbjct: 93 IETYVFWNGHEPVRGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWL 152
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG 212
RDIPGIEFRTNNAPFKEEM+RFV K+V+LMREE LFSWQGGPII+LQIENEYGN+E+SYG
Sbjct: 153 RDIPGIEFRTNNAPFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYG 212
Query: 213 QQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTE 272
+ GK+Y+KWAA MAL LGAGVPWVMC+Q DAP +IID CN YYCDG+KPNS+NKPT+WTE
Sbjct: 213 KGGKEYMKWAAKMALSLGAGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSHNKPTMWTE 272
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSY 332
NWDGWYT WG RLPHRPVEDLAFAVARFFQRGGSF NYYMYFGGTNFGRT+GGP ITSY
Sbjct: 273 NWDGWYTQWGERLPHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTAGGPLQITSY 332
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN--- 389
DYDAPIDEYGLL EPKWGHLKDLHAA+KLCEPALVA DS YIKLG QEAHVY+AN
Sbjct: 333 DYDAPIDEYGLLREPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPKQEAHVYQANVHL 392
Query: 390 ------RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVS 443
+ S S CSAFLANIDE A+VTF GQ YT+PPWSVS+LPDCRNTVFNTAKV
Sbjct: 393 EGLNLSMFESSSICSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVR 452
Query: 444 SQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEH 503
+QTS+K VE LP NI QQ ++ SKSWMT KEP+ +WS+++FTV+GI EH
Sbjct: 453 AQTSVKLVESYLPTVSNIFPAQQLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEH 512
Query: 504 LNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH 563
LNVTKD SDYLW+ T++YVSD DI FW+ N+V P +TID +RD+LRVFINGQL G+V+GH
Sbjct: 513 LNVTKDQSDYLWYSTRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLIGNVVGH 572
Query: 564 WVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKIL 623
W+KVVQ ++F GYNDL LL+QTVGLQNYGAFLEKDGAG RG++K+TGF+NGDIDLSK L
Sbjct: 573 WIKVVQTLQFLPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSL 632
Query: 624 WTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
WTYQVGL+GEF + YS E +EW +LT D IPSTFTWYKTYFD P GIDPVALD SMG
Sbjct: 633 WTYQVGLQGEFLKFYSEENENSEWVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMG 692
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KGQAWVNG HIGRYWT V+PK GCQ CDYRGAYNSDKC+TNCG PTQT YHVPRSWL+A
Sbjct: 693 KGQAWVNGQHIGRYWTRVSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKA 752
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAP 803
+NNLLVI EETGGNPFEISVKL S+RI+C QVSES+YPP++K N+ + ++S N M P
Sbjct: 753 TNNLLVILEETGGNPFEISVKLHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMIP 812
Query: 804 EMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
E+HLHCQ G+ ISS+ FAS+GTP G CQ FSRGNCHAP S+S+VSE
Sbjct: 813 ELHLHCQQGHTISSVAFASFGTPGGSCQNFSRGNCHAPSSMSIVSE 858
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 1309 bits (3387), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 604/838 (72%), Positives = 708/838 (84%), Gaps = 12/838 (1%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+M ++ + + ++ + FFKPFNVSYDHRA+IIDG RRMLIS+GIHYPRATPEMWPDLI
Sbjct: 7 IMEFLLVVMTLQIAACTEFFKPFNVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLI 66
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
AKSKEGGAD+I+TY FWN HE IRGQYNF+G+ DIVKF+KL GS+GLY LRIGPYVCAE
Sbjct: 67 AKSKEGGADLIQTYAFWNGHEPIRGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAE 126
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWLRDIPGIEFRT+NAP+K+EMQRFVKKIVDLMR+EMLFSWQGGPII+LQIEN
Sbjct: 127 WNFGGFPVWLRDIPGIEFRTDNAPYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIEN 186
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYGN+E YGQ+GKDYVKWAA MA+GLGAGVPWVMC+QTDAPENIIDACN +YCDG+KPN
Sbjct: 187 EYGNIERLYGQRGKDYVKWAADMAIGLGAGVPWVMCRQTDAPENIIDACNAFYCDGFKPN 246
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
SY KP LWTE+W+GWYT+WGGR+PHRPVED AFAVARFFQRGGS+ NYYM+FGGTNFGRT
Sbjct: 247 SYRKPALWTEDWNGWYTSWGGRVPHRPVEDNAFAVARFFQRGGSYHNYYMFFGGTNFGRT 306
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA-QYIKLGQNQ 381
SGGPFY+TSYDYDAPIDEYGLLS+PKWGHLKDLH+AIKLCEPALVA D A QYI+LG Q
Sbjct: 307 SGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKDLHSAIKLCEPALVAVDDAPQYIRLGPMQ 366
Query: 382 EAHVYRANRY---------GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
EAHVYR + Y G+ + CSAFLANIDEH +A+V FLGQ Y+LPPWSVSILPDC
Sbjct: 367 EAHVYRHSSYVEDQSSSTLGNGTLCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDC 426
Query: 433 RNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE 492
+N FNTAKV+SQ S+KTVEFS P N + P ++ + S +WM +KEPIG W
Sbjct: 427 KNVAFNTAKVASQISVKTVEFSSPFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEWGG 486
Query: 493 NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFI 552
NNFT +GILEHLNVTKD SDYLW+I ++++SD+DISFW+ +EV P + IDSMRDV+R+F+
Sbjct: 487 NNFTAEGILEHLNVTKDTSDYLWYIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFV 546
Query: 553 NGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF 612
NGQL GS +G WV+V QPV+ GYN+L +LS+TVGLQNYGAFLEKDGAGF+GQ+KLTG
Sbjct: 547 NGQLAGSHVGRWVRVEQPVDLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQIKLTGL 606
Query: 613 KNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDG 671
K+G+ DL+ LW YQVGL+GEF +I+S+EE+E A+W DL D +PS FTWYKT+FDAP G
Sbjct: 607 KSGEYDLTNSLWVYQVGLRGEFMKIFSLEEHESADWVDLPNDSVPSAFTWYKTFFDAPQG 666
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
DPV+L LGSMGKGQAWVNGH IGRYW++VAP GCQ +CDYRGAY+ KC TNCG PTQ
Sbjct: 667 KDPVSLYLGSMGKGQAWVNGHSIGRYWSLVAPVDGCQ-SCDYRGAYHESKCATNCGKPTQ 725
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS 791
+WYH+PRSWLQ S NLLVIFEETGGNP EISVKL ST +C +VSESHYPP+ WS+
Sbjct: 726 SWYHIPRSWLQPSKNLLVIFEETGGNPLEISVKLHSTSSICTKVSESHYPPLHLWSHKDI 785
Query: 792 VDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
V+GK+SI+ PE+HL C +G ISSI FAS+GTPQG CQ+FS+G+CHAP S SVVSE
Sbjct: 786 VNGKVSISNAVPEIHLQCDNGQRISSIMFASFGTPQGSCQRFSQGDCHAPNSFSVVSE 843
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 1293 bits (3345), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 613/836 (73%), Positives = 695/836 (83%), Gaps = 19/836 (2%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
S + + + + +FKPFNV+YDHRA+IIDG+RRMLISAGIHYPRATPEMWPDLIAK+KEGG
Sbjct: 34 SIIVAGAEAAWFKPFNVTYDHRALIIDGHRRMLISAGIHYPRATPEMWPDLIAKAKEGGV 93
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
DVIETYVFWN H+ ++GQYNF+G+ D+VKF KLV S+GLY LRIGPY CAEWNFGGFPV
Sbjct: 94 DVIETYVFWNGHQPVKGQYNFEGRYDLVKFAKLVASNGLYFFLRIGPYACAEWNFGGFPV 153
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ------IENEY 204
WLRDIPGIEFRTNNAPFKEEM+RFV K+V+LMREEMLFSWQGGPII+LQ IENEY
Sbjct: 154 WLRDIPGIEFRTNNAPFKEEMKRFVSKVVNLMREEMLFSWQGGPIILLQVRREYGIENEY 213
Query: 205 GNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY 264
GN+ESSYG +GK+YVKWAASMAL LGAGVPWVMCKQ DAP +IID CN YYCDG+KPNS
Sbjct: 214 GNLESSYGNEGKEYVKWAASMALSLGAGVPWVMCKQPDAPYDIIDTCNAYYCDGFKPNSR 273
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
NKP WTENWDGWYT WG RLPHRPVEDLAFAVARFFQRGGS NYYMYFGGTNFGRT+G
Sbjct: 274 NKPIFWTENWDGWYTQWGERLPHRPVEDLAFAVARFFQRGGSLQNYYMYFGGTNFGRTAG 333
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAH 384
GP ITSYDYDAPIDEYGLL+EPKWGHLKDLHAA+KLCEPALVAADS YIKLG QEAH
Sbjct: 334 GPLQITSYDYDAPIDEYGLLNEPKWGHLKDLHAALKLCEPALVAADSPTYIKLGSKQEAH 393
Query: 385 VYRANRYGSQSN---------CSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
VY+ N + N CSAFLANIDE AA+VTF GQ+YTLPPWSVSILPDCR+
Sbjct: 394 VYQENVHREGLNLSISQISNKCSAFLANIDERKAATVTFRGQTYTLPPWSVSILPDCRSA 453
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNF 495
+FNTAKV +QTS+K V +LPL+ N+ + QQS+ + +S SKSWMT KEPI +W ++F
Sbjct: 454 IFNTAKVGAQTSVKLVGSNLPLTSNLLLSQQSIDHNGISHISKSWMTTKEPINIWINSSF 513
Query: 496 TVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ 555
T +GI EHLNVTKD SDYLW+ T+IYVSD DI FWK N P + IDS+RD+LRVF+NGQ
Sbjct: 514 TAEGIWEHLNVTKDQSDYLWYSTRIYVSDGDILFWKENAAHPKLAIDSVRDILRVFVNGQ 573
Query: 556 LTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG 615
L G+V+GHWVK VQ ++FQ GYNDL LL+QTVGLQNYGAF+EKDGAG RG +K+TGF+NG
Sbjct: 574 LIGNVVGHWVKAVQTLQFQPGYNDLTLLTQTVGLQNYGAFIEKDGAGIRGTIKITGFENG 633
Query: 616 DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV 675
IDLSK LWTYQVGL+GEF + Y+ E A W +LT D IPSTFTWYKTYFD P G DPV
Sbjct: 634 HIDLSKPLWTYQVGLQGEFLKFYNEESENAGWVELTPDAIPSTFTWYKTYFDVPGGNDPV 693
Query: 676 ALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
ALDL SMGKGQAWVNGHHIGRYWT V+PK GCQ CDYRGAY+SDKCTTNCG PTQT YH
Sbjct: 694 ALDLESMGKGQAWVNGHHIGRYWTRVSPKTGCQ-VCDYRGAYDSDKCTTNCGKPTQTLYH 752
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGK 795
VPRSWL+ASNN LVI EETGGNP ISVKL S IVC QVS+S+YPP++K N+ S+ G+
Sbjct: 753 VPRSWLKASNNFLVILEETGGNPLGISVKLHSASIVCAQVSQSYYPPMQKLLNA-SLLGQ 811
Query: 796 --LSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+S N M PEM+L C+DG IISSI FAS+GTP G CQ FSRGNCHAP S S+VS+
Sbjct: 812 QEVSSNDMIPEMNLRCRDGNIISSITFASFGTPGGSCQSFSRGNCHAPSSKSIVSK 867
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 1263 bits (3269), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 589/839 (70%), Positives = 686/839 (81%), Gaps = 13/839 (1%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+P+++ ++ IH V A +FKPFNV+YD+RA+II G RRMLISAGIHYPRATPEMW
Sbjct: 13 FPLILTVLTIHFVIV----AGEYFKPFNVTYDNRALIIGGKRRMLISAGIHYPRATPEMW 68
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
P LIA+SKEGGADVIETY FWN HE RGQYNF+G+ DIVKF KLVGS GL+L +RIGPY
Sbjct: 69 PTLIARSKEGGADVIETYTFWNGHEPTRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPY 128
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
CAEWNFGGFP+WLRDIPGIEFRT+NAPFKEEM+R+VKKIVDLM E LFSWQGGPII+L
Sbjct: 129 ACAEWNFGGFPIWLRDIPGIEFRTDNAPFKEEMERYVKKIVDLMISESLFSWQGGPIILL 188
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG 258
QIENEYGN+ES++G +GK Y+KWAA MA+GLGAGVPWVMC+QTDAPE IID CN YYCDG
Sbjct: 189 QIENEYGNVESTFGPKGKLYMKWAAEMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDG 248
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
+ PNS KP +WTENW+GW+ WG RLP+RP ED+AFA+ARFFQRGGS NYYMYFGGTN
Sbjct: 249 FTPNSEKKPKIWTENWNGWFADWGERLPYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTN 308
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
FGRT+GGP ITSYDYDAP+DEYGLL +PKWGHLKDLHAAIKLCEPALVAADS QYIKLG
Sbjct: 309 FGRTAGGPTQITSYDYDAPLDEYGLLRQPKWGHLKDLHAAIKLCEPALVAADSPQYIKLG 368
Query: 379 QNQEAHVYR--ANRYG-----SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPD 431
QEAHVYR +N G ++ C+AF+ANIDEH +A+V F GQ +TLPPWSVSILPD
Sbjct: 369 PKQEAHVYRGTSNNIGQYMSLNEGICAAFIANIDEHESATVKFYGQEFTLPPWSVSILPD 428
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWS 491
CRNT FNTAKV +QTSIKTV N S+ Q + +SKL S S+SWMT+KEP+GVW
Sbjct: 429 CRNTAFNTAKVGAQTSIKTVGSDSVSVGNNSLFLQVITKSKLESFSQSWMTLKEPLGVWG 488
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
+ NFT +GILEHLNVTKD SDYLW++T+IY+SDDDISFW+ N+V PT+ IDSMRD +R+F
Sbjct: 489 DKNFTSKGILEHLNVTKDQSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIF 548
Query: 552 INGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
+NGQL GSV G W+KVVQPV+ GYND++LLS+TVGLQNYGAFLEKDGAGF+GQ+KLTG
Sbjct: 549 VNGQLAGSVKGKWIKVVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTG 608
Query: 612 FKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPD 670
K+GDI+L+ LWTYQVGL+GEF ++Y + E A WT+ PS F+WYKT FDAP
Sbjct: 609 CKSGDINLTTSLWTYQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPG 668
Query: 671 GIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
G DPVALD SMGKGQAWVNGHH+GRYWT+VAP GC TCDYRGAY+SDKC TNCG T
Sbjct: 669 GTDPVALDFSSMGKGQAWVNGHHVGRYWTLVAPNNGCGRTCDYRGAYHSDKCRTNCGEIT 728
Query: 731 QTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSY 790
Q WYH+PRSWL+ NN+LVIFEE PF+IS+ RST +C QVSE HYPP+ KWS+S
Sbjct: 729 QAWYHIPRSWLKTLNNVLVIFEEIDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHS- 787
Query: 791 SVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
D KLS+ PEMHL C +G+ ISSIEFASYG+P G CQKFS+G CHA SLSVVS+
Sbjct: 788 EFDRKLSLMDKTPEMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQ 846
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 1205 bits (3118), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/842 (67%), Positives = 668/842 (79%), Gaps = 19/842 (2%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+P+++ ++ IH V A +FKPFNV+YD+RA+II G RRMLISAGIHYPRATPEMW
Sbjct: 13 FPLILTVLTIHFVIV----AGEYFKPFNVTYDNRALIIGGKRRMLISAGIHYPRATPEMW 68
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
P LIA+SKEGGADVIETY FWN HE RGQYNF+G+ DIVKF KLVGS GL+L +RIGPY
Sbjct: 69 PTLIARSKEGGADVIETYTFWNGHEPTRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPY 128
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
CAEWNFGGFP+WLRDIPGIEFRT+NAPFKEEM+R+VKKIVDLM E LFSWQGGPII+L
Sbjct: 129 ACAEWNFGGFPIWLRDIPGIEFRTDNAPFKEEMERYVKKIVDLMISESLFSWQGGPIILL 188
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG 258
QIENEYGN+ESS+G +GK Y+KWAA MA+GLGAGVPWVMC+QTDAPE IID CN YYCDG
Sbjct: 189 QIENEYGNVESSFGPKGKLYMKWAAEMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDG 248
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
+ PNS KP +WTENW+GW+ WG RLP+RP ED+AFA+ARFFQRGGS NYYMYFGGTN
Sbjct: 249 FTPNSEKKPKIWTENWNGWFADWGERLPYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTN 308
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
FGRT+GGP ITSYDYDAP+DEYGLL +PKWGHLKDLHAAIKLCEPALVAADS QYIKLG
Sbjct: 309 FGRTAGGPTQITSYDYDAPLDEYGLLRQPKWGHLKDLHAAIKLCEPALVAADSPQYIKLG 368
Query: 379 QNQEAHVYR--ANRYG-----SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPD 431
QEAHVYR +N G ++ C+AF+ANIDEH +A+V F GQ +TLPPWSV
Sbjct: 369 PKQEAHVYRGTSNNIGQYMSLNEGICAAFIANIDEHESATVKFYGQEFTLPPWSVVFCQI 428
Query: 432 CRNTVFNTAKVSSQTSIK---TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG 488
+ + + K + F L + I + +++ S S+SWMT+KEP+G
Sbjct: 429 AEIQLSTQLRWGHKLQSKQWAQILFQLGI---ILCFYKLSLKASSESFSQSWMTLKEPLG 485
Query: 489 VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVL 548
VW + NFT +GILEHLNVTKD SDYLW++T+IY+SDDDISFW+ N+V PT+ IDSMRD +
Sbjct: 486 VWGDKNFTSKGILEHLNVTKDQSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFV 545
Query: 549 RVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
R+F+NGQL GSV G W+KVVQPV+ GYND++LLS+TVGLQNYGAFLEKDGAGF+GQ+K
Sbjct: 546 RIFVNGQLAGSVKGKWIKVVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIK 605
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFD 667
LTG K+GDI+L+ LWTYQVGL+GEF ++Y + E A WT+ PS F+WYKT FD
Sbjct: 606 LTGCKSGDINLTTSLWTYQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFD 665
Query: 668 APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
AP G DPVALD SMGKGQAWVNGHH+GRYWT+VAP GC TCDYRGAY+SDKC TNCG
Sbjct: 666 APGGTDPVALDFSSMGKGQAWVNGHHVGRYWTLVAPNNGCGRTCDYRGAYHSDKCRTNCG 725
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
TQ WYH+PRSWL+ NN+LVIFEET PF+IS+ RST +C QVSE HYPP+ KWS
Sbjct: 726 EITQAWYHIPRSWLKTLNNVLVIFEETDKTPFDISISTRSTETICAQVSEKHYPPLHKWS 785
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+S D KLS+ PEMHL C +G+ ISSIEFASYG+P G CQKFS+G CHA SLSVV
Sbjct: 786 HS-EFDRKLSLMDKTPEMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVV 844
Query: 848 SE 849
S+
Sbjct: 845 SQ 846
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 1150 bits (2976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 527/826 (63%), Positives = 640/826 (77%), Gaps = 11/826 (1%)
Query: 33 VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
V + FF+PFNVSYDHRA+ + G RRML+SAG+HYPRATPEMWP +IAK KEGGADV
Sbjct: 38 VGKGTDGLFFEPFNVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCKEGGADV 97
Query: 93 IETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
IETY+FWN HE +GQY F+ + D+V+F+KLV + GL+L LRIGPY CAEWNFGGFPVWL
Sbjct: 98 IETYIFWNGHEPAKGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFGGFPVWL 157
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG 212
RDIPGIEFRT+N P+K EMQ FV KIVD+M++E L+SWQGGPII+ QIENEYGN++ YG
Sbjct: 158 RDIPGIEFRTDNEPYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGNIQGKYG 217
Query: 213 QQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTE 272
Q GK Y++WAA MALGL G+PWVMC+QTDAPE I+D CN +YCDG+KPNSYNKPT+WTE
Sbjct: 218 QAGKRYMQWAAQMALGLDTGIPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTE 277
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSY 332
+WDGWY WGG LPHRP ED AFAVARF+QRGGS NYYMYFGGTNF RT+GGP ITSY
Sbjct: 278 DWDGWYADWGGPLPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSY 337
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHVYRANRY 391
DYDAPI+EYG+L +PKWGHLKDLH AIKLCEPAL+A D S QY+KLG QEAH+Y + +
Sbjct: 338 DYDAPINEYGMLRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYVKLGSMQEAHIYSSAKV 397
Query: 392 -------GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSS 444
G+ CSAFLANIDEH SV G+SY LPPWSVSILPDC N FNTA+V +
Sbjct: 398 HTNGSTAGNAQICSAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCENVAFNTARVGA 457
Query: 445 QTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHL 504
QTS+ T E P + P + + S S +W T KE IG W + +F QGILEHL
Sbjct: 458 QTSVFTFESGSPSHSSRREPSVLLPGVRGSYLSSTWWTSKETIGTWGDGSFATQGILEHL 517
Query: 505 NVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW 564
NVTKD SDYLW+ T + +SD+D++FW + V P++ ID +RDV RVF+NG+L GS +GHW
Sbjct: 518 NVTKDISDYLWYTTSVNISDEDVAFWSSKGVLPSLIIDQIRDVARVFVNGKLAGSQVGHW 577
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
V + QP++F G N+L LLS+ VGLQNYGAFLEKDGAGF+GQVKLTG NGD DL+ W
Sbjct: 578 VSLKQPIQFVRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLTGLSNGDTDLTNSAW 637
Query: 625 TYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
TYQVGLKGEF IY+ E+ E AEW+ + D I S FTWYKT DAP+G DPVA+DLGSMG
Sbjct: 638 TYQVGLKGEFSMIYTPEKQECAEWSAMQTDNIQSPFTWYKTMVDAPEGTDPVAIDLGSMG 697
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KGQAWVNG IGRYW++VAP+ GC +C+Y GAY+ KC +NCG PTQ+WYH+PR WLQ
Sbjct: 698 KGQAWVNGRLIGRYWSLVAPESGCPSSCNYPGAYSETKCQSNCGMPTQSWYHIPREWLQE 757
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAP 803
SNNLLV+FEETGG+P +IS+++ T+ +C ++SE++YPP+ W S+ G++S++ +AP
Sbjct: 758 SNNLLVLFEETGGDPSKISLEVHYTKTICSRISENYYPPLSAW--SWLDTGRVSVDSVAP 815
Query: 804 EMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
E+ L C DGY IS I FASYGTP G CQ FS+G CHA +L V+E
Sbjct: 816 ELLLRCDDGYEISRITFASYGTPSGGCQNFSKGKCHAASTLDFVTE 861
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 1150 bits (2975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 533/818 (65%), Positives = 632/818 (77%), Gaps = 12/818 (1%)
Query: 41 FFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWN 100
FF+PFNV+YDHRA++I G RRML+SAG+HYPRATPEMWP LIAK KEGGADVIETYVFWN
Sbjct: 58 FFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWN 117
Query: 101 AHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEF 160
HE +GQY F+ + D+VKF KLV + GL+L LRIGPY CAEWNFGGFPVWLRDIPGIEF
Sbjct: 118 GHEPAKGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEF 177
Query: 161 RTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVK 220
RT+N PFK EMQ FV KIV LM+EE L+SWQGGPII+ QIENEYGN++ +YGQ GK Y++
Sbjct: 178 RTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQ 237
Query: 221 WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTT 280
WAA MA+GL G+PWVMC+QTDAPE IID CN +YCDG+KPNSYNKPT+WTE+WDGWY
Sbjct: 238 WAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYAD 297
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDE 340
WGG LPHRP ED AFAVARF+QRGGS NYYMYFGGTNF RT+GGP ITSYDYDAPIDE
Sbjct: 298 WGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDE 357
Query: 341 YGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHVYRANRY-------G 392
YG+L +PKWGHLKDLH AIKLCEPAL+A D S QYIKLG QEAHVY G
Sbjct: 358 YGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAG 417
Query: 393 SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVE 452
+ CSAFLANIDEH ASV G+SY+LPPWSVSILPDC N FNTA++ +QTS+ TVE
Sbjct: 418 NAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVE 477
Query: 453 FSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSD 512
P + P + S S +W T KE IG W NNF VQGILEHLNVTKD SD
Sbjct: 478 SGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISD 537
Query: 513 YLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVE 572
YLW+ T++ +SD D++FW + V P++TID +RDV RVF+NG+L GS +GHWV + QP++
Sbjct: 538 YLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWVSLKQPIQ 597
Query: 573 FQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKG 632
G N+L LLS+ VGLQNYGAFLEKDGAGFRGQV LTG +GD+DL+ LWTYQVGLKG
Sbjct: 598 LVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKG 657
Query: 633 EFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
EF IY+ E+ A W+ + +D + FTWYKT F P G DPVA+DLGSMGKGQAWVNG
Sbjct: 658 EFSMIYAPEKQGCAGWSRMQKDSV-QPFTWYKTMFSTPKGTDPVAIDLGSMGKGQAWVNG 716
Query: 692 HHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
H IGRYW++VAP+ GC +C Y GAYN KC +NCG PTQ WYH+PR WL+ S+NLLV+F
Sbjct: 717 HLIGRYWSLVAPESGCSSSCYYPGAYNERKCQSNCGMPTQNWYHIPREWLKESDNLLVLF 776
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EETGG+P IS++ + VC ++SE++YPP+ WS+ S G+ S+N PE+ L C D
Sbjct: 777 EETGGDPSLISLEAHYAKTVCSRISENYYPPLSAWSHLSS--GRASVNAATPELRLQCDD 834
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G++IS I FASYGTP G C FS+GNCHA +L +V+E
Sbjct: 835 GHVISEITFASYGTPSGGCLNFSKGNCHASSTLDLVTE 872
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 1140 bits (2950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/819 (64%), Positives = 641/819 (78%), Gaps = 15/819 (1%)
Query: 41 FFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWN 100
FF+PFNV+YDHRA+I+ G RRML+SAG+HYPRATPEMWP LIAK+KEGG DVIETY+FWN
Sbjct: 63 FFEPFNVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFWN 122
Query: 101 AHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEF 160
HE +GQY F+G+ DIV+F KLV + GL+L LRIGPY CAEWNFGGFPVWLRDIPGIEF
Sbjct: 123 GHEPAKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEF 182
Query: 161 RTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVK 220
RT+N P+K EMQ FV KIVD+M+EE L+SWQGGPII+ QIENEYGN++ YGQ GK Y++
Sbjct: 183 RTDNEPYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQ 242
Query: 221 WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTT 280
WAA MAL L GVPWVMC+QTDAPE I+D CN +YCDG+KPNSYNKPT+WTE+WDGWY
Sbjct: 243 WAAQMALALDTGVPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYAD 302
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDE 340
WG LPHRP +D AFAVARF+QRGGSF NYYMYFGGTNF RT+GGP ITSYDYDAPIDE
Sbjct: 303 WGEALPHRPAQDSAFAVARFYQRGGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDE 362
Query: 341 YGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHVYRANRY-------G 392
YG+L +PKWGHLKDLHAAIKLCEPAL A D S +YIKLG QEAHVY + G
Sbjct: 363 YGILRQPKWGHLKDLHAAIKLCEPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSISG 422
Query: 393 SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVE 452
+ CSAFLANIDEH ASV G+SY+LPPWSVSILPDC FNTA+V +QTS VE
Sbjct: 423 NAQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVE 482
Query: 453 FSLPLSPNISVPQ-QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYS 511
P + P+ S+ LSST W KEP+G+WSE+ F QGILEHLNVTKD S
Sbjct: 483 SGSPSYSSRHKPRILSLGGPYLSST---WWASKEPVGIWSEDIFAAQGILEHLNVTKDIS 539
Query: 512 DYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPV 571
DYL + T++ +SD+D+ +W + + P++TID +RDV+R+F+NG+L GS +GHWV + QP+
Sbjct: 540 DYLSYTTRVNISDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGHWVSLNQPL 599
Query: 572 EFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLK 631
+ G N+L LLS+ VGLQNYGAFLEKDGAGFRGQVKLTG NGDIDL+ LWTYQ+GLK
Sbjct: 600 QLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLK 659
Query: 632 GEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVN 690
GEF +IYS E + A W+ + D S FTW+KT FDAP+G PVA+DLGSMGKGQAWVN
Sbjct: 660 GEFSRIYSPEKQGSAGWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSMGKGQAWVN 719
Query: 691 GHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
GH IGRYW++VAP+ GC +C+Y G Y KC +NCG TQ+WYH+PR WLQ S+NLLV+
Sbjct: 720 GHLIGRYWSLVAPESGCPSSCNYAGNYGDSKCRSNCGIATQSWYHIPREWLQESDNLLVL 779
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ 810
FEETGG+P +IS+++ T+ +C ++SE++YPP+ WS + +G+ S+N +APE+ L C
Sbjct: 780 FEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSR--AANGRPSVNTVAPELRLQCD 837
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+G++IS I FASYGTP G CQ FS GNCHA +L +V+E
Sbjct: 838 EGHVISKITFASYGTPTGDCQNFSVGNCHASTTLDLVAE 876
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 1130 bits (2924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/828 (63%), Positives = 634/828 (76%), Gaps = 16/828 (1%)
Query: 33 VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
V TFF+PFNV+YDHRA+I+ G RRML+SAG+HYPRATPEMWP LIAK KEGG D
Sbjct: 49 VGGDDGGTFFEPFNVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGVDA 108
Query: 93 IETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
IETYVFWN HE +GQY F+G+ DIV+F KLV + GL+L LRIGPY CAEWNFGGFPVWL
Sbjct: 109 IETYVFWNGHEPAKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWL 168
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG 212
RD+PGIEFRT+N P+K EMQ FV KIVD+M+EE L+SWQGGPII+ QIENEYGN++ YG
Sbjct: 169 RDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGHYG 228
Query: 213 QQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTE 272
Q GK Y+ WAA MAL L GVPWVMC+QTDAPE I++ CN +YCDG+KPNSYNKPT+WTE
Sbjct: 229 QAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNKPTIWTE 288
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSY 332
+WDGWY WG LPHRP +D AFAVARF+QRGGS NYYMYFGGTNF RT+GGP ITSY
Sbjct: 289 DWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGPLQITSY 348
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHVYRANRY 391
DYDAPIDEYG+L +PKWGHLKDLHAAIKLCE AL A D S Y+KLG QEAHVY +
Sbjct: 349 DYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHVYSSENV 408
Query: 392 -------GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSS 444
G+ CSAFLANIDEH ASV G+SY+LPPWSVSILPDC FNTA+V +
Sbjct: 409 HTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGT 468
Query: 445 QTSIKTVEFSLPLSPNISVPQQSMIESKLSS--TSKSWMTVKEPIGVWSENNFTVQGILE 502
QTS VE SP+ S + I S + S +W T KEP+G+W E FT QGILE
Sbjct: 469 QTSFFNVESG---SPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQGILE 525
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
HLNVTKD SDYL + T++ +S++D+ +W + P++TID +RDV RVF+NG+L GS +G
Sbjct: 526 HLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGSKVG 585
Query: 563 HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKI 622
HWV + QP++ G N+L LLS+ VGLQNYGAFLEKDGAGFRGQVKLTG NGDIDL+
Sbjct: 586 HWVSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNS 645
Query: 623 LWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGS 681
LWTYQ+GLKGEF +IYS E + AEW+ + D S FTW+KT FDAP+G PV +DLGS
Sbjct: 646 LWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPVTIDLGS 705
Query: 682 MGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
MGKGQAWVNGH IGRYW++VAP+ GC +C+Y G Y+ KC +NCG TQ+WYH+PR WL
Sbjct: 706 MGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSWYHIPREWL 765
Query: 742 QASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM 801
Q S NLLV+FEETGG+P +IS+++ T+ +C ++SE++YPP+ WS + +G+ S+N +
Sbjct: 766 QESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSR--AANGRPSVNTV 823
Query: 802 APEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
APE+ L C DG++IS I FASYGTP G CQ FS GNCHA +L +V E
Sbjct: 824 APELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVE 871
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 1121 bits (2899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 516/713 (72%), Positives = 602/713 (84%), Gaps = 10/713 (1%)
Query: 147 GFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN 206
GFP+WLRD+PGIEFRT+NAPFKEEMQRFVKKIVDL+R+E LF WQGGP+IMLQ+ENEYGN
Sbjct: 6 GFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENEYGN 65
Query: 207 MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK 266
+ESSYG++G++Y+KW +MALGLGA VPWVMC+Q DAP II++CNGYYCDG+K NS +K
Sbjct: 66 IESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKANSPSK 125
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
P WTENW+GW+T+WG R PHRPVEDLAF+VARFFQR GSF NYYMYFGGTNFGRT+GGP
Sbjct: 126 PIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGRTAGGP 185
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVY 386
FYITSYDYD+PIDEYGL+ EPKWGHLKDLH A+KLCEPALV+ADS QYIKLG QEAHVY
Sbjct: 186 FYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQEAHVY 245
Query: 387 RA---------NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
++ G+ NCSAFLANIDE A +V F GQ+Y LPPWSVSILPDC+N VF
Sbjct: 246 HMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQNVVF 305
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NTAKV++QTSIK +E PLS N+S+ + +++LS + SWMTVKEPIG+WS+ NFTV
Sbjct: 306 NTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQNFTV 365
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
+GILEHLNVTKD SDYLW++T+I+VS+DDI FWK + PT+TIDS+RDV RVF+NG+LT
Sbjct: 366 KGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNGKLT 425
Query: 558 GSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
GS IG WVK VQPV+F GYNDL+LLSQ +GLQN GAF+EKDGAG RG++KLTGFKNGDI
Sbjct: 426 GSAIGQWVKFVQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFKNGDI 485
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
DLSK LWTYQVGLKGEF YS+EENE A+WT+L+ D IPSTFTWYK YF +PDG DPVA
Sbjct: 486 DLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSSPDGTDPVA 545
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
++LGSMGKGQAWVNGHHIGRYW+VV+PK GC CDYRGAYNS KC TNCG PTQ+WYH+
Sbjct: 546 INLGSMGKGQAWVNGHHIGRYWSVVSPKDGCPRKCDYRGAYNSGKCATNCGRPTQSWYHI 605
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL 796
PRSWL+ S+NLLV+FEETGGNP EI VKL ST ++C QVSESHYP +RK SN Y DG+
Sbjct: 606 PRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYPSLRKLSNDYISDGET 665
Query: 797 SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
N+ PEM LHC DG++ISS+EFASYGTPQG C KFSRG CHA SLSVVS+
Sbjct: 666 LSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSVVSQ 718
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 972 bits (2513), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/832 (55%), Positives = 587/832 (70%), Gaps = 24/832 (2%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
S + ++ + KP NV+YD RA++IDG RRMLISAGIHYPRATPEMWP +I +K+GGA
Sbjct: 16 SVTAFTTRACVRKPVNVTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGA 75
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
DV++TYVFWN HE +GQYNF+G+ D+VKF+KLV +GLY LRIGPYVCAEWNFGGFP
Sbjct: 76 DVVQTYVFWNGHEPEQGQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPY 135
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS 210
WL++IPGI FRT+N PFK MQ F KIV+LM+E LFSWQGGPIIM QIENEYG++ES
Sbjct: 136 WLKEIPGIVFRTDNEPFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQ 195
Query: 211 YGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLW 270
+G GK YV+WAA MAL L VPW+MCKQ DAP NII+ CNG+YCDG+KPN+ KP LW
Sbjct: 196 FGDGGKRYVQWAADMALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGWKPNTALKPILW 255
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYIT 330
TE+W+GW+ WG PHRPVED AFAVARFFQRGGSF NYYMYFGGTNF RT+GGPF T
Sbjct: 256 TEDWNGWFQNWGQAAPHRPVEDNAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTT 315
Query: 331 SYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA-QYIKLGQNQEAHVYRAN 389
+YDYDAPIDEYGL+ +PKWGHLKDLHAAIKLCEPAL A D+ Q +G NQEAH Y AN
Sbjct: 316 TYDYDAPIDEYGLIRQPKWGHLKDLHAAIKLCEPALTAVDTVPQSTWIGSNQEAHEYSAN 375
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
+C+AFLANID + +V F G+SY LP WSVSILPDC+N FNTA++ +QT++
Sbjct: 376 -----GHCAAFLANIDSENSVTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVT 430
Query: 450 TVEFSLPLSP-NISVPQQSMIESKLSS----TSKSWMTVKEPIGVWSENNFTVQGILEHL 504
+ + S +I +P +++ +S + W EP G+ +LE L
Sbjct: 431 RMRIAPSNSRGDIFLPSNTLVHDHISDGGVFANLKWQASAEPFGIRGSGTTVSNSLLEQL 490
Query: 505 NVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW 564
N+TKD SDYLW+ T I ++ + ++ + + + +MRD + +F+NG+L GS +G
Sbjct: 491 NITKDTSDYLWYSTSITITSEGVTS-DVSGTEANLVLGTMRDAVHIFVNGKLAGSAMGWN 549
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
++VVQP+ + G N + LLS T+GLQNYGA+LE GAG RG V +TG G++ LS W
Sbjct: 550 IQVVQPITLKDGKNSIDLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYGNLSLSTAEW 609
Query: 625 TYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGK 684
+YQVGL+GE +++ + D + S TWYKT FDAP G DPVALDLGSMGK
Sbjct: 610 SYQVGLRGEELKLFHNGTADGFSWDSSSFTNASYLTWYKTTFDAPGGTDPVALDLGSMGK 669
Query: 685 GQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW-------YHVP 737
GQAW+NGHH+GRY+ +VAP+ GC+ TCDYRGAYN++KC TNCG P+Q W YH+P
Sbjct: 670 GQAWINGHHLGRYFLMVAPQSGCE-TCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIP 728
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLS 797
R+WLQA+ NLLV+FEE GG+ ++SV RS VC ++ES PP+R W S+D +
Sbjct: 729 RAWLQATGNLLVLFEEIGGDISKVSVVTRSAHAVCAHINESQPPPIRTWRPHRSID---A 785
Query: 798 INKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
N A EM L C G I+ I+FAS+G P+G C F G CHA S+ V +
Sbjct: 786 FNNPA-EMLLECAAGQHITKIKFASFGNPRGSCGHFQHGTCHANKSMEAVRK 836
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 968 bits (2502), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/819 (57%), Positives = 578/819 (70%), Gaps = 21/819 (2%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
KP NV+YD RA+II+G RRMLISAGIHYPRATPEMWP L+ KSKEGGADV+++YVFWN H
Sbjct: 31 KPINVTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQSYVFWNGH 90
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E +GQYNF+G+ D+VKF+K+V +GLY LRIGPYVCAEWNFGGFP WL+DIPGI FRT
Sbjct: 91 EPKQGQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKDIPGIVFRT 150
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+N PFK M+ FV KIV+LM+E LF+WQGGPIIM QIENEYGN+E ++G GK Y WA
Sbjct: 151 DNEPFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDGGKRYAMWA 210
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
A +ALGL AGVPWVMC+Q DAP NII+ CNGYYCDG+K N+ KP WTE+W+GW+ WG
Sbjct: 211 AELALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKANTATKPAFWTEDWNGWFQYWG 270
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
+PHRPVED AFA+ARFFQRGGSF NYYMYFGGTNF RT+GGPF TSYDYDAP+DEYG
Sbjct: 271 QSVPHRPVEDNAFAIARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTSYDYDAPLDEYG 330
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK-LGQNQEAHVYRANRYGSQSNCSAFL 401
L+ +PKWGHL+DLHAAIKLCEPAL A D LG N EAHVY + C+AFL
Sbjct: 331 LIRQPKWGHLRDLHAAIKLCEPALTAVDEVPLSTWLGPNVEAHVYSG-----RGQCAAFL 385
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEF-SLPLSPN 460
ANID A+V F G++Y LPPWSVSILPDC+N VFNTA+V +QT++ + L
Sbjct: 386 ANIDSWKIATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSKLEGE 445
Query: 461 ISVPQQSMIESKLSSTSKS---WMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
+ +P + + S S W EP+G+ +LE LN+TKD +DYLW+
Sbjct: 446 VVMPSNMLRKHAPESIVGSGLKWEASVEPVGIRGAATLVSNRLLEQLNITKDSTDYLWYS 505
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGY 577
I VS + ++ + + + + SMRD + +F+N QL GS +G V+VVQPV + G
Sbjct: 506 ISIKVSVEAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAMGSDVQVVQPVPLKEGK 565
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
ND+ LLS TVGLQNYGA+LE GAG RG L G +G +DLS W+YQVG++GE +++
Sbjct: 566 NDIDLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSGVLDLSTERWSYQVGIQGEEKRL 625
Query: 638 YSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+ + +W + S TWYKT FDAP G DPVALDLGSMGKGQAWVNGHH+GR
Sbjct: 626 FETGTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQAWVNGHHMGR 685
Query: 697 YW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW-----YHVPRSWLQASNNLLVI 750
YW +V+A + GC TCDYRGAY++DKC TNCG P+Q W YH+PR+WLQ SNNLLV+
Sbjct: 686 YWPSVLASQSGC-STCDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWLQLSNNLLVL 744
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ 810
FEE GG+ ++S+ RS VC V ES PPV W + S+D +++ + E L C
Sbjct: 745 FEEIGGDVSKVSLVTRSAPAVCTHVHESQPPPVLFWPANSSMD---AMSSRSGEAVLECI 801
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G I I+FAS+G P+G C F RG CHA SL V +
Sbjct: 802 AGQHIRHIKFASFGNPKGSCGNFQRGTCHAMKSLEVARK 840
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 953 bits (2464), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/714 (62%), Positives = 542/714 (75%), Gaps = 16/714 (2%)
Query: 147 GFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN 206
GFPVWLRD+PGIEFRT+N P+K EMQ FV KIVD+M+EE L+SWQGGPII+ QIENEYGN
Sbjct: 19 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78
Query: 207 MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK 266
++ YGQ GK Y+ WAA MAL L GVPWVMC+QTDAPE I++ CN +YCDG+KPNSYNK
Sbjct: 79 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNK 138
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
PT+WTE+WDGWY WG LPHRP +D AFAVARF+QRGGS NYYMYFGGTNF RT+GGP
Sbjct: 139 PTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGP 198
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHV 385
ITSYDYDAPIDEYG+L +PKWGHLKDLHAAIKLCE AL A D S Y+KLG QEAHV
Sbjct: 199 LQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHV 258
Query: 386 YRANRY-------GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
Y + G+ CSAFLANIDEH ASV G+SY+LPPWSVSILPDC FN
Sbjct: 259 YSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFN 318
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSS--TSKSWMTVKEPIGVWSENNFT 496
TA+V +QTS VE SP+ S + I S + S +W T KEP+G+W E FT
Sbjct: 319 TARVGTQTSFFNVESG---SPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFT 375
Query: 497 VQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL 556
QGILEHLNVTKD SDYL + T++ +S++D+ +W + P++TID +RDV RVF+NG+L
Sbjct: 376 AQGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKL 435
Query: 557 TGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
GS +GHWV + QP++ G N+L LLS+ VGLQNYGAFLEKDGAGFRGQVKLTG NGD
Sbjct: 436 AGSKVGHWVSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGD 495
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV 675
IDL+ LWTYQ+GLKGEF +IYS E + AEW+ + D S FTW+KT FDAP+G PV
Sbjct: 496 IDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPV 555
Query: 676 ALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
+DLGSMGKGQAWVNGH IGRYW++VAP+ GC +C+Y G Y+ KC +NCG TQ+WYH
Sbjct: 556 TIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSWYH 615
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGK 795
+PR WLQ S NLLV+FEETGG+P +IS+++ T+ +C ++SE++YPP+ WS + +G+
Sbjct: 616 IPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSR--AANGR 673
Query: 796 LSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S+N +APE+ L C DG++IS I FASYGTP G CQ FS GNCHA +L +V E
Sbjct: 674 PSVNTVAPELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVE 727
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 927 bits (2397), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 452/812 (55%), Positives = 573/812 (70%), Gaps = 23/812 (2%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHR+++IDG RR+LIS IHYPR+TPEMWPD+I K+K+GG DVIE+YVFWN HE
Sbjct: 30 NVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPDIIQKAKDGGLDVIESYVFWNMHEPK 89
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ +Y F+ + D+VKFVK+V +GL + LRIGPY CAEWN+GGFPVWL IPGI FRT+N
Sbjct: 90 QNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYACAEWNYGGFPVWLHLIPGIHFRTDNE 149
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF KIVD+M++E LF+ QGGPII+ QIENEYGN++ YG GK YVKWAASM
Sbjct: 150 PFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQIENEYGNIDGPYGAAGKSYVKWAASM 209
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMC+Q DAP+ II+ CNG+YCD + PNS NKP +WTENW GW+ ++GGRL
Sbjct: 210 AVGLNTGVPWVMCQQADAPDPIINTCNGFYCDAFTPNSPNKPKMWTENWSGWFLSFGGRL 269
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP EDLAF+VARFFQRGG+F NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYG++
Sbjct: 270 PFRPTEDLAFSVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYGIVR 329
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK+LH AIKLCE ALV A+S Y LG EAHVY C+AFLAN +
Sbjct: 330 QPKWGHLKELHKAIKLCEAALVNAES-NYTSLGSGLEAHVYSP----GSGTCAAFLANSN 384
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP-NISVP 464
+ A+V F G SY LP WSVSILPDC+N VFNTAK+ SQT+ S+ ++P N+ +
Sbjct: 385 TQSDATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGSQTT------SVQMNPANLILA 438
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ ++ S+ + SW + E IG+ N F+ G+LE +N T D SDYLW+ T I V D
Sbjct: 439 GSNSMKGTDSANAASWSWLHEQIGIGGSNTFSKPGLLEQINTTVDSSDYLWYTTSIQVDD 498
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ + N +P + + S+ L VFING+ G G + + P+ +SG N++
Sbjct: 499 NEP--FLHNGTQPVLHVQSLGHALHVFINGEFAGRGAGSSSSSKIALQTPITLKSGKNNI 556
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQNYG+F + GAG G V L GFK+G+ DLS WTYQ+GL GE IYS
Sbjct: 557 DLLSITVGLQNYGSFFDTWGAGITGPVILQGFKDGEHDLSTQQWTYQIGLTGEQLGIYSG 616
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
+ + A+W + WYKT FDAP G DPVAL+L MGKG AWVNG IGRYW
Sbjct: 617 DTKASAQWVAGSDLPTKQPMIWYKTNFDAPSGNDPVALNLLGMGKGVAWVNGQSIGRYWP 676
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ +A + GC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q + N+LV+FEE GG+P
Sbjct: 677 SYIASQSGCTDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSWIQPTGNVLVLFEELGGDP 736
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDG-YIISS 817
+IS RS +C QVSE+H PPV W +S + L +NK E+ LHC ++I S
Sbjct: 737 TQISFMTRSVGSLCAQVSETHLPPVDSWKSSAT--SGLEVNKPKAELQLHCPSSRHLIKS 794
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FAS+GT +G C F+ G+C+ ++S+V E
Sbjct: 795 IKFASFGTSKGSCGSFTYGHCNTNSTMSIVEE 826
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 905 bits (2340), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/812 (55%), Positives = 560/812 (68%), Gaps = 45/812 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++IDG RR+L+S IHYPR+TPEMWPDLI KSK+GG DVIETYVFWN HE +R
Sbjct: 30 VSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 89
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ND+V FVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+ RT+N P
Sbjct: 90 GQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEP 149
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K EM RF KIV++M+ E L++ QGGPII+ QIENEYGN++ +YG K Y+ WAA+MA
Sbjct: 150 YKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAKTYINWAANMA 209
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ L GVPWVMC+Q DAP ++I+ CNG+YCD + PNS + P +WTENW GW+ ++GG +P
Sbjct: 210 VSLDTGVPWVMCQQADAPSSVINTCNGFYCDQFSPNSNSTPKIWTENWSGWFLSFGGAVP 269
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDLAFAVARF+QRGG+F NYYMY GGTNFGR+SGGPF TSYDYDAP+DEYGLL +
Sbjct: 270 QRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDAPLDEYGLLRQ 329
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLKD+H AIKLCEPA+VA D LGQN EA VY+ + S CSAFLAN+D
Sbjct: 330 PKWGHLKDVHKAIKLCEPAMVATDPT-ISSLGQNIEAAVYK-----TGSVCSAFLANVDT 383
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A+VTF G SY LP WSVSILPDC+N V NTAK+++ T + P+ +
Sbjct: 384 KSDATVTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATMV----------PSFTRQSI 433
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S + W + EP+G+ + FT G+LE +N T D SDYLW+ T I V
Sbjct: 434 SADVEPTEAVGSGWSWINEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDVKGG- 492
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+ + + S+ L F+NG+L GS G+ V V PVEF SG N + L
Sbjct: 493 --------YKADLHVQSLGHALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDL 544
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYSIE 641
LS TVGLQNYGAF + GAG G V+L G NG IDLS WTYQ+GLKGE + + S
Sbjct: 545 LSLTVGLQNYGAFFDLVGAGITGPVQLKGSANGTTIDLSSQQWTYQIGLKGEDEDLPS-- 602
Query: 642 ENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
++W +++ +P TWYKT FDAP G +PVALD MGKG+AWVNG IGRYW
Sbjct: 603 -GSSQW--ISQPTLPKNQPLTWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWP 659
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T VAPK GC D C+YRGAY++DKC NCG P+Q YHVPRSW+++S N LV+FEE GG+P
Sbjct: 660 TNVAPKTGCTD-CNYRGAYSADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGDP 718
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIISS 817
++S R +C VSESH PV WS+ D K +K P + L C +ISS
Sbjct: 719 TQLSFATRQVESLCSHVSESHPSPVDMWSS----DSKAG-SKSRPRLSLECPFPNQVISS 773
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FASYG P G C FS G+C + +LS+V +
Sbjct: 774 IKFASYGRPSGTCGSFSHGSCRSSRALSIVQK 805
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 904 bits (2336), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/632 (66%), Positives = 493/632 (78%), Gaps = 10/632 (1%)
Query: 41 FFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWN 100
FF+PFNV+YDHRA++I G RRML+SAG+HYPRATPEMWP LIAK KEGGADVIETYVFWN
Sbjct: 58 FFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFWN 117
Query: 101 AHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEF 160
HE +GQY F+ + D+VKF KLV + GL+L LRIGPY CAEWNFGGFPVWLRDIPGIEF
Sbjct: 118 GHEPAKGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEF 177
Query: 161 RTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVK 220
RT+N PFK EMQ FV KIV LM+EE L+SWQGGPII+ QIENEYGN++ +YGQ GK Y++
Sbjct: 178 RTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQ 237
Query: 221 WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTT 280
WAA MA+GL G+PWVMC+QTDAPE IID CN +YCDG+KPNSYNKPT+WTE+WDGWY
Sbjct: 238 WAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYAD 297
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDE 340
WGG LPHRP ED AFAVARF+QRGGS NYYMYFGGTNF RT+GGP ITSYDYDAPIDE
Sbjct: 298 WGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDE 357
Query: 341 YGLLSEPKWGHLKDLHAAIKLCEPALVA-ADSAQYIKLGQNQEAHVYRANRY-------G 392
YG+L +PKWGHLKDLH AIKLCEPAL+A S QYIKLG QEAHVY G
Sbjct: 358 YGILRQPKWGHLKDLHTAIKLCEPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSMAG 417
Query: 393 SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVE 452
+ CSAFLANIDEH ASV G+SY+LPPWSVSILPDC N FNTA++ +QTS+ TVE
Sbjct: 418 NAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVE 477
Query: 453 FSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSD 512
P + P + S S +W T KE IG W NNF VQGILEHLNVTKD SD
Sbjct: 478 SGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISD 537
Query: 513 YLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVE 572
YLW+ T++ +SD D++FW + V P++TID +RDV RVF+NG+L GS +GHWV + QP++
Sbjct: 538 YLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHWVSLKQPIQ 597
Query: 573 FQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKG 632
G N+L LLS+ VGLQNYGAFLEKDGAGFRGQV LTG +GD+DL+ LWTYQVGLKG
Sbjct: 598 LVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKG 657
Query: 633 EFQQIYSIEENE-AEWTDLTRDGIPSTFTWYK 663
EF IY+ E+ A W+ + +D + FTWYK
Sbjct: 658 EFSMIYAPEKQGCAGWSRMQKDSV-QPFTWYK 688
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 901 bits (2329), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/640 (66%), Positives = 494/640 (77%), Gaps = 18/640 (2%)
Query: 41 FFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWN 100
FF+PFNV+YDHRA++I G RRML+SAG+HYPRATPEMWP LIAK KEGGADVIETYVFWN
Sbjct: 58 FFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWN 117
Query: 101 AHESIRGQYNFK--------GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
HE +GQY F+ K D+VKF KLV + GL+L LRIGPY CAEWNFGGFPVWL
Sbjct: 118 GHEPAKGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWL 177
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG 212
RDIPGIEFRT+N PFK EMQ FV KIV LM+EE L+SWQGGPII+ QIENEYGN++ +YG
Sbjct: 178 RDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYG 237
Query: 213 QQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTE 272
Q GK Y++WAA MA+GL G+PWVMC+QTDAPE IID CN +YCDG+KPNSYNKPT+WTE
Sbjct: 238 QAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTE 297
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSY 332
+WDGWY WGG LPHRP ED AFAVARF+QRGGS NYYMYFGGTNF RT+GGP ITSY
Sbjct: 298 DWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSY 357
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHVYRANRY 391
DYDAPIDEYG+L +PKWGHLKDLH AIKLCEPAL+A D S QYIKLG QEAHVY
Sbjct: 358 DYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEV 417
Query: 392 -------GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSS 444
G+ CSAFLANIDEH ASV G+SY+LPPWSVSILPDC N FNTA++ +
Sbjct: 418 HTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGA 477
Query: 445 QTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHL 504
QTS+ TVE P + P + S S +W T KE IG W NNF VQGILEHL
Sbjct: 478 QTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHL 537
Query: 505 NVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW 564
NVTKD SDYLW+ T++ +SD D++FW + V P++TID +RDV RVF+NG+L GS +GHW
Sbjct: 538 NVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW 597
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
V + QP++ G N+L LLS+ VGLQNYGAFLEKDGAGFRGQV LTG +GD+DL+ LW
Sbjct: 598 VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLW 657
Query: 625 TYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYK 663
TYQVGLKGEF IY+ E+ A W+ + +D + FTWYK
Sbjct: 658 TYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSV-QPFTWYK 696
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 900 bits (2327), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/810 (53%), Positives = 567/810 (70%), Gaps = 26/810 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP +I K+K+GG DVIETYVFW+ HE +
Sbjct: 36 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFWDIHEPV 95
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ D+ FVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 96 RGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 155
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA M
Sbjct: 156 PFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGM 215
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+QTDAP+ +I+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 216 AISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGGAV 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTN R+SGGPF TSYDYDAPIDEYGL+
Sbjct: 276 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGLVR 335
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+D+H AIKLCEPAL+A D + Y LGQN EA VY+ + S C+AFLANID
Sbjct: 336 EPKWGHLRDVHKAIKLCEPALIATDPS-YTSLGQNAEAAVYK-----TGSVCAAFLANID 389
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +VTF G+ Y LP WSVSILPDC+N V NTA+++SQ + + + L +
Sbjct: 390 GQSDKTVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRY---LESSNMASD 446
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S I +L+ + W EP+G+ +N T G++E +N T D SD+LW+ T I V D
Sbjct: 447 GSFITPELAVS--GWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGD 504
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQSGYNDLI 581
+ N + + ++S+ VL+V+ING++ GS G + +P+E G N +
Sbjct: 505 EPYL---NGSQSNLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKID 561
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G NG +DLS WTYQ+GL+GE +Y
Sbjct: 562 LLSATVGLSNYGAFFDLVGAGITGPVKLSG-TNGALDLSSAEWTYQIGLRGEDLHLYDPS 620
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
E EW I WYKT F P G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 621 EASPEWVSANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTN 680
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+AP+ GC ++C+YRG+YNS+KC CG P+QT YHVPRS+LQ +N +V+FE+ GG+P +
Sbjct: 681 LAPQSGCVNSCNYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSK 740
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIE 819
IS +R T VC QVSE H + W++S + ++ + PE+ L C +DG +ISSI+
Sbjct: 741 ISFVIRQTGSVCAQVSEEHPAQIDSWNSS-----QQTMQRYGPELRLECPKDGQVISSIK 795
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C +S G C + +LSVV E
Sbjct: 796 FASFGTPSGTCGSYSHGECSSTQALSVVQE 825
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 900 bits (2325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/813 (53%), Positives = 570/813 (70%), Gaps = 32/813 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP L+ K+K+GG DV+ETYVFW+ HE +
Sbjct: 29 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDVHEPV 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ND+V+FVK +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+ RT+N
Sbjct: 89 RGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF +K+V M+ L++ QGGPII+ QIENEYGN+ +SYG GK Y++WAA M
Sbjct: 149 PFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGM 208
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+QTDAPE +I+ CNG+YCD + P+ ++P LWTENW GW+ ++GG +
Sbjct: 209 AVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAV 268
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAFAVARF+QRGG+ NYYMY GGTNFGR+SGGPF TSYDYDAPIDEYGL+
Sbjct: 269 PYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVR 328
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+D+H AIK+CEPAL+A D + Y+ LGQN EAHVY+ S S C+AFLANID
Sbjct: 329 QPKWGHLRDVHKAIKMCEPALIATDPS-YMSLGQNAEAHVYK-----SGSLCAAFLANID 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ---TSIKTVEFSLPLSPNIS 462
+ + +VTF G++Y LP WSVSILPDC+N V NTA+++SQ T ++ + FS S
Sbjct: 383 DQSDKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQAS---- 438
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
S +E++L+++ SW EP+G+ EN T G++E +N T D SD+LW+ T I V
Sbjct: 439 --DGSSVEAELAAS--SWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV 494
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYN 578
+ + N + + ++S+ VL+VFING+L GS + + PV +G N
Sbjct: 495 AGGEPYL---NGSQSNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKN 551
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL NYGAF + GAG G VKLTG K G +DLS WTYQ+GL+GE +Y
Sbjct: 552 KIDLLSATVGLTNYGAFFDLVGAGITGPVKLTGPK-GTLDLSSAEWTYQIGLRGEDLHLY 610
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ E EW + TWYK+ F AP G DPVA+D MGKG+AWVNG IGRYW
Sbjct: 611 NPSEASPEWVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW 670
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
T +AP+ GC ++C+YRG+Y++ KC CG P+Q YHVPRS+LQ +N +V+FE+ GGN
Sbjct: 671 PTNIAPQSGCVNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGN 730
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIIS 816
P +IS + T VC VSE H + W V + + + P + L C ++G +IS
Sbjct: 731 PSKISFTTKQTESVCAHVSEDHPDQIDSW-----VSSQQKLQRSGPALRLECPKEGQVIS 785
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP G C +S G C + +L+V E
Sbjct: 786 SIKFASFGTPSGTCGSYSHGECSSSQALAVAQE 818
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 896 bits (2315), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/834 (54%), Positives = 571/834 (68%), Gaps = 41/834 (4%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++++ L C+ S T F NV YDHRA++IDG RR+LIS IHYPR+TPEMWPDLI
Sbjct: 6 IVLVLFWLLCIHSP---TLFCA-NVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
KSK+GG DVIETYVFWN +E +RGQY+F G+ D+VKFVK V ++GLY+ LRIGPYVCAE
Sbjct: 62 QKSKDGGLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GGFP+WL IPGI+FRT+N PFK EM+RF KIVD+++EE L++ QGGP+I+ QIEN
Sbjct: 122 WNYGGFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYGN++S+YG GK Y+KWAA+MA L GVPWVMC+Q DAP+ II+ CNG+YCD + PN
Sbjct: 182 EYGNIDSAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
S KP +WTENW GW+ +GG +P+RPVEDLAFAVARFFQRGG+F NYYMY GGTNF RT
Sbjct: 242 SNTKPKMWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
SGGPF TSYDYDAPIDEYG++ +PKWGHLK++H AIKLCE AL+A D LG N E
Sbjct: 302 SGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPT-ITSLGPNLE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
A VY+ + S C+AFLAN+D + +V F G SY LP WSVSILPDC+N V NTAK+
Sbjct: 361 AAVYK-----TGSVCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKI 415
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
+S ++I + ++ + S+ SST SW++ EP+G+ ++F G+LE
Sbjct: 416 NSASAISSF--------TTESLKEDIGSSEASSTGWSWIS--EPVGISKADSFPQTGLLE 465
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+N T D SDYLW+ I D S + + I+S+ L FING+L GS G
Sbjct: 466 QINTTADKSDYLWYSLSIDYKGDAGS-------QTVLHIESLGHALHAFINGKLAGSQTG 518
Query: 563 HWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-I 617
+ K V PV +G N + LLS TVGLQNYGAF + GAG G V L G NG+ +
Sbjct: 519 NSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTL 578
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
DLS WTYQVGLKGE + S + +W + WYKT F AP G DPVA+
Sbjct: 579 DLSYQKWTYQVGLKGEDLGLSS--GSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAI 636
Query: 678 DLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
D MGKG+AWVNG IGRYW T VA GC D+C+YRG Y++ KC NCG P+QT YHV
Sbjct: 637 DFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSASKCRRNCGKPSQTLYHV 696
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL 796
PRSWL+ S N+LV+FEE GG+P +IS + T +C VS+SH PPV W NS + G+
Sbjct: 697 PRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSHPPPVDLW-NSDTESGR- 754
Query: 797 SINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
K+ P + L C D +ISSI+FASYGTP G C F G C + +LS+V +
Sbjct: 755 ---KVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQK 805
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 891 bits (2302), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/813 (53%), Positives = 560/813 (68%), Gaps = 35/813 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG R++LIS IHYPR+TPEMWP+LI KSK+GG DVIETYVFW+ HE
Sbjct: 31 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ +YNF+G+ D+VKFVKL +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 91 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFKEEMQRF KIVDLM++E L++ QGGPII+ QIENEYGN++S+YG K Y+KW+ASM
Sbjct: 151 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL L GVPW MC+QTDAP+ +I+ CNG+YCD + PNS NKP +WTENW GW+ +G
Sbjct: 211 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPS 270
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTNF RTSGGP TSYDYDAPIDEYGLL
Sbjct: 271 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGLLR 330
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+DLH AIKLCE AL+A D LG N EA VY+ +C+AFLAN+D
Sbjct: 331 QPKWGHLRDLHKAIKLCEDALIATDPT-ITSLGSNLEAAVYKTE----SGSCAAFLANVD 385
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G+SY LP WSVSILPDC+N FNTAK++S T + + +
Sbjct: 386 TKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATE------------STAFAR 433
Query: 466 QSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
QS+ SS W +KEPIG+ + F G+LE +N T D SDYLW+ + +
Sbjct: 434 QSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIK 493
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ---PVEFQSGYNDL 580
D+ + + + I+S+ V+ FING+L GS GH + + P+ +G N +
Sbjct: 494 GDET--FLDEGSKAVLHIESLGQVVYAFINGKLAGS--GHGKQKISLDIPINLVTGTNTI 549
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYS 639
LLS TVGL NYGAF + GAG G V L K G IDL+ WTYQVGLKGE + +
Sbjct: 550 DLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLAT 609
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
++ +EW + WYKT FDAP G +PVA+D GKG AWVNG IGRYW
Sbjct: 610 VD--SSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP 667
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T +A GGC ++CDYRG+Y ++KC NCG P+QT YHVPRSWL+ S N+LV+FEE GG+P
Sbjct: 668 TSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDP 727
Query: 759 FEISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIIS 816
+IS + T +C VS+SH PPV W++ + + N+ P + L C +I
Sbjct: 728 TQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNR---NRTRPVLSLKCPISTQVIF 784
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP+G C F++G+C++ SLS+V +
Sbjct: 785 SIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQK 817
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 890 bits (2301), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/813 (53%), Positives = 560/813 (68%), Gaps = 35/813 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG R++LIS IHYPR+TPEMWP+LI KSK+GG DVIETYVFW+ HE
Sbjct: 31 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ +YNF+G+ D+VKFVKL +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 91 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFKEEMQRF KIVDLM++E L++ QGGPII+ QIENEYGN++S+YG K Y+KW+ASM
Sbjct: 151 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL L GVPW MC+QTDAP+ +I+ CNG+YCD + PNS NKP +WTENW GW+ +G
Sbjct: 211 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPS 270
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTNF RTSGGP TSYDYDAPIDEYGLL
Sbjct: 271 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGLLR 330
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+DLH AIKLCE AL+A D LG N EA VY+ +C+AFLAN+D
Sbjct: 331 QPKWGHLRDLHKAIKLCEDALIATDPT-ITSLGSNLEAAVYKTE----SGSCAAFLANVD 385
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G+SY LP WSVSILPDC+N FNTAK++S T + + +
Sbjct: 386 TKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATE------------STAFAR 433
Query: 466 QSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
QS+ SS W +KEPIG+ + F G+LE +N T D SDYLW+ + +
Sbjct: 434 QSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIK 493
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ---PVEFQSGYNDL 580
D+ + + + I+S+ V+ FING+L GS GH + + P+ +G N +
Sbjct: 494 GDET--FLDEGSKAVLHIESLGQVVYAFINGKLAGS--GHGKQKISLDIPINLVTGTNTI 549
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYS 639
LLS TVGL NYGAF + GAG G V L K G IDL+ WTYQVGLKGE + +
Sbjct: 550 DLLSVTVGLANYGAFFDLMGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLAT 609
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
++ +EW + WYKT FDAP G +PVA+D GKG AWVNG IGRYW
Sbjct: 610 VD--SSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP 667
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T +A GGC ++CDYRG+Y ++KC NCG P+QT YHVPRSWL+ S N+LV+FEE GG+P
Sbjct: 668 TSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDP 727
Query: 759 FEISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIIS 816
+IS + T +C VS+SH PPV W++ + + N+ P + L C +I
Sbjct: 728 TQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNR---NRTRPVLSLKCPISTQVIF 784
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP+G C F++G+C++ SLS+V +
Sbjct: 785 SIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQK 817
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 890 bits (2300), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/813 (53%), Positives = 560/813 (68%), Gaps = 35/813 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG R++LIS IHYPR+TPEMWP+LI KSK+GG DVIETYVFW+ HE
Sbjct: 25 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ +YNF+G+ D+VKFVKL +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 85 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFKEEMQRF KIVDLM++E L++ QGGPII+ QIENEYGN++S+YG K Y+KW+ASM
Sbjct: 145 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL L GVPW MC+QTDAP+ +I+ CNG+YCD + PNS NKP +WTENW GW+ +G
Sbjct: 205 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPS 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTNF RTSGGP TSYDYDAPIDEYGLL
Sbjct: 265 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGLLR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+DLH AIKLCE AL+A D LG N EA VY+ +C+AFLAN+D
Sbjct: 325 QPKWGHLRDLHKAIKLCEDALIATDPT-ITSLGSNLEAAVYKTE----SGSCAAFLANVD 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G+SY LP WSVSILPDC+N FNTAK++S T + + +
Sbjct: 380 TKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATE------------STAFAR 427
Query: 466 QSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
QS+ SS W +KEPIG+ + F G+LE +N T D SDYLW+ + +
Sbjct: 428 QSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIK 487
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ---PVEFQSGYNDL 580
D+ + + + I+S+ V+ FING+L GS GH + + P+ +G N +
Sbjct: 488 GDET--FLDEGSKAVLHIESLGQVVYAFINGKLAGS--GHGKQKISLDIPINLVTGTNTI 543
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYS 639
LLS TVGL NYGAF + GAG G V L K G IDL+ WTYQVGLKGE + +
Sbjct: 544 DLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLAT 603
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
++ +EW + WYKT FDAP G +PVA+D GKG AWVNG IGRYW
Sbjct: 604 VD--SSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP 661
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T +A GGC ++CDYRG+Y ++KC NCG P+QT YHVPRSWL+ S N+LV+FEE GG+P
Sbjct: 662 TSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDP 721
Query: 759 FEISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIIS 816
+IS + T +C VS+SH PPV W++ + + N+ P + L C +I
Sbjct: 722 TQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNR---NRTRPVLSLKCPISTQVIF 778
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP+G C F++G+C++ SLS+V +
Sbjct: 779 SIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQK 811
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 890 bits (2299), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/810 (53%), Positives = 561/810 (69%), Gaps = 27/810 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI K+K+GG DVIETYVFW+ HE +
Sbjct: 29 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHEPV 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ D+ FVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 89 RGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA M
Sbjct: 149 PFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAAGM 208
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q DAP+ +I+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 209 AVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGGAV 268
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTN R+SGGPF TSYDYDAPIDEYGL+
Sbjct: 269 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGLVR 328
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+D+H AIKLCEPAL+A D + Y LG N EA VY+ S C+AFLANID
Sbjct: 329 QPKWGHLRDVHKAIKLCEPALIATDPS-YTSLGPNVEAAVYKVG-----SVCAAFLANID 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +VTF G+ Y LP WSVSILPDC+N V NTA+++SQT+ + + L +
Sbjct: 383 GQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRY---LESSNVASD 439
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S + +L+ + W EP+G+ +N T G++E +N T D SD+LW+ T I V D
Sbjct: 440 GSFVTPELAVS--DWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGD 497
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQSGYNDLI 581
+ N + + ++S+ VL+V+ING++ GS G + +P+E G N +
Sbjct: 498 EPYL---NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKID 554
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G NG +DLS WTYQ+GL+GE +Y
Sbjct: 555 LLSATVGLSNYGAFFDLVGAGITGPVKLSGL-NGALDLSSAEWTYQIGLRGEDLHLYDPS 613
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
E EW I WYKT F P G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 614 EASPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTN 673
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+AP+ GC ++C+YRGAY+S KC CG P+QT YHVPRS+LQ +N LV+FE GG+P +
Sbjct: 674 LAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSK 733
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIE 819
IS +R T VC QVSE+H + WS+ + + + P + L C ++G +ISS++
Sbjct: 734 ISFVMRQTGSVCAQVSEAHPAQIDSWSS------QQPMQRYGPALRLECPKEGQVISSVK 787
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C +S G C + +LS+V E
Sbjct: 788 FASFGTPSGTCGSYSHGECSSTQALSIVQE 817
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 889 bits (2298), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/810 (52%), Positives = 563/810 (69%), Gaps = 24/810 (2%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI KSK+GG DVIETYVFW+ HE++
Sbjct: 130 NVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHEAV 189
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ D+V+FVK V +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 190 RGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 249
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK EMQRF +K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA M
Sbjct: 250 AFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGM 309
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q+DAP+ +I+ CNG+YCD + PNS +KP +WTENW GW+ ++GG +
Sbjct: 310 AVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAV 369
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAFAVARF+QRGG+F NYYMY GGTNFGR++GGPF TSYDYDAPIDEYG++
Sbjct: 370 PYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVR 429
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+D+H AIKLCEPAL+AA+ + Y LGQN EA VY+ S C+AFLAN+D
Sbjct: 430 QPKWGHLRDVHKAIKLCEPALIAAEPS-YSSLGQNTEATVYQT---ADNSICAAFLANVD 485
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F G +Y LP WSVSILPDC+N V NTA+++SQ + + L +I
Sbjct: 486 AQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMR---SLGSSIQDTD 542
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S+I +L++ W EP+G+ EN T G++E +N T D SD+LW+ T I V D
Sbjct: 543 DSLITPELATA--GWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGD 600
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYNDLI 581
+ N + + ++S+ VL+++ING+L GS + + PV G N +
Sbjct: 601 EPYL---NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKID 657
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G NG ++LS WTYQ+GL+GE +Y+
Sbjct: 658 LLSTTVGLSNYGAFFDLVGAGVTGPVKLSG-PNGALNLSSTDWTYQIGLRGEDLHLYNPS 716
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
E EW WYKT F AP G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 717 EASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTN 776
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+AP+ GC ++C+YRGAY+S+KC CG P+QT YHVPRS+LQ +N LV+FE+ GG+P
Sbjct: 777 LAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSM 836
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIE 819
IS R T +C VSE H + W + + + P + L C ++G +IS+I+
Sbjct: 837 ISFTTRQTSSICAHVSEMHPAQIDSW-----ISPQQTSQTQGPALRLECPREGQVISNIK 891
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C ++ G C + +L+VV E
Sbjct: 892 FASFGTPSGTCGNYNHGECSSSQALAVVQE 921
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 889 bits (2298), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/837 (52%), Positives = 571/837 (68%), Gaps = 41/837 (4%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
M M++++ L + +++A NV+YDHRA++IDG R++LIS IHYPR+TPEMWP+L
Sbjct: 7 MEMILLLILQIMMAATA------VNVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPEL 60
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I KSK+GG DVIETYVFW+ HE + +YNF+G+ D+VKFVKLV +GLY+ LRIGPYVCA
Sbjct: 61 IKKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCA 120
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWN+GGFPVWL +PGI+FRT+N PFKEEMQRF KIVDLM++E L++ QGGPII+ QIE
Sbjct: 121 EWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIE 180
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYGN++S+YG K Y+KW+ASMAL L GVPW MC+Q DAP+ +I+ CNG+YCD + P
Sbjct: 181 NEYGNIDSAYGAAAKIYIKWSASMALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFTP 240
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
NS +KP +WTENW GW+ +G P+RPVEDLAFAVARF+QRGG+F NYYMY GGTNF R
Sbjct: 241 NSNSKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDR 300
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
TSGGP TSYDYDAPIDEYGLL +PKWGHL+DLH AIKLCE AL+A D LG N
Sbjct: 301 TSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPT-ISSLGSNL 359
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
EA VY+ + +C+AFLAN+ + A+V+F G+SY LP WSVSILPDC+N FNTAK
Sbjct: 360 EAAVYKT----ASGSCAAFLANVGTKSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAK 415
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQG 499
++S T + +QS+ SS W +KEPIG+ + F G
Sbjct: 416 INSATE------------PTAFARQSLKPDGGSSAELGSEWSYIKEPIGISKADAFLKPG 463
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
+LE +N T D SDYLW+ ++ + D+ + + + I+S+ V+ FING+L GS
Sbjct: 464 LLEQINTTADKSDYLWYSLRMDIKGDET--FLDEGSKAVLHIESLGQVVYAFINGKLAGS 521
Query: 560 VIGHWVKVVQ---PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG- 615
GH + + P+ +G N + LLS TVGL NYGAF + GAG G V L K G
Sbjct: 522 --GHGKQKISLDIPINLAAGKNTVDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGS 579
Query: 616 DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV 675
IDL+ WTYQVGLKGE + +++ +EW + WYKT FDAP G +PV
Sbjct: 580 SIDLASQQWTYQVGLKGEDTGLATVD--SSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPV 637
Query: 676 ALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWY 734
A+D GKG AWVNG IGRYW T +A GGC D+CDYRG+Y ++KC NCG P+QT Y
Sbjct: 638 AIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTDSCDYRGSYRANKCLKNCGKPSQTLY 697
Query: 735 HVPRSWLQASNNLLVIFEETGGNPFEISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVD 793
HVPRSWL+ S N LV+FEE GG+P +IS + T +C VS+SH PPV W++ +
Sbjct: 698 HVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQTGSNLCLMVSQSHPPPVDTWTSDSKIS 757
Query: 794 GKLSINKMAPEMHLHCQ-DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ N+ P + L C +ISSI+FAS+GTPQG C F+ G+C++ SLSVV +
Sbjct: 758 NR---NRTRPVLSLKCPVSTQVISSIKFASFGTPQGTCGSFTHGHCNSSRSLSVVQK 811
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 889 bits (2297), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/812 (54%), Positives = 551/812 (67%), Gaps = 35/812 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YDHRA++IDG RR+LIS IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +R
Sbjct: 26 VTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QY+FKG+ND+VKFVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N P
Sbjct: 86 RQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQFRTDNGP 145
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKEEMQ F KIVD+M++E L++ QGGPII+ QIENEYGN++S+YG K Y++WAASMA
Sbjct: 146 FKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGNIDSAYGSAAKSYIQWAASMA 205
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPWVMC+Q DAP+ +I+ CNG+YCD + PNS KP +WTENW GW+ ++GG +P
Sbjct: 206 TSLDTGVPWVMCQQADAPDPMINTCNGFYCDQFTPNSVKKPKMWTENWTGWFLSFGGAVP 265
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RPVED+AFAVARFFQ GG+F NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL +
Sbjct: 266 YRPVEDIAFAVARFFQLGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYGLLRQ 325
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLKDLH AIKLCE AL+A D LG N EA VY+ +C+AFLAN+
Sbjct: 326 PKWGHLKDLHKAIKLCEAALIATDPT-ITSLGTNLEASVYKTG----TGSCAAFLANVRT 380
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A+V F G SY LP WSVSILPDC+N NTA+++ S+ + P QQ
Sbjct: 381 NSDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQIN----------SMAVMPRFM--QQ 428
Query: 467 SMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
S+ SS W V EP+G+ N FT G+LE +N+T D SDYLW+ +
Sbjct: 429 SLKNDIDSSDGFQSGWSWVDEPVGISKNNAFTKLGLLEQINITADKSDYLWYSLSTEIQG 488
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
D+ ++ + ++S+ L FING+L GS G+ V V PV G N +
Sbjct: 489 DEPFLEDGSQT--VLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHGKNTI 546
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYS 639
LLS TVGLQNYGAF +K GAG G +KL G NG +DLS WTYQVGL+GE + S
Sbjct: 547 DLLSLTVGLQNYGAFYDKQGAGITGPIKLKGLANGTTVDLSSQQWTYQVGLQGEELGLPS 606
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ ++W + WYKT FDAP G DPVALD MGKG+AWVNG IGRYW
Sbjct: 607 --GSSSKWVAGSTLPKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQSIGRYWP 664
Query: 700 V-VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
V+ GGC +C+YRG Y+S+KC NCG P+Q YHVPRSWLQ S N LV+FEE GG+P
Sbjct: 665 AYVSSNGGCTSSCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVLFEEIGGDP 724
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIISS 817
+IS + +C +VSE H PV W + + K +P + L C +ISS
Sbjct: 725 TQISFATKQVESLCSRVSEYHPLPVDMWGSDLTTG-----RKSSPMLSLECPFPNQVISS 779
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FAS+GTP+G C FS C + +LS+V E
Sbjct: 780 IKFASFGTPRGTCGSFSHSKCSSRTALSIVQE 811
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 889 bits (2297), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/810 (52%), Positives = 563/810 (69%), Gaps = 24/810 (2%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI KSK+GG DVIETYVFW+ HE++
Sbjct: 32 NVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHEAV 91
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ D+V+FVK V +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 92 RGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 151
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK EMQRF +K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA M
Sbjct: 152 AFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGM 211
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q+DAP+ +I+ CNG+YCD + PNS +KP +WTENW GW+ ++GG +
Sbjct: 212 AVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAV 271
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAFAVARF+QRGG+F NYYMY GGTNFGR++GGPF TSYDYDAPIDEYG++
Sbjct: 272 PYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVR 331
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+D+H AIKLCEPAL+AA+ + Y LGQN EA VY+ S C+AFLAN+D
Sbjct: 332 QPKWGHLRDVHKAIKLCEPALIAAEPS-YSSLGQNTEATVYQT---ADNSICAAFLANVD 387
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F G +Y LP WSVSILPDC+N V NTA+++SQ + + L +I
Sbjct: 388 AQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMR---SLGSSIQDTD 444
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S+I +L++ W EP+G+ EN T G++E +N T D SD+LW+ T I V D
Sbjct: 445 DSLITPELATA--GWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGD 502
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYNDLI 581
+ N + + ++S+ VL+++ING+L GS + + PV G N +
Sbjct: 503 EPYL---NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKID 559
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G NG ++LS WTYQ+GL+GE +Y+
Sbjct: 560 LLSTTVGLSNYGAFFDLVGAGVTGPVKLSG-PNGALNLSSTDWTYQIGLRGEDLHLYNPS 618
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
E EW WYKT F AP G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 619 EASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTN 678
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+AP+ GC ++C+YRGAY+S+KC CG P+QT YHVPRS+LQ +N LV+FE+ GG+P
Sbjct: 679 LAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSM 738
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIE 819
IS R T +C VSE H + W + + + P + L C ++G +IS+I+
Sbjct: 739 ISFTTRQTSSICAHVSEMHPAQIDSW-----ISPQQTSQTQGPALRLECPREGQVISNIK 793
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C ++ G C + +L+VV E
Sbjct: 794 FASFGTPSGTCGNYNHGECSSSQALAVVQE 823
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 889 bits (2296), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/812 (54%), Positives = 552/812 (67%), Gaps = 42/812 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV YDHRA++IDG RR+LIS IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ D+VKFVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EM+RF KIVDLM++E L++ QGGPII+ QIENEYGN++S YG GK Y+ WAA M
Sbjct: 141 PFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWAAKM 200
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPWVMC+Q DAP+ II+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 201 ATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFGGAV 260
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVEDLAFAVARFFQRGG+F NYYMY GGTNF R++GGPF TSYDYDAPIDEYG++
Sbjct: 261 PHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIR 320
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+ KWGHLKD+H AIKLCE AL+A D + LGQN EA VY+ + S C+AFLAN+D
Sbjct: 321 QQKWGHLKDVHKAIKLCEEALIATD-PKISSLGQNLEAAVYK-----TGSVCAAFLANVD 374
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G SY LP WSVSILPDC+N V NTAK++S ++I ++ +I
Sbjct: 375 TKNDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISNF-----VTEDI---- 425
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S L ++S W + EP+G+ ++ + G+LE +N T D SDYLW+ + ++DD
Sbjct: 426 -----SSLETSSSKWSWINEPVGISKDDILSKTGLLEQINTTADRSDYLWYSLSLDLADD 480
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
S + + I+S+ L FING+L G+ G+ K V P+ SG N +
Sbjct: 481 PGS-------QTVLHIESLGHALHAFINGKLAGNQAGNSDKSKLNVDIPIALVSGKNKID 533
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD--IDLSKILWTYQVGLKGEFQQIYS 639
LLS TVGLQNYGAF + GAG G V L G KNG+ +DLS WTYQ+GLKGE +
Sbjct: 534 LLSLTVGLQNYGAFFDTVGAGITGPVILKGLKNGNNTLDLSSRKWTYQIGLKGE--DLGL 591
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
+ W + WYKT FDAP G +PVA+D MGKG+AWVNG IGRYW
Sbjct: 592 SSGSSGGWNSQSTYPKNQPLVWYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWP 651
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T VA GC D+C+YRG Y S KC NCG P+QT YHVPRS+L+ + N LV+FEE GG+P
Sbjct: 652 TYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDP 711
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD-GYIISS 817
+IS + VC VS+SH P + W+ GK+ P + L C + +ISS
Sbjct: 712 TQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKV-----GPALLLSCPNHNQVISS 766
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FASYGTP G C F RG C + +LS+V +
Sbjct: 767 IKFASYGTPLGTCGNFYRGRCSSNKALSIVKK 798
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 888 bits (2294), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/810 (51%), Positives = 557/810 (68%), Gaps = 26/810 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP L+ K+K+GG DV+ETYVFW+ HE+
Sbjct: 28 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDIHETA 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
QY+F+G+ D+V+FVK +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 88 TXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF +K+V M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA M
Sbjct: 148 PFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIRWAAGM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q DAP+ +I+ CNG+YCD + PNS +KP LWTENW GW+ ++GG +
Sbjct: 208 AVALDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSNSKPKLWTENWSGWFLSFGGAV 267
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAFAVARF+QRGG+ NYYMY GGTNFGR+SGGPF TSYDYDAPIDEYGL+
Sbjct: 268 PYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVR 327
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKD+H AIK CEPAL+A D + Y+ +GQN EAHVY+A S C+AFLAN+D
Sbjct: 328 QPKWGHLKDVHKAIKQCEPALIATDPS-YMSMGQNAEAHVYKAG-----SVCAAFLANMD 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +VTF G +Y LP WSVSILPDC+N V NTA+++SQT+ + L +
Sbjct: 382 TQSDKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEMR---SLGSSTKASD 438
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S IE++L+ + W EP+G+ +EN T G++E +N T D SD+LW+ T + V
Sbjct: 439 GSSIETELALS--GWSYAIEPVGITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGG 496
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ N + + ++S+ VL+ +ING+ GS G + + P+ G N +
Sbjct: 497 EPYL---NGSQSNLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKID 553
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G K G +DLS WTYQVGL+GE +Y+
Sbjct: 554 LLSGTVGLSNYGAFFDLVGAGITGPVKLSGPK-GVLDLSSTDWTYQVGLRGEGLHLYNPS 612
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
E EW WYK+ F P G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 613 EASPEWVSDKAYPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTN 672
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+AP+ GC ++C+YRG Y+S KC CG P+QT YHVPRS+LQ +N +V+FE+ GG+P +
Sbjct: 673 LAPQSGCVNSCNYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSK 732
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIE 819
IS + T VC VSE H + W + + + + P + L C + G +ISSI+
Sbjct: 733 ISFTTKQTASVCAHVSEDHPDQIDSW-----ISPQQKVQRSGPALRLECPKAGQVISSIK 787
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C ++ G C +P +L+V E
Sbjct: 788 FASFGTPSGTCGNYNHGECSSPQALAVAQE 817
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 887 bits (2293), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/813 (54%), Positives = 549/813 (67%), Gaps = 37/813 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YDHRA++IDG RR+L+S IHYPR+TPEMWPDLI KSK+GG DVIETYVFWN HE++R
Sbjct: 22 VTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEAVR 81
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F G+ D+VKFVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+ RT+N P
Sbjct: 82 GQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQLRTDNEP 141
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQRF KIVD+M++E L++ QGGPII+ QIENEYGN++ +YG + Y+KWAA MA
Sbjct: 142 FKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQTYIKWAADMA 201
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN-SYNKPTLWTENWDGWYTTWGGRL 285
+ L GVPWVMC+Q DAP ++I CNG+YCD + P +P +WTENW GW+ ++GG +
Sbjct: 202 VSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQWTPRLPEKRPKMWTENWSGWFLSFGGAV 261
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RPVEDLAFAVARFFQRGG+F NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGLL
Sbjct: 262 PQRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGLLR 321
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKD+H AIKLCE A+VA D +Y G N EA VY+ S C+AFLAN D
Sbjct: 322 QPKWGHLKDVHKAIKLCEEAMVATD-PKYSSFGPNVEATVYKTG-----SACAAFLANSD 375
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G SY LP WSVSILPDC+N V NTAK++S I S
Sbjct: 376 TKSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAAMIP------------SFMH 423
Query: 466 QSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
S+++ SS + W + EP+G+ ++ FT G+LE +N T D SDYLW+ I V+
Sbjct: 424 HSVLDDIDSSEALGSGWSWINEPVGISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVT 483
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI----GHWVKVVQPVEFQSGYND 579
D ++ + ++S+ L FING+ G I + V PV F SG N
Sbjct: 484 SSDTFLQDGSQT--ILHVESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNT 541
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIY 638
+ LLS T+GLQNYGAF +K GAG G V+L G KNG DLS WTYQ+GL+GE
Sbjct: 542 IDLLSLTIGLQNYGAFFDKSGAGITGPVQLKGLKNGTTTDLSSQRWTYQIGLQGE--DSG 599
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ ++W TWYK F+APDG +PVALD MGKG+AWVNG IGRYW
Sbjct: 600 FSSGSSSQWISQPTLPKKQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYW 659
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
T AP GC D+C++RG Y+S+KC NCG P+Q YHVPRSWL+ S N LV+FEE GG+
Sbjct: 660 PTNNAPTSGCPDSCNFRGPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGD 719
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIIS 816
P +IS R +C VSESH PV WS+ D K K+ P + L C +IS
Sbjct: 720 PTQISFATRQIESLCSHVSESHPSPVDTWSS----DSKAG-RKLGPVLSLECPFPNQVIS 774
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FASYG PQG C FS G C + +LS+V +
Sbjct: 775 SIKFASYGKPQGTCGSFSHGQCKSTSALSIVQK 807
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 887 bits (2293), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/835 (53%), Positives = 565/835 (67%), Gaps = 47/835 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
++++++ C+ + S+ NV+YDHRA++IDG RR+L+S IHYPR+TPEMWPDLI
Sbjct: 6 ILLVLLWFFCIYAPSSFGA----NVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
KSK+GG DVIETYVFWN HE +RGQYNF+G+ D+VKFVK+V ++GLY+ LRIGPY CAE
Sbjct: 62 QKSKDGGLDVIETYVFWNLHEPVRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GGFP+WL IPGI+FRT+N PF+ EM++F KIVDLM++E L++ QGGPII+ QIEN
Sbjct: 122 WNYGGFPLWLHFIPGIQFRTDNKPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYGN+E+ YG K Y+KWAASMA LG GVPWVMC+Q +AP+ II+ACNG+YCD +KPN
Sbjct: 182 EYGNIEADYGPAAKSYIKWAASMATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
S KP +WTE + GW+ +G +PHRPVEDLAFAVARF+QRGG+F NYYMY GGTNFGR
Sbjct: 242 SNTKPKIWTEGYTGWFLAFGDAVPHRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRA 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
SGGPF +SYDYDAPIDEYG + +PKWGHLKD+H AIKLCE AL+A D LG N E
Sbjct: 302 SGGPFVASSYDYDAPIDEYGFIRQPKWGHLKDVHKAIKLCEEALIATDPT-ITSLGPNIE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
A VY+ C+AFLANI + A+VTF G SY LP WSVSILPDC+N V NTAK+
Sbjct: 361 AAVYKTGVV-----CAAFLANI-ATSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKI 414
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIE-SKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
+S + I S +S+ + L + W + EPIG+ ++F+ G+L
Sbjct: 415 TSASMIS------------SFTTESLKDVGSLDDSGSRWSWISEPIGISKADSFSTFGLL 462
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E +N T D SDYLW+ I + +F + I S+ L FING+L GS
Sbjct: 463 EQINTTADRSDYLWYSLSIDLDAGAQTF---------LHIKSLGHALHAFINGKLAGSGT 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-D 616
G+ V+V P+ SG N + LLS TVGLQNYGAF + GAG G V L KNG +
Sbjct: 514 GNHEKANVEVDIPITLVSGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKCLKNGSN 573
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
+DLS WTYQVGLK E + S +W + TWYKT F AP G +PVA
Sbjct: 574 VDLSSKQWTYQVGLKNEDLGLSS--GCSGQWNSQSTLPTNQPLTWYKTNFVAPSGNNPVA 631
Query: 677 LDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
+D MGKG+AWVNG IGRYW T +PKGGC D+C+YRGAY++ KC NCG P+QT YH
Sbjct: 632 IDFTGMGKGEAWVNGQSIGRYWPTYASPKGGCTDSCNYRGAYDASKCLKNCGKPSQTLYH 691
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGK 795
VPRSWL+ N LV+FEE+GGNP +IS + VC VSESH PPV W NS + G+
Sbjct: 692 VPRSWLRPDRNTLVLFEESGGNPKQISFATKQIGSVCSHVSESHPPPVDSW-NSNTESGR 750
Query: 796 LSINKMAPEMHLHCQ-DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
K+ P + L C ++SSI+FAS+GTP G C F G C + +LS+V +
Sbjct: 751 ----KVVPVVSLECPYPNQVVSSIKFASFGTPLGTCGNFKHGLCSSNKALSIVQK 801
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 887 bits (2291), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/834 (54%), Positives = 562/834 (67%), Gaps = 51/834 (6%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++++ L C+ S T F NV YDHRA++IDG RR+LIS IHYPR+TPEMWPDLI
Sbjct: 6 IVLVLFWLLCIHSP---TLFCA-NVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
KSK+GG DVIETYVFWN +E +RGQY+F G+ D+VKFVK V ++GLY+ LRIGPYVCAE
Sbjct: 62 QKSKDGGLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GGFP+WL IPGI+FRT+N PFK EM+RF KIVD+++EE L++ QGGP+I+ QIEN
Sbjct: 122 WNYGGFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYGN++S+YG GK Y+KWAA+MA L GVPWVMC+Q DAP+ II+ CNG+YCD + PN
Sbjct: 182 EYGNIDSAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
S KP +WTENW GW+ +GG +P+RPVEDLAFAVARFFQRGG+F NYYMY GGTNF RT
Sbjct: 242 SNTKPKMWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
SGGPF TSYDYDAPIDEYG++ +PKWGHLK++H AIKLCE AL+A D LG N E
Sbjct: 302 SGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPT-ITSLGPNLE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
A VY+ + S C+AFLAN+D + +V F G SY LP WSVSILPDC+N V NTAKV
Sbjct: 361 AAVYK-----TGSVCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKV 415
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
I SM SST SW + EP+G+ ++F G+LE
Sbjct: 416 CLTNFI------------------SMFMWLPSSTGWSW--ISEPVGISKADSFPQTGLLE 455
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+N T D SDYLW+ I D S + + I+S+ L FING+L GS G
Sbjct: 456 QINTTADKSDYLWYSLSIDYKGDAGS-------QTVLHIESLGHALHAFINGKLAGSQTG 508
Query: 563 HWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-I 617
+ K V PV +G N + LLS TVGLQNYGAF + GAG G V L G NG+ +
Sbjct: 509 NSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTL 568
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
DLS WTYQVGLKGE + S + +W + WYKT F AP G DPVA+
Sbjct: 569 DLSYQKWTYQVGLKGEDLGLSS--GSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAI 626
Query: 678 DLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
D MGKG+AWVNG IGRYW T VA GC D+C+YRG Y++ KC NCG P+QT YHV
Sbjct: 627 DFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSASKCRRNCGKPSQTLYHV 686
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL 796
PRSWL+ S N+LV+FEE GG+P +IS + T +C VS+SH PPV W NS + G+
Sbjct: 687 PRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSHPPPVDLW-NSDTESGR- 744
Query: 797 SINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
K+ P + L C D +ISSI+FASYGTP G C F G C + +LS+V +
Sbjct: 745 ---KVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQK 795
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 886 bits (2290), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/846 (51%), Positives = 571/846 (67%), Gaps = 45/846 (5%)
Query: 13 CLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
CL++ +M++ ++ L C+ +S + NV+YDHRA+++DG RR+LIS IHYPR
Sbjct: 8 CLSV----IMLVFGVVFLHCLVMTSFAA-----NVTYDHRALVVDGRRRVLISGSIHYPR 58
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
+TP+MWPDLI KSK+GG DVIETYVFWN HE +R QY+F+G+ D++ FVKLV +GL++
Sbjct: 59 STPDMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVEKAGLFVH 118
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
+RIGPYVCAEWN+GGFP+WL IPGIEFRT+N PFK EM+RF KIVD++++E L++ QG
Sbjct: 119 IRIGPYVCAEWNYGGFPLWLHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQG 178
Query: 193 GPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDA 250
GP+I+ QIENEYGN +ES YG + K YV WAASMA L GVPWVMC+Q DAP ++I+
Sbjct: 179 GPVILSQIENEYGNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINT 238
Query: 251 CNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNY 310
CNG+YCD +K NS P +WTENW GW+ ++GG +P+RPVED+AFAVARFFQRGG+F NY
Sbjct: 239 CNGFYCDQFKQNSDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNY 298
Query: 311 YMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
YMY GGTNFGRTSGGPF TSYDYDAP+DEYGL+++PKWGHLKDLH AIKLCE A+VA +
Sbjct: 299 YMYHGGTNFGRTSGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATE 358
Query: 371 SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILP 430
LG N E VY+ + S C+AFLAN + A+V+F G SY LPPWSVSILP
Sbjct: 359 -PNITSLGSNIEVSVYKTD-----SQCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILP 412
Query: 431 DCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW 490
DC+N F+TAK++S ++I T V + S ++ S S W +V EP+G+
Sbjct: 413 DCKNVAFSTAKINSASTISTF-----------VTRSSEADASGGSLS-GWTSVNEPVGIS 460
Query: 491 SENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRV 550
+EN FT G+LE +N T D SDYLW+ + + +D+ F + + + ++ VL
Sbjct: 461 NENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDE-PFLQDGSAT-VLHVKTLGHVLHA 518
Query: 551 FINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQ 606
+ING+L+GS G+ + PV G N + LLS TVGLQNYGAF + GAG G
Sbjct: 519 YINGKLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYGAFFDLKGAGITGP 578
Query: 607 VKLTGFKNGD-IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTY 665
V+L GFKNG DLS WTYQVGLKGE + W T WYK
Sbjct: 579 VQLKGFKNGSTTDLSSKQWTYQVGLKGE--DLGLSNGGSTLWKSQTALPTNQPLIWYKAS 636
Query: 666 FDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV-VAPKGGCQDTCDYRGAYNSDKCTT 724
FDAP G P+++D MGKG+AWVNG IGR+W +AP GC D C+YRG YN++KC
Sbjct: 637 FDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDPCNYRGGYNAEKCLK 696
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
NCG P+Q YHVPRSWL++S N+LV+FEE GG+P ++S R + VC ++S++H P+
Sbjct: 697 NCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQSVCSRISDAHPLPID 756
Query: 785 KWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
W++ K P + L C +ISSI+FAS+GTPQG C F G C + +
Sbjct: 757 MWASEDDAR-----KKSGPTLSLECPHPNQVISSIKFASFGTPQGTCGSFIHGRCSSSNA 811
Query: 844 LSVVSE 849
LS+V +
Sbjct: 812 LSIVKK 817
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 886 bits (2289), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/815 (53%), Positives = 559/815 (68%), Gaps = 42/815 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+T EMW DLI KSK+GG DVIETYVFWNAHE +
Sbjct: 31 NVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDGGLDVIETYVFWNAHEPV 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ QYNF+G+ D+VKF+KLVG +GLY LRIGPYVCAEWN+GGFP+WL +PGI+FRT+N
Sbjct: 91 QNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGFPLWLHFVPGIKFRTDNE 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF KIVD+M++E L++ QGGPII+ QIENEYGN++SSYG K Y+ WAASM
Sbjct: 151 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSSYGPAAKSYINWAASM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q DAP+ II+ CNG+YCD + PNS NKP +WTENW GW+ ++GG +
Sbjct: 211 AVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSKNKPKMWTENWSGWFLSFGGAV 270
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+Q GG+F NYYMY GGTNFGR++GGPF TSYDYDAP+DEYGL
Sbjct: 271 PYRPVEDLAFAVARFYQLGGTFQNYYMYHGGTNFGRSTGGPFISTSYDYDAPLDEYGLTR 330
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH +IKLCE ALVA D LGQN EA VY+ CSAFLAN
Sbjct: 331 QPKWGHLKDLHKSIKLCEEALVATDPVTS-SLGQNLEATVYKTG----TGLCSAFLANFG 385
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F G SY LP WSVSILPDC+N NTAK++S T I PN
Sbjct: 386 T-SDKTVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVI----------PNFV--H 432
Query: 466 QSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
QS+I S+ + SW + EP+G+ + F G+LE +N T D SDYLW+ +
Sbjct: 433 QSLIGDADSADTLGSSWSWIYEPVGISKNDAFVKPGLLEQINTTADKSDYLWYSLSTVIK 492
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
D++ ++ + ++S+ L F+NG+L GS G+ V V PV G N
Sbjct: 493 DNEPFLEDGSQT--VLHVESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNT 550
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIY 638
+ LLS T GLQNYGAF E +GAG G VKL G KNG +DLS + WTYQ+GLKGE
Sbjct: 551 IDLLSLTAGLQNYGAFFELEGAGITGPVKLEGLKNGTTVDLSSLQWTYQIGLKGE---EL 607
Query: 639 SIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+ ++W +T+ +P+ WYKT F+AP G DP+A+D MGKG+AWVNG IGR
Sbjct: 608 GLSSGNSQW--VTQPALPTKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGR 665
Query: 697 YW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
YW T V+P GC + C+YRG+Y+S KC NC P+QT YHVPRSW+++S N LV+FEE G
Sbjct: 666 YWPTKVSPTSGCSN-CNYRGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEEIG 724
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYI 814
G+P +I+ + + +C VSESH PV WS++ + K P + L C +
Sbjct: 725 GDPTQIAFATKQSASLCSHVSESHPLPVDMWSSNSEAE-----RKAGPVLSLECPFPNQV 779
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
ISSI+FAS+GTP+G C FS G C + +LS+V +
Sbjct: 780 ISSIKFASFGTPRGTCGSFSHGQCKSTRALSIVQK 814
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 885 bits (2287), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/813 (53%), Positives = 556/813 (68%), Gaps = 44/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TPEMWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 26 NVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQYNF+G+ D+VKFVK V ++GLY+ LRIGPY CAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 86 QGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTDNK 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PF+ EM+RF KIVD+M++E L++ QGGPII+ Q+ENEYGN++++YG K Y+KWAASM
Sbjct: 146 PFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQVENEYGNIDAAYGPAAKSYIKWAASM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPWVMC+Q DAP+ II+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 206 ATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNAKPKMWTENWSGWFLSFGGAV 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTNFGRT+GGPF TSYDYDAPID+YG++
Sbjct: 266 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDQYGIIR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKD+H AIKLCE AL+A D G N EA VY+ + S C+AFLANI
Sbjct: 326 QPKWGHLKDVHKAIKLCEEALIATDPT-ITSPGPNIEAAVYK-----TGSICAAFLANI- 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G SY LP WSVSILPDC+N V NTAK++S + I S
Sbjct: 379 ATSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSASMIS------------SFTT 426
Query: 466 QSMIE--SKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+S E L + W + EPIG+ ++F+ G+LE +N T D SDYLW+ I V
Sbjct: 427 ESFKEEVGSLDDSGSGWSWISEPIGISKSDSFSKFGLLEQINTTADKSDYLWYSISIDVE 486
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
D S + + I+S+ L FING++ GS G+ V V PV +G N
Sbjct: 487 GDSGS-------QTVLHIESLGHALHAFINGKIAGSGTGNSGKAKVNVDIPVTLVAGKNS 539
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-IDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGLQNYGAF + GAG G V L G KNG +DLS WTYQVGLK ++ +
Sbjct: 540 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGSTVDLSSQQWTYQVGLK--YEDLG 597
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ +W + + WYKT F AP G +PVA+D MGKG+AWVNG IGRYW
Sbjct: 598 PSNGSSGQWNSQSTLPTNQSLIWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYW 657
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
T V+P GGC D+C+YRGAY+S KC NCG P+QT YH+PRSWLQ +N LV+FEE+GG+
Sbjct: 658 PTYVSPNGGCTDSCNYRGAYSSSKCLKNCGKPSQTLYHIPRSWLQPDSNTLVLFEESGGD 717
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIIS 816
P +IS + +C VSESH PPV W++ K+ P + L C +IS
Sbjct: 718 PTQISFATKQIGSMCSHVSESHPPPVDLWNSDKG-------RKVGPVLSLECPYPNQLIS 770
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP G C F G C + +LS+V +
Sbjct: 771 SIKFASFGTPYGTCGNFKHGRCRSNKALSIVQK 803
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 885 bits (2287), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/811 (53%), Positives = 554/811 (68%), Gaps = 38/811 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG R++LIS IHYPR+TPEMWP+LI KSK+GG DVIETYVFW+ HE
Sbjct: 25 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ +YNF+G+ D+VKFVKL +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 85 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFKEEMQRF KIVDLM++E L++ QGGPII+ QIENEYGN++S+YG K Y+KW+ASM
Sbjct: 145 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL L GVPW MC+QTDAP+ +I+ CNG+YCD + PNS NKP +WTENW GW+ +G
Sbjct: 205 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGDPS 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTNF RTSGGP TSYDYDAPIDEYGLL
Sbjct: 265 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGLLR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+DLH AIKLCE AL+A D LG N EA VY+ +C+AFLAN+D
Sbjct: 325 QPKWGHLRDLHKAIKLCEDALIATDPT-ITSLGSNLEAAVYKTE----SGSCAAFLANVD 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G+SY LP WSVSILPDC+N FNTAKV + KT +
Sbjct: 380 TKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKVKFNSISKTPD------------- 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+ W +KEPIG+ + F G+LE +N T D SDYLW+ + + D
Sbjct: 427 ----GGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGD 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ---PVEFQSGYNDLIL 582
+ + + + I+S+ V+ FING+L GS GH + + P+ +G N + L
Sbjct: 483 ET--FLDEGSKAVLHIESLGQVVYAFINGKLAGS--GHGKQKISLDIPINLVTGTNTIDL 538
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYSIE 641
LS TVGL NYGAF + GAG G V L K G IDL+ WTYQVGLKGE + +++
Sbjct: 539 LSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVD 598
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
+EW + WYKT FDAP G +PVA+D GKG AWVNG IGRYW T
Sbjct: 599 --SSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTS 656
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+A GGC ++CDYRG+Y ++KC NCG P+QT YHVPRSWL+ S N+LV+FEE GG+P +
Sbjct: 657 IAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQ 716
Query: 761 ISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIISSI 818
IS + T +C VS+SH PPV W++ + + N+ P + L C +I SI
Sbjct: 717 ISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNR---NRTRPVLSLKCPISTQVIFSI 773
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+GTP+G C F++G+C++ SLS+V +
Sbjct: 774 KFASFGTPKGTCGSFTQGHCNSSRSLSLVQK 804
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 884 bits (2285), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/846 (51%), Positives = 570/846 (67%), Gaps = 45/846 (5%)
Query: 13 CLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
CL++ +M++ ++ L C+ +S + NV+YDHRA+++DG RR+LIS IHYPR
Sbjct: 8 CLSV----IMLVFGVVFLHCLVMTSFAA-----NVTYDHRALVVDGRRRVLISGSIHYPR 58
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
+TP+MWPDLI KSK+GG DVIETYVFWN HE +R QY+F+G+ D++ FVKLV +GL++
Sbjct: 59 STPDMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVERAGLFVH 118
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
+RIGPYVCAEWN+GGFP+WL IPGIEFRT+N PFK EM+RF KIVD++++E L++ QG
Sbjct: 119 IRIGPYVCAEWNYGGFPLWLHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQG 178
Query: 193 GPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDA 250
GP+I+ QIENEYGN +ES YG + K YV WAASMA L GVPWVMC+Q DAP ++I+
Sbjct: 179 GPVILSQIENEYGNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINT 238
Query: 251 CNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNY 310
CNG+YCD +K NS P +WTENW GW+ ++GG +P+RPVED+AFAVARFFQRGG+F NY
Sbjct: 239 CNGFYCDQFKQNSDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNY 298
Query: 311 YMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
YMY GGTNFGRTSGGPF TSYDYDAP+DEYGL+++PKWGHLKDLH AIKLCE A+VA +
Sbjct: 299 YMYHGGTNFGRTSGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATE 358
Query: 371 SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILP 430
LG N E VY+ + S C+AFLAN + A+V+F G SY LPPWSVSILP
Sbjct: 359 -PNVTSLGSNIEVSVYKTD-----SQCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILP 412
Query: 431 DCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW 490
DC+N F+TAK++S ++I T V + S ++ S S W +V EP+G+
Sbjct: 413 DCKNVAFSTAKINSASTISTF-----------VTRSSEADASGGSLS-GWTSVNEPVGIS 460
Query: 491 SENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRV 550
+EN FT G+LE +N T D SDYLW+ + + +D+ F + + + ++ VL
Sbjct: 461 NENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDE-PFLQDGSAT-VLHVKTLGHVLHA 518
Query: 551 FINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQ 606
+ING+L+GS G+ + PV G N + LLS TVGLQNYGAF + GAG G
Sbjct: 519 YINGRLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYGAFFDLKGAGITGP 578
Query: 607 VKLTGFKNGD-IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTY 665
V+L GFKNG DLS WTYQVGLKGE + W T WYK
Sbjct: 579 VQLKGFKNGSTTDLSSKQWTYQVGLKGE--DLGLSNGGSTLWKSQTALPTNQPLIWYKAS 636
Query: 666 FDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV-VAPKGGCQDTCDYRGAYNSDKCTT 724
FDAP G P+++D MGKG+AWVNG IGR+W +AP GC D C+YRG YN++KC
Sbjct: 637 FDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDPCNYRGGYNAEKCLK 696
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
NCG P+Q YHVPRSWL++S N+LV+FEE GG+P ++S R + VC + S++H P+
Sbjct: 697 NCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQSVCSRTSDAHPLPID 756
Query: 785 KWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
W++ K P + L C +ISSI+FAS+GTPQG C F G C + +
Sbjct: 757 MWASEDDAR-----KKSGPTLSLECPHPNQVISSIKFASFGTPQGTCGSFIHGRCSSSNA 811
Query: 844 LSVVSE 849
LS+V +
Sbjct: 812 LSIVKK 817
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 884 bits (2284), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/813 (52%), Positives = 563/813 (69%), Gaps = 27/813 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI KSK+GG DVIETYVFW+ HE++
Sbjct: 32 NVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHEAV 91
Query: 106 RGQ---YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
RGQ Y+F+G+ D+V+FVK V +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT
Sbjct: 92 RGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRT 151
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+N FK EMQRF +K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WA
Sbjct: 152 DNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWA 211
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
A MA+ L GVPWVMC+Q+DAP+ +I+ CNG+YCD + PNS +KP +WTENW GW+ ++G
Sbjct: 212 AGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFG 271
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
G +P+RP EDLAFAVARF+QRGG+F NYYMY GGTNFGR++GGPF TSYDYDAPIDEYG
Sbjct: 272 GAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYG 331
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
++ +PKWGHL+D+H AIKLCEPAL+AA+ + Y LGQN EA VY+ S C+AFLA
Sbjct: 332 MVRQPKWGHLRDVHKAIKLCEPALIAAEPS-YSSLGQNTEATVYQT---ADNSICAAFLA 387
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N+D + +V F G +Y LP WSVSILPDC+N V NTA+++SQ + + L +I
Sbjct: 388 NVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMR---SLGSSIQ 444
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
S+I +L++ W EP+G+ EN T G++E +N T D SD+LW+ T I V
Sbjct: 445 DTDDSLITPELATA--GWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV 502
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYN 578
D+ N + + ++S+ VL+++ING+L GS + + PV G N
Sbjct: 503 KGDEPYL---NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKN 559
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL NYGAF + GAG G VKL+G NG ++LS WTYQ+GL+GE +Y
Sbjct: 560 KIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSG-PNGALNLSSTDWTYQIGLRGEDLHLY 618
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ E EW WYKT F AP G DPVA+D MGKG+AWVNG IGRYW
Sbjct: 619 NPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW 678
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
T +AP+ GC ++C+YRGAY+S+KC CG P+QT YHVPRS+LQ +N LV+FE+ GG+
Sbjct: 679 PTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGD 738
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIIS 816
P IS R T +C VSE H + W + + + P + L C ++G +IS
Sbjct: 739 PSMISFTTRQTSSICAHVSEMHPAQIDSW-----ISPQQTSQTQGPALRLECPREGQVIS 793
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+I+FAS+GTP G C ++ G C + +L+VV E
Sbjct: 794 NIKFASFGTPSGTCGNYNHGECSSSQALAVVQE 826
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 884 bits (2284), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/812 (53%), Positives = 549/812 (67%), Gaps = 36/812 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+LIS IHYPR+TPEMWP LI KSK+GG DVIETYVFWN HE +
Sbjct: 24 NVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPGLIQKSKDGGLDVIETYVFWNGHEPV 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R QYNF+G+ D+VKFVKLV +GLY+ +RIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 84 RNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF KIVD+M++E L++ QGGPII+ QIENEYGN++S++G K Y+ WAA M
Sbjct: 144 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAFGPAAKTYINWAAGM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q DAP+ +I+ CNG+YCD + PNS NKP +WTENW GW+ ++GG +
Sbjct: 204 AISLDTGVPWVMCQQADAPDPVINTCNGFYCDQFTPNSKNKPKMWTENWSGWFQSFGGAV 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+Q G+F NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 264 PYRPVEDLAFAVARFYQLSGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPLDEYGLLR 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKD+H AIKLCE AL+A D LG N EA VY+ + S C+AFLANI
Sbjct: 324 QPKWGHLKDVHKAIKLCEEALIATDPTT-TSLGSNLEATVYK-----TGSLCAAFLANI- 376
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
T +VTF G SY LP WSVSILPDC+N NTAK++S T + S +
Sbjct: 377 ATTDKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKINSVTIVP------------SFAR 424
Query: 466 QSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
QS++ SS + W + EP+G+ + F G+LE +N T D SDYLW+ +
Sbjct: 425 QSLVGDVDSSKAIGSGWSWINEPVGISKNDAFVKSGLLEQINTTADKSDYLWYSLSTNIK 484
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYND 579
D+ ++ + ++S+ L FING+L GS G V V P+ G N
Sbjct: 485 GDEPFLEDGSQT--VLHVESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITLTPGKNT 542
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ LLS TVGLQNYGAF E GAG G VKL +DLS WTYQ+GLKGE
Sbjct: 543 IDLLSLTVGLQNYGAFYELTGAGITGPVKLKAQNGNTVDLSSQQWTYQIGLKGE--DSGI 600
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
+ +EW WYKT FDAP G DPVA+D MGKG+AWVNG IGRYW
Sbjct: 601 SSGSSSEWVSQPTLPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVNGQSIGRYWP 660
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T V+P GC D+C+YRG Y+S+KC NCG P+QT+YH+PRSW+++S N+LV+ EE GG+P
Sbjct: 661 TNVSPSSGCADSCNYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNILVLLEEIGGDP 720
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISS 817
+I+ R +C VSESH PV W N+ S GK S P + L C +ISS
Sbjct: 721 TQIAFATRQVGSLCSHVSESHPQPVDMW-NTDSEGGKRS----GPVLSLQCPHPDKVISS 775
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FAS+GTP G C +S G C + +LS+V +
Sbjct: 776 IKFASFGTPHGSCGSYSHGKCSSTSALSIVQK 807
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 883 bits (2281), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/813 (52%), Positives = 562/813 (69%), Gaps = 27/813 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI KSK+GG DVIETYVFW+ HE +
Sbjct: 32 NVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHEPV 91
Query: 106 RGQ---YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
RGQ Y+F+G+ D+V+FVK V +GLY+ LRIGPYVCAEWN+GGFPVWL +PGI+FRT
Sbjct: 92 RGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRT 151
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+N FK EMQRF +K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WA
Sbjct: 152 DNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWA 211
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
A MA+ L GVPWVMC+Q+DAP+ +I+ CNG+YCD + PNS +KP +WTENW GW+ ++G
Sbjct: 212 AGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFG 271
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
G +P+RP EDLAFAVARF+QRGG+F NYYMY GGTNFGR++GGPF TSYDYDAPIDEYG
Sbjct: 272 GAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYG 331
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
++ +PKWGHL+D+H AIKLCEPAL+AA+ + Y LGQN EA VY+ S C+AFLA
Sbjct: 332 MVRQPKWGHLRDVHKAIKLCEPALIAAEPS-YSSLGQNTEATVYQT---ADNSICAAFLA 387
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N+D + +V F G +Y LP WSVSILPDC+N V NTA+++SQ + + L +I
Sbjct: 388 NVDAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMR---SLGSSIQ 444
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
S+I +L++ W EP+G+ EN T G++E +N T D SD+LW+ T I V
Sbjct: 445 DTDDSLITPELATA--GWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV 502
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYN 578
D+ N + + ++S+ VL+V+ING+L GS + + PV G N
Sbjct: 503 KGDEPYL---NGSQSNLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKN 559
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL NYGAF + GAG G VKL+G NG ++LS WTYQ+GL+GE +Y
Sbjct: 560 KIDLLSTTVGLSNYGAFFDLIGAGVTGPVKLSG-PNGALNLSSTDWTYQIGLRGEDLHLY 618
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ E EW WYKT F AP G DPVA+D MGKG+AWVNG IGRYW
Sbjct: 619 NPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW 678
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
T +AP+ GC ++C+YRGAY+S+KC CG P+QT YHVPRS+LQ +N LV+FE+ GG+
Sbjct: 679 PTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGD 738
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIIS 816
P IS R T +C VSE H + W + + + P + L C ++G +IS
Sbjct: 739 PSMISFTTRQTSSICAHVSEMHPAQIDSW-----ISPQQTSQTPGPALRLECPREGQVIS 793
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+I+FAS+GTP G C ++ G C + +L+VV E
Sbjct: 794 NIKFASFGTPSGTCGNYNHGECSSSQALAVVQE 826
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 881 bits (2276), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/808 (53%), Positives = 534/808 (66%), Gaps = 37/808 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YDH+A++I+G RR+LIS IHYPR+T EMWPDL K+K+GG DVI+TYVFWN HE
Sbjct: 25 VTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDGGLDVIQTYVFWNMHEPSP 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+VKFVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 85 GNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 144
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+ F KK+VDLM+ E LF QGGPII+ Q+ENEY E YG G Y+ WAA MA
Sbjct: 145 FKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEEMEYGLAGAQYMNWAAQMA 204
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+G+ GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KPT+WTE W GWYT +GG P
Sbjct: 205 VGMDTGVPWVMCKQDDAPDPVINTCNGFYCDNFVPNKPYKPTMWTEAWSGWYTEFGGASP 264
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVEDLAFAVARFF +GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+ +
Sbjct: 265 HRPVEDLAFAVARFFVKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLIRQ 324
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK+LH AIKLCEPALV+ D LG Q+A+VY A NC+AF+ N D
Sbjct: 325 PKWGHLKELHKAIKLCEPALVSGDPV-VTSLGHFQQAYVYSA----GAGNCAAFIVNYDS 379
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ V F GQ Y + PWSVSILPDCRN VFNTAKV QTS + ++P
Sbjct: 380 NSVGRVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQTS------QMKMTP------- 426
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
W ++ E I + +N+ + G+LE +N+T+D +DYLW+IT + V D+D
Sbjct: 427 --------VGGFGWESIDENIASFEDNSISAVGLLEQINITRDNTDYLWYITSVEV-DED 477
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLIL 582
F K N P +T+ S D L VFIN L GS G V+ V G N + L
Sbjct: 478 EPFIK-NGGLPVLTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNKISL 536
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS TVGLQN G E AG G + L+GFK+G DLS W+YQ+GLKGE +++ +
Sbjct: 537 LSMTVGLQNIGPHFEMANAGVLGPITLSGFKDGTRDLSSQRWSYQIGLKGETMNLHTSGD 596
Query: 643 NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA 702
N EW WYK FDAP G DP+ LDL SMGKGQAWVNG IGRYW
Sbjct: 597 NTVEWMKGVAVPQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYWPSYL 656
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
+G C D C Y G Y KC TNCG +Q WYHVPRSWLQ S N LV+FEE GGNP +S
Sbjct: 657 AEGVCSDGCSYEGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEIGGNPSGVS 716
Query: 763 VKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM-APEMHLHCQDGYIISSIEFA 821
+ RS VC VSESH + W ++ + K+ P++HL C G IS+I+FA
Sbjct: 717 LVTRSVDSVCAHVSESHSQSINFW----RLESTDQVQKLHIPKVHLQCSKGQRISAIKFA 772
Query: 822 SYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S+GTPQG C F +G+CH+P S++ + +
Sbjct: 773 SFGTPQGLCGSFQQGDCHSPNSVATIQK 800
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 880 bits (2274), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/839 (53%), Positives = 566/839 (67%), Gaps = 45/839 (5%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPF--NVSYDHRAIIIDGNRRMLISAGIHYPRATPEM 77
P +++++ L C+ + K F NV YDHRA++IDG RR+LIS IHYPR+TPEM
Sbjct: 3 PAQIVLVLFWLLCIHTP------KLFCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEM 56
Query: 78 WPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGP 137
WPDLI KSK+GG DVIETYVFWN HE +RGQY+F G+ D+VKFVK V ++GLY+ LRIGP
Sbjct: 57 WPDLIQKSKDGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGP 116
Query: 138 YVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIM 197
YVCAEWN+GGFPVWL IPGI+FRT+N PFK EM+RF KIVD++++E L++ QGGP+I+
Sbjct: 117 YVCAEWNYGGFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVIL 176
Query: 198 LQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD 257
QIENEYGN++++YG GK Y+KWAA+MA L GVPWVMC Q DAP+ II+ NG+Y D
Sbjct: 177 SQIENEYGNIDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGD 236
Query: 258 GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGT 317
+ PNS KP +WTENW GW+ +GG +P+RPVEDLAFAVARFFQRGG+F NYYMY GGT
Sbjct: 237 EFTPNSNTKPKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT 296
Query: 318 NFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
NF R SGGPF TSYDYDAPIDEYG++ +PKWGHLK++H AIKLCE AL+A D L
Sbjct: 297 NFDRASGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPT-ITSL 355
Query: 378 GQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
G N EA VY+ S C+AFLAN+ + +V F G SY LP WSVSILPDC++ V
Sbjct: 356 GPNLEAAVYKTG-----SVCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVL 410
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NTAK++S ++I + ++ + S+ SST SW++ EP+G+ ++F+
Sbjct: 411 NTAKINSASAISSF--------TTESSKEDIGSSEASSTGWSWIS--EPVGISKTDSFSQ 460
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
G+LE +N T D SDYLW+ I D S + + I+S+ L FING+L
Sbjct: 461 TGLLEQINTTADKSDYLWYSLSIDYKADASS-------QTVLHIESLGHALHAFINGKLA 513
Query: 558 GSVIGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
GS G+ K V PV +G N + LLS TVGLQNYGAF + G G G V L GF
Sbjct: 514 GSQPGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITGPVILKGFA 573
Query: 614 NGD-IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI 672
NG+ +DLS WTYQVGL+GE + S + +W + TWYKT F AP G
Sbjct: 574 NGNTLDLSSQKWTYQVGLQGEDLGLSS--GSSGQWNLQSTFPKNQPLTWYKTTFSAPSGS 631
Query: 673 DPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
DPVA+D MGKG+AWVNG IGRYW T VA C D+C+YRG Y++ KC NC P+Q
Sbjct: 632 DPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSCNYRGPYSASKCRKNCEKPSQ 691
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS 791
T YHVPRSWL+ S N+LV+FEE GG+P +IS + T +C VS+SH PPV W NS +
Sbjct: 692 TLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESLCAHVSDSHPPPVDLW-NSET 750
Query: 792 VDGKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G+ K+ P + L C D +ISSI+FASYGTP G C F G C + +LS+V +
Sbjct: 751 ESGR----KVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQK 805
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/809 (54%), Positives = 549/809 (67%), Gaps = 29/809 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDHRA++IDG R++LIS IHYPR+TPEMWPDLI KSK+GG DVIETYVFWN HE
Sbjct: 32 SVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNGHEPE 91
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ +YNF+G+ D+VKFVKL +GLY+ LRIGPY CAEWN+GGFPVWL +PGI+FRT+N
Sbjct: 92 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNYGGFPVWLHFVPGIKFRTDNE 151
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF KIVDLM++E L++ QGGPII+ QIENEYGN++SSYG GK Y+KW+ASM
Sbjct: 152 PFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSSYGAAGKSYMKWSASM 211
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL L GVPW MC+Q DAP+ II+ CNG+YCD + PNS NKP +WTENW GW+ +G
Sbjct: 212 ALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGEPS 271
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARFFQRGG+F NYYMY GGTNF RTSGGP TSYDYDAPIDEYGLL
Sbjct: 272 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFERTSGGPLISTSYDYDAPIDEYGLLR 331
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+DLH AIKLCE AL+A D + LG N EA VY+ S +C+AFLANI
Sbjct: 332 QPKWGHLRDLHKAIKLCEDALIATD-PKITSLGSNLEAAVYKT----STGSCAAFLANIG 386
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G+SY LP WSVSILPDC+N FNTAK++S T T L PN
Sbjct: 387 TKSDATVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATE-STAFARQSLKPNADS-- 443
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
++L S W +KEP+G+ + F G+LE +N T D SDYLW+ ++ + D
Sbjct: 444 ----SAELGS---QWSYIKEPVGISKADAFVKPGLLEQINTTADKSDYLWYSLRMDIKGD 496
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVVQPVEFQSGYNDLILLS 584
+ + + + + S+ ++ FING+L GS G + + P+ +G N + LLS
Sbjct: 497 ET--FLDEGSKAVLHVQSIGQLVYAFINGKLAGSGNGKQKISLDIPINLVTGKNTIDLLS 554
Query: 585 QTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIYSIEEN 643
TVGL NYG F + GAG G V L K G DLS WTYQVGLKGE + + S +
Sbjct: 555 VTVGLANYGPFFDLTGAGITGPVSLKSAKTGSSTDLSSQQWTYQVGLKGEDKGLGS--GD 612
Query: 644 EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVA 702
+EW + WYKT FDAP G DPVA+D GKG AWVNG IGRYW T +A
Sbjct: 613 SSEWVSNSPLPTSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNGQSIGRYWPTSIA 672
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
GC +CDYRG+Y S+KC NCG P+QT YHVPRSW++ S N LV+ EE GG+P +IS
Sbjct: 673 RTDGCVGSCDYRGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLVLLEEMGGDPTKIS 732
Query: 763 VKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIISSIEF 820
+ T +C VS+SH PV W + D K S N+ +P + L C +ISSI F
Sbjct: 733 FATKQTGSNLCLTVSQSHPAPVDTWIS----DSKFS-NRTSPVLSLKCPVSTQVISSIRF 787
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+GTP G C FS G+C + SLSVV +
Sbjct: 788 ASFGTPTGTCGSFSYGHCSSARSLSVVQK 816
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 876 bits (2264), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/818 (53%), Positives = 556/818 (67%), Gaps = 36/818 (4%)
Query: 40 TFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFW 99
+F NV+YDHRA++IDG R++L+S +HYPR+TPEMWP +I KSK+GG DVIETYVFW
Sbjct: 20 SFSLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFW 79
Query: 100 NAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIE 159
N HE +R QY+F+G+ D+VKF+KLVG++GLY+ +RIGPYVCAEWN+GGFPVWL +PG++
Sbjct: 80 NLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQ 139
Query: 160 FRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYV 219
FRT+N PFK EM+RF KIVD++++E L++ QGGPII+ QIENEYGN++SS+G K YV
Sbjct: 140 FRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYV 199
Query: 220 KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
+WAA+MA L GVPWVMC Q DAP+ II+ CNG+YCD + PNS NKP +WTENW GW+
Sbjct: 200 QWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPID 339
++GG LP+RPVEDLAFAVARF+Q GGS NYYMY GGTNFGRTSGGPF TSYDYDAPID
Sbjct: 260 SFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPID 319
Query: 340 EYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSA 399
EYGL+ +PKWGHL+D+H AIK+CE ALV+ D A LG N EA VY+ S S CSA
Sbjct: 320 EYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPA-VTSLGPNLEATVYK-----SGSQCSA 373
Query: 400 FLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
FLAN+D + +VTF G SY LP WSVSILPDC+N V NTAK++S T+ + + PL
Sbjct: 374 FLANVDTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFS-NQPLKV 432
Query: 460 NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
++S + + W + EPIG+ N+F G+ E +N T D SDYLW+
Sbjct: 433 DVSASE---------AFDSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLS 483
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ----PVEFQS 575
+ D+ + N + +DS+ VL VFIN +L GS G P+
Sbjct: 484 TDIKGDEP--YLANGSNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVP 541
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEF 634
G N + LLS TVGLQNYGAF E GAG G VKL KN +DLS WTYQ+GL+GE
Sbjct: 542 GKNTIDLLSLTVGLQNYGAFFELRGAGVTGPVKLENXKNNITVDLSSGQWTYQIGLEGED 601
Query: 635 QQIYSIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
+ S + ++W L++ +P TWYKT FDAP G DP+ALD GKG+AW+NGH
Sbjct: 602 LGLPS--GSTSQW--LSQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGH 657
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
IGRYW G C CDY+GAY+++KC NCG P+QT YHVP+SWL+ + N LV+FE
Sbjct: 658 SIGRYWPSYIASGQCTSYCDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFE 717
Query: 753 ETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD- 811
E G +P ++ + +C VSESH PPV WS+ D K K P + L C
Sbjct: 718 EIGSDPTRLTFASKQLGSLCSHVSESHPPPVEMWSS----DSKQ--QKTGPVLSLECPSP 771
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ISSI+FAS+GTP+G C FS G C +LS+V +
Sbjct: 772 SQVISSIKFASFGTPRGTCGSFSHGQCSTRNALSIVQK 809
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 876 bits (2263), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/818 (53%), Positives = 556/818 (67%), Gaps = 36/818 (4%)
Query: 40 TFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFW 99
+F NV+YDHRA++IDG R++L+S +HYPR+TPEMWP +I KSK+GG DVIETYVFW
Sbjct: 20 SFSLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFW 79
Query: 100 NAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIE 159
N HE +R QY+F+G+ D+VKF+KLVG++GLY+ +RIGPYVCAEWN+GGFPVWL +PG++
Sbjct: 80 NLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQ 139
Query: 160 FRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYV 219
FRT+N PFK EM+RF KIVD++++E L++ QGGPII+ QIENEYGN++SS+G K YV
Sbjct: 140 FRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYV 199
Query: 220 KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
+WAA+MA L GVPWVMC Q DAP+ II+ CNG+YCD + PNS NKP +WTENW GW+
Sbjct: 200 QWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPID 339
++GG LP+RPVEDLAFAVARF+Q GGS NYYMY GGTNFGRTSGGPF TSYDYDAPID
Sbjct: 260 SFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPID 319
Query: 340 EYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSA 399
EYGL+ +PKWGHL+D+H AIK+CE ALV+ D A LG N EA VY+ S S CSA
Sbjct: 320 EYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPA-VTSLGPNLEATVYK-----SGSQCSA 373
Query: 400 FLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
FLAN+D + +VTF G SY LP WSVSILPDC+N V NTAK++S T+ + + PL
Sbjct: 374 FLANVDTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFS-NQPLKV 432
Query: 460 NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
++S + + W + EPIG+ N+F G+ E +N T D SDYLW+
Sbjct: 433 DVSASE---------AFDSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLS 483
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ----PVEFQS 575
+ D+ + N + +DS+ VL VFIN +L GS G P+
Sbjct: 484 TDIKGDEP--YLANGSNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVP 541
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEF 634
G N + LLS TVGLQNYGAF E GAG G VKL KN +DLS WTYQ+GL+GE
Sbjct: 542 GKNTIDLLSLTVGLQNYGAFFELRGAGVTGPVKLENQKNNITVDLSSGQWTYQIGLEGED 601
Query: 635 QQIYSIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
+ S + ++W L++ +P TWYKT FDAP G DP+ALD GKG+AW+NGH
Sbjct: 602 LGLPS--GSTSQW--LSQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGH 657
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
IGRYW G C CDY+GAY+++KC NCG P+QT YHVP+SWL+ + N LV+FE
Sbjct: 658 SIGRYWPSYIASGQCTSYCDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFE 717
Query: 753 ETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD- 811
E G +P ++ + +C VSESH PPV WS+ D K K P + L C
Sbjct: 718 EIGSDPTRLTFASKQLGSLCSHVSESHPPPVEMWSS----DSKQ--QKTGPVLSLECPSP 771
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ISSI+FAS+GTP+G C FS G C +LS+V +
Sbjct: 772 SQVISSIKFASFGTPRGTCGSFSHGQCSTRNALSIVQK 809
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 874 bits (2259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/803 (53%), Positives = 551/803 (68%), Gaps = 28/803 (3%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
+IDG RR+LIS IHYPR+TPEMWPDLI KSK GG D+IETYVFW+ HE ++GQY+F+G+
Sbjct: 1 VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+V+F+K VG +GLY+ LRIGPY CAEWN+GGFP+WL IPGI+FRT+N PFK+EMQRF
Sbjct: 61 KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRF 120
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
KIVDLM++E L++ QGGPII+ QIENEYGN++ +YG K Y+ WAASMA L GVP
Sbjct: 121 TTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVP 180
Query: 235 WVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLA 294
WVMC+QTDAP+ II+ CNG+YCD + PNS NKP +WTENW GW+ ++GG +P RPVEDLA
Sbjct: 181 WVMCQQTDAPDPIINTCNGFYCDQFSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDLA 240
Query: 295 FAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKD 354
FAVARFFQRGG+F NYYMY G NFG TSGGPF TSYDYDAPIDEYG+ +PKWGHLK+
Sbjct: 241 FAVARFFQRGGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLKE 300
Query: 355 LHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTF 414
LH AIKLCEPALVA D ++LG N EAHVY+ + C+AFLANI + A+VTF
Sbjct: 301 LHKAIKLCEPALVATDH-HTLRLGPNLEAHVYKT----ASGVCAAFLANIGTQSDATVTF 355
Query: 415 LGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEF--SLPLSPNISVPQQSMIESK 472
G+SY+LP WSVSILPDCR VFNTA+++SQ +++ S L+ + + + +S
Sbjct: 356 NGKSYSLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSD 415
Query: 473 LSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKT 532
W V EP+G+ N G+LE +N T D SDYLW+ I + D+ + +
Sbjct: 416 -------WSFVIEPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEP--FLS 466
Query: 533 NEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW--VKVV--QPVEFQSGYNDLILLSQTVG 588
N + + +S+ VL F+NG+L GS IG+ K++ + + G N + LLS TVG
Sbjct: 467 NGTQSNLHAESLGHVLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVG 526
Query: 589 LQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT 648
LQNYGAF + GAG G VKL G +NG +DLS WTYQ+GLKGE ++ + ++W
Sbjct: 527 LQNYGAFFDLMGAGITGPVKLKG-QNGTLDLSSNAWTYQIGLKGEDLSLHENSGDVSQWI 585
Query: 649 DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGC 707
+ WYKT F+APDG DPVA+D MGKG+AWVNG IGRYW T +P+ GC
Sbjct: 586 SESTLPKNQPLIWYKTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTYSSPQNGC 645
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
C+YRG Y++ KC NCG P+Q YHVPRS++Q+ +N LV+FEE GG+P +IS+ +
Sbjct: 646 STACNYRGPYSASKCIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQ 705
Query: 768 TRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIISSIEFASYGTP 826
+C VSESH PV W S GK K P + L C +ISSI+FAS+GTP
Sbjct: 706 MTSLCAHVSESHPAPVDTWL-SLQQKGK----KSGPTIQLECPYPNQVISSIKFASFGTP 760
Query: 827 QGRCQKFSRGNCHAPMSLSVVSE 849
G C F+ C + L+VV +
Sbjct: 761 SGMCGSFNHSQCSSASVLAVVQK 783
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 873 bits (2255), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/847 (52%), Positives = 567/847 (66%), Gaps = 53/847 (6%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPF--NVSYDHRAIIIDGNRRMLISAGIHYPRATPEM 77
P +++++ L C+ + K F NV YDHRA++IDG RR+LIS IHYPR+TPEM
Sbjct: 3 PAQIVLVLFWLLCIHTP------KLFCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEM 56
Query: 78 WPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGP 137
WPDLI KSK+GG DVIETYVFWN HE +RGQY+F G+ D+VKFVK V ++GLY+ LRIGP
Sbjct: 57 WPDLIQKSKDGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGP 116
Query: 138 YVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIM 197
YVCAEWN+GGFPVWL IPGI+FRT+N PFK EM+RF KIVD++++E L++ QGGP+I+
Sbjct: 117 YVCAEWNYGGFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVIL 176
Query: 198 LQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD 257
QIENEYGN++++YG GK Y+KWAA+MA L GVPWVMC Q DAP+ II+ NG+Y D
Sbjct: 177 SQIENEYGNIDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGD 236
Query: 258 GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGT 317
+ PNS KP +WTENW GW+ +GG +P+RPVEDLAFAVARFFQRGG+F NYYMY GGT
Sbjct: 237 EFTPNSNTKPKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT 296
Query: 318 NFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
NF R SGGPF TSYDYDAPIDEYG++ +PKWGHLK++H AIKLCE AL+A D L
Sbjct: 297 NFDRASGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPT-ITSL 355
Query: 378 GQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
G N EA VY+ + S C+AFLAN+ + +V F G SY LP WSVSILPDC++ V
Sbjct: 356 GPNLEAAVYK-----TGSVCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVL 410
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NTAK++S ++I + ++ + S+ SST SW++ EP+G+ ++F+
Sbjct: 411 NTAKINSASAISSF--------TTESSKEDIGSSEASSTGWSWIS--EPVGISKTDSFSQ 460
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
G+LE +N T D SDYLW+ I D S + + I+S+ L FING+L
Sbjct: 461 TGLLEQINTTADKSDYLWYSLSIDYKADASS-------QTVLHIESLGHALHAFINGKLA 513
Query: 558 GS--------VIGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRG 605
G +I + K V PV +G N + LLS TVGLQNYGAF + G G G
Sbjct: 514 GKYKLKHSQLIICNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITG 573
Query: 606 QVKLTGFKNGD-IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKT 664
V L GF NG+ +DLS WTYQVGL+GE + S + +W + TWYKT
Sbjct: 574 PVILKGFANGNTLDLSSQKWTYQVGLQGEDLGLSS--GSSGQWNLQSTFPKNQPLTWYKT 631
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCT 723
F AP G DPVA+D MGKG+AWVNG IGRYW T VA C D+C+YRG Y++ KC
Sbjct: 632 TFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSCNYRGPYSASKCR 691
Query: 724 TNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPV 783
NC P+QT YHVPRSWL+ S N+LV+FEE GG+P +IS + T +C VS+SH PPV
Sbjct: 692 KNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESLCAHVSDSHPPPV 751
Query: 784 RKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPM 842
W NS + G+ K+ P + L C D +ISSI+FASYGTP G C F G C +
Sbjct: 752 DLW-NSETESGR----KVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNK 806
Query: 843 SLSVVSE 849
+LS+V +
Sbjct: 807 ALSIVQK 813
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 870 bits (2248), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/813 (53%), Positives = 555/813 (68%), Gaps = 42/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 25 NVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEPV 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQYNF+G+ D+V FVK V ++GLY+ LRIGPYVCAEWN+GGFP+WL I GI+FRTNN
Sbjct: 85 RGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNNE 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EM+RF KIVD+M++E L++ QGGPII+ QIENEYGN+++ + K Y+ WAASM
Sbjct: 145 PFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAASM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPW+MC+Q +AP+ II+ CN +YCD + PNS NKP +WTENW GW+ +GG +
Sbjct: 205 ATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGAV 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARFFQRGG+F NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYG +
Sbjct: 265 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDIR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCE AL+A+D G N E VY+ + + CSAFLANI
Sbjct: 325 QPKWGHLKDLHKAIKLCEEALIASDPT-ITSPGPNLETAVYK-----TGAVCSAFLANIG 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+VTF G SY LP WSVSILPDC+N V NTAKV++ + I S
Sbjct: 379 -MSDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMIS------------SFAT 425
Query: 466 QSMIES--KLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+S+ E L S+S W + EP+G+ + + FT G+LE +N T D SDYLW+ I
Sbjct: 426 ESLKEKVDSLDSSSSGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYE 485
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
D+ +P + I+S+ L F+NG+L GS G V V P+ +G N
Sbjct: 486 DNAGD-------QPVLHIESLGHALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGKNT 538
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGLQNYGAF + GAG G V L G KNG +DL+ WTYQVGL+GEF +
Sbjct: 539 IDLLSLTVGLQNYGAFYDTVGAGITGPVILKGLKNGSSVDLTSQQWTYQVGLQGEFVGLS 598
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
S N +W + TWYKT F AP G +PVA+D MGKG+AWVNG IGRYW
Sbjct: 599 S--GNVGQWNSQSNLPANQPLTWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYW 656
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
T ++P GC D+C+YRG Y++ KC NCG P+QT YHVPR+WL+ +N V+FEE+GG+
Sbjct: 657 PTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLFEESGGD 716
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DGYIIS 816
P +IS + VC V+ESH PPV W+++ S K+ P + L C IS
Sbjct: 717 PTKISFGTKQIESVCSHVTESHPPPVDTWNSNAE-----SERKVGPVLSLECPYPNQAIS 771
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP+G C ++ G+C + +LS+V +
Sbjct: 772 SIKFASFGTPRGTCGNYNHGSCSSNRALSIVQK 804
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 866 bits (2237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/842 (51%), Positives = 549/842 (65%), Gaps = 75/842 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
N+SYDHRAIII G RR+LIS +HYPRA+P+MWP LI +KEGG D+I+TYVFW+ HE
Sbjct: 22 NISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHEPS 81
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+++F+KLV +GLY+ LRIGPYVCAEWNFGGFP WL +PGI+FRT+N
Sbjct: 82 PGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTHNR 141
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F+++M+ FV+KIVD+++ E LF+ QGGP++ QIENEYGN++ SYG GK Y+ WAA M
Sbjct: 142 AFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYMLWAARM 201
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPW+MCKQ DAP+ II+ CNGYYCDG+KPNS +KP +WTENW GWY WG
Sbjct: 202 AKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQLWGEAA 261
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYM------------------YFGGTNFGRTSGGPF 327
P+R VED+AFAVARFFQRGG NYYM YFGGTNFGRTSGGPF
Sbjct: 262 PYRTVEDVAFAVARFFQRGGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNFGRTSGGPF 321
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE---AH 384
TSYDYDAP+DE+G+L +PKWGHLK+LHAA+KLCE AL + D Y LG+ QE AH
Sbjct: 322 ITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPL-YYTLGRMQEMVQAH 380
Query: 385 VY-----RANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
VY AN + C+AFLANID ++ASV F G Y LPPWSVSILPDCRN VFNT
Sbjct: 381 VYSDGSLEANFSNLATPCAAFLANIDT-SSASVKFGGNVYNLPPWSVSILPDCRNVVFNT 439
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK------SWMTVKEPIGVWSEN 493
A+VS+QTS+ + ++V + S+IE S + +W +EP+G N
Sbjct: 440 AQVSAQTSVTKM---------VAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGIN 490
Query: 494 NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFIN 553
+LE ++ T D +DYLW+ T+ +SD ++ K + P + I SMRD++ +F+N
Sbjct: 491 KILAHALLEQISTTNDSTDYLWYSTRFEISDQEL---KGGD--PVLVITSMRDMVHIFVN 545
Query: 554 GQLTGSVI-----GHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
G+ GS G + +V QP+ ++G N L +LS TVGLQNYGA LE GAG G V
Sbjct: 546 GEFAGSTSTLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVW 605
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDA 668
+ G G +L+ LW +QVGL GE I W+ T WYK F+
Sbjct: 606 IQGLSTGTRNLTSALWLHQVGLNGEHDAI--------TWSSTTSLPFFQPLVWYKANFNI 657
Query: 669 PDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
PDG DPVA+ LGSMGKGQAWVNGH +GR+W + AP GC D CDYRG Y S KC + CG
Sbjct: 658 PDGDDPVAIHLGSMGKGQAWVNGHSLGRFWPAITAPSTGCSDRCDYRGTYYSSKCLSGCG 717
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
P+Q WYHVPR WL N LV+ EE GGN +S R VC QVSE PPV ++S
Sbjct: 718 LPSQEWYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQFS 777
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+ PE+ L C G ISSI FAS+G P+GRC F +G+CHA S ++V
Sbjct: 778 S-------------LPELGLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIV 824
Query: 848 SE 849
+
Sbjct: 825 EK 826
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 865 bits (2236), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/842 (51%), Positives = 552/842 (65%), Gaps = 75/842 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
N+SYDHRAIII G RR+LIS IHYPRA+P+MWP LI +KEGG D+I+TYVFW+ HE
Sbjct: 22 NISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHEPS 81
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+++F+KLV +GLY+ LRIGPYVCAEWNFGGFP WL +PGI+FRT+N
Sbjct: 82 PGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTHNR 141
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F+++M+ FV+KIVD+++ E LF+ QGGP++ QIENEYGN++ SYG GK Y+ WAA M
Sbjct: 142 AFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYMLWAARM 201
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPW+MCKQ DAP+ II+ CNGYYCDG+KPNS +KP +WTENW GWY +WG
Sbjct: 202 AKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQSWGEAA 261
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYM------------------YFGGTNFGRTSGGPF 327
P+R VED+AFAVARFFQRGG NYYM YFGGTNFGRTSGGPF
Sbjct: 262 PYRTVEDVAFAVARFFQRGGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNFGRTSGGPF 321
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE---AH 384
TSYDYDAP+DE+G+L +PKWGHLK+LHAA+KLCE AL + D Y LG+ QE AH
Sbjct: 322 ITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPV-YYTLGRMQEMVQAH 380
Query: 385 VY-----RANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
VY AN + C+AFLANID ++ASV F G+ Y LPPWSVSILPDCRN VFNT
Sbjct: 381 VYSDGSLEANFSNLATPCAAFLANIDT-SSASVKFGGKVYNLPPWSVSILPDCRNVVFNT 439
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK------SWMTVKEPIGVWSEN 493
A+VS+QTS+ + ++V + S+IE S + +W +EP+G N
Sbjct: 440 AQVSAQTSVTKM---------VAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGIN 490
Query: 494 NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFIN 553
+LE ++ T D +DY+W+ T+ + D ++ K + P + I SMRD++ +F+N
Sbjct: 491 KILAHALLEQISTTNDSTDYMWYSTRFEILDQEL---KGGD--PVLVITSMRDMVHIFVN 545
Query: 554 GQLTGSVI-----GHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
G+ GS G + +V QP+ ++G N L +LS TVGLQNYGA LE GAG G +
Sbjct: 546 GEFAGSTSTLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIW 605
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDA 668
+ G G +L+ LW +QVGL GE I W+ T WYK F+
Sbjct: 606 IQGLSTGTRNLTSALWLHQVGLNGEHDAI--------TWSSTTSLPFFQPLVWYKANFNI 657
Query: 669 PDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV-APKGGCQDTCDYRGAYNSDKCTTNCG 727
PDG DPVA+ LGSMGKGQAWVNGH +GR+W V+ AP GC D CDYRG Y S KC ++CG
Sbjct: 658 PDGDDPVAIHLGSMGKGQAWVNGHSLGRFWPVITAPSTGCSDRCDYRGTYYSSKCLSSCG 717
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
P+Q WYHVPR WL N LV+ EE GGN +S R VC QVSE PPV ++S
Sbjct: 718 LPSQEWYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQFS 777
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+ PE+ L C G ISSI FAS+G P+GRC F +G+CHA S ++V
Sbjct: 778 S-------------LPELGLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIV 824
Query: 848 SE 849
+
Sbjct: 825 EK 826
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 860 bits (2223), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/835 (51%), Positives = 551/835 (65%), Gaps = 61/835 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV YDHRA++IDG RR+LIS IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ D+VKFVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 166 PFK--EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
PFK EM+RF KIVDLM++E L++ QGGPII+ QIENEYG+++S+YG GK Y+ WAA
Sbjct: 141 PFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWAA 200
Query: 224 SMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA L GVPWVMC+Q DAP++II+ CNG+YCD + PNS KP +WTENW WY +GG
Sbjct: 201 KMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQFTPNSNTKPKMWTENWSAWYLLFGG 260
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYM---------------------YFGGTNFGRT 322
PHRPVEDLAFAVARFFQRGG+F NYYM Y GGTNF R+
Sbjct: 261 GFPHRPVEDLAFAVARFFQRGGTFQNYYMVLQPEMFFTSSIYYMVLFLRPYHGGTNFDRS 320
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYD+DAPIDEYG++ +PKWGHLKDLH A+KLCE AL+A + + LG N E
Sbjct: 321 TGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLCEEALIATE-PKITSLGPNLE 379
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
A VY+ + S C+AFLAN+D + +V F G SY LP WSVSILPDC+N V NTAK+
Sbjct: 380 AAVYK-----TGSVCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKI 434
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
+S ++I N S L ++S W + EP+G+ ++ F+ G+LE
Sbjct: 435 NSASAIS----------NFVTKSSKEDISSLETSSSKWSWINEPVGISKDDIFSKTGLLE 484
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+N+T D SDYLW+ + + DD S + + I+S+ L F+NG+L GS G
Sbjct: 485 QINITADRSDYLWYSLSVDLKDDLGS-------QTVLHIESLGHALHAFVNGKLAGSHTG 537
Query: 563 HWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-- 616
+ K V P++ G N + LLS TVGLQNYGAF ++ GAG G V L G KNG+
Sbjct: 538 NKDKPKLNVDIPIKVIYGNNQIDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLKNGNNT 597
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
+DLS WTYQVGLKGE + S + W + WYKT FDAP G +PVA
Sbjct: 598 LDLSSQKWTYQVGLKGEDLGLSS--GSSEGWNSQSTFPKNQPLIWYKTNFDAPSGSNPVA 655
Query: 677 LDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
+D MGKG+AWVNG IGRYW T VA C D+C+YRG + KC NCG P+QT YH
Sbjct: 656 IDFTGMGKGEAWVNGQSIGRYWPTYVASNADCTDSCNYRGPFTQTKCHMNCGKPSQTLYH 715
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGK 795
VPRS+L+ + N LV+FEE GG+P +I+ + +C VS+SH P + W+ +
Sbjct: 716 VPRSFLKPNGNTLVLFEENGGDPTQIAFATKQLESLCAHVSDSHPPQIDLWNQDTT---- 771
Query: 796 LSINKMAPEMHLHCQD-GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S K+ P + L+C + +I SI+FASYGTP G C F RG C + +LS+V +
Sbjct: 772 -SWGKVGPALLLNCPNHNQVIFSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKK 825
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 858 bits (2216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/730 (56%), Positives = 514/730 (70%), Gaps = 38/730 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
V+YDHR +II+G RMLISA IHYPRA P+MW LI+ +K GG DVIETYVFW+ H+
Sbjct: 23 TVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGHQPT 82
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R YNF+G+ D+V FVKLV +GLY LRIGPYVCAEWN GGFPVWL+D+PGIEFRTNN
Sbjct: 83 RDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIEFRTNNQ 142
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQ FV+KIV +M+ + LF+ QGGPII+ QIENEYGN++++YG GK+Y++WAA+M
Sbjct: 143 PFKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMEWAANM 202
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A GLG GVPW+MC+Q+DAP+ I+D CNG+YCD + PN+ KP +WTENW GW+ WG
Sbjct: 203 AQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWGEAS 262
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVED+AFAVARFFQRGGSF NYYMYFGGTNFGR+SGGP+ TSYDYDAPIDE+G++
Sbjct: 263 PHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFGVIR 322
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK LHAAIKLCE AL + D YI LGQ QEAHVY + G+ C+AFLANID
Sbjct: 323 QPKWGHLKQLHAAIKLCEAALGSNDPT-YISLGQLQEAHVYGSTSSGA---CAAFLANID 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+V F ++Y LP WSVSILPDC+ NTAKV QT++ T+
Sbjct: 379 SSSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVHVQTAMPTM-------------- 424
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
K S T +W + EP+GVWS++ +LE +N TKD SDYLW+ T + +S
Sbjct: 425 ------KPSITGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQA 478
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLI 581
D + K ++++SMRDV+ VF+NG+L GS + V QP+E SG+N L
Sbjct: 479 DAASGKA-----LLSLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLA 533
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+L TVGLQNYG F+E GAG G V + G +G IDL+ W +QVGLKGE I++
Sbjct: 534 ILCATVGLQNYGPFIETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTES 593
Query: 642 ENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-T 699
++ W+ G WYK +FD+P G DPVALDL SMGKGQAW+NG IGR+W +
Sbjct: 594 GSQRVRWSSAVPQG--QALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPS 651
Query: 700 VVAPK-GGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ AP GC TCDYRG+Y+S KC + CG P+Q WYHVPRSWLQ S NL+V+FEE GG P
Sbjct: 652 LRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKP 711
Query: 759 FEISVKLRST 768
+S R+
Sbjct: 712 SGVSFVTRTV 721
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 855 bits (2208), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/809 (53%), Positives = 539/809 (66%), Gaps = 38/809 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AIII+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 38 SVSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 97
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G+ D+VKF+KLV +GLY+ LRIGPY CAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 98 PGEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTDNE 157
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M F KKIVD+M+EE LF QGGPII+ QIENEYG +E G G+ Y KWAA+M
Sbjct: 158 PFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAANM 217
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCKQ DAP+ II+ CN +YCD + PN KPT+WTE W W+T +GG +
Sbjct: 218 AVGLGTGVPWVMCKQDDAPDPIINTCNDHYCDWFSPNKNYKPTMWTEAWTSWFTAFGGPV 277
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP ED+AFA+A+F QRGGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 278 PYRPAEDMAFAIAKFIQRGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYGLIR 337
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIK+CE ALV+ D LG +QE+HV+++ +C+AFLAN D
Sbjct: 338 QPKWGHLKDLHKAIKMCEAALVSGDPI-VTSLGSSQESHVFKS----ESGDCAAFLANYD 392
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
E + A V F G Y LPPWS+SILPDC NTVFNTA+V +QTS
Sbjct: 393 EKSFAKVAFQGMHYNLPPWSISILPDCVNTVFNTARVGAQTS------------------ 434
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
SM + ++ SW T E + + + T++G+LE +NVT+D +DYLW+ T I + D
Sbjct: 435 -SMTMTSVNPDGFSWETYNEETASYDDASITMEGLLEQINVTRDVTDYLWYTTDITI-DP 492
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F K E P +T+ S L +FING+L+G+V G + V+ +G N +
Sbjct: 493 NEGFLKNGEY-PVLTVMSAGHALHIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKIS 551
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS VGL N GA E G G V L G G DLS W+Y++GLKGE Q++S+
Sbjct: 552 VLSIAVGLPNIGAHFETWNTGVLGPVVLNGLNEGRRDLSWQNWSYKIGLKGEALQLHSLT 611
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW+ L P TWYKT F+AP+G P ALD+ MGKGQ W+NG IGRYW
Sbjct: 612 GSSSVEWSSLIAQKQP--LTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWPA 669
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C + C Y G YN KC NCG +Q WYHVP SWL + NLLV+FEE GG+P
Sbjct: 670 YKAYGNCGE-CSYTGRYNEKKCLANCGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTG 728
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
IS+ R+T C +SE H P +RKW + D + P+ HL C DG ISSI+F
Sbjct: 729 ISLVRRTTGSACAFISEWH-PTLRKW---HIKDYGRAERPRRPKAHLSCADGQKISSIKF 784
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+GTPQG C F+ G+CHA S + +
Sbjct: 785 ASFGTPQGVCGNFTEGSCHAHKSYDIFEK 813
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 853 bits (2204), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/810 (51%), Positives = 545/810 (67%), Gaps = 49/810 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI K+K+GG DVIETYVFW+ HE +
Sbjct: 29 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHEPV 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ D+ FVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 89 RGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF KI ENEYGN++S+YG GK Y++WAA M
Sbjct: 149 PFKAEMQRFTAKI----------------------ENEYGNIDSAYGAPGKAYMRWAAGM 186
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q DAP+ +I+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 187 AVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGGAV 246
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTN R+SGGPF TSYDYDAPIDEYGL+
Sbjct: 247 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGLVR 306
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+D+H AIKLCEPAL+A D + Y LG N EA VY+ S C+AFLANID
Sbjct: 307 QPKWGHLRDVHKAIKLCEPALIATDPS-YTSLGPNVEAAVYKVG-----SVCAAFLANID 360
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +VTF G+ Y LP WSVSILPDC+N V NTA+++SQT+ + + L +
Sbjct: 361 GQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRY---LESSNVASD 417
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S + +L+ + W EP+G+ +N T G++E +N T D SD+LW+ T I V D
Sbjct: 418 GSFVTPELAVS--DWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGD 475
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQSGYNDLI 581
+ N + + ++S+ VL+V+ING++ GS G + +P+E G N +
Sbjct: 476 EPYL---NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKID 532
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G NG +DLS WTYQ+GL+GE +Y
Sbjct: 533 LLSATVGLSNYGAFFDLVGAGITGPVKLSGL-NGALDLSSAEWTYQIGLRGEDLHLYDPS 591
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TV 700
E EW I WYKT F P G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 592 EASPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTN 651
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+AP+ GC ++C+YRGAY+S KC CG P+QT YHVPRS+LQ +N LV+FE GG+P +
Sbjct: 652 LAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSK 711
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIE 819
IS +R T VC QVSE+H + WS+ + + + P + L C ++G +ISS++
Sbjct: 712 ISFVMRQTGSVCAQVSEAHPAQIDSWSS------QQPMQRYGPALRLECPKEGQVISSVK 765
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C +S G C + +LS+V E
Sbjct: 766 FASFGTPSGTCGSYSHGECSSTQALSIVQE 795
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 853 bits (2204), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/803 (53%), Positives = 537/803 (66%), Gaps = 40/803 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRAI I+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 30 VSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 89
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G Y F+ + D+VKF+K+V ++GLY+ LRIGPY+CAEWNFGGFPVWL+ +PGIEFRT+N P
Sbjct: 90 GNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDNGP 149
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV +M+ E LF QGGPII+ QIENE+G +E G GK Y KWAA MA
Sbjct: 150 FKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAADMA 209
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ LG GVPWVMCKQ DAP+ +I+ CNG+YC+ +KPN KP LWTENW GWYT +GG +P
Sbjct: 210 VKLGTGVPWVMCKQDDAPDPVINTCNGFYCENFKPNKDYKPKLWTENWTGWYTEFGGAVP 269
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP EDLAF+VARF Q GGSFMNYYMY GGTNFGRTS G F TSYDYDAP+DEYGL +
Sbjct: 270 YRPAEDLAFSVARFIQNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDEYGLTRD 329
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIKLCEPALV+ D LG NQEAHV++ S+S+C+AFLAN D
Sbjct: 330 PKWGHLRDLHKAIKLCEPALVSVDPT-VKSLGSNQEAHVFQ-----SKSSCAAFLANYDT 383
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP-NISVPQ 465
+ VTF Y LPPWS+SILPDC+ VFNTA++ +Q+S + ++P ++
Sbjct: 384 KYSVKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSS------QMKMTPVGGALSW 437
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
QS IE + ++++ T++G+ E +NVT+D SDYLW++T + + D
Sbjct: 438 QSYIEEAATG--------------YTDDTTTLEGLWEQINVTRDASDYLWYMTNVNI-DS 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
D F K + P +TI S L VFINGQL G+V G + Q V+ +G N +
Sbjct: 483 DEGFLKNGD-SPVLTIFSAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKLTAGINKIS 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G EK AG G V L G G DLS W+Y++GLKGE ++++
Sbjct: 542 LLSVAVGLPNVGVHFEKWNAGILGPVTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTVT 601
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + + TWYK FDAP+G DPVALD+ SMGKGQ WVNG IGR+W
Sbjct: 602 GSSSVEWVEGSLSAKKQPLTWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPA 661
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+G C C+Y G Y+ KC +NCG P+Q WYHVPRSWL S NLLV+FEE GG P
Sbjct: 662 YTARGSCS-ACNYAGTYDDKKCRSNCGEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSG 720
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
IS+ R+T VC + E P ++ W G+L + + P+ HL C G IS I+F
Sbjct: 721 ISLVKRTTGSVCADIFEGQ-PALKNW--QMIALGRL--DHLQPKAHLWCPHGQKISKIKF 775
Query: 821 ASYGTPQGRCQKFSRGNCHAPMS 843
ASYG+PQG C F G+CHA S
Sbjct: 776 ASYGSPQGTCGSFKAGSCHAHKS 798
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 849 bits (2194), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/667 (61%), Positives = 483/667 (72%), Gaps = 45/667 (6%)
Query: 41 FFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWN 100
FF+PFNV+YDHRA++I G RRML+SAG+HYPRATPEMWP LIAK KEGGADVIETYVFWN
Sbjct: 58 FFEPFNVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWN 117
Query: 101 AHESIRGQYNFKGKNDIVKFVK--LVGSSGL----------------------------- 129
HE +GQY F+ + D+VKF K LV + L
Sbjct: 118 GHEPAKGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEPA 177
Query: 130 ----YLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREE 185
Y + R P + GFPVWLRDIPGIEFRT+N PFK EMQ FV KIV LM+EE
Sbjct: 178 KGQYYFEERFDPVKFEKHVIFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEE 237
Query: 186 MLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPE 245
L+SWQGGPII+ QIENEYGN++ +YGQ GK Y++WAA MA+GL G+PWVMC+QTDAPE
Sbjct: 238 KLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPE 297
Query: 246 NIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGG 305
IID CN +YCDG+KPNSYNKPT+WTE+WDGWY WGG LPHRP ED AFAVARF+QRGG
Sbjct: 298 EIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGG 357
Query: 306 SFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPA 365
S NYYMYFGGTNF RT+GGP ITSYDYDAPIDEYG+L +PKWGHLKDLH AIKLCEPA
Sbjct: 358 SLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPA 417
Query: 366 LVAAD-SAQYIKLGQNQEAHVYRANRY-------GSQSNCSAFLANIDEHTAASVTFLGQ 417
L+A D S QYIKLG QEAHVY G+ CSAFLANIDEH ASV G+
Sbjct: 418 LIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGK 477
Query: 418 SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTS 477
SY+LPPWSVSILPDC N FNTA++ +QTS+ TVE P + P + S S
Sbjct: 478 SYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLS 537
Query: 478 KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRP 537
+W T KE IG W NNF VQGILEHLNVTKD SDYLW+ T++ +SD D++FW + V P
Sbjct: 538 STWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLP 597
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
++TID +RDV RVF+NG+L GS +GHWV + QP++ G N+L LLS+ VGLQNYGAFLE
Sbjct: 598 SLTIDKIRDVARVFVNGKLAGSQVGHWVSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLE 657
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIP 656
KDGAGFRGQV LTG +GD+DL+ LWTYQVGLKGEF IY+ E+ A W+ + +D +
Sbjct: 658 KDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSV- 716
Query: 657 STFTWYK 663
FTWYK
Sbjct: 717 QPFTWYK 723
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/825 (51%), Positives = 549/825 (66%), Gaps = 44/825 (5%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
M +++++ SCV S AS VSYDH+AII++G RR+LIS IHYPR+TPEMWPD
Sbjct: 12 MWNVLLVLLSSCVFSGLAS-------VSYDHKAIIVNGQRRILISGSIHYPRSTPEMWPD 64
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+KEGG DVI+TYVFWN HE +G+Y F+ + D+VKF+KLV +GLY+ LR+GPY C
Sbjct: 65 LIQKAKEGGVDVIQTYVFWNGHEPEQGKYYFEERYDLVKFIKLVHQAGLYVNLRVGPYAC 124
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWNFGGFPVWL+ +PGI FRT+N PFK MQ+F KIV++M+ E L+ QGGPII+ QI
Sbjct: 125 AEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQKFTTKIVNMMKAERLYESQGGPIILSQI 184
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYG +E +G+QGK Y +WAA MAL LG GVPW+MCKQ DAP+ +I+ CNG+YCD +
Sbjct: 185 ENEYGPLEVRFGEQGKSYAEWAAKMALDLGTGVPWLMCKQDDAPDPVINTCNGFYCDYFY 244
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PN KP +WTE W W+T +G +P+RPVEDLAF VA F Q GGSF+NYYMY GGTNFG
Sbjct: 245 PNKAYKPKIWTEAWTAWFTEFGSPVPYRPVEDLAFGVANFIQTGGSFINYYMYHGGTNFG 304
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RT+GGPF TSYDYDAP+DE+GLL +PKWGHLKDLH AIKLCEPALV+ D LG
Sbjct: 305 RTAGGPFVATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALVSGDPT-VTALGNY 363
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
Q+AHV+R+ + C+AFLAN D ++ A+V F + Y LPPWS+SILPDC++TV+NTA
Sbjct: 364 QKAHVFRS----TSGACAAFLANNDPNSFATVAFGNKHYNLPPWSISILPDCKHTVYNTA 419
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
+V +Q+++ + ++P ++ SW + + + +N FTV G+
Sbjct: 420 RVGAQSAL------MKMTP--------------ANEGYSWQSYNDQTAFYDDNAFTVVGL 459
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
LE LN T+D SDYLW++T + + D F ++ P +T+ S D L VF+NGQL G+V
Sbjct: 460 LEQLNTTRDVSDYLWYMTDVKI-DPSEGFLRSGN-WPWLTVSSAGDALHVFVNGQLAGTV 517
Query: 561 IGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G K + V ++G N + LLS VGL N G E G G V L+G G
Sbjct: 518 YGSLKKQKITFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWNTGVLGPVSLSGLDEGK 577
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV 675
DL+ W+Y+VGLKGE ++S+ + EW + + TWYKT F+AP G +P+
Sbjct: 578 RDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWVEGSLVAQRQPLTWYKTTFNAPAGNEPL 637
Query: 676 ALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
ALD+ SMGKGQ W+NG IGRYW G C D C+Y G +N KC +NCG+ +Q WYH
Sbjct: 638 ALDMNSMGKGQVWINGQSIGRYWPGYKASGTC-DACNYAGPFNEKKCLSNCGDASQRWYH 696
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGK 795
VPRSWL + NLLV+FEE GG+P IS+ R VC ++E P + W GK
Sbjct: 697 VPRSWLHPTGNLLVVFEEWGGDPNGISLVKRELASVCADINEWQ-PQLVNW--QLQASGK 753
Query: 796 LSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+ + P+ HL C G I+SI+FAS+GTPQG C FS G+CHA
Sbjct: 754 VD-KPLRPKAHLSCTSGQKITSIKFASFGTPQGVCGSFSEGSCHA 797
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/823 (51%), Positives = 546/823 (66%), Gaps = 45/823 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ ++ S VS SAS V+YD R+ II+G R++LIS IHYPR+TPEMWPDLI
Sbjct: 6 LVVFILIFSWVSHGSAS-------VTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLI 58
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE RG+Y F+G+ D+V+F+K+V ++GLY+ LRIGPY+CAE
Sbjct: 59 QKAKDGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAE 118
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIVD+M+ E LF QGGPIIM QIEN
Sbjct: 119 WNFGGFPVWLKYVPGIAFRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIEN 178
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +E G GK Y KWAA MA+ LG GVPWVMCKQ DAP+ +IDACNG+YC+ + PN
Sbjct: 179 EYGPVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPN 238
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP ++TE W GWYT +GG +P+RP EDLA++VARF Q GSF+NYYMY GGTNFGRT
Sbjct: 239 KDYKPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRT 298
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAPIDEYGL SEPKWGHL+DLH AIKLCEPALV+AD LG N E
Sbjct: 299 AGGPFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPT-VTYLGTNLE 357
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHVY+A C+AFLAN D ++A VTF Y LPPWSVSILPDC+N VFNTA++
Sbjct: 358 AHVYKAK----SGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARI 413
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
+Q+S + ++P + QS E S+ ++E+ T+ G+LE
Sbjct: 414 GAQSS------QMKMNPVSTFSWQSYNEETASA--------------YTEDTTTMDGLLE 453
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+N+T+D +DYLW++T++++ D+ F KT + P +T+ S L VFINGQL+G+V G
Sbjct: 454 QINITRDTTDYLWYMTEVHIKPDE-GFLKTGQY-PVLTVMSAGHALHVFINGQLSGTVYG 511
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
V V+ G N + LLS +GL N G E AG G V L G G +D
Sbjct: 512 ELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGLHFETWNAGVLGPVTLKGLNEGTVD 571
Query: 619 LSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
+S W+Y++GLKGE + +I + + EW + + TWYKT F+AP G DP+AL
Sbjct: 572 MSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLAL 631
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVP 737
D+ SMGKGQ W+NG IGR+W G C + C+Y G +N KC T CG P+Q WYHVP
Sbjct: 632 DMSSMGKGQIWINGESIGRHWPAYTAHGNC-NGCNYAGIFNDKKCQTGCGGPSQRWYHVP 690
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLS 797
RSWL+ S N L++FEE GGNP I++ R+ VC + E P K S + G
Sbjct: 691 RSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVCADIFEGQ--PSLKNSQ---IIGSSK 745
Query: 798 INKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+N + + HL C G IS I+FAS+G PQG C F G+CHA
Sbjct: 746 VNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGSCHA 788
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 847 bits (2189), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/823 (51%), Positives = 546/823 (66%), Gaps = 45/823 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ ++ S VS SAS V+YD R+ II+G R++LIS IHYPR+TPEMWPDLI
Sbjct: 9 LVVFILIFSWVSHGSAS-------VTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE RG+Y F+G+ D+V+F+K+V ++GLY+ LRIGPY+CAE
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIVD+M+ E LF QGGPIIM QIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +E G GK Y KWAA MA+ LG GVPWVMCKQ DAP+ +IDACNG+YC+ + PN
Sbjct: 182 EYGPVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP ++TE W GWYT +GG +P+RP EDLA++VARF Q GSF+NYYMY GGTNFGRT
Sbjct: 242 KDYKPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAPIDEYGL SEPKWGHL+DLH AIKLCEPALV+AD LG N E
Sbjct: 302 AGGPFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPT-VTYLGTNLE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHVY+A C+AFLAN D ++A VTF Y LPPWSVSILPDC+N VFNTA++
Sbjct: 361 AHVYKAK----SGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARI 416
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
+Q+S + ++P + QS E S+ ++E+ T+ G+LE
Sbjct: 417 GAQSS------QMKMNPVSTFSWQSYNEETASA--------------YTEDTTTMDGLLE 456
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+N+T+D +DYLW++T++++ D+ F KT + P +T+ S L VFINGQL+G+V G
Sbjct: 457 QINITRDTTDYLWYMTEVHIKPDE-GFLKTGQY-PVLTVMSAGHALHVFINGQLSGTVYG 514
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
V V+ G N + LLS +GL N G E AG G V L G G +D
Sbjct: 515 ELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGLHFETWNAGVLGPVTLKGLNEGTVD 574
Query: 619 LSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
+S W+Y++GLKGE + +I + + EW + + TWYKT F+AP G DP+AL
Sbjct: 575 MSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLAL 634
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVP 737
D+ SMGKGQ W+NG IGR+W G C + C+Y G +N KC T CG P+Q WYHVP
Sbjct: 635 DMSSMGKGQIWINGESIGRHWPAYTAHGNC-NGCNYAGIFNDKKCQTGCGGPSQRWYHVP 693
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLS 797
RSWL+ S N L++FEE GGNP I++ R+ VC + E P K S + G
Sbjct: 694 RSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVCADIFEGQ--PSLKNSQ---IIGSSK 748
Query: 798 INKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+N + + HL C G IS I+FAS+G PQG C F G+CHA
Sbjct: 749 VNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGSCHA 791
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 847 bits (2189), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/804 (52%), Positives = 534/804 (66%), Gaps = 40/804 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+A+II+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 27 SVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y F+ + D+VKF+KLV +GLYL LRIGPY+CAEWNFGGFPVWL+ +PGIEFRT+N
Sbjct: 87 PGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDNG 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F +KIV +M+ E LF QGGPII+ QIENEYG +E G GK Y KWAA M
Sbjct: 147 PFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAADM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ LG GVPW+MCKQ DAP+ +ID CNG+YC+ +KPN KP +WTE W GWYT +GG +
Sbjct: 207 AVKLGTGVPWIMCKQEDAPDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTGWYTEFGGAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARF Q GGS++NYYMY GGTNFGRT+GGPF TSYDYDAP+DE+GL
Sbjct: 267 PHRPAEDMAFSVARFIQNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLPR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIKLCEPALV+ D LG NQEAHV++ S+S C+AFLAN D
Sbjct: 327 EPKWGHLRDLHKAIKLCEPALVSVDPT-VTSLGSNQEAHVFK-----SKSVCAAFLANYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ VTF Y LPPWSVSILPDC+ V+NTA++ SQ+S Q
Sbjct: 381 TKYSVKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQSS-----------------Q 423
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ +S+S SW + E +++ T+ G+ E +NVT+D +DYLW++T + + D
Sbjct: 424 MKMVP---ASSSFSWQSYNEETASADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKI-D 479
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
D F K+ + P +TI S L VFINGQL G+ G + Q ++ G N +
Sbjct: 480 ADEGFLKSGQ-NPLLTIFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKI 538
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E AG G + L G G DLS W+Y++GLKGE +++
Sbjct: 539 SLLSVAVGLPNVGLHFETWNAGVLGPITLKGLNEGTRDLSGQKWSYKIGLKGESLSLHTA 598
Query: 641 EENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+E+ EW + + TWYKT FDAP G DP+ALD+ SMGKGQ W+NG +IGR+W
Sbjct: 599 SGSESVEWVEGSLLAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWP 658
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
G C D C+Y G ++ KC TNCG P+Q WYHVPRSWL+ S NLL +FEE GG+P
Sbjct: 659 GYIAHGSCGD-CNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPT 717
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
IS R+T VC + E P ++ W S GK + P+ HL C G IS I+
Sbjct: 718 GISFVKRTTASVCADIFEGQ-PALKNWQAIAS--GK--VISPQPKAHLWCPTGQKISQIK 772
Query: 820 FASYGTPQGRCQKFSRGNCHAPMS 843
FAS+G PQG C F G+CHA S
Sbjct: 773 FASFGMPQGTCGSFREGSCHAHKS 796
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 847 bits (2187), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/806 (50%), Positives = 545/806 (67%), Gaps = 36/806 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA+ +DG RRML+S IHYPR+TP MWP LIAK+KEGG DVI+TYVFWN HE R
Sbjct: 28 VSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFWNGHEPTR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YN+ G+ ++ KF++LV +G+Y+ LRIGPYVCAEWN GGFP WLR IPGIEFRT+N P
Sbjct: 88 GVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGIEFRTDNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK E QRFV +V ++ E LF+WQGGPIIM QIENEYGN+++SYG+ G+ Y+ W A+MA
Sbjct: 148 FKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRYLNWIANMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ VPW+MC+Q +AP+ +I+ CNG+YCDG++PNS +KP WTENW GW+ +WGG P
Sbjct: 208 VATNTSVPWIMCQQPEAPQLVINTCNGFYCDGWRPNSEDKPAFWTENWTGWFQSWGGGAP 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPV+D+AF+VARFF++GGSFMNYYMY GGTNF RT G TSYDYDAPIDEY + +
Sbjct: 268 TRPVQDIAFSVARFFEKGGSFMNYYMYHGGTNFERT-GVESVTTSYDYDAPIDEYD-VRQ 325
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQY-IKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLKDLHAA+KLCEPALV D+ I LG NQEAHVY++ S C+AFLA+ D
Sbjct: 326 PKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQS----SSGTCAAFLASWD 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ + VTF GQ Y LP WSVSILPDC++ VFNTAKV +Q+ I T++ ++P++
Sbjct: 382 TNDSL-VTFQGQPYDLPAWSVSILPDCKSVVFNTAKVGAQSVIMTMQGAVPVT------- 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+W++ EP+G W + F+ G+LE + TKD +DYLW++T + V++
Sbjct: 434 -------------NWVSYHEPLGPWG-SVFSTNGLLEQIATTKDTTDYLWYMTNVQVAES 479
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQ 585
D+ + + T+ + S+RD F+NG TG+ ++ QP+ + G N++ +LS
Sbjct: 480 DV---RNISAQATLVMSSLRDAAHTFVNGFYTGTSHQQFMHARQPISLRPGSNNITVLSM 536
Query: 586 TVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEEN-E 644
T+GLQ YG FLE + AG + V++ +G I+L WTYQVGL+GE +Q++ + +
Sbjct: 537 TMGLQGYGPFLENEKAGIQYGVRIEDLPSGTIELGGSTWTYQVGLQGESKQLFEVNGSLT 596
Query: 645 AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAP 703
AEW ++ + W KT FD P G +ALDL SMGKG WVNG ++GRYW + A
Sbjct: 597 AEWNTISEVSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYWSSFTAQ 656
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
+ GC +CDYRG+Y KC T C P+Q WYH+PR WL NN +V+FEE GGNP +IS+
Sbjct: 657 RDGCDASCDYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDISI 716
Query: 764 KLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASY 823
R + +C +S+SH P S++ L+ + + L C +G IS I FASY
Sbjct: 717 ATRMPQQICSHISQSHPFPFSL--TSWTKRDNLTSTLLRAPLTLECAEGQQISRICFASY 774
Query: 824 GTPQGRCQKFSRGNCHAPMSLSVVSE 849
GTP G C+ F +CHA S V+++
Sbjct: 775 GTPSGDCEGFVLSSCHANTSYDVLTK 800
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 844 bits (2181), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/808 (51%), Positives = 536/808 (66%), Gaps = 39/808 (4%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F+ +NVSYD RAI+I+G RR+LIS IHYPR++PEMWPDLI K+KEGG DVI+TYVFWN
Sbjct: 12 FQAWNVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNG 71
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE +G+Y F+G+ D+V+F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ + GI FR
Sbjct: 72 HEPSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFR 131
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
TNN PFK MQRF KKIVD+M+ E LF QGGPII+ QIENEYG ME G G+ Y +W
Sbjct: 132 TNNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEW 191
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+GLG GVPWVMCKQ DAP+ II+ CNG+YCD + PN KP +WTE W GW+T +
Sbjct: 192 AAKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEF 251
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG +PHRP EDLAF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DE+
Sbjct: 252 GGAVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEF 311
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GLL +PKWGHLKDLH AIKLCEPAL++ D LG +EAHV+ + C+AFL
Sbjct: 312 GLLRQPKWGHLKDLHRAIKLCEPALISGDPT-VTSLGNYEEAHVFHSK----SGACAAFL 366
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
AN + + A V+F Y LPPWS+SILPDC+NTV+NTA++ +Q++ ++ ++P
Sbjct: 367 ANYNPRSYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSA------TMKMTP-- 418
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
S W + E + +++F G+LE +N T+D SDYLW+ T +
Sbjct: 419 ------------VSGRFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVK 466
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGY 577
+ ++ F K+ P +T+ S L VFING+L+G+ G + Q V+ ++G
Sbjct: 467 IGYNE-GFLKSGRY-PVLTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGV 524
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQ 636
N + LLS VGL N G E AG G V L G G DLS W+Y+VGLKGE
Sbjct: 525 NTIALLSIAVGLPNVGPHFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSL 584
Query: 637 IYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+ EW + + TWYKT F+AP G P+ALD+GSMGKGQ W+NG ++GR
Sbjct: 585 HSLSGSSSVEWVEGSLMARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGR 644
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW GGC D C+Y G Y+ KC +NCG P+Q WYHVP SWL + NLLV+FEE+GG
Sbjct: 645 YWPAYKATGGCGD-CNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGG 703
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYII 815
NP IS+ R VC + E + P +Y + +NK + P+ HL C G I
Sbjct: 704 NPAGISLVEREIESVCADIYE--WQPTLM---NYEMQASGKVNKPLRPKAHLWCAPGQKI 758
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMS 843
SSI+FAS+GTP+G C + G+CHA S
Sbjct: 759 SSIKFASFGTPEGVCGSYREGSCHAHKS 786
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 842 bits (2176), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/801 (52%), Positives = 533/801 (66%), Gaps = 37/801 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD +AI I+G RR+LIS IHYPR++PEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 32 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 91
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+VKFVKL +GLY+ LRIGPY+CAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 92 PGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRTDNG 151
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQ+F KIV++M+ E LF QGGPII+ QIENEYG ME G GK Y KWAA M
Sbjct: 152 PFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWAAEM 211
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ II+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 212 AVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFGGPV 271
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 272 PHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 331
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D A I LG QEAHV+ G C+AFLAN
Sbjct: 332 QPKWGHLKDLHRAIKLCEPALVSGD-ATVIPLGNYQEAHVFNYKAGG----CAAFLANYH 386
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ + A V+F Y LPPWS+SILPDC+NTV+NTA+V +Q++ + ++P VP
Sbjct: 387 QRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSA------RMKMTP---VPM 437
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
SW E ++ FT+ G+LE +N T+D SDYLW++T +++ D
Sbjct: 438 HGGF---------SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHI-DP 487
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
F ++ + P + + S L VFINGQL+G+ G + Q V+ ++G N +
Sbjct: 488 SEGFLRSGKY-PVLGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKIS 546
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E AG G V L G G DLS W+Y++GL GE ++SI
Sbjct: 547 LLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSIS 606
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + + +WYKT F+AP G P+ALD+GSMGKGQ W+NG H+GR+W
Sbjct: 607 GSSSVEWAEGSLVAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPA 666
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C D C Y G YN KC+TNCG +Q WYHVP+SWL+ + NLLV+FEE GG+P
Sbjct: 667 YKASGTCGD-CSYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNG 725
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIE 819
IS+ R VC + E + P +Y + +NK + P+ HL C G I SI+
Sbjct: 726 ISLVRRDVDSVCADIYE--WQPTLM---NYQMQASGKVNKPLRPKAHLSCGPGQKIRSIK 780
Query: 820 FASYGTPQGRCQKFSRGNCHA 840
FAS+GTP+G C + +G+CHA
Sbjct: 781 FASFGTPEGVCGSYRQGSCHA 801
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 842 bits (2174), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/801 (52%), Positives = 533/801 (66%), Gaps = 37/801 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD +AI I+G RR+LIS IHYPR++PEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 25 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+VKFVKL +GLY+ LRIGPY+CAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 85 PGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRTDNG 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQ+F K+V++M+ E LF QGGPII+ QIENEYG ME G GK Y KWAA M
Sbjct: 145 PFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWAAEM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ II+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 205 AVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFGGPV 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 265 PHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D A I LG QEAHV+ G C+AFLAN
Sbjct: 325 QPKWGHLKDLHRAIKLCEPALVSGD-ATVIPLGNYQEAHVFNYKAGG----CAAFLANYH 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ + A V+F Y LPPWS+SILPDC+NTV+NTA+V +Q++ + ++P VP
Sbjct: 380 QRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSA------RMKMTP---VPM 430
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
SW E ++ FT+ G+LE +N T+D SDYLW++T +++ D
Sbjct: 431 HGGF---------SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHI-DP 480
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
F ++ + P + + S L VFINGQL+G+ G + Q V+ ++G N +
Sbjct: 481 SEGFLRSGKY-PVLGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKIS 539
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E AG G V L G G DLS W+Y++GL GE ++SI
Sbjct: 540 LLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSIS 599
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + + +WYKT F+AP G P+ALD+GSMGKGQ W+NG H+GR+W
Sbjct: 600 GSSSVEWAEGSLVAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPA 659
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C D C Y G YN KC+TNCG +Q WYHVP+SWL+ + NLLV+FEE GG+P
Sbjct: 660 YKASGTCGD-CSYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNG 718
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIE 819
IS+ R VC + E + P +Y + +NK + P+ HL C G I SI+
Sbjct: 719 ISLVRRDVDSVCADIYE--WQPTLM---NYQMQASGKVNKPLRPKAHLSCGPGQKIRSIK 773
Query: 820 FASYGTPQGRCQKFSRGNCHA 840
FAS+GTP+G C + +G+CHA
Sbjct: 774 FASFGTPEGVCGSYRQGSCHA 794
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 841 bits (2172), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/800 (52%), Positives = 529/800 (66%), Gaps = 37/800 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD RAI I+G RR+LIS IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 33 SVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGHEPS 92
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+V+FVKLV SGLYL LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 93 PGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNG 152
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQRF KIV++M+ E LF QGGPII+ QIENEYG ME G G+ Y WAA M
Sbjct: 153 PFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWAAKM 212
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCKQ DAP+ II+ACNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 213 AVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPV 272
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP ED+AF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL
Sbjct: 273 PYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLER 332
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ + + + LG QEAHVY+A CSAFLAN +
Sbjct: 333 QPKWGHLKDLHRAIKLCEPALVSGEPTR-MPLGNYQEAHVYKAK----SGACSAFLANYN 387
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V+F Y LPPWS+SILPDC+NTV+NTA+V +QTS + + VP
Sbjct: 388 PKSYAKVSFGSNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMKM---------VRVPV 438
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+ SW E + + +FT+ G++E +N T+D SDYLW++T + + D
Sbjct: 439 HGGL---------SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKI-DA 488
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F + ++ PT+T+ S + VFINGQL+GS G + + V ++G+N +
Sbjct: 489 NEGFLRNGDL-PTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIA 547
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
+LS VGL N G E AG G V L G G DLS WTY+VGLKGE
Sbjct: 548 ILSIAVGLPNVGPHFETWNAGVLGPVSLNGLSGGRRDLSWQKWTYKVGLKGESLSLHSLS 607
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + TWYKT F AP G P+A+D+GSMGKGQ W+NG +GR+W
Sbjct: 608 GSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPA 667
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C + C Y G + DKC NCG +Q WYHVPRSWL+ S NLLV+FEE GG+P
Sbjct: 668 YKAVGSCSE-CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNG 726
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIE 819
IS+ R VC + E V +Y + +NK + P++HL C G I++++
Sbjct: 727 ISLVRREVDSVCADIYEWQSTLV-----NYQLHASGKVNKPLHPKVHLQCGPGQKITTVK 781
Query: 820 FASYGTPQGRCQKFSRGNCH 839
FAS+GTP+G C + +G+CH
Sbjct: 782 FASFGTPEGTCGSYRQGSCH 801
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 841 bits (2172), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/846 (50%), Positives = 546/846 (64%), Gaps = 50/846 (5%)
Query: 1 MHSKKNNRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNR 60
M SK N A+ +A++ + + ++ L C S S VSYD RAI I+G R
Sbjct: 1 MGSKPN--AMKNVVAMAA--VSALFLLGFLVCSVSGS---------VSYDSRAITINGKR 47
Query: 61 RMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKF 120
R+LIS IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE G+Y F+G D+VKF
Sbjct: 48 RILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKF 107
Query: 121 VKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVD 180
VKLV SGLYL LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N PFK +MQRF KIV+
Sbjct: 108 VKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVN 167
Query: 181 LMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQ 240
+M+ E LF QGGPII+ QIENEYG ME G G+ Y WAA MA+GLG GVPWVMCKQ
Sbjct: 168 MMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQ 227
Query: 241 TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARF 300
DAP+ II+ACNG+YCD + PN KP +WTE W GW+T +GG +P+RP ED+AF+VARF
Sbjct: 228 DDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARF 287
Query: 301 FQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIK 360
Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL +PKWGHLKDLH AIK
Sbjct: 288 IQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIK 347
Query: 361 LCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYT 420
LCEPALV+ + + + LG QEAHVY++ CSAFLAN + + A V+F Y
Sbjct: 348 LCEPALVSGEPTR-MPLGNYQEAHVYKSK----SGACSAFLANYNPKSYAKVSFGNNHYN 402
Query: 421 LPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSW 480
LPPWS+SILPDC+NTV+NTA+V +QTS + + VP + SW
Sbjct: 403 LPPWSISILPDCKNTVYNTARVGAQTSRMKM---------VRVPVHGGL---------SW 444
Query: 481 MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVT 540
E + + +FT+ G++E +N T+D SDYLW++T + V D + F + ++ PT+T
Sbjct: 445 QAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKV-DANEGFLRNGDL-PTLT 502
Query: 541 IDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
+ S + VFINGQL+GS G + + V ++G+N + +LS VGL N G
Sbjct: 503 VLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEENEAEWTDLTRDGI 655
E AG G V L G G DLS WTY+VGLKGE + EW +
Sbjct: 563 ETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
TWYKT F AP G P+A+D+GSMGKGQ W+NG +GR+W G C + C Y G
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGSCSE-CSYTG 681
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQV 775
+ DKC NCG +Q WYHVPRSWL+ S NLLV+FEE GG+P I++ R VC +
Sbjct: 682 TFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADI 741
Query: 776 SESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFS 834
E V +Y + +NK + P+ HL C G I++++FAS+GTP+G C +
Sbjct: 742 YEWQSTLV-----NYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYR 796
Query: 835 RGNCHA 840
+G+CHA
Sbjct: 797 QGSCHA 802
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 840 bits (2171), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/801 (51%), Positives = 532/801 (66%), Gaps = 39/801 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD RAI+I+G RR+LIS IHYPR++PEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 29 SVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G+Y F+G+ D+V+F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ + GI FRTNN
Sbjct: 89 QGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQRF KKIVD+M+ E LF QGGPII+ QIENEYG ME G G+ Y +WAA M
Sbjct: 149 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 208
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCKQ DAP+ II+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 209 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 268
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP EDLAF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DE+GLL
Sbjct: 269 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLLR 328
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPAL++ D LG +EAHV+ + C+AFLAN +
Sbjct: 329 QPKWGHLKDLHRAIKLCEPALISGDPT-VTSLGNYEEAHVFHSK----SGACAAFLANYN 383
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V+F Y LPPWS+SILPDC+NTV+NTA++ +Q++ ++ ++P
Sbjct: 384 PRSYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSA------TMKMTP------ 431
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S W + E + +++F G+LE +N T+D SDYLW+ T + + +
Sbjct: 432 --------VSGRFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYN 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F K+ P +T+ S L VFING+L+G+ G + Q V+ ++G N +
Sbjct: 484 E-GFLKSGRY-PVLTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
LLS VGL N G E AG G V L G G DLS W+Y+VGLKGE
Sbjct: 542 LLSIAVGLPNVGPHFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLS 601
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + + TWYKT F+AP G P+ALD+GSMGKGQ W+NG ++GRYW
Sbjct: 602 GSSSVEWVEGSLMARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPA 661
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
GGC D C+Y G Y+ KC +NCG P+Q WYHVP SWL + NLLV+FEE+GGNP
Sbjct: 662 YKATGGCGD-CNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAG 720
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIE 819
IS+ R VC + E + P +Y + +NK + P+ HL C G ISSI+
Sbjct: 721 ISLVEREIESVCADIYE--WQPTLM---NYEMQASGKVNKPLRPKAHLWCAPGQKISSIK 775
Query: 820 FASYGTPQGRCQKFSRGNCHA 840
FAS+GTP+G C + G+CHA
Sbjct: 776 FASFGTPEGVCGSYREGSCHA 796
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 840 bits (2169), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/846 (50%), Positives = 546/846 (64%), Gaps = 50/846 (5%)
Query: 1 MHSKKNNRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNR 60
M SK N A+ +A++ + + ++ L C S S VSYD RAI I+G R
Sbjct: 1 MGSKPN--AMKNVVAMAA--VSALFLLGFLVCSVSGS---------VSYDSRAITINGKR 47
Query: 61 RMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKF 120
R+LIS IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE G+Y F+G D+VKF
Sbjct: 48 RILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKF 107
Query: 121 VKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVD 180
VKLV SGLYL LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N PFK +MQRF KIV+
Sbjct: 108 VKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVN 167
Query: 181 LMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQ 240
+M+ E LF QGGPII+ QIENEYG ME G G+ Y WAA MA+GLG GVPWVMCKQ
Sbjct: 168 MMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQ 227
Query: 241 TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARF 300
DAP+ II+ACNG+YCD + PN KP +WTE W GW+T +GG +P+RP ED+AF+VARF
Sbjct: 228 DDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARF 287
Query: 301 FQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIK 360
Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL +PKWGHLKDLH AIK
Sbjct: 288 IQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIK 347
Query: 361 LCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYT 420
LCEPALV+ + + + LG QEAHVY++ CSAFLAN + + A V+F Y
Sbjct: 348 LCEPALVSGEPTR-MPLGNYQEAHVYKSK----SGACSAFLANYNPKSYAKVSFGNNHYN 402
Query: 421 LPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSW 480
LPPWS+SILPDC+NTV+NTA+V +QTS + + VP + SW
Sbjct: 403 LPPWSISILPDCKNTVYNTARVGAQTSRMKM---------VRVPVHGGL---------SW 444
Query: 481 MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVT 540
E + + +FT+ G++E +N T+D SDYLW++T + V D + F + ++ PT+T
Sbjct: 445 QAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKV-DANEGFLRNGDL-PTLT 502
Query: 541 IDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
+ S + +FINGQL+GS G + + V ++G+N + +LS VGL N G
Sbjct: 503 VLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEENEAEWTDLTRDGI 655
E AG G V L G G DLS WTY+VGLKGE + EW +
Sbjct: 563 ETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
TWYKT F AP G P+A+D+GSMGKGQ W+NG +GR+W G C + C Y G
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGSCSE-CSYTG 681
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQV 775
+ DKC NCG +Q WYHVPRSWL+ S NLLV+FEE GG+P I++ R VC +
Sbjct: 682 TFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADI 741
Query: 776 SESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFS 834
E V +Y + +NK + P+ HL C G I++++FAS+GTP+G C +
Sbjct: 742 YEWQSTLV-----NYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYR 796
Query: 835 RGNCHA 840
+G+CHA
Sbjct: 797 QGSCHA 802
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 839 bits (2168), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/800 (52%), Positives = 532/800 (66%), Gaps = 35/800 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD +AIII+G+RR+LIS IHYPR+T EMWPDLI K+KEGG DVIETYVFWN HE
Sbjct: 27 SVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLIQKAKEGGLDVIETYVFWNGHEPE 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+V+FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+NA
Sbjct: 87 PGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNA 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +M+RF +KIV++M+ E L+ QGGPII+ QIENEYG ME G GK Y KWAA M
Sbjct: 147 PFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYSKWAAQM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
ALGLG GVPWVMCKQ DAP+ II+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 207 ALGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFGGAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AFAVARF Q+GG+ +NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL
Sbjct: 267 PHRPAEDMAFAVARFIQKGGALINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDL+ AIKLCEPALV+ D +LG QEAHV+++ C+AFL+N +
Sbjct: 327 QPKWGHLKDLNRAIKLCEPALVSGDPI-VTRLGNYQEAHVFKS----KSGACAAFLSNYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+V F Y +PPWS+SILPDC+NTVFNTA+V +QT+I + +SP VP
Sbjct: 382 PRSYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGAQTAI------MKMSP---VPM 432
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S SW E ++E FT G+LE +N T+D +DYLW+ T +++ D
Sbjct: 433 HE---------SFSWQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWYTTDVHI-DA 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F ++ + P +T+ S + VF+NGQL G+ G + + V ++G N +
Sbjct: 483 NEGFLRSGKY-PVLTVLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNNKIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
LLS VGL N G E AG G V L G G DL+ WTY++GL GE
Sbjct: 542 LLSIAVGLPNVGPHFEMWNAGILGPVNLNGLDEGRRDLTWQKWTYKIGLDGEAMSLHSLS 601
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + TW+KT F+AP G P+ALD+GSMGKGQ W+NG +GRYW
Sbjct: 602 GSSSVEWIQGSLVAQKQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGRYWPA 661
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C +CDY G YN KC++NCG +Q WYHVPRSWL + NLLV+FEE GG+P
Sbjct: 662 YKSTGSC-GSCDYTGTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWGGDPNG 720
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
I + R VC ++E P + W S GK++ + P+ HL C G ISS++F
Sbjct: 721 IHLVRRDVDSVCVNINEWQ-PTLMNWQMQSS--GKVN-KPLRPKAHLSCGPGQKISSVKF 776
Query: 821 ASYGTPQGRCQKFSRGNCHA 840
AS+GTP+G C F G+CHA
Sbjct: 777 ASFGTPEGECGSFREGSCHA 796
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/810 (51%), Positives = 532/810 (65%), Gaps = 50/810 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+++II+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 26 SVTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F G+ D+V+F+KLV +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 86 PGQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNG 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M +F +KIV +M+ E L+ QGGPII+ QIENEYG +E G GK Y WAA M
Sbjct: 146 PFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ +I+ CNG+YCD + PN NKP +WTE W GW+T +GG +
Sbjct: 206 AVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKDNKPKMWTEAWTGWFTGFGGAV 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP ED+AFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL
Sbjct: 266 PQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGLLR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+DLH AIKLCEPALV+ + LGQNQE++VYR S+S+C+AFLAN +
Sbjct: 326 QPKWGHLRDLHKAIKLCEPALVSGEPT-ITSLGQNQESYVYR-----SKSSCAAFLANFN 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
A+VTF G Y LPPWSVSILPDC+ TVFNTA+V +QT+ +++
Sbjct: 380 SRYYATVTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQTTTMKMQYLGGF-------- 431
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
SW E ++N FT G++E L+ T D SDYLW+ T + ++ +
Sbjct: 432 -------------SWKAYTEDTDALNDNTFTKDGLVEQLSTTWDRSDYLWYTTYVDIAKN 478
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F KT + P +T+ S + VFINGQL+G+ G + + +G N +
Sbjct: 479 E-EFLKTGKY-PYLTVMSAGHAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKIS 536
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS +VGL N G E G G V LTG G DLS WTYQ+GL GE ++S+
Sbjct: 537 ILSVSVGLPNVGNHFETWNTGVLGPVTLTGLNEGKRDLSLQKWTYQIGLHGETLSLHSLT 596
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + ++ TWYKT+F+AP G +P+ALD+ +MGKGQ W+NG IGRYW
Sbjct: 597 GSSNVEWGEASQK---QPLTWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPA 653
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C +CDYRG YN KC +NCG +Q WYHVPRSWL + N LV+ EE GG+P
Sbjct: 654 YKASGSC-GSCDYRGTYNEKKCLSNCGEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTG 712
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKW-SNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
IS+ RS VC +V E P + W + +Y P++HL C G +S I+
Sbjct: 713 ISMVKRSVASVCAEVEELQ-PTMDNWRTKAYG----------RPKVHLSCDPGQKMSKIK 761
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTPQG C FS G+CHA S +
Sbjct: 762 FASFGTPQGTCGSFSEGSCHAHKSYDAFEQ 791
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/836 (51%), Positives = 541/836 (64%), Gaps = 46/836 (5%)
Query: 10 LLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIH 69
++ CL L + M + +++ S + S+ AS VSYD +AI I+G RR+LIS IH
Sbjct: 1 MVICLKLII--MWNVALLLVFSLIGSAKAS-------VSYDSKAITINGQRRILISGSIH 51
Query: 70 YPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGL 129
YPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE G+Y F+G D+VKF+KLV +GL
Sbjct: 52 YPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGL 111
Query: 130 YLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFS 189
Y+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N PFK +MQ+F KIVDLM+ E L+
Sbjct: 112 YVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNEPFKHQMQKFTTKIVDLMKAERLYE 171
Query: 190 WQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIID 249
QGGPIIM QIENEYG ME G GK Y KWAA MA+GLG GVPWVMCKQ D P+ +I+
Sbjct: 172 SQGGPIIMSQIENEYGPMEYEIGAAGKAYTKWAAEMAMGLGTGVPWVMCKQDDTPDPLIN 231
Query: 250 ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMN 309
CNG+YCD + PN KP +WTE W GW+T +GG +PHRP EDLAF+VARF Q+GGSF+N
Sbjct: 232 TCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFIN 291
Query: 310 YYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
YYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL +PKWGHLKDLH AIKLCEPALV+
Sbjct: 292 YYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSG 351
Query: 370 DSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSIL 429
D K+G QEAHV+++ C+AFLAN + + A+V F Y LPPWS+SIL
Sbjct: 352 DPT-VTKIGNYQEAHVFKSK----SGACAAFLANYNPKSYATVAFGNMHYNLPPWSISIL 406
Query: 430 PDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGV 489
PDC+NTV+NTA+V SQ++ Q M + SW++ E
Sbjct: 407 PDCKNTVYNTARVGSQSA-----------------QMKMTRVPIHG-GFSWLSFNEETTT 448
Query: 490 WSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLR 549
+++FT+ G+LE LN T+D SDYLW+ T + V D + F + N P +T+ S L
Sbjct: 449 TDDSSFTMTGLLEQLNTTRDLSDYLWYSTDV-VLDPNEGFLR-NGKDPVLTVFSAGHALH 506
Query: 550 VFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRG 605
VFINGQL+G+ G + + V+ ++G N + LLS VGL N G E AG G
Sbjct: 507 VFINGQLSGTAYGSLEFPKLTFNEGVKLRAGVNKISLLSVAVGLPNVGPHFETWNAGVLG 566
Query: 606 QVKLTGFKNGDIDLSKILWTYQVGLKGEF-QQIYSIEENEAEWTDLTRDGIPSTFTWYKT 664
+ L+G G DLS W+Y+VGLKGE + EW + TWYKT
Sbjct: 567 PISLSGLNEGRRDLSWQKWSYKVGLKGEILSLHSLSGSSSVEWIQGSLVSQRQPLTWYKT 626
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
FDAP G P+ALD+ SMGKGQ W+NG ++GRYW G C D CDY G YN +KC +
Sbjct: 627 TFDAPAGTAPLALDMDSMGKGQVWLNGQNLGRYWPAYKASGTC-DYCDYAGTYNENKCRS 685
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
NCG +Q WYHVP+SWL+ + NLLV+FEE GG+P I + R VC + E +
Sbjct: 686 NCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADIYEWQPNLI- 744
Query: 785 KWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
S GK + P++HL C G ISSI+FAS+GTP G C F G+CHA
Sbjct: 745 --SYQMQTSGKAPVR---PKVHLSCSPGQKISSIKFASFGTPAGSCGNFHEGSCHA 795
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/779 (51%), Positives = 535/779 (68%), Gaps = 24/779 (3%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWP LI KSK+GG DVIETYVFW+ HE++RGQY+F+G+ D+V+FVK V +GLY+ LRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVCAEWN+GGFPVWL +PGI+FRT+N FK EMQRF +K+VD M+ L++ QGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEYGN++S+YG GK Y++WAA MA+ L GVPWVMC+Q+DAP+ +I+ CNG+YC
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180
Query: 257 DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
D + PNS +KP +WTENW GW+ ++GG +P+RP EDLAFAVARF+QRGG+F NYYMY GG
Sbjct: 181 DQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGG 240
Query: 317 TNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK 376
TNFGR++GGPF TSYDYDAPIDEYG++ +PKWGHL+D+H AIKLCEPAL+AA+ + Y
Sbjct: 241 TNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPS-YSS 299
Query: 377 LGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV 436
LGQN EA VY+ S C+AFLAN+D + +V F G +Y LP WSVSILPDC+N V
Sbjct: 300 LGQNTEATVYQT---ADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVV 356
Query: 437 FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFT 496
NTA+++SQ + + L +I S+I +L++ W EP+G+ EN T
Sbjct: 357 LNTAQINSQVTTSEMR---SLGSSIQDTDDSLITPELATA--GWSYAIEPVGITKENALT 411
Query: 497 VQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL 556
G++E +N T D SD+LW+ T I V D+ N + + ++S+ VL+++ING+L
Sbjct: 412 KPGLMEQINTTADASDFLWYSTSIVVKGDEPYL---NGSQSNLLVNSLGHVLQIYINGKL 468
Query: 557 ----TGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF 612
GS + + PV G N + LLS TVGL NYGAF + GAG G VKL+G
Sbjct: 469 AGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSG- 527
Query: 613 KNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI 672
NG ++LS WTYQ+GL+GE +Y+ E EW WYKT F AP G
Sbjct: 528 PNGALNLSSTDWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGD 587
Query: 673 DPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
DPVA+D MGKG+AWVNG IGRYW T +AP+ GC ++C+YRGAY+S+KC CG P+Q
Sbjct: 588 DPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQ 647
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS 791
T YHVPRS+LQ +N LV+FE+ GG+P IS R T +C VSE H + W
Sbjct: 648 TLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSW----- 702
Query: 792 VDGKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ + + P + L C ++G +IS+I+FAS+GTP G C ++ G C + +L+VV E
Sbjct: 703 ISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQE 761
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/802 (51%), Positives = 531/802 (66%), Gaps = 41/802 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDHRAII++G RR+LIS +HYPR+TPEMWP +I K+KEGG DVI+TYVFWN HE
Sbjct: 26 SVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHEPQ 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G+Y F+G+ D+VKF+KLV +GLY+ LR+GPY CAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 86 QGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTDNG 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV++M+ E L+ QGGPII+ QIENEYG ME G GK Y +WAA M
Sbjct: 146 PFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ II+ACNG+YCD + PN KP +WTE W W+T +G +
Sbjct: 206 AVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGNPV 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAF+VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 266 PYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D A LG QEAHV+R+ +C+AFLAN D
Sbjct: 326 QPKWGHLKDLHRAIKLCEPALVSGDPA-VTALGHQQEAHVFRSK----AGSCAAFLANYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+H+ A+V+F + Y LPPWS+SILPDC+NTVFNTA++ +Q++
Sbjct: 381 QHSFATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSA------------------ 422
Query: 466 QSMIESKLSSTSKS--WMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ K++ S+ W + E + +++FTV G+LE +N T+D SDYLW+ T + +
Sbjct: 423 ----QMKMTPVSRGLPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKI- 477
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYND 579
D F + + P +TI S L VF+NGQL G+ G K + V ++G N
Sbjct: 478 DSREKFLRGGK-WPWLTIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNK 536
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIY 638
+ LLS VGL N G E AG G V LTG G DL+ W+Y+VGLKGE
Sbjct: 537 ISLLSIAVGLPNIGPHFETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHS 596
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ EW + + TWYK+ F+AP G DP+ALDL +MGKGQ W+NG +GRYW
Sbjct: 597 LSGSSSVEWVEGSLVAQRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYW 656
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
G C C+Y G +N KC +NCG +Q WYHVPRSWL + NLLV+FEE GG P
Sbjct: 657 PGYKASGNC-GACNYAGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEP 715
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
IS+ R VC ++E P + W S GK+ + P+ HL C G I+SI
Sbjct: 716 HGISLVKREVASVCADINEWQ-PQLVNWQMQAS--GKVD-KPLRPKAHLSCASGQKITSI 771
Query: 819 EFASYGTPQGRCQKFSRGNCHA 840
+FAS+GTPQG C F G+CHA
Sbjct: 772 KFASFGTPQGVCGSFREGSCHA 793
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/830 (50%), Positives = 544/830 (65%), Gaps = 42/830 (5%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L V+ + +++++ S + +SAS VSYDH+AIII+G RR+L+S IHYPR+TP
Sbjct: 6 LKVWNVPLLLVVFACSLLGQASAS-------VSYDHKAIIINGQRRILLSGSIHYPRSTP 58
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
EMWPDLI K+KEGG DVI+TYVFWN HE G+Y F G D+V+F+KLV +GLY+ LRI
Sbjct: 59 EMWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRI 118
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPYVCAEWNFGGFPVWL+ IPGI FRT+N PFK +M++F KKIVD+M+ E LF QGGPI
Sbjct: 119 GPYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPI 178
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEYG ME G G+ Y +WAA MA+GLG GVPW+MCKQ DAP+ II+ CNG+Y
Sbjct: 179 ILSQIENEYGPMEYEIGAPGRSYTQWAAHMAVGLGTGVPWIMCKQDDAPDPIINTCNGFY 238
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
CD + PN KP +WTE W GW+T +GG +PHRP EDLAF++ARF Q+GGSF+NYYMY G
Sbjct: 239 CDYFSPNKAYKPKMWTEAWTGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYHG 298
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNFGRT+GGPF TSYDYDAP+DEYGL +PKWGHLKDLH AIKLCEPALV+ DS
Sbjct: 299 GTNFGRTAGGPFIATSYDYDAPLDEYGLARQPKWGHLKDLHRAIKLCEPALVSGDSTVQ- 357
Query: 376 KLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
+LG +EAHV+R+ C+AFLAN + + A+V F Q Y LPPWS+SILP+C++T
Sbjct: 358 RLGNYEEAHVFRSK----SGACAAFLANYNPQSYATVAFGNQHYNLPPWSISILPNCKHT 413
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNF 495
V+NTA+V SQ++ + VP + SW E +++F
Sbjct: 414 VYNTARVGSQSTTMKM---------TRVPIHGGL---------SWKAFNEETTTTDDSSF 455
Query: 496 TVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ 555
TV G+LE +N T+D SDYLW+ T + ++ ++ F + N P +T+ S L VFIN Q
Sbjct: 456 TVTGLLEQINATRDLSDYLWYSTDVVINSNE-GFLR-NGKNPVLTVLSAGHALHVFINNQ 513
Query: 556 LTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
L+G+ G + + V ++G N + LLS VGL N G E+ AG G + L+G
Sbjct: 514 LSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFERWNAGVLGPITLSG 573
Query: 612 FKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPD 670
G DL+ W+Y+VGLKGE ++S+ + EW TWYKT FDAP
Sbjct: 574 LNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRRQPLTWYKTTFDAPA 633
Query: 671 GIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
G+ P+ALD+GSMGKGQ W+NG +GRYW G C C+Y G YN KC +NCG +
Sbjct: 634 GVAPLALDMGSMGKGQVWINGQSLGRYWPAYKASGSC-GYCNYAGTYNEKKCGSNCGEAS 692
Query: 731 QTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSY 790
Q WYHVP SWL+ S NLLV+FEE GG+P I + R VC + E V S
Sbjct: 693 QRWYHVPHSWLKPSGNLLVVFEELGGDPNGIFLVRRDIDSVCADIYEWQPNLV---SYEM 749
Query: 791 SVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
GK+ + + P+ HL C G ISSI+FAS+GTP G C + G+CHA
Sbjct: 750 QASGKVR-SPVRPKAHLSCGPGQKISSIKFASFGTPVGSCGSYREGSCHA 798
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/802 (51%), Positives = 531/802 (66%), Gaps = 41/802 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDHRAII++G RR+LIS +HYPR+TPEMWP +I K+KEGG DVI+TYVFWN HE
Sbjct: 26 SVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHEPQ 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G+Y F+G+ D+VKF+KLV +GLY+ LR+GPY CAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 86 QGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTDNG 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV++M+ E L+ QGGPII+ QIENEYG ME G GK Y +WAA M
Sbjct: 146 PFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ II+ACNG+YCD + PN KP +WTE W W+T +G +
Sbjct: 206 AVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGNPV 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAF+VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 266 PYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D A LG QEAHV+R+ +C+AFLAN D
Sbjct: 326 QPKWGHLKDLHRAIKLCEPALVSGDPA-VTALGHQQEAHVFRSK----AGSCAAFLANYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+H+ A+V+F + Y LPPWS+SILPDC+NTVFNTA++ +Q++
Sbjct: 381 QHSFATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSA------------------ 422
Query: 466 QSMIESKLSSTSKS--WMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ K++ S+ W + E + +++FTV G+LE +N T+D SDYLW+ T + +
Sbjct: 423 ----QMKMTPVSRGLPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKI- 477
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYND 579
D F + + P +TI S L VF+NGQL G+ G K + V ++G N
Sbjct: 478 DSREKFLRGGK-WPWLTIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNK 536
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIY 638
+ LLS VGL N G E AG G V LTG G DL+ W+Y+VGLKGE
Sbjct: 537 ISLLSIAVGLPNIGPHFETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHS 596
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ EW + + TWYK+ F+AP G DP+ALDL +MGKGQ W+NG +GRYW
Sbjct: 597 LSGSSSVEWVEGSLVAQRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYW 656
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
G C C+Y G +N KC +NCG +Q WYHVPRSWL + NLLV+FEE GG P
Sbjct: 657 PGYKASGNC-GACNYAGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEP 715
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
IS+ R VC ++E P + W S GK+ + P+ HL C G I+SI
Sbjct: 716 HGISLVKREVASVCADINEWQ-PQLVNWQMQAS--GKVD-KPLRPKAHLSCAPGQKITSI 771
Query: 819 EFASYGTPQGRCQKFSRGNCHA 840
+FAS+GTPQG C F G+CHA
Sbjct: 772 KFASFGTPQGVCGSFREGSCHA 793
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 837 bits (2161), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/801 (52%), Positives = 528/801 (65%), Gaps = 36/801 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++IDG RR+L S IHYPR TPE+WPD+I KSKEGG DVIETYVFWN HE ++
Sbjct: 30 VSYDHRALVIDGKRRVLQSGSIHYPRTTPEVWPDIIRKSKEGGLDVIETYVFWNYHEPVK 89
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+G+ D+V+FVK + +GL + LRIGPY CAEWN+GGFP+WL IPGI+FRT N
Sbjct: 90 GQYYFEGRFDLVRFVKTIQEAGLLVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTTNEL 149
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKEEM+ F+ KIV++M+EE LF+ QGGPII+ Q+ENEYGN+E +YG G+ YVKWAA A
Sbjct: 150 FKEEMKLFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVEWAYGAAGELYVKWAAETA 209
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ L VPWVMC Q DAP+ II+ CNG+YCD + PNS +KP +WTEN+ GW+ ++G +P
Sbjct: 210 VSLNTSVPWVMCAQVDAPDPIINTCNGFYCDRFSPNSPSKPKMWTENYSGWFLSFGYAIP 269
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RPVEDLAFAVARFF+ GG+F NYYMYFGGTNFGRT+GGP TSYDYDAPIDEYG + +
Sbjct: 270 YRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 329
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK CE L+++D +LG N EAH+Y Y S ++C+AFLAN D
Sbjct: 330 PKWGHLRDLHKAIKQCEEHLISSDPIHQ-QLGNNLEAHIY----YKSSNDCAAFLANYDS 384
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A+VTF G Y LP WSVSILPDC+N +FNTAKV L L +
Sbjct: 385 SSDANVTFNGNIYFLPAWSVSILPDCKNVIFNTAKV----------LILNLGDDFFAHST 434
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S+ E L SW KE +G+W N+FT G+LE +N TKD SD+LW+ T I V+ D
Sbjct: 435 SVNEIPLEQIVWSWY--KEEVGIWGNNSFTAPGLLEQINTTKDISDFLWYSTSISVNADQ 492
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH---WVKVVQPVEFQSGYNDLILL 583
+ + I+S+ VF+N L G H + + + G N L LL
Sbjct: 493 VK-------DIILNIESLGHAALVFVNKVLVGKYGNHDDASFSLTEKISLIEGNNTLDLL 545
Query: 584 SQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-E 642
S +G+QNYG + + GAG V L G IDLS WTYQVGL+GE+ + +
Sbjct: 546 SMMIGVQNYGPWFDVQGAGIYA-VLLVGQSKVKIDLSSEKWTYQVGLEGEYFGLDKVSLA 604
Query: 643 NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV-V 701
N + WT I + WYK F AP+G P+AL+L MGKGQAWVNG IGRYW +
Sbjct: 605 NSSLWTQGASPPINKSLIWYKGTFVAPEGKGPLALNLAGMGKGQAWVNGQSIGRYWPAYL 664
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
+P GC D+CDYRGAY+S KC CG P QT YH+PR+W+ NLLV+ EE GG+P +I
Sbjct: 665 SPSTGCNDSCDYRGAYDSFKCLKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSKI 724
Query: 762 SVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFA 821
SV R+ +C VSE PP W +S + PE+ L C+ G+ I SI FA
Sbjct: 725 SVLTRTGHEICSIVSEDDPPPADSWKSSSEFKSQ------NPEVRLTCEQGWHIKSINFA 778
Query: 822 SYGTPQGRCQKFSRGNCHAPM 842
S+GTP G C F+ G+CHA M
Sbjct: 779 SFGTPAGICGTFNPGSCHADM 799
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 837 bits (2161), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/747 (54%), Positives = 510/747 (68%), Gaps = 55/747 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
V+YDHR +II+G RMLISA IHYPRA P+MW LI+ +K GG DVIETYVFW+ H+
Sbjct: 25 TVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGHQPT 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R YNF+G+ D+V FVKLV +GLY LRIGPYVCAEWN GGFPVWL+D+ GIEFRTNN
Sbjct: 85 RDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIEFRTNNQ 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQ FV+KIV +M+ + LF+ QGGPII+ QIENEYGN++++YG GK+Y+ WAA+M
Sbjct: 145 PFKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMVWAANM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ GLG GVPW+MC+Q+DAP+ I+D CNG+YCD + PN+ KP +WTENW GW+ WG
Sbjct: 205 SQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWGEAS 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVED+AFAVARFFQRGGSF NYYMYFGGTNFGR+SGGP+ TSYDYDAPIDE+G++
Sbjct: 265 PHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFGVIR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK LHAAIKLCE AL + D YI LGQ QEAHVY + G+ C+AFLANID
Sbjct: 325 QPKWGHLKQLHAAIKLCEAALGSNDPT-YISLGQLQEAHVYGSTSSGA---CAAFLANID 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+V F ++Y LP WSVSILPDC+ NTAKV QT++ T+
Sbjct: 381 SSSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVDVQTAMPTM-------------- 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
K S T +W + EP+GVWS++ +LE +N TKD SDYLW+ T + +S
Sbjct: 427 ------KPSITGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQA 480
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLI 581
D + K + ++SMRDV+ VF+NG+L GS + V QP+E SG+N L
Sbjct: 481 DAASGKA-----LLYLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLA 535
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+L TVGLQNYG F+E GAG G V + G +G IDL+ W +QVGLKGE I++
Sbjct: 536 ILCATVGLQNYGPFIETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTES 595
Query: 642 ENE-AEWTDLTRDGIPSTFTWYK-----------------TYFDAPDGIDPVALDLGSMG 683
++ W+ G WYK +FD+P G DPVALDL SMG
Sbjct: 596 GSQRVRWSSAVPQG--QALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMG 653
Query: 684 KGQAWVNGHHIGRYW-TVVAPK-GGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
KGQAW+NG IGR+W ++ AP GC TCDYRG+Y+S KC + CG P+Q WYHVPRSWL
Sbjct: 654 KGQAWINGQSIGRFWPSLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWL 713
Query: 742 QASNNLLVIFEETGGNPFEISVKLRST 768
Q NL+V+FEE GG P +S R+
Sbjct: 714 QDGGNLVVLFEEEGGKPSGVSFVTRTV 740
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 836 bits (2159), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/815 (51%), Positives = 534/815 (65%), Gaps = 42/815 (5%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
SC +S AS VSYD +AI+I+G RR+LIS IHYPR+TPEMWPDLI ++K+GG
Sbjct: 21 SCFASVRAS-------VSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGL 73
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
DVI+TYVFWN HE G+Y F+ D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFPV
Sbjct: 74 DVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPV 133
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS 210
WL+ +PGI+FRT+N PFK++MQRF KIV++M+ E LF GGPII+ QIENEYG ME
Sbjct: 134 WLKYVPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYE 193
Query: 211 YGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLW 270
G GK Y WAA MA+GLG GVPWVMCKQ DAP+ +I+ACNG+YCD + PN KP +W
Sbjct: 194 IGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKAYKPKMW 253
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYIT 330
TE W GW+T +GG +P+RP EDLAF+VA+F Q+GG+F+NYYMY GGTNFGRT+GGPF T
Sbjct: 254 TEAWTGWFTEFGGAVPYRPAEDLAFSVAKFLQKGGAFINYYMYHGGTNFGRTAGGPFIAT 313
Query: 331 SYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANR 390
SYDYDAP+DEYGLL +PKWGHLKDLH AIKLCEPALV++D LG QEAHV+++N
Sbjct: 314 SYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSSDPT-VTPLGTYQEAHVFKSN- 371
Query: 391 YGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKT 450
C+AFLAN + + A V F Y LPPWS+SILPDC+NTV+NTA++ +QT+
Sbjct: 372 ---SGACAAFLANYNRKSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQTA--- 425
Query: 451 VEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDY 510
+P P I SW + +S+ +FT G+LE +N+T+D
Sbjct: 426 -RMKMPRVP---------IHGGF-----SWQAYNDETATYSDTSFTTAGLLEQINITRDA 470
Query: 511 SDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VK 566
+DYLW++T + + D F ++ P +T+ S LRVFINGQL G+ G +
Sbjct: 471 TDYLWYMTDVKI-DPSEDFLRSGNY-PVLTVLSAGHALRVFINGQLAGTAYGSLETPKLT 528
Query: 567 VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY 626
Q V ++G N + LLS VGL N G E AG G V L G G DLS W+Y
Sbjct: 529 FKQGVNLRAGINQIALLSIAVGLPNVGPHFETWNAGILGPVILNGLNEGRRDLSWQKWSY 588
Query: 627 QVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKG 685
++GLKGE ++S+ + EWT+ + TWYKT F+ P G P+ALD+GSMGKG
Sbjct: 589 KIGLKGEALSLHSLTGSSSVEWTEGSFVAQRQPLTWYKTTFNRPAGNSPLALDMGSMGKG 648
Query: 686 QAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN 745
Q W+N IGRYW G C + C+Y G ++ KC +NCG +Q WYHVPRSWL +
Sbjct: 649 QVWINDRSIGRYWPAYKASGTCGE-CNYAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTG 707
Query: 746 NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEM 805
NLLV+ EE GG+P I + R VC + E P + W V G+++ + P+
Sbjct: 708 NLLVVLEEWGGDPNGIFLVRREVDSVCADIYEWQ-PNLMSW--QMQVSGRVN-KPLRPKA 763
Query: 806 HLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
HL C G ISSI+FAS+GTP+G C F G CHA
Sbjct: 764 HLSCGPGQKISSIKFASFGTPEGVCGSFREGGCHA 798
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/839 (51%), Positives = 543/839 (64%), Gaps = 45/839 (5%)
Query: 10 LLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIH 69
++ CL L + M + +++ S + S+ AS VSYD +AI I+G RR+LIS IH
Sbjct: 1 MVMCLKLKLI-MWNVALLLAFSLIGSAKAS-------VSYDSKAITINGQRRILISGSIH 52
Query: 70 YPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGL 129
YPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE G+Y F+G D+VKF+KLV +GL
Sbjct: 53 YPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGL 112
Query: 130 YLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFS 189
Y+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N PFK +MQ+F KIVDLM+ E L+
Sbjct: 113 YVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNEPFKVQMQKFTTKIVDLMKAERLYE 172
Query: 190 WQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIID 249
QGGPIIM QIENEYG ME G GK Y KWAA MA+ LG GVPW+MCKQ D P+ +I+
Sbjct: 173 SQGGPIIMSQIENEYGPMEYEIGAAGKAYTKWAAEMAMELGTGVPWIMCKQDDTPDPLIN 232
Query: 250 ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMN 309
CNG+YCD + PN KP +WTE W GW+T +GG +PHRP EDLAF+VARF Q+GGSF+N
Sbjct: 233 TCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFIN 292
Query: 310 YYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
YYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL +PKWGHLKDLH AIKLCEPALV+
Sbjct: 293 YYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSG 352
Query: 370 DSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSIL 429
D K+G QEAHV+++ C+AFLAN + + A+V F Y LPPWS+SIL
Sbjct: 353 DPT-VTKIGNYQEAHVFKS----MSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISIL 407
Query: 430 PDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGV 489
P+C+NTV+NTA+V SQ++ Q M + SW++ E
Sbjct: 408 PNCKNTVYNTARVGSQSA-----------------QMKMTRVPIHG-GLSWLSFNEETTT 449
Query: 490 WSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLR 549
+++FT+ G+LE LN T+D SDYLW+ T + V D + F + N P +T+ S L
Sbjct: 450 TDDSSFTMTGLLEQLNTTRDLSDYLWYSTDV-VLDPNEGFLR-NGKDPVLTVFSAGHALH 507
Query: 550 VFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRG 605
VFINGQL+G+ G + + V+ ++G N + LLS VGL N G E AG G
Sbjct: 508 VFINGQLSGTAYGSLEFPKLTFNEGVKLRTGVNKISLLSVAVGLPNVGPHFETWNAGVLG 567
Query: 606 QVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEENEAEWTDLTRDGIPSTFTWYKT 664
+ L+G G DLS W+Y+VGLKGE + EW + TWYKT
Sbjct: 568 PISLSGLNEGRRDLSWQKWSYKVGLKGETLSLHSLGGSSSVEWIQGSLVSQRQPLTWYKT 627
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
FDAPDG P+ALD+ SMGKGQ W+NG ++GRYW G C D CDY G YN +KC +
Sbjct: 628 TFDAPDGTAPLALDMNSMGKGQVWLNGQNLGRYWPAYKASGTC-DYCDYAGTYNENKCRS 686
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
NCG +Q WYHVP+SWL+ + NLLV+FEE GG+ IS+ R VC + E +
Sbjct: 687 NCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDLNGISLVRRDIDSVCADIYEWQPNLI- 745
Query: 785 KWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
S GK + P++HL C G ISSI+FAS+GTP G C F G+CHA MS
Sbjct: 746 --SYQMQTSGKAPVR---PKVHLSCSPGQKISSIKFASFGTPVGSCGNFHEGSCHAHMS 799
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 835 bits (2156), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/800 (52%), Positives = 525/800 (65%), Gaps = 35/800 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD +AI+I+G RR+LIS IHYPR++PEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 27 SVSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+ D+VKF+KL+ +GLY+ LRIGPYVCAEWNFGGFPVWL+ IPGI+FRT+N
Sbjct: 87 PGKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIQFRTDNG 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQRF KIV++M+ E LF QGGPII+ QIENEYG ME G GK Y WAA M
Sbjct: 147 PFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTDWAAHM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
ALGLG GVPWVMCKQ DAP+ II+ACNG+YCD + PN KP +WTE W GWYT +GG +
Sbjct: 207 ALGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWYTEFGGAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP EDLAF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 267 PSRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+AD LG QEAHV+++ C+AFLAN +
Sbjct: 327 QPKWGHLKDLHRAIKLCEPALVSADPT-VTPLGTYQEAHVFKSK----SGACAAFLANYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V F Y LPPWS+SILPDC+NTV+NTA+V +Q++ + +P VP
Sbjct: 382 PRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSA----QMKMP-----RVPL 432
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
SW + +++ +FT G+LE +N T+D SDYLW++T + + D
Sbjct: 433 HGAF---------SWQAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKI-DP 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F ++ + P +TI S LRVFINGQL G+ G + Q V ++G N +
Sbjct: 483 NEEFLRSGKY-PVLTILSAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
LLS VGL N G E AG G V L G G DLS W+Y+VGLKGE
Sbjct: 542 LLSIAVGLPNVGPHFETWNAGVLGPVILNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLS 601
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + TWYKT F+AP G P+ALD+GSMGKGQ W+NG IGRYW
Sbjct: 602 GSSSVEWIQGSLVTRRQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPA 661
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C C+Y G+Y+ KC +NCG +Q WYHVPR+WL + NLLV+ EE GG+P
Sbjct: 662 YKASGSC-GACNYAGSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNG 720
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
I + R +C + E P + W S GK+ + P+ HL C G ISSI+F
Sbjct: 721 IFLVRREIDSICADIYEWQ-PNLMSWQMQAS--GKVK-KPVRPKAHLSCGPGQKISSIKF 776
Query: 821 ASYGTPQGRCQKFSRGNCHA 840
AS+GTP+G C F G+CHA
Sbjct: 777 ASFGTPEGGCGSFREGSCHA 796
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 834 bits (2155), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/802 (51%), Positives = 535/802 (66%), Gaps = 71/802 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYDHR++I++G RR+L+S +HYPRATPEMWP +I K+KEGG DVIETYVFW+ HE
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F+G+ D+VKFVKLV +GL + LRIGPYVCAEWN GGFP+WLRDIP I FRT+N
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ MQ F+ KIV++M+EE LF+ QGGPII+ Q+ENEYGN++S YG+ G Y+ WAA M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A GVPW+MC Q+ PE IID CNG YCDG+ P Y KPT+WTE++ GW+T +G L
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPTLYKKPTMWTESYTGWFTYYGWPL 258
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVED+AFAVARFF+RGGSF NYYMYFGGTNFGRTSGGP+ +SYDYDAP+DEYG+
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQH 318
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLKDLH +KL E +++++ Q+ +LG NQEAHVY YG + C AFLAN+D
Sbjct: 319 LPKWGHLKDLHETLKLGEEVILSSE-GQHSELGPNQEAHVY---SYG--NGCVAFLANVD 372
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
V F SY+LP WSVSI+ DC+ FN+AKV SQ+++ + ++P
Sbjct: 373 SMNDTVVEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKSQSAV------VSMNP------ 420
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S +S SW + EP+G+ S ++F + +LE + TKD SDYLW+ T+
Sbjct: 421 --------SKSSLSWTSFDEPVGI-SGSSFKAKQLLEQMETTKDTSDYLWYTTR------ 465
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQL------TGSVIGHWVKVVQPVEFQSGYND 579
+ T ++I+SMRDV+ +F+NGQ + SV+ + V P++ G N
Sbjct: 466 ----YATGTGSTWLSIESMRDVVHIFVNGQFQSSWHTSKSVL--YNSVEAPIKLAPGSNT 519
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ LLS TVGLQN+GAF+E AG G + L G GD +LSK WTYQVGLKGE ++++
Sbjct: 520 IALLSATVGLQNFGAFIETWSAGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFT 579
Query: 640 IEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+E + + W+ ++ TWY T FDAP G DPVALDL SMGKGQAWVNG IGRYW
Sbjct: 580 VEGSRSVNWSAVSTK---KPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYW 636
Query: 699 TVV-APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
A C ++CDYRG+Y+ +KC T CG +Q WYHVPRSW++ NLLV+FEETGG+
Sbjct: 637 PAYKAADSVCPESCDYRGSYDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGD 696
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
P I RST ++C +V ESH V+ W + +IS
Sbjct: 697 PSSIDFVTRSTNVICARVYESHPASVKLWCPG---------------------EKQVISQ 735
Query: 818 IEFASYGTPQGRCQKFSRGNCH 839
I FAS G P+G C F G+CH
Sbjct: 736 IRFASLGNPEGSCGSFKEGSCH 757
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 834 bits (2154), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/827 (51%), Positives = 533/827 (64%), Gaps = 37/827 (4%)
Query: 27 MIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSK 86
+I C+ F V+YD +A++I+G RR+L S IHYPR+TP+MW LI K+K
Sbjct: 13 LILWCCLGLLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAK 72
Query: 87 EGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFG 146
+GG DVIETYVFWN HE G+Y+F+G+ND+V+FVK + +GLY LRIGPYVCAEWNFG
Sbjct: 73 DGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIGPYVCAEWNFG 132
Query: 147 GFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN 206
GFPVWL+ +PGI FRT+N PFK M+ F ++IV+LM+ E LF QGGPII+ QIENEYG
Sbjct: 133 GFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGR 192
Query: 207 MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK 266
G +G +Y+ WAA MA+ GVPWVMCK+ DAP+ +I CNG+YCD + PN K
Sbjct: 193 QGQILGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVISTCNGFYCDSFAPNKPYK 252
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
PT+WTE W GW+T +GG + HRPV+DLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGP
Sbjct: 253 PTIWTEAWSGWFTEFGGPMHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGP 312
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVY 386
F TSYDYDAPIDEYGL+ +PK+GHLK+LH AIK+CE ALV+ D LG Q+AHVY
Sbjct: 313 FVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSTDPV-VTSLGNKQQAHVY 371
Query: 387 RANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQT 446
+ +CSAFLAN D +AA V F Y LPPWS+SILPDCRN VFNTAKV QT
Sbjct: 372 SSE----SGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQT 427
Query: 447 SIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNV 506
S + LP S S QS +E LSS S + FT QG+LE +NV
Sbjct: 428 SQMEM---LPTSTG-SFQWQSYLED-LSSLDDS-------------STFTTQGLLEQINV 469
Query: 507 TKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG---- 562
T+D SDYLW++T + + + + SF E+ PT+ I S + +F+NGQL+GS G
Sbjct: 470 TRDTSDYLWYMTSVDIGETE-SFLHGGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQN 527
Query: 563 HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKI 622
+ SG N + LLS VGL N G E G G V L G G DLS
Sbjct: 528 RRFTYKGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKRDLSWQ 587
Query: 623 LWTYQVGLKGEFQQI-YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLG 680
WTYQVGLKGE + Y W D + P TW+KTYFDAP+G +P+ALD+
Sbjct: 588 KWTYQVGLKGEAMNLAYPTNTPSFGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDME 647
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
MGKGQ WVNG IGRYWT A G C C Y G Y +KC + CG PTQ WYHVPRSW
Sbjct: 648 GMGKGQIWVNGESIGRYWTAFA-TGDCGH-CSYTGTYKPNKCNSGCGQPTQKWYHVPRSW 705
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
L+ S NLLVIFEE GGNP +S+ RS VC +VSE H P ++ W G+
Sbjct: 706 LKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TF 761
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
P++HL C G IS+I+FAS+GTP G C + +G+CHA S +++
Sbjct: 762 RRPKVHLKCSPGQAISAIKFASFGTPLGTCGSYQQGDCHAATSYAIL 808
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 833 bits (2153), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/802 (52%), Positives = 534/802 (66%), Gaps = 44/802 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AIII+G +R+LIS IHYPR+TPEMWPDLI KSK+GG DVI+TYVFWN HE
Sbjct: 27 SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+ + D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 PGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F +KIV +M+ E LF QGGPII+ QIENE+G +E G GK Y KWAA M
Sbjct: 147 PFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW+MCKQ DAP+ +ID CNG+YC+ + PN KP +WTE W GWYT +GG +
Sbjct: 207 AVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP EDLAF++ARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL
Sbjct: 267 PTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLPR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIK E ALV+A+ + LG +QEAHV++ S+S C+AFLAN D
Sbjct: 327 EPKWGHLRDLHKAIKSSESALVSAEPS-VTSLGNSQEAHVFK-----SKSGCAAFLANYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP-NISVP 464
++A V+F Y LPPWS+SILPDCR V+NTA++ SQ+S + ++P ++P
Sbjct: 381 TKSSAKVSFGNGQYELPPWSISILPDCRTAVYNTARLGSQSS------QMKMTPVKSALP 434
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QS IE SS + T+ G+ E +NVT+D +DY W++T I +S
Sbjct: 435 WQSFIEESASSD--------------ESDTTTLDGLWEQINVTRDTTDYSWYMTDITISP 480
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
D+ F K E P +TI S L VFINGQL+G+V G + Q V+ +SG N L
Sbjct: 481 DE-GFIKRGE-SPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKL 538
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS +VGL N G E AG G V L G +G D+S+ WTY+VGLKGE ++++
Sbjct: 539 ALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKVGLKGEALGLHTV 598
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWY+ F+AP G P+ALD+ SMGKGQ W+NG IGR+W
Sbjct: 599 SGSSSVEWAEGPSMAQKQPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWP 658
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C + C Y G Y+ KC T+CG P+Q WYHVPRSWL S NLLV+FEE GG+P
Sbjct: 659 AYTARGNCGN-CYYAGTYDDKKCRTHCGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPT 717
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM-APEMHLHCQDGYIISSI 818
+IS+ R T VC + E + + KL+ K+ P+ HL C G +IS I
Sbjct: 718 KISLVERRTSSVCADIFEGQ--------PTLTNSQKLASGKLNRPKAHLWCPPGQVISDI 769
Query: 819 EFASYGTPQGRCQKFSRGNCHA 840
+FASYG QG C F G+CHA
Sbjct: 770 KFASYGLSQGTCGSFQEGSCHA 791
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 833 bits (2152), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/812 (51%), Positives = 530/812 (65%), Gaps = 54/812 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A++++G RR+L+S IHYPR+ PEMWPDLI K+K+GG DV++TYVFWN HE
Sbjct: 29 VTYDRKAVVVNGQRRILLSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFP+WL+ +PGI FRT+N P
Sbjct: 89 GQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPIWLKYVPGISFRTDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQ+F KIV +M+ E LF WQGGPII+ QIENE+G +E G+ KDY WAA+MA
Sbjct: 149 FKAEMQKFTTKIVQMMKSERLFEWQGGPIILSQIENEFGPLEWDQGEPAKDYASWAANMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ L GVPW+MCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +P
Sbjct: 209 MALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVEDLA+ VA+F Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAP+DEYGLL E
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFERTAGGPFIATSYDYDAPLDEYGLLRE 328
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK+LH AIKLCEPALVAAD LG Q+A V+R+ S C+AFL N +
Sbjct: 329 PKWGHLKELHRAIKLCEPALVAADPI-LSSLGNAQKASVFRS----STGACAAFLENKHK 383
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A V+F G Y LPPWS+SILPDC+ TVFNTA+V SQ S +E++ L
Sbjct: 384 LSYARVSFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGL--------- 434
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSE-NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+W + E I +SE +FT G+LE +N+T+D +DYLW+ T + V+ D
Sbjct: 435 ------------TWQSYNEEINSFSELESFTTVGLLEQINMTRDNTDYLWYTTYVDVAKD 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-------VEFQSGYN 578
+ T+ P +T+ S L VFINGQL+G+V G V P V+ SG N
Sbjct: 483 EQFL--TSGKNPKLTVMSAGHALHVFINGQLSGTVYG---SVENPKLTYTGKVKLWSGSN 537
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQI 637
+ LS VGL N G E AG G V L G G DL+ WTYQVGLKGE
Sbjct: 538 TISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGKRDLTWQKWTYQVGLKGEAMSLH 597
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ EW + + TWYK +F+APDG +P+ALD+ SMGKGQ W+NG IGRY
Sbjct: 598 SLSGSSSVEWGEPVQK---QPLTWYKAFFNAPDGDEPLALDMNSMGKGQIWINGQGIGRY 654
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W G C CDYRG YN KC TNCG+P+Q WYHVPR WL + NLLVIFEE GG+
Sbjct: 655 WPGYKASGTCGH-CDYRGEYNETKCQTNCGDPSQRWYHVPRPWLNPTGNLLVIFEEWGGD 713
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
P IS+ R+T VC VSE P ++ W + E+HL C G I+
Sbjct: 714 PTGISMVKRTTGSVCADVSEWQ-PSIKNWRTK---------DYEKAEVHLQCDHGRKITE 763
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FAS+GTPQG C +S G CHA S + +
Sbjct: 764 IKFASFGTPQGSCGNYSEGGCHAHRSYDIFKK 795
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 833 bits (2152), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/802 (52%), Positives = 534/802 (66%), Gaps = 44/802 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AIII+G +R+LIS IHYPR+TPEMWPDLI KSK+GG DVI+TYVFWN HE
Sbjct: 27 SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+ + D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 PGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F +KIV +M+ E LF QGGPII+ QIENE+G +E G GK Y KWAA M
Sbjct: 147 PFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW+MCKQ DAP+ +ID CNG+YC+ + PN KP +WTE W GWYT +GG +
Sbjct: 207 AVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP EDLAF++ARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL
Sbjct: 267 PTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLPR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIK E ALV+A+ + LG QEAHV++ S+S C+AFLAN D
Sbjct: 327 EPKWGHLRDLHKAIKSSESALVSAEPS-VTSLGNGQEAHVFK-----SKSGCAAFLANYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP-NISVP 464
++A V+F Y LPPW +SILPDC+ V+NTA++ SQ+S + ++P ++P
Sbjct: 381 TKSSAKVSFGNGQYELPPWPISILPDCKTAVYNTARLGSQSS------QMKMTPVKSALP 434
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QS +E SS + T+ G+ E +NVT+D +DYLW++T I +S
Sbjct: 435 WQSFVEESASSD--------------ESDTTTLDGLWEQINVTRDTTDYLWYMTDITISP 480
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
D+ F K E P +TI S L VFINGQL+G+V G + Q V+ +SG N L
Sbjct: 481 DE-GFIKRGE-SPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKPRSGINKL 538
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS +VGL N G E AG G V L G +G D+S+ WTY++GLKGE ++++
Sbjct: 539 ALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKIGLKGEALGLHTV 598
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWYK F+AP G P+ALD+ SMGKGQ W+NG IGR+W
Sbjct: 599 SGSSSVEWAEGPSMAQKQPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWP 658
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C + C Y G Y+ KC T+CG P+Q WYHVPRSWL S NLLV+FEE GG+P
Sbjct: 659 AYTARGNCGN-CYYAGTYDDKKCRTHCGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPT 717
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM-APEMHLHCQDGYIISSI 818
+IS+ R T VC + E + + KL+ K+ P+ HL C G +IS I
Sbjct: 718 KISLVERRTSSVCADIFEGQ--------PTLTNSQKLASGKLNRPKAHLWCPPGQVISDI 769
Query: 819 EFASYGTPQGRCQKFSRGNCHA 840
+FASYG PQG C F G+CHA
Sbjct: 770 KFASYGLPQGTCGSFQEGSCHA 791
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 833 bits (2151), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/813 (51%), Positives = 528/813 (64%), Gaps = 39/813 (4%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F V+YD +A++I+G RR+L S IHYPR+TP+MW DLI K+K+GG DVIETYVFWN
Sbjct: 28 FVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNL 87
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G+Y+F+G+ND+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 88 HEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFR 147
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK M+ F ++IV+LM+ E LF QGGPII+ QIENEYG G +G +Y+ W
Sbjct: 148 TDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTW 207
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+ GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +
Sbjct: 208 AAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEF 267
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + HRPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 268 GGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEY 327
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL+ +PK+GHLK+LH AIK+CE ALV+AD +G Q+AHVY A +CSAFL
Sbjct: 328 GLIRQPKYGHLKELHRAIKMCEKALVSADPV-VTSIGNKQQAHVYSAE----SGDCSAFL 382
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
AN D +AA V F Y LPPWS+SILPDCRN VFNTAKV QTS
Sbjct: 383 ANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTS-------------- 428
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQI 520
Q M+ + + + W + E + +++ FT G+LE +NVT+D SDYLW++T +
Sbjct: 429 ---QMEMLPT--DTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV 483
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSG 576
+ D + SF E+ PT+ I S + +F+NGQL+GS G + SG
Sbjct: 484 DIGDSE-SFLHGGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSG 541
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS VGL N G E G G V L G G +DLS WTYQVGLKGE
Sbjct: 542 TNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMN 601
Query: 637 I-YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+ + W D + P TW+KTYFDAP+G +P+ALD+ MGKGQ WVNG I
Sbjct: 602 LAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESI 661
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT A G C C Y G Y +KC T CG PTQ WYHVPR+WL+ S NLLVIFEE
Sbjct: 662 GRYWTAFA-TGDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEEL 719
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GGNP +S+ RS VC +VSE H P ++ W G+ P++HL C G
Sbjct: 720 GGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TFHRPKVHLKCSPGQA 775
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
I+SI+FAS+GTP G C + +G CHA S +++
Sbjct: 776 IASIKFASFGTPLGTCGSYQQGECHAATSYAIL 808
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 832 bits (2150), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/816 (51%), Positives = 532/816 (65%), Gaps = 41/816 (5%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F +V+YD +AI+I+G RR+L S IHYPR+TPEMW LI K+KEGG DV+ETYVFWN
Sbjct: 24 FVQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNV 83
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G YNF+G+ D+ +F+K + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 84 HEPSPGNYNFEGRYDLARFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFR 143
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK MQ F +KIV LM+ E LF QGGPII+ QIENEYG +G G++Y+ W
Sbjct: 144 TDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTW 203
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+GLG GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+ +
Sbjct: 204 AAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEF 263
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + RPV+DLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 264 GGPIHQRPVQDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 323
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL+ +PK+GHLK+LH A+K+CE ALV+AD LG +Q+A+VY + NC+AFL
Sbjct: 324 GLIRQPKYGHLKELHRAVKMCEKALVSADPI-VTSLGSSQQAYVYTS----ESGNCAAFL 378
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
+N D +AA V F Y LPPWS+SILPDCRN VFNTAKV QTS
Sbjct: 379 SNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-------------- 424
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQI 520
Q M+ + +S W + E + ++ T G+LE +NVTKD SDYLW+IT +
Sbjct: 425 ---QLEMLPT--NSPMLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSV 479
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSG 576
+ + SF E+ PT+ + S + +FING+L+GS G V F++G
Sbjct: 480 DIGSTE-SFLHGGEL-PTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAG 537
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS VGL N G E G G V L G G +DLS WTY+VGLKGE
Sbjct: 538 RNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMN 597
Query: 637 IYSIEE-NEAEWTDLTRDG-IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+ S + EW + + P TW+K+ FDAP+G +P+A+D+ MGKGQ W+NG I
Sbjct: 598 LVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSI 657
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT A G C D C+Y G + KC CG PTQ WYHVPR+WL+ +NLLV+FEE
Sbjct: 658 GRYWTAYA-TGNC-DKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEEL 715
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGY 813
GGNP IS+ RS VC VSE H P ++ W SY L P++HL C GY
Sbjct: 716 GGNPTSISLVKRSVTGVCADVSEYH-PTLKNWHIESYGKSEDLH----RPKVHLKCSAGY 770
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+SI+FAS+GTP G C + +G CHAPMS ++ +
Sbjct: 771 SITSIKFASFGTPLGTCGSYQQGTCHAPMSYDILEK 806
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 832 bits (2150), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/813 (51%), Positives = 528/813 (64%), Gaps = 39/813 (4%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F V+YD +A++I+G RR+L S IHYPR+TP+MW DLI K+K+GG DVIETYVFWN
Sbjct: 28 FVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNL 87
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G+Y+F+G+ND+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 88 HEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFR 147
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK M+ F ++IV+LM+ E LF QGGPII+ QIENEYG G +G +Y+ W
Sbjct: 148 TDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTW 207
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+ GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +
Sbjct: 208 AAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEF 267
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + HRPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 268 GGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEY 327
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL+ +PK+GHLK+LH AIK+CE ALV+AD +G Q+AHVY A +CSAFL
Sbjct: 328 GLIRQPKYGHLKELHRAIKMCEKALVSADPV-VTSIGNKQQAHVYSAE----SGDCSAFL 382
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
AN D +AA V F Y LPPWS+SILPDCRN VFNTAKV QTS
Sbjct: 383 ANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTS-------------- 428
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQI 520
Q M+ + + + W + E + +++ FT G+LE +NVT+D SDYLW++T +
Sbjct: 429 ---QMEMLPT--DTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV 483
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSG 576
+ D + SF E+ PT+ I S + +F+NGQL+GS G + SG
Sbjct: 484 DIGDSE-SFLHGGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSG 541
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS VGL N G E G G V L G G +DLS WTYQVGLKGE
Sbjct: 542 TNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMN 601
Query: 637 I-YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+ + W D + P TW+KTYFDAP+G +P+ALD+ MGKGQ WVNG I
Sbjct: 602 LAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESI 661
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT A G C C Y G Y +KC T CG PTQ WYHVPR+WL+ S NLLVIFEE
Sbjct: 662 GRYWTAFA-TGDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEEL 719
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GGNP +S+ RS VC +VSE H P ++ W G+ P++HL C G
Sbjct: 720 GGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TFHRPKVHLKCSPGQA 775
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
I+SI+FAS+GTP G C + +G CHA S +++
Sbjct: 776 IASIKFASFGTPLGTCGSYQQGECHAATSYAIL 808
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 832 bits (2149), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/812 (52%), Positives = 527/812 (64%), Gaps = 37/812 (4%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F V+YD +A++I+G RR+L S IHYPR+TP+MW LI K+K+GG DVIETYVFWN
Sbjct: 25 FVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNL 84
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G+Y+F+G+ND+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 85 HEPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFR 144
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK M+ F ++IV+LM+ E LF QGGPII+ QIENEYG G +G +Y+ W
Sbjct: 145 TDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTW 204
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+ GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +
Sbjct: 205 AAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEF 264
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + HRPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 265 GGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEY 324
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL+ EPK+GHLK+LH AIK+CE ALV+AD +G Q+AHVY A +CSAFL
Sbjct: 325 GLIREPKYGHLKELHRAIKMCEKALVSADPV-VTSIGNKQQAHVYSA----ESGDCSAFL 379
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
AN D +AA V F Y LPPWS+SILPDCRN VFNTAKV QTS + + +
Sbjct: 380 ANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTS----QMEMLPTDTK 435
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+ QS +E LSS S + FT QG+LE +NVT+D SDYLW++T +
Sbjct: 436 NFQWQSYLED-LSSLDDS-------------STFTTQGLLEQINVTRDTSDYLWYMTSVD 481
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGY 577
+ D + SF E+ PT+ I S + +F+NGQL+GS G + SG
Sbjct: 482 IGDTE-SFLHGGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGT 539
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + LLS VGL N G E G G V L G G DLS WTYQVGLKGE +
Sbjct: 540 NRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNL 599
Query: 638 -YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ W D + P TW+KTYFDAP+G +P+ALD+ MGKGQ WVNG IG
Sbjct: 600 AFPTNTRSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIG 659
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYWT A G C C Y G Y +KC T CG PTQ +YHVPRSWL+ S NLLVIFEE G
Sbjct: 660 RYWTAFA-TGDCSQ-CSYTGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLVIFEELG 717
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
GNP +S+ RS VC +VSE H P ++ W G+ P++HL C G I
Sbjct: 718 GNPSSVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TFHRPKVHLKCSPGQAI 773
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+SI+FAS+GTP G C + +G CHA S +++
Sbjct: 774 ASIKFASFGTPLGTCGSYQQGECHAATSYAIL 805
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 832 bits (2148), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/813 (51%), Positives = 528/813 (64%), Gaps = 39/813 (4%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F V+YD +A++I+G RR+L S IHYPR+TP+MW DLI K+K+GG DVIETYVFWN
Sbjct: 25 FVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNL 84
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G+Y+F+G+ND+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 85 HEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFR 144
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK M+ F ++IV+LM+ E LF QGGPII+ QIENEYG G +G +Y+ W
Sbjct: 145 TDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTW 204
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+ GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +
Sbjct: 205 AAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEF 264
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + HRPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 265 GGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEY 324
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL+ +PK+GHLK+LH AIK+CE ALV+AD +G Q+AHVY A +CSAFL
Sbjct: 325 GLIRQPKYGHLKELHRAIKMCEKALVSADPV-VTSIGNKQQAHVYSA----ESGDCSAFL 379
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
AN D +AA V F Y LPPWS+SILPDCRN VFNTAKV QTS
Sbjct: 380 ANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTS-------------- 425
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQI 520
Q M+ + + + W + E + +++ FT G+LE +NVT+D SDYLW++T +
Sbjct: 426 ---QMEMLPT--DTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV 480
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSG 576
+ D + SF E+ PT+ I S + +F+NGQL+GS G + SG
Sbjct: 481 DIGDSE-SFLHGGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSG 538
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS VGL N G E G G V L G G +DLS WTYQVGLKGE
Sbjct: 539 TNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMN 598
Query: 637 I-YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+ + W D + P TW+KTYFDAP+G +P+ALD+ MGKGQ WVNG I
Sbjct: 599 LAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESI 658
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT A G C C Y G Y +KC T CG PTQ WYHVPR+WL+ S NLLVIFEE
Sbjct: 659 GRYWTAFA-TGDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEEL 716
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GGNP +S+ RS VC +VSE H P ++ W G+ P++HL C G
Sbjct: 717 GGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TFHRPKVHLKCSPGQA 772
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
I+SI+FAS+GTP G C + +G CHA S +++
Sbjct: 773 IASIKFASFGTPLGTCGSYQQGECHAATSYAIL 805
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 831 bits (2147), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/816 (51%), Positives = 532/816 (65%), Gaps = 41/816 (5%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F +V+YD +AI+I+G RR+L S IHYPR+TPEMW LI K+KEGG DV+ETYVFWN
Sbjct: 24 FVQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNV 83
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G YNF+G+ D+V+F+K + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 84 HEPSPGNYNFEGRYDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFR 143
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK MQ F +KIV LM+ E LF QGGPII+ QIENEYG +G G++Y+ W
Sbjct: 144 TDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTW 203
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+GLG GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+ +
Sbjct: 204 AAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEF 263
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + RPV+DLAFAVA F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 264 GGPIHQRPVQDLAFAVALFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 323
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL+ +PK+GHLK+LH A+K+CE ALV+AD LG +Q+A+VY + NC+AFL
Sbjct: 324 GLIRQPKYGHLKELHRAVKMCEKALVSADPI-VTSLGSSQQAYVYTS----ESGNCAAFL 378
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
+N D +AA V F Y LPPWS+SILPDCRN VFNTAKV QTS
Sbjct: 379 SNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-------------- 424
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQI 520
Q M+ + +S W + E + ++ T G+LE +NVTKD SDYLW+IT +
Sbjct: 425 ---QLEMLPT--NSPMLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSV 479
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSG 576
+ + SF E+ PT+ + S + +FING+L+GS G V F++G
Sbjct: 480 DIGSTE-SFLHGGEL-PTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAG 537
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS VGL N G E G G V L G G +DLS WTY+VGLKGE
Sbjct: 538 RNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMN 597
Query: 637 IYSIEE-NEAEWTDLTRDG-IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+ S + EW + + P TW+K+ FDAP+G +P+A+D+ MGKGQ W+NG I
Sbjct: 598 LVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSI 657
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT A G C D C+Y G + KC CG PTQ WYHVPR+WL+ +NLLV+FEE
Sbjct: 658 GRYWTAYA-TGNC-DKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEEL 715
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGY 813
GGNP IS+ RS VC VSE H P ++ W SY L P++HL C GY
Sbjct: 716 GGNPTSISLVKRSVTGVCADVSEYH-PTLKNWHIESYGKSEDLH----RPKVHLKCSAGY 770
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+SI+FAS+GTP G C + +G CHAPMS ++ +
Sbjct: 771 SITSIKFASFGTPLGTCGSYQQGTCHAPMSYDILEK 806
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 830 bits (2145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/804 (51%), Positives = 536/804 (66%), Gaps = 72/804 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYDHR++I++G RR+L+S +HYPRATPEMWP +I K+KEGG DVIETYVFW+ HE
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F+G+ D+VKFVKLV +GL + LRIGPYVCAEWN GGFP+WLRDIP I FRT+N
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ MQ F+ KIV++M+EE LF+ QGGPII+ Q+ENEYGN++S YG+ G Y+ WAA M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A GVPW+MC Q+ PE IID CNG YCDG+ P Y KPT+WTE++ GW+T +G +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPILYKKPTMWTESYTGWFTYYGWPI 258
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYM--YFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
PHRPVED+AFAVARFF+RGGSF NYYM YFGGTNFGRTSGGP+ +SYDYDAP+DEYG+
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLDEYGM 318
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
PKWGHLKDLH +KL E +++++ Q+ +LG NQEAHVY YG + C AFLAN
Sbjct: 319 QHLPKWGHLKDLHETLKLGEEVILSSE-GQHSELGPNQEAHVY---SYG--NGCVAFLAN 372
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+D V F SY+LP WSVSIL DC+ FN+AKV SQ+++ + +SP
Sbjct: 373 VDSMNDTVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKSQSAV------VSMSP---- 422
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
S ++ SW + EP+G+ S ++F + +LE + TKD SDYLW+ T + +
Sbjct: 423 ----------SKSTLSWTSFDEPVGI-SGSSFKAKQLLEQMETTKDTSDYLWYTTSVEAT 471
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL------TGSVIGHWVKVVQPVEFQSGY 577
S W ++I+SMRDV+ +F+NGQ + SV+ + V P+ G
Sbjct: 472 GTG-STW--------LSIESMRDVVHIFVNGQFQSSWHTSKSVL--YNSVEAPITLAPGS 520
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + LLS TVGLQN+GAF+E AG G + L G GD +LSK WTYQVGLKGE ++
Sbjct: 521 NTIALLSATVGLQNFGAFIETWSAGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKL 580
Query: 638 YSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+++E + + W+ ++ + TWY T FDAP G DPVALDL SMGKGQAWVNG IGR
Sbjct: 581 FTVEGSRSVNWSAVSTE---KPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGR 637
Query: 697 YWTVV-APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
YW A C ++CDYRG+Y+ +KC T CG +Q WYHVPRSW++ NLLV+FEETG
Sbjct: 638 YWPAYKAADSVCPESCDYRGSYDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETG 697
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
G+P I RST ++C +V ESH V+ W + +I
Sbjct: 698 GDPSSIDFVTRSTNVICARVYESHPASVKLWCPG---------------------EKQVI 736
Query: 816 SSIEFASYGTPQGRCQKFSRGNCH 839
S I FAS G P+G C F G+CH
Sbjct: 737 SQIRFASLGNPEGSCGSFKEGSCH 760
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 830 bits (2144), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/801 (52%), Positives = 534/801 (66%), Gaps = 36/801 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AI I+G RR+L+S IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 20 SVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 79
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F G D+V+F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRTNN
Sbjct: 80 PGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNNG 139
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQRF KKIVD+M+ E LF QGGPII+ QIENEYG ME G G+ Y +WAA M
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCKQ DAP+ II++CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 259
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 260 PYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLVR 319
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D + + LG+ QEAHV+++ +YG +C+AFLAN +
Sbjct: 320 QPKWGHLKDLHRAIKLCEPALVSGDPS-VMPLGRFQEAHVFKS-KYG---HCAAFLANYN 374
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V F Y LPPWS+SILPDC+NTV+NTA+V +Q++ + +P+ + +
Sbjct: 375 PRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKM---VPVPIHGAFSW 431
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
Q+ E SS E +FT G++E +N T+D SDYLW+ T + + D
Sbjct: 432 QAYNEEAPSSN--------------GERSFTTVGLVEQINTTRDVSDYLWYSTDVKI-DP 476
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
D F KT + PT+T+ S L VF+N QL+G+ G + + V ++G N +
Sbjct: 477 DEGFLKTGKY-PTLTVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKIS 535
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
+LS VGL N G E AG G V L G G DLS W+Y+VG++GE
Sbjct: 536 ILSIAVGLPNVGPHFETWNAGVLGPVTLNGLNEGRRDLSWQKWSYKVGVEGEAMSLHSLS 595
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EWT + TW+KT F+AP G P+ALD+ SMGKGQ W+NG IGR+W
Sbjct: 596 GSSSVEWTAGSFVARRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPA 655
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C CDY G +N KC +NCG +Q WYHVPRSW + NLLV+FEE GG+P
Sbjct: 656 YKASGSC-GWCDYAGTFNEKKCLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNG 714
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIE 819
IS+ R VC + E + P +Y + +NK + P+ HL C G ISS++
Sbjct: 715 ISLVRREVDSVCADIYE--WQPTLM---NYQMQASGKVNKPLRPKAHLQCGPGQKISSVK 769
Query: 820 FASYGTPQGRCQKFSRGNCHA 840
FAS+GTP+G C + G+CHA
Sbjct: 770 FASFGTPEGACGSYREGSCHA 790
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 830 bits (2143), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/814 (51%), Positives = 532/814 (65%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+LIS IHYPR+TP+MW D+I K+K+GG DV+ETYVFWN HE
Sbjct: 27 SVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+F++ V +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 PGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV LM+ E LF QGGPII+ QIENEYG G G DY+ WAA+M
Sbjct: 147 PFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWAANM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+ +GG L
Sbjct: 207 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPTIWTEAWSGWFNEFGGPL 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH +IKLCE ALV+AD LG Q+AHVY ++ +C+AFL+N D
Sbjct: 327 QPKYGHLKELHRSIKLCERALVSADPI-VSSLGSFQQAHVYSSD----AGDCAAFLSNYD 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LPPWS+SILPDCRN VFNTAKV QT+ + LP + +
Sbjct: 382 TKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEM---LPTNAEM---- 434
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
SW + E I +++ FT G+LE +NVT+D SDYLW+IT+I +
Sbjct: 435 ------------LSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + E+ PT+ + + + VFINGQLTGS G + V +G N +
Sbjct: 483 SE-SFLRGGEL-PTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V L G G DLS WTY+VGLKGE + +
Sbjct: 541 ALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNL--V 598
Query: 641 EENEAEWTDLTRDGIPST----FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
N D + + + TW+K +F+AP+G +P+ALD+ MGKGQ W+NG IGR
Sbjct: 599 SPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGR 658
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A G CQ C Y G Y KC CG PTQ WYHVPRSWL+ + NLLV+FEE GG
Sbjct: 659 YWTAYA-NGNCQG-CSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGG 716
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYII 815
+P IS+ RS VC V E H P ++ W SY +L P++HL C G I
Sbjct: 717 DPSRISLVRRSMTSVCADVFEYH-PNIKNWHIESYGKTEELH----KPKVHLRCGPGQSI 771
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SSI+FASYGTP G C F +G CHAP S ++V +
Sbjct: 772 SSIKFASYGTPLGTCGSFEQGPCHAPDSYAIVEK 805
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 829 bits (2142), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/814 (51%), Positives = 532/814 (65%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+LIS IHYPR+TP+MW D+I K+K+GG DV+ETYVFWN HE
Sbjct: 80 SVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVHEPS 139
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+F++ V +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 140 PGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 199
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV LM+ E LF QGGPII+ QIENEYG G G DY+ WAA+M
Sbjct: 200 PFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWAANM 259
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+ +GG L
Sbjct: 260 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPTIWTEAWSGWFNEFGGPL 319
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 320 HQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVR 379
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH +IKLCE ALV+AD LG Q+AHVY ++ +C+AFL+N D
Sbjct: 380 QPKYGHLKELHRSIKLCERALVSADPI-VSSLGSFQQAHVYSSD----AGDCAAFLSNYD 434
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LPPWS+SILPDCRN VFNTAKV QT+ + LP + +
Sbjct: 435 TKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEM---LPTNAEM---- 487
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
SW + E I +++ FT G+LE +NVT+D SDYLW+IT+I +
Sbjct: 488 ------------LSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGS 535
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + E+ PT+ + + + VFINGQLTGS G + V +G N +
Sbjct: 536 SE-SFLRGGEL-PTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTI 593
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V L G G DLS WTY+VGLKGE + +
Sbjct: 594 ALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNL--V 651
Query: 641 EENEAEWTDLTRDGIPST----FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
N D + + + TW+K +F+AP+G +P+ALD+ MGKGQ W+NG IGR
Sbjct: 652 SPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGR 711
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A G CQ C Y G Y KC CG PTQ WYHVPRSWL+ + NLLV+FEE GG
Sbjct: 712 YWTAYA-NGNCQG-CSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGG 769
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYII 815
+P IS+ RS VC V E H P ++ W SY +L P++HL C G I
Sbjct: 770 DPSRISLVRRSMTSVCADVFEYH-PNIKNWHIESYGKTEELH----KPKVHLRCGPGQSI 824
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SSI+FASYGTP G C F +G CHAP S ++V +
Sbjct: 825 SSIKFASYGTPLGTCGSFEQGPCHAPDSYAIVEK 858
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 829 bits (2141), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/810 (51%), Positives = 526/810 (64%), Gaps = 44/810 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AI+I+G RR+L S IHYPR+TPEMW DLI K+K GG DV+ETYVFWN HE
Sbjct: 27 VTYDRKAILINGQRRILFSGSIHYPRSTPEMWEDLILKAKNGGLDVVETYVFWNVHEPYP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 GIYNFEGRFDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEA 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV LM+ E LF QGGPII+ QIENEYG +G+ G +Y+ WAA+MA
Sbjct: 147 FKNAMQGFTEKIVALMKSENLFESQGGPIILAQIENEYGTESKLFGEAGYNYMTWAANMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW++ +GG L
Sbjct: 207 VGLQTGVPWVMCKEADAPDPVINTCNGFYCDTFSPNKPYKPTMWTEAWTGWFSEFGGPLH 266
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPV+DLAFAVARF QRGGS +NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL +
Sbjct: 267 QRPVQDLAFAVARFIQRGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLLRQ 326
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH AIK+CEPALV+AD LG Q+AHVY + G C+AFL+N D
Sbjct: 327 PKYGHLKELHRAIKMCEPALVSADPI-VTSLGDYQQAHVYSSESGG----CAAFLSNYDT 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A V F + Y LPPWS+SILPDC+N VFNTAKV QT+ Q
Sbjct: 382 KSFARVLFNNRHYNLPPWSISILPDCKNAVFNTAKVGVQTA-----------------QM 424
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ ++ ST+ SW + E I + + T G+LE +NVT+D SDYLW+IT + +S
Sbjct: 425 GMLPAE--STTLSWESYFEDISALDDRSMMTSPGLLEQINVTRDTSDYLWYITSVDISSS 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ F E+ PT+ + S + VFINGQL+GSV G V +G N +
Sbjct: 483 E-PFLHGGEL-PTLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKVNLHAGTNKIG 540
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G G V L G + G DLS WTY+VGLKGE + S
Sbjct: 541 LLSVAVGLPNVGGHFETWNTGILGPVVLYGLRQGKWDLSSQKWTYKVGLKGEAMNLISPS 600
Query: 642 E-NEAEWTDLTRDG-IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + P TW+K YFDAP+G +P+ALD+ MGKGQ W+NG IGRYWT
Sbjct: 601 GFSPVEWMQASLAAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKGQIWINGQSIGRYWT 660
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
A +G C C+Y A+ KC CG PTQ WYHVPRSWL+ NLLV+FEE GGNP
Sbjct: 661 AYA-RGNC-SRCNYATAFRPPKCQLGCGQPTQRWYHVPRSWLRPEQNLLVVFEEVGGNPS 718
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
IS+ R VC VSE H P + W ++ + P++HL C G ISSI+
Sbjct: 719 RISIVKRLVTSVCADVSEFH-PTFKNW--------HITAKFITPKVHLSCDPGQYISSIK 769
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C + +G CHAP S ++ +
Sbjct: 770 FASFGTPLGTCGSYQQGTCHAPSSSGILEK 799
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 828 bits (2140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/805 (51%), Positives = 530/805 (65%), Gaps = 41/805 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AII++G R++LIS IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 23 SVSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEPE 82
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+ + D+VKF+K+V +GLY+ LRIGPY CAEWNFGGFPVWL+ +PGI FRTNN
Sbjct: 83 EGKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGISFRTNNE 142
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIVD+M+ E L+ QGGPII+ QIENEYG ME G+ GK Y +WAA M
Sbjct: 143 PFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEWAAKM 202
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ LG GVPW+MCKQ D P+ II+ CNG+YCD + PN NKP +WTE W W+T +GG +
Sbjct: 203 AVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNKANKPKMWTEAWTAWFTEFGGPV 262
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP ED+AFAVARF Q GGSF+NYYMY GGTNFGRTSGGPF TSYDYDAP+DE+G L
Sbjct: 263 PYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEFGSLR 322
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D LG QEA V+++ C+AFLAN +
Sbjct: 323 QPKWGHLKDLHRAIKLCEPALVSVDPT-VTSLGNYQEARVFKS----ESGACAAFLANYN 377
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+H+ A V F Y LPPWS+SILPDC+NTV+NTA+V +Q++
Sbjct: 378 QHSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSA------------------ 419
Query: 466 QSMIESKLSSTSK--SWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ K++ S+ SW + E ++ FTV G+LE +N+T+D SDYLW++T I +
Sbjct: 420 ----QMKMTPVSRGFSWESFNEDAASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEI- 474
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
D + + P +T+ S L VF+NGQL G+V G + + ++G N
Sbjct: 475 -DPTEGFLNSGNWPWLTVFSAGHALHVFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNK 533
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ LLS VGL N G E AG G V L G G DL+ W Y+VGLKGE ++S
Sbjct: 534 ISLLSIAVGLPNVGPHFETWNAGVLGPVSLNGLNEGTRDLTWQKWFYKVGLKGEALSLHS 593
Query: 640 IEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ + + EW + + +WYKT F+APDG +P+ALD+ +MGKGQ W+NG +GR+W
Sbjct: 594 LSGSPSVEWVEGSLVAQKQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHW 653
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
G C C+Y G ++ KC TNCG +Q WYHVPRSWL + NLLV+FEE GG+P
Sbjct: 654 PAYKSSGSC-SVCNYTGWFDEKKCLTNCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDP 712
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
+ I++ R VC + E P + W V GK + P+ HL C G ISSI
Sbjct: 713 YGITLVKREIGSVCADIYEWQ-PQLLNWQR--LVSGKFD-RPLRPKAHLKCAPGQKISSI 768
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMS 843
+FAS+GTP+G C F +G+CHAP S
Sbjct: 769 KFASFGTPEGVCGNFQQGSCHAPRS 793
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 828 bits (2138), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/809 (51%), Positives = 525/809 (64%), Gaps = 37/809 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AIIIDG RR+LIS IHYPR+TP+MW DL+ K+K+GG DVI+TYVFWN HE
Sbjct: 28 VTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKAKDGGLDVIDTYVFWNVHEPSP 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M++E LF QGGPII QIENEYG ++G G Y+ WAA MA
Sbjct: 148 FKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEYGPESRAFGAAGHSYINWAAQMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+T +GG
Sbjct: 208 VGLKTGVPWVMCKEDDAPDPVINTCNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGAFH 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPV+DLAFAVARF Q+GGSF+NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+ E
Sbjct: 268 HRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH AIKLCE LV++D + LG Q+AHV+ + + +CSAFLAN
Sbjct: 328 PKYGHLKELHRAIKLCEHELVSSDPTITL-LGTYQQAHVFSSGK----RSCSAFLANYHT 382
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F Y LPPWS+SILPDCRN VFNTAKV QTS
Sbjct: 383 QSAARVMFNNMHYVLPPWSISILPDCRNVVFNTAKVGVQTS-----------------HV 425
Query: 467 SMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ + S SW + E I + + + T G++E +NVT+D +DYLW+IT + ++
Sbjct: 426 QMLPT--GSRFFSWESYDEDISSLGASSRMTALGLMEQINVTRDTTDYLWYITSVNINPS 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ SF + + PT+T++S L VFINGQ +GS G PV ++G N +
Sbjct: 484 E-SFLRGGQ-WPTLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVNLRAGTNRIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G G V L G G+ DL+ W+YQVGLKGE + S
Sbjct: 542 LLSIAVGLPNVGVHYETWKTGILGPVMLHGLNQGNKDLTWQQWSYQVGLKGEAMNLVSPN 601
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ +W + WYK YFDAP G +P+ALD+ SMGKGQ W+NG IGRYW
Sbjct: 602 RASSVDWIQGSLATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWINGQSIGRYWLS 661
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
A KG C +C Y G + KC CG PTQ WYHVPRSWL+ NLLVIFEE GG+ +
Sbjct: 662 YA-KGDC-SSCGYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIFEELGGDASK 719
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
IS+ RST VC E H+P + ++ +G+ N ++HL C G IS+I F
Sbjct: 720 ISLVKRSTTSVCADAFE-HHPTIENYNT--ESNGESERNLHQAKVHLRCAPGQSISAINF 776
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+GTP G C F G CHAP S SVV +
Sbjct: 777 ASFGTPTGTCGSFQEGTCHAPNSHSVVEK 805
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 827 bits (2137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/809 (50%), Positives = 540/809 (66%), Gaps = 37/809 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YDH+A++IDG RR+L S IHYPR TPE+WP++I KSKEGG DVIETYVFWN HE +R
Sbjct: 36 VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+G+ D+V+FVK V +GL++ LRIGPY CAEWN+GGFP+WL IPG++FRT+N
Sbjct: 96 GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGVQFRTSNDI 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+ F+ KIVDLM+++ LF+ QGGPII+ Q+ENEYGN++ +YG G+ YVKWAA A
Sbjct: 156 FKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAETA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ L VPWVMC Q DAP+ +I+ CNG+YCD + PNS +KP +WTEN+ GW+ +G +P
Sbjct: 216 ISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQFTPNSPSKPKMWTENYSGWFLAFGYAVP 275
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RPVEDLAFAVARFF+ GGSF NYYMYFGGTNFGRT+GGP TSYDYDAPIDEYG + +
Sbjct: 276 YRPVEDLAFAVARFFEYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 335
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH+AIK CE LV++D + +LG EAHVY Y ++C+AFLAN D
Sbjct: 336 PKWGHLRDLHSAIKQCEEYLVSSDPV-HQQLGNKLEAHVY----YKHSNDCAAFLANYDS 390
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A+VTF G +Y LP WSVSIL DC+N +FNTAKV +Q I FS +
Sbjct: 391 GSDANVTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDALFS----------RS 440
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+ ++ L + S W KE +G+W N+FT G+LE +N TKD SD+LW+ T +YV
Sbjct: 441 TTVDGNLVAASP-WSWYKEEVGIWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVE--- 496
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+ + I+S+ VF+N + G+ + + + + G N L +
Sbjct: 497 ----AGQDKEHLLNIESLGHAALVFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDV 552
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS +G+QNYG + + GAG V L DLS WTYQVGL+GE+ + ++
Sbjct: 553 LSMLIGVQNYGPWFDVQGAGIH-SVFLVDLHKSKKDLSSGKWTYQVGLEGEYLGLDNVSL 611
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV- 700
N + W+ T + + WYK AP+G P+AL+L SMGKGQAW+NG IGRYW+
Sbjct: 612 ANSSLWSQGTSLPVNKSLIWYKATIIAPEGNGPLALNLASMGKGQAWINGQSIGRYWSAY 671
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
++P GC D CDYRGAYNS KC CG P QT YH+PR+W+ NLLV+ EE GG+P +
Sbjct: 672 LSPSAGCTDNCDYRGAYNSFKCQKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQ 731
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
IS+ R+ + +C VSE PP W + L +PE+ L C+ G+ I++I F
Sbjct: 732 ISLLTRTGQDICSIVSEDDPPPADSWKPN------LEFMSQSPEVRLTCEHGWHIAAINF 785
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+GTP+G+C F+ GNCHA M L++V +
Sbjct: 786 ASFGTPEGKCGTFTPGNCHADM-LTIVQK 813
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 827 bits (2136), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/814 (51%), Positives = 534/814 (65%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +A+II+G RR+L S IHYPR+TP+MW LI K+K+GG D I+TYVFWN HE
Sbjct: 26 SVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+YNF+G+ D+V+F+KL+ +GLY+ LRIGPY+CAEWNFGGFPVWL+ +PG+ FRT+N
Sbjct: 86 PGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNFGGFPVWLKFVPGVSFRTDNE 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQRF +KIV +M+ E LF QGGPII+ QIENEYG+ ++G G Y+ WAA M
Sbjct: 146 PFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEYGHESRAFGAPGYAYLTWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ + GVPWVMCK+ DAP+ +I+ CNG+YCD + PN NKPTLWTE W GW+T + G +
Sbjct: 206 AVAMDTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPNKPTLWTEAWSGWFTEFAGPI 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPVEDL+FAV RF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 266 QQRPVEDLSFAVTRFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIKLCE AL++AD A+ LG +A V+ Y C+AFL+N +
Sbjct: 326 QPKYGHLKELHKAIKLCERALLSADPAE-TSLGTYAKAQVF----YSESGGCAAFLSNYN 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA VTF Y L PWS+SILPDC+N VFNTA V QTS Q
Sbjct: 381 PTSAARVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQTS-----------------Q 423
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + +S SW T E I +++ TV G+LE LNVT+D SDYLW+ T+I +S
Sbjct: 424 MQMLPT--NSELLSWETFNEDISSADDDSTITVVGLLEQLNVTRDTSDYLWYSTRIDISS 481
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + PT+ + S + VFING L+GS G V Q+G N +
Sbjct: 482 SE-SFLHGGQ-HPTLIVQSTGHAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSNII 539
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
+LS VGL N G E G G V L G G DLS W+YQVGLKGE + +
Sbjct: 540 SVLSIAVGLPNNGPHFETWSTGVLGPVVLHGLDEGKKDLSWQKWSYQVGLKGEAMNL--V 597
Query: 641 EENEAEWTDLTRDGI----PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
N D + + TWYK YFDAPDG +P+ALD+GSMGKGQ W+NG IGR
Sbjct: 598 SPNVISNIDWMKGSLFAQKQQPLTWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSIGR 657
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A KG C C Y G + + KC CG PTQ WYHVPRSWL+ + NLLV+FEE GG
Sbjct: 658 YWTAYA-KGNCSG-CSYSGTFRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVLFEELGG 715
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMA-PEMHLHCQDGYII 815
+ +IS RS VC +VSE H+P ++ W ++ + +M+ P++HLHC G I
Sbjct: 716 DASKISFMKRSVTTVCAEVSE-HHPNIKNW----HIESQERPEEMSKPKVHLHCASGQSI 770
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S+I+FAS+GTP G C F +G CHAP S +V+ +
Sbjct: 771 SAIKFASFGTPSGTCGNFQKGTCHAPTSQAVLEK 804
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 827 bits (2135), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/800 (51%), Positives = 528/800 (66%), Gaps = 35/800 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AI I+G RR+L+S IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 31 SVSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F G D+V+F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 91 PGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNG 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +M++F KKIVD+M+ E LF QGGPII+ QIENEYG ME G G+ Y +WAA M
Sbjct: 151 PFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTQWAAHM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPW+MCKQ DAP+ II+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 211 AVGLGTGVPWIMCKQEDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 270
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP EDLAF++ARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL
Sbjct: 271 PHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLPR 330
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D +LG +EAHV+R+ C+AFLAN +
Sbjct: 331 QPKWGHLKDLHRAIKLCEPALVSGDPTVQ-QLGNYEEAHVFRSK----SGACAAFLANYN 385
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A+V F Q Y LPPWS+SILP+C++TV+NTA+V SQ++ + VP
Sbjct: 386 PQSYATVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTTMKM---------TRVPI 436
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+ SW E +++FTV G+LE +N T+D SDYLW+ T + ++ +
Sbjct: 437 HGGL---------SWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSN 487
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F + N P +T+ S L VFIN QL+G+ G + + V ++G N +
Sbjct: 488 E-GFLR-NGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKIS 545
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E+ AG G + L+G G DL+ W+Y+VGLKGE ++S+
Sbjct: 546 LLSVAVGLPNVGPHFERWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLS 605
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW TWYKT FDAP G+ P+ALD+GSMGKGQ W+NG +GRYW
Sbjct: 606 GSSSVEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPA 665
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C C+Y G YN KC +NCG +Q WYHVP SWL+ + NLLV+FEE GG+P
Sbjct: 666 YKASGSC-GYCNYAGTYNEKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNG 724
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
I + R VC + E V S GK+ + + P+ HL C G ISSI+F
Sbjct: 725 IFLVRRDIDSVCADIYEWQPNLV---SYDMQASGKVR-SPVRPKAHLSCGPGQKISSIKF 780
Query: 821 ASYGTPQGRCQKFSRGNCHA 840
AS+GTP G C + G+CHA
Sbjct: 781 ASFGTPVGSCGNYREGSCHA 800
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 827 bits (2135), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/807 (51%), Positives = 525/807 (65%), Gaps = 53/807 (6%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YD +A++++G RR+LIS IHYPR+TPEMWPDLI K+K+GG DV++TYVFWN HE G
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PF
Sbjct: 87 QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 146
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQ+F KIV++M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+
Sbjct: 147 KAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 206
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L VPW+MCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PH
Sbjct: 207 ALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPH 266
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLA+ VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL EP
Sbjct: 267 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 326
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHLK LH AIKLCEPALVA D LG Q++ V+R+ S C+AFL N D+
Sbjct: 327 KWGHLKQLHKAIKLCEPALVAGDPI-VTSLGNAQKSSVFRS----STGACAAFLENKDKV 381
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
+ A V F G Y LPPWS+SILPDC+ TVFNTA+V SQ S +E++
Sbjct: 382 SYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGF---------- 431
Query: 468 MIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDI 527
+W + E I + E+ T G+LE +NVT+D +DYLW+ T + V+ D+
Sbjct: 432 -----------AWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQ 480
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-------VEFQSGYNDL 580
+N +T+ S L +FINGQL G+V G V P V+ +G N +
Sbjct: 481 FL--SNGENLKLTVMSAGHALHIFINGQLKGTVYG---SVDDPKLTYTGNVKLWAGSNTI 535
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LS VGL N G E AG G V L G G DL+ WTYQVGLKGE ++S+
Sbjct: 536 SCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSL 595
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + + TWYK +F+APDG +P+ALD+ SMGKGQ W+NG IGRYW
Sbjct: 596 SGSSTVEWGEPVQK---QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWP 652
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
G C TCDYRG Y+ KC TNCG+ +Q WYHVPRSWL + NLLVIFEE GG+P
Sbjct: 653 GYKASGNC-GTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPT 711
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
IS+ RS VC VSE P ++ W K+ HL C +G I+ I+
Sbjct: 712 GISMVKRSIGSVCADVSEWQ-PSMKNWHTKDYEKAKV---------HLQCDNGQKITEIK 761
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSV 846
FAS+GTPQG C ++ G CHA S +
Sbjct: 762 FASFGTPQGSCGSYTEGGCHAHKSYDI 788
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 826 bits (2134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/801 (52%), Positives = 519/801 (64%), Gaps = 37/801 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD +AI I+G R+LIS IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 27 SVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 87 PGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQ+F +KIVD+M+ + LF QGGPIIM QIENEYG ME G GK Y KWAA M
Sbjct: 147 PFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTKWAADM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPW+MCKQ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 207 AVGLGTGVPWIMCKQDDAPDPVINTCNGFYCDYFSPNKDYKPKMWTEAWTGWFTEFGGPV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 267 PHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLQ 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKL EPAL++ D ++G QEAHV+++ C+AFL N +
Sbjct: 327 QPKWGHLKDLHRAIKLSEPALISGDPT-VTRIGNYQEAHVFKSK----SGACAAFLGNYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
A+V F Y LPPWS+SILPDC+NTV+NTA+V SQ++ Q
Sbjct: 382 PKAFATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSA-----------------Q 424
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M + SW E +++FT+ G+LE LN T+D +DYLW+ T + V D
Sbjct: 425 MKMTRVPIHG-GLSWQVFTEQTASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDV-VIDP 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F ++ + P +T+ S L VFIN QL+G++ G + Q V+ G N +
Sbjct: 483 NEGFLRSGK-DPVLTVLSAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKIS 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
LLS VGL N G E AG G + L G G DLS W+Y+VGL GE
Sbjct: 542 LLSVAVGLPNVGPHFETWNAGVLGPITLNGLDEGRRDLSWQKWSYKVGLHGEALSLHSLG 601
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + TWYKT FDAPDGI P ALD+GSMGKGQ W+NG ++GRYW
Sbjct: 602 GSSSVEWVQGSLVSRMQPLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPA 661
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C D CDY G YN +KC +NCG +Q WYHVP SWL + NLLV+FEE GG+P
Sbjct: 662 YKASGTC-DNCDYAGTYNENKCRSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNG 720
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIE 819
I + R VC + E + SY + NK + P+ HL C G ISSI+
Sbjct: 721 IFLVRRDIDSVCADIYEWQPNLI-----SYQMQTSGKTNKPVRPKAHLSCGPGQKISSIK 775
Query: 820 FASYGTPQGRCQKFSRGNCHA 840
FAS+GTP G C F G+CHA
Sbjct: 776 FASFGTPVGSCGNFHEGSCHA 796
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 825 bits (2131), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/798 (51%), Positives = 525/798 (65%), Gaps = 37/798 (4%)
Query: 49 YDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQ 108
YD +AI I+G RR+LIS IHYPR++PEMWPDLI K+KEGG DVI+TYVFWN HE G+
Sbjct: 34 YDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGK 93
Query: 109 YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFK 168
Y F+G D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK
Sbjct: 94 YYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGINFRTDNGPFK 153
Query: 169 EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALG 228
+MQRF KIV++M+ E LF QGGPII+ QIENEYG ME G G+ Y KWAA MA+G
Sbjct: 154 AQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGQAYSKWAAKMAVG 213
Query: 229 LGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHR 288
LG GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +GG +P+R
Sbjct: 214 LGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKPYKPKMWTEAWTGWFTEFGGAVPYR 273
Query: 289 PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPK 348
P EDLAF+VARF Q+GG+F+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL +PK
Sbjct: 274 PAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPK 333
Query: 349 WGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHT 408
WGHLKDLH AIKLCEPALV+ + + LG QEAHV+++ C+AFLAN ++ +
Sbjct: 334 WGHLKDLHRAIKLCEPALVSG-APSVMPLGNYQEAHVFKS----KSGACAAFLANYNQRS 388
Query: 409 AASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSM 468
A V+F Y LPPWS+SILPDC+NTV+NTA++ +Q++ + +SP +P +
Sbjct: 389 FAKVSFGNMHYNLPPWSISILPDCKNTVYNTARIGAQSA------RMKMSP---IPMRGG 439
Query: 469 IESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDIS 528
SW E +N F + G+LE +N T+D SDYLW+ T + + D +
Sbjct: 440 F---------SWQAYSEEASTEGDNTFMMVGLLEQINTTRDVSDYLWYSTDVRI-DSNEG 489
Query: 529 FWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLS 584
F ++ + P +T+ S L VF+NGQL+G+ G + Q V+ ++G N + LLS
Sbjct: 490 FLRSGKY-PVLTVLSAGHALHVFVNGQLSGTAYGSLESPKLTFSQGVKMRAGINRIYLLS 548
Query: 585 QTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEEN 643
VGL N G E AG G V L G G DLS WTY++GL GE +
Sbjct: 549 IAVGLPNVGPHFETWNAGVLGPVTLNGLNEGRRDLSWQKWTYKIGLHGEALSLHSLSGSS 608
Query: 644 EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
EW + WYKT F+AP G P+ALD+GSMGKGQ W+NG +GRYW
Sbjct: 609 SVEWAQGSFVSRKQPLMWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQSVGRYWPAYKA 668
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
G C C+Y G +N KC TNCG +Q WYHVPRSWL + NLLV+FEE GG+P IS+
Sbjct: 669 SGNC-GVCNYAGTFNEKKCLTNCGEASQRWYHVPRSWLNTAGNLLVVFEEWGGDPNGISL 727
Query: 764 KLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIEFAS 822
R VC + E + P +Y + +NK + P++HL C G IS I+FAS
Sbjct: 728 VRREVDSVCADIYE--WQPTLM---NYMMQSSGKVNKPLRPKVHLQCGAGQKISLIKFAS 782
Query: 823 YGTPQGRCQKFSRGNCHA 840
+GTP+G C + +G+CHA
Sbjct: 783 FGTPEGVCGSYRQGSCHA 800
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 824 bits (2128), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/814 (51%), Positives = 528/814 (64%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +A++I+G RR+L S IHYPR+TP+MW DLI K+KEGG DV+ETYVFWN HE
Sbjct: 26 SVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEPS 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 86 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF QGGPII+ QIENEYG G G++YV WAA M
Sbjct: 146 PFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ +G GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 206 AVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGPI 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFA ARF RGGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 266 HKRPVQDLAFAAARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLIR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIK+CE ALV+ D LG+ Q+AHVY +C+AFL+N D
Sbjct: 326 QPKYGHLKELHRAIKMCERALVSTDPI-VTSLGEFQQAHVYTTE----SGDCAAFLSNYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y+LPPWSVSILPDCRN VFNTAKV QTS Q
Sbjct: 381 SKSSARVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGVQTS-----------------Q 423
Query: 466 QSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + ++ SW + E I V + T G+LE +NVTKD SDYLW+IT + +
Sbjct: 424 MQMLPT--NTQLFSWESFDEDIYSVDESSAITAPGLLEQINVTKDASDYLWYITSVDIGS 481
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + E+ PT+ + S + VFINGQL+GS G V +G N +
Sbjct: 482 SE-SFLRGGEL-PTLIVQSTGHAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRI 539
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS +GL N G E G G V L G G DLS WTYQVGLKGE + S
Sbjct: 540 ALLSVAIGLPNVGEHFESWSTGILGPVALHGLDKGKWDLSGQKWTYQVGLKGEAMDLASP 599
Query: 641 EE-NEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+ W + + P TW+KTYFDAP+G +P+ALD+ MGKGQ W+NG IGR
Sbjct: 600 NGISSVAWMQSAIVVQRNQP--LTWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGR 657
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A G C D C+Y G++ KC CG PTQ WYHVPRSWL+ + NLLVIFEE GG
Sbjct: 658 YWTAFA-TGNCND-CNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGG 715
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYII 815
NP +IS+ RS VC VSE H P ++ W SY + P++HLHC G I
Sbjct: 716 NPSKISLVKRSVSSVCADVSEYH-PNIKNWHIESYGKSEEFR----PPKVHLHCSPGQTI 770
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SSI+FAS+GTP G C + +G CH+P S ++ +
Sbjct: 771 SSIKFASFGTPLGTCGNYEQGACHSPASYVILEK 804
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/803 (51%), Positives = 526/803 (65%), Gaps = 35/803 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD++AI I+G R++L+S IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 25 SVSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+VKF++LV +GLY+ LRIGPY CAEWNFGGFPVWL+ IPGI FRT+N
Sbjct: 85 PGKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTDNG 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQ+F KIV++M+ E L+ QGGPII+ QIENEYG ME G GK Y +WAA M
Sbjct: 145 PFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWAAHM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 205 AIGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTGFGGTV 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP EDLAF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGLL
Sbjct: 265 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+AD +LG QEAHV+++ C+AFLAN +
Sbjct: 325 QPKWGHLKDLHRAIKLCEPALVSADPT-VTRLGNYQEAHVFKSK----SGACAAFLANYN 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
H+ ++V F Q Y LPPWS+SILP+C++TV+NTA++ SQ++ Q
Sbjct: 380 PHSYSTVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQSA-----------------Q 422
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M + SW E +++FTV G+LE +N T+D SDYLW+ T + ++ D
Sbjct: 423 MKMTRVPIHG-GLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPD 481
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F N P +T+ S L VFINGQL+G+V G + + V ++G N +
Sbjct: 482 EGYF--RNGKNPVLTVLSAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKIS 539
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSI 640
LLS VGL N G E AG G + L G G DL+ W+Y+VGLKGE
Sbjct: 540 LLSVAVGLPNVGPHFETWNAGVLGPITLNGLNEGRRDLTWQKWSYKVGLKGEDLSLHSLS 599
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ +W TWYKT FDAP G+ P+ALD+ SMGKGQ W+NG +GRYW
Sbjct: 600 GSSSVDWLQGYLVSRRQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPA 659
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C D C+Y G YN KC TNCG +Q WYHVP SWL+ + NLLV+FEE GG+P
Sbjct: 660 YKATGSC-DYCNYAGTYNEKKCGTNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNG 718
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
+ + R VC + E V S GK+S ++P+ HL C G ISSI+F
Sbjct: 719 VFLVRRDIDSVCADIYEWQPNLV---SYQMQASGKVS-RPVSPKAHLSCGPGQKISSIKF 774
Query: 821 ASYGTPQGRCQKFSRGNCHAPMS 843
AS+GTP G C + G+CHA S
Sbjct: 775 ASFGTPVGSCGNYREGSCHAHKS 797
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/814 (51%), Positives = 530/814 (65%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+L S IHYPR+TP+MW DLI K+KEGG DV+ETYVFWN HE
Sbjct: 26 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWNVHEPS 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 86 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF QGGPII+ QIENEYG G G++YV WAA M
Sbjct: 146 PFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYVNWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ +G GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 206 AVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGPI 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVARF RGGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 266 HKRPVQDLAFAVARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLIR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIK+CE ALV+ D LG++Q+AHVY +C+AFL+N D
Sbjct: 326 QPKYGHLKELHRAIKMCERALVSTDPI-ITSLGESQQAHVYTTE----SGDCAAFLSNYD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LPPWSVSILPDCRN VFNTAKV QTS Q
Sbjct: 381 SKSSARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKVGVQTS-----------------Q 423
Query: 466 QSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + ++ SW + E + V + G+LE +NVTKD SDYLW+IT + +
Sbjct: 424 MQMLPT--NTQLFSWESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSVDIGS 481
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + E+ PT+ + S + VFINGQL+GS G V ++G N +
Sbjct: 482 SE-SFLRGGEL-PTLIVQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRI 539
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS +GL N G E G G V L G G DLS WTYQVGLKGE + S
Sbjct: 540 ALLSVAIGLPNVGEHFESWSTGILGPVALHGLDQGKWDLSGQKWTYQVGLKGEAMDLASP 599
Query: 641 EE-NEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+ W + + P TW+KT+FDAP+G +P+ALD+ MGKGQ W+NG IGR
Sbjct: 600 NGISSVAWMQSAIVVQRNQP--LTWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGR 657
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A G C D C+Y G++ KC CG PTQ WYHVPRSWL+ + NLLVIFEE GG
Sbjct: 658 YWTTFA-TGNCND-CNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGG 715
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYII 815
NP +IS+ RS VC VSE H P ++ W SY + P++HLHC G I
Sbjct: 716 NPSKISLVKRSVSSVCADVSEYH-PNIKNWHIESYGKSEEFH----PPKVHLHCSPGQTI 770
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SSI+FAS+GTP G C + +G CH+P S +++ +
Sbjct: 771 SSIKFASFGTPLGTCGNYEQGACHSPASYAILEK 804
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/816 (50%), Positives = 532/816 (65%), Gaps = 47/816 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+L S IHYPR+TP+MW DLI K+KEGG DVIETY+FWN HE
Sbjct: 31 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYIFWNVHEPS 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RG YNF+G+ D+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 91 RGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ MQ F +KIV +M+ E L+ QGGPII+ QIENEYG G G++YV WAA M
Sbjct: 151 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGPAGQNYVNWAAKM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ G GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP++WTE W GW++ +GG
Sbjct: 211 AVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFGGPN 270
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 271 HERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIR 330
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIK+CE ALV+AD A +G Q+AHVY +C+AFL+N D
Sbjct: 331 QPKYGHLKELHKAIKMCERALVSADPA-VTSMGNFQQAHVYTTK----SGDCAAFLSNFD 385
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++ V F Y LPPWS+SILPDCRN VFNTAKV QTS Q
Sbjct: 386 TKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-----------------Q 428
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN---FTVQGILEHLNVTKDYSDYLWHITQIYV 522
M+ + ++ SW + E I + + T G+LE +NVT+D SDYLW+IT + +
Sbjct: 429 MQMLPT--NTHMFSWESFDEDISSLDDGSAITITTSGLLEQINVTRDTSDYLWYITSVDI 486
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
+ SF + ++ PT+ + S + VFINGQL+GS G + V ++G N
Sbjct: 487 GSSE-SFLRGGKL-PTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFRYTGTVNLRAGTN 544
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS VGL N G E G G V L G G +DLS WTYQVGLKGE +
Sbjct: 545 RIALLSVAVGLPNVGGHFETWNTGILGPVVLRGLNQGKLDLSWQKWTYQVGLKGEAMNLA 604
Query: 639 SIEE-NEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
S + EW ++ P TW+KTYFDAPDG +P+ALD+ MGKGQ W+NG I
Sbjct: 605 SPNGISSVEWMQSALVSEKNQP--LTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSI 662
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYWT AP G + C Y G + KC CG PTQ WYHVPRSWL+ ++NLLV+FEE
Sbjct: 663 GRYWT--APAAGICNGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPNHNLLVVFEEL 720
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGY 813
GG+P +IS+ RS +C VSE H P +R W +SY + P++HLHC
Sbjct: 721 GGDPSKISLVKRSVSSICADVSEYH-PNIRNWHIDSYGKSEEFH----PPKVHLHCSPSQ 775
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
ISSI+FAS+GTP G C + +G CH+P S + + +
Sbjct: 776 AISSIKFASFGTPLGTCGNYEKGVCHSPTSYATLEK 811
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 823 bits (2125), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/815 (51%), Positives = 532/815 (65%), Gaps = 45/815 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+L S IHYPR+TP+MW DLI K+KEGG DVIETYVFWN HE
Sbjct: 31 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEPS 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RG YNF+G+ D+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 91 RGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ MQ F +KIV +M+ E L+ QGGPII+ QIENEYG G G++YV WAA M
Sbjct: 151 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGSAGQNYVNWAAKM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ G GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP++WTE W GW++ +GG
Sbjct: 211 AVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFGGPN 270
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 271 HERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIR 330
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIK+CE ALV+ D A LG Q+AHVY A +C+AFL+N D
Sbjct: 331 QPKYGHLKELHKAIKMCERALVSTDPA-VTSLGNFQQAHVYSAK----SGDCAAFLSNFD 385
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++ V F Y LPPWS+SILPDCRN VFNTAKV QTS Q
Sbjct: 386 TKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-----------------Q 428
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN---FTVQGILEHLNVTKDYSDYLWHITQIYV 522
M+ + ++ SW + E I + + T G+LE +NVT+D SDYLW+IT + +
Sbjct: 429 MQMLPT--NTRMFSWESFDEDISSLDDGSSITTTTSGLLEQINVTRDTSDYLWYITSVDI 486
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
+ SF + ++ PT+ + S + VFINGQL+GS G V ++G N
Sbjct: 487 GSSE-SFLRGGKL-PTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAGTN 544
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS VGL N G E G G V L GF G +DLS WTYQVGLKGE +
Sbjct: 545 RIALLSVAVGLPNVGGHFETWNTGILGPVVLRGFDQGKLDLSWQKWTYQVGLKGEAMNLA 604
Query: 639 SIEE-NEAEW--TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
S + EW + L D TW+KTYFDAPDG +P+ALD+ MGKGQ W+NG IG
Sbjct: 605 SPNGISSVEWMQSALVSDK-NQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIG 663
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYWT +A G C + C Y G + KC CG PTQ WYHVPRSWL+ +NLLV+FEE G
Sbjct: 664 RYWTALA-AGNC-NGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDHNLLVVFEELG 721
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYI 814
G+P +IS+ RS VC VSE H P +R W +SY + P++HLHC G
Sbjct: 722 GDPSKISLVKRSVSSVCADVSEYH-PNIRNWHIDSYGKSEEFH----PPKVHLHCSPGQT 776
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
ISSI+FAS+GTP G C + +G CH+ S + + +
Sbjct: 777 ISSIKFASFGTPLGTCGNYEKGVCHSSTSHATLEK 811
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 822 bits (2124), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/837 (51%), Positives = 537/837 (64%), Gaps = 55/837 (6%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+ ++++H + S V+YD +AIII+G R++LIS IHYPR+TP+MW L+
Sbjct: 16 LFLLVLHFQLIQCS----------VTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLM 65
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G YNF+G+ D+V+FVK V +GLY+ LRIGPYVCAE
Sbjct: 66 QKAKDGGLDVIQTYVFWNVHEPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAE 125
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIV +M+ E LF QGGPII+ QIEN
Sbjct: 126 WNFGGFPVWLKYVPGISFRTDNEPFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIEN 185
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG+ + G G Y+ WAA MA+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN
Sbjct: 186 EYGSESKALGAPGHAYMTWAAKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYCDAFTPN 245
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KPT+WTE W GW+T +GG + RPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT
Sbjct: 246 KPYKPTMWTEAWSGWFTEFGGTVHERPVEDLAFAVARFIQKGGSFINYYMYHGGTNFGRT 305
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAPIDEYGL+ +PK+GHLK+LH AIKLCEPAL++AD LG Q+
Sbjct: 306 AGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKLCEPALISADPI-VTSLGPYQQ 364
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
+HV+ + G C+AFL+N + ++ A V F Y+LPPWS+SILPDCRN VFNTAKV
Sbjct: 365 SHVFSSGTGG----CAAFLSNYNPNSVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKV 420
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNF-TVQGIL 501
QTS S E+KL SW E I +N+ T G+L
Sbjct: 421 GVQTSQM---------------HMSAGETKL----LSWEMYDEDIASLGDNSMITAVGLL 461
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E LNVT+D SDYLW++T + +S + S P +T+ S L V+INGQL+GS
Sbjct: 462 EQLNVTRDTSDYLWYMTSVDISPSESSLRGGRP--PVLTVQSAGHALHVYINGQLSGSAH 519
Query: 562 G----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G V ++G N + LLS V L N G E G G V L G G
Sbjct: 520 GSRENRRFTFTGDVNMRAGINRIALLSIAVELPNVGLHYESTNTGVLGPVVLHGLDQGKR 579
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTD---LTRDGIPSTFTWYKTYFDAPDGID 673
DL+ W+YQVGLKGE + + + EW T+ P TWYK YF+AP G +
Sbjct: 580 DLTWQKWSYQVGLKGEAMNLVAPSGISYVEWMQASFATQKLQP--LTWYKAYFNAPGGDE 637
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW 733
P+ALDLGSMGKGQ W+NG IGRYWT A G C + C Y G Y + KC T CG PTQ W
Sbjct: 638 PLALDLGSMGKGQVWINGESIGRYWTAAA-NGDC-NHCSYAGTYRAPKCQTGCGQPTQRW 695
Query: 734 YHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSV 792
YHVPRSWLQ + NLLVIFEE GG+ IS+ RS VC VSE H P ++ W SY
Sbjct: 696 YHVPRSWLQPTKNLLVIFEEIGGDASGISLVKRSVSSVCADVSEWH-PTIKNWHIESYGR 754
Query: 793 DGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+L P++HL C G IS+I+FAS+GTP G C F +G CH+P S +++ +
Sbjct: 755 SEELH----RPKVHLRCAMGQSISAIKFASFGTPLGTCGSFQQGPCHSPNSHAILEK 807
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 822 bits (2123), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/811 (51%), Positives = 526/811 (64%), Gaps = 40/811 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AI+I+G RR+LIS IHYPR+TPEMW DLI K+K+GG DV+ETYVFWN HE
Sbjct: 28 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV LM+ E LF QGGPII+ QIENEYG +G G +Y+ WAA MA
Sbjct: 148 FKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 208 VGLDTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPYKPTIWTETWSGWFTEFGGPIH 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPV+DLA+AVA F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+ +
Sbjct: 268 QRPVQDLAYAVATFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQ 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH AIK+CE ALV+AD LG Q+A+VY + +CSAFL+N D
Sbjct: 328 PKYGHLKELHKAIKMCERALVSADPI-ITSLGNFQQAYVYTSE----SGDCSAFLSNHDS 382
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F Y LPPWS+SILPDCRN VFNTAKV QTS + L NI +
Sbjct: 383 KSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQM-----LPTNIPMLSW 437
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+ L+S S + T G+LE +NVT+D +DYLW+IT + + D
Sbjct: 438 ESYDEDLTSMDDS-------------STMTAPGLLEQINVTRDSTDYLWYITSVDI-DSS 483
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLIL 582
SF E+ PT+ + S + +FINGQLTGS G V ++G N + L
Sbjct: 484 ESFLHGGEL-PTLIVQSTGHAVHIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIAL 542
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS VGL N G E G G V L G G DLS WTYQVGLKGE + S
Sbjct: 543 LSVAVGLPNVGGHFEAWNTGILGPVALHGLNQGKWDLSWQKWTYQVGLKGEAMNLVSQNA 602
Query: 643 -NEAEWT--DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW L TW+KT F+ P+G +P+ALD+ MGKGQ W+NG IGRYWT
Sbjct: 603 FSSVEWISGSLIAQKKQQPLTWHKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYWT 662
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
A G C + C Y G + KC + CG PTQ +YHVPRSWL+ + NLLV+FEE GG+P
Sbjct: 663 AFA-NGNC-NGCSYAGGFRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPS 720
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
IS+ R+ VC +V+E H P ++ W SY GK+ + +P++HL C G ISSI
Sbjct: 721 RISLVKRAVSSVCSEVAEYH-PTIKNWHIESY---GKVE-DFHSPKVHLRCNPGQAISSI 775
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+GTP G C + G CHA S SVV +
Sbjct: 776 KFASFGTPLGTCGSYQEGTCHATTSYSVVQK 806
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 820 bits (2119), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/539 (71%), Positives = 442/539 (82%), Gaps = 1/539 (0%)
Query: 312 MYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADS 371
MYFGGTNFGRTSGGPFYITSYDYDAP+DEYGL SEPKWGHLKDLHAAIKLCEPALVAAD+
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60
Query: 372 AQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPD 431
QY KLG QEAH+Y + C+AFLANIDEH +A V F GQSYTLPPWSVSILPD
Sbjct: 61 PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWS 491
CR+ FNTAKV +QTS+KTVE + P ++S+ Q+ + + +S SKSWM +KEPIG+W
Sbjct: 121 CRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWG 180
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
ENNFT QG+LEHLNVTKD SDYLWH T+I VS+DDISFWK N TV+IDSMRDVLRVF
Sbjct: 181 ENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVF 240
Query: 552 INGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
+N QL GS++GHWVK VQPV F G NDL+LL+QTVGLQNYGAFLEKDGAGFRG+ KLTG
Sbjct: 241 VNKQLAGSIVGHWVKAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTG 300
Query: 612 FKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPD 670
FKNGD+DLSK WTYQVGLKGE +IY++E NE AEW+ L D PS F WYKTYFD P
Sbjct: 301 FKNGDLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPA 360
Query: 671 GIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
G DPV L+L SMG+GQAWVNG HIGRYW +++ K GC TCDYRGAYNSDKCTTNCG PT
Sbjct: 361 GTDPVVLNLESMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTTNCGKPT 420
Query: 731 QTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSY 790
QT YHVPRSWL+ S+NLLV+FEETGGNPF+ISVK + I+C QVSESHYPP+RKWS
Sbjct: 421 QTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRKWSTPD 480
Query: 791 SVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++G +SIN +APE+HLHC+DG++ISSIEFASYGTP+G C FS G CHA SLS+VSE
Sbjct: 481 YINGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSE 539
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 820 bits (2119), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/813 (51%), Positives = 528/813 (64%), Gaps = 45/813 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD RAI+I+G RR+LIS IHYPR+TPEMW DLI K+K+GG DV+ETYVFWN HE
Sbjct: 28 VTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPSP 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNFKG+ D+V+F+K + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV LM+ E LF QGGPII+ QIENEYG +G G +Y+ WAA+MA
Sbjct: 148 FKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIENEYGAQSKLFGAAGHNYMTWAANMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GLG GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW++ +GG +
Sbjct: 208 VGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFAPNKPYKPTIWTEAWSGWFSEFGGPIH 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPV+DLA+AVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+ +
Sbjct: 268 QRPVQDLAYAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQ 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH AIK+CE ALV+AD LG Q+A+VY + +CSAFL+N D
Sbjct: 328 PKYGHLKELHRAIKMCERALVSADPI-ITSLGNFQQAYVYTSE----SGDCSAFLSNHDS 382
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F Y LPPWS+SILPDCRN VFNTAKV QTS Q
Sbjct: 383 KSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-----------------QM 425
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ + + SW + E I +++ T G+LE +NVT+D +DYLW+ T + +
Sbjct: 426 GMLPTNIQML--SWESYDEDITSLDDSSTITAPGLLEQINVTRDSTDYLWYKTSVDIGSS 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ SF + E+ PT+ + S + +FINGQL+GS G V +G N +
Sbjct: 484 E-SFLRGGEL-PTLIVQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAGTNRIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G G V L G G DLS WTYQVGLKGE + +
Sbjct: 542 LLSVAVGLPNVGGHFEAWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNL--VS 599
Query: 642 ENEAEWTDLTRDGIPST----FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
N D R + + TW+KT F+AP+G +P+ALD+ MGKGQ W+NG IGRY
Sbjct: 600 PNSISSVDWMRGSLAAQKQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQSIGRY 659
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
WT A G C + C Y G + KC CG PTQ YHVPRSWL+ NLLVIFEE GG+
Sbjct: 660 WTAFA-NGNC-NGCSYAGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVIFEEFGGD 717
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P IS+ RS VC +V+E H P ++ W SY GK + +P++HL C G IS
Sbjct: 718 PSRISLVKRSVSSVCAEVAEYH-PTIKNWHIESY---GKAE-DFHSPKVHLRCNPGQAIS 772
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP G C + G CHA S SV+ +
Sbjct: 773 SIKFASFGTPLGTCGSYQEGTCHAATSYSVLQK 805
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 818 bits (2114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/812 (50%), Positives = 517/812 (63%), Gaps = 42/812 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++II G RR+LIS IHYPR+ P MWP L+A++K+GGAD IETYVFWN HE+
Sbjct: 102 VTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHETAP 161
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+ + D+V+F K+V +GLYL LRIGP+V AEWNFGG PVWL IPG FRTNN P
Sbjct: 162 GEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPGAVFRTNNEP 221
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+ F KIVD+M+ E F+ QGG II+ QIENEYG+ E +YG GK Y WAASMA
Sbjct: 222 FKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAASMA 281
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q DAPE++I+ CN +YCD +K NS KP +WTENW GW+ T+G P
Sbjct: 282 LAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKTNSPTKPKIWTENWPGWFQTFGESNP 341
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRP ED+AF+VARFFQ+GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 342 HRPPEDVAFSVARFFQKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLTRL 401
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKW HL+DLH +IKLCE +L+ + + LG QEA VY + G C AFLANID
Sbjct: 402 PKWAHLRDLHKSIKLCEHSLLYGNLTS-LSLGTKQEADVYTDHSGG----CVAFLANIDP 456
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
VTF + Y LP WSVSILPDC+N VFNTAKV SQT +
Sbjct: 457 ENDTVVTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQTLM-----------------V 499
Query: 467 SMIESKLSSTSKS-WMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ L ST W +E G+W +N+F G ++H+N TKD +DYLWH T V
Sbjct: 500 DMVPETLQSTKPDRWSIFREKTGIWDKNDFIRNGFVDHINTTKDSTDYLWHTTSFNVDRS 559
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
+ TN R ++IDS + F+N +L GS G+ K V P++ + G N++
Sbjct: 560 ----YPTNGNRELLSIDSKGHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIA 615
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGLQN G E GAG V ++G KNG IDLS W Y++GL+GE ++ +
Sbjct: 616 LLSMTVGLQNAGPHYEWVGAGLT-SVNISGMKNGSIDLSSNNWAYKIGLEGEHYGLFKPD 674
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ N W+ + TWYK D P G DPV +D+ SMGKG AW+NG+ IGRYW
Sbjct: 675 QGNNQRWSPQSEPPKGQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPR 734
Query: 701 VAPKGG-CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+ C +C+YRG +N KC T CG PTQ WYHVPRSW S N LV+FEE GG+P
Sbjct: 735 TSSSDDRCTPSCNYRGPFNPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPT 794
Query: 760 EISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I+ R VC VSE +YP + W S S DGK + ++ L C G ISS
Sbjct: 795 KITFSRRVATKVCSFVSE-NYPSIDLESWDKSISDDGKDTA-----KVQLSCPKGKNISS 848
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++FAS+G P G C+ + +G CH P SLSVV +
Sbjct: 849 VKFASFGDPSGTCRSYQQGRCHHPSSLSVVEK 880
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 818 bits (2114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/801 (51%), Positives = 522/801 (65%), Gaps = 41/801 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD +A++++G RR+LIS IHYPR+TPEMWPDLI K+K+GG DV++TYVFWN HE
Sbjct: 23 LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 83 GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 142
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQ+F KIV++M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA
Sbjct: 143 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 202
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ L GVPW+MCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +P
Sbjct: 203 VALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 262
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVEDLA+ VA+F Q+GGSF+NYYM+ GGTNFGRT+GGPF TSYDYDAPIDEYGLL E
Sbjct: 263 HRPVEDLAYGVAKFIQKGGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 322
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK LH AIKLCEPALVA D LG Q++ V+R+ S C+AFL N D+
Sbjct: 323 PKWGHLKQLHKAIKLCEPALVAGDPI-VTSLGNAQKSSVFRS----STGACAAFLDNKDK 377
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A V F G Y LPPWS+SILPDC+ TVFNTA+V SQ S +E++
Sbjct: 378 VSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGF--------- 428
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+W + E I + E+ FT G+LE +NVT+D +DYLW+ T + V+ DD
Sbjct: 429 ------------AWQSYNEEINSFGEDPFTTVGLLEQINVTRDNTDYLWYTTYVDVAQDD 476
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQT 586
+ TV + ++L + G + GSV + V+ +G N + LS
Sbjct: 477 QFLSNGENPKLTVMCFLILNILFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIA 536
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEA 645
VGL N G E AG G V L G G DL+ WTYQVGLKGE ++S+ +
Sbjct: 537 VGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTV 596
Query: 646 EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKG 705
EW + + TWYK +F+APDG +P+ALD+ SMGKGQ W+NG IGRYW G
Sbjct: 597 EWGEPVQK---QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASG 653
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
C TCDYRG Y+ KC TNCG+ +Q WYHVPRSWL + NLLVIFEE GG+P IS+
Sbjct: 654 NC-GTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVK 712
Query: 766 RSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGT 825
RS VC VSE P ++ W K+ HL C +G I+ I+FAS+GT
Sbjct: 713 RSIGSVCADVSEWQ-PSMKNWHTKDYEKAKV---------HLQCDNGQKITEIKFASFGT 762
Query: 826 PQGRCQKFSRGNCHAPMSLSV 846
PQG C +S G CHA S +
Sbjct: 763 PQGSCGSYSEGGCHAHKSYDI 783
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 818 bits (2113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/813 (51%), Positives = 537/813 (66%), Gaps = 44/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AIII+G RR+L S IHYPR+TP+MW DLI K+KEGG DVIETYVFWN HE
Sbjct: 25 DVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEPS 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ND+V+F++ V +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR +N
Sbjct: 85 PGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRQDNE 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ MQ F +KIV +M+ E L+ QGGPII+ QIENEYG G G +Y+ WAA M
Sbjct: 145 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKMLGPVGYNYMSWAAKM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ +G GVPW+MCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW++ +GG +
Sbjct: 205 AVEMGTGVPWIMCKEDDAPDPVINTCNGFYCDKFTPNKPYKPTMWTEAWSGWFSEFGGPI 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 265 HKRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIK+CE AL++ D LG Q+A+VY +CSAFL+N D
Sbjct: 325 QPKYGHLKELHKAIKMCEKALISTDPV-VTSLGNFQQAYVYTT----ESGDCSAFLSNYD 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LPPWSVSILPDCRN VFNTAKV QTS Q
Sbjct: 380 SKSSARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKVGVQTS-----------------Q 422
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ + +S SW + +E S T G+LE +NVT+D SDYLW+IT + V
Sbjct: 423 MQMLPT--NSERFSWESFEEDTSSSSATTITASGLLEQINVTRDTSDYLWYITSVDVGSS 480
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ SF ++ P++ + S + VFING+L+GS G + V ++G N +
Sbjct: 481 E-SFLHGGKL-PSLIVQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNLRAGTNTIA 538
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G G V + G G +DLS WTYQVGLKGE + S +
Sbjct: 539 LLSVAVGLPNVGGHFETWNTGILGPVVIHGLDKGKLDLSWQKWTYQVGLKGEAMNLASPD 598
Query: 642 E-NEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ EW + + P TW+KT+FDAP+G +P+ALD+ MGKGQ W+NG IGRY
Sbjct: 599 GISSVEWMQSAVVVQRNQP--LTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRY 656
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
WT +A G C D C+Y G++ KC CG PTQ WYHVPRSWL+ ++NLLV+FEE GG+
Sbjct: 657 WTAIA-TGSCND-CNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGD 714
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P +IS+ RS VC VSE H P ++ W +SY GK S N P++HLHC G IS
Sbjct: 715 PSKISLAKRSVSSVCADVSEYH-PNLKNWHIDSY---GK-SENFRPPKVHLHCNPGQAIS 769
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI+FAS+GTP G C + +G CH+ S ++ +
Sbjct: 770 SIKFASFGTPLGTCGSYEQGACHSSSSYDILEQ 802
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/816 (51%), Positives = 526/816 (64%), Gaps = 51/816 (6%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F V+YD +A++I+G RR+L S IHYPR+TP+MW DLI K+K+GG DVIETYVFWN
Sbjct: 28 FVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNL 87
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G+Y+F+G+ND+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR
Sbjct: 88 HEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFR 147
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK M+ F ++IV+LM+ E LF QGGPII+ QIENEYG G +G +Y+ W
Sbjct: 148 TDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTW 207
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+ GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW+T +
Sbjct: 208 AAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEF 267
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG + HRPV+DLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEY
Sbjct: 268 GGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEY 327
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYR---ANRYGSQS-NC 397
GL+ +PK+GHLK+LH AIK+CE ALV+AD +G Q+ +Y A+ Y ++S +C
Sbjct: 328 GLIRQPKYGHLKELHRAIKMCEKALVSADPV-VTSIGNKQQVWIYYERFAHVYSAESGDC 386
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
SAFLAN D +AA V F Y LPPWS+SILPDCRN VFNTAKVS
Sbjct: 387 SAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVS-------------- 432
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
+ +S +E LSS S + FT G+LE +NVT+D SDYLW++
Sbjct: 433 ----NFQWESYLED-LSSLDDS-------------STFTTHGLLEQINVTRDTSDYLWYM 474
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEF 573
T + + D + SF E+ PT+ I S + +F+NGQL+GS G +
Sbjct: 475 TSVDIGDSE-SFLHGGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINL 532
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE 633
SG N + LLS VGL N G E G G V L G G +DLS WTYQVGLKGE
Sbjct: 533 HSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGE 592
Query: 634 FQQI-YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
+ + W D + P TW+KTYFDAP+G +P+ALD+ MGKGQ WVNG
Sbjct: 593 AMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNG 652
Query: 692 HHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYWT A G C C Y G Y +KC T CG PTQ WYHVPR+WL+ S NLLVIF
Sbjct: 653 ESIGRYWTAFA-TGDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIF 710
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GGNP +S+ RS VC +VSE H P ++ W G+ P++HL C
Sbjct: 711 EELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TFHRPKVHLKCSP 766
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
G I+SI+FAS+GTP G C + +G CHA S +++
Sbjct: 767 GQAIASIKFASFGTPLGTCGSYQQGECHAATSYAIL 802
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/819 (50%), Positives = 525/819 (64%), Gaps = 65/819 (7%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPE------------MWPDLIAKSKEGGADVIET 95
+YD +A++++G RR+LIS IHYPR+TPE MWPDLI K+K+GG DV++T
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86
Query: 96 YVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI 155
YVFWN HE GQY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +
Sbjct: 87 YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146
Query: 156 PGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQG 215
PGI FRT+N PFK EMQ+F KIV++M+ E LF WQGGPII+ QIENE+G +E G+
Sbjct: 147 PGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPA 206
Query: 216 KDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWD 275
K Y WAA+MA+ L VPW+MCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W
Sbjct: 207 KAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWT 266
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD 335
WYT +G +PHRPVEDLA+ VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYD
Sbjct: 267 AWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYD 326
Query: 336 APIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQS 395
APIDEYGLL EPKWGHLK LH AIKLCEPALVA D LG Q++ V+R+ S
Sbjct: 327 APIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPI-VTSLGNAQKSSVFRS----STG 381
Query: 396 NCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSL 455
C+AFL N D+ + A V F G Y LPPWS+SILPDC+ TVFNTA+V SQ S +E++
Sbjct: 382 ACAAFLENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAG 441
Query: 456 PLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLW 515
+W + E I + E+ T G+LE +NVT+D +DYLW
Sbjct: 442 GF---------------------AWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLW 480
Query: 516 HITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP----- 570
+ T + V+ D+ +N +T+ S L +FINGQL G+V G V P
Sbjct: 481 YTTYVDVAQDEQFL--SNGENLKLTVMSAGHALHIFINGQLKGTVYG---SVDDPKLTYT 535
Query: 571 --VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
V+ +G N + LS VGL N G E AG G V L G G DL+ WTYQV
Sbjct: 536 GNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQV 595
Query: 629 GLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
GLKGE ++S+ + EW + + TWYK +F+APDG +P+ALD+ SMGKGQ
Sbjct: 596 GLKGESMSLHSLSGSSTVEWGEPVQK---QPLTWYKAFFNAPDGDEPLALDMSSMGKGQI 652
Query: 688 WVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
W+NG IGRYW G C TCDYRG Y+ KC TNCG+ +Q WYHVPRSWL + NL
Sbjct: 653 WINGQGIGRYWPGYKASGNC-GTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNL 711
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LVIFEE GG+P IS+ RS VC VSE P ++ W K+ HL
Sbjct: 712 LVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHTKDYEKAKV---------HL 761
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSV 846
C +G I+ I+FAS+GTPQG C ++ G CHA S +
Sbjct: 762 QCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDI 800
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/813 (51%), Positives = 531/813 (65%), Gaps = 46/813 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AIII+G RR+LIS IHYPR+TPEMW DLI K+K+GG DVI+TYVFW+ HE+
Sbjct: 28 VTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWDVHETSP 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF G+ D+V+F+K V GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG + G G+ Y+ WAA MA
Sbjct: 148 FKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIENEYGPESRALGAAGRSYINWAAKMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPTLWTE W GW+T +GG +
Sbjct: 208 VGLDTGVPWVMCKEDDAPDPMINTCNGFYCDAFAPNKPYKPTLWTEAWSGWFTEFGGPIH 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDLAFAVARF Q+GGS+ NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+ E
Sbjct: 268 QRPVEDLAFAVARFIQKGGSYFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK LH AIKLCE ALV++D + LG Q+AHV+ + R +C+AFLAN +
Sbjct: 328 PKYGHLKALHKAIKLCEHALVSSDPS-ITSLGTYQQAHVFSSGR-----SCAAFLANYNA 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F Y LPPWS+SILPDCRN VFNTA+V +QT + + P
Sbjct: 382 KSAARVMFNNMHYDLPPWSISILPDCRNVVFNTARVGAQT------LRMQMLPT------ 429
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S SW T E I ++++ T G+LE +NVT+D SDYLW++T + +S
Sbjct: 430 -------GSELFSWETYDEEISSLTDSSRITALGLLEQINVTRDTSDYLWYLTSVDISPS 482
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ +F + N +P++T+ S L VFINGQ +GS G + PV ++G N +
Sbjct: 483 E-AFLR-NGQKPSLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVNLRAGTNRIA 540
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G +G V L G G DL+ W+YQVGLKGE + +
Sbjct: 541 LLSIAVGLPNVGLHYETWKTGVQGPVLLNGLNQGKKDLTWQKWSYQVGLKGEAMNL--VS 598
Query: 642 ENEAEWTDLTRDGIPST----FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
N D + S+ W+K YFDAP G +P+ALD+ SMGKGQ W+NG IGRY
Sbjct: 599 PNGVSSVDWIEGSLASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWINGQSIGRY 658
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W A KG C ++C Y + KC CG PTQ WYHVPRSWL+ + NLLV+FEE GG+
Sbjct: 659 WMAYA-KGDC-NSCSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLLVVFEELGGD 716
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM-APEMHLHCQDGYIIS 816
+IS+ RS VC E H+P + +Y+ G +K+ ++HL C G I+
Sbjct: 717 ASKISLVKRSIEGVCADAYE-HHPATK----NYNTGGNDESSKLHQAKIHLRCAPGQFIA 771
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+I+FAS+GTP G C F +G CHAP + SV+ +
Sbjct: 772 AIKFASFGTPSGTCGSFQQGTCHAPNTHSVIEK 804
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 817 bits (2111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/810 (50%), Positives = 527/810 (65%), Gaps = 43/810 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A++I+G RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+VKF+K +GL++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG E +G GK Y WAA MA
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCKQ DAP+ +I+ACNG+YCD + PN+ +KPT+WTE W GW+T +GG +
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDL+FAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH AIKLCE ALV+ D LG QEAHVYR S S C+AFLAN +
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPT-VTSLGSMQEAHVYR-----SPSGCAAFLANYNS 385
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A + F + Y+LPPWS+SILPDC+ V+NTA V QTS Q
Sbjct: 386 NSHAKIVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTS-----------------QM 428
Query: 467 SMIESKLSSTSKSWMTVKEPIG-VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M SS W E +G + + T G+LE LN T+D SDYLW++T + VS
Sbjct: 429 QMWSDGASSM--MWERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPS 486
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ S + ++T+ S L +F+NGQL GS G + V+ ++G N +
Sbjct: 487 EKSLQGGKPL--SLTVQSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKIS 544
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V L G G DL+ WTYQVGLKGE + S+E
Sbjct: 545 LLSVACGLPNIGVHYETWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLE 604
Query: 642 -ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ EW + ++ +P WY+ YFD P G +P+ALD+GSMGKGQ W+NG IGRY
Sbjct: 605 GASSVEWMQGSLIAQNQMP--LAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRY 662
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
++ G C+D C Y G++ + KC CG PTQ WYHVP+SWLQ + NLLV+FEE GG+
Sbjct: 663 -SLAYATGDCKD-CSYTGSFRAIKCQAGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGD 720
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+IS+ RS VC VSE H P ++ W S + K + + ++HL C G IS+
Sbjct: 721 TSKISLVKRSVSNVCADVSEFH-PSIKNWQTENSGEAKPELRR--SKVHLRCAPGQSISA 777
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
I+FAS+GTP G C F +G CH+ S +V+
Sbjct: 778 IKFASFGTPLGTCGSFEQGQCHSTKSQTVL 807
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 817 bits (2110), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/811 (51%), Positives = 523/811 (64%), Gaps = 30/811 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R++IIDG R++LISA IHYPR+ PEMWP L+ +KEGG DVIETYVFWN HE
Sbjct: 28 NVSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y F G+ D+VKFVK+V +G++L LRIGP+V AEW FGG PVWL +PG FRT N
Sbjct: 88 PGNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTVFRTENK 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F IVDLM++E F+ QGGPII+ Q+ENEYG E YG+ GK Y WAASM
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ GVPW+MC+Q DAPE++I+ CN +YCD + P NKP +WTENW GW+ T+GG
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQFTPIYQNKPKIWTENWPGWFKTFGGWN 267
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 327
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK LH AIKLCE ++ + + LG + EA V+ S C+AF+AN+D
Sbjct: 328 LPKWGHLKQLHRAIKLCEHIMLNSQPTN-VSLGPSLEADVFT----NSSGACAAFIANMD 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F SY LP WSVSILPDC+N VFNTAKV SQ+S+ VE LP S +SV
Sbjct: 383 DKNDKTVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSV--VEM-LPESLQLSVGS 439
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+ S W E G+W E +F G+++H+N TK +DYLW+ T I V ++
Sbjct: 440 -----ADKSLKDLKWDVFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGEN 494
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDLI 581
+ F K P + I+S + F+N +L S G+ K+ P+ + G ND+
Sbjct: 495 E-EFLKKGS-SPVLLIESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIA 552
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGLQN G+F E GAG VK+ GF NG IDLS WTY++GL+GE Q + E
Sbjct: 553 LLSMTVGLQNAGSFYEWVGAGLT-SVKIQGFNNGTIDLSAYNWTYKIGLEGEHQGLDKEE 611
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
W + TWYK D P G DPV LD+ MGKG AW+NG IGRYW
Sbjct: 612 GFGNVNWISASEPPKEQPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPR 671
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
P GC C+YRG ++ DKC T CG PTQ WYHVPRSW + S N+LVIFEE GG+P +
Sbjct: 672 KGPLHGCVKECNYRGKFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSK 731
Query: 761 ISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
I R VC V+E +YP + W+ DG S NK +HL C + ISS+
Sbjct: 732 IEFSRRKITGVCALVAE-NYPSIDLESWN-----DGSGS-NKTVATIHLGCPEDTHISSV 784
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+G P G C+ +++G+CH P S+SVV +
Sbjct: 785 KFASFGNPTGACRSYTQGDCHDPNSISVVEK 815
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 815 bits (2106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/810 (50%), Positives = 526/810 (64%), Gaps = 43/810 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A++I+G RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+VKF+K +GL++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG E +G GK Y WAA MA
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCKQ DAP+ +I+ACNG+YCD + PN+ +KPT+WTE W GW+T +GG +
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDL+FAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH AIKLCE ALV+ D LG QEAHVYR S S C+AFLAN +
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPT-VTSLGSMQEAHVYR-----SPSGCAAFLANYNS 385
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A + F + Y+LPPWS+SILPDC+ V+NTA V QTS Q
Sbjct: 386 NSHAKIVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTS-----------------QM 428
Query: 467 SMIESKLSSTSKSWMTVKEPIG-VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M SS W E +G + + T G+LE LN T+D SDYLW++T + VS
Sbjct: 429 QMWSDGASSM--MWERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPS 486
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ S + ++T+ S L +F+NGQL GS G + V+ ++G N +
Sbjct: 487 EKSLQGGKPL--SLTVQSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKIS 544
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V L G G DL+ WTYQVGLKGE + S+E
Sbjct: 545 LLSVACGLPNIGVHYETWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLE 604
Query: 642 -ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ EW + ++ +P WY+ YFD P G +P+ALD+GSMGKGQ W+NG IGRY
Sbjct: 605 GASSVEWMQGSLIAQNQMP--LAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRY 662
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
++ G C+D C Y G++ + KC CG PTQ WYHVP+ WLQ + NLLV+FEE GG+
Sbjct: 663 -SLAYATGDCKD-CSYTGSFRAIKCQAGCGQPTQRWYHVPKPWLQPTRNLLVVFEELGGD 720
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+IS+ RS VC VSE H P ++ W S + K + + ++HL C G IS+
Sbjct: 721 TSKISLVKRSVSNVCADVSEFH-PSIKNWQTENSGEAKPELRR--SKVHLRCAPGQSISA 777
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
I+FAS+GTP G C F +G CH+ S +V+
Sbjct: 778 IKFASFGTPLGTCGSFEQGQCHSTKSQTVL 807
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 814 bits (2103), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/835 (49%), Positives = 534/835 (63%), Gaps = 49/835 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
++ +++ L C S V+YD +AI+I+G RR+L S IHYPR+TP+MW DLI
Sbjct: 12 LVFLVVFLGCSELIQCS-------VTYDRKAIMINGQRRILFSGSIHYPRSTPDMWEDLI 64
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVIETYVFWN HE G Y+F+G+ DIV+F+K + +GLY LRIGPYVCAE
Sbjct: 65 QKAKDGGIDVIETYVFWNVHEPTPGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPYVCAE 124
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIV LM+ E LF QGGPII+ QIEN
Sbjct: 125 WNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKAENLFESQGGPIILSQIEN 184
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +G G +Y+ WAA+MA+ G GVPWVMCK+ DAP+ +I+ CNG+YCD + PN
Sbjct: 185 EYGVQSKLFGAAGYNYMTWAANMAIQTGTGVPWVMCKEDDAPDPVINTCNGFYCDSFAPN 244
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KPT+WTE W GW++ +GG + RPV+DLAFAVA+F Q+GGSF+NYYM+ GGTNFGR+
Sbjct: 245 KPYKPTIWTEAWSGWFSEFGGTIHQRPVQDLAFAVAKFIQKGGSFINYYMFHGGTNFGRS 304
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAPIDEYGL+ +PK+GHLK+LH +IK+CE ALV+ D +LG Q+
Sbjct: 305 AGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRSIKMCERALVSVDPI-VTQLGTYQQ 363
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
HVY +C+AFLAN D +AA V F Y LPPWS+SILPDCRN VFNTAKV
Sbjct: 364 VHVYSTE----SGDCAAFLANYDTKSAARVLFNNMHYNLPPWSISILPDCRNVVFNTAKV 419
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGIL 501
QTS Q M+ ++ SW + E I +++ FT G+L
Sbjct: 420 GVQTS-----------------QMEMLP---TNGIFSWESYDEDISSLDDSSTFTTAGLL 459
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E +NVT+D SDYLW++T + + + SF E+ PT+ I S + +FINGQL+GS
Sbjct: 460 EQINVTRDASDYLWYMTSVDIGSSE-SFLHGGEL-PTLIIQSTGHAVHIFINGQLSGSAF 517
Query: 562 G----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G V + G N + LLS VGL N G E G G V L G G
Sbjct: 518 GTRENRRFTYTGKVNLRPGTNRIALLSVAVGLPNVGGHYESWNTGILGPVALHGLDQGKW 577
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDG-IPSTFTWYKTYFDAPDGIDPV 675
DLS WTYQVGLKGE + S + EW + P TW+K YF+AP+G +P+
Sbjct: 578 DLSWQKWTYQVGLKGEAMNLLSPDSVTSVEWMQSSLAAQRPQPLTWHKAYFNAPEGDEPL 637
Query: 676 ALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
ALD+ MGKGQ W+NG IGRYWT A G C + C Y G + KC CG PTQ WYH
Sbjct: 638 ALDMEGMGKGQIWINGQSIGRYWTAYA-SGNC-NGCSYAGTFRPTKCQLGCGQPTQRWYH 695
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDG 794
VPRSWL+ +NNLLV+FEE GG+P IS+ RS VC +VSE H P ++ W SY
Sbjct: 696 VPRSWLKPTNNLLVVFEELGGDPSRISLVKRSLASVCAEVSEFH-PTIKNWQIESYGRAE 754
Query: 795 KLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ +P++HL C G I+SI+FAS+GTP G C + +G CHA S +++ +
Sbjct: 755 EFH----SPKVHLRCSGGQSITSIKFASFGTPLGTCGSYQQGACHASTSYAILEK 805
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 812 bits (2098), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/811 (50%), Positives = 520/811 (64%), Gaps = 31/811 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R+++IDG R++LISA IHYPR+ P MWP L+ +KEGG DVIETYVFWN HE
Sbjct: 21 NVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGHELS 80
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y F G+ D+VKF K V +G+YL LRIGP+V AEWNFGG PVWL +PG FRT N
Sbjct: 81 PGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTYNQ 140
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PF MQ+F IV+LM++E LF+ QGGPII+ QIENEYG E+ Y + GK Y WAA M
Sbjct: 141 PFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKKYALWAAKM 200
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ GVPW+MC+Q DAP+ +ID CN +YCD + P S N+P +WTENW GW+ T+GGR
Sbjct: 201 AVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFKTFGGRD 260
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL
Sbjct: 261 PHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYGLPR 320
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK+LH AIKLCE L+ S I LG + EA VY S C+AF++N+D
Sbjct: 321 LPKWGHLKELHRAIKLCEHVLLNGKSVN-ISLGPSVEADVYT----DSSGACAAFISNVD 375
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F SY LP WSVSILPDC+N VFNTAKV+SQT++ +++
Sbjct: 376 DKNDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNV------------VAMIP 423
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S+ +S S W VKE G+W + +F G ++ +N TKD +DYLWH T I+VS++
Sbjct: 424 ESLQQSDKGVNSLKWDIVKEKPGIWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSEN 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDLI 581
+ F K +P + I+S L F+N + G+ G+ P+ ++G N++
Sbjct: 484 E-EFLKKGS-KPVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LL TVGLQ G F + GAG VK+ G KNG IDLS WTY++G++GE+ ++Y
Sbjct: 542 LLCLTVGLQTAGPFYDFIGAGLT-SVKIKGLKNGTIDLSSYAWTYKIGVQGEYLRLYQGN 600
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
N+ WT + TWYK DAP G +PV LD+ MGKG AW+NG IGRYW
Sbjct: 601 GLNKVNWTSTSEPQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPR 660
Query: 701 VA--PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ C CDYRG +N DKC T CG PTQ WYHVPRSW + S N+LV+FEE GG+P
Sbjct: 661 KSEFKSEDCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDP 720
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
+I R C V+E YP V S + K+ NK P HL C IS++
Sbjct: 721 EKIKFVRRKVSGACALVAED-YPSVGLLSQG---EDKIQNNKNVPFAHLTCPSNTRISAV 776
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+GTP G C + +G+CH P S ++V +
Sbjct: 777 KFASFGTPSGSCGSYLKGDCHDPNSSTIVEK 807
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 811 bits (2096), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/807 (50%), Positives = 516/807 (63%), Gaps = 53/807 (6%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YD +A++++G RR+L+S IHYPR+ PEMWPDLI K+K+GG DV++TYVFWN HE R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQ F KIVD+M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L VPWVMCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PH
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLA+ VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL EP
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 329
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHLK+LH AIKLCEPALVA D LG Q+A V+R+ S C AFL N D+
Sbjct: 330 KWGHLKELHKAIKLCEPALVAGDPI-VTSLGNAQQASVFRS----STDACVAFLENKDKV 384
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
+ A V+F G Y LPPWS+SILPDC+ TV+NTA V SQ S +E++
Sbjct: 385 SYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQMKMEWAGGF---------- 434
Query: 468 MIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDI 527
+W + E I + +F G+LE +NVT+D +DYLW+ T + ++ D+
Sbjct: 435 -----------TWQSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQ 483
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-------VEFQSGYNDL 580
+N P +T+ S L +F+NGQLTG+V G V P V+ SG N +
Sbjct: 484 FL--SNGKNPMLTVMSAGHALHIFVNGQLTGTVYG---SVEDPKLTYSGNVKLWSGSNTI 538
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYS 639
LS VGL N G E AG G V L G G DL+ WTY+VGLKGE
Sbjct: 539 SCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALSLHSL 598
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + + +WYK +F+APDG +P+ALD+ SMGKGQ W+NG IGRYW
Sbjct: 599 SGSSSVEWGEPVQK---QPLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWP 655
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
G C CDYRG Y+ KC TNCG+ +Q WYHVPRSWL + NLLVIFEE GG+P
Sbjct: 656 GYKASGTC-GICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPT 714
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
IS+ R +C VSE P + W K+ HL C G ++ I+
Sbjct: 715 GISMVKRIAGSICADVSEWQ-PSMANWRTKGYEKAKV---------HLQCDHGRKMTHIK 764
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSV 846
FAS+GTPQG C +S G CHA S +
Sbjct: 765 FASFGTPQGSCGSYSEGGCHAHKSYDI 791
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 811 bits (2094), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/825 (49%), Positives = 531/825 (64%), Gaps = 39/825 (4%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
C+ SS NV YD +A++IDG RR+L S IHYPR+TPEMW LI K+K+GG D
Sbjct: 16 CIVWSSVYVEVTKCNVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKAKDGGLD 75
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
I+TYVFWN HE G YNF+G+ND+V+F+K V +GLY+ LRIGPY+C+EWNFGGFPVW
Sbjct: 76 AIDTYVFWNLHEPSPGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNFGGFPVW 135
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSY 211
L+ +PGI FRT+N PFK MQ+F +K+V LM+ E LF QGGPII+ QIENEY ++
Sbjct: 136 LKFVPGISFRTDNEPFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYEPESKAF 195
Query: 212 GQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWT 271
G G Y+ WAA MA+G+G GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WT
Sbjct: 196 GASGYAYMTWAAKMAVGMGTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPYKPTMWT 255
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITS 331
E W GW+T +GG + RPVEDL FAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TS
Sbjct: 256 EAWSGWFTEFGGPIYQRPVEDLTFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTS 315
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRY 391
YDYDAPIDEYGL+ PK+GHLK+LH A+KLCE AL+ AD LG ++AHV+ +++
Sbjct: 316 YDYDAPIDEYGLIRRPKYGHLKELHKAVKLCELALLNADPT-VTTLGSYEQAHVF-SSKS 373
Query: 392 GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTV 451
GS + FL+N + +A VTF ++ LPPWS+SILPDC+N FNTA+V QTS
Sbjct: 374 GSG---AVFLSNFNTKSATKVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTS---- 426
Query: 452 EFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDY 510
Q ++ + +S SW E + V + TV G+L+ LN+T+D
Sbjct: 427 -------------QTQLL--RTNSELHSWGIFNEDVSSVAGDTTITVTGLLDQLNITRDS 471
Query: 511 SDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVK 566
SDYLW+ T + + D SF + P++T+ S D + VFIN QL+GS G
Sbjct: 472 SDYLWYTTSVDI-DPSESFLGGGQ-HPSLTVQSAGDAMHVFINDQLSGSASGTREHRRFT 529
Query: 567 VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY 626
V +G N + LLS VGL N G E G G V L G +G DLS W+Y
Sbjct: 530 FTGNVNLHAGLNKISLLSIAVGLANNGPHFETRNTGVLGPVALHGLDHGTRDLSWQKWSY 589
Query: 627 QVGLKGEFQQIYSIEENEA-EW-TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGK 684
QVGLKGE + S A +W T TWYK YFD P+G +P+ALD+GSMGK
Sbjct: 590 QVGLKGEATNLDSPNSISAVDWMTGSLVAQKQQPLTWYKAYFDEPNGDEPLALDMGSMGK 649
Query: 685 GQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS 744
GQ W+NG IGRYWT+ A C C Y G + KC C +PTQ WYHVPRSWL+ S
Sbjct: 650 GQVWINGQSIGRYWTIYA-DSDC-SACTYSGTFRPKKCQFGCQHPTQQWYHVPRSWLKPS 707
Query: 745 NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPE 804
NLLV+FEE GG+ ++++ +S VC +VSE+H P + W G+ + + PE
Sbjct: 708 KNLLVVFEEIGGDVSKVALVKKSVTSVCAEVSENH-PRITNWHT--ESHGQTEV-QQKPE 763
Query: 805 MHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ LHC DG+ IS+I+F+S+GTP G C KF G CHAP S +V+ +
Sbjct: 764 ISLHCTDGHSISAIKFSSFGTPSGSCGKFQHGTCHAPNSNAVLQK 808
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 810 bits (2093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/809 (51%), Positives = 524/809 (64%), Gaps = 41/809 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A++IDG RR+L S IHYPR+TPEMW L K+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+VKF+K +GL++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 87 GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG S+G GK Y WAA MA
Sbjct: 147 FKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCKQ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 207 VGLDTGVPWVMCKQDDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWTGWFTEFGGTIR 266
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDL+FAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 267 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH A+KLCEPALV+ D A LG QEAHV+R S S+C+AFLAN +
Sbjct: 327 PKYGHLKELHRAVKLCEPALVSVDPA-VTTLGSMQEAHVFR-----SPSSCAAFLANYNS 380
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A+V F + Y+LPPWS+SILPDC+ VFNTA V QTS Q
Sbjct: 381 NSHANVVFNNEHYSLPPWSISILPDCKTVVFNTATVGVQTS-----------------QM 423
Query: 467 SMIESKLSSTSKSWMTVKEPIG-VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M +S W E +G + + T G+LE LNVT+D SDYLW+IT + VS
Sbjct: 424 QMWAD--GESSMMWERYDEEVGSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPS 481
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ F + E ++T+ S L +FINGQL GS G ++G N +
Sbjct: 482 E-KFLQGGEPL-SLTVQSAGHALHIFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIA 539
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V L G G DL+ W+YQVGLKGE + S+E
Sbjct: 540 LLSIACGLPNVGVHYETWNTGIVGPVVLHGLDVGSRDLTWQTWSYQVGLKGEQMNLNSLE 599
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + + +WY+ YFD P G +P+ALD+GSMGKGQ W+NG IGRY T
Sbjct: 600 GASSVEWMQGSLLA-QAPLSWYRAYFDTPTGDEPLALDMGSMGKGQIWINGQSIGRYSTS 658
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
A G C+ C Y G+Y + KC CG PTQ WYHVP+SWLQ S NLLV+FEE GG+ +
Sbjct: 659 YA-SGDCK-ACSYAGSYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSK 716
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
IS+ RS VC VSE H ++ W ++ + P++HL C G IS+I+F
Sbjct: 717 ISLVKRSVSSVCADVSEYH-TNIKNW----QIENAGEVEFHRPKVHLRCAPGQTISAIKF 771
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+GTP G C F +G+CH+ S +V+ +
Sbjct: 772 ASFGTPLGTCGNFQQGDCHSTKSHAVLEK 800
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/822 (50%), Positives = 529/822 (64%), Gaps = 48/822 (5%)
Query: 38 ASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYV 97
AS V+YD +AI+I+G RR+LIS IHYPR+TPEMW LI K+K+GG DVI+TYV
Sbjct: 21 ASELIHCTTVTYDKKAILINGQRRLLISGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYV 80
Query: 98 FWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPG 157
FWN HE G Y F+G+ D+V+F+K V +GL+L LRIGPYVCAEWNFGGFPVWL+ +PG
Sbjct: 81 FWNGHEPSPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVCAEWNFGGFPVWLKYVPG 140
Query: 158 IEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD 217
I FRT+N PFK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG + G G++
Sbjct: 141 ISFRTDNGPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPERKALGAPGQN 200
Query: 218 YVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
Y+ WAA MA+GL GVPWVMCK+ DAP+ +I+ACNG+YCDG+ PN KPT+WTE W GW
Sbjct: 201 YINWAAKMAVGLDTGVPWVMCKEDDAPDPMINACNGFYCDGFTPNKPYKPTMWTEAWSGW 260
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAP 337
+ +GG + HRPV+DLAFAVARF QRGGS++NYYMY GGTNFGRT+GGPF TSYDYDAP
Sbjct: 261 FLEFGGTIHHRPVQDLAFAVARFIQRGGSYVNYYMYHGGTNFGRTAGGPFITTSYDYDAP 320
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNC 397
IDEYGL+ +PK+GHLK+LH AIKLCE +L++++ LG +A+V+ + C
Sbjct: 321 IDEYGLIRQPKYGHLKELHKAIKLCEHSLLSSEPT-VTSLGTYHQAYVFNS----GPRRC 375
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
+AFL+N A VTF + Y LPPWSVSILPDCRN V+NTAKV QTS
Sbjct: 376 AAFLSNFHS-VEARVTFNNKHYDLPPWSVSILPDCRNEVYNTAKVGVQTS---------- 424
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWH 516
MI + +S SW T E I V ++ G+LE +NVT+D SDYLW+
Sbjct: 425 -------HVQMIPT--NSRLFSWQTYDEDISSVHERSSIPAIGLLEQINVTRDTSDYLWY 475
Query: 517 ITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVE 572
+T + +S D+S K +PT+T+ S L VF+NGQ +GS G PV
Sbjct: 476 MTNVDISSSDLSGGK----KPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVN 531
Query: 573 FQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKG 632
+G N + LLS VGL N G E G +G V L G NG DL+ W +VGLKG
Sbjct: 532 LHAGINRIALLSIAVGLPNVGLHYESWKTGIQGPVFLDGLGNGKKDLTLHKWFNKVGLKG 591
Query: 633 EFQQIYSIEENEAEWTDLTRDGIPS----TFTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
E + + N A R + + T WYK YF+AP G +P+ALD+ MGKGQ W
Sbjct: 592 EAMNL--VSPNGASSVGWIRRSLATQTKQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVW 649
Query: 689 VNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL 748
+NG IGRYW A KG C +C Y G + KC +CG PTQ WYHVPRSWL+ + NL+
Sbjct: 650 INGQSIGRYWMAYA-KGDC-SSCSYIGTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNLV 707
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM-APEMHL 807
V+FEE GG+P +I++ RS VC + E+H + ++ VDG + ++HL
Sbjct: 708 VVFEELGGDPSKITLVRRSVAGVCGDLHENH-----PNAENFDVDGNEDSKTLHQAQVHL 762
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
HC G ISSI+FAS+GTP G C F +G CHA S +VV +
Sbjct: 763 HCAPGQSISSIKFASFGTPSGTCGSFQQGTCHATNSHAVVEK 804
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/840 (49%), Positives = 535/840 (63%), Gaps = 54/840 (6%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISA 66
+R L++ L + +++++ + C ++S V+YDH+AI+++G RR+LIS
Sbjct: 3 SRVLIENLPRGNFCTLLLVLWV---CAVTAS---------VTYDHKAIVVNGQRRILISG 50
Query: 67 GIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGS 126
IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV
Sbjct: 51 SIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQ 110
Query: 127 SGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM 186
+GLY+ LRIGPY+CAEWNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV +M+EE
Sbjct: 111 AGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSIMKEEK 170
Query: 187 LFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN 246
LF QGGPIIM QIENEYG +E G GK Y KW + MA+GL GVPW+MCKQ D P+
Sbjct: 171 LFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWFSQMAVGLDTGVPWIMCKQQDTPDP 230
Query: 247 IIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGS 306
+ID CNGYYC+ + PN KP +WTENW GWYT +GG +P RP ED+AF+VARF Q GGS
Sbjct: 231 LIDTCNGYYCENFTPNKKYKPKMWTENWTGWYTEFGGAVPRRPAEDMAFSVARFVQNGGS 290
Query: 307 FMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
F+NYYMY GGTNF RTS G F TSYDYD PIDEYGLL+EPKWGHL+DLH AIKLCEPAL
Sbjct: 291 FVNYYMYHGGTNFDRTSSGLFIATSYDYDGPIDEYGLLNEPKWGHLRDLHKAIKLCEPAL 350
Query: 367 VAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSV 426
V+ D G N E HV++ + C+AFLAN D ++ASV F Y LPPWS+
Sbjct: 351 VSVDPTVTWP-GNNLEVHVFK-----TSGACAAFLANYDTKSSASVKFGNGQYDLPPWSI 404
Query: 427 SILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEP 486
SILPDC+ VFNTA++ +Q+S+ + + N + QS E SS
Sbjct: 405 SILPDCKTAVFNTARLGAQSSLMKMT-----AVNSAFDWQSYNEEPASSN---------- 449
Query: 487 IGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRD 546
+++ T + E +NVT+D +DYLW++T + + D + F K N P +T+ S
Sbjct: 450 ----EDDSLTAYALWEQINVTRDSTDYLWYMTDVNI-DANEGFIK-NGQSPVLTVMSAGH 503
Query: 547 VLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
VL V IN QL+G+V G H + V+ + G N + LLS VGL N G E AG
Sbjct: 504 VLHVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKISLLSIAVGLPNVGPHFETWNAG 563
Query: 603 FRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTW 661
G V L G G DLSK W+Y++GLKGE + ++ + EW + W
Sbjct: 564 VLGPVTLKGLNEGTRDLSKQKWSYKIGLKGEALNLNTVSGSSSVEWVQGSLLAKQQPLAW 623
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
YKT F P G DP+ALD+ SMGKGQAW+NG IGR+W +G C D C Y G Y K
Sbjct: 624 YKTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHWPGYIARGNCGD-CYYAGTYTDKK 682
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYP 781
C TNCG P+Q WYH+PRSWL S N LV+FEE GG+P I++ R+T VC + +
Sbjct: 683 CRTNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWGGDPTGITLVKRTTASVCADIYQGQ-- 740
Query: 782 PVRKWSNSYSVD-GKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
P K N +D GK+ + P+ HL C G IS I+FASYG PQG C F G+CHA
Sbjct: 741 PTLK--NRQMLDSGKV----VRPKAHLWCPPGKNISQIKFASYGLPQGTCGNFREGSCHA 794
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/810 (50%), Positives = 522/810 (64%), Gaps = 29/810 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG+R++LISA IHYPR+ P MWP LI +KEGG DVIETYVFWN HE
Sbjct: 21 NVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQNAKEGGVDVIETYVFWNGHELS 80
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
Y+F G+ D+VKF+ +V ++GLYL LRIGP+V AEWNFGG PVWL IP FRT+NA
Sbjct: 81 PDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWNFGGVPVWLHYIPNTVFRTDNA 140
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK MQ+F IV LM++E LF+ QGGPII+ Q+ENEYG++E YG+ GK Y WAA M
Sbjct: 141 SFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEYGDIERVYGEGGKPYAMWAAQM 200
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ GVPW+MC+Q DAP+ +I+ CN +YCD + PNS NKP +WTENW GW+ T+G R
Sbjct: 201 AVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKTFGARD 260
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 261 PHRPPEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 320
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK+LH AIKL E L+ ++ Y+ LG + EA VY S C+AF+ANID
Sbjct: 321 LPKWGHLKELHRAIKLTERVLLNSEPT-YVSLGPSLEADVYT----DSSGACAAFIANID 375
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
E +V F SY LP WSVSILPDC+N VFNTA + SQT++ + + P P
Sbjct: 376 EKDDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAM------VEMVPEELQPS 429
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
L + W E G+W + +F +++HLN TKD +DYLW+ T I+V+++
Sbjct: 430 ADATNKDLKAL--KWEVFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYTTSIFVNEN 487
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLI 581
+ F K ++ P + ++S L FIN +L S G+ K Q + ++G N++
Sbjct: 488 E-KFLKGSQ--PVLVVESKGHALHAFINKKLQVSATGNGSDITFKFKQAISLKAGKNEIA 544
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGLQN G F E GAG +V + GF NG +DLS W+Y++GL+GE IY +
Sbjct: 545 LLSMTVGLQNAGPFYEWVGAGL-SKVVIEGFNNGPVDLSSYAWSYKIGLQGEHLGIYKPD 603
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-T 699
+W TWYK D P G +PV LD+ MGKG AW+NG IGRYW T
Sbjct: 604 GIKNVKWLSSREPPKQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLAWLNGEEIGRYWPT 663
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+ C CDYRG + DKC T CG PTQ WYHVPRSW + S N+LVIFEE GG+P
Sbjct: 664 KSSIHDVCVQKCDYRGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPT 723
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
+I + R +C + E H P + WS + +V+ K + L C D I+ I+
Sbjct: 724 QIRLSKRKVLGICAHLGEGH-PSIESWSEAENVE-----RKSKATVDLKCPDNGRIAKIK 777
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTPQG C +S G+CH P S+S+V +
Sbjct: 778 FASFGTPQGSCGSYSIGDCHDPNSISLVEK 807
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 808 bits (2087), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/812 (50%), Positives = 530/812 (65%), Gaps = 40/812 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHR++II G RR++IS IHYPR+ PEMWP L+A++K+GGAD IETYVFWN HE
Sbjct: 28 NVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHEIA 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F+ + D+V+FVK+V +GL L LRIGP+V AEWNFGG PVWL +PG FRT+N
Sbjct: 88 PGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTDNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM-ESSYGQQGKDYVKWAAS 224
PFK M+ F IV++M++E LF+ QGG II+ QIENEYG+ E +Y GK Y WAAS
Sbjct: 148 PFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMWAAS 207
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MA+ GVPW+MC+++DAP+ +I++CNG+YCDG++PNS KP LWTENW GW+ T+G
Sbjct: 208 MAVAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKLWTENWPGWFQTFGES 267
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRP ED+AFAVARFF++GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 268 NPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLR 327
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PKW HL+DLH +I+LCE L+ ++ ++ LG QEA +Y G C AFLANI
Sbjct: 328 RFPKWAHLRDLHKSIRLCEHTLLYGNTT-FLSLGPKQEADIYSDQSGG----CVAFLANI 382
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D VTF + Y LP WSVSILPDCRN VFNTAKV SQTS+ + VP
Sbjct: 383 DSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVAM-----------VP 431
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
ES +S + W +E G+W +N+F G ++H+N TKD +DYLW+ T V
Sbjct: 432 -----ESLQASKPERWNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSV-- 484
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
D S+ K + V + IDS + F+N + GS G+ + V P+ ++G N+L
Sbjct: 485 -DESYSKGSHV--VLNIDSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNEL 541
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G E GAGF V ++G +NG I+LS W Y++GL+GE+ ++
Sbjct: 542 ALLSMTVGLQNAGFSYEWIGAGFT-NVNISGVRNGTINLSSNNWAYKIGLEGEYYSLFKP 600
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
++ N W + TWYK D P G DPV +D+ SMGKG W+NG+ IGRYW
Sbjct: 601 DQRNNQRWIPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWP 660
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ C +CDYRG +N +KC T CG PTQ WYH+PRSW S N+LVIFEE GG+P
Sbjct: 661 RTSSIDDRCTPSCDYRGEFNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDP 720
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAP-EMHLHCQDGYIISS 817
+I+ R+ VC VSE H+P + + S DG + +P + L C G ISS
Sbjct: 721 TKITFSRRAVTSVCSFVSE-HFPSI----DLESWDGSATNEGTSPAKAQLSCPIGKNISS 775
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++FAS GTP G C+ + +G+CH P SLSVV +
Sbjct: 776 LKFASLGTPSGTCRSYQKGSCHHPNSLSVVEK 807
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 807 bits (2085), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/811 (50%), Positives = 520/811 (64%), Gaps = 28/811 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG R++LISA IHYPR+ P MWP L+ +KEGG DVIETYVFWN HE
Sbjct: 22 NVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFWNGHELS 81
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
Y F G+ D++KFVK+V + +YL LR+GP+V AEWNFGG PVWL +PG FRTN+
Sbjct: 82 PDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTVFRTNSE 141
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F+ IV++M++E LF+ QGGPII+ Q+ENEYG+ E YG GK Y WAA+M
Sbjct: 142 PFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYAMWAANM 201
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL GVPW+MC+Q DAP+ +I+ CN +YCD + PNS NKP +WTENW GW+ T+G
Sbjct: 202 ALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKTFGAPD 261
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL
Sbjct: 262 PHRPHEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPIDEYGLAR 321
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK+LH AIK CE L+ + + LG +QE VY S C+AF++N+D
Sbjct: 322 LPKWGHLKELHRAIKSCEHVLLYGEPIN-LSLGPSQEVDVYT----DSSGGCAAFISNVD 376
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTS-IKTVEFSLPLSPNISVP 464
E + F SY +P WSVSILPDC+N VFNTAKV SQTS ++ V L
Sbjct: 377 EKEDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEEL--------- 427
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
Q S++ S W T E G+W E +F G ++H+N TKD +DYLW+ + V +
Sbjct: 428 QPSLVPSNKDLKGLQWETFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGE 487
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
+ +F K +P + ++S L F+N +L GS G+ K P+ ++G ND+
Sbjct: 488 SE-NFLKEIS-QPVLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDI 545
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G F E GAG VK+ G NG +DLS WTY++GL+GE IY
Sbjct: 546 ALLSMTVGLQNAGPFYEWVGAGLT-SVKIKGLNNGIMDLSTYTWTYKIGLQGEHLLIYKP 604
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
E N +W TWYK D P G +P+ LD+ MGKG AW+NG IGRYW
Sbjct: 605 EGLNSVKWLSTPEPPKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWP 664
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ C CDYRG + +KC+T CG PTQ WYHVPRSW + S N+LVIFEE GG+P
Sbjct: 665 RKSSIHDKCVQECDYRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDP 724
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
+I R T VC VSE H P + S+ D + NK +HL C + ISS+
Sbjct: 725 TKIRFSRRKTTGVCALVSEDH--PTYELE-SWHKDANEN-NKNKATIHLKCPENTHISSV 780
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FASYGTP G+C +S+G+CH P S SVV +
Sbjct: 781 KFASYGTPTGKCGSYSQGDCHDPNSASVVEK 811
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 806 bits (2083), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/798 (51%), Positives = 527/798 (66%), Gaps = 44/798 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V YDH+AI I+ RR+LIS IHYPR+TPEMWP LI K+KEGG +VI+TYVFWN HE
Sbjct: 25 VWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKAKEGGIEVIQTYVFWNGHEPSP 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+ + D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFP+WL+ +PGIEFRT+N P
Sbjct: 85 GQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPMWLKYVPGIEFRTDNGP 144
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+FV IV++M+E+ LF QGGPII+ QIENEYG +E + G GK Y KWAA+MA
Sbjct: 145 FKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYGPVEWTIGAPGKAYTKWAAAMA 204
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GL GVPW+MCKQ DAP+ ID CNG+YC+GYKPN+YNKP +WTENW GWYT WG +P
Sbjct: 205 TGLNTGVPWIMCKQEDAPDPTIDTCNGFYCEGYKPNNYNKPKVWTENWTGWYTEWGASVP 264
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED AF+VARF GSF+NYYMY GGTNF RT+ G F TSYDYDAP+DEYGL +
Sbjct: 265 YRPPEDTAFSVARFIAASGSFVNYYMYHGGTNFDRTA-GLFMATSYDYDAPLDEYGLTHD 323
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK E ALV+AD I LG+NQEAHV++ S+ C+AFLAN D
Sbjct: 324 PKWGHLRDLHRAIKQSERALVSADPT-VISLGKNQEAHVFQ-----SKMGCAAFLANYDT 377
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+A V F + Y+LP WS+S+LPDC+ V+NTAK+S+Q+ T ++ +P++ S Q
Sbjct: 378 QYSARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISAQS---TQKWMMPVASGFS--WQ 432
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S I+ + P+G +S FT G+ E +T D +DYLW++T + ++ ++
Sbjct: 433 SHID-------------EVPVG-YSAGTFTKVGLWEQKYLTGDKTDYLWYMTDVTINSNE 478
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
F ++ + P +T+ S VL VFING L GS G + Q V+ G N + L
Sbjct: 479 -GFLRSGK-NPFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLVGGVNKIAL 536
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS TVGL N G + G G V L G G +D++K W+Y++GLKGE +++S
Sbjct: 537 LSATVGLANVGVHYDTWNVGVLGPVTLQGLNQGTLDMTKWKWSYKIGLKGEDLKLFSGGA 596
Query: 643 NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA 702
N W + + TWYKT+ +AP G DPVAL +GSMGKGQ ++NG IGR+W
Sbjct: 597 N-VGWAQGAQLAKKTPLTWYKTFINAPPGNDPVALYMGSMGKGQMYINGRSIGRHWPAYT 655
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
KG C+D CDY G Y+ KC + CG P Q WYHVPRSWL+ + NLLV+FEE GG+P IS
Sbjct: 656 AKGNCKD-CDYAGYYDDQKCRSGCGQPPQQWYHVPRSWLKPTGNLLVVFEEMGGDPTGIS 714
Query: 763 VKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFAS 822
+ R VC + + P ++ W+ + V P+ HL C G S I FAS
Sbjct: 715 LVKRVVGSVCADIDDDQ-PEMKSWTENIPV---------TPKAHLWCPPGQKFSKIVFAS 764
Query: 823 YGTPQGRCQKFSRGNCHA 840
YG PQGRC + +G CHA
Sbjct: 765 YGWPQGRCGAYRQGKCHA 782
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 806 bits (2083), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/814 (50%), Positives = 528/814 (64%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+L S IHYPR+TPEMW DLI K+KEGG DV+ETYVFWN HE
Sbjct: 27 DVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR +N
Sbjct: 87 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRADNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+ + +KIV+LM+ LF QGGPII+ QIENEYG G G Y WAA+M
Sbjct: 147 PFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWAANM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP +WTE W GW++ +GG L
Sbjct: 207 AVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPAIWTEAWSGWFSEFGGPL 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVA+F QRGGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH A+K+CE ++V+AD A LG Q+A+VY + G C+AFL+N D
Sbjct: 327 QPKYGHLKELHRAVKMCEKSIVSADPA-ITSLGNLQQAYVYSSETGG----CAAFLSNND 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA V F Y LPPWS+SILPDCRN VFNTAKV QTS +
Sbjct: 382 WKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-----------------K 424
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ-GILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + +S SW T E I +++ G+LE +NVT+D SDYLW+IT + +
Sbjct: 425 MEMLPT--NSEMLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDIGS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF E+ PT+ +++ + VFINGQL+GS G V ++G N +
Sbjct: 483 TE-SFLHGGEL-PTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V + G +G DLS WTYQVGLKGE + S
Sbjct: 541 ALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVST 600
Query: 641 EENEA-EWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
A +W + + P TW+K YF+ P+G +P+ALD+ SMGKGQ W+NG IGR
Sbjct: 601 NGISAVDWMQGSLIAQKQQP--LTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGR 658
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A G C + C Y G + KC CG PTQ WYHVPRSWL+ + NLLV+FEE GG
Sbjct: 659 YWTAYA-TGDC-NGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGG 716
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYII 815
+P IS+ RS VC V+E H P ++ W +Y + + P++ +HC G I
Sbjct: 717 DPTRISLVKRSVTNVCSNVAEYH-PNIKNWQIENYGKTEEFHL----PKVRIHCAPGQSI 771
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SSI+FAS+GTP G C F +G CHAP S +VV +
Sbjct: 772 SSIKFASFGTPLGTCGSFKQGTCHAPDSHAVVEK 805
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 806 bits (2081), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/813 (49%), Positives = 522/813 (64%), Gaps = 41/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHR++II G RR++IS IHYPR+ PEMWP L+A++K+GGAD IETYVFWN HE
Sbjct: 28 NVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHEIA 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F+ + D+V+FVK+V +GL L LRIGPYV AEWN+GG PVWL +PG FRTNN
Sbjct: 88 PGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTNNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM-ESSYGQQGKDYVKWAAS 224
PFK ++ F IVD+M++E LF+ QGG II+ QIENEYG+ E +YG GK Y WAAS
Sbjct: 148 PFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWAAS 207
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MAL GVPW+MC+++DAP+ +I++CNG+YCDG++PNS KP +WTENW GW+ T+G
Sbjct: 208 MALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFGES 267
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRP ED+AFAVARFF++GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 268 NPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLR 327
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PKW HL+DLH +I+LCE L+ ++ ++ LG QEA +Y G C AFLANI
Sbjct: 328 RFPKWAHLRDLHKSIRLCEHTLLYGNTT-FLSLGPKQEADIYSDQSGG----CVAFLANI 382
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D VTF + Y LP WSVSILPDCRN VFNTAKV SQTS+ T+ VP
Sbjct: 383 DSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTM-----------VP 431
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
ES +S + W +E G+W +N+F G ++H+N TKD +DYLW+ T V
Sbjct: 432 -----ESLQASKPERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDG 486
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
+ + + IDS + F+N L GS G+ + V P+ ++G N+L
Sbjct: 487 S----YSSKGSHAVLNIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNEL 542
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G E GAGF V ++G + G IDLS W Y++GL+GE+ ++
Sbjct: 543 ALLSMTVGLQNAGFAYEWIGAGFT-NVNISGVRTGTIDLSSNNWAYKIGLEGEYYNLFKP 601
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
++ N W + TWYK D P G DPV +D+ SMGKG AW+NG+ IGRYW
Sbjct: 602 DQTNNQRWIPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWP 661
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ C +C+YRG + DKC T CG PTQ WYH+PRSW S N+LV+FEE GG+P
Sbjct: 662 RTSSINDRCTPSCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDP 721
Query: 759 FEISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+I+ R+ VC VSE H+P + W S +G + L C +G IS
Sbjct: 722 TKITFSRRAVTSVCSFVSE-HFPSIDLESWDESAMTEG-----TPPAKAQLFCPEGKSIS 775
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S++FAS G P G C+ + G CH P SLSVV +
Sbjct: 776 SVKFASLGNPSGTCRSYQMGRCHHPNSLSVVEK 808
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/812 (50%), Positives = 520/812 (64%), Gaps = 41/812 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+LIS IHYPR+TP+MW DLI K+K+GG DVI+TY+FWN HE
Sbjct: 28 SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVHEPS 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRTNN
Sbjct: 88 PGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRTNNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG G G Y+ WAA M
Sbjct: 148 PFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWAAKM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 208 AVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFGGTI 267
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAF VARF Q GGSF+NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+
Sbjct: 268 HRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 327
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIKLCE A+V+AD I LG Q+AHV+ + R NC+AFL+N +
Sbjct: 328 QPKYGHLKELHKAIKLCEHAVVSADPT-VISLGSYQQAHVFSSGR----GNCAAFLSNYN 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LP WS+SILPDCR VFNTA+V QTS + + P
Sbjct: 383 PKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTS------HMRMFPT----- 431
Query: 466 QSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
SKL SW T E I + S T G+LE +N+T+D +DYLW++T + + D
Sbjct: 432 ----NSKL----HSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNI-D 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
SF + + PT+T+ S + VFINGQ +GS G +G N +
Sbjct: 483 SSESFLRRGQT-PTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRI 541
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V L G G DLS W+YQVGLKGE + S
Sbjct: 542 ALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSP 601
Query: 641 EENEA-EWT--DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
A EW L G WYK YF+AP+G +P+ALD+ SMGKGQ W+NG IGRY
Sbjct: 602 NGVSAVEWVRGSLAAQG-QQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRY 660
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W A KG C + C Y G Y KC CG+PTQ WYHVPRSWL+ + NLL+IFEE GG+
Sbjct: 661 WMAYA-KGDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGD 718
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I++ R+ + VC +E H+P + W + S +HL C G IS+
Sbjct: 719 ASKIALMKRAMKSVCADANE-HHPTLENWHTESPSE---SEELHEASVHLQCAPGQSIST 774
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I FAS+GTP G C F +G CHAP S +++ +
Sbjct: 775 IMFASFGTPSGTCGSFQKGTCHAPNSQAILEK 806
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/812 (50%), Positives = 520/812 (64%), Gaps = 41/812 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+LIS IHYPR+TP+MW DLI K+K+GG DVI+TY+FWN HE
Sbjct: 28 SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVHEPS 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRTNN
Sbjct: 88 PGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRTNNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG G G Y+ WAA M
Sbjct: 148 PFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWAAKM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 208 AVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFGGTI 267
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAF VARF Q GGSF+NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+
Sbjct: 268 HRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 327
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIKLCE A+V+AD I LG Q+AHV+ + R NC+AFL+N +
Sbjct: 328 QPKYGHLKELHKAIKLCEHAVVSADPT-VISLGSYQQAHVFSSGR----GNCAAFLSNYN 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LP WS+SILPDCR VFNTA+V QTS + + P
Sbjct: 383 PKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTS------HMRMFPT----- 431
Query: 466 QSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
SKL SW T E I + S T G+LE +N+T+D +DYLW++T + + D
Sbjct: 432 ----NSKL----HSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNI-D 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
SF + + PT+T+ S + VFINGQ +GS G +G N +
Sbjct: 483 SSESFLRRGQT-PTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRI 541
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V L G G DLS W+YQVGLKGE + S
Sbjct: 542 ALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSP 601
Query: 641 EENEA-EWT--DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
A EW L G WYK YF+AP+G +P+ALD+ SMGKGQ W+NG IGRY
Sbjct: 602 NGVSAVEWVRGSLAAQG-QQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRY 660
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W A KG C + C Y G Y KC CG+PTQ WYHVPRSWL+ + NLL+IFEE GG+
Sbjct: 661 WMAYA-KGDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGD 718
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I++ R+ + VC +E H+P + W + S +HL C G IS+
Sbjct: 719 ASKIALMKRAMKSVCADANE-HHPTLENWHTESPSE---SEELHQASVHLQCAPGQSIST 774
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I FAS+GTP G C F +G CHAP S +++ +
Sbjct: 775 IMFASFGTPSGTCGSFQKGTCHAPNSQAILEK 806
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 805 bits (2079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/812 (50%), Positives = 517/812 (63%), Gaps = 41/812 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AI+I+G RR+LIS IHYPR+TP+MW DLI K+K+GG DVI+TY+FWN HE
Sbjct: 28 SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVHEPS 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRTNN
Sbjct: 88 PGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRTNNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG G G Y+ WAA M
Sbjct: 148 PFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWAAKM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 208 AVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFGGTI 267
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAF VARF Q GGSF+NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+
Sbjct: 268 HRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 327
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIKLCE A+V+AD I LG Q+AHV+ + R NC+AFL+N +
Sbjct: 328 QPKYGHLKELHKAIKLCEHAVVSADPT-VISLGSYQQAHVFSSGR----GNCAAFLSNYN 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++A V F Y LP WS+SILPDCR VFNTA+V QTS
Sbjct: 383 PKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTS------------------ 424
Query: 466 QSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M +S SW T E I + S T G+LE +N+T+D +DYLW++T + + D
Sbjct: 425 -HMRMFPTNSKLHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNI-D 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
SF + + PT+T+ S + VFINGQ +GS G +G N +
Sbjct: 483 SSESFLRRGQT-PTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRI 541
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V L G G DLS W+YQVGLKGE + S
Sbjct: 542 ALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSP 601
Query: 641 EENEA-EWT--DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
A EW L G WYK YF+AP+G +P+ALD+ SMGKGQ W+NG IGRY
Sbjct: 602 NGVSAVEWVRGSLAAQG-QQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRY 660
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W A KG C + C Y G Y KC CG+PTQ WYHVPRSWL+ + NLL+IFEE GG+
Sbjct: 661 WMAYA-KGDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGD 718
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I++ R+ + VC +E H+P + W + S +HL C G IS+
Sbjct: 719 ASKIALMKRAMKSVCADANE-HHPTLENWHTESPSE---SEELHZASVHLQCAPGQSIST 774
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I FAS+GTP G C F +G CHAP S +++ +
Sbjct: 775 IMFASFGTPSGTCGSFQKGTCHAPNSQAILEK 806
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 805 bits (2078), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/814 (50%), Positives = 526/814 (64%), Gaps = 45/814 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD AI+I+G RR+L S IHYPR+TPEMW DLI K+KEGG DV+ETYVFWN HE
Sbjct: 27 DVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+FVK + +GLY LRIGPYVCAEWNFGGFPVWL+ +PGI FR +N
Sbjct: 87 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRADNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+ + +KIV+LM+ LF QGGPII+ QIENEYG G G Y WAA+M
Sbjct: 147 PFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWAANM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KP WTE W GW++ +GG L
Sbjct: 207 AVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPATWTEAWSGWFSEFGGPL 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVA+F QRGGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH A+K+CE ++V+AD A LG Q+A+VY + G C+AFL+N D
Sbjct: 327 QPKYGHLKELHRAVKMCEKSIVSADPA-ITSLGNLQQAYVYSSETGG----CAAFLSNND 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA V F Y LPPWS+SILPDCRN VFNTAKV QTS +
Sbjct: 382 WKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTS-----------------K 424
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ-GILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + +S SW T E I +++ G+LE +NVT+D SDYLW+IT + +
Sbjct: 425 MEMLPT--NSEMLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDIGS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF E+ PT+ +++ + VFINGQL+GS G V ++G N +
Sbjct: 483 TE-SFLHGGEL-PTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V + G +G DLS WTYQVGLKGE + S
Sbjct: 541 ALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVST 600
Query: 641 EENEA-EWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
A +W + + P TW+K YF+ P+G +P+ALD+ SMGKGQ W+NG IGR
Sbjct: 601 NGISAVDWMQGSLIAQKQQP--LTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGR 658
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YWT A G C + C Y G + KC CG PTQ WYHVPRSWL+ + NLLV+FEE GG
Sbjct: 659 YWTAYA-TGDC-NGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGG 716
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYII 815
+P IS+ RS VC V+E H P ++ W +Y + + P++ +HC G I
Sbjct: 717 DPTRISLVKRSVTNVCSNVAEYH-PNIKNWQIENYGKTEEFHL----PKVRIHCAPGQSI 771
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SSI+FAS+GTP G C F +G CHAP S +VV +
Sbjct: 772 SSIKFASFGTPLGTCGSFKQGTCHAPDSHAVVEK 805
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 804 bits (2077), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/811 (49%), Positives = 516/811 (63%), Gaps = 31/811 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R++IID R++LISA IHYPR+ P MWP L+ +KEGG DVIETYVFWN HE
Sbjct: 76 NVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGHELS 135
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y F G+ D+VKF + V +G+YL LRIGP+V AEWNFGG PVWL +PG FRT N
Sbjct: 136 PGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTYNQ 195
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PF MQ+F IV+LM++E LF+ QGGPII+ QIENEYG E+ Y + GK Y WAA M
Sbjct: 196 PFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGKKYALWAAKM 255
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ GVPW+MC+Q DAP+ +ID CN +YCD + P S N+P +WTENW GW+ T+GGR
Sbjct: 256 AVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFKTFGGRD 315
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL
Sbjct: 316 PHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYGLPR 375
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK+LH AIKLCE L+ S I LG + EA VY S C+AF++N+D
Sbjct: 376 LPKWGHLKELHRAIKLCEHVLLNGKSVN-ISLGPSVEADVYT----DSSGACAAFISNVD 430
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F S+ LP WSVSILPDC+N VFNTAKV+SQTS+ +++
Sbjct: 431 DKNDKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSV------------VAMVP 478
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S+ +S S W VKE G+W + +F G ++ +N TKD +DYLWH T I+VS++
Sbjct: 479 ESLQQSDKVVNSFKWDIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSEN 538
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV----VQPVEFQSGYNDLI 581
+ K N +P + I+S L F+N + G+ G+ P+ ++G N++
Sbjct: 539 EEFLKKGN--KPVLLIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIA 596
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LL TVGLQ G F + GAG VK+ G NG IDLS WTY++G++GE+ ++Y
Sbjct: 597 LLCLTVGLQTAGPFYDFVGAGLT-SVKIKGLNNGTIDLSSYAWTYKIGVQGEYLRLYQGN 655
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
N WT + TWYK DAP G +PV LD+ MGKG AW+NG IGRYW
Sbjct: 656 GLNNVNWTSTSEPPKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPR 715
Query: 701 VA--PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ C CDYRG +N DKC T CG PTQ WYHVPRSW + S N+LV+FEE GG+P
Sbjct: 716 KSEFKSEDCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDP 775
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
+I R C V+E YP V S + K+ NK P L C IS++
Sbjct: 776 EKIKFVRRKVSGACALVAED-YPSVALVSQG---EDKIQSNKNIPFARLACPGNTRISAV 831
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+G+P G C + +G+CH P S ++V +
Sbjct: 832 KFASFGSPSGTCGSYLKGDCHDPNSSTIVEK 862
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 803 bits (2073), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/815 (49%), Positives = 524/815 (64%), Gaps = 45/815 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHR++II G RR++IS IHYPR+ PEMWP L+A++K+GGAD IETYVFWN HE
Sbjct: 28 NVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHEIA 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F+ + D+V+FVK+V +GL L LRIGPYV AEWN+GG PVWL +PG FRTNN
Sbjct: 88 PGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTNNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM-ESSYGQQGKDYVKWAAS 224
PFK M+ F IVD+M++E LF+ QGG II+ QIENEYG+ E +YG GK Y WAAS
Sbjct: 148 PFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWAAS 207
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MAL GVPW+MC+++DAP+ +I++CNG+YCDG++PNS KP +WTENW GW+ T+G
Sbjct: 208 MALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFGES 267
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRP ED+AFAVARFF++GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 268 NPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLR 327
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PKW HL++LH +I+LCE L+ ++ ++ LG QEA +Y G C AFLANI
Sbjct: 328 RFPKWAHLRELHKSIRLCEHTLLYGNTT-FLSLGPKQEADIYSDQSGG----CVAFLANI 382
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D VTF + Y LP WSVSILPDCRN VFNTAKV SQTS+ T+ VP
Sbjct: 383 DSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTM-----------VP 431
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
ES +S + W +E G+W +N+F G ++H+N TKD +DYLW+ T V
Sbjct: 432 -----ESLQASKPERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDG 486
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
+ + + IDS + F+N L GS G+ + V + ++G N+L
Sbjct: 487 S----YSSKGSHAVLNIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNEL 542
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G E GAGF V ++G + G IDLS W Y++GL+GE+ ++
Sbjct: 543 ALLSMTVGLQNAGFAYEWIGAGFT-NVNISGVRTGIIDLSSNNWAYKIGLEGEYYNLFKP 601
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
++ N W + TWYK D P G DPV +D+ SMGKG AW+NG+ IGRYW
Sbjct: 602 DQTNNQRWIPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWP 661
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ C +C+YRG + DKC T CG PTQ WYH+PRSW S N+LV+FEE GG+P
Sbjct: 662 RTSSINDRCTPSCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDP 721
Query: 759 FEISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAP--EMHLHCQDGYI 814
+I+ R+ VC VSE H+P + W S ++N+ P + L C +G
Sbjct: 722 TKITFSRRAVTSVCSFVSE-HFPSIDLESWDES-------AMNEGTPPAKAQLSCPEGKS 773
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
ISS++FAS G P G C+ + G CH P SLSVV +
Sbjct: 774 ISSVKFASLGNPSGTCRSYQMGRCHHPNSLSVVEK 808
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/841 (48%), Positives = 535/841 (63%), Gaps = 52/841 (6%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
SV + + + L S+ +T V+YD +AI+I+G RR+LIS IHYPR+TP
Sbjct: 4 FSVSSFLFFVFLAALLGFRSTQCTT------VTYDKKAILINGQRRILISGSIHYPRSTP 57
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
EMW DL+ K+K+GG DV++TYVFWN HE G Y+F+G+ D+V+F+K GLY+ LRI
Sbjct: 58 EMWDDLMQKAKDGGLDVVDTYVFWNVHEPSPGNYDFEGRYDLVRFIKTAQRVGLYVHLRI 117
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIV +M+ E LF+ QGGPI
Sbjct: 118 GPYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKMAMQGFTQKIVQMMKSEKLFASQGGPI 177
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEYG + G G Y+ WAA MA+GL GVPWVMCK+ DAP+ +I++CNG+Y
Sbjct: 178 ILSQIENEYGPQSKALGAAGHAYMNWAAKMAVGLNTGVPWVMCKEDDAPDPVINSCNGFY 237
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
CD + PN KPTLWTE W GW+T +GG + RPV+DLAFAVARF Q+GGS NYYMY G
Sbjct: 238 CDYFSPNKPYKPTLWTEAWSGWFTEFGGPVYGRPVQDLAFAVARFVQKGGSLFNYYMYHG 297
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNFGRT+GGPF TSYDYDAP+DEYG+L +PK+GHLK+LH AIKLCE ALV++D
Sbjct: 298 GTNFGRTAGGPFITTSYDYDAPLDEYGMLRQPKYGHLKNLHRAIKLCEHALVSSDPT-VT 356
Query: 376 KLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
LG ++AHV+ + C+AFLAN ++AA+V F Y LP WS+SILPDC+
Sbjct: 357 SLGAYEQAHVFSSG----PGRCAAFLANYHTNSAATVVFNNMRYALPAWSISILPDCKRV 412
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK-SWMTVKEPI-GVWSEN 493
VFNTA+V + + Q M L + SK SW T E + +
Sbjct: 413 VFNTAQV-----------------GVHIAQTQM----LPTISKLSWETYNEDTYSLGGSS 451
Query: 494 NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFIN 553
TV G+LE +NVT+D SDYLW++T + +S + +F + + +PT+++ S + VFIN
Sbjct: 452 RMTVAGLLEQINVTRDTSDYLWYMTSVGISSSE-AFLRGGQ-KPTLSVRSAGHAVHVFIN 509
Query: 554 GQLTGSVIGH----WVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKL 609
GQ +GS G P+ ++G N + LLS VGL N G EK G G + +
Sbjct: 510 GQFSGSAYGSREHPAFTYTGPINLRAGMNKIALLSIAVGLPNVGLHFEKWQTGILGPISI 569
Query: 610 TGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDA 668
+G G DL+ W+YQVGLKGE + S E +W + TWYK F+A
Sbjct: 570 SGLNGGKKDLTWQKWSYQVGLKGEAMNLVSPTEATSVDWIKGSLLQGQRPLTWYKASFNA 629
Query: 669 PDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGN 728
P G +P+ALDL SMGKGQAW+NG IGRYW A KGGC C Y G Y C CG
Sbjct: 630 PRGNEPLALDLRSMGKGQAWINGQSIGRYWMAYA-KGGC-SRCTYAGTYRPPTCENGCGQ 687
Query: 729 PTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSN 788
PTQ WYHVPRSWL+ +NN+LV+FEE GG+ +IS+ RS +C + E H ++
Sbjct: 688 PTQRWYHVPRSWLKPTNNVLVLFEELGGDASKISLMRRSVTGLCGEAVEYHAK-----ND 742
Query: 789 SYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVS 848
SY ++ N+ +HL C G +IS+I+FAS+GTP G C + +G CHAP S +++
Sbjct: 743 SYIIES----NEELDSLHLQCNPGQVISAIKFASFGTPSGTCGSYQKGTCHAPDSHAIIE 798
Query: 849 E 849
+
Sbjct: 799 K 799
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 801 bits (2070), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/837 (48%), Positives = 527/837 (62%), Gaps = 35/837 (4%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +M L+ V +S+ +T +V+YD R++II+G R++LISA IHYPR+ P MWP L+
Sbjct: 23 MTVMSSSLAAVDASNVTTIGTD-SVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLV 81
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
+KEGG DVIETYVFWN HE G Y F G+ D+VKF K++ +G+Y+ LRIGP+V AE
Sbjct: 82 RLAKEGGVDVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAE 141
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGG PVWL +PG FRT++ PFK MQ+F+ V+LM+ E LF+ QGGPII+ Q+EN
Sbjct: 142 WNFGGLPVWLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVEN 201
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG E++YG+ GK Y WAA MAL GVPW+MC+Q DAP+ +ID CN +YCD +KP
Sbjct: 202 EYGYYENAYGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPI 261
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
S NKP +WTENW GW+ T+G R PHRP ED+A++VARFFQ+GGS NYYMY GGTNFGRT
Sbjct: 262 SPNKPKIWTENWPGWFKTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRT 321
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAPIDEYGL PKWGHLK+LH IK CE AL+ D + LG QE
Sbjct: 322 AGGPFITTSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPT-LLSLGPLQE 380
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
A VY + C+AFLAN+D+ V F SY LP WSVSILPDC+N FNTAKV
Sbjct: 381 ADVYE----DASGACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKV 436
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
QTSI + + L P S P++ + S W KE GVW +FT G ++
Sbjct: 437 GCQTSIVNMA-PIDLHPTASSPKRDI-------KSLQWEVFKETAGVWGVADFTKNGFVD 488
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
H+N TKD +DYLW+ T I+V ++ F + N + ++S + VFIN +L S G
Sbjct: 489 HINTTKDATDYLWYTTSIFVHAEE-DFLR-NRGTAMLFVESKGHAMHVFINKKLQASASG 546
Query: 563 HWV----KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
+ K P+ ++G N++ LLS TVGLQ GAF E GAG VK+ GFK G +D
Sbjct: 547 NGTVPQFKFGTPIALKAGKNEIALLSMTVGLQTAGAFYEWIGAG-PTSVKVAGFKTGTMD 605
Query: 619 LSKILWTYQVGLKGEFQQIY-SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
L+ WTY++GL+GE +I S W ++ TWYK DAP G +PVAL
Sbjct: 606 LTASAWTYKIGLQGEHLRIQKSYNLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVAL 665
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPK-GGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
D+ MGKG AW+NG IGRYW K C CDYRG +N DKC T CG PTQ WYHV
Sbjct: 666 DMIHMGKGMAWLNGQEIGRYWPRRTSKYENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHV 725
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVD--- 793
PRSW + S N+L+IFEE GG+P +I +R C +S H S+ V+
Sbjct: 726 PRSWFKPSGNVLIIFEEIGGDPSQIRFSMRKVSGACGHLSVDH--------PSFDVENLQ 777
Query: 794 -GKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++ +K P + L C ISS++FAS+G P G C + G+CH S ++V +
Sbjct: 778 GSEIESDKNRPTLSLKCPTNTNISSVKFASFGNPNGTCGSYMLGDCHDQNSAALVEK 834
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 801 bits (2070), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/812 (50%), Positives = 521/812 (64%), Gaps = 45/812 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+++DG RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V +G+++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG +G GK Y+ WAA MA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH A+KLCE LV+AD LG QEAHV+R S S C+AFLAN +
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPT-VTTLGSMQEAHVFR-----SSSGCAAFLANYNS 380
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A V F ++Y+LPPWS+SILPDC+N VFNTA V QT+ Q
Sbjct: 381 NSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN-----------------QM 423
Query: 467 SMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M ++S W E + + + T G+LE LNVT+D SDYLW+IT + V
Sbjct: 424 QMWAD--GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPS 481
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ + ++T+ S L VFINGQL GS G + ++G N +
Sbjct: 482 EKFLQGGTPL--SLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVA 539
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V + G G DL+ W+YQVGLKGE + S+E
Sbjct: 540 LLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLE 599
Query: 642 -ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
EW + ++ P WY+ YFD P G +P+ALD+GSMGKGQ W+NG IGRY
Sbjct: 600 GSGSVEWMQGSLVAQNQQP--LAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRY 657
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
WT A +G C+ C Y G+Y + KC CG PTQ WYHVPRSWLQ + NLLV+FEE GG+
Sbjct: 658 WTAYA-EGDCKG-CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGD 715
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I++ R+ VC VSE H P ++ W + + K +HL C G IS+
Sbjct: 716 SSKIALAKRTVSGVCADVSEYH-PNIKNWQIESYGEPEFHTAK----VHLKCAPGQTISA 770
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FAS+GTP G C F +G CH+ S SV+ +
Sbjct: 771 IKFASFGTPLGTCGTFQQGECHSINSNSVLEK 802
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 801 bits (2068), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/837 (48%), Positives = 527/837 (62%), Gaps = 35/837 (4%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +M L+ V +S+ +T +V+YD R++II+G R++LISA IHYPR+ P MWP L+
Sbjct: 23 MTVMSSSLAAVDASNVTTIGTD-SVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLV 81
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
+KEGG DVIETYVFWN HE G Y F G+ D+VKF K++ +G+Y+ LRIGP+V AE
Sbjct: 82 RLAKEGGVDVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAE 141
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGG PVWL +PG FRT++ PFK MQ+F+ V+LM+ E LF+ QGGPII+ Q+EN
Sbjct: 142 WNFGGLPVWLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVEN 201
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG E++YG+ GK Y WAA MAL GVPW+MC+Q DAP+ +ID CN +YCD +KP
Sbjct: 202 EYGYYENAYGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPI 261
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
S NKP +WTENW GW+ T+G R PHRP ED+A++VARFFQ+GGS NYYMY GGTNFGRT
Sbjct: 262 SPNKPKIWTENWPGWFKTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRT 321
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAPIDEYGL PKWGHLK+LH IK CE AL+ D + LG QE
Sbjct: 322 AGGPFITTSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPT-LLSLGPLQE 380
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
A VY + C+AFLAN+D+ V F SY LP WSVSILPDC+N FNTAKV
Sbjct: 381 ADVYE----DASGACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKV 436
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
QTSI + + L P S P++ + S W KE GVW +FT G ++
Sbjct: 437 GCQTSIVNMA-PIDLHPTASSPKRDI-------KSLQWEVFKETAGVWGVADFTKNGFVD 488
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
H+N TKD +DYLW+ T I+V ++ F + N + ++S + VFIN +L S G
Sbjct: 489 HINTTKDATDYLWYTTSIFVHAEE-DFLR-NRGTAMLFVESKGHAMHVFINKKLQASASG 546
Query: 563 HWV----KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
+ K P+ ++G N++ LLS TVGLQ GAF E GAG VK+ GFK G +D
Sbjct: 547 NGTVPQFKFGTPIALKAGKNEISLLSMTVGLQTAGAFYEWIGAG-PTSVKVAGFKTGTMD 605
Query: 619 LSKILWTYQVGLKGEFQQIY-SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
L+ WTY++GL+GE +I S W ++ TWYK DAP G +PVAL
Sbjct: 606 LTASAWTYKIGLQGEHLRIQKSYNLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVAL 665
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPK-GGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
D+ MGKG AW+NG IGRYW K C CDYRG +N DKC T CG PTQ WYHV
Sbjct: 666 DMIHMGKGMAWLNGQEIGRYWPRRTSKYENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHV 725
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVD--- 793
PRSW + S N+L+IFEE GG+P +I +R C +S H S+ V+
Sbjct: 726 PRSWFKPSGNVLIIFEEIGGDPSQIRFSMRKVSGACGHLSVDH--------PSFDVENLQ 777
Query: 794 -GKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++ +K P + L C ISS++FAS+G P G C + G+CH S ++V +
Sbjct: 778 GSEIENDKNRPTLSLKCPTNTNISSVKFASFGNPNGTCGSYMLGDCHDQNSAALVEK 834
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 800 bits (2067), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/812 (50%), Positives = 521/812 (64%), Gaps = 45/812 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+++DG RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V +G+++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG +G GK Y+ WAA MA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH A+KLCE LV+AD LG QEAHV+R S S C+AFLAN +
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPT-VTTLGSMQEAHVFR-----SSSGCAAFLANYNS 380
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A V F ++Y+LPPWS+SILPDC+N VFNTA V QT+ Q
Sbjct: 381 NSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN-----------------QM 423
Query: 467 SMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M ++S W E + + + T G+LE LNVT+D SDYLW+IT + V
Sbjct: 424 QMWAD--GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPS 481
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ + ++T+ S L VFINGQL GS G + ++G N +
Sbjct: 482 EKFLQGGTPL--SLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVA 539
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V + G G DL+ W+YQVGLKGE + S+E
Sbjct: 540 LLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLE 599
Query: 642 -ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
EW + ++ P WY+ YFD P G +P+ALD+GSMGKGQ W+NG IGRY
Sbjct: 600 GSGSVEWMQGSLVAQNQQP--LAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRY 657
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
WT A +G C+ C Y G+Y + KC CG PTQ WYHVPRSWLQ + NLLV+FEE GG+
Sbjct: 658 WTAYA-EGDCKG-CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGD 715
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I++ R+ VC VSE H P ++ W + + K +HL C G IS+
Sbjct: 716 SSKIALAKRTVSGVCADVSEYH-PNIKNWQIESYGEPEFHTAK----VHLKCAPGQTISA 770
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FAS+GTP G C F +G CH+ S SV+ +
Sbjct: 771 IKFASFGTPLGTCGTFQQGECHSINSNSVLEK 802
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 799 bits (2064), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/812 (50%), Positives = 519/812 (63%), Gaps = 45/812 (5%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YD +A++IDG RR+L S IHYPR+TP+MW LI K+K+GG DVI+TYVFWN HE G
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
Y F+ + D+V+FVK V +GL++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N PF
Sbjct: 90 NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG +G G+ Y+ WAA MA+
Sbjct: 150 KTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAV 209
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 210 GLDTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 269
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EP
Sbjct: 270 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREP 329
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
K HLK+LH A+KLCE ALV+ D LG QEAHV+R S S C+AFLAN + +
Sbjct: 330 KHSHLKELHRAVKLCEQALVSVDPT-ITTLGTMQEAHVFR-----SPSGCAAFLANYNSN 383
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
+ A V F + Y+LPPWS+SILPDC+N VFN+A V QTS Q
Sbjct: 384 SHAKVVFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTS-----------------QMQ 426
Query: 468 MIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
M +TS W E + + + T G+LE LNVT+D SDYLW+IT + +S +
Sbjct: 427 MWGD--GATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSE 484
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLIL 582
+F + P++++ S L VF+NGQL GS G +K V ++G N + L
Sbjct: 485 -NFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIAL 543
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS GL N G E G G V L G G DL+ W+YQVGLKGE + S+E
Sbjct: 544 LSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEG 603
Query: 642 ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
EW + + P WYK YF+ P G +P+ALD+GSMGKGQ W+NG IGRYW
Sbjct: 604 SGSVEWMQGSLIAQKQQP--LAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYW 661
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET-GGN 757
T A G C+ C Y G + + KC CG PTQ WYHVPRSWLQ S NLLV+ EE GG+
Sbjct: 662 TAYA-DGDCKG-CSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGD 719
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
+I++ RS VC VSE H P ++KW ++ ++HL C G IS+
Sbjct: 720 SSKIALAKRSVSSVCADVSEDH-PNIKKW----QIESYGEREHRRAKVHLRCAHGQSISA 774
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I FAS+GTP G C F +G CH+ S +V+ +
Sbjct: 775 IRFASFGTPVGTCGNFQQGGCHSASSHAVLEK 806
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/830 (49%), Positives = 532/830 (64%), Gaps = 46/830 (5%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGG 89
L C + + V+YD +A++IDG RR+L S IHYPR+TP+MW LI K+K+GG
Sbjct: 10 LGCAVAVAVLAAAVECAVTYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGG 69
Query: 90 ADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFP 149
DVI+TYVFWN HE G Y F+ + D+V+F+K V +GL++ LRIGPY+C EWNFGGFP
Sbjct: 70 LDVIQTYVFWNGHEPTPGNYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGGFP 129
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES 209
VWL+ +PGI FRT+N PFK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG
Sbjct: 130 VWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGPEGK 189
Query: 210 SYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTL 269
G G+ Y+ WAA MA+GLG GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+
Sbjct: 190 ELGAAGQAYINWAAKMAIGLGTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTM 249
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTE W GW+T +GG + RPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF
Sbjct: 250 WTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFIT 309
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
TSYDYDAPIDEYGL+ EPK HLK+LH A+KLCE ALV+ D A LG QEAHV+R
Sbjct: 310 TSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDPA-ITTLGTMQEAHVFR-- 366
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
S S C+AFLAN + ++ A V F + Y+LPPWS+SILPDC+N VFN+A V QTS
Sbjct: 367 ---SPSGCAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTS-- 421
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTK 508
Q M ++S W E + + + T G+LE LNVT+
Sbjct: 422 ---------------QMQMWGD--GASSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTR 464
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HW 564
D SDYLW+IT + +S + +F + ++++ S L VF+NG+L GS G
Sbjct: 465 DSSDYLWYITSVDISPSE-NFLQGGGKPLSLSVLSAGHALHVFVNGELQGSAYGTREDRR 523
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
+K ++G N + LLS GL N G E G G V L G G DL+ W
Sbjct: 524 IKYNGNANLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVGLHGLNEGSRDLTWQTW 583
Query: 625 TYQVGLKGEFQQIYSIE-ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLG 680
+YQVGLKGE + S+E EW + ++ P +WY+ YF+ P G +P+ALD+G
Sbjct: 584 SYQVGLKGEQMNLNSLEGSTSVEWMQGSLIAQNQQP--LSWYRAYFETPSGDEPLALDMG 641
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
SMGKGQ W+NG IGRYWT A G C++ C Y G + + KC CG PTQ WYHVPRSW
Sbjct: 642 SMGKGQIWINGQSIGRYWTAYA-DGDCKE-CSYTGTFRAPKCQAGCGQPTQRWYHVPRSW 699
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS-NSYSVDGKLSIN 799
LQ + NLLV+FEE GG+ +I++ RS VC VSE H P ++ W SY G+ +
Sbjct: 700 LQPTRNLLVVFEELGGDSSKIALVKRSVSSVCADVSEDH-PNIKNWQIESY---GEREYH 755
Query: 800 KMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ ++HL C G IS+I+FAS+GTP G C F +G+CH+ S +V+ +
Sbjct: 756 RA--KVHLRCSPGQSISAIKFASFGTPMGTCGNFQQGDCHSANSHTVLEK 803
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/827 (49%), Positives = 532/827 (64%), Gaps = 58/827 (7%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
++ ++ S V S +AS VSYD +AI I+G RR+LIS IHYPR++PEMWPDLI
Sbjct: 8 VVFLVFLASLVCSVTAS-------VSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLI 60
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+KEGG DVI+TYVFWN HE G+Y F+G D+VKFVKLV +GLY+ LRIGPY+CAE
Sbjct: 61 QKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAE 120
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEE---MQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
WNFG +F+ PF+ E M++F KIV++M+ E LF QGGPII+ Q
Sbjct: 121 WNFGH-----------QFQNGQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQ 169
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYG ME G G+ Y KWAA MA+GL GVPWVMCKQ DAP+ II+ CNG+YCD +
Sbjct: 170 IENEYGPMEYELGSPGQAYTKWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYF 229
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
PN KP +WTE W GW+T +GG +PHRP ED+AF+VARF Q+GGSF+NYYMY GGTNF
Sbjct: 230 SPNKAYKPKMWTEAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNF 289
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
GRT+GGPF TSYDYDAP+DEYGLL +PKWGHLKDLH AIKLCEPALV+ D A I LG
Sbjct: 290 GRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGD-ATVIPLGN 348
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
QEAHV+ G C+AFLAN + + A V+F Y LPPWS+SILPDC+NTV+NT
Sbjct: 349 YQEAHVFNYKAGG----CAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNT 404
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQG 499
A+V +Q++ ++ ++P VP + SW T E +N FT+ G
Sbjct: 405 ARVGAQSA------TIKMTP---VPMHGGL---------SWQTYNEEPSSSGDNTFTMVG 446
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
+LE +N T+D SDYLW++T +++ D F K+ + P +T+ S L VFINGQL+G+
Sbjct: 447 LLEQINTTRDVSDYLWYMTDVHI-DPSEGFLKSGKY-PVLTVLSAGHALHVFINGQLSGT 504
Query: 560 VIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG 615
G + Q V ++G N + LLS VGL N G E AG G V L G G
Sbjct: 505 AYGSLDFPKLTFSQGVSLRAGVNKISLLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEG 564
Query: 616 DIDLSKILWTYQVGLKGE-FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDP 674
+DLS W+Y++GL GE + EW + + +WYKT F+AP G P
Sbjct: 565 RMDLSWQKWSYKIGLHGEALSLHSISGSSSVEWAEGSLVAQKQPLSWYKTTFNAPAGNSP 624
Query: 675 VALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWY 734
+ALD+GSMGKGQ W+NG H+GR+W G C + C Y G YN +KC+TNCG +Q WY
Sbjct: 625 LALDMGSMGKGQIWINGQHVGRHWPAYKASGTCGE-CTYIGTYNENKCSTNCGEASQRWY 683
Query: 735 HVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG 794
HVP+SWL+ + NLLV+FEE GG+P +S+ R VC + E + P +Y +
Sbjct: 684 HVPQSWLKPTGNLLVVFEEWGGDPNGVSLVRREVDSVCADIYE--WQPTLM---NYQMQA 738
Query: 795 KLSINK-MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+NK + P+ HL C G I SI+FAS+GTP+G C +++G+CHA
Sbjct: 739 SGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYNQGSCHA 785
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 798 bits (2062), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/751 (51%), Positives = 515/751 (68%), Gaps = 32/751 (4%)
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY+F+G+ND+V+FVK +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+ RT+N PF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQRF +K+V M+ L++ QGGPII+ QIENEYGN+ +SYG GK Y++WAA MA+
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L GVPWVMC+QTDAPE +I+ CNG+YCD + P+ ++P LWTENW GW+ ++GG +P+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RP EDLAFAVARF+QRGG+ NYYMY GGTNFGR+SGGPF TSYDYDAPIDEYGL+ +P
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHL+D+H AIK+CEPAL+A D + Y+ LGQN EAHVY+ S S C+AFLANID+
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPS-YMSLGQNAEAHVYK-----SGSLCAAFLANIDDQ 294
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ---TSIKTVEFSLPLSPNISVP 464
+ +VTF G++Y LP WSVSILPDC+N V NTA+++SQ T ++ + FS S
Sbjct: 295 SDKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQAS------ 348
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
S +E++L+++ SW EP+G+ EN T G++E +N T D SD+LW+ T I V+
Sbjct: 349 DGSSVEAELAAS--SWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAG 406
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYNDL 580
+ N + + ++S+ VL+VFING+L GS + + PV +G N +
Sbjct: 407 GEPYL---NGSQSNLPVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKI 463
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGL NYGAF + GAG G VKLTG K G +DLS WTYQ+GL+GE +Y+
Sbjct: 464 DLLSATVGLTNYGAFFDLVGAGITGPVKLTGPK-GTLDLSSAEWTYQIGLRGEDLHLYNP 522
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-T 699
E EW + TWYK+ F AP G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 523 SEASPEWVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPT 582
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+AP+ C ++C+YRG+Y++ KC CG P+Q YHVPRS+LQ +N +V+FE+ GGNP
Sbjct: 583 NIAPQSDCVNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPS 642
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSI 818
+IS + T VC VSE H + W V + + + P + L C ++G +ISSI
Sbjct: 643 KISFTTKQTESVCAHVSEDHPDQIDSW-----VSSQQKLQRSGPALRLECPKEGQVISSI 697
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+GTP G C +S G C + +L+V E
Sbjct: 698 KFASFGTPSGTCGSYSHGECSSSQALAVAQE 728
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/813 (49%), Positives = 516/813 (63%), Gaps = 43/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDHR++II G RR+LIS IHYPR+ PEMWP L+A++K+GGAD +ETYVFWN HE
Sbjct: 105 SVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 164
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY F+ + D+V+F K+V +GLY+ LRIGP+V AEW FGG PVWL PG FRTNN
Sbjct: 165 QGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNE 224
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+RF IVD+M++E F+ QGG II+ Q+ENEYG+ME +YG K Y WAASM
Sbjct: 225 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 284
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL GVPW+MC+Q DAP+ +I+ CN +YCD +KPNS KP WTENW GW+ T+G
Sbjct: 285 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 344
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFF +GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 345 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 404
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKW HL+DLH +IKL E L+ +S+ ++ LG QEA VY G C AFL+N+D
Sbjct: 405 LPKWAHLRDLHKSIKLGEHTLLYGNSS-FVSLGPQQEADVYTDQSGG----CVAFLSNVD 459
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
VTF +SY LP WSVSILPDC+N FNTAKV SQT +
Sbjct: 460 SEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLM----------------- 502
Query: 466 QSMIESKL-SSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + L SS W +E G+W + G ++H+N TKD +DYLW+ T V
Sbjct: 503 MDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDG 562
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
++ N V + I+S ++ F+N +L GS G+ K V PV ++G N L
Sbjct: 563 SHLA--GGNHV---LHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKL 617
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G E GAG VK++G +N IDLS W Y++GL+GE+ ++
Sbjct: 618 SLLSMTVGLQNGGPMYEWAGAGIT-SVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKA 676
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
++ + W + TWYK D P G DPV LD+ SMGKG AW+NG+ IGRYW
Sbjct: 677 DKGKDIRWMPQSEPPKNQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWP 736
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
++P C +CDYRG ++ +KC CG PTQ WYHVPRSW S N LVIFEE GG+P
Sbjct: 737 RISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDP 796
Query: 759 FEISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+I+ R+ VC VSE HYP + W + DG + A ++ L C G IS
Sbjct: 797 TKITFSRRTVASVCSFVSE-HYPSIDLESWDRNTQNDG-----RDAAKVQLSCPKGKSIS 850
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S++F S+G P G C+ + +G+CH P S+SVV +
Sbjct: 851 SVKFVSFGNPSGTCRSYQQGSCHHPNSISVVEK 883
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 798 bits (2060), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/791 (51%), Positives = 506/791 (63%), Gaps = 53/791 (6%)
Query: 64 ISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKL 123
+S +HYPR+ PEMWPDLI K+K+GG DV++TYVFWN HE RGQY F+G+ D+V F+KL
Sbjct: 1 MSGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKL 60
Query: 124 VGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMR 183
V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK EMQ+F KIVD+M+
Sbjct: 61 VKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMK 120
Query: 184 EEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDA 243
E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+ L VPWVMCK+ DA
Sbjct: 121 SEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDA 180
Query: 244 PENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQR 303
P+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PHRPVEDLA+ VA+F Q+
Sbjct: 181 PDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQK 240
Query: 304 GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL EPKWGHLK+LH AIKLCE
Sbjct: 241 GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCE 300
Query: 364 PALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPP 423
PALVA D LG Q+A V+R+ S C AFL N D+ + A V+F G Y LPP
Sbjct: 301 PALVAGDPI-VTSLGNAQQASVFRS----STDACVAFLENKDKVSYARVSFNGMHYNLPP 355
Query: 424 WSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTV 483
WS+SILPDC+ TV+NTA+V SQ S +E++ +W +
Sbjct: 356 WSISILPDCKTTVYNTARVGSQISQMKMEWAGGF---------------------TWQSY 394
Query: 484 KEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDS 543
E I + +F G+LE +NVT+D +DYLW+ T + V+ D+ +N P +T+ S
Sbjct: 395 NEDINSLGDESFVTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFL--SNGKNPVLTVMS 452
Query: 544 MRDVLRVFINGQLTGSVIGHWVKVVQP-------VEFQSGYNDLILLSQTVGLQNYGAFL 596
L +F+NGQLTG+V G V P V+ G N + LS VGL N G
Sbjct: 453 AGHALHIFVNGQLTGTVYG---SVDDPKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHF 509
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEENEAEWTDLTRDGI 655
E AG G V L G G DL+ WTY+VGLKGE + EW + +
Sbjct: 510 ETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEDLSLHSLSGSSSVEWGEPMQK-- 567
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
TWYK +F+APDG +P+ALD+ SMGKGQ W+NG IGRYW G C CDYRG
Sbjct: 568 -QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGTC-GICDYRG 625
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQV 775
Y+ KC TNCG+ +Q WYHVPRSWL + NLLVIFEE GG+P IS+ R+T +C V
Sbjct: 626 EYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSICADV 685
Query: 776 SESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSR 835
SE P + W K+ HL C G ++ I+FAS+GTPQG C +S
Sbjct: 686 SEWQ-PSMTNWRTKDYEKAKI---------HLQCDHGRKMTDIKFASFGTPQGSCGSYSE 735
Query: 836 GNCHAPMSLSV 846
G CHA S +
Sbjct: 736 GGCHAHKSYDI 746
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 797 bits (2059), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/814 (50%), Positives = 522/814 (64%), Gaps = 47/814 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+++DG RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V +G+++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF+ QGGPII+ QIENEYG +G GK Y+ WAA MA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK+LH A+KLCE LV+AD LG QEAHV+R S S C+AFLAN +
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPT-VTTLGSMQEAHVFR-----SSSGCAAFLANYNS 380
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ A V F ++Y+LPPWS+SILPDC+N VFNTA V QT+ Q
Sbjct: 381 NSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN-----------------QM 423
Query: 467 SMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M ++S W E + + + T G+LE LNVT+D SDYLW+IT++ V
Sbjct: 424 QMWAD--GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPS 481
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ + ++T+ S L VFINGQL GS G + ++G N +
Sbjct: 482 EKFLQGGTPL--SLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVA 539
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY--QVGLKGEFQQIYS 639
LLS GL N G E G G V + G G DL+ W+Y QVGLKGE + S
Sbjct: 540 LLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQFQVGLKGEQMNLNS 599
Query: 640 IE-ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+E EW + ++ P WY+ YFD P G +P+ALD+GSMGKGQ W+NG IG
Sbjct: 600 LEGSGSVEWMQGSLVAQNQQP--LAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIG 657
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYWT A +G C+ C Y G+Y + KC CG PTQ WYHVPRSWLQ + NLLV+FEE G
Sbjct: 658 RYWTAYA-EGDCKG-CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELG 715
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
G+ +I++ R+ VC VSE H P ++ W + + K +HL C G I
Sbjct: 716 GDSSKIALAKRTVSGVCADVSEYH-PNIKNWQIESYGEPEFHTAK----VHLKCAPGQTI 770
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S+I+FAS+GTP G C F +G CH+ S SV+ +
Sbjct: 771 SAIKFASFGTPLGTCGTFQQGECHSINSNSVLEK 804
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 797 bits (2058), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/813 (49%), Positives = 516/813 (63%), Gaps = 43/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDHR++II G RR+LIS IHYPR+ PEMWP L+A++K+GGAD +ETYVFWN HE
Sbjct: 37 SVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 96
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY F+ + D+V+F K+V +GLY+ LRIGP+V AEW FGG PVWL PG FRTNN
Sbjct: 97 QGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNE 156
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+RF IVD+M++E F+ QGG II+ Q+ENEYG+ME +YG K Y WAASM
Sbjct: 157 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 216
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL GVPW+MC+Q DAP+ +I+ CN +YCD +KPNS KP WTENW GW+ T+G
Sbjct: 217 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 276
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFF +GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 277 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 336
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKW HL+DLH +IKL E L+ +S+ ++ LG QEA VY G C AFL+N+D
Sbjct: 337 LPKWAHLRDLHKSIKLGEHTLLYGNSS-FVSLGPQQEADVYTDQSGG----CVAFLSNVD 391
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
VTF +SY LP WSVSILPDC+N FNTAKV SQT +
Sbjct: 392 SEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLM----------------- 434
Query: 466 QSMIESKL-SSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + L SS W +E G+W + G ++H+N TKD +DYLW+ T V
Sbjct: 435 MDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDG 494
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
++ N V + I+S ++ F+N +L GS G+ K V PV ++G N L
Sbjct: 495 SHLA--GGNHV---LHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKL 549
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G E GAG VK++G +N IDLS W Y++GL+GE+ ++
Sbjct: 550 SLLSMTVGLQNGGPMYEWAGAGIT-SVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKA 608
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
++ + W + TWYK D P G DPV LD+ SMGKG AW+NG+ IGRYW
Sbjct: 609 DKGKDIRWMPQSEPPKNQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWP 668
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
++P C +CDYRG ++ +KC CG PTQ WYHVPRSW S N LVIFEE GG+P
Sbjct: 669 RISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDP 728
Query: 759 FEISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+I+ R+ VC VSE HYP + W + DG + A ++ L C G IS
Sbjct: 729 TKITFSRRTVASVCSFVSE-HYPSIDLESWDRNTQNDG-----RDAAKVQLSCPKGKSIS 782
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S++F S+G P G C+ + +G+CH P S+SVV +
Sbjct: 783 SVKFVSFGNPSGTCRSYQQGSCHHPNSISVVEK 815
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/813 (49%), Positives = 516/813 (63%), Gaps = 43/813 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD R++II G RR+LIS IHYPR+ PEMWP L+A++K+GGAD +ETYVFWN HE
Sbjct: 37 SVTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 96
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY F+ + D+V+F K+V +GLY+ LRIGP+V AEW FGG PVWL PG FRTNN
Sbjct: 97 QGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNE 156
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+RF IVD+M++E F+ QGG II+ Q+ENEYG+ME +YG K Y WAASM
Sbjct: 157 PFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASM 216
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
AL GVPW+MC+Q DAP+ +I+ CN +YCD +KPNS KP WTENW GW+ T+G
Sbjct: 217 ALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGESN 276
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFF +GGS NYY+Y GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 277 PHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLRR 336
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKW HL+DLH +IKL E L+ +S+ ++ LG QEA VY G C AFL+N+D
Sbjct: 337 LPKWAHLRDLHKSIKLGEHTLLYGNSS-FVSLGPQQEADVYTDQSGG----CVAFLSNVD 391
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
VTF +SY LP WSVSILPDC+N FNTAKV SQT +
Sbjct: 392 SEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLM----------------- 434
Query: 466 QSMIESKL-SSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + L SS W +E G+W + G ++H+N TKD +DYLW+ T V
Sbjct: 435 MDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDG 494
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
++ N V + I+S ++ F+N +L GS G+ K V PV ++G N L
Sbjct: 495 SHLA--GGNHV---LHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKL 549
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGLQN G E GAG VK++G +N IDLS W Y++GL+GE+ ++
Sbjct: 550 SLLSMTVGLQNGGPMYEWAGAGIT-SVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKA 608
Query: 641 EE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
++ + W + TWYK D P G DPV LD+ SMGKG AW+NG+ IGRYW
Sbjct: 609 DKGKDIRWMPQSEPPKNQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWP 668
Query: 700 VVAP-KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
++P C +CDYRG ++ +KC CG PTQ WYHVPRSW S N LVIFEE GG+P
Sbjct: 669 RISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDP 728
Query: 759 FEISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+I+ R+ VC VSE HYP + W + DG + A ++ L C G IS
Sbjct: 729 TKITFSRRTVASVCSFVSE-HYPSIDLESWDRNTQNDG-----RDAAKVQLSCPKGKSIS 782
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S++FAS+G P G C+ + +G+CH P S+SVV +
Sbjct: 783 SVKFASFGNPSGTCRSYQQGSCHHPNSISVVEK 815
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/759 (52%), Positives = 514/759 (67%), Gaps = 44/759 (5%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L + +M++ + + L SS AS V+YDH+AIII+G RR+LIS IHYPR+ P
Sbjct: 2 LKMSKIMVVFLGLFLWVCSSVMAS-------VTYDHKAIIINGRRRILISGSIHYPRSIP 54
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
+MWPDLI K+K+GG DVIETYVFWN HE GQYNF+ + D+V+FVKLV +GLY+ LRI
Sbjct: 55 QMWPDLIQKAKDGGLDVIETYVFWNGHEPSPGQYNFEDRYDLVRFVKLVHQAGLYVHLRI 114
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV LM+ E L+ QGGPI
Sbjct: 115 GPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVGLMKGEKLYESQGGPI 174
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEYG +E G GK Y KWAA MALGL GVPWVMCKQ DAP+ +ID CNG+Y
Sbjct: 175 ILSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLNTGVPWVMCKQDDAPDPVIDTCNGFY 234
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
C+ +KPN KP +WTE W GW+T +GG P+RPVED+A++VARF Q GGSF+NYYMY G
Sbjct: 235 CENFKPNKVYKPKMWTEAWTGWFTEFGGPAPYRPVEDMAYSVARFIQNGGSFINYYMYHG 294
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQY 374
GTNFGRT+GGPF TSYDYDAPIDEYGLL EPKW HL+DLH AIKLCEPALV+ D + Y
Sbjct: 295 GTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWSHLRDLHKAIKLCEPALVSVDPTVSY 354
Query: 375 IKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRN 434
LG NQEAHV++ R GS C+AFLAN D ++A+VTF Y LPPWSVSILPDC++
Sbjct: 355 --LGSNQEAHVFK-TRSGS---CAAFLANYDASSSATVTFGNNQYDLPPWSVSILPDCKS 408
Query: 435 TVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG-VWSEN 493
+FNTAKV + T S P+ + + +S SW++ E ++E+
Sbjct: 409 VIFNTAKVGAPT---------------SQPKMTPV------SSFSWLSYNEETASAYTED 447
Query: 494 NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFIN 553
T+ G++E ++VT+D +DYLW++T I + D + F K+ + P +T+ S L VFIN
Sbjct: 448 TTTMAGLVEQISVTRDSTDYLWYMTDIRI-DPNEGFLKSGQ-WPLLTVFSAGHALHVFIN 505
Query: 554 GQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKL 609
GQL+G+ G + + + V ++G N L +LS VGL N G E G G V L
Sbjct: 506 GQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETWNTGVLGPVTL 565
Query: 610 TGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDA 668
G D+S W+Y++GLKGE ++S+ + EW + TWYKT FD+
Sbjct: 566 KGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVAQKQPLTWYKTTFDS 625
Query: 669 PDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGN 728
P G +P+ALD+ SMGKGQ W+NG IGR+W KG C C+Y G +N KC + CG
Sbjct: 626 PKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTAKGSC-GKCNYGGIFNEKKCHSXCGE 684
Query: 729 PTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
P+Q WYHVPR+WL++S N+LVIFEE GGNP IS+ RS
Sbjct: 685 PSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRS 723
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/822 (49%), Positives = 521/822 (63%), Gaps = 55/822 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+++DG RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V +G+++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ----------IENEYGNMESSYGQQGK 216
FK MQ F +KIV +M+ E LF+ QGGPII+ Q IENEYG +G GK
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 217 DYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDG 276
Y+ WAA MA+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W G
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDA 336
W+T +GG + RPVEDLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 337 PIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN 396
P+DEYGL EPK+GHLK+LH A+KLCE LV+AD LG QEAHV+R S S
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPT-VTTLGSMQEAHVFR-----SSSG 380
Query: 397 CSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLP 456
C+AFLAN + ++ A V F ++Y+LPPWS+SILPDC+N VFNTA V QT+
Sbjct: 381 CAAFLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN--------- 431
Query: 457 LSPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLW 515
Q M ++S W E + + + T G+LE LNVT+D SDYLW
Sbjct: 432 --------QMQMWAD--GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLW 481
Query: 516 HITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPV 571
+IT + V + + ++T+ S L VFINGQL GS G +
Sbjct: 482 YITSVEVDPSEKFLQGGTPL--SLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNA 539
Query: 572 EFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLK 631
++G N + LLS GL N G E G G V + G G DL+ W+YQVGLK
Sbjct: 540 NLRAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLK 599
Query: 632 GEFQQIYSIE-ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
GE + S+E EW + ++ P WY+ YFD P G +P+ALD+GSMGKGQ
Sbjct: 600 GEQMNLNSLEGSGSVEWMQGSLVAQNQQP--LAWYRAYFDTPSGDEPLALDMGSMGKGQI 657
Query: 688 WVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
W+NG IGRYWT A +G C+ C Y G+Y + KC CG PTQ WYHVPRSWLQ + NL
Sbjct: 658 WINGQSIGRYWTAYA-EGDCKG-CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNL 715
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+ +I++ R+ VC VSE H P ++ W + + K +HL
Sbjct: 716 LVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNWQIESYGEPEFHTAK----VHL 770
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C G IS+I+FAS+GTP G C F +G CH+ S SV+ +
Sbjct: 771 KCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSVLEK 812
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 793 bits (2048), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/811 (48%), Positives = 515/811 (63%), Gaps = 40/811 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YDHR+++I G RR+LISA IHYPR+ P MWP L+A++KEGGAD IETYVFWN HE+
Sbjct: 31 VTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHETAP 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+ + D+V+F ++V +GL+L LRIGP+V AEWNFGG P WL IPG FRTNN P
Sbjct: 91 GKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTVFRTNNEP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+ F KIVD+M+E+ F+ QGG II+ QIENEYG + +YG GK Y WA SMA
Sbjct: 151 FKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGSMA 210
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GVPW+MC+Q D P+ +I+ CN +YCD +KPNS +P +WTENW GW+ T+G P
Sbjct: 211 QAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQFKPNSPTQPKIWTENWPGWFQTFGESNP 270
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRP ED+AF+VARFF +GGS NYY+Y GGTNF RT+GGPF TSYDYDAPIDEYGL
Sbjct: 271 HRPPEDVAFSVARFFGKGGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGLRRL 330
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKW HLK+LH +IKLCE +L+ +S + LG QEA VY + G C AFLANID
Sbjct: 331 PKWAHLKELHQSIKLCEHSLLFGNST-LLSLGPQQEADVYTDHSGG----CVAFLANIDS 385
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
VTF + Y LP WSVSILPDC+N VFNTAKV SQT + + +P + S P Q
Sbjct: 386 EKDRVVTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDM---VPGTLQASKPDQ 442
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
W E IGVW +N+F ++H+N TKD +DYLWH T V +
Sbjct: 443 -------------WSIFTERIGVWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRN- 488
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLIL 582
+ ++ P + IDS + F+N L GS G+ + P+ ++G N++ +
Sbjct: 489 ---YPSSGNHPVLNIDSKGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAI 545
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS TVGL++ G + E GAG V ++G KNG DLS W Y+VGL+GE ++ ++
Sbjct: 546 LSMTVGLKSAGPYYEWVGAGLT-SVNISGMKNGTTDLSSNNWAYKVGLEGEHYGLFKHDQ 604
Query: 643 -NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
N W ++ TWYK D P G DPV LD+ SMGKG W+NG+ IGRYW
Sbjct: 605 GNNQRWRPQSQPPKHQPLTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWPRT 664
Query: 702 APKGG-CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+P C +CDYRG ++ +KC CG PTQ WYHVPRSW S N LV+FEE GG+P +
Sbjct: 665 SPTNDRCTTSCDYRGKFSPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTK 724
Query: 761 ISVKLRSTRIVCEQVSESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
I+ R VC VSE +YP + W S S DG++ A ++ L C G ISS+
Sbjct: 725 ITFSRRVATSVCSFVSE-NYPSIDLESWDKSISDDGRV-----AAKVQLSCPKGKNISSV 778
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+G P G C+ + +G+CH P S+SVV +
Sbjct: 779 KFASFGDPSGTCRSYQQGSCHHPDSVSVVEK 809
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 793 bits (2047), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/820 (49%), Positives = 520/820 (63%), Gaps = 55/820 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+++DG RR+L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V +G+++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ----------IENEYGNMESSYGQQGK 216
FK MQ F +KIV +M+ E LF+ QGGPII+ Q IENEYG +G GK
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 217 DYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDG 276
Y+ WAA MA+GL GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W G
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDA 336
W+T +GG + RPVEDLAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 337 PIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN 396
P+DEYGL EPK+GHLK+LH A+KLCE LV+AD LG QEAHV+R S S
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPT-VTTLGSMQEAHVFR-----SSSG 380
Query: 397 CSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLP 456
C+AFLAN + ++ A V F ++Y+LPPWS+SILPDC+N VFNTA V QT+
Sbjct: 381 CAAFLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN--------- 431
Query: 457 LSPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLW 515
Q M ++S W E + + + T G+LE LNVT+D SDYLW
Sbjct: 432 --------QMQMWAD--GASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLW 481
Query: 516 HITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPV 571
+IT + V + + ++T+ S L VFINGQL GS G +
Sbjct: 482 YITSVEVDPSEKFLQGGTPL--SLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNA 539
Query: 572 EFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLK 631
++G N + LLS GL N G E G G V + G G DL+ W+YQVGLK
Sbjct: 540 NLRAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLK 599
Query: 632 GEFQQIYSIE-ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
GE + S+E EW + ++ P WY+ YFD P G +P+ALD+GSMGKGQ
Sbjct: 600 GEQMNLNSLEGSGSVEWMQGSLVAQNQQP--LAWYRAYFDTPSGDEPLALDMGSMGKGQI 657
Query: 688 WVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
W+NG IGRYWT A +G C+ C Y G+Y + KC CG PTQ WYHVPRSWLQ + NL
Sbjct: 658 WINGQSIGRYWTAYA-EGDCKG-CHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNL 715
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+ +I++ R+ VC VSE H P ++ W + + K +HL
Sbjct: 716 LVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNWQIESYGEPEFHTAK----VHL 770
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
C G IS+I+FAS+GTP G C F +G CH+ S SV+
Sbjct: 771 KCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSVL 810
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/812 (50%), Positives = 519/812 (63%), Gaps = 54/812 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV YD RAI I+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 25 NVWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+G D+V+F+KLV GLYL LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 85 PGKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNE 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EM++F IV++M+ E LF WQGGPII+ QIENE+G +E G K Y WAA M
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMCK+ DAP+ +I+ NG+Y DG+ PN KP +WTENW GW+T +G +
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFYPNKRYKPMMWTENWTGWFTGYGVPV 264
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVEDLAF+VA+F Q+GGS++NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYG+L
Sbjct: 265 PHRPVEDLAFSVAKFVQKGGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGMLR 324
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHL DLH AIKLCEPALV+ LG NQE++V+R+N C+AFLAN D
Sbjct: 325 QPKYGHLTDLHKAIKLCEPALVSGYPV-VTSLGNNQESNVFRSN----SGACAAFLANYD 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
A+VTF G Y LPPWS+SILPDC+ TVFNTA+V +QT+ Q
Sbjct: 380 TKYYATVTFNGMRYNLPPWSISILPDCKTTVFNTARVGAQTT-----------------Q 422
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M + SW++ E + +FT G++E +++T+D +DYLW+ T YV+ D
Sbjct: 423 MQMT----TVGGFSWVSYNEDPNSIDDGSFTKLGLVEQISMTRDSTDYLWYTT--YVNID 476
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-------VEFQSGYN 578
+ N P +T S L VFINGQL G+ G V P V+ +G N
Sbjct: 477 QNEQFLKNGQYPVLTAQSAGHSLHVFINGQLIGTAYG---SVEDPRLTYTGNVKLFAGSN 533
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LS VGL N G E G G V L G G DL+ WTY++GLKGE ++
Sbjct: 534 KISFLSIAVGLPNVGEHFETWNTGLLGPVTLNGLNEGKRDLTWQKWTYKIGLKGEALSLH 593
Query: 639 SIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
++ + EW D +R WYK +F+AP G +P+ALD+ +MGKGQ W+NG IGRY
Sbjct: 594 TLSGSSNVEWGDASRK---QPLAWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIGRY 650
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W +G C CDY G Y KC +NCG+ +Q WYHVPRSWL + NL+V+FEE GG
Sbjct: 651 WPAYKARGSCPK-CDYEGTYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGGE 709
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISS 817
P IS+ RS R C VS+ P + W Y+ ++HL C G ++
Sbjct: 710 PTGISLVKRSMRSACAYVSQGQ-PSMNNWHTKYA----------ESKVHLSCDPGLKMTQ 758
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FASYGTPQG C+ +S G CHA S + +
Sbjct: 759 IKFASYGTPQGACESYSEGRCHAHKSYDIFQK 790
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/754 (52%), Positives = 509/754 (67%), Gaps = 41/754 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ SC+ S+++++ VSYDH+AIII+G +R+LIS IHYPR+TPEMWPDLI
Sbjct: 8 MWSILLLFSCIFSAASAS------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV +GL++ LRIGPYVCAE
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
E+G +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ +KPN
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP ED+AF+VARF Q GGSF+NYYMY GGTNFGRT
Sbjct: 242 KDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGLL EPKWGHL+DLH AIK CE ALV+ D + KLG NQE
Sbjct: 302 AGGPFMATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKSCESALVSVDPS-VTKLGSNQE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV++ S+S+C+AFLAN D + V+F G Y LPPWS+SILPDC+ V++TAKV
Sbjct: 361 AHVFK-----SESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKV 415
Query: 443 SSQTSIKTVEFSLPLSP-NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
SQ+S + ++P + P QS IE SS T+ G+
Sbjct: 416 GSQSS------QVQMTPVHSGFPWQSFIEETTSSDETD--------------TTTLDGLY 455
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E +N+T+D +DYLW++T I + D+ +F K N P +TI S L VFINGQL+G+V
Sbjct: 456 EQINITRDTTDYLWYMTDITIGSDE-AFLK-NGKSPLLTIFSAGHALNVFINGQLSGTVY 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V +SG N L LLS +VGL N G E AG G + L G +G
Sbjct: 514 GSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTW 573
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S WTY+ GLKGE ++++ + EW + TWYK F+AP G P+A
Sbjct: 574 DMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLA 633
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C D C Y G Y+ KC T+CG P+Q WYH+
Sbjct: 634 LDMGSMGKGQIWINGQSVGRHWPGYIARGSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHI 692
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRI 770
PRSWL + NLLV+FEE GG+P IS+ R T +
Sbjct: 693 PRSWLTPNGNLLVVFEEWGGDPSRISLVERGTAL 726
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/759 (52%), Positives = 511/759 (67%), Gaps = 41/759 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ SC+ S+++++ VSYDH+AIII+G +R+LIS IHYPR+TPEMWPDLI
Sbjct: 8 MWSILLLFSCIFSAASAS------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV +GL++ LRIGPYVCAE
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
E+G +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ +KPN
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP ED+AF+VARF Q GGSF+NYYMY GGTNFGRT
Sbjct: 242 KDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGL EPKWGHL+DLH AIK CE ALV+ D + KLG NQE
Sbjct: 302 AGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPS-VTKLGSNQE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV++ S+S+C+AFLAN D + V+F G Y LPPWS+SILPDC+ V+NTAKV
Sbjct: 361 AHVFK-----SESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKV 415
Query: 443 SSQTSIKTVEFSLPLSP-NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
SQ+S + ++P + P QS IE SS T+ G+
Sbjct: 416 GSQSS------QVQMTPVHSGFPWQSFIEETTSSDETD--------------TTTLDGLY 455
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E +N+T+D +DYLW++T I + D+ +F K N P +TI S L VFINGQL+G+V
Sbjct: 456 EQINITRDTTDYLWYMTDITIGSDE-AFLK-NGKSPLLTISSAGHALNVFINGQLSGTVY 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V +SG N L LLS +VGL N G E AG G + L G +G
Sbjct: 514 GSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTW 573
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S WTY+ GLKGE ++++ + EW + TWYK F+AP G P+A
Sbjct: 574 DMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLA 633
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C D C Y G Y+ KC T+CG P+Q WYH+
Sbjct: 634 LDMGSMGKGQIWINGQSVGRHWPGYIARGSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHI 692
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQV 775
PRSWL + NLLV+FEE GG+P IS+ R T + +++
Sbjct: 693 PRSWLTPTGNLLVVFEEWGGDPSGISLVERGTALDAKKL 731
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 788 bits (2036), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/759 (52%), Positives = 509/759 (67%), Gaps = 41/759 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ SC+ S+++++ VSYDH+AIII+G +R+LIS IHYPR+TPEMWPDLI
Sbjct: 8 MWSILLLFSCIFSAASAS------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G Y F+ + D+VKF+KLV GL++ LRIGPYVCAE
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
E+G +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ +KPN
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP ED+AF+VARF Q GGSF+NYYMY GGTNFGRT
Sbjct: 242 KDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGL EPKWGHL+DLH AIK CE ALV+ D + KLG NQE
Sbjct: 302 AGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPS-VTKLGSNQE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV++ S+S+C+AFLAN D + V+F G Y LPPWS+SILPDC+ V+NTAKV
Sbjct: 361 AHVFK-----SESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKV 415
Query: 443 SSQTSIKTVEFSLPLSP-NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
SQ+S + ++P + P QS IE SS T+ G+
Sbjct: 416 GSQSS------QVQMTPVHSGFPWQSFIEETTSSDETD--------------TTTLDGLY 455
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E +N+T+D +DYLW++T I + D+ +F K N P +TI S L VFINGQL+G+V
Sbjct: 456 EQINITRDTTDYLWYMTDITIGSDE-AFLK-NGKSPLLTIFSAGHALNVFINGQLSGTVY 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V +SG N L LLS +VGL N G E AG G + L G +G
Sbjct: 514 GSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTW 573
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S WTY+ GLKGE ++++ + EW + TWYK F+AP G P+A
Sbjct: 574 DMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSMAEKQPLTWYKATFNAPPGDAPLA 633
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C D C Y G Y+ KC T+CG P+Q WYH+
Sbjct: 634 LDMGSMGKGQIWINGQSVGRHWPGYIARGSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHI 692
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQV 775
PRSWL + NLLV+FEE GG+P IS+ R T + +++
Sbjct: 693 PRSWLTPTGNLLVVFEEWGGDPSRISLVERGTALDAKKL 731
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 788 bits (2036), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/759 (52%), Positives = 510/759 (67%), Gaps = 41/759 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ SC+ S+++++ VSYDH+AIII+G +R+LIS IHYPR+TPEMWPDLI
Sbjct: 1 MWSILLLFSCIFSAASAS------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 54
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV +GL++ LRIGPYVCAE
Sbjct: 55 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 114
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIEN
Sbjct: 115 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIEN 174
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
E+G +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ +KPN
Sbjct: 175 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPN 234
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP ED+AF+VARF Q GGSF+NYYMY GGTNFGRT
Sbjct: 235 KDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRT 294
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGL EPKWGHL+DLH AIK CE ALV+ D + KLG NQE
Sbjct: 295 AGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPS-VTKLGSNQE 353
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV++ S+S+C+AFLAN D + V+F G Y LPPWS+SILPDC+ V+NTAKV
Sbjct: 354 AHVFK-----SESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKV 408
Query: 443 SSQTSIKTVEFSLPLSP-NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
SQ+S + ++P + P QS IE SS + G+
Sbjct: 409 GSQSS------QVQMTPVHSGFPWQSFIEETTSSDETD--------------TTYMDGLY 448
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E +N+T+D +DYLW++T I + D+ +F K N P +TI S L VFINGQL+G+V
Sbjct: 449 EQINITRDTTDYLWYMTDITIGSDE-AFLK-NGKSPLLTISSAGHALNVFINGQLSGTVY 506
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V +SG N L LLS +VGL N G E AG G + L G +G
Sbjct: 507 GSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTW 566
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S WTY+ GLKGE ++++ + EW + TW+K F+AP G P+A
Sbjct: 567 DMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSMAKKQPLTWHKATFNAPPGDAPLA 626
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C D C Y G Y+ KC T+CG P+Q WYH+
Sbjct: 627 LDMGSMGKGQIWINGQSVGRHWPGYIARGSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHI 685
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQV 775
PRSWL + NLLV+FEE GG+P IS+ R T + +++
Sbjct: 686 PRSWLTPTGNLLVVFEEWGGDPSGISLVERGTALDAKKL 724
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 787 bits (2033), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/811 (49%), Positives = 524/811 (64%), Gaps = 38/811 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +A+II+G R++L S IHYPR+ P+MW LI K+K GG DV++TYVFWN HE
Sbjct: 29 NVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKAKMGGLDVVDTYVFWNLHEPS 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y+F+G+ND+VKF+KLV +GLY+ LRIGPY+C EWNFGGFP WL+ +PGI FRT+N
Sbjct: 89 PGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNFGGFPAWLKFVPGISFRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M +F KKIV +M++E LF QGGPII+ QIENEY + +G+ G Y+ WAA M
Sbjct: 149 PFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYETEDKVFGEAGFAYMNWAAKM 208
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ + GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KP WTE W W+ +GG
Sbjct: 209 AVQMDTGVPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPN 268
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPVEDLAF VARF Q+GGS +NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 269 HKRPVEDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 328
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK LH A+KLCE AL+ + Y L Q+A V+ + S +C+AFL+N
Sbjct: 329 QPKFGHLKRLHDAVKLCEKALLTGEPHDYT-LATYQKAKVFSS----SSGDCAAFLSNYH 383
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A VTF G+ YTLPPWS+SILPDC++ ++NTA+V QT+ Q
Sbjct: 384 SNNTARVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTN-----------------Q 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
S + +K+ S SW T E I E+ + + G+LE L +TKD SDYLW+ T + V D
Sbjct: 427 LSFLPTKVESF--SWETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNV-D 483
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ S+ + + PT+T S + VFING+L GS G + Q+G N +
Sbjct: 484 PNESYLRGGKF-PTLTATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKV 542
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS GL N G E+ G G V + G G +DLS+ W+Y+VGLKGE + S
Sbjct: 543 SLLSIAGGLPNNGPHYEEREMGVLGPVAIHGLDKGKMDLSRQKWSYKVGLKGENMNLGSP 602
Query: 641 EENEA-EWT-DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+A +W D + TWYK YFDAP+G +P+ALD+GSM KGQ W+NG ++GRYW
Sbjct: 603 SSVQAVDWAKDSLKQENAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYW 662
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T+ A G C D C Y G Y KC CG PTQ WYHVPRSWL + NL+V+FEE GGNP
Sbjct: 663 TITA-NGNCTD-CSYSGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNP 720
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
IS+ RS +C + S+ Y PV K + + +G+L+ + +++LHC G IS+I
Sbjct: 721 SRISLVKRSVTSICTEASQ--YRPVIKNVHMHQNNGELNEQNVL-KINLHCAAGQFISAI 777
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+FAS+GTP G C +G CH+P S V+ +
Sbjct: 778 KFASFGTPSGACGSHKQGTCHSPKSDYVLQK 808
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 783 bits (2021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/750 (52%), Positives = 499/750 (66%), Gaps = 41/750 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ LSC+ S+++++ V YDH+AIII+G RR+LIS IHYPR+TPEMWPDLI
Sbjct: 8 MWSILLLLSCIFSAASAS------VGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV +GL++ LRIGPYVCAE
Sbjct: 62 QKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFP+WL+ +PGI FRT+N PFK MQ+F +KIV++M+ E LF +GGPII+ QIEN
Sbjct: 122 WNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNGYYC+ +KPN
Sbjct: 182 EYGPVEWEIGAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RPVEDLAF+VARF Q GGSF NYYMY GGTNFGRT
Sbjct: 242 KVYKPKMWTEVWTGWYTEFGGAIPTRPVEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGLL +PKWGHLKDLH AIK CE ALVA D + KLG NQE
Sbjct: 302 AGGPFMATSYDYDAPLDEYGLLQQPKWGHLKDLHKAIKSCEYALVAVDPS-VTKLGNNQE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV+ ++S C+AFLAN D V+F Y LPPWS+SILPDC+ VFNTAKV
Sbjct: 361 AHVFN-----TKSGCAAFLANYDTKYPVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKV 415
Query: 443 SSQTSIKTVEFSLPLSPNIS-VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
+ +TS + + P S +P QS IE +S T+ G+
Sbjct: 416 TWKTS------QVQMKPVYSRLPWQSFIEETTTSD--------------ESGTTTLDGLY 455
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E + +T+D +DYLW++T I + D+ N P +TI S L VFINGQL+G+V
Sbjct: 456 EQIYMTRDATDYLWYMTDITIGSDEAFL--NNGKFPLLTIFSACHALHVFINGQLSGTVY 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V+ + G N L LLS +VGL N G E AG G + L G G
Sbjct: 514 GSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFETWNAGVLGPISLKGLNTGTW 573
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S+ WTY++G+KGE ++++ + +W + TWYK F+AP G P+A
Sbjct: 574 DMSRWKWTYKIGMKGEALGLHTVTGSSSVDWAEGPSMAKKQPLTWYKATFNAPPGHAPLA 633
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C TC+Y G + KC T CG P+Q WYH+
Sbjct: 634 LDMGSMGKGQIWINGQSVGRHWPGYIAQGSC-GTCNYAGTFYDKKCRTYCGKPSQRWYHI 692
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLR 766
PRSWL + NLLV+FEE GG+P +S+ R
Sbjct: 693 PRSWLTPTGNLLVVFEEWGGDPQWMSLVER 722
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/728 (53%), Positives = 494/728 (67%), Gaps = 34/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD RAIII+G R++LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE
Sbjct: 24 SVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+YNF+G+ D+V+F+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+EFRTNN
Sbjct: 84 PGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRTNNQ 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ FV+KIV++M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 144 PFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW+MCKQ DAP+ +ID CNG+YC+G++PN KP +WTE W GWYT +GG +
Sbjct: 204 AVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFGGPI 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP ED+AF+VARF Q GSF NYYMY GGTNFGRTS G F TSYDYDAP+DEYGLL+
Sbjct: 264 PQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYGLLN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH AIKL EPALV++ +A LG NQEAHVYR+ C+AFL+N D
Sbjct: 324 EPKYGHLRDLHKAIKLSEPALVSSYAA-VTSLGSNQEAHVYRS----KSGACAAFLSNYD 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ VTF + Y LPPWS+SILPDC+ V+NTA+V+SQ+S S+ ++P
Sbjct: 379 SRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSS------SIKMTP------ 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ SW + E +++ T G+ E NVT+D SDYLW++T + ++
Sbjct: 427 --------AGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIAS 478
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F K N P +T+ S VL VF+NG+L+G+V G + V+ ++G N +
Sbjct: 479 NE-GFLK-NGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKI 536
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYS 639
LLS +VGL N G + AG G V L+G G +L+K W+Y+VGLKGE
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSL 596
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWYK F+AP G DP+ALD+ SMGKGQ W+NG +GR+W
Sbjct: 597 SGSSSVEWVRGSLMAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWP 656
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C C Y G +N KC TNCG P+Q WYHVPRSWL+ S NLLV+FEE GGNP
Sbjct: 657 GYIAQGDCSK-CSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPT 715
Query: 760 EISVKLRS 767
IS+ RS
Sbjct: 716 GISLVRRS 723
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 777 bits (2007), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/758 (51%), Positives = 501/758 (66%), Gaps = 42/758 (5%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L + +M++ + + L SS AS V+YDH+A++IDG RR+LIS IHYPR+TP
Sbjct: 2 LKMSKIMVVFLGLVLWVCSSVMAS-------VTYDHKALVIDGKRRILISGSIHYPRSTP 54
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
+MWPDLI K+K+GG DVIETYVFWN HE GQY F+ + ++V+FVKLV +GLY+ LRI
Sbjct: 55 QMWPDLIQKAKDGGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRI 114
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK MQ+F KIV +M+ E L+ QGGPI
Sbjct: 115 GPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPI 174
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEYG +E G GK Y KWAA MALGL GVPWVMCKQ DAP+ +ID CNG+Y
Sbjct: 175 ILSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFY 234
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
C+ ++PN KP +WTE W GW+T +GG +P+RPVEDLA+AVARF Q GS +NYYMY G
Sbjct: 235 CENFEPNKAYKPKMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHG 294
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNFGRT+GGPF TSYDYDAPIDEYGL+ +PKWGHL+DLH AIKLCEPALV+ D
Sbjct: 295 GTNFGRTAGGPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPT-VS 353
Query: 376 KLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
LG QEAHVY C+AFLAN D T+ VTF Y LPPWSVSILPDC+
Sbjct: 354 SLGSKQEAHVYNTR----SGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTV 409
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG-VWSENN 494
VFNTAKV++ + P+ + I +S SW + E ++++
Sbjct: 410 VFNTAKVNAPSYW---------------PKMTPI------SSFSWHSYNEETASAYADDT 448
Query: 495 FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFING 554
T+ G++E +++T+D +DYLW++T I + D + F K+ + P +TI S L VFING
Sbjct: 449 TTMAGLVEQISITRDATDYLWYMTDIRI-DSNEGFLKSGQ-WPLLTIFSAGHALHVFING 506
Query: 555 QLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
QL+G+V G + + V + G N L +LS VGL N G E AG G V L
Sbjct: 507 QLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVHFETWNAGILGPVTLK 566
Query: 611 GFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAP 669
G G D+S W+Y+VGLKGE ++++ + EW + TWYKT F+AP
Sbjct: 567 GLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAP 626
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNP 729
G +P+ALD+GSMGKGQ W+NG IGR+W +G C C Y G + KC +CG P
Sbjct: 627 GGNEPLALDMGSMGKGQVWINGESIGRHWPAYTARGSC-GKCYYGGIFTEKKCHFSCGEP 685
Query: 730 TQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
+Q WYHVPR+WL+ S N+LVIFEE GGNP IS+ RS
Sbjct: 686 SQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISLVKRS 723
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 776 bits (2005), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/732 (53%), Positives = 492/732 (67%), Gaps = 41/732 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YDH+A+II+G RR+LIS IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 29 VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G Y F+ + D+VKF KLV +GLYL LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 89 GNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQRF KKIVD+M+EE LF QGGPII+ QIENEYG ME G GK Y KW A MA
Sbjct: 149 FKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMEWEMGAAGKAYSKWTAEMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
LGL GVPW+MCKQ DAP IID CNG+YC+G+KPNS NKP LWTENW GW+T +GG +P
Sbjct: 209 LGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGAIP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RPVED+AF+VARF Q GGSF+NYYMY+GGTNF RT+ G F TSYDYDAP+DEYGLL E
Sbjct: 269 NRPVEDIAFSVARFIQNGGSFLNYYMYYGGTNFDRTA-GVFIATSYDYDAPLDEYGLLRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+ HLK+LH IKLCEPALV+ D LG QE HV++ S+++C+AFL+N D
Sbjct: 328 PKYSHLKELHKVIKLCEPALVSVDPT-ITSLGDKQEVHVFK-----SKTSCAAFLSNYDT 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA + F G Y LPPWSVSILPDC+ +NTAK+ + T + + VP
Sbjct: 382 SSAARIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKM-----------VP-- 428
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENN---FTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ST SW + E G S N+ F G++E +++T+D +DY W++T I +
Sbjct: 429 -------TSTKFSWESYNE--GSPSSNDDGTFVKDGLVEQISMTRDKTDYFWYLTDITIG 479
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
D+ SF KT + P +TI S L VF+NG L G+ G + Q ++ G N
Sbjct: 480 SDE-SFLKTGD-DPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKIKLSVGINK 537
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
L LLS VGL N G E G G V L G +G D+SK W+Y++G++GE ++
Sbjct: 538 LALLSTAVGLPNAGVHYETWNTGVLGPVTLKGVNSGTWDMSKWKWSYKIGIRGEAMSFHT 597
Query: 640 IEENEAE--WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
I + A W + TWYK+ FD P G +P+ALD+ +MGKGQ WVNGH+IGR+
Sbjct: 598 IAGSSAVKWWIKGSFVVKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWVNGHNIGRH 657
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W +G C C+Y G YN KC ++CG P+Q WYHVPRSWL+ NLLVIFEE GG+
Sbjct: 658 WPAYTARGNC-GRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGD 716
Query: 758 PFEISVKLRSTR 769
P IS+ R+ +
Sbjct: 717 PSGISLVKRTAK 728
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/864 (47%), Positives = 519/864 (60%), Gaps = 97/864 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPE------------------------------- 76
+YD +A++IDG RR+L S IHYPR+TP+
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89
Query: 77 ---------------------MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
MW LI K+K+GG DVI+TYVFWN HE G Y F+ +
Sbjct: 90 LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+V+FVK V +GL++ LRIGPY+C EWNFGGFPVWL+ +PGI FRT+N PFK MQ F
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFT 209
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+KIV +M+ E LF+ QGGPII+ QIENEYG +G G+ Y+ WAA MA+GL GVPW
Sbjct: 210 EKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPW 269
Query: 236 VMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAF 295
VMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG + RPVEDLAF
Sbjct: 270 VMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAF 329
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDL 355
AVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EPK HLK+L
Sbjct: 330 AVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKEL 389
Query: 356 HAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFL 415
H A+KLCE ALV+ D LG QEAHV+R S S C+AFLAN + ++ A V F
Sbjct: 390 HRAVKLCEQALVSVDPT-ITTLGTMQEAHVFR-----SPSGCAAFLANYNSNSHAKVVFN 443
Query: 416 GQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSS 475
+ Y+LPPWS+SILPDC+N VFN+A V QTS Q M +
Sbjct: 444 NEQYSLPPWSISILPDCKNVVFNSATVGVQTS-----------------QMQMWGD--GA 484
Query: 476 TSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNE 534
TS W E + + + T G+LE LNVT+D SDYLW+IT + +S + +F +
Sbjct: 485 TSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSE-NFLQGGG 543
Query: 535 VRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQ 590
P++++ S L VF+NGQL GS G +K V ++G N + LLS GL
Sbjct: 544 KPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLP 603
Query: 591 NYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTD 649
N G E G G V L G G DL+ W+YQVGLKGE + S+E EW
Sbjct: 604 NVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQ 663
Query: 650 ---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGG 706
+ + P WYK YF+ P G +P+ALD+GSMGKGQ W+NG IGRYWT A G
Sbjct: 664 GSLIAQKQQP--LAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAYA-DGD 720
Query: 707 CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET-GGNPFEISVKL 765
C+ C Y G + + KC CG PTQ WYHVPRSWLQ S NLLV+ EE GG+ +I++
Sbjct: 721 CKG-CSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAK 779
Query: 766 RSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGT 825
RS VC VSE H P ++KW ++ ++HL C G IS+I FAS+GT
Sbjct: 780 RSVSSVCADVSEDH-PNIKKW----QIESYGEREHRRAKVHLRCAHGQSISAIRFASFGT 834
Query: 826 PQGRCQKFSRGNCHAPMSLSVVSE 849
P G C F +G CH+ S +V+ +
Sbjct: 835 PVGTCGNFQQGGCHSASSHAVLEK 858
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/785 (50%), Positives = 509/785 (64%), Gaps = 45/785 (5%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L + +M++ + + L SS AS V+YDH+A++IDG RR+LIS IHYPR+TP
Sbjct: 2 LKMSKIMVVFLGLVLWVCSSVMAS-------VTYDHKALVIDGKRRILISGSIHYPRSTP 54
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
+MWPDLI K+K+GG DVIETYVFWN HE GQY F+ + ++V+FVKLV +GLY+ LRI
Sbjct: 55 QMWPDLIQKAKDGGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRI 114
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK MQ+F KIV +M+ E L+ QGGPI
Sbjct: 115 GPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPI 174
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEYG +E G GK Y KWAA MALGL GVPWVMCKQ DAP+ +ID CNG+Y
Sbjct: 175 ILSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFY 234
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
C+ ++PN KP +WTE W GW+T +GG +P+RPVEDLA+AVARF Q GS +NYYMY G
Sbjct: 235 CENFEPNKAYKPKMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHG 294
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNFGRT+GGPF TSYDYDAPIDEYGL+ +PKWGHL+DLH AIKLCEPALV+ D
Sbjct: 295 GTNFGRTAGGPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPT-VS 353
Query: 376 KLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
LG QEAHVY C+AFLAN D T+ VTF Y LPPWSVSILPDC+
Sbjct: 354 SLGSKQEAHVYNTR----SGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTV 409
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG-VWSENN 494
VFNTAKV++ + P+ + I +S SW + E ++++
Sbjct: 410 VFNTAKVNAPSYW---------------PKMTPI------SSFSWHSYNEETASAYADDT 448
Query: 495 FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFING 554
T+ G++E +++T+D +DYLW++T I + D + F K+ + P +TI S L VFING
Sbjct: 449 TTMAGLVEQISITRDATDYLWYMTDIRI-DSNEGFLKSGQ-WPLLTIFSAGHALHVFING 506
Query: 555 QLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
QL+G+V G + + V + G N L +LS VGL N G E AG G V L
Sbjct: 507 QLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVHFETWNAGILGPVTLK 566
Query: 611 GFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAP 669
G G D+S W+Y+VGLKGE ++++ + EW + TWYKT F+AP
Sbjct: 567 GLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAP 626
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNP 729
G +P+ALD+GSMGKGQ W+NG IGR+W +G C C Y G + KC +CG P
Sbjct: 627 GGNEPLALDMGSMGKGQVWINGESIGRHWPAYTARGSC-GKCYYGGIFTEKKCHFSCGEP 685
Query: 730 TQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLR---STRIVCEQVSESHYPPVRKW 786
+Q WYHVPR+WL+ S N+LVIFEE GGNP IS+ R CE + + W
Sbjct: 686 SQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISLVKRIDTCNGFYCENFKPNQIYKPKIW 745
Query: 787 SNSYS 791
+ ++S
Sbjct: 746 TENWS 750
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 251/527 (47%), Positives = 327/527 (62%), Gaps = 33/527 (6%)
Query: 248 IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
ID CNG+YC+ +KPN KP +WTENW GWYT +GG P+RP ED+AF+VARF Q GGS
Sbjct: 723 IDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSL 782
Query: 308 MNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
+NYYMY GGTNFGRTS G F TSYD+DAPIDEYGLL EPKWGHL+DLH AIKLCEPALV
Sbjct: 783 VNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALV 841
Query: 368 AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVS 427
+AD LG++QEA V+++ S C+AFLAN D V F Y LPPWS+S
Sbjct: 842 SADPTS-TWLGKDQEARVFKS----SSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSIS 896
Query: 428 ILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTV--KE 485
ILPDC+ FNTA+V P + +P +++ +K++ S W +E
Sbjct: 897 ILPDCKTVTFNTARVRRD-------------PKLFIP--NLLMAKMTPISSFWWLSYKEE 941
Query: 486 PIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMR 545
P ++++ T G++E ++VT D +DYLW++T I + D F K+ + P +T++S
Sbjct: 942 PASAYAKDTTTKDGLVEQVSVTWDTTDYLWYMTDIRI-DSTEGFLKSGQ-WPLLTVNSAG 999
Query: 546 DVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
+L VFINGQL+GSV G + + V + G N L +LS TVGL N G + A
Sbjct: 1000 HILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNA 1059
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFT 660
G G V L G G D+SK W+Y+VGL+GE +YS++ N +W + P T
Sbjct: 1060 GVLGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQKQP--LT 1117
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
WYKT F+ P G +P+ALD+ SM KGQ WVNG IGRY+ G C + C Y G +
Sbjct: 1118 WYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIASGKC-NKCSYTGFFTEK 1176
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
KC NCG P+Q WYH+PR WL + NLL+I EE GGNP IS+ R+
Sbjct: 1177 KCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 1223
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 775 bits (2001), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/815 (48%), Positives = 507/815 (62%), Gaps = 54/815 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R++II+G R++LISA IHYPR+ P MWP+L+ +KEGG DVIETYVFWN H+
Sbjct: 20 NVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVHQPT 79
Query: 106 R-GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
+Y+F G+ D+VKF+ +V +G+YL LRIGP+V AEWNFGG PVWL + G FRT+N
Sbjct: 80 SPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFRTDN 139
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ--IENEYGNMESSYGQQGKDYVKWA 222
FK M+ F IV LM++E LF+ QGGPII+ Q +ENEYG E +YG+ GK Y WA
Sbjct: 140 YNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENEYGYYEGAYGEGGKRYAAWA 199
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
A MA+ GVPW+MC+Q DAP ++I+ CN +YCD +KP +KP +WTENW GW+ T+G
Sbjct: 200 AQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQTFG 259
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDY+APIDEYG
Sbjct: 260 APNPHRPAEDVAFSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYEAPIDEYG 319
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L PKWGHLK+LH AIKLCE L+ + + LG +QEA VY + C AFLA
Sbjct: 320 LPRLPKWGHLKELHKAIKLCEHVLLNSKPVN-LSLGPSQEADVYA----DASGGCVAFLA 374
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
NID+ +V F SY LP WSVSILPDC+N V+NTAK
Sbjct: 375 NIDDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAK--------------------- 413
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
K S + W E G+W E +F G ++H+N TKD +DYLW+ T I V
Sbjct: 414 --------QKDGSKALKWEVFVEKAGIWGEPDFMKNGFVDHINTTKDTTDYLWYTTSIVV 465
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYN 578
+++ F K P + I+SM L F+N +L GS G+ K P+ ++G N
Sbjct: 466 GENE-EFLKEGR-HPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPISLKAGNN 523
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
++ LLS TVGL N G+F E GAG V++ GF NG +DLS W Y++GL+GE IY
Sbjct: 524 EIALLSMTVGLPNAGSFYEWVGAGLT-SVRIEGFNNGTVDLSHFNWIYKIGLQGEKLGIY 582
Query: 639 SIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
E N W + TWYK D P G +PV LD+ MGKG AW+NG IGRY
Sbjct: 583 KPEGVNSVSWVATSEPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNGEEIGRY 642
Query: 698 W---TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
W + V K C CDYRG + DKC T CG PTQ WYHVPRSW + S NLLVIFEE
Sbjct: 643 WPRKSSVHEK--CVTECDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLVIFEEK 700
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GG+P +I+ R +C ++E + RK S G + N A +HL C +
Sbjct: 701 GGDPEKITFSRRKMSSICALIAEDYPSADRK---SLQEAGSKNSNSKA-SVHLGCPQNAV 756
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
IS+++FAS+GTP G+C +S G CH P S+SVV +
Sbjct: 757 ISAVKFASFGTPTGKCGSYSEGECHDPNSISVVEK 791
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/752 (52%), Positives = 500/752 (66%), Gaps = 43/752 (5%)
Query: 26 MMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKS 85
+++ + C SS ST V+YDH+A+II+G RR+LIS IHYPR+TPEMWPDLI K+
Sbjct: 11 IILAILCFSSLIHST---EAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 86 KEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
KEGG DVI+TYVFWN HE G Y F+ + D+VKF KLV +GLYL LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFPVWL+ +PG+ FRT+N PFK MQ+F KKIVD+M+EE LF QGGPII+ QIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
M+ G GK Y KW A MALGL GVPW+MCKQ DAP IID CNG+YC+G+KPNS N
Sbjct: 188 PMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
KP LWTENW GW+T +GG +P+RPVED+AF+VARF Q GGSFMNYYMY+GGTNF RT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-G 306
Query: 326 PFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
F TSYDYDAPIDEYGLL EPK+ HLK+LH IKLCEPALV+ D LG QE HV
Sbjct: 307 VFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPT-ITSLGDKQEIHV 365
Query: 386 YRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ 445
++ S+++C+AFL+N D +AA V F G Y LPPWSVSILPDC+ +NTAK+ +
Sbjct: 366 FK-----SKTSCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAP 420
Query: 446 TSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN---NFTVQGILE 502
T + + +P +ST SW + E G S N F G++E
Sbjct: 421 TILMKM-----------IP---------TSTKFSWESYNE--GSPSSNEAGTFVKDGLVE 458
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+++T+D +DY W+ T I + D+ SF KT + P +TI S L VF+NG L G+ G
Sbjct: 459 QISMTRDKTDYFWYFTDITIGSDE-SFLKTGD-NPLLTIFSAGHALHVFVNGLLAGTSYG 516
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
+ Q ++ G N L LLS VGL N G E G G V L G +G D
Sbjct: 517 ALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGTWD 576
Query: 619 LSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
+SK W+Y++GL+GE ++++ + A +W TWYK+ FD P G +P+AL
Sbjct: 577 MSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLAL 636
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVP 737
D+ +MGKGQ WVNGH+IGR+W +G C C+Y G YN KC ++CG P+Q WYHVP
Sbjct: 637 DMNTMGKGQVWVNGHNIGRHWPAYTARGNC-GRCNYAGIYNEKKCLSHCGEPSQRWYHVP 695
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRSTR 769
RSWL+ NLLVIFEE GG+P IS+ R+ +
Sbjct: 696 RSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 727
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 774 bits (1999), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/765 (51%), Positives = 502/765 (65%), Gaps = 48/765 (6%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGG 89
SC+ S++ S V+YD +AI+I+G RR+LIS IHYPR+TPEMWPDLI K+KEGG
Sbjct: 27 FSCLPSATCS-------VTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGG 79
Query: 90 ADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFP 149
DVI+TYVFW+ HE G+Y F+G+ D+VKF+KLV +GLY+ LRIGPY+CAEWN GGFP
Sbjct: 80 LDVIQTYVFWDGHEPSPGKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFP 139
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES 209
VWL+ IPGI FRT+N PFK M F KKIV++M+ E LF QGGPIIM QIENEYG +E
Sbjct: 140 VWLKYIPGISFRTDNEPFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEW 199
Query: 210 SYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTL 269
G GK Y +WAASMA+ L GVPW+MCKQ + P+ II+ CNG+YCD +KPN KP +
Sbjct: 200 EIGAIGKVYTRWAASMAVNLNTGVPWIMCKQDEVPDPIINTCNGFYCDWFKPNKDYKPIM 259
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTE W GW+T +GG +P+RPVED+A+AV +F Q+GGSF+NYYMY GGTNFGRT+GGPF
Sbjct: 260 WTELWTGWFTAFGGPVPYRPVEDVAYAVVKFIQKGGSFINYYMYHGGTNFGRTAGGPFIA 319
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
TSYDYDAP+DEYGL EPKWGHL+DLH AIK+CEPALV+ D K+G +QEAHV++
Sbjct: 320 TSYDYDAPLDEYGLKREPKWGHLRDLHRAIKMCEPALVSNDPT-VTKIGDSQEAHVFKF- 377
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
CSAFL N DE VTF G Y LPPWS+SILPDC N V+NT +V +QTS+
Sbjct: 378 ---ESGACSAFLENKDETNFVKVTFQGMQYELPPWSISILPDCVNVVYNTGRVGTQTSMM 434
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKD 509
T M+ + S+ SW + E ++E + T++G+ E +++TKD
Sbjct: 435 T-----------------MLSA--SNNEFSWASYNEDTASYNEESMTIEGLSEQISITKD 475
Query: 510 YSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ 569
+DYL + T + + ++ F K E P +T++S L+VF+NGQL+G+ G V
Sbjct: 476 STDYLRYTTDVTIGQNE-GFLKNGEY-PVLTVNSAGHALQVFVNGQLSGTAYG---SVND 530
Query: 570 P-------VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKI 622
P V+ +G N + LLS VGL N G E G G V L G G DLS
Sbjct: 531 PRLTFSGKVKLWAGNNKISLLSSAVGLPNVGTHFETWNYGVLGPVTLNGLNEGKRDLSLQ 590
Query: 623 LWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGS 681
W+Y+VG+ GE Q++S + EW T P FTWYKT F+AP G DP+ALD+ +
Sbjct: 591 KWSYKVGVIGEALQLHSPTGSSSVEWGSSTSKIQP--FTWYKTTFNAPGGNDPLALDMNT 648
Query: 682 MGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
MGKGQ W+NG IGRYW G C C Y G Y+ KC NCG +Q WYH+PRSWL
Sbjct: 649 MGKGQIWINGQSIGRYWPAYKANGKC-SACHYTGWYDEKKCGFNCGEASQRWYHIPRSWL 707
Query: 742 QASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKW 786
+ NLLV+FEE GG+P I++ R+ C ++E H P V+ W
Sbjct: 708 NPTGNLLVVFEEWGGDPTGITLVRRTIGSACAYINEWH-PTVKNW 751
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/811 (48%), Positives = 524/811 (64%), Gaps = 48/811 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD RA+ +DGNRRML+S IHYPR+TP MWP LIAK+K+GG DVI+TYVFW+ HE +
Sbjct: 25 VSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVIQTYVFWSGHEPTQ 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF G+ D+ KF++LV +G+Y+ LRIGPYVCAEWNFGGFP WLR +PGIEFRT+N
Sbjct: 85 GVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLRFLPGIEFRTDNES 144
Query: 167 FKEEM-QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK + F ++ + S+ +I QIENEYG++++ YG+ G+ Y+ W A+M
Sbjct: 145 FKVHLSHSFTSSLISVYSR----SFNIQLVICAQIENEYGSIDAVYGEAGQKYLNWIANM 200
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ VPW+MC Q DAP ++ID CNG+YCDG++PNS KP LWTENW GW+ +WG
Sbjct: 201 AVATNISVPWIMCNQPDAPPSVIDTCNGFYCDGFRPNSEGKPALWTENWTGWFQSWGEGA 260
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RPV+D+AFAVARFFQ+GGSFM+YYMY GGTNF R S T+YDYDAPIDEYG +
Sbjct: 261 PTRPVQDIAFAVARFFQKGGSFMHYYMYHGGTNFER-SAMEGVTTNYDYDAPIDEYGDVR 319
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSA-QYIKLGQNQEAHVYRANRYGSQSNCSAFLAN- 403
+PKWGHLKDLHAA+KLCE LV D+ I LG QEAHVY + S C+AFLA+
Sbjct: 320 QPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNS----STGACAAFLASW 375
Query: 404 -IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
D+ T V F GQSY LP WSVSILPDC++ VFNTAKV Q+ T++ ++P++
Sbjct: 376 GTDDST---VLFQGQSYDLPAWSVSILPDCKSVVFNTAKVGVQSMTMTMQSAIPVT---- 428
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+W++ +EP+ W + F+ ++E + TKD +DYLW+ T + V
Sbjct: 429 ----------------NWVSYREPLEPWG-STFSTNELVEQIATTKDTTDYLWYTTNVEV 471
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL 582
++ D + T+ + +RD +F+N LTG+ H + Q + + G N + +
Sbjct: 472 AESDA---PNGLAQATLVMSYLRDAAHIFVNKWLTGTKSAHGSEASQSISLRPGINSVKV 528
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS T GLQ G FLEK+ AG + +++ G +G I + + WTYQVGL+GE +++ E
Sbjct: 529 LSMTTGLQGTGPFLEKEKAGIQFGIRVEGLPSGAIIMQRNTWTYQVGLQGENNRLF--ES 586
Query: 643 N---EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
N A W+ T + +W+KT FD P+ VALDL SMGKGQ WVNG ++GRYW
Sbjct: 587 NGSLSAVWSTSTDVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYWS 646
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ +A GC D CDYRG+++ KC T CG P+Q+WYHVPR WL + NLLV+FEE GNP
Sbjct: 647 SCIAHTDGCVDNCDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNP 706
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
I++ R + +C ++SESH P+ S+S + S +AP + L C DG IS I
Sbjct: 707 EAITIAPRIPQHICSRMSESHPFPI-PLSSSTKRGSQTSTPPIAP-LALECADGQHISRI 764
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FASYGTP G C F +CHA S V+S+
Sbjct: 765 SFASYGTPSGDCGDFKLSSCHANSSKDVLSK 795
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/728 (53%), Positives = 493/728 (67%), Gaps = 34/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD RAIII+G R++LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN H
Sbjct: 24 SVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHGPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+YNF+G+ D+V+F+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+EFRTNN
Sbjct: 84 PGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRTNNQ 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+ FV+KIV++M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 144 PFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW+MCKQ DAP+ +ID CNG+YC+G++PN KP +WTE W GWYT +GG +
Sbjct: 204 AVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFGGPI 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP ED+AF+VARF Q GSF NYYMY GGTNFGRTS G F TSYDYDAP+DEYGLL+
Sbjct: 264 PQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYGLLN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH AIKL EPALV++ +A LG NQEAHVYR+ C+AFL+N D
Sbjct: 324 EPKYGHLRDLHKAIKLSEPALVSSYAA-VTSLGSNQEAHVYRS----KSGACAAFLSNYD 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ VTF + Y LPPWS+SILPDC+ V+NTA+V+SQ+S S+ ++P
Sbjct: 379 SRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSS------SIKMTP------ 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ SW + E +++ T G+ E NVT+D SDYLW++T + ++
Sbjct: 427 --------AGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIAS 478
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F K N P +T+ S VL VF+NG+L+G+V G + V+ ++G N +
Sbjct: 479 NE-GFLK-NGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKI 536
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYS 639
LLS +VGL N G + AG G V L+G G +L+K W+Y+VGLKGE
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSL 596
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWYK F+AP G DP+ALD+ SMGKGQ W+NG +GR+W
Sbjct: 597 SGSSSVEWVRGSLVAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWP 656
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C C Y G +N KC TNCG P+Q WYHVPRSWL+ S NLLV+FEE GGNP
Sbjct: 657 GYIAQGDCSK-CSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPT 715
Query: 760 EISVKLRS 767
IS+ RS
Sbjct: 716 GISLVRRS 723
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/750 (51%), Positives = 496/750 (66%), Gaps = 42/750 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M ++ +S +SS+ AS V YDHRAII++G RR+LIS IHYPR+TPEMWPDL+
Sbjct: 10 MFFLLFLVSWLSSALAS-------VGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLL 62
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DV++TYVFWN HE G+Y F+ + D+VKF+KL GLY+ LRIGPY+CAE
Sbjct: 63 QKAKDGGLDVLQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAE 122
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PF M++F +KIV +M+ E LF QGGPII+ QIEN
Sbjct: 123 WNFGGFPVWLKYVPGIAFRTDNRPFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIEN 182
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +E G GK Y +WAA MA+GL GVPWVMCKQ DAP+ IID CNG+YC+ + PN
Sbjct: 183 EYGPVEWEIGAPGKSYTQWAAKMAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPN 242
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP +DLAF+VARF Q GGSF NYYMY GGTNFGRT
Sbjct: 243 KNYKPKMWTEIWTGWYTEFGGAVPTRPAQDLAFSVARFIQNGGSFANYYMYHGGTNFGRT 302
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGL EPK+ HLK +H AIK+ EPAL+A D+A KLG NQE
Sbjct: 303 AGGPFIATSYDYDAPLDEYGLPREPKYSHLKYMHKAIKMAEPALLATDAA-VSKLGNNQE 361
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHVY+ S+S C+AFLAN D VTF + Y LPPWS+SILPDC+ VFNTA+V
Sbjct: 362 AHVYQ-----SRSGCAAFLANYDTKYPVRVTFWNKQYNLPPWSISILPDCKTEVFNTARV 416
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
K ++P + Q+ IE +S +N FT G+ E
Sbjct: 417 GQSPPTK-------MTPVAHLSWQAYIEDVATSA--------------DDNAFTSVGLRE 455
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+++T D +DYLW++T I + ++ F +T + PT+ +DS L VFINGQL+GS G
Sbjct: 456 QISLTWDNTDYLWYMTDITIGPNE-QFLRTGKY-PTLKVDSAGHALHVFINGQLSGSAYG 513
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
++ Q V+ ++G N L LLS +VGL N G E G G V L G +G D
Sbjct: 514 TLAFPKLEFNQGVKLRAGINKLALLSVSVGLANVGLHFETWNTGVLGPVTLAGVNSGTWD 573
Query: 619 LSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
+++ WTY++G++GE ++++ + EW + TWYK +AP G P+AL
Sbjct: 574 MTRWQWTYKIGMRGEDMSLHTVSGSSSVEWVQGSLLAQYRPLTWYKAILNAPPGNAPLAL 633
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVP 737
D+GSMGKGQ W+NG IGR+W G C C Y G Y +KC TNCG P+Q WYHVP
Sbjct: 634 DMGSMGKGQMWINGQSIGRHWPAYKAHGSC-GACYYAGTYTENKCRTNCGQPSQRWYHVP 692
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRS 767
RSWL++S NLLV+FEE GG+P +IS+ RS
Sbjct: 693 RSWLKSSGNLLVVFEEWGGDPTKISLVARS 722
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/725 (52%), Positives = 493/725 (68%), Gaps = 33/725 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDH+AIII+G RR+LIS IHYPR+TP+MWPDLI +KEGG DVI+TYVFWN HE
Sbjct: 23 VSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGHEPSP 82
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G Y F+ + D+VKF+KLV +GLY+ LRIGPY+C EWNFGGFPVWL+ +PGI+FRT+N P
Sbjct: 83 GNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGP 142
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK +MQ+F +KIV++M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA MA
Sbjct: 143 FKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWAAQMA 202
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GLG GVPW+MCKQ DAP+ IID CNG+YC+ + PN+ KP ++TE W GWYT +GG +P
Sbjct: 203 VGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPKMFTEAWTGWYTEFGGPVP 262
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED+A++VARF Q GSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 263 YRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLRRE 322
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH IKLCEP+LV+ D + LG NQEAHV+ ++++C+AFLAN D
Sbjct: 323 PKWGHLRDLHKTIKLCEPSLVSVD-PKVTSLGSNQEAHVFW-----TKTSCAAFLANYDL 376
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ VTF Y LPPWSVSILPDC+ VFNTAKV SQ S+ + ++ N + Q
Sbjct: 377 KYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKM-----IAVNSAFSWQ 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S E S+ + FT G+ E ++VT+D +DYLW++T + + D+
Sbjct: 432 SYNEETPSANYDA--------------VFTKDGLWEQISVTRDATDYLWYMTDVTIGPDE 477
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+F K N P +T+ S L VF+NGQL+G+V G + V+ ++G N + L
Sbjct: 478 -AFLK-NGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSL 535
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS VGL N G E AG G V L G +G D+SK W+Y++GLKGE ++++
Sbjct: 536 LSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSG 595
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW + + WYKT F+AP G DP+ALD+ SMGKGQ W+NG IGR+W
Sbjct: 596 SSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGY 655
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
+G C C+Y G Y+ KC +NCG +Q WYHVPRSWL + NLLV+FEE GG+P +I
Sbjct: 656 KARGSC-GACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKI 714
Query: 762 SVKLR 766
S+ R
Sbjct: 715 SLVKR 719
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/727 (52%), Positives = 482/727 (66%), Gaps = 37/727 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++I+G RR+LIS IHYPR+TPEMWPDL+ K+K+GG DV++TYVFWN HE +
Sbjct: 31 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQTYVFWNGHEPQQ 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKL +GL++ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+NAP
Sbjct: 91 GQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K Y WAA MA
Sbjct: 151 FKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 210
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ GAGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS +KPT+WTE W GW+T +GG +P
Sbjct: 211 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 270
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVED+AFAVARF Q+GGSF+NYYMY GGTNF RTSGGPF TSYDYDAPIDEYGLL +
Sbjct: 271 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQ 330
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EPALV+ D +G ++A+VY++ S C+AFL+N
Sbjct: 331 PKWGHLRDLHKAIKQAEPALVSGDPT-IQTIGNYEKAYVYKS----SSGACAAFLSNYHT 385
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ AA V F G+ Y LP WS+S+LPDCR VFNTA VSS P +P P
Sbjct: 386 NAAARVVFNGRRYDLPAWSISVLPDCRTAVFNTATVSS-----------PSAPARMTPAG 434
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
SW + E + FT G++E L++T D SDYLW+ T + ++ ++
Sbjct: 435 GF----------SWQSYSEATNSLDDRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNE 484
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
F K+ + P +TI S L+VF+NGQ G+ G + + V+ G N + +
Sbjct: 485 -QFLKSGQ-WPQLTIYSAGHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISI 542
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS VGL N G E G G V L+G G DLS WTYQ+GL GE ++S+
Sbjct: 543 LSAAVGLPNQGTHYEAWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVHSVAG 602
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW TW+K YF+AP G PVALD+ SMGKGQAWVNGHHIGRYW+
Sbjct: 603 SSSVEWGSAAGK---QPLTWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIGRYWSYK 659
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
A G C C Y G Y+ KC T CG+ +Q +YHVPRSWL S NLLV+ EE GG+ +
Sbjct: 660 ATGGSC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGV 718
Query: 762 SVKLRST 768
+ R+T
Sbjct: 719 KLVTRTT 725
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/729 (52%), Positives = 489/729 (67%), Gaps = 40/729 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AIII+G RR+LIS IHYPR+TPEMW DLI K+K+GG DVI+TYVFWN HE
Sbjct: 29 VTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWNVHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 89 GNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F +KIV +M+ E LF QGGPII+ QIENEYG + G G Y WAA MA
Sbjct: 149 FKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGHAYSNWAAKMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GLG GVPWVMCK+ DAP+ +I+ACNG+YCD + PN KP LWTE+W GW++ +GG P
Sbjct: 209 VGLGTGVPWVMCKEDDAPDPVINACNGFYCDDFSPNKPYKPKLWTESWSGWFSEFGGSNP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
RPVEDLAFAVARF Q+GGSF NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGLL E
Sbjct: 269 QRPVEDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLLRE 328
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLKDLH AIK CE ALV++D LG ++AHV + S + C+AFLAN
Sbjct: 329 PKYGHLKDLHKAIKQCEHALVSSDPT-VTSLGAYEQAHV-----FSSGTTCAAFLANYHS 382
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++AA VTF + Y LPPWS+SILPDCR VFNTA++ Q S Q
Sbjct: 383 NSAARVTFNNRHYDLPPWSISILPDCRTDVFNTARMRFQPS-----------------QI 425
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ S +S SW T E + +E++ T +LE ++ T+D SDYLW+IT + +S
Sbjct: 426 QMLPS--NSKLLSWETYDEDVSSLAESSRITASRLLEQIDATRDTSDYLWYITSVDISSS 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
+ SF + +P++++ S D + VFING+ +GS G P++ ++G N +
Sbjct: 484 E-SFLRGRN-KPSISVHSSGDAVHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIA 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E +G G V L +G DL+ W+YQVGLKGE + +
Sbjct: 542 LLSVAVGLPNGGIHFESWKSGITGPVLLHDLDHGQKDLTGQKWSYQVGLKGEAMNL--VS 599
Query: 642 ENEAEWTDLTRDGIPS----TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
N D + + S W+K +F+AP+G++P+ALD+ SMGKGQ W+NG IGRY
Sbjct: 600 PNGVSSVDWVSESLASQNQPQLKWHKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRY 659
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W V A KG C ++C+Y G Y KC CG PTQ WYHVPRSWL+ NNL+V+FEE GGN
Sbjct: 660 WMVYA-KGNC-NSCNYAGTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGN 717
Query: 758 PFEISVKLR 766
P++IS+ R
Sbjct: 718 PWKISLVKR 726
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/752 (52%), Positives = 499/752 (66%), Gaps = 43/752 (5%)
Query: 26 MMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKS 85
+++ + C SS ST V+YDH+A+II+G RR+LIS IHYPR+TPEMWPDLI K+
Sbjct: 11 IILAILCFSSLIHST---EAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 86 KEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
KEGG DVI+TYVFWN HE G Y F+ + D+VKF KLV +GLYL LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFPVWL+ +PG+ FRT+N PFK MQ+F KKIVD+M+EE LF QGGPII+ QIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
M+ G GK Y KW A MALGL GVPW+M KQ DAP IID CNG+YC+G+KPNS N
Sbjct: 188 PMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMSKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
KP LWTENW GW+T +GG +P+RPVED+AF+VARF Q GGSFMNYYMY+GGTNF RT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-G 306
Query: 326 PFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
F TSYDYDAPIDEYGLL EPK+ HLK+LH IKLCEPALV+ D LG QE HV
Sbjct: 307 VFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPT-ITSLGDKQEIHV 365
Query: 386 YRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ 445
++ S+++C+AFL+N D +AA V F G Y LPPWSVSILPDC+ +NTAK+ +
Sbjct: 366 FK-----SKTSCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAP 420
Query: 446 TSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN---NFTVQGILE 502
T + + +P +ST SW + E G S N F G++E
Sbjct: 421 TILMKM-----------IP---------TSTKFSWESYNE--GSPSSNEAGTFVKDGLVE 458
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+++T+D +DY W+ T I + D+ SF KT + P +TI S L VF+NG L G+ G
Sbjct: 459 QISMTRDKTDYFWYFTDITIGSDE-SFLKTGD-NPLLTIFSAGHALHVFVNGLLAGTSYG 516
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
+ Q ++ G N L LLS VGL N G E G G V L G +G D
Sbjct: 517 ALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGTWD 576
Query: 619 LSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
+SK W+Y++GL+GE ++++ + A +W TWYK+ FD P G +P+AL
Sbjct: 577 MSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLAL 636
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVP 737
D+ +MGKGQ WVNGH+IGR+W +G C C+Y G YN KC ++CG P+Q WYHVP
Sbjct: 637 DMNTMGKGQVWVNGHNIGRHWPAYTARGNC-GRCNYAGIYNEKKCLSHCGEPSQRWYHVP 695
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRSTR 769
RSWL+ NLLVIFEE GG+P IS+ R+ +
Sbjct: 696 RSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 727
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/725 (52%), Positives = 492/725 (67%), Gaps = 33/725 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDH+AIII+G RR+LIS IHYPR+TP+MWPDLI +KEGG DVI+TYVFWN HE
Sbjct: 23 VSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGHEPSP 82
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G Y F+ + D+VKF+KLV +GLY+ LRI PY+C EWNFGGFPVWL+ +PGI+FRT+N P
Sbjct: 83 GNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYICGEWNFGGFPVWLKYVPGIQFRTDNGP 142
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK +MQ+F +KIV++M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA MA
Sbjct: 143 FKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWAAQMA 202
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GLG GVPW+MCKQ DAP+ IID CNG+YC+ + PN+ KP ++TE W GWYT +GG +P
Sbjct: 203 VGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPKMFTEAWTGWYTEFGGPVP 262
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED+A++VARF Q GSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL E
Sbjct: 263 YRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLRRE 322
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH IKLCEP+LV+ D + LG NQEAHV+ ++++C+AFLAN D
Sbjct: 323 PKWGHLRDLHKTIKLCEPSLVSVD-PKVTSLGSNQEAHVFW-----TKTSCAAFLANYDL 376
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ VTF Y LPPWSVSILPDC+ VFNTAKV SQ S+ + ++ N + Q
Sbjct: 377 KYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKM-----IAVNSAFSWQ 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S E S+ + FT G+ E ++VT+D +DYLW++T + + D+
Sbjct: 432 SYNEETPSANYDA--------------VFTKDGLWEQISVTRDATDYLWYMTDVTIGPDE 477
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+F K N P +T+ S L VF+NGQL+G+V G + V+ ++G N + L
Sbjct: 478 -AFLK-NGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSL 535
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS VGL N G E AG G V L G +G D+SK W+Y++GLKGE ++++
Sbjct: 536 LSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSG 595
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW + + WYKT F+AP G DP+ALD+ SMGKGQ W+NG IGR+W
Sbjct: 596 SSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGY 655
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
+G C C+Y G Y+ KC +NCG +Q WYHVPRSWL + NLLV+FEE GG+P +I
Sbjct: 656 KARGSC-GACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKI 714
Query: 762 SVKLR 766
S+ R
Sbjct: 715 SLVKR 719
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/728 (52%), Positives = 493/728 (67%), Gaps = 34/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD RAIII+G R++LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE
Sbjct: 24 SVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+YNF+G+ D+V+F+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+EFRTNN
Sbjct: 84 PGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRTNNQ 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ FV+KIV++M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 144 PFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW+MCK+ DAP+ +ID CNG+YC+G++PN KP +WTE W GWYT +GG +
Sbjct: 204 AVGLKTGVPWIMCKREDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFGGPI 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP ED+AF+VARF Q GSF NYYMY GGTNFGRTS G F TSYDYDAP+DEYGLL+
Sbjct: 264 PQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYGLLN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH AIKL EPALV++ +A LG NQEAHVYR+ C+AFL+N D
Sbjct: 324 EPKYGHLRDLHKAIKLSEPALVSSYAA-VTSLGSNQEAHVYRS----KSGACAAFLSNYD 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ VTF + Y LPPWS+SILPDC+ V+NTA+V+SQ+S S+ ++P
Sbjct: 379 SRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSS------SIKMTP------ 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ SW + E +++ T G+ E NVT+D SDYLW++T + ++
Sbjct: 427 --------AGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIAS 478
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F + N P +T+ S VL VF+NG+L+G+V G + V+ ++G N +
Sbjct: 479 NE-GFLR-NGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKI 536
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYS 639
LLS +VGL N G + AG G V L+G G +L+K W+Y+VGLKGE
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSL 596
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWYK F+AP G DP+AL + SMGKGQ W+NG +GR+W
Sbjct: 597 SGSSSVEWVRGSLVAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWP 656
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C C Y G +N KC TNCG P+Q W+HVPRSWL+ S NLLV+FEE GGNP
Sbjct: 657 GYIAQGDCSK-CSYAGTFNEKKCQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNPT 715
Query: 760 EISVKLRS 767
IS+ RS
Sbjct: 716 GISLVRRS 723
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/744 (52%), Positives = 492/744 (66%), Gaps = 33/744 (4%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGG 89
L +S S F +VSYDH+A+II+G +R+LIS IHYPR+TPEMWPDLI K+K+GG
Sbjct: 22 LVLLSFCSWEISFVKASVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGG 81
Query: 90 ADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFP 149
DVI+TYVFWN HE +G Y F+ + D+V+F+KLV +GLY+ LRIGPYVCAEWN+GGFP
Sbjct: 82 LDVIQTYVFWNGHEPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFP 141
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES 209
VWL+ +PGIEFRT+N PFK M +F +KIV +M+ E LF QGGPII+ QIENE+G +E
Sbjct: 142 VWLKYVPGIEFRTDNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEW 201
Query: 210 SYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTL 269
G GK Y KWAA MA+GL GVPWVMCKQ DAP+ +I+ CNG+YC+ + PN KP +
Sbjct: 202 DIGAPGKAYAKWAAQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCEKFVPNQNYKPKM 261
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTE W GW+T +G +P RP EDL F+VARF Q GGSF+NYYMY GGTNFGRTSGG F
Sbjct: 262 WTEAWTGWFTEFGSAVPTRPAEDLVFSVARFIQSGGSFINYYMYHGGTNFGRTSGG-FVA 320
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
TSYDYDAPIDEYGLL+EPKWGHL+ LH AIKLCEPALV+ D LG+NQEAHV+ +
Sbjct: 321 TSYDYDAPIDEYGLLNEPKWGHLRGLHKAIKLCEPALVSVDPT-VKSLGENQEAHVFNS- 378
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
C+AFLAN D +A V+F Y LPPWS+S+LPDC+ VFNTA+V Q+S K
Sbjct: 379 ---ISGKCAAFLANYDTTFSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQSSQK 435
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKD 509
+P+ S QS IE SST +N FT G+ E + +T D
Sbjct: 436 KF---VPVINAFS--WQSYIEETASST--------------DDNTFTKDGLWEQVYLTAD 476
Query: 510 YSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----V 565
SDYLW++T + + ++ F K N P +TI S L+VFINGQL+G+V G +
Sbjct: 477 ASDYLWYMTDVNIGSNE-GFLK-NGQDPLLTIWSAGHALQVFINGQLSGTVYGSLENPKL 534
Query: 566 KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWT 625
+ V+ ++G N + LLS +VGL N G EK AG G V L G G D+SK WT
Sbjct: 535 TFSKNVKLRAGVNKISLLSTSVGLPNVGTHFEKWNAGVLGPVTLKGLNEGTRDISKQKWT 594
Query: 626 YQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGK 684
Y++GLKGE ++++ + EW TWYKT F+ P G DP+ALD+G+MGK
Sbjct: 595 YKIGLKGEALSLHTVSGSSSVEWAQGASLAQKQPMTWYKTTFNVPPGNDPLALDMGAMGK 654
Query: 685 GQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS 744
G W+NG IGR+W G C C+Y G Y KC T CG P+Q WYHVPRS L+ S
Sbjct: 655 GMVWINGQSIGRHWPGYIGNGNC-GGCNYAGTYTEKKCRTYCGKPSQRWYHVPRSRLKPS 713
Query: 745 NNLLVIFEETGGNPFEISVKLRST 768
NLLV+FEE GG P IS+ R+T
Sbjct: 714 GNLLVVFEEWGGEPHWISLLKRTT 737
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 770 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/730 (53%), Positives = 489/730 (66%), Gaps = 40/730 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AIII+G RR+LIS IHYPR+TPEMW DLI K+K GG D I+TYVFWN HE
Sbjct: 27 SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIRKAKGGGLDAIDTYVFWNVHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 PGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF QGGPII+ QIENEYG+ G G Y WAA M
Sbjct: 147 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGSESKQLGGAGYAYTNWAAKM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ +I+ACNG+YCD + PN KPTLWTE+W GW+T +GG +
Sbjct: 207 AVGLNTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKPYKPTLWTESWSGWFTEFGGPI 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPV+DLAFAVARF Q+GGS++NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+
Sbjct: 267 YQRPVQDLAFAVARFIQKGGSYINYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL DLH AIK CE ALV++D LG ++AHV+ + C+AFLAN
Sbjct: 327 EPKYGHLMDLHKAIKQCERALVSSDPT-VTSLGAYEQAHVFSSK----NGACAAFLANYH 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++AA VTF + Y LPPWS+SILPDC+ VFNTA+V QT+ +
Sbjct: 382 SNSAARVTFNNRKYDLPPWSISILPDCKTDVFNTARVRFQTT-----------------K 424
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ S +S SW T E + SE++ T G+LE LN T+D SDYLW+IT + +S
Sbjct: 425 IQMLPS--NSKLFSWETYDEDVSSLSESSKITASGLLEQLNATRDTSDYLWYITSVDISS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + +P++++ S + VFINGQ GS G PV ++G N +
Sbjct: 483 SE-SFLRGGN-KPSISVHSAGHAVHVFINGQFLGSAFGTSEDRSCTFNGPVNLRAGTNKI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E AG G V L G +G DL+ W+YQ+GLKGE + +
Sbjct: 541 ALLSVAVGLPNVGFHFETWKAGITG-VLLYGLDHGQKDLTWQKWSYQIGLKGEAMNL--V 597
Query: 641 EENEAEWTDLTRDGI----PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
N D RD + S W+K YF+APDG++P+ALDL SMGKGQ W+NG IGR
Sbjct: 598 SPNGVSSVDWVRDSLDVRSQSQLKWHKAYFNAPDGVEPLALDLSSMGKGQVWINGQSIGR 657
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW V A KG C ++C+Y G Y KC CG PTQ WYHVPRSWL+ +NNL+V+ EE GG
Sbjct: 658 YWMVYA-KGAC-NSCNYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKPTNNLIVLLEELGG 715
Query: 757 NPFEISVKLR 766
NP++IS++ R
Sbjct: 716 NPWKISLQKR 725
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/726 (51%), Positives = 484/726 (66%), Gaps = 38/726 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHR+++I+G RR+LIS IHYPR+TPEMWP LI K+K+GG DV++TYVFWN HE ++
Sbjct: 94 VSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPVK 153
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+++FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 154 GQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 213
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQRFV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES+ G K Y WAA MA
Sbjct: 214 FKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPYANWAAKMA 273
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ GVPWVMCKQ DAP+ +I+ CNG+YCD + PN NKP +WTE W GW+T++GG +P
Sbjct: 274 VATNTGVPWVMCKQEDAPDPVINTCNGFYCDYFTPNKKNKPAMWTEAWTGWFTSFGGAVP 333
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVED+AFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+GLL +
Sbjct: 334 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEFGLLRQ 393
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EP LV+ D LG ++A+V+++ C+AFL+N
Sbjct: 394 PKWGHLRDLHKAIKQAEPTLVSGDPT-IQSLGNYEKAYVFKSK----NGACAAFLSNYHM 448
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++A V F G+ Y LP WS+SILPDC+ VFNTA V T + + P +
Sbjct: 449 NSAVKVRFNGRHYDLPAWSISILPDCKTVVFNTATVKEPTLLPK------MHPVVRF--- 499
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+W + E ++ FT G++E L++T D SDYLW+ T + + +
Sbjct: 500 ------------TWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPGE 547
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+S N P +T+ S ++VF+NG+ GSV G + + V+ G N + +
Sbjct: 548 LS---KNGQWPQLTVYSAGHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISI 604
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS VGL N G E+ G G V L+G G DLS WTYQVGLKGE I+++
Sbjct: 605 LSSAVGLPNVGDHFERWNVGVLGPVTLSGLSEGKRDLSHQKWTYQVGLKGESLGIHTVSG 664
Query: 643 NEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ A EW G TW+K F+AP G DPVALD+GSMGKGQ WVNGHH+GRYW+
Sbjct: 665 SSAVEWGG---PGSKQPLTWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGRYWSYK 721
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
AP GC C Y G Y DKC ++CG +Q WYHVPRSWL+ NLLV+ EE GG+ +
Sbjct: 722 APSRGC-GGCSYAGTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGV 780
Query: 762 SVKLRS 767
++ R+
Sbjct: 781 TLATRT 786
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/750 (51%), Positives = 498/750 (66%), Gaps = 41/750 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ SC+ S+++++ V YDH+AIII+G RR+LIS IHYPR+TP MWPDLI
Sbjct: 8 MWSILLLFSCIFSAASAS------VGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV +GL++ LRIGPYVCAE
Sbjct: 62 QKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFP+WL+ +PGI FRT+N PFK MQ+F +KIV++M+ E LF QGGPII+ QIEN
Sbjct: 122 WNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
E+G +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNGYYC+ +KPN
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP EDLAF+VARF Q GGSF NYYMY GGTNFGRT
Sbjct: 242 KVYKPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGLL +PKWGHL+DLH AIK CE ALVA D + KLG NQE
Sbjct: 302 AGGPFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPS-VTKLGNNQE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV+ S+S C+AFLAN D + V+F Y LPPWS+SILPDC+ VFNTAKV
Sbjct: 361 AHVFN-----SKSGCAAFLANHDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKV 415
Query: 443 SSQTSIKTVEFSLPLSPNIS-VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
+ + S + + P S +P QS IE +S T+ G+
Sbjct: 416 AWKAS------EVQMKPVYSRLPWQSFIEETTTSDETG--------------TTTLDGLY 455
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E + +T+D +DYLW++T I + D+ +F K + P +TI S L VFINGQL+G+V
Sbjct: 456 EQIYMTRDATDYLWYMTDITIGSDE-AFLKNGKF-PLLTIFSAGHALHVFINGQLSGTVY 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V+ + G N L LLS +VGL N G E G G + L G G
Sbjct: 514 GSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTW 573
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S+ WTY++G+KGE ++++ + +W + TWYK FDAP G P+A
Sbjct: 574 DMSRWKWTYKIGMKGESLGLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLA 633
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C + C Y G +N KC T CG P+Q WYH+
Sbjct: 634 LDMGSMGKGQIWINGQSVGRHWPGYIAQGSCGN-CYYAGTFNDKKCRTYCGKPSQRWYHI 692
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLR 766
PRSWL + NLLV+FEE GG+P +S+ R
Sbjct: 693 PRSWLTPTGNLLVVFEEWGGDPSWMSLVER 722
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/728 (52%), Positives = 488/728 (67%), Gaps = 34/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD RAI+I+G R++LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE
Sbjct: 24 NVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+YNF+G+ D+VKF+KLV +GLY+ LRIGPY+CAEWNFGG PVWL+ + G+EFRT+N
Sbjct: 84 PGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRTDNQ 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ FV+KIV +M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 144 PFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL VPW+MCKQ DAP+ +ID CNG+YC+G++PN KP +WTE W GW+T +GG +
Sbjct: 204 AVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWFTKFGGPI 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP ED+AF+VARF Q GS+ NYYMY GGTNFGRTS G F TSYDYDAPIDEYGLL+
Sbjct: 264 PQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLLN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL++LH AIK CEPALV++ LG NQEAHVYR+ C+AFL+N D
Sbjct: 324 EPKYGHLRELHKAIKQCEPALVSS-YPTVTSLGSNQEAHVYRSK----SGACAAFLSNYD 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ V+F Y LPPWS+SILPDC+ V+NTAKVSSQ S S+ ++P
Sbjct: 379 AKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGS------SIKMTP------ 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ SW + E +++ G+ E NVT+D SDYLW++T I ++
Sbjct: 427 --------AGGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDINIAS 478
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F K+ + P +T+ S VL VF+NG+L G+V G + V+ +G N +
Sbjct: 479 NE-GFLKSGK-DPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKI 536
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS +VGL N G + AG G V L+G G DL+K W+Y+VGLKGE ++++
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTL 596
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWYK F AP G +P+ALD+ SMGKGQ W+NG +GR+W
Sbjct: 597 SGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWP 656
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
A +G C C Y G +N KC TNCG P+Q WYHVPRSWL+ S NLLV+FEE GG+P
Sbjct: 657 GYAAQGDCSK-CSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPT 715
Query: 760 EISVKLRS 767
IS+ RS
Sbjct: 716 GISLVRRS 723
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/727 (52%), Positives = 489/727 (67%), Gaps = 40/727 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++I+G RR+L+S IHYPR+TPEMWP LI K+K+GG DVI+TYVFWN HE ++
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQ+FV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES G K Y WAA MA
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+G GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KP++WTE W GW+T++GG +P
Sbjct: 218 VGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVP 277
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+GLL +
Sbjct: 278 HRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQ 337
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EP LV+AD +G ++A+V++A C+AFL+N
Sbjct: 338 PKWGHLRDLHRAIKQAEPVLVSADPT-IESIGSYEKAYVFKAK----NGACAAFLSNYHM 392
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+TA V F GQ Y LP WS+SILPDC+ VFNTA V T + ++P +
Sbjct: 393 NTAVKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPK------MNPVVRF--- 443
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+W + E S++ FT G++E L++T D SDYLW+ T + + +D
Sbjct: 444 ------------AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND 491
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+ ++ P +T+ S ++VF+NG+ GSV G + + V+ G N + +
Sbjct: 492 LRSGQS----PQLTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISI 547
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS VGL N G E G G V L+ G DLS WTYQVGLKGE ++++
Sbjct: 548 LSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTG 607
Query: 643 NEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ A EW G TW+K +F+AP G DPVALD+GSMGKGQ WVNGHH+GRYW+
Sbjct: 608 SSAVEWGG---PGGYQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYK 664
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
A GGC C Y G Y+ DKC +NCG+ +Q WYHVPRSWL+ NLLV+ EE GG+ +
Sbjct: 665 A-SGGC-GGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGV 722
Query: 762 SVKLRST 768
S+ R+T
Sbjct: 723 SLATRTT 729
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/728 (52%), Positives = 488/728 (67%), Gaps = 34/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD RAI+I+G R++LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE
Sbjct: 24 NVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIEKAKDGGLDVIETYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+YNF+G+ D+VKF+KLV +GLY+ LRIGPY+CAEWNFGG PVWL+ + G+EFRT+N
Sbjct: 84 PGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRTDNQ 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ FV+KIV +M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 144 PFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL VPW+MCKQ DAP+ +ID CNG+YC+G++PN KP +WTE W GW+T +GG +
Sbjct: 204 AVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWFTKFGGPI 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP ED+AF+VARF Q GS+ NYYMY GGTNFGRTS G F TSYDYDAPIDEYGLL+
Sbjct: 264 PQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLLN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL++LH AIK CEPALV++ LG NQEAHVYR+ C+AFL+N D
Sbjct: 324 EPKYGHLRELHKAIKQCEPALVSS-YPTVTSLGSNQEAHVYRSK----SGACAAFLSNYD 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ V+F Y LPPWS+SILPDC+ V+NTAKVSSQ S S+ ++P
Sbjct: 379 AKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGS------SIKMTP------ 426
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ SW + E +++ G+ E NVT+D SDYLW++T + ++
Sbjct: 427 --------AGGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDVNIAS 478
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F K+ + P +T+ S VL VF+NG+L G+V G + V+ +G N +
Sbjct: 479 NE-GFLKSGK-DPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKI 536
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS +VGL N G + AG G V L+G G DL+K W+Y+VGLKGE ++++
Sbjct: 537 SLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTL 596
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + TWYK F AP G +P+ALD+ SMGKGQ W+NG +GR+W
Sbjct: 597 SGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWP 656
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
A +G C C Y G +N KC TNCG P+Q WYHVPRSWL+ S NLLV+FEE GG+P
Sbjct: 657 GYAAQGDCSK-CSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPT 715
Query: 760 EISVKLRS 767
IS+ RS
Sbjct: 716 GISLVRRS 723
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/730 (53%), Positives = 489/730 (66%), Gaps = 38/730 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+AIII+G RR+LIS IHYPR+TP+MWPDLI K+K+GG D+IETYVFWN HE
Sbjct: 83 SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 142
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+ + D+V+F+KLV +GLY+ LRIGPYVCAEWN+GGFP+WL+ +PGI FRT+NA
Sbjct: 143 PGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 202
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+FV KIVD+M+ E LF QGGPII+ QIENEYG +E G GK Y KWAA M
Sbjct: 203 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQM 262
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ +ID CNG+YC+ +KPN KP +WTENW GWYT +GG
Sbjct: 263 AVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPT 322
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP ED+AF+VARF Q GGS +NYYMY GGTNFGRTS G F TSYD+DAPIDEYGLL
Sbjct: 323 PYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDEYGLLR 381
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIKLCEPALV+AD LG+NQEA V+++ S C+AFLAN D
Sbjct: 382 EPKWGHLRDLHKAIKLCEPALVSADPTS-TWLGKNQEARVFKS----SSGACAAFLANYD 436
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSL-PLSPNISVP 464
V F Y LPPWS+SILPDC+ FNT S Q +K+ E + P+S
Sbjct: 437 TSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTG--SLQIGVKSYEAKMTPIS------ 488
Query: 465 QQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
S W++ K EP ++++ T G++E ++VT D +DYLW+I I +
Sbjct: 489 ------------SFWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDTTDYLWYILSIRI- 535
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
D F K+ + P +T++S +L VFINGQL+GSV G + + V + G N
Sbjct: 536 DSTEGFLKSGQ-WPLLTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNK 594
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
L +LS TVGL N G + AG G V L G G D+SK W+Y+VGL+GE +YS
Sbjct: 595 LSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYS 654
Query: 640 IE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
++ N +W + P TWYKT F+ P G +P+ALD+ SM KGQ WVNG IGRY+
Sbjct: 655 VKGSNSVQWMKGSFQKQP--LTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYF 712
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+G C + C Y G + KC NCG P+Q WYH+PR WL + NLL+I EE GGNP
Sbjct: 713 PGYIARGKC-NKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNP 771
Query: 759 FEISVKLRST 768
IS+ R+
Sbjct: 772 QGISLVKRTA 781
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/731 (52%), Positives = 490/731 (67%), Gaps = 39/731 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD +AIII+G RR+LIS IHYPR+TPEMW DLI K+K GG DVI+TYVFWN HE
Sbjct: 27 SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTYVFWNVHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
YNF+G+ D+V+F+K V GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 PSNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF QGGPII+ QIENEYG + G G Y WAA M
Sbjct: 147 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGHAYSNWAAKM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCK+ DAP+ +I++CNG+YCD + PN KP LWTE+W GW++ +GG +
Sbjct: 207 AVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDFSPNKPYKPKLWTESWSGWFSEFGGPV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP +DLAFAVARF Q+GGSF NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGLL
Sbjct: 267 PQRPAQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLLR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHLKDLH AIK CE ALV++D LG ++AHV+ + G+Q+ C+AFLAN
Sbjct: 327 EPKYGHLKDLHKAIKQCEHALVSSDPT-VTSLGAYEQAHVFSS---GTQT-CAAFLANYH 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
++AA VTF + Y LPPWS+SILPDC+ VFNTA+V Q S +
Sbjct: 382 SNSAARVTFNNRHYDLPPWSISILPDCKTDVFNTARVRFQNS-----------------K 424
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ S +S SW T E + +E++ T G+LE +N T+D SDYLW+IT + +S
Sbjct: 425 IQMLPS--NSKLLSWETYDEDVSSLAESSRITASGLLEQINATRDTSDYLWYITSVDISP 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + +P++++ S D + VFING+ +GS G P+ +G N +
Sbjct: 483 SE-SFLRGGN-KPSISVHSSGDAVHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G + L G +G DL+ W+YQVGLKGE + +
Sbjct: 541 ALLSVAVGLPNGGIHFESWKTGITGPILLHGLDHGQKDLTWQKWSYQVGLKGEAMNL--V 598
Query: 641 EENEAEWTDLTRDGIPS----TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
N D R+ + S W+K YF+APDG + +ALD+ MGKGQ W+NG IGR
Sbjct: 599 SPNGVSSVDWVRESLASQNQPQLKWHKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGR 658
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW V A KG C ++C+Y G Y KC CG PTQ WYHVPRSWL+ +NNL+V+FEE GG
Sbjct: 659 YWLVYA-KGNC-NSCNYAGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGG 716
Query: 757 NPFEISVKLRS 767
NP++IS+ R+
Sbjct: 717 NPWKISLVKRT 727
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 766 bits (1978), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/750 (51%), Positives = 497/750 (66%), Gaps = 41/750 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M +++ SC+ S+++++ V YDH+AIII+G RR+LIS IHYPR+TP MWPDLI
Sbjct: 8 MWSILLLFSCIFSAASAS------VGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLI 61
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K GG DVI+TYVFWN HE G+Y F+ + D+VKF+KLV +GL++ LRIGPYVCAE
Sbjct: 62 QKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFP+WL+ +PGI FRT+N PFK MQ+F +KIV++M+ E LF QGGPII+ QIEN
Sbjct: 122 WNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIEN 181
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
E+G +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNGYYC+ +KPN
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPN 241
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTE W GWYT +GG +P RP EDLAF+VARF Q GGSF NYYMY GGTNFGRT
Sbjct: 242 KVYKPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRT 301
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
+GGPF TSYDYDAP+DEYGLL +PKWGHL+DLH AIK CE ALVA D + KLG NQE
Sbjct: 302 AGGPFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPS-VTKLGNNQE 360
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV+ S+S C+AFLAN D + V+F Y LPPWS+SILPDC+ VFNTAKV
Sbjct: 361 AHVFN-----SKSGCAAFLANYDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKV 415
Query: 443 SSQTSIKTVEFSLPLSPNIS-VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
+ + S + + P S +P QS IE +S T+ G+
Sbjct: 416 AWKAS------EVQMKPVYSRLPWQSFIEETTTSDETG--------------TTTLDGLY 455
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E + +T+D +DYLW++T I + D+ +F K + P +TI S L VFINGQL+G+V
Sbjct: 456 EQIYMTRDATDYLWYMTDITIGSDE-AFLKNGKF-PLLTIFSAGHALHVFINGQLSGTVY 513
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + Q V+ + G N L LLS +VGL N G E G G + L G G
Sbjct: 514 GSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTW 573
Query: 618 DLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
D+S+ WTY++G+KGE ++++ + +W + TWYK FDAP G P+A
Sbjct: 574 DMSRWKWTYKIGMKGESLGLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLA 633
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+GSMGKGQ W+NG +GR+W +G C + C Y G +N KC T CG P+Q W H+
Sbjct: 634 LDMGSMGKGQIWINGQSVGRHWPGYIAQGSCGN-CYYAGTFNDKKCRTYCGKPSQRWCHI 692
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLR 766
PRSWL + NLLV+FEE GG+P +S+ R
Sbjct: 693 PRSWLTPTGNLLVVFEEWGGDPSWMSLVER 722
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/727 (52%), Positives = 476/727 (65%), Gaps = 37/727 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++I+G RR+LIS IHYPR+TPEMWP L+ K+K+GG DV++TYVFWN HE +R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K Y WAA MA
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ GAGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS +KPT+WTE W GW+T +GG +P
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVED+AFAVARF Q+GGSF+NYYMY GGTNF RTSGGPF TSYDYDAPIDEYGLL +
Sbjct: 268 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQ 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EPALV+ D LG ++A+V+++ S C+AFL+N
Sbjct: 328 PKWGHLRDLHKAIKQAEPALVSGDPT-IQSLGNYEKAYVFKS----SGGACAAFLSNYHT 382
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
AA V F G+ Y LP WS+S+LPDC+ VFNTA VS P +P P
Sbjct: 383 SAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSE-----------PSAPARMSPAG 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
SW + E FT G++E L++T D SDYLW+ T + ++ ++
Sbjct: 432 GF----------SWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNE 481
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
F K+ + P +TI S L+VF+NGQ G+V G + + V+ G N + +
Sbjct: 482 -QFLKSGQ-WPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISI 539
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS VGL N G E G G V L+G G DLS WTYQ+GL GE + S+
Sbjct: 540 LSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAG 599
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW TW+K YF AP G PVALD+GSMGKGQAWVNG HIGRYW+
Sbjct: 600 SSSVEWGSAAGK---QPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYK 656
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
A GC C Y G Y+ KC T CG+ +Q +YHVPRSWL S NLLV+ EE GG+ +
Sbjct: 657 ASSSGC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGV 715
Query: 762 SVKLRST 768
+ R+
Sbjct: 716 KLVTRTA 722
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/727 (52%), Positives = 488/727 (67%), Gaps = 40/727 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++I+G RR+L+S IHYPR+TPEMWP LI K+K+GG DVI+TYVFWN HE ++
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQ+FV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES G K Y WAA MA
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KP++WTE W GW+T++GG +P
Sbjct: 218 VRTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVP 277
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+GLL +
Sbjct: 278 HRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQ 337
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EP LV+AD +G ++A+V++A C+AFL+N
Sbjct: 338 PKWGHLRDLHRAIKQAEPVLVSADPT-IESIGSYEKAYVFKAK----NGACAAFLSNYHM 392
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+TA V F GQ Y LP WS+SILPDC+ VFNTA V T + ++P +
Sbjct: 393 NTAVKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPK------MNPVVRF--- 443
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+W + E S++ FT G++E L++T D SDYLW+ T + + +D
Sbjct: 444 ------------AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND 491
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+ ++ P +T+ S ++VF+NG+ GSV G + + V+ G N + +
Sbjct: 492 LRSGQS----PQLTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISI 547
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS VGL N G E G G V L+ G DLS WTYQVGLKGE ++++
Sbjct: 548 LSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTG 607
Query: 643 NEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ A EW G TW+K +F+AP G DPVALD+GSMGKGQ WVNGHH+GRYW+
Sbjct: 608 SSAVEWGG---PGGYQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYK 664
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
A GGC C Y G Y+ DKC +NCG+ +Q WYHVPRSWL+ NLLV+ EE GG+ +
Sbjct: 665 A-SGGC-GGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGV 722
Query: 762 SVKLRST 768
S+ R+T
Sbjct: 723 SLATRTT 729
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/750 (51%), Positives = 489/750 (65%), Gaps = 43/750 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
++M++ C ++S V+YDH+AI+IDG RR+LIS IHYPR+TP+MWPDLI
Sbjct: 10 VLMLLFFWVCGVTAS---------VTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLI 60
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G+Y F+ + D+V+FVKL +GLY+ LRIGPY+CAE
Sbjct: 61 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAE 120
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PGI FRT+N PFK MQ+F KIV LM+EE LF QGGPII+ QIEN
Sbjct: 121 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIEN 180
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +E G GK Y KWAA MA+GL GVPWVMCKQ DAP+ +ID CNG+YC+ +KPN
Sbjct: 181 EYGPVEWEIGAPGKSYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPN 240
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTENW GWYT +GG P RP EDLAF+VARF Q GGSF+NYYMY GGTNFGRT
Sbjct: 241 KNTKPKMWTENWTGWYTDFGGASPIRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRT 300
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
SGG F TSYDYDAP+DEYGL +EPKWGHL+ LH AIK EPALV+ D + LG N E
Sbjct: 301 SGGLFIATSYDYDAPLDEYGLQNEPKWGHLRALHKAIKQSEPALVSTD-PKVTSLGYNLE 359
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
AHV+ + C+AF+AN D ++A TF Y LPPWS+SILPDC+ V+NTA+V
Sbjct: 360 AHVFS-----TPGACAAFIANYDTKSSAKATFGSGQYDLPPWSISILPDCKTVVYNTARV 414
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILE 502
N V + + + S + S + +EP +++ + + E
Sbjct: 415 G----------------NGWVKKMTPVNSGFAWQSYN----EEPASSSQDDSIAAEALWE 454
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+NVT+D SDYLW++T +Y++ ++ F K N P +T+ S +L VFINGQL+G+V G
Sbjct: 455 QVNVTRDSSDYLWYMTDVYINGNE-GFLK-NGRSPVLTVMSAGHLLHVFINGQLSGTVYG 512
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID 618
+ V + G N L LLS VGL N G E AG G V L G G D
Sbjct: 513 GLGNPKLTFSDNVNLRVGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRD 572
Query: 619 LSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
LS+ W+Y+VGLKGE +++ + EW + TWYK F AP G DP+AL
Sbjct: 573 LSRQKWSYKVGLKGEALNLHTESGSSSVEWIQGSLVAKKQPLTWYKATFSAPAGNDPLAL 632
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVP 737
DLGSMGKG+ WVNG IGR+W G C + C+Y G Y KC TNCG P+Q WYHVP
Sbjct: 633 DLGSMGKGEVWVNGRSIGRHWPGYIAHGSC-NACNYAGYYTDQKCRTNCGKPSQRWYHVP 691
Query: 738 RSWLQASNNLLVIFEETGGNPFEISVKLRS 767
RSWL + N LV+FEE GG+P I++ R+
Sbjct: 692 RSWLNSGGNSLVVFEEWGGDPNGIALVKRT 721
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/725 (52%), Positives = 484/725 (66%), Gaps = 36/725 (4%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
SYDHRA++I+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE RG
Sbjct: 24 SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY+F + D+V+FVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PF
Sbjct: 84 QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPF 143
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQRFV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES+ G K Y WAA+MA+
Sbjct: 144 KAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMAV 203
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
AGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS +KPT+WTE W GW+T +GG +PH
Sbjct: 204 ATDAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNSKPTMWTEAWTGWFTAFGGPVPH 263
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVED+AFAVARF Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAPIDEYGL+ +P
Sbjct: 264 RPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQP 323
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHL+DLH AIK EPALV+ D ++G ++A+V+++ S C+AFL+N
Sbjct: 324 KWGHLRDLHKAIKQAEPALVSGDPT-IQRIGNYEKAYVFKS----STGACAAFLSNYHTS 378
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
+AA + + G+ Y LP WS+SILPDC+ VFNTA V P +P P
Sbjct: 379 SAARIVYNGRRYDLPAWSISILPDCKTAVFNTATVKE-----------PTAPAKMNPAGG 427
Query: 468 MIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDI 527
+W + E + FT G++E L++T D SDYLW+ T + + D
Sbjct: 428 F----------AWQSYSEDTNALDSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNI-DSSE 476
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILL 583
F KT + P +TI+S ++VF+NGQ G G + + +PV+ G N + +L
Sbjct: 477 QFLKTGQ-WPQLTINSAGHSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISIL 535
Query: 584 SQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEEN 643
S +GL N G E G G V L+G G DLS WTYQ+GLKGE + SI +
Sbjct: 536 SSAMGLPNQGTHYEAWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVNSISGS 595
Query: 644 EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
+ P TW+K YF AP G PVALD+GSMGKGQ WVNG++ GRYW+ A
Sbjct: 596 SSVEWSSASGAQP--LTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYWSYRA- 652
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
G C C Y G ++ KC TNCG+ +Q WYHVPRSWL+ S NLLV+ EE GG+ +++
Sbjct: 653 SGSC-GGCSYAGTFSEAKCQTNCGDISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTL 711
Query: 764 KLRST 768
R+T
Sbjct: 712 MTRTT 716
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 764 bits (1972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/730 (53%), Positives = 477/730 (65%), Gaps = 40/730 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+AI++DG RR+LIS IHYPR+TP+MWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 24 SVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY F+ + D+VKFVKLV +GLY+ LRIGPY+CAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 84 PGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV LM+E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 144 PFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ +ID CNGYYC+ +KPN KP +WTENW GWYT +GG +
Sbjct: 204 AVGLDTGVPWVMCKQEDAPDPVIDTCNGYYCENFKPNKNTKPKMWTENWTGWYTDFGGAV 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P RP EDLAF+VARF Q GGSF+NYYMY GGTNFGRTSGG F TSYDYDAP+DEYGL +
Sbjct: 264 PRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYGLQN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+ HL++LH AIK CEPALVA D + LG N EAHV+ + C+AF+AN D
Sbjct: 324 EPKYEHLRNLHKAIKQCEPALVATD-PKVQSLGYNLEAHVFS-----TPGACAAFIANYD 377
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A TF Y LPPWS+SILPDC+ V+NTAKV + K N +
Sbjct: 378 TKSYAKATFGNGQYDLPPWSISILPDCKTVVYNTAKVGNSWLKKMTPV------NSAFAW 431
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
QS E SS+ ++ + E +NVT+D SDYLW++T +Y++ +
Sbjct: 432 QSYNEEPASSS--------------QADSIAAYALWEQVNVTRDSSDYLWYMTDVYINAN 477
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-------VEFQSGYN 578
+ F K N P +T S VL VFIN QL G+V W + P V+ + G N
Sbjct: 478 E-GFLK-NGQSPVLTAMSAGHVLHVFINDQLAGTV---WGGLANPKLTFSDNVKLRVGNN 532
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
L LLS VGL N G E AG G V L G G DLS W+Y+VGLKGE ++
Sbjct: 533 KLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSSQKWSYKVGLKGESLSLH 592
Query: 639 SIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ + EW + TWYKT F AP G DP+ALDLGSMGKG+ WVNG IGR+
Sbjct: 593 TESGSSSVEWIRGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRH 652
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W G C + C+Y G Y KC TNCG P+Q WYHVPRSWL + N LV+FEE GG+
Sbjct: 653 WPGYIAHGSC-NACNYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGD 711
Query: 758 PFEISVKLRS 767
P I++ R+
Sbjct: 712 PNGIALVKRT 721
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/759 (50%), Positives = 493/759 (64%), Gaps = 41/759 (5%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L + + + + + L C S +AS V+YD +AI I+G RR+L S IHYPR+TP
Sbjct: 5 LRIKVLFVCVGLFFLLCCCSVTAS-------VTYDGKAIKINGQRRILFSGSIHYPRSTP 57
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
EMWP LI K+KEGG DVI+TYVFWN HE GQY F+G+ D+V+F+KL +GLY+ LRI
Sbjct: 58 EMWPGLIQKAKEGGLDVIQTYVFWNGHEPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRI 117
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
G YVCAEWNFGGFPVWL+ +PGI FRT+N PFK MQ+F +KIV+LM+ E LF QGGPI
Sbjct: 118 GLYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPI 177
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
IM QIENEYG +E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ IID CNG+Y
Sbjct: 178 IMSQIENEYGPVEWEIGAPGKAYTKWAAEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFY 237
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
C+G+ PN KP +WTE W GWYT +GG + +RPVEDLA++VARF Q GSF+NYYMY G
Sbjct: 238 CEGFTPNKNYKPKMWTEAWTGWYTEFGGPIHNRPVEDLAYSVARFIQNNGSFVNYYMYHG 297
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNFGRT+ G F TSYDYDAPIDEYGL EPKWGHL+DLH AIKLCEP+LV+A
Sbjct: 298 GTNFGRTAAGLFVATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLCEPSLVSA-YPTVT 356
Query: 376 KLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
G+N E HV++ S+S+C+AFLAN D + A VTF Y LPPWS+SILPDC+N
Sbjct: 357 WPGKNLEVHVFK-----SKSSCAAFLANYDPSSPAKVTFQNMQYDLPPWSISILPDCKNA 411
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMT-VKEPIGVWSENN 494
VFNTA+VSS++S M + +S + SW + ++E + +
Sbjct: 412 VFNTARVSSKSS-------------------QMKMTPVSGGAFSWQSYIEETVSADDSDT 452
Query: 495 FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFING 554
G+ E +++T+D SDYLW++T + + ++ F K N P +T+ S L VFING
Sbjct: 453 IAKNGLWEQISITRDGSDYLWYLTDVNIHPNE-GFLK-NGQSPVLTVMSAGHALHVFING 510
Query: 555 QLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
QL G+V G + V+ ++G N + LLS VGL N G E G G V L
Sbjct: 511 QLAGTVYGSLENPKLTFSNNVKLRAGINKISLLSAAVGLPNVGLHFETWNTGVLGPVTLK 570
Query: 611 GFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAP 669
G G DL+K W+Y+VGLKGE ++++ + EW + TWYK F+AP
Sbjct: 571 GLNEGTRDLTKQKWSYKVGLKGEDLSLHTLSGSSSVEWVQGSLLAQKQPLTWYKATFNAP 630
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNP 729
+G DP+ALD+ +MGKGQ W+NG IGR+W G C C Y G Y KC +NCG
Sbjct: 631 EGNDPLALDMNTMGKGQIWINGESIGRHWPEYKASGNC-GGCSYAGIYTEKKCLSNCGEA 689
Query: 730 TQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRST 768
+Q WYHVPRSWL+ S N LV+FEE GG+P IS R+T
Sbjct: 690 SQRWYHVPRSWLKPSGNFLVVFEELGGDPTGISFVRRTT 728
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/726 (53%), Positives = 478/726 (65%), Gaps = 32/726 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+ I+IDG RR+LIS IHYPR+TPEMWP L K+KEGG DVI+TYVFWN HE
Sbjct: 24 SVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQTYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F+ + D+VKF+KL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 84 PGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV +M+ E LF QGGPIIM QIENEYG +E + G GK Y WAA M
Sbjct: 144 PFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAPGKAYTNWAAQM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW MCKQ DAP+ +ID CNGYYC+ + PN KP +WTENW GWYT +G +
Sbjct: 204 AVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTDFGNAI 263
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
+RPVEDLA++VARF Q GSF+NYYMY GGTNFGRTS G F TSYDYDAPIDEYGL +
Sbjct: 264 CYRPVEDLAYSVARFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLTN 323
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKW HL+DLH AIK CEPALV+ D LG EAHVY S C+AFLAN D
Sbjct: 324 EPKWSHLRDLHKAIKQCEPALVSVDPT-ITSLGNKLEAHVYST----GTSVCAAFLANYD 378
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA+VTF Y LPPWSVSILPDC+ VFNTAKV +Q+S KT+ +S N +
Sbjct: 379 TKSAATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGAQSSQKTM-----ISTNSTFDW 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
QS IE EP +++ T + + E +NVT+D SDYLW++T + +S +
Sbjct: 434 QSYIE--------------EPAFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPN 479
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F K N P + + S VL VF+NGQL+G+V G + V G N +
Sbjct: 480 E-DFIK-NGQYPILNVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKIS 537
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G G V L G G DLS W+Y+VGLKGE +++I
Sbjct: 538 LLSVAVGLPNVGLHFETWNVGVLGPVTLKGLNEGTRDLSWQKWSYKVGLKGESLSLHTIT 597
Query: 642 ENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ +WT + TWYK F+AP G DP+ LD+ SMGKG+ WVN IGR+W
Sbjct: 598 GGSSVDWTQGSLLAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWPG 657
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C D CDY G + + KC TNCGNPTQTWYH+PRSWL + N+LV+ EE GG+P
Sbjct: 658 YIAHGSCGD-CDYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSG 716
Query: 761 ISVKLR 766
IS+ R
Sbjct: 717 ISLLKR 722
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/716 (52%), Positives = 481/716 (67%), Gaps = 40/716 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++I+G RR+L+S IHYPR+TPEMWP LI K+K+GG DVI+TYVFWN HE ++
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EMQ+FV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES G K Y WAA MA
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+G GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KP++WTE W GW+T++GG +P
Sbjct: 218 VGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVP 277
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+GLL +
Sbjct: 278 HRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQ 337
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EP LV+AD +G ++A+V++A C+AFL+N
Sbjct: 338 PKWGHLRDLHRAIKQAEPVLVSADPT-IESIGSYEKAYVFKAK----NGACAAFLSNYHM 392
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+TA V F GQ Y LP WS+SILPDC+ VFNTA V T + ++P +
Sbjct: 393 NTAVKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPK------MNPVVRF--- 443
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+W + E S++ FT G++E L++T D SDYLW+ T + + +D
Sbjct: 444 ------------AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND 491
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
+ ++ P +T+ S ++VF+NG+ GSV G + + V+ G N + +
Sbjct: 492 LRSGQS----PQLTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISI 547
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS VGL N G E G G V L+ G DLS WTYQVGLKGE + ++
Sbjct: 548 LSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLQTVTG 607
Query: 643 NEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ A EW G TW+K +F+AP G DPVALD+GSMGKGQ WVNGHH+GRYW+
Sbjct: 608 SSAVEWGG---PGGYQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYK 664
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
A GGC C Y G Y+ DKC +NCG+ +Q WYHVPRSWL+ NLLV+ EE G N
Sbjct: 665 A-SGGC-GGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/726 (51%), Positives = 482/726 (66%), Gaps = 38/726 (5%)
Query: 49 YDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQ 108
YDHR+++I+G RR+LIS IHYPR+TPEMWP LI K+K+GG DVI+TYVFWN HE ++GQ
Sbjct: 47 YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106
Query: 109 YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFK 168
Y+F + D+V+FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFK 166
Query: 169 EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALG 228
MQ+FV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES G K Y WAA MA+G
Sbjct: 167 AAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAVG 226
Query: 229 LGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHR 288
GVPWVMCKQ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+T +GG LPHR
Sbjct: 227 TNTGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNRKYKPTMWTEAWTGWFTKFGGALPHR 286
Query: 289 PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPK 348
PVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+GLL +PK
Sbjct: 287 PVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPK 346
Query: 349 WGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHT 408
WGHL+DLH AIK EPAL++ D +G ++A+++++ C+AFL+N T
Sbjct: 347 WGHLRDLHRAIKQAEPALISGDPT-IQSIGNYEKAYIFKSK----NGACAAFLSNYHMKT 401
Query: 409 AASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSM 468
A + F G+ Y LP WS+SILPDC+ VFNTA V T + + L
Sbjct: 402 AVKIRFDGRHYDLPAWSISILPDCKTAVFNTATVKEPTLLPKMNPVLHF----------- 450
Query: 469 IESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDIS 528
+W + E ++ FT G++E L++T D SDYLW+ T + + ++
Sbjct: 451 ----------AWQSYSEDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWYTTHVSIGGNE-Q 499
Query: 529 FWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLS 584
F K+ + P +T+ S ++VF+NG+ GSV G + + V+ G N + +LS
Sbjct: 500 FLKSGQ-WPQLTVYSAGHSMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILS 558
Query: 585 QTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE 644
VGL N G E G G V L+G G DLS WTYQVGLKGE ++++ +
Sbjct: 559 SAVGLPNNGNHFELWNVGVLGPVTLSGLNEGKRDLSHQKWTYQVGLKGESLGLHTVTGSS 618
Query: 645 A-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
A EW G TW+K F+AP G DPVALD+GSMGKGQ WVNGHH GRYW+ A
Sbjct: 619 AVEWAG---PGGKQPLTWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAGRYWSYRAY 675
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE-IS 762
G C+ C Y G Y D+C +NCG+ +Q WYHVPRSWL+ S NLLV+ EE GG ++
Sbjct: 676 SGSCR-RCSYAGTYREDQCLSNCGDISQRWYHVPRSWLKPSGNLLVVLEEYGGGDLAGVT 734
Query: 763 VKLRST 768
+ R+T
Sbjct: 735 LATRTT 740
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/783 (50%), Positives = 495/783 (63%), Gaps = 45/783 (5%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MW LI K+K+GG DVI+TYVFWN HE G Y F+ + D+V+FVK V +GL++ LRIG
Sbjct: 29 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PY+C EWNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIV +M+ E LF+ QGGPII
Sbjct: 89 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 148
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEYG +G G+ Y+ WAA MA+GL GVPWVMCK+ DAP+ +I+ACNG+YC
Sbjct: 149 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 208
Query: 257 DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
D + PN KPT+WTE W GW+T +GG + RPVEDLAFAVARF Q+GGSF+NYYMY GG
Sbjct: 209 DAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHGG 268
Query: 317 TNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK 376
TNFGRT+GGPF TSYDYDAPIDEYGL+ EPK HLK+LH A+KLCE ALV+ D
Sbjct: 269 TNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPT-ITT 327
Query: 377 LGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV 436
LG QEAHV+R S S C+AFLAN + ++ A V F + Y+LPPWS+SILPDC+N V
Sbjct: 328 LGTMQEAHVFR-----SPSGCAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVV 382
Query: 437 FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNF 495
FN+A V QTS Q M +TS W E + + +
Sbjct: 383 FNSATVGVQTS-----------------QMQMWGD--GATSMMWERYDEEVDSLAAAPLL 423
Query: 496 TVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ 555
T G+LE LNVT+D SDYLW+IT + +S + +F + P++++ S L VF+NGQ
Sbjct: 424 TTTGLLEQLNVTRDSSDYLWYITSVDISPSE-NFLQGGGKPPSLSVQSAGHALHVFVNGQ 482
Query: 556 LTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
L GS G +K V ++G N + LLS GL N G E G G V L G
Sbjct: 483 LQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHG 542
Query: 612 FKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTD---LTRDGIPSTFTWYKTYFD 667
G DL+ W+YQVGLKGE + S+E EW + + P WYK YF+
Sbjct: 543 LNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQP--LAWYKAYFE 600
Query: 668 APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
P G +P+ALD+GSMGKGQ W+NG IGRYWT A G C+ C Y G + + KC CG
Sbjct: 601 TPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAYA-DGDCKG-CSYTGTFRAPKCQAGCG 658
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEET-GGNPFEISVKLRSTRIVCEQVSESHYPPVRKW 786
PTQ WYHVPRSWLQ S NLLV+ EE GG+ +I++ RS VC VSE H P ++KW
Sbjct: 659 QPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVSEDH-PNIKKW 717
Query: 787 SNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSV 846
++ ++HL C G IS+I FAS+GTP G C F +G CH+ S +V
Sbjct: 718 ----QIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAV 773
Query: 847 VSE 849
+ +
Sbjct: 774 LEK 776
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/735 (51%), Positives = 481/735 (65%), Gaps = 40/735 (5%)
Query: 41 FFKPFN--VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVF 98
F P N VSYDH+AI+I+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVF
Sbjct: 18 FASPANAAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVF 77
Query: 99 WNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGI 158
WN HE ++GQY F + D+V+FVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI
Sbjct: 78 WNGHEPVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI 137
Query: 159 EFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDY 218
FRT+N PFK MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K Y
Sbjct: 138 SFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPY 197
Query: 219 VKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
WAA MA+ GAGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS KP +WTE W GW+
Sbjct: 198 ANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWF 257
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPI 338
T +GG +PHRPVEDLAFAVARF Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAPI
Sbjct: 258 TAFGGAVPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPI 317
Query: 339 DEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCS 398
DEYGLL +PKWGHL+DLH AIK EPA+V+ D +G ++A+V+++ S C+
Sbjct: 318 DEYGLLRQPKWGHLRDLHKAIKQAEPAMVSGDPT-IQSIGNYEKAYVFKS----STGACA 372
Query: 399 AFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLS 458
AFL+N + A V + G+ Y LP WS+SILPDC+ V+NTA V P +
Sbjct: 373 AFLSNYHTSSPAKVVYNGRRYELPAWSISILPDCKTAVYNTATVKE-----------PSA 421
Query: 459 PNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
P P SW + E ++ FT G++E L++T D SD+LW+ T
Sbjct: 422 PAKMNPAGGF----------SWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTT 471
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQ 574
+ + D F K+ + P +TI+S L+VF+NGQ G+ G + + + V+
Sbjct: 472 YVNI-DSSEQFLKSGQ-WPQLTINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMW 529
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEF 634
G N + +LS VGL N G E G G V L+G G DLS WTYQ+GLKGE
Sbjct: 530 QGSNKISILSSAVGLANQGTHYENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGES 589
Query: 635 QQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
++SI + EW TW+K YF AP G PVALD+GSMGKGQ WVNG +
Sbjct: 590 LGVHSITGSSSVEWGSANG---AQPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRN 646
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
GRYW+ A G C +C Y G Y+ KC TNCG+ +Q WYHVPRSWL S NLLV+ EE
Sbjct: 647 AGRYWSYKA-SGSC-GSCSYTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEE 704
Query: 754 TGGNPFEISVKLRST 768
GG+ + + R+T
Sbjct: 705 FGGDLSGVKLMTRTT 719
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/735 (52%), Positives = 484/735 (65%), Gaps = 38/735 (5%)
Query: 41 FFKPFN--VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVF 98
F P N VSYDH+AI+I+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVF
Sbjct: 18 FASPANAAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVF 77
Query: 99 WNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGI 158
WN HE ++GQY F + D+V+FVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI
Sbjct: 78 WNGHEPVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGI 137
Query: 159 EFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDY 218
FRT+N PFK MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K Y
Sbjct: 138 SFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPY 197
Query: 219 VKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
WAA MA+ GAGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS KP +WTE W GW+
Sbjct: 198 ANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWF 257
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPI 338
T +GG +PHRPVEDLAFAVARF Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAPI
Sbjct: 258 TAFGGAVPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPI 317
Query: 339 DEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCS 398
DEYGLL +PKWGHL+DLH AIK EPA+V+ D +G ++A+V+++ S C+
Sbjct: 318 DEYGLLRQPKWGHLRDLHKAIKQAEPAMVSGDPT-IQSIGNYEKAYVFKS----STGACA 372
Query: 399 AFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLS 458
AFL+N + A V + G+ Y LP WS+SILPDC+ V+NTA V K E L ++
Sbjct: 373 AFLSNYHTSSPAKVVYNGRRYELPAWSISILPDCKTAVYNTATVRQ----KWKEKKLWMN 428
Query: 459 PNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
P + SW + E ++ FT G++E L++T D SD+LW+ T
Sbjct: 429 P---------------AGGFSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTT 473
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQ 574
+ + D F K+ + P +TI+S L+VF+NGQ G+ G + + + V+
Sbjct: 474 YVNI-DSSEQFLKSGQ-WPQLTINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMW 531
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEF 634
G N + +LS VGL N G E G G V L+G G DLS WTYQ+GLKGE
Sbjct: 532 QGSNKISILSSAVGLANQGTHYENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGES 591
Query: 635 QQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
++SI + EW TW+K YF AP G PVALD+GSMGKGQ WVNG +
Sbjct: 592 LGVHSITGSSSVEWGSANG---AQPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRN 648
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
GRYW+ A G C +C Y G Y+ KC TNCG+ +Q WYHVPRSWL S NLLV+ EE
Sbjct: 649 AGRYWSYKA-SGSC-GSCSYTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEE 706
Query: 754 TGGNPFEISVKLRST 768
GG+ + + R+T
Sbjct: 707 FGGDLSGVKLMTRTT 721
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/820 (48%), Positives = 511/820 (62%), Gaps = 62/820 (7%)
Query: 39 STFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVF 98
S + +VSYD RAI IDG R++L S IHYPR+T EMWP LI KSKEGG DVIETYVF
Sbjct: 19 SIAIEAIDVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVF 78
Query: 99 WNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGI 158
WN HE GQY+F G D+V+F+K + + GLY LRIGPYVCAEWN+GGFPVWL +IP I
Sbjct: 79 WNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNI 138
Query: 159 EFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDY 218
EFRTNNA F++EM++F IVD+MR E LF+ QGGPII+ QIENEYGN+ SYGQ GK+Y
Sbjct: 139 EFRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEY 198
Query: 219 VKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
V+W A +A GVPW+MC+Q+DAP+ +I+ CNG+YCD + PNS NKP +WTE+W GW+
Sbjct: 199 VQWCAQLAQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWF 258
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPI 338
WGG PHR ED+AFAV RFFQ GG+F NYYMY GGTNFGRTSGGP+ TSYDYDAP+
Sbjct: 259 MHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPL 318
Query: 339 DEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCS 398
+EYG L++PKWGHLK LH +K E L S++ I G A ++ Y QS C
Sbjct: 319 NEYGDLNQPKWGHLKRLHEVLKSVETTLTMG-SSRNIDYGNQMTATIF---SYAGQSVC- 373
Query: 399 AFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLS 458
FL N A++ F YT+P WSVSILPDC V+NTAKV++QTSI T+
Sbjct: 374 -FLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSIMTINN----E 428
Query: 459 PNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
+ ++ Q M E+ L V T +L+ V D SDYLW+IT
Sbjct: 429 NSYALDWQWMPETHLEQMKDG--------KVLGSVAITAPRLLDQ-KVANDTSDYLWYIT 479
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQ 574
+ V D ++R +++ VL VF+NG GS + K ++ +
Sbjct: 480 SVDVKQGDPILSHDLKIR----VNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLK 535
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD---IDLSKILWTYQVGLK 631
G N++ L+S TVGL NYGA+ + G G V+L +G D+S +W Y+VG+
Sbjct: 536 LGKNEISLVSGTVGLPNYGAYFDNIHVGVTG-VQLVSQNDGSEVTKDISTNVWHYKVGMH 594
Query: 632 GEFQQIYSIEENEAEWTDLTRDGIPS--TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWV 689
GE ++YS + EW +G+ + F WYKT F P G D V LDL +GKGQAWV
Sbjct: 595 GENVKLYSPSRSTEEW---FTNGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWV 651
Query: 690 NGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS-NNL 747
NG++IGRYW + +A + GC TCDYRG Y S+KCTTNCGNPTQ WYHVP S+L+ +N
Sbjct: 652 NGNNIGRYWVSYLAGEDGCSSTCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNT 711
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GGNPF++ + + C + E H E+ L
Sbjct: 712 LVVFEEQGGNPFQVKIATVTIAKACAKAYEGH------------------------ELEL 747
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
C++ +IS I+FAS+G P+G C F +G+C + +LS+V
Sbjct: 748 ACKENQVISEIKFASFGVPEGECGSFKKGHCESSDTLSIV 787
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/729 (50%), Positives = 488/729 (66%), Gaps = 35/729 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+II+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+ + D+VKF+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 89 GQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV +M+EE LF QGGPII+ QIENEYG +E G GK Y KW A MA
Sbjct: 149 FKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAEMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GL GVPW+MCKQ DAP +II+ CNG+YC+ +KPNS NKP +WTENW GW+T +GG +P
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFGGAVP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED+A +VARF Q GGSF+NYYMY GGTNF RT+ G F TSYDYDAP+DEYGL E
Sbjct: 269 YRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYGLPRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+ HLK LH IKLCEPALV+AD LG QEAHV++ S+S+C+AFL+N +
Sbjct: 328 PKYSHLKRLHKVIKLCEPALVSADPT-VTSLGDKQEAHVFK-----SKSSCAAFLSNYNT 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F G +Y LPPWSVSILPDC+ +NTAKV ++T + + P
Sbjct: 382 SSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV----QVRTSSIHMKMVP------- 430
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
++T SW + E I ++N F+ G++E +++T+D +DY W++T I +S D
Sbjct: 431 -------TNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPD 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
+ T E P +TI S L VF+NGQL G+ G K Q ++ +G N L
Sbjct: 484 EKFL--TGE-DPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLA 540
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V L G +G D++K W+Y++G KGE ++++
Sbjct: 541 LLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLA 600
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + + TWYK+ FD+P G +P+ALD+ +MGKGQ W+NG +IGR+W
Sbjct: 601 GSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPA 660
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+G C+ C Y G + KC +NCG +Q WYHVPRSWL+ +NNL+++ EE GG P
Sbjct: 661 YTARGKCE-RCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNG 719
Query: 761 ISVKLRSTR 769
IS+ R+ +
Sbjct: 720 ISLVKRTAK 728
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/727 (52%), Positives = 475/727 (65%), Gaps = 36/727 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++I+G RR+LIS IHYPR+TPEMWP L+ K+K+GG DV++TYVFWN HE +R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K Y WAA MA
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ GAGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS +KPT+WTE W GW+T +GG +P
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HRPVED+AFAVARF Q+GGSF+NYYMY GGTNF RTSGGPF TSYDYDAPIDEYGLL +
Sbjct: 268 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQ 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH AIK EPALV+ D LG ++A+V+++ S C+AFL+N
Sbjct: 328 PKWGHLRDLHKAIKQAEPALVSGDPT-IQSLGNYEKAYVFKS----SGGACAAFLSNYHT 382
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
AA V F G+ Y LP WS+S+LPDC+ VFNTA VS P +P P
Sbjct: 383 SAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSE-----------PSAPARMSPAG 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
SW + E FT G++E L++T D SDYLW+ T + ++ ++
Sbjct: 432 GF----------SWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNE 481
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLIL 582
F K+ + P +T+ S L+VF+NGQ G+V G + + V+ G N + +
Sbjct: 482 -QFLKSGQ-WPQLTVYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISI 539
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS VGL N G E G G V L+G G DLS WTYQ+GL GE + S+
Sbjct: 540 LSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAG 599
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW TW+K YF AP G PVALD+GSMGKGQAWVNG HIGRYW+
Sbjct: 600 SSSVEWGSAAGK---QPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYK 656
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
A G C Y G Y+ KC T CG+ +Q +YHVPRSWL S NLLV+ EE GG+ +
Sbjct: 657 ASSSGGCGGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGV 716
Query: 762 SVKLRST 768
+ R+
Sbjct: 717 KLVTRTA 723
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/727 (51%), Positives = 484/727 (66%), Gaps = 36/727 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AIII+G RR+L+S IHYPR+TP+MWP LI +K+GG D+IETYVFWN HE +
Sbjct: 22 VTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLIQNAKDGGLDIIETYVFWNGHEPTQ 81
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+ + D+V+F+KLV +GLY+ LRIGPYVCAEWN+GGFP+WL+ +PGI FRT N P
Sbjct: 82 GKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPIWLKHVPGIVFRTENEP 141
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV +M+ E L+ QGGPII+ QIENEYG +E G GK Y KWAA MA
Sbjct: 142 FKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMA 201
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
LGL GVPWVMCKQ DAP+ +ID CNG+YC+ +KPN NKP +WTE W GWYT +GG +P
Sbjct: 202 LGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNRENKPKIWTEVWSGWYTAFGGAVP 261
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP EDLAF+VARF Q GGS NYYMY GGTNFGR+S G F SYD+DAPIDEYGL E
Sbjct: 262 YRPAEDLAFSVARFVQNGGSLFNYYMYHGGTNFGRSS-GLFIANSYDFDAPIDEYGLKRE 320
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKW HL+DLH AIKLCEPALV+AD LG+N EA V+++ S C+AFLAN D
Sbjct: 321 PKWEHLRDLHKAIKLCEPALVSAD-PNVTWLGKNLEARVFKS----SSGACAAFLANYDI 375
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
T++ V+F Y LPPWS+SIL DC++ +FNTA++ +Q S P +
Sbjct: 376 STSSKVSFWNTQYDLPPWSISILSDCKSAIFNTARIGAQ----------------SAPMK 419
Query: 467 SMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
M+ S S W++ KE + ++ + T G++E +N T D +DYLW++T I + D
Sbjct: 420 MMLVS-----SFWWLSYKEEVASGYATDTTTKDGLVEQVNFTWDSTDYLWYMTDIQI-DP 473
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ +F K+ + P + I S VL VF+NGQL+G+V G V + V ++G N L
Sbjct: 474 NEAFIKSGQ-WPLLNISSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNLKAGVNKLS 532
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI- 640
+LS TVGL N G E AG G V L G G D+S W+++VGLKGE +++I
Sbjct: 533 MLSVTVGLPNVGLHFESWNAGVLGPVTLKGLNEGIRDMSGYKWSHKVGLKGENMNLHTIG 592
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
N +W + TWYKT F+ P G +P+ALD+ SMGKGQ W+NG IGRYW
Sbjct: 593 GSNSVQWAKGSGLVQKQPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWINGRSIGRYWPA 652
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
A G C C Y G + KC +NCG P+Q WYHVPR WL++ N LV+FEE GGNP
Sbjct: 653 YAASGSC-GKCSYAGIFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVVFEELGGNPGG 711
Query: 761 ISVKLRS 767
IS+ RS
Sbjct: 712 ISLVKRS 718
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 756 bits (1951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/728 (52%), Positives = 484/728 (66%), Gaps = 35/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +A+II+G R++L S IHYPR+TPEMW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 NVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNLHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF G+ D+V+F+KLV +GLY+ LRIGPY+CAEWNFGGFPVWL+ +PGI FRT+N
Sbjct: 87 PGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGISFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F +KIV +M++E LF QGGPII+ QIENEY ++G G Y+ WAA M
Sbjct: 147 PFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPESKAFGSPGHAYMTWAAHM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ + GVPWVMCK+ DAP+ +I+ CNG+YCD + PN KPT+WTE W GW+T +GG
Sbjct: 207 AISMDTGVPWVMCKEFDAPDPVINTCNGFYCDYFSPNKPYKPTMWTEAWTGWFTDFGGPN 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RP EDLAFAVARF Q+GGS +NYYMY GGTNFGRTSGGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPAEDLAFAVARFIQKGGSLVNYYMYHGGTNFGRTSGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK+LH AIKLCE AL+AADS LG ++AHV+ ++ G C+AFL+N +
Sbjct: 327 QPKYGHLKELHKAIKLCEKALLAADST-VTSLGSYEQAHVFSSDSGG----CAAFLSNYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
AA V F Y+LPPWS+SILPDC+N VFNTA V QTS Q
Sbjct: 382 TKQAARVKFNNIQYSLPPWSISILPDCKNVVFNTAHVGVQTS-----------------Q 424
Query: 466 QSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
M+ + S SW T E I V + TV G+LE LN+T+D SDYLW+ T +++S
Sbjct: 425 VHMLPT--DSELLSWETFNEDISSVDDDKMITVAGLLEQLNITRDTSDYLWYTTSVHISS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + + P +T+ S L VFING+L+GS G + ++F +G N +
Sbjct: 483 SE-SFLRGGRL-PVLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKNRI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G E G G V L G G DL+ W+Y+VGLKGE + S
Sbjct: 541 SLLSVAVGLPNNGPRFETWNTGILGPVTLHGLDEGQRDLTWQKWSYKVGLKGEDMNLRSR 600
Query: 641 EE-NEAEWTDLT-RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ + +W + G TWYK YF++P G DP+ALD+GSMGKGQ W+NGH IGRYW
Sbjct: 601 KSVSLVDWIQGSLMVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGRYW 660
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T+ A +G C C Y + +C CG PTQ WYHVPRSWL+++ NLLV+FEE GG+
Sbjct: 661 TLYA-EGNCSG-CSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVLFEEIGGDA 718
Query: 759 FEISVKLR 766
IS+ R
Sbjct: 719 SRISLVKR 726
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 756 bits (1951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/840 (47%), Positives = 517/840 (61%), Gaps = 66/840 (7%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
M M + L + S+ S + +VSYD RAI IDG R++L S IHYPR+T EMWP
Sbjct: 1 MGKMGSITTLLLLCSALISIAIEAIDVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPS 60
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI KSKEGG DVIETYVFWN HE GQY+F G D+V+F+K + + GL+ LRIGPYVC
Sbjct: 61 LIEKSKEGGLDVIETYVFWNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGPYVC 120
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN+GGFPVWL +IP IEFRTNNA F++EM++F IVD+MR E LF+ QGGPII+ QI
Sbjct: 121 AEWNYGGFPVWLHNIPNIEFRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQI 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYGN+ SYGQ GK+YV+W A +A GVPW+MC+Q+D P+ +I+ CNG+YCD +
Sbjct: 181 ENEYGNIMGSYGQNGKEYVQWCAQLAQSYQIGVPWIMCQQSDTPDPLINTCNGFYCDQWH 240
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PNS NKP +WTE+W GW+ WGG PHR ED+AFAV RFFQ GG+F NYYMY GGTNFG
Sbjct: 241 PNSNNKPKMWTEDWTGWFMHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFG 300
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RTSGGP+ TSYDYDAP++EYG L++PKWGHLK LH +K E L S++ I G
Sbjct: 301 RTSGGPYITTSYDYDAPLNEYGDLNQPKWGHLKRLHEVLKSVETTLTMG-SSRNIDYGNQ 359
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
A ++ Y QS C FL N A++ F YT+P WSVSILPDC V+NTA
Sbjct: 360 MTATIF---SYAGQSVC--FLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNTA 414
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
KV++QTSI T+ + ++ Q M E+ L V T +
Sbjct: 415 KVNAQTSIMTINN----ENSYALDWQWMPETHLEQMKDG--------KVLGSVAITAPRL 462
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
L+ V D SDYLW+IT + V D ++R +++ VL VF+NG GS
Sbjct: 463 LDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLKIR----VNTKGHVLHVFVNGAHIGSQ 517
Query: 561 IGHWVKVVQPVEFQS------GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
+ K P F++ G N++ L+S TVGL NYGA+ + G G V+L +
Sbjct: 518 YATYGKY--PFTFEADIKLKLGKNEISLVSGTVGLPNYGAYFDNIHVGVTG-VQLVSQND 574
Query: 615 GD---IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS--TFTWYKTYFDAP 669
G D+S +W Y+VG+ GE ++YS + EW +G+ + F WYKT F P
Sbjct: 575 GSEVTKDISTNVWHYKVGMHGENVKLYSPSRSSEEW---FTNGLQAHKIFMWYKTTFRTP 631
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGN 728
G D V LDL +GKGQAWVNG++IGRYW + +A + GC TCDYRG Y S+KCTTNCGN
Sbjct: 632 VGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSSTCDYRGTYRSNKCTTNCGN 691
Query: 729 PTQTWYHVPRSWLQAS-NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
PTQ WYHVP S+L+ +N LV+FEE GGNPF++ + + C + E H
Sbjct: 692 PTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIAKACAKAYEGH-------- 743
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
E+ L C++ +IS I FAS+G P+G C F +G+C + +LS+V
Sbjct: 744 ----------------ELELACKENQVISEIRFASFGVPEGECGSFKKGHCESSDTLSIV 787
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/735 (51%), Positives = 491/735 (66%), Gaps = 51/735 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD ++++I+G RR+LIS IHYPR+TPEMW DLI K+K GG DVI+TYVFW+ HE
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y+F+G+ D+V+F+K V GLY LRIGPYVCAEWNFGG PVWL+ +PG+ FRT+N
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF QGGPII+ QIENEYG S G G+ YV WAASM
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGRAYVNWAASM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCK+ DAP+ +I++CNG+YCD + PN KP++WTE W GW+T +GG +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPVEDL+FAVARF Q+GGS++NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+ HLK+LH AIK CE ALV+ D + LG +AHV+ + C+AFLAN +
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPT-VLSLGTLLQAHVFSSG----TGTCAAFLANYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA+VTF + Y LPPWS+SILPDC+ VFNTAKV Q S + LP+ P +
Sbjct: 382 AQSAATVTFNNRHYDLPPWSISILPDCKIDVFNTAKVRVQPSQVKM---LPVKPKLF--- 435
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
SW + E + +E++ T G+LE LNVT+D SDYLW+IT + +S
Sbjct: 436 -------------SWESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + + +P++ + S + VF+NGQ +GS G PV+ ++G N +
Sbjct: 483 SE-SFLRGGQ-KPSINVQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKI 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS- 639
LLS TVGLQN G E AG G V L G G DL+ W+Y+VGL+GE + S
Sbjct: 541 ALLSVTVGLQNVGRHYETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSP 600
Query: 640 --------IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
++E++A + S WYK YFDAP G +P+ALDL SMGKGQ W+NG
Sbjct: 601 NGVSSVDWVQESQATQSR-------SQLKWYKAYFDAPGGKEPLALDLESMGKGQVWING 653
Query: 692 HHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYW A KG C ++C Y G + KC CG PTQ WYHVPRSWL+ + NL+V+F
Sbjct: 654 QSIGRYWMAYA-KGDC-NSCTYSGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVF 711
Query: 752 EETGGNPFEISVKLR 766
EE GGNP++IS+ R
Sbjct: 712 EELGGNPWKISLVKR 726
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/752 (51%), Positives = 484/752 (64%), Gaps = 44/752 (5%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+++MM+ + + V++S V+YDH+AI++DG RR+LIS IHYPR+TP+MWPD
Sbjct: 9 VVLMMLCLWVCGVTAS----------VTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPD 58
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+K+GG DVI+TYVFWN HE GQY F+ + D+VKFVKL +GLY+ LRIGPY+C
Sbjct: 59 LIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYIC 118
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN GGFPVWL+ +PGI FRT+N PFK MQ+F KIV LM+E LF QGGPII+ QI
Sbjct: 119 AEWNLGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQI 178
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYG +E G GK Y KWAA MA+GL GVPWVMCKQ DAP+ +ID CNG+YC+ +K
Sbjct: 179 ENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFK 238
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PN KP +WTENW GWYT +GG +P RP EDLAF+VARF Q GGSF+NYYMY GGTNFG
Sbjct: 239 PNKNTKPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFG 298
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RTSGG F TSYDYDAP+DEYGL +EPK+ HL+ LH AIK EPALVA D + LG N
Sbjct: 299 RTSGGLFIATSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATD-PKVQSLGYN 357
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
EAHV+ A C+AF+AN D + A F Y LPPWS+SILPDC+ V+NTA
Sbjct: 358 LEAHVFSA-----PGACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNTA 412
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
KV K N + QS E SS+ ++ +
Sbjct: 413 KVGYGWLKKMTPV------NSAFAWQSYNEEPASSS--------------QADSIAAYAL 452
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
E +NVT+D SDYLW++T + V+ ++ F K N P +T+ S VL VFINGQL G+V
Sbjct: 453 WEQVNVTRDSSDYLWYMTDVNVNANE-GFLK-NGQSPLLTVMSAGHVLHVFINGQLAGTV 510
Query: 561 IGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G + V+ ++G N L LLS VGL N G E AG G V L G G
Sbjct: 511 WGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGT 570
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV 675
DLS+ W+Y+VGLKGE +++ + EW + TWYKT F AP G DP+
Sbjct: 571 RDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSLVAKKQPLTWYKTTFSAPAGNDPL 630
Query: 676 ALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYH 735
ALDLGSMGKG+ WVNG IGR+W G C + C+Y G Y KC TNCG P+Q WYH
Sbjct: 631 ALDLGSMGKGEVWVNGRSIGRHWPGYIAHGSC-NACNYAGYYTDTKCRTNCGQPSQRWYH 689
Query: 736 VPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
VPRSWL + N LV+FEE GG+P I++ R+
Sbjct: 690 VPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 721
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/726 (52%), Positives = 477/726 (65%), Gaps = 38/726 (5%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YDHR++ I+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE ++G
Sbjct: 25 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F + D+V+FVKLV +GLY+ LRIGPYVCAEWN+GGFPVWL+ +PGI FRT+N PF
Sbjct: 85 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 144
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K YV WAA MA+
Sbjct: 145 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 204
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
AGVPW+MCKQ DAP+ +I+ CNG+YCD + PNS NKP++WTE W GW+T +GG +P
Sbjct: 205 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 264
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLAFAVARF Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAPIDEYGLL +P
Sbjct: 265 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 324
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHL +LH AIK EPALVA D +G ++A+V+R+ S +C+AFL+N
Sbjct: 325 KWGHLTNLHKAIKQAEPALVAGDPT-VQNIGNYEKAYVFRS----SSGDCAAFLSNFHTS 379
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
AA V F G+ Y LP WS+S+LPDCR V+NTA V++ +S P N
Sbjct: 380 AAARVAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASS--------PAKMN------- 424
Query: 468 MIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDI 527
+ +W + E E FT G++E L++T D SDYLW+ T + + D
Sbjct: 425 ------PAGGFTWQSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNI-DSGE 477
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILL 583
F K+ + P +T+ S ++VF+NGQ G+ G + + V+ G N + +L
Sbjct: 478 QFLKSGQ-WPQLTVYSAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISIL 536
Query: 584 SQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEE 642
S VGL N G E G G V L+G G DLSK WTYQ+GLKGE
Sbjct: 537 SSAVGLPNVGTHYETWNIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGS 596
Query: 643 NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA 702
+ EW TW++ YF+AP G PVALDLGSMGKGQAWVNGH IGRYW+ A
Sbjct: 597 SSVEWGGAAGK---QPVTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKA 653
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
G C C Y G Y+ KC NCG+ +Q WYHVPRSWL S NL+V+ EE GG+ ++
Sbjct: 654 -SGNC-GGCSYAGTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVT 711
Query: 763 VKLRST 768
+ R+T
Sbjct: 712 LMTRTT 717
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/730 (50%), Positives = 488/730 (66%), Gaps = 36/730 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+II+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+ + D+VKF+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 89 GQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV +M+EE LF QGGPII+ QIENEYG +E G GK Y KW A MA
Sbjct: 149 FKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAEMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GL GVPW+MCKQ DAP +II+ CNG+YC+ +KPNS NKP +WTENW GW+T +GG +P
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFGGAVP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED+A +VARF Q GGSF+NYYMY GGTNF RT+ G F TSYDYDAP+DEYGL E
Sbjct: 269 YRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYGLPRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+ HLK LH IKLCEPALV+AD LG QEAHV++ S+S+C+AFL+N +
Sbjct: 328 PKYSHLKRLHKVIKLCEPALVSADPT-VTSLGDKQEAHVFK-----SKSSCAAFLSNYNT 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F G +Y LPPWSVSILPDC+ +NTAKV ++T + + P
Sbjct: 382 SSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV----QVRTSSIHMKMVP------- 430
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
++T SW + E I ++N F+ G++E +++T+D +DY W++T I +S D
Sbjct: 431 -------TNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPD 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
+ T E P +TI S L VF+NGQL G+ G K Q ++ +G N L
Sbjct: 484 EKFL--TGE-DPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLA 540
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY-QVGLKGEFQQIYSI 640
LLS GL N G E G G V L G +G D++K W+Y Q+G KGE ++++
Sbjct: 541 LLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKQIGTKGEALSVHTL 600
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ EW + + TWYK+ FD+P G +P+ALD+ +MGKGQ W+NG +IGR+W
Sbjct: 601 AGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWP 660
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C+ C Y G + KC +NCG +Q WYHVPRSWL+ +NNL+++ EE GG P
Sbjct: 661 AYTARGKCE-RCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPN 719
Query: 760 EISVKLRSTR 769
IS+ R+ +
Sbjct: 720 GISLVKRTAK 729
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/762 (50%), Positives = 490/762 (64%), Gaps = 45/762 (5%)
Query: 14 LALS-VYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
L LS + ++ M+I S + SS V+YD +AI+I+G+RR+L+S IHYPR
Sbjct: 6 LVLSKILTFLLTTMLIGSSVIQCSS---------VTYDKKAIVINGHRRILLSGSIHYPR 56
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
+TPEMW DLI K+K+GG DVI+TYVFWN HE G YNF+G+ D+V+F+K + GLY+
Sbjct: 57 STPEMWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVH 116
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
LRIGPYVCAEWNFGGFPVWL+ + GI FRT+N PFK MQ F +KIV +M+E F+ QG
Sbjct: 117 LRIGPYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQG 176
Query: 193 GPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACN 252
GPII+ QIENE+ G G YV WAA MA+GL GVPWVMCK+ DAP+ II+ CN
Sbjct: 177 GPIILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCN 236
Query: 253 GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
G+YCD + PN KPT+WTE W GW+T +GG +P RPVEDLAF VARF Q+GGS++NYYM
Sbjct: 237 GFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYM 296
Query: 313 YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
Y GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EPK+ HLK LH AIK CE ALV++D
Sbjct: 297 YHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSD-P 355
Query: 373 QYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
KLG +EAHV+ A + +C AFL N + A V F + YTLP WS+SILPDC
Sbjct: 356 HVTKLGNYEEAHVFTAGK----GSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDC 411
Query: 433 RNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE 492
RN VFNTA V+++TS + VP S++ S ++ + T P
Sbjct: 412 RNVVFNTATVAAKTSHVQM-----------VPSGSILYS-VARYDEDIATYGNP------ 453
Query: 493 NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFI 552
T +G+LE +NVT+D +DYLW+ T + + + SF + + PT+T+DS + VF+
Sbjct: 454 GTITARGLLEQVNVTRDTTDYLWYTTSVDIKASE-SFLRGGK-WPTLTVDSAGHAVHVFV 511
Query: 553 NGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
NG GS G V + G N + LLS VGL N G E G G V
Sbjct: 512 NGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVA 571
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWT--DLTRDGIPSTFTWYKTY 665
L G G+ DLS WTYQ GL+GE + S E++ +W L + TWYK Y
Sbjct: 572 LHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN-KQPLTWYKAY 630
Query: 666 FDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTN 725
FDAP G +P+ALDL SMGKGQAW+NG IGRYW A KG C +C+Y G Y +KC +
Sbjct: 631 FDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFA-KGDC-GSCNYAGTYRQNKCQSG 688
Query: 726 CGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
CG PTQ WYHVPRSWL+ NLLV+FEE GG+ ++SV RS
Sbjct: 689 CGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/763 (50%), Positives = 494/763 (64%), Gaps = 47/763 (6%)
Query: 14 LALS-VYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
L LS + ++ M+I S + SS V+YD +AI+I+G+RR+L+S IHYPR
Sbjct: 6 LVLSKILTFLLTTMLIGSSMIQCSS---------VTYDKKAIVINGHRRILLSGSIHYPR 56
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
+TPEMW DLI K+K+GG DVI+TYVFWN HE G YNF+G+ D+V+F+K + GLY+
Sbjct: 57 STPEMWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVH 116
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
LRIGPYVCAEWNFGGFPVWL+ + GI FRT+N PFK MQ F +KIV +M+E F+ QG
Sbjct: 117 LRIGPYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKAAMQGFTEKIVQMMKEHRFFASQG 176
Query: 193 GPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACN 252
GPII+ QIENE+ G G YV WAA MA+GL GVPWVMCK+ DAP+ II++CN
Sbjct: 177 GPIILSQIENEFEPELKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINSCN 236
Query: 253 GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
G+YCD + PN KPT+WTE W GW+T +GG +P RPVEDLAF VARF Q+GGS++NYYM
Sbjct: 237 GFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGTIPKRPVEDLAFGVARFIQKGGSYINYYM 296
Query: 313 YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
Y GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EPK+ HLK LH AIK CE ALV++D
Sbjct: 297 YHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSD-P 355
Query: 373 QYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
KLG +EAHV+ A + +C AFL N + A V F + YTLP WS+SILPDC
Sbjct: 356 HVTKLGNYEEAHVFTAGK----GSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDC 411
Query: 433 RNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE 492
RN VFNTA V+++TS + + P+ S+ L S ++ E I + +
Sbjct: 412 RNVVFNTATVAAKTS------HVQMMPSGSI---------LYSVAR----YDEDIATYGD 452
Query: 493 N-NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
T +G+LE +NVT+D +DYLW+ T + + + SF + + PT+T+DS + VF
Sbjct: 453 RGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASE-SFLRGGK-WPTLTVDSAGHAVHVF 510
Query: 552 INGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQV 607
+NG GS G V + G N + LLS VGL N G E G G V
Sbjct: 511 VNGHFYGSAFGTRENRKFSFSSQVNLRGGANRIALLSVAVGLPNVGPHFETWATGIVGSV 570
Query: 608 KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWT--DLTRDGIPSTFTWYKT 664
L G G+ DLS WTYQ GL+GE ++ S E++ +W L + TWYK
Sbjct: 571 VLHGLDEGNKDLSWQKWTYQAGLRGEAMKLVSPTEDSSVDWIKGSLAKQN-KQPLTWYKA 629
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
YFDAP G +P+ALDL SMGKGQAW+NG IGRYW A KG C +C+Y G Y +KC +
Sbjct: 630 YFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFA-KGNC-GSCNYAGTYRQNKCQS 687
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
CG PTQ WYHVPRSWL+ NLLV+FEE GG+ ++SV RS
Sbjct: 688 GCGEPTQRWYHVPRSWLKPRGNLLVLFEELGGDISKVSVVKRS 730
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/753 (50%), Positives = 501/753 (66%), Gaps = 46/753 (6%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ ++ + LSC+ +S VSYD +A+II+G RR+L+S IHYPR+TPEMWP
Sbjct: 12 FLAILCCLSLSCIVKAS---------VSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPG 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+KEGG DVIETYVFWN HE GQY F + D+VKF+KLV +GLY+ LRIGPYVC
Sbjct: 63 LIQKAKEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWNFGGFPVWL+ +PG+ FRT+N PFK M++F +KIV +M+ E LF QGGPII+ QI
Sbjct: 123 AEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQI 182
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYG +E G GK Y KW A MALGL GVPW+MCKQ DAP IID CNGYYC+ +K
Sbjct: 183 ENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFK 242
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PNS NKP +WTENW GWYT +GG +P+RPVED+A++VARF Q+GGS +NYYMY GGTNF
Sbjct: 243 PNSINKPKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLINYYMYHGGTNFD 302
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RT+ G F +SYDYDAP+DEYGL EPK+ HLK LH AIKL EPAL++AD A LG
Sbjct: 303 RTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSAD-ATVTSLGAK 360
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
QEA+V+ S+S+C+AFL+N DE++AA V F G Y LPPWSVSILPDC+ V+NTA
Sbjct: 361 QEAYVFW-----SKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTA 415
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE-NNFTVQG 499
KV+ +P++ ++M+ + T SW + E +E F G
Sbjct: 416 KVN--------------APSV---HRNMVP---TGTKFSWGSFNEATPTANEAGTFARNG 455
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
++E +++T D SDY W+IT I + + +F KT + P +T+ S L VF+NGQL+G+
Sbjct: 456 LVEQISMTWDKSDYFWYITDITIGSGE-TFLKTGD-SPLLTVMSAGHALHVFVNGQLSGT 513
Query: 560 VIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG 615
G + Q ++ +G N + LLS VGL N G E+ G G V L G +G
Sbjct: 514 AYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSG 573
Query: 616 DIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDP 674
D+SK W+Y++G+KGE +++ E + WT + TWYK+ F P G +P
Sbjct: 574 TWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEP 633
Query: 675 VALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWY 734
+ALD+ +MGKGQ W+NG +IGR+W +G C C+Y G +++ KC +NCG +Q WY
Sbjct: 634 LALDMNTMGKGQVWINGRNIGRHWPAYKAQGSC-GRCNYAGTFDAKKCLSNCGEASQRWY 692
Query: 735 HVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
HVPRSWL+ S NL+V+FEE GG+P IS+ R+
Sbjct: 693 HVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 724
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/726 (52%), Positives = 476/726 (65%), Gaps = 38/726 (5%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YDHR++ I+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE ++G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F + D+V+FVKLV +GLY+ LRIGPYVCAEWN+GGFPVWL+ +PGI FRT+N PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K YV WAA MA+
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
AGVPW+MCKQ DAP+ +I+ CNG+YCD + PNS NKP++WTE W GW+T +GG +P
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLAFAVARF Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAPIDEYGLL +P
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHL +LH AIK E ALVA D +G ++A+V+R+ S +C+AFL+N
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPT-VQNIGNYEKAYVFRS----SSGDCAAFLSNFHTS 377
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
AA V F G+ Y LP WS+S+LPDCR V+NTA V++ +S P N
Sbjct: 378 AAARVAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASS--------PAKMN------- 422
Query: 468 MIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDI 527
+ +W + E E FT G++E L++T D SDYLW+ T + + D
Sbjct: 423 ------PAGGFTWQSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNI-DSGE 475
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILL 583
F K+ + P +T+ S ++VF+NGQ G+ G + + V+ G N + +L
Sbjct: 476 QFLKSGQ-WPQLTVYSAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISIL 534
Query: 584 SQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEE 642
S VGL N G E G G V L+G G DLSK WTYQ+GLKGE
Sbjct: 535 SSAVGLPNVGTHYETWNIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGS 594
Query: 643 NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA 702
+ EW TW++ YF+AP G PVALDLGSMGKGQAWVNGH IGRYW+ A
Sbjct: 595 SSVEWGGAAGK---QPVTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKA 651
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
G C C Y G Y+ KC NCG+ +Q WYHVPRSWL S NL+V+ EE GG+ ++
Sbjct: 652 -SGNC-GGCSYAGTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVT 709
Query: 763 VKLRST 768
+ R+T
Sbjct: 710 LMTRTT 715
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/753 (50%), Positives = 501/753 (66%), Gaps = 46/753 (6%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ ++ + LSC+ +S VSYD +A+II+G RR+L+S IHYPR+TPEMWP
Sbjct: 12 FLAILCCLSLSCIVKAS---------VSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPG 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+KEGG DVIETYVFWN HE GQY F + D+VKF+KLV +GLY+ LRIGPYVC
Sbjct: 63 LIQKAKEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWNFGGFPVWL+ +PG+ FRT+N PFK M++F +KIV +M+ E LF QGGPII+ QI
Sbjct: 123 AEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQI 182
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYG +E G GK Y KW A MALGL GVPW+MCKQ DAP IID CNGYYC+ +K
Sbjct: 183 ENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFK 242
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PNS NKP +WTENW GWYT +GG +P+RPVED+A++VARF Q+GGS +NYYMY GGTNF
Sbjct: 243 PNSINKPKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFD 302
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RT+ G F +SYDYDAP+DEYGL EPK+ HLK LH AIKL EPAL++AD A LG
Sbjct: 303 RTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSAD-ATVTSLGAK 360
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
QEA+V+ S+S+C+AFL+N DE++AA V F G Y LPPWSVSILPDC+ V+NTA
Sbjct: 361 QEAYVFW-----SKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTA 415
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE-NNFTVQG 499
KV+ +P++ ++M+ + T SW + E +E F G
Sbjct: 416 KVN--------------APSV---HRNMVP---TGTKFSWGSFNEATPTANEAGTFARNG 455
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
++E +++T D SDY W+IT I + + +F KT + P +T+ S L VF+NGQL+G+
Sbjct: 456 LVEQISMTWDKSDYFWYITDITIGSGE-TFLKTGD-SPLLTVMSAGHALHVFVNGQLSGT 513
Query: 560 VIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG 615
G + Q ++ +G N + LLS VGL N G E+ G G V L G +G
Sbjct: 514 AYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSG 573
Query: 616 DIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDP 674
D+SK W+Y++G+KGE +++ E + WT + TWYK+ F P G +P
Sbjct: 574 TWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEP 633
Query: 675 VALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWY 734
+ALD+ +MGKGQ W+NG +IGR+W +G C C+Y G +++ KC +NCG +Q WY
Sbjct: 634 LALDMNTMGKGQVWINGRNIGRHWPAYKAQGSC-GRCNYAGTFDAKKCLSNCGEASQRWY 692
Query: 735 HVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
HVPRSWL+ S NL+V+FEE GG+P IS+ R+
Sbjct: 693 HVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 724
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/735 (51%), Positives = 488/735 (66%), Gaps = 58/735 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD ++++I+G RR+LIS IHYPR+TPEMW DLI K+K GG DVI+TYVFW+ HE
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y+F+G+ D+V+F+K V GLY LRIGPYVCAEWNFGG PVWL+ +PG+ FRT+N
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F +KIV +M+ E LF QGGPII+ QIENEYG S G G+ YV WAASM
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGRAYVNWAASM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCK+ DAP+ +I++CNG+YCD + PN KP++WTE W GW+T +GG +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPVEDL+FAVARF Q+GGS++NYYMY GGTNFGR++GGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+ HLK+LH AIK CE ALV+ D + LG +AHV+ + C+AFLAN +
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPT-VLSLGTLLQAHVFSSG----TGTCAAFLANYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA+VTF + Y LPPWS+SILPDC+ VFNTAKV LP+ P +
Sbjct: 382 AQSAATVTFNNRHYDLPPWSISILPDCKIDVFNTAKVK----------MLPVKPKLF--- 428
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
SW + E + +E++ T G+LE LNVT+D SDYLW+IT + +S
Sbjct: 429 -------------SWESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISS 475
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+ SF + + +P++ + S + VF+NGQ +GS G PV+ ++G N +
Sbjct: 476 SE-SFLRGGQ-KPSINVQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKI 533
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS- 639
LLS TVGLQN G E AG G V L G G DL+ W+Y+VGL+GE + S
Sbjct: 534 ALLSVTVGLQNVGRHYETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSP 593
Query: 640 --------IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
++E++A + S WYK YFDAP G +P+ALDL SMGKGQ W+NG
Sbjct: 594 NGVSSVDWVQESQATQSR-------SQLKWYKAYFDAPGGKEPLALDLESMGKGQVWING 646
Query: 692 HHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYW A KG C ++C Y G + KC CG PTQ WYHVPRSWL+ + NL+V+F
Sbjct: 647 QSIGRYWMAYA-KGDC-NSCTYSGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVF 704
Query: 752 EETGGNPFEISVKLR 766
EE GGNP++IS+ R
Sbjct: 705 EELGGNPWKISLVKR 719
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/729 (50%), Positives = 484/729 (66%), Gaps = 35/729 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+II+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+ + D+VKF+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +P + FRT+N P
Sbjct: 89 GQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPDMVFRTDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV +M+EE LF QGGPII+ QIENEYG +E G GK Y KW A MA
Sbjct: 149 FKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAKMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GL GVPW+MCKQ DAP +II+ CNG+YC+ +KPNS KP +WTENW GW+T +GG +P
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDKKPKMWTENWTGWFTEFGGAVP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED+A +VARF Q GGSF+NYYMY GGTNF RT+ G F TSYDYDAP+DEYGL E
Sbjct: 269 YRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYGLPRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+ HLK LH IKLCEPALV+AD LG QEA V++ SQS+C+AFL+N +
Sbjct: 328 PKYSHLKRLHKVIKLCEPALVSADPT-VTSLGDKQEAQVFK-----SQSSCAAFLSNYNT 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V+F G +Y LPPWSVSILPDC+ +NTAKV +TS ++ + VP
Sbjct: 382 SSAARVSFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTS--SIHMKM-------VPTN 432
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
++ SW + E I ++N F+ G++E +++T+D +DY W++T I +S D
Sbjct: 433 TLF---------SWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPD 483
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
+ T E P + I S L VF+NGQL G+ G K Q ++ +G N L
Sbjct: 484 EKFL--TGE-DPLLNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLA 540
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V L G +G D+S+ W+Y++G KGE I+++
Sbjct: 541 LLSIAAGLPNVGVHYETWNTGVLGPVTLKGVNSGTWDMSQWKWSYKIGTKGEALSIHTVT 600
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW + TWYK+ FD P G +P+ALD+ +MGKGQ W+NG +IGR+W
Sbjct: 601 GSSTVEWKQGSLVATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWINGQNIGRHWPA 660
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
+G C+ C Y G + +KC +NCG +Q WYHVPRSWL+ +NNL+V+ EE GG P
Sbjct: 661 YTARGKCE-RCSYAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVLEEWGGEPNG 719
Query: 761 ISVKLRSTR 769
IS+ R +
Sbjct: 720 ISLVKRRAK 728
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/763 (50%), Positives = 489/763 (64%), Gaps = 47/763 (6%)
Query: 14 LALS-VYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
L LS + ++ M+I S + SS V+YD +AI+I+G+RR+L+S IHYPR
Sbjct: 6 LVLSKILTFLLTTMLIGSSVIQCSS---------VTYDKKAIVINGHRRILLSGSIHYPR 56
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
+TPEMW DLI K+K+GG DVI+TYVFWN HE G YNF+G+ D+V+F+K + GLY+
Sbjct: 57 STPEMWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVH 116
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
LRIGPYVCAEWNFGGFPVWL+ + GI FRT+N PFK MQ F +KIV +M+E F+ QG
Sbjct: 117 LRIGPYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQG 176
Query: 193 GPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACN 252
GPII+ QIENE+ G G YV WAA MA+GL GVPWVMCK+ DAP+ II+ CN
Sbjct: 177 GPIILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCN 236
Query: 253 GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
G+YCD + PN KPT+WTE W GW+T +GG +P RPVEDLAF VARF Q+GGS++NYYM
Sbjct: 237 GFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYM 296
Query: 313 YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
Y GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EPK+ HLK LH AIK CE ALV++D
Sbjct: 297 YHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSD-P 355
Query: 373 QYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
KLG +EAHV+ A + +C AFL N + A V F + YTLP WS+SILPDC
Sbjct: 356 HVTKLGNYEEAHVFTAGK----GSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDC 411
Query: 433 RNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW-S 491
RN VFNTA V+++TS + VP S++ S E I + +
Sbjct: 412 RNVVFNTATVAAKTSHVQM-----------VPSGSILYSV--------ARYDEDIATYGN 452
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
T +G+LE +NVT+D +DYLW+ T + + + SF + + PT+T+DS + VF
Sbjct: 453 RGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASE-SFLRGGK-WPTLTVDSAGHAVHVF 510
Query: 552 INGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQV 607
+NG GS G V + G N + LLS VGL N G E G G V
Sbjct: 511 VNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSV 570
Query: 608 KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWT--DLTRDGIPSTFTWYKT 664
L G G+ DLS WTYQ GL+GE + S E++ +W L + TWYK
Sbjct: 571 VLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN-KQPLTWYKA 629
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
YFDAP G +P+ALDL SMGKGQAW+NG IGRYW A KG C +C+Y G Y +KC +
Sbjct: 630 YFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFA-KGDC-GSCNYAGTYRQNKCQS 687
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
CG PTQ WYHVPRSWL+ NLLV+FEE GG+ ++SV RS
Sbjct: 688 GCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/728 (52%), Positives = 479/728 (65%), Gaps = 34/728 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+AI+I+G RR+LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE
Sbjct: 27 SVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIETYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G+Y F+ + D+VKF+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N
Sbjct: 87 QGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVAFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV +M+ E LF QGGPII+ QIENEYG +E G GK Y KW + M
Sbjct: 147 PFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWFSQM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ IID CNGYYC+ + PN KP +WTENW GWYT +G +
Sbjct: 207 AVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMWTENWTGWYTDFGTAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAF+VARF Q GS++NYYMY GGTNFGRTS G F TSYDYDAPIDEYGL+S
Sbjct: 267 PYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLIS 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIK CE ALV+ D G+N E H+Y+ + +G+ C+AFLAN D
Sbjct: 327 EPKWGHLRDLHKAIKQCESALVSVDPTVSWP-GKNLEVHLYKTS-FGA---CAAFLANYD 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V F Y LPPWS+SILPDC+ VFNTAKV + +++ N +
Sbjct: 382 TGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVHRSMT-----PANSAFNW 436
Query: 466 QSMIES-KLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QS E S S SW T G+LE L+ T D SDYLW++T + +S
Sbjct: 437 QSYNEQPAFSGESGSW---------------TANGLLEQLSQTWDKSDYLWYMTDVNISP 481
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F K N P +T S VL VFINGQ G+ G + V+ + G N +
Sbjct: 482 NE-GFIK-NGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKI 539
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS VGL N G EK G G V L G G DLSK W+Y++GLKGE +++
Sbjct: 540 SLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGESLNLHTT 599
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ +WT + TWYKT F+AP G DP+ALD+ SMGKG+ WVNG IGR+W
Sbjct: 600 SGSSSVKWTQGSFLSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSIGRHWP 659
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
+G C +C+Y G + KC TNCG PTQ WYH+PRSWL S N+LV+ EE GG+P
Sbjct: 660 AYIARGNC-GSCNYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEEWGGDPT 718
Query: 760 EISVKLRS 767
IS+ R+
Sbjct: 719 GISLVKRT 726
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/767 (50%), Positives = 488/767 (63%), Gaps = 46/767 (5%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISA 66
+R +++ L M+++++ C ++S V+YDH+AI+I+G RR+LIS
Sbjct: 4 SRIVMESLMSRRNFHMVLLLLFFWVCYVTAS---------VTYDHKAIMINGQRRILISG 54
Query: 67 GIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGS 126
IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE G+Y F+ + D+V F+KLV
Sbjct: 55 SIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPSPGKYYFEDRFDLVGFIKLVQQ 114
Query: 127 SGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM 186
+GL++ LRIGP++CAEWNFGGFPVWL+ +PGI FRT+N PFKE MQ+F +KIV++M+ E
Sbjct: 115 AGLFVHLRIGPFICAEWNFGGFPVWLKYVPGIAFRTDNEPFKEAMQKFTEKIVNIMKAEK 174
Query: 187 LFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN 246
LF QGGPII+ QIENEYG +E G GK Y KWAA MA+GL GVPWVMCKQ DAP+
Sbjct: 175 LFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDP 234
Query: 247 IIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGS 306
IID CNG+YC+ + PN KP LWTENW GWYT +GG P+RP ED+AF+VARF Q GS
Sbjct: 235 IIDTCNGFYCENFTPNKNYKPKLWTENWTGWYTAFGGATPYRPAEDIAFSVARFIQNRGS 294
Query: 307 FMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
NYYMY GGTNFGRTS G F TSYDYDAPIDEYGLL+EPKWGHL++LH AIK CE AL
Sbjct: 295 LFNYYMYHGGTNFGRTSNGLFVATSYDYDAPIDEYGLLNEPKWGHLRELHRAIKQCESAL 354
Query: 367 VAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSV 426
V+ D G+N E H+Y+ ++S C+AFLAN + + V F Y LPPWS+
Sbjct: 355 VSVDPTVSWP-GKNLEVHLYK-----TESACAAFLANYNTDYSTQVKFGNGQYDLPPWSI 408
Query: 427 SILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEP 486
SILPDC+ VFNTAKV+S P P S +W + E
Sbjct: 409 SILPDCKTEVFNTAKVNS-----------PRLHRKMTPVNSAF---------AWQSYNEE 448
Query: 487 IGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMR 545
SEN+ T + E + VT+D SDYLW++T + + +DI K P +T S
Sbjct: 449 PASSSENDPVTGYALWEQVGVTRDSSDYLWYLTDVNIGPNDIKDGK----WPVLTAMSAG 504
Query: 546 DVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
VL VFINGQ G+ G + Q V + G N + LLS +VGL N G E
Sbjct: 505 HVLNVFINGQYAGTAYGSLDDPRLTFSQSVNLRVGNNKISLLSVSVGLANVGTHFETWNT 564
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFT 660
G G V LTG +G DLSK W+Y++GLKGE +++ N EW +
Sbjct: 565 GVLGPVTLTGLSSGTWDLSKQKWSYKIGLKGESLSLHTEAGSNSVEWVQGSLVAKKQPLA 624
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
WYKT F AP G DP+ALDLGSMGKG+ WVNG IGR+W +G C + C+Y G Y
Sbjct: 625 WYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSIGRHWPGNKARGNCGN-CNYAGTYTDT 683
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
KC NCG P+Q WYHVPRSWL++ N LV+ EE GG+P I++ R+
Sbjct: 684 KCLANCGQPSQRWYHVPRSWLRSGGNYLVVLEEWGGDPNGIALVERT 730
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/760 (50%), Positives = 496/760 (65%), Gaps = 47/760 (6%)
Query: 17 SVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
+V MM++ + + ++ +S NV YD+RAI I+ RR+L+S IHYPR+TPE
Sbjct: 8 NVMKMMLVYVFVLITLISCVYG-------NVWYDYRAIKINDQRRILLSGSIHYPRSTPE 60
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWPD+I K+K+ DVI+TYVFWN HE G+Y F+G+ D+VKF+KL+ +GL++ LRIG
Sbjct: 61 MWPDIIEKAKDSQLDVIQTYVFWNGHEPSEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIG 120
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P+ CAEWNFGGFPVWL+ +PGIEFRT+N PFKE+MQ F KIVD+M+ E LF WQGGPII
Sbjct: 121 PFACAEWNFGGFPVWLKYVPGIEFRTDNGPFKEKMQVFTTKIVDMMKAEKLFHWQGGPII 180
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQ-TDAPENIIDACNGYY 255
+ QIENEYG +E G GK Y WAA MA L AGVPW+MCKQ +D P+N+ID CNG+Y
Sbjct: 181 LNQIENEYGPVEWEIGAPGKAYTHWAAQMAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFY 240
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
C+G+ P +KP +WTENW GWYT +G +P+RP ED+AF+VARF Q GGSFMNYYM+ G
Sbjct: 241 CEGFVPKDKSKPKMWTENWTGWYTEYGKPVPYRPAEDVAFSVARFIQNGGSFMNYYMFHG 300
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNF T+ G F TSYDYDAP+DEYGL EPK+ HLK+LH AIK+CEPALV++D A+
Sbjct: 301 GTNF-ETTAGRFVSTSYDYDAPLDEYGLPREPKYTHLKNLHKAIKMCEPALVSSD-AKVT 358
Query: 376 KLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNT 435
LG NQEAHVY +N +C+AFLAN D + VTF G + LP WS+SILPDC+
Sbjct: 359 NLGSNQEAHVYSSN----SGSCAAFLANYDPKWSVKVTFSGMEFELPAWSISILPDCKKE 414
Query: 436 VFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNF 495
V+NTA+V ++ S K P+ N++ S S T P F
Sbjct: 415 VYNTARV-NEPSPKLHSKMTPVISNLN----------WQSYSDEVPTADSP------GTF 457
Query: 496 TVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ 555
+ + E +N+T D SDYLW++T + V D + F K + P +T++S VL VF+NGQ
Sbjct: 458 REKKLYEQINMTWDKSDYLWYMTDV-VLDGNEGFLKKGD-EPWLTVNSAGHVLHVFVNGQ 515
Query: 556 LTGSVIGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
L G G K Q V+ +G N + LLS VGL N G E+ G G V L+G
Sbjct: 516 LQGHAYGSLAKPQLTFSQKVKMTAGVNRISLLSAVVGLANVGWHFERYNQGVLGPVTLSG 575
Query: 612 FKNGDIDLSKILWTYQVGLKGEFQQIY-SIEENEAEWTDLTRDGIPS---TFTWYKTYFD 667
G DL+ W+Y++G KGE QQ+Y S + +W G P+ WYKT FD
Sbjct: 576 LNEGTRDLTWQYWSYKIGTKGEEQQVYNSGGSSHVQW------GPPAWKQPLVWYKTTFD 629
Query: 668 APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
AP G DP+ALDLGSMGKGQAW+NG IGR+W+ KG C D C+Y G Y KC ++CG
Sbjct: 630 APGGNDPLALDLGSMGKGQAWINGQSIGRHWSNNIAKGSCNDNCNYAGTYTETKCLSDCG 689
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
+Q WYHVPRSWLQ NLLV+FEE GG+ +S+ R+
Sbjct: 690 KSSQKWYHVPRSWLQPRGNLLVVFEEWGGDTKWVSLVKRT 729
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/763 (50%), Positives = 488/763 (63%), Gaps = 47/763 (6%)
Query: 14 LALS-VYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
L LS + ++ M+I S + SS V+YD +AI+I+G+RR+L+S IHYPR
Sbjct: 6 LVLSKILTFLLTTMLIGSSVIQCSS---------VTYDKKAIVINGHRRILLSGSIHYPR 56
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
+TPEMW DLI K+K+GG DVI+TYVFWN HE G YNF+G+ D+V+F+K + GLY+
Sbjct: 57 STPEMWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVH 116
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
LRIGPYVCAEWNFGGFPVWL+ + GI FRT+N PFK MQ F +KIV +M+E F+ QG
Sbjct: 117 LRIGPYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQG 176
Query: 193 GPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACN 252
GPII+ QIENE+ G G YV WAA MA+GL GVPWVMCK+ DAP+ II+ CN
Sbjct: 177 GPIILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCN 236
Query: 253 GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
G+YCD + PN KPT+WTE W GW+T +GG +P RPVEDLAF VARF Q+GGS++NYYM
Sbjct: 237 GFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYM 296
Query: 313 YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
Y GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EPK+ HLK LH AIK CE ALV++D
Sbjct: 297 YHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSD-P 355
Query: 373 QYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
KLG +EAHV+ A + +C AFL N + A V F + YTLP WS+SILPDC
Sbjct: 356 HVTKLGNYEEAHVFTAGK----GSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDC 411
Query: 433 RNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW-S 491
RN VFNTA V+++TS + VP S++ S E I + +
Sbjct: 412 RNVVFNTATVAAKTSHVQM-----------VPSGSILYSV--------ARYDEDIATYGN 452
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
T +G+LE +NVT+D +DYLW+ T + + + SF + + PT+T+DS + VF
Sbjct: 453 RGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASE-SFLRGGK-WPTLTVDSAGHAVHVF 510
Query: 552 INGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQV 607
+NG GS G V + G N + LLS VGL N G E G G V
Sbjct: 511 VNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSV 570
Query: 608 KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWT--DLTRDGIPSTFTWYKT 664
L G G+ DLS WTYQ GL+GE + S E++ +W L + TWYK
Sbjct: 571 VLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN-KQPLTWYKA 629
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
YFD P G +P+ALDL SMGKGQAW+NG IGRYW A KG C +C+Y G Y +KC +
Sbjct: 630 YFDVPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFA-KGDC-GSCNYAGTYRQNKCQS 687
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
CG PTQ WYHVPRSWL+ NLLV+FEE GG+ ++SV RS
Sbjct: 688 GCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/841 (46%), Positives = 514/841 (61%), Gaps = 81/841 (9%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ + I LSC +NVS+D RAIIIDG RR+L+S IHYPR+TPEMWPD
Sbjct: 10 LLFQAVFISLSCA-----------YNVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPD 58
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+KEGG D IETYVFWNAHE R QY+F G D+++F+K + GLY LRIGPYVC
Sbjct: 59 LIRKAKEGGLDAIETYVFWNAHEPARRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVC 118
Query: 141 AEWNFGGFPVWLRDIPGI-EFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
AEWN+GGFPVWL ++PG+ EFRT N F EMQ F IVD++++E LF+ QGGPII+ Q
Sbjct: 119 AEWNYGGFPVWLHNMPGVQEFRTVNEVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQ 178
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYGNM S+YG GK Y+ W A MA L GVPW+MC+++DAP+ +I+ CNG+YCD +
Sbjct: 179 IENEYGNMISNYGDAGKVYIDWCAKMAESLDIGVPWIMCQESDAPQPMINTCNGWYCDSF 238
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
PN N P +WTENW GW+ +WGG+ PHR EDLAF+VARFFQ GG+F NYYMY GGTNF
Sbjct: 239 TPNDPNSPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNF 298
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
GRTSGGP+ TSYDYDAP+DE+G L++PKWGHLK+LH +K E L + + G
Sbjct: 299 GRTSGGPYLTTSYDYDAPLDEFGNLNQPKWGHLKELHTVLKAMEKTLTHGNVST-TDFGN 357
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
+ A V Y ++ S F N + A++TF G Y +P WSVSILPDC+ +NT
Sbjct: 358 SVTATV-----YATEEGSSCFFGNANTTGDATITFQGSDYVVPAWSVSILPDCKTEAYNT 412
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWM--TVKEPIGVWSENNFTV 497
AKV++QTS+ + P Q+ E++ SS W + EP+ V + +F+
Sbjct: 413 AKVNTQTSVI-----------VKKPNQA--ENEPSSLKWVWRPEAIDEPV-VQGKGSFSA 458
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
+++ V D SDYLW++T + + DDI W N T+ +++ VL F+NG+
Sbjct: 459 SFLIDQ-KVINDASDYLWYMTSVDLKPDDI-IWSDNM---TLRVNTTGIVLHAFVNGEHV 513
Query: 558 GSVIGHWVK-------VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
GS W K Q V+ G N + LLS TVGLQNYG + AG G V+L
Sbjct: 514 GS---QWTKYGVFKDVFQQQVKLNPGKNQISLLSVTVGLQNYGPMFDMVQAGITGPVELI 570
Query: 611 GFKNGDI---DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPST--FTWYKTY 665
G K + DLS WTY+VGL G + + + E + + +PS TWYKT
Sbjct: 571 GQKGDETVIKDLSCHKWTYEVGLTGLEDNKFYSKASTNETCGWSAENVPSNSKMTWYKTT 630
Query: 666 FDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGC-QDTCDYRGAYNSDKCT 723
F AP G DPV LDL MGKG AWVNG+++GRYW + +A GC D CDYRG Y+++KC
Sbjct: 631 FKAPLGNDPVVLDLQGMGKGFAWVNGYNLGRYWPSYLAEADGCSSDPCDYRGQYDNNKCV 690
Query: 724 TNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPV 783
TNCG P+Q WYHVPRS+LQ N LV+FEE GGNP++++ + VC E
Sbjct: 691 TNCGQPSQRWYHVPRSFLQDGENTLVLFEEFGGNPWQVNFQTLVVGSVCGNAHEKK---- 746
Query: 784 RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
+ L C +G IS+I+FAS+G PQG C F G C
Sbjct: 747 --------------------TLELSC-NGRPISAIKFASFGDPQGTCGSFQAGTCQTEQD 785
Query: 844 L 844
+
Sbjct: 786 I 786
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/751 (50%), Positives = 483/751 (64%), Gaps = 38/751 (5%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M++ ++ C+ S + +V+YDH+AI+I+G RR+LIS IHYPR+TP+MWPDLI
Sbjct: 12 MVIGLVLFLCLFVFSVTA-----SVTYDHKAIVINGQRRILISGSIHYPRSTPQMWPDLI 66
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+K+GG DVI+TYVFWN HE G Y F+ + D+VKFVK+V +GLY+ LRIGPYVCAE
Sbjct: 67 QKAKDGGVDVIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYVCAE 126
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WNFGGFPVWL+ +PG+ FRT+N PFK MQ+F KIV +M+ E LF QGGPIIM QIEN
Sbjct: 127 WNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIMSQIEN 186
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPN 262
EYG +E G GK Y KW + MA+GL GVPW+MCKQ DAP+ IID CNGYYC+ + PN
Sbjct: 187 EYGPVEWEIGAPGKAYTKWFSQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYCENFTPN 246
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
KP +WTENW GWYT +G +P+RP +D+AF+VARF Q GS++NYYMY GGTNFGRT
Sbjct: 247 KNYKPKMWTENWSGWYTDFGSAVPYRPAQDVAFSVARFIQNRGSYVNYYMYHGGTNFGRT 306
Query: 323 SGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE 382
S G F TSYDYDAPIDEYGLLSEPKWGHL++LH AIK CEP LV+ D G+N E
Sbjct: 307 SAGLFIATSYDYDAPIDEYGLLSEPKWGHLRNLHKAIKQCEPILVSVDPTVSWP-GKNLE 365
Query: 383 AHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
HVY+ S C+AFLAN D + A VTF Y LPPWS+SILPDC+ VFNTAKV
Sbjct: 366 VHVYKT----STGACAAFLANYDTTSPAKVTFGNGQYDLPPWSISILPDCKTAVFNTAKV 421
Query: 443 SSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKE-PIGVWSENNFTVQGIL 501
+ S F ++P S++ W + E P +++ T +L
Sbjct: 422 GTVPS-----FHRKMTP--------------VSSAFDWQSYNEAPASSGIDDSTTANALL 462
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
E + VT+D SDYLW++T + +S ++ F K + P +T S VL VF+NGQ +G+
Sbjct: 463 EQIKVTRDSSDYLWYMTDVNISPNE-GFIKNGQY-PVLTAMSAGHVLHVFVNGQFSGTAY 520
Query: 562 GHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
G + V+ + G N + LLS VGL N G E G G V L G G
Sbjct: 521 GGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSNVGLHYETWNVGVLGPVTLKGLNEGTR 580
Query: 618 DLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVA 676
DLS W+Y++GLKGE +++ I + +WT + TWYK FDAP G DP+A
Sbjct: 581 DLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKGSSLVKKQPLTWYKATFDAPAGNDPLA 640
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+ SMGKG+ WVNG IGR+W +G C C+Y G + KC T+CG PTQ WYH+
Sbjct: 641 LDMSSMGKGEIWVNGESIGRHWPAYIARGSCGG-CNYAGTFTDKKCRTSCGQPTQKWYHI 699
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
PRSW+ N LV+ EE GG+P IS+ R+
Sbjct: 700 PRSWVNPRGNFLVVLEEWGGDPSGISLVKRT 730
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/809 (47%), Positives = 512/809 (63%), Gaps = 65/809 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVS+D RAI IDG RR+LIS IHYPR+TPEMWP+LI K+KEGG D IETYVFWNAHE
Sbjct: 29 NVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAHEPS 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R Y+F G NDI++F+K + SGLY LRIGPYVCAEWN+GG PVW+ ++P +E RT N+
Sbjct: 89 RRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRTANS 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F EMQ F IVD++++E LF+ QGGPII+ QIENEYGN+ S YG GK Y+ W A+M
Sbjct: 149 VFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAYMNWCANM 208
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPW+MC+++DAP+ +I+ CNG+YCD ++PNS+N P +WTENW GW+ WGGR
Sbjct: 209 AESLKVGVPWIMCQESDAPQPMINTCNGWYCDNFEPNSFNSPKMWTENWIGWFKNWGGRD 268
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHR ED+AFAVARFFQ GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP+DEYG ++
Sbjct: 269 PHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNIA 328
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK+LH+A+K E AL + + ++ LG + + +Y N + S FL+N +
Sbjct: 329 QPKWGHLKELHSALKAMEEALTSGNVSE-TDLGNSVKVTIYATN-----GSSSCFLSNTN 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
A++TF G +YT+P WSVSILPDC++ +NTAKV QTS+ T E
Sbjct: 383 TTADATLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTSVMTKE------------- 429
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S E + + W + + ++N + +L+ + D SDYLW++T+++V D
Sbjct: 430 NSKAEKEAAILKWVWRSENIDKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHD 489
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV-------KVVQPVEFQSGYN 578
D W N T+ I+ V+ F+NG+ S HW K ++ + G N
Sbjct: 490 D-PVWSENM---TLRINGSGHVIHAFVNGEYIDS---HWATYGIHNDKFEPKIKLKHGTN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQ 635
+ LLS TVGLQNYGAF + AG G ++L K + +LS W+Y++GL G
Sbjct: 543 TISLLSVTVGLQNYGAFFDTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDH 602
Query: 636 QIYSIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+++S + A + + +P+ TWYKT F AP G DPV +DL MGKG AWVNG +
Sbjct: 603 KLFSDDSPFAAQSKWESEKLPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKN 662
Query: 694 IGRYW-TVVAPKGGCQDT-CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGR W + A + GC D CDYRG Y+ KC TNCG PTQ WYHVPRS+L+ N LV+F
Sbjct: 663 IGRIWPSYNAEEDGCSDEPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLF 722
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
E GGNP ++ + VC +N+Y NK + L CQ
Sbjct: 723 AELGGNPSLVNFQTVVVGNVC--------------ANAYE-------NKT---LELSCQ- 757
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHA 840
G IS+I+FAS+G P+G C F+ G+C +
Sbjct: 758 GRKISAIKFASFGDPKGVCGAFTNGSCES 786
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 745 bits (1924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/822 (47%), Positives = 504/822 (61%), Gaps = 70/822 (8%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
F VSYD RAI IDG R++L S IHYPR+T EMWP LI K+KEGG DVIETYVFWNAHE
Sbjct: 20 FEVSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKEGGLDVIETYVFWNAHEP 79
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
QY+F G D+VKF+K + GLY LRIGPYVCAEWN+GGFPVWL ++P +EFRTNN
Sbjct: 80 QPRQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGGFPVWLHNMPNMEFRTNN 139
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
+ EMQ F IVD MR E LF+ QGGPII+ QIENEYGN+ S YG+ GK YV+W A
Sbjct: 140 TAYMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNIMSEYGENGKQYVQWCAQ 199
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
+A GVPWVMC+Q+DAP+ II+ CNG+YCD + PNS +KP +WTENW GW+ WGG
Sbjct: 200 LAESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQFSPNSKSKPKMWTENWTGWFKNWGGP 259
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
+PHR D+A+AVARFFQ GG+F NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 260 IPHRTARDVAYAVARFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNK 319
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
++PKWGHLK LH +K E L + + G A VY Y +S C FL N
Sbjct: 320 NQPKWGHLKQLHELLKSMEDVLTQG-TTNHTDYGNLLTATVY---NYSGKSAC--FLGNA 373
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A++ F Y +P WSVSILP+C N V+NTAK+++QTSI ++ + S N P
Sbjct: 374 NSSNDATIMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVMKDN--KSDNEEEP 431
Query: 465 QQSMIESKLSSTSKSWMTVKEPI------GVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
++ +W + EP V + +L+ VT D SDYLW+IT
Sbjct: 432 HSTL----------NWQWMHEPHVQMKDGQVLGSVSRKAAQLLDQKVVTNDTSDYLWYIT 481
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQ 574
+ +S++D + + + + VL VF+NG G G K ++ +
Sbjct: 482 SVDISEND-------PIWSKIRVSTNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLK 534
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD---IDLSKILWTYQVGLK 631
G N++ LLS TVGL NYGA G G V+L +N D++ W Y+VGL
Sbjct: 535 KGTNEISLLSGTVGLPNYGAHFSNVSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLH 594
Query: 632 GEFQQIYSIEENEAEWTDLTRDGIPS--TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWV 689
GE ++Y E N+ W +G+P+ F WYKT F +P G DPV +DL + KGQAWV
Sbjct: 595 GEIVKLYCPENNKG-W---NTNGLPTNRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWV 650
Query: 690 NGHHIGRYWT-VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN-NL 747
NG++IGRYWT +A GC TC+YRG Y+SDKC T CG PTQ WYHVPRS+L+ N N
Sbjct: 651 NGNNIGRYWTRYLADDNGCTATCNYRGPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQNT 710
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P E+ +C +NSY +G + + L
Sbjct: 711 LVLFEEFGGHPNEVKFATVMVEKIC--------------ANSY--EGNV--------LEL 746
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C++ +IS I+FAS+G P+G C F + C +P +LS++S+
Sbjct: 747 SCREEQVISKIKFASFGVPEGECGSFKKSQCESPNALSILSK 788
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/755 (50%), Positives = 495/755 (65%), Gaps = 48/755 (6%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++++ + L C+ +S VSYD +A+II+G RR+L+S IHYPR+TPEMWP
Sbjct: 12 FLVILCCLSLVCIVKAS---------VSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPG 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+KEGG DVIETYVFWN HE GQY F + D+VKF+KLV +GLY+ LRIGPYVC
Sbjct: 63 LIQKAKEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML-- 198
AEWNFGGFPVWL+ +PG+ FRT+N PFK M++F +KIV +M+ E LF QGGPII+
Sbjct: 123 AEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQG 182
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG 258
QIENEYG +E G GK Y KW A MALGL GVPW+MCKQ DAP IID CNGYYC+
Sbjct: 183 QIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYCED 242
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
+KPNS NKP +WTENW GWYT +GG +P+RPVED+A++VARF Q+GGSF+NYYMY GGTN
Sbjct: 243 FKPNSSNKPKMWTENWTGWYTEFGGAVPYRPVEDIAYSVARFIQKGGSFVNYYMYHGGTN 302
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
F RT+ G F +SYDYDAP+DEYGL EPK+ HLK LH IKL EPAL++AD A LG
Sbjct: 303 FDRTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKVIKLSEPALLSAD-ATVTSLG 360
Query: 379 QNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
QEA+V+ S+S+C+AFL+N DE +AA V F G Y LPPWSVSILPDC+ +N
Sbjct: 361 AKQEAYVFW-----SKSSCAAFLSNKDESSAARVMFRGFPYVLPPWSVSILPDCKTEFYN 415
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE-NNFTV 497
TAKV++ + + + VP + SW + E +E F
Sbjct: 416 TAKVNAPSVHRNM-----------VPTGARF---------SWGSFNEATPTANEAGTFAR 455
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
G++E +++T D SDY W++T I + + +F KT + P T+ S L VF+NGQL+
Sbjct: 456 NGLVEQISMTWDKSDYFWYLTDITIGSGE-TFLKTGDF-PLFTVMSAGHALHVFVNGQLS 513
Query: 558 GSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
G+ G + Q ++ +G N L LLS VGL N G E+ G G V L G
Sbjct: 514 GTAYGGLDHPKLTFTQKIKLHAGVNKLALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVN 573
Query: 614 NGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI 672
+G D+SK W+Y++G+KGE +++ E + WT + TWYK+ F P G
Sbjct: 574 SGTWDMSKWKWSYKIGVKGEALSLHTDTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGN 633
Query: 673 DPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQT 732
+P+ALD+ +MGKGQ W+NG +IGR+W +G C C+Y G +N+ KC +NCG +Q
Sbjct: 634 EPLALDMNTMGKGQVWINGRNIGRHWPAYKAQGSC-GRCNYAGTFNAKKCLSNCGEASQR 692
Query: 733 WYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
WYHVPRSWL+ S NL+V+FEE GG+P IS+ R+
Sbjct: 693 WYHVPRSWLK-SQNLIVVFEEWGGDPNGISLVKRT 726
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/756 (50%), Positives = 491/756 (64%), Gaps = 49/756 (6%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWP 79
P +++ + L+ V S+ + V+YD +AIII+ RR+LIS IHYPR+TP+MWP
Sbjct: 2 PKTVLLFLSLLTWVGSTIGA-------VTYDEKAIIINDQRRILISGSIHYPRSTPQMWP 54
Query: 80 DLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYV 139
DLI K+K+GG D+IETYVFWN HE G+Y F+ + D+V F+KLV +GLY+ LRIGPYV
Sbjct: 55 DLIQKAKDGGLDIIETYVFWNGHEPSEGKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYV 114
Query: 140 CAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
CAEWN+GGFP+WL+ +PGI FRT+N PFK MQ+FV KIVD+M+ E L+ QGGPII+ Q
Sbjct: 115 CAEWNYGGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQ 174
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYG +E G GK Y KW A MA+ L GVPWVMCKQ DAP+ +ID CNG+YC+ +
Sbjct: 175 IENEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENF 234
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
KPN KP +WTENW GWYT +GG P+RP ED+AF+VARF Q GS +NYY+Y GGTNF
Sbjct: 235 KPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNF 294
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
GRTS G F TSYD+DAPIDEYGL+ EPKWGHL+DLH AIK CEPALV+AD LG+
Sbjct: 295 GRTS-GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKSCEPALVSADPT-ITWLGK 352
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
NQEA V++ S S C+AFLAN D + V F Y LPPWS+SILPDC FNT
Sbjct: 353 NQEARVFK-----SSSACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCXTVTFNT 407
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQ 498
A+V ++ + +P+S S W++ K EP ++++ T
Sbjct: 408 AQVGVKSYQAKM---MPIS------------------SFGWLSYKEEPASAYAKDTTTKA 446
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
G++E +++T D +DYLW++ I + D F K+ + P ++++S +L VFINGQL+G
Sbjct: 447 GLVEQVSITWDTTDYLWYMQDISI-DSTEGFLKSGK-WPLLSVNSAGHLLHVFINGQLSG 504
Query: 559 SVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
SV G + + V+ + G N L +LS TVGL N G + AG G V L G
Sbjct: 505 SVYGSLEDPAITFSKNVDLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLEGLNE 564
Query: 615 GDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWT--DLTRDGIPSTFTWYKTYFDAPDG 671
G D+SK W+Y+VGL GE +YS + N +WT LT+ TWYKT F P G
Sbjct: 565 GTRDMSKYKWSYKVGLSGESLNLYSDKGSNSVQWTKGSLTQK---QPLTWYKTTFKTPAG 621
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
+P+ LD+ SM KGQ W+NG IGRY+ G C D C Y G + KC NCG P+Q
Sbjct: 622 NEPLGLDMSSMSKGQIWINGQSIGRYFPGYIANGKC-DKCSYAGLFTEKKCLGNCGEPSQ 680
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
WYH+PR WL S+NLLVIFEE GG+P IS+ R+
Sbjct: 681 KWYHIPRDWLSPSDNLLVIFEEIGGSPDGISLVKRT 716
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/819 (45%), Positives = 512/819 (62%), Gaps = 72/819 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD RAIIIDG R+L+S IHYPR+T +MWPDL+ KS+EGG D IETYVFW++HE R
Sbjct: 25 VTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPDLVKKSREGGLDAIETYVFWDSHEPAR 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+Y+F G D+++F+K + GLY LRIGPYVCAEWN+GGFPVWL ++PG++ RT N
Sbjct: 85 REYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQMRTANDV 144
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F EM+ F IV+++++E LF+ QGGP+I+ QIENEYGN+ SSYG +GK Y++W A+MA
Sbjct: 145 FMNEMRNFTTLIVNMVKQENLFASQGGPVILAQIENEYGNVMSSYGDEGKAYIEWCANMA 204
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q+DAPE +I+ CNG+YCD + PN P +WTENW GW+ +WGG+ P
Sbjct: 205 QSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQFTPNRPTSPKMWTENWTGWFKSWGGKDP 264
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR EDLAF+VARF+Q GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP+DEYG L++
Sbjct: 265 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNLNQ 324
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK+LH + E L + + + G + +Y ++ S FL N D
Sbjct: 325 PKWGHLKELHDVLHSMEDTLTRGNISS-VDFGNSVSGTIYS-----TEKGSSCFLTNTDS 378
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ F G Y +P WSVSILPDC++ V+NTAKVS+QTS+ V ++
Sbjct: 379 RNDTTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQTSVM-------------VKKK 425
Query: 467 SMIESKLSSTSKSWM-TVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
++ E + ++ + SW + ++ + +V IL+ + D SDYL+++T + + +D
Sbjct: 426 NVAEDEPAALTWSWRPETNDKSILFGKGEVSVNQILDQKDAANDLSDYLFYMTSVSLKED 485
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK-------VVQPVEFQSGYN 578
D W N T+ I VL VF+NG+ GS W K Q ++ G N
Sbjct: 486 D-PIWGDNM---TLRITGSGQVLHVFVNGEFIGS---QWAKYGVFDYVFEQQIKLNKGKN 538
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQ 635
+ LLS TVG NYGA + AG RG V+L G+ + +I DLS W+Y+VGL+G Q
Sbjct: 539 TITLLSATVGFANYGANFDLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKVGLEGLRQ 598
Query: 636 QIYSIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+YS + ++W +D P+ FTWYK F AP G DPV +DL +GKG AWVNG+
Sbjct: 599 NLYS--SDSSKW---QQDNYPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAWVNGNS 653
Query: 694 IGRYWTVVAPKGGCQ-DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL-QASNNLLVIF 751
IGRYW + GC D CDYRG+Y+++KC TNCG PTQ WYHVPRS+L +N LV+F
Sbjct: 654 IGRYWPSFIAEDGCSLDPCDYRGSYDNNKCVTNCGKPTQRWYHVPRSFLNNEGDNTLVLF 713
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GG+P ++ + + C E ++ L CQ
Sbjct: 714 EEFGGDPSSVNFQTTAIGSACVNAEEKK------------------------KIELSCQ- 748
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPM-SLSVVSE 849
G IS+I+FAS+G P G C FS+G C A +LS+V +
Sbjct: 749 GRPISAIKFASFGNPLGTCGSFSKGTCEASNDALSIVQK 787
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/774 (47%), Positives = 496/774 (64%), Gaps = 43/774 (5%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISA 66
NR + +A + ++++M+ L S A+ NVSYDHR++ I R+++ISA
Sbjct: 2 NRVTTESIASTA----ILVVMVFLFSWRSIEAA------NVSYDHRSLTIGNRRQLIISA 51
Query: 67 GIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGS 126
IHYPR+ P MWP L+ +KEGG + IE+YVFWN HE G+Y F G+ +IVKF+K+V
Sbjct: 52 AIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQ 111
Query: 127 SGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM 186
+G+++ LRIGP+V AEWN+GG PVWL +PG FR +N P+K M+ F IV+L+++E
Sbjct: 112 AGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEK 171
Query: 187 LFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN 246
LF+ QGGPII+ Q+ENEYG E YG+ GK Y +W+ASMA+ GVPW+MC+Q DAP
Sbjct: 172 LFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPT 231
Query: 247 IIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGS 306
+I CNG+YCD + PN+ +KP +WTENW GW+ T+GGR PHRP ED+A++VARFF +GGS
Sbjct: 232 VISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGS 291
Query: 307 FMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL PKWGHLKDLH AI L E L
Sbjct: 292 VHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLL 351
Query: 367 VAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSV 426
++ + Q LG + EA VY S C+AFL+N+D+ +V F SY LP WSV
Sbjct: 352 ISGEH-QNFTLGHSLEADVYT----DSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSV 406
Query: 427 SILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEP 486
SILPDC+ VFNTAKV+S++S VE LP E SS+ W E
Sbjct: 407 SILPDCKTEVFNTAKVTSKSS--KVEM-LP-------------EDLKSSSGLKWEVFSEK 450
Query: 487 IGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRD 546
G+W +F +++H+N TKD +DYLW+ T I VS+++ +F K P + I+S
Sbjct: 451 PGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENE-AFLKKGS-SPVLFIESKGH 508
Query: 547 VLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
L VFIN + G+ G+ K+ +PV ++G N++ LLS TVGL N G+F E GAG
Sbjct: 509 TLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYEWVGAG 568
Query: 603 FRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTW 661
V + GF G ++L+ W+Y++G++GE +++ + A +WT T+ TW
Sbjct: 569 LT-SVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQPLTW 627
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA----PKGGCQDTCDYRGAY 717
YK + P G +PV LD+ SMGKG AW+NG IGRYW +A P C CDYRG +
Sbjct: 628 YKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDYRGKF 687
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
DKC T CG P+Q WYHVPRSW ++S N LVIFEE GGNP +I + R +V
Sbjct: 688 MPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKVSVV 741
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 736 bits (1899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/776 (48%), Positives = 485/776 (62%), Gaps = 84/776 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHR+++I+G RR+LIS IHYPR+ PEMWP LI K+K+GG DV++TYVFWN HE +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKLV +GLY+ LR+GPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+FV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES G GK Y WAA MA
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+G AGVPWVMCKQ DAP+ +I+ CNG+YCD + PN+ +KPT+WTE W GW+T +GG P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY----- 341
HRPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 342 --------------------------------------------GLLSEPKWGHLKDLHA 357
GLL +PKWGHL+++H
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 358 AIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQ 417
AIK EPALV+ D +G ++A+V+++ C+AFL+N +A + F G+
Sbjct: 400 AIKQAEPALVSGDPT-IRSIGNYEKAYVFKSK----NGACAAFLSNYHVKSAVRIRFDGR 454
Query: 418 SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTS 477
Y LP WS+SILPDC+ VFNTA V T + P+ S + +
Sbjct: 455 HYDLPAWSISILPDCKTAVFNTATVKEPTLL---------------PKMSPVMHRF---- 495
Query: 478 KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRP 537
+W + E ++ F G++E L++T D SDYLW+ T + + ++ F K+ + P
Sbjct: 496 -AWQSYSEDTNSLDDSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNE-RFLKSGQW-P 552
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYG 593
+++ S ++VF+NG+ GSV G + + V+ G N + +LS VGL N G
Sbjct: 553 QLSVYSAGHSMQVFVNGRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNG 612
Query: 594 AFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTR 652
E G G V L+G G DLS W YQVGLKGE ++++ + A EW
Sbjct: 613 DHFELWNVGVLGPVTLSGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAG--P 670
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
G TW+K F+AP G DPVALD+GSMGKGQ WVNG H GRYW+ A GC C
Sbjct: 671 GGGTQPLTWHKALFNAPAGSDPVALDMGSMGKGQVWVNGRHAGRYWSYRAHSRGC-GRCS 729
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRST 768
Y G Y D+CT+NCG+ +Q WYHVPRSWL+ S NLLV+ EE GG+ +S+ R+T
Sbjct: 730 YAGTYREDQCTSNCGDLSQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVSLATRTT 785
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/809 (46%), Positives = 514/809 (63%), Gaps = 68/809 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
+S+D RAI IDG RR+L+S IHYPR+TP+MWPDLI KSKEGG D IETYVFWN HE R
Sbjct: 25 ISHDGRAITIDGKRRVLLSGSIHYPRSTPQMWPDLIKKSKEGGLDAIETYVFWNVHEPSR 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QY+F G D+V+F+K V GLY LRIGPYVCAEWN+GGFPVWL ++PGIE RT N+
Sbjct: 85 RQYDFGGNLDLVRFIKAVQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGIELRTANSI 144
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F EMQ F IVD+M++E LF+ QGGPII+ Q+ENEYGN+ SSYG GK Y+ W A+MA
Sbjct: 145 FMNEMQNFTSLIVDMMKQEQLFASQGGPIIIAQVENEYGNVMSSYGAAGKAYIDWCANMA 204
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q+DAP+ +I+ CNG+YCD + P++ N P +WTENW GW+ +WGG+ P
Sbjct: 205 ESLNIGVPWIMCQQSDAPDPMINTCNGWYCDQFTPSNPNSPKMWTENWTGWFKSWGGKDP 264
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR ED+AFAVARFFQ GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP+DE+G L++
Sbjct: 265 HRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEFGNLNQ 324
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK LH + E L + + + + A +Y ++ +S+C FL+N +E
Sbjct: 325 PKWGHLKQLHDVLHSMEEILTSG-TVSSVDYDNSVTATIYATDK---ESSC--FLSNANE 378
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ A++ F G +YT+P WSVSILPDC N +NTAKV +QTS+ V +
Sbjct: 379 TSDATIEFKGTTYTIPAWSVSILPDCANVGYNTAKVKTQTSVM-------------VKRD 425
Query: 467 SMIESKLSSTSKSWM--TVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ E + +S + SW V + + + + + + I++ V D SDYLW++T + +
Sbjct: 426 NKAEDEPTSLNWSWRPENVDKTV-LLGQGHIHAKQIVDQKAVANDASDYLWYMTSVDLKK 484
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS-----VIGHWVKVVQPVEFQSGYND 579
DD+ + K +R I+ +L ++NG+ GS + ++V + V+ + G N
Sbjct: 485 DDLIWSKDMSIR----INGSGHILHAYVNGEYLGSQWSEYSVSNYV-FEKSVKLKHGRNL 539
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQQ 636
+ LLS TVGL NYGA + AG G V+L G K + DLS W+Y+VGL G +
Sbjct: 540 ITLLSATVGLANYGANYDLIQAGILGPVELVGRKGDETIIKDLSNNRWSYKVGLLGLEDK 599
Query: 637 IY-SIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+Y S ++ ++W + +P+ TWYKT F AP G DPV LDL +GKG AW+NG+
Sbjct: 600 LYLSDSKHASKWQE---QELPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKGMAWINGNS 656
Query: 694 IGRYW-TVVAPKGGCQ-DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYW + +A GC D CDYRG Y+++KC +NCG PTQ WYHVPRS+LQ + N LV+F
Sbjct: 657 IGRYWPSFLAEDDGCSTDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQDNENTLVLF 716
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GGNP +++ + T + C E + + C +
Sbjct: 717 EEFGGNPSQVNFQTVVTGVACVSGDEGEV------------------------VEISC-N 751
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHA 840
G IS+++FAS+G PQG C +G+C
Sbjct: 752 GQSISAVQFASFGDPQGTCGSSVKGSCEG 780
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/774 (47%), Positives = 495/774 (63%), Gaps = 43/774 (5%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISA 66
NR + +A + ++++M+ L S A+ NVSYDHR++ I R+++ISA
Sbjct: 2 NRVTTESIASTA----ILVVMVFLFSWRSIEAA------NVSYDHRSLTIGNRRQLIISA 51
Query: 67 GIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGS 126
IHYPR+ P MWP L+ +KEGG + IE+YVFWN HE G+Y F G+ +IVKF+K+V
Sbjct: 52 AIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQ 111
Query: 127 SGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM 186
+G+++ LRIGP+V AEWN+GG PVWL +PG FR +N P+K M+ F IV+L+++E
Sbjct: 112 AGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEK 171
Query: 187 LFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN 246
LF+ QGGPII+ Q+ENEYG E YG+ GK Y +W+ASMA+ GVPW+MC+Q DAP
Sbjct: 172 LFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPT 231
Query: 247 IIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGS 306
+I CNG+YCD + PN+ +KP +WTENW GW+ T+GGR PHRP ED+A++VARFF +GGS
Sbjct: 232 VISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGS 291
Query: 307 FMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL PKWGHLKDLH AI L E L
Sbjct: 292 VHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLL 351
Query: 367 VAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSV 426
++ + Q LG + EA VY S C+AFL+N+D+ +V F SY LP WSV
Sbjct: 352 ISGEH-QNFTLGHSLEADVYT----DSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSV 406
Query: 427 SILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEP 486
SILPDC+ VFNTAKV+S++S VE LP E SS+ W E
Sbjct: 407 SILPDCKTEVFNTAKVTSKSS--KVEM-LP-------------EDLKSSSGLKWEVFSEK 450
Query: 487 IGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRD 546
G+W +F +++H+N TKD +DYLW+ T I VS+++ +F K P + I+S
Sbjct: 451 PGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENE-AFLKKGS-SPVLFIESKGH 508
Query: 547 VLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
L VFIN + G+ G+ K+ +PV ++G ++ LLS TVGL N G+F E GAG
Sbjct: 509 TLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGETNIDLLSMTVGLANAGSFYEWVGAG 568
Query: 603 FRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTW 661
V + GF G ++L+ W+Y++G++GE +++ + A +WT T+ TW
Sbjct: 569 LT-SVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQPLTW 627
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA----PKGGCQDTCDYRGAY 717
YK + P G +PV LD+ SMGKG AW+NG IGRYW +A P C CDYRG +
Sbjct: 628 YKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDYRGKF 687
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
DKC T CG P+Q WYHVPRSW ++S N LVIFEE GGNP +I + R +V
Sbjct: 688 MPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKVSVV 741
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 732 bits (1889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/724 (51%), Positives = 468/724 (64%), Gaps = 27/724 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R++IIDG R++LISA IHYPR+ P MWP LI +KEGG DVIETYVFWN HE
Sbjct: 26 NVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTAKEGGIDVIETYVFWNGHELS 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y F G+ D+V+F K+V +G+YL LRIGP+V AEWNFGG PVWL IPG FRT N
Sbjct: 86 PGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNFGGVPVWLHYIPGTVFRTYNQ 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PF M++F IV+LM++E LF+ QGGPII+ QIENEYG E+ Y + GK Y WAA M
Sbjct: 146 PFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ VPW+MC+Q DAP+ +ID CN +YCD + P S +P +WTENW GW+ T+GGR
Sbjct: 206 AVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKMWTENWPGWFKTFGGRD 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 266 PHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK+LH AIKLCE L+ S I LG + EA +Y S C+AF++N+D
Sbjct: 326 LPKWGHLKELHKAIKLCEHVLLYGKSVN-ISLGPSVEADIYT----DSSGACAAFISNVD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ V F SY LP WSVSILPDC+N VFNTAKVSS T+I + +P+
Sbjct: 381 DKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAM-----------IPE 429
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
K T K W KE G+W + +F G ++H+N TKD +DYLWH T I + D
Sbjct: 430 HLQQSDKGQKTLK-WDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILI-DA 487
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLI 581
+ F K +P + I+S L F+N + G+ G+ P+ ++G N++
Sbjct: 488 NEEFLKKGS-KPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIA 546
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS TVGLQ G F + GAG VK+ G N IDLS W Y++G+ GE IY E
Sbjct: 547 ILSLTVGLQTAGPFYDFIGAGVT-SVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGE 605
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
N +WT + TWYK DAP G +PV LD+ MGKG AW+NG IGRYW
Sbjct: 606 GMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPR 665
Query: 701 VA--PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
++ K C CDYRG +N DKC T CG P+Q WYHVPRSW + S N+LVIFEE GG+P
Sbjct: 666 ISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGDP 725
Query: 759 FEIS 762
+I+
Sbjct: 726 TKIT 729
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/824 (45%), Positives = 501/824 (60%), Gaps = 73/824 (8%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV + VSYD RA+IIDG RR+L S IHYPR+TPEMWPDLI K+K GG D
Sbjct: 25 CVLFVLLNVLASAVEVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMWPDLIRKAKAGGLD 84
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
IETYVFWN HE +R +Y+F G D+++F++ + + GLY LRIGPYVCAEW +GGFP+W
Sbjct: 85 AIETYVFWNVHEPLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPYVCAEWTYGGFPMW 144
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSY 211
L ++PGIEFRT N F EMQ F IVD+ ++E LF+ QGGPII+ QIENEYGN+ + Y
Sbjct: 145 LHNMPGIEFRTANKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIIIAQIENEYGNIMAPY 204
Query: 212 GQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWT 271
G GK YV W A+MA L GVPW+MC+Q+DAP+ +I+ CNG+YCD + PN+ N P +WT
Sbjct: 205 GDAGKVYVDWCAAMANSLDIGVPWIMCQQSDAPQPMINTCNGWYCDSFTPNNPNSPKMWT 264
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITS 331
ENW GW+ WGG+ PHR EDL+++VARFFQ GG+F NYYMY GGTNFGR +GGP+ TS
Sbjct: 265 ENWTGWFKNWGGKDPHRTAEDLSYSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTS 324
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRY 391
YDYDAP+DE+G L++PKWGHLKDLH +K E L + I +G + E V Y
Sbjct: 325 YDYDAPLDEFGNLNQPKWGHLKDLHTVLKSMEETLTEGNITT-IDMGNSVEVTV-----Y 378
Query: 392 GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTV 451
+Q S F +N + A+ T+ G YT+P WSVSILPDC+ V+NTAKV++QTS+
Sbjct: 379 ATQKVSSCFFSNSNTTNDATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVNAQTSVM-- 436
Query: 452 EFSLPLSPNISVPQQSMIESKLSSTSKSWM-TVKEPIGVWSENNFTVQGILEHLNVTKDY 510
V ++ E + +S SW + + V + + +++ T D
Sbjct: 437 -----------VKNKNEAEDQPASLKWSWRPEMIDDTAVLGKGQVSANRLIDQ-KTTNDR 484
Query: 511 SDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK---- 566
SDYLW++ + +S+DD+ W N T+ +++ +L ++NG+ GS W
Sbjct: 485 SDYLWYMNSVDLSEDDL-VWTDNM---TLRVNATGHILHAYVNGEYLGS---QWATNGIF 537
Query: 567 ---VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLS 620
+ V+ + G N + LLS T+G QNYGAF + +G G V++ G K + DLS
Sbjct: 538 NYVFEEKVKLKPGKNLIALLSATIGFQNYGAFYDLVQSGISGPVEIVGRKGDETIIKDLS 597
Query: 621 KILWTYQVGLKGEFQQIYSIEENEAEWTD----LTRDGIPSTFTWYKTYFDAPDGIDPVA 676
W+Y+VG+ G ++Y E+ +W + L R+ TWYKT F AP G D V
Sbjct: 598 SHKWSYKVGMHGMAMKLYD-PESPYKWEEGNVPLNRN-----LTWYKTTFKAPLGTDAVV 651
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
+DL +GKG+AWVNG +GRYW + GC TCDYRG Y + KC NCGNPTQ WYHV
Sbjct: 652 VDLQGLGKGEAWVNGQSLGRYWPSSIAEDGCNATCDYRGPYTNTKCVRNCGNPTQRWYHV 711
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL 796
PRS+L A N LV+FEE GGNP ++ + + C E++
Sbjct: 712 PRSFLTADENTLVLFEEFGGNPSLVNFQTVTIGTACGNAYENNV---------------- 755
Query: 797 SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+ L CQ+ IS I+FAS+G PQG C FS+G+C
Sbjct: 756 --------LELACQN-RPISDIKFASFGDPQGSCGSFSKGSCEG 790
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 730 bits (1885), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/854 (45%), Positives = 531/854 (62%), Gaps = 78/854 (9%)
Query: 11 LQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHY 70
+ L+LSV+ +++ I + V VS+D RAIIIDG RR+L+S IHY
Sbjct: 1 MNFLSLSVWFCFVILSFIGSNAVE------------VSHDGRAIIIDGKRRVLLSGSIHY 48
Query: 71 PRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLY 130
PR+TPEMWP+LI K+KEGG D IETYVFWNAHE R Y+F G NDI++F+K + SGLY
Sbjct: 49 PRSTPEMWPELIQKAKEGGLDAIETYVFWNAHEPSRRVYDFSGNNDIIRFLKTIQESGLY 108
Query: 131 LQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSW 190
LRIGPYVCAEWN+GG PVW+ ++P +E RT N+ + EMQ F IVD++++E LF+
Sbjct: 109 GVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRTANSVYMNEMQNFTTLIVDMVKKEKLFAS 168
Query: 191 QGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDA 250
QGGPII+ QIENEYGN+ S YG GK Y+ W A+MA L GVPW+MC+++DAP+++I+
Sbjct: 169 QGGPIILTQIENEYGNVISHYGDAGKAYMNWCANMAESLNVGVPWIMCQESDAPQSMINT 228
Query: 251 CNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNY 310
CNG+YCD ++PN+ + P +WTENW GW+ WGGR PHR ED+AFAVARFFQ GG+F NY
Sbjct: 229 CNGFYCDNFEPNNPSSPKMWTENWVGWFKNWGGRDPHRTAEDVAFAVARFFQTGGTFQNY 288
Query: 311 YMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
YMY GGTNF RT+GGP+ TSYDYDAP+DEYG +++PKWGHLK+LH +K E L + +
Sbjct: 289 YMYHGGTNFDRTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHNVLKSMEETLTSGN 348
Query: 371 SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILP 430
++ G + +A +Y N + S FL++ + T A++TF G++YT+P WSVSILP
Sbjct: 349 VSE-TDFGNSVKATIYATN-----GSSSCFLSSTNTTTDATLTFRGKNYTVPAWSVSILP 402
Query: 431 DCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW 490
DC + +NTAKV+ QTS+ V + S E + ++ W + +
Sbjct: 403 DCEHEEYNTAKVNVQTSVM-------------VKENSKAEEEATALKWVWRSENIDNALH 449
Query: 491 SENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRV 550
++N + +L+ + D SDYLW++T+++V DD W N T+ I+S V+
Sbjct: 450 GKSNVSANRLLDQKDAANDASDYLWYMTKLHVKHDD-PVWGENM---TLRINSSGHVIHA 505
Query: 551 FINGQLTGSVIGHWV-------KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGF 603
F+NG+ GS HW K ++ + G N + LLS TVGLQNYGAF + AG
Sbjct: 506 FVNGEHIGS---HWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGAFFDTWHAGL 562
Query: 604 RGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS--T 658
++L K + +LS W+Y+VGL G +++S + A + +P+
Sbjct: 563 VEPIELVSVKGDETIIKNLSSNKWSYKVGLHGWDHKLFSDDSPFAAPNKWESEKLPTDRM 622
Query: 659 FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDT-CDYRGA 716
TWYKT F+AP G DPV +DL MGKG AWVNG +IGR W + A + GC D CDYRG
Sbjct: 623 LTWYKTTFNAPLGTDPVVVDLQGMGKGYAWVNGQNIGRIWPSYNAEEDGCSDEPCDYRGE 682
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVS 776
Y KC TNCG PTQ WYHVPRS+L+ N LV+F E GGNP +++ + VC
Sbjct: 683 YTDSKCVTNCGKPTQRWYHVPRSYLKDGANNLVLFAELGGNPSQVNFQTVVVGTVC---- 738
Query: 777 ESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRG 836
+N+Y NK + L CQ G IS+I+FAS+G P+G C F+ G
Sbjct: 739 ----------ANAYE-------NKT---LELSCQ-GRKISAIKFASFGDPEGVCGAFTNG 777
Query: 837 NCHAPM-SLSVVSE 849
+C + +LS+V +
Sbjct: 778 SCESKSNALSIVQK 791
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/726 (50%), Positives = 466/726 (64%), Gaps = 34/726 (4%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +AII++G RR+LI+ IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 31 VTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G Y F+ + D+VKFVK+V +GLY+ LRIGPY CAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 91 GNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMSFRTDNEP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV++M++E LF QGGPII+ QIENEYG +E GK Y +WAA MA
Sbjct: 151 FKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWAAQMA 210
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+GL GVPW+ CKQ DAP+ +ID CN YYC+ + PN KP +WTE W W+T+WG +
Sbjct: 211 VGLNTGVPWIACKQEDAPDPLIDTCNAYYCEKFTPNKSYKPKMWTEAWTAWFTSWGNPVL 270
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED AF+V +F Q GGS+ NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL ++
Sbjct: 271 YRPAEDQAFSVLKFIQSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYGLTND 330
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+ HLK +H AIK E ALV+AD A LG NQEAHVY S S C+AFLAN D
Sbjct: 331 PKYTHLKHMHKAIKQSEKALVSAD-ATVTSLGTNQEAHVYS-----SSSGCAAFLANYDV 384
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+ V F Y LP WS+SILPDC+ V+NTAKV + K ++P
Sbjct: 385 SYSVKVNFGSGQYDLPAWSISILPDCKTEVYNTAKVLAPRVHKK------MTPLGGFTWD 438
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S I+ S ++ + T G+ E L +TKD SDYLW++ + + D+
Sbjct: 439 SYIDEVASG--------------FASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDE 484
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLIL 582
TN P + + S L VF+NG+L GS G + Q V+ G N + L
Sbjct: 485 AFL--TNGKDPFLNVQSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIAL 542
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS +VGL N G E G G V LTG G +D++K W+Y+VG++GE Q+ ++
Sbjct: 543 LSASVGLANVGLHFENYNVGVLGPVTLTGLNQGTVDMTKWKWSYKVGVQGEKLQLNTVAG 602
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW + TWYK+ F+AP+G DPVALD+ SMGKGQ W+NG IGRYW
Sbjct: 603 SSSVEWVKGSMLAKKQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAY 662
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
+G C C Y G + KC T CG PTQ WYHVPRSWL+ + NLLV+FEE GG+P I
Sbjct: 663 TAQGNC-GGCSYGGYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGI 721
Query: 762 SVKLRS 767
S+ R+
Sbjct: 722 SMVKRT 727
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/840 (44%), Positives = 522/840 (62%), Gaps = 67/840 (7%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
++ + +I ++ S++++ VS+D RAI IDG RR+L+S IHYPR+T +MWPDL
Sbjct: 8 LLSLFLILITSFGSANSTI------VSHDERAITIDGQRRILLSGSIHYPRSTSDMWPDL 61
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I+K+K+GG D IETYVFWNAHE R QY+F G D+V+F+K + S+GLY LRIGPYVCA
Sbjct: 62 ISKAKDGGLDTIETYVFWNAHEPSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGPYVCA 121
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWN+GGFPVWL ++P ++FRT N F EMQ F KIV++M+EE LF+ QGGPII+ QIE
Sbjct: 122 EWNYGGFPVWLHNMPDMKFRTINPGFMNEMQNFTTKIVNMMKEESLFASQGGPIILAQIE 181
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYGN+ SSYG +GK Y+ W A+MA L GVPW+MC+Q AP+ +I+ CNG+YCD YKP
Sbjct: 182 NEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCDQYKP 241
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
++ + P +WTENW GW+ WGG+ P+R EDLAF+VARFFQ GG+F NYYMY GGTNFGR
Sbjct: 242 SNPSSPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGR 301
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
+GGP+ TSYDYDAP+DEYG L++PKWGHLK LH +K E L + + I LG +
Sbjct: 302 VAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHTLLKSMEKPLTYGNIST-IDLGNSV 360
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
A VY N +S+C F+ N++ A V F G+ Y +P WSVS+LPDC +NTA+
Sbjct: 361 TATVYSTNE---KSSC--FIGNVNATADALVNFKGKDYNVPAWSVSVLPDCDKEAYNTAR 415
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL 501
V++QTSI T + + P+ KL T + T ++ I + + +G++
Sbjct: 416 VNTQTSIITED-------SCDEPE------KLKWTWRPEFTTQKTI-LKGSGDLIAKGLV 461
Query: 502 EHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI 561
+ +VT D SDYLW++T++++ D W N ++ + S VL ++NG+ G+ I
Sbjct: 462 DQKDVTNDASDYLWYMTRVHLDKKD-PIWSRNM---SLRVHSNAHVLHAYVNGKYVGNQI 517
Query: 562 GHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
K + V G N L LLS +VGLQNYG F E G G VKL G+K +
Sbjct: 518 VRDNKFDYRFEKKVNLVHGTNHLALLSVSVGLQNYGPFFESGPTGINGPVKLVGYKGDET 577
Query: 618 ---DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS--TFTWYKTYFDAPDGI 672
DLSK W Y++GL G +++S++ + + +P+ +WYK F AP G
Sbjct: 578 IEKDLSKHQWDYKIGLNGFNHKLFSMKSAGHHHRKWSTEKLPADRMLSWYKANFKAPLGK 637
Query: 673 DPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
DPV +DL +GKG+ W+NG IGRYW + + GC + CDYRG Y SDKC CG PTQ
Sbjct: 638 DPVIVDLNGLGKGEVWINGQSIGRYWPSFNSSDEGCTEECDYRGEYGSDKCAFMCGKPTQ 697
Query: 732 TWYHVPRSWLQ-ASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSY 790
WYHVPRS+L +N + +FEE GG+P + K T VC + E +
Sbjct: 698 RWYHVPRSFLNDKGHNTITLFEEMGGDPSMVKFKTVVTGRVCAKAHEHN----------- 746
Query: 791 SVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCH-APMSLSVVSE 849
++ L C + IS+++FAS+G P G+C F+ G+C A ++ VV++
Sbjct: 747 -------------KVELSCNN-RPISAVKFASFGNPSGQCGSFAAGSCEGAKDAVKVVAK 792
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/727 (51%), Positives = 467/727 (64%), Gaps = 37/727 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+A++IDG RR+LIS IHYPR+TPEMWPDL K+K+GG DVI+TYVFWN HE
Sbjct: 24 SVSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y K + D VK KL + L + LR+ P F GFPVWL+ +PG+ FRT+N
Sbjct: 84 PGNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMAFRTDNE 137
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV +M+ E LF QGGPIIM QIENEYG +E G GK Y KWAA M
Sbjct: 138 PFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAQM 197
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPW MCKQ DAP+ +ID CNGYYC+ + PN KP +WTENW GWYT +GG +
Sbjct: 198 AVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYTDFGGAI 257
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
HRP EDLA++VA F Q GSF+NYYMY GGTNFGRTS G F TSYDYDAPIDEYGL +
Sbjct: 258 SHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLPN 317
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKW HLK+LH AIK CEPAL++ D +N EAHVY N S C+AFLAN D
Sbjct: 318 EPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVN----TSICAAFLANYD 373
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+AA+VTF Y LPPWSVSILPDC+ VFNTA V+ + F ++P
Sbjct: 374 TKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHS------FHKRMTP------ 421
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+E+ S S +EP +++ + E +NVT+D SDYLW++T + +S
Sbjct: 422 ---VETTFDWQSYS----EEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPS 474
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ SF K + PT+TI+S VL VF+NGQL+G+V G V + V + G N +
Sbjct: 475 E-SFIKNGQF-PTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKIS 532
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VGL N G E G G V+L G G DLS W+Y+VGLKGE +++I
Sbjct: 533 LLSVAVGLPNVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTIT 592
Query: 642 ENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ + +WT + TWYKT FDAP G DPVALD+ SMGKG+ W+N IGR+W
Sbjct: 593 GSSSIDWTQGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPA 652
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C D C+Y G + + KC TNCG PTQ WYH+PRSWL +S N+LV+ EE GG+P
Sbjct: 653 YIAHGNC-DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTG 711
Query: 761 ISVKLRS 767
IS+ R+
Sbjct: 712 ISLVKRT 718
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/735 (49%), Positives = 477/735 (64%), Gaps = 33/735 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYDHR++ I R+++ISA IHYPR+ P MWP L+ +KEGG + IE+YVFWN HE
Sbjct: 30 NVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHEPS 89
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+Y F G+ +IVKF+K+V +G+++ LRIGP+V AEWN+GG PVWL +PG FR +N
Sbjct: 90 PRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRADNE 149
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
P+K M+ F IV+L+++E LF+ QGGPII+ Q+ENEYG E YG+ GK Y +W+ASM
Sbjct: 150 PWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSASM 209
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ GVPW+MC+Q DAP +I CNG+YCD + PN+ +KP +WTENW GW+ T+GGR
Sbjct: 210 AVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGRD 269
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+A++VARFF +GGS NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL
Sbjct: 270 PHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 329
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLKDLH AI L E L+ + Q LG + EA VY S C+AFL+N+D
Sbjct: 330 LPKWGHLKDLHKAIMLSENLLINGEH-QNFTLGHSLEADVYT----DSSGTCAAFLSNLD 384
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +V F SY LP WSVSILPDC+N VFNTAKV+S+ S VE LP
Sbjct: 385 DKNDKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTSKFS--KVEM-LP--------- 432
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
E SS+ W E G+W E +F +++H+N TKD +DYLW+ T I VS +
Sbjct: 433 ----EDLRSSSGLKWEVFSEKPGIWGEADFVKNELVDHINTTKDTTDYLWYTTSITVSTN 488
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDLI 581
+ F K P + I+S L VFIN + G+ G+ K+ + V ++G N++
Sbjct: 489 E-EFLKKGS-PPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVALKAGENNID 546
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL N G+F E GAG V + GF G ++L+ W+Y++G++G +++
Sbjct: 547 LLSMTVGLSNAGSFYEWVGAGLT-SVSIKGFNKGTLNLTNSKWSYKLGVQGVHLELFKPG 605
Query: 642 ENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
++ A +WT T+ TWYK D P G +PV LD+ SMGKG AW+NG IGRYW
Sbjct: 606 DSGAVKWTVTTKPPKKQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEEIGRYWPR 665
Query: 701 VA----PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
+A P C CDYRG + DKC T CG P+Q WYHVPRSW ++S N LVIFEE GG
Sbjct: 666 IARKSTPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGG 725
Query: 757 NPFEISVKLRSTRIV 771
+P +I++ R +V
Sbjct: 726 DPMKITLSKRKVSVV 740
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/813 (45%), Positives = 497/813 (61%), Gaps = 60/813 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD AIII+G RR++ S IHYPR+T EMWPDLI K+K+GG D IETY+FW+ HE
Sbjct: 26 NVSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFWDRHEPH 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R +Y+F G + +K+ +L+ +GLY+ +RIGPYVCAEWN+GGFP+WL ++PGI+ RTNN
Sbjct: 86 RRKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTNNQ 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
+K EMQ F KIV++ ++ LF+ QGGPII+ QIENEYGN+ + YG+ GK Y+ W A M
Sbjct: 146 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYINWCAQM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L G+PW+MC+Q+DAP+ II+ CNG+YCD + PN+ N P ++TENW GW+ WG +
Sbjct: 206 AESLNIGIPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPNSPKMFTENWVGWFKKWGDKD 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHR ED+AF+VARFFQ GG NYYMY GGTNFGRTSGGPF TSYDYDAP+DEYG L+
Sbjct: 266 PHRTAEDVAFSVARFFQSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLDEYGNLN 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK LHA+IKL E L +S + + + +N + C FL+N D
Sbjct: 326 QPKWGHLKQLHASIKLGEKILT--NSTRSDQDFGSSVTFTKFSNLETGEKFC--FLSNAD 381
Query: 406 EHTAASVTFLG-QSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
E+ A V LG + Y LP WSVSIL C +FNTAKVSSQTS+
Sbjct: 382 ENNDAIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTSL------------FFKK 429
Query: 465 QQSMIESKLSSTSKSWMTVKEPI--GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
Q +KL SW EP+ + F +LE T D SDYLW++T +
Sbjct: 430 QNEKENAKL-----SWNWASEPMRDTLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVNS 484
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS---VIGHWVKVVQPVEFQSGYND 579
+ ++ T+ +++ VL FIN + GS G +P++ + G N
Sbjct: 485 NT------TSSLQNLTLQVNTKGHVLHAFINRRYIGSQWGSNGQSFVFEKPIQLKLGTNT 538
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL+NY AF + G G + L G N DLS LW+Y+VGL GE +Q+Y
Sbjct: 539 ITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLNGERKQLY 598
Query: 639 S-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ + N +W+ L + I TW+K F P G DPV LD+ MGKGQAWVNG IGR+
Sbjct: 599 NPMFSNRTKWSTLNKKSIGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRF 658
Query: 698 W-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
W + +A C +TCDY+G+YN +KC NCGN +Q WYH+PRS++ S N L++FEE GG
Sbjct: 659 WPSFIASNDSCSETCDYKGSYNPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGG 718
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
NP +SV+ + +C +E + L CQ G++IS
Sbjct: 719 NPQMVSVQTITIGTICGNANE------------------------GSTLELSCQGGHVIS 754
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I+FASYG P+G+C F G S +++ E
Sbjct: 755 EIQFASYGHPEGKCGSFQSGLWDVTKSTTIIVE 787
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/819 (45%), Positives = 502/819 (61%), Gaps = 61/819 (7%)
Query: 41 FFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWN 100
F K NVSYD AIII+G RR+++S +HYPR+T MWPDLI K+K+GG D IETY+FW+
Sbjct: 6 FCKGDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWD 65
Query: 101 AHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEF 160
HE R +Y+F G+ D +KF +LV +GLY+ +RIGPYVCAEWN+GGFP+WL ++PGI+F
Sbjct: 66 RHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQF 125
Query: 161 RTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVK 220
RT+N +K EMQ F KIV++ ++ LF+ QGGPII+ QIENEYGN+ + YG GK Y+
Sbjct: 126 RTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYIN 185
Query: 221 WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD-GYKPNSYNKPTLWTENWDGWYT 279
W A MA L G+PW+MC+Q+DAP+ II+ CNG+YCD + PN+ P ++TENW GW+
Sbjct: 186 WCAQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFK 245
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPID 339
WG + P+R ED+AFAVARFFQ GG F NYYMY GGTNFGRT+GGPF TSYDY+AP+D
Sbjct: 246 KWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLD 305
Query: 340 EYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSA 399
EYG L++PKWGHLK LHA+IK+ E L + + KL + +N + C
Sbjct: 306 EYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQ-KLXSFVTLTKF-SNPTSGERFC-- 361
Query: 400 FLANIDEHTAASVTFLGQ-SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLS 458
FL+N D A++ Y +P WSVSIL C VFNTAK++SQTS+
Sbjct: 362 FLSNTDNKNDATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQTSMFV-------- 413
Query: 459 PNISVPQQSMIESKLSSTSKSWMTVKEPI--GVWSENNFTVQGILEHLNVTKDYSDYLWH 516
+++K + SW+ EP+ + + F +LE T D+SDYLW+
Sbjct: 414 ---------KVQNKKENAQFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWY 464
Query: 517 ITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS---VIGHWVKVVQPVEF 573
+T I D + + ++ T+ +++ +L F+N + GS G +P+
Sbjct: 465 MTNI---DSNAT---SSLQNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQSFVFXKPILI 518
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKG 632
+ G N + LLS TVGL+NY AF + G G + L G N IDLS LW+Y+VGL G
Sbjct: 519 KPGTNTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNG 578
Query: 633 EFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
E +Q+Y+ + W+ + + I T YKT F P GIDPV LD+ MGKGQAWVNG
Sbjct: 579 EMKQLYNPVFSQRTNWSTINQKSIGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNG 638
Query: 692 HHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
IGR+W + +A C TCDYRGAYN KC NCGNP+Q WYH+PRS+L N LV+
Sbjct: 639 QSIGRFWPSFIAGNDSCSTTCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVL 698
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ 810
FEE GGNP ++SV+ + +C +E + L CQ
Sbjct: 699 FEEIGGNPQQVSVQTITIGTICGNANE------------------------GSTLELSCQ 734
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G+IIS I+FASYG P+G+C F +G+ H S +V +
Sbjct: 735 GGHIISEIQFASYGNPEGKCGSFKQGSWHVINSAILVEK 773
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/733 (50%), Positives = 483/733 (65%), Gaps = 44/733 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +A+II+G +R+L S IHYPR+TP+MW LI K+K+GG DVI+TYVFWN HE
Sbjct: 27 NVTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G YNF+G+ND+V+F+KLV +GLY+ LRIGPY+C EWNFGGFPVWL+ IPG+ FRT+N
Sbjct: 87 PGNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGMIFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQ+F +KIV +M++E L+ QGGPII+ QIENEY + ++G G Y+ WAA M
Sbjct: 147 PFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWAAHM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMCK+ DAP+ +++ CNG+YCD + PN KPT+WTE W GW+T +GG +
Sbjct: 207 AVSLNTGVPWVMCKEFDAPDPVVNTCNGFYCDYFSPNKAYKPTMWTEAWTGWFTDFGGPI 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
RPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+
Sbjct: 267 HQRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLKDLH AIKLCE AL+++D LG ++AHV+ +N +C+AFLAN +
Sbjct: 327 QPKYGHLKDLHKAIKLCERALLSSDPV-VTTLGSYEQAHVFSSN----SGDCAAFLANYN 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
A VTF Y LPPWSVSILPDC+N VFNTA+V Q S + + P
Sbjct: 382 PKATAKVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGVQPS------KIQMLPT----- 430
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNF-TVQGILEHLNVTKDYSDYLWHITQIYVSD 524
E++ SW + E I ++ TV G+LE +NVT+D SDYLW+ T +++S
Sbjct: 431 ----EARF----LSWEALSEDISSVDDDKIGTVAGLLEQINVTRDASDYLWYTTGVHISS 482
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEF-------QSGY 577
+ +F + P + + S + VF+NGQL+GSV G + + + F +G
Sbjct: 483 SE-TFLDGGQ-PPILKVISAGHGIHVFVNGQLSGSVYG--TRGNRRISFSGELKQLHAGR 538
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + LLS VGL N G E G G V + G G DL+ W+Y+VGLKGE +
Sbjct: 539 NRISLLSVAVGLPNNGPRFETWNTGVLGPVVIHGLDQGHRDLTWQKWSYKVGLKGEDLNL 598
Query: 638 YSIEE----NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
S N + + + + P TW++ +FDAP G DP+ALD+ SM KGQ W+NG+
Sbjct: 599 GSPNSIPSINWMQESAMVAERQP--LTWHRAFFDAPRGDDPLALDMSSMVKGQVWINGNS 656
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
IGRYWTV A G C C Y G + C CG PTQ WYH+PRS L+ + NLLV+FEE
Sbjct: 657 IGRYWTVYA-DGNCT-ACSYSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLVVFEE 714
Query: 754 TGGNPFEISVKLR 766
GG+ +I + R
Sbjct: 715 IGGDVSKIYLVKR 727
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/850 (44%), Positives = 511/850 (60%), Gaps = 69/850 (8%)
Query: 15 ALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRAT 74
++ V M+ + S V + + F K NVSYD AIII+G RR+++S +HYPR+T
Sbjct: 5 SIIVISKMLNHQWLVFSLVVTLACFYFCKGDNVSYDSNAIIINGERRVILSGSMHYPRST 64
Query: 75 PEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLR 134
MWPDLI K+K+GG D IETY+FW+ HE R +Y+F G+ D +KF +LV +GLY+ +R
Sbjct: 65 EAMWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMR 124
Query: 135 IGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGP 194
IGPYVCAEWN+GGFP+WL ++PGI+FRT+N +K EMQ F KIV++ ++ LF+ QGGP
Sbjct: 125 IGPYVCAEWNYGGFPLWLHNLPGIQFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGP 184
Query: 195 IIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGY 254
II+ QIENEYGN+ + YG GK Y+ W A MA L G+PW+MC+Q DAP+ II+ CNG+
Sbjct: 185 IILAQIENEYGNVMTPYGNAGKSYINWCAQMAESLNIGIPWIMCQQNDAPQPIINTCNGF 244
Query: 255 YCD-GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
YCD + PN+ P ++TENW GW+ WG + P+R ED+AFAVARFFQ GG F NYYMY
Sbjct: 245 YCDYDFSPNNPKSPKMFTENWVGWFKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMY 304
Query: 314 FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV-AADSA 372
GGTNFGRT+GGPF TSYDY+AP+DEYG L++PKWGHLK LHA+IK+ E L + S
Sbjct: 305 HGGTNFGRTAGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSD 364
Query: 373 QYIK--LGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQS---YTLPPWSVS 427
Q I + + ++ R+ FL+N D A++ +P WSVS
Sbjct: 365 QKISSFVTLTKFSNPTSGERF-------CFLSNTDNKNDATIDLQADGKYFVPVPAWSVS 417
Query: 428 ILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPI 487
IL C VFNTAK++SQTS+ +++K + SW+ EP+
Sbjct: 418 ILDGCNKEVFNTAKINSQTSMFV-----------------KVQNKKENAQFSWVWAPEPM 460
Query: 488 --GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMR 545
+ + F +LE T D+SDYLW++T I D + + ++ T+ +++
Sbjct: 461 RDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNI---DSNAT---SSLQNVTLQVNTKG 514
Query: 546 DVLRVFINGQLTGS---VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
+L F+N + GS G +P+ + G N + LLS TVGL+NY AF + G
Sbjct: 515 HMLHAFVNRRYIGSQWRSNGQSFVFEKPILIKPGTNTITLLSATVGLKNYDAFYDTVPTG 574
Query: 603 FR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFT 660
G + L G N IDLS LW+Y+VGL GE +Q+Y+ + W+ + + I T
Sbjct: 575 IDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQKSIGRRMT 634
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNS 719
WYKT F P GID V LD+ MGKGQAWVNG IGR+W + +A C TCDYRGAYN
Sbjct: 635 WYKTSFKTPSGIDRVTLDMQGMGKGQAWVNGQSIGRFWPSFIASNDSCSTTCDYRGAYNP 694
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESH 779
KC NCGNP+Q WYH+PRS+L N LV+FEE GGNP ++SV+ + +C +E
Sbjct: 695 SKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTICGNANE-- 752
Query: 780 YPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCH 839
+ L CQ G+IIS I+FASYG P+G+C F +G+ H
Sbjct: 753 ----------------------GSTLELSCQGGHIISEIQFASYGNPEGKCGSFKQGSWH 790
Query: 840 APMSLSVVSE 849
S +V +
Sbjct: 791 VINSAILVEK 800
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/812 (45%), Positives = 493/812 (60%), Gaps = 62/812 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD A+II+G RR++ S IHYPR+T EMWPDLI K+K+GG D IETY+FW+ HE +R
Sbjct: 10 VTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDRHEPVR 69
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+YNF G D VKF +L+ +GLY +RIGPY CAEWNFGGFP WL ++PGIE RTNN+
Sbjct: 70 REYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIELRTNNSV 129
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K EMQ F +IV++++E LF+ QGGPII+ QIENEYG++ +Y GK YV+WAA MA
Sbjct: 130 YKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQWAAQMA 189
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q DAP+ II+ CNGYYC ++PN+ P ++TENW GW+ WG R+P
Sbjct: 190 LAQNIGVPWIMCQQQDAPQPIINTCNGYYCHNFQPNNPKSPKIFTENWIGWFQKWGERVP 249
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR ED AF+VARFFQ GG NYYMY GGTNFGRT+GGP+ TSYDYDAPIDEYG L++
Sbjct: 250 HRSAEDSAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEYGNLNQ 309
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK+LHAAIKL E L + + LG Y N G++ FL+N +
Sbjct: 310 PKWGHLKNLHAAIKLGENVLTNYSARKDEDLGNGLTLTTY-TNSSGAR---FCFLSNNNN 365
Query: 407 HTAASVTFLGQS--YTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ L Y +P WSVSI+ C VFNTAKV+SQTS+ +
Sbjct: 366 TDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTSMMVKK------------ 413
Query: 465 QQSMIESKLSSTSKSWMTVKEPI--GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+SST+ +W EP + + Q +LE +T D SDYLW++T +
Sbjct: 414 -----SDNVSSTNLTWEWKVEPKRDTIHGNGSLKAQKLLEQKELTLDASDYLWYMTSADI 468
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWVKVVQPVEFQSGYND 579
+D S W +R +++ L ++N + G S G+ + V ++G N
Sbjct: 469 --NDTSIWSNATLR----VNTSGHSLHGYVNQRYVGYQFSQYGNQFTYEKQVSLKNGTNI 522
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL NYGA+ + G G V+L G N +DLS LW+Y++GL GE + +Y
Sbjct: 523 ITLLSATVGLANYGAWFDDKKTGISGGPVELIGKNNVTMDLSTNLWSYKIGLNGERRHLY 582
Query: 639 SIEEN-EAEW-TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
++N W T+ + I WY+ F +P G +P+ +DL +GKG AWVNGH IGR
Sbjct: 583 DAQQNVSVAWHTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGR 642
Query: 697 YWTV-VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
YW+ ++P GC DTCDYRG Y KC TNCG+P+Q WYHVPRS+L N LV+FEE G
Sbjct: 643 YWSSWISPSDGCSDTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIG 702
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
GNP + + +T +C V E + L CQ G ++
Sbjct: 703 GNPQSVQFQTVTTGTICANVYE------------------------GAQFELSCQSGQVM 738
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
S I+FASYG P+G+C F +GN A S SVV
Sbjct: 739 SQIQFASYGNPEGQCGSFKKGNFDAANSQSVV 770
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/812 (46%), Positives = 498/812 (61%), Gaps = 65/812 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++II+G RR++ S +HYPR+T +MWPD+I K+K+GG D IE+YVFW+ HE +R
Sbjct: 28 VTYDARSLIINGERRVIFSGAVHYPRSTVQMWPDIIQKAKDGGLDAIESYVFWDRHEPVR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+Y+F G D +KF +++ +GLY LRIGPYVCAEWNFGGFP+WL ++PGIE RT+N
Sbjct: 88 REYDFSGNLDFIKFFQIIQEAGLYAILRIGPYVCAEWNFGGFPLWLHNMPGIELRTDNPI 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K EMQ F KIV++ +E LF+ QGGPII+ QIENEYGN+ + YG+ GK Y+KW A MA
Sbjct: 148 YKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNIMTDYGEAGKTYIKWCAQMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q DAP+ +I+ CNG+YCD ++PN+ P ++TENW GW+ WG R+P
Sbjct: 208 LAQNIGVPWIMCQQHDAPQPMINTCNGHYCDSFQPNNPKSPKMFTENWIGWFQKWGERVP 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR ED AF+VARFFQ GG NYYMY GGTNFGRT+GGP+ TSY+YDAP+DEYG L++
Sbjct: 268 HRSAEDSAFSVARFFQNGGILNNYYMYHGGTNFGRTAGGPYMTTSYEYDAPLDEYGNLNQ 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK LHAAIKL E + + + K N+ + G + FL+N ++
Sbjct: 328 PKWGHLKQLHAAIKLGEK--IITNGTRTDKDFGNEVTLTTYTHTNGER---FCFLSNTND 382
Query: 407 HTAASVTFLGQ-SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
A+V +Y LP WSV+IL C VFNTAKV+SQTSI
Sbjct: 383 SKDANVDLQQDGNYFLPAWSVTILDGCNKEVFNTAKVNSQTSI----------------- 425
Query: 466 QSMIESKLSSTSK---SWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
M++ +++K +W+ K+ + + NF V +LE +T D SDYLW++T + +
Sbjct: 426 --MVKKSDDASNKLTWAWIPEKKKDTMHGKGNFKVNQLLEQKELTFDVSDYLWYMTSVDI 483
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW---VKVVQPVEFQSGYND 579
+D S W +R +++ LR ++NG+ G W + V + G N
Sbjct: 484 --NDTSIWSNATLR----VNTRGHTLRAYVNGRHVGYKFSQWGGNFTYEKYVSLKKGLNV 537
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL NYGA +K G G V+L G N IDLS LW+Y++GL GE +++Y
Sbjct: 538 ITLLSATVGLPNYGAKFDKIKTGIAGGPVQLIGNNNETIDLSTNLWSYKIGLNGEKKRLY 597
Query: 639 SIEEN-EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ W + I + TWYK F AP G DPV +DL +GKG+AWVNG IGRY
Sbjct: 598 DPQPRIGVSWRTNSPYPIGRSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVNGQSIGRY 657
Query: 698 WTV-VAPKGGCQDTCDYRGAY-NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
WT + GC DTCDYRG Y + KC TNCGNP+Q WYHVPRS+L+ N LV+FEE G
Sbjct: 658 WTSWITATNGCSDTCDYRGKYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTLVLFEEIG 717
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
GNP +S + T +C QV E G L + L CQ G I
Sbjct: 718 GNPQNVSFQTVITGTICAQVQE----------------GAL--------LELSCQGGKTI 753
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
S I+F+S+G P G C F +G A SVV
Sbjct: 754 SQIQFSSFGNPTGNCGSFKKGTWEATDGQSVV 785
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 717 bits (1852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/814 (45%), Positives = 497/814 (61%), Gaps = 64/814 (7%)
Query: 40 TFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFW 99
TF VS+D RAI IDG RR+LIS IHYPR+T EMWPDLI KSKEGG D IETYVFW
Sbjct: 40 TFVSATIVSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDLIKKSKEGGLDAIETYVFW 99
Query: 100 NAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIE 159
N+HE R QY+F G D+V+F+K + + GLY LRIGPYVCAEWN+GGFP+WL ++PG E
Sbjct: 100 NSHEPSRRQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCAEWNYGGFPMWLHNLPGCE 159
Query: 160 FRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYV 219
RT N+ F EMQ F IVD+M++E LF+ QGGPII+ Q+ENEYGN+ S+YG GK Y+
Sbjct: 160 LRTANSVFMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVENEYGNVMSAYGAAGKTYI 219
Query: 220 KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
W ++MA L GVPW+MC+Q+DAP+ +I+ CNG+YCD + PN+ N P +WTENW GW+
Sbjct: 220 DWCSNMAESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQFTPNNANSPKMWTENWTGWFK 279
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPID 339
+WGG+ PHR ED+AFAVARFFQ GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP+D
Sbjct: 280 SWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLD 339
Query: 340 EYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSA 399
EYG L++PKWGHLK LH + E L + + I + A +Y ++ +S C
Sbjct: 340 EYGNLNQPKWGHLKQLHDILHSMEYTLTHGNIST-IDYDNSVTATIYATDK---ESAC-- 393
Query: 400 FLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
F N +E + A++ F G Y +P WSVSILPDC N +NTAKV +QT+I
Sbjct: 394 FFGNANETSDATIVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQTAIM---------- 443
Query: 460 NISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
V Q++ E + SS SW+ + + + + +++ D SDYLW++T
Sbjct: 444 ---VKQKNEAEDQPSSLKWSWIPENTHTTSLLGKGHAHARQLIDQKAAANDASDYLWYMT 500
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQ 574
+++ DD W ++ ++ ++ VL ++NG+ GS + + ++ +
Sbjct: 501 SLHIKKDD-PVWSSDM---SLRVNGSGHVLHAYVNGKHLGSQFAKYGVFSYVFEKSLKLR 556
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD---IDLSKILWTYQVGLK 631
G N + LLS TVGLQNYG + G G V++ G + + DLS W+Y VGL
Sbjct: 557 PGKNVISLLSATVGLQNYGPMFDLVQTGIPGPVEIIGHRGDEKVVKDLSSHKWSYSVGLN 616
Query: 632 GEFQQIYSIEENEA-EWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
G ++YS A W + +P+ WYKT F AP G DPV LDL MGKG AW
Sbjct: 617 GFHNELYSSNSRHASRWVE---QDLPTNKMMIWYKTTFKAPLGKDPVVLDLQGMGKGFAW 673
Query: 689 VNGHHIGRYW-TVVAPKGGCQ-DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNN 746
VNG++IGRYW + +A + GC + CDYRGAY+++KC TNCG PTQ WYHVPRS+ N
Sbjct: 674 VNGNNIGRYWPSFLAEEDGCSTEVCDYRGAYDNNKCVTNCGKPTQRWYHVPRSFFNDYEN 733
Query: 747 LLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMH 806
LV+FEE GGNP ++ + + V E +
Sbjct: 734 TLVLFEEFGGNPAGVNFQTVTVGKVSGSAGEGE------------------------TIE 769
Query: 807 LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
L C +G IS+IEFAS+G PQG + +G C
Sbjct: 770 LSC-NGKSISAIEFASFGDPQGTSGAYVKGTCEG 802
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 717 bits (1851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/840 (43%), Positives = 513/840 (61%), Gaps = 73/840 (8%)
Query: 14 LALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRA 73
+ + + ++ + I ++ +S + ++ VS+D RAI I+G RR+L+S IHYPR+
Sbjct: 1 MKMKHFTRLLSLFFILITSLSLAKSTI------VSHDERAITINGKRRILLSGSIHYPRS 54
Query: 74 TPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQL 133
T +MWPDLI K+K+GG D IETYVFWNAHE R +Y+F G D+V+F+K + +GLY L
Sbjct: 55 TADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVL 114
Query: 134 RIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGG 193
RIGPYVCAEWN+GGFPVWL ++P ++FRT N F EMQ F KIV +M+EE LF+ QGG
Sbjct: 115 RIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGG 174
Query: 194 PIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNG 253
PII+ QIENEYGN+ SSYG +GK Y+ W A+MA L GVPW+MC+Q +AP+ +++ CNG
Sbjct: 175 PIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNG 234
Query: 254 YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
+YCD Y+P + + P +WTENW GW+ WGG+ P+R EDLAF+VARFFQ GG+F NYYMY
Sbjct: 235 FYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMY 294
Query: 314 FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ 373
GGTNFGR +GGP+ TSYDY AP+DE+G L++PKWGHLK LH +K E +L + ++
Sbjct: 295 HGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISR 354
Query: 374 YIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCR 433
I LG + +A +Y ++ S F+ N++ A V F G+ Y +P WSVS+LPDC
Sbjct: 355 -IDLGNSIKATIYT-----TKEGSSCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCD 408
Query: 434 NTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN 493
+NTAKV++QTSI T + S P +E S M +K
Sbjct: 409 KEAYNTAKVNTQTSIMTEDSSKP----------ERLEWTWRPESAQKMILK------GSG 452
Query: 494 NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFIN 553
+ +G+++ +VT D SDYLW++T++++ D W N T+ + S VL ++N
Sbjct: 453 DLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKD-PLWSRNM---TLRVHSNAHVLHAYVN 508
Query: 554 GQLTGSVIGHWVKVVQPVEFQ-----SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
G+ G+ K E + G N + LLS +VGLQNYG F E G G V
Sbjct: 509 GKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVS 568
Query: 609 LTGFKNGDI---DLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPS--TFTWY 662
L G+K + DLS+ W Y++GL G +++SI+ +W + + +P+ TWY
Sbjct: 569 LVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN---EKLPTGRMLTWY 625
Query: 663 KTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDK 721
K F AP G +PV +DL +GKG+AW+NG IGRYW + + GC+D CDYRGAY SDK
Sbjct: 626 KAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDYRGAYGSDK 685
Query: 722 CTTNCGNPTQTWYHVPRSWLQAS-NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHY 780
C CG PTQ WYHVPRS+L AS +N + +FEE GGNP ++ K VC + E +
Sbjct: 686 CAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCARAHEHN- 744
Query: 781 PPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
++ L C + IS+++FAS+G P G C F+ G C
Sbjct: 745 -----------------------KVELSCHN-RPISAVKFASFGNPLGHCGSFAVGTCQG 780
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 717 bits (1851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/848 (44%), Positives = 507/848 (59%), Gaps = 80/848 (9%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+ M ++ LS +S + VSYD RA+ IDG RR+L S IHYPR+TPEMWP L
Sbjct: 8 LSAMFLLCLSLISIA-----INALEVSYDERALTIDGKRRILFSGSIHYPRSTPEMWPYL 62
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I K+KEGG DVIETYVFWNAHE R QY+F D+V+F++ + GLY +RIGPY+ +
Sbjct: 63 IRKAKEGGLDVIETYVFWNAHEPQRRQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYISS 122
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWN+GG PVWL +IP +EFRT+N F EEM+ F +KIVD+M++E LF+ QGGPII+ QIE
Sbjct: 123 EWNYGGLPVWLHNIPNMEFRTHNRAFMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQIE 182
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYGN+ +YG G Y+KW A +A GVPWVM +Q++AP+ +ID+C+GYYCD ++P
Sbjct: 183 NEYGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQP 242
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N +KP +WTENW G Y WG + PHRP ED+A+AVARFFQ GG+F NYYMY GGTNF R
Sbjct: 243 NDNHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKR 302
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
T+GGP+ TSYDYDAP+DEYG L++PKWGHL+ LH +K E L S+Q+ G
Sbjct: 303 TAGGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQG-SSQHTDYGNMV 361
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
A VY Y +S C F+ N + A++ F YT+P WSVSILP+C + +NTAK
Sbjct: 362 TATVY---TYDGKSTC--FIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAK 416
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN------NF 495
V++QT+I + + L + W +EP + +
Sbjct: 417 VNTQTTIMVKKDNEDLEYAL-----------------RWQWRQEPFVQMKDGQITGIIDL 459
Query: 496 TVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ 555
T +L+ VT D+SDYLW+IT I + DD W T E R + + + VL VF+NG+
Sbjct: 460 TAPKLLDQKVVTNDFSDYLWYITSIDIKGDDDPSW-TKEFR--LRVHTSGHVLHVFVNGK 516
Query: 556 LTGS--VIGHWVKVVQ--PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
G+ K V ++ +G N++ LLS TVGL NYG F + G G V+L
Sbjct: 517 HVGTQHAKNGQFKFVHESKIKLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVA 576
Query: 612 ------FKNGDI--DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS--TFTW 661
+ + +I DLSK W+Y+VGL GE + YS E + W D +P+ W
Sbjct: 577 AVGDYDYDDDEIVKDLSKNQWSYKVGLHGEHEMHYSYENSLKTW---YTDAVPTDRILVW 633
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSD 720
YKT F +P G DPV +DL +GKG AWVNG+ IGRYW + +A + GC CDYRG Y S+
Sbjct: 634 YKTTFKSPIGDDPVVVDLSGLGKGHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSN 693
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASN-NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESH 779
KC + C P+Q WYHVPRS+L+ + N LV+FEE GG P+ ++ + VC E +
Sbjct: 694 KCLSMCAQPSQRWYHVPRSFLRDDDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGN 753
Query: 780 YPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCH 839
+ L C +IS I+FAS+G P+G C F +GNC
Sbjct: 754 ------------------------TLELACNKNQVISEIKFASFGLPKGECGSFQKGNCE 789
Query: 840 APMSLSVV 847
+ +LS +
Sbjct: 790 SSEALSAI 797
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/840 (45%), Positives = 503/840 (59%), Gaps = 77/840 (9%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGG 89
L C+S S + VSYD RA+ IDG RR+L SA IHYPR+TPEMWP LI K+KEGG
Sbjct: 13 LLCLSLISIA--INALEVSYDERALTIDGKRRILFSASIHYPRSTPEMWPYLIRKAKEGG 70
Query: 90 ADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFP 149
DVIETYVFWNAHE R QY F D+V+F++ + GLY +RIGPY+ +EWN+GG P
Sbjct: 71 LDVIETYVFWNAHEPQRRQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGLP 130
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES 209
VWL +IP +EFRT+N F EEM+ F KIVD+M++E LF+ QGGPII+ QIENEYGN+
Sbjct: 131 VWLHNIPNMEFRTHNRAFMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQIENEYGNVMH 190
Query: 210 SYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTL 269
+YG G Y+KW A +A GVPWVM +Q++AP+ +ID+C+GYYCD ++PN +KP +
Sbjct: 191 AYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPKI 250
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTENW G Y WG + PHRP ED+A+AVARFFQ GG+F NYYMY GGTNF RT+GGP+
Sbjct: 251 WTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYVT 310
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
TSYDYDAP+DEYG L++PKWGHL+ LH +K E L S+Q G A VY
Sbjct: 311 TSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQG-SSQNTDYGNMVTATVY--- 366
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
Y +S C F+ N + A++ F YT+P WSVSILP+C + +NTAKV++QT+I
Sbjct: 367 TYDGKSTC--FIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTTIM 424
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN------NFTVQGILEH 503
+ + L + W +EP + + T +L+
Sbjct: 425 VKKDNEDLEYAL-----------------RWQWRQEPFVQMKDGQITGIIDLTAPKLLDQ 467
Query: 504 LNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS--VI 561
VT D+SDYLW+IT I + DD W T E R + + + VL VF+NG+ G+
Sbjct: 468 KVVTNDFSDYLWYITSIDIKGDDDPSW-TKEFR--LRVHTSGHVLHVFVNGKHVGTQHAK 524
Query: 562 GHWVKVVQ--PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG------FK 613
K V ++ +G N++ LLS TVGL NYG F + G G V+L +
Sbjct: 525 NGQFKFVHESKIKLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYD 584
Query: 614 NGDI--DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS--TFTWYKTYFDAP 669
+ +I DLSK W+Y+VGL GE + YS E + W D +P+ WYKT F +P
Sbjct: 585 DDEIVKDLSKNQWSYKVGLHGEHEMHYSYENSLKTW---YTDAVPTDRILVWYKTTFKSP 641
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGN 728
G DPV +DL +GKG AWVNG+ IGRYW + +A + GC CDYRG Y S+KC + C
Sbjct: 642 IGDDPVVVDLSGLGKGHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSMCAQ 701
Query: 729 PTQTWYHVPRSWLQASN-NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
P+Q WYHVPRS+L+ ++ N LV+FEE GG P+ ++ + VC E +
Sbjct: 702 PSQRWYHVPRSFLRDNDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGN-------- 753
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+ L C +IS I+FAS+G P+G C F +GNC + +LS +
Sbjct: 754 ----------------TLELACNKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAI 797
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/681 (53%), Positives = 456/681 (66%), Gaps = 40/681 (5%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
M M+++M+ S V A+ VSYDH+AIIIDG RR+LIS IHYPR+TP+MWPD
Sbjct: 15 MFMLLLMLFSSWVCFVEAT-------VSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPD 67
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+K+G DVI+TYVFWN HE G+Y F+ + D+V+F+KLV +GLY+ LRIGPYVC
Sbjct: 68 LIQKAKDG-VDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVC 126
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWNFGGFPVWL+ +PGIEFRT+N PFK MQ+F +KIV +M+ E LF QGGPII+ QI
Sbjct: 127 AEWNFGGFPVWLKYVPGIEFRTDNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQI 186
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENE+G +E G GK Y KWAA MA+GL GVPWVMCKQ DAP+ +I+ CNG+YC+ +
Sbjct: 187 ENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYCENFV 246
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PN NKP +WTENW GW+T +GG P RP ED+AF+VARF Q GGSF+NYYMY GGTNFG
Sbjct: 247 PNQKNKPKMWTENWTGWFTAFGGPTPQRPAEDVAFSVARFIQNGGSFVNYYMYHGGTNFG 306
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RT+GGPF TSYDYDAP+DEYGLL EPKWGHL+DLH AIKLCE ALV+ D LG N
Sbjct: 307 RTAGGPFIATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKLCESALVSTDPT-VTSLGNN 365
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
QE HV+ +C+AFLAN D ++A V F Y LPPWS+SILPDC+ VFNTA
Sbjct: 366 QEVHVFNPK----SGSCAAFLANYDTTSSAKVNFKIMQYELPPWSISILPDCKTAVFNTA 421
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
++ +Q+S+K ++P + QS IE SS+ + FT G+
Sbjct: 422 RLGAQSSLKQ------MTPVSTFSWQSYIEESASSS--------------DDKTFTTDGL 461
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
E LNVT+D SDYLW++T I + D + F K N P +TI S L VFINGQL+G+V
Sbjct: 462 WEQLNVTRDASDYLWYMTNINI-DSNEGFLK-NGQDPLLTIWSAGHALHVFINGQLSGTV 519
Query: 561 IGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G + Q V+ + G N L LLS +VGLQN G E+ G G V L G G
Sbjct: 520 YGGVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGT 579
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV 675
DLSK W+Y++GLKGE ++++ + EW + + TWYKT F+AP G +P+
Sbjct: 580 RDLSKQQWSYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPL 639
Query: 676 ALDLGSMGKGQAWVNGHHIGR 696
ALD+ +MGKG W+N IGR
Sbjct: 640 ALDMSTMGKGLIWINSQSIGR 660
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 715 bits (1846), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/809 (47%), Positives = 493/809 (60%), Gaps = 74/809 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
N++YD R++IIDG R++LISA IHYPR+ P MWP+L+ +KEGG DVIETYVFWN HE
Sbjct: 28 NITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
Y F+ + D+VKFVK+V +G+YL LRIGP+V AEWNFGG PVWL +PG FRT+N
Sbjct: 88 PSNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTDNY 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK MQ+F+ IV+LM++E LF+ QGGPII+ Q+ENEYG ES+YG+ GK Y WAA M
Sbjct: 148 NFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYAMWAAQM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ GVPW+MC+Q DAP ++I+ CN +YCD +KP +KP +WTENW GW+ T+G
Sbjct: 208 AVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQTFGAPN 267
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AF+VARFFQ+GGS NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLAR 327
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKW HLK+LH AIKLCE L+ + + LG +QEA VY A G+ C+AFLAN+D
Sbjct: 328 LPKWAHLKELHKAIKLCELTLLNSVPVN-LSLGPSQEADVY-AEESGA---CAAFLANMD 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
E +V F SY LP WSVSILPDC+N VFNTAKV+SQTSI + +
Sbjct: 383 EKNDKTVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSI------------VEMVP 430
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+ S + + W T E G+W ++ G ++H+N TKD +DYLW+ T I+V ++
Sbjct: 431 DDLRSSDKGTKALKWETFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGEN 490
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDLI 581
+ F K RP + I+S L F+N +L G+ G+ K +PV +G ND+
Sbjct: 491 E-EFLKKGG-RPVLLIESKGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIA 548
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-I 640
LLS TVGLQN G+F E GAG VK+ GF NG IDLS WTY++GL+GE +Y+ I
Sbjct: 549 LLSMTVGLQNAGSFYEWVGAGLT-SVKMKGFNNGTIDLSTFNWTYKIGLQGEKLGMYNGI 607
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
W ++ TWYK A ++ W+ W +
Sbjct: 608 AVETVNWVATSKPPKDQPLTWYKRQIHARQMLN--------------WM--------WRI 645
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
NS+ T YHVPRSW + S N+LVIFEE GG+P +
Sbjct: 646 -----------------NSEMIL------VWTRYHVPRSWFKPSGNILVIFEEKGGDPTK 682
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
I+ R VC V+E YP S G S N A +HL C IIS+I+F
Sbjct: 683 ITFSRRKISGVCALVAED-YPMANL--ESLENAGSGSSNYKA-SVHLKCPKSSIISAIKF 738
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+G+P G C +S G CH P S+SVV +
Sbjct: 739 ASFGSPAGACGSYSEGECHDPKSISVVEK 767
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/807 (45%), Positives = 497/807 (61%), Gaps = 67/807 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VS+D RAI I+G RR+L+S IHYPR+T +MWPDLI K+K+GG D IETYVFWNAHE R
Sbjct: 28 VSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+Y+F G D+V+F+K + +GLY LRIGPYVCAEWN+GGFPVWL ++P ++FRT N
Sbjct: 88 REYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPS 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F EMQ F KIV++M+EE LF+ QGGPII+ QIENEYGN+ SSYG GK Y+ W A+MA
Sbjct: 148 FMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQIENEYGNVISSYGAAGKAYIDWCANMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q +AP+ +++ CNG+YCD Y+P + + P +WTENW GW+ WGG+ P
Sbjct: 208 NSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHP 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+R EDLAF+VARFFQ GG+F NYYMY GGTNFGR +GGP+ TSYDY APIDE+G L++
Sbjct: 268 YRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPIDEFGNLNQ 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK LH +K E +L + ++ I LG + +A +Y ++ S F+ N++
Sbjct: 328 PKWGHLKQLHRVLKSMEKSLTYGNISR-IDLGNSIKATIYT-----TKEGSSCFIGNVNA 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
A V F G+ Y +P WSVS+LP+C +NTAKV++QTSI T + S P
Sbjct: 382 TANALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNTQTSIMTEDSSKP---------- 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+E S M +K S + +G+++ +VT D SDYLW++T++++ D
Sbjct: 432 EKLEWTWRPESAQKMILK------SSGDLIAKGLVDQKDVTNDASDYLWYMTRVHLDKKD 485
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVE-----FQSGYNDLI 581
W N T+ + S VL ++NG+ G+ K E G N +
Sbjct: 486 -PLWSRNM---TLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHLVHGTNHIS 541
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQQIY 638
LLS +VGLQNYGAF E G G V L G+K + DLS+ W Y++GL G +++
Sbjct: 542 LLSVSVGLQNYGAFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNNKLF 601
Query: 639 SIEE-NEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
S + +W + + P++ TWYK F AP G +PV +D +GKG+AW+NG IG
Sbjct: 602 STKSVGHIKWAN---EMFPTSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWINGQSIG 658
Query: 696 RYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS-NNLLVIFEE 753
RYW + + GC+D CDYRG Y SDKC CG PTQ WYHVPRS+L+AS +N + +FEE
Sbjct: 659 RYWPSFNSSDDGCKDECDYRGEYGSDKCAFMCGEPTQRWYHVPRSFLKASGHNTITLFEE 718
Query: 754 TGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGY 813
GGNP ++ K VC + E NK+ H H
Sbjct: 719 MGGNPSMVNFKTVVVGTVCARAHEH--------------------NKVELSCHNH----- 753
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHA 840
IS+++FAS+G P G C F+ G C
Sbjct: 754 PISAVKFASFGNPVGHCGTFAVGTCQG 780
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 712 bits (1837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/810 (46%), Positives = 487/810 (60%), Gaps = 70/810 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSY +R I IDG ++ +S IHYPR+TP+MWPDLI KSKEGG D IETYVFWNAHE +R
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIE-FRTNNA 165
QY+F D+V+F+K + + GLY LRIGPYVCAEWN+GGFPVWL ++PGIE RT N
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F EMQ F IVD+M++E LF+ QGGPII+ QIENEYGN+ +SYG GK YV W A+M
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A GVPW+MC+Q DAPE I+ CNG+YCD + PN+ P +WTENW GW+ +WGGR
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGRD 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P R EDLAF+VARFFQ GG+F NYYMY GGTNF R +GGP+ T+YDY+AP+DEYG L+
Sbjct: 266 PVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNLN 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK LHAA+K E ALV+ + + Y + S F +NI+
Sbjct: 326 QPKFGHLKQLHAALKSIEKALVSGN------VTTTDLTDSVSITEYATDKGKSCFFSNIN 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
E T A V +LG+ + +P WSVSILPDC+ V+NTAKV++QTS+ + N + +
Sbjct: 380 ETTDALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSV------MVKKENKAENE 433
Query: 466 QSMIESKLSSTSKSWMTVKEPI---GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
++E WM E I + T +++ + D SDYLW++T + +
Sbjct: 434 PEVLE---------WMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNL 484
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK-------VVQPVEFQS 575
D W +NE+ T+ I+ ++ F+NG+ GS W Q V+ +
Sbjct: 485 KKKD-PIW-SNEM--TLRINVSGHIVHAFVNGEHIGS---QWASYDVYNYIFEQEVKLKP 537
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD----IDLSKILWTYQVGLK 631
G N + LLS T+GL+NYGA + +G G V+L G ++GD DLS W+Y+VGL
Sbjct: 538 GKNIISLLSATIGLKNYGAQYDLIQSGIVGPVQLIG-RHGDETIIKDLSNHKWSYEVGLH 596
Query: 632 GEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
G +++S E A + TWYKT F P G DPV LDL +GKG AWVNG
Sbjct: 597 GFENRLFSPESRFATKWQSGNLPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNG 656
Query: 692 HHIGRYWTVVAPKGGCQDT-CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
H IGRYW + GC D CDYRG+Y + KC +CG PTQ WYHVPRSWL +N LV+
Sbjct: 657 HSIGRYWPSFIAEDGCSDEPCDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVL 716
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ 810
FEE GGNP ++ K + C E + L CQ
Sbjct: 717 FEEFGGNPSLVNFKTIAMEKACGHAYEKK------------------------SLELSCQ 752
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
G I+ I+FAS+G P G C FS+G+C
Sbjct: 753 -GKEITGIKFASFGDPTGSCGNFSKGSCEG 781
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/810 (46%), Positives = 487/810 (60%), Gaps = 70/810 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSY +R I IDG ++ +S IHYPR+TP+MWPDLI KSKEGG D IETYVFWNAHE +R
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIE-FRTNNA 165
QY+F D+V+F+K + + GLY LRIGPYVCAEWN+GGFPVWL ++PGIE RT N
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F EMQ F IVD+M++E LF+ QGGPII+ QIENEYGN+ +SYG GK YV W A+M
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A GVPW+MC+Q DAPE I+ CNG+YCD + PN+ P +WTENW GW+ +WGGR
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGRD 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P R EDLAF+VARFFQ GG+F NYYMY GGTNF R +GGP+ T+YDY+AP+DEYG L+
Sbjct: 266 PVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNLN 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLK LHAA+K E ALV+ + + Y + S F +NI+
Sbjct: 326 QPKFGHLKQLHAALKSIEKALVSGN------VTTTDLTDSVSITEYATDKGKSCFFSNIN 379
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
E T A V +LG+ + +P WSVSILPDC+ V+NTAKV++QTS+ + N + +
Sbjct: 380 ETTDALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSV------MVKKENKAENE 433
Query: 466 QSMIESKLSSTSKSWMTVKEPI---GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
++E WM E I + T +++ + D SDYLW++T + +
Sbjct: 434 PEVLE---------WMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNL 484
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK-------VVQPVEFQS 575
D W +NE+ T+ I+ ++ F+NG+ GS W Q V+ +
Sbjct: 485 KKKD-PIW-SNEM--TLRINVSGHIVHAFVNGEHIGS---QWASYDVYNYIXEQEVKLKP 537
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD----IDLSKILWTYQVGLK 631
G N + LLS T+GL+NYGA + +G G V+L G ++GD DLS W+Y+VGL
Sbjct: 538 GKNIISLLSATIGLKNYGAQYDLIQSGIVGPVQLIG-RHGDETIIKDLSNHKWSYEVGLH 596
Query: 632 GEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
G +++S E A + TWYKT F P G DPV LDL +GKG AWVNG
Sbjct: 597 GFENRLFSPESRFATKWQSGNLPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNG 656
Query: 692 HHIGRYWTVVAPKGGCQDT-CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
H IGRYW + GC D CDYRG+Y + KC +CG PTQ WYHVPRSWL +N LV+
Sbjct: 657 HSIGRYWPSFIAEDGCSDEPCDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVL 716
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ 810
FEE GGNP ++ K + C E + L CQ
Sbjct: 717 FEEFGGNPSLVNFKTIAMEKACGHAYEKK------------------------SLELSCQ 752
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
G I+ I+FAS+G P G C FS+G+C
Sbjct: 753 -GKEITGIKFASFGDPTGSCGNFSKGSCEG 781
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 709 bits (1831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/808 (45%), Positives = 498/808 (61%), Gaps = 65/808 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VS+D RAI IDG RR+LIS IHYPR+TP+MWPDLI K+KEGG D IETYVFWNAHE IR
Sbjct: 27 VSHDGRAIKIDGKRRVLISGSIHYPRSTPQMWPDLIKKAKEGGLDAIETYVFWNAHEPIR 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+Y+F G ND+++F+K + GL+ LRIGPYVCAEWN+GG PVW+ ++PG+E RT N
Sbjct: 87 REYDFSGNNDLIRFLKTIQDEGLFAVLRIGPYVCAEWNYGGIPVWVYNLPGVEIRTANKV 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F EMQ F IVD++R+E LF+ QGGPII+ QIENEYGN+ S+YG +GK Y+ W A+MA
Sbjct: 147 FMNEMQNFTTLIVDMVRKEKLFASQGGPIILSQIENEYGNVMSAYGDEGKAYINWCANMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GVPW+MC+Q DAP+ +I+ CNG+YC ++PN+ N P +WTENW GW+ WGG+ P
Sbjct: 207 DSFNIGVPWIMCQQPDAPQPMINTCNGWYCHDFEPNNPNSPKMWTENWVGWFKNWGGKDP 266
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR ED+A++VARFF+ GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP+DEYG +++
Sbjct: 267 HRTAEDIAYSVARFFETGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNIAQ 326
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHLK+LH +K E +L + ++ I LG +A VY N + S FL N +
Sbjct: 327 PKWGHLKELHLVLKSMENSLTNGNVSK-IDLGSYVKATVYATN-----DSSSCFLTNTNT 380
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
T A+VTF G +Y +P WSVSILPDC+ +NTAKV+ QTSI V ++
Sbjct: 381 TTDATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVNVQTSIM-------------VKRE 427
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+ E + + W + +++ + I++ D SDYLW++T++ ++ D
Sbjct: 428 NKAEDEPEALKWVWRAENVHNSLIGKSSVSKNTIVDQKIAANDSSDYLWYMTRLDINQKD 487
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV-------KVVQPVEFQSGYND 579
W N + + I+ V+ F+NG+ GS HW + ++ + G ND
Sbjct: 488 -PVWTNNTI---LRINGTGHVIHAFVNGEHIGS---HWATYGIHNDQFETNIKLKHGRND 540
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQQ 636
+ LLS TVGLQNYG +K G ++L G K + DLS WTY+VGL G +
Sbjct: 541 ISLLSVTVGLQNYGKEYDKWQDGLVSPIELIGTKGDETIIKDLSSHKWTYKVGLHGWENK 600
Query: 637 IYSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+S + A + + +P TWYKT F AP DP+ +DL MGKG AWVNGH +
Sbjct: 601 FFSQDTFFASSSKWESNELPINKMLTWYKTTFKAPLESDPIVVDLQGMGKGYAWVNGHSL 660
Query: 695 GRYW-TVVAPKGGCQDT-CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
GRYW + A + GC D CDYRG YN KC +NCG P+Q WYHVPR +++ N LV+FE
Sbjct: 661 GRYWPSYNADEDGCSDDPCDYRGEYNDTKCVSNCGKPSQRWYHVPRDFIEDGVNTLVLFE 720
Query: 753 ETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDG 812
E GGNP +I+ + C E+ + L C G
Sbjct: 721 EIGGNPSQINFQTVIVGSACANAYENK------------------------TLELSCH-G 755
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHA 840
IS I+FAS+G PQG C F++G+C +
Sbjct: 756 RSISDIKFASFGNPQGTCGAFTKGSCES 783
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/801 (44%), Positives = 492/801 (61%), Gaps = 62/801 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD AIII+G RR++ S IHYPR+T MWPDLI K+K+GG D IETY+FW+ HE
Sbjct: 4 NVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 63
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R +Y+F G + +KF +LV +GLY+ +RIGPYVCAEWN+GGFP+WL ++PGI+ RT+N
Sbjct: 64 RQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTDNQ 123
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
+K EM F KIV++ ++ LF+ QGGPII+ QIENEYGN+ + YG GK Y+ W A M
Sbjct: 124 VYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCAQM 183
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPW+MC+Q+DAP+ II+ CNG+YCD + PN+ P ++TENW GW+ WG +
Sbjct: 184 AESLNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWGDKD 243
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+R ED+AF+VARFFQ GG F NYYMY GGTNFGRTSGGPF TSYDY+AP+DEYG L+
Sbjct: 244 PYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLN 303
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK LH++IKL E L + K + +N + C FL+N D
Sbjct: 304 QPKWGHLKQLHSSIKLGEKILTNGTHSN--KTFGSFVTLTKFSNPTTKERFC--FLSNTD 359
Query: 406 EHTAASVTFLGQ-SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A++ Y +P WSVSI+ C+ VFNTAK++SQTS+
Sbjct: 360 DTNDATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSMFV-------------- 405
Query: 465 QQSMIESKLSSTSKSWMTVKEPIG--VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
++++ + SW+ E + + + F +LE T D SDYLW++T +
Sbjct: 406 ---KVQNEKENVKLSWVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVET 462
Query: 523 SDDDISFWKTNEVRP-TVTIDSMRDVLRVFINGQLTGSVIGHWVKVV---QPVEFQSGYN 578
+ T+ + T+ +++ VL F+N + GS G+ + +P+ ++G N
Sbjct: 463 NG-------TSSIHNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQSFVFEKPILLKAGTN 515
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS TVGL+NY AF + G G + L G N +LS LW+Y+VGL GE +Q+
Sbjct: 516 IITLLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVTTNLSSNLWSYKVGLNGEIKQL 575
Query: 638 YS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y+ + E W L ++ I TWYKT F P GIDPV LD+ MGKG+AW+NG IGR
Sbjct: 576 YNPVFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGR 635
Query: 697 YW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
+W + +A C +TCDYRGAY+ KC NCGNP+Q WYH+PRS+L + N LV+FEE G
Sbjct: 636 FWPSFIAGNDNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIG 695
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
G+P ++SV+ + +C +E + L CQ YII
Sbjct: 696 GSPQQVSVQTITIGTICGNANE------------------------GSTLELSCQGEYII 731
Query: 816 SSIEFASYGTPQGRCQKFSRG 836
S I+FASYG P+G+C F +G
Sbjct: 732 SEIQFASYGNPKGKCGSFKQG 752
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 707 bits (1825), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/835 (45%), Positives = 487/835 (58%), Gaps = 64/835 (7%)
Query: 24 MMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIA 83
++++ L + S SA+ V YD A+II+G R++++S IHYPR+T EMW DLI
Sbjct: 8 ILLIASLGLIGSCSAAAAAA-AAVEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQ 66
Query: 84 KSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEW 143
K+KEGG D IETY+FWNAHE R +YNF G D VKF + V +GLY LRIGPY CAEW
Sbjct: 67 KAKEGGLDTIETYIFWNAHERRRREYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEW 126
Query: 144 NFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENE 203
N+GGFPVWL +IP I+FRT+N FK EMQ F KIV++ +E LF+ QGGPII+ QIENE
Sbjct: 127 NYGGFPVWLHNIPEIKFRTDNEIFKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENE 186
Query: 204 YGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNS 263
YGN+ YG+ GK YV+W A MA+ GVPW+MC+Q+DAP ++I+ CNG+YCD + PNS
Sbjct: 187 YGNVMGPYGEAGKSYVQWCAQMAVAQNIGVPWIMCQQSDAPSSVINTCNGFYCDTFTPNS 246
Query: 264 YNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTS 323
P +WTENW GWY WG + PHR EDLAF+VARFFQ G NYYMY+GGTNFGRTS
Sbjct: 247 PKSPKMWTENWTGWYKKWGQKDPHRTAEDLAFSVARFFQYNGVLQNYYMYYGGTNFGRTS 306
Query: 324 GGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEA 383
GGPF TSYDYDAP+DEYG L++PKWGHLK+LHAA+KL E L + E
Sbjct: 307 GGPFIATSYDYDAPLDEYGNLNQPKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVEL 366
Query: 384 HVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVS 443
Y +N G + FL+N Y +P WSVSIL DC +NTAKV+
Sbjct: 367 TTYTSNIDGER---LCFLSNTKMDGLDVDLQQDGKYFVPAWSVSILQDCNKETYNTAKVN 423
Query: 444 SQTSI---KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG--VWSENNFTVQ 498
QTS+ K E PL KL SW EP + + F
Sbjct: 424 VQTSLIVKKLHENDTPL--------------KL-----SWEWAPEPTKAPLHGQGGFKAT 464
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
+LE T D SDYLW++T + D++ T T+ + L F+NG+ G
Sbjct: 465 QLLEQKAATYDESDYLWYMTSV---DNN----GTASKNVTLRVKYSGQFLHAFVNGKEIG 517
Query: 559 SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDI 617
S G+ +P + G N + LLS TVGLQNYG F ++ G G V+L N
Sbjct: 518 SQHGYTFTFEKPALLKPGTNIISLLSATVGLQNYGEFFDEGPEGIAGGPVELIDSGNTTT 577
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
DLS W+Y+VGL GE + Y A+W + TWYKT F AP G +PV +
Sbjct: 578 DLSSNEWSYKVGLNGEGGRFYDPTSGRAKWVSGNLR-VGRAMTWYKTTFQAPSGTEPVVV 636
Query: 678 DLGSMGKGQAWVNGHHIGRYWTVV-APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
DL MGKG AWVNG+ +GR+W ++ A GC CDYRG Y KC +NCGNPTQ WYHV
Sbjct: 637 DLQGMGKGHAWVNGNSLGRFWPILTADPNGCDGKCDYRGQYKEGKCLSNCGNPTQRWYHV 696
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL 796
PRS+L +N L++FEE GGNP ++S ++ +T +C N+Y
Sbjct: 697 PRSFLNNGSNTLILFEEIGGNPSDVSFQITATETIC--------------GNTYE----- 737
Query: 797 SINKMAPEMHLHCQDG-YIISSIEFASYGTPQG-RCQKFSRGNCHAPMSLSVVSE 849
+ L C G IIS I++AS+G PQG C F RG+ A S S V +
Sbjct: 738 -----GTTLELSCNGGRRIISDIQYASFGDPQGSSCGSFQRGSVEASRSFSAVEK 787
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/813 (44%), Positives = 506/813 (62%), Gaps = 62/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++I++G R +L S IHYPR+TPEMWPD++ K+K GG ++I+TYVFWN HE +
Sbjct: 32 VTYDGRSLIVNGRRELLFSGSIHYPRSTPEMWPDILQKAKHGGLNLIQTYVFWNIHEPVE 91
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQ+NF+G D+VKF+KL+G GLY LRIGP++ AEWN GGFP WLR++P I FR+ N P
Sbjct: 92 GQFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSYNEP 151
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+++ + I+++M+E LF+ QGGPII+ QIENEY +++ +Y + G YV+WA MA
Sbjct: 152 FKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYRELGVQYVQWAGKMA 211
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+GLGAGVPW+MCKQ DAP+ +I+ CNG +C D + PN NKP+LWTENW Y +G
Sbjct: 212 VGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVFGDP 271
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R EDLAF+VARF + G+ NYYMY GGTNFGRT G F T Y +AP+DEYGL
Sbjct: 272 PSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEYGLQ 330
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHLKDLH+A++LC+ AL S KLG+++E Y + G+ C+AFL N
Sbjct: 331 REPKWGHLKDLHSALRLCKKALFTG-SPGVEKLGKDKEVRFYE--KPGTHI-CAAFLTNN 386
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
AA++TF G+ Y LPP S+SILPDC+ V+NT +V +Q + +
Sbjct: 387 HSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNAR--------------- 431
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ ++SK+++ + W +EPI V ++ + +E N KD SDY W +T I +S+
Sbjct: 432 --NFVKSKIANKNLKWEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWFVTSIELSN 489
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
D+ K ++ P + I ++ + F+NG GS G V+ +PV+F++G N +
Sbjct: 490 YDLPMKK--DIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFKAGTNYI 547
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LL TVGL N GA++E AG V++ G G +D++ W QVG+ GE + Y+
Sbjct: 548 ALLCMTVGLPNSGAYMEHRYAGIH-SVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQ 606
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
+ +WT G TWYKTYFD P+G DPV L + SM KG AWVNG +IGRYW
Sbjct: 607 GGSHRVQWTAAKGKG--PAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNIGRYWL 664
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ ++P P+Q+ YHVPR+WL+ S+NLLVIFEETGGNP
Sbjct: 665 SYLSP----------------------LEKPSQSEYHVPRAWLKPSDNLLVIFEETGGNP 702
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLS--INKMAPEMHLHCQDGYIIS 816
EI V+L + +C V+E H P V+ W D K+ ++++ P+ HL C + +I
Sbjct: 703 EEIEVELVNRDTICSIVTEYHPPHVKSWQRH---DSKIRAVVDEVKPKGHLKCPNYKVIV 759
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++FAS+G P G C F GNC AP S VV +
Sbjct: 760 KVDFASFGNPLGACGDFEMGNCTAPNSKKVVEQ 792
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/812 (45%), Positives = 492/812 (60%), Gaps = 65/812 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V YD A+II+G RR++ S IHYPR+T +MWPDL+ K+K+GG D IETY+FW+ HE +R
Sbjct: 25 VKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDRHEQVR 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF G D VKF K + +GLY +RIGPY CAEWN+GGFPVWL IPGIE RT+NA
Sbjct: 85 GRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGIEMRTDNAA 144
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K EMQ FV KI+++ +E LF+ QGGPII+ QIENEYG++ ++ + GK Y+KWAA MA
Sbjct: 145 YKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKWAAQMA 204
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW MC+Q DAP+ II+ CNGYYC +KPN+ P ++TENW GW+ WG R P
Sbjct: 205 LAQNIGVPWFMCQQNDAPQPIINTCNGYYCHNFKPNNPKSPKMFTENWIGWFQKWGERAP 264
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR ED A+AVARFFQ GG F NYYMY GGTNFGRTSGGP+ ITSYDYDAPI+EYG L++
Sbjct: 265 HRTAEDSAYAVARFFQNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEYGNLNQ 324
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHLK LH AIKL E L S LG Y N G++ FL+N +
Sbjct: 325 PKYGHLKFLHEAIKLGEKVLTNYTSRNDKDLGNGITLTTY-TNSVGAR---FCFLSNDKD 380
Query: 407 HTAASVTFLGQ-SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+T +V Y +P WSV+IL C VFNTAKV+SQTSI
Sbjct: 381 NTDGNVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTSI----------------- 423
Query: 466 QSMIESKL--SSTSK-SWMTVKEPIG--VWSENNFTVQGILEHLNVTKDYSDYLWHITQI 520
+E K+ SST+K +W + EP + + +LE +T D SDYLW++T +
Sbjct: 424 ---MEKKIDNSSTNKLTWAWIMEPKKDTMNGRGSIKAHQLLEQKELTLDASDYLWYMTSV 480
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWVKVVQPVEFQSGY 577
++D +N + +++ L ++N + G S G+ + V ++G
Sbjct: 481 DIND------TSNWSNANLHVETSGHTLHGYVNKRYIGYGHSQFGNNFTYEKQVSLKNGT 534
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS TVGL NYGA ++ G G VKL G + IDLS W+++VGL GE ++
Sbjct: 535 NIITLLSATVGLANYGARFDEIKTGISDGPVKLVGQNSVTIDLSTGNWSFKVGLNGEKRR 594
Query: 637 IYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y ++ + + TWYKT F +P G +P+ +DL +GKG AWVNG IGR
Sbjct: 595 FYDLQPRSGVAWNTSSYPTGKPLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGR 654
Query: 697 YWTV-VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
YWT + GC DTCDYRG Y +KC T C +P+Q WYHVPRS+L N L++FEE G
Sbjct: 655 YWTSWITSTAGCSDTCDYRGNYKKEKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIG 714
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
GNP +S +T+ +C V E GKL L CQ+G +I
Sbjct: 715 GNPQNVSFLTETTKTICANVYEG---------------GKL---------ELSCQNGQVI 750
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+SI FAS+G PQG+C F +G+ + S S++
Sbjct: 751 TSINFASFGNPQGQCGSFKKGSWESLNSQSMM 782
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 704 bits (1817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/806 (44%), Positives = 487/806 (60%), Gaps = 72/806 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD AIII+G RR++ S IHYPR+T MWPDLI K+K+GG D IETY+FW+ HE
Sbjct: 4 NVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 63
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R +Y+F G + +KF +LV +GLY+ +RIGPYVCAEWN+GGFP+WL ++PGI+ RT+N
Sbjct: 64 RQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTDNQ 123
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
+K EM F KIV++ ++ LF+ QGGPII+ QIENEYGN+ + YG GK Y+ W A M
Sbjct: 124 VYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCAQM 183
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A GVPW+MC+Q+DAP+ II+ CNG+YCD + PN+ P ++TENW GW+ WG +
Sbjct: 184 AESFNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWGDKD 243
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+R ED+AF+VARFFQ GG F NYYMY GGTNFGRTSGGPF TSYDY+AP+DEYG L+
Sbjct: 244 PYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLN 303
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLK LH++IKL E L + +GS + F +
Sbjct: 304 QPKWGHLKQLHSSIKLGEKILTNGTHSN------KTFGSFVTFKTFGSFVTLTKFS---N 354
Query: 406 EHTAASVTFLGQS------YTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
T FL + Y +P WSVSI+ C+ VFNTAK++SQTSI
Sbjct: 355 PTTKERFCFLSNTXKADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSIFV--------- 405
Query: 460 NISVPQQSMIESKLSSTSKSWMTVKEPIG--VWSENNFTVQGILEHLNVTKDYSDYLWHI 517
++++ + SW+ E + + + F +LE T D SDYLW++
Sbjct: 406 --------KVQNEKENVKLSWVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYM 457
Query: 518 TQIYVSDDDISFWKTNEVRP-TVTIDSMRDVLRVFINGQLTGSVIGHWVKVV---QPVEF 573
T + + T+ + T+ +++ VL F+N + GS G+ + +P+
Sbjct: 458 TNVETNG-------TSSIHNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQSFVFEKPILL 510
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRG-QVKLTGFKNGDIDLSKILWTYQVGLKG 632
++G N + LLS TVGL+NY AF + G G + L G N IDLS LW+Y+VGL G
Sbjct: 511 KAGTNIITLLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNG 570
Query: 633 EFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
E +Q+Y+ + E W L ++ I TWYKT F P GIDPV LD+ MGKG+AW+NG
Sbjct: 571 EIKQLYNPVFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWING 630
Query: 692 HHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
IGR+W + +A C +TCDYRGAY+ KC NCGNP+Q WYH+PRS+L + N LV+
Sbjct: 631 QSIGRFWPSFIAGNDNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVL 690
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ 810
FEE GG+P ++SV+ + +C +E + L CQ
Sbjct: 691 FEEIGGSPQQVSVQTITIGTICGNANE------------------------GSTLELSCQ 726
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRG 836
YIIS I+FASYG P+G+C F +G
Sbjct: 727 GEYIISEIQFASYGNPKGKCGSFKQG 752
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/632 (53%), Positives = 446/632 (70%), Gaps = 26/632 (4%)
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY+F+G+ND+V+FVK +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+ RT+N PF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQRF +K+V M+ L++ QGGPII+ QIENEYGN+ +SYG GK Y++WAA MA+
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L GVPWVMC+QTDAPE +I+ CNG+YCD + P+ ++P LWTENW GW+ ++GG +P+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RP EDLAFAVARF+QRGG+ NYYMY GGTNFGR+SGGPF TSYDYDAPIDEYGL+ +P
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHL+D+H AIK+CEPAL+A D + Y+ LGQN EAHVY+ S S C+AFLANID+
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPS-YMSLGQNAEAHVYK-----SGSLCAAFLANIDDQ 294
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ---TSIKTVEFSLPLSPNISVP 464
+ +VTF G++Y LP WSVSILPDC+N V NTA+++SQ T ++ + FS S
Sbjct: 295 SDKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQAS------ 348
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
S +E++L+++ SW EP+G+ EN T G++E +N T D SD+LW+ T I V+
Sbjct: 349 DGSSVEAELAAS--SWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAG 406
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYNDL 580
+ N + + ++S+ VL+VFING+L GS + + PV +G N +
Sbjct: 407 GEPYL---NGSQSNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKI 463
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGL NYGAF + GAG G VKLTG K G +DLS WTYQ+GL+GE +Y+
Sbjct: 464 DLLSATVGLTNYGAFFDLVGAGITGPVKLTGPK-GTLDLSSAEWTYQIGLRGEDLHLYNP 522
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-T 699
E EW + TWYK+ F AP G DPVA+D MGKG+AWVNG IGRYW T
Sbjct: 523 SEASPEWVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPT 582
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
+AP+ GC ++C+YRG+Y++ KC CG P+Q
Sbjct: 583 NIAPQSGCVNSCNYRGSYSATKCLKKCGQPSQ 614
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/796 (45%), Positives = 490/796 (61%), Gaps = 67/796 (8%)
Query: 58 GNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
G RR+L+S IHYPR+T +MWPDLI K+K+GG D IETYVFWNAHE R +Y+F G D+
Sbjct: 1 GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
V+F+K + +GLY LRIGPYVCAEWN+GGFPVWL ++P ++FRT N F EMQ F K
Sbjct: 61 VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTK 120
Query: 178 IVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVM 237
IV +M+EE LF+ QGGPII+ QIENEYGN+ SSYG +GK Y+ W A+MA L GVPW+M
Sbjct: 121 IVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLM 180
Query: 238 CKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAV 297
C+Q +AP+ +++ CNG+YCD Y+P + + P +WTENW GW+ WGG+ P+R EDLAF+V
Sbjct: 181 CQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSV 240
Query: 298 ARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
ARFFQ GG+F NYYMY GGTNFGR +GGP+ TSYDY AP+DE+G L++PKWGHLK LH
Sbjct: 241 ARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHT 300
Query: 358 AIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQ 417
+K E +L + ++ I LG + +A +Y ++ S F+ N++ A V F G+
Sbjct: 301 VLKSMEKSLTYGNISR-IDLGNSIKATIYT-----TKEGSSCFIGNVNATADALVNFKGK 354
Query: 418 SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTS 477
Y +P WSVS+LPDC +NTAKV++QTSI T + S P +E S
Sbjct: 355 DYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKP----------ERLEWTWRPES 404
Query: 478 KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRP 537
M +K + +G+++ +VT D SDYLW++T++++ D W N
Sbjct: 405 AQKMILK------GSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKD-PLWSRNM--- 454
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQ-----SGYNDLILLSQTVGLQNY 592
T+ + S VL ++NG+ G+ K E + G N + LLS +VGLQNY
Sbjct: 455 TLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNY 514
Query: 593 GAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQQIYSIEE-NEAEWT 648
G F E G G V L G+K + DLS+ W Y++GL G +++SI+ +W
Sbjct: 515 GPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWA 574
Query: 649 DLTRDGIPS--TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKG 705
+ + +P+ TWYK F AP G +PV +DL +GKG+AW+NG IGRYW + +
Sbjct: 575 N---EKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDD 631
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS-NNLLVIFEETGGNPFEISVK 764
GC+D CDYRGAY SDKC CG PTQ WYHVPRS+L AS +N + +FEE GGNP ++ K
Sbjct: 632 GCKDECDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFK 691
Query: 765 LRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYG 824
VC + E + ++ L C + IS+++FAS+G
Sbjct: 692 TVVVGTVCARAHEHN------------------------KVELSCHN-RPISAVKFASFG 726
Query: 825 TPQGRCQKFSRGNCHA 840
P G C F+ G C
Sbjct: 727 NPLGHCGSFAVGTCQG 742
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/823 (44%), Positives = 494/823 (60%), Gaps = 74/823 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++ D R I+I+G R++LIS +HYPR+TPEMWPDLI KSK+GG + I+TYVFW+ HE R
Sbjct: 30 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QY+F G D+V+F+K + + GLY LRIGPYVCAEW +GGFPVWL + P I+ RTNN
Sbjct: 90 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ EMQ F IVD+M++E LF+ QGGPII+ QIENEYGN+ +Y G Y+ W A MA
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q +AP+ +I+ CNGYYCD + PN+ N P +WTENW GWY WGG P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR EDLAF+VARF+Q GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP++EYG ++
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH + E AL D + + A Y Q S F N +
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGD------VKNVDYETLTSATIYSYQGKSSCFFGNSNA 383
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ + G +YT+P WSVSILPDC N V+NTAKV+SQ S + S
Sbjct: 384 DRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGS------------ 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
E++ S W E I + FT +L+ V +D SDYL+++T + +S+DD
Sbjct: 432 ---EAENEPNSLQWTWRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDD 488
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHW-VKVVQPVEFQSGYNDLIL 582
W + T+++++ +L F+NG+ G +++G + + + V Q G N++ L
Sbjct: 489 -PIWGKDL---TLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITL 544
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKIL-----WTYQVGLKGEFQQI 637
LS TVGL NYG + G G V++ NG D+ K L W Y+ GL GE ++I
Sbjct: 545 LSATVGLTNYGPDFDMVNQGIHGPVQIIA-SNGSADIIKDLSNNNQWAYKAGLNGEDKKI 603
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +W D +P +F WYK FDAP G DPV +DL +GKG+AWVNGH +G
Sbjct: 604 FLGRARYNQWKS---DNLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLG 660
Query: 696 RYWTVVAPKG-GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
RYW +G GC CDYRG Y ++KC TNCGNP+Q WYHVPRS+L +++N LV+FEE
Sbjct: 661 RYWPSYIARGEGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEF 720
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GGNP ++ + + C E + + L CQ G
Sbjct: 721 GGNPSSVTFQTVTVGNACANAREGY------------------------TLELSCQ-GRA 755
Query: 815 ISSIEFASYGTPQGRC--------QKFSRGNCHAPMSLSVVSE 849
IS I+FAS+G PQG C Q F +G C A SLS++ +
Sbjct: 756 ISGIKFASFGDPQGTCGKPFATGSQVFEKGTCEAADSLSIIQK 798
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/480 (70%), Positives = 389/480 (81%), Gaps = 29/480 (6%)
Query: 377 LGQNQEAHVYR------ANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILP 430
+ Q AHVYR + + G+ S+CSAFLANIDEH ASVTFLGQ Y LPPWSVSILP
Sbjct: 561 MDTKQTAHVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILP 620
Query: 431 DCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW 490
DCR TVFNTAKV +QTSIKT +K+S K+WMT+KEPI VW
Sbjct: 621 DCRTTVFNTAKVGAQTSIKT--------------------NKISYVPKTWMTLKEPISVW 660
Query: 491 SENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRV 550
SENNFT+QG+LEHLNVTKD+SDYLW IT+I VS +DISFW+ N+V PT++IDSMRD+L +
Sbjct: 661 SENNFTIQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHI 720
Query: 551 FINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
F+NGQL GSVIGHWVKVVQP++ GYNDL+LLSQTVGLQNYGAFLEKDGAGF+GQVKLT
Sbjct: 721 FVNGQLIGSVIGHWVKVVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLT 780
Query: 611 GFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAP 669
GFKNG+IDLS+ WTYQVGL+GEFQ+IY I+E+E AEWTDLT D PSTFTWYKT+FDAP
Sbjct: 781 GFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAP 840
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNP 729
+G +PVALDLGSMGKGQAWVNGHHIGRYWT VAPK GC CDYRG Y++ KC TNCGNP
Sbjct: 841 NGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKDGC-GKCDYRGHYHTSKCATNCGNP 899
Query: 730 TQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNS 789
TQ WYH+PRSWLQASNNLLV+FEETGG PFEISVK RST+ +C +VSESHYP ++ WS S
Sbjct: 900 TQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPS 959
Query: 790 YSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+D S NKM PEMHL C DG+ ISSIEFASYGTPQG CQ FS+G CHAP SL++VS+
Sbjct: 960 DFIDQN-SKNKMTPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSK 1018
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 290/344 (84%), Positives = 318/344 (92%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
F PFNVSYDHRA++IDG RRML+SAGIHYPRATPEMWPDLIAKSKEGGADVI+TYVFWN
Sbjct: 24 FAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNG 83
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE +R QYNF+G+ DIVKFVKLVGSSGLYL LRIGPYVCAEWNFGGFPVWLRDIPGIEFR
Sbjct: 84 HEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 143
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+NAPFK+EMQRFVKKIVDLM++EMLFSWQGGPIIMLQIENEYGN+ESS+GQ+GKDYVKW
Sbjct: 144 TDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKW 203
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MAL L AGVPWVMC+Q DAP+ II+ACNG+YCD + PNS NKP LWTE+W+GW+ +W
Sbjct: 204 AARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSANKPKLWTEDWNGWFASW 263
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GGR P RPVED+AFAVARFFQRGGSF NYYMYFGGTNFGR+SGGPFY+TSYDYDAPIDEY
Sbjct: 264 GGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEY 323
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
GLLS+PKWGHLK+LHAAIKLCEPALVA DS QYIKLG QE V
Sbjct: 324 GLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEVGV 367
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/623 (54%), Positives = 436/623 (69%), Gaps = 19/623 (3%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDHRA++IDG RR+L+S IHYPR+TP+MWP LI K+K+GG DVIETYVFW+ HE +
Sbjct: 29 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHEPV 88
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQY+F+G+ D+ FVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 89 RGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 148
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EMQRF K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA M
Sbjct: 149 PFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAAGM 208
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L GVPWVMC+Q DAP+ +I+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 209 AVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGGAV 268
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAFAVARF+QRGG+F NYYMY GGTN R+SGGPF TSYDYDAPIDEYGL+
Sbjct: 269 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGLVR 328
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHL+D+H AIKLCEPAL+A D + Y LG N EA VY+ S C+AFLANID
Sbjct: 329 QPKWGHLRDVHKAIKLCEPALIATDPS-YTSLGPNVEAAVYKVG-----SVCAAFLANID 382
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +VTF G+ Y LP WSVSILPDC+N V NTA+++SQT+ + + L +
Sbjct: 383 GQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRY---LESSNVASD 439
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
S + +L+ + W EP+G+ +N T G++E +N T D SD+LW+ T I V D
Sbjct: 440 GSFVTPELAVS--DWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGD 497
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQSGYNDLI 581
+ N + + ++S+ VL+V+ING++ GS G + +P+E G N +
Sbjct: 498 EPYL---NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKID 554
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS TVGL NYGAF + GAG G VKL+G NG +DLS WTYQ+GL+GE +Y
Sbjct: 555 LLSATVGLSNYGAFFDLVGAGITGPVKLSGL-NGALDLSSAEWTYQIGLRGEDLHLYDPS 613
Query: 642 ENEAEWTDLTRDGIPSTFTWYKT 664
E EW I WYK
Sbjct: 614 EASPEWVSANAYPINHPLIWYKV 636
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/823 (44%), Positives = 492/823 (59%), Gaps = 78/823 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++ D R I+I+G R++LIS +HYPR+TPEMWPDLI KSK+GG + I+TYVFW+ HE R
Sbjct: 30 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QY+F G D+V+F+K + + GLY LRIGPYVCAEW +GGFPVWL + P I+ RTNN
Sbjct: 90 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ EMQ F IVD+M++E LF+ QGGPII+ QIENEYGN+ +Y G Y+ W A MA
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q +AP+ +I+ CNGYYCD + PN+ N P +WTENW GWY WGG P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR EDLAF+VARF+Q GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP++EYG ++
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PKWGHL+DLH + E AL D + + A Y Q S F N +
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGD------VKNVDYETLTSATIYSYQGKSSCFFGNSNA 383
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
++ + G +YT+P WSVSILPDC N V+NTAKV+SQ S + S
Sbjct: 384 DRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGS------------ 431
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
E++ S W E I + FT +L+ V +D SDYL+++T +DD
Sbjct: 432 ---EAENEPNSLQWTWRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMT---TNDDP 485
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHW-VKVVQPVEFQSGYNDLIL 582
I W + T+++++ +L F+NG+ G +++G + + + V Q G N++ L
Sbjct: 486 I--WGKDL---TLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITL 540
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKIL-----WTYQVGLKGEFQQI 637
LS TVGL NYG + G G V++ NG D+ K L W Y+ GL GE ++I
Sbjct: 541 LSATVGLTNYGPDFDMVNQGIHGPVQIIA-SNGSADIIKDLSNNNQWAYKAGLNGEDKKI 599
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +W D +P +F WYK FDAP G DPV +DL +GKG+AWVNGH +G
Sbjct: 600 FLGRARYNQWKS---DNLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLG 656
Query: 696 RYWTVVAPKG-GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
RYW +G GC CDYRG Y ++KC TNCGNP+Q WYHVPRS+L +++N LV+FEE
Sbjct: 657 RYWPSYIARGEGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEF 716
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYI 814
GGNP ++ + + C E + + L CQ G
Sbjct: 717 GGNPSSVTFQTVTVGNACANAREGY------------------------TLELSCQ-GRA 751
Query: 815 ISSIEFASYGTPQGRC--------QKFSRGNCHAPMSLSVVSE 849
IS I+FAS+G PQG C Q F +G C A SLS++ +
Sbjct: 752 ISGIKFASFGDPQGTCGKPFATGSQVFEKGTCEAADSLSIIQK 794
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/666 (52%), Positives = 450/666 (67%), Gaps = 37/666 (5%)
Query: 109 YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFK 168
YNF+ + D+V+FVKLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PFK
Sbjct: 6 YNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFK 65
Query: 169 EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALG 228
MQ+F +KIV LM+ E L+ QGGPII+ QIENEYG +E G GK Y KWAA MALG
Sbjct: 66 AAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMALG 125
Query: 229 LGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHR 288
L GVPWVMCKQ DAP+ +ID CNG+YC+ +KPN KP +WTE W GW+T +GG P+R
Sbjct: 126 LDTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGPAPYR 185
Query: 289 PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPK 348
PVED+A++VARF Q GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL EPK
Sbjct: 186 PVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPK 245
Query: 349 WGHLKDLHAAIKLCEPALVAAD-SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
W HL+DLH AIKLCEPALV+ D + Y LG NQEAHV++ R GS C+AFLAN D
Sbjct: 246 WSHLRDLHKAIKLCEPALVSVDPTVSY--LGSNQEAHVFKT-RSGS---CAAFLANYDAS 299
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
++A+VTF Y LPPWSVSILPDC++ +FNTAKV + T S P+ +
Sbjct: 300 SSATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPT---------------SQPKMT 344
Query: 468 MIESKLSSTSKSWMTVKEPIG-VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
+ +S SW++ E ++E+ T+ G++E ++VT+D +DYLW++T I + D +
Sbjct: 345 PV------SSFSWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRI-DPN 397
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLIL 582
F K+ + P +T+ S L VFINGQL+G+ G + + + V ++G N L +
Sbjct: 398 EGFLKSGQ-WPLLTVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSI 456
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE- 641
LS VGL N G E G G V L G D+S W+Y++GLKGE ++S+
Sbjct: 457 LSVAVGLPNGGLHYETWNTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSG 516
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+ EW + TWYKT FD+P G +P+ALD+ SMGKGQ W+NG IGR+W
Sbjct: 517 SSSVEWVTGSLVAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAY 576
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
KG C C+Y G +N KC +NCG P+Q WYHVPR+WL++S N+LVIFEE GGNP I
Sbjct: 577 TAKGSC-GKCNYGGIFNEKKCHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGI 635
Query: 762 SVKLRS 767
S+ RS
Sbjct: 636 SLVKRS 641
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/817 (45%), Positives = 494/817 (60%), Gaps = 60/817 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD R++I+DG R++L S IHYPR+TPEMW LIAK+KEGG DVI+TYVFWN HE
Sbjct: 23 DVTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFWNLHEPQ 82
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQY+F G+ DIV+F+K V + GLY+ LRIGP++ EW++GG P WL DIPGI FR++N
Sbjct: 83 PGQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGIVFRSDNE 142
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK +MQ F KIV +M+ E L+ QGGPII+ QIENEYG +E +Y ++G YVKWAA M
Sbjct: 143 PFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYVKWAAQM 202
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGG 283
A+GL GVPWVMCKQ DAP+ +I+ACNG C PNS NKP +WTENW Y G
Sbjct: 203 AVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTRYVITGE 262
Query: 284 RLPHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
+ R VED+AF V +F + GSF+NYYMY GGTNFGRT+ F TSY APIDEYG
Sbjct: 263 NIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASA-FVPTSYYDQAPIDEYG 321
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L+ +PKWGHLK++HAAIKLC L++ I LGQ Q+A V+ G C+AFL
Sbjct: 322 LIRQPKWGHLKEMHAAIKLCLTPLLSGGQVT-ISLGQQQQAFVFT----GLSGECAAFLL 376
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D ASV F SY LPP S+SILPDC+ FNTAKVS+Q + +
Sbjct: 377 NNDTANTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYTTR------------- 423
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
SM SKL W+ +E I + E + + ILE ++ TKD SDYLW+ +
Sbjct: 424 ----SMTRSKLLDGEDKWVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWYTFRFQQ 479
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYN 578
D + + + S+ VL F+NGQ G G + V G N
Sbjct: 480 ESSD--------TQAVLNVRSLGHVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVN 531
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
++ LLS VG+ + GA++E+ AG R +VK+ K G+ + + W YQVGL GE QI+
Sbjct: 532 NVSLLSVMVGMPDSGAYMERRAAGLR-KVKIQE-KEGNKEFTNYSWGYQVGLLGEKLQIF 589
Query: 639 SIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ + ++ +W + +++ + + TWYKT FDAP PVAL+LGSMGKG+AWVNG IGRY
Sbjct: 590 TDQGSSQVQWANFSKNAL-NPLTWYKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRY 648
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW----YHVPRSWLQASNNLLVIFEE 753
W YR + S + N + Y+VPRS+L+ NLLV+ EE
Sbjct: 649 WP------------SYRASDGSSQIWYAYFNTGAIFRAVRYNVPRSFLKPKGNLLVVLEE 696
Query: 754 TGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGY 813
+GGNP +ISV S +C V+ SH P V WS + D S+ + P + L C
Sbjct: 697 SGGNPLQISVDTASISKICSHVTASHLPLVSSWSKRTNTDNNNSL-QARPRVKLDCPSNT 755
Query: 814 IISSIEFASYGTPQGRC-QKFSRGNCHAPMSLSVVSE 849
IS+I FASYGTP+G C ++ G CH+ S ++V +
Sbjct: 756 KISNILFASYGTPEGTCGDAYAVGMCHSSSSEAIVQK 792
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 689 bits (1779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/685 (49%), Positives = 455/685 (66%), Gaps = 24/685 (3%)
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
MQRF +K+VD M+ L++ QGGPII+ QIENEYGN++S+YG GK Y++WAA MA+ L
Sbjct: 1 MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60
Query: 231 AGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPV 290
GVPWVMC+Q+DAP+ +I+ CNG+YCD + PNS +KP +WTENW GW+ ++GG +P+RP
Sbjct: 61 TGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPA 120
Query: 291 EDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWG 350
EDLAFAVARF+QRGG+F NYYMY GGTNFGR++GGPF TSYDYDAPIDEYG++ +PKWG
Sbjct: 121 EDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWG 180
Query: 351 HLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAA 410
HL+D+H AIKLCEPAL+AA+ + Y LGQN EA VY+ S C+AFLAN+D +
Sbjct: 181 HLRDVHKAIKLCEPALIAAEPS-YSSLGQNTEATVYQT---ADNSICAAFLANVDAQSDK 236
Query: 411 SVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIE 470
+V F G +Y LP WSVSILPDC+N V NTA+++SQ + + L +I S+I
Sbjct: 237 TVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMR---SLGSSIQDTDDSLIT 293
Query: 471 SKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFW 530
+L++ W EP+G+ EN T G++E +N T D SD+LW+ T I V D+
Sbjct: 294 PELATA--GWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL- 350
Query: 531 KTNEVRPTVTIDSMRDVLRVFINGQL----TGSVIGHWVKVVQPVEFQSGYNDLILLSQT 586
N + + ++S+ VL+++ING+L GS + + PV G N + LLS T
Sbjct: 351 --NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTT 408
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAE 646
VGL NYGAF + GAG G VKL+G NG ++LS WTYQ+GL+GE +Y+ E E
Sbjct: 409 VGLSNYGAFFDLVGAGVTGPVKLSG-PNGALNLSSTDWTYQIGLRGEDLHLYNPSEASPE 467
Query: 647 WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKG 705
W WYKT F AP G DPVA+D MGKG+AWVNG IGRYW T +AP+
Sbjct: 468 WVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQS 527
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
GC ++C+YRGAY+S+KC CG P+QT YHVPRS+LQ +N LV+FE+ GG+P IS
Sbjct: 528 GCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTT 587
Query: 766 RSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIEFASYG 824
R T +C VSE H + W + + + P + L C ++G +IS+I+FAS+G
Sbjct: 588 RQTSSICAHVSEMHPAQIDSW-----ISPQQTSQTQGPALRLECPREGQVISNIKFASFG 642
Query: 825 TPQGRCQKFSRGNCHAPMSLSVVSE 849
TP G C ++ G C + +L+VV E
Sbjct: 643 TPSGTCGNYNHGECSSSQALAVVQE 667
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 686 bits (1769), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/740 (49%), Positives = 453/740 (61%), Gaps = 61/740 (8%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
LISA IHYPR+ P MWP LI +KEGG DVIETYVFWN HE G Y F G+ D+V+F K
Sbjct: 1 LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGG---------------------------------FP 149
+V +G+YL LRIGP+V AEWNFGG P
Sbjct: 60 VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES 209
VWL IPG FRT N PF M++F IV+LM++E LF+ QGGPII+ QIENEYG E+
Sbjct: 120 VWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYEN 179
Query: 210 SYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTL 269
Y + GK Y WAA MA+ VPW+MC+Q DAP+ +ID CN +YCD + P S +P +
Sbjct: 180 YYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKM 239
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTENW GW+ T+GGR PHRPVED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF
Sbjct: 240 WTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFIT 299
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
TSYDYDAPIDEYGL PKWGHLK+LH AIKLCE L+ S I LG + EA +Y
Sbjct: 300 TSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVN-ISLGPSVEADIYT-- 356
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
S C+AF++N+D+ V F SY LP WSVSILPDC+N VFNTAKVSS T+I
Sbjct: 357 --DSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIV 414
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKD 509
+ +P+ K T K W KE G+W + +F G ++H+N TKD
Sbjct: 415 AM-----------IPEHLQQSDKGQKTLK-WDVFKENPGIWGKADFVKNGFVDHINTTKD 462
Query: 510 YSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV- 568
+DYLWH T I + D + F K +P + I+S L F+N + G+ G+
Sbjct: 463 TTDYLWHTTSILI-DANEEFLKKGS-KPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAF 520
Query: 569 ---QPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWT 625
P+ ++G N++ +LS TVGLQ G F + GAG VK+ G N IDLS W
Sbjct: 521 TFKNPISLRAGKNEIAILSLTVGLQTAGPFYDFIGAGVT-SVKIIGLNNRTIDLSSNAWA 579
Query: 626 YQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGK 684
Y++G+ GE IY E N +WT + TWYK DAP G +PV LD+ MGK
Sbjct: 580 YKIGVLGEHLSIYQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGK 639
Query: 685 GQAWVNGHHIGRYWTVVA--PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
G AW+NG IGRYW ++ K C CDYRG +N DKC T CG P+Q WYHVPRSW +
Sbjct: 640 GLAWLNGEEIGRYWPRISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFK 699
Query: 743 ASNNLLVIFEETGGNPFEIS 762
S N+LVIFEE GG+P +I+
Sbjct: 700 PSGNVLVIFEEKGGDPTKIT 719
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/756 (48%), Positives = 474/756 (62%), Gaps = 53/756 (7%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWP 79
P +++ + L+ V S+ + V+YD +AIII+ RR+LIS IHYPR+TP+MWP
Sbjct: 2 PKTVLLFLSLLTWVGSTIGA-------VTYDEKAIIINDQRRILISGSIHYPRSTPQMWP 54
Query: 80 DLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYV 139
DLI K+K+GG D+IETYVFWN HE G+ ++ D + + +++ + ++ L P
Sbjct: 55 DLIQKAKDGGLDIIETYVFWNGHEPSEGKVTWE---DFL-YEQILYINCFHVALFXFPPY 110
Query: 140 CAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
F GFP+WL+ +PGI FRT+N PFK MQ+FV KIVD+M+ E L+ QGGPII+ Q
Sbjct: 111 FXFQKFSGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQ 170
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYG +E G GK Y KW A MA+ L GVPWVMCKQ DAP+ +ID CNG+YC+ +
Sbjct: 171 IENEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENF 230
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
KPN KP +WTENW GWYT +GG P+RP ED+AF+VARF Q GS +NYY+Y GGTNF
Sbjct: 231 KPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNF 290
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
GRTS G F TSYD+DAPIDEYGL+ EPKWGHL+DLH AIKLCEPALV+AD LG+
Sbjct: 291 GRTS-GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKLCEPALVSADPTS-TWLGK 348
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
NQEA V++ S S C+AFLAN D + V F Y LPPWS+SILPDC+ FNT
Sbjct: 349 NQEARVFK-----SSSACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCKTVTFNT 403
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQ 498
A Q +K+ E + + +S W++ K EP ++++ T
Sbjct: 404 A----QIGVKSYEAKM-----------------MPISSFGWLSYKEEPASAYAKDTTTKD 442
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
G++E ++VT D +DYLW++ I + D F K+ + P ++++S +L VFINGQL+G
Sbjct: 443 GLVEQVSVTWDTTDYLWYMQDISI-DSTEGFLKSGK-WPLLSVNSAGHLLHVFINGQLSG 500
Query: 559 SVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
SV G + + V + G N L +LS TVGL N G + AG G V L G
Sbjct: 501 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 560
Query: 615 GDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWT--DLTRDGIPSTFTWYKTYFDAPDG 671
G D+SK W+Y+VGL GE +YS + N +WT LT+ TWYKT F P G
Sbjct: 561 GTRDMSKYKWSYKVGLSGESLNLYSDKGSNSVQWTKGSLTQK---QPLTWYKTTFKTPAG 617
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
+P+ LD+ SM KGQ WVNG IGRY+ G C D C Y G + KC NCG P+Q
Sbjct: 618 NEPLGLDMSSMSKGQIWVNGRSIGRYFPGYIANGKC-DKCSYAGLFTEKKCLGNCGEPSQ 676
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
WYH+PR WL S+NLLVIFEE GG+P IS+ R+
Sbjct: 677 KWYHIPRDWLSPSDNLLVIFEEIGGSPDGISLVKRT 712
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/813 (45%), Positives = 486/813 (59%), Gaps = 67/813 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD R++II+G RR+L S IHYPR+TPEMWP LI+K+KEGG DVIETY FWN HE
Sbjct: 31 SVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEPK 90
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ DIVKF K V + GLY LRIGP++ +EWN+GG P WL D+PGI +R++N
Sbjct: 91 QGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDNE 150
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F KIV+LM+ E L++ QGGPII+ QIENEY N+E+++ ++G YV+WAA M
Sbjct: 151 PFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAKM 210
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +I+ACNG C PN NKP +WTENW Y +G
Sbjct: 211 AVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYGE 270
Query: 284 RLPHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
R EDLAF VA F ++ GSF+NYYMY GGTNFGRTS + +T+Y AP+DEYG
Sbjct: 271 DKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSS-YVLTAYYDQAPLDEYG 329
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L+ +PKWGHLK+LHA IKLC L+ Y LGQ QEA++++ C+AFL
Sbjct: 330 LIRQPKWGHLKELHAVIKLCSDTLLHGVQYNY-SLGQLQEAYLFKR----PSGQCAAFLV 384
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ +V F +Y L S+SILPDC+ FNTAKVS+Q + ++V+
Sbjct: 385 NNDKRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQ---------- 434
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ ST K W +E I + +LEH+ TKD SDYLW+ +
Sbjct: 435 ------TRATFGST-KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-- 485
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYN 578
++ +P + +DS+ VL F+NG+ S G +V V SG N
Sbjct: 486 ------IQNSSNAQPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLN 539
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGEFQQI 637
+ LLS VGL + G +LE AG R +V++ GD D SK W YQVGL GE QI
Sbjct: 540 RISLLSVMVGLPDAGPYLEHKVAGIR-RVEIQ--DGGDSKDFSKHPWGYQVGLMGEKSQI 596
Query: 638 Y-SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y S + +W L G TWYKT FDAP G DPV L GSMGKG+AWVNG IGR
Sbjct: 597 YTSPGSQKVQWHGLGSHGR-GPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGR 655
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW T G P+QTWY+VPR++L NLLV+ EE G
Sbjct: 656 YWV---------------------SYLTPSGEPSQTWYNVPRAFLNPKGNLLVVQEEESG 694
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+P +IS+ S VC V++SH PP+ W+ S DG S + P++ L C IS
Sbjct: 695 DPLKISIGTVSVTNVCGHVTDSHPPPIISWTT--SDDGNESHHGKIPKVQLRCPPSSNIS 752
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I FAS+GTP G C+ ++ G+CH+P SL+V +
Sbjct: 753 KITFASFGTPVGGCESYAIGSCHSPNSLAVAEK 785
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/813 (45%), Positives = 486/813 (59%), Gaps = 67/813 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD R++II+G RR+L S IHYPR+TPEMWP LI+K+KEGG DVIETY FWN HE
Sbjct: 23 SVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEPK 82
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ DIVKF K V + GLY LRIGP++ +EWN+GG P WL D+PGI +R++N
Sbjct: 83 QGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDNE 142
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F KIV+LM+ E L++ QGGPII+ QIENEY N+E+++ ++G YV+WAA M
Sbjct: 143 PFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAKM 202
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +I+ACNG C PN NKP +WTENW Y +G
Sbjct: 203 AVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYGE 262
Query: 284 RLPHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
R EDLAF VA F ++ GSF+NYYMY GGTNFGRTS + +T+Y AP+DEYG
Sbjct: 263 DKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSS-YVLTAYYDQAPLDEYG 321
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L+ +PKWGHLK+LHA IKLC L+ Y LGQ QEA++++ C+AFL
Sbjct: 322 LIRQPKWGHLKELHAVIKLCSDTLLHGVQYNY-SLGQLQEAYLFKR----PSGQCAAFLV 376
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ +V F +Y L S+SILPDC+ FNTAKVS+Q + ++V+
Sbjct: 377 NNDKRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQ---------- 426
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ ST K W +E I + +LEH+ TKD SDYLW+ +
Sbjct: 427 ------TRATFGST-KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-- 477
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYN 578
++ +P + +DS+ VL F+NG+ S G +V V SG N
Sbjct: 478 ------IQNSSNAQPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLN 531
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGEFQQI 637
+ LLS VGL + G +LE AG R +V++ GD D SK W YQVGL GE QI
Sbjct: 532 RISLLSVMVGLPDAGPYLEHKVAGIR-RVEIQ--DGGDSKDFSKHPWGYQVGLMGEKSQI 588
Query: 638 Y-SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y S + +W L G TWYKT FDAP G DPV L GSMGKG+AWVNG IGR
Sbjct: 589 YTSPGSQKVQWHGLGSHGR-GPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGR 647
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW T G P+QTWY+VPR++L NLLV+ EE G
Sbjct: 648 YWV---------------------SYLTPSGEPSQTWYNVPRAFLNPKGNLLVVQEEESG 686
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+P +IS+ S VC V++SH PP+ W+ S DG S + P++ L C IS
Sbjct: 687 DPLKISIGTVSVTNVCGHVTDSHPPPIISWTT--SDDGNESHHGKIPKVQLRCPPSSNIS 744
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I FAS+GTP G C+ ++ G+CH+P SL+V +
Sbjct: 745 KITFASFGTPVGGCESYAIGSCHSPNSLAVAEK 777
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/801 (46%), Positives = 475/801 (59%), Gaps = 96/801 (11%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP-------------- 75
L C + S V+YD +A++IDG RR+L S IHYPR+TP
Sbjct: 10 LGCAVAVSVLVAAVECAVTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPPTI 69
Query: 76 ------------EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKL 123
EMW LI K+K+GG DVI+TYVFWN HE G ND
Sbjct: 70 PWRGLWLRIYGSEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG-------ND------- 115
Query: 124 VGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMR 183
S G++ R Y E GFPVWL+ +PGI FRT+N PFK MQ F +KIV +M+
Sbjct: 116 --SDGIFF--RFEQYYFEE---SGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMK 168
Query: 184 EEMLFSWQGGPIIMLQ---------IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
E LF+ QGGPII+ Q IENEYG +G G+ Y+ WAA MA+GLG GVP
Sbjct: 169 SENLFASQGGPIILSQASIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVP 228
Query: 235 WVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLA 294
WVMCK+ DAP+ +I+ACNG+YCD + PN KPT+WTE W GW+T +GG + RPVEDLA
Sbjct: 229 WVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLA 288
Query: 295 FAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKD 354
FAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+ EPK HLK+
Sbjct: 289 FAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKE 348
Query: 355 LHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTF 414
LH A+KLCE ALV+ D A LG QEA V++ S S C+AFLAN + ++ A V F
Sbjct: 349 LHRAVKLCEQALVSVDPA-ITTLGTMQEARVFQ-----SPSGCAAFLANYNSNSYAKVVF 402
Query: 415 LGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLS 474
+ Y+LPPWS+SILPDC+N VFN+A V QTS Q M
Sbjct: 403 NNEQYSLPPWSISILPDCKNVVFNSATVGVQTS-----------------QMQMWGD--G 443
Query: 475 STSKSWMTVKEPI-GVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTN 533
++S +W E + + + T G+LE LNVT+D SDYLW+IT + +S + +F +
Sbjct: 444 ASSMTWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISSSE-NFLQGG 502
Query: 534 EVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGL 589
++++ S L VF+NGQL GS G +K ++G N + LLS GL
Sbjct: 503 GKPLSLSVQSAGHALHVFVNGQLQGSAYGTREDRRIKYNGNASLRAGTNKIALLSVACGL 562
Query: 590 QNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWT 648
N G E G G V L G G DL+ W+YQVGLKGE + SIE + EW
Sbjct: 563 PNVGVHYETWNTGVGGPVVLHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSIEGSSSVEWM 622
Query: 649 D---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKG 705
+ ++ P WY+ YF+ P G +P+ALD+GSMGKGQ W+NG IGRYWT A G
Sbjct: 623 QGSLIAQNQQP--LAWYRAYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYA-DG 679
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
C++ C Y G + + KC + CG PTQ WYHVP+SWLQ + NLLV+FEE GG+ +I++
Sbjct: 680 DCKE-CSYTGTFRAPKCQSGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDSSKIALVK 738
Query: 766 RSTRIVCEQVSESHYPPVRKW 786
RS VC VSE H P ++ W
Sbjct: 739 RSVSSVCADVSEDH-PNIKNW 758
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 676 bits (1745), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/813 (44%), Positives = 490/813 (60%), Gaps = 66/813 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R+++I+G +++ S IHYPR+TP+MWP LI+K++ GG D I+TYVFWN HE
Sbjct: 7 NVTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLHEPQ 66
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ D+V+F+K V + GLY+ LRIGP++ +EW +GG P WL D+PGI FR++N
Sbjct: 67 QGQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIVFRSDNK 126
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+R+ K IV +++ E L++ QGGPII+ QIENEYGN+E+++ ++G YVKWAA M
Sbjct: 127 PFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWAAKM 186
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGG 283
A+GL GVPWVMCKQ DAP+ +I+ACNG C PNS KP +WTENW Y T+G
Sbjct: 187 AVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQTYGK 246
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF A F +GGSF+NYYMY GGTNFGRT+ + TSY AP+DEYGL
Sbjct: 247 ETRSRSAEDIAFHAALFIAKGGSFVNYYMYHGGTNFGRTA-AEYVPTSYYDQAPLDEYGL 305
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK GHLK+LHAAIKLC L++ + LGQ QEA + N C+AFL N
Sbjct: 306 LRQPKHGHLKELHAAIKLCRKPLLSRKWINF-SLGQLQEAFAFERN----SDECAAFLVN 360
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D + A+V F G SY LPP S+SILP C+ FNTA+VS+Q +
Sbjct: 361 HDGRSNATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGTRL------------- 407
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ K S + W KE I + +++ +LEH+N TKD SDYLW+ + + +
Sbjct: 408 ---ATRRHKFDSIEQ-WKEYKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQN 463
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYND 579
+ +T++S+ L F+NG+ GS G + + + + G N
Sbjct: 464 SSN--------AHSVLTVNSLGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNY 515
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGEFQQIY 638
+ LLS GL + GA+LE+ AG R ++T + ++ D + LW Y+VGL GE Q++
Sbjct: 516 VSLLSVMTGLPDAGAYLERRVAGLR---RVTIQRQHELHDFTTYLWGYKVGLSGENIQLH 572
Query: 639 SIEEN-EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ +A W+ P TWYK+ FDAP G DPVAL+L SMGKG+AWVNG IGRY
Sbjct: 573 RNNASVKAYWSRYASSSRP--LTWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRY 630
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W +SD GNP QTW H+PRS+L+ S NLLVI EE GN
Sbjct: 631 WVSF---------------LDSD------GNPYQTWNHIPRSFLKPSGNLLVILEEERGN 669
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG-KLSINKMAPEMHLHCQDGYIIS 816
P IS+ S VC VS SH PPV W ++G + P++ L C G IS
Sbjct: 670 PLGISLGTMSITKVCGHVSISHPPPVISWQGENQINGTRKRKYGRRPKVQLRCPRGRKIS 729
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S+ F+S+GTP G C+ ++ G+CHA S + V +
Sbjct: 730 SVLFSSFGTPSGDCETYAIGSCHASNSRATVEK 762
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/823 (45%), Positives = 501/823 (60%), Gaps = 86/823 (10%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHY R+TP+MWP LIAK+K GG DV++TYVFWN HE
Sbjct: 24 NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ++F G DIVKF+K V + GLY+ LRIGP++ EW++GG P WL ++ GI FRT+N
Sbjct: 84 QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+R+ K IV LM+ E L++ QGGPII+ QIENEYG + ++ Q+GK YVKW A +
Sbjct: 144 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +++ACNG C + +K PNS NKP +WTENW +Y T+G
Sbjct: 204 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYGE 263
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF VA F + GSF+NYYMY GGTNFGR + F ITSY AP+DEYGL
Sbjct: 264 EPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYGL 322
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN-CSAFLA 402
L +PKWGHLK+LHAA+KLCE L++ I LG+ Q A V +G ++N C+A L
Sbjct: 323 LRQPKWGHLKELHAAVKLCEEPLLSGLQTT-ISLGKLQTAFV-----FGKKANLCAAILV 376
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ ++V F SY L P SVS+LPDC+N FNTAKV++Q + +T + N+S
Sbjct: 377 NQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK----ARQNLS 431
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
PQ W E + +SE + + +LEH+N T+D SDYLW T+
Sbjct: 432 SPQM-------------WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQ 478
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
S+ S K N + L F+NG+ GS+ G H + + + +G N
Sbjct: 479 SEGAPSVLKVNH---------LGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTN 529
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDL--SKILWTYQVGLKGEFQQ 636
+L LLS VGL N GA LE+ G R VK+ NG L + W YQVGLKGE
Sbjct: 530 NLALLSVMVGLPNSGAHLERRVVGSR-SVKIW---NGRYQLYFNNYSWGYQVGLKGEKFH 585
Query: 637 IYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+Y+ + + +W RD TWYK FD P+G DPVAL+LGSMGKG+AWVNG IG
Sbjct: 586 VYTEDGSAKVQWKQY-RDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIG 644
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW ++++ K GNP+Q WYH+PRS+L+ ++NLLVI EE
Sbjct: 645 RYWV----------------SFHTYK-----GNPSQIWYHIPRSFLKPNSNLLVILEEER 683
Query: 756 -GNPFEISVKLRSTRIVCEQVSESHYPPV----RKWSN----SYSVDGKLSINKMAPEMH 806
GNP I++ S VC VS ++ PV +K N +Y D K P++
Sbjct: 684 EGNPLGITIDTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRK-------PKVQ 736
Query: 807 LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C G IS I FAS+GTP G C +S G+CH+P SL+VV +
Sbjct: 737 LQCPTGRKISKILFASFGTPNGSCGSYSIGSCHSPNSLAVVQK 779
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 674 bits (1738), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/823 (45%), Positives = 501/823 (60%), Gaps = 86/823 (10%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHY R+TP+MWP LIAK+K GG DV++TYVFWN HE
Sbjct: 24 NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ++F G DIVKF+K V + GLY+ LRIGP++ EW++GG P WL ++ GI FRT+N
Sbjct: 84 QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+R+ K IV LM+ E L++ QGGPII+ QIENEYG + ++ Q+GK YVKW A +
Sbjct: 144 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +++ACNG C + +K PNS NKP +WTENW +Y T+G
Sbjct: 204 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYGE 263
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF VA F + GSF+NYYMY GGTNFGR + F ITSY AP+DEYGL
Sbjct: 264 EPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYGL 322
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN-CSAFLA 402
L +PKWGHLK+LHAA+KLCE L++ I LG+ Q A V +G ++N C+A L
Sbjct: 323 LRQPKWGHLKELHAAVKLCEEPLLSGLQTT-ISLGKLQTAFV-----FGKKANLCAAILV 376
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ ++V F SY L P SVS+LPDC+N FNTAKV++Q + +T + N+S
Sbjct: 377 NQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK----ARQNLS 431
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
PQ W E + +SE + + +LEH+N T+D SDYLW T+
Sbjct: 432 SPQM-------------WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQ 478
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
S+ S K N + L F+NG+ GS+ G H + + + +G N
Sbjct: 479 SEGAPSVLKVNH---------LGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTN 529
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDL--SKILWTYQVGLKGEFQQ 636
+L LLS VGL N GA LE+ G R VK+ NG L + W YQVGLKGE
Sbjct: 530 NLALLSVMVGLPNSGAHLERRVVGSR-SVKIW---NGRYQLYFNNYSWGYQVGLKGEKFH 585
Query: 637 IYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+Y+ + + +W RD TWYK FD P+G DPVAL+LGSMGKG+AWVNG IG
Sbjct: 586 VYTEDGSAKVQWKQY-RDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIG 644
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW ++++ K GNP+Q WYH+PRS+L+ ++NLLVI EE
Sbjct: 645 RYWV----------------SFHTYK-----GNPSQIWYHIPRSFLKPNSNLLVILEEER 683
Query: 756 -GNPFEISVKLRSTRIVCEQVSESHYPPV----RKWSN----SYSVDGKLSINKMAPEMH 806
GNP I++ S VC VS ++ PV +K N +Y D K P++
Sbjct: 684 EGNPLGITIDTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRK-------PKVQ 736
Query: 807 LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C G IS I FAS+GTP G C +S G+CH+P SL+VV +
Sbjct: 737 LQCPTGRKISKILFASFGTPNGSCGSYSIGSCHSPNSLAVVQK 779
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 674 bits (1738), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/845 (43%), Positives = 491/845 (58%), Gaps = 102/845 (12%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
M ++ L+ + + S++T V YD AII++G R+++IS IHYPR+T +MWPD
Sbjct: 5 MFGTFLIACLALLYTCSSAT-----TVEYDSSAIILNGERKLIISGAIHYPRSTSQMWPD 59
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+K+G D IETY+FW+ HE +R +Y+F G D +KF+K+ GLY+ LRIGPYVC
Sbjct: 60 LIMKAKDGDLDAIETYIFWDLHEPVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVC 119
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN+GGFP+WL ++PGI+ RT+NA FKEEM+ F KIV + +E LF+ QGGPII+ QI
Sbjct: 120 AEWNYGGFPMWLHNMPGIQLRTDNAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQI 179
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYG++ S YG+ G Y+KW A MAL GVPW+MCKQ +AP IID CNGYYCD +K
Sbjct: 180 ENEYGDVISHYGEAGNSYIKWCAEMALAQNIGVPWIMCKQKNAPATIIDTCNGYYCDTFK 239
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PN+ P ++TENW GW+ WG R PHR ED AF+VARFFQ GG+ NYY+Y GGTNFG
Sbjct: 240 PNNPKSPKIFTENWVGWFQKWGERRPHRTAEDSAFSVARFFQNGGALQNYYLYHGGTNFG 299
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RT+GGPF IT+YDYDAP+DEYG L EPK+GHLK LHAAIKL E L +A + G +
Sbjct: 300 RTAGGPFIITTYDYDAPLDEYGNLIEPKYGHLKRLHAAIKLGEKVLTNG-TATWESHGDS 358
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQ-SYTLPPWSVSILPDCRNTVFNT 439
Y N+ Q C FL+N A V Y +P WS+S+L DC V+NT
Sbjct: 359 LWMTTY-TNKGTGQKFC--FLSNSHTSKDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNT 415
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQG 499
AK +QT+I + L + P+ S + T + + FT
Sbjct: 416 AKTEAQTNIYMKQLDQKLG---NSPEWSWTSDPMEDTFQ------------GKGTFTASQ 460
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
+L+ +VT SDYLW++T++ V+D + W + V +++ +L +FING LTG+
Sbjct: 461 LLDQKSVTVGASDYLWYMTEVVVNDTNT--WG----KAKVQVNTTGHILYLFINGFLTGT 514
Query: 560 VIGHWVKVVQP-------VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTG 611
G V QP + G N + LLS TVG NYGAF + G G VKL
Sbjct: 515 QHG---TVSQPGFIHEGNISLNQGTNIISLLSVTVGHANYGAFFDMQETGIVGGPVKLFS 571
Query: 612 FKNGD--IDLSKILWTYQVGLKGEFQQIYSIEEN-EAEW-TDLTRDGIPSTFTWYKTYFD 667
+N + +DLSK W+Y+VG+ G ++ Y + +W T+ G+P TWYKT F
Sbjct: 572 IENPNNVLDLSKSTWSYKVGINGMTKKFYDPKTTIGVQWKTNNVSIGVP--MTWYKTTFK 629
Query: 668 APDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNC 726
PDG +PV LDL + KG+AWVNG IGRYW ++A GC DTCDYRG YN+DKC + C
Sbjct: 630 TPDGTNPVVLDLIGLQKGEAWVNGQSIGRYWPAMLAENKGCSDTCDYRGEYNADKCLSGC 689
Query: 727 GNPTQTWYHVPRSWLQASNNLLVIFEETG--GNPFEISVKLRSTRIVCEQVSESHYPPVR 784
G P+Q +YHVPRS+L N LV+FEE G PF
Sbjct: 690 GEPSQRFYHVPRSFLNNDVNTLVLFEEMGFDATPF------------------------- 724
Query: 785 KWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSL 844
+G +S I+FASYG P+G C F G + S
Sbjct: 725 --------------------------NGKTMSEIQFASYGDPEGSCGSFKIGEWESRYSK 758
Query: 845 SVVSE 849
+VV +
Sbjct: 759 TVVEK 763
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/819 (45%), Positives = 494/819 (60%), Gaps = 75/819 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHY R+TP+MWP LIAK+K GG DVI+TYVFWN HE
Sbjct: 24 NVTYDGRSLIIDGQHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVIDTYVFWNIHEPQ 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ++F G+ DIVKF+K V + GLY+ LRIGP++ EW++GG P WL ++ GI FRT+N
Sbjct: 84 QGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+R+ + IV LM+ E L++ QGGPII+ QIENEYG + ++ Q GK YVKWAA +
Sbjct: 144 PFKYHMKRYAQMIVKLMKSENLYASQGGPIILSQIENEYGMVARAFRQDGKSYVKWAAKL 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +++ACNG C + +K PNS NKP +WTENW +Y T+G
Sbjct: 204 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYGE 263
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF VA F + GSF+NYYMY GGTNFGR + F ITSY AP+DEYGL
Sbjct: 264 EPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYGL 322
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN-CSAFLA 402
L +PKWGHLK+LHAA+KLCE L++ I LG+ Q A V +G ++N C+A L
Sbjct: 323 LRQPKWGHLKELHAAVKLCEEPLLSGLQTT-ISLGKLQTAFV-----FGKKANLCAALLV 376
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ +V F SY L P S+S+LPDC+N FNTAKV++Q + +T +
Sbjct: 377 NQDK-CDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTAKVNAQYNTRTRK---------- 425
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
P+Q++ S+ W E + +SE + + +LEH+N T+D SDYLW T+
Sbjct: 426 -PRQNL------SSPHMWEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFEQ 478
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
S+ S K N + VL F+N + GS+ G H + + + +G N
Sbjct: 479 SEGAPSVLKVNH---------LGHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTN 529
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDL--SKILWTYQVGLKGEFQQ 636
++ LLS VGL N GA LE+ G R NG L + W YQVGLKGE
Sbjct: 530 NMALLSVMVGLPNSGAHLERRVVGSRS----VNIWNGSYQLFFNNYSWGYQVGLKGEKYH 585
Query: 637 IYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+Y+ + + +W RD TWYK FD P+G DPVAL+LGSMGKG+AWVNG IG
Sbjct: 586 VYTEDGAKKVQWKQY-RDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIG 644
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW T+ GNP+Q WYH+PRS+L+ ++NLLVI EE
Sbjct: 645 RYWVSF---------------------YTSKGNPSQIWYHIPRSFLKPNSNLLVILEEER 683
Query: 756 -GNPFEISVKLRSTRIVCEQVSESHYPPV---RKWSNSYSVDGKLSIN-KMAPEMHLHCQ 810
G P I++ S VC VS +H PV RK ++ + L P++ L C
Sbjct: 684 EGYPLGITIDTVSVTEVCGHVSNTHPHPVISPRKKGHNRNEQRHLKYRYDRKPKVQLQCP 743
Query: 811 DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G IS + FA++G P G C +S G+CH+P SL+VV +
Sbjct: 744 TGRKISKVLFATFGNPNGSCGSYSVGSCHSPNSLAVVQK 782
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 668 bits (1724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/818 (44%), Positives = 494/818 (60%), Gaps = 79/818 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++II+G R++L S IHYPR+TPEMWP LI+++K+GG DVIETYVFWN HE
Sbjct: 28 VTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQGGIDVIETYVFWNQHEPKP 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F G+ DIV+F++ V + GLY LRIGP++ AEWN+GGFP WL D+PGI +RT+N P
Sbjct: 88 GQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGFPFWLHDVPGIVYRTDNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+ F KIV++M+ E L++ QGGPII+ QIENEY +E+++G+ GK YV WAA+MA
Sbjct: 148 FKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVEANFGEAGKRYVLWAANMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPWVMCKQ DAP+ +I++CNG C PNS NKP +WTENW Y +G
Sbjct: 208 VGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNKPAIWTENWTSSYPLFGED 267
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
RPVED+AF VA F + GSF+NYYMY GGTNFGRT+ + T+Y +AP+DEYGL
Sbjct: 268 ARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASA-YVQTAYYDEAPLDEYGL 326
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ +P WGHLK+LHAA+KLC L+ + + QEA+V+R G C+AFL N
Sbjct: 327 IQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYVFR----GQSGKCAAFLVN 382
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D T +V F SY LP S+SILPDC+N FNTAK S + + +++
Sbjct: 383 NDSRTDVTVVFQNTSYELPRKSISILPDCKNEAFNTAKASFRPGLISIQ----------- 431
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+K +ST + W KE I + + + +LEH+N TKD SDYLW+ +
Sbjct: 432 -----TVTKFNSTEQ-WEEYKESILNFDDTSSRANTLLEHMNTTKDASDYLWY---TFRY 482
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----QPVEFQSGYND 579
++D S ++ ++ +S L FING+ TGS G + V F++G N+
Sbjct: 483 NNDPSNGQS-----VLSTNSRAHALHAFINGRHTGSQHGSSSNLSFSLDNTVSFRAGINN 537
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGEFQQIY 638
+ LLS VGL + GA+LE+ AG R +V++ NG + D + W YQVGL GE QIY
Sbjct: 538 VSLLSVMVGLPDSGAYLERRVAGLR-RVRIQ--SNGSLKDFTNNPWGYQVGLLGEKLQIY 594
Query: 639 S-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ + + +W+ TWYKT FDAP G +PVAL+L SM KG+ WVNG IGRY
Sbjct: 595 TDVGSQKVQWSKFG-SSTSGLLTWYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIGRY 653
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
W T G P+Q WYH+PRS+L+ + NLLV+ EE G+
Sbjct: 654 WVSF---------------------LTPSGKPSQIWYHIPRSFLKPTGNLLVLLEEETGH 692
Query: 758 PFEISVKLRSTRIVCEQVSESHYPPV------RKWSNSYSVDGKLSINKMAPEMHLHCQD 811
P IS+ S +C VSESH PPV +K N + P++ L C
Sbjct: 693 PVGISIGKVSIPKICGHVSESHLPPVISRVIYKKHENHHG---------RRPKVQLRCPS 743
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
IS I FAS+GTP G CQ ++ G+CH+ S S V +
Sbjct: 744 NRNISRILFASFGTPSGDCQSYAVGSCHSSNSRSNVEK 781
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/840 (43%), Positives = 493/840 (58%), Gaps = 112/840 (13%)
Query: 22 MMMMMMIHLSC--VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWP 79
MM+ + L C VSS + +T VS+D RAI IDG+RR+L+S IHYPR+T EMWP
Sbjct: 1 MMISLKFLLCCLLVSSCAYATI-----VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWP 55
Query: 80 DLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYV 139
DLI K KEGG D IETYVFWNAHE R QY+F G D+++F+K + G+Y LRIGPYV
Sbjct: 56 DLIKKGKEGGLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYV 115
Query: 140 CAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
CAEWN+GGFPVWL ++PG+EFRT N F EMQ F IV+++++E LF+ QGGPII+ Q
Sbjct: 116 CAEWNYGGFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQ 175
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYGN+ SYG+ GK Y+KW A+MA L GVPW+MC+Q DAP+ +++ CNGYYCD +
Sbjct: 176 IENEYGNVIGSYGEAGKAYIKWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNF 235
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
PN+ N P +WTENW GWY WGG+ PHR ED+AFAVARFFQRGG+F NYYMY GGTNF
Sbjct: 236 TPNNPNTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQRGGTFQNYYMYHGGTNF 295
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
RT+GGP+ T+YDYDAP+DE+G L++PK+GHLK LH + E L + + + G
Sbjct: 296 DRTAGGPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNIST-VDFGN 354
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
A VY+ ++ S F+ N++E + A + F G Y +P WSVSILPDC+ +NT
Sbjct: 355 LVTATVYK-----TEEGSSCFIGNVNETSDAKINFQGTFYDVPAWSVSILPDCKTETYNT 409
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQ 498
AK+++QTS+ V + + E++ S+ SW + + + + T++
Sbjct: 410 AKINTQTSVM-------------VKKANEAENEPSTLKWSWRPENIDNVLLKGKGESTMR 456
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
+ + V+ D SDYLW++T + + + D + K +R I+S VL F+NGQ G
Sbjct: 457 QLFDQKVVSNDESDYLWYMTTVNIKEQDPVWGKNMSLR----INSTAHVLHAFVNGQHIG 512
Query: 559 SVIG-----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
+ H+V Q +F G N + LLS TVGL NYGAF E AG G V + G +
Sbjct: 513 NYRAENGKFHYV-FEQDAKFNPGANVITLLSITVGLPNYGAFFENVPAGITGPVFIIG-R 570
Query: 614 NGD----IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAP 669
NGD DLS W+Y+ GL G Q++S E PST++ AP
Sbjct: 571 NGDETIVKDLSTHKWSYKTGLSGFENQLFSSES-------------PSTWS-------AP 610
Query: 670 DGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGN 728
G +PV +DL +GKG AW+NG++IGRYW +A GC
Sbjct: 611 LGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLADIDGCSAE------------------ 652
Query: 729 PTQTWYHVPRSWLQAS-NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
YHVPRS+L + +N LV+FEE GGNP ++ + VC V E +
Sbjct: 653 -----YHVPRSFLNSDGDNTLVLFEEIGGNPSLVNFQTIGVGNVCANVYEKNV------- 700
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+ L C +G ISSI+FAS+G P G C F +G C A + +
Sbjct: 701 -----------------LELSC-NGKPISSIKFASFGNPGGNCGSFEKGTCEASNDAAAI 742
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/813 (43%), Positives = 494/813 (60%), Gaps = 62/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++I++G R +L S IHYPR PEMWP++I K+KEGG +VI+TYVFWN HE ++
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQ+NF+G D+VKF+K +G GLY+ LRIGPY+ AEWN GGFP WLR++P I FR+ N P
Sbjct: 88 GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F M+++ + ++DL+++E LF+ QGGPIIM QIENEY N++ +Y GK Y++WAA+MA
Sbjct: 148 FIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
L GVPW+MCKQ DAP +I+ CNG +C D + PN NKP+LWTENW Y T+G
Sbjct: 208 TSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF+VARFF + G+ NYYMY+GGTN+GRTS F T Y +AP+DE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRTSSS-FVTTRYYDEAPLDEFGLY 326
Query: 345 SEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
EPKW HL+DLH A++L AL+ + Q I NQ+ + + GS ++C+AFL N
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPTVQKI----NQDLEITVFEKPGS-TDCAAFLTN 381
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ F G+ Y LP SVSILPDC+ V+NT + SQ +
Sbjct: 382 NHTTQPSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHN---------------- 425
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
++ I S+ S K W +E + ++ + LE ++TKD SDY W+ T I +
Sbjct: 426 -SRNFITSEKSKNLK-WEMYQEKVPTIADLPLKNREPLELYSLTKDTSDYAWYSTSITLE 483
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYND 579
D+ ++ P + I SM L F+NG+ G G+ ++ +P+ + G N
Sbjct: 484 RHDLPM--RPDILPVLQIASMGHALAAFVNGEYVGFGHGNNIEKSFVFQKPIILKPGTNT 541
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ +L++TVG N GA++EK AG RG V + G G +D+++ W ++VG+ GE Q++++
Sbjct: 542 ITILAETVGFPNSGAYMEKRFAGPRG-VTIQGLMAGTLDITQNNWGHEVGVFGEKQELFT 600
Query: 640 IE-ENEAEWTDLTRDGIPS-TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
E + +WT +T G P TWYKTYFDAP+G +PVAL + M KG WVNG +GRY
Sbjct: 601 EEGAKKVQWTPVT--GPPKGAVTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLGRY 658
Query: 698 WT-VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
WT ++P G PTQ YH+PR++L+ +NNLLVIFEETGG
Sbjct: 659 WTSFLSP----------------------LGQPTQAEYHIPRAYLKPTNNLLVIFEETGG 696
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+P I V+ + +C ++E H P V+ W S D + + HL C D II
Sbjct: 697 HPTNIEVQTVNRDTICSIITEYHPPHVKSWERS-GTDFVAVVEDLKSGAHLTCPDNKIIE 755
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+EFASYG P G C GNC++ SL VV +
Sbjct: 756 KVEFASYGNPDGACGNLFNGNCNSANSLKVVEQ 788
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/799 (43%), Positives = 469/799 (58%), Gaps = 81/799 (10%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWP+L K+KEGG D IETY+FW+ HE +R QY F G DIVKF KL +GL++ LRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVCAEW++GGFP+WL +IPGIE RT+N +K EMQ F KIVD+ +E LF+ QGGPII
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEYGN+ YG G+ YV W A MA+G GVPW+MC+Q++AP+ +I+ CNG+YC
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180
Query: 257 DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
D +KPN+ P +WTENW GW+ WGGR P+R EDLAF+VARF Q GG +YYMY GG
Sbjct: 181 DQFKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYHGG 240
Query: 317 TNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD-SAQYI 375
TNFGRT+GGP+ TSYDY+AP+DEYG L++PKWGHLK LH AIK E L +++
Sbjct: 241 TNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSKNF 300
Query: 376 KLGQNQEAHVYRAN--RYGSQSNCSAFLANIDEHTAASVTFLGQ--SYTLPPWSVSILPD 431
G +Q + + R+ SN + AN+D LGQ Y+LP WSV+IL D
Sbjct: 301 WGGVDQTTYTNQGTGERFCFLSNTNMEEANVD---------LGQDGKYSLPAWSVTILQD 351
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG--V 489
C ++NTAKV++QTSI + P SW EP+ +
Sbjct: 352 CNKEIYNTAKVNTQTSIMVKKLHEEDKP----------------VQLSWTWAPEPMKGVL 395
Query: 490 WSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLR 549
+ F +LE T D +DYLW++T + +++ + W T+ + + L
Sbjct: 396 QGKGRFRATELLEQKETTVDTTDYLWYMTSVNLNETTLKKW----TNVTLRVGTRGHTLH 451
Query: 550 VFINGQLTGSVIGHWVKVVQ-------------PVEFQSGYNDLILLSQTVGLQNYGAFL 596
++N + G+ Q PV SG N + LLS TVGL NYG +
Sbjct: 452 AYVNKKEIGTQFSKQANAQQSVKGDDYSFLFEKPVTLTSGTNTISLLSATVGLANYGQYY 511
Query: 597 EKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT-RDG 654
+K G G V+L +DL+ W+Y++GL GE ++ + N + T D
Sbjct: 512 DKKPVGIAEGPVQLVANGKPFMDLTSYQWSYKIGLSGEAKRYN--DPNSPHASKFTASDN 569
Query: 655 IPS--TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTC 711
+P+ TWYKT F +P G +PV +DL MGKG AWVNG +GR+W T +A GC DTC
Sbjct: 570 LPTGRAMTWYKTTFASPSGTEPVVVDLLGMGKGHAWVNGKSLGRFWPTQIADAKGCPDTC 629
Query: 712 DYRGAYNSDKCTTNCGNPTQTWYHVPRSWL-QASNNLLVIFEETGGNPFEISVKLRSTRI 770
DYRG+YN DKC TNCGNP+Q WYH+PRS+L + N L++FEE GGNP +S ++ +
Sbjct: 630 DYRGSYNGDKCVTNCGNPSQRWYHIPRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAVET 689
Query: 771 VCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRC 830
+C N+Y + L C+ G IS I+FASYG P+G C
Sbjct: 690 IC--------------GNAYE----------GSTLELSCEGGRTISDIQFASYGDPEGTC 725
Query: 831 QKFSRGNCHAPMSLSVVSE 849
F +G+ +A S +VV +
Sbjct: 726 GAFMKGSFYATRSAAVVEK 744
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/815 (44%), Positives = 489/815 (60%), Gaps = 79/815 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+ +YD R++I++G ++L S IHYPR+TP+MWP LIAK+KEGG DVI+TYVFWN HE
Sbjct: 15 SATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQ 74
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G Y F G+ DIV+FVK + + GLY LRIGP++ AEW++GG P WL D+ GI +R++N
Sbjct: 75 QGTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNE 134
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F KIV++M+ E L++ QGGPII+ QIENEY +E+++G++G YV+WAA M
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPW MCKQ DAP+ +I+ CNG C PNS NKP++WTENW +Y T+G
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254
Query: 284 RLPHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
R E++AF VA F + G+++NYYMY GGTNFGR S F IT Y +P+DEYG
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGR-SASAFMITGYYDQSPLDEYG 313
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L EPKWGHLK+LHAA+KLC L+ + + LGQ+ EA V++ + C+AFL
Sbjct: 314 LTREPKWGHLKELHAAVKLCSTPLLTGTKSNF-SLGQSVEAIVFKT----ESNECAAFLV 368
Query: 403 N---IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
N ID ++V F +Y LP S+SILPDC+N FNT +VS Q + +++
Sbjct: 369 NRGAID----SNVLFQNVTYELPLGSISILPDCKNVAFNTRRVSVQHNTRSM-------- 416
Query: 460 NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
++V + ++E W KEPI + +LEH+ TKD SDYLW+ +
Sbjct: 417 -MAVQKFDLLE---------WEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYLWYTFR 466
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQS 575
+ D + T+ +DS L F+NG GS G + + + + + ++
Sbjct: 467 VQQDSPD--------SQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRN 518
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
G N++ LLS VGL + GAFLE AG R +V + G D S+ W Y+VGL GE
Sbjct: 519 GINNISLLSVMVGLPDSGAFLETRVAGLR-RVGIQG-----EDFSEQHWGYKVGLSGEQS 572
Query: 636 QIY-SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
QI+ + +W+ L P TWYKT FDAP G DP+AL+LGSMGKG WVNG I
Sbjct: 573 QIFLDTGSSNVQWSRLGNSSQP--LTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGI 630
Query: 695 GRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
GRYW + + PK G P+Q WY+VPRS+L+ ++N LVI EE
Sbjct: 631 GRYWVSFLTPK----------------------GEPSQKWYNVPRSFLKPTDNQLVILEE 668
Query: 754 TGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM-APEMHLHCQDG 812
GNP EIS+ C QVSESHYP V W + + N+ P++ L C
Sbjct: 669 ETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSK 728
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
IS+I FAS+GTP G CQ ++ G CH+P S ++V
Sbjct: 729 KKISNILFASFGTPSGDCQSYAIGLCHSPNSRAIV 763
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/838 (42%), Positives = 492/838 (58%), Gaps = 108/838 (12%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
M+ + ++ VSS + +T VS+D RAI IDG+RR+L+S IHYPR+T EMWPD
Sbjct: 1 MVSLSFILCCVLVSSCAYATI-----VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPD 55
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K KEG D IETYVFWNAHE R QY+F G D+++F+K + + G+Y LRIGPYVC
Sbjct: 56 LIKKGKEGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVC 115
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN+GGFPVWL ++PG+EFRT N F EMQ F IV+++++E LF+ QGGPII+ QI
Sbjct: 116 AEWNYGGFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQI 175
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYGN+ SYG+ GK Y++W A+MA L GVPW+MC+Q DAP+ +++ CNGYYCD +
Sbjct: 176 ENEYGNVIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFS 235
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PN+ N P +WTENW GWY WGG+ PHR ED+AFAVARFFQ+ G+F NYYMY GGTNF
Sbjct: 236 PNNPNTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFD 295
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RT+GGP+ T+YDYDAP+DE+G L++PK+GHLK LH + E L + + + G
Sbjct: 296 RTAGGPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNIST-VDFGNL 354
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
A VY+ ++ S F+ N++E + A + F G SY +P WSVSILPDC+ +NTA
Sbjct: 355 VTATVYQ-----TEEGSSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTA 409
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQG 499
K+++QTS+ V + + E++ S+ SW + + + + T++
Sbjct: 410 KINTQTSVM-------------VKKANEAENEPSTLKWSWRPENIDSVLLKGKGESTMRQ 456
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
+ + V+ D SDYLW++T + + + D K +R I+S VL F+NGQ G+
Sbjct: 457 LFDQKVVSNDESDYLWYMTTVNLKEQDPVLGKNMSLR----INSTAHVLHAFVNGQHIGN 512
Query: 560 V-----IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
H+V Q +F G N + LLS TVGL NYGAF E AG G V + G +N
Sbjct: 513 YRVENGKFHYV-FEQDAKFNPGANVITLLSITVGLPNYGAFFENFSAGITGPVFIIG-RN 570
Query: 615 GD----IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPD 670
GD DLS W+Y+ GL G Q++S E PST++ AP
Sbjct: 571 GDETIVKDLSTHKWSYKTGLSGFENQLFSSES-------------PSTWS-------APL 610
Query: 671 GIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
G +PV +DL +GKG AW+NG++IGRYW + D C+
Sbjct: 611 GSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLS--------------DIDGCSAE----- 651
Query: 731 QTWYHVPRSWLQAS-NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNS 789
YHVPRS+L + +N LV+FEE GGNP ++ + VC V E +
Sbjct: 652 ---YHVPRSFLNSEGDNTLVLFEEIGGNPSLVNFQTIGVGSVCANVYEKNV--------- 699
Query: 790 YSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+ L C +G IS+I+FAS+G P G C F +G C A + + +
Sbjct: 700 ---------------LELSC-NGKPISAIKFASFGNPGGDCGSFEKGTCEASNNAAAI 741
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/822 (43%), Positives = 483/822 (58%), Gaps = 75/822 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF G DIV+F K + ++GLY LRIGPY+C EWN+GG P WLRDIPG++FR +NAP
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + + V +Y S + F+ N
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDTNYSDKVTV---TKYTLDSTSACFINN 384
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT++ V
Sbjct: 385 RNDNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVM-------------V 431
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ +M+E + S SWM P + ++ +LE + + D SDYLW+ T I
Sbjct: 432 NKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSIN- 490
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
E T+ +++ L F+NG L G S GH+V ++ P + G N
Sbjct: 491 --------HKGEASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 543 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQI 602
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 603 H-LDKPGCTW-DNNNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLG 660
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N L
Sbjct: 661 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTL 719
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P +S + + VC ++ + L
Sbjct: 720 ILFEEAGGDPSHVSFRTVAAGSVCASA------------------------EVGDTITLS 755
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I S+G +G+C + +G C + + +E
Sbjct: 756 CGQHSKTISAINMTSFGVARGQCGAY-KGGCESKAAYKAFTE 796
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/657 (50%), Positives = 435/657 (66%), Gaps = 27/657 (4%)
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG 258
+IENEYGN++S+YG GK Y++WAA MA+ L GVPWVMC+Q DAP+ +I+ CNG+YCD
Sbjct: 7 KIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQ 66
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
+ PNS KP +WTENW GW+ ++GG +P+RPVEDLAFAVARF+QRGG+F NYYMY GGTN
Sbjct: 67 FTPNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTN 126
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
R+SGGPF TSYDYDAPIDEYGL+ +PKWGHL+D+H AIKLCEPAL+A D + Y LG
Sbjct: 127 LDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPS-YTSLG 185
Query: 379 QNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
N EA VY+ S C+AFLANID + +VTF G+ Y LP WSVSILPDC+N V N
Sbjct: 186 PNVEAAVYKVG-----SVCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLN 240
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ 498
TA+++SQT+ + + L + S + +L+ + W EP+G+ +N T
Sbjct: 241 TAQINSQTTGSEMRY---LESSNVASDGSFVTPELAVS--DWSYAIEPVGITKDNALTKA 295
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
G++E +N T D SD+LW+ T I V D+ N + + ++S+ VL+V+ING++ G
Sbjct: 296 GLMEQINTTADASDFLWYSTSITVKGDEPYL---NGSQSNLAVNSLGHVLQVYINGKIAG 352
Query: 559 SVIGHWVKVV----QPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
S G + +P+E G N + LLS TVGL NYGAF + GAG G VKL+G N
Sbjct: 353 SAQGSASSSLISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGL-N 411
Query: 615 GDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDP 674
G +DLS WTYQ+GL+GE +Y E EW I WYKT F P G DP
Sbjct: 412 GALDLSSAEWTYQIGLRGEDLHLYDPSEASPEWVSANAYPINHPLIWYKTKFTPPAGDDP 471
Query: 675 VALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW 733
VA+D MGKG+AWVNG IGRYW T +AP+ GC ++C+YRGAY+S KC CG P+QT
Sbjct: 472 VAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTL 531
Query: 734 YHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVD 793
YHVPRS+LQ +N LV+FE GG+P +IS +R T VC QVSE+H + WS+
Sbjct: 532 YHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSS----- 586
Query: 794 GKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ + + P + L C ++G +ISS++FAS+GTP G C +S G C + +LS+V E
Sbjct: 587 -QQPMQRYGPALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQE 642
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/822 (42%), Positives = 484/822 (58%), Gaps = 81/822 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K +V+YD R++ I+G R+++IS IHYPR++P MWP L+ K+K GG + IETYVFWNAH
Sbjct: 12 KKISVTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIETYVFWNAH 71
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E RGQY+F G ND+V+F+K V LY LRIGPYVCAEWN+GGFPVWL ++PGI+FRT
Sbjct: 72 EPQRGQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNLPGIKFRT 131
Query: 163 NNAPFKEEMQRF-----VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD 217
NN +K F +KKI ++ + IENE+GN+E SYGQ+GK+
Sbjct: 132 NNQVYKVTFXFFFLTKNLKKINNMFLKNX-------------IENEFGNVEGSYGQEGKE 178
Query: 218 YVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
YVKW A +A PW+MC+Q DAP+ I+ CN CD +KPN+ N P +WTE+W GW
Sbjct: 179 YVKWCAELAQSYNLSEPWIMCQQGDAPQPIV--CN---CDQFKPNNKNSPKMWTESWAGW 233
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAP 337
+ WG R P+R EDLAFAVARFFQ GGS NYYMY GGTNFGR++GGP+ TSYDY+AP
Sbjct: 234 FKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYNAP 293
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNC 397
+DEYG +++PKWGHLK LH I+ E L D ++I G + A Y Y +S+C
Sbjct: 294 LDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGD-VKHIDTGHSTTATSY---TYKGKSSC 349
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
F N E++ +TF + YT+P WSV++LPDC+ V+NTAKV++QT+I+ +
Sbjct: 350 --FFGN-PENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIRE------M 400
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
P++ + ++ + + +T + I S + T +++ VT D SDYLW++
Sbjct: 401 VPSLVGKHKKPLKWQWRNEKIEHLTHEGDI---SGSAITANSLIDQKMVTNDSSDYLWYL 457
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVE----- 572
T +++ +D F K R T+ + + +L F+N + G+ G + K +E
Sbjct: 458 TGFHLNGNDPLFGK----RVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRN 513
Query: 573 FQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKG 632
+ G+N + LLS TVGL NYGA+ E G G V+L DLS W Y+VGL G
Sbjct: 514 LRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGKTIRDLSTNEWIYKVGLDG 573
Query: 633 EFQQIYSIEEN-EAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWV 689
E + + + W + +P FTWYKT F P G + V +DL MGKGQAWV
Sbjct: 574 EKYEFFDPDHKFRKPWLS---NNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWV 630
Query: 690 NGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ-ASNNL 747
NG IGRYW + +A + GC +CDYRGAY KC TNCG PTQ WYH+PRS++ N
Sbjct: 631 NGKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENT 690
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
L++FEE GG P I +K + VC +V + ++ L
Sbjct: 691 LILFEEFGGMPLNIEIKTTRVKKVCAKVD------------------------LGSKLEL 726
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C D + I F +G P+G C F +G+CH+ + SV+ +
Sbjct: 727 TCHD-RTVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEK 767
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/816 (41%), Positives = 490/816 (60%), Gaps = 64/816 (7%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K V+YD R++II+G R +L S IHYPR+TP+MWPDLI K+K+GG + IETYVFWN H
Sbjct: 45 KALGVTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFWNGH 104
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E + GQYNF+G+ D+VKF+KL+ LY +R+GP++ AEWN GG P WLR++PGI FR+
Sbjct: 105 EPVEGQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGIIFRS 164
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+N PFK+ M+RFV IVD +++E LF+ QGGPII+ QIENEY ++ ++ ++G YV+WA
Sbjct: 165 DNEPFKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYVQWA 224
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTT 280
+AL L A VPW+MCKQ DAP+ II+ CNG +C Y PN NKP LWTENW Y
Sbjct: 225 GKLALSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQYRV 284
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDE 340
+G R EDLA++VARFF + GS +NYYM++GGTNFGRTS F T Y + P+DE
Sbjct: 285 FGDPPSQRSAEDLAYSVARFFSKNGSMVNYYMHYGGTNFGRTSAS-FTTTRYYDEGPLDE 343
Query: 341 YGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAF 400
+GL EPKWGHLKD+H A+ LC+ AL +KLG +Q+A V++ S C+AF
Sbjct: 344 FGLQREPKWGHLKDVHRALSLCKRALFWGFPTT-LKLGPDQQAIVWQQP---GTSACAAF 399
Query: 401 LANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPN 460
LAN + A V F GQ LP S+S+LPDC+ VFNT V++Q +
Sbjct: 400 LANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHN------------- 446
Query: 461 ISVPQQSMIESKLSSTSKSWMTVKE--PIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
++ + S++++ + +W +E P+G+ F E ++TKD +DY W+ T
Sbjct: 447 ----SRNFVRSEIANKNFNWEMCREVPPVGL----GFKFDVPRELFHLTKDTTDYAWYTT 498
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQ 574
+ + D+ K VRP + + S+ + ++NG+ GS G V+ + + V +
Sbjct: 499 SLLLGRRDLPMKKN--VRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLK 556
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEF 634
G N + LL VGL + GA++EK AG R + + G G +D+S+ W +QVG+ GE
Sbjct: 557 EGENHIALLGYLVGLPDSGAYMEKRFAGPR-SITILGLNTGTLDISQNGWGHQVGIDGEK 615
Query: 635 QQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+++++ E +++ +WT + G TWYK YFDAP+G +PVA+ + MGKG WVNG
Sbjct: 616 KKLFTEEGSKSVQWTKPDQGG---PLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRS 672
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
IGRYW + + PTQ+ YH+PR++L+ NL+V+ EE
Sbjct: 673 IGRYW---------------------NNYLSPLKKPTQSEYHIPRAYLKPK-NLIVLLEE 710
Query: 754 TGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGY 813
GGNP ++ + + +C VSE H PP + + + + +N + P L C
Sbjct: 711 EGGNPKDVHIVTVNRDTICSAVSEIH-PPSPRLFETKNGSLQAKVNDLKPRAELKCPGKK 769
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I ++EFASYG P G C + GNC AP S VV +
Sbjct: 770 QIVAVEFASYGDPFGACGAYFIGNCTAPESKQVVEK 805
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/825 (44%), Positives = 482/825 (58%), Gaps = 67/825 (8%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
+ V+YD+RAI IDG R++++S IHYPR+TPEMWP LI K+KEGG + IETYVFWNAHE
Sbjct: 5 YEVTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWNAHEP 64
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
+ QY+F G D+++F+K + GLY LRIGPYVCAEWN+GGFPVWL ++PGI+ RTNN
Sbjct: 65 HQRQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGIQIRTNN 124
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
+K EM+ F IV++M++ LF+ QGGPII+ QIENEYGN++SSYG +GK+YVKW A+
Sbjct: 125 EVYKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYVKWCAN 184
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
+A GVPW+MC+Q+DAP +ID+CNG+YCD Y N+ + P +WTENW GW+ WG +
Sbjct: 185 LAESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYSNNKSLPKIWTENWTGWFQDWGQK 244
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHR ED+AFAVARFFQ GGS MNYYMY GGTNFG T GGP+ SYDYDAP+DEYG L
Sbjct: 245 NPHRSAEDVAFAVARFFQLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPLDEYGNL 304
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHL+DLH+ + E L +S K + + + Q S F ++I
Sbjct: 305 RQPKWGHLRDLHSVLNSMEQTLTYGES----KNSNYPDNNNIFITIFAYQGKRSCFFSSI 360
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D + +++F G Y LP WSVSILPDC V+NTA V+ QTSI
Sbjct: 361 D-YKDQTISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSIME-------------N 406
Query: 465 QQSMIESKLSSTSKSWMTVKEPI------GVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
+ + +S S W E I G + N +++ VT SDYLW +T
Sbjct: 407 KANAADSFREPNSLQWKWRPEKIRGLSLQGDFVGNTLVANELMDQKAVTNGTSDYLWIMT 466
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV-----IGHWVKVVQ-PVE 572
Y + + S W + + + + V+ F+NG+ GS G + V + ++
Sbjct: 467 N-YDHNMNDSLWGAGK-DIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIK 524
Query: 573 FQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF-KNGD-----IDLSKILWTY 626
+ G N + L+S +VGLQNYGA + G G + + G K G+ +D+S W Y
Sbjct: 525 LKRGINRISLVSVSVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVY 584
Query: 627 QVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQ 686
+ GL GE Q ++ I F WYKT F+AP G DPV +DL +GKG
Sbjct: 585 KTGLHGEDQGFQAVRPRHRRQFYTKHVLINQPFVWYKTSFNAPLGQDPVVVDLLGLGKGT 644
Query: 687 AWVNGHHIGRYW-TVVAPKGG-CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS 744
AWVNG +IGR+W +AP G C C Y G Y +C T CG PTQ +YH+PR WL+
Sbjct: 645 AWVNGRNIGRFWPKALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPE 704
Query: 745 NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPE 804
+N LV+FEE GG P +SV+ + VC E H
Sbjct: 705 DNKLVLFEELGGTPDFVSVQTVTVGKVCVHGYEGH------------------------T 740
Query: 805 MHLHCQDGYIISSIEFASYGTPQGRCQKFSRGN---CHAPMSLSV 846
+ L CQ G S I FAS+G PQG+C F+ N CHA +S V
Sbjct: 741 VELSCQHGRKFSKITFASFGLPQGKCGSFTPSNNHDCHADVSTIV 785
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/823 (42%), Positives = 491/823 (59%), Gaps = 77/823 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K V+YD R++II+G R +L S IHYPR+TP+MWP+LI K+K GG +VI+TYVFWN H
Sbjct: 27 KQVGVTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWNIH 86
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E +G++NF+G D+VKF+K +G +G++ LR+GP++ AEWN GG P WLR+IP I FR+
Sbjct: 87 EPEQGKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDIIFRS 146
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+NAPFK M++FV KI+D+M+EE LF+ QGGPII+ QIENEY ++ +Y G Y++WA
Sbjct: 147 DNAPFKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQWA 206
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTT 280
+MALGL GVPWVMCKQ DAP +I+ CNG +C D + PN NKP+LWTENW +
Sbjct: 207 GNMALGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQFRV 266
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDE 340
+G R ED AF+VAR+F + GS +NYYMY GGTNF RT+ F T Y +AP+DE
Sbjct: 267 FGDPPSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDE 325
Query: 341 YGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAF 400
YGL EPKWGHLKDLH A+ LC+ AL+ + KL + EA Y + G++ C+AF
Sbjct: 326 YGLQREPKWGHLKDLHRALNLCKKALLWGN-PNVQKLSADVEARFYE--QPGTKV-CAAF 381
Query: 401 LANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP- 459
LA+ + A +V F GQ Y LP S+SILPDC+ V+NT V SQ + + S +
Sbjct: 382 LASNNSKEAETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRNFVKSRKTNKL 441
Query: 460 -----NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYL 514
+ ++P Q ++S L E N+TKD +DY+
Sbjct: 442 EWNMYSETIPAQLQVDSSLPK--------------------------ELYNLTKDKTDYV 475
Query: 515 WHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK---VVQ-P 570
W T I V D++ + + P + + S+ + F+NG+ GS G ++ V+Q
Sbjct: 476 WFTTTINVDRRDMN--ERKRINPVLRVASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHS 533
Query: 571 VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGL 630
V+ + G N + LL VGL + GA++E AG RG V + G G +DL+ W +QVGL
Sbjct: 534 VDLKPGINFVTLLGTLVGLPDSGAYMEHRYAGPRG-VSILGLNTGTLDLTSNGWGHQVGL 592
Query: 631 KGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWV 689
GE ++++ E + WT + + G P TWYKT+FDAP+G PVA+ + M KG W+
Sbjct: 593 SGETAKLFTKEGGGKVTWTKVQKAGPP--VTWYKTHFDAPEGKSPVAVRMTGMNKGMIWI 650
Query: 690 NGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL 748
NG IGRYW T V+P G PTQ+ YH+PRS+L+ ++NL+
Sbjct: 651 NGKSIGRYWMTYVSP----------------------LGEPTQSEYHIPRSYLKPTDNLM 688
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLS--INKMAPEMH 806
VIFEE NP +I + + +C V+E H P V+ W + K + ++ P H
Sbjct: 689 VIFEEEEANPEKIEILTVNRDTICSYVTEYHPPSVKSWERK---NNKFTPVVDNAKPAAH 745
Query: 807 LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C + I +++FAS+G P G C ++ G CH+ +S VV E
Sbjct: 746 LKCPNQKKIIAVQFASFGDPLGTCGDYAVGTCHSLVSKQVVEE 788
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/813 (42%), Positives = 485/813 (59%), Gaps = 64/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++II+G R +L S IHYPR+TPEMWP+LI K+K GG +VI+TYVFWN HE +
Sbjct: 31 VTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF+G D+VKF+K +G +G+ +R+GP++ AEWN GG P WLR+IP I FR++NAP
Sbjct: 91 GKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+RFV I++ ++EE LF+ QGGPII+ QIENEY ++ +Y G YV+WA +MA
Sbjct: 151 FKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGVSYVQWAGNMA 210
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
LGL GVPWVMCKQ DAP +I+ CNG +C D + PNS +KP+LWTENW + +G
Sbjct: 211 LGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENWTAQFRVFGDP 270
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED AF+VAR+F + GS +NYYMY GGTNF RT+ F T Y +AP+DEYGL
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHLKDLH A+ LC+ AL+ + +L + EA + R ++C+AFLAN
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWG-TPNVQRLSADVEARFFEQPR---TNDCAAFLANN 385
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +VTF G+ Y LP S+SILPDC+ V+NT V SQ +
Sbjct: 386 NTKDPETVTFRGKKYYLPAKSISILPDCKTVVYNTMTVVSQHN----------------- 428
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGIL--EHLNVTKDYSDYLWHITQIYV 522
++ ++S+ + W E I +N V + E N+TKD +DY W T I V
Sbjct: 429 SRNFVKSRKTDGKLEWKMFSETI----PSNLLVDSRIPRELYNLTKDKTDYAWFTTTINV 484
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK---VVQ-PVEFQSGYN 578
+D+S K ++ P + + S+ + FING+ GS G ++ V+Q V+ + G N
Sbjct: 485 DRNDLSARK--DINPVLRVASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGIN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LL VGL + GA++E AG RG V + G G +DLS W +QV L GE +++
Sbjct: 543 FVTLLGSLVGLPDSGAYMEHRYAGPRG-VSILGLNTGTLDLSSNGWGHQVALSGETAKVF 601
Query: 639 SIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ E + WT + +DG P TWYKT FDAP+G PVA+ + M KG W+NG IGRY
Sbjct: 602 TKEGGRKVTWTKVNKDGPP--VTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRY 659
Query: 698 W-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
W ++P G PTQ+ YH+PRS+L+ +NNL+VI EE G
Sbjct: 660 WMNYISP----------------------LGEPTQSEYHIPRSYLKPTNNLMVILEEEGA 697
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
+P +I + + +C V+E H P VR W ++ + P L C + I
Sbjct: 698 SPEKIEILTVNRDTICSYVTEYHPPNVRSWERKNKKFTPVA-DDAKPAARLKCPNKKKIV 756
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+++FAS+G P G C F+ G C +P+S VV +
Sbjct: 757 AVQFASFGDPSGTCGNFAVGTCDSPISKQVVEQ 789
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/837 (41%), Positives = 486/837 (58%), Gaps = 58/837 (6%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
++M+ + + C + A NV+YD RA+++DG RR+LI+ IHYPR+TPEMWP+L
Sbjct: 25 VLMVAAVAMCCSAILVALPSTSAMNVTYDSRALLLDGQRRLLIAGCIHYPRSTPEMWPEL 84
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
A++K G DVI+TY+FW+ ++ G++ + D V+F+KL +GL + RIGPYVCA
Sbjct: 85 FARAKANGLDVIQTYLFWDVNQPTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPYVCA 144
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWN+GGFP WLR I GI FR N+ P+ + + ++ K V ++++ L + GGP+I+LQIE
Sbjct: 145 EWNYGGFPAWLRQISGIVFRDNDKPWLDVVGPYITKTVQVLKDNKLLAADGGPVILLQIE 204
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYGN+E SY G YV+W +A L AG W+MC+Q DAP N I CNG+YCD Y P
Sbjct: 205 NEYGNIEDSYA-GGPAYVQWCGQLAASLNAGAQWIMCQQDDAPANTIATCNGFYCDNYVP 263
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
+ +P +WTENW GW+ TWG PHRP +D+AFA ARF+ +GG++M+YYMY GGTNFGR
Sbjct: 264 HK-GQPMMWTENWPGWFQTWGQPSPHRPAQDVAFAAARFYAKGGTYMSYYMYHGGTNFGR 322
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
T+GGP TSYDYD +DEYG+ SEPK+ HL LHA + E +++ + I LG+N
Sbjct: 323 TAGGPGITTSYDYDVALDEYGMPSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLGKNL 382
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAK 441
EAHV+ S S C AFL+NID A V F G+++ LP WSVSIL +C ++NTA
Sbjct: 383 EAHVFN-----SSSGCVAFLSNIDSSVDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAA 437
Query: 442 VSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK---------SWMTVKEPIGVWSE 492
VS+ + + + PL + + + S + ++ + E IG +E
Sbjct: 438 VSAPLNARRMT---PLVVHEDAVSDAADHRRSLSKGEGQERVGAFSTFASYAETIGRRAE 494
Query: 493 NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFI 552
E +N T D +DYLW+ T + + V V + R F+
Sbjct: 495 EAVYFTSPQEQINTTNDTTDYLWYTTTYNSASATSQVLSISNVNDVVYVYVNRQ----FV 550
Query: 553 NGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF 612
+GS V + V +G N + +LS T GLQNYG FLE+ G +G VKL
Sbjct: 551 TMSWSGS-------VNKAVPLMAGTNVIDVLSTTFGLQNYGTFLEQVTRGIQGTVKL--- 600
Query: 613 KNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI 672
G DL++ W +QVGL GE I+ + +N + T TWY++ FD P
Sbjct: 601 --GSTDLTQNGWWHQVGLLGEELGIF-LPQNASNVPWATPATTNRGLTWYRSSFDLPQSS 657
Query: 673 D-PVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
P+ALD+ MGKG WVNGH++GRYW + +A C D CDYRGAY+ +C C P+
Sbjct: 658 QAPLALDMTGMGKGFVWVNGHNLGRYWPSRIADSMACDD-CDYRGAYDDSRCRQGCNIPS 716
Query: 731 QTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSY 790
Q +YHVPR WLQ +NNL+V+ EE GGNP IS+ R I C V E YP ++
Sbjct: 717 QRYYHVPREWLQPTNNLIVMLEEIGGNPALISLVEREEDISCGAVGED-YP-----ADDL 770
Query: 791 SVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
SV L C I +EFAS+GTP G C++FS G+C+A S ++V
Sbjct: 771 SV-------------VLGCGLHQTIRRVEFASFGTPVGTCRQFSLGSCNAANSTAIV 814
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/822 (43%), Positives = 481/822 (58%), Gaps = 75/822 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF G DIV+F K + ++GLY LRIGPY+C EWN+GG P WLRDIPG++FR +NAP
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + V + GS S C F+ N
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDTNYSDNVTVTKYT-LGSTSAC--FINN 384
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT+I V
Sbjct: 385 RNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIM-------------V 431
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ +M+E + + SWM P + ++ +LE + + D SDYLW+ T
Sbjct: 432 KKANMVEKEPENLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRT---- 487
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
S E T+ +++ L F+NG L G S GH+V ++ V+ G N
Sbjct: 488 -----SLDHKGEASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 543 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQI 602
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 603 H-LDKPGYRW-DNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLG 660
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N L
Sbjct: 661 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTL 719
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P ++ VC ++ + L
Sbjct: 720 ILFEEAGGDPSQVIFHSVVAGSVCVSA------------------------EVGDAITLS 755
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I+ S+G +G+C + G C + + +E
Sbjct: 756 CGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKAFTE 796
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/822 (42%), Positives = 481/822 (58%), Gaps = 75/822 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 31 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF+G DI++F K + ++GLY LRIGPY+C EWN+GG P WLRDIP ++FR +NAP
Sbjct: 91 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F I++ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 151 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + V + GS S C F+ N
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDANYSDNVTVTKYT-LGSTSAC--FINN 384
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT+I V
Sbjct: 385 RNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIM-------------V 431
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ +M+E + S SWM P + ++ +LE + + D SDYLW+ T
Sbjct: 432 KKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRT---- 487
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
S E T+ +++ L F+NG L G S GH+V ++ V+ G N
Sbjct: 488 -----SLDHKGEASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 543 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQI 602
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 603 H-LDKPGYRW-DNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLG 660
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N L
Sbjct: 661 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTL 719
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P ++ VC ++ + L
Sbjct: 720 ILFEEAGGDPSQVIFHSVVAGSVCVSA------------------------EVGDAITLS 755
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I+ S+G +G+C + G C + + +E
Sbjct: 756 CGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKAFTE 796
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/822 (42%), Positives = 481/822 (58%), Gaps = 75/822 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF+G DI++F K + ++GLY LRIGPY+C EWN+GG P WLRDIP ++FR +NAP
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F I++ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + V + GS S C F+ N
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDANYSDNVTVTKYT-LGSTSAC--FINN 380
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT+I V
Sbjct: 381 RNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIM-------------V 427
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ +M+E + S SWM P + ++ +LE + + D SDYLW+ T
Sbjct: 428 KKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRT---- 483
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
S E T+ +++ L F+NG L G S GH+V ++ V+ G N
Sbjct: 484 -----SLDHKGEASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKN 538
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 539 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQI 598
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 599 H-LDKPGYRW-DNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLG 656
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N L
Sbjct: 657 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTL 715
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P ++ VC ++ + L
Sbjct: 716 ILFEEAGGDPSQVIFHSVVAGSVCVSA------------------------EVGDAITLS 751
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I+ S+G +G+C + G C + + +E
Sbjct: 752 CGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKAFTE 792
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/822 (42%), Positives = 480/822 (58%), Gaps = 75/822 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF+G DI++F K + ++GLY LRIGPY+C EWN+GG P WLRDIP ++FR +NAP
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F I++ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + V +Y S + F+ N
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDANYSDNVTV---TKYTLGSTSACFINN 380
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT+I V
Sbjct: 381 RNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIM-------------V 427
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ +M+E + S SWM P + ++ +LE + + D SDYLW+ T
Sbjct: 428 KKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRT---- 483
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
S E T+ +++ L F+NG L G S GH+V ++ V+ G N
Sbjct: 484 -----SLDHKGEASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKN 538
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 539 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQI 598
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 599 H-LDKPGYRW-DNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLG 656
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N L
Sbjct: 657 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTL 715
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P ++ VC ++ + L
Sbjct: 716 ILFEEAGGDPSQVIFHSVVAGSVCVSA------------------------EVGDAITLS 751
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I+ S+G +G+C + G C + + +E
Sbjct: 752 CGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKAFTE 792
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/713 (49%), Positives = 442/713 (61%), Gaps = 44/713 (6%)
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFPVWL+ +PGI FRT+N PFK MQ F +KIV +++ E LF+ QGGPII+ QIENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
+ G G+ Y+ WAA MA+GL GVPWVMCK+ DAP+ +I+ACNG+YCDG+ PN
Sbjct: 61 PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYCDGFSPNKPY 120
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
KP LWTE W GW+T +GG + RPV+DLAFAVARF Q+GGS+ NYYMY GGTNFGRT+GG
Sbjct: 121 KPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTAGG 180
Query: 326 PFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
PF TSYDYDAPIDEYGL EPK+ HLK+LH AIKL E ALV+A LG ++A++
Sbjct: 181 PFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSA-GPTITSLGTYEQAYI 239
Query: 386 YRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ 445
Y + C+AFLAN + +AA V F + Y LPPWS+SILPDCRN +NTA V Q
Sbjct: 240 YNSG----PRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALVGVQ 295
Query: 446 TSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHL 504
TS + LP ++ SW T E I E T G+LE +
Sbjct: 296 TSHVHM---LPTGTSL----------------LSWETYDEVISSLDERARMTAVGLLEQI 336
Query: 505 NVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-- 562
NVT+D SDYLW++T + +S + SF + + +PT+ + S +RVFINGQ +GS G
Sbjct: 337 NVTRDTSDYLWYMTSVDISSSE-SFLRGGQ-KPTLNVQSAGHAVRVFINGQFSGSAFGTR 394
Query: 563 --HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLS 620
PV ++G N + LLS VGL N G E G G V L G NG DL+
Sbjct: 395 EHRQFTFTGPVNLRAGSNKISLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKRDLT 454
Query: 621 KILWTYQVGLKGEFQQIYSIE-ENEAEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVA 676
W+YQVGLKGE + + E + A+W R P TWYK YF+AP+G +P+A
Sbjct: 455 WQKWSYQVGLKGEAMNLVTPEGASSADWVRGSLAARSVQP--LTWYKAYFNAPNGNEPLA 512
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LDL SMGKGQ +NG IGRYWT A KG C+ C Y G +PTQ WYHV
Sbjct: 513 LDLRSMGKGQVRINGQSIGRYWTAYA-KGDCE-ACSYTGHSGRQNVNLVVASPTQRWYHV 570
Query: 737 PRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL 796
PRSWL+ NLLVIFEE GG+ +I++ RS VC E+H P + K+S S S DG
Sbjct: 571 PRSWLKPKQNLLVIFEELGGDASKIALLRRSLTNVCANAFENH-PSMAKYSTS-SQDG-- 626
Query: 797 SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S K A ++L C G IS+IEFAS+GTP G C F G CHAP S S++ +
Sbjct: 627 SKVKEA-TVNLQCGPGQSISAIEFASFGTPSGTCGSFHIGTCHAPNSRSIIEK 678
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/822 (42%), Positives = 481/822 (58%), Gaps = 75/822 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF+G DI++F K + ++GLY LRIGPY+C EWN+GG P WLRDIP ++FR +NAP
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F I++ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + V + GS S C F+ N
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDTNYSDNVTVTKYT-LGSTSAC--FINN 380
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT+I V
Sbjct: 381 RNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIM-------------V 427
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ +M+E + + SWM P + ++ +LE + + D SDYLW+ T
Sbjct: 428 KKANMVEKEPENLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRT---- 483
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
S E T+ +++ L F+NG L G S GH+V ++ V+ G N
Sbjct: 484 -----SLDHKGEASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKN 538
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 539 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQI 598
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 599 H-LDKPGYRW-DNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLG 656
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N L
Sbjct: 657 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTL 715
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P ++ VC ++ + L
Sbjct: 716 ILFEEAGGDPSQVIFHSVVAGSVCVSA------------------------EVGDAITLS 751
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I+ S+G +G+C + G C + + +E
Sbjct: 752 CGQHSKTISTIDVTSFGVARGQCGAY-EGGCESKAAYKAFTE 792
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/847 (41%), Positives = 493/847 (58%), Gaps = 79/847 (9%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+ +++++ + V +++ +T V+Y+ RA++IDG RR+++S IHYPR+TPEMWPDL
Sbjct: 11 LALVLLLITAAVGAANCTT------VAYNDRALVIDGQRRIVLSGSIHYPRSTPEMWPDL 64
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I K+KEGG D IETYVFWN HE QYNF G DIV+F K + ++G+Y LRIGPY+C
Sbjct: 65 IKKAKEGGLDAIETYVFWNGHEPRPRQYNFAGNYDIVRFFKEIQNAGMYAILRIGPYICG 124
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWN+GG P WLRDIPG++FR +N PF+ EM+ F IV+ +++ +F+ QGGPII+ QIE
Sbjct: 125 EWNYGGLPAWLRDIPGMQFRMHNQPFEHEMETFTTLIVNKLKDANMFAGQGGPIILSQIE 184
Query: 202 NEYGNMESSY--GQQGKDYVKWAASMALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDG 258
NEYGN+ ++ Q +Y+ W A+MA GVPW+MC+Q D P N+I+ CNG+YC
Sbjct: 185 NEYGNIMANLTDAQSASEYIHWCAAMANKQNVGVPWIMCQQDADVPPNVINTCNGFYCHD 244
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
+ P + P +WTENW GW+ W HR +D+AFAVA FFQ+ GS NYYMY GGTN
Sbjct: 245 WFPKRTDIPKIWTENWTGWFKAWDKPDFHRSAQDIAFAVAMFFQKRGSLQNYYMYHGGTN 304
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
FGRT+GGP+ TSYDYDAP+DEYG + EPK+GHLKDLHA +K E LV D + I G
Sbjct: 305 FGRTAGGPYITTSYDYDAPLDEYGNIREPKYGHLKDLHAVLKSMEKILVHGDFSD-INYG 363
Query: 379 QNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
+N +Y + F++N + A+ T G ++ +P WSVS+LPDC+ +N
Sbjct: 364 RN-----VTVTKYTLDGSSVCFISNQFDDRDANATIDGTTHVVPAWSVSVLPDCKAVAYN 418
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTV 497
TAK+ +QTS+ V + + +E + + SWM +P + +F
Sbjct: 419 TAKIKAQTSVM-------------VKKPNTVEQEPENLKWSWMPEHLKPFMTDEKGSFRK 465
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
+LE + + D SDYLW+ T SF E + +++++ + F+NG+L
Sbjct: 466 NELLEQITTSTDQSDYLWYRT---------SFEHKGEAKYKLSVNTTGHQIYAFVNGKLA 516
Query: 558 G---SVIGHWV-KVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGF 612
G S G ++ ++ PV+ G N L LLS T+GL+NYGA E AG G VKL
Sbjct: 517 GRQHSPNGAFIFQLESPVKLHDGKNYLSLLSATMGLKNYGALFELMPAGIVGGPVKLVDN 576
Query: 613 KNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEW-TDLTRDGIPSTFTWYKTYFDAPDG 671
IDLS W+Y+ GL GE +QI+ +++ +W D I FTWYK F AP G
Sbjct: 577 NGSTIDLSNSSWSYKAGLAGEHRQIH-LDKPGYKWHGDNGTIPINRAFTWYKATFQAPAG 635
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTN 725
+ V DL + KG AWVNG+++GRYW V A GGC CDYRGA+ ++ KC T
Sbjct: 636 EEAVVADLMGLNKGVAWVNGNNLGRYWPSYVAAEMGGCHH-CDYRGAFKAEGDGLKCLTG 694
Query: 726 CGNPTQTWYHVPRSWLQASN-NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
C P Q +YHVPR +L+A N +V+FEE GG+P + + VC + +E
Sbjct: 695 CNEPAQRFYHVPRVFLRAGEPNTVVLFEEAGGDPSRVGFHTVAVGPVCVEAAEK------ 748
Query: 785 KWSNSYSVDGKLSINKMAPEMHLHC--QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPM 842
+ L C G ISS++ ASYG +G+C + +G C +
Sbjct: 749 -----------------GDNVTLSCGQHKGRTISSVDLASYGVTRGQCGAY-QGGCESKA 790
Query: 843 SLSVVSE 849
+ +E
Sbjct: 791 AYEAFAE 797
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/816 (43%), Positives = 485/816 (59%), Gaps = 82/816 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD RA++++G RRML S +HY R+TPEMWP +IAK+++GG DVI+TYVFWN HE ++
Sbjct: 39 VTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYVFWNVHEPVQ 98
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ +IVKF++ + + GLY+ LRIGP++ AEW +GGFP WL ++P I FRT+N P
Sbjct: 99 GKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPNITFRTDNEP 158
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQ FV +V++M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAAS+A
Sbjct: 159 FKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPRYVQWAASLA 218
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQ DAP+ II+ CNG C PNS NKP LWTENW Y +G
Sbjct: 219 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWTTRYPIYGND 278
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEYG 342
R D+ FAVA F R GGSF++YYMY GGTNFGR + Y+T+ YD AP+DEYG
Sbjct: 279 TKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASS--YVTTSYYDGAPLDEYG 336
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L+ +P WGHLK+LHAA+KL L+ + + LG++QEAHV+ ++ C AFL
Sbjct: 337 LIWQPTWGHLKELHAAVKLSSEPLLYGTYSNF-SLGEDQEAHVFE-----TKLKCVAFLV 390
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+H +V F S L P S+SIL DCR VF T KV++Q +T E L
Sbjct: 391 NFDKHQRPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNAQHGSRTAEVVQSL----- 445
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQI- 520
+ + +W KE I S+ +T + + EHL+ TKD +DYLW+I
Sbjct: 446 ------------NDTHTWKAFKESIPQDISKAAYTGKQLFEHLSTTKDETDYLWYIASYE 493
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH-----WVKVVQPVEFQS 575
Y DD N ++S +L F+NG+ GSV G ++ + + +
Sbjct: 494 YRPSDDSHLVLLN-------VESQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKE 546
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
G N + LL+ VG + GA +E+ G +V + ++ L+ LW YQVGL GE
Sbjct: 547 GQNTISLLNVMVGSPDSGAHMERRSFGIH-KVSIQQGQHALHLLNNELWGYQVGLFGEGN 605
Query: 636 QIYSIE-ENEAEWTDLTR-DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+IY+ E + EWTD+ +P TWY+T F P G D V L+L SMGKG+ W+NG
Sbjct: 606 RIYTQEGSHSVEWTDVNNLTYLP--LTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGES 663
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
IGRYW T G P+Q+ YH+P+ +L+ ++NLLV+ EE
Sbjct: 664 IGRYWV---------------------SFKTPSGQPSQSLYHIPQHFLKNTDNLLVLVEE 702
Query: 754 TGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGY 813
GGNP +I+V S VC V+E PPV+ GK PE+ L CQ G
Sbjct: 703 MGGNPLQITVNTVSITTVCSSVNELSAPPVQS-------QGK------DPEVRLRCQKGK 749
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
IS++EFASYG P G C+ F+ G+CHA S SVV +
Sbjct: 750 HISAVEFASYGNPAGDCRTFTIGSCHAESSESVVKQ 785
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/840 (43%), Positives = 482/840 (57%), Gaps = 103/840 (12%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
++ ++ L+ + S SA + +V+YD R++IIDG R+++ S IHYPR+TPEMWP L
Sbjct: 4 VLFLVAAVLAVIGSGSA---VRGGDVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSL 60
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
IAK+KEGG D IETYVFWN HE G Y+F G +DIV+F+K V + GLY LRIGP++ +
Sbjct: 61 IAKAKEGGLDAIETYVFWNVHEPQPGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQS 120
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW++GG P WL DIPGI FR++N PFK MQ F K+V +M+ E L++ QGGPII+ QIE
Sbjct: 121 EWSYGGLPFWLHDIPGIVFRSDNEPFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIE 180
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--Y 259
NEYG ++ +YGQ+G YV+WAA MA GL GVPWVMCKQ +AP ++I++CNG C
Sbjct: 181 NEYGTVQKAYGQEGLAYVQWAAQMAEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFV 240
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTN 318
PNS NKP++WTENW TT + ED+AF V F + GSF+NYYMY GGTN
Sbjct: 241 GPNSPNKPSIWTENW----TT-------QSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTN 289
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
FGRT+ F TSY AP+DEYGL ++PKWGHLK+LHAAIKLC L++ + LG
Sbjct: 290 FGRTASA-FVTTSYYDQAPLDEYGLTTQPKWGHLKELHAAIKLCSTPLLSGVQVN-LYLG 347
Query: 379 QNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
Q+A+++ A C+AFL N D AASV F SY LPP S+SILPDC+N
Sbjct: 348 PQQQAYIFNA----VSGECAAFLINNDSSNAASVPFRNASYDLPPMSISILPDCKN---- 399
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ 498
VS+Q + +T M ++ + W E I + + +
Sbjct: 400 ---VSTQYTTRT-----------------MGRGEVLDAADVWQEFTEAIPNFDSTSTRSE 439
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
+LE +N TKD SDYLW+ + D + + + S+ L F+NGQ G
Sbjct: 440 TLLEQMNTTKDSSDYLWYTFRFQHESSD--------TQAILDVSSLGHALHAFVNGQAVG 491
Query: 559 SVIGH----WVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
SV G K V G N++ LLS VG+ + GAFLE AG R + K
Sbjct: 492 SVQGSRKNPRFKFETSVSLSKGINNVSLLSVMVGMPDSGAFLENRAAGLR--TVMIRDKQ 549
Query: 615 GDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGID 673
+ D + W YQ+GL+GE QIY+ + ++ +W + G P TWYKT DAP G
Sbjct: 550 DNNDFTNYSWGYQIGLQGETLQIYTEQGSSQVQWKKFSNAGNP--LTWYKTQVDAPPGDV 607
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW 733
PV L+L SMGKG+AWVNG IGRYW P+
Sbjct: 608 PVGLNLASMGKGEAWVNGQSIGRYW------------------------------PS--- 634
Query: 734 YHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKW---SNSY 790
YHVPRS+L+ + NLLV+ EE GGNP ++S+ + VC V+ SH PV W + Y
Sbjct: 635 YHVPRSFLKPTGNLLVLQEEEGGNPLQVSLDTVTISQVCGHVTASHLAPVSSWIEHNQRY 694
Query: 791 SVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQK-FSRGNCHAPMSLSVVSE 849
K+S + P++ L C IS I FASYGTP G C+ + G CH+ S +VV E
Sbjct: 695 KNPAKVSGRR--PKVLLACPSKSKISRISFASYGTPLGNCRNSMAVGTCHSQNSKAVVEE 752
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/841 (42%), Positives = 484/841 (57%), Gaps = 124/841 (14%)
Query: 19 YPMMMMMMMIHLSCV--SSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
Y M+ + L CV SS + +T VS+D RAI IDG+RR+L+S IHYPR+T E
Sbjct: 20 YITTMVSLSFILCCVLVSSCAYATI-----VSHDGRAITIDGHRRVLLSGSIHYPRSTTE 74
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWPDLI K KEG D IETYVFWNAHE R QY+F G D+++F+K + + G+Y LRIG
Sbjct: 75 MWPDLIKKGKEGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIG 134
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVCAEWN+GGFPVWL ++PG+EFRT N F EMQ F IV+++++E LF+ QGGPII
Sbjct: 135 PYVCAEWNYGGFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPII 194
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEYGN+ SYG+ GK Y++W A+MA L GVPW+MC+Q DAP+ +++ CNGYYC
Sbjct: 195 LAQIENEYGNVIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYC 254
Query: 257 DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
D + PN+ N P +WTENW GWY WGG+ PHR ED+AFAVARFFQ+ G+F NYYMY GG
Sbjct: 255 DNFSPNNPNTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGG 314
Query: 317 TNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK 376
TNF RT+GGP+ T+YDYDAP+DE+G L++PK+GHLK LH + E L + + +
Sbjct: 315 TNFDRTAGGPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNIST-VD 373
Query: 377 LGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV 436
G A VY+ ++ S F+ N++E + A + F G SY +P WSVSILPDC+
Sbjct: 374 FGNLVTATVYQ-----TEEGSSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTET 428
Query: 437 FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNF 495
+NTAK+++QTS+ V + + E++ S+ SW + + + +
Sbjct: 429 YNTAKINTQTSVM-------------VKKANEAENEPSTLKWSWRPENIDSVLLKGKGES 475
Query: 496 TVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ 555
T++ + + V+ D SDYLW++T + + + D K +R I+S VL F+NGQ
Sbjct: 476 TMRQLFDQKVVSNDESDYLWYMTTVNLKEQDPVLGKNMSLR----INSTAHVLHAFVNGQ 531
Query: 556 LTGSV-----IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
G+ H+V Q +F G N + LLS TVGL NYGAF E AG G V +
Sbjct: 532 HIGNYRVENGKFHYV-FEQDAKFNPGANVITLLSITVGLPNYGAFFENFSAGITGPVFII 590
Query: 611 GFKNGD----IDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYF 666
G +NGD DLS W+Y+ GL G Q++S E PST++
Sbjct: 591 G-RNGDETIVKDLSTHKWSYKTGLSGFENQLFSSES-------------PSTWS------ 630
Query: 667 DAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNC 726
AP G +PV +DL +GKG AW+NG++IGRYW A+ SD
Sbjct: 631 -APLGSEPVVVDLLGLGKGTAWINGNNIGRYWP----------------AFLSDI----- 668
Query: 727 GNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKW 786
+N LV+FEE GGNP ++ + VC V E +
Sbjct: 669 ----------------DGDNTLVLFEEIGGNPSLVNFQTIGVGSVCANVYEKNV------ 706
Query: 787 SNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSV 846
+ L C +G IS+I+FAS+G P G C F +G C A + +
Sbjct: 707 ------------------LELSC-NGKPISAIKFASFGNPGGDCGSFEKGTCEASNNAAA 747
Query: 847 V 847
+
Sbjct: 748 I 748
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/813 (43%), Positives = 479/813 (58%), Gaps = 94/813 (11%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++II+G ++L S IHYPR+TP+MW LI+K+K GG DVI+TYVFWN HE
Sbjct: 1 NVTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQ 60
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ+ F G+ D+V+FVK + + GLY LRIGP++ +EW +GG P WL DIPG+ +R++N
Sbjct: 61 QGQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMVYRSDNQ 120
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+RFV +IV +M+ E L++ QGGPII+ Q+ENEY N+E+++ ++G YV+WAA M
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +I++CNG C PNS NKP++WTE+W +Y +G
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R +D+AF VA F + GS++NYYMY GGTNFGRT+ F ITSY AP+DEYGL
Sbjct: 241 ETYMRSAQDIAFHVALFIAKTGSYVNYYMYHGGTNFGRTASA-FTITSYYDQAPLDEYGL 299
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ +PKWGHLK+LHAAIK C L+ + + LG Q+A+V++ G+ C+AFL N
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHG-AHKTFSLGPLQQAYVFQ----GNSGQCAAFLVN 354
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D V F SY LP S+SILPDC+ FNTAKV++Q + ++ + PN
Sbjct: 355 NDGKQEVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNAQYTTRS------MKPN--- 405
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
K +S K W EPI + + + +LEH++ TKD SDYLW+ + +
Sbjct: 406 -------QKFNSVGK-WEEYNEPIPEFDKTSLRANRLLEHMSTTKDTSDYLWYTFRFQQN 457
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW------VKVVQPVEFQSGY 577
+ + S VL ++NG G GH + V ++G
Sbjct: 458 LPN--------AQSVFNAQSHGHVLHAYVNGVHAG--FGHGSHQNTSFSLQTTVRLKNGT 507
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + LLS TVGL + GA+LE+ AG R +V++ + D + W YQVGL GE QI
Sbjct: 508 NSVALLSATVGLPDSGAYLERRVAGLR-RVRIQ-----NKDFTTYTWGYQVGLLGERLQI 561
Query: 638 YSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y+ N+ +W L G WYKT FDAP G DPVAL+LGSMGKG+AWVNG IGR
Sbjct: 562 YTENGSNKVKWNKL---GTNRPLMWYKTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGR 618
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW T+ G+P+QTWY++PR++L+ + NLLV+ EE G
Sbjct: 619 YWVSFH---------------------TSQGSPSQTWYNIPRAFLKPTGNLLVLLEEEKG 657
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P I+V S VC SESH V+ L C IS
Sbjct: 658 YPPGITVDTVSVTKVCGYASESHLSAVQ----------------------LSCPLKRNIS 695
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
SI FAS+GTP G C+ ++ GNCH+ S + V +
Sbjct: 696 SIIFASFGTPSGNCESYAIGNCHSSSSKANVEK 728
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/809 (42%), Positives = 472/809 (58%), Gaps = 73/809 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD R+++IDG RR+++S IHYPR+TPEMWPDLI K+KEGG D IETY+FWN HE
Sbjct: 30 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPH 89
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R QYNF+G D+V+F K + ++G+Y LRIGPY+C EWN+GG P WLRDIPG++FR +N
Sbjct: 90 RRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 149
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAA 223
PF+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 224 SMALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
MA GVPW+MC+Q D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L +PK+GHLK+LH+ +K E LV +Y V +Y S+ + F+
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLV---HGEYFDTNYGDNITV---TKYTLDSSSACFIN 383
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N + +VT G ++ LP WSVSILPDC+ FN+AK+ +QTS+
Sbjct: 384 NRFDDKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVM------------- 430
Query: 463 VPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
V + + E + S SWM P + NF +LE + + D SDYLW+ T
Sbjct: 431 VKKPNTAEQEQESLKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRT--- 487
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGY 577
S E + +++ L F+NG+L G S G +V ++ PV+ G
Sbjct: 488 ------SLNHKGEGSYKLYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGK 541
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS TVGL+NYG EK G G VKL IDLS W+Y+ GL E++Q
Sbjct: 542 NYISLLSATVGLKNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQ 601
Query: 637 IYSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
I+ +++ +W IP FTWYK F+AP G D V +DL + KG AWVNG+++
Sbjct: 602 IH-LDKPGYKWNG-NNGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNL 659
Query: 695 GRYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NL 747
GRYW A GC CDYRGA+ ++ +C T CG P+Q +YHVPRS+L A N
Sbjct: 660 GRYWPSYTAAEMAGCH-RCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNT 718
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
L++FEE GG+P ++++ VC + + + L
Sbjct: 719 LLLFEEAGGDPSGVALRTVVPGAVC------------------------TSGEAGDAVTL 754
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRG 836
C G+ +SS++ AS+G +GRC + G
Sbjct: 755 SCGGGHAVSSVDVASFGVGRGRCGGYEGG 783
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 326/651 (50%), Positives = 430/651 (66%), Gaps = 37/651 (5%)
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
LV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N PFK M++F +KIV +M
Sbjct: 1 LVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMM 60
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD 242
+ E LF QGGPII+ QIENEYG +E G GK Y KW A MALGL GVPW+MCKQ D
Sbjct: 61 KAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQED 120
Query: 243 APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ 302
AP IID CNGYYC+ +KPNS NKP +WTENW GWYT +GG +P+RPVED+A++VARF Q
Sbjct: 121 APGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFIQ 180
Query: 303 RGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLC 362
+GGS +NYYMY GGTNF RT+ G F +SYDYDAP+DEYGL EPK+ HLK LH AIKL
Sbjct: 181 KGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLS 239
Query: 363 EPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLP 422
EPAL++AD A LG QEA+V+ S+S+C+AFL+N DE++AA V F G Y LP
Sbjct: 240 EPALLSAD-ATVTSLGAKQEAYVFW-----SKSSCAAFLSNKDENSAARVLFRGFPYDLP 293
Query: 423 PWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMT 482
PWSVSILPDC+ V+NTAKV+ +P++ ++M+ + T SW +
Sbjct: 294 PWSVSILPDCKTEVYNTAKVN--------------APSV---HRNMVP---TGTKFSWGS 333
Query: 483 VKEPIGVWSE-NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTI 541
E +E F G++E +++T D SDY W+IT I + + +F KT + P +T+
Sbjct: 334 FNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGE-TFLKTGD-SPLLTV 391
Query: 542 DSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
S L VF+NGQL+G+ G + Q ++ +G N + LLS VGL N G E
Sbjct: 392 MSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFE 451
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIP 656
+ G G V L G +G D+SK W+Y++G+KGE +++ E + WT +
Sbjct: 452 QWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKK 511
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGA 716
TWYK+ F P G +P+ALD+ +MGKGQ W+NG +IGR+W +G C C+Y G
Sbjct: 512 QPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKAQGSC-GRCNYAGT 570
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
+++ KC +NCG +Q WYHVPRSWL+ S NL+V+FEE GG+P IS+ R+
Sbjct: 571 FDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 620
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 319/623 (51%), Positives = 414/623 (66%), Gaps = 36/623 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +A+II+G RR+L+S IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F+ + D+VKF+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N P
Sbjct: 89 GQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+F +KIV +M+EE LF QGGPII+ QIENEYG +E G GK Y KW A MA
Sbjct: 149 FKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAEMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GL GVPW+MCKQ DAP +II+ CNG+YC+ +KPNS NKP +WTENW GW+T +GG +P
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFGGAVP 268
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
+RP ED+A +VARF Q GGSF+NYYMY GGTNF RT+ G F TSYDYDAP+DEYGL E
Sbjct: 269 YRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYGLPRE 327
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+ HLK LH IKLCEPALV+AD LG QEAHV++ S+S+C+AFL+N +
Sbjct: 328 PKYSHLKRLHKVIKLCEPALVSADPT-VTSLGDKQEAHVFK-----SKSSCAAFLSNYNT 381
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+AA V F G +Y LPPWSVSILPDC+ +NTAKV +T + + P
Sbjct: 382 SSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV------RTSSIHMKMVP------- 428
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
++T SW + E I ++N F+ G++E +++T+D +DY W++T I +S D
Sbjct: 429 -------TNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPD 481
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
+ T E P +TI S L VF+NGQL G+ G K Q ++ +G N L
Sbjct: 482 EKFL--TGE-DPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLA 538
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS GL N G E G G V L G +G D++K W+Y++G KGE ++++
Sbjct: 539 LLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLA 598
Query: 642 -ENEAEWTDLTRDGIPSTFTWYK 663
+ EW + + TWYK
Sbjct: 599 GSSTVEWKEGSLVAKKQPLTWYK 621
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 338/838 (40%), Positives = 499/838 (59%), Gaps = 70/838 (8%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+ + +I + C +++ + ++YD R++++DG + S IHYPR+TP+MWPD+
Sbjct: 10 ITLFSIITIVCAQNAAQT-------ITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDI 62
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K++ GG ++I+TYVFWN HE + + NF+G+ D+VKF+KLV G+Y+ LRIGP++ A
Sbjct: 63 LDKARRGGLNLIQTYVFWNGHEPEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQA 122
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWN GG P WLR++P I FR+NN PFK+ M+ +V +++ M+EE LF+ QGGPII+ QIE
Sbjct: 123 EWNHGGLPYWLREVPDIIFRSNNEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIE 182
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK 260
NEY +++ +Y G +YV+WAA MA+ L GVPWVMCKQ DAP+ +I+ACNG +C D +
Sbjct: 183 NEYNHIQLAYEADGDNYVQWAAKMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFT 242
Query: 261 -PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
PN KP +WTENW Y +G R ED+AF+VARFF + GS +NYYMY GGTNF
Sbjct: 243 GPNKPYKPFIWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYHGGTNF 302
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKLG 378
GRT+ F T Y +AP+DE+GL EPKW HL+D H A+ LC+ +L+ + Q K+
Sbjct: 303 GRTTSA-FTTTRYYDEAPLDEFGLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQ--KIS 359
Query: 379 QNQEAHVYRANRYGSQSN-CSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
Q E VY +SN C+AF+ N TA +++F G Y LPP S+SILPDC+ VF
Sbjct: 360 QYHEVIVYEKK----ESNLCAAFITNNHTQTAKTLSFRGSDYFLPPRSISILPDCKTVVF 415
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NT ++SQ S + E +SK + K W EPI E
Sbjct: 416 NTQNIASQHSSRHFE-----------------KSKTGNDFK-WEVFSEPIPSAKELPSKQ 457
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
+ E ++ KD +DY W+ T + + +DI K ++V P + I S+ L+ F+NG+
Sbjct: 458 KLPAELYSLLKDKTDYGWYTTSVELGPEDIP--KKSDVAPVLRILSLGHSLQAFVNGEYI 515
Query: 558 GSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
GS G + +PV F+ G N + +L+ VGL + GA++E AG + + + G
Sbjct: 516 GSKHGSHEEKGFEFQKPVNFKVGVNQIAILANLVGLPDSGAYMEHRYAGPK-TITILGLM 574
Query: 614 NGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI 672
+G IDL+ W +QVGL+GE I++ + + EW D G ST +WYKT FD P+G
Sbjct: 575 SGTIDLTSNGWGHQVGLQGENDSIFTEKGSKKVEWKDGKGKG--STISWYKTNFDTPEGT 632
Query: 673 DPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
+PVA+ + M KG WVNG IGR+W + ++P G PTQ
Sbjct: 633 NPVAIGMEGMAKGMIWVNGESIGRHWMSYLSP----------------------LGKPTQ 670
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS 791
+ YH+PRS+L+ +NLLVIFEE +P +I++ + +C ++E+H P +R +++
Sbjct: 671 SEYHIPRSFLKPKDNLLVIFEEEAISPDKIAILTVNRDTICSFITENHPPNIRSFASKNQ 730
Query: 792 VDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++ N + PE + C D I+++EFAS+G P G C F G C+AP S +V +
Sbjct: 731 KLERVGEN-LTPEAFITCPDQKKITAVEFASFGDPSGFCGSFIMGKCNAPSSKKIVEQ 787
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 352/847 (41%), Positives = 491/847 (57%), Gaps = 81/847 (9%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+ +++ L V+ +++T V+Y+ RA++IDG RR+++S IHYPR+TP+MWPDLI
Sbjct: 4 LQFLLLALVAVTQVASAT-----TVTYNDRALVIDGQRRIILSGSIHYPRSTPQMWPDLI 58
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+KEGG + IETYVFWN HE R QYNF+G DI++F K + ++G++ LRIGPY+C E
Sbjct: 59 NKAKEGGLNTIETYVFWNGHEPRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGPYICGE 118
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GG P WLRDIPG++FR +NAPF+ EM+ F IV+ M++ +F+ QGGPII+ QIEN
Sbjct: 119 WNYGGLPAWLRDIPGMQFRLHNAPFEREMETFTTLIVNKMKDVNMFAGQGGPIILAQIEN 178
Query: 203 EYGNM--ESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGY 259
EYGN+ + Q Y+ W A MA GVPW+MC+Q D P N+I+ CNG+YC +
Sbjct: 179 EYGNIMGQLKNNQSASQYIHWCADMANKQEVGVPWIMCQQDNDVPHNVINTCNGFYCHDW 238
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
PN P +WTENW GW+ W HR ED+AFAVA FFQ+ GS NYYMY GGTNF
Sbjct: 239 FPNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNF 298
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
GRTSGGP+ TSYDYDAP+DEYG + +PK+GHLKDLH I+ E LV G+
Sbjct: 299 GRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGKYND-TSYGK 357
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
N Y YG S C F+ N VT G+++ +P WSVSILP+C+ +NT
Sbjct: 358 NVTVTKY---MYGGSSVC--FINNQFVDRDMKVTLGGETHLVPAWSVSILPNCKTVAYNT 412
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQ 498
AK+ +QTS+ V + + +E + + SWM +P +F
Sbjct: 413 AKIKTQTSVM-------------VKKANSVEKEPETMRWSWMPENLKPFMTDHRGSFRQS 459
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
+LE + + D SDYLW+ T S E T+ +++ + F+NG+L G
Sbjct: 460 QLLEQIATSTDQSDYLWYRT---------SLEHKGEGSYTLYVNTSGHEMYAFVNGRLVG 510
Query: 559 ---SVIGHWVKVVQ-PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFK 613
S G +V +Q PV+ SG N + LLS TVGL+NYG E AG G VKL G
Sbjct: 511 QNHSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPSFELVPAGIAGGPVKLVGTN 570
Query: 614 NGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG-IPST--FTWYKTYFDAPD 670
IDL+K W+Y+ GL GE +QI+ +++ +W + +G IP FTWYKT F+AP
Sbjct: 571 GTAIDLTKSSWSYKSGLAGELRQIH-LDKPGYKWQ--SHNGTIPVNRPFTWYKTTFEAPA 627
Query: 671 GIDPVALDLGSMGKGQAWVNGHHIGRYWT--VVAPKGGCQDTCDYRGAYNSD----KCTT 724
G + V +DL + KG AWVNG+ +GRYW A GC CDYRG + ++ +C T
Sbjct: 628 GEEAVVVDLLGLNKGVAWVNGNSLGRYWPSYTAAEMPGCH-VCDYRGKFIAEGDGIRCLT 686
Query: 725 NCGNPTQTWYHVPRSWLQASN-NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPV 783
CG P Q +YHVPRS+L+A N L++FEE GG+P + + VC E
Sbjct: 687 GCGEPAQRFYHVPRSFLRAGEPNTLILFEEAGGDPTRAAFHTVAVGPVCVAAVE------ 740
Query: 784 RKWSNSYSVDGKLSINKMAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPM 842
+ ++ L C G +++S++ AS+G +G C + +G C +
Sbjct: 741 -----------------LGDDVTLSCGGHGRVVASVDVASFGVARGSCGAY-KGGCESKA 782
Query: 843 SLSVVSE 849
+L ++
Sbjct: 783 ALKAFTD 789
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 633 bits (1633), Expect = e-178, Method: Compositional matrix adjust.
Identities = 345/807 (42%), Positives = 459/807 (56%), Gaps = 70/807 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R++I+DG RR++IS IHYPR+TPEMWPDLI K+KEGG + IETYVFWN HE R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
++NF+G D+V+F K + ++G+Y LRIGPY+C EWN+GG PVWLRDIPGI+FR +N P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAAS 224
F+ EM+ F IV M++ +F+ QGGPII+ QIENEYG ++ Q +Y+ W A
Sbjct: 151 FENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q D P N+++ CNG+YC + N + P +WTENW GWY W
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWDQ 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
RP ED+AFAVA FFQ GS NYYMY GGTNFGRT+GGP+ TSYDYDAP+DEYG
Sbjct: 271 PEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 330
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLK+LH+ + E L+ D YI V +Y + + F+ N
Sbjct: 331 LRQPKYGHLKELHSVLMSMEKILLHGD---YIDTNYGDNVTV---TKYTLNATSACFINN 384
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT++ V
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTTVM-------------V 431
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ SM+E + SWM P + NF +LE + T D SDYLW+ T
Sbjct: 432 NKTSMVEQQTEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRT---- 487
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI----GHWVKVVQPVEFQSGYN 578
S E + +++ L F+NG+L G ++ PV+ G N
Sbjct: 488 -----SLEHKGEGSYVLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS TVGL+NYG E AG G VKL IDLS W+Y+ GL GE+++I
Sbjct: 543 YISLLSGTVGLRNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKI 602
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y + + + I FTWYKT F AP G D V +DL + KG AWVNG+ +GRY
Sbjct: 603 YLDKPGNKWRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRY 662
Query: 698 WT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLLVI 750
W V A GC CDYRG + ++ KC T CG P+Q YHVPRS+L N L++
Sbjct: 663 WPSYVAADMPGCHH-CDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLHKGEPNTLIL 721
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC- 809
FEE GG+P E++V+ VC ++ + L C
Sbjct: 722 FEEAGGDPSEVAVRTVVEGSVCASA------------------------ELGDTVTLSCG 757
Query: 810 QDGYIISSIEFASYGTPQGRCQKFSRG 836
G ISS++ AS+G +GRC + G
Sbjct: 758 AHGRTISSVDVASFGVARGRCGSYDGG 784
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 364/823 (44%), Positives = 482/823 (58%), Gaps = 108/823 (13%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHY R+TP+MWP LIAK+K GG DV++TYVFWN HE
Sbjct: 11 NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 70
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ++F G DIVKF+K V + GLY+ LRIGP++ EW++GG P WL ++ GI FRT+N
Sbjct: 71 QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 130
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+R+ K IV LM+ E L++ QGGPII+ QIENEYG + ++ Q+GK YVKW A +
Sbjct: 131 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 190
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +++ACNG C + +K PNS NKP +WTENW
Sbjct: 191 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL------ 244
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
ED+AF VA F + GSF+NYYMY GGTNFGR + F ITSY AP+DEYGL
Sbjct: 245 -----SAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYGL 298
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN-CSAFLA 402
L +PKWGHLK+LHAA+KLCE L++ I LG+ Q A V +G ++N C+A L
Sbjct: 299 LRQPKWGHLKELHAAVKLCEEPLLSGLQTT-ISLGKLQTAFV-----FGKKANLCAAILV 352
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ ++V F SY L P SVS+LPDC+N FNTAKV++Q + +T + N+S
Sbjct: 353 NQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK----ARQNLS 407
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
PQ W E + +SE + + +LEH+N T+D SDYLW T+
Sbjct: 408 SPQM-------------WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQ 454
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
S+ S K N + L F+NG+ GS+ G H + + + +G N
Sbjct: 455 SEGAPSVLKVNH---------LGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTN 505
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDL--SKILWTYQVGLKGEFQQ 636
+L LLS VGL N GA LE+ G R VK+ NG L + W YQVGLKGE
Sbjct: 506 NLALLSVMVGLPNSGAHLERRVVGSR-SVKIW---NGRYQLYFNNYSWGYQVGLKGEKFH 561
Query: 637 IYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+Y+ + + +W RD TWYK FD P+G DPVAL+LGSMGKG+AWVNG I
Sbjct: 562 VYTEDGSAKVQWKQY-RDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIA 620
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
+ + YH+PRS+L+ ++NLLVI EE
Sbjct: 621 MF--------------------------------SYFRYHIPRSFLKPNSNLLVILEEER 648
Query: 756 -GNPFEISVKLRSTRIVCEQVSESHYPPV----RKWSN----SYSVDGKLSINKMAPEMH 806
GNP I++ S VC VS ++ PV +K N +Y D K P++
Sbjct: 649 EGNPLGITIDTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRK-------PKVQ 701
Query: 807 LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C G IS I FAS+GTP G C +S G+CH+P SL+VV +
Sbjct: 702 LQCPTGRKISKILFASFGTPNGSCGSYSIGSCHSPNSLAVVQK 744
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 355/818 (43%), Positives = 472/818 (57%), Gaps = 86/818 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD RA+++ G RRM S +HY R+TPEMWP LIAK+K GG DVI+TYVFWN HE I+
Sbjct: 29 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ D+VKF++ + + GLY+ LRIGP+V AEW +GGFP WL D+P I FR++N P
Sbjct: 89 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQ FV KIV +M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAA+MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQ DAP+ +I+ CNG C PNS NKP LWTENW Y +G
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIYGND 268
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEYG 342
R ED+AFAVA + R GSF++YYMY GGTNFGR + Y+T+ YD AP+DEYG
Sbjct: 269 TKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS--YVTTSYYDGAPLDEYG 326
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L+ +P WGHL++LH A+K L+ + + LGQ QEAHV+ + C AFL
Sbjct: 327 LIWQPTWGHLRELHCAVKQSSEPLLFGSYSNF-SLGQQQEAHVFETD-----FKCVAFLV 380
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+H V F S L P S+S+L DCRN VF TAKV++Q +T L
Sbjct: 381 NFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSL----- 435
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+ +W EP+ S++ +T + E L TKD +DYLW+I
Sbjct: 436 ------------NDINNWKAFIEPVPQDLSKSTYTGNQLFEQLPTTKDETDYLWYIVSYK 483
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW-----VKVVQPVEFQSG 576
D N++ + + S+ +L F+N + GSV G + + + + G
Sbjct: 484 NRASD-----GNQI-ARLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEG 537
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID---LSKILWTYQVGLKGE 633
N + LLS VG + GA++E+ G ++ G + G L+ LW YQVGL GE
Sbjct: 538 DNTISLLSVMVGSPDSGAYMERRTFG----IQTVGIQQGQQPMHLLNNDLWGYQVGLFGE 593
Query: 634 FQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
IY+ E N W D+ + I TWYKT F P G D V L+L SMGKG+ WVNG
Sbjct: 594 KDSIYTQEGPNSVRWMDIN-NLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGE 652
Query: 693 HIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYW + AP G P+Q+ YH+PR +L +NLLV+
Sbjct: 653 SIGRYWVSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNLLVLV 690
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GG+P +I+V S VC V E PP++ GK+ P++ + CQ
Sbjct: 691 EEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS-------RGKV------PKVRIWCQG 737
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G ISSIEFASYG P G C+ F G+CHA S SVV +
Sbjct: 738 GKRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQ 775
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 350/827 (42%), Positives = 479/827 (57%), Gaps = 88/827 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD RA++IDG RR+++S IHYPR+TPEMWPDLI K+K+GG + IETYVFWN HE
Sbjct: 33 VSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTIETYVFWNGHEPRP 92
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF+G DI++F K V +G+Y LRIGPY+C EWN+GG P WLRDIP ++FR +N P
Sbjct: 93 RQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLRDIPDMQFRLHNEP 152
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQ--GKDYVKWAAS 224
F+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN++S+ Q Y+ W A
Sbjct: 153 FEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPDQESATKYIHWCAD 212
Query: 225 MALGLGAGVPWVMCKQT-DAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q+ D P N+I+ CNG+YC +KP N P +WTENW GW+ W
Sbjct: 213 MANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHDFKPKGSNMPKIWTENWTGWFKAWDK 272
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HRP ED+A+AVA FFQ GS NYYMY GGTNFGRTSGGP+ T+YDYDAP+DEYG
Sbjct: 273 PDYHRPAEDVAYAVAMFFQNRGSVQNYYMYHGGTNFGRTSGGPYITTTYDYDAPLDEYGN 332
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV---YRANRYGSQSNCSA- 399
+ +PK+GHLK LH + E LV GQ E ++ +A +Y SA
Sbjct: 333 IRQPKYGHLKALHTVLTSMEKHLV---------YGQQNETNLDDKVKATKYTLDDGSSAC 383
Query: 400 FLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
F++N ++ +VTF G +Y +P WSVS+LPDC+ +NTAKV +QTS+
Sbjct: 384 FISNSHDNKDVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQTSVM---------- 433
Query: 460 NISVPQQSMIESKLSSTSKSWM-TVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT 518
V ++S + L SW+ P S +F +LE + D SDYLW+ T
Sbjct: 434 ---VKKESAAKGGLKW---SWLPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKT 487
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQ 574
S + + + T+ +++ L F+NG+L G +V G ++ + PV +
Sbjct: 488 ---------SLTRGPKEQFTLYVNTTGHELYAFVNGELAGYKHAVNGPYLFQFEAPVTLK 538
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGE 633
G N + LLS TVGL+NYGA E AG G VKL IDLS WTY+ GL GE
Sbjct: 539 PGKNYISLLSATVGLKNYGASFELMPAGIVGGPVKLVSAHGNTIDLSNNTWTYKTGLFGE 598
Query: 634 FQQIYSIEENEAEWTDLTRDGIPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
+QI+ +++ W+ +P+ FTWYK F AP G + V +DL + KG +VNG
Sbjct: 599 QKQIH-LDKPGLRWSPF---AVPTNRPFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNG 654
Query: 692 HHIGRYWT--VVAPKGGCQDTCDYRGAY----NSDKCTTNCGNPTQTWYHVPRSWLQASN 745
H++GRYW V GC CDYRG Y N +KC T CG Q +YHVPRS+L A++
Sbjct: 655 HNLGRYWPSYVAGDMDGCH-RCDYRGEYVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAH 713
Query: 746 ---NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMA 802
N +V+FEE GG+P +++ + + VC +
Sbjct: 714 GAPNTVVLFEEAGGDPAKVNFRTVAVGPVCADAEKGD----------------------- 750
Query: 803 PEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGN-CHAPMSLSVVS 848
+ L C G ISS++ AS+G G+C + G+ C + +L ++
Sbjct: 751 -AVTLACAHGRTISSVDTASFGVSGGQCGAYEGGSGCESKPALEAIT 796
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 350/823 (42%), Positives = 478/823 (58%), Gaps = 76/823 (9%)
Query: 49 YDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQ 108
Y+ RA++IDG RR+++S IHYPR+TP+MWPDLI K+KEGG + IETYVFWN HE R Q
Sbjct: 30 YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89
Query: 109 YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFK 168
YNF+G DIV+F K + ++G++ LRIGPY+C EWN+GG P WLRDIPG++FR +N PF+
Sbjct: 90 YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNDPFE 149
Query: 169 EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSY--GQQGKDYVKWAASMA 226
EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ Q Y+ W A MA
Sbjct: 150 REMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADMA 209
Query: 227 LGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
GVPW+MC+Q D P N+I+ CNG+YC + PN P +WTENW GW+ W
Sbjct: 210 NKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWFPNRTGIPKIWTENWTGWFKAWDKPD 269
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG +
Sbjct: 270 FHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIR 329
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PK+GHLKDLH +K E LV + G+N Y YG S C F++N
Sbjct: 330 QPKYGHLKDLHNLLKSMEKILVHGEYKD-TSHGKNVTVTKY---TYGGSSVC--FISNQF 383
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ +VT G ++ +P WSVSILPDC+ +NTAK+ +QTS+ V +
Sbjct: 384 DDRDVNVTLAG-THLVPAWSVSILPDCKTVAYNTAKIKTQTSVM-------------VKK 429
Query: 466 QSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ +E + + SWM +P +F +LE + + D SDYLW+ T
Sbjct: 430 ANSVEKEPEALRWSWMPENLKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRT------ 483
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWVKVVQ-PVEFQSGYNDL 580
S E T+ +++ + F+NG+L G S G +V +Q PV+ SG N +
Sbjct: 484 ---SLEHKGEGSYTLYVNTTGHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYV 540
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFR-GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
LLS TVGL+NYG E AG G VKL G + IDL+ W+Y+ GL GE +QI+
Sbjct: 541 SLLSGTVGLKNYGPLFELVPAGIAGGPVKLVGANDTAIDLTHSSWSYKSGLAGEHRQIH- 599
Query: 640 IEENEAEWTDLTRDG-IPST--FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
+++ +W G IP FTWYKT F AP G + V +DL + KG AWVNG+ +GR
Sbjct: 600 LDKPGYKWRSHNGSGSIPVNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGR 659
Query: 697 YWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLLV 749
YW A GGC CDYRG + ++ +C T CG P+Q +YHVPRS+L+A N LV
Sbjct: 660 YWPSYTAAEMGGCHGACDYRGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLV 719
Query: 750 IFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC 809
+FEE GG+P + + VC +E + ++ L C
Sbjct: 720 LFEEAGGDPARAAFHTVAVGHVCVAAAE-----------------------VGDDVTLSC 756
Query: 810 ---QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G +++S++ AS+G +G C + +G C + +L +
Sbjct: 757 GGGLGGGVVASVDVASFGVTRGGCGDY-QGGCESKAALKAFRD 798
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 296/512 (57%), Positives = 373/512 (72%), Gaps = 24/512 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV YDHRA++IDG RR+LIS IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ D+VKFVK V +GLY+ LRIGPYVC+EWN+GGFP+WL IPGI+FRT+N
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCSEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EM+RF KIVDLM++E L++ QGGPII+ QIENEYG+++S+YG GK Y+ WAA M
Sbjct: 141 PFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWAAKM 200
Query: 226 ALGLGAGVPWVMCKQTDAPENI-IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
A L GVPWVMC+Q DAP+ I I+ CNG+YCD + PNS KP LWTENW WY +GG
Sbjct: 201 ATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQFTPNSKTKPKLWTENWSAWYLLFGGG 260
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRPVEDLAFAVARFFQRGG+F NYYMY GGTNF R++GGPF TSYD+DAPIDEYG++
Sbjct: 261 FPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDFDAPIDEYGVI 320
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHLKD+H AIKLCE AL+AA+ + LG N EA VY+ S C+AFLAN+
Sbjct: 321 RQPKWGHLKDVHKAIKLCEEALIAAE-PKITYLGPNLEAAVYKTG-----SVCAAFLANV 374
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D + +V F G SY LP WSVSILPDC+N V NTAK++S ++I ++
Sbjct: 375 DAKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASTISNF---------VTES 425
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ I S +S SK W + EP+G+ ++ + G+LE +N+T D SDYLW+ + + D
Sbjct: 426 LKEDISSSETSRSK-WSWINEPVGISKDDILSKTGLLEQINITADRSDYLWYSLSVDLKD 484
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQL 556
D S + + I+S+ L FING+L
Sbjct: 485 DPGS-------QTVLHIESLGHALHAFINGKL 509
Score = 254 bits (649), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/284 (47%), Positives = 172/284 (60%), Gaps = 11/284 (3%)
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD--IDLSKILWTYQ 627
P+ SG N + LLS TVGLQNYGAF + GAG G V L G KNG+ +DLS WTYQ
Sbjct: 1949 PITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGNKTLDLSSRKWTYQ 2008
Query: 628 VGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
VGLKGE + S + W T WYKT FDAP G +PV +D MGKG+A
Sbjct: 2009 VGLKGEDLGLSS--GSSGAWNSKTTFPKKQPLIWYKTNFDAPSGSNPVVIDFTGMGKGEA 2066
Query: 688 WVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNN 746
WVNG IGRYW T VA C D+C+YRG + KC NCG P+QT YHVP+S+L+ + N
Sbjct: 2067 WVNGQSIGRYWPTYVASNVDCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPQSFLKPNGN 2126
Query: 747 LLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMH 806
LV+FEE+GG+P +IS + VC VS+SH P + W+ GK+ P +
Sbjct: 2127 TLVLFEESGGDPTQISFATKQIGSVCAHVSDSHPPQIDLWNQDTESGGKV-----GPALL 2181
Query: 807 LHCQD-GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L+C + +ISSI+FASYGTP G C F RG C + +LS+V +
Sbjct: 2182 LNCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKTLSIVKK 2225
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 630 bits (1625), Expect = e-177, Method: Compositional matrix adjust.
Identities = 330/647 (51%), Positives = 412/647 (63%), Gaps = 43/647 (6%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+++MM+ + + V++S V+YDH+AI++DG RR+LIS IHYPR+TP+MWPD
Sbjct: 9 VVLMMLCLWVCGVTAS----------VTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPD 58
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LI K+K+GG DVI+TYVFWN HE GQY F+ + D+VKFVKL +GLY+ LRIGPY+C
Sbjct: 59 LIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYIC 118
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN GGFPVWL+ +PGI FRT+N PFK MQ+F KIV LM+E LF QGGPII+ QI
Sbjct: 119 AEWNLGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQI 178
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYK 260
ENEYG +E G GK Y KWAA MA+GL GVPWVMCKQ DAP+ +ID CNG+YC+ +K
Sbjct: 179 ENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFK 238
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PN KP +WTENW GWYT +GG +P RP EDLAF+VARF Q GGSF+NYYMY GGTNFG
Sbjct: 239 PNKNTKPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFG 298
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RTSGG F TSYDYDAP+DEYGL +EPK+ HL+ LH AIK EPALVA D + LG N
Sbjct: 299 RTSGGLFIATSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDP-KVQSLGYN 357
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
EAHV+ A C+AF+AN D + A F Y LPPWS+SILPDC+ V+NTA
Sbjct: 358 LEAHVFSA-----PGACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNTA 412
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
KV K N + QS E SS+ ++ +
Sbjct: 413 KVGYGWLKKMTPV------NSAFAWQSYNEEPASSSQA--------------DSIAAYAL 452
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
E +NVT+D SDYLW++T + V+ ++ F K N P +T+ S VL VFINGQL G+V
Sbjct: 453 WEQVNVTRDSSDYLWYMTDVNVNANE-GFLK-NGQSPLLTVMSAGHVLHVFINGQLAGTV 510
Query: 561 IGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G + V+ ++G N L LLS VGL N G E AG G V L G G
Sbjct: 511 WGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGT 570
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWY 662
DLS+ W+Y+VGLKGE +++ + EW + TWY
Sbjct: 571 RDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSLVAKKQPLTWY 617
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
TWYHVPRSWL + N LV+FEE GG+P I++ R+
Sbjct: 615 TWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 650
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 344/816 (42%), Positives = 471/816 (57%), Gaps = 75/816 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V YD RA++IDG RR+LIS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 26 VGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHEPRR 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF+G DIV+F K V +G+Y LRIGPY+C EWN+GG P WLRDI G++FR +N P
Sbjct: 86 RQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQFRMHNHP 145
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F++EM+ F IVD ++E +F+ QGGPII+ QIENEYGN+ + + + +Y+ W A+
Sbjct: 146 FEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHWCAA 205
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q D P N+I+ NG+YC + P + P +WTENW GW+ W
Sbjct: 206 MANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWFPKRTDIPKIWTENWTGWFKAWDK 265
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AF+VA FFQ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 266 PDFHRSAEDIAFSVAMFFQTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 325
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ +PK+GHLKDLH +K E L+ D N +Y ++ + F++N
Sbjct: 326 IRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMGNTN-----VTVTKYTLDNSSACFISN 380
Query: 404 IDEHTAASVTF-LGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
+ +VT G ++T+P WSVSILPDC+ +N+AK+ +QTS+
Sbjct: 381 KFDDKEVNVTLDNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTSV-------------- 426
Query: 463 VPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+ ++ E+ + SWM +P + NF +LE + + D SDYLW+ T
Sbjct: 427 MVKRPGAETVTDGLAWSWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRT--- 483
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI----GHWVKVVQPVEFQSGY 577
SF E + +++ L F+NG+L G G ++ PV+ SG
Sbjct: 484 ------SFEHKGESNYKLHVNTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGK 537
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKL--TGFKNGDIDLSKILWTYQVGLKGEF 634
N + LLS T+GL+NYGA E AG G VKL T DLS W+Y+ GL GE+
Sbjct: 538 NYISLLSATIGLKNYGALFEMMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKAGLAGEY 597
Query: 635 QQIYSIEENE-AEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
++ + + N+ ++W+ IP FTWYK F+AP G +PV DL +GKG WVNG
Sbjct: 598 RETHLDKANDRSQWSGGLNGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNG 657
Query: 692 HHIGRYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN 745
+++GRYW V A GCQ CDYRG + ++ KC T C P+Q +YHVPRS+++A
Sbjct: 658 NNLGRYWPSYVAADMDGCQ-RCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGE 716
Query: 746 -NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPE 804
N +V+FEE GG+P +S + ++ E
Sbjct: 717 PNTMVLFEEAGGDPTRVSFHTVAVGAA-----------------------CAEAAEVGDE 753
Query: 805 MHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+ L C G ISS++ AS G +G+C + +G C +
Sbjct: 754 VALACSHGRTISSVDVASLGVARGKCGAY-QGGCES 788
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 334/819 (40%), Positives = 467/819 (57%), Gaps = 67/819 (8%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
NV+YD RA++IDG RR+L+S IHYPR+TP+MWP+L A++K G DVI+TY+FWN +
Sbjct: 25 MNVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTYLFWNTNVP 84
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
G++ + D V+FV+L +GLY+ RIGP+VCAEW +GG P WLR IP I FR +
Sbjct: 85 TPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIPDIMFRDYD 144
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
P+ + ++ K V ++++ L + QGGPII+LQIENEYG ES Y G YV+W
Sbjct: 145 QPWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRYA-GGPQYVEWCGQ 203
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
+A L W+MC Q DAP NII CN +YCD + P+ +P++WTENW GW+ WG
Sbjct: 204 LAANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVPHP-GQPSMWTENWPGWFQKWGDP 262
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRP +D+A+AV R++ +GGS+MNYYMY GGTNF RT+GGPF T+YDYDA +DEYG+
Sbjct: 263 TPHRPAQDVAYAVTRYYIKGGSYMNYYMYHGGTNFERTAGGPFITTNYDYDASLDEYGMP 322
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+EPK+ HL +HA + E ++A + + I LG N EAH+Y S C AFL+N
Sbjct: 323 NEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNLEAHIYN-----SSVGCVAFLSNN 377
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT------------AKVSSQTSIKTVE 452
+ T V F G++Y LP WSVS+L C ++NT A ++ S + +
Sbjct: 378 NNKTDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTAVCRAHQRAPHDAACCARESRRVCD 437
Query: 453 FSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSD 512
PL P P QS L + + + P + + LE ++ T D++D
Sbjct: 438 RLPPLRPKARAPCQSGRIRHLCLVVLTSIGPQAP-----ATKYWNKTPLEQIDQTLDHTD 492
Query: 513 YLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV-KVVQPV 571
YLW+ T S+ ++ +++ + DV V++NG+ V W V V
Sbjct: 493 YLWYST---------SYVSSSATYAQLSLPQITDVAYVYVNGKF---VTVSWSGNVSATV 540
Query: 572 EFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLK 631
+G N + +LS T+GL N G L + G G V L G ++L++ W +Q G+
Sbjct: 541 SLVAGPNTIDILSLTMGLDNGGDILSEYNCGLLGGVYL-----GSVNLTENGWWHQTGVV 595
Query: 632 GEFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAP-DGIDPVALDLGSMGKGQAWV 689
GE I+ E + WT T + + TWYK+ FD P D P+ALDL MGKG WV
Sbjct: 596 GERNAIFLPENLKKVAWT--TPAVLNTGLTWYKSSFDVPRDSQAPLALDLTGMGKGYVWV 653
Query: 690 NGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL 748
NGH++GRYW T++A C D CDYRG Y++ C C P+QT YHVPR WLQA NN+L
Sbjct: 654 NGHNLGRYWPTILATNWPC-DVCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENNVL 712
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
V+ EE GGNP +I++ R + C V E Y D + L
Sbjct: 713 VLLEEMGGNPSKIALVEREEYVSCGVVGE-----------DYPADDLAVV--------LG 753
Query: 809 CQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
C I+ ++FASYGTP G C+ + +G+CHA S +V
Sbjct: 754 CGTHQTIAGVDFASYGTPMGSCRSYQQGSCHASNSTEIV 792
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 344/814 (42%), Positives = 463/814 (56%), Gaps = 71/814 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R++I+DG RR++IS IHYPR+TPEMWPDLI K+KEGG + IETYVFWN HE R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
++NF+G D+V+F K + ++G+Y LRIGPY+C EWN+GG PVWLRDIPGI+FR +N P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAAS 224
F+ M+ F IV M++ +F+ QGGPII+ QIENEYG ++ Q +Y+ W A
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q D P N+++ CNG+YC + N + P +WTENW GWY W
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWDQ 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
RP ED+AFAVA FFQ GS NYYMY GGTNFGRT+GGP+ TSYDYDAP+DEYG
Sbjct: 271 PEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 330
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLK+LH+ + E L+ D YI V +Y + + F+ N
Sbjct: 331 LRQPKYGHLKELHSVLMSMEKILLHGD---YIDTNYGDNVTV---TKYTLNATSACFINN 384
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+ +VT G ++ LP WSVSILP+C+ FN+AK+ +QT++ V
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVM-------------V 431
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ SM+E + SWM P + NF +LE + T D SDYLW+ T
Sbjct: 432 NKTSMVEQQTEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRT---- 487
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVI----GHWVKVVQPVEFQSGYN 578
S E + +++ L F+NG+L G ++ PV+ G N
Sbjct: 488 -----SLEHKGEGSYVLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKN 542
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS TVGL+NYG E AG G VKL IDLS W+Y+ GL GE+++I
Sbjct: 543 YISLLSGTVGLRNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKI 602
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y + + + I FTWYKT F AP G D V +DL + KG AWVNG+ +GRY
Sbjct: 603 YLDKPGNKWRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRY 662
Query: 698 WT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWL-QASNNLLVI 750
W V A GC CDYRG + ++ KC T CG P+Q YHVPRS+L + N L++
Sbjct: 663 WPSYVAADMPGCHH-CDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLIL 721
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC- 809
FEE GG+P E++V+ VC ++ + L C
Sbjct: 722 FEEAGGDPSEVAVRTVVEGSVCASA------------------------EVGDTVTLSCG 757
Query: 810 QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
G ISS++ AS+G +GRC + G C + ++
Sbjct: 758 AHGRTISSVDVASFGVARGRCGSYD-GGCESKVA 790
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 334/745 (44%), Positives = 450/745 (60%), Gaps = 49/745 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD R+++IDG RR+++S IHYPR+TPEMWPDLI K+KEGG D IETY+FWN HE
Sbjct: 30 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPH 89
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R QYNF+G D+V+F K + ++G+Y LRIGPY+C EWN+GG P WLRDIPG++FR +N
Sbjct: 90 RRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 149
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAA 223
PF+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 224 SMALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
MA GVPW+MC+Q D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L +PK+GHLK+LH+ +K E LV +Y V +Y S+ + F+
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLV---HGEYFDTNYGDNITV---TKYTLDSSSACFIN 383
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N + +VT G ++ LP WSVSILPDC+ FN+AK+ +QTS+
Sbjct: 384 NRFDDKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVM------------- 430
Query: 463 VPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
V + + E + S SWM P + NF +LE + + D SDYLW+ T
Sbjct: 431 VKKPNTAEQEQESLKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRT--- 487
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGY 577
S E + +++ L F+NG+L G S G +V ++ PV+ G
Sbjct: 488 ------SLNHKGEGSYKLYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGK 541
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS TVGL+NYG EK G G VKL IDLS W+Y+ GL E++Q
Sbjct: 542 NYISLLSATVGLKNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQ 601
Query: 637 IYSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
I+ +++ +W IP FTWYK F+AP G D V +DL + KG AWVNG+++
Sbjct: 602 IH-LDKPGYKWNG-NNGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNL 659
Query: 695 GRYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NL 747
GRYW A GC CDYRGA+ ++ +C T CG P+Q +YHVPRS+L A N
Sbjct: 660 GRYWPSYTAAEMAGCH-RCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNT 718
Query: 748 LVIFEETGGNPFEISVKLRSTRIVC 772
L++FEE GG+P ++++ VC
Sbjct: 719 LLLFEEAGGDPSGVALRTVVPGPVC 743
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 323/593 (54%), Positives = 395/593 (66%), Gaps = 41/593 (6%)
Query: 26 MMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKS 85
+ + + C SS ST V+YDH+A+II+G RR+LIS IHYPR+TPEMWPDLI K+
Sbjct: 11 IFLAILCFSSLIHST---EAVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 86 KEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
KEGG DVI+TYVFWN HE G Y F+ + D+VKF KLV +GLYL LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFPVWL+ +PG+ FRT+N PFK MQ+F KKIVD+M+EE LF QGGPII+ QIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
M+ G GK Y KW A MALGL GVPW+MCKQ DAP IID CNG+YC+G+KPNS N
Sbjct: 188 PMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
KP LWTENW GW+T +GG +P+RPVED+AF+VARF Q GGSFMNYYMY GGTNF RT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYXGGTNFDRTA-G 306
Query: 326 PFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
F TSYDYDAPIDEYGLL EPK+ HLK+LH IKLCEPALV+ D LG QE HV
Sbjct: 307 VFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPT-ITSLGDKQEIHV 365
Query: 386 YRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ 445
++ S+++C+AFL+N D +AA V F G Y LPPWSVSILPDC+ +NTAK+ +
Sbjct: 366 FK-----SKTSCAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAP 420
Query: 446 TSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN---NFTVQGILE 502
T + + +P +ST SW + E G S N F G++E
Sbjct: 421 TILMKM-----------IP---------TSTKFSWESYNE--GSPSSNEAGTFVKDGLVE 458
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
+++T+D +DY W+ T I + D+ SF KT + P +TI S L VF+NG L G+ G
Sbjct: 459 QISMTRDKTDYFWYFTDITIGSDE-SFLKTGD-NPLLTIFSAGHALHVFVNGLLAGTSYG 516
Query: 563 HW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG 611
+ Q ++ G N L LLS VGL N G E G G V L G
Sbjct: 517 ALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKG 569
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/585 (52%), Positives = 391/585 (66%), Gaps = 32/585 (5%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YDHR++ I+G RR+LIS IHYPR+TPEMWPDLI K+K+GG DVI+TYVFWN HE ++G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F + D+V+FVKLV +GLY+ LRIGPYVCAEWN+GGFPVWL+ +PGI FRT+N PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K YV WAA MA+
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
AGVPW+MCKQ DAP+ +I+ CNG+YCD + PNS NKP++WTE W GW+T +GG +P
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLAFAVARF Q+GGSF+NYYMY GGTNF RT+GGPF TSYDYDAPIDEYGLL +P
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 348 KWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEH 407
KWGHL +LH AIK E ALVA D +G ++A+V+R+ S +C+AFL+N
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPT-VQNIGNYEKAYVFRS----SSGDCAAFLSNFHTS 377
Query: 408 TAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQS 467
AA V F G+ Y LP WS+S+LPDCR V+NTA V++ +S P N
Sbjct: 378 AAARVAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASS--------PAKMN------- 422
Query: 468 MIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDI 527
+ +W + E E FT G++E L++T D SDYLW+ T + + D
Sbjct: 423 ------PAGGFTWQSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNI-DSGE 475
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILL 583
F K+ + P +T+ S ++VF+NGQ G+ G + + V+ G N + +L
Sbjct: 476 QFLKSGQ-WPQLTVYSAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISIL 534
Query: 584 SQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
S VGL N G E G G V L+G G DLSK WTYQV
Sbjct: 535 SSAVGLPNVGTHYETWNIGVLGPVTLSGLNEGKRDLSKQKWTYQV 579
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 337/743 (45%), Positives = 450/743 (60%), Gaps = 63/743 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++IIDG R++L S IHYPR+TP+MWPDLIAK+K+GG DVI+TYVFWN HE
Sbjct: 27 VTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHEPQP 86
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G Y+F G+ D+V F+K + + GLY+ LRIGP++ +EW +GGFP WL D+PGI +RT+N P
Sbjct: 87 GMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPGIVYRTDNEP 146
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F KIV++M+EE L++ QGGPII+ QIENEY N++ ++G G YV+WAA MA
Sbjct: 147 FKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAAKMA 206
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQTDAP+ +I+ CNG C PNS NKP LWTENW +Y +GG
Sbjct: 207 VGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVYGG- 265
Query: 285 LPH-RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
LP+ R ED+AF V F R GS++NYYMY GGTNFGRT G + IT Y AP+DEYGL
Sbjct: 266 LPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRT-GSAYVITGYYDQAPLDEYGL 324
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PKWGHLK LH IK C L+ + LGQ E +V+ + C AFL N
Sbjct: 325 LRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFT-LGQLLEVYVFEEEK----GECVAFLIN 379
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D A+V F SY L P S+SILPDC+N F+TA V++ ++ + +
Sbjct: 380 NDRDNKATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTSNRRIIS----------- 428
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
P+Q+ S+ W ++ I + + +LE +N TKD SDYLW+ +
Sbjct: 429 PKQNF------SSVDDWQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYTLRFE-- 480
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYND 579
+ + +PT+++ S V F+N G G H VK + PV G N+
Sbjct: 481 ------YNLSCSKPTLSVQSAAHVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNN 534
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
L +LS VGL + GAFLE+ AG V+L + ++L+ W YQVGL GE Q+Y
Sbjct: 535 LSILSVMVGLPDSGAFLERRFAGLI-SVELQCSEQESLNLTNSTWGYQVGLMGEQLQVYK 593
Query: 640 IEEN-EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ N + W+ L + + T WYKT FD P+G DPV LDL SMGKG+AWVNG IGRYW
Sbjct: 594 EQNNSDTGWSQLG-NVMEQTLFWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYW 652
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ ++ K GNP+Q+ YHVPRS+L+ S N+LV+ EE GGNP
Sbjct: 653 IL----------------FHDSK-----GNPSQSLYHVPRSFLKDSGNVLVLLEEGGGNP 691
Query: 759 FEISVKLRSTRIVCEQVSESHYP 781
IS+ S + + S+ P
Sbjct: 692 LGISLDTVSVTDLQQNFSKLSLP 714
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 345/784 (44%), Positives = 465/784 (59%), Gaps = 79/784 (10%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWP LIAK+KEGG DVI+TYVFWN HE +G Y F G+ DIV+FVK + + GLY LRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P++ AEW++GG P WL D+ GI +R++N PFK MQ F KIV++M+ E L++ QGGPII
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEY +E+++G++G YV+WAA MA+ L GVPW MCKQ DAP+ +I+ CNG C
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180
Query: 257 DG--YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFF-QRGGSFMNYYMY 313
PNS NKP++WTENW +Y T+G R E++AF VA F + G+++NYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240
Query: 314 FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ 373
GGTNFGR S F IT Y +P+DEYGL EPKWGHLK+LHAA+KLC L+ +
Sbjct: 241 HGGTNFGR-SASAFMITGYYDQSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSN 299
Query: 374 YIKLGQNQEAHVYRANRYGSQSNCSAFLAN---IDEHTAASVTFLGQSYTLPPWSVSILP 430
+ LGQ+ EA V++ + C+AFL N ID ++V F +Y LP S+SILP
Sbjct: 300 F-SLGQSVEAIVFKT----ESNECAAFLVNRGAID----SNVLFQNVTYELPLGSISILP 350
Query: 431 DCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW 490
DC+N FNT +VS Q + +++ ++V + ++E W KEPI
Sbjct: 351 DCKNVAFNTRRVSVQHNTRSM---------MAVQKFDLLE---------WEEFKEPIPNI 392
Query: 491 SENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRV 550
+ +LEH+ TKD SDYLW+ ++ D + T+ +DS L
Sbjct: 393 DDTELRANELLEHMGTTKDRSDYLWYTFRVQQDSPD--------SQQTLEVDSRAHALHA 444
Query: 551 FINGQLTGSVIGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQ 606
F+NG GS G + + + + + ++G N++ LLS VGL + GAFLE AG R +
Sbjct: 445 FVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSGAFLETRVAGLR-R 503
Query: 607 VKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY-SIEENEAEWTDLTRDGIPSTFTWYKTY 665
V + G D S+ W Y+VGL GE QI+ + +W+ L P TWYKT
Sbjct: 504 VGIQG-----EDFSEQHWGYKVGLSGEQSQIFLDTGSSNVQWSRLGNSSQP--LTWYKTQ 556
Query: 666 FDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTT 724
FDAP G DP+AL+LGSMGKG WVNG IGRYW + + PK
Sbjct: 557 FDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK-------------------- 596
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
G P+Q WY+VPRS+L+ ++N LVI EE GNP EIS+ C QVSESHYP V
Sbjct: 597 --GEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSESHYPLVA 654
Query: 785 KWSNSYSVDGKLSINKM-APEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
W + + N+ P++ L C IS+I FAS+GTP G CQ ++ G CH+P S
Sbjct: 655 SWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIGLCHSPNS 714
Query: 844 LSVV 847
++V
Sbjct: 715 RAIV 718
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 339/770 (44%), Positives = 463/770 (60%), Gaps = 70/770 (9%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
MMM++ ++ LS + V+YD R++II+G R +L S IHYPR+TP+MWP
Sbjct: 8 MMMLVAILELSFGVKGAEE-------VTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPG 60
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
LIAK+K+GG DVI+TYVFWN HE G+Y+F G+ND+V F+K + + GLY+ LRIGP++
Sbjct: 61 LIAKAKQGGLDVIQTYVFWNLHEPQPGKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIE 120
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
+EWN+GGFP WL D+PGI +RT+N PFK MQ F KIV++M+EE L++ QGGPII+ QI
Sbjct: 121 SEWNYGGFPFWLHDVPGIVYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQI 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG-- 258
ENEYGN++ ++G G YV+WAA MA+GL GVPWVMCKQ DAP+ +I+ CNG C
Sbjct: 181 ENEYGNIQKAFGTAGSQYVEWAAKMAVGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETF 240
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
PNS NKP +WTENW +Y +GG R ED+AF V F R GSF+NYYMY GGTN
Sbjct: 241 TGPNSPNKPAMWTENWTSFYQVYGGVPYIRSAEDIAFHVTLFVARNGSFVNYYMYHGGTN 300
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
FGRTS + IT Y AP+DEYGL +PKWGHLK+LHAAIK C L+ + LG
Sbjct: 301 FGRTSSA-YMITGYYDQAPLDEYGLFRQPKWGHLKELHAAIKSCSTTLLQGVQRNF-SLG 358
Query: 379 QNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
+ QE +V+ C+AFL N D+ +V F SY L P S+SILPDC+N FN
Sbjct: 359 ELQEGYVFEE----ENGKCAAFLINNDKGNTVTVQFNNSSYKLLPKSISILPDCQNVAFN 414
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ 498
TA +++ ++ + + +Q+ S+ W ++ I + + +
Sbjct: 415 TAHLNTTSNRRII-----------TSRQNF------SSVDDWKQFQDVIPNFDDTSLRSD 457
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG 558
+LE +N TKD SDYLW+ ++ ++++S N+ P + + S V F+N G
Sbjct: 458 SLLEQMNTTKDKSDYLWYTLRL---ENNLS---CND--PILHVQSSAHVAYAFVNNTYIG 509
Query: 559 SVIG-HWVK---VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
G H VK + P+ N++ +LS VGL + GAFLEK AG V+L +
Sbjct: 510 GEHGNHDVKSFTLELPITLNERTNNISILSGMVGLPDSGAFLEKRFAGLN-NVELQCSEQ 568
Query: 615 GDIDLSKILWTYQVGLKGEFQQIYSIEEN--EAEWTDLTRDGIPS-TFTWYKTYFDAPDG 671
++L+ W YQVGL GE ++Y+ E+N + +WT L I T TWYKT FD P G
Sbjct: 569 ESLNLNNSTWGYQVGLLGEQLKVYT-EQNSTDIKWTQLGNITIDEVTLTWYKTTFDTPKG 627
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
DP+ALDL SM KG+AWVNG IGRYW + GNP+Q
Sbjct: 628 DDPIALDLSSMAKGEAWVNGQSIGRYWILFLDSK---------------------GNPSQ 666
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYP 781
+ YHVPRS+L+ S N LV+ +E GGNP +IS+ S + + S+ +P
Sbjct: 667 SLYHVPRSFLKDSENSLVLLDEGGGNPLDISLNTVSVTDLQDNFSKLPFP 716
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 622 bits (1604), Expect = e-175, Method: Compositional matrix adjust.
Identities = 301/512 (58%), Positives = 377/512 (73%), Gaps = 24/512 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYDH+AI I+G RR+L+S IHYPR+TPEMWPDLI K+KEGG DVI+TYVFWN HE
Sbjct: 20 SVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 79
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y F G D+V+F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ IPGI FRTNN
Sbjct: 80 PGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNNG 139
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQRF KKIVD+M+ E LF QGGPII+ QIENEYG ME G G+ Y +WAA M
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GLG GVPWVMCKQ DAP+ II++CNG+YCD + PN KP +WTE W GW+T +GG +
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 259
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RPVEDLAF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL+
Sbjct: 260 PYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLVR 319
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
+PKWGHLKDLH AIKLCEPALV+ D + + LG+ QEAHV+++ +YG +C+AFLAN +
Sbjct: 320 QPKWGHLKDLHRAIKLCEPALVSGDPS-VMPLGRFQEAHVFKS-KYG---HCAAFLANYN 374
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V F Y LPPWS+SILPDC+NTV+NTA+V +Q++ + +P+ + +
Sbjct: 375 PRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKM---VPVPIHGAFSW 431
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
Q+ E SS E +FT G++E +N T+D SDYLW+ T + + D
Sbjct: 432 QAYNEEAPSSN--------------GERSFTTVGLVEQINTTRDVSDYLWYSTDVKI-DP 476
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
D F KT + PT+T+ S L VF+N QL+
Sbjct: 477 DEGFLKTGKY-PTLTVLSAGHALHVFVNDQLS 507
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 329/811 (40%), Positives = 466/811 (57%), Gaps = 58/811 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD R+++IDG R + S IHYPR+ WPDLIA++KEGG +VIE+YVFWN HE
Sbjct: 36 ITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWNIHEPEM 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D++KF KL+ ++ +RIGP+V AEWN GG P WLR++P I FRT+N P
Sbjct: 96 GVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIVFRTDNEP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+ MQ+FV +V+ +++ LF+ QGGPII+ QIENEY +ME+++ + G Y+ WAA MA
Sbjct: 156 YKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYIDWAAKMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGGR 284
+ GVPW+MCKQT AP +I CNG +C P NKP LWTENW Y +G
Sbjct: 216 ISTSTGVPWIMCKQTKAPAEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVFGDP 275
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AFAVARFF GGS +NYYMY GGTNFGRT G F + Y +AP+DE+G+
Sbjct: 276 PSQRSAEDIAFAVARFFSVGGSMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGMY 334
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A++LC+ AL+ + + LG+ EA ++ Q C AFL+N
Sbjct: 335 KEPKWGHLRDLHHALRLCKKALLRGNPSTQ-PLGKLYEARLFEIP---EQKVCVAFLSNH 390
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +VTF GQ Y +P SVSIL DC+ VF+T V++Q + +T +
Sbjct: 391 NTKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLT---------- 440
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ L + T + + + + LE N+TKD +DYLW+ T +
Sbjct: 441 -----DQTLQNNVWEMYTEGDKVPTYKFTTDRSEKPLEAYNMTKDKTDYLWYTTSFKLEA 495
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
+D+ F +++P + S + F+NG+L G+ G + + +P+E ++G N +
Sbjct: 496 EDLPF--RQDIKPVLEASSHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEVRAGINHV 553
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
+LS T+GLQ+ GA+LE AG V + G G +DLS W + VGL GE +Q +
Sbjct: 554 SILSSTLGLQDSGAYLEHRQAGVH-SVTIQGLNTGTLDLSSNGWGHIVGLDGERKQAHMD 612
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ E +W D +P TWY+ FD P G DPV +DL MGKG +VNG +GRYW+
Sbjct: 613 KGGEVQWKPAVFD-LP--LTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYWS- 668
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
Y+ A G P+Q YHVPR +L+ + N+L IFEE GG P
Sbjct: 669 -----------SYKHA---------LGRPSQYLYHVPRCFLKPTGNVLTIFEEEGGRPDA 708
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSI--NKMAPEMHLHCQDGYIISSI 818
I + +C +SE + VR W D +L++ + + P L C + I +
Sbjct: 709 IMILTVKRDNICSFISEKNPGHVRSWERK---DSQLTVVADDLKPRAVLTCPEKKTIQQV 765
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FASYG P G C ++ GNCH P + VV +
Sbjct: 766 VFASYGNPLGICGNYTVGNCHTPKAKEVVEK 796
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 327/813 (40%), Positives = 469/813 (57%), Gaps = 59/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
+++D R++++DG R + S IHYPR+ P MWPDLIA++KEGG +VIE+YVFWN HE
Sbjct: 15 ITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWNGHEPEM 74
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D++KF KLV ++ +RIGP+V AEWN GG P WLR++P I FRTNN P
Sbjct: 75 GVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIIFRTNNEP 134
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQ+FV IV+ +++ LF+ QGGPII+ QIENEY ++E+++ + G Y+ WAA MA
Sbjct: 135 FKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIHWAAKMA 194
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGGR 284
L GVPW+MCKQT AP +I CNG +C P NKP LWTENW Y +G
Sbjct: 195 SDLNIGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVFGDP 254
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AFAVARF+ GG+ +NYYMY GGTNFGRT G F + Y +AP+DE+GL
Sbjct: 255 PSQRSAEDIAFAVARFYSVGGTMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGLY 313
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A++LC+ A++ + + LG+ EA ++ Q C AFL+N
Sbjct: 314 KEPKWGHLRDLHHALRLCKKAILWGNPSNQ-PLGKLYEARLFEIP---EQKICVAFLSNH 369
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +VTF GQ Y +P SVSIL DC+ VF+T V+SQ + +T FS
Sbjct: 370 NTKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHNQRTFHFS---------- 419
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ + T + + + N Q LE N+TKD +DY+W+ T +
Sbjct: 420 -----DQTVQGNVWEMYTESDKVPTYKFTNIRTQKPLEAYNLTKDKTDYVWYTTSFKLEA 474
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV------VQPVEFQSGYN 578
+D+ F K ++ P + + S + F+NG+ G+ GH K+ +P+E ++G N
Sbjct: 475 EDLPFRK--DIWPVLEVSSHGHAMVAFVNGKYVGA--GHGTKINKAFTMEKPIEVRTGIN 530
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ +LS T+G+Q+ G +LE AG G V + G G +DL+ W + VGL+GE + +
Sbjct: 531 HVSILSTTLGMQDSGVYLEHRQAGIDG-VTIQGLNTGTLDLTSNGWGHLVGLEGERRNAH 589
Query: 639 SIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ + + +W D TWY+ FD P G DPV +D+ MGKG +VNG +GRY
Sbjct: 590 TEKGGDGVQWVPAVFD---RPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRY 646
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET-GG 756
W+ Y+ A G P+Q YHVPR +L+ + N++ IFEE GG
Sbjct: 647 WS------------SYKHA---------LGRPSQYLYHVPRCFLKPTGNVMTIFEEEGGG 685
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P I + +C +SE + V+ W S ++ + P+ L C + +I
Sbjct: 686 QPDGIMILTVKRDNICSFISEKNPAHVKSWERKDSHLKSVADADLKPQAVLSCPEKKLIQ 745
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FASYG P G C ++ GNCHAP + +V +
Sbjct: 746 QVVFASYGNPLGICGNYTVGNCHAPKAKEIVEK 778
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 315/811 (38%), Positives = 478/811 (58%), Gaps = 59/811 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++II+G R +L S IHYPR+TPE W ++ K+++GG +V++TYVFWN HE+ +
Sbjct: 9 VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y+ + + D +KF+KL+ G+Y+ LR+GP++ AEWN GG P WLR++P I FR+NN P
Sbjct: 69 GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEIIFRSNNEP 128
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ M+++V ++ +++ LF+ QGGPII+ QIENEY +++ ++ ++G +YV+WAA MA
Sbjct: 129 FKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKMA 188
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ L GVPW+MCKQTDAP+ +I+ACNG +C D + PN KP +WTENW Y +G
Sbjct: 189 VSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGDP 248
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF+VARFF + GS +NYYMY GGTNFGRTS F T Y +AP+DEYG+
Sbjct: 249 PSQRSAEDIAFSVARFFSKNGSLVNYYMYHGGTNFGRTSSA-FTTTRYYDEAPLDEYGMQ 307
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKW HL+D+H A+ LC+ AL S K+ Q+ E V+ + GS C+AF+ N
Sbjct: 308 REPKWSHLRDVHRALSLCKRALFNGAST-VTKMSQHHEVIVFE--KPGSNL-CAAFITNN 363
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+++F G Y +PP S+SILPDC+ VFNT ++SQ S + + S
Sbjct: 364 HTKVPTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSSRNFKRS---------- 413
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+++ W E I + + +E ++ KD SDY W+ T + +
Sbjct: 414 --------MAANDHKWEVYSETIPTTKQIPTHEKNPIELYSLLKDTSDYAWYTTSVELRP 465
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDL 580
+D+ K N++ + I S+ L F+NG+ GS G + +PV + G N +
Sbjct: 466 EDLP--KKNDIPTILRIMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQI 523
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
+L+ TVGL + GA++E AG + + + G +G +DL+ W ++VG+KGE I++
Sbjct: 524 AILASTVGLPDSGAYMEHRFAGPK-SIFILGLNSGKMDLTSNGWGHEVGIKGEKLGIFTE 582
Query: 641 E-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW- 698
E + +W + G +WYKT F P+G DPVA+ + MGKG W+NG IGR+W
Sbjct: 583 EGSKKVQWKEAKGPG--PAVSWYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRHWM 640
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ ++P G PTQ+ YH+PR++ +NLLV+FEE NP
Sbjct: 641 SYLSP----------------------LGQPTQSEYHIPRTYFNPKDNLLVVFEEEIANP 678
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
++ + + +C V+E+H P V+ W+ S + +N + P L C I ++
Sbjct: 679 EKVEILTVNRDTICSFVTENHPPNVKSWAIK-SEKFQAVVNDLVPSASLKCPHQRTIKAV 737
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
EFAS+G P G C F+ G C+AP +V +
Sbjct: 738 EFASFGDPAGACGAFALGKCNAPAIKQIVEK 768
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 336/747 (44%), Positives = 450/747 (60%), Gaps = 63/747 (8%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K V+YD R++IIDG R++L S IHYPR+TP+MWPDLIAK+K+GG DVI+TYVFWN H
Sbjct: 23 KAEEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLH 82
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G Y+F+G+ D+V F+K + + GLY+ LRIGP++ +EW +GGFP WL D+PGI +RT
Sbjct: 83 EPQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQSEWKYGGFPFWLHDVPGIVYRT 142
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+N FK MQ F KIV++M+EE L++ QGGPII+ QIENEY N++ ++G G YV+WA
Sbjct: 143 DNESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWA 202
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTT 280
A MA+GL GVPWVMCKQTDAP+ +I+ CNG C PNS NKP LWTENW +Y
Sbjct: 203 AKMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQV 262
Query: 281 WGGRLPH-RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPID 339
+GG LP+ R ED+AF V F R GS++NYYMY GGTNFGRT+ + IT Y AP+D
Sbjct: 263 YGG-LPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRTASA-YVITGYYDQAPLD 320
Query: 340 EYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSA 399
EYGLL +PKWGHLK LH IK C L+ + LGQ QE +V+ + C A
Sbjct: 321 EYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNF-SLGQLQEGYVFEEEK----GECVA 375
Query: 400 FLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
FL N D +V F +SY L P S+SILPDC+N FNTA V++ ++ + +
Sbjct: 376 FLKNNDRDNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNTTSNRRIIS------- 428
Query: 460 NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
P+Q+ S+ W ++ I + + +LE +N TKD SDYLW+ +
Sbjct: 429 ----PKQNF------SSLDDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYLWYTLR 478
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQS 575
+ + +PT+++ S V FIN G G H VK + PV
Sbjct: 479 FE--------YNLSCRKPTLSVQSAAHVAHAFINNTYIGGEHGNHDVKSFTLELPVTVNQ 530
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
G N+L +LS VGL + GAFLE+ AG V+L + ++L+ W YQVGL GE
Sbjct: 531 GTNNLSILSAMVGLPDSGAFLERRFAGLIS-VELQCSEQESLNLTNSTWGYQVGLLGEQL 589
Query: 636 QIYSIEEN-EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
Q+Y + N + W+ L + + WYKT FD P+G DPV LDL SMGKG+AWVN I
Sbjct: 590 QVYKKQNNSDIGWSQLG-NIMEQLLIWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQSI 648
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GRYW + ++ K GNP+Q+ YHVPRS+L+ + N+LV+ EE
Sbjct: 649 GRYWIL----------------FHDSK-----GNPSQSLYHVPRSFLKDTGNVLVLVEEG 687
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYP 781
GGNP IS+ S + + S+ P
Sbjct: 688 GGNPLGISLDTVSVIDLQQNFSKLTLP 714
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 337/750 (44%), Positives = 455/750 (60%), Gaps = 62/750 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++IIDG+R++L S IHYPR+TP+MW LIAK+KEGG DVI+TYVFWN HE
Sbjct: 26 VTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEPQP 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F G+ D+ KF+K + + GLY LRIGP++ +EW++GG P WL D+ GI +RT+N P
Sbjct: 86 GQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDNEP 145
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F KIV+LM+ E L++ QGGPII+ QIENEY N+E+++ ++G YV+WAA MA
Sbjct: 146 FKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAKMA 205
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+ L GVPWVMCKQ+DAP+ +I+ CNG C PNS NKP++WTENW +Y +GG
Sbjct: 206 VELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFGGE 265
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF VA F R GS++NYYMY GGTNFGR S + TSY AP+DEYGL+
Sbjct: 266 TYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRASSA-YIKTSYYDQAPLDEYGLI 324
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHLK+LHAAI LC L+ + I LGQ QEA+V++ G C AFL N
Sbjct: 325 RQPKWGHLKELHAAITLCSTPLLNGVQSN-ISLGQLQEAYVFQEEMGG----CVAFLVNN 379
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV---SSQTSIKTVEFSLPLSPNI 461
DE ++V F S L P S+SILPDC+N +FNTAKV S Q++ K E S
Sbjct: 380 DEGNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKVCSSSRQSAYKIQELS------- 432
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+S I+S W K+ I + + + ILEH+N+TKD SDYLW+ +
Sbjct: 433 ----RSCIQS--FDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQ 486
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGY 577
+ ++ P + I+S+ + F+N G+ G H +K P+ +
Sbjct: 487 PN--------SSCTEPLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEM 538
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N++ +LS VG + GA+LE AG +V++ + G D + W YQVGL GE I
Sbjct: 539 NNISILSVMVGFPDSGAYLESRFAGLT-RVEIQCTEKGIYDFANYTWGYQVGLSGEKLHI 597
Query: 638 YSIEEN--EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
Y EEN EW T TWYK F+ P G DPVAL+L +MGKG+AWVNG IG
Sbjct: 598 YK-EENLSNVEWRK-TEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIG 655
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW ++++ K G+P+QT YHVPR++L+ S NLLV+ EE
Sbjct: 656 RYWV----------------SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEAN 694
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRK 785
G+P IS++ S + + V H P ++
Sbjct: 695 GDPLHISLETISRTDLPDHVLYHHLPQEKQ 724
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 322/815 (39%), Positives = 471/815 (57%), Gaps = 66/815 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R++IIDG R + S IHYPR+ P+MWP+LIAK+KEGG + IETY+FWN HE +
Sbjct: 41 VSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWNIHEPEK 100
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQ++F+G+ DIV+F KL+ +Y +R+GP++ AEWN GG P WLR+IP I FRTNN P
Sbjct: 101 GQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 160
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K M+ FVK I+ +++ LF+ QGGPII+ QIENEY ++E+++ G Y+KWAA+MA
Sbjct: 161 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIKWAANMA 220
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--PTLWTENWDGWYTTWGGR 284
+ G+PW+MCKQT AP ++I CNG C P NK P LWTENW Y +G
Sbjct: 221 ISTNVGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPMNKSMPLLWTENWTAQYRVFGDP 280
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AFAVARFF GG+ NYYMY GGTNFGRTS F + Y +AP+DE+GL
Sbjct: 281 PSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLY 339
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A+KLC+ AL+ ++ KLG+ EA V+ Q C AFL+N
Sbjct: 340 KEPKWGHLRDLHLALKLCKKALLWGKTSTE-KLGKQFEARVFEI---PEQKVCVAFLSNH 395
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ ++TF GQSY +P S+SIL DC+ VF T V++Q + +T F
Sbjct: 396 NTKDDVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHF----------- 444
Query: 465 QQSMIESKLSSTSKSW-MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ ++ + W M +E + + ++ ++ + N+TKD +DY+W+ + +
Sbjct: 445 ------ADQTTQNNVWQMFDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLE 498
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK------VVQPVEFQSGY 577
DD+ + +++ + ++S F+N + G GH K + +P++ + G
Sbjct: 499 ADDMPIRR--DIKTVLEVNSHGHASVAFVNTKFVGC--GHGTKMNKAFTLEKPMDLKKGV 554
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + +L+ T+G+ + GA+LE AG +V++ G G +DL+ W + VGL GE +QI
Sbjct: 555 NHVAVLASTMGMMDSGAYLEHRLAGV-DRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQI 613
Query: 638 YSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y+ + W D TWYK +FD P G DP+ LD+ +MGKG +VNG IGR
Sbjct: 614 YTDKGMGSVTWKPAVND---RPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGR 670
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW Y+ A G P+Q YH+PRS+L+ +N+LV+FEE G
Sbjct: 671 YW------------ISYKHA---------LGRPSQQLYHIPRSFLRQKDNVLVLFEEEFG 709
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSIN--KMAPEMHLHCQDGYI 814
P I + +C +SE + ++ W D ++++ + P L C +
Sbjct: 710 RPDAIMILTVKRDNICTFISERNPAHIKSWERK---DSQITVTAADLKPRATLTCSPKKL 766
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I + FASYG P G C ++ G+CH P + +V +
Sbjct: 767 IQQVVFASYGNPMGICGNYTIGSCHTPRAKELVEK 801
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 613 bits (1581), Expect = e-172, Method: Compositional matrix adjust.
Identities = 337/822 (40%), Positives = 466/822 (56%), Gaps = 94/822 (11%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF G DIV+F K + ++GLY LRIGPY+C EWN+GG P WLRDIPG++FR +NAP
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GGP+ TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQK-------------------RGGPYITTSYDYDAPLDEYGN 311
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + + V +Y S + F+ N
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDTNYSDKVTV---TKYTLDSTSACFINN 365
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT++ V
Sbjct: 366 RNDNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVM-------------V 412
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ M+E + S SWM P + ++ +LE + + D SDYLW+ T I
Sbjct: 413 NKAKMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSIN- 471
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
E T+ +++ L F+NG L G S GH+V ++ P + G N
Sbjct: 472 --------HKGEASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKN 523
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 524 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQI 583
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 584 H-LDKPGCTW-DNNNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLG 641
Query: 696 RYWT--VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NLL 748
RYW A GGC CDYRG + ++ KC T CG P+Q +YHVPRS+L+ N +
Sbjct: 642 RYWPSYTAAEMGGCHH-CDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTV 700
Query: 749 VIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLH 808
++FEE GG+P +S + + VC ++ + L
Sbjct: 701 ILFEEAGGDPSHVSFRTVAAGSVCASA------------------------EVGDTITLS 736
Query: 809 C-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I S+G +G+C + +G C + + +E
Sbjct: 737 CGQHSKTISAINVTSFGVARGQCGAY-KGGCESKAAYKAFTE 777
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 324/725 (44%), Positives = 447/725 (61%), Gaps = 69/725 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHYPR+TP+MWP+LIAK+KEGG DVI+TYVFWN HE
Sbjct: 27 NVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEPQ 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F+G +IV+F+K + + GLY+ LRIGPY+ +E +GG P+WL DIPGI FR++N
Sbjct: 87 QGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK MQ+F KIV+LM+ LF+ QGGPII+ QIENEYGN+E ++ ++G Y++WAA M
Sbjct: 147 QFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+GL GVPWVMCKQ +AP+ +I+ CNG C +K PNS NKP+LWTENW +Y +G
Sbjct: 207 AVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSLWTENWTSFYQVFGE 266
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+A+ VA F + GS++NYYMY GGTNF R + F IT+Y +AP+DEYGL
Sbjct: 267 VPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASA-FVITAYYDEAPLDEYGL 325
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ EPKWGHLK+LHAAIK C +++ + LG Q A+V++ S C+AFL N
Sbjct: 326 VREPKWGHLKELHAAIKSCSNSILHGTQTSF-SLGTQQNAYVFKR----SSIECAAFLEN 380
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
++ + ++ F Y LPP S+SILPDC+N FNTAKVS Q +
Sbjct: 381 TEDQS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVSIQNA---------------- 423
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
++M +++++W KE I + + + +L+ ++ TKD SDYLW+ ++Y +
Sbjct: 424 --RAMKSQLEFNSAETWKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYTFRLYDN 481
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQ----SGYND 579
+ + ++ S VL F+NG L GS+ G + +E + +G N+
Sbjct: 482 SPN--------AQSILSAYSHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNN 533
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ LS TVGL N GA+LE+ AG R +K+ G D + W YQ+GL GE QIY+
Sbjct: 534 ISFLSATVGLPNSGAYLERRVAGLR-SLKVQGR-----DFTNQAWGYQIGLLGEKLQIYT 587
Query: 640 IE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
++ +W P TWYKT FDAP G DPV L+LGSMGKG W+NG IGRYW
Sbjct: 588 ASGSSKVQWESFQSSTKP--LTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYW 645
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T G P+Q WYH+PRS L+++ NLLV+ EE GNP
Sbjct: 646 V---------------------SFHTPQGTPSQKWYHIPRSLLKSTGNLLVLLEEETGNP 684
Query: 759 FEISV 763
I++
Sbjct: 685 LGITL 689
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 611 bits (1575), Expect = e-172, Method: Compositional matrix adjust.
Identities = 316/588 (53%), Positives = 387/588 (65%), Gaps = 32/588 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+AI+I+G RR+LIS IHYPR+TP+MWPDLI K+K+GG DVIETYVFWN HE
Sbjct: 27 SVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIETYVFWNGHEPS 86
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G+Y F+ + D+VKF+K+V +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PG+ FRT+N
Sbjct: 87 QGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVAFRTDNE 146
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+F KIV +M+ E LF QGGPII+ QIENEYG +E G GK Y KW + M
Sbjct: 147 PFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWFSQM 206
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+GL GVPWVMCKQ DAP+ IID CNGYYC+ + PN KP +WTENW GWYT +G +
Sbjct: 207 AVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMWTENWTGWYTDFGTAV 266
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
P+RP EDLAF+VARF Q GS++NYYMY GGTNFGRTS G F TSYDYDAPIDEYGL+S
Sbjct: 267 PYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLIS 326
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIK CE ALV+ D G+N E H+Y+ S C+AFLAN D
Sbjct: 327 EPKWGHLRDLHKAIKQCESALVSVDPTVSWP-GKNLEVHLYKT----SFGACAAFLANYD 381
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V F Y LPPWS+SILPDC+ VFNTAKV + +++ N +
Sbjct: 382 TGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVHRSMT-----PANSAFNW 436
Query: 466 QSMIES-KLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QS E S S SW T G+LE L+ T D SDYLW++T + +S
Sbjct: 437 QSYNEQPAFSGESGSW---------------TANGLLEQLSQTWDKSDYLWYMTDVNISP 481
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ F K N P +T S VL VFINGQ G+ G + V+ + G N +
Sbjct: 482 NE-GFIK-NGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKI 539
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
LLS VGL N G EK G G V L G G DLSK W+Y+V
Sbjct: 540 SLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKV 587
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 331/747 (44%), Positives = 451/747 (60%), Gaps = 63/747 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++IIDG+R++L S IHYPR+TP+MW LIAK+KEGG DVI+TYVFWN HE
Sbjct: 62 VTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEPQP 121
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F G+ D+ KF+K + + GLY LRIGP++ +EW++GG P WL D+ GI +RT+N P
Sbjct: 122 GQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDNEP 181
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F KIV+LM+ E L++ QGGPII+ QIENEY N+E+++ ++G YV+WAA MA
Sbjct: 182 FKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAKMA 241
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+ L GVPWVMCKQ+DAP+ +I+ CNG C PNS NKP++WTENW +Y +GG
Sbjct: 242 VELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFGGE 301
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF VA F R GS++NYYMY GGTNFGR S + TSY AP+DEYGL+
Sbjct: 302 TYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRASSA-YIKTSYYDQAPLDEYGLI 360
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHLK+LHAAI LC L+ + I LGQ QEA+V++ G C AFL N
Sbjct: 361 RQPKWGHLKELHAAITLCSTPLLNGVQSN-ISLGQLQEAYVFQEEMGG----CVAFLVNN 415
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
DE ++V F S L P S+SILPDC+N +FNTAK+++ + + I+
Sbjct: 416 DEGNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYNER-----------IATS 464
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QS W K+ I + + + ILEH+N+TKD SDYLW+ + +
Sbjct: 465 SQSF------DAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPN- 517
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYNDL 580
++ P + I+S+ + F+N G+ G H +K P+ + N++
Sbjct: 518 -------SSCTEPLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNI 570
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
+LS VG + GA+LE AG +V++ + G D + W YQVGL GE IY
Sbjct: 571 SILSVMVGFPDSGAYLESRFAGLT-RVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYK- 628
Query: 641 EEN--EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
EEN EW T TWYK F+ P G DPVAL+L +MGKG+AWVNG IGRYW
Sbjct: 629 EENLSNVEWRK-TEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYW 687
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
++++ K G+P+QT YHVPR++L+ S NLLV+ EE G+P
Sbjct: 688 V----------------SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDP 726
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRK 785
IS++ S + + V H P ++
Sbjct: 727 LHISLETISRTDLPDHVLYHHLPQEKQ 753
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 326/725 (44%), Positives = 443/725 (61%), Gaps = 69/725 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHYPR+TP+MWP+LIAK+KEGG DVI+TYVFWN HE
Sbjct: 26 NVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEPQ 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F+G +IV+F+K + + GLY+ LRIGPY+ +E +GG P+WL DIPGI FR++N
Sbjct: 86 QGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSDNE 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK MQRF KIV+LM+ LF+ QGGPII+ QIENEYGN+E ++ ++G Y++WAA M
Sbjct: 146 QFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+GL GVPWVMCKQ +AP+ +I+ CNG C +K PNS NKP+LWTENW +Y +G
Sbjct: 206 AVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSLWTENWTSFYQVFGE 265
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+A+ VA F + GS++NYYMY GGTNF R + F +T+Y +AP+DEYGL
Sbjct: 266 VPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASA-FVVTAYYDEAPLDEYGL 324
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ EPKWGHLK+LH AIK C +L+ + LG Q A+V+R S C+AFL N
Sbjct: 325 VREPKWGHLKELHEAIKSCSNSLLYGTQTSF-SLGTQQNAYVFRR----SSIECAAFLEN 379
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
++ + ++ F Y LPP S+SILPDC+N FNTAKV +Q +
Sbjct: 380 TEDRS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQNA---------------- 422
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
++M ++++ W +E I +++ + +L+ ++ KD SDYLW+ ++Y +
Sbjct: 423 --RAMKSQLQFNSAEKWKVYREAIPSFADTSLRANTLLDQISTAKDTSDYLWYTFRLYDN 480
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQ----SGYND 579
+ + ++ S VL F+NG L GS G V +E + SG N+
Sbjct: 481 SAN--------AQSILSAYSHGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNLISGMNN 532
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ LS TVGL N GA+LE AG R +K+ G D + W YQVGL GE QIY+
Sbjct: 533 ISFLSATVGLPNSGAYLEGRVAGLR-SLKVQG-----RDFTNQAWGYQVGLLGEKLQIYT 586
Query: 640 IE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
++ +W P TWYKT FDAP G DPV L+LGSMGKG WVNG IGRYW
Sbjct: 587 ASGSSKVKWESFLSSTKP--LTWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNGQGIGRYW 644
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T G P+Q WYH+PRS L+++ NLLV+ EE GNP
Sbjct: 645 V---------------------SFHTPQGTPSQKWYHIPRSLLKSTGNLLVLLEEETGNP 683
Query: 759 FEISV 763
I++
Sbjct: 684 LGITL 688
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 610 bits (1573), Expect = e-171, Method: Compositional matrix adjust.
Identities = 320/814 (39%), Positives = 467/814 (57%), Gaps = 62/814 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++ DG+R + +S IHYPR+ P+MWP+LIAK+KEGG + IETYVFWN HE +
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF+G+ND+V+F +L+ +Y +R+GP++ AEWN GG P WLR+IP I FRTNN P
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K M+ FVK I+ +++ LF+ QGGPII+ QIENEY +ME+++ +G Y+ WAA MA
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--PTLWTENWDGWYTTWGGR 284
+ G+PW+MCKQT AP ++I CNG C P NK P LWTENW Y +G
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVFGDP 282
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AFAVARFF GG+ NYYMY GGTNFGRTS F + Y +AP+DE+GL
Sbjct: 283 PSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLY 341
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A+KLC+ AL+ + KLG+ EA V+ Q C AFL+N
Sbjct: 342 KEPKWGHLRDLHQALKLCKKALLWGTPSTE-KLGKQLEARVF---EMPEQKVCVAFLSNH 397
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A++TF G+ Y +P S+S+L DC VF T V++Q + +T F
Sbjct: 398 NTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHF----------- 446
Query: 465 QQSMIESKLSSTSKSW-MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ ++ + W M E + + + ++ + N+TKD +DY+W+ + +
Sbjct: 447 ------ADQTAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLE 500
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK------VVQPVEFQSGY 577
DD+ ++++ + ++S F+N + G GH K + +P++ + G
Sbjct: 501 ADDMPI--RSDIKTVLEVNSHGHASVAFVNNKFVGC--GHGTKMNKAFTLEKPMDLKKGV 556
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + +L+ ++G+ + GA++E AG +V++TG G +DL+ W + VGL GE +QI
Sbjct: 557 NHVAVLASSMGMTDSGAYMEHRLAGV-DRVQITGLNAGTLDLTNNGWGHIVGLVGERKQI 615
Query: 638 YSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y+ + W D TWYK +FD P G DPV LD+ +MGKG +VNG IGR
Sbjct: 616 YTDKGMGSVTWKPAMND---RPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGR 672
Query: 697 YWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
YW Y+ A G P+Q YHVPRS+L+ +N+LV+FEE G
Sbjct: 673 YW------------ISYKHA---------LGRPSQQLYHVPRSFLRQKDNMLVLFEEEFG 711
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS-VDGKLSINKMAPEMHLHCQDGYII 815
P I + +C +SE + + W S + K + + + L C +I
Sbjct: 712 RPDAIMILTVKRDNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLI 771
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FASYG P G C ++ G+CH P + VV +
Sbjct: 772 QQVVFASYGNPAGICGNYTVGSCHTPRAKEVVEK 805
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 610 bits (1573), Expect = e-171, Method: Compositional matrix adjust.
Identities = 342/825 (41%), Positives = 459/825 (55%), Gaps = 134/825 (16%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++ D R I+I+G R++LIS +HYPR+TPEMWPDLI KSK+GG + I+TYVFW+ HE R
Sbjct: 26 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QY+F G D+V+F+K + + GLY LRIGPYVCAEW +GGFPVWL + P I+ RTNN
Sbjct: 86 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 145
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ IENEYGN+ +Y G Y+ W A MA
Sbjct: 146 Y-------------------------------MIENEYGNVMRAYHDAGVQYINWCAQMA 174
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC+Q +AP+ +I+ CNGYYCD + PN+ N P +WTENW GWY WGG P
Sbjct: 175 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 234
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
HR EDLAF+VARF+Q GG+F NYYMY GGTNFGRT+GGP+ TSYDYDAP++EYG ++
Sbjct: 235 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 294
Query: 347 PKWGHLKDLHAAIKLCEPALVAAD--SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PKWGHL+DLH + E AL D + Y L A Y Q S F N
Sbjct: 295 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTS--------ATIYSYQGKSSCFFGNS 346
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ ++ + G +YT+P WSVSILPDC N V+NTAKV+SQ S V
Sbjct: 347 NADRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYS-------------TFVK 393
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ S E++ +S W+ T+Q I + +S+
Sbjct: 394 KGSEAENEPNSLQ------------WTWRGETIQYITP---------------GSVDISN 426
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHW-VKVVQPVEFQSGYNDL 580
DD W + T+++++ +L F+NG+ G +++G + + + + Q G N++
Sbjct: 427 DD-PIWGKDL---TLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEI 482
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKIL-----WTYQVGLKGEFQ 635
LLS TVGL NYG + G G V++ NG D+ K L W Y+ GL GE +
Sbjct: 483 TLLSVTVGLTNYGPDFDMVNQGIHGPVQIIA-SNGSADIIKDLSNNNQWAYKAGLNGEDK 541
Query: 636 QIYSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+I+ +W D +P +F WYK FDAP G DPV +DL +GKG+AWVNGH
Sbjct: 542 KIFLGRARYNQWKS---DNLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHS 598
Query: 694 IGRYWTVVAPKG-GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
+GRYW +G GC CDYRG Y ++KC TNCGNP+Q WYHVPRS+L +++N LV+FE
Sbjct: 599 LGRYWPSYIARGEGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFE 658
Query: 753 ETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDG 812
E GNP ++ + + C E + + L CQ G
Sbjct: 659 EFXGNPSSVTFQTVTVGNACANAREGY------------------------TLELSCQ-G 693
Query: 813 YIISSIEFASYGTPQGRC--------QKFSRGNCHAPMSLSVVSE 849
IS I+FAS+G PQG C Q F +G C A SLS++ +
Sbjct: 694 RAISXIKFASFGDPQGTCGKPFATGSQVFEKGTCEAADSLSIIQK 738
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 610 bits (1572), Expect = e-171, Method: Compositional matrix adjust.
Identities = 325/813 (39%), Positives = 476/813 (58%), Gaps = 61/813 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +++ I+G R +L S +HY R+TP+MWPD++ K++ GG +VI+TYVFWNAHE
Sbjct: 45 NVTYDGKSLFINGRREILFSGSVHYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHEPE 104
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G++NF+G D+VKF++LV + G+++ LR+GP++ AEWN GG P WLR++PGI FR++N
Sbjct: 105 PGKFNFQGNYDLVKFIRLVQAKGMFVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNE 164
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
P+K M+ FV KI+ +M++E LF+ QGGPII+ QIENEY +++ +Y ++G YV+WAA+M
Sbjct: 165 PYKFHMKAFVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 224
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ GVPW+MCKQ DAP+ +I+ACNG +C D + PN KP +WTENW Y G
Sbjct: 225 AVATDIGVPWLMCKQRDAPDPVINACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRVHGD 284
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF+VARFF + G+ +NYYMY GGTNFGRTS F T Y +AP+DEYGL
Sbjct: 285 PPSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTS-SVFSTTRYYDEAPLDEYGL 343
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAA-DSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
EPKW HL+D+H A+ LC A++ S Q KL E + R G+ C+AF+
Sbjct: 344 PREPKWSHLRDVHKALLLCRRAILGGVPSVQ--KLNHFHEVRTFE--RVGTNM-CAAFIT 398
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N A++ F G +Y LPP S+SILPDC+ VFNT ++ SQ + + E S P + N
Sbjct: 399 NNHTMEPATINFRGTNYFLPPHSISILPDCKTVVFNTQQIVSQHNSRNYERS-PAANNF- 456
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
M + + K + + P ++S + KD +DY W+ T +
Sbjct: 457 --HWEMFNEAIPTAKKMPINLPVPAELYS--------------LLKDTTDYAWYTTSFEL 500
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVVQ---PVEFQSGYN 578
S +D+S V P + + S+ + F+NG + G+ G H K + PV + G N
Sbjct: 501 SQEDMSM--KPGVLPVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLRVGTN 558
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS TVGL + GA++E AG + + + G G +DL++ W ++VGLKGE ++++
Sbjct: 559 YISLLSSTVGLPDSGAYMEHRYAGPK-SINILGLNRGTLDLTRNGWGHRVGLKGEGKKVF 617
Query: 639 SIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
S E +W L +P +WY+T F P+G PVA+ + M KG WVNG++IGRY
Sbjct: 618 SEEGSTSVKWKPL--GAVPRALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGRY 675
Query: 698 W-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
W + ++P G PTQ+ YH+PRS+L +NLLVIFEE
Sbjct: 676 WMSYLSP----------------------LGKPTQSEYHIPRSFLNPQDNLLVIFEEEAR 713
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P ++ + + +C V E V W S + + + + C G I
Sbjct: 714 VPAQVEILNVNRDTICSVVGERDPANVNSWV-SRRGNFHPVVKSVGAAASMACATGKRIV 772
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++EFAS+G P G C F+ G+C+A S +V
Sbjct: 773 AVEFASFGNPSGYCGDFAMGSCNAAASKQIVER 805
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 610 bits (1572), Expect = e-171, Method: Compositional matrix adjust.
Identities = 320/814 (39%), Positives = 468/814 (57%), Gaps = 64/814 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
+SYD R++++DG R + S IHYPR+ P+MWP+LIAK+KEGG + IETYVFWN HE +
Sbjct: 38 ISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 97
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQ+NF+G+ D+VKF KL+ ++ +R+GP++ AEWN GG P WLR+IP I FRTNN P
Sbjct: 98 GQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 157
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K M+ FVK ++ +++ LF+ QGGPII+ QIENEY ++E+++ ++G Y+ WAA MA
Sbjct: 158 YKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIHWAAQMA 217
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--PTLWTENWDGWYTTWGGR 284
+G G+PW+MCKQT AP ++I CNG C P NK P LWTENW Y +G
Sbjct: 218 IGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQYRVFGDP 277
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AFAVARFF GG+ NYYMY GGTNFGRT+ F + Y +AP+DE+GL
Sbjct: 278 PSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTAAA-FVMPKYYDEAPLDEFGLY 336
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A+KLC+ AL+ + KLG+ EA V+ Q C AFL+N
Sbjct: 337 KEPKWGHLRDLHLALKLCKKALLWGKPSTE-KLGKQLEARVF---EIPEQKVCVAFLSNH 392
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ ++TF GQ Y +P S+SIL DC+ VF T V++Q + +T F
Sbjct: 393 NTKDDVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHF----------- 441
Query: 465 QQSMIESKLSSTSKSW-MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ ++ + W M +E + + + + + N+TKD +DY+W+ + +
Sbjct: 442 ------ADQTNQNNVWQMFDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVWYTSSFKLE 495
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK------VVQPVEFQSGY 577
DD+ + +++ V ++S F+N + G GH K + +P+E + G
Sbjct: 496 PDDMPIRR--DIKTVVEVNSHGHASVAFVNNKFAGC--GHGTKMNKAFTLEKPMELKKGV 551
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + +L+ ++G+ + GA+LE AG +V++TG G +DL+ W + VGL GE ++I
Sbjct: 552 NHVAVLASSMGMMDSGAYLEHRLAGV-DRVQITGLNAGTLDLTNNGWGHIVGLVGEQKEI 610
Query: 638 YSIEENEAE--WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
Y+ E+ A W D TWYK +FD P G DP+ LD+ +MGKG +VNG IG
Sbjct: 611 YT-EKGMASVTWKPAVND---KPLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIG 666
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW Y+ A G P+Q YH+PRS+L+ +N+LV+FEE
Sbjct: 667 RYWM------------SYKHA---------LGRPSQQLYHIPRSFLRPKDNVLVLFEEEF 705
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYII 815
G P I + +C +SE + ++ W S + + + L C +I
Sbjct: 706 GRPDAIMILTVKRDNICTYISERNPAHIKSWERKDS-QITATADDLKARATLTCPPKKLI 764
Query: 816 SSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FASYG P G C ++ G+CH P + VV +
Sbjct: 765 QQVVFASYGNPVGICGNYTIGSCHTPRAKEVVEK 798
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 336/733 (45%), Positives = 441/733 (60%), Gaps = 73/733 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++IIDG R++L S IHYPR+TP+MWP LI+K+KEGG DVI+TYVFWN HE
Sbjct: 4 VTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEPQF 63
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F G+ D+V+F+K + GLY+ LRIGPY+ +EW +GGFP WL D+P I +RT+N P
Sbjct: 64 GQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIVYRTDNQP 123
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F KIV +M+ E L++ QGGPII+ QIENEY N+E ++G+ G YV+WAA MA
Sbjct: 124 FKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAEMA 183
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQTDAP+ +I+ CNG C PNS NKP WTENW +Y +GG
Sbjct: 184 VGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYGGE 243
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF V F R GS++NYYMY GGTN GRTS + ITSY AP+DEYGL
Sbjct: 244 PYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSS-YVITSYYDQAPLDEYGL 302
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PKWGHLK+LHAAIK C L+ + + LGQ QE +V+ + C AFL N
Sbjct: 303 LRQPKWGHLKELHAAIKSCSTTLLEGKQSNF-SLGQLQEGYVFE-----EEGKCVAFLVN 356
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D +V F +SY LP S+SILPDC+N FNTA V+++++ +
Sbjct: 357 NDHVKMFTVQFRNRSYELPSKSISILPDCQNVTFNTATVNTKSNRRMT------------ 404
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
S I++ S++ W ++ I + + +LE +NVTKD SDYLW+
Sbjct: 405 ---STIQT--FSSADKWEQFQDVIPNFDQTTLISNSLLEQMNVTKDKSDYLWYTLS---- 455
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVVQ---PVEFQSGYND 579
+T S V F +G G G H VK P++ G N+
Sbjct: 456 ------------ESKLTAQSAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNN 503
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ +LS VGL + GAFLE+ AG V++ + DL+ W YQVGL GE +IY
Sbjct: 504 ISILSVMVGLPDAGAFLERRFAGLTA-VEIQCSEE-SYDLTNSTWGYQVGLLGEQLEIYE 561
Query: 640 IEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ N + +W+ L + T TWYKT FD+P G +PVAL+L SMGKGQAWVNG IGRYW
Sbjct: 562 EKSNSSIQWSPLG-NTCNQTLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYW 620
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+++ K G P+QT YHVPRS+L+ N LV+FEE GGNP
Sbjct: 621 I----------------SFHDSK-----GQPSQTLYHVPRSFLKDIGNSLVLFEEEGGNP 659
Query: 759 FEISVK-LRSTRI 770
IS+ + ST I
Sbjct: 660 LHISLDTISSTNI 672
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 322/751 (42%), Positives = 445/751 (59%), Gaps = 75/751 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD R++IIDG R++L S IHYPR+TPEMWP L+AK++EGG DVI+TYVFWN HE
Sbjct: 24 DVTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREGGVDVIQTYVFWNLHEPR 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+Y+F G+ND+V+F+K + + GLY+ LRIGP++ +EW +GGFP WL D+P I +R++N
Sbjct: 84 PGEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPDIVYRSDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F KIV++M+ E L++ QGGPII+ QIENEY N+E+++ +G YV WAA M
Sbjct: 144 PFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVEAAFRDKGPPYVIWAAKM 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQTDAP+ +I+ CNG C PNS KP+LWTENW +Y +GG
Sbjct: 204 AVELQTGVPWVMCKQTDAPDPVINTCNGMRCGETFGGPNSPTKPSLWTENWTSFYQVYGG 263
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF V F + GS++NYYM+ GGTNFGRT+ + ITSY AP+DEYGL
Sbjct: 264 EPYIRSAEDIAFHVTLFIAKNGSYINYYMFHGGTNFGRTASA-YVITSYYDQAPLDEYGL 322
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ +PKWGHLK+LHAAIK C ++ + + LGQ Q+A+++ G C+AFL N
Sbjct: 323 IRQPKWGHLKELHAAIKSCSSTILEGVQSNF-SLGQLQQAYIFEEEGAG----CAAFLVN 377
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D+ A+V F ++ L P S+S+LPDC N +FNTAKV+++ +
Sbjct: 378 NDQKNNATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKGN---------------- 421
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ + S+L + W + I +++ N +LEH+N TKD SDYLW+
Sbjct: 422 -EITRTSSQLFDDADRWEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYT------ 474
Query: 524 DDDISFWKTNE-VRPTVTIDSMRDVLRVFINGQLTGSVIGHW-----VKVVQPVEFQSGY 577
SF + P + ++S+ V F+N + GS G + P+
Sbjct: 475 ---FSFLPNSSCTEPILHVESLAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQM 531
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFR------GQVKLTGFKNGDIDLSKILWTYQVGLK 631
N + +LS VGLQ+ GAFLE+ AG Q ++ F N W YQ GL
Sbjct: 532 NTISILSTMVGLQDSGAFLERRYAGLTRVEIRCAQQEIYNFTN------NYEWGYQAGLS 585
Query: 632 GEFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVN 690
GE IY E + EW+++ +W+K FDAP G DPV L+L +MGKG+AWVN
Sbjct: 586 GESLNIYMREHLDNIEWSEVV-SATDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVN 644
Query: 691 GHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
G IGRYW T+ G P+QT YH+PR++L +S NLLV+
Sbjct: 645 GQSIGRYWL---------------------SFLTSKGQPSQTLYHIPRAFLNSSGNLLVL 683
Query: 751 FEETGGNPFEISVKLRSTRIVCEQVSESHYP 781
EE+GG+P IS+ S + E S H P
Sbjct: 684 LEESGGDPLHISLDTVSRTGLQEHASRYHPP 714
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 321/727 (44%), Positives = 449/727 (61%), Gaps = 58/727 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++I++G R +L S IHYPR PEMWPD+I K+KEGG ++I+TYVFWN HE ++
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHEPVQ 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQ+NF+G D+VKF+K +G GLY+ LRIGPY+ AEWN GGFP WLR++P I FR+ N P
Sbjct: 88 GQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F M+++ + ++DLM++E LF+ QGGPIIM QIENEY N++ +Y GK YV+WAA+MA
Sbjct: 148 FIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAANMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
GL GVPW+MCKQ DAP +I+ CNG +C D + PN NKP+LWTENW Y T+G
Sbjct: 208 TGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF+VARFF + G+ NYYMY+GGTN+GRT G F T Y +AP+DE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRT-GSSFVTTRYYDEAPLDEFGLY 326
Query: 345 SEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
EPKW HL+DLH A++L AL+ S Q K+ Q+ E VY ++C+AFL N
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPSVQ--KINQHLEITVYEK----PGTDCAAFLTN 380
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
A++ F G+ Y LP SVSILPDC+ NT + SQ + + LP S
Sbjct: 381 NHTTLPATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNSRNF---LP-SEKAKN 436
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ M + K+ + S + +EP+ ++S +TKD SDY W+ T I
Sbjct: 437 LKWEMYQEKVPTISDLSLKNREPLELYS--------------LTKDTSDYAWYSTSINFD 482
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYND 579
D+ ++ P + I SM L F+NG+ G G+ ++ +PV + G N
Sbjct: 483 RHDLPM--RPDILPVLQIASMGHALSAFVNGEFVGFGHGNNIEKSFVFQKPVILKPGTNT 540
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ +L++TVG N GA++EK AG RG + + G G +D+++ W ++VG+ GE +Q+++
Sbjct: 541 ISILAETVGFPNSGAYMEKRFAGPRG-ITVQGLMAGTLDITQNNWGHEVGVFGEKEQLFT 599
Query: 640 IE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
E + +WT + TWYKTYFDAP+G +PVAL + M KG WVNG+ +GRYW
Sbjct: 600 EEGAKKVKWTPVN-GPTKGAVTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGRYW 658
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ + G PTQ YH+PR++L+ +NNLLVIFEETGG+P
Sbjct: 659 S---------------------SFLSPLGQPTQFEYHIPRAFLKPTNNLLVIFEETGGHP 697
Query: 759 FEISVKL 765
I V++
Sbjct: 698 ETIEVQI 704
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 320/815 (39%), Positives = 481/815 (59%), Gaps = 49/815 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +++ ++G R +L S IHY R+TP+ WPD++ K++ GG +VI+TYVFWNAHE
Sbjct: 34 NVTYDGKSLFVNGRRELLFSGSIHYTRSTPDAWPDILDKARHGGLNVIQTYVFWNAHEPE 93
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G++NF+G ND+VKF++LV S G+Y+ LR+GP++ AEWN GG P WLR++PGI FR++N
Sbjct: 94 QGKFNFEGNNDLVKFIRLVQSKGMYVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNE 153
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
P+K+ M+ +V KI+ +M++E LF+ QGGPII+ QIENEY +++ +Y ++G YV+WAA+M
Sbjct: 154 PYKKYMKAYVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 213
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPW+MCKQ DAP+ +I+ACNG +C D + PN KP+LWTENW Y +G
Sbjct: 214 AVALDIGVPWIMCKQKDAPDPVINACNGRHCGDTFSGPNKPYKPSLWTENWTAQYRVFGD 273
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
+ R ED+AF+VARFF + G+ +NYYMY GGTNFGRT+ F T Y +AP+DEYG+
Sbjct: 274 PVSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEYGM 332
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+PKW HL+D H A+ LC A++ KL E ++ S CSAF+ N
Sbjct: 333 ERQPKWSHLRDAHKALLLCRKAILGG-VPTVQKLNDYHEVRIFEK---PGTSTCSAFITN 388
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ-TSIKTVEFSLPLSPNIS 462
+ AA+++F G +Y LP S+S+LPDC+ V+NT V +Q K + L + +S
Sbjct: 389 NHTNQAATISFRGSNYFLPAHSISVLPDCKTVVYNTQNVMNQLVYYKLISSHLIIKLIVS 448
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ + + W E I + + LE + KD +DY W+ T +
Sbjct: 449 QHNKRNFVKSAVANNLKWELFLEAIPSSKKLESNQKIPLELYTLLKDTTDYGWYTTSFEL 508
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEFQSGYN 578
+D+ + + I S+ L F+NGQ G+ G H K QP F+ G N
Sbjct: 509 GPEDLP-----KKSAILRIMSLGHTLSAFVNGQYIGTDHGTHEEKSFEFEQPANFKVGTN 563
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ +L+ TVGL + GA++E AG + + + G G ++L+K W ++VGL+GE +++
Sbjct: 564 YISILATTVGLPDSGAYMEHRYAGPK-SISILGLNKGKLELTKNGWGHRVGLRGEQLKVF 622
Query: 639 SIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ E + +W +T G +W KT F P+G PVA+ + MGKG WVNG IGR+
Sbjct: 623 TEEGSKKVQWDPVT--GETRALSWLKTRFATPEGRGPVAIRMTGMGKGMIWVNGKSIGRH 680
Query: 698 W-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
W + ++P G P+Q YH+PR +L A +NLLV+ EE G
Sbjct: 681 WMSFLSP----------------------LGQPSQEEYHIPRDYLNAKDNLLVVLEEEKG 718
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKL-SINKMA-PEMHLHCQDGYI 814
+P +I + + +C ++E+ V W S +G+ S+ K + P+ L C G
Sbjct: 719 SPEKIEIMIVDRDTICSYITENSPANVNSWG---SKNGEFRSVGKNSGPQASLKCPSGKK 775
Query: 815 ISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I ++EFAS+G P G C F+ GNC+ + VV +
Sbjct: 776 IVAVEFASFGNPSGYCGDFALGNCNGGAAKGVVEK 810
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 317/621 (51%), Positives = 397/621 (63%), Gaps = 25/621 (4%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R++IIDG R++LISA IHYPR+ P MWP LI +KEGG DVIETYVFWN HE
Sbjct: 26 NVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTAKEGGIDVIETYVFWNGHELS 85
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G Y F G+ D+V+F K+V +G+YL LRIGP+V AEWNFGG PVWL IPG FRT N
Sbjct: 86 PGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNFGGVPVWLHYIPGTVFRTYNQ 145
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PF M++F IV+LM++E LF+ QGGPII+ QIENEYG E+ Y + GK Y WAA M
Sbjct: 146 PFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWAAKM 205
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ VPW+MC+Q DAP+ +ID CN +YCD + P S +P +WTENW GW+ T+GGR
Sbjct: 206 AVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKMWTENWPGWFKTFGGRD 265
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRPVED+AF+VARFFQ+GGS NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL
Sbjct: 266 PHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 325
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PKWGHLK+LH AIKLCE L+ S I LG + EA +Y S C+AF++N+D
Sbjct: 326 LPKWGHLKELHKAIKLCEHVLLYGKSVN-ISLGPSVEADIYT----DSSGACAAFISNVD 380
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ V F SY LP WSVSILPDC+N VFNTAKVSS T+I + +P+
Sbjct: 381 DKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAM-----------IPE 429
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
K T K W KE G+W + +F G ++H+N TKD +DYLWH T I + D
Sbjct: 430 HLQQSDKGQKTLK-WDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILI-DA 487
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYNDLI 581
+ F K +P + I+S L F+N + G+ G+ P+ ++G N++
Sbjct: 488 NEEFLKKGS-KPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIA 546
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS TVGLQ G F + GAG VK+ G N IDLS W Y++G+ GE IY E
Sbjct: 547 ILSLTVGLQTAGPFYDFIGAGVT-SVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGE 605
Query: 642 -ENEAEWTDLTRDGIPSTFTW 661
N +WT + TW
Sbjct: 606 GMNSVKWTSTSEPPKGQALTW 626
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 606 bits (1562), Expect = e-170, Method: Compositional matrix adjust.
Identities = 313/672 (46%), Positives = 411/672 (61%), Gaps = 83/672 (12%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHR+++I+G RR+LIS IHYPR+ PEMWP LI K+K+GG DV++TYVFWN HE +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKLV +GLY+ LR+GPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ+FV+KIV +M+ E LF WQGGPIIM Q+ENE+G MES G GK Y WAA MA
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+G AGVPWVMCKQ DAP+ +I+ CNG+YCD + PN+ +KPT+WTE W GW+T +GG P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY----- 341
HRPVEDLAFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 342 --------------------------------------------GLLSEPKWGHLKDLHA 357
GLL +PKWGHL+++H
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 358 AIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQ 417
AIK EPALV+ D +G ++A+V+++ C+AFL+N +A + F G+
Sbjct: 400 AIKQAEPALVSGDPT-IRSIGNYEKAYVFKSK----NGACAAFLSNYHVKSAVRIRFDGR 454
Query: 418 SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTS 477
Y LP WS+SILPDC+ VFNTA V T + P+ S + +
Sbjct: 455 HYDLPAWSISILPDCKTAVFNTATVKEPTLL---------------PKMSPVMHRF---- 495
Query: 478 KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRP 537
+W + E ++ F G++E L++T D SDYLW+ T + + ++ F K+ + P
Sbjct: 496 -AWQSYSEDTNSLDDSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNE-RFLKSGQ-WP 552
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYG 593
+++ S ++VF+NG+ GSV G + + V+ G N + +LS VGL N G
Sbjct: 553 QLSVYSAGHSMQVFVNGRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNG 612
Query: 594 AFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTR 652
E G G V L+G G DLS W YQVGLKGE ++++ + A EW
Sbjct: 613 DHFELWNVGVLGPVTLSGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAG--P 670
Query: 653 DGIPSTFTWYKT 664
G TW+K
Sbjct: 671 GGGTQPLTWHKV 682
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 335/849 (39%), Positives = 499/849 (58%), Gaps = 82/849 (9%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++I L C +SS K V+YD ++II+G R +L S +HYPR+TP MWP
Sbjct: 16 LIAILLVISL-CSKASSHDDEKKKKGVTYDGTSLIINGKRELLFSGSVHYPRSTPHMWPS 74
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
+I K++ GG + I+TYVFWN HE +G+Y+FKG+ D+VKF+KL+ GLY+ LR+GP++
Sbjct: 75 IIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQ 134
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN GG P WLR++P + FRTNN PFKE +R+V+KI+ +M+EE LF+ QGGPII+ QI
Sbjct: 135 AEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPIILGQI 194
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGY 259
ENEY ++ +Y + G+ Y+KWAA++ + G+PWVMCKQ DAP N+I+ACNG +C D +
Sbjct: 195 ENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTF 254
Query: 260 K-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
PN ++KP+LWTENW + +G R VED+AF+VAR+F + GS +NYYMY GGTN
Sbjct: 255 PGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTN 314
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKL 377
FGRTS F T Y DAP+DE+GL PK+GHLK +H A++LC+ AL AQ L
Sbjct: 315 FGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ--TL 371
Query: 378 GQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
G + E Y + G++ C+AFL+N + ++ F GQ Y LP S+SILPDC+ V+
Sbjct: 372 GPDTEVRYY--EQPGTKV-CAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVY 428
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NTA++ +Q S + ++S+ +S + E I + + +
Sbjct: 429 NTAQIVAQHSWR-----------------DFVKSEKTSKGLKFEMFSENIPSLLDGDSLI 471
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
G L +L TKD +DY W+ T + + +DD F ++ + + S+ L V++NG+
Sbjct: 472 PGELYYL--TKDKTDYAWYTTSVKIDEDD--FPDQKGLKTILRVASLGHALIVYVNGEYA 527
Query: 558 GSVIG-HWVK---VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
G G H +K +PV F++G N + +L GL + G+++E AG R + + G K
Sbjct: 528 GKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLK 586
Query: 614 NGDIDLSK-ILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDG 671
+G DL++ W + GL+GE +++Y+ E + +W +DG TWYKTYF+ P+G
Sbjct: 587 SGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKW---EKDGKRKPLTWYKTYFETPEG 643
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
++ VA+ + +MGKG WVNG +GRYW + ++P G PT
Sbjct: 644 VNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP----------------------LGEPT 681
Query: 731 QTWYHVPRSWLQA--SNNLLVIFEETGGNPFE-ISVKLRSTRIVCEQVSESHYPPVRKWS 787
QT YH+PRS+++ N+LVI EE G E I L + +C V E + V+ W
Sbjct: 682 QTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWK 741
Query: 788 N------SYSVDGKL-SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
S S D +L ++ + PE + ++FAS+G P G C F+ G C A
Sbjct: 742 REGPKIVSRSKDMRLKAVMRCPPEKQM--------VEVQFASFGDPTGTCGNFTMGKCSA 793
Query: 841 PMSLSVVSE 849
S VV +
Sbjct: 794 SKSKEVVEK 802
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 335/810 (41%), Positives = 451/810 (55%), Gaps = 83/810 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R++I+DG RR++IS IHYPR+TPEMWPDLI K+KEGG + IETYVFWN HE R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
++NF+G D+V+F K + ++G+Y LRIGPY+C EWN+GG PVWLRDIPGI+FR +N P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAAS 224
F+ M+ F IV M++ +F+ QGGPII+ QIENEYG ++ Q +Y+ W A
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q D P N+++ CNG+YC + N + P +WTENW GWY W
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWYRDWDQ 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
RP ED+AFAVA FFQ GS NYYMY GGTNFGRT+GGP+ TSYDYDAP+DEYG
Sbjct: 271 PEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 330
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLK+LH+ + E L+ D YI V +Y + + F+ N
Sbjct: 331 LRQPKYGHLKELHSVLMSMEKILLHGD---YIDTNYGDNVTV---TKYTLNATSACFINN 384
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+ +VT G ++ LP WSVSILP+C+ FN+AK+ +QT++ V
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVM-------------V 431
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ SM+E + SWM P + NF +LE + T D SDYLW+ T
Sbjct: 432 NKTSMVEQQTEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRT---- 487
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL 582
S E + +++ L F+NG+L G ++ N
Sbjct: 488 -----SLEHKGEGSYVLYVNTTGHELYAFVNGKLVGQ------------QYSPNENFTFQ 530
Query: 583 LSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
L NYG E AG G VKL IDLS W+Y+ GL GE+++IY +
Sbjct: 531 LKSP----NYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK 586
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT-- 699
+ + I FTWYKT F AP G D V +DL + KG AWVNG+ +GRYW
Sbjct: 587 PGNKWRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSY 646
Query: 700 VVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWL-QASNNLLVIFEET 754
V A GC CDYRG + ++ KC T CG P+Q YHVPRS+L + N L++FEE
Sbjct: 647 VAADMPGCHH-CDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEA 705
Query: 755 GGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHC-QDGY 813
GG+P E++V+ VC ++ + L C G
Sbjct: 706 GGDPSEVAVRTVVEGSVCASA------------------------EVGDTVTLSCGAHGR 741
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
ISS++ AS+G +GRC + G C + ++
Sbjct: 742 TISSVDVASFGVARGRCGSYD-GGCESKVA 770
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 327/767 (42%), Positives = 459/767 (59%), Gaps = 68/767 (8%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+ +++I + + S +T K V+YD R++IIDG R++L S IHYPR+TPEMWP LI
Sbjct: 10 LCLILIVGTFLEFSGGATAAK--GVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLI 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+KEGG DVI+TYVFWN HE GQY+F G+ND+VKF+K + S GLY+ LRIGP++ AE
Sbjct: 68 KKAKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GG P WLRD+PG+ +RT+N PFK MQ+F KIVDLM+ E L++ QGGPII+ QIEN
Sbjct: 128 WNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIEN 187
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--K 260
EY N+E ++ ++G Y+KWA MA+GL GVPW+MCK DAP+ +I+ CNG C
Sbjct: 188 EYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPG 247
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PNS NKP +WTE+W ++ +G R ED+AF A F + GS++NYYMY GGTNFG
Sbjct: 248 PNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFG 307
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RTS ++IT Y AP+DEYGLL +PK+GHLK+LHAAIK L+ + LG
Sbjct: 308 RTSSS-YFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQT-ILSLGPM 365
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
Q+A+V+ G C AFL N D A+ + F +Y+L P S+ IL +C+N ++ TA
Sbjct: 366 QQAYVFEDANNG----CVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQNCKNLIYETA 420
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
KV+ + + + ++ P Q + + +W +E I + + +
Sbjct: 421 KVNVKMNTR-----------VTTPVQ------VFNVPDNWNLFRETIPAFPGTSLKTNAL 463
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
LEH N+TKD +DYLW Y S + TN P++ +S V+ VF+N L GS
Sbjct: 464 LEHTNLTKDKTDYLW-----YTSSFKLDSPCTN---PSIYTESSGHVVHVFVNNALAGSG 515
Query: 561 IG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G VK+ PV +G N++ +LS VGL + GA++E+ G +V+++
Sbjct: 516 HGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP 574
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPST--FTWYKTYFDAPDGID 673
IDLS+ W Y VGL GE ++Y + N +W+ + + G+ WYKT FD P+G
Sbjct: 575 IDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWS-MNKAGLIKNRPLAWYKTTFDGPNGDG 633
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW 733
PV L + SMGKG+ WVNG IGRYW T G P+Q+
Sbjct: 634 PVGLHMSSMGKGEIWVNGESIGRYWV---------------------SFLTPAGQPSQSI 672
Query: 734 YHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHY 780
YH+PR++L+ S NLLV+FEE GG+P IS L + +V ++S +
Sbjct: 673 YHIPRAFLKPSGNLLVVFEEEGGDPLGIS--LNTISVVGSSQAQSQF 717
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 327/767 (42%), Positives = 459/767 (59%), Gaps = 68/767 (8%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+ +++I + + S +T K V+YD R++IIDG R++L S IHYPR+TPEMWP LI
Sbjct: 10 LCLILIVGTFLEFSGGATAAK--GVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLI 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+KEGG DVI+TYVFWN HE GQY+F G+ND+VKF+K + S GLY+ LRIGP++ AE
Sbjct: 68 KKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GG P WLRD+PG+ +RT+N PFK MQ+F KIVDLM+ E L++ QGGPII+ QIEN
Sbjct: 128 WNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIEN 187
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--K 260
EY N+E ++ ++G Y+KWA MA+GL GVPW+MCK DAP+ +I+ CNG C
Sbjct: 188 EYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPG 247
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PNS NKP +WTE+W ++ +G R ED+AF A F + GS++NYYMY GGTNFG
Sbjct: 248 PNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFG 307
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RTS ++IT Y AP+DEYGLL +PK+GHLK+LHAAIK L+ + LG
Sbjct: 308 RTSSS-YFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQT-ILSLGPM 365
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
Q+A+V+ G C AFL N D A+ + F +Y+L P S+ IL +C+N ++ TA
Sbjct: 366 QQAYVFEDANNG----CVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQNCKNLIYETA 420
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
KV+ + + + ++ P Q + + +W +E I + + +
Sbjct: 421 KVNVKMNTR-----------VTTPVQ------VFNVPDNWNLFRETIPAFPGTSLKTNAL 463
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
LEH N+TKD +DYLW Y S + TN P++ +S V+ VF+N L GS
Sbjct: 464 LEHTNLTKDKTDYLW-----YTSSFKLDSPCTN---PSIYTESSGHVVHVFVNNALAGSG 515
Query: 561 IG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G VK+ PV +G N++ +LS VGL + GA++E+ G +V+++
Sbjct: 516 HGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP 574
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPST--FTWYKTYFDAPDGID 673
IDLS+ W Y VGL GE ++Y + N +W+ + + G+ WYKT FD P+G
Sbjct: 575 IDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWS-MNKAGLIKNRPLAWYKTTFDGPNGDG 633
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW 733
PV L + SMGKG+ WVNG IGRYW T G P+Q+
Sbjct: 634 PVGLHMSSMGKGEIWVNGESIGRYWV---------------------SFLTPAGQPSQSI 672
Query: 734 YHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHY 780
YH+PR++L+ S NLLV+FEE GG+P IS L + +V ++S +
Sbjct: 673 YHIPRAFLKPSGNLLVVFEEEGGDPLGIS--LNTISVVGSSQAQSQF 717
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 333/823 (40%), Positives = 462/823 (56%), Gaps = 94/823 (11%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ R+++IDG RR++IS IHYPR+TPEMWPDLI K+KEGG D IETYVFWN HE R
Sbjct: 31 VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
QYNF G DIV+F K + ++GLY LRIGPY+C EWN+GG P WLRDIPG++FR +NAP
Sbjct: 91 RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAAS 224
F+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210
Query: 225 MALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
MA GVPW+MC+Q +D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
HR ED+AFAVA FFQ+ GGP+ TSYDYDAP+DEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQK-------------------RGGPYITTSYDYDAPLDEYGN 311
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
L +PK+GHLKDLH+ IK E LV +Y+ + + V +Y S + F+ N
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILV---HGEYVDTNYSDKVTV---TKYTLDSTSACFINN 365
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
+++ +VT G ++ LP WSVSILPDC+ FN+AK+ +QT++ V
Sbjct: 366 RNDNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVM-------------V 412
Query: 464 PQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ M+E + S SWM P + ++ +LE + + D SDYLW+ T I
Sbjct: 413 NKAKMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSIN- 471
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGYN 578
E T+ +++ L F+NG L G S GH+V ++ P + G N
Sbjct: 472 --------HKGEASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKN 523
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ LLS T+GL+NYG EK AG G VKL IDLS W+Y+ GL GE++QI
Sbjct: 524 YISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQI 583
Query: 638 YSIEENEAEWTDLTRDGIP--STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ W D +P FTWYKT F AP G D V +DL + KG AWVNG+++G
Sbjct: 584 H-LDKPGCTW-DNNNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLG 641
Query: 696 RYW---TVVAPKGGCQDTCDYRGAYNSD----KCTTNCGNPTQTWYHVPRSWLQASN-NL 747
RYW T T YRG + ++ KC T CG P+Q +YHVPRS+L+ N
Sbjct: 642 RYWPSYTAARSMRRLPTTAHYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNT 701
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
+++FEE GG+P +S + + VC ++ + L
Sbjct: 702 VILFEEAGGDPSHVSFRTVAAGSVCASA------------------------EVGDTITL 737
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C Q IS+I S+G +G+C + +G C + + +E
Sbjct: 738 SCGQHSKTISAINVTSFGVARGQCGAY-KGGCESKAAYKAFTE 779
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 328/813 (40%), Positives = 483/813 (59%), Gaps = 62/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++IIDG R +L S IHYPR+TPEMWP +I ++K+GG + I+TYVFWN HE +
Sbjct: 40 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G+ D+VKF+KL+ +G+Y+ LR+GP++ AEW GG P WLR++PGI FRT+N P
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKP 159
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKE +R+V+ I+D M+EE LF+ QGGPII+ QIENEY ++ +Y Q G +Y+KWA+ +
Sbjct: 160 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKLV 219
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW + +G
Sbjct: 220 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGDP 279
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R VED+A++VARFF + GS +NYYMY GGTNFGRTS + T Y DAP+DEYGL
Sbjct: 280 PTQRSVEDIAYSVARFFSKNGSHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 338
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPK+GHLK LH+A+ LC+ L+ + K G++ E Y + G+++ C+AFLAN
Sbjct: 339 REPKYGHLKHLHSALNLCKKPLLWGQ-PKTEKPGKDTEIRYYE--QPGTKT-CAAFLANN 394
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A ++ F G+ Y + P S+SILPDC+ V+NTA++ SQ +
Sbjct: 395 NTEAAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHT----------------- 437
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++ ++SK ++ + E + E N + +E +TKD +DY W+ T V
Sbjct: 438 SRNFMKSKKANKKFDFKVFTETLPSKLEGNSYIP--VELYGLTKDKTDYGWYTTSFKVHK 495
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEFQSGYNDL 580
+ + K V+ V I S+ L +++NG+ GS G H K + V ++G N L
Sbjct: 496 NHLPTKKG--VKTFVRIASLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHL 553
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK-ILWTYQVGLKGEFQQIYS 639
I+L G + G+++E G RG V + G +G +DL++ W ++G++GE I++
Sbjct: 554 IMLGVLTGFPDSGSYMEHRYTGPRG-VSILGLTSGTLDLTESSKWGNKIGMEGEKLGIHT 612
Query: 640 IEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
E + EW T G TWY+ YFDAP+ ++ A+ + MGKG WVNG +GRYW
Sbjct: 613 EEGLKKVEWKKFT--GKAPGLTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYW 670
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG- 756
+ ++P G PTQ YH+PRS+L+ NLLVIFEE
Sbjct: 671 QSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNV 708
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P + + + VC V E++ P VR W+ ++ N ++ L C I+
Sbjct: 709 KPELMDFVIVNRDTVCSYVGENYTPSVRHWTRKQDQVQAITDN-VSLTATLKCSGTKKIA 767
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++EFAS+G P G C F+ G C+AP+S V+ +
Sbjct: 768 AVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEK 800
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 333/849 (39%), Positives = 497/849 (58%), Gaps = 82/849 (9%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++I L C +SS K V+YD ++II+G R + S +HYPR+TP+MWP
Sbjct: 16 LIAILLVISL-CSKASSHDDEKKKKGVTYDGTSLIINGKRELFFSGSVHYPRSTPDMWPS 74
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
+I K++ GG + I+TYVFWN HE +G+Y+FKG+ D+VKF+KL+ GLY+ LR+GP++
Sbjct: 75 IIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQ 134
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEWN GG P WLR++P + FRTNN PFKE +R+V+KI+ +M+EE LF+ QGGPII+ QI
Sbjct: 135 AEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPIILGQI 194
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGY 259
ENEY ++ +Y + G+ Y+KWAA++ + G+PWVMCKQ DAP N+I+ACNG +C D +
Sbjct: 195 ENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTF 254
Query: 260 K-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
PN ++KP+LWTENW + +G R ED+AF+VAR+F + GS +NYYMY GGTN
Sbjct: 255 PGPNRHDKPSLWTENWTTQFRVFGDPPTQRTAEDIAFSVARYFSKNGSHVNYYMYHGGTN 314
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKL 377
FGRTS F T Y DAP+DE+GL PK+GHLK +H A++LC+ AL AQ L
Sbjct: 315 FGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ--TL 371
Query: 378 GQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
G + E Y + G++ C+AFL+N + ++ F GQ Y LP S+SILPDC+ V+
Sbjct: 372 GPDTEVRYY--EQPGTKV-CAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVY 428
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NTA++ +Q S + ++S+ +S + E I + + +
Sbjct: 429 NTAQIVAQHSWR-----------------DFVKSEKTSKGLKFEMFSENIPSLLDGDSLI 471
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
G L +L TKD +DY W+ T + + +DD F ++ + + S+ L V++NG+
Sbjct: 472 PGELYYL--TKDKTDYAWYTTSVKIDEDD--FPDQKGLKTILRVASLGHALIVYVNGEYA 527
Query: 558 GSVIG-HWVK---VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
G G H +K +PV F++G N + +L GL + G+++E AG R + + G K
Sbjct: 528 GKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLK 586
Query: 614 NGDIDLSK-ILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDG 671
+G DL++ W + GL+GE +++Y+ E + +W +DG TWYKTYF+ P+G
Sbjct: 587 SGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKW---EKDGERKPLTWYKTYFETPEG 643
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
++ VA+ + MGKG WVNG +GRYW + ++P G PT
Sbjct: 644 VNAVAIRMKGMGKGLIWVNGIGVGRYWMSFLSP----------------------LGEPT 681
Query: 731 QTWYHVPRSWLQA--SNNLLVIFEETGGNPFE-ISVKLRSTRIVCEQVSESHYPPVRKWS 787
QT YH+PRS+++ N+LVI EE G E I L + +C V E + V+ W
Sbjct: 682 QTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWK 741
Query: 788 N------SYSVDGKL-SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
S S D +L ++ + PE + ++FAS+G P G C F+ G C A
Sbjct: 742 REGPKIVSRSKDMRLKAVMRCPPEKQM--------VEVQFASFGDPTGTCGNFTMGKCSA 793
Query: 841 PMSLSVVSE 849
S VV +
Sbjct: 794 SKSKEVVEK 802
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 327/767 (42%), Positives = 458/767 (59%), Gaps = 68/767 (8%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+ +++I + + S +T K V+YD R++IIDG R++L S IHYPR+TPEMWP LI
Sbjct: 10 LCLILIVGTFLEFSGGATAAK--GVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLI 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K+KEGG DVI+TYVFWN HE GQY+F G+ND+VKF+K + S GLY+ LRIGP++ AE
Sbjct: 68 KKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
WN+GG P WLRD+PG+ +RT+N PFK MQ+F KIVDLM+ E L++ QGGPII+ QIEN
Sbjct: 128 WNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIEN 187
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--K 260
EY N+E ++ ++G Y+KWA MA+GL GVPW+MCK DAP+ +I+ CNG C
Sbjct: 188 EYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPG 247
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
PNS NKP +WTE+W ++ +G R ED+AF A F + GS++NYYMY GGTNFG
Sbjct: 248 PNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFG 307
Query: 321 RTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQN 380
RTS ++IT Y AP+DEYGLL +PK+GHLK+LHAAIK L+ + LG
Sbjct: 308 RTSSS-YFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQT-ILSLGPM 365
Query: 381 QEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
Q+A+V+ G C AFL N D A+ + F +Y+L P S+ IL +C+N ++ TA
Sbjct: 366 QQAYVFEDANNG----CVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQNCKNLIYETA 420
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
KV+ + + + ++ P Q + + +W +E I + +
Sbjct: 421 KVNVKMNTR-----------VTTPVQ------VFNVPDNWNLFRETIPASQAHLLKTNAL 463
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
LEH N+TKD +DYLW Y S + TN P++ +S V+ VF+N L GS
Sbjct: 464 LEHTNLTKDKTDYLW-----YTSSFKLDSPCTN---PSIYTESSGHVVHVFVNNALAGSG 515
Query: 561 IG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD 616
G VK+ PV +G N++ +LS VGL + GA++E+ G +V+++
Sbjct: 516 HGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP 574
Query: 617 IDLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPST--FTWYKTYFDAPDGID 673
IDLS+ W Y VGL GE ++Y + N +W+ + + G+ WYKT FD P+G
Sbjct: 575 IDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWS-MNKAGLIKNRPLAWYKTTFDGPNGDG 633
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTW 733
PV L + SMGKG+ WVNG IGRYW T G P+Q+
Sbjct: 634 PVGLHMSSMGKGEIWVNGESIGRYWV---------------------SFLTPAGQPSQSI 672
Query: 734 YHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHY 780
YH+PR++L+ S NLLV+FEE GG+P IS L + +V ++S +
Sbjct: 673 YHIPRAFLKPSGNLLVVFEEEGGDPLGIS--LNTISVVGSSQAQSQF 717
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 323/725 (44%), Positives = 443/725 (61%), Gaps = 69/725 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +++I+G+ ++L S IHYPR+TP+MWPDLI+K+KEGG DVI+TYVFWN HE
Sbjct: 25 NVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDVIQTYVFWNLHEPQ 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY F G+ D+V F+K + + GLY+ LRIGPY+ +E +GG P+WL D+PGI FRT+N
Sbjct: 85 QGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDVPGIVFRTDND 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK MQRF KIV++M+ LF+ QGGPII+ QIENEYG+++S + G Y+ WAA M
Sbjct: 145 QFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFRANGLPYIHWAAQM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCD-GYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+GL GVPW+MCKQ DAP+ +I+ACNG C +K PNS NKP+LWTENW + +GG
Sbjct: 205 AVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLWTENWTSFLQAFGG 264
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R D+A+ VA F + GS++NYYMY GGTNF R + F IT+Y +AP+DEYGL
Sbjct: 265 APYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASA-FIITAYYDEAPLDEYGL 323
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ +PKWGHLK+LHA+IK C L+ + LG Q+A+V+R S + C+AFL N
Sbjct: 324 VRQPKWGHLKELHASIKSCSQPLLDGTQTTF-SLGSEQQAYVFR-----SSTECAAFLEN 377
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
++ F SY LP S+SILP C+N VFNT KVS Q +++ ++ L
Sbjct: 378 SGPRD-VTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNNVRAMKPRLQF------ 430
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+++++W E I ++ + +L+ ++ KD SDY+W+ +
Sbjct: 431 -----------NSAENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTFRFNN- 478
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEFQSGYND 579
K+ + ++I S DVL FING LTGS G V + + V +G N+
Sbjct: 479 -------KSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNN 531
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
+ +LS TVGL N GAFLE AG R +V++ G D S W YQVGL GE QI++
Sbjct: 532 ISILSATVGLPNSGAFLESRVAGLR-KVEVQGR-----DFSSYSWGYQVGLLGEKLQIFT 585
Query: 640 IE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ ++ +W P TWY+T F AP G DPV ++LGSMGKG AWVNG IGRYW
Sbjct: 586 VSGSSKVQWKSFQSSTKP--LTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYW 643
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
+ D G P+Q WYH+PRS+L+++ NLLVI EE GNP
Sbjct: 644 VSF---------------HKPD------GTPSQQWYHIPRSFLKSTGNLLVILEEETGNP 682
Query: 759 FEISV 763
I++
Sbjct: 683 LGITL 687
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 600 bits (1546), Expect = e-168, Method: Compositional matrix adjust.
Identities = 325/817 (39%), Positives = 480/817 (58%), Gaps = 70/817 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++II+GNR +L S IHYPR+TPEMWP++I ++K+GG + I+TYVFWN HE +
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G+ D+VKF+KL+ +G+Y+ LR+GP++ AEW GG P WLR++PGI FRT+N P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNTP 163
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKE +R+VK I+D M+EE LF+ QGGPII+ QIENEY ++ +Y + G +Y+KWA+ +
Sbjct: 164 FKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW + +G
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYGDP 283
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R VED+A++VARFF + G+ +NYYMY GGTNFGRTS + T Y DAP+DEYGL
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 342
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPK+GHLK LH A+ LC+ AL+ + K E Y + G++ C+AFLAN
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWG-QPRVEKPSNETEIRYYE--QPGTKV-CAAFLANN 398
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +A + F G+ Y +P S+SILPDC+ V+NT ++ S +
Sbjct: 399 NTESAEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHT----------------- 441
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++ ++SK ++ + + E + + + + +E +TKD +DY W+ T + D
Sbjct: 442 SRNFMKSKKANKNFDFKVFTETVPSKIKGDSYIP--VELYGLTKDETDYGWYTTSFKIDD 499
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEFQSGYNDL 580
+D+S K +PT+ I S+ L V++NG+ G+ G H K +P+ + G N L
Sbjct: 500 NDLS--KKKGSKPTLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHL 557
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKI-LWTYQVGLKGEFQQIYS 639
+L G + G+++E G R V + G +G +DL++ W +VG++GE I++
Sbjct: 558 TMLGVLTGFPDSGSYMEHRYTGPR-SVSILGLGSGTLDLTEENKWGNKVGMEGEKLGIHA 616
Query: 640 IEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
E + +W + G TWY+TYFDAP+ A+ + MGKG WVNG +GRYW
Sbjct: 617 EEGLKKVKWQKFS--GKEPGLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYW 674
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG- 756
+ ++P G PTQ YH+PRS+L+ NLLVIFEE
Sbjct: 675 MSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNV 712
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMH----LHCQDG 812
P I + + VC + E++ P VR W+ + + ++H L C
Sbjct: 713 KPELIDFVIINRDTVCSHIGENYTPSVRHWTRKND-----QVQAITDDVHLTASLKCSGT 767
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
IS +EFAS+G P G C F+ G C+AP+S VV +
Sbjct: 768 KKISEVEFASFGNPNGTCGNFTLGTCNAPVSKKVVEK 804
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 343/811 (42%), Positives = 455/811 (56%), Gaps = 110/811 (13%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YD R++II+G RR+L S IHYPR+TPEMWP LI+K+KEGG DVIETY FWN HE
Sbjct: 23 SVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEPK 82
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ DIVKF K V + GLY LRIGP++ +EWN+GG P WL D+PGI +R++N
Sbjct: 83 QGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDNE 142
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ F KIV+LM+ E L++ QGGPII+ QIENEY N+E+++ ++G YV+WAA M
Sbjct: 143 PFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAKM 202
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A+ L + + + E+ G
Sbjct: 203 AVDLQTAMRY----------------------------------YGEDKRG--------- 219
Query: 286 PHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R EDLAF VA F ++ GSF+NYYMY GGTNFGRTS + +T+Y AP+DEYGL+
Sbjct: 220 --RAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSS-YVLTAYYDQAPLDEYGLI 276
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHLK+LHA IKLC L+ Y LGQ QEA++++ C+AFL N
Sbjct: 277 RQPKWGHLKELHAVIKLCSDTLLXGVQYNY-SLGQLQEAYLFKR----PSGQCAAFLVNN 331
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D+ +V F +Y L S+SILPDC+ FNTAKVS+Q + ++V+
Sbjct: 332 DKRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQ------------ 379
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ ST K W +E I + +LEH+ TKD SDYLW+ +
Sbjct: 380 ----TRATFGST-KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF---- 430
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDL 580
++ +P + +DS+ VL F+NG+ S G +V V SG N +
Sbjct: 431 ----IHNSSNAQPVLRVDSLAHVLLAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRI 486
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRG-QVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY- 638
LLS VGL + G +LE AG R +++ G D SK W YQVGL GE QIY
Sbjct: 487 SLLSVMVGLPDAGPYLEHKVAGIRRVEIQDGGXSK---DFSKHPWGYQVGLMGEKLQIYT 543
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
S + +W L G TWYKT FDAP G DPV L GSMGKG+AWVNG IGRYW
Sbjct: 544 SPGSQKVQWYGLGSHG-RGPLTWYKTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYW 602
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T G P+QTWY+VPR++L NLLV+ EE G+P
Sbjct: 603 V---------------------SYLTPSGEPSQTWYNVPRAFLNPKGNLLVVQEEESGDP 641
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
+IS+ S VC V++SH PP+ W+ S DG S + P++ L C IS I
Sbjct: 642 LKISIGTVSVTNVCGHVTDSHPPPIISWTT--SDDGNESHHGKIPKVQLRCPPSSNISKI 699
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+GTP G C+ ++ G+CH+P SL+V +
Sbjct: 700 TFASFGTPVGGCESYAIGSCHSPNSLAVAEK 730
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 332/817 (40%), Positives = 460/817 (56%), Gaps = 63/817 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD R++IIDG+R + S IHYPR+ P+ WPDLI+K+KEGG +VIE+YVFWN HE +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D++KF KL+ +Y +RIGP+V AEWN GG P WLR+IP I FRTNN P
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ M++FV IV+ ++E LF+ QGGPII+ QIENEY ++E ++ + G Y+ WAA MA
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGGR 284
+ GVPW+MCKQT AP +I CNG +C P KP LWTENW Y +G
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDP 272
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF+VARFF GG+ NYYMY GGTNFGR +G F + Y +AP+DE+GL
Sbjct: 273 PSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLY 331
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A++ C+ AL+ + + LG+ EA V+ ++ C AFL+N
Sbjct: 332 KEPKWGHLRDLHHALRHCKKALLWGNPS-VQPLGKLYEARVFEMK---EKNVCVAFLSNH 387
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +VTF GQ Y + S+SIL DC+ VF+T V+SQ + +T F+
Sbjct: 388 NTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFA------DQTV 441
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
Q ++ E M +E I +S+ + Q LE N TKD +DYLW+ T +
Sbjct: 442 QDNVWE----------MYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLET 491
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV------VQPVEFQSGYN 578
DD+ + K EV+P + + S + F+N G GH K+ + ++ + G N
Sbjct: 492 DDLPYRK--EVKPVLEVSSHGHAIVAFVNDAFVG--CGHGTKINKAFTMEKAMDLKVGVN 547
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ +LS T+GL + G++LE AG V + G G +DL+ W + VGL GE ++++
Sbjct: 548 HVAILSSTLGLMDSGSYLEHRMAGVY-TVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVH 606
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
S + A +D P TWY+ FD P G DPV +DL MGKG +VNG +GRYW
Sbjct: 607 SEQGMGAVAWKPGKDNQP--LTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYW 664
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
Y A G P+Q YHVPRS L+ N L+ FEE GG P
Sbjct: 665 V------------SYHHA---------LGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKP 703
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVR-KWSNSYS-----VDGKLSINKMAPEMHLHCQDG 812
I + +C ++E + VR W + S + P L C
Sbjct: 704 DAIMILTVKRDNICTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTK 763
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I S+ FASYG P G C ++ G+CHAP + VV +
Sbjct: 764 KTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEK 800
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 598 bits (1542), Expect = e-168, Method: Compositional matrix adjust.
Identities = 319/812 (39%), Positives = 464/812 (57%), Gaps = 62/812 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R+++IDG R + S IHYPR+ PE+WP LI ++KEGG + IETY+FWNAHE
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ D++K++K++ +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+EM++FV+ IV +++ LF+ QGGPII+ QIENEYGN++ + G Y++WAA MA
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
L GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G ++
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GGS +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH I+ + A + + I LG EAH++ ++ C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEI-LGHGYEAHIFELP---EENLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ + +P SVSIL C+N V+NT +V Q + +
Sbjct: 391 TGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHN-----------------E 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S S+++S + W E I + + ++ LE N TKD SDYLW+ T + D
Sbjct: 434 RSYHTSEVTSKNNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESD 493
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYNDLI 581
D+ F N++RP + + S + F N G G VK +PV+ + G N ++
Sbjct: 494 DLPF--RNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVV 551
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS T+G+++ G L + +G + + + G G +DL W ++ L+GE ++IYS +
Sbjct: 552 LLSSTMGMKDSGGELAEVKSGIQ-ECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEK 610
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ +W TWYK YFD PDG DPV LD+ SM KG +VNG +GRYW
Sbjct: 611 GVGKVQWKPAENG---RAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWV- 666
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
YR T G P+Q YH+PR +L++ +NLLV+FEE G P
Sbjct: 667 -----------SYR---------TLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDG 706
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIISS 817
I V+ + +C +SE + ++ W DG KL + L C I
Sbjct: 707 ILVQTVTRDDICLFISEHNPGQIKTW----DTDGDKIKLIAEDHSRRGTLMCPPEKTIQE 762
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FAS+G P+G C F+ G CH P + +V +
Sbjct: 763 VVFASFGNPEGMCGNFTVGTCHTPNAKQIVEK 794
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 342/838 (40%), Positives = 467/838 (55%), Gaps = 125/838 (14%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
M+ M+ + + +S + + K NV+YD R++II+G R+L S IHYPR+TPE
Sbjct: 14 MVCMLFWLGFAFLSMAIITVQGKAGNVTYDGRSLIINGEHRILFSGSIHYPRSTPE---- 69
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
Y+F G+ D+VKF+ V + GLY LRIGP++
Sbjct: 70 ----------------------------YDFDGRKDLVKFLLEVQAQGLYAALRIGPFIE 101
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
EW +GG P WL D+ GI FR++N PFK+ MQRFV KIV++M+ L++ QGGPII+ QI
Sbjct: 102 GEWTYGGLPFWLHDVSGIVFRSDNEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQI 161
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY- 259
ENEY N+E+++ ++G YV WAA+MA+ L GVPWVMCKQTDAP+ +I+ CNG C
Sbjct: 162 ENEYQNVETAFHEKGSRYVHWAANMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETF 221
Query: 260 -KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
PNS NKP++WTENW +Y +GG R ED+AF VA F R GS++NYYMY GGTN
Sbjct: 222 AGPNSPNKPSMWTENWTSFYQVFGGEPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTN 281
Query: 319 FGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG 378
FGRT G F TSY AP+DEYGL+ +PKWGHLKDLHA IK C L+ + Q LG
Sbjct: 282 FGRT-GSAFVTTSYYDQAPLDEYGLIRQPKWGHLKDLHAKIKSCSKTLIRG-THQTFPLG 339
Query: 379 QNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFN 438
+ QEA+V+R +C AFL N D +V F +SY LP S+SILPDC++ FN
Sbjct: 340 RLQEAYVFREK----SGDCVAFLVNNDGRRDVTVRFQNRSYELPHKSISILPDCKSITFN 395
Query: 439 TAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ 498
TAKV++Q + ++ S + SS K W KE + + + +
Sbjct: 396 TAKVNTQYATRSATLS----------------QEFSSVGK-WEEYKETVATFDSTSLRAK 438
Query: 499 GILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTID--SMRDVLRVFINGQL 556
+L+HL+ TKD SDYLW+ + ++ + RP T+ S VL ++NG
Sbjct: 439 TLLDHLSTTKDTSDYLWYTFR----------FQNHFSRPQSTLRAYSRGHVLHAYVNGVY 488
Query: 557 TGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF 612
GS G + V ++G N++ LLS TVGL + GA+LE+ AG +V++
Sbjct: 489 AGSAHGSHESTSFTLENSVRLKNGTNNVALLSVTVGLPDSGAYLERRVAGLH-RVRIQ-- 545
Query: 613 KNGDIDLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDG 671
+ D + W YQVGL GE QIY+ N+ W + G TWYKT FDAP G
Sbjct: 546 ---NKDFTTYSWGYQVGLLGEKLQIYTDNGLNKVSWNEF--RGTTQPLTWYKTQFDAPAG 600
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
DP+AL+L SMGKG+AWVNG IGRYW + T+ GNP+Q
Sbjct: 601 SDPIALNLHSMGKGEAWVNGQSIGRYWVSFS---------------------TSKGNPSQ 639
Query: 732 TWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS 791
T YH+P+S+++ + NLLV+ EE G P I+V S VC VSESH V+
Sbjct: 640 TRYHIPQSFVKPTGNLLVLLEEEKGYPPGITVDSISISKVCGHVSESHKSVVQ------- 692
Query: 792 VDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C IS I F+S+GTP+G C +++ G CH+ S ++V +
Sbjct: 693 ---------------LSCPPNRNISRILFSSFGTPEGNCNQYAIGKCHSSNSRAIVEK 735
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 332/817 (40%), Positives = 458/817 (56%), Gaps = 63/817 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD R++IIDG+R + S IHYPR+ P+ WPDLI+K+KEGG +VIE+YVFWN HE +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G YNF+G+ D++KF KL+ +Y +RIGP+V AEWN GG P WLR+IP I FRTNN P
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ M++FV IV+ ++E LF+ QGGPII+ QIENEY ++E ++ + G Y+ WAA MA
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGGR 284
+ GVPW+MCKQT AP +I CNG +C P KP LWTENW Y +G
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDP 272
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF+VARFF GG+ NYYMY GGTNFGR +G F + Y +AP DE+GL
Sbjct: 273 PSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPFDEFGLY 331
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A++ C+ AL+ + + LG+ EA V+ ++ C AFL+N
Sbjct: 332 KEPKWGHLRDLHHALRHCKKALLWGNPS-VQPLGKLYEARVFEMK---EKNVCVAFLSNH 387
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +VTF GQ Y + S+SIL DC+ VF+T V+SQ + +T F+
Sbjct: 388 NTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFA------DQTV 441
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
Q ++ E M +E I +S+ + Q LE N TKD +DYLW+ T +
Sbjct: 442 QDNVWE----------MYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLET 491
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV------VQPVEFQSGYN 578
DD+ + K EV+P + + S + F+N G GH K+ + ++ + G N
Sbjct: 492 DDLPYRK--EVKPVLEVSSHGHAIVAFVNDAFVG--CGHGTKINKAFTMEKAMDLKVGVN 547
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ +LS T+GL + G++LE AG V + G G +DL+ W + VGL GE ++++
Sbjct: 548 HVAILSSTLGLMDSGSYLEHRMAGVY-TVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVH 606
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
S + A +D P TWY+ FD P G DPV +DL MGKG +VNG +GRYW
Sbjct: 607 SEQGMGAVAWKPGKDNQP--LTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYW 664
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
Y A G P+Q YHVPRS L+ N L+ FEE GG P
Sbjct: 665 V------------SYHHA---------LGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKP 703
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVR-KWSNSYS-----VDGKLSINKMAPEMHLHCQDG 812
I + +C ++E + VR W + S P L C
Sbjct: 704 DAIMILTVKRDNICTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGFKPTAVLSCPTK 763
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
I S+ FASYG P G C ++ G+CHAP + VV +
Sbjct: 764 KTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEK 800
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 325/824 (39%), Positives = 484/824 (58%), Gaps = 70/824 (8%)
Query: 40 TFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFW 99
+F +++YD ++II+GNR +L S IHYPR+TPEMWP++I ++K+GG + I+TYVFW
Sbjct: 21 SFSGALSITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFW 80
Query: 100 NAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIE 159
N HE +G++NF G+ D+VKF+KL+ +GLY+ LR+GP++ AEW GG P WLR++PGI
Sbjct: 81 NVHEPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIF 140
Query: 160 FRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYV 219
FRT+N PFKE +R+VK ++D+M+EE LF+ QGGPII+ QIENEY ++ +Y + G +Y+
Sbjct: 141 FRTDNEPFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYI 200
Query: 220 KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGW 277
KWA+ + + G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW
Sbjct: 201 KWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQ 260
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAP 337
+ +G R VED+A++VARFF + G+ +NYYMY GGTNFGRTS + T Y DAP
Sbjct: 261 FRVFGDPPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAP 319
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNC 397
+DE+GL EPK+GHLK LH A+ LC+ AL+ + K E Y + G++ C
Sbjct: 320 LDEFGLEREPKYGHLKHLHNALNLCKKALLWG-QPRVEKPSNETEIRYYE--QPGTKV-C 375
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
+AFLAN + A + F G+ Y +P S+SILPDC+ V+NT ++ S +
Sbjct: 376 AAFLANNNTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHT---------- 425
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
++ ++SK ++ + + E + + + + +E +TKD SDY W+
Sbjct: 426 -------SRNFMKSKKANKNFDFKVFTESVPSKIKGDSFIP--VELYGLTKDESDYGWYT 476
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEF 573
T + D+D+S K +P + I S+ L V++NG+ G+ G H K +PV
Sbjct: 477 TSFKIDDNDLS--KKKGGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTL 534
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKI-LWTYQVGLKG 632
+ G N L +L G + G+++E G R V + G +G +DL++ W +VG++G
Sbjct: 535 KEGENHLTMLGVLTGFPDSGSYMEHRYTGPR-SVSILGLGSGTLDLTEENKWGNKVGMEG 593
Query: 633 EFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNG 691
E I++ E + +W + G TWY+TYFDAP+ A+ + MGKG WVNG
Sbjct: 594 ERLGIHAEEGLKKVKWEKAS--GKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNG 651
Query: 692 HHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
+GRYW + ++P G PTQ YH+PRS+L+ NLLVI
Sbjct: 652 EGVGRYWMSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNLLVI 689
Query: 751 FEETGG-NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMH--- 806
FEE P I + + VC + E++ P VR W+ + + ++H
Sbjct: 690 FEEEPNVKPELIDFVIVNRDTVCSYIGENYTPSVRHWTRKND-----QVQAITDDVHLTA 744
Query: 807 -LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C IS++EFAS+G P G C F+ G+C+AP+S VV +
Sbjct: 745 NLKCSGTKKISAVEFASFGNPNGTCGNFTLGSCNAPVSKKVVEK 788
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 321/812 (39%), Positives = 459/812 (56%), Gaps = 62/812 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++IDG R + S IHYPR+ PEMWP L+ ++K+GG + IETYVFWNAHE
Sbjct: 33 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWNAHEPEP 92
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ D++KF+KL+ + +Y +RIGP++ AEWN GG P WLR+IP I FR NN P
Sbjct: 93 GKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 152
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+EM++FV+ IV +++ +F+ QGGPII+ QIENEYGN++ + G Y++WAA MA
Sbjct: 153 YKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLEWAAEMA 212
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
L G+PW+MCKQT AP +I CNG +C D + NKP LWTENW + +G +
Sbjct: 213 LSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWTLRDKNKPRLWTENWTAQFRAFGDQA 272
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A++V RFF +GG+ +NYYMY+GGTNFGRT G + +T Y +APIDEYGL
Sbjct: 273 AVRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEAPIDEYGLNK 331
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH IK A + + + LG EAH Y ++ C AF++N +
Sbjct: 332 EPKFGHLRDLHKLIKSYHKAFLVGKQS-FELLGHGYEAHNY---ELPEENLCLAFISNNN 387
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ Y +P SVSIL DC + V+NT +V Q S +
Sbjct: 388 TGEDGTVMFRGKKYYIPSRSVSILADCNHVVYNTKRVFVQHS-----------------E 430
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S + S+ + W EPI + + + LE N+TKD SDYLW+ T + D
Sbjct: 431 RSFHTADESTKNNVWEMYSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWYTTSFRLEAD 490
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLI 581
D+ F + ++RP V + S + F+N GS G +P++ + G N L
Sbjct: 491 DLPFRR--DIRPVVQVKSSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLA 548
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS ++G+++ G L + G + + G G +DL W +++ L GE ++IY+ +
Sbjct: 549 LLSSSMGMKDSGGELVEVKGGIQ-DCMIQGLNTGTLDLQGNGWGHKINLDGEDKEIYTEK 607
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+W TWY+ YFD PDG DPV LD+ SM KG +VNG +GRYWT
Sbjct: 608 GMGTVKWKPAENG---HAVTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWT- 663
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
Y+ T G P+Q+ YH+PR +L++ NLLV+FEE G P
Sbjct: 664 -----------SYK---------TIAGLPSQSLYHIPRPFLKSKKNLLVVFEEEIGKPEG 703
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIISS 817
I ++ +C +SE + V+ W DG KL + L C I
Sbjct: 704 ILIQTVRRDDICFLMSEHNPAQVKTW----DADGGQIKLIAEDHSSRGILTCPHKKTIEE 759
Query: 818 IEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FAS+G P+G C F+ G CH P + V++
Sbjct: 760 VVFASFGNPEGACGNFTAGTCHTPNAKEFVAK 791
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 325/817 (39%), Positives = 481/817 (58%), Gaps = 70/817 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++II+GNR +L S IHYPR+TPEMWP++I ++K+GG + I+TYVFWN HE +
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G+ D+VKF+KL+ +GLY+ LR+GP++ AEW GG P WLR++PGI FRT+N P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 163
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKE +R+VK ++D+M+EE LF+ QGGPII+ QIENEY ++ +Y + G +Y+KWA+ +
Sbjct: 164 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW + +G
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 283
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R VED+A++VARFF + G+ +NYYMY GGTNFGRTS + T Y DAP+DE+GL
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEFGLE 342
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPK+GHLK LH A+ LC+ AL+ + K E Y + G++ C+AFLAN
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWG-QPRVEKPSNETEIRYYE--QPGTKV-CAAFLANN 398
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A + F G+ Y +P S+SILPDC+ V+NT ++ S +
Sbjct: 399 NTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHT----------------- 441
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++ ++SK ++ + + E + + + + +E +TKD SDY W+ T + D
Sbjct: 442 SRNFMKSKKANKNFDFKVFTESVPSKIKGDSFIP--VELYGLTKDESDYGWYTTSFKIDD 499
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEFQSGYNDL 580
+D+S K +P + I S+ L V++NG+ G+ G H K +PV + G N L
Sbjct: 500 NDLS--KKKGGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHL 557
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKI-LWTYQVGLKGEFQQIYS 639
+L G + G+++E G R V + G +G +DL++ W +VG++GE I++
Sbjct: 558 TMLGVLTGFPDSGSYMEHRYTGPR-SVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHA 616
Query: 640 IEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
E + +W + G TWY+TYFDAP+ A+ + MGKG WVNG +GRYW
Sbjct: 617 EEGLKKVKWEKAS--GKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYW 674
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG- 756
+ ++P G PTQ YH+PRS+L+ NLLVIFEE
Sbjct: 675 MSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNV 712
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMH----LHCQDG 812
P I + + VC + E++ P VR W+ + + ++H L C
Sbjct: 713 KPELIDFVIVNRDTVCSYIGENYTPSVRHWTRKND-----QVQAITDDVHLTANLKCSGT 767
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
IS++EFAS+G P G C F+ G+C+AP+S VV +
Sbjct: 768 KKISAVEFASFGNPNGTCGNFTLGSCNAPVSKKVVEK 804
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 324/762 (42%), Positives = 457/762 (59%), Gaps = 77/762 (10%)
Query: 15 ALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRAT 74
A V+ + ++++ + L ++A+ V+YD R++IIDG R++L S IHYPR+T
Sbjct: 3 AARVFGLCLILVGMFLVFPGGATAAK-----GVTYDGRSLIIDGQRKLLFSGSIHYPRST 57
Query: 75 PEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLR 134
PEMWP LI K+KEGG DVI+TYVFWN HE GQY+F G+ND+VKF+K + S GLY+ LR
Sbjct: 58 PEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLR 117
Query: 135 IGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGP 194
IGP++ AEWN+GG P WLRD+PG+ +RT+N PFK MQ+F KIV+LM+ E L++ QGGP
Sbjct: 118 IGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTTKIVNLMKSEGLYASQGGP 177
Query: 195 IIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGY 254
II+ QIENEY N+E+++ ++G Y+KWA MA+GL GVPW+MCK DAP+ +I+ CNG
Sbjct: 178 IILSQIENEYANVEAAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGM 237
Query: 255 YCDGY--KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
C PNS NKP +WTE+W ++ +G R ED+AF F + GS++NYYM
Sbjct: 238 RCGETFPGPNSPNKPKMWTEDWTSFFQVYGTEPYIRSAEDIAFHAVLFIAKNGSYINYYM 297
Query: 313 YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
Y GGTNFGRTS ++IT Y AP+DEYGLL +PK+GHLK+LHAAIK L+
Sbjct: 298 YHGGTNFGRTSSS-YFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQT 356
Query: 373 QYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
+ LG Q+A+V+ + S C AFL N D + + F SY+L P S+ IL +C
Sbjct: 357 -ILSLGPMQQAYVFED----ASSGCVAFLVNNDAKV-SQIQFRKSSYSLSPKSIGILQNC 410
Query: 433 RNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSE 492
+N ++ TAKV+ + + + ++ P Q + + + W +E I +S
Sbjct: 411 KNLIYETAKVNVEKNKR-----------VTTPVQ------VFNVPEKWEGFRETIPAFSG 453
Query: 493 NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFI 552
+ +LEH N+TKD +DYLW+ T + D + P++ I+S V+ VF+
Sbjct: 454 TSLKANALLEHTNLTKDKTDYLWY-TSSFKPDSPCT-------NPSIYIESSGHVVHVFV 505
Query: 553 NGQLTGSVIGHW---VKVVQ---PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQ 606
N L GS GH +KVV+ P +G N + +LS VGL + GA++E+ G +
Sbjct: 506 NNALAGS--GHGSRDIKVVKLQVPASLTNGQNSISILSGMVGLPDSGAYMERKSYGLT-K 562
Query: 607 VKLTGFKNGDIDLSKILWTYQVGLKGE---FQQIYSIEENEAEWTDLTRDGIPST--FTW 661
V+++ IDLS W Y VGL GE QQ ++ N +W+ + G+ W
Sbjct: 563 VQISCGGTKPIDLSGSQWGYSVGLLGEKVRLQQWRNL--NRVKWS-MNNAGLIKNRPLIW 619
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
YKT FD P+G PV L++ SMGKG+ WVNG IGRYW
Sbjct: 620 YKTIFDGPNGDGPVGLNMSSMGKGEIWVNGESIGRYWV---------------------S 658
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
T G+P+Q+ YH+PR +L+ S NLLV+FEE GG+P IS+
Sbjct: 659 FLTPSGHPSQSIYHIPREFLKPSGNLLVVFEEEGGDPLGISL 700
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 596 bits (1536), Expect = e-167, Method: Compositional matrix adjust.
Identities = 323/773 (41%), Positives = 465/773 (60%), Gaps = 61/773 (7%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MW D++ K++ GG +VI+TYVFWN HE + GQ+NF+G D+VKF+KL+G +Y+ LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P++ AEWN GG P WLR+ P I FR+ N+ FK M+++V IVD+M+E LF+ QGGPI+
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEY +++ +Y + G YV+WAA+MA+GLG GVPW+MCKQ DAP+ +I+ CNG +C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180
Query: 257 -DGYK-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYF 314
D + PN KP LWTENW Y +G R ED+AF+VARFF + GS +NYYMY
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMYH 240
Query: 315 GGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE-PALVAADSAQ 373
GGTNFGRTS F T Y +AP+DE+GL EPKWGHL+D+H A+ LC+ P L Q
Sbjct: 241 GGTNFGRTSA-VFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQ 299
Query: 374 YIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCR 433
I G+ EA Y + C+AFLAN D +A ++ F G+ + LPP S+SILPDC+
Sbjct: 300 VI--GKGLEARFYEK---PGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCK 354
Query: 434 NTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKE-PIGVWSE 492
VFNT + SQ + + +P ++ + K + +S TV++ P+
Sbjct: 355 TVVFNTETIVSQHNARNF-----------IPSKNANKLKWKMSPESIPTVEQVPV----- 398
Query: 493 NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFI 552
NN + LE ++ KD +DY W+ T I + +D+S K ++ P + I S+ + VF+
Sbjct: 399 NN---KIPLELYSLLKDTTDYGWYTTSIELDKEDVS--KRPDILPVLRIASLGHAMLVFV 453
Query: 553 NGQLTGSVIG-HWVK--VVQ-PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVK 608
NG+ G+ G H K V Q V F++G N++ LL VGL + GA++E AG R +
Sbjct: 454 NGEYIGTAHGSHEEKNFVFQGSVPFKAGVNNIALLGILVGLPDSGAYMEHRFAGPR-SIT 512
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFD 667
+ G G +D+SK W +QV L+GE ++++ + +W+++ + S TWYKTYFD
Sbjct: 513 ILGLNTGTLDISKNGWGHQVALQGEKVKVFTQGGSHRVDWSEIKEE--KSALTWYKTYFD 570
Query: 668 APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
AP+G DPVA+ + MGKGQ WVNG IGRYW +Y S +
Sbjct: 571 APEGNDPVAIRMNGMGKGQIWVNGKSIGRYWM----------------SYLSPLKLS--- 611
Query: 728 NPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWS 787
TQ+ YH+PRS+++ S NLLVI EE P ++ + L + +C +++ H P V+ W
Sbjct: 612 --TQSEYHIPRSFIKPSENLLVILEEENVTPEKVEILLVNRDTICSFITQYHPPNVKSWE 669
Query: 788 NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
+ ++ + HL C I++IEFAS+G P G C F G CH+
Sbjct: 670 RK-DKQFRAVVDDVKTGAHLRCPHDKKITNIEFASFGDPSGVCGNFEHGKCHS 721
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 596 bits (1536), Expect = e-167, Method: Compositional matrix adjust.
Identities = 324/813 (39%), Positives = 480/813 (59%), Gaps = 62/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++IIDG R +L S IHYPR+TPEMWP +I ++K+GG + I+TYVFWN HE +
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G+ D+VKF+KL+ +G+Y+ LR+GP++ AEW GG P WLR++PGI FRT+N
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKE +R+V+ I+D M+EE LF+ QGGPII+ QIENEY ++ +Y Q G +Y+KWA+++
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW + +G
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R VED+A++VARFF + G+ +NYYMY GGTNFGRTS + T Y DAP+DEYGL
Sbjct: 281 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 339
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPK+GHLK LH A+ LC+ L+ + K G++ E Y + G+++ C+AFLAN
Sbjct: 340 KEPKYGHLKHLHNALNLCKKPLLWGQ-PKTEKPGKDTEIRYYE--QPGTKT-CAAFLANN 395
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A ++ F G+ Y + P S+SILPDC+ V+NTA++ SQ +
Sbjct: 396 NTEAAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHT----------------- 438
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++ ++SK ++ + E + E N + +E +TKD +DY W+ T V
Sbjct: 439 SRNFMKSKKANKKFDFKVFTETLPSKLEGNSYIP--VELYGLTKDKTDYGWYTTSFKVHK 496
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEFQSGYNDL 580
+ + K V+ V I S+ L ++NG+ GS G H K + V ++G N L
Sbjct: 497 NHLPTKKG--VKTFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHL 554
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK-ILWTYQVGLKGEFQQIYS 639
++L G + G+++E G RG + + G +G +DL++ W ++G++GE I++
Sbjct: 555 VMLGVLTGFPDSGSYMEHRYTGPRG-ISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHT 613
Query: 640 IEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
E + EW T G TWY+TYFDAP+ + + + MGKG WVNG +GRYW
Sbjct: 614 EEGLKKVEWKKFT--GKAPGLTWYQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYW 671
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG- 756
+ ++P G PTQ YH+PRS+L+ NLLVIFEE
Sbjct: 672 QSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNV 709
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
P + + + VC V E++ P VR W+ ++ N ++ L C I+
Sbjct: 710 KPELMDFAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDN-VSLTATLKCSGTKKIA 768
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++EFAS+G P G C F+ G C+AP+S V+ +
Sbjct: 769 AVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEK 801
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 312/615 (50%), Positives = 389/615 (63%), Gaps = 31/615 (5%)
Query: 158 IEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD 217
+ FRT+N PFK MQ+F KIV +M+ E LF QGGPIIM QIENEYG +E G GK
Sbjct: 1 MAFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKA 60
Query: 218 YVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
Y KWAA MA+GL GVPW MCKQ DAP+ +ID CNGYYC+ + PN KP +WTENW GW
Sbjct: 61 YTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGW 120
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAP 337
YT +GG + HRP EDLA++VA F Q GSF+NYYMY GGTNFGRTS G F TSYDYDAP
Sbjct: 121 YTDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAP 180
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNC 397
IDEYGL +EPKW HLK+LH AIK CEPAL++ D +N EAHVY N S C
Sbjct: 181 IDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVN----TSIC 236
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
+AFLAN D +AA+VTF Y LPPWSVSILPDC+ VFNTA V+ + F +
Sbjct: 237 AAFLANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHS------FHKRM 290
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
+P +E+ S S +EP +++ + E +NVT+D SDYLW++
Sbjct: 291 TP---------VETTFDWQSYS----EEPAYSSDDDSIIANALWEQINVTRDSSDYLWYL 337
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEF 573
T + +S + SF K + PT+TI+S VL VF+NGQL+G+V G V + V
Sbjct: 338 TDVNISPSE-SFIKNGQF-PTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNL 395
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE 633
+ G N + LLS VGL N G E G G V+L G G DLS W+Y+VGLKGE
Sbjct: 396 KVGNNKISLLSVAVGLPNVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGE 455
Query: 634 FQQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
+++I + + +WT + TWYKT FDAP G DPVALD+ SMGKG+ W+N
Sbjct: 456 SLSLHTITGSSSIDWTQGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQ 515
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
IGR+W G C D C+Y G + + KC TNCG PTQ WYH+PRSWL +S N+LV+ E
Sbjct: 516 SIGRHWPAYIAHGNC-DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLE 574
Query: 753 ETGGNPFEISVKLRS 767
E GG+P IS+ R+
Sbjct: 575 EWGGDPTGISLVKRT 589
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 593 bits (1530), Expect = e-166, Method: Compositional matrix adjust.
Identities = 314/671 (46%), Positives = 405/671 (60%), Gaps = 46/671 (6%)
Query: 188 FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENI 247
F+ QGGPII+ QIENEYG + G G Y+ WAA MA+ L GVPWVMCK+ DAP+ +
Sbjct: 2 FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61
Query: 248 IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
I+ACNG+YCDG+ PN KPT+WTE W GW+T +GG + HRPV+DLAF+VARF Q+GGS+
Sbjct: 62 INACNGFYCDGFSPNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSY 121
Query: 308 MNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
+NYYMY GGTNFGRT+GGPF TSYDYD PIDEYGL+ +PK+GHLK+LH AIKLCE ALV
Sbjct: 122 INYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALV 181
Query: 368 AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVS 427
++D LG Q+A+V+ + C+AFL+N T A +TF Y LP WS+S
Sbjct: 182 SSDPT-VTSLGAYQQAYVFNSG----PRRCAAFLSNFHS-TGARMTFNNMHYDLPAWSIS 235
Query: 428 ILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPI 487
ILPDCRN VFNTAKV QTS + +P S + SW T E +
Sbjct: 236 ILPDCRNVVFNTAKVGVQTSRVQM-----------IPTNSRL--------FSWQTYDEDV 276
Query: 488 GVWSE-NNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRD 546
E ++ G+LE +NVT+D SDYLW++T + +S ++ K +PT+T+ S
Sbjct: 277 SSLHERSSIAAGGLLEQINVTRDTSDYLWYMTNVDISSSELRGGK----KPTLTVQSAGH 332
Query: 547 VLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
L VF+NGQ +GS G +PV ++G N + LLS VGL N G E G
Sbjct: 333 ALHVFVNGQFSGSAFGTREHRQFTFAKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTG 392
Query: 603 FRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS----T 658
G V L G G DL+ W +VGLKGE + + N D R + + T
Sbjct: 393 ILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMDL--VSPNGGSSVDWIRGSLATQTKQT 450
Query: 659 FTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYN 718
WYK YF+AP G +P+ALD+ SMGKGQ W+NG IG+YW A G C C Y G +
Sbjct: 451 LKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSIGKYWMAYA-NGDC-SLCSYIGTFR 508
Query: 719 SDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSES 778
KC CG PTQ WYHVPRSWL+ + NL+V+FEE GG+P +I++ RS VC + E
Sbjct: 509 PTKCQLGCGQPTQRWYHVPRSWLKPTQNLVVVFEELGGDPSKITLVKRSVAGVCADLQE- 567
Query: 779 HYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNC 838
H+P K + K ++HL C G ISSI+FAS+GTP G C F +G C
Sbjct: 568 HHPNAEKLDIDSHEESK---TLHQAQVHLQCVPGQSISSIKFASFGTPTGTCGSFQQGTC 624
Query: 839 HAPMSLSVVSE 849
HA S ++V +
Sbjct: 625 HATNSHAIVEK 635
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 341/810 (42%), Positives = 459/810 (56%), Gaps = 111/810 (13%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+Y+ RA+++DG RRML + +HYPR+TPEMWP LIAK+KEGG DVI+TYVFWN HE I+
Sbjct: 18 VTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVHEPIQ 77
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ D+V+F+K + + GLY+ LRIGP++ +EW +GGFP WL D+P I FR++N P
Sbjct: 78 GQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNEP 137
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQRFV IV++M+ E L+ QGGPII QIENEY +E ++G G+ YV WAA+MA
Sbjct: 138 FKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWAAAMA 197
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ L GVPW MCKQ DAP+ ++ +SY P + +N Y +G
Sbjct: 198 VDLQTGVPWTMCKQNDAPDPVVGI-----------HSYTIPVNF-QNDSRNYLIYGNDTK 245
Query: 287 HRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEYGLL 344
R +D+ FAVA F R GS+++YYMY GGTNFGR + Y+T+ YD AP+DEYGL+
Sbjct: 246 LRSPQDITFAVALFIARKNGSYVSYYMYHGGTNFGRFASS--YVTTSYYDGAPLDEYGLI 303
Query: 345 SEPKWGHLKDLHAAIKL-CEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+P WGHL++LHAA+K EP L S + +GQ QEAH++ +++ C AFL N
Sbjct: 304 WQPTWGHLRELHAAVKQSSEPLLFGTYSN--LSIGQEQEAHIFE-----TETQCVAFLVN 356
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISV 463
D+H + V F S L P S+SIL DC+ VF TAKV++Q +T E
Sbjct: 357 FDQHHISEVVFRNISLELAPKSISILLDCKQVVFETAKVNAQHGSRTAE----------- 405
Query: 464 PQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
E + S +W KEPI S++ ++ + EHL+ TKD +DYLW+I +++
Sbjct: 406 ------EVQSFSDISTWKAFKEPIPQDVSKSAYSGNRLFEHLSTTKDATDYLWYIVGLFL 459
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-VEFQSGYNDLI 581
+ I G++ GS G + + Q G N +
Sbjct: 460 N----------------------------ILGRIHGSHGGPANIIFSTNISLQEGPNTIS 491
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS VG + GA +E+ G R +V + + + L+ LW YQVGL GE IY+ +
Sbjct: 492 LLSAMVGSPDSGAHMERRVFGIR-KVSIQQGQEPENLLNNELWGYQVGLFGERNNIYTQD 550
Query: 642 ENEAEWTDLTRDGIP-STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-T 699
EWT T D + S TWYKT F P G D V L+L MGKG+ WVNG IGRYW +
Sbjct: 551 SKITEWT--TIDNLTYSPLTWYKTTFSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWVS 608
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
AP GNP+Q+ YH+PR +L +N LV+FEE GGNP
Sbjct: 609 FKAP----------------------SGNPSQSLYHIPREFLNPQDNTLVLFEEMGGNPQ 646
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIE 819
I+V S VC V+E P L P + L C +G IS+IE
Sbjct: 647 LITVNTMSVSRVCGNVNELSAP-------------SLQYKDKEPAVDLWCPEGKHISAIE 693
Query: 820 FASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FASYG P G C+KF G CHA S SVV +
Sbjct: 694 FASYGGPTGDCKKFGFGRCHAGSSESVVKQ 723
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 587 bits (1513), Expect = e-165, Method: Compositional matrix adjust.
Identities = 317/813 (38%), Positives = 463/813 (56%), Gaps = 66/813 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R+++IDG R + S IHYPR+ PE+WP LI ++KEGG + IETY+FWNAHE
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ D++K++K++ +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+EM++FV+ IV +++ LF+ QGGPII+ QIENEYGN++ + G Y++WAA MA
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
L GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G ++
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GGS +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH I+ + A + + I LG EAH++ ++ C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEI-LGHGYEAHIFELP---EENLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ + +P SVSIL C+N V+NT +V Q + +
Sbjct: 391 TGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHN-----------------E 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S S+++S + W E I + + ++ LE N TKD SDYLW+ T + D
Sbjct: 434 RSYHTSEVTSKNNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESD 493
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYNDLI 581
D+ F N++RP + + S + F N G G VK +PV+ + G N ++
Sbjct: 494 DLPF--RNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVV 551
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS T+G+++ G L + +G + + + G G +DL W ++ L+GE ++IYS +
Sbjct: 552 LLSSTMGMKDSGGELAEVKSGIQ-ECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEK 610
Query: 642 E-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ +W TWYK YFD PDG DPV LD+ SM KG +VNG +GRYW
Sbjct: 611 GVGKVQWKPAENG---RAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWV- 666
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
YR T G P+Q YH+PR +L++ +NLLV+FEE G P
Sbjct: 667 -----------SYR---------TLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDG 706
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIISS 817
I V+ + +C +SE + ++ W DG KL + L C I
Sbjct: 707 ILVQTVTRDDICLFISEHNPGQIKTW----DTDGDKIKLIAEDHSRRGTLMCPPEKTIQE 762
Query: 818 IEFASYGTPQGRCQKFS----RGNCHAPMSLSV 846
+ FAS+G P+G C F+ + +C P+ +V
Sbjct: 763 VVFASFGNPEGMCGNFTECLGKPSCMLPVDHTV 795
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 339/813 (41%), Positives = 459/813 (56%), Gaps = 110/813 (13%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
P VS D RA+++DG RR+L + +HY R+TPEMWP LIAK+KEGG D+I+TYVFWN HE
Sbjct: 39 PRQVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNVHE 98
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
++GQYNF+G+ D+V+F+K + + GLY+ LRIGP++ +EW +GGFP WL D+P I FR++
Sbjct: 99 PVQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSD 158
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
N PFK+ MQRFV IV++M+ E L+ QGGPII QIENEY +E ++G G+ YV WAA
Sbjct: 159 NEPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSWAA 218
Query: 224 SMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
+MA+ GVPW MCKQ DAP+ ++ +S+ P L N Y +G
Sbjct: 219 AMAVDRQTGVPWTMCKQNDAPDPVVGI-----------HSHTIP-LDFPNASRNYLIYGN 266
Query: 284 RLPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEY 341
R ED+AFAV F R GS+++YYMY GGTNFGR + Y+T+ YD AP+DEY
Sbjct: 267 DTKLRSPEDIAFAVVYFIARKNGSYVSYYMYHGGTNFGRFASS--YVTTSYYDAAPLDEY 324
Query: 342 GLLSEPKWGHLKDLHAAIKL-CEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAF 400
GL+ +P WGHL++LHAA+K EP L S Y+ LGQ QEAH++ ++S C AF
Sbjct: 325 GLIWQPTWGHLRELHAAVKQSSEPLLFGTYS--YLSLGQEQEAHIFE-----TESQCVAF 377
Query: 401 LANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPN 460
L N D H + V F S L P S+SIL DC+ VF TAKV++Q +T E
Sbjct: 378 LVNFDRHHISEVVFRNISLELAPKSISILSDCKRVVFETAKVTAQHGSRTAE-------- 429
Query: 461 ISVPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
E + S +W KEPI S+ ++ + EHL+ TKD +DYLW+I
Sbjct: 430 ---------EVQSFSDINTWTAFKEPIPQDVSKAMYSGNRLFEHLSTTKDDTDYLWYIVG 480
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-VEFQSGYN 578
++ + I G++ GS G ++ + + G N
Sbjct: 481 LFHN----------------------------ILGRIHGSHGGPANIILNTNISLKEGPN 512
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+ LLS VG + GA +E+ G + +V + + + L+ LW YQVGL GE IY
Sbjct: 513 TISLLSAMVGSPDSGAHMERRVFGLQ-KVSIQQGQEPENLLNNELWGYQVGLFGERNSIY 571
Query: 639 SIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
+ E +++ EWT + S TWYKT F P G D V L+L MGKG+ WVNG IGRY
Sbjct: 572 TQEGSKSVEWTTIYNLAY-SPLTWYKTTFSTPAGNDAVTLNLTGMGKGEVWVNGESIGRY 630
Query: 698 W-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGG 756
W + AP GNP+Q+ YH+PR +L +N+LV+FEE GG
Sbjct: 631 WVSFKAPS----------------------GNPSQSLYHIPRQFLNPQDNILVLFEEMGG 668
Query: 757 NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIIS 816
NP +I+V S VC V+E P L P + L CQ+G IS
Sbjct: 669 NPQQITVNTVSVTRVCVNVNELSAP-------------SLQYKNKEPAVDLRCQEGKQIS 715
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+IEFASYG P G C+K G+CHA S SVV +
Sbjct: 716 AIEFASYGNPIGDCKKIRFGSCHAGSSESVVKQ 748
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 580 bits (1495), Expect = e-162, Method: Compositional matrix adjust.
Identities = 316/813 (38%), Positives = 457/813 (56%), Gaps = 64/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++IDG R + S IHYPR+ PEMW L+ +K GG + IETYVFWN HE
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+G+ D+++F+ ++ + +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EM++FV+ IV +++ +F+ QGGPII+ QIENEYGN++ +G Y++WAA MA
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ G GVPWVMCKQ+ AP +I CNG +C D + NKP LWTENW + T+G +L
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GG+ +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH IK A + + I LG EAH Y C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEI-LGHGYEAHNY---ELPEDKLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ + +P SVSIL DC+ V+NT +V Q S +
Sbjct: 391 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS-----------------E 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S + +S + W E I + + + LE N TKD SDYLW+ T + D
Sbjct: 434 RSFHTTDETSKNNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESD 493
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
D+ F + ++RP + I S + F N G+ G + +P++ + G N +
Sbjct: 494 DLPFRR--DIRPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIA 551
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS ++G+++ G L + G + V + G G +DL W ++ L+GE ++IY+ E
Sbjct: 552 MLSSSMGMKDSGGELVEVKGGIQDCV-VQGLNTGTLDLQGNGWGHKARLEGEDKEIYT-E 609
Query: 642 ENEA--EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ A +W D +P TWYK YFD PDG DP+ +D+ SM KG +VNG IGRYWT
Sbjct: 610 KGMAQFQWKPAEND-LP--ITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT 666
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
T G+P+Q+ YH+PR++L+ NLL+IFEE G P
Sbjct: 667 SF---------------------ITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPG 705
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIIS 816
I ++ +C +SE + ++ W + DG KL + L+C I
Sbjct: 706 GILIQTVRRDDICVFISEHNPAQIKTWES----DGGQIKLIAEDTSTRGTLNCPPKRTIQ 761
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FAS+G P+G C F+ G CH P + ++V +
Sbjct: 762 EVVFASFGNPEGACGNFTAGTCHTPDAKAIVEK 794
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 580 bits (1495), Expect = e-162, Method: Compositional matrix adjust.
Identities = 316/813 (38%), Positives = 457/813 (56%), Gaps = 64/813 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++IDG R + S IHYPR+ PEMW L+ +K GG + IETYVFWN HE
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+G+ D+++F+ ++ + +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EM++FV+ IV +++ +F+ QGGPII+ QIENEYGN++ +G Y++WAA MA
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ G GVPWVMCKQ+ AP +I CNG +C D + NKP LWTENW + T+G +L
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GG+ +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH IK A + + I LG EAH Y C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEI-LGHGYEAHNYELP---EDKLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ + +P SVSIL DC+ V+NT +V Q S +
Sbjct: 391 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS-----------------E 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S + +S + W E I + + + LE N TKD SDYLW+ T + D
Sbjct: 434 RSFHTTDETSKNNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESD 493
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
D+ F + ++RP + I S + F N G+ G + +P++ + G N +
Sbjct: 494 DLPFRR--DIRPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIA 551
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS ++G+++ G L + G + V + G G +DL W ++ L+GE ++IY+ E
Sbjct: 552 MLSSSMGMKDSGGELVEVKGGIQDCV-VQGLNTGTLDLQGNGWGHKARLEGEDKEIYT-E 609
Query: 642 ENEA--EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ A +W D +P TWYK YFD PDG DP+ +D+ SM KG +VNG IGRYWT
Sbjct: 610 KGMAQFQWKPAEND-LP--ITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT 666
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
T G+P+Q+ YH+PR++L+ NLL+IFEE G P
Sbjct: 667 SF---------------------ITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPG 705
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIIS 816
I ++ +C +SE + ++ W + DG KL + L+C I
Sbjct: 706 GILIQTVRRDDICVFISEHNPAQIKTWES----DGGQIKLIAEDTSTRGTLNCPPKRTIQ 761
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FAS+G P+G C F+ G CH P + ++V +
Sbjct: 762 EVVFASFGNPEGACGNFTAGTCHTPDAKAIVEK 794
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 316/731 (43%), Positives = 435/731 (59%), Gaps = 69/731 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD +++I+G+ ++L S IHYPR+TP+MWPDLI+K+KEGG DVI+TYVFWN HE
Sbjct: 25 NVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDVIQTYVFWNLHEPQ 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY F G+ D+V F+K + + GLY+ LRIGPY+ +E +GG P+WL D+PGI FRT+N
Sbjct: 85 QGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDVPGIVFRTDND 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
FK MQRF KIV++M+ LF+ QGGPII+ QIENEYG+++S + G Y+ WAA M
Sbjct: 145 QFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFRANGLPYIHWAAQM 204
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCD-GYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+GL GVPW+MCKQ DAP+ +I+ACNG C +K PNS NKP+LWTENW + +GG
Sbjct: 205 AVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLWTENWTSFLQAFGG 264
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R D+A+ VA F + GS++NYYMY GGTNF R + F IT+Y +AP+DEYGL
Sbjct: 265 APYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASA-FIITAYYDEAPLDEYGL 323
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLG------QNQEAHVYRANRYGSQSNC 397
+ +PKWGHLK+LHA+IK C L+ + LG +N+ + Y +
Sbjct: 324 VRQPKWGHLKELHASIKSCSQPLLDGTQTTF-SLGSEQQVIKNESSWTYFPLMFSEVPQN 382
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
I ++ F SY LP S+SILP C+N VFNT KVS Q +++ ++ L
Sbjct: 383 VLLSWKISGPRDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNNVRAMKPRLQF 442
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
+++++W E I ++ + +L+ ++ KD SDY+W+
Sbjct: 443 -----------------NSAENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYT 485
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH----WVKVVQPVEF 573
+ K+ + ++I S DVL FING LTGS G V + + V
Sbjct: 486 FRFNN--------KSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNL 537
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE 633
+G N++ +LS TVGL N GAFLE AG R +V++ G D S W YQVGL GE
Sbjct: 538 INGMNNISILSATVGLPNSGAFLESRVAGLR-KVEVQGR-----DFSSYSWGYQVGLLGE 591
Query: 634 FQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
QI+++ ++ +W P TWY+T F AP G DPV ++LGSMGKG AWVNG
Sbjct: 592 KLQIFTVSGSSKVQWKSFQSSTKP--LTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQ 649
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
IGRYW + D G P+Q WYH+PRS+L+++ NLLVI E
Sbjct: 650 GIGRYWVSF---------------HKPD------GTPSQQWYHIPRSFLKSTGNLLVILE 688
Query: 753 ETGGNPFEISV 763
E GNP I++
Sbjct: 689 EETGNPLGITL 699
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 336/818 (41%), Positives = 442/818 (54%), Gaps = 132/818 (16%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD RA+++ G RRM S +HY R+TPEMWP LIAK+K GG DVI+TYVFWN HE I+
Sbjct: 29 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ D+VKF++ + + GLY+ LRIGP+V AEW +GGFP WL D+P I FR++N P
Sbjct: 89 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQ FV KIV +M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAA+MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQ DAP+ +I+ CNG C PNS NKP LWTENW Y +G
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIYGND 268
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEYG 342
R ED+AFAVA F R GSF++YYMY GGTNFGR + Y+T+ YD AP+DEY
Sbjct: 269 TKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS--YVTTSYYDGAPLDEYD 326
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
C AFL
Sbjct: 327 F----------------------------------------------------KCVAFLV 334
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+H V F S L P S+S+L DCRN VF TAKV++Q +T L
Sbjct: 335 NFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSL----- 389
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+ +W EP+ S++ +T + E L TKD +DYLW+I
Sbjct: 390 ------------NDINNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWYIVSYK 437
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW-----VKVVQPVEFQSG 576
D N++ + + S+ +L F+N + GSV G + + + + G
Sbjct: 438 NRASD-----GNQI-AHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEG 491
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID---LSKILWTYQVGLKGE 633
N + LLS VG + GA++E+ G ++ G + G L+ LW YQVGL GE
Sbjct: 492 DNTISLLSVMVGSPDSGAYMERRTFG----IQTVGIQQGQQPMHLLNNDLWGYQVGLFGE 547
Query: 634 FQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
IY+ E N W D+ + I TWYKT F P G D V L+L SMGKG+ WVNG
Sbjct: 548 KDSIYTQEGTNSVRWMDIN-NLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGE 606
Query: 693 HIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYW + AP G P+Q+ YH+PR +L +NLLV+
Sbjct: 607 SIGRYWVSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNLLVLV 644
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GG+P +I+V S VC V E PP++ GK+ P++ + CQ
Sbjct: 645 EEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS-------RGKV------PKVRIWCQG 691
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G ISSIEFASYG P G C+ F G+CHA S SVV +
Sbjct: 692 GNRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQ 729
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 577 bits (1486), Expect = e-161, Method: Compositional matrix adjust.
Identities = 310/811 (38%), Positives = 451/811 (55%), Gaps = 60/811 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD +++IDG R + S IHYPR+ +MWP L+ +KEGG + IETYVFWNAHE
Sbjct: 38 VTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWNAHEPEP 97
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF+G+ND++KF+KL+ S G+Y +RIGP++ EWN G P WLR+IP I FR NN P
Sbjct: 98 GKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHIIFRANNEP 157
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K EM++FV+ IV ++++E LF+ QGG +I+ QIENEYGN++ + +G Y++WAA MA
Sbjct: 158 YKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLEWAAEMA 217
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G L
Sbjct: 218 ISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDENKPHLWTENWTAQFRAFGNDL 277
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A++V RFF +GG+ +NYYMY+GGTNFGRT G + +T Y + PIDEYG+
Sbjct: 278 AQRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPIDEYGMPK 336
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PK+GHL+DLH IK A + + + LGQ EA R + C AF++N +
Sbjct: 337 APKYGHLRDLHNVIKSYSRAFLEGKQS-FELLGQGYEA---RNFEIPEEKLCLAFISNNN 392
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G Y +P SVSIL DC++ V+NT +V Q S +
Sbjct: 393 TGEDGTVIFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHS-----------------E 435
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S +++ ++ + W E I + + + LE N TKD SDYLW+ T + D
Sbjct: 436 RSFHKAEKATKNNVWEMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEAD 495
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ------PVEFQSGYND 579
D+ ++RP + + S + F+N G+ GH K + P+ + G N
Sbjct: 496 DLPI--RGDIRPVIAVKSTAHAMVGFVNDAFAGN--GHGSKKEKFFTFETPISLRLGVNH 551
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS 639
L LLS ++G+++ G L + G + + G G +DL W ++ L+GE ++IY+
Sbjct: 552 LALLSSSMGMKDSGGELVELKGGIQ-DCTIQGLNTGTLDLQINGWGHKAKLEGEVKEIYT 610
Query: 640 IEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ A +W TWYK YFD PDG DPV LD+ SM KG +VNG +GRYW
Sbjct: 611 EKGMGAVKWVPAVSG---QAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYW 667
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
T G +Q YH+PR++L++ NNLLV+FEE G P
Sbjct: 668 TSYKTPGKVA---------------------SQAVYHIPRTFLKSKNNLLVVFEEELGKP 706
Query: 759 FEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSI 818
I ++ +C +SE + ++ W + + KL L+C II +
Sbjct: 707 EGILIQTVRRDDICVFISEHNPAQIKPW-DEHGGQIKLIAEDHNTRGFLNCPPKKIIQEV 765
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+G P G C F+ G CH P + +V +
Sbjct: 766 VFASFGNPVGSCANFTVGTCHTPNAKEIVEK 796
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 335/818 (40%), Positives = 442/818 (54%), Gaps = 132/818 (16%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD RA+++ G RRM S +HY R+TPEMWP LIAK+K GG DVI+TYVFWN HE I+
Sbjct: 25 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 84
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ D+VKF++ + + GLY+ LRIGP+V AEW +GGFP WL D+P I FR++N P
Sbjct: 85 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 144
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQ FV KIV +M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAA+MA
Sbjct: 145 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 204
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQ DAP+ +I+ CNG C PNS NKP LWTENW Y +G
Sbjct: 205 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIYGND 264
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEYG 342
R ED+AFAVA + R GSF++YYMY GGTNFGR + Y+T+ YD AP+DEY
Sbjct: 265 TKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS--YVTTSYYDGAPLDEYD 322
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
C AFL
Sbjct: 323 F----------------------------------------------------KCVAFLV 330
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+H V F S L P S+S+L DCRN VF TAKV++Q +T L
Sbjct: 331 NFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSL----- 385
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+ +W EP+ S++ +T + E L TKD +DYLW+I
Sbjct: 386 ------------NDINNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWYIVSYK 433
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW-----VKVVQPVEFQSG 576
D N++ + + S+ +L F+N + GSV G + + + + G
Sbjct: 434 NRASD-----GNQI-ARLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEG 487
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID---LSKILWTYQVGLKGE 633
N + LLS VG + GA++E+ G ++ G + G L+ LW YQVGL GE
Sbjct: 488 DNTISLLSVMVGSPDSGAYMERRTFG----IQTVGIQQGQQPMHLLNNDLWGYQVGLFGE 543
Query: 634 FQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
IY+ E N W D+ + I TWYKT F P G D V L+L SMGKG+ WVNG
Sbjct: 544 KDSIYTQEGPNSVRWMDIN-NLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGE 602
Query: 693 HIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
IGRYW + AP G P+Q+ YH+PR +L +NLLV+
Sbjct: 603 SIGRYWVSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNLLVLV 640
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GG+P +I+V S VC V E PP++ GK+ P++ + CQ
Sbjct: 641 EEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS-------RGKV------PKVRIWCQG 687
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
G ISSIEFASYG P G C+ F G+CHA S SVV +
Sbjct: 688 GKRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQ 725
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 310/652 (47%), Positives = 409/652 (62%), Gaps = 49/652 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YD R++IIDG ++L S IHY R+TP+MWP LIAK+K GG DV++TYVFWN HE
Sbjct: 24 NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ++F G DIVKF+K V + GLY+ LRIGP++ EW++GG P WL ++ GI FRT+N
Sbjct: 84 QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK M+R+ K IV LM+ E L++ QGGPII+ QIENEYG + ++ Q+GK YVKW A +
Sbjct: 144 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 203
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+ L GVPWVMCKQ DAP+ +++ACNG C + +K PNS NKP +WTENW +Y T+G
Sbjct: 204 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYGE 263
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R ED+AF VA F + GSF+NYYMY GGTNFGR + F ITSY AP+DEYGL
Sbjct: 264 EPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYGL 322
Query: 344 LSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSN-CSAFLA 402
L +PKWGHLK+LHAA+KLCE L++ I LG+ Q A V +G ++N C+A L
Sbjct: 323 LRQPKWGHLKELHAAVKLCEEPLLSGLQTT-ISLGKLQTAFV-----FGKKANLCAAILV 376
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+ ++V F SY L P SVS+LPDC+N FNTAKV++Q + +T + N+S
Sbjct: 377 NQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRK----ARQNLS 431
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
PQ W E + +SE + + +LEH+N T+D SDYLW T+
Sbjct: 432 SPQM-------------WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQ 478
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYN 578
S+ S K N + L F+NG+ GS+ G H + + + +G N
Sbjct: 479 SEGAPSVLKVNH---------LGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTN 529
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDL--SKILWTYQVGLKGEFQQ 636
+L LLS VGL N GA LE+ G R VK+ NG L + W YQVGLKGE
Sbjct: 530 NLALLSVMVGLPNSGAHLERRVVGSRS-VKIW---NGRYQLYFNNYSWGYQVGLKGEKFH 585
Query: 637 IYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
+Y+ + + +W RD TWYK FD P+G DPVAL+LGSMGKG+A
Sbjct: 586 VYTEDGSAKVQWKQY-RDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 570 bits (1469), Expect = e-159, Method: Compositional matrix adjust.
Identities = 336/828 (40%), Positives = 442/828 (53%), Gaps = 142/828 (17%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD RA+++ G RRM S +HY R+TPEMWP LIAK+K GG DVI+TYVFWN HE I+
Sbjct: 29 ITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPIQ 88
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ D+VKF++ + + GLY+ LRIGP+V AEW +GGFP WL D+P I FR++N P
Sbjct: 89 GQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNEP 148
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQ FV KIV +M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAA+MA
Sbjct: 149 FKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAMA 208
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGW------- 277
+GL GVPW+MCKQ DAP+ +I+ CNG C PNS NKP LWTENW
Sbjct: 209 VGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSNGQNNS 268
Query: 278 ---YTTWGGRLPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYD 333
Y +G R ED+AFAVA F R GSF++YYMY GGTNFGR + Y+T+
Sbjct: 269 AFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS--YVTTSY 326
Query: 334 YD-APIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYG 392
YD AP+DEY
Sbjct: 327 YDGAPLDEYDF------------------------------------------------- 337
Query: 393 SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVE 452
C AFL N D+H V F S L P S+S+L DCRN VF TAKV++Q +T
Sbjct: 338 ---KCVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTAN 394
Query: 453 FSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYS 511
L + +W EP+ S++ +T + E L TKD +
Sbjct: 395 AVQSL-----------------NDINNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDET 437
Query: 512 DYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW-----VK 566
DYLW+I D N++ + + S+ +L F+N + GSV G +
Sbjct: 438 DYLWYIVSYKNRASD-----GNQI-AHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIV 491
Query: 567 VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDID---LSKIL 623
+ + + G N + LLS VG + GA++E+ G ++ G + G L+ L
Sbjct: 492 LNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFG----IQTVGIQQGQQPMHLLNNDL 547
Query: 624 WTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSM 682
W YQVGL GE IY+ E N W D+ + I TWYKT F P G D V L+L SM
Sbjct: 548 WGYQVGLFGEKDSIYTQEGTNSVRWMDIN-NLIYHPLTWYKTTFSTPPGNDAVTLNLTSM 606
Query: 683 GKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
GKG+ WVNG IGRYW + AP G P+Q+ YH+PR +L
Sbjct: 607 GKGEVWVNGESIGRYWVSFKAPS----------------------GQPSQSLYHIPRGFL 644
Query: 742 QASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM 801
+NLLV+ EE GG+P +I+V S VC V E PP++ GK+
Sbjct: 645 TPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS-------RGKV----- 692
Query: 802 APEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
P++ + CQ G ISSIEFASYG P G C+ F G+CHA S SVV +
Sbjct: 693 -PKVRIWCQGGNRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQ 739
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 320/816 (39%), Positives = 450/816 (55%), Gaps = 134/816 (16%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++I++G R +L S IHYPR+TPE
Sbjct: 32 VTYDGRSLIVNGRRELLFSGSIHYPRSTPE------------------------------ 61
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+NF+G D+VKF+KL+G GLY LRIGP++ AEWN GGFP WLR++P I FR+ N P
Sbjct: 62 --FNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSYNEP 119
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK M+++ + I+++M+E LF+ QGGPII+ QIENEY +++ +Y + G YV+WA MA
Sbjct: 120 FKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYKELGVQYVQWAGKMA 179
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+GLGAGVPW+MCKQ DAP+ +I+ CNG +C D + PN NKP+LWTENW Y +G
Sbjct: 180 VGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVFGDP 239
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R EDLAF+VARF + G+ NYYMY GGTNFGRT G F T Y +AP+DEYGL
Sbjct: 240 PSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEYGLQ 298
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHLKDLH+A++LC+ AL S KLG+++E Y + G+ C+AFL N
Sbjct: 299 REPKWGHLKDLHSALRLCKKALFTG-SPGVEKLGKDKEVRFYE--KPGTHI-CAAFLTNN 354
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
AA++TF G+ Y LPP S+SILPDC+ V+NT +V +Q + +
Sbjct: 355 HSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNAR--------------- 399
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+ ++SK+++ + W +EPI V ++ + +E KD SDY W +T I +S+
Sbjct: 400 --NFVKSKIANKNLKWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFVTSIELSN 457
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
D+ K ++ P + I ++ + F+NG GS G V+ +PV+FQ G N L
Sbjct: 458 YDLPMKK--DIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFQ-GRNKL 514
Query: 581 ----ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
+ S T G+ + V++ G G +D++ W QVG+ GE +
Sbjct: 515 HCPAVYDSGTTGIHS---------------VQILGLNTGTLDITNNGWGQQVGVNGEHVK 559
Query: 637 IYSI-EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
Y+ + +WT G TWYKTYFD P+G DPV L + SM KG +H+
Sbjct: 560 AYTQGGSHRVQWTAAKGKG--PAMTWYKTYFDMPEGNDPVILRMTSMAKGNGL--EYHV- 614
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
PR+WL+ S+NLLVIFEETG
Sbjct: 615 -----------------------------------------PRAWLKPSDNLLVIFEETG 633
Query: 756 GNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLS--INKMAPEMHLHCQDGY 813
GNP EI +L + +C V+E H P V+ W D K+ ++++ P+ HL C +
Sbjct: 634 GNPEEIEXELVNRDTICSIVTEYHPPHVKSWQRH---DSKIRAVVDEVKPKGHLKCPNYK 690
Query: 814 IISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+I ++FAS+G P G C F GNC AP S VV +
Sbjct: 691 VIVKVDFASFGNPLGACGDFEMGNCTAPNSKKVVEQ 726
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 315/813 (38%), Positives = 453/813 (55%), Gaps = 68/813 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++IDG R + S IHYPR+ P+MW L+ +K+GG + IETYVFWNAHE
Sbjct: 35 VSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWNAHEPEP 94
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ND++KF+KL+ S +Y +RIGP++ AEWN GG P WLR+IP I FR NN P
Sbjct: 95 GKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 154
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+EM++FV+ IV +++ +F+ QGGP+I+ QIENEYGN++ + +G Y++WAA MA
Sbjct: 155 YKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMA 214
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G +L
Sbjct: 215 ISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQL 274
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYM-YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+A++V RFF +GG+ +NYYM Y+GGTNFGRT G + +T Y + P+DE +
Sbjct: 275 ALRSAEDIAYSVLRFFAKGGTLVNYYMQYYGGTNFGRT-GASYVLTGYYDEGPVDEC-MP 332
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PK+GHL+DLH IK A + + + L EAH + + C AF++N
Sbjct: 333 KAPKYGHLRDLHNLIKSYSRAFLEGKQS-FELLAHGYEAHNFEIP---EEKLCLAFISNN 388
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ +V F G Y +P SVSIL DC++ V+NT +V Q S
Sbjct: 389 NTGEDGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHS----------------- 431
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++S ++ + S +W EPI + + + +E N+TKD SDYL +
Sbjct: 432 ERSFHTAQKLAKSNAWEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLC----FRLEA 487
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDL 580
DD+ F ++RP V + S L F+N G+ G + P+ + G N L
Sbjct: 488 DDLPF--RGDIRPVVQVKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHL 545
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS ++G+++ G L + G + + G G +DL W ++V L+GE ++IY+
Sbjct: 546 ALLSSSMGMKDSGGELVEVKGGIQ-DCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTE 604
Query: 641 EENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ A +W T TWYK YFD PDG DPV LD+ SMGKG +VNG +GRYW
Sbjct: 605 KGMGAVKWVPATTG---RAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWP 661
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
YR T G P+Q YH+PR +L+ NNLLVIFEE G P
Sbjct: 662 ------------SYR---------TVGGVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPE 700
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIIS 816
I ++ +C +SE + ++ W DG KL + L C I
Sbjct: 701 GILIQTVRRDDICVFISEHNPAQIKTWDK----DGGQIKLIAEDHSTRGILKCPPKKTIQ 756
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FAS+G P+G C F+ G CH P + +V++
Sbjct: 757 EVVFASFGNPEGSCANFTAGTCHTPNAKDIVAK 789
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 297/634 (46%), Positives = 392/634 (61%), Gaps = 42/634 (6%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWPDLI K+K+GG D IETY+FW+ HE R +Y+F G+ D +KF +L+ +GLY+ +RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVCAEWN+GGFPVWL ++PGI+ RTNN +K EMQ F KIV++ ++ LF+ QGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120
Query: 197 MLQIENEYGN-MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
+ QIENEYGN M +YG GK Y+ W A MA L GVPW+MC+Q+DAP+ +I+ CNG+Y
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
CD + PN+ P ++TENW GW+ WG + P+R ED+AF+VARFFQ GG F NYYMY G
Sbjct: 181 CDNFTPNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHG 240
Query: 316 GTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYI 375
GTNFGRTSGGPF TSYDY+AP+DEYG L++PKWGHLK LHA+IKL E L + +
Sbjct: 241 GTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSN-- 298
Query: 376 KLGQNQEAHVYR---ANRYGSQSNCSAFLANIDEHTAASVTFLGQ-SYTLPPWSVSILPD 431
QN + V +N + C FL+N D A++ Y +P WSVSIL
Sbjct: 299 ---QNFGSSVTLTKFSNPTTGERFC--FLSNTDGKNDATIDLQEDGKYFVPAWSVSILDG 353
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWS 491
C V+NTAKV+SQTS+ E +++ + SW EP+
Sbjct: 354 CNKEVYNTAKVNSQTSMFVKE-----------------QNEKENAQLSWAWAPEPMKDTL 396
Query: 492 ENN--FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLR 549
+ N F +LE VT D+SDY W++T++ + ++ T+ +++ VL
Sbjct: 397 QGNGKFAANLLLEQKRVTVDFSDYFWYMTKVDTNG------TSSLQNVTLQVNTKGHVLH 450
Query: 550 VFINGQLTGS---VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR-G 605
F+N + GS G +P+ +SG N + LLS TVGL+NY AF + G G
Sbjct: 451 AFVNKRYIGSKWGSNGQSFVFEKPILLKSGINTITLLSATVGLKNYDAFYDMVPTGIDGG 510
Query: 606 QVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKT 664
+ L G N DLS LW+Y+VGL GE +QIY+ + W L + I TWYKT
Sbjct: 511 PIYLIGDGNVTTDLSSNLWSYKVGLNGEMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKT 570
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
F P GIDPV LD+ MGKGQAWVNG IGR+W
Sbjct: 571 SFKTPAGIDPVVLDMQGMGKGQAWVNGQSIGRFW 604
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 306/665 (46%), Positives = 411/665 (61%), Gaps = 46/665 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD RA++++G RRML S +HY R+TPEMWP LIA +K+GG DVI+TYVFWN HE ++
Sbjct: 40 VTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNVHEPVQ 99
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQYNF+G+ D+VKF++ + + GLY+ LRIGP++ AEW +GGFP WL D+P I FRT+N P
Sbjct: 100 GQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNITFRTDNEP 159
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK+ MQRFV +IV++M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAA MA
Sbjct: 160 FKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRWAAEMA 219
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--YKPNSYNKPTLWTENWDGWYTTWGGR 284
+GL GVPW+MCKQ DAP+ II+ CNG C PNS KP LWTENW Y +G
Sbjct: 220 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYPIYGND 279
Query: 285 LPHRPVEDLAFAVARFFQR-GGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD-APIDEYG 342
R ED+AFAVA F R GSF++YYMY GGTNFGR + Y+T+ YD AP+DEYG
Sbjct: 280 TKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASS--YVTTSYYDGAPLDEYG 337
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L+ P WGHL++LHAA+KL AL+ + + LG QEAH++ ++ C AFL
Sbjct: 338 LIWRPTWGHLRELHAAVKLSSEALLFGRYSNF-SLGPEQEAHIFE-----TELKCVAFLV 391
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N D+H +V F + L P S+S+L +CR VF TA+V++Q +T E
Sbjct: 392 NFDKHQTPTVVFRNIYFQLAPKSISVLSECRTVVFETARVNAQYGSRTAE---------- 441
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGV-WSENNFTVQGILEHLNVTKDYSDYLWHITQI- 520
++ES + +W KEPI S+ +T + EHL++TKD +DYLW+I
Sbjct: 442 -----VVESL--NDIHTWKAFKEPIPEDISKAVYTGNQLFEHLSMTKDETDYLWYIVSYE 494
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW-----VKVVQPVEFQS 575
Y+ DD N ++S VL F+N + GSV G + + +
Sbjct: 495 YIPSDDGQLVLLN-------VESRAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNE 547
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
G N + LLS VG + GA +E+ G +V + + L+ LW YQVGL GE
Sbjct: 548 GQNTISLLSVMVGSPDSGAHMERRSFGIH-KVSIQQGQQPLHLLNNELWAYQVGLYGEAN 606
Query: 636 QIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+IY+ EE + AEWT++ + FTWYKT F P G D VAL+L SMGKG+ WVNG +
Sbjct: 607 RIYTQEESSSAEWTEIN-NLTYHPFTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESL 665
Query: 695 GRYWT 699
GRYW
Sbjct: 666 GRYWV 670
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 305/808 (37%), Positives = 447/808 (55%), Gaps = 88/808 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R+++IDG R + S IHYPR+ PE+WP L+ ++KEGG + IETY+FWNAHE
Sbjct: 36 VTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWNAHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ D+VKF+K++ G+Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+EM+++ + +V +++ LF+ QGGP+I+ QIENEYGN++ + +G Y++WAA MA
Sbjct: 156 YKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLEWAAQMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
L GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G +L
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQL 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GGS +NYYMY GGTNFGRTS + +T Y +AP+DEYG+
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSMVNYYMYHGGTNFGRTSAS-YVLTGYYDEAPLDEYGMYK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH I+ + A ++ + I LG EA ++ ++ C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLSGKHSSEI-LGHGYEAQIFELPE---ENLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G + +P SVSIL C++ V+NT +V Q S +
Sbjct: 391 TGEDGTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRVFVQHS-----------------E 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S S+++S + W E + + + + LE N TKD SDYLW+ T + D
Sbjct: 434 RSYHTSEVTSKNNQWEMYSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWYTTSFRLESD 493
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH-WVK---VVQPVEFQSGYNDLI 581
D+ F ++RP + + S + F N GS G+ VK +PV+ ++G N ++
Sbjct: 494 DLPF--RGDIRPVLQVKSSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVV 551
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
LLS T+G+++ G L + G + + + G G +DL W
Sbjct: 552 LLSSTMGMKDSGGELAEVKGGIQ-ECLIQGLNTGTLDLQVNGWG---------------- 594
Query: 642 ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
+K YFD PDG DP+ LD+ SM KG +VNG IGRYW
Sbjct: 595 --------------------HKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWV-- 632
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
+R T G P+Q YH+PR +L+ +NLLV+FEE G P I
Sbjct: 633 ----------SFR---------TLAGTPSQAVYHIPRPFLKPKDNLLVVFEEEMGKPDGI 673
Query: 762 SVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFA 821
V+ + +C +SE + ++ W ++ V KL + L C II + FA
Sbjct: 674 LVQTVTRDDICLLISEHNPGQIKTW-DTDGVKIKLIAEDHSVRGTLMCPPEKIIQEVVFA 732
Query: 822 SYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
S+G P G C F+ G CH P + +V +
Sbjct: 733 SFGNPDGMCGNFTVGTCHTPNAKQIVEK 760
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 312/792 (39%), Positives = 463/792 (58%), Gaps = 79/792 (9%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWP +I K++ GG + I+TYVFWN HE +G+Y+FKG+ D+VKF+KL+ GLY+ LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P++ AEWN GG P WLR++P + FRTNN PFKE +R+V+KI+ +M+EE LF+ QGGPII
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEY ++ +Y + G+ Y+KWAA++ + G+PWVMCKQ DAP N+I+ACNG +C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180
Query: 257 -DGYK-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYF 314
D + PN ++KP+LWTENW + +G R VED+AF+VAR+F + GS +NYYMY
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 240
Query: 315 GGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY 374
GGTNFGRTS F T Y DAP+DE+GL PK+GHLK +H A++LC+ AL +
Sbjct: 241 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWG-QLRA 298
Query: 375 IKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRN 434
LG + E Y + G++ C+AFL+N + ++ F GQ Y LP S+SILPDC+
Sbjct: 299 QTLGPDTEVRYYE--QPGTKV-CAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKT 355
Query: 435 TVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN 494
V+NTA++ +Q S + ++S+ +S + E I + +
Sbjct: 356 VVYNTAQIVAQHSWR-----------------DFVKSEKTSKGLKFEMFSENIPSLLDGD 398
Query: 495 FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFING 554
+ G L +L TKD +DY W+ T + + +DD F ++ + + S+ L V++NG
Sbjct: 399 SLIPGELYYL--TKDKTDYAWYTTSVKIDEDD--FPDQKGLKTILRVASLGHALIVYVNG 454
Query: 555 QLTGSVIG-HWVK---VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
+ G G H +K +PV F++G N + +L GL + G+++E AG R + +
Sbjct: 455 EYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISII 513
Query: 611 GFKNGDIDLSK-ILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDA 668
G K+G DL++ W + GL+GE +++Y+ E + +W +DG TWYKTYF+
Sbjct: 514 GLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKW---EKDGKRKPLTWYKTYFET 570
Query: 669 PDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
P+G++ VA+ + +MGKG WVNG +GRYW + ++P G
Sbjct: 571 PEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP----------------------LG 608
Query: 728 NPTQTWYHVPRSWLQAS--NNLLVIFEETGGNPFE-ISVKLRSTRIVCEQVSESHYPPVR 784
PTQT YH+PRS+++ N+LVI EE G E I L + +C V E + V+
Sbjct: 609 EPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVK 668
Query: 785 KWSN------SYSVDGKL-SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGN 837
W S S D +L ++ + PE + ++FAS+G P G C F+ G
Sbjct: 669 SWKREGPKIVSRSKDMRLKAVMRCPPEKQM--------VEVQFASFGDPTGTCGNFTMGK 720
Query: 838 CHAPMSLSVVSE 849
C A S VV +
Sbjct: 721 CSASKSKEVVEK 732
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 560 bits (1443), Expect = e-156, Method: Compositional matrix adjust.
Identities = 310/729 (42%), Positives = 424/729 (58%), Gaps = 90/729 (12%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R++IIDG+R++L S IHYPR+TP+MW LIAK+KEGG DVI+TYVFWN HE
Sbjct: 26 VTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEPQP 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY+F G+ D+ KF+K + + GLY LRIGP++ +EW++GG P WL D+ GI +RT+N P
Sbjct: 86 GQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDNEP 145
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ F KIV+LM+ E L++ QGGPII+ QIENEY N+E+++ ++G YV+WAA MA
Sbjct: 146 FKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAKMA 205
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGGR 284
+ L GVPWVMCKQ+DAP+ +I+ CNG C PNS NKP++WTENW +Y +GG
Sbjct: 206 VELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFGGE 265
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AF VA F R GS++NYYM L+
Sbjct: 266 TYLRSAEDIAFHVALFIARNGSYVNYYMV----------------------------SLI 297
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHLK+LHAAI LC L+ + I LGQ QEA+V++ G C AFL N
Sbjct: 298 RQPKWGHLKELHAAITLCSTPLLNGVQSN-ISLGQLQEAYVFQEEMGG----CVAFLVNN 352
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
DE ++V F S L P S+SILPDC+N +FNTAK+++ + + I+
Sbjct: 353 DEGNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYNER-----------ITTS 401
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QS W K+ I + + + ILEH+N+TKD SDYLW+ + +
Sbjct: 402 SQSF------DAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPN- 454
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYNDL 580
++ P + I+S+ + F+N G+ G H +K P+ + N++
Sbjct: 455 -------SSCTEPLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNI 507
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
+LS VG + GA+LE AG +V++ + G D + W YQVGL GE IY
Sbjct: 508 SILSVMVGFPDSGAYLESRFAGLT-RVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYK- 565
Query: 641 EEN--EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
EEN EW T TWYK F+ P G DPVAL+L +MGKG+AWVNG IGRYW
Sbjct: 566 EENLSNVEWRK-TEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYW 624
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
++++ K G+P+QT YHVPR++L+ S NLLV+ EE G+P
Sbjct: 625 V----------------SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDP 663
Query: 759 FEISVKLRS 767
IS++ S
Sbjct: 664 LHISLETIS 672
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 556 bits (1433), Expect = e-155, Method: Compositional matrix adjust.
Identities = 301/693 (43%), Positives = 399/693 (57%), Gaps = 76/693 (10%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD R+++IDG RR+++S IHYPR+TPEMWPDLI K+KEGG D IETY+FWN HE
Sbjct: 30 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPH 89
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
R QYNF+G D+V+F K + ++G+Y LRIGPY+C EWN+GG P WLRDIPG++FR +N
Sbjct: 90 RRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 149
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAA 223
PF+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 150 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 209
Query: 224 SMALGLGAGVPWVMCKQ-TDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
MA GVPW+MC+Q D P N+++ CNG+YC + PN P +WTENW GW+ W
Sbjct: 210 DMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWD 269
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
HR ED+AFAVA FFQ+ GS NYYMY GGTNFGRTSGGP+ TSYDYDAP+DEYG
Sbjct: 270 KPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 329
Query: 343 LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
L +PK+GHLK+LH+ +K E LV +Y V +Y S+ + F+
Sbjct: 330 NLRQPKYGHLKELHSVLKSMEKTLV---HGEYFDTNYGDNITV---TKYTLDSSSACFIN 383
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N + +VT G ++ LP WSVSILPDC+ FN+AK+ +QTS+
Sbjct: 384 NRFDDKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVM------------- 430
Query: 463 VPQQSMIESKLSSTSKSWMTVK-EPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
V + + E + S SWM P + NF +LE + + D SDYLW+ T
Sbjct: 431 VKKPNTAEQEQESLKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRT--- 487
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG---SVIGHWV-KVVQPVEFQSGY 577
S E + +++ L F+NG+L G S G +V ++ PV+ G
Sbjct: 488 ------SLNHKGEGSYKLYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGK 541
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ 636
N + LLS TVGL+NYG EK G G VKL IDLS W+
Sbjct: 542 NYISLLSATVGLKNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWS----------- 590
Query: 637 IYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
YK F+AP G DPV +DL + KG AWVNG+++GR
Sbjct: 591 -------------------------YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGR 625
Query: 697 YWT--VVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
YW A GC CDYRGA+ ++ T+ G
Sbjct: 626 YWPSYTAAEMAGCH-RCDYRGAFQAEGDGTSFG 657
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 286/611 (46%), Positives = 378/611 (61%), Gaps = 37/611 (6%)
Query: 236 VMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAF 295
V+CKQ DAP+ II+ACNG+YCD + PN KP +WTE W GW+T +GG +P+RP ED+AF
Sbjct: 1 VLCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAF 60
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEYGL +PKWGHLKDL
Sbjct: 61 SVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDL 120
Query: 356 HAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFL 415
H AIKLCEPALV+ + + + LG QEAHVY++ CSAFLAN + + A V+F
Sbjct: 121 HRAIKLCEPALVSGEPTR-MPLGNYQEAHVYKSK----SGACSAFLANYNPKSYAKVSFG 175
Query: 416 GQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSS 475
Y LPPWS+SILPDC+NTV+NTA+V +QTS + + VP +
Sbjct: 176 NNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMKM---------VRVPVHGGL------ 220
Query: 476 TSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEV 535
SW E + + +FT+ G++E +N T+D SDYLW++T + V D + F + ++
Sbjct: 221 ---SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKV-DANEGFLRNGDL 276
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQN 591
PT+T+ S + VFINGQL+GS G + + V ++G+N + +LS VGL N
Sbjct: 277 -PTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPN 335
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE-FQQIYSIEENEAEWTDL 650
G E AG G V L G G DLS WTY+VGLKGE + EW +
Sbjct: 336 VGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEG 395
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
TWYKT F AP G P+A+D+GSMGKGQ W+NG +GR+W G C +
Sbjct: 396 AFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGSCSE- 454
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRI 770
C Y G + DKC NCG +Q WYHVPRSWL+ S NLLV+FEE GG+P I++ R
Sbjct: 455 CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDS 514
Query: 771 VCEQVSESHYPPVRKWSNSYSVDGKLSINK-MAPEMHLHCQDGYIISSIEFASYGTPQGR 829
VC + E V +Y + +NK + P+ HL C G I++++FAS+GTP+G
Sbjct: 515 VCADIYEWQSTLV-----NYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGT 569
Query: 830 CQKFSRGNCHA 840
C + +G+CHA
Sbjct: 570 CGSYRQGSCHA 580
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 310/792 (39%), Positives = 460/792 (58%), Gaps = 83/792 (10%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MWP +I K++ GG + I+TYVFWN HE +G+Y+FKG+ D+VKF+KL+ GLY+ LR+G
Sbjct: 69 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 128
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P++ AEWN GG P WLR++P + FRTNN PFKE +R+V+KI+ +M+EE LF+ QGGPII
Sbjct: 129 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 188
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ QIENEY ++ +Y + G+ Y+KWAA++ + G+PWVMCKQ DAP N+I+ACNG +C
Sbjct: 189 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 248
Query: 257 -DGYK-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYF 314
D + PN ++KP+LWTENW + +G R VED+AF+VAR+F + GS +NYYMY
Sbjct: 249 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 308
Query: 315 GGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY 374
GGTNFGRTS F T Y DAP+DE+GL PK+GHLK +H A++LC+ AL +
Sbjct: 309 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWG-QLRA 366
Query: 375 IKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRN 434
LG + E Y + G++ C+AFL+N + ++ F GQ Y LP S+SILPDC+
Sbjct: 367 QTLGPDTEVRYYE--QPGTKV-CAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKT 423
Query: 435 TVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN 494
V+NTA++ +Q S + ++S+ +S + E I + +
Sbjct: 424 VVYNTAQIVAQHSWR-----------------DFVKSEKTSKGLKFEMFSENIPSLLDGD 466
Query: 495 FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFING 554
+ G L +L TKD +DY + + +DD F ++ + + S+ L V++NG
Sbjct: 467 SLIPGELYYL--TKDKTDY----ACVKIDEDD--FPDQKGLKTILRVASLGHALIVYVNG 518
Query: 555 QLTGSVIG-HWVK---VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLT 610
+ G G H +K +PV F++G N + +L GL + G+++E AG R + +
Sbjct: 519 EYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISII 577
Query: 611 GFKNGDIDLSK-ILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDA 668
G K+G DL++ W + GL+GE +++Y+ E + +W +DG TWYKTYF+
Sbjct: 578 GLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKW---EKDGKRKPLTWYKTYFET 634
Query: 669 PDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
P+G++ VA+ + +MGKG WVNG +GRYW + ++P G
Sbjct: 635 PEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP----------------------LG 672
Query: 728 NPTQTWYHVPRSWLQA--SNNLLVIFEETGGNPFE-ISVKLRSTRIVCEQVSESHYPPVR 784
PTQT YH+PRS+++ N+LVI EE G E I L + +C V E + V+
Sbjct: 673 EPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVK 732
Query: 785 KWSN------SYSVDGKL-SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGN 837
W S S D +L ++ + PE + ++FAS+G P G C F+ G
Sbjct: 733 SWKREGPKIVSRSKDMRLKAVMRCPPEKQM--------VEVQFASFGDPTGTCGNFTMGK 784
Query: 838 CHAPMSLSVVSE 849
C A S VV +
Sbjct: 785 CSASKSKEVVEK 796
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 291/624 (46%), Positives = 383/624 (61%), Gaps = 38/624 (6%)
Query: 233 VPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVED 292
VPWVMCKQ DAP+ +I+ CNG+YCD + PN KP WTE W W+ +GG RPVED
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVED 62
Query: 293 LAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHL 352
LAF VARF Q+GGS +NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+ +PK+GHL
Sbjct: 63 LAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHL 122
Query: 353 KDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASV 412
K LH A+KLCE AL+ + Y L Q+A V+ + S +C+AFL+N + A V
Sbjct: 123 KRLHDAVKLCEKALLTGEPHDYT-LATYQKAKVFSS----SSGDCAAFLSNYHSNNTARV 177
Query: 413 TFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESK 472
TF G+ YTLPPWS+SILPDC++ ++NTA+V QT+ Q S + +K
Sbjct: 178 TFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTN-----------------QLSFLPTK 220
Query: 473 LSSTSKSWMTVKEPIGVWSEN-NFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWK 531
+ S SW T E I E+ + + G+LE L +TKD SDYLW+ T + V D + S+ +
Sbjct: 221 VESF--SWETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNV-DPNESYLR 277
Query: 532 TNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTV 587
+ PT+T S + VFING+L GS G + Q+G N + LLS
Sbjct: 278 GGKF-PTLTATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAG 336
Query: 588 GLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-E 646
GL N G E+ G G V + G G +DLS+ W+Y+VGLKGE + S +A +
Sbjct: 337 GLPNNGPHYEEREMGVLGPVAIHGLDXGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVD 396
Query: 647 WT-DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKG 705
W D + TWYK YFDAP+G +P+ALD+GSM KGQ W+NG ++GRYWT+ A G
Sbjct: 397 WAKDSLKQENAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWTITA-NG 455
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
C D C Y G Y KC CG PTQ WYHVPRSWL + NL+V+FEE GGNP IS+
Sbjct: 456 NCTD-CSYSGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVK 514
Query: 766 RSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGT 825
RS +C + S+ Y PV K + + +G+L+ + +++LHC G IS+I+FAS+GT
Sbjct: 515 RSVTSICTEASQ--YRPVIKNVHMHQNNGELNEQNVL-KINLHCAAGQFISAIKFASFGT 571
Query: 826 PQGRCQKFSRGNCHAPMSLSVVSE 849
P G C +G CH+P S V+ +
Sbjct: 572 PSGACGSHKQGTCHSPKSDYVLQK 595
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 276/662 (41%), Positives = 400/662 (60%), Gaps = 40/662 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++ DG+R + +S IHYPR+ P+MWP+LIAK+KEGG + IETYVFWN HE +
Sbjct: 43 VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF+G+ND+V+F +L+ +Y +R+GP++ AEWN GG P WLR+IP I FRTNN P
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K M+ FVK I+ +++ LF+ QGGPII+ QIENEY +ME+++ +G Y+ WAA MA
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--PTLWTENWDGWYTTWGGR 284
+ G+PW+MCKQT AP ++I CNG C P NK P LWTENW Y +G
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVFGDP 282
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R ED+AFAVARFF GG+ NYYMY GGTNFGRTS F + Y +AP+DE+GL
Sbjct: 283 PSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLY 341
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPKWGHL+DLH A+KLC+ AL+ + KLG+ EA V+ Q C AFL+N
Sbjct: 342 KEPKWGHLRDLHQALKLCKKALLWGTPSTE-KLGKQLEARVFEMP---EQKVCVAFLSNH 397
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A++TF G+ Y +P S+S+L DC VF T V++Q + +T F
Sbjct: 398 NTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHF----------- 446
Query: 465 QQSMIESKLSSTSKSW-MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVS 523
+ ++ + W M E + + + ++ + N+TKD +DY+W+ + +
Sbjct: 447 ------ADQTAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLE 500
Query: 524 DDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK------VVQPVEFQSGY 577
DD+ ++++ + ++S F+N + G GH K + +P++ + G
Sbjct: 501 ADDMPI--RSDIKTVLEVNSHGHASVAFVNNKFVG--CGHGTKMNKAFTLEKPMDLKKGV 556
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
N + +L+ ++G+ + GA++E AG +V++TG G +DL+ W + VGL GE +QI
Sbjct: 557 NHVAVLASSMGMTDSGAYMEHRLAGV-DRVQITGLNAGTLDLTNNGWGHIVGLVGERKQI 615
Query: 638 YSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGR 696
Y+ + W D TWYK +FD P G DPV LD+ +MGKG +VNG IGR
Sbjct: 616 YTDKGMGSVTWKPAMND---RPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGR 672
Query: 697 YW 698
YW
Sbjct: 673 YW 674
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 256/385 (66%), Positives = 300/385 (77%), Gaps = 8/385 (2%)
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
QQ ++ SKSWMT KEP+ +WS+++FTV+GI EHLNVTKD SDYLW+ T++YVSD
Sbjct: 20 QQLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSD 79
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLS 584
DI FW+ N+V P +TID +RD+LRVFINGQL V K V V G ND S
Sbjct: 80 SDILFWEENDVHPKLTIDGVRDILRVFINGQLI--VKDEQFKAVISVSI--GKNDCTAGS 135
Query: 585 QTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE 644
+ NYGAFLEKDGAG RG++K+TGF+NGDIDLSK LWTYQVGL+GEF + YS E
Sbjct: 136 ----INNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEENEN 191
Query: 645 AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPK 704
+EW +LT D IPSTFTWYKTYFD P GIDPVALD SMGKGQAWVNG HIGRYWT V+PK
Sbjct: 192 SEWVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTRVSPK 251
Query: 705 GGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK 764
GCQ CDYRGAYNSDKC+TNCG PTQT YHVPRSWL+A+NNLLVI EETGGNPFEISVK
Sbjct: 252 SGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVILEETGGNPFEISVK 311
Query: 765 LRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYG 824
L S+RI+C QVSES+YPP++K N+ + ++S N M PE+HLHCQ G+ ISS+ FAS+G
Sbjct: 312 LHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMIPELHLHCQQGHTISSVAFASFG 371
Query: 825 TPQGRCQKFSRGNCHAPMSLSVVSE 849
TP G CQ FSRGNCHAP S+S+VSE
Sbjct: 372 TPGGSCQNFSRGNCHAPSSMSIVSE 396
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/813 (37%), Positives = 433/813 (53%), Gaps = 95/813 (11%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++IDG R + S IHYPR+ PEMW L+ +K GG + IETYVFWN HE
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+G+ D+++F+ ++ + +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK IENEYGN++ +G Y++WAA MA
Sbjct: 156 FK-------------------------------IENEYGNIKKDRKVEGDKYLEWAAEMA 184
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ G GVPWVMCKQ+ AP +I CNG +C D + NKP LWTENW + T+G +L
Sbjct: 185 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 244
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GG+ +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 245 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 303
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH IK A + + I LG EAH Y C +FL+N +
Sbjct: 304 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEI-LGHGYEAHNYELP---EDKLCLSFLSNNN 359
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ + +P SVSIL DC+ V+NT +V Q S +
Sbjct: 360 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS-----------------E 402
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S + +S + W E I + + + LE N TKD SDYLW+ T + D
Sbjct: 403 RSFHTTDETSKNNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESD 462
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLI 581
D+ F + ++RP + I S + F N G+ G + +P++ + G N +
Sbjct: 463 DLPFRR--DIRPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIA 520
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS ++G+++ G L + G + V + G G +DL ++ L+GE ++IY+ E
Sbjct: 521 MLSSSMGMKDSGGELVEVKGGIQDCV-VQGLNTGTLDLQGNGRGHKARLEGEDKEIYT-E 578
Query: 642 ENEA--EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ A +W D +P TWYK YFD PDG DP+ +D+ SM KG +VNG IGRYWT
Sbjct: 579 KGMAQFQWKPAEND-LP--ITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWT 635
Query: 700 VVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPF 759
T G+P+Q+ YH+PR++L+ NLL+IFEE G P
Sbjct: 636 SF---------------------ITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPG 674
Query: 760 EISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIIS 816
I ++ +C +SE + ++ W + DG KL + L+C I
Sbjct: 675 GILIQTVRRDDICVFISEHNPAQIKTWES----DGGQIKLIAEDTSTRGTLNCPPQRTIQ 730
Query: 817 SIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ FAS+G P+G C F+ G CH P + +VV +
Sbjct: 731 EVVFASFGNPEGACGNFTAGTCHTPDAKAVVEK 763
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 526 bits (1355), Expect = e-146, Method: Compositional matrix adjust.
Identities = 298/762 (39%), Positives = 441/762 (57%), Gaps = 81/762 (10%)
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY+FKG+ D+VKF+KL+ GLY+ LR+GP++ AEWN GG P WLR++P + FRTNN PF
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
KE +R+V+KI+ +M+EE LF+ QGGPII+ QIENEY ++ +Y + G+ Y+KWAA++
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGRL 285
+ G+PWVMCKQ DAP N+I+ACNG +C D + PN ++KP+LWTENW + +G
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R VED+AF+VAR+F + GS +NYYMY GGTNFGRTS F T Y DAP+DE+GL
Sbjct: 260 TQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEK 318
Query: 346 EPKWGHLKDLHAAIKLCEPALV-AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PK+GHLK +H A++LC+ AL AQ LG + E Y + G++ C+AFL+N
Sbjct: 319 APKYGHLKHVHRALRLCKKALFWGQLRAQ--TLGPDTEVRYYE--QPGTKV-CAAFLSNN 373
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ ++ F GQ Y LP S+SILPDC+ V+NTA++ +Q S +
Sbjct: 374 NTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWR--------------- 418
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++S+ +S + E I + + + G L +L TKD +DY W+ T + + +
Sbjct: 419 --DFVKSEKTSKGLKFEMFSENIPSLLDGDSLIPGELYYL--TKDKTDYAWYTTSVKIDE 474
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYNDL 580
DD F ++ + + S+ L V++NG+ G G H +K +PV F++G N +
Sbjct: 475 DD--FPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRI 532
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK-ILWTYQVGLKGEFQQIYS 639
+L GL + G+++E AG R + + G K+G DL++ W + GL+GE +++Y+
Sbjct: 533 SILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYT 591
Query: 640 IE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
E + +W +DG TWYKTYF+ P+G++ VA+ + +MGKG WVNG +GRYW
Sbjct: 592 EEGSKKVKW---EKDGKRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYW 648
Query: 699 -TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA--SNNLLVIFEETG 755
+ ++P G PTQT YH+PRS+++ N+LVI EE
Sbjct: 649 MSFLSP----------------------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEP 686
Query: 756 GNPFE-ISVKLRSTRIVCEQVSESHYPPVRKWSN------SYSVDGKL-SINKMAPEMHL 807
G E I L + +C V E + V+ W S S D +L ++ + PE +
Sbjct: 687 GVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQM 746
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
++FAS+G P G C F+ G C A S VV +
Sbjct: 747 --------VEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEK 780
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 266/556 (47%), Positives = 347/556 (62%), Gaps = 33/556 (5%)
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MA+ GVPW+MC+Q DAP +I CNG+YCD + PN+ +KP +WTENW GW+ T+GGR
Sbjct: 1 MAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGR 60
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRP ED+A++VARFF +GGS NYYMY GGTNFGRTSGGPF TSYDY+APIDEYGL
Sbjct: 61 DPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLP 120
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
PKWGHLKDLH AI L E L++ + Q LG + EA VY S C+AFL+N+
Sbjct: 121 RLPKWGHLKDLHKAIMLSENLLISGEH-QNFTLGHSLEADVYT----DSSGTCAAFLSNL 175
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
D+ +V F SY LP WSVSILPDC+ VFNTAKV+S++S VE LP
Sbjct: 176 DDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSS--KVEM-LP-------- 224
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
E SS+ W E G+W +F +++H+N TKD +DYLW+ T I VS+
Sbjct: 225 -----EDLKSSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSE 279
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWV----KVVQPVEFQSGYNDL 580
++ +F K P + I+S L VFIN + G+ G+ K+ +PV ++G N++
Sbjct: 280 NE-AFLKKGS-SPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNI 337
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LLS TVGL N G+F E GAG V + GF G ++L+ W+Y++G++GE +++
Sbjct: 338 DLLSMTVGLANAGSFYEWVGAGLT-SVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKP 396
Query: 641 EENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
+ A +WT T+ TWYK + P G +PV LD+ SMGKG AW+NG IGRYW
Sbjct: 397 GNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWP 456
Query: 700 VVA----PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
+A P C CDYRG + DKC T CG P+Q WYHVPRSW ++S N LVIFEE G
Sbjct: 457 RIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKG 516
Query: 756 GNPFEISVKLRSTRIV 771
GNP +I + R +V
Sbjct: 517 GNPMKIKLSKRKVSVV 532
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 283/751 (37%), Positives = 413/751 (54%), Gaps = 62/751 (8%)
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
Q F+G+ND++KF+KL+ S +Y +RIGP++ AEWN GG P WLR+IP I FR NN P+
Sbjct: 105 QVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEPY 164
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K+EM++FV+ IV +++ +F+ QGGP+I+ QIENEYGN++ + +G Y++WAA MA+
Sbjct: 165 KKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMAI 224
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G +L
Sbjct: 225 STNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQLA 284
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
R ED+A++V RFF +GG+ +NYYMY+GGTNFGRT G + +T Y + P+DEYG+
Sbjct: 285 LRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPVDEYGMPKA 343
Query: 347 PKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDE 406
PK+GHL+DLH IK A + + + L EAH + + C AF++N +
Sbjct: 344 PKYGHLRDLHNLIKSYSRAFLEGKQS-FELLAHGYEAHNFEIP---EEKLCLAFISNNNT 399
Query: 407 HTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQ 466
+V F G Y +P SVSIL DC++ V+NT +V Q S ++
Sbjct: 400 GEDGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHS-----------------ER 442
Query: 467 SMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDD 526
S ++ + S +W EPI + + + +E N+TKD SDYLW+ T + DD
Sbjct: 443 SFHTAQKLAKSNAWEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADD 502
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLIL 582
+ F ++RP V + S L F+N G+ G + P+ + G N L L
Sbjct: 503 LPF--RGDIRPVVQVKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLAL 560
Query: 583 LSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE 642
LS ++G+++ G L + G + + G G +DL W ++V L+GE ++IY+ +
Sbjct: 561 LSSSMGMKDSGGELVEVKGGIQ-DCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKG 619
Query: 643 NEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVV 701
A +W T TWYK YFD PDG DPV LD+ SMGKG +VNG +GRYW
Sbjct: 620 MGAVKWVPATTG---RAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWP-- 674
Query: 702 APKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
YR T G P+Q YH+PR +L+ NNLLVIFEE G P I
Sbjct: 675 ----------SYR---------TVGGVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGI 715
Query: 762 SVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIISSI 818
++ +C +SE + ++ W DG K+ + L C I +
Sbjct: 716 LIQTVRRDDICVFISEHNPAQIKTWDK----DGGQIKVIAEDHSTRGILKCPPKKTIQEV 771
Query: 819 EFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
FAS+G P+G C F+ G+CH P + +V++
Sbjct: 772 VFASFGNPEGSCANFTAGSCHTPNAKDIVAK 802
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 266/578 (46%), Positives = 353/578 (61%), Gaps = 37/578 (6%)
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY 328
+WTE W GW+T +GG +P+RP ED+AF+VARF Q+GGSF+NYYMY GGTNFGRT+GGPF
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRA 388
TSYDYDAP+DEYGL +PKWGHLKDLH AIKLCEPALV+ + + + LG QEAHVY++
Sbjct: 61 ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTR-MPLGNYQEAHVYKS 119
Query: 389 NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI 448
CSAFLAN + + A V+F Y LPPWS+SILPDC+NTV+NTA+V +QTS
Sbjct: 120 K----SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSR 175
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
+ + VP + SW E + + +FT+ G++E +N T+
Sbjct: 176 MKM---------VRVPVHGGL---------SWQAYNEDPSTYIDESFTMVGLVEQINTTR 217
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW---- 564
D SDYLW++T + V D + F + ++ PT+T+ S + VFINGQL+GS G
Sbjct: 218 DTSDYLWYMTDVKV-DANEGFLRNGDL-PTLTVLSAGHAMHVFINGQLSGSAYGSLDSPK 275
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
+ + V ++G+N + +LS VGL N G E AG G V L G G DLS W
Sbjct: 276 LTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKW 335
Query: 625 TYQVGLKGE-FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
TY+VGLKGE + EW + TWYKT F AP G P+A+D+GSMG
Sbjct: 336 TYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMG 395
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KGQ W+NG +GR+W G C + C Y G + DKC NCG +Q WYHVPRSWL+
Sbjct: 396 KGQIWINGQSLGRHWPAYKAVGSCSE-CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKP 454
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK-MA 802
S NLLV+FEE GG+P I++ R VC + E V +Y + +NK +
Sbjct: 455 SGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLV-----NYQLHASGKVNKPLH 509
Query: 803 PEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHA 840
P+ HL C G I++++FAS+GTP+G C + +G+CHA
Sbjct: 510 PKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHA 547
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 272/660 (41%), Positives = 381/660 (57%), Gaps = 58/660 (8%)
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENE+GN+E SYGQ+GK+YVKW A +A PW+MC+Q DAP+ II+ CNG+YCD +
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQF 60
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
KPN+ N P +WTE+W GW+ WG R P+R EDLAFAVARFFQ GGS NYYMY GGTNF
Sbjct: 61 KPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNF 120
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQ 379
GR++GGP+ TSYDY+AP+DEYG +++PKWGHLK LH I+ E L D ++I G
Sbjct: 121 GRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGD-VKHIDTGH 179
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNT 439
+ A Y Y +S+C F N E++ +TF + YT+P WSV++LPDC+ V+NT
Sbjct: 180 STTATSY---TYKGKSSC--FFGN-PENSDREITFQERKYTVPGWSVTVLPDCKTEVYNT 233
Query: 440 AKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQG 499
AKV++QT+I+ + P++ + ++ + + +T + I S + T
Sbjct: 234 AKVNTQTTIRE------MVPSLVGKHKKPLKWQWRNEKIEHLTHEGDI---SGSAITANS 284
Query: 500 ILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGS 559
+++ VT D SDYLW++T +++ +D F K R T+ + + +L F+N + G+
Sbjct: 285 LIDQKMVTNDSSDYLWYLTGFHLNGNDPLFGK----RVTLRVKTRGHILHAFVNNKHIGT 340
Query: 560 VIGHWVKVVQPVE-----FQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKN 614
G + K +E + G+N + LLS TVGL NYGA+ E G G V+L
Sbjct: 341 QFGPYGKYSFTLEKKVRNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGK 400
Query: 615 GDIDLSKILWTYQVGLKGEFQQIYSIEEN-EAEWTDLTRDGIP--STFTWYKTYFDAPDG 671
DLS W Y+VGL GE + + + W + +P FTWYKT F P G
Sbjct: 401 TIRDLSTNEWIYKVGLDGEKYEFFDPDHKFRKPWLS---NNLPLNQNFTWYKTSFSTPKG 457
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
+ V +DL MGKGQAWVNG IGRYW + +A + GC +CDYRGAY KC TNCG PT
Sbjct: 458 REGVVVDLMGMGKGQAWVNGKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPT 517
Query: 731 QTWYHVPRSWLQ-ASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNS 789
Q WYH+PRS++ N L++FEE GG P I +K + VC +V
Sbjct: 518 QRWYHIPRSYMNDGKENTLILFEEFGGMPLNIEIKTTRVKKVCAKVD------------- 564
Query: 790 YSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ ++ L C D + I F +G P+G C F +G+CH+ + SV+ +
Sbjct: 565 -----------LGSKLELTCHD-RTVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEK 612
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 288/741 (38%), Positives = 415/741 (56%), Gaps = 68/741 (9%)
Query: 38 ASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYV 97
A+ F P NVSYDHR++II+G R++L+SA IHYPRATP MW ++ +K G D+IETY
Sbjct: 34 AAKFGVPLNVSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYT 93
Query: 98 FWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPG 157
FWN HE G YNF+G ++ F+ + GLY+ +R GPYVCAEWN+GGFP WL++I G
Sbjct: 94 FWNLHEPTPGTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDG 153
Query: 158 IEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD 217
I FR N PF ++M ++ IV+ +R ++ GGPII+ Q+ENEYG +E++YG G
Sbjct: 154 IVFRDYNQPFMDQMSNWMTYIVNYLRP--YYASNGGPIILAQVENEYGWLEAAYGASGTK 211
Query: 218 YVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSYNKPTLWTEN 273
Y WAA A L G+PW+MC Q D +I+ CNG+YC D + N+P WTEN
Sbjct: 212 YALWAAQFANSLDIGIPWIMCSQDDI-ATVINTCNGFYCHDWIDVHWTAYPNQPAFWTEN 270
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYD 333
W GW+ W G +PHRPV+D+ ++VAR+ GGS MNYYM+FGGT FGR +GGPF TSYD
Sbjct: 271 WPGWFQNWEGGVPHRPVQDVLYSVARWIAYGGSMMNYYMWFGGTTFGRWTGGPFITTSYD 330
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQE-AHVYRANRYG 392
YD IDEYG EPK+ + H I E +++ + + I LG+N E +H Y
Sbjct: 331 YDGAIDEYGYPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEISHFYSVETGE 390
Query: 393 SQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVE 452
S S FLAN +V + G ++ + PWSV +L + ++F+T+ + +
Sbjct: 391 SFS----FLANFGATGVQTVQWNGITFKVQPWSVQLLYN-NVSIFDTSATPIGSPV---- 441
Query: 453 FSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSEN---NFT--VQGILEHLNVT 507
P P +S E IG WSE+ FT + +E L++T
Sbjct: 442 ------PKQFTPIKSF----------------ENIGQWSESFDLTFTNYSETPMEQLSLT 479
Query: 508 KDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV 567
+D +DYLW++T+I + N V +++ ++ D++ VF++ Q + G +
Sbjct: 480 RDQTDYLWYVTKI----------EVNRVGAQLSLPNISDMVHVFVDNQYIATGRGP-TNI 528
Query: 568 VQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQ 627
G + L +L VGL NY +E AG V L +D+S W+ +
Sbjct: 529 TLNSTIGVGGHTLQVLHTKVGLVNYAEHMEATVAGIFEPVTLD-----SVDISSNGWSMK 583
Query: 628 VGLKGEFQQIYSIEEN-EAEWTDLTRDGIPSTFTWYKTYFDAPDGID-PVALDLGSMGKG 685
++GE Q+Y+ + +WT++T + TWYK F+ + +ALD+ M KG
Sbjct: 584 PFVQGETLQLYNPNHSGSVQWTNVTGN---PPLTWYKFNFNLELSSNMSLALDMLGMTKG 640
Query: 686 QAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN 745
+VNG++IGRYW +A GC + C Y+G Y+ C CG P+Q +YHVP WL
Sbjct: 641 MIFVNGYNIGRYWLALA--YGC-NPCTYQGGYSPSMCQLGCGEPSQQYYHVPTDWLMNGE 697
Query: 746 NLLVIFEETGGNPFEISVKLR 766
N +VIFEE GNP I++ R
Sbjct: 698 NEIVIFEEVYGNPEAITLVQR 718
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 296/823 (35%), Positives = 444/823 (53%), Gaps = 109/823 (13%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++IIDG R +L S IHYPR+TPEMWP +I ++K+GG + I+TYVFWN HE +
Sbjct: 54 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G+ D+VKF+KL+ +G+Y+ LR+GP++ AEW G +
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDH------------- 160
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
K I R +IENEY ++ +Y Q G +Y+KWA+++
Sbjct: 161 ---------KNIAGAYR---------------KIENEYSAVQRAYKQDGLNYIKWASNLV 196
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW + +G
Sbjct: 197 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 256
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
R VED+A++VARFF + G+ +NYYMY GGTNFGRTS + T Y DAP+DEYGL
Sbjct: 257 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 315
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
EPK+GHLK LH A+ LC+ L+ + K G++ E Y + G+++ C+AFLAN
Sbjct: 316 KEPKYGHLKHLHNALNLCKKPLLWGQ-PKTEKPGKDTEIRYYE--QPGTKT-CAAFLANN 371
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A ++ F G+ Y + P S+SILPDC+ V+NTA++ SQ +
Sbjct: 372 NTEAAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHT----------------- 414
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
++ ++SK ++ + E + E N + +E +TKD +DY W+ T V
Sbjct: 415 SRNFMKSKKANKKFDFKVFTETLPSKLEGNSYIP--VELYGLTKDKTDYGWYTTSFKVHK 472
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVKVV---QPVEFQSGYNDL 580
+ + K V+ V I S+ L ++NG+ GS G H K + V ++G N L
Sbjct: 473 NHLPTKKG--VKTFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHL 530
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK-ILWTYQVGLKGEFQQIYS 639
++L G + G+++E G RG + + G +G +DL++ W ++G++GE I++
Sbjct: 531 VMLGVLTGFPDSGSYMEHRYTGPRG-ISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHT 589
Query: 640 IEE-NEAEWTDLTRDGIPSTFTWY----------KTYFDAPDGIDPVALDLGSMGKGQAW 688
E + EW T G TWY +TYFDAP+ + + + MGKG W
Sbjct: 590 EEGLKKVEWKKFT--GKAPGLTWYQKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIW 647
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG +GRYW + ++P G PTQ YH+PRS+L+ NL
Sbjct: 648 VNGEGVGRYWQSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNL 685
Query: 748 LVIFEETGG-NPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMH 806
LVIFEE P + + + VC V E++ P VR W+ ++ N ++
Sbjct: 686 LVIFEEEPNVKPELMDFAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDN-VSLTAT 744
Query: 807 LHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C I+++EFAS+G P G C F+ G C+AP+S V+ +
Sbjct: 745 LKCSGTKKIAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEK 787
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 271/562 (48%), Positives = 338/562 (60%), Gaps = 39/562 (6%)
Query: 293 LAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHL 352
LAF VARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGL+ +PK+GHL
Sbjct: 1 LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60
Query: 353 KDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASV 412
K+LH AIK+CE ALV+AD +G Q+AHVY A +CSAFLAN D +AA V
Sbjct: 61 KELHRAIKMCEKALVSADPV-VTSIGNKQQAHVYSAE----SGDCSAFLANYDTESAARV 115
Query: 413 TFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESK 472
F Y LPPWS+SILPDCRN VFNTAKV QTS Q M+ +
Sbjct: 116 LFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTS-----------------QMEMLPT- 157
Query: 473 LSSTSKSWMTVKEPIGVWSENN-FTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWK 531
+ + W + E + +++ FT G+LE +NVT+D SDYLW++T + + D + SF
Sbjct: 158 -DTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSE-SFLH 215
Query: 532 TNEVRPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTV 587
E+ PT+ I S + +F+NGQL+GS G + SG N + LLS V
Sbjct: 216 GGEL-PTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAV 274
Query: 588 GLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI-YSIEENEAE 646
GL N G E G G V L G G +DLS WTYQVGLKGE + +
Sbjct: 275 GLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIG 334
Query: 647 WTDLTRD-GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKG 705
W D + P TW+KTYFDAP+G +P+ALD+ MGKGQ WVNG IGRYWT A G
Sbjct: 335 WMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFA-TG 393
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
C C Y G Y +KC T CG PTQ WYHVPR+WL+ S NLLVIFEE GGNP +S+
Sbjct: 394 DCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVK 452
Query: 766 RSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGT 825
RS VC +VSE H P ++ W G+ P++HL C G I+SI+FAS+GT
Sbjct: 453 RSVSGVCAEVSEYH-PNIKNWQIESYGKGQ---TFHRPKVHLKCSPGQAIASIKFASFGT 508
Query: 826 PQGRCQKFSRGNCHAPMSLSVV 847
P G C + +G CHA S +++
Sbjct: 509 PLGTCGSYQQGECHAATSYAIL 530
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 246/496 (49%), Positives = 319/496 (64%), Gaps = 27/496 (5%)
Query: 33 VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
V++ + TF NVSYD AIII+G RR++ S IHYPR+T MWPDLI K+K+GG D
Sbjct: 8 VATLACLTFCLGDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDA 67
Query: 93 IETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
IETY+FW+ HE R +Y+F G+ D +KF +L+ +GLY+ +RIGPYVCAEWN+GGFPVWL
Sbjct: 68 IETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWL 127
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN-MESSY 211
++PGI+ RTNN +K EMQ F KIV++ ++ LF+ QGGPII+ QIENEYGN M +Y
Sbjct: 128 HNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAY 187
Query: 212 GQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWT 271
G GK Y+ W A MA L GVPW+MC+Q+DAP+ II+ CNG+YCD + PN+ P ++T
Sbjct: 188 GDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPKSPKMFT 247
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITS 331
ENW GW+ WG + P+R ED+AF+VARFFQ GG F NYYMY GGTNFGRTSGGPF TS
Sbjct: 248 ENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTS 307
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRY 391
YDY+AP+DEYG L++PKWGHLK LHA+IKL E L QN + V +
Sbjct: 308 YDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNGTHTN-----QNFGSSVTLTKFF 362
Query: 392 GSQSNCS-AFLANIDEHTAASVTFLGQ-SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
+ FL+N D A++ Y +P WSVSIL C V+NTAKV+SQTS+
Sbjct: 363 NPTTGERFCFLSNTDGKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQTSMF 422
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN--FTVQGILEHLNVT 507
E +++ + SW EP+ + N F LE VT
Sbjct: 423 VKE-----------------QNEKENAQLSWAWAPEPMKDTLQGNGKFAANLFLEQKRVT 465
Query: 508 KDYSDYLWHITQIYVS 523
D+SDY W++T + S
Sbjct: 466 ADFSDYFWYMTNVDTS 481
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 258/563 (45%), Positives = 344/563 (61%), Gaps = 47/563 (8%)
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
PHRP ED+AFAVARF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYGLL
Sbjct: 1 PHRPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 60
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPKWGHL+DLH AIKLCEPALV+ D +G Q++HV+R+ C+AFL+N D
Sbjct: 61 EPKWGHLRDLHRAIKLCEPALVSGDPT-VTSIGHYQQSHVFRSK----AGACAAFLSNYD 115
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+ A V F G Y +PPWS+SILPDC+ TVFNTA++ +QTS +E++
Sbjct: 116 SGSYARVVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQTSQLKMEWAGKF-------- 167
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
SW + E + + +FT G++E +++T+D +DYLW+ T + + ++
Sbjct: 168 -------------SWESYNEDTNSFDDRSFTKVGLVEQISMTRDNTDYLWYTTYVNIGEN 214
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLI 581
+ F K N P +T++S + ++INGQLTG++ G + V+ +G N +
Sbjct: 215 E-GFLK-NGHYPVLTVNSAGHSMHIYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKIS 272
Query: 582 LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE 641
+LS VGL N G E G G V L+G G DLS W YQ+GLKGE ++++
Sbjct: 273 ILSVAVGLPNIGGHFETWNTGVLGPVTLSGLNEGKRDLSWQKWIYQIGLKGEALNLHTLS 332
Query: 642 -ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ EW ++ + TWYKT F+AP G DP+ALD+GSMGKGQ W+NG +GRYW
Sbjct: 333 GSSSVEWGGPSQK---QSLTWYKTSFNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPA 389
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFE 760
G C CDYRG YN KC +NCG TQ WYHVPRSWL + NLLV+FEE GG+P
Sbjct: 390 YKASGSC-GGCDYRGTYNEKKCQSNCGESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSG 448
Query: 761 ISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEF 820
IS+ R VC +++E + P ++D + N + HL C G +++I+F
Sbjct: 449 ISMVRRKVESVCAEIAE--WQP--------NMDNVHTGNYGRSKAHLSCAPGQKMTNIKF 498
Query: 821 ASYGTPQGRCQKFSRGNCHAPMS 843
AS+GTPQG C FS G CHA S
Sbjct: 499 ASFGTPQGTCGAFSEGTCHAHKS 521
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 254/535 (47%), Positives = 335/535 (62%), Gaps = 40/535 (7%)
Query: 239 KQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVA 298
KQ DAP+ +I+ CNG+YCD + PN KP++WTE W GW+T++GG +PHRPVEDLAFAVA
Sbjct: 1 KQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVA 60
Query: 299 RFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAA 358
RF Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDE+GLL +PKWGHL+DLH A
Sbjct: 61 RFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRA 120
Query: 359 IKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQS 418
IK EP LV+AD +G ++A+V++A C+AFL+N +TA V F GQ
Sbjct: 121 IKQAEPVLVSADPT-IESIGSYEKAYVFKA----KNGACAAFLSNYHMNTAVKVRFNGQQ 175
Query: 419 YTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSK 478
Y LP WS+SILPDC+ VFNTA V T + ++P +
Sbjct: 176 YNLPAWSISILPDCKTAVFNTATVKEPTLMPK------MNPVVRF--------------- 214
Query: 479 SWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPT 538
+W + E S++ FT G++E L++T D SDYLW+ T + + +D+ ++ P
Sbjct: 215 AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQS----PQ 270
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
+T+ S ++VF+NG+ GSV G + + V+ G N + +LS VGL N G
Sbjct: 271 LTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGN 330
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA-EWTDLTRD 653
E G G V L+ G DLS WTYQVGLKGE ++++ + A EW
Sbjct: 331 HFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGG---P 387
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
G TW+K +F+AP G DPVALD+GSMGKGQ WVNGHH+GRYW+ A GGC C Y
Sbjct: 388 GGYQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKA-SGGCGG-CSY 445
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRST 768
G Y+ DKC +NCG+ +Q WYHVPRSWL+ NLLV+ EE GG+ +S+ R+T
Sbjct: 446 AGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRTT 500
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 253/555 (45%), Positives = 336/555 (60%), Gaps = 22/555 (3%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MW L+ +KEGG DVIETYVF N HE Y F G D++KFVK+V +G+YL L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P+V EWNFGG P+WL +P F+TN+ PFK MQ+F+ IV++M+++ LF+ QGGPII
Sbjct: 61 PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 256
+ Q+ENEYG+ + Y GK YV WAA+M L GVPW+MC+ + + +I+ CN +YC
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180
Query: 257 DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
D + PNS +K +WTENW W+ T+G HR ED+AF+VA FF NYYMY GG
Sbjct: 181 DQFTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKSX--NYYMYHGG 238
Query: 317 TNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK 376
TNFG TSGGPF T+Y+Y+APIDEYGL PK GHLK+L AIK CE L+ + +
Sbjct: 239 TNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPIN-LX 297
Query: 377 LGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV 436
LG +QE VY S +AF++N+DE + F SY +P WSVSILPDC+N V
Sbjct: 298 LGPSQEVDVYA----DSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVV 353
Query: 437 FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFT 496
FNTAKV SQ I VE L Q S++ S W T E G+W E +F
Sbjct: 354 FNTAKVVSQ--ISQVEMVL------EDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFV 405
Query: 497 VQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL 556
G ++H+N TKD +D LW+ I V + + +F K +P + ++S L F+N +L
Sbjct: 406 KNGFVDHINTTKDTTDXLWYTVSITVGESE-NFLKEIS-QPILLVESKGHALHAFVNQKL 463
Query: 557 TGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF 612
GS G+ K P+ ++G N++++LS TVGLQN F E GA VK+ G
Sbjct: 464 QGSASGNGSHSPFKFECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLT-SVKIKGL 522
Query: 613 KNGDIDLSKILWTYQ 627
NG +DLS W Y+
Sbjct: 523 NNGIMDLSTYPWIYK 537
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 471 bits (1211), Expect = e-129, Method: Compositional matrix adjust.
Identities = 256/629 (40%), Positives = 360/629 (57%), Gaps = 67/629 (10%)
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MA L GVPW+MC+Q +AP+ +++ CNG+YCD Y+P + + P +WTENW GW+ WGG+
Sbjct: 1 MANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGK 60
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
P+R EDLAF+VARFFQ GG+F NYYMY GGTNFGR +GGP+ TSYDY AP+DE+G L
Sbjct: 61 HPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNL 120
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
++PKWGHLK LH +K E +L + ++ I LG + +A +Y ++ S F+ N+
Sbjct: 121 NQPKWGHLKQLHTVLKSMEKSLTYGNISR-IDLGNSIKATIYT-----TKEGSSCFIGNV 174
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A V F G+ Y +P WSVS+LPDC +NTAKV++QTSI T + S P
Sbjct: 175 NATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKP-------- 226
Query: 465 QQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSD 524
+E S M +K + +G+++ +VT D SDYLW++T++++
Sbjct: 227 --ERLEWTWRPESAQKMILK------GSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDK 278
Query: 525 DDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQ-----SGYND 579
D W N T+ + S VL ++NG+ G+ K E + G N
Sbjct: 279 KD-PLWSRNM---TLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNH 334
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI---DLSKILWTYQVGLKGEFQQ 636
+ LLS +VGLQNYG F E G G V L G+K + DLS+ W Y++GL G +
Sbjct: 335 ISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDK 394
Query: 637 IYSIEE-NEAEWTDLTRDGIPS--TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
++SI+ +W + + +P+ TWYK F AP G +PV +DL +GKG+AW+NG
Sbjct: 395 LFSIKSVGHQKWAN---EKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQS 451
Query: 694 IGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS-NNLLVIF 751
IGRYW + + GC+D CDYRGAY SDKC CG PTQ WYHVPRS+L AS +N + +F
Sbjct: 452 IGRYWPSFNSSDDGCKDKCDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLF 511
Query: 752 EETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQD 811
EE GGNP ++ K VC + E + ++ L C +
Sbjct: 512 EEMGGNPSMVNFKTVVVGTVCARAHEHN------------------------KVELSCHN 547
Query: 812 GYIISSIEFASYGTPQGRCQKFSRGNCHA 840
IS+++FAS+G P G C F+ G C
Sbjct: 548 -RPISAVKFASFGNPLGHCGSFAVGTCQG 575
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 232/549 (42%), Positives = 337/549 (61%), Gaps = 29/549 (5%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD R+++IDG R + S IHYPR+ PE+WP LI ++KEGG + IETY+FWNAHE
Sbjct: 36 VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+YNF+G+ D++K++K++ +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+K+EM++FV+ IV +++ LF+ QGGPII+ QIENEYGN++ + G Y++WAA MA
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
L GVPW+MCKQ+ AP +I CNG +C D + NKP LWTENW + +G ++
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GGS +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH I+ + A + + I LG EAH++ ++ C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEI-LGHGYEAHIFEL---PEENLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQ 465
+V F G+ + +P SVSIL C+N V+NT +V Q + +
Sbjct: 391 TGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHN-----------------E 433
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDD 525
+S S+++S + W E I + + ++ LE N TKD SDYLW+ T + D
Sbjct: 434 RSYHTSEVTSKNNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESD 493
Query: 526 DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG-HWVK---VVQPVEFQSGYNDLI 581
D+ F N++RP + + S + F N G G VK +PV+ + G N ++
Sbjct: 494 DLPF--RNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVV 551
Query: 582 LLSQTVGLQ 590
LLS T+G++
Sbjct: 552 LLSSTMGMK 560
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 268/739 (36%), Positives = 402/739 (54%), Gaps = 66/739 (8%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE- 103
N++YDHR++II+G R++L+S +HYPRA+ W +++ SK G D+IETY+FWN H+
Sbjct: 40 LNITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNVHQP 99
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
+ ++ + +I F+ L + L++ LRIGPYVCAEWN+GGFP+WL++I GI FR
Sbjct: 100 NTPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIVFRDY 159
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
N PF + M +V +VD +++ F+ GGPII+ QIENEYG +E+ YG G++Y WA
Sbjct: 160 NQPFMDAMSTWVTMVVDKLQD--YFAPNGGPIIIAQIENEYGWLENEYGASGREYALWAI 217
Query: 224 SMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN----KPTLWTENWDGWYT 279
+ A L G+PW+MC Q D ++ I+ CNG+YC + +N +P WTENW GW+
Sbjct: 218 NFAKSLNIGIPWIMCAQEDI-DSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVGWFE 276
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPID 339
WG +P RPV+D+ F+ ARF GGS NYYM+FGGTNFGR+ GGP+ ITSY+YDAP+D
Sbjct: 277 NWGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYDAPLD 336
Query: 340 EYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSA 399
E+G +EPK+ H I E ++ D + L EAH Y +
Sbjct: 337 EFGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTPVPLSNISEAHPYGEDL--------V 388
Query: 400 FLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSP 459
FL N + + G +YTL PWSV I+ + VF+T+ V + +
Sbjct: 389 FLTNFG-LVIDYIQWQGTNYTLQPWSVVIVY-SGSVVFDTSYVPDEYIKPSTRDQFK--- 443
Query: 460 NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
VP +S LS + + + NN + LE +N+T D +DYLW+ T
Sbjct: 444 --DVPNAINYDSILSFSEWGQSDIINDCII---NN---ESPLEQINLTNDTTDYLWYTTN 495
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV----VQPVEFQS 575
I +++ T+TI++M D VF+NG G+ W V ++P
Sbjct: 496 ITLNE-----------TTTLTIENMYDFCHVFLNGAYQGN---GWSPVAYITLEPTNGNI 541
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
Y L +L+ T+GL+NY A +E G G + L G +++ W+ + G+ GE
Sbjct: 542 NY-QLQILTMTMGLENYAAHMESYSRGLLGSISL-----GQTNITNNQWSMKPGILGEKL 595
Query: 636 QIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI--DPVA----LDLGSMGKGQAW 688
QIY+ ++ W S TWY+ + + DG+ DP + L++ SM KG +
Sbjct: 596 QIYNEYSSSKVNWQPYNPSATQS-MTWYQ-FNISLDGLSSDPSSNAYVLNMTSMNKGFVY 653
Query: 689 VNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNN-- 746
VNG +IGRY+ + A + C DY G Y +C P+Q+ YH+P WL +
Sbjct: 654 VNGFNIGRYFLMEATQSNCTLKQDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQ 713
Query: 747 --LLVIFEETGGNPFEISV 763
+++FEE G+P +I +
Sbjct: 714 YATVILFEEVNGDPTKIQL 732
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 204/296 (68%), Positives = 242/296 (81%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++I+G RR+LIS IHYPR+TPEMWP L+ K+K+GG DV++TYVFWN HE +R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQY F + D+V+FVKL +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK MQ FV+KIV +M+ E LF WQGGPII+ Q+ENEYG MES G K Y WAA MA
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ GAGVPWVMCKQ DAP+ +I+ CNG+YCD + PNS +KPT+WTE W GW+T +GG +P
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGGAVP 267
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 342
HRPVED+AFAVARF Q+GGSF+NYYMY GGTNF RTSGGPF TSYDYDAPIDEYG
Sbjct: 268 HRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYG 323
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 274/754 (36%), Positives = 405/754 (53%), Gaps = 62/754 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE-SI 105
V+YD R++II+G R++L S IHYPR + EMWP ++ +SK+ G D+I+TY+FWN H+ +
Sbjct: 40 VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+Y F G +I KF+ L LY+ LRIGPYVCAEW +GGFP+WL++IP I +R N
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIVYRDYNQ 159
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
+ EM +++ +V + + F+ GGPII+ Q+ENEYG +E YG G +Y KW+
Sbjct: 160 QWMNEMSIWMEFVVKYL--DNYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSIDF 217
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY----NKPTLWTENWDGWYTTW 281
A L G+PW+MC+Q D E+ I+ CNGYYC + + + N+P+ WTENW GW+ W
Sbjct: 218 AKSLNIGIPWIMCQQNDI-ESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
G P RPV+D+ ++ ARF GGS +NYYM+FGGTNFGRTSGGP+ ITSYDYDAP+DE+
Sbjct: 277 GQAKPKRPVQDILYSNARFIAYGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDEF 336
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY-IKLGQNQEAHVYRANRYGSQSNCSAF 400
G +EPK+ H + E L+ + L Q E H Y N +F
Sbjct: 337 GQPNEPKFSLSSKFHQVLHAIESDLLNNQPPKSPTFLSQFIEVHQYGINL--------SF 388
Query: 401 LANIDEHTAASVT-FLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI--KTVEFSLPL 457
+ N T + ++ Q+YT+ PWSV I+ + +F+T+ + T T+ P+
Sbjct: 389 ITNYGTSTTPKIIQWMNQTYTIQPWSVLIIYN-NEILFDTSFIPPNTLFNNNTINNFKPI 447
Query: 458 SPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHI 517
+ NI + + L+S + S +E L +TKD SDY W+
Sbjct: 448 NQNIIQSIFQISDFNLNSGGGGGDGDGNSVNSVSP--------IEQLLITKDTSDYCWYS 499
Query: 518 TQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ--PVEFQS 575
T V+ +S+ + + T+T D + +FI+ + GS + +Q P+ +
Sbjct: 500 TN--VTTTSLSYNEKGNIFLTIT--EFYDYVHIFIDNEYQGSAFSPSLCQLQLNPINNST 555
Query: 576 GYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ 635
+ L +LS T+GL+NY + +E G G + + G +L+ W + GL GE
Sbjct: 556 TF-QLQILSMTIGLENYASHMENYTRGILGSILI-----GSQNLTNNQWLMKSGLIGENI 609
Query: 636 QIYSIEENEAEWTDLTRDG----IPSTFTWYK---TYFDAPDGIDPV--ALDLGSMGKGQ 686
+I++ +N W I TWYK + P I ALD+ SM KG
Sbjct: 610 KIFN-NDNTINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGM 668
Query: 687 AWVNGHHIGRYWTVVAPKGGCQDTC----DYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
WVNG+ IGRYW + A + C + Y G Y+ +C P+Q+ Y VP WL
Sbjct: 669 IWVNGYSIGRYWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLF 728
Query: 743 ASN-----NLLVIFEETGGNPFEISVKLRSTRIV 771
+N ++I EE GNP EI +L S +I+
Sbjct: 729 NNNYNNQYATIIIIEELNGNPNEI--QLLSNKII 760
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 250/611 (40%), Positives = 340/611 (55%), Gaps = 59/611 (9%)
Query: 248 IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
I+ CNGYYCD +KPN+ P ++TENW GWY WGG+ +R ED+AF+VARF Q GG F
Sbjct: 164 INTCNGYYCDTFKPNNPKSPKMFTENWSGWYKLWGGKTSYRTAEDMAFSVARFVQAGGVF 223
Query: 308 MNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
NYYMY+GGTNFGRT+GGP+ SYDYD+P+DEYG L++PKWGHLK LHA+IKL E +
Sbjct: 224 NNYYMYYGGTNFGRTAGGPYITASYDYDSPLDEYGNLNQPKWGHLKQLHASIKLGEKIIT 283
Query: 368 AAD-SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSV 426
+ + + G + A+ A R + C FL+NI+ A +YT+P WSV
Sbjct: 284 NGTVTIKNFQAGVDLTAYTNNATR---ERFC--FLSNINIADAHIDLQQDGNYTIPAWSV 338
Query: 427 SILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEP 486
SIL +C +FNTAKV++QTS+ + P T+ SW+ EP
Sbjct: 339 SILQNCSKEIFNTAKVNTQTSLMVKKLYENDKP----------------TNLSWVWAPEP 382
Query: 487 IG--VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSM 544
+ + + F +L+ T D SDYLW++T ++ + + + T+ + S
Sbjct: 383 MKDTLLGKGRFRTSQLLDQKETTVDASDYLWYMTSFDMNKNTLQW-----TNVTLRVTSR 437
Query: 545 RDVLRVFINGQL-TGS--VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
VL ++N +L GS VI +PV + G N + LLS TVGL NYG+F +K
Sbjct: 438 GHVLHAYVNKKLIVGSQLVIQGEFTFEKPVTLKPGNNVISLLSATVGLANYGSFFDKTPV 497
Query: 602 GF-RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
G G V+L +DLS LW+Y++GL GE ++ Y +W+ T
Sbjct: 498 GIVDGPVQLMANGKPVMDLSSNLWSYKIGLNGEAKRFYDPTSRHNKWSAANGVSTARPMT 557
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNS 719
WYKT F +P G DPV +DL MGKG AW NG +GRYW + +A GC TCDYRG YN+
Sbjct: 558 WYKTTFSSPSGTDPVVVDLQGMGKGHAWANGKSLGRYWPSQIANANGCSGTCDYRGPYNA 617
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQAS-NNLLVIFEETGGNPFEISVKLRSTRIVCEQVSES 778
KCT NCG PTQ WYHVPRS+L ++ N L++FEE GG+P IS ++ +T +C E
Sbjct: 618 GKCTRNCGIPTQRWYHVPRSFLNSNGKNTLILFEEVGGDPSGISFQIVTTETICGNAYE- 676
Query: 779 HYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNC 838
+ L CQ G IS I+FASYG PQG C F +G+
Sbjct: 677 -----------------------GSTLELSCQGGRTISEIQFASYGNPQGTCSSFKKGSF 713
Query: 839 HAPMSLSVVSE 849
A S+ +V +
Sbjct: 714 DAMNSVQMVQK 724
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 80/151 (52%), Positives = 111/151 (73%), Gaps = 6/151 (3%)
Query: 24 MMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIA 83
++++ L+ +S SA+T V YD A+II+G R+++ S IHYPR+TPEMWP+LI
Sbjct: 8 IVLISTLALLSLCSATT------VEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELIN 61
Query: 84 KSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEW 143
K+K+GG D IETYVFW+ HE +R QY+F G DIVKF +++ +GLY+ LRIGPYVCAEW
Sbjct: 62 KAKDGGLDAIETYVFWDRHEPVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEW 121
Query: 144 NFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
N+GGFP+WL + PG+E RT+N +K + F
Sbjct: 122 NYGGFPMWLHNTPGVELRTDNEIYKVPLLIF 152
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 265/769 (34%), Positives = 417/769 (54%), Gaps = 50/769 (6%)
Query: 11 LQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHY 70
L+ ++ +Y + ++++I + V S + VSYD+RAIII+G R++L SA IHY
Sbjct: 3 LKISSIFLYISIFLILLIFPNYVLSDKLT-------VSYDNRAIIINGERKLLYSASIHY 55
Query: 71 PRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLY 130
PR+T MWPD++ ++K G + IETY+FWN H+ Y+F+G +D+ F+ L G +
Sbjct: 56 PRSTRTMWPDILKRTKAAGINTIETYIFWNLHQPTPDTYDFEGSSDVKHFLDLCKEEGFH 115
Query: 131 LQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSW 190
+ +R GPYVCAEWN GG P WL+ +PGI +RT+N PF EM++++ IV + + ++
Sbjct: 116 VIVRFGPYVCAEWNNGGLPSWLKAVPGIVYRTHNEPFMREMKKWMDYIVHYLSD--YYAP 173
Query: 191 QGGPIIMLQIENEYGNMESSYGQQ-GKDYVKWAASMALGLGAGVPWVMCKQTDAPENIID 249
GGPIIM QIENEYG +E Y +Q G +YV WA +A G+PW+MC+Q + ++I+
Sbjct: 174 NGGPIIMAQIENEYGWLEYEYREQGGPEYVDWAVKLAKSYNTGIPWIMCQQ-NTRSDVIN 232
Query: 250 ACNGYYCDG---YKPNSY-NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGG 305
CNG+YC Y ++ ++P +TE W GW + P RP D+ ++ ARF+ RGG
Sbjct: 233 TCNGFYCHDWLQYHQRTFPDQPAFFTELWTGWPQYFEEGFPTRPTVDVLYSAARFYSRGG 292
Query: 306 SFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPA 365
+NYYM+ GGT FGR + PF TSYDYDAP+DEYG EPK+ L LH ++
Sbjct: 293 GMVNYYMWHGGTTFGRFT-SPFLTTSYDYDAPLDEYGFPQEPKYSMLTKLHVTLEKYSSV 351
Query: 366 LVAADSA--QYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPP 423
++ + Y+ E Y+ + + FL N D+ A V G++ +
Sbjct: 352 ILHDPNVPPPYVFPDNTVEMIEYKKD-----AESVVFLVNWDDTFAKQVDMNGKNVKINQ 406
Query: 424 WSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTV 483
WSV I + VF+T ++ + + F ++ + + L + SW
Sbjct: 407 WSVQIYYN-NELVFDTFEIPANLTRPNPPFKPIAKTSLDATAAATSRTGLVNLVSSW--- 462
Query: 484 KEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDS 543
EP + N + Q L +T D SDY+W+ T+I ++ KT+E+ + +
Sbjct: 463 NEPFSFLTYNA-SSQTPTAQLKLTGDNSDYIWYETEIDLT-------KTDEI---LYLYK 511
Query: 544 MRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGF 603
D VF++GQ G ++ +F G + L +L +G+ +YGA +E+ G
Sbjct: 512 SYDFSYVFVDGQFLYWHRGSPIQAYFNGKFPVGKHTLQILCAAMGVPSYGAHIEQHERGL 571
Query: 604 RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK 663
G + L G +++ W + L GE +++ + +W+ +++ S TWYK
Sbjct: 572 TGDIFL-----GSKNITDNGWKMRPFLSGELLGLHA-SPSTVKWSPVSKGTAGSGVTWYK 625
Query: 664 TYFDAPDGID--PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
P D ALDL SM KG +VNG+ IGRYW KG C++ C+ G Y++
Sbjct: 626 FNVKTPSFEDGPAFALDLKSMWKGLVFVNGNSIGRYWVA---KGWCEEKCNQTGLYDNYG 682
Query: 722 CTTNCGNPTQTWYHVPRSWL-QASNNLLVIFEETGGNPFEISVKLRSTR 769
C NCG +Q +YHVP+ +L ++S+N ++IFEE G+P+ I + R+T
Sbjct: 683 CRENCGESSQRYYHVPKDFLKESSDNEVIIFEELQGDPYSIELVQRNTE 731
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 212/343 (61%), Positives = 260/343 (75%), Gaps = 10/343 (2%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWP 79
P +++ + L+ V S+ S V+YDH+AIII+G RR+LIS IHYPR+TP+MWP
Sbjct: 2 PKTVLLFLCLLTWVCSTIGS-------VTYDHKAIIINGRRRILISGSIHYPRSTPQMWP 54
Query: 80 DLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYV 139
DLI K+K+GG D+IETYVFWN HE G+Y F+ + D+V+F+KLV +GLY+ LRIGPYV
Sbjct: 55 DLIQKAKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYV 114
Query: 140 CAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
CAEWN+GGFP+WL+ +PGI FRT+NAPFK MQ+FV KIVD+M+ E LF QGGPII+ Q
Sbjct: 115 CAEWNYGGFPIWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 174
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYG +E G GK Y KWAA MA+GL GVPWVMCKQ DAP+ +ID CNG+YC+ +
Sbjct: 175 IENEYGPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENF 234
Query: 260 KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF 319
KPN KP +WTENW GWYT +GG P+RP ED+AF+VARF Q GGS +NYYMY GGTNF
Sbjct: 235 KPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNF 294
Query: 320 GRTSGGPFYITSYDYDAPIDEYGLLSEPKWG--HLKDLHAAIK 360
GRTS G F TSYD+DAPIDEYGLL EP G LK L+ +
Sbjct: 295 GRTS-GLFVTTSYDFDAPIDEYGLLREPILGPVTLKGLNEGTR 336
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 4/165 (2%)
Query: 605 GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYK 663
G V L G G D+SK W+Y+VGL+GE +YS++ N +W + P TWYK
Sbjct: 324 GPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQKQP--LTWYK 381
Query: 664 TYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCT 723
T F+ P G +P+ALD+ SM KGQ WVNG IGRY+ +G C + C Y G + KC
Sbjct: 382 TTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIARGKC-NKCSYTGFFTEKKCL 440
Query: 724 TNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRST 768
NCG P+Q WYH+PR WL + NLL+I EE GGNP IS+ R+
Sbjct: 441 WNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRTA 485
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 283/754 (37%), Positives = 397/754 (52%), Gaps = 80/754 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V YD R++ I+G R+++IS IHYPR+TP MWP LI KSK+ G ++IETYVFWN H+
Sbjct: 46 VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105
Query: 107 GQ-YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
Q YNF+G +I F+ L GLY+ LRIGPYVCAEWN+GG P WLR+IPGI FR N
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGIVFRDYNQ 165
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
P+ EM ++ IV+ ++ F+ GGPII+ Q+ENEYG +E+ YG GK Y +WA S
Sbjct: 166 PWMTEMASWMTFIVNYLKP--YFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAISF 223
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY----NKPTLWTENWDGWYTTW 281
A L G+PW MC+Q D ++ I+ CNG+YC + + N+P +TENW GW +
Sbjct: 224 AKSLNIGIPWTMCQQNDI-DDAINTCNGFYCHDWIQYHFQVYPNQPAFFTENWAGWIQYY 282
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
+PHRP EDL ++VAR+F RGGS MNYYM+ GGT F R S F SYDYDA +DEY
Sbjct: 283 SEGVPHRPTEDLLYSVARWFSRGGSLMNYYMWHGGTTFARYS-STFLTNSYDYDAALDEY 341
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAA-DSAQYIKLGQNQEAHVYRANRYGSQSNCS-- 398
G +EPK+ L LH+ + L+++ + A+ + + + +Y + N +
Sbjct: 342 GYEAEPKYSALAQLHSVLSQYSYILLSSGEVARPVNISNITTCNTIEIIQYNTTINGTLE 401
Query: 399 --AFLANIDEHTAASV--TFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFS 454
F+ N ++A V + GQ+ T+ PWSV IL + TV +T+ V Q S + EF
Sbjct: 402 TITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYN-NQTVIDTSYVKQQYSAQK-EFY 459
Query: 455 LPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI-LEHLNVTKDYSDY 513
Q +++ L S SW EPIGV + +N + E L++T D +DY
Sbjct: 460 ----------QSKRVKNVLVS---SW---TEPIGVGNYSNVVTANLPSEQLDLTLDQTDY 503
Query: 514 LWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEF 573
L + D++ ++I+G+ G V +F
Sbjct: 504 LCN---------------------------ADDMIYIYIDGEYQSWSRGSPAHFVLDTKF 536
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE 633
G + L +LS T+GL +YG+ E G G V L G D++ W+ + L GE
Sbjct: 537 GIGTHKLSILSLTMGLISYGSHFESYKRGLNGTVTL-----GTQDITNNGWSMRPYLVGE 591
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPV---ALDLGSMGKGQAWVN 690
Q I S + W+ I TWYK I ALD+ M KG VN
Sbjct: 592 MQGIQS-NPHLTSWSINNELSINQPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVN 650
Query: 691 GHHIGRYWTVVAPKGGCQDTCDYRG-AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL- 748
G+ IGRYW + GC C+Y G Y C T CG P++ +YHVP +L N L
Sbjct: 651 GNSIGRYWLTLG--WGCGSGCNYTGDGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLN 708
Query: 749 --VIFEETGGNPFEISVKLRSTRIVCEQVSESHY 780
++FEE G+P I + R V Q+ ++ Y
Sbjct: 709 EIIVFEELSGDPNSIQL---VQRYVPYQLDQTDY 739
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 441 bits (1133), Expect = e-120, Method: Compositional matrix adjust.
Identities = 199/320 (62%), Positives = 247/320 (77%), Gaps = 3/320 (0%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YD +A++++G RR+L+S IHYPR+ PEMWPDLI K+K+GG DV++TYVFWN HE R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQ F KIVD+M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L VPWVMCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PH
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLA+ VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYG L+
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGELNTF 329
Query: 348 KWGHLKDLHAAIKLCEPALV 367
+G HA L +P L+
Sbjct: 330 YFG---KRHALYSLHQPPLM 346
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 196/331 (59%), Positives = 252/331 (76%), Gaps = 1/331 (0%)
Query: 33 VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
V++ + TF NVSYD A+II+G RR++ S IHYPR+T MWPDLI K+K+GG D
Sbjct: 8 VATLACLTFCIGDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDA 67
Query: 93 IETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
IETY+FW+ HE R +Y+F G+ D +KF +L+ +GLY+ +RIGPYVCAEWN+GGFPVWL
Sbjct: 68 IETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWL 127
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN-MESSY 211
++PGI+ RTNN +K EMQ F KIV++ ++ LF+ QGGPII+ QIENEYGN M +Y
Sbjct: 128 HNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAY 187
Query: 212 GQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWT 271
G GK Y+ W A MA L GVPW+MC+Q+DAP+ +I+ CNG+YCD + PN+ P ++T
Sbjct: 188 GDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYCDNFTPNNPKSPKMFT 247
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITS 331
ENW GW+ WG + P+R ED+AF+VARFFQ GG F NYYMY GGTNFGRTSGGPF TS
Sbjct: 248 ENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTS 307
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAAIKLC 362
YDY+AP+DEYG L++PKWGHLK LHA+I +C
Sbjct: 308 YDYNAPLDEYGNLNQPKWGHLKQLHASIXIC 338
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 198/320 (61%), Positives = 246/320 (76%), Gaps = 3/320 (0%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YD +A++++G RR+L+S IHYPR+ PEMWPDLI K+K+GG DV++TYVFWN HE R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI RT+N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISLRTDNEPF 149
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K EMQ F KIVD+M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L VPWVMCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PH
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLA+ VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYG L+
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGELNTF 329
Query: 348 KWGHLKDLHAAIKLCEPALV 367
+G HA L +P L+
Sbjct: 330 YFG---KRHALYSLHQPPLM 346
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 237/505 (46%), Positives = 299/505 (59%), Gaps = 37/505 (7%)
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY 328
+WTE W GW+T +GG +PHRPVED+AFAVARF Q+GGSF+NYYMY GGTNF RTSGGPF
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRA 388
TSYDYDAPIDEYGLL +PKWGHL+DLH AIK EPALV+ D LG ++A+V+++
Sbjct: 61 ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPT-IQSLGNYEKAYVFKS 119
Query: 389 NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI 448
S C+AFL+N AA V F G+ Y LP WS+S+LPDC+ VFNTA VS
Sbjct: 120 ----SGGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSE---- 171
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
P +P P SW + E FT G++E L++T
Sbjct: 172 -------PSAPARMSPAGGF----------SWQSYSEATNSLDGRAFTKDGLVEQLSMTW 214
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW---- 564
D SDYLW+ T + ++ ++ F K+ + P +TI S L+VF+NGQ G+V G +
Sbjct: 215 DKSDYLWYTTYVNINSNE-QFLKSGQ-WPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPK 272
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
+ V+ G N + +LS VGL N G E G G V L+G G DLS W
Sbjct: 273 LTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKW 332
Query: 625 TYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
TYQ+GL GE + S+ + EW TW+K YF AP G PVALD+GSMG
Sbjct: 333 TYQIGLHGESLGVQSVAGSSSVEWGSAAGK---QPLTWHKAYFSAPSGDAPVALDMGSMG 389
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KGQAWVNG HIGRYW+ A GC C Y G Y+ KC T CG+ +Q +YHVPRSWL
Sbjct: 390 KGQAWVNGRHIGRYWSYKASSSGC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNP 448
Query: 744 SNNLLVIFEETGGNPFEISVKLRST 768
S NLLV+ EE GG+ + + R+
Sbjct: 449 SGNLLVMLEEFGGDLSGVKLVTRTA 473
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/320 (61%), Positives = 244/320 (76%), Gaps = 7/320 (2%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
+YD +A++++G RR+L+S IHYPR+ PEMWPDLI K+K+GG DV++TYVFWN HE R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
QY F+G+ D+V F+KLV +GLY+ LRIGPYVCAEWNFGGFPVWL+ +PGI FRT+N PF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
K F KIVD+M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+
Sbjct: 150 KN----FTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 205
Query: 228 GLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
L VPWVMCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PH
Sbjct: 206 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 265
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEP 347
RPVEDLA+ VA+F Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYG L+
Sbjct: 266 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGELNTF 325
Query: 348 KWGHLKDLHAAIKLCEPALV 367
+G HA L +P L+
Sbjct: 326 YFG---KRHALYSLHQPPLM 342
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 210/373 (56%), Positives = 261/373 (69%), Gaps = 27/373 (7%)
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFPVWL+ +PGI FRT+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIENE+G
Sbjct: 1 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
+E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ +KPN
Sbjct: 61 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 120
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
KP +WTE W GWYT +GG +P RP ED+AF+VARF Q GGSF+NYYMY GGTNFGRT+GG
Sbjct: 121 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTAGG 180
Query: 326 PFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
PF TSYDYDAP+DEYGL EPKWGHL+DLH AIK CE ALV+ D + KLG NQEAHV
Sbjct: 181 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPS-VTKLGSNQEAHV 239
Query: 386 YRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQ 445
++ S+S+C+AFLAN D + V+F G Y LPPWS+SILPDC+ V+NTAKV SQ
Sbjct: 240 FK-----SESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQ 294
Query: 446 TSIKTVEFSLPLSP-NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHL 504
+S + ++P + P QS IE SS T+ G+ E +
Sbjct: 295 SS------QVQMTPVHSGFPWQSFIEETTSSDETD--------------TTTLDGLYEQI 334
Query: 505 NVTKDYSDYLWHI 517
N+T+D +DYLW++
Sbjct: 335 NITRDTTDYLWYM 347
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 232/517 (44%), Positives = 311/517 (60%), Gaps = 35/517 (6%)
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GLL +PKWGHL+DLH AIKLCE AL+A D LG N EA VY+ + +C+AFL
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATD-PTISSLGSNLEAAVYKT----ASGSCAAFL 63
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNI 461
AN+ + A+V+F G+SY LP WSVSILPDC+N FNTAK++S T
Sbjct: 64 ANVGTKSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATE------------PT 111
Query: 462 SVPQQSMIESKLSSTS--KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQ 519
+ +QS+ SS W +KEPIG+ + F G+LE +N T D SDYLW+ +
Sbjct: 112 AFARQSLKPDGGSSAELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLR 171
Query: 520 IYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ---PVEFQSG 576
+ + D+ +F + + I+S+ V+ FING+L GS GH + + P+ +G
Sbjct: 172 MDIKGDE-TFLDEGS-KAVLHIESLGQVVYAFINGKLAGS--GHGKQKISLDIPINLVAG 227
Query: 577 YNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-IDLSKILWTYQVGLKGEFQ 635
N + LLS TVGL NYGAF + GAG G V L K G IDL+ WTYQVGLKGE
Sbjct: 228 KNTVDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDT 287
Query: 636 QIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
+ +++ +EW + WYKT FDAP G +PVA+D KG AWVNG IG
Sbjct: 288 GLGAVDS--SEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIG 345
Query: 696 RYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
RYW T +A GGC D+CDYRG+Y ++KC NCG P+QT YHVPRSWL+ S N LV+FEE
Sbjct: 346 RYWPTSIAGNGGCTDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEM 405
Query: 755 GGNPFEISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQ-DG 812
GG+P +IS + T +C VS+SH PPV W++ + + N+ P + L C
Sbjct: 406 GGDPTQISFGTKQTGSNLCLTVSQSHPPPVDTWTSDSKISNR---NRTRPVLSLQCPVST 462
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ISSI+FAS+GTP+G C F+ G+C++ SLS+V +
Sbjct: 463 QVISSIKFASFGTPKGTCGSFTSGSCNSSRSLSLVQK 499
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 250/687 (36%), Positives = 347/687 (50%), Gaps = 90/687 (13%)
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
M++FV IV+ ++E LF+ QGGPII+ QIENEY ++E ++ + G Y+ WAA MA+
Sbjct: 426 MKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMAIATN 485
Query: 231 AGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGGRLPHR 288
GVPW+MCKQT AP +I CNG +C P KP LWTENW Y +G R
Sbjct: 486 TGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQR 545
Query: 289 PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPK 348
ED+AF+VARFF GG+ NYYMY GGTNFGR +G F + Y +AP+DE+GL EPK
Sbjct: 546 SAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLYKEPK 604
Query: 349 WGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHT 408
WGHL+DLH A++ C+ AL+ + + LG+ EA V+ ++ C AFL+N +
Sbjct: 605 WGHLRDLHHALRHCKKALLWGNPS-VQPLGKLYEARVFEMK---EKNVCVAFLSNHNTKE 660
Query: 409 AASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSM 468
+VTF GQ Y + S+SIL DC+ VF+T V+SQ + +T F+ Q ++
Sbjct: 661 DGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFA------DQTVQDNV 714
Query: 469 IESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDIS 528
E M +E I +S+ + Q LE N TKD +DYLW+ T + DD+
Sbjct: 715 WE----------MYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLP 764
Query: 529 FWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVG 588
+ K EV+P + G TG + + ++ + G N + +LS T+G
Sbjct: 765 YRK--EVKPV-------------LEGAGTGRRSTRSFTMEKAMDLKVGVNHVAILSSTLG 809
Query: 589 LQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT 648
L + G++LE AG V + G G +DL+ W + G
Sbjct: 810 LMDSGSYLEHRMAGVY-TVTIRGLNTGTLDLTTNGWGHVPG------------------- 849
Query: 649 DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQ 708
+D P TWY+ FD P G DPV +DL MGKG +VNG +GRYW
Sbjct: 850 ---KDNQP--LTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWV--------- 895
Query: 709 DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRST 768
Y A G P+Q YHVPRS L+ N L+ FEE GG P I +
Sbjct: 896 ---SYHHA---------LGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKR 943
Query: 769 RIVCEQVSESHYPPVR-KWSNSYS-----VDGKLSINKMAPEMHLHCQDGYIISSIEFAS 822
+C ++E + VR W + S + P L C I S+ FAS
Sbjct: 944 DNICTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVVFAS 1003
Query: 823 YGTPQGRCQKFSRGNCHAPMSLSVVSE 849
YG P G C ++ G+CHAP + VV +
Sbjct: 1004 YGNPLGICGNYTVGSCHAPRTKEVVEK 1030
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 176/380 (46%), Positives = 235/380 (61%), Gaps = 40/380 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD R++IIDG+R + S IHYPR+ P+ WPDLI+K+KEGG +VIE+YVFWN HE +
Sbjct: 33 ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGF-PVWLRDIPGIEFRTNNA 165
G YNF+G+ D++KF KL+ +Y +RIGP+V AEWN G + +IP I FRTNN
Sbjct: 93 GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGFVCHIGSGEIPDIIFRTNNE 152
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ M++FV IV+ ++E LF+ QGGPII+ QIENEY ++E ++ + G Y+ WAA M
Sbjct: 153 PFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKM 212
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY--KPNSYNKPTLWTENWDGWYTTWGG 283
A+ GVPW+MCKQT AP +I CNG +C P KP LWTENW Y +G
Sbjct: 213 AIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGD 272
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYM------------------------------- 312
R ED+AF+VARFF GG+ NYYM
Sbjct: 273 PPSQRSAEDIAFSVARFFSVGGTMANYYMVVLNSNSNLFLTKKRDEISDRTDTGGFTCVN 332
Query: 313 ---YFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
Y GGTNFGR +G F + Y +AP+DE+GL EPKWGHL+DLH A++ C+ AL+
Sbjct: 333 NQQYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRHCKKALLWG 391
Query: 370 D-SAQYI-KLGQNQEAHVYR 387
+ S Q + KL + Q+ V R
Sbjct: 392 NPSVQPLGKLTRGQKYFVAR 411
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 192/397 (48%), Positives = 261/397 (65%), Gaps = 6/397 (1%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R+++IDG R + S IHYPR+ PEMW L+ +K GG + IETYVFWN HE
Sbjct: 36 VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+Y F+G+ D+++F+ ++ + +Y +RIGP++ AEWN GG P WLR+I I FR NN P
Sbjct: 96 GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FK EM++FV+ IV +++ +F+ QGGPII+ QIENEYGN++ +G Y++WAA MA
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
+ G GVPWVMCKQ+ AP +I CNG +C D + NKP LWTENW + T+G +L
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 345
R ED+A+AV RFF +GG+ +NYYMY GGTNFGRT G + +T Y +AP+DEYG+
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
EPK+GHL+DLH IK A + + I LG EAH Y C +FL+N +
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEI-LGHGYEAHNYELP---EDKLCLSFLSNNN 390
Query: 406 EHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKV 442
+V F G+ + +P SVSIL DC+ V+NT +V
Sbjct: 391 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV 427
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 184/291 (63%), Positives = 224/291 (76%), Gaps = 6/291 (2%)
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWL+ +PGI+FRT+N PFK +MQ+F +KIV++M+ E LF Q GPIIM QIE
Sbjct: 1 EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYG +E G GK Y KWAA MA+GLG GVPW+MCKQ DAP+ IID CNG+YC+ + P
Sbjct: 61 NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMP 120
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N+ KP ++TE W GWYT +GG +P+RP ED+A++VARF Q GSF+NYYMY GGTNFGR
Sbjct: 121 NANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGR 180
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
T+GGPF TSYDYDAP+DEYGL EPKWGHL+DLH IKLCEP+LV+ D + LG NQ
Sbjct: 181 TAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDP-KVTSLGSNQ 239
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
EAHV+ ++++C+AFLAN D + VTF Y LPPWSVSILPDC
Sbjct: 240 EAHVFW-----TKTSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 223/632 (35%), Positives = 344/632 (54%), Gaps = 58/632 (9%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
+ V+YD R+++I+G R++ +S +HYPR+TP +W ++A SK G ++I+TYVFW+ HE
Sbjct: 106 YKVTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDLHEP 165
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
RG YNF+G ++ F+ L +GL++ LRIGPY+CAEWN+GG P+WL+DIPGI+ R N
Sbjct: 166 QRGVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGIKMRDFN 225
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
+ EE++R++K IVD + F+ QGGPI++ QIENEY ++ Y + G+ + W A
Sbjct: 226 TQYMEEVERWMKFIVDYLHG--YFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAHWCAD 283
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSYNKPTLWTENWDGWYTT 280
+A L G+PW+MC+Q D P +I+ CNGYYC + + N ++P L+TENW GW+
Sbjct: 284 LANRLDIGIPWIMCQQDDIP-TVINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGWFNN 342
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDE 340
W + HRPV DL ++ AR+F GG+ MNYYM+ GGTNFGR S GP SYDYDAP++E
Sbjct: 343 WVNAVRHRPVADLLYSAARWFASGGALMNYYMWHGGTNFGRKS-GPMIALSYDYDAPLNE 401
Query: 341 YGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAF 400
YG PK+ +D + I E L++ I L N YR + +N ++F
Sbjct: 402 YGNPRNPKYSQTRDFNKLILSLEDILLSQYPPTPIFLANNISVIHYR-----NGNNSASF 456
Query: 401 LANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPN 460
+ N +E+ + V F G+SY +SV IL + +VF++++ + VE PN
Sbjct: 457 IINSNENGNSKVMFEGRSYFSYAYSVQILKNYV-SVFDSSQNPRNYTDTVVE----SEPN 511
Query: 461 ISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQI 520
I S+I + E + ++E LN+TKD +DY+W+ T I
Sbjct: 512 IPF-ANSIISKHVERFD-------------FEESLYDNRLMEQLNLTKDETDYIWYTTMI 557
Query: 521 YVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDL 580
D + + + D++ VF++ G+++ + + G + L
Sbjct: 558 NHDQDG----------EILKVINKTDIVHVFVDSYYVGTIMSDSLAITG---VPLGPSTL 604
Query: 581 ILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
LL +G+Q+Y +E AG G V GDI+++ +W + + E I
Sbjct: 605 QLLHTKMGIQHYELHMENTKAGILGPVYY-----GDIEITNQMWGSKPFVSSEKVITDPI 659
Query: 641 EENEAEWTDLTRD------GIPSTFTWYKTYF 666
+ W+ L R +P TWYK F
Sbjct: 660 QSKFVRWSPLDRKPNEVFYSVP--LTWYKFIF 689
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 219/481 (45%), Positives = 278/481 (57%), Gaps = 36/481 (7%)
Query: 293 LAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHL 352
+AFAVARF Q+GGSF+NYYMY GGTNF RTSGGPF TSYDYDAPIDEYGLL +PKWGHL
Sbjct: 1 MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60
Query: 353 KDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASV 412
+DLH AIK EPALV+ D LG ++A+V+++ S C+AFL+N AA V
Sbjct: 61 RDLHKAIKQAEPALVSGDPT-IQSLGNYEKAYVFKS----SGGACAAFLSNYHTSAAARV 115
Query: 413 TFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESK 472
F G+ Y LP WS+S+LPDC+ VFNTA VS P +P P
Sbjct: 116 VFNGRRYDLPAWSISVLPDCKAAVFNTATVSE-----------PSAPARMSPAGGF---- 160
Query: 473 LSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKT 532
SW + E FT G++E L++T D SDYLW+ T + ++ ++ F K+
Sbjct: 161 ------SWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNE-QFLKS 213
Query: 533 NEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVG 588
+ P +T+ S L+VF+NGQ G+V G + + V+ G N + +LS VG
Sbjct: 214 GQ-WPQLTVYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVG 272
Query: 589 LQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEW 647
L N G E G G V L+G G DLS WTYQ+GL GE + S+ + EW
Sbjct: 273 LPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEW 332
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
TW+K YF AP G PVALD+GSMGKGQAWVNG HIGRYW+ A G
Sbjct: 333 GSAAGK---QPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSGG 389
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
C Y G Y+ KC T CG+ +Q +YHVPRSWL S NLLV+ EE GG+ + + R+
Sbjct: 390 CGGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTRT 449
Query: 768 T 768
Sbjct: 450 A 450
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/264 (70%), Positives = 214/264 (81%), Gaps = 15/264 (5%)
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-A 645
+ NYGAFLEKDGAGF+GQVKLTGFKNG+IDLS+ WTYQVGL+GEFQ+IY I+E+E A
Sbjct: 22 IAAGNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKA 81
Query: 646 EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKG 705
EWTDLT D PSTFTWYKT+FDAP+G +PVALDLGSMGKGQAWVNGHHIGRYWT VAPK
Sbjct: 82 EWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKD 141
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
GC CDYRG Y++ K YH+PRSWLQASNNLLV+FEETGG PFEISVK
Sbjct: 142 GC-GKCDYRGHYHTSK------------YHIPRSWLQASNNLLVLFEETGGKPFEISVKS 188
Query: 766 RSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGT 825
RST+ +C +VSESHYP ++ WS S +D + S NKM PEMHL C DG+ ISSIEFASYGT
Sbjct: 189 RSTQTICAEVSESHYPSLQNWSPSDFID-QNSKNKMTPEMHLQCDDGHTISSIEFASYGT 247
Query: 826 PQGRCQKFSRGNCHAPMSLSVVSE 849
PQG CQ FS+G CHAP SL++VS+
Sbjct: 248 PQGSCQMFSQGQCHAPNSLALVSK 271
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 167/248 (67%), Positives = 203/248 (81%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV YDHRA++IDG RR+LIS IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQY+F G+ D+VKFVK V +GLY+ LRIGPYVCAEWN+GGFP+WL IPGI+FRT+N
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EM+RF KIVDLM++E L++ QGGPII+ QIENEYGN++S YG GK Y+ WAA M
Sbjct: 141 PFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWAAKM 200
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRL 285
A L GVPWVMC+Q DAP+ II+ CNG+YCD + PNS KP +WTENW GW+ ++GG +
Sbjct: 201 ATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFGGAV 260
Query: 286 PHRPVEDL 293
PHRPVE L
Sbjct: 261 PHRPVEIL 268
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 222/523 (42%), Positives = 284/523 (54%), Gaps = 53/523 (10%)
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRY 391
Y D + GLL EPKWGHLK+LH AIKLCEPALVA D LG Q+A V+R+
Sbjct: 139 YRLDHILVADGLLREPKWGHLKELHKAIKLCEPALVAGDPI-VTSLGNAQQASVFRS--- 194
Query: 392 GSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTV 451
S C AFL N D+ + A V+F G Y LPPWS+SILPDC+ TV+NTA V SQ S +
Sbjct: 195 -STDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQMKM 253
Query: 452 EFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYS 511
E++ + W + E I + +F G+LE +NVT+D +
Sbjct: 254 EWAGGFT---------------------WQSYNEDINSLGDESFATVGLLEQINVTRDNT 292
Query: 512 DYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP- 570
DYLW+ T + ++ D+ +N P +T+ S L +F+NGQLTG+V G V P
Sbjct: 293 DYLWYTTYVDIAQDEQFL--SNGKNPMLTVMSAGHALHIFVNGQLTGTVYG---SVEDPK 347
Query: 571 ------VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
V+ SG N + LS VGL N G E AG G V L G G DL+ W
Sbjct: 348 LTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKW 407
Query: 625 TYQVGLKGE-FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
TY+VGLKGE + EW + + +WYK +F+APDG +P+ALD+ SMG
Sbjct: 408 TYKVGLKGEALSLHSLSGSSSVEWGEPVQK---QPLSWYKAFFNAPDGDEPLALDMSSMG 464
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KGQ W+NG IGRYW G C CDYRG Y+ KC TNCG+ +Q WYHVPRSWL
Sbjct: 465 KGQIWINGQGIGRYWPGYKASGTC-GICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNP 523
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAP 803
+ NLLVIFEE GG+P IS+ R +C VSE P + W K+
Sbjct: 524 TGNLLVIFEEWGGDPTGISMVKRIAGSICADVSEWQ-PSMANWRTKGYEKAKV------- 575
Query: 804 EMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSV 846
HL C G ++ I+FAS+GTPQG C +S G CHA S +
Sbjct: 576 --HLQCDHGRKMTHIKFASFGTPQGSCGSYSEGGCHAHKSYDI 616
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 176/291 (60%), Positives = 216/291 (74%), Gaps = 5/291 (1%)
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWL+ +PGI FRT+N PFK MQ F +KIV +M++E LF QGGPII+ QIE
Sbjct: 1 EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEY +G G+ Y+ WAA MA GL GVPWVMCK+ DAP+ +I+ CNG+YCD + P
Sbjct: 61 NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKFSP 120
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N KP LWTE W GW+T +GG + RPVEDLAFAVARF Q GGSF+NYYMY GGTNFGR
Sbjct: 121 NKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNFGR 180
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
T+GGPF TSYDYDAPIDEYGL+ PK+ HLK+LH A+KLCE AL+ AD + LG +
Sbjct: 181 TAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYAD-PYVMSLGNYE 239
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
+AHV+ + G C+AFL+N + ++A VTF + + LPPWS+SILPDC
Sbjct: 240 QAHVFSSTSGG----CAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 246/798 (30%), Positives = 381/798 (47%), Gaps = 131/798 (16%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
VS+DHRA+++DG R +++S +HYPR+TP MWP ++ ++ G + +ETY+FWN HE
Sbjct: 1 MTVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHER 60
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
RG +F G+ D+V+F +L + GL + LRIGPY+CAE N+GG P WLRD+P I RT+N
Sbjct: 61 RRGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDN 120
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
FK E R+V+ + +++R L + GGP+I+ QIENEY N+ ++YG+ G+ Y++W+
Sbjct: 121 EAFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVE 178
Query: 225 MALGLGAGVPWVMC-----------KQTDAPENIIDACNGYYCDGYKPNSYN----KPTL 269
+A LG G+PWV C + + ++ N + + +P L
Sbjct: 179 LAQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPAL 238
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTENW GWY TWGG LP R E+LA+A ARFF GGS +NY+++ GGTNFGR G
Sbjct: 239 WTENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGR-DGMYLLT 297
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
T+Y++ P+DEYGL + K HL L+ A+ C ++A++ + I +N
Sbjct: 298 TAYEFGGPLDEYGLPTT-KARHLARLNKALAACADKILASERPRAITGERNGLL------ 350
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
+ S+ L + A +V +G++ +L D V + + ++
Sbjct: 351 ----KFQYSSGLTFWCDDVARTVRIVGKNG-------EVLYDSSARVAPVRRTWKASGVR 399
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKD 509
+ P + + + ++S +T ++P LE L +TKD
Sbjct: 400 FAPWGWRAEP---------LPAAWPAEAQSAVTARKP--------------LEQLLLTKD 436
Query: 510 YSDYLWHITQIYV--SDDDISFWKTNE----------------VRPTV----------TI 541
+DY W+ T I V S D + + RP++ T+
Sbjct: 437 ETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTV 496
Query: 542 DSMR-----DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
+++R D++ VFI+G + P+ + G D L +QT L +
Sbjct: 497 NTLRLTRVADIVHVFIDGTFVAT-------TPTPLRERRGKMDAGLFTQTFELDLKALRI 549
Query: 597 EKDGAGFR------GQVK---LTGFKNGDIDLSKIL-------------WTYQVGLKGEF 634
G +K + G++N ++ + W +Q GL GE
Sbjct: 550 TPGKHRLSLLCCALGLIKGDWMIGYENMALEKKGLWAPVFWNGKKLEGEWRHQPGLLGER 609
Query: 635 QQIYSIEENE-AEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVN 690
W T G W++T F P G P ALDLG MGKG AW+N
Sbjct: 610 CGFADPAAGSLLAWKTAKAATGRGARRPLRWWRTTFTRPKGHGPWALDLGGMGKGMAWIN 669
Query: 691 GHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN--NLL 748
GH IGRYW + +G+ + + PTQ +YHVP WL+ + L
Sbjct: 670 GHCIGRYWLLADTDPMGPWMAWMKGSLTAAPSS----GPTQRYYHVPDDWLRTDGGPDTL 725
Query: 749 VIFEETGGNPFEISVKLR 766
V+FEE GG+P + + R
Sbjct: 726 VLFEELGGDPATVRLVRR 743
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/369 (49%), Positives = 239/369 (64%), Gaps = 38/369 (10%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
LS + ++M+M V VSYD R++II+G R++L S IHYPR+TP
Sbjct: 6 LSCFGLLMVMWTTTRGGVEGG---------QVSYDGRSLIIEGQRKLLFSGSIHYPRSTP 56
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
+MWP LI+K+K GG DVIETYVFWN HE GQY+FKG+++IV+F++ + + GLY +RI
Sbjct: 57 DMWPSLISKAKHGGLDVIETYVFWNLHEPRHGQYDFKGRHNIVRFIREIQAHGLYAFIRI 116
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GP++ AEW +GG P WL D+PGI +R++N PFK MQ F KIV+L + E L++ QGGPI
Sbjct: 117 GPFIEAEWTYGGLPFWLHDVPGIVYRSDNEPFKYHMQNFTTKIVNLFKSEGLYAPQGGPI 176
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEY N E ++ ++G YV+WAA+MA+GL GVPWVMCKQ DAP+ +I+ CNG
Sbjct: 177 ILQQIENEYKNAERAFHEKGPPYVQWAAAMAVGLQTGVPWVMCKQDDAPDPVINTCNGRT 236
Query: 256 CDG--YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
C PNS NKP +WT+NW + GSF+NYYMY
Sbjct: 237 CGETFVGPNSPNKPAIWTDNWTS-------------------------LKNGSFVNYYMY 271
Query: 314 FGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ 373
GGTNFGRT G F +TSY +APIDEYGL+ +PKWGHLK LH+ IK C L+
Sbjct: 272 HGGTNFGRT-GSAFVLTSYYDEAPIDEYGLIRQPKWGHLKQLHSVIKSCSQTLLHG-VIS 329
Query: 374 YIKLGQNQE 382
LGQ QE
Sbjct: 330 VSPLGQQQE 338
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 367 bits (942), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 253/745 (33%), Positives = 378/745 (50%), Gaps = 87/745 (11%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
P++VSYDHRAI I+GNR +L S IHYPR+TP MWP L++K+KE G + I+TYVFWN HE
Sbjct: 31 PYHVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNMHE 90
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
RG Y+F G+ ++ F++ ++GL++ LR+GPYVCAEW++G PVWL +IP I FR++
Sbjct: 91 QKRGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSS 150
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
N +K EM+RF+ I+ + + + GGPII+ QIENEYG + + YV W
Sbjct: 151 NDAWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGND-------RAYVDWCG 201
Query: 224 SMALGLGAG--VPWVMCKQTDAPENIIDACNGYYC------DGYKPNSYNKPTLWTENWD 275
S+ A +PW+MC A + I+ CNG C D ++ N+P L+TENW
Sbjct: 202 SLVSNDFASTQIPWIMCNGL-AANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW- 259
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD 335
GW+ WG L R EDLA++VA +F GG++ YYM+ GG ++GRT GG T+Y D
Sbjct: 260 GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDD 318
Query: 336 APIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY-IKLGQNQEAHVYRANRYGSQ 394
+ G +EPK+ HL L + L++ DSA+ I ++ V S
Sbjct: 319 VILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSARLPIPYWDGKQWSVGTQQMVYSY 378
Query: 395 SNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFS 454
F+ N + V F Q+ ++ SV I + + ++N+A VS T
Sbjct: 379 PPSIQFVIN-QAAFSLFVLFNKQNISIAGQSVQIYDNNEHLLWNSADVSGIFRNNTF--- 434
Query: 455 LPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYL 514
L P + P + S+ + + P LE LN+T D + YL
Sbjct: 435 --LVPIVVGPLDWQVYSEPFLSDLPVIVASTP--------------LEQLNLTNDETIYL 478
Query: 515 WHITQIYVSDDDISFWKTNEVRPTVTIDSMR-DVLRVFINGQLTGSVIGHW-------VK 566
W+ + +S + V + + R + L F++ Q G H V
Sbjct: 479 WYRRNVSLSQP--------SAQTIVQVQTRRANSLIFFMDRQFVGYFDDHSHAQGTINVN 530
Query: 567 VVQPV-EFQSGYNDLI-LLSQTVGLQNY----GAFLEKDGAGFRGQVKLTGFKNGDIDLS 620
+ + +F L +LS ++G+ N+ G+F K G G V L G +
Sbjct: 531 ITLNLSQFLPNQQYLFEILSVSLGIDNFNIGPGSFEYK---GIVGNVSLGG--QSLVGDE 585
Query: 621 KILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGI------D 673
+W +Q GL GE QIY+ + ++ EW I + TW++T FD + +
Sbjct: 586 ASIWEHQKGLFGEAYQIYTEQGSKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNAN 645
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQD--TCDYRGAYNSDKCTTNCGNPTQ 731
PV LD + +G A+VNG+ IG YW + +G CQ+ C + TNC P+Q
Sbjct: 646 PVLLDAFGLNRGHAFVNGNDIGLYWLI---EGTCQNKLCCCLQNQ-------TNCQQPSQ 695
Query: 732 TWYHVPRSWLQASNNLLVIFEETGG 756
+YH+P WL+ +NNLL +FEE G
Sbjct: 696 RYYHIPSDWLKPTNNLLTVFEEIGA 720
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 363 bits (933), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 248/800 (31%), Positives = 378/800 (47%), Gaps = 136/800 (17%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
VS+DHRA+++DG R +++S +HYPR+TP MWP ++ ++ G + +ETY+FWN HE
Sbjct: 1 MTVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHER 60
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
RG +F G+ D+V+F +L + GL + LRIGPY+CAE N+GG P WLRD+P I RT+N
Sbjct: 61 RRGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDN 120
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
FK E R+V+ + +++R L + GGP+I+ QIENEY N+ ++YG+ G+ Y++W+
Sbjct: 121 EAFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVE 178
Query: 225 MALGLGAGVPWVMC-----------KQTDAPENIIDACNGYYCDGYKPNSYN----KPTL 269
+A LG G+PWV C + + ++ N + + +P L
Sbjct: 179 LAQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPAL 238
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYI 329
WTENW GWY TWGG LP R E+LA+A ARFF GGS +NY+++ GGTNFGR G
Sbjct: 239 WTENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGR-DGMYLLT 297
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
T+Y++ P+DEYGL + K HL L+AA+ C L+A++ ++ + Y +
Sbjct: 298 TAYEFGGPLDEYGLPTT-KARHLARLNAALAACAGELLASERPGVVEKSSGVVEYHYDSG 356
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIK 449
L + + TA +V + +S +L D V + + ++
Sbjct: 357 -----------LVFVCDDTARAVRIVKKSG-------EVLYDSSVRVAPVRRAWKSSGVR 398
Query: 450 TVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKD 509
+ P + + + ++S +T ++P LE L TKD
Sbjct: 399 FAPWGWRAEP---------LPAAWPAEAQSAVTARKP--------------LEQLLPTKD 435
Query: 510 YSDYLWHITQIYV--SDDDISFWKTNE----------------VRPTV----------TI 541
+DY W+ T I V S D + + RP++ T+
Sbjct: 436 ETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTV 495
Query: 542 DSMR-----DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
+++R D++ VFI+G + P+ + G D L +QT L +
Sbjct: 496 NTLRLTRVADIVHVFIDGTFVAT-------TPTPLRERRGKMDAGLFTQTFELDLKALRI 548
Query: 597 EKDGAGFR------GQVK---LTGFKNGDIDLSKIL-------------WTYQVGLKGEF 634
G +K + G++N ++ + W +Q GL GE
Sbjct: 549 TPGKHRLSLLCCALGLIKGDWMIGYENMALEKKGLWAPVFWNGKKLEGEWRHQPGLLGER 608
Query: 635 QQIYSIEENE-AEWTD---LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVN 690
W T G W++T F P G P ALDLG MGKG W+N
Sbjct: 609 CGFADPAAGSLLAWKTAKAATGRGARRPLNWWRTTFTRPKGHGPWALDLGGMGKGFCWIN 668
Query: 691 GHHIGRYWTV--VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN--N 746
GH IGRYW + P G + G PTQ +YHVP WL+ +
Sbjct: 669 GHCIGRYWLLPDTDPMG------PWMAWMKGSLTAAPSGGPTQRYYHVPDDWLRTDGGPD 722
Query: 747 LLVIFEETGGNPFEISVKLR 766
LV+FEE GG+P + + R
Sbjct: 723 TLVLFEELGGDPATVRLVRR 742
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 362 bits (928), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 216/498 (43%), Positives = 286/498 (57%), Gaps = 44/498 (8%)
Query: 361 LCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYT 420
+CE AL++ D LG Q+A+VY +CSAFL+N D ++A V F Y
Sbjct: 1 MCEKALISTDPV-VTSLGNFQQAYVYTTE----SGDCSAFLSNYDSKSSARVMFNNMHYN 55
Query: 421 LPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSW 480
LPPWSVSILPDCRN VFNTAKV QTS Q M+ + +S SW
Sbjct: 56 LPPWSVSILPDCRNAVFNTAKVGVQTS-----------------QMQMLPT--NSERFSW 96
Query: 481 MTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVT 540
+ +E S T G+LE +NVT+D SDYLW+IT + V + SF ++ P++
Sbjct: 97 ESFEEDTSSSSATTITASGLLEQINVTRDTSDYLWYITSVDVGSSE-SFLHGGKL-PSLI 154
Query: 541 IDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
+ S + VFING+L+GS G + V ++G N + LLS VGL N G
Sbjct: 155 VQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHF 214
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTD---LTR 652
E G G V + G G +DLS WTYQVGLKGE + S + + EW + +
Sbjct: 215 ETWNTGILGPVVIHGLDKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQ 274
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
P TW+KT+FDAP+G +P+ALD+ MGKGQ W+NG IGRYWT +A G C D C+
Sbjct: 275 RNQP--LTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRYWTAIA-TGSCND-CN 330
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVC 772
Y G++ KC CG PTQ WYHVPRSWL+ ++NLLV+FEE GG+P +IS+ RS VC
Sbjct: 331 YAGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVC 390
Query: 773 EQVSESHYPPVRKWS-NSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQ 831
VSE H P ++ W +SY GK S N P++HLHC G ISSI+FAS+GTP G C
Sbjct: 391 ADVSEYH-PNLKNWHIDSY---GK-SENFRPPKVHLHCNPGQAISSIKFASFGTPLGTCG 445
Query: 832 KFSRGNCHAPMSLSVVSE 849
+ +G CH+ S ++ +
Sbjct: 446 SYEQGACHSSSSYDILEQ 463
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 249/745 (33%), Positives = 372/745 (49%), Gaps = 87/745 (11%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
P+ VSYDHRAI I+GNR +L S IHYPR+TP MWP L++K+KE G + I+TYVFWN HE
Sbjct: 31 PYRVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNIHE 90
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
RG Y+F G+ ++ F++ ++GL++ LR+GPYVCAEW++G PVWL +IP I FR++
Sbjct: 91 QKRGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSS 150
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
N +K EM+RF+ I+ + + + GGPII+ QIENEYG + + YV W
Sbjct: 151 NDAWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGND-------RAYVDWCG 201
Query: 224 SMALGLGAG--VPWVMCKQTDAPENIIDACNGYYC------DGYKPNSYNKPTLWTENWD 275
S+ A +PW+MC A + I+ CNG C D ++ N+P L+TENW
Sbjct: 202 SLVSNDFASTQIPWIMCNGL-AANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW- 259
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD 335
GW+ WG L R EDLA++VA +F GG++ YYM+ GG ++GRT GG T+Y D
Sbjct: 260 GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDD 318
Query: 336 APIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY-IKLGQNQEAHVYRANRYGSQ 394
+ G +EPK+ HL L + L++ DS + I ++ V S
Sbjct: 319 VILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSNRLSIPYWNGKQWTVGTQQMVYSY 378
Query: 395 SNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFS 454
F+ N + V F Q+ ++ SV I + ++N+A VS + T
Sbjct: 379 PPSVQFVIN-QAAFSLFVLFNKQNISIAGQSVQIYDYNEHLLWNSADVSGISRNNTF--- 434
Query: 455 LPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYL 514
L P + P + S+ ++ + P LE LN+T D + YL
Sbjct: 435 --LVPIVVGPLDWQVYSEPFTSDLPVIVASTP--------------LEQLNLTNDETIYL 478
Query: 515 WHITQIYVSDDDISFWKTNEVRPTVTIDSMR-DVLRVFINGQLTGSVIGH-WVKVVQPVE 572
W+ + +S V+ V + + R + L F++ Q G H + V
Sbjct: 479 WYRRNVSLSQP--------SVQTIVQVQTRRANSLLFFMDRQFVGYFDDHSHTQGTINVN 530
Query: 573 FQSGYNDLI--------LLSQTVGLQNY----GAFLEKDGAGFRGQVKLTGFKNGDIDLS 620
+ + +LS ++G+ N+ G+F K G G V L G +
Sbjct: 531 ITLNLSQFLPNQQYIFEILSVSLGIDNFNIGPGSFEYK---GIVGNVSLGG--QSLVGDE 585
Query: 621 KILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPD------GID 673
+W +Q GL GE QIY+ + ++ EW I TW++T FD +
Sbjct: 586 ASIWEHQKGLFGEAHQIYTEQGSKTVEWNPKWTTVINKPVTWFQTRFDLNHLAREDLNAN 645
Query: 674 PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT--CDYRGAYNSDKCTTNCGNPTQ 731
P+ LD +G A+VNG+ IG YW + +G CQ+ C + TNC P+Q
Sbjct: 646 PILLDAFGFNRGHAFVNGNDIGLYWLI---EGTCQNNLCCCLQNQ-------TNCQQPSQ 695
Query: 732 TWYHVPRSWLQASNNLLVIFEETGG 756
+YH+ WL+ +NNLL +FEE G
Sbjct: 696 RYYHISSDWLKPTNNLLTVFEEIGA 720
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 357 bits (916), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 190/440 (43%), Positives = 257/440 (58%), Gaps = 27/440 (6%)
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGS 393
YDAP+DEYGL PKWGHLKDLH AIKLCE L+ S + LG + EA VY S
Sbjct: 1 YDAPVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVN-VSLGPSVEADVYT----DS 55
Query: 394 QSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEF 453
C+AF+AN+D+ +V F SY +P WSVSILPDC+N V+NTAKV++QT+
Sbjct: 56 SGACAAFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTN------ 109
Query: 454 SLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDY 513
I++ + + +S + W KE G+W + +F + G ++H+N TKD +DY
Sbjct: 110 ------KIAMIPEKLQQSDKGQKTFKWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDY 163
Query: 514 LWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV----Q 569
LWH T I + +++ K + +P + I+S L F+N + G+ G+
Sbjct: 164 LWHTTSISIDENEELLKKGS--KPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKN 221
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVG 629
P+ ++G N++ LLS TVGLQ G F + GAG VK+ G N IDLS WTY++G
Sbjct: 222 PISLKAGKNEIALLSLTVGLQTAGPFYDFVGAGVT-SVKIKGLNNKTIDLSSNAWTYKIG 280
Query: 630 LKGEFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
++GE +IY N WT + T TWYK DAP G +PV LD+ MGKG AW
Sbjct: 281 VQGEHLKIYQGNGLNSVSWTSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAW 340
Query: 689 VNGHHIGRYWTVVA--PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNN 746
+NG IGRYW ++ K C + CDYRG +N DKC T CG P+Q WYHVPRSW + S N
Sbjct: 341 LNGEGIGRYWPRISEFKKEDCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGN 400
Query: 747 LLVIFEETGGNPFEISVKLR 766
+LV FEE GG+P +I+ R
Sbjct: 401 VLVFFEEKGGDPTKITFVRR 420
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 357 bits (916), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 160/287 (55%), Positives = 202/287 (70%), Gaps = 20/287 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDHR++II G RR+LIS IHYPR+ PEMWP L+A++K+GGAD +ETYVFWN HE
Sbjct: 37 SVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEPA 96
Query: 106 RGQ--------------------YNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
+GQ Y F+ + D+V+F K+V +GLY+ LRIGP+V AEW F
Sbjct: 97 QGQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTF 156
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GG PVWL PG FRTNN PFK M+RF IVD+M++E F+ QGG II+ Q+ENEYG
Sbjct: 157 GGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYG 216
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
+ME +YG K Y WAASMAL GVPW+MC+Q DAP+ +I+ CN +YCD +KPNS
Sbjct: 217 DMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPT 276
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
KP WTENW GW+ T+G PHRP ED+AF+VARFF +GGS NYY+
Sbjct: 277 KPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYV 323
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 193/495 (38%), Positives = 257/495 (51%), Gaps = 75/495 (15%)
Query: 364 PALVAADSAQYIKLGQNQEAHVYRANRYGSQSN-CSAFLANIDEHTAASVTFLGQSYTLP 422
P VA A++ G + + + Y A+ Y QS C AFL+N+D VTF +SY LP
Sbjct: 301 PEDVAFSVARFFGKGGSLQNY-YVADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLP 359
Query: 423 PWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKL-SSTSKSWM 481
WSVSILPDC+N FNTAKV SQT + M+ + L SS W
Sbjct: 360 AWSVSILPDCKNVAFNTAKVRSQTLM-----------------MDMVPANLESSKVDGWS 402
Query: 482 TVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTI 541
+E G+W + G ++H+N TKD +DYLW+ T V ++ N V + I
Sbjct: 403 IFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLA--GGNHV---LHI 457
Query: 542 DSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
+S ++ F+N +L GS G+ K V PV ++G N L LLS TVGLQN G E
Sbjct: 458 ESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYE 517
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
GAG VK++G +N IDLS W Y+V +
Sbjct: 518 WAGAGIT-SVKISGMENRIIDLSSNKWEYKVNV--------------------------- 549
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP-KGGCQDTCDYRGA 716
D P G DPV LD+ SMGKG AW+NG+ IGRYW ++P C +CDYRG
Sbjct: 550 ---------DVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGT 600
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVS 776
++ +KC CG PTQ WYHVPRSW S N LVIFEE GG+P +I+ R+ VC VS
Sbjct: 601 FSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVS 660
Query: 777 ESHYPPV--RKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFS 834
E HYP + W + DG + A ++ L C G ISS++F S+G P G C+ +
Sbjct: 661 E-HYPSIDLESWDRNTQNDG-----RDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQ 714
Query: 835 RGNCHAPMSLSVVSE 849
+G+CH P S+SVV +
Sbjct: 715 QGSCHHPNSISVVEK 729
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 164/269 (60%), Positives = 201/269 (74%), Gaps = 6/269 (2%)
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIENE+G +E G GK Y KW
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ + PN KP +WTE W GWYT +
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG +P RP EDLAF++AR Q+GGSF+NYYMY GGTNFGRT+GGPF TSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARLIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL EPKWGHL+DLH AIK E ALV+A+ + LG +QEAHV++ S+S C+AFL
Sbjct: 181 GLPREPKWGHLRDLHKAIKSSESALVSAEPS-VTSLGNSQEAHVFK-----SKSGCAAFL 234
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILP 430
AN D ++A V+F Y LPPWS+SILP
Sbjct: 235 ANYDTKSSAKVSFGNGQYELPPWSISILP 263
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 350 bits (899), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 162/269 (60%), Positives = 200/269 (74%), Gaps = 6/269 (2%)
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
T+N PFK MQ+F +KIV +M+ E LF QGGPII+ QIENE+G +E G GK Y KW
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
AA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+ + PN KP +WTE W GWYT +
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEY 341
GG +P RP EDLAF++ARF Q+GGS +NYYMY GGTNFGRT+GGPF TSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARFIQKGGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFL 401
GL EPKWGHL++LH AIK E ALV+A+ + LG +QEAH ++ S+S C+AFL
Sbjct: 181 GLPREPKWGHLRNLHKAIKSSESALVSAEPS-VTSLGNSQEAHAFK-----SKSGCAAFL 234
Query: 402 ANIDEHTAASVTFLGQSYTLPPWSVSILP 430
AN D ++A V+F Y LPPWS+SILP
Sbjct: 235 ANYDTKSSAKVSFGNGQYELPPWSISILP 263
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 155/291 (53%), Positives = 216/291 (74%), Gaps = 4/291 (1%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++IIDG R +L S IHYPR+TPEMWP +I ++K+GG + I+TYVFWN HE +
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G+ D+VKF+KL+ +G+Y+ LR+GP++ AEW GG P WLR++PGI FRT+N
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKE +R+V+ I+D M+EE LF+ QGGPII+ QIENEY ++ +Y Q G +Y+KWA+++
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGR 284
+ G+PWVMCKQ DAP+ +I+ACNG +C D + PN NKP+LWTENW + +G
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYD 335
R VED+A++VARFF + G+ +NYYMY GGTNFGRTS Y+T+ Y+
Sbjct: 281 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSA--HYVTTRYYE 329
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 207/589 (35%), Positives = 302/589 (51%), Gaps = 61/589 (10%)
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY 328
LWTENW + +G ++ R ED+A+AV RFF +GGS +NYYMY GGTNFGRT G +
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRA 388
+T Y +AP+DEYG+ EPK+GHL+DLH I+ + A + + I LG EAH++
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEI-LGHGYEAHIFEL 119
Query: 389 NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI 448
+ C +FL+N + +V F G + +P SVSIL C+N V+NT +V Q S
Sbjct: 120 PE---EKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHS- 175
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
++S S ++S + W E I + + + LE N TK
Sbjct: 176 ----------------ERSFHTSDVTSKNNQWEMFSETIPKYRDTKVRTKEPLEQYNQTK 219
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH-WVK- 566
D +DYLW+ T + DD+ F N++RP + + S + F N G G+ VK
Sbjct: 220 DDTDYLWYTTSFRLESDDLPF--RNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKG 277
Query: 567 --VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
+PV+ + G N ++LLS T+G+++ G L + G + + + G G +DL W
Sbjct: 278 FMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAEVKGGIQ-ECLIQGLNTGTLDLQVNGW 336
Query: 625 TYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
++ L+GE+++IYS + + +W D TWYK YFD PDG DPV LD+ SM
Sbjct: 337 GHKAALEGEYKEIYSEKGLGKVQWKPAEND---RAATWYKRYFDEPDGDDPVVLDMSSMS 393
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KG +VNG +GRYW YR T G P+Q YH+PR +L++
Sbjct: 394 KGMIFVNGEGVGRYWV------------SYR---------TLAGTPSQAVYHIPRPFLKS 432
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINK 800
+NLLVIFEE G P I V+ + +C +SE + ++ W DG KL
Sbjct: 433 KDNLLVIFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTW----DTDGDKIKLIAED 488
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ L C I + FAS+G P G C F+ G CH P + +V +
Sbjct: 489 HSRRGTLTCPPEKTIQEVVFASFGNPDGMCGNFTVGTCHTPNAKQIVEK 537
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 207/589 (35%), Positives = 302/589 (51%), Gaps = 61/589 (10%)
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY 328
LWTENW + +G ++ R ED+A+AV RFF +GGS +NYYMY GGTNFGRT G +
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRA 388
+T Y +AP+DEYG+ EPK+GHL+DLH I+ + A + + I LG EAH++
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEI-LGHGYEAHIFEL 119
Query: 389 NRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI 448
+ C +FL+N + +V F G + +P SVSIL C+N V+NT +V Q S
Sbjct: 120 PE---EKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHS- 175
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
++S S ++S + W E I + + + LE N TK
Sbjct: 176 ----------------ERSFHTSDVTSKNNQWEMSSETIPKYRDTKVRTKEPLEQYNQTK 219
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH-WVK- 566
D +DYLW+ T + DD+ F N++RP + + S + F N G G+ VK
Sbjct: 220 DDTDYLWYTTSFRLESDDLPF--RNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKG 277
Query: 567 --VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
+PV+ + G N ++LLS T+G+++ G L + G + + + G G +DL W
Sbjct: 278 FMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAEVKGGIQ-ECLIQGLNTGTLDLQVNGW 336
Query: 625 TYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
++ L+GE+++IYS + + +W D TWYK YFD PDG DPV LD+ SM
Sbjct: 337 GHKAALEGEYKEIYSEKGLGKVQWKPAEND---RAATWYKRYFDEPDGDDPVVLDMSSMS 393
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KG +VNG +GRYW YR T G P+Q YH+PR +L++
Sbjct: 394 KGMIFVNGEGVGRYWV------------SYR---------TLAGTPSQAVYHIPRPFLKS 432
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINK 800
+NLLVIFEE G P I V+ + +C +SE + ++ W DG KL
Sbjct: 433 KDNLLVIFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTW----DTDGDKIKLIAED 488
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ L C I + FAS+G P G C F+ G CH P + +V +
Sbjct: 489 HSRRGTLTCPPEKTIQEVVFASFGNPDGMCGNFTVGTCHTPNAKQIVEK 537
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 166/291 (57%), Positives = 205/291 (70%), Gaps = 10/291 (3%)
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWL+ +PGI FRT+N PFK M +F +KIV +M+ E LF QGGPII+ QIE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYG +E G K+Y+ WAA MA+GL VPWVMCKQ DAP+ +I+ACNG+YCD + P
Sbjct: 61 NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYCDYFSP 120
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N KPT+WTE W GW+T + G + + A V R + + + + GTNFGR
Sbjct: 121 NKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQVIRRWILVTTIVPW-----GTNFGR 175
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
T+GGPF TSYDYDAPIDEYGLL +PKWGHL+DLH AIK+CEPALV+ D KLG Q
Sbjct: 176 TAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPT-VTKLGNYQ 234
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
EAHVYR+ +C+AFL+N + H+ ASVTF G Y +P WS+SILPDC
Sbjct: 235 EAHVYRSK----SGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 231/697 (33%), Positives = 336/697 (48%), Gaps = 141/697 (20%)
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
P ++ M+RF + I+D+M +E + QGGPII+ +++ ++ + G V WA +M
Sbjct: 113 PVQDHMKRFTRMIIDMMSKEKXIASQGGPIILALVDSAI-----AFKEMGTRCVHWAGTM 167
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGG 283
A+GL G+P VMCKQ DAP+ +I+ C G C D + PN NK ++ + + G Y +G
Sbjct: 168 AVGLKTGIPXVMCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSV-SNHXLGMYRVFGD 226
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGL 343
R EDLAF+ F + G+ NYYMY+ TNFGRT+ F T Y +AP+DEYGL
Sbjct: 227 PPSQRAAEDLAFSX--FISKNGTLANYYMYYSVTNFGRTTSS-FATTCYYDEAPLDEYGL 283
Query: 344 LSEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLA 402
E KWGHL+DLHAA++L + AL+ SAQ KLG++ EA +Y + GS C+ FL
Sbjct: 284 PRETKWGHLRDLHAALRLSKKALLWGVTSAQ--KLGEDLEARIYE--KPGSNI-CATFLL 338
Query: 403 NIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNIS 462
N T + T G Y LP S+S LPDC+ VFNT V SQ S+
Sbjct: 339 NNITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFNTQTVVSQYSV-------------- 384
Query: 463 VPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+ + W ++ + + E + +E + +TKD +DYLW+ T I +
Sbjct: 385 ------------NKNLQWXMSQDALPTYEECPTKTKSPVELMTMTKDTTDYLWYTTNIEL 432
Query: 523 SDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQ-----LTGSVIGHWVK----VVQPVEF 573
+ + F K +V + ++ V+ F+NG+ LTG+ G V+ +P+
Sbjct: 433 ARTGLPFRK--DVLRVPQVSNLGHVMHAFLNGEYMEFYLTGTRHGSNVEKSFVFNKPITL 490
Query: 574 QSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGE 633
++G N + L TVGL + G+++E AG V + G IDL K W
Sbjct: 491 KAGLNQIAPLGATVGLPDSGSYMEHRLAGVH-NVAIQGLNTRTIDLPKNGWG-------- 541
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+K YFDAP+G PVAL+L +M KG AW+NG
Sbjct: 542 ----------------------------HKAYFDAPEGDVPVALELSTMAKGMAWINGKS 573
Query: 694 IGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
I YW + ++P G P+Q+ YHVPR++L+ S+NLLV+FE
Sbjct: 574 IDXYWVSYLSP----------------------LGKPSQSVYHVPRAFLKTSDNLLVLFE 611
Query: 753 ETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDG 812
ETG NP I + + +C +SE H VR W S
Sbjct: 612 ETGRNPDGIEILTLNRDTICCYISEHHPTHVRSWKREAS--------------------- 650
Query: 813 YIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ +G P G C +F GNC AP S VV +
Sbjct: 651 ------DIQIFGDPTGTCXEFIPGNCAAPNSXKVVEK 681
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 34/60 (56%), Positives = 45/60 (75%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD R +I++G R +L S IHYPR+ PEMWPD+I K++ GG +VI TY FWN HE ++
Sbjct: 56 VSYDGRPLIVNGKRELLFSGSIHYPRSIPEMWPDIIXKARHGGLNVIHTYAFWNLHEPVQ 115
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 339 bits (869), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 228/670 (34%), Positives = 346/670 (51%), Gaps = 79/670 (11%)
Query: 169 EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALG 228
E RF+ K + E F+ GGPIIM Q+ENEYG ++ YG+ G Y +W+A +A
Sbjct: 2 ESWMRFITKYL-----ERHFAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQS 56
Query: 229 LGAGVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSYNKPTLWTENWDGWYTTWGGR 284
L GVPW+MC+Q D +++I+ CNG+YC +G+ N+P +TENW GW+ W
Sbjct: 57 LNVGVPWIMCQQDDI-DSVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQS 115
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
PHRPVED+ +AV +F RGGS MNYYM+ GGTNFGRTS P + SYDYDA +DEYG
Sbjct: 116 TPHRPVEDVLYAVGNWFARGGSLMNYYMWHGGTNFGRTS-SPMVVNSYDYDAALDEYGNP 174
Query: 345 SEPKWGHLKDLHAAI-KLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
SEPK+ H + + K L A + + LG + + Y +G +S +FL N
Sbjct: 175 SEPKYSHAAKFNNLLQKYSHIFLNAPEIPRSEYLGGSSSIYHY---TFGGES--LSFLIN 229
Query: 404 IDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVS--SQTSIKTVEFSLPLSPNI 461
E + + GQ++ + PWSV +L + +TVF++A S+ ++ + FS S N
Sbjct: 230 NHESALNDIVWNGQNHIIKPWSVHLLYN-NHTVFDSAATPEVSKLAMTSKRFSPVNSFNN 288
Query: 462 SVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIY 521
+ Q + E ++ ++ S +P LE L++T D +DYLW++T+I
Sbjct: 289 AYISQWVEEIDMTDSTWS----SKP--------------LEQLSLTHDKTDYLWYVTEIN 330
Query: 522 VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQS----GY 577
+ + TN + DVL +I+G+ ++ P +S G+
Sbjct: 331 LQVRGAEVFTTN----------VSDVLHAYIDGKYQSTIWS-----ANPFNIKSDIPLGW 375
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI 637
+ L +L+ +G+Q+Y +EK G G + + G D++ W+ + + GE I
Sbjct: 376 HKLQILNSKLGVQHYTVDMEKVTGGLLGNIWV-----GGTDITNNGWSMKPYVNGERLAI 430
Query: 638 YSIEE-NEAEWTDLTRDGIPSTFTWYKTYF---DAPDGIDPVALDLGSMGKGQAWVNGHH 693
Y+ + +W+ + G+ TWYK F +P+ +L++ M KG W+NG H
Sbjct: 431 YNPNNIFKVDWSSFS--GVQQPLTWYKINFLHELSPN--KHYSLNMSGMNKGMIWLNGKH 486
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
+ RYW KG + C Y+G Y C+TNCG P+Q YH+P+ WL NLLVIFEE
Sbjct: 487 VARYWIT---KGWGCNGCSYQGGYTDQLCSTNCGEPSQINYHLPQDWLIEGANLLVIFEE 543
Query: 754 TGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHLHCQDGY 813
GGNP S+KL + + P + N +DG+ S+N A M+ H Y
Sbjct: 544 VGGNP--KSIKLEEKESAYQYKNRKGDPNFQ---NGMPIDGESSMND-ARSMYTHITLIY 597
Query: 814 IISSIEFASY 823
+ +I SY
Sbjct: 598 VYCAIIAISY 607
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 338 bits (866), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 205/315 (65%), Gaps = 5/315 (1%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYD + II+ + ++ S +HYP +T ++WP + + K GG D IE+Y+FW+ HE +R
Sbjct: 9 VSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWDRHEPVR 68
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+Y+ G D + F+KL+ + LY LRIGPYVC WNFGGF +WL ++P IE R +N
Sbjct: 69 REYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEIELRIDNPI 128
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
K EMQ F KIV++ +E LF+ GGPII+ IENEYGN+ + Y + K Y+KW A MA
Sbjct: 129 XKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIKWCAQMA 188
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
L GVPW+MC DAP+ +I+ CNG+YCD + PN+ ++ + WG R+P
Sbjct: 189 LTQNIGVPWIMCXXRDAPQPMINTCNGHYCDSFXPNNPKSSKMFRX-----FQKWGERVP 243
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSE 346
H+ E+ F+VARFFQ GG NYYMY GGTNFG GGP+ SY+YDAP+DEYG L++
Sbjct: 244 HKSAEESTFSVARFFQSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDEYGNLNK 303
Query: 347 PKWGHLKDLHAAIKL 361
PKW H K LH +
Sbjct: 304 PKWEHFKQLHKELTF 318
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 666 FDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV-VAPKGGCQDT 710
F+AP GIDP+ +DL GK QAWVNG IG YW+ + GC+ T
Sbjct: 363 FEAPFGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWITNTNGCKIT 408
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 23/39 (58%)
Query: 809 CQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
CQ G IS I+FAS+G P+G C F G A S SVV
Sbjct: 425 CQIGKTISQIQFASFGNPEGNCGSFKGGTWEATDSQSVV 463
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 329 bits (844), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 195/501 (38%), Positives = 270/501 (53%), Gaps = 67/501 (13%)
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENEYGN+E+++ ++G YV WAA MA+ L GVPW+MCKQ DAP+ +I+ CNG C
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKCGET 60
Query: 260 --KPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGT 317
PNS NKP+LWTENW +Y +GG R +D+AF VA F + GS++NYYMY GGT
Sbjct: 61 FGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 120
Query: 318 NFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
NFGRT+ + IT Y AP+DEYGL+ +PKWGHLK+LHA IK C L+ + +
Sbjct: 121 NFGRTAAA-YVITGYYDQAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTN-LSV 178
Query: 378 GQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVF 437
GQ Q+A+++ A G C AFL N D A+V F +S+ L P S+SILPDC N +F
Sbjct: 179 GQLQQAYMFEAQGGG----CVAFLVNNDS-VNATVGFRNKSFELLPKSISILPDCDNIIF 233
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTV 497
NTAKV++ ++ + S L+ +W + I +S++
Sbjct: 234 NTAKVNAGSNRRITTSSKKLN--------------------TWEKYIDVIPNYSDSTIKS 273
Query: 498 QGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLT 557
+LEH+N TKD SDYLW+ + ++S +P + ++S+ V F+N + +
Sbjct: 274 DTLLEHMNTTKDKSDYLWY---TFSFQPNLSC-----TKPLLHVESLAHVAYAFVNNKYS 325
Query: 558 GSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI 617
GS G ++G I+ V L+ DG + +I
Sbjct: 326 GSAHGS----------KNGKVPFIMEVPIV--------LDDDGL------------SNNI 355
Query: 618 DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVAL 677
+ +L VGL GE Q+Y E E I TW+K FD P G DPV L
Sbjct: 356 SILSVLVGLSVGLLGETLQLYGKEHLEMVKWSKADISIAQPLTWFKLEFDTPKGNDPVVL 415
Query: 678 DLGSMGKGQAWVNGHHIGRYW 698
+L +M KG+AWVNG IGRYW
Sbjct: 416 NLATMSKGEAWVNGQSIGRYW 436
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 328 bits (842), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 255/781 (32%), Positives = 371/781 (47%), Gaps = 109/781 (13%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
++VSY R IDG R +L+ IHYPR++ W L+ +K G + IE YVFWN HE
Sbjct: 85 YSVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVFWNLHEQ 144
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
RG +NF G + +F +L GL+L +R GPYVCAEW+ GG P+WL IPG++ R++N
Sbjct: 145 ERGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGMKVRSSN 204
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
AP++ EM+RFV +V+L R + GGPIIM QIENE + +YV+W
Sbjct: 205 APWQWEMERFVTYMVELSRP--FLAKNGGPIIMAQIENE-------FAMHDPEYVEWCGD 255
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK----PTLWTENWDGWYTT 280
+ L +PWVMC +A EN I +CNG C + + P +WTE+ +GW+ T
Sbjct: 256 LVKRLDTSIPWVMC-YANAAENTILSCNGNDCVDFAVKHVKERPSDPLVWTED-EGWFQT 313
Query: 281 WG----GRLPH--RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDY 334
W LP+ R ED+A+AVAR+F GG+ NYYMY GG NFGR + T Y
Sbjct: 314 WAKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGGNNFGRAASAGV-TTKYAD 372
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK-------LGQNQEAHVY- 386
+ GL +EPK HL+ LH A+ C L+ D Q + G+ EA
Sbjct: 373 GVNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDR-QLLHPHELAPTHGETAEASSLQ 431
Query: 387 -RANRYGSQS--NCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVS 443
RA YG++ N AFL N +V F Y L P S+ I+ D +FNTA V
Sbjct: 432 QRAFIYGAEDGPNQVAFLEN-QADKKVTVVFRDNKYELAPTSMMIIKDG-ALLFNTADVR 489
Query: 444 SQTSIKTVEFSLPLSPNISVPQQSMIESKLSS-TSKSWMTVKEPIGVWSENNFTVQGILE 502
P+ ++ ++ E +SS T + + + P+ E
Sbjct: 490 KSFPGTVHRAYTPIVQAATLQWETWSELNVSSLTPRRRVVAERPV--------------E 535
Query: 503 HLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
L +T D SDYL + T V D ++ + F++G L G
Sbjct: 536 QLRLTADRSDYLTYETTFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNL 595
Query: 563 HWVKVVQPVEFQ---------SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
+ EF+ + + L L+S ++G+ + G+ K G G+V++ G K
Sbjct: 596 AYPGGNCSKEFRFSLPTNIDVTRQHSLKLVSVSLGIYSLGSNHTK---GLTGKVRV-GRK 651
Query: 614 NGDIDLSK-ILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTR--DGIPSTFTWYKT----- 664
N L+K W L GE +IY E + WT + R +WY T
Sbjct: 652 N----LAKGHQWEMYPTLVGEQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSYP 707
Query: 665 YFDAPDGIDPVA------LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYN 718
F+ P DPV+ LD + +G+A++NGH +GRYW V
Sbjct: 708 AFELPAEADPVSEPFSILLDCIGLTRGRAYINGHDLGRYWLV------------------ 749
Query: 719 SDKCTTNCGNPTQTWYHVPRSWL-QASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSE 777
+ G Q +YHVPR WL + N+LV+F+E GG+ + V+L S+ +V + V +
Sbjct: 750 -----NDEGEFVQRYYHVPRDWLVKDQANVLVVFDELGGSVAD--VRLVSSSMVPDAVGD 802
Query: 778 S 778
+
Sbjct: 803 A 803
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 325 bits (834), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 165/291 (56%), Positives = 199/291 (68%), Gaps = 9/291 (3%)
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EWNFGGFPVWL+ +PGI FRT+N PFK M +F +KIV +M+ E LF QGGPII+ QIE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
NEYG +E G K+Y+ WAA MA+GL GVPWVMCKQ DAP+ +I+A NG+YCD + P
Sbjct: 61 NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYCDYFSP 120
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
NS T + W G + V F V + + G F NYYMY GGTNFGR
Sbjct: 121 NSLK--TFFGGLKLDWLVPVSGSSSSQTVRT-GFCV-QVYTEGWIFRNYYMYHGGTNFGR 176
Query: 322 TSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQ 381
T+GG F TSYDYDAPIDEY LL +PKWGHL+DLH AIK+CEPALV+ D KLG Q
Sbjct: 177 TAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPT-VTKLGNYQ 235
Query: 382 EAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
EAHVYR+ +C+AFL+N + H+ ASVTF G Y +P WS+SILPDC
Sbjct: 236 EAHVYRSK----SGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 325 bits (833), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 168/302 (55%), Positives = 206/302 (68%), Gaps = 21/302 (6%)
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MA L GVPW+MC+Q +AP+ II+ CN +YCD + PNS NKP +WTENW GW+ +GG
Sbjct: 1 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGA 60
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
+P+RPVEDLAFAVARFFQRGG+F NYYMY GGTNFGRT+GGPF TSYDYDAPIDEYG +
Sbjct: 61 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDI 120
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANI 404
+PKWGHLKDLH AIKLCE AL+A+D G N E VY+ + + CSAFLANI
Sbjct: 121 RQPKWGHLKDLHKAIKLCEEALIASDPT-ITSPGPNLETAVYK-----TGAVCSAFLANI 174
Query: 405 DEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVP 464
+ A+VTF G SY LP WSVSILPDC+N V NTAKV++ + I S
Sbjct: 175 G-MSDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMIS------------SFA 221
Query: 465 QQSMIES--KLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYV 522
+S+ E L S+S W + EP+G+ + + FT G+LE +N T D SDYLW+ I
Sbjct: 222 TESLKEKVDSLDSSSSGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVY 281
Query: 523 SD 524
D
Sbjct: 282 ED 283
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 325 bits (832), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 243/766 (31%), Positives = 356/766 (46%), Gaps = 111/766 (14%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
++V Y R +IDG +L+ IHY R+TP+ W L+AK+KE G ++++ Y+FWN HE
Sbjct: 97 YDVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFHEP 156
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
RG + F + ++ F + V + GL++ LR GPYVCAEWN GG P+WL IPG++ R+N+
Sbjct: 157 RRGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGMKVRSNS 216
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAAS 224
+++EM R + +++L R FS GGPIIM QIENEY + +Y V W +
Sbjct: 217 ESWRQEMNRIILIMINLARP--YFSVNGGPIIMAQIENEYNGHDPTY-------VAWLSQ 267
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY----NKPTLWTENWDGWYTT 280
+ LG G+PW MC A N I CN C + + ++P +WTEN + WY
Sbjct: 268 LVRKLGIGIPWTMCNGASAV-NTISTCNDNDCFQFAEKNAKVFPSQPLVWTEN-EAWYEK 325
Query: 281 WG-------GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYD 333
W G+ R E +A+ VAR+F GG+ NYYMY GG NFGRT+ T Y
Sbjct: 326 WATKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYHGGNNFGRTASAGV-TTMYA 384
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA--QYIKLG-QNQEAHVYRANR 390
A + GL +EPK HL+ LH + C AL++ + LG + + A+ RA
Sbjct: 385 DGAILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNHAKPLGPEGKNAYTQRAYI 444
Query: 391 YGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKT 450
YG NCS FL N A + + Y LPP ++ IL D N ++NT+ VS ++
Sbjct: 445 YG---NCS-FLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDVSGTLGSRS 499
Query: 451 VEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQ------GILEHL 504
PL + W E W N V+ LE L
Sbjct: 500 TRSFSPL---------------IRFRKSDWKIWSE----WDVNPHNVRDQIVNDSPLEQL 540
Query: 505 NVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVT--IDSMRDVLRVFINGQLTGS--- 559
VT+D +DYL + ++ + + N+++ ++ I + VFING+ G
Sbjct: 541 LVTQDTTDYLMYQNEVRWGSNGPT---KNKMKSSILKFISCDANSFLVFINGEFIGEQHL 597
Query: 560 ------VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
+ + P+ L +LS ++G+ + G EK G V++ +
Sbjct: 598 AYPGDDCSNIFRFDLGPLGKYGANLTLSILSISLGIHSLG---EKHQKGIVSDVQID--E 652
Query: 614 NGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWTDL-TRDGIPSTFTWYKTYFDAP-- 669
+ W GL GE ++Y + N W +L + T WY T F
Sbjct: 653 RSLVYGPHERWVMFSGLIGELLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQL 712
Query: 670 --DGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCG 727
D V LD M +G+ ++NGH +GRYW + GAY
Sbjct: 713 DWDTETSVLLDCKGMNRGRIYLNGHDLGRYWLIRRSD----------GAY---------- 752
Query: 728 NPTQTWYHVPRSWLQASN--NLLVIFEETGGNPFEISVKLRSTRIV 771
Q +Y +P +WL A+N N LVIFEE E S RIV
Sbjct: 753 --VQRYYTIPVAWLHAANKSNYLVIFEELRNETIE------SMRIV 790
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 230/761 (30%), Positives = 363/761 (47%), Gaps = 118/761 (15%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
KP+ V+YD R+ +DG R + ++ +HYPRATPEMW ++ ++ E G ++I+ Y FWN H
Sbjct: 31 KPYKVTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQIYTFWNLH 90
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E ++GQYN++G DI F++ GL++ +RIGPYVCAEW+ GG PVW+ + G+ R
Sbjct: 91 EPVKGQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNYLDGVRLRA 150
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
NN +K+EM ++K + D R+ F+ +GGPII QIENE +G ++Y+ W
Sbjct: 151 NNDVWKKEMGDWMKVLTDYTRD--FFADRGGPIIFSQIENEL------WG-GAREYIDWC 201
Query: 223 ASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNS-------YNKPTLWTENWD 275
A L VPW+MC D E I+ACNG C Y + ++P WTEN +
Sbjct: 202 GEFAESLELNVPWMMC-NGDTSEKTINACNGNDCSSYLESHGQSGRILVDQPGCWTEN-E 259
Query: 276 GWYTTWGGRLPH---------RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
GW+ G R ED F V +F RGGS+ NYYM+FGG ++G+ +G
Sbjct: 260 GWFQIHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHNYYMWFGGNHYGKWAGNG 319
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVY 386
Y I L +EPK H +H + L+ D AQ N + H+
Sbjct: 320 M-TNWYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLL-NDKAQV-----NNQKHLN 372
Query: 387 RAN------RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTA 440
N RYG + +F+ N ++ +A V + Y LP WS+ +L + N +F T
Sbjct: 373 CDNCNAFEYRYGDR--LVSFVEN-NKGSADKVIYRDIVYELPAWSMIVLDEYDNVLFETN 429
Query: 441 KVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI 500
V P++ + + +E + + S ++ + P V S
Sbjct: 430 NVK------------PVNKHRVYHCEEKLEFEYWNEPVSTLSQEAPRVVVSPK------A 471
Query: 501 LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMR-DVLRVFINGQLTGS 559
E LN+T+D +++L++ T++ D+ T++I + +++ GS
Sbjct: 472 NEQLNMTRDLTEFLYYETEVEFPQDEC----------TLSIGGTDANAFVAYVDDHFVGS 521
Query: 560 VIGH-----WVKVVQPVEFQSGYNDLILLSQTVGLQN-YGAFLEKDGAGFR-----GQVK 608
H W + ++ G + L+LLS+++G+ N + L+ A R G +K
Sbjct: 522 DDEHTHHDGWHTMNINMKSGKGKHKLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWIK 581
Query: 609 LTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFD 667
L G D+ W + GL GE +Q+++ E W + WY++ F
Sbjct: 582 LCGN-----DIFNQEWKHYPGLVGEAKQVFTDEGMKTVTWKSDVENA--DNLAWYRSTFK 634
Query: 668 APDGID---PVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
P G+ V L M +GQA+VNGH+IGRYW + D G Y
Sbjct: 635 TPQGLKRGIEVLLRPEGMNRGQAYVNGHNIGRYWMIK----------DGNGEY------- 677
Query: 725 NCGNPTQTWYHVPRSWL--QASNNLLVIFEETGGNPFEISV 763
TQ +YH+P+ WL + N+LV+ E G + +++
Sbjct: 678 -----TQGYYHIPKDWLKGEGEENVLVLGETLGASDPSVTI 713
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 142/219 (64%), Positives = 172/219 (78%)
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
EMWPDLI ++K+GG DVI+TYVFWN HE G+Y F+ D+VKF+KLV +GLY+ LRI
Sbjct: 1 EMWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRI 60
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPYVCAEWNFGGFPVWL+ IPGI+FRT+N PFK++MQRF KIV++M+ E LF GGPI
Sbjct: 61 GPYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPI 120
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYY 255
I+ QIENEYG ME G GK Y WAA MA+GLG GVPWVMCKQ DAP+ +I+ACNG+Y
Sbjct: 121 ILSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFY 180
Query: 256 CDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLA 294
CD + PN KP +WTE W GW+T +GG +P+RP EDLA
Sbjct: 181 CDYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 145/196 (73%), Positives = 156/196 (79%)
Query: 205 GNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY 264
G +E+ YG+ GK+Y KWAA AL LG GVPWVMC+Q DAP +IID CN YYCDG+KPNS+
Sbjct: 42 GAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSH 101
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
NKPT+WTENWDGWYT WG RLPHRPVEDLAFAVA FFQRGGSF NYYMYFG TNFGRT+G
Sbjct: 102 NKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYYMYFGRTNFGRTAG 161
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAH 384
GP ITSYDY A IDEYG L EPKWGHLKDLHAA+KLCEPALVA DS YIKLG NQE
Sbjct: 162 GPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPNQEIG 221
Query: 385 VYRANRYGSQSNCSAF 400
R QS AF
Sbjct: 222 TLSMLRSRFQSLPGAF 237
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 181/506 (35%), Positives = 263/506 (51%), Gaps = 25/506 (4%)
Query: 61 RMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKF 120
R+L A IHYPR P W LI +KE G + IETYVFWN HE +G Y+F G+ D+ F
Sbjct: 476 RILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGF 535
Query: 121 VKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVD 180
++ + +GLY LRIGPY+CAE +FGGFP WLRDI GIEFRT N PF+ E R+V+ +V+
Sbjct: 536 IRTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVE 595
Query: 181 LMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQ 240
+ F QGGPI+M+Q ENEY + +YG+ G +Y+KW + +A L VP MCK
Sbjct: 596 KLNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCK- 654
Query: 241 TDAPENIIDACNGYYCDGYKPNSY----NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFA 296
+ EN+++ N +Y N + N+P +WTE W GWY WG RP +DL +A
Sbjct: 655 -GSIENVLETINDFYGHQEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYA 713
Query: 297 VARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLH 356
V RFF +GG +NYYM+ GGTN+ + + TSYDYDAPIDEYG ++ +G L+ +H
Sbjct: 714 VLRFFAQGGKGINYYMFHGGTNYDQLAMY-LQTTSYDYDAPIDEYGRKTKKYFG-LQYIH 771
Query: 357 AAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLG 416
++ +L A ++ ++ G SNC F N + V +
Sbjct: 772 RQLEQHFASLALKLEAPIAHSYEDNYVWIFIWEEQG--SNC-IFFCNDHPTSTKQVQWKE 828
Query: 417 QSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSST 476
Q Y L P SV ++ D + + ++ + E P+S + + +T
Sbjct: 829 QEYCLAPLSVQMVVDHHRLILKSDQLFVDEELIQKELK-PISVTTEEWTWQYYKENIPTT 887
Query: 477 SKSWMTVKEPIGVWSENNFTV--QGILEHLNVTKDYSDYLWHIT------QIYVSDDDIS 528
+ + +N + Q +E L T +DY W+I QI + DD
Sbjct: 888 DITSSASQSSSISSLSSNTEIETQVPVEMLRYTGTATDYAWYIAHYQIDPQIEWTSDDAL 947
Query: 529 FWKTNEVRPTVTIDSMRDVLRVFING 554
W +V D ++V++NG
Sbjct: 948 EWVGGQVDLEAA-----DYVQVYVNG 968
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 301 bits (771), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 175/487 (35%), Positives = 266/487 (54%), Gaps = 46/487 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
V++D RA++IDG R +L HYP+ E WP + +K+ G + +E Y+FWN HE
Sbjct: 5 QVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHEKK 64
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G Y+F+ + +I +F++L GL + LR+GPY+CAE ++GGFP WLR+IPGIEFRT N
Sbjct: 65 KGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGIEFRTYNE 124
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PF +EM+R++ I +++E L+ +GGPII++QIENEY + S YG G+ Y+ W
Sbjct: 125 PFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWC--Y 182
Query: 226 ALGLGAGVPWVMCKQTD-----APENIIDACNGYY----CDGYKPNSYNKPTLWTENWDG 276
L W+ K ++ + + I+ N +Y D K ++P LWTE W G
Sbjct: 183 ELYKEGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALKPHQPLLWTEFWIG 242
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--ITSYDY 334
WY W G RPV+D+ +A ARF +GGS MNYYM+ GGT+FG + Y T YD+
Sbjct: 243 WYNIWRGAQRQRPVDDVIYAAARFIAQGGSGMNYYMFHGGTHFGNLA---MYGQTTGYDF 299
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQ 394
DAP+D YG +E K+ LK L+ + E L++ D + KL N +VYR S
Sbjct: 300 DAPVDSYGRPTE-KFERLKQLNHCLSNLEYILLSQDEPEVQKLTPN--VNVYRWKDIESG 356
Query: 395 SNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFS 454
CS F+ N D+ + + V ++ L P SV I + VF++++ S S K+
Sbjct: 357 DECS-FVCN-DQRSQSYVIVAERAVCLKPLSVKIYLN-HEEVFDSSQNSYNVSQKSYH-- 411
Query: 455 LPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENN-----FTVQGILEHLNVTKD 509
+L W T++ PI + + F+ I + L++T+D
Sbjct: 412 -----------------RLDYVCNEWKTMQIPIPSKEKKDKEHFEFSFPHIPDMLHITQD 454
Query: 510 YSDYLWH 516
+DY+W+
Sbjct: 455 ETDYMWY 461
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 292 bits (748), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 166/384 (43%), Positives = 220/384 (57%), Gaps = 32/384 (8%)
Query: 377 LGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV 436
LG NQE HV+ + GS C+AFLAN D ++A V F Y LPPWS+SILPDC+ V
Sbjct: 5 LGNNQEVHVFNP-KSGS---CAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAV 60
Query: 437 FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFT 496
FNTA++ +Q+S+K ++P + QS IE SS+ + FT
Sbjct: 61 FNTARLGAQSSLKQ------MTPVSTFSWQSYIEESASSS--------------DDKTFT 100
Query: 497 VQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQL 556
G+ E LNVT+D SDYLW++T I + D + F K N P +TI S L VFINGQL
Sbjct: 101 TDGLWEQLNVTRDASDYLWYMTNINI-DSNEGFLK-NGQDPLLTIWSAGHALHVFINGQL 158
Query: 557 TGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF 612
+G+V G + Q V+ + G N L LLS +VGLQN G E+ G G V L G
Sbjct: 159 SGTVYGGVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGL 218
Query: 613 KNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDG 671
G DLSK W+Y++GLKGE ++++ + EW + + TWYKT F+AP G
Sbjct: 219 NEGTRDLSKQQWSYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAG 278
Query: 672 IDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQ 731
+P+ALD+ +MGKG W+N IGR+W G C + C+Y G Y KC TNCG P+Q
Sbjct: 279 NEPLALDMSTMGKGLIWINSQSIGRHWPGYIAHGSCGE-CNYAGTYTDKKCHTNCGQPSQ 337
Query: 732 TWYHVPRSWLQASNNLLVIFEETG 755
WYHVPRSWL + NLLV+ + G
Sbjct: 338 RWYHVPRSWLNPTGNLLVVLKRVG 361
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 129/204 (63%), Positives = 157/204 (76%), Gaps = 1/204 (0%)
Query: 71 PRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLY 130
PR+TPEMWPDLI +KEGG DVI+TYVFWN HE G Y F+ + D VKF+KLV +GLY
Sbjct: 1 PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60
Query: 131 LQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSW 190
+ LRIGPY+C EWNFGGFPVWL+ +PGI+FRT+N PFK +MQ+F +KIV++M+ E LF
Sbjct: 61 VHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEP 120
Query: 191 QGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDA 250
QGGP IM QIE EYG + G GK Y KWAA MA+GLG GVPW+MCKQ DAP+ IID
Sbjct: 121 QGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDT 179
Query: 251 CNGYYCDGYKPNSYNKPTLWTENW 274
CNG+YC+ + PN+ KP +WTE W
Sbjct: 180 CNGFYCENFMPNANYKPKMWTEAW 203
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 282 bits (722), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 124/203 (61%), Positives = 163/203 (80%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
V+YD RA+I+DG RRML S +HYPR+TPEMWPDLIAK+K+GG DVI+TYVFWNAHE +
Sbjct: 37 EVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEPV 96
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQ+NF+G+ D+VKF++ + + GLY+ LRIGP+V +EW +GG P WLR IP I FR++N
Sbjct: 97 QGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNITFRSDNE 156
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK MQ+FV KIV+LM++E LF QGGPII+ QIENEY +E+++ +G YV WAA+M
Sbjct: 157 PFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYVHWAAAM 216
Query: 226 ALGLGAGVPWVMCKQTDAPENII 248
A+ L GVPW+MCKQ DAP+ I+
Sbjct: 217 AVNLQTGVPWMMCKQDDAPDPIV 239
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 279 bits (713), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 122/184 (66%), Positives = 154/184 (83%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NV+YDH+A++IDG RR+L+S IHYPR+TP+MWPDLI KSK+GG DVIETYVFWN HE +
Sbjct: 25 NVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEPV 84
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
RGQYNF+G+ D+V FVK+V ++GLY+ LRIGPYVCAEWN+GGFP+WL I GI+FRTNN
Sbjct: 85 RGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNNE 144
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK EM+RF KIVD+M++E L++ QGGPII+ QIENEYGN+++ + K Y+ WAASM
Sbjct: 145 PFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAASM 204
Query: 226 ALGL 229
A L
Sbjct: 205 ATSL 208
>gi|388518087|gb|AFK47105.1| unknown [Lotus japonicus]
Length = 220
Score = 275 bits (704), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 124/168 (73%), Positives = 140/168 (83%)
Query: 682 MGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
MGKGQAWVNGHHIGRYWT V+PK GC+ CDYRGAYNSDKCTTNCG PTQT YHVPRSWL
Sbjct: 1 MGKGQAWVNGHHIGRYWTRVSPKSGCEQVCDYRGAYNSDKCTTNCGKPTQTLYHVPRSWL 60
Query: 742 QASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKM 801
+AS+NLLVIFEETGGNPF ISVKL S RIVC +VSESHY P+ K N+ + ++S N M
Sbjct: 61 KASDNLLVIFEETGGNPFRISVKLHSARIVCAKVSESHYQPLHKLMNADLIGHEVSANSM 120
Query: 802 APEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
PE+HL CQDG IISSI FASYG P+G CQ FSRGNCHAP S+++VS+
Sbjct: 121 IPELHLRCQDGRIISSITFASYGNPEGSCQSFSRGNCHAPSSMAIVSK 168
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 275 bits (704), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 222/797 (27%), Positives = 347/797 (43%), Gaps = 137/797 (17%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S+D RAI ++G R +L+ + YP+ W + + +KE G + ++ YVFWN HE RG
Sbjct: 8 SFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWNVHEKKRG 67
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
+ F + DI +F+++ GL + LR+GPY+CAE ++GGFP WLR+IPGI+FRT N PF
Sbjct: 68 IFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGIQFRTYNDPF 127
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
E++R++ I L++E+ LF QGGPI+++Q+ENEY + +G+ Y+ W +
Sbjct: 128 MREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYLNWYNELYR 187
Query: 228 GLGAGVPWVMCKQTD-------------------APENIIDACNGYY----CDGYKPNSY 264
L VP +MC+ + + E I+ N +Y +
Sbjct: 188 ELAFDVPLIMCRSSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKIADLRRRKP 247
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
++P LWTE W GWY W R ED+ +A RF +GG+ +YYM+ GGT+F +
Sbjct: 248 HQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQGGAGFSYYMFHGGTHFNNLAM 307
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAH 384
TSY +D+PIDEYG S + LK ++ + L++ D Q + L A
Sbjct: 308 YS-QTTSYYFDSPIDEYGRPSF-LFYMLKRINHILHQFSSHLLSQDHPQVLHLLPQVVAF 365
Query: 385 VYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV-------- 436
+++ + SQ + S FL N D A + F + P SV++ +
Sbjct: 366 IWQ--EHSSQQSLS-FLCN-DSEQIAYIMFQQSMMKMNPLSVAVFLENELLFDSSSGYDW 421
Query: 437 ------FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVW 490
F + + +KT + +P+ P LSS+
Sbjct: 422 QIPFRDFKPLERAYFRELKTFQLDIPIPP-------------LSSSCD------------ 456
Query: 491 SENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRV 550
+ + L+VT+D +DY+W+I+ + F + M D++ +
Sbjct: 457 ------FSQLPDMLSVTQDETDYMWYISSATLPVSSKEF----TCEKVLLQIEMADLIHL 506
Query: 551 FINGQLTGSVIGHWVKVVQPVEFQSGYNDL---ILLSQTVGLQNYGAFLEKDGAGFRGQV 607
FIN Q GS W+K + F +G N I +V Q F V
Sbjct: 507 FINQQYMGS---SWIK-IDDERFANGKNGFRFSIEFENSVYPQ--PVFSSNSKLYVSILV 560
Query: 608 KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT------- 660
G G+ L K T + KG F+Q I + ++L + IP +FT
Sbjct: 561 CSLGLIKGEFQLWKGA-TMEKEKKGLFKQ--PIIHFVVKHSELETETIPLSFTSSWAMMP 617
Query: 661 ----------WYKTY----FDAPDGIDP--------------------VALDLGSMGKGQ 686
+ K Y D P + P + +D SM KG
Sbjct: 618 LSIMKDHQSAFVKEYNIKNVDKPLSLGPTYYKQTVIINKAMIDALKWGLVIDFSSMTKGI 677
Query: 687 AWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNN 746
N GRY+++ G +D NS + TQ +YH+P+ LQ N
Sbjct: 678 FRWNSFCCGRYYSIQV-LGKERDP----SLRNSPVQEDHLFKSTQRYYHIPKGVLQERNE 732
Query: 747 LLVIFEETGGNPFEISV 763
L V FEE GGN ++ +
Sbjct: 733 LEV-FEEIGGNFMQLRI 748
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 225/802 (28%), Positives = 351/802 (43%), Gaps = 116/802 (14%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE-- 103
++YD R++ I+G +S +HY R+ P WP + + G + +ETYVFW HE
Sbjct: 9 EITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEFE 68
Query: 104 -----SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI--- 155
+ +F G D+V+F++ GL LR+GPYVCAE N+GGFP WLR +
Sbjct: 69 PPEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCEK 128
Query: 156 ---PGIEFRTNNAPFKEEMQRFVKKIVD-LMREEMLFSWQGGPIIMLQIENEYGNMESSY 211
+ FRT + + +++R++K +VD +++ +F+ QGGP+I+ QIENEY + SY
Sbjct: 129 GSSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAESY 188
Query: 212 GQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--IIDACNGYYCDGY------KPNS 263
G G+ Y+ W AS+A L GVP VMC E+ +I+ N +Y + +
Sbjct: 189 GPDGQQYLDWIASLANQLALGVPLVMCYGASQRESGRVIETINAFYAHEHVESLRRAQGA 248
Query: 264 YNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTS 323
+P LWTE W GWY WG R DLA+AV RF GG+ +NYYMYFGGTN+ R +
Sbjct: 249 NPQPLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAAGGAGINYYMYFGGTNWRREN 308
Query: 324 GGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEA 383
TSYDYDAP++EY ++ K HL+ LH +I +P L D + E
Sbjct: 309 TMYLQATSYDYDAPLNEY-VMETTKSRHLRRLHESI---QPFLSDRDGVLDMS---RLEL 361
Query: 384 HVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVS 443
V+ R S + D + SV + S + V + + R + N A
Sbjct: 362 KVFEGERRAILYERSTVSGDADHRSEESVRCVFDSADI---RVHLALELREIIVNAA--- 415
Query: 444 SQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEH 503
S+ + + + + + +P+ + + LS TS + T+ + +
Sbjct: 416 SRDTGQDLRWRM-------LPEPPPLRAALSDTSATLATIPDLV---------------- 452
Query: 504 LNVTKDYSDYLWHITQIYVSDD----DISFWKTNEVRPTVTIDSMRDVLRVFINGQLTG- 558
+ T SDY W+I + + + V +D D R + G
Sbjct: 453 -DATAGTSDYAWYILRCPTAQGSGLLQLEVADFGRVWRRKAVDQGDDAERQPLEWAAAGP 511
Query: 559 --SVIGHWVKVVQPVEFQSG---------YNDLILLSQTVGL--------QNYGAFLEKD 599
V + E+ G + + ++L ++G+ YG E+
Sbjct: 512 EPPVEDRFPNAWNSTEYGYGIVEVGAIDCHEEYVVLVSSLGMVKGDWQLPPGYGMARERK 571
Query: 600 G---AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAE-----WTDLT 651
G A +R V + D ++ + GL+GE +I S+ E +A+ WT
Sbjct: 572 GLLRASYRSDVTFADDEWRD----ALVVGFAAGLRGE--RIRSVIEGDADAYPYLWTPQK 625
Query: 652 RDGIPSTFT---WYKTYFDAP----DGIDPVALDLGSMGKGQAWV--NGHHIGRYWTV-- 700
F+ WY+ P D + + LDL G + W+ NG GR+W V
Sbjct: 626 AALSGRRFSWPRWYRASLAIPPPNADETEGIILDLYESGVEKGWIYMNGEPCGRHWRVHG 685
Query: 701 VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN--NLLVIFEETGGNP 758
PK G D G PTQ ++++P L A + LVIF+E
Sbjct: 686 TMPKNGFLRQGDQEAPIEQ----VGHGQPTQRYFYIPPWHLHAKGRPSTLVIFDEHANGE 741
Query: 759 FE--ISVKLRSTRIVCEQVSES 778
+ +LR R V V +
Sbjct: 742 YREFEPHRLRVYRAVLRVVEST 763
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 119/202 (58%), Positives = 156/202 (77%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
++YD RA+++ G RRM S +HY R+TPEMWP LIAK+K GG DVI+TYVFWN HE I
Sbjct: 28 EITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHEPI 87
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+GQYNF+G+ D+VKF++ + + GLY+ LRIGP+V AEW +GGFP WL D+P I FR++N
Sbjct: 88 QGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSDNE 147
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
PFK+ MQ FV KIV +M+ E L+ QGGPII+ QIENEY +E ++G G YV+WAA+M
Sbjct: 148 PFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAAAM 207
Query: 226 ALGLGAGVPWVMCKQTDAPENI 247
A+GL GVPW+MCKQ DAP+ +
Sbjct: 208 AVGLQTGVPWMMCKQNDAPDPV 229
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 204/664 (30%), Positives = 300/664 (45%), Gaps = 102/664 (15%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWP 79
P+M+ A+ ++V+Y R IDG + +L+ IHYPR++P W
Sbjct: 58 PLMLKSSNFQYLTYDGIDATKRQSGYSVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWE 117
Query: 80 DLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYV 139
L+ ++K G + IE YVFWN HE RG +NF G +I +F +L GL+L +R GPYV
Sbjct: 118 QLLREAKRDGLNHIEMYVFWNLHEQERGVFNFAGNANITRFYELAAEVGLFLHVRFGPYV 177
Query: 140 CAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
CAEWN GG P+WL IPG+E R++NAP++ EM+RF++ +V+L R + GGPIIM Q
Sbjct: 178 CAEWNNGGLPLWLNWIPGMEVRSSNAPWQREMERFIRYMVELSRP--FLAKNGGPIIMAQ 235
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGY 259
IENE + +Y+ W ++ L +PWVMC +A EN I +CN C +
Sbjct: 236 IENE-------FAWHDPEYIAWCGNLVKQLDTSIPWVMC-YANAAENTILSCNDDDCVDF 287
Query: 260 KPNSYNK----PTLWTENWDGWYTTW----GGRLPH--RPVEDLAFAVARFFQRGGSFMN 309
+ P +WTE+ +GW+ TW LP+ R ED+A+AVAR+F GG+ N
Sbjct: 288 AVKHVKERPSDPLVWTED-EGWFQTWQKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHN 346
Query: 310 YYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
YYMY GG N+GR + T Y + GL +EPK HL+ LH A+ C L+
Sbjct: 347 YYMYHGGNNYGRAASAGV-TTMYADGVNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRN 405
Query: 370 D------------SAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQ 417
D Q +K Q A VY +Q
Sbjct: 406 DRQVLNPRELPLVDEQTVKASSQQRAFVYGPEAEPNQDGA-------------------- 445
Query: 418 SYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTS 477
+F+TA V + PL ++ ++ E +SST+
Sbjct: 446 -----------------ILFDTADVRKSFPGRQHRTYTPLVKASALAWKAWSELNVSSTT 488
Query: 478 -KSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHIT-----QIYVSDDDISFWK 531
+ + +PI E L +T D SDYL + T Q+ DDD+ K
Sbjct: 489 PRRRVVADQPI--------------EQLRLTADQSDYLTYETTFTPKQLSDVDDDMWTVK 534
Query: 532 TNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGY-NDLILLSQTVGLQ 590
+ I + L N G P + G +DL L+S ++G+
Sbjct: 535 VTSCEASSIIALVDGWLIGERNLAYPGGNCSKEFSFHLPASIEVGRQHDLKLVSVSLGIY 594
Query: 591 NYGAFLEKDGAGFRGQVKLTGFKNGDIDLSK-ILWTYQVGLKGEFQQIYSIEENEA-EWT 648
+ G+ K G G V++ G DL++ W L GE +IY + +A WT
Sbjct: 595 SLGSNHSK---GVTGSVRI-----GHKDLARGQRWEMYPSLIGEQLEIYRSQWIDAVPWT 646
Query: 649 DLTR 652
++R
Sbjct: 647 PVSR 650
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 268 bits (684), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 174/477 (36%), Positives = 251/477 (52%), Gaps = 66/477 (13%)
Query: 312 MYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADS 371
MY GGTNFGRTS ++IT Y AP+DEYGLL +PK+GHLK+LHAAIK L+
Sbjct: 1 MYHGGTNFGRTSSS-YFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQ 59
Query: 372 AQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPD 431
+ LG Q+A+V+ G C AFL N D A+ + F +Y+L P S+ IL +
Sbjct: 60 T-ILSLGPMQQAYVFEDANNG----CVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQN 113
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWS 491
C+N ++ TAKV+ + + + ++ P Q + + +W +E I +
Sbjct: 114 CKNLIYETAKVNVKMNTR-----------VTTPVQ------VFNVPDNWNLFRETIPAFP 156
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
+ +LEH N+TKD +DYLW Y S + TN P++ +S V+ VF
Sbjct: 157 GTSLKTNALLEHTNLTKDKTDYLW-----YTSSFKLDSPCTN---PSIYTESSGHVVHVF 208
Query: 552 INGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGF-RGQ 606
+N L GS G VK+ PV +G N++ +LS VGL + GA++E+ G + Q
Sbjct: 209 VNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLTKVQ 268
Query: 607 VKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEE-NEAEWTDLTRDGIPST--FTWYK 663
+ G K IDLS+ W Y VGL GE ++Y + N +W+ + + G+ WYK
Sbjct: 269 ISCGGTK--PIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWS-MNKAGLIKNRPLAWYK 325
Query: 664 TYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCT 723
T FD P+G PV L + SMGKG+ WVNG IGRYW
Sbjct: 326 TTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWV---------------------SFL 364
Query: 724 TNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHY 780
T G P+Q+ YH+PR++L+ S NLLV+FEE GG+P IS L + +V ++S +
Sbjct: 365 TPAGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGIS--LNTISVVGSSQAQSQF 419
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/282 (48%), Positives = 173/282 (61%), Gaps = 7/282 (2%)
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVG 629
P+ G ND+ LLS VGL N G E+ AG V L GFK+G DLS+ LWTYQ+G
Sbjct: 5 PISLIPGTNDIALLSVMVGLPNSGGHFERKIAGIS-TVTLRGFKDGTRDLSQELWTYQIG 63
Query: 630 LKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
L GE IYS + WT + P TWYK D PDG +PV LDL SMGKGQAW
Sbjct: 64 LLGEMSTIYSDVGFISVNWTSSSTPNPP--LTWYKAVIDVPDGDEPVILDLSSMGKGQAW 121
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
+NG HIGRYW + +AP G C CDYRG Y+ KC TNCG P+QT YHVPRSWL+ + NL
Sbjct: 122 INGEHIGRYWISFLAPLGDCSK-CDYRGNYSLHKCATNCGQPSQTLYHVPRSWLRPTGNL 180
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEETGG+P ++S+ RS VC E+H P ++ W + V+ ++ + P + L
Sbjct: 181 LVLFEETGGDPSKVSLLTRSIDSVCAHAFETHPPSIQSWQKT-KVNSEVLRENVEPSLQL 239
Query: 808 HCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
C G ISSI+FAS+G P+G C F +G CH+ S V +
Sbjct: 240 DCSVGRRISSIKFASFGNPKGVCGNFMKGTCHSVESEKAVEK 281
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 117/266 (43%), Positives = 172/266 (64%), Gaps = 22/266 (8%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V+YD ++II+G R +L S +HYPR+TP+MWP +I K++ GG + I+TYVFWN HE
Sbjct: 42 VTYDGTSLIINGKRELLFSVSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEH 101
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
+Y+FKG+ D+V F+KL+ GLY+ LR+GP++ AEWN GG P WLR++P + FRT+N P
Sbjct: 102 RKYDFKGRFDLVTFIKLIQEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPEVYFRTDNEP 161
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
FKE +R+V+KI+ +M+EE L + Q L ENE ++ +Y + G+ Y+KWAA++
Sbjct: 162 FKEHTERYVRKILGMMKEEKLLASQRRS-HHLGTENECNAVQLAYKENGERYIKWAANLV 220
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
+ G+PWVMCKQ +A +N+I+ACNG +C + G
Sbjct: 221 ESMKLGIPWVMCKQNNASDNLINACNGRHC---------------------FEFLGILQL 259
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYM 312
ED+AF+VAR+F + GS +NYYM
Sbjct: 260 IEQSEDIAFSVARYFSKNGSHVNYYM 285
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 18/126 (14%)
Query: 734 YHVPRSWL--QASNNLLVIFEETGGNPFE-ISVKLRSTRIVCEQVSESHYPPVRKWSN-- 788
YH+PRS++ + N+LVI EE G E I L + +C V E + V+ W
Sbjct: 290 YHIPRSFMKEEKKKNMLVILEEEPGVKLEAIDFVLVNRDTICSYVGEDYPVSVKSWKRER 349
Query: 789 ----SYSVDGKL-SINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
S S D +L ++ K PE + ++EFAS+G P G C F+ G C A S
Sbjct: 350 PKIASRSKDMRLKAVMKCPPEKQM--------VAVEFASFGDPTGTCGNFTMGKCSASKS 401
Query: 844 LSVVSE 849
VV +
Sbjct: 402 KEVVEK 407
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 252 bits (643), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 213/777 (27%), Positives = 345/777 (44%), Gaps = 147/777 (18%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+P++V+YD RA IDG R +L+ IHYPR + W ++ + G + ++ YVFWN H
Sbjct: 47 RPYSVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYH 106
Query: 103 E-----------SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
E + +Y+F G+ D++ F++ L++ LRIGPYVCAEW FGG P+W
Sbjct: 107 EPRPPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLW 166
Query: 152 LRDIPGIEFRT--------------------NNAPFKEEMQRFVKKIVDLMREEMLFSWQ 191
LRD+ G+ FR+ + P+++ M FV +I +++E L + Q
Sbjct: 167 LRDVEGMCFRSICGYNGSPGKCKPWEGGKFRSCDPWRKYMADFVMEIGRMVKEANLMAAQ 226
Query: 192 GGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDAC 251
GGP+I+ Q+ENEYG+ + G+ Y+ W ++ GLG VPWVMC A ++ C
Sbjct: 227 GGPVILGQLENEYGH----HSDAGRAYIDWVGELSFGLGLDVPWVMCNGISA-NGTLNVC 281
Query: 252 NGYYC-DGYKPNSYNK----PTLWTENWDGWYTTWGGRLPH--RPVEDLAFAVARFFQRG 304
NG C D YK + + P WTEN +GW+ TWGG + + R E++A+ +A++ G
Sbjct: 282 NGDDCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVG 340
Query: 305 GSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEP 364
GS NYYM++GG + + G +Y GL +EPK HL+ LH +
Sbjct: 341 GSHHNYYMWYGGNHLAQW-GAASLTNAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNG 399
Query: 365 ALVAAD---SAQYIKLGQNQEAHVYRAN-RYGSQSNCSAFLANIDEHTAASVTFLGQSYT 420
L+ + S ++L E + + A + + CS + V + +Y+
Sbjct: 400 ELMQVEDRHSVMPVQLENGVEVYEWTAGLAFLHRPACSG--------SPVEVHYAKATYS 451
Query: 421 LPPWSVSIL-PDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSS-TSK 478
+ V ++ P +F TA V P ++ +++ T+
Sbjct: 452 IACREVLVVDPSSSTVLFATASVE--------------------PPPELVRRVVATLTAD 491
Query: 479 SWMTVKEPIGVWSENNFTVQGI--LEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVR 536
W KE + TV+G +EHL V+ +DY+ + T + ++ +
Sbjct: 492 RWSMRKEEL---LHGMATVEGREPVEHLRVSGLDTDYVTYKTTVTATEGVTNV------- 541
Query: 537 PTVTIDS-MRDVLRVFIN--GQLTGSVIG------HWVKVVQPVEFQSGYN-DLILLSQT 586
++ IDS + V V ++ L +V+ W V Q +G DL +LS++
Sbjct: 542 -SLEIDSRISQVFHVSVDNASSLAATVMDVNKGNTEWTAVAQLHNLTAGRTYDLWILSES 600
Query: 587 VGLQN---YGAFLEKDGA---GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI 640
+G++N YGA + + G G ++L + + K W+ GL GE
Sbjct: 601 LGVENGMLYGAPAATEPSLQKGIFGDIRLN-----EKSIRKGRWSMVKGLDGEVDG---- 651
Query: 641 EENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLG--SMGKGQAWVNGHHIGRYW 698
+ +AE G P+ F T + L LG G W+NG IGR+
Sbjct: 652 GQGKAELPCCDSLG-PAWFVAGFTLHSVRSKSISLTLPLGLPQQAGGHIWLNGVDIGRWR 710
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
V GG Q + Y +P L+ +N L +F TG
Sbjct: 711 AV----GGRQAS-----------------------YRLPSDVLKRGSNRLAVFSATG 740
>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
Length = 314
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 109/187 (58%), Positives = 139/187 (74%), Gaps = 2/187 (1%)
Query: 663 KTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKC 722
+T F P G DPVA+DLGSMGKGQAWVNGH IGRYW++VAP+ GC +C Y GAYN KC
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERKC 142
Query: 723 TTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPP 782
+NCG PTQ WYH+PR WL+ S+NLLV+FEETGG+P IS++ + VC ++SE++YPP
Sbjct: 143 QSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYPP 202
Query: 783 VRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPM 842
+ WS+ S G+ S+N PE+ L C DG++IS I FASYGTP G C FS+GNCHA
Sbjct: 203 LSAWSHLSS--GRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASS 260
Query: 843 SLSVVSE 849
+L +V+E
Sbjct: 261 TLDLVTE 267
>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 317
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 109/187 (58%), Positives = 139/187 (74%), Gaps = 2/187 (1%)
Query: 663 KTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKC 722
+T F P G DPVA+DLGSMGKGQAWVNGH IGRYW++VAP+ GC +C Y GAYN KC
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERKC 142
Query: 723 TTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPP 782
+NCG PTQ WYH+PR WL+ S+NLLV+FEETGG+P IS++ + VC ++SE++YPP
Sbjct: 143 QSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYPP 202
Query: 783 VRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPM 842
+ WS+ S G+ S+N PE+ L C DG++IS I FASYGTP G C FS+GNCHA
Sbjct: 203 LSAWSHLSS--GRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASS 260
Query: 843 SLSVVSE 849
+L +V+E
Sbjct: 261 TLDLVTE 267
>gi|125536445|gb|EAY82933.1| hypothetical protein OsI_38150 [Oryza sativa Indica Group]
Length = 314
Score = 247 bits (631), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 109/187 (58%), Positives = 139/187 (74%), Gaps = 2/187 (1%)
Query: 663 KTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKC 722
+T F P G DPVA+DLGSMGKGQAWVNGH IGRYW++VAP+ GC +C Y GAYN KC
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERKC 142
Query: 723 TTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPP 782
+NCG PTQ WYH+PR WL+ S+NLLV+FEETGG+P IS++ + VC ++SE++YPP
Sbjct: 143 QSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKAVCSRISENYYPP 202
Query: 783 VRKWSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPM 842
+ WS+ S G+ S+N PE+ L C DG++IS I FASYGTP G C FS+GNCHA
Sbjct: 203 LSAWSHLSS--GRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASS 260
Query: 843 SLSVVSE 849
+L +V+E
Sbjct: 261 TLDLVTE 267
>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
Length = 287
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 144/314 (45%), Positives = 189/314 (60%), Gaps = 33/314 (10%)
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAH 384
GPF TSYDYDAP+DEYGL EPKWGHL+DLH AIK E ALV+A+ + LG QEAH
Sbjct: 1 GPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPS-VTSLGNGQEAH 59
Query: 385 VYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSS 444
V++ S+S C+AFLAN D ++A V+F Y LPPWS+SILPDC+ V+NTA++ S
Sbjct: 60 VFK-----SKSGCAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGS 114
Query: 445 QTSIKTVEFSLPLSP-NISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEH 503
Q+S + ++P ++P QS +E SS + T+ G+ E
Sbjct: 115 QSS------QMKMTPVKSALPWQSFVEESASSD--------------ESDTTTLDGLWEQ 154
Query: 504 LNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGH 563
+NVT+D +DYLW++T I +S D+ F K E P +TI S L VFINGQL+G+V G
Sbjct: 155 INVTRDTTDYLWYMTDITISPDE-GFIKRGE-SPLLTIYSAGHALHVFINGQLSGTVYGA 212
Query: 564 W----VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDL 619
+ Q V+ +SG N L LLS +VGL N G E AG G V L G +G D+
Sbjct: 213 LENPKLTFSQNVKLRSGINKLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDM 272
Query: 620 SKILWTYQVGLKGE 633
S+ WTY+ GLKGE
Sbjct: 273 SRWKWTYKTGLKGE 286
>gi|218117866|dbj|BAH03318.1| beta-galactosidase [Cucumis melo var. cantalupensis]
Length = 164
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 114/164 (69%), Positives = 136/164 (82%), Gaps = 1/164 (0%)
Query: 519 QIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN 578
+I+VS+DDI FWK + PTV IDS+RDV RV +NG++ GS IG WVK VQPV+F GYN
Sbjct: 1 RIHVSNDDIKFWKERNISPTVMIDSVRDVFRVSVNGKIAGSAIGQWVKFVQPVQFLEGYN 60
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
DL+LLSQ +GLQN GAF+EKDGAG RG++KLTGFKNGDIDLS+ LWTYQVGLKGEF Y
Sbjct: 61 DLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFKNGDIDLSESLWTYQVGLKGEFLNFY 120
Query: 639 SIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGS 681
S+EE+E A+WT L+ D IPSTFTWYK YF +PDG DPVA++LGS
Sbjct: 121 SLEESEKADWTKLSVDAIPSTFTWYKAYFSSPDGTDPVAINLGS 164
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 137/323 (42%), Positives = 178/323 (55%), Gaps = 18/323 (5%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIG----HWVKVVQPVEFQSGYNDLILLSQTVGLQN 591
+PT+T+ S L VF+NGQ +GS G +PV ++G N + LLS VGL N
Sbjct: 15 KPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRAGINKIALLSIAVGLPN 74
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT 651
G E G G V L G G DL+ W +VGLKGE + S N D
Sbjct: 75 VGLHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMDLVS--PNGGSSVDWI 132
Query: 652 RDGIPS----TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
R + + T WYK YF+AP G +P+ALD+ SMGKGQ W+NG IGRYW A G C
Sbjct: 133 RGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSIGRYWMAYA-NGDC 191
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
C Y G + KC CG PTQ WYHVPRSWL+ + NL+V+FEE GG+P +I++ RS
Sbjct: 192 -SLCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMVMFEELGGDPSKITLVKRS 250
Query: 768 TRIVCEQVSESHYPPVRKWSNSYSVDG-KLSINKMAPEMHLHCQDGYIISSIEFASYGTP 826
VC + E H+P K + +D + S ++HL C G ISSI+FAS+GTP
Sbjct: 251 VAGVCADLQE-HHPNAEK----FDIDSHEESKTLHQAQVHLQCVPGQSISSIKFASFGTP 305
Query: 827 QGRCQKFSRGNCHAPMSLSVVSE 849
G C F +G CHA S ++V +
Sbjct: 306 TGTCGSFQQGTCHATNSHAIVEK 328
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 186/309 (60%), Gaps = 34/309 (11%)
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAP 337
+ ++G +PHRPVEDLAFAVARF+QRGG+F NYYM+ GGTNFGRT+GGPF TSYD+D P
Sbjct: 6 FVSFGDVVPHRPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTP 65
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNC 397
IDEYG++ +PKW HLK++H AIKLCE AL+A LG N EA VY G+ S
Sbjct: 66 IDEYGIIRQPKWDHLKNVHKAIKLCEKALLAT-GPTITYLGPNIEAAVY---NIGAVS-- 119
Query: 398 SAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPL 457
+AFLANI + T A V+F G SY LP W VS LPDC++ V NTAK++S + I
Sbjct: 120 AAFLANIAK-TDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMIS-------- 170
Query: 458 SPNISVPQQSMIES--KLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLW 515
S +S+ E L + W + EPIG+ ++F+ +LE +N T D SDYLW
Sbjct: 171 ----SFTTESLKEEVGSLDDSGSGWSWISEPIGISKAHSFSKFWLLEQINTTADRSDYLW 226
Query: 516 HITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPV 571
+ + I D D + + I+S+ L F+NG+L GS G+ VKV P+
Sbjct: 227 YSSSI---DLDAA------TETVLHIESLGHALHAFVNGKLAGSGTGNHEKVSVKVDIPI 277
Query: 572 EFQSGYNDL 580
G N +
Sbjct: 278 TLVYGKNTI 286
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 110/172 (63%), Positives = 131/172 (76%)
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFPVWL+ +PGI FRT+N PFK MQ F +KIV+LM+ E LF QGGPII+ QIENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60
Query: 206 NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN 265
G G YV WAA+MA+GLG GVPWVMCK+ DAP+ +I+ CNG+YCD + PN
Sbjct: 61 PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPY 120
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGT 317
KPT+WTE W GW+T +GG + RPV+DLAFAVARF Q+GGSF NYYMY GGT
Sbjct: 121 KPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 148/433 (34%), Positives = 230/433 (53%), Gaps = 60/433 (13%)
Query: 336 APIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQS 395
P+DE+GL EPKWGHLKD+H A+ LC+ AL +KLG +Q+A V++ S
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTT-LKLGPDQQAIVWQQP---GTS 59
Query: 396 NCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSL 455
C+A LAN + A V F GQ LP S+S+LPDC+ VFNT V++Q +
Sbjct: 60 ACAALLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHN-------- 111
Query: 456 PLSPNISVPQQSMIESKLSSTSKSWMTVKE--PIGVWSENNFTVQGILEHLNVTKDYSDY 513
++ + S++++ + +W +E P+G+ F E ++TKD +DY
Sbjct: 112 ---------SRNFVRSEIANKNFNWEMYREVPPVGL----GFKFDVPRELFHLTKDTTDY 158
Query: 514 LWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQ 569
W+ T + + D+ K VRP + + S+ + ++NG+ GS G V+ +
Sbjct: 159 AWYTTSLLLGRRDLPMKKN--VRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRE 216
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVG 629
+ G N + LL VGL + GA++EK AG R + + G G +D+S+ W +QVG
Sbjct: 217 LSSLKEGENHIALLGYLVGLPDSGAYMEKRFAGPR-SITILGLNTGTLDISQNGWGHQVG 275
Query: 630 LKGEFQQIYSIEENEA-EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
GE +++++ E +++ +WT + G TWYK YFDAP+G +PVA+ + MGKG W
Sbjct: 276 TDGEKKKLFTEEGSKSVQWTKPDQGG---PLTWYKGYFDAPEGDNPVAIVMTGMGKGMVW 332
Query: 689 VNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL 748
VNG IGRYW + + PTQ+ YH+PR++L+ NL+
Sbjct: 333 VNGRSIGRYW---------------------NNYLSPLKKPTQSEYHIPRAYLKPK-NLI 370
Query: 749 VIFEETGGNPFEI 761
V+ EE GGNP ++
Sbjct: 371 VLLEEEGGNPKDV 383
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 180/294 (61%), Gaps = 31/294 (10%)
Query: 312 MYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADS 371
MY GGTNF R++GGPF TSYDYDAPIDEYG++ + KWGHLKD++ AIKLCE AL+ D
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTD- 59
Query: 372 AQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPD 431
+ LGQN EA VY+ + S C+AFLAN+D +V F G SY LP WSVS+LPD
Sbjct: 60 PKISSLGQNLEAAVYK-----TGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPD 114
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWS 491
C+N V NTAK++S ++I ++ +I S L ++S W + EP+G+
Sbjct: 115 CKNVVLNTAKINSASAISNF-----VTEDI---------SSLETSSSKWSWINEPVGISK 160
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
++ + G+LE +N T D SDYLW+ + ++DD S + + I+S+ L F
Sbjct: 161 DDILSKTGLLEQINTTADRSDYLWYSLSLDLADDPGS-------QTVLHIESLGHTLHAF 213
Query: 552 INGQLTGSVIGHWVK----VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
ING+L G+ G+ K V P+ SG N + LLS TVGLQNYGAF + GA
Sbjct: 214 INGKLAGNQAGNSDKSKLNVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTVGA 267
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 125/302 (41%), Positives = 174/302 (57%), Gaps = 39/302 (12%)
Query: 68 IHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSS 127
+HYPR PEMWPD+ K+K Q+NF+G D++KF+K++G
Sbjct: 13 VHYPRCPPEMWPDIFKKAK---------------------QFNFEGNYDLIKFIKMIGIM 51
Query: 128 GLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEML 187
L + + P+WLR+IP I FR++N PF M++F K I+ MR+E
Sbjct: 52 ICMQHLEL------VHSLKELPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMRDEKF 105
Query: 188 FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENI 247
F + QIENE+ ++ +Y + G YV+W +MA+GL GVPW+MCKQ +A +
Sbjct: 106 FPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNALGPV 158
Query: 248 IDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGG 305
++ CNG YC D + PN + + ++ Y +G R ED+A AVARFF + G
Sbjct: 159 MNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFSKKG 216
Query: 306 SFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPA 365
+ NYYMY+GGTNFGRTS F T Y +API EYGL EPKWGH +DLH A+KLC+ A
Sbjct: 217 TMANYYMYYGGTNFGRTSSS-FVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLCQKA 275
Query: 366 LV 367
L+
Sbjct: 276 LL 277
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 63/123 (51%), Gaps = 3/123 (2%)
Query: 727 GNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPP-VRK 785
G+ YH PR+ LQ NN LV+ EE GG I + + +C ++ HYPP V
Sbjct: 298 GSYVSMLYHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVNRDTICS-IAGEHYPPNVET 356
Query: 786 WSNSYSVDGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
WS V + +++ P +L C D I+ ++FASYG P G C F G C+AP S
Sbjct: 357 WSRYKGVI-RTNVDTPKPAANLVCLDNKTITQVDFASYGDPVGNCGHFILGKCNAPNSQK 415
Query: 846 VVS 848
+V
Sbjct: 416 IVE 418
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 238 bits (608), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 143/397 (36%), Positives = 219/397 (55%), Gaps = 36/397 (9%)
Query: 309 NYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVA 368
NYYMY GGTNFGRTS F + Y +AP+DE+GL EPKWGHL+DLH A+KLC+ AL+
Sbjct: 3 NYYMYHGGTNFGRTSAA-FVMPKYYDEAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 369 ADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSI 428
++ KLG+ EA V+ Q C AFL+N + ++TF GQSY +P S+SI
Sbjct: 62 GKTSTE-KLGKQFEARVFEIP---EQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISI 117
Query: 429 LPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIG 488
L DC+ VF T V++Q + +T F+ + N +V Q M +E +
Sbjct: 118 LADCKTVVFGTQHVNAQHNQRTFHFADQTTQN-NVWQ---------------MFDEEKVP 161
Query: 489 VWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVL 548
+ ++ ++ + N+TKD +DY+W+ + + DD+ + +++ + ++S
Sbjct: 162 KYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRR--DIKTVLEVNSHGHAS 219
Query: 549 RVFINGQLTGSVIGHWVK------VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
F+N + G GH K + +P++ + G N + +L+ T+G+ + GA+LE AG
Sbjct: 220 VAFVNTKFVG--CGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAG 277
Query: 603 FRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTW 661
+V++ G G +DL+ W + VGL GE +QIY+ + W D TW
Sbjct: 278 V-DRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAVND---RPLTW 333
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
YK +FD P G DP+ LD+ +MGKG +VNG IGRYW
Sbjct: 334 YKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYW 370
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 235 bits (600), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 106/169 (62%), Positives = 128/169 (75%)
Query: 147 GFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN 206
GF + +PGI FRT+N PFK MQ+F +KIV++M+ E LF QGGPIIM QIENEYG
Sbjct: 3 GFSCLAQYVPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYGP 62
Query: 207 MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK 266
+E G GK Y KWAA MA+GL GVPW+MCKQ DAP+ +ID CNG+YC+G++PN K
Sbjct: 63 VEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKNYK 122
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
P +WTENW GWYT +GG P+RPVEDLAF+VARF Q GSF+NYYMY G
Sbjct: 123 PKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHG 171
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 107/161 (66%), Positives = 126/161 (78%), Gaps = 2/161 (1%)
Query: 225 MALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
MALGL GVPW+MCKQ DAP IID CNGYYC+ +KPNS NKP +WTENW GWYT +GG
Sbjct: 1 MALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFGGA 60
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLL 344
+P+RPVED+A++VARF Q+GGS +NYYMY GGTNF RT+ G F +SYDYDAP+DEYGL
Sbjct: 61 VPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLP 119
Query: 345 SEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
EPK+ HLK LH AIKL EPAL++AD A LG QE +
Sbjct: 120 REPKYSHLKALHKAIKLSEPALLSAD-ATVTSLGAKQEVTI 159
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 106/190 (55%), Positives = 138/190 (72%), Gaps = 9/190 (4%)
Query: 14 LALSVYPMM----MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIH 69
L + +P+M ++++++ CV V+YDHRA++IDG RR+L S IH
Sbjct: 128 LGIGFFPIMGNKDLVLLVLIAVCVFEGCYCK-----TVTYDHRALVIDGKRRVLQSGSIH 182
Query: 70 YPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGL 129
YPR+ PE+WP++I KSKEGG DVIETYVFWN HE +RG+Y F+G+ D+V+FVK V +GL
Sbjct: 183 YPRSMPEVWPEIIRKSKEGGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGL 242
Query: 130 YLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFS 189
+ LRIGPY CAEWN+GGFPVWL IPGI+FRT N FK EM+RF+ KIV LM+E LF+
Sbjct: 243 LVHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFA 302
Query: 190 WQGGPIIMLQ 199
QGGPII+ Q
Sbjct: 303 PQGGPIILAQ 312
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 101/154 (65%), Positives = 128/154 (83%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V+YDH+AIII+G RR+LIS IHYPR+TP+MWPDLI K+K+GG D+IETYVFWN HE
Sbjct: 1 SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 60
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+Y F+ + D+V+F+KLV +GLY+ LRIGPYVCAEWN+GGFP+WL+ +PGI FRT+NA
Sbjct: 61 PDKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 120
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
PFK MQ+FV KIVD+M+ E LF QGGPII+ Q
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/294 (43%), Positives = 163/294 (55%), Gaps = 18/294 (6%)
Query: 554 GQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK 613
G + GSV + V+ +G N + LS VGL N G E AG G V L G
Sbjct: 165 GTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLN 224
Query: 614 NGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGI 672
G DL+ WTYQVGLKGE ++S+ + EW + ++ F F+APDG
Sbjct: 225 EGRRDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASNMAF------FNAPDGD 278
Query: 673 DPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQT 732
+P+ALD+ SMGKGQ W+NG IGRYW G C TCDYRG Y+ KC TNCG+ +Q
Sbjct: 279 EPLALDMSSMGKGQIWINGQGIGRYWPGYKASGNC-GTCDYRGEYDETKCQTNCGDSSQR 337
Query: 733 WYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSV 792
WYHVPRSWL + NLLVIFEE GG+P IS+ RS VC VSE P ++ W
Sbjct: 338 WYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHTKDYE 396
Query: 793 DGKLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSV 846
K+ HL C +G I+ I+FAS+GTPQG C ++ G CHA S +
Sbjct: 397 KAKV---------HLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDI 441
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 85/151 (56%), Positives = 111/151 (73%)
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
MQ+F KIV++M+ E LF WQGGPII+ QIENE+G +E G+ K Y WAA+MA+ L
Sbjct: 1 MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60
Query: 231 AGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPV 290
VPW+MCK+ DAP+ II+ CNG+YCD + PN +KPT+WTE W WYT +G +PHRPV
Sbjct: 61 TSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPV 120
Query: 291 EDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
EDLA+ VA+F Q+GGSF+NYYM+ F +
Sbjct: 121 EDLAYGVAKFIQKGGSFVNYYMFLNLRGFTK 151
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 225 bits (573), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 103/178 (57%), Positives = 133/178 (74%), Gaps = 6/178 (3%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
++++++I + T V+YDHRA++IDG RR+L S IHYPR+ PE+WP++
Sbjct: 6 LVLLVLIAVCVFEGCYCKT------VTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEI 59
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I KSKEGG DVIETYVFWN HE +RG+Y F+G+ D+V+FVK V +GL + LRIGPY CA
Sbjct: 60 IRKSKEGGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACA 119
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
EWN+GGFPVWL IPGI+FRT N FK EM+RF+ KIV LM+E LF+ QGGPII+ Q
Sbjct: 120 EWNYGGFPVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 191/671 (28%), Positives = 300/671 (44%), Gaps = 118/671 (17%)
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQG 192
+RIGPYVCAEW+ GG PVW+ + G+ R NN +K+EM ++K + D R+ F+ +G
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTRD--FFADRG 58
Query: 193 GPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACN 252
GPII QIENE +G ++Y+ W A L VPW+MC D E I+ACN
Sbjct: 59 GPIIFSQIENEL------WG-GAREYIDWCGEFAESLELNVPWMMC-NGDTSEKTINACN 110
Query: 253 GYYCDGYKPNS-------YNKPTLWTENWDGWYTTWGGRLPH---------RPVEDLAFA 296
G C Y + ++P WTEN +GW+ G R ED F
Sbjct: 111 GNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFN 169
Query: 297 VARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLH 356
V +F RGGS+ NYYM+FGG ++G+ +G Y I L +EPK H +H
Sbjct: 170 VLKFMDRGGSYHNYYMWFGGNHYGKWAGNGM-TNWYTNGVMIHSDTLPNEPKHSHTAKMH 228
Query: 357 AAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN------RYGSQSNCSAFLANIDEHTAA 410
+ L+ D AQ N + H+ N RYG + +F+ N + +A
Sbjct: 229 RMLANIAEVLL-NDKAQV-----NNQKHLNCDNCNAFEYRYGDR--LVSFVEN-SKGSAD 279
Query: 411 SVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIE 470
V + Y LP WS+ +L + N +F T V P++ + + +E
Sbjct: 280 KVIYRDIVYELPAWSMIVLDEYDNVLFETNNVK------------PVNKHRVYHCEEKLE 327
Query: 471 SKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFW 530
+ + S ++ + P V S E LN+T+D +++L++ T++ D+
Sbjct: 328 FEYWNEPVSTLSQEAPRVVVSPK------ANEQLNMTRDLTEFLYYETEVEFPQDEC--- 378
Query: 531 KTNEVRPTVTIDSMR-DVLRVFINGQLTGSVIGH-----WVKVVQPVEFQSGYNDLILLS 584
T++I + +++ GS H W + ++ G + L+LLS
Sbjct: 379 -------TLSIGGTDANAFVAYVDDHFVGSDDEHTHHDGWHTMNINMKSGKGKHKLVLLS 431
Query: 585 QTVGLQN-YGAFLEKDGAGFR-----GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIY 638
+++G+ N + L+ A R G +KL G D+ W + GL GE +Q++
Sbjct: 432 ESLGVSNGMDSNLDPSWASSRLKGICGWIKLCGN-----DIFNQEWKHYPGLVGEAKQVF 486
Query: 639 SIEE-NEAEWTDLTRDGIPSTFTWYKTYFDAPDGID---PVALDLGSMGKGQAWVNGHHI 694
+ E W + WY++ F P G+ V L M +GQA+ NGH+I
Sbjct: 487 TDEGMKTVTWKSDVENA--DNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYANGHNI 544
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL--QASNNLLVIFE 752
GRYW + D G Y TQ +YH+P+ WL + N+LV+ E
Sbjct: 545 GRYWMIK----------DGNGEY------------TQGFYHIPKDWLKGEGEENVLVLGE 582
Query: 753 ETGGNPFEISV 763
G + +++
Sbjct: 583 TLGASDPSVTI 593
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 140/465 (30%), Positives = 226/465 (48%), Gaps = 55/465 (11%)
Query: 394 QSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEF 453
Q C AFL+N + A++TF G+ Y +P S+S+L DC VF T V++Q + +T F
Sbjct: 4 QKVCVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHF 63
Query: 454 SLPLSPNISVPQQSMIESKLSSTSKSW-MTVKEPIGVWSENNFTVQGILEHLNVTKDYSD 512
+ + N W M E + + + ++ + N+TKD +D
Sbjct: 64 ADQTAQN-----------------NVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTD 106
Query: 513 YLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK------ 566
Y+W+ + + DD+ ++++ + ++S F+N + G GH K
Sbjct: 107 YVWYTSSFKLEADDMPI--RSDIKTVLEVNSHGHASVAFVNNKFVG--CGHGTKMNKAFT 162
Query: 567 VVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY 626
+ +P++ + G N + +L+ ++G+ + GA++E AG +V++TG G +DL+ W +
Sbjct: 163 LEKPMDLKKGVNHVAVLASSMGMTDSGAYMEHRLAGV-DRVQITGLNAGTLDLTNNGWGH 221
Query: 627 QVGLKGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKG 685
VGL GE +QIY+ + W D TWYK +FD P G DPV LD+ +MGKG
Sbjct: 222 IVGLVGERKQIYTDKGMGSVTWKPAMND---RPLTWYKRHFDMPSGEDPVVLDMSTMGKG 278
Query: 686 QAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASN 745
+VNG IGRYW Y+ A G P+Q YHVPRS+L+ +
Sbjct: 279 MMFVNGQGIGRYWI------------SYKHAL---------GRPSQQLYHVPRSFLRQKD 317
Query: 746 NLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYS-VDGKLSINKMAPE 804
N+LV+FEE G P I + +C +SE + + W S + K + + +
Sbjct: 318 NMLVLFEEEFGRPDAIMILTVKRDNICTFISERNPAHIMSWERKDSQITAKANADDLRAR 377
Query: 805 MHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
L C +I + FASYG P G C ++ G+CH P + VV +
Sbjct: 378 AALACPPKKLIQQVVFASYGNPAGICGNYTVGSCHTPRAKEVVEK 422
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/334 (37%), Positives = 183/334 (54%), Gaps = 21/334 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
+ +D + IIDG R+ +ISA +HY R W +I K++ GG + IETY+ WN HE+
Sbjct: 2 IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
Q++F G D+ F + G+Y+ +R GPY+CAEW+FGG P +L + GIE+R +NA
Sbjct: 62 EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGIEYRCSNAA 121
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+++ ++R+ ++I+ ++R L S GG IIM+QIENEY ++G++ ++++ +
Sbjct: 122 YEQAVRRYFERIMPIIRRYQLGS--GGSIIMVQIENEY----HAFGKKDLAHIRFLEELT 175
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGY-----YCDGYKPNSYNKPTLWTENWDGWYTTW 281
G G VP V C A N ++ N + + +P E W GW W
Sbjct: 176 RGFGITVPLVSC--YGAGRNTVEMRNFWSGAERAAAVLRERQSGQPLGIMEFWIGWVEHW 233
Query: 282 GGR-LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF----GRTSGGP--FYITSYDY 334
GG H+P E + + G F NYYMYFGG+NF GRT G F SYDY
Sbjct: 234 GGEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYDY 293
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVA 368
DAP+DE+G +E K+ L LH I E L A
Sbjct: 294 DAPLDEFGFETE-KYRLLAVLHTFIAWLENDLTA 326
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 114/233 (48%), Positives = 142/233 (60%), Gaps = 8/233 (3%)
Query: 617 IDLSKILWTYQVGLKGEFQQI-YSIEENEAEWTDLTRD-GIPSTFTWYKTYFDAPDGIDP 674
+DLS WTYQVGLKGE + + W D + P TW+KTYFDAP+G +P
Sbjct: 1 MDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEP 60
Query: 675 VALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWY 734
+ALD+ MGKGQ WVNG IGRYWT A G C C Y G Y +KC T CG PTQ WY
Sbjct: 61 LALDMEGMGKGQIWVNGESIGRYWTAFA-TGDCSH-CSYTGTYKPNKCQTGCGQPTQRWY 118
Query: 735 HVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDG 794
HVPR+WL+ S NLLVIFEE GGNP +S+ RS VC +VSE H P ++ W G
Sbjct: 119 HVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKG 177
Query: 795 KLSINKMAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVV 847
+ P++HL C G I+SI+FAS+GTP G C + +G CHA S +++
Sbjct: 178 Q---TFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAIL 227
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 217 bits (553), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/299 (40%), Positives = 167/299 (55%), Gaps = 9/299 (3%)
Query: 475 STSKSWMTVKE-PIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTN 533
S++ W + E P +++ T +LE + VT+D SDYLW++T + +S ++ F K N
Sbjct: 12 SSAFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNE-GFIK-N 69
Query: 534 EVRPTVTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGL 589
P +T S VL VF+NGQ +G+ G + V+ + G N + LLS VGL
Sbjct: 70 GQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGL 129
Query: 590 QNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-IEENEAEWT 648
N G E G G V L G G DLS W+Y++GLKGE +++ I + +WT
Sbjct: 130 SNVGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWT 189
Query: 649 DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQ 708
+ TWYK FDAP G DP+ALD+ SMGKG+ WVNG IGR+W +G C
Sbjct: 190 KGSSLVEKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIARGSCG 249
Query: 709 DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
C+Y G + KC T+CG PTQ WYH+PRSW+ N LV+ EE GG+P IS+ R+
Sbjct: 250 G-CNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISLVKRT 307
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 216 bits (549), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 139/449 (30%), Positives = 218/449 (48%), Gaps = 58/449 (12%)
Query: 410 ASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMI 469
+V F G+ + +P SVSIL DC+ V+NT +V Q S ++S
Sbjct: 3 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHS-----------------ERSFH 45
Query: 470 ESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISF 529
+ +S + W E I + + + LE N TKD SDYLW+ T + DD+ F
Sbjct: 46 TTDETSKNNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPF 105
Query: 530 WKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK----VVQPVEFQSGYNDLILLSQ 585
+ ++RP + I S + F N G+ G + +P++ + G N + +LS
Sbjct: 106 RR--DIRPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSS 163
Query: 586 TVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA 645
++G+++ G L + G + V + G G +DL W ++ L+GE ++IY+ E+ A
Sbjct: 164 SMGMKDSGGELVEVKGGIQDCV-VQGLNTGTLDLQGNGWGHKARLEGEDKEIYT-EKGMA 221
Query: 646 E--WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
+ W D +P TWYK YFD PDG DP+ +D+ SM KG +VNG IGRYWT
Sbjct: 222 QFQWKPAEND-LP--ITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSF-- 276
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
T G+P+Q+ YH+PR++L+ NLL+IFEE G P I +
Sbjct: 277 -------------------ITLAGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILI 317
Query: 764 KLRSTRIVCEQVSESHYPPVRKWSNSYSVDG---KLSINKMAPEMHLHCQDGYIISSIEF 820
+ +C +SE + ++ W + DG KL + L+C I + F
Sbjct: 318 QTVRRDDICVFISEHNPAQIKTWES----DGGQIKLIAEDTSTRGTLNCPPKRTIQEVVF 373
Query: 821 ASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
AS+G P+G C F+ G CH P + ++V +
Sbjct: 374 ASFGNPEGACGNFTAGTCHTPDAKAIVEK 402
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 215 bits (548), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 124/294 (42%), Positives = 163/294 (55%), Gaps = 11/294 (3%)
Query: 479 SWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPT 538
SW + E FT G++E L++T D SDYLW+ T + ++ ++ F K+ + P
Sbjct: 8 SWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNE-QFLKSGQ-WPQ 65
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHW----VKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
+TI S L+VF+NGQ G+V G + + V+ G N + +LS VGL N G
Sbjct: 66 LTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGT 125
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAEWTDLTRD 653
E G G V L+G G DLS WTYQ+GL GE + S+ + EW
Sbjct: 126 HYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAGK 185
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
TW+K YF AP G PVALD+GSMGKGQAWVNG HIGRYW+ A GC C Y
Sbjct: 186 ---QPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSGC-GGCSY 241
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRS 767
G Y+ KC T CG+ +Q +YHVPRSWL S NLLV+ EE GG+ + + R+
Sbjct: 242 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTRT 295
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/293 (43%), Positives = 162/293 (55%), Gaps = 37/293 (12%)
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
MW L+ +KEGG DVIETYVF N HE Y F G D++KFVK+V +G+YL L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
P+V EWNFG F+TN+ PFK MQ+F+ IV++M+++ LF+ QGGPII
Sbjct: 61 PFVATEWNFGTI-----------FQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENI-IDACNGYY 255
+ Q +NEYG+ + Y GK YV WAA+M L GVPW+MC+ + I I G Y
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQYSYVDIYIYIVKKEGLY 169
Query: 256 CDGYK--------------PNSYN----KPTLWTE-NWDGWYTTWGGRLPHRPVED-LAF 295
Y+ NS+ KP + DG L HR + D +
Sbjct: 170 SLSYQYALILSTLVTHSIVTNSHQILQAKPKCGLKIGLDGL-----KHLGHRILTDYMKI 224
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPK 348
+ +NYYMY GGTNFG TSGGPF T+Y+Y+APIDEYGL PK
Sbjct: 225 LLFLLLFFFFQKVNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK 277
>gi|147838572|emb|CAN74312.1| hypothetical protein VITISV_037520 [Vitis vinifera]
Length = 1915
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 102/147 (69%), Positives = 119/147 (80%), Gaps = 6/147 (4%)
Query: 383 AHVYR------ANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTV 436
AHVYR + + G+ S+CSAFLANIDEH ASVTFLGQ Y LPPWSVSILPDCR TV
Sbjct: 125 AHVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTV 184
Query: 437 FNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFT 496
FNTAKV +QTSIKTVEF LPL NISV Q M+++K+S K+WMT+KEPI VWSENNFT
Sbjct: 185 FNTAKVGAQTSIKTVEFDLPLVRNISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFT 244
Query: 497 VQGILEHLNVTKDYSDYLWHITQIYVS 523
+QG+LEHLNVTKD+SDYLW IT+ ++
Sbjct: 245 IQGVLEHLNVTKDHSDYLWRITRYIIT 271
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 205 bits (522), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 137/418 (32%), Positives = 210/418 (50%), Gaps = 32/418 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD ++ I R ++SA IHY R W D++ K+K GG + IETY+ WN HE
Sbjct: 2 ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+++F G D+ F++L + GLY+ R GPY+CAEW+FGGFP WL I++R+
Sbjct: 62 GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPS 121
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F + ++ +++ ++ E L + G +IM+QIENE+ +YG+ K Y+++
Sbjct: 122 FLHYVDQYFDQVISIIDEYQLT--KNGSVIMVQIENEF----QAYGKPDKKYMEYLRDGM 175
Query: 227 LGLGAGVPWVMC-KQTDAPENIIDACNG--YYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
+ G VP+V C D + +G + ++P E W GW+ WGG
Sbjct: 176 IARGIEVPFVTCYGAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEHWGG 235
Query: 284 -RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF----GRT-SGGPFYITSYDYDAP 337
+ + E L + + G + +NYYMYFGGTNF GRT S F T+YDYD
Sbjct: 236 NKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYDYDVA 295
Query: 338 IDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY-IKLGQNQEAHVYRANRYGSQSN 396
IDEY L K+ LK H +K EP A+ A +KL + ++ R S
Sbjct: 296 IDEY-LQPTRKYEVLKRYHLFVKWLEPLFTNAEQANSDVKLSSD-----LKSGRIVSPHG 349
Query: 397 CSAFLANIDEHTAASVTFLGQSYTLPPWSV---SILPDCRNTVFNTAKVSSQTSIKTV 451
F+ N S G L P+++ ++LP RN KV ++ +IKT+
Sbjct: 350 EVLFIENNRNERIQSHVKHGNE--LVPFTIEANAVLPIVRN-----VKVGNRFTIKTL 400
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 47/115 (40%), Gaps = 30/115 (26%)
Query: 661 WYKTYFD-APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
W+K+ F PD V + L + KG WVNG +GRYW + G Q+
Sbjct: 770 WHKSRFTWNPDNGSIVKVRLNQLSKGCFWVNGQCLGRYWNI-----GPQED--------- 815
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQ 774
Y +P S L+ N +VIF+E G P + + S + E+
Sbjct: 816 --------------YKIPASLLKEQNE-IVIFDEEGVVPDHVVIHSYSPFVQSEK 855
>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
Length = 138
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 90/138 (65%), Positives = 103/138 (74%)
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG 258
QIENEYG +E GK Y WAA MA+GL GVPWVMCKQ DAP+ +ID CNGYYC+
Sbjct: 1 QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYCEN 60
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
+ PN KP +WTENW GWYT +GG +P RPVED+A++V RF Q GGSF+NYYMY GGTN
Sbjct: 61 FTPNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFIQNGGSFVNYYMYHGGTN 120
Query: 319 FGRTSGGPFYITSYDYDA 336
FGRT G F TSYDYDA
Sbjct: 121 FGRTYSGLFIATSYDYDA 138
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 121/350 (34%), Positives = 184/350 (52%), Gaps = 36/350 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++S +HY R P+ W D I K++ G + IETYV WNAH G ++ G
Sbjct: 11 FLLDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTDG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F++LV +G+Y +R GP++CAEW+ GG P WL PG+ R + F +E+++
Sbjct: 71 ILDLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGVGIRRHEPRFLDEVEK 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ +++ L+R + GGP++++Q+ENEYG +YG +DY++ A M G G V
Sbjct: 131 YLHQVLALVRPHQVD--LGGPVLLVQVENEYG----AYGDD-RDYLQAVADMIRGAGIDV 183
Query: 234 PWVMCKQTDAPENIIDACNG----YYCDGYKPNSYNK-----------PTLWTENWDGWY 278
P V Q P + + A G + +S N+ P + E WDGW+
Sbjct: 184 PLVTVDQ---PVDAMLAAGGLDGVLRTSSFGSDSANRLRTLRDHQPTGPLMCMEFWDGWF 240
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG---PFY---ITSY 332
WGGR PVE A + G S +N YM+ GGTNFG TSG Y +TSY
Sbjct: 241 DHWGGRHHTTPVEQAAEELDALLAAGAS-VNVYMFHGGTNFGLTSGANDKGIYRPTVTSY 299
Query: 333 DYDAPIDEYGLLSEPKWGHLKDL---HAAIKLCEPALVAADSAQYIKLGQ 379
DYDAP+DE G + K+ +D+ +A + P Q + LG+
Sbjct: 300 DYDAPLDEAGNPTA-KYFAFRDVIARYAPVPAEVPVATGPAPEQTVALGE 348
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 70/167 (41%), Gaps = 30/167 (17%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
+T+ +RD VF++G G + + + + G L L+ + G NYGA +
Sbjct: 396 VLTLGEVRDRAAVFLDGAPVGVLEREHAE--RAIALPRGRGRLELVVEDQGRVNYGARIG 453
Query: 598 KD----GAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRD 653
+ G V LT ++ IDL ++ ++ G D
Sbjct: 454 EHKGLVGPALLDGVPLTDWEILAIDLDRVPRLWESG---------------------PPD 492
Query: 654 GIPSTF--TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
G S T ++ +F A +D V LD + GKG AWVNG +GRYW
Sbjct: 493 GHGSGVGPTAWRAHFAAEPDVD-VFLDTSAWGKGIAWVNGFCLGRYW 538
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 121/350 (34%), Positives = 182/350 (52%), Gaps = 21/350 (6%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
V YD + IIDG R ++SA +HY R W +++ KSKE G + IETYV WN HE
Sbjct: 5 RVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEEE 64
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQ++F G D+ F+ L GLY+ +R GPY+CAEW+ GG P WL P +++R +
Sbjct: 65 EGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDMQYRKFHR 124
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F + + ++V ++ +L + G +IM+Q+ENE+ + G+ K Y+++
Sbjct: 125 EFLHYVDLYWDRLVPVVLPRLLSN--SGTVIMVQVENEF----QALGKPDKAYMEYLRDG 178
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGY-----YCDGYKPNSYNKPTLWTENWDGWYTT 280
+ G VP V C A + ++ N + + + ++P E W GW+
Sbjct: 179 LIERGIDVPLVTC--YGAVDGAVEFRNFWSHAEEHARTLEERFADQPKGVLEFWIGWFEQ 236
Query: 281 WGG-RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF----GRTSGG-PFYITSYDY 334
WGG R + + + G + +NYYM+FGGTNF GRT G F TSYDY
Sbjct: 237 WGGPRANQKTASQVERKTYELIREGFTAINYYMFFGGTNFGHWGGRTIGEHTFMTTSYDY 296
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV-AADSAQYIKLGQNQEA 383
DA +DEY L K+ LK +H ++ EP L S +I LG++ A
Sbjct: 297 DAALDEY-LRPTAKYKALKLVHDFVRWMEPLLTETTGSTAFIPLGKHSSA 345
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 113/307 (36%), Positives = 164/307 (53%), Gaps = 26/307 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++S +HY R P++W D I K++ G + IETYV WNAH RG ++ G
Sbjct: 11 FLLDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTDG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F++ V ++GLY +R GPY+CAEW+ GG P WL PG+ R F +++
Sbjct: 71 MLDLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVEQ 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++++++DL+R L QGGP+++LQ+ENEYG ++G +Y++ A M G V
Sbjct: 131 YLEQVLDLVRP--LQVDQGGPVLLLQVENEYG----AFGND-PEYLEAVAGMIRKAGITV 183
Query: 234 PWVMCKQTDAPENIIDACNGYYCDG------------YKPNSYNKPTLWTENWDGWYTTW 281
P V Q +G G + + P + E WDGW+ W
Sbjct: 184 PLVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDHW 243
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPFY--ITSYDYD 335
GG VED A + G S +N YM+ GGTNFG TSG G F +TSYDYD
Sbjct: 244 GGPHHTTSVEDAARELDALLAAGAS-VNIYMFHGGTNFGLTSGADDKGVFRPTVTSYDYD 302
Query: 336 APIDEYG 342
AP+DE G
Sbjct: 303 APLDEAG 309
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 193 bits (491), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 133/421 (31%), Positives = 209/421 (49%), Gaps = 38/421 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++YD ++ I R ++SA IHY R W +++ K+K GG + IETY+ WN HE
Sbjct: 2 ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G+++F G D+ F +L LY+ R GPY+CAEW+FGGFP WL I++R+
Sbjct: 62 GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPA 121
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F + ++ +++ ++ E L + G +IM+Q+ENE+ +YG+ K Y+++
Sbjct: 122 FLHYVDQYFDRVIPIIDEYQLT--KNGTVIMVQVENEF----QAYGKPDKPYMEYIRDGM 175
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY-----NKPTLWTENWDGWYTTW 281
G VP V C A E ++ N + + ++P E W GW+ W
Sbjct: 176 KARGIDVPLVTC--YGAVEGAVEFRNFWSHSKHAAAILDERFPDQPKGVMEFWIGWFEQW 233
Query: 282 GG-RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF----GRTSG-GPFYITSYDYD 335
GG + + E L + G + +NYYMYFGGTNF GRT G T+YDYD
Sbjct: 234 GGNKADQKTPEQLERECYQLLSNGFTAINYYMYFGGTNFDHWGGRTVGEQTLCTTTYDYD 293
Query: 336 APIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADS-AQYIKLGQNQEAHVYRANRYGS- 393
IDEY L K+ LK H+ +K EP A+ A +KL + ++ A+ YG
Sbjct: 294 VAIDEY-LQPTRKYEVLKRYHSFVKWLEPLFTDAEKVASDMKLPSDLKSERI-ASPYGEV 351
Query: 394 ---QSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKT 450
++N + + + +H + F ++ T +LP RN KV + +IKT
Sbjct: 352 IFIENNRNERIQSHVKHGYDQILFTIEANT-------VLPIVRN-----VKVGNHFTIKT 399
Query: 451 V 451
+
Sbjct: 400 L 400
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 45/104 (43%), Gaps = 30/104 (28%)
Query: 661 WYKTYFD-APDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
WYK++F PD V + L + KG WVNG +GRYW + G Q+
Sbjct: 770 WYKSHFTWNPDNGSIVKVRLNHLSKGCFWVNGECLGRYWNI-----GPQED--------- 815
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
Y +P S L+ N +VIF+E G P ++ +
Sbjct: 816 --------------YKIPVSLLKDQNE-IVIFDEEGYAPDDVVI 844
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 120/340 (35%), Positives = 171/340 (50%), Gaps = 24/340 (7%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
++YD + ++DG L+S +HY R PE W D + K K G + +ETYV WN HE
Sbjct: 3 QLTYDD-SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEPE 61
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
GQ+ F+G DIV+F+K GL++ +R GP++CAEW FGGFP WL +P I+ R N
Sbjct: 62 EGQFVFEGIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNIKLRCFNQ 121
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAA 223
P+ E++ + + + +R L S GGPII LQIENEYG+ + Y Q +D +K
Sbjct: 122 PYLEKVDAYFDVLFERLRP--LLSSNGGPIIALQIENEYGSFGNDQKYLQYLRDGIKKRV 179
Query: 224 SMALGLGAGVPWVMCKQTDAPENIIDACN-GYYCDG-------YKPNSYNKPTLWTENWD 275
L + P E I + N G + Y+PN+ P + E W
Sbjct: 180 GNELLFTSDGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNA---PLMCMEFWH 236
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------Y 328
GW+ WG R E + + ++ GS +N+YM GGTNFG +G
Sbjct: 237 GWFDHWGEEHHTRSAESVVETLEEILKQNGS-VNFYMAHGGTNFGFYNGANHNETDYQPT 295
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVA 368
ITSYDYD + E G ++E + K + L E L A
Sbjct: 296 ITSYDYDGLLTESGDVTEKFYAVRKVFEKYVDLPELNLPA 335
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/217 (22%), Positives = 81/217 (37%), Gaps = 47/217 (21%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ +T+ + D +V++NG+ G V + VE + L ++ + +G NYG F
Sbjct: 394 KQALTVQDIHDRGQVYVNGEYVGIVERNRGCSRLVVELTEEESKLQIIVENMGRINYGPF 453
Query: 596 LEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ V G G ++ L+ + V Y + + + T D +
Sbjct: 454 V----------VDYKGITEGVRLGNQFLFDWTV---------YPLPLKDLSSLEFTADEV 494
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
F ++ D +DL KG +VNGHH+GRYW +
Sbjct: 495 KENFPYFHKGILTVDKAADTFIDLSEWTKGVVFVNGHHLGRYWEI--------------- 539
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y VP +LQ N +++ E
Sbjct: 540 ------------GPQQTLY-VPAPFLQEGENEIILLE 563
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 192 bits (488), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 136/234 (58%), Gaps = 8/234 (3%)
Query: 612 FKNGDIDLSKILWTYQVGLKGEFQQIYSIEENE-AEWTDLTRDGIPSTFTWYKTYFDAPD 670
G DLS WTY+VGLKGE ++S+ + EW + TWYKT F AP
Sbjct: 1 LNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPA 60
Query: 671 GIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPT 730
G P+A+D+GSMGKGQ W+NG +GR+W G C + C Y G + DKC NCG +
Sbjct: 61 GDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGSCSE-CSYTGTFREDKCLRNCGEAS 119
Query: 731 QTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSY 790
Q WYHVPRSWL+ S NLLV+FEE GG+P I++ R VC + E V +Y
Sbjct: 120 QRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLV-----NY 174
Query: 791 SVDGKLSINK-MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMS 843
+ +NK + P+ HL C G I++++FAS+GTP+G C + +G+CHA S
Sbjct: 175 QLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHS 228
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 199/386 (51%), Gaps = 45/386 (11%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFF----------KPFN--VSYDHRAI 54
NRAL+ + L+V ++ +++ S + S KP + +S D +
Sbjct: 19 NRALVIIIILAVCFILYLLLPTSSSREDEKAPSNSLLDRLKPNVGRKPADPALSLDEDSF 78
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
I G + ++S IHY R P+ W D + K K G + ++TYV WN HE + G+++F G
Sbjct: 79 YIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMPGEFDFSGL 138
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
+I +F+K+ S L + +R GPY+C+EW+ GG P WL P ++ R+N P+++ ++RF
Sbjct: 139 LNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNMKIRSNYKPYQDAVKRF 198
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQ---GKDYVKWAASMALGLGA 231
K+ +++ L S GGPII Q+ENEY ++YG + G+ ++++ A++ LGA
Sbjct: 199 FTKLFEILTP--LQSSYGGPIIAFQVENEY----AAYGPRNATGRHHMQYLANLMRSLGA 252
Query: 232 GVPWVMCK-QTD-------APENIIDACNGYYCDGYKPNSY-----NKPTLWTENWDGWY 278
++ Q D AP N + N N NKP L E W GW+
Sbjct: 253 VELFITSDGQNDIKASSDMAPNNALLTVNFQNDPSEALNKLLLVQPNKPPLVMEYWTGWF 312
Query: 279 TTWGGRLPHRPV--EDLAFAVARFFQRGGSFMNYYMYFGGTNFG-----RTSGGPFY--I 329
WG R R + L + Q GGSF N YM+ GGTNFG GG + +
Sbjct: 313 DHWGRRHLERTLSPSQLIVNIGTILQMGGSF-NLYMFHGGTNFGFMNGANIEGGEYRPDV 371
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDL 355
TSYDYDAP+ E G +++ K+ L++L
Sbjct: 372 TSYDYDAPLSEAGDITK-KYTLLREL 396
>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
Length = 447
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 148/478 (30%), Positives = 230/478 (48%), Gaps = 80/478 (16%)
Query: 237 MCKQTDAPENIIDACNGYYC-DGYK-PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLA 294
MCKQ DAP+ +I+ C G C D + PN NK ++ TE + + ++ H
Sbjct: 1 MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTEYLETPHLKGQQKILH------- 53
Query: 295 FAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKD 354
+ F + G+ NYYMY+ TNFGRT+ F T Y +AP+DEYGL E KWGHL+D
Sbjct: 54 ---SLFISKNGTLANYYMYYSVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLRD 109
Query: 355 LHAAIKLCEPALV-AADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVT 413
LHAA++L + AL+ SAQ KLG++ EA +Y + GS C+ FL N T + T
Sbjct: 110 LHAALRLSKKALLWGVTSAQ--KLGEDLEARIYE--KPGSNI-CATFLLNNITRTPTTTT 164
Query: 414 FLGQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKL 473
G Y LP S+S LPDC+ VFNT V+S I P S S+ + +M L
Sbjct: 165 LRGSKYYLPQHSISNLPDCKTVVFNTQTVASNYLI------FPFSMFDSLNEPNMKTDAL 218
Query: 474 SSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTN 533
+ + K P+ E + +TKD +DYLW+ T+
Sbjct: 219 PTYEECPTKTKSPV--------------ELMTMTKDTTDYLWYTTK-------------K 251
Query: 534 EVRPTVTIDSMRDVLRVFINGQ------LTGSVIGHWVK----VVQPVEFQSGYNDLILL 583
+V + ++ V+ F+NG+ LTG+ G V+ +P+ ++G N + L
Sbjct: 252 DVLRVPQVSNLGHVMHAFLNGEYVMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPL 311
Query: 584 SQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEEN 643
TVGL + G+++E AG V + G IDL K W ++VGL G+ +++ +
Sbjct: 312 GATVGLPDSGSYMEHRLAGVH-NVAIQGLNTRTIDLPKNGWGHKVGLNGDKLHLFTQPPS 370
Query: 644 EAEWTDLTRDGIPSTF--------TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
++ + +P F ++ PDGI+ + L+ ++ +++ HH
Sbjct: 371 QSVY------HVPRAFLKTSDNLLVLFEETGRNPDGIEILTLNRDTIC---CYISEHH 419
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 114/341 (33%), Positives = 182/341 (53%), Gaps = 39/341 (11%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A ++G + +L+S +HY R PE W D + K K G + +ETYV WNAHE++RG ++F
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G D+ +F+++ GLY+ LR GPY+C+EW+FGG P WL P ++ RT+ P+ E +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN----------MESSYGQQGKDYVKWA 222
++ KI+ L+ + L +GGPII +Q+ENEYG+ +++ + + G + + +
Sbjct: 130 AYLAKILPLVND--LQMSKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLFT 187
Query: 223 ASMALGLGAG-VPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--PTLWTENWDGWYT 279
+ G+ G +P V+ A N + GY Y N P + E W GW+
Sbjct: 188 SDNGTGIQNGPIPGVL-----ATTNFQEQEQGYLMFEYLRNIKQPGLPMMVMEFWSGWFD 242
Query: 280 TWGGRLPHRPVEDLAFA-VARFFQRGGSFMNYYMYFGGTNFGRTSGG------------- 325
WG + H F V ++ GS +N+YM+ GGTNFG +G
Sbjct: 243 HWGEQ--HNLCHHAEFIDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATNEGGGE 300
Query: 326 PFY--ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEP 364
P+ TSYDYD P+ E G L+E K+ ++++ + +K P
Sbjct: 301 PYAADTTSYDYDCPVSESGQLNE-KFYEIRNILSEMKTLLP 340
>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 951
Score = 186 bits (472), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 207/836 (24%), Positives = 344/836 (41%), Gaps = 150/836 (17%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
+VSYD RAI I+ R +L+S +H RAT W + ++ G ++I Y+FW AH+S
Sbjct: 148 LSVSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMITVYIFWGAHQS 207
Query: 105 IRGQ---YNFKGKN--------DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL- 152
R + ++ G + ++ ++ + GL++ +RIGPY C E+ +GG P WL
Sbjct: 208 FRDEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGEYTYGGIPEWLP 267
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENE--------- 203
+ R N P+ + M+ FV + + L++ QGGPI++ QIENE
Sbjct: 268 LQSSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIENELGSGVDGSA 327
Query: 204 ---------------------------YGNMESSYGQQG----------KDYVKWAASMA 226
YG++ + +G +DY W ++
Sbjct: 328 AANYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATVQDYADWCGNLV 387
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNG--------YYCDGYKPNSYNKPTLWTENWDGWY 278
L V W MC A EN I NG Y D + ++P +WTE+ +G +
Sbjct: 388 ARLAPNVIWTMCNGLSA-ENTISTFNGNNGIDWLEKYGDSGRIQ-VDQPAIWTED-EGGF 444
Query: 279 TTWGGRLPHRPVE--------DLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFYIT 330
WG + P +P + +A ++F RGG+ +NYYM++GG N GR+S +
Sbjct: 445 QLWGDQ-PSKPSDYFWGRTSRAMATDALQWFARGGTHLNYYMWWGGYNRGRSSAAGI-MN 502
Query: 331 SYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANR 390
+Y DA + G PK+ H LH I L+ A ++ L +N + +
Sbjct: 503 AYATDAFLCSSGQRRHPKYDHFLALHLVIADIAAILLHAPTS----LLKNASVEIMDGDD 558
Query: 391 YGSQSNCSAFLANI-DEHTAASVTFL-GQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSI 448
+ N FL + D H + V FL + T ++ + VF SSQ I
Sbjct: 559 WIVGDNQRQFLYQVLDTHDSKQVIFLENDANTTEMARLTGAKADDSLVFVMKPYSSQIVI 618
Query: 449 KTV---EFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPI-GVWSENNFTVQG-ILEH 503
+ + S + +S + E + SW EPI G ++ N V LE
Sbjct: 619 DGIVAFDSSTISTKAMSFRRTLHYEPAVLLHLTSW---SEPIAGADTDQNAHVSTEPLEQ 675
Query: 504 LNVTKDY---SDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV 560
N+ SDY W+ T + + D+ ++V+ + + L VFI+G G
Sbjct: 676 TNLNSKASISSDYAWYGTDVKI---DVVL---SQVKLYIGTEKAT-ALAVFIDGAFIGEA 728
Query: 561 IGHW------VKVVQPVEFQSGYNDLILLSQTVGLQN----YGAFLEKDGAGFRGQVKL- 609
H V ++ +G + L +L +++G N +GA G G V +
Sbjct: 729 NNHQHAEGPTVLSIEIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITGNVLIG 788
Query: 610 TGFKNGDIDL--SKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFD 667
+ + +I L + +W GL E + E+ + D + W F
Sbjct: 789 SPLLSENISLVDGRQMWWSLPGLSVERKAARHGLRRES-FEDAAQAEAGLHPLWSSVLFT 847
Query: 668 APD---GIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
+P + + LDL S G+G W+NG +GRYW + +G + DY
Sbjct: 848 SPQFDSTVHSLFLDLTS-GRGHLWLNGKDLGRYWNIT--RGNSWN--DY----------- 891
Query: 725 NCGNPTQTWYHVPRSWLQASNNL--LVIFEETGGNPFEISVKLRSTRIVCEQVSES 778
+Q +Y +P +L L L++F+ GG+ + R++ + ES
Sbjct: 892 -----SQRYYFLPADFLHLDGQLNELILFDMLGGDH-------SAARLLLSSIEES 935
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 185 bits (470), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 43/355 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++S IHY R PE W + K G + +ETYV WN HE++ G+++F G
Sbjct: 10 FMLDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFDFSG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
DI +F+ + GLY+ +R PY+CAEW FGG P WL P + R+ + F E ++R
Sbjct: 70 TKDIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEYVER 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ ++ +++ + GPI+M+Q+ENEYG SYG+ K Y+ A M G V
Sbjct: 130 YYDRLFEILTPLQID--HHGPILMMQVENEYG----SYGED-KTYLSALARMMRDRGVTV 182
Query: 234 P-------WVMCKQTD--APENIIDACNGYYCDGYKPNSYNK---------PTLWTENWD 275
P W C + A +II N + ++ +K P + E WD
Sbjct: 183 PLFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW+ WG R+ R ++L + +RG +N YM+ GGTNFG +G
Sbjct: 243 GWFNRWGDRIITRQSDELIDEIGEVLKRGS--INLYMFHGGTNFGFWNGCSARGRIDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLK------DLHAAIKLCEPALVAADSAQYIKL 377
+TSYDYDAP+DE G P + K LH I+ P + + ++I L
Sbjct: 301 VTSYDYDAPLDEAG---NPTVKYYKIQQLVHKLHPEIQQTTPKVKPLMAKEHITL 352
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 185 bits (470), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 183/383 (47%), Gaps = 60/383 (15%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ ++++ +I+ IHY R PE W D + K K G + +ETYV WN HE G++ F
Sbjct: 9 KQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEEGRFVF 68
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
+G D+ KF+ L G GLY +R PY+CAEW FGG P WL PG+ R + PF ++
Sbjct: 69 EGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGMRLRCSYKPFLDKA 128
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ +++ R S +GGP+I +QIENEYG SYG K Y+ + + G
Sbjct: 129 DAYYDELIP--RLTPFLSTKGGPLIAMQIENEYG----SYGND-KTYLNYLKEALVKRGV 181
Query: 232 GVPWVMCKQTDAPENIIDACNGYYCDG--------------------YKPNSYNKPTLWT 271
V+ +D PE+ + G +G Y+P ++P +
Sbjct: 182 D---VLLFTSDGPEDFM--LQGGMVEGVWETVNFGSRSAEAFAKLQEYQP---DQPLMCM 233
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---- 327
E W+GW+ WG R D+A + G S +N+YM+ GGTNFG SG +
Sbjct: 234 EFWNGWFDHWGETHHTRGAADVALVLDEMLAAGAS-VNFYMFHGGTNFGFFSGANYTDRL 292
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
+TSYDYD+P+ E G L+E + V A+Y +LG +
Sbjct: 293 LPTVTSYDYDSPLSESGELTEKYYA----------------VREVIAKYAELGPLELPAQ 336
Query: 386 YRANRYGS--QSNCSAFLANIDE 406
A +GS + + LA++DE
Sbjct: 337 IVAKSFGSVRMTGQARLLASLDE 359
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 163/310 (52%), Gaps = 34/310 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS +HY R PE W D I +K G + IETYV WNAHE +RG+++ G
Sbjct: 12 LLDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDATGW 71
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
ND+ +F+ L+ + GL+ +R GPY+CAEW+ GG PVWL PGI R + F E + +
Sbjct: 72 NDLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGIGIRRSEPQFVEAVSEY 131
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++++ +++ + +GG ++++QIENEYG +YG K+Y++ + G VP
Sbjct: 132 LRRVYEIVAPRQID--RGGNVVLVQIENEYG----AYGSD-KEYLRELVRVTKDAGITVP 184
Query: 235 WV--------MCKQTDAPENIIDACNGY----YCDGYKPNSYNKPTLWTENWDGWYTTWG 282
M + PE + G + + P + +E WDGW+ WG
Sbjct: 185 LTTVDQPMPWMLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWDGWFDWWG 244
Query: 283 G----RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF--YITSY 332
P DL +A G+ +N YM GGTNFG T+G G F +TSY
Sbjct: 245 SIHHTTDPAASAHDLDVLLA-----AGASVNIYMVHGGTNFGTTNGANDKGRFDPIVTSY 299
Query: 333 DYDAPIDEYG 342
DYDAPIDE G
Sbjct: 300 DYDAPIDESG 309
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 128/411 (31%), Positives = 187/411 (45%), Gaps = 52/411 (12%)
Query: 75 PEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLR 134
PE W D + K K G + +ETYV WN HE ++ + FK + DIVKFVKL GLY+ +R
Sbjct: 2 PEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIR 61
Query: 135 IGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGP 194
GPY+CAEW+ GG P WL P ++ RT+ PF E + R+ +K+ L+ L QGGP
Sbjct: 62 PGPYICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLLTP--LQYCQGGP 119
Query: 195 IIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVM---------------CK 239
II QIENEY + + Y++ M + G +M K
Sbjct: 120 IIAWQIENEYSSFDKKVDMT---YMELLQKMMVKNGVTEMLLMSDNLFSMKTHPINLVLK 176
Query: 240 QTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVAR 299
+ +N+ DA K +KP + TE W GW+ WG + P E L +
Sbjct: 177 TINLQKNVKDALL-----QLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKD 231
Query: 300 FFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------------ITSYDYDAPIDEYGLLS 345
F G S +N+YM+ GGTNFG +G F ITSYDYDAP+ E G ++
Sbjct: 232 LFSLGAS-INFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDIT 290
Query: 346 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLANID 405
PK+ K L I+ P + K + +++ N +++ F I
Sbjct: 291 -PKY---KALRKFIREHAPNPFPDIPSNLYKGAYGKTMYLF--NFLIEETDQKIFDQAIV 344
Query: 406 EHTAASVTFL------GQSYTLPPWSVSILPDCRNTVFNTAKVSSQTSIKT 450
T V FL GQ Y + ++ D ++ V + + + +
Sbjct: 345 SDTVKPVEFLPINNHGGQGYGFVIYQTALKHDAKSLVVEIVRDRAHVMVDS 395
>gi|361068121|gb|AEW08372.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128330|gb|AFG44821.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128334|gb|AFG44823.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 85/159 (53%), Positives = 111/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A +GGC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q + N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSGLKVNKPKAELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+FAS+GTP GRC F+ G+C+ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 118/381 (30%), Positives = 184/381 (48%), Gaps = 39/381 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I+DG ++S IHY R P+ W D + K G + +ETY+ WN HE G+++F+G
Sbjct: 10 FIVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+V F+K L + +R PY+CAEW FGG P WL + R++ + E+++
Sbjct: 70 IKDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVKN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + ++ ++ L S QGGPIIM+Q+ENE+G+ ++ K Y+K + L LG V
Sbjct: 130 YYEVLLPMLTS--LQSTQGGPIIMMQVENEFGSFSNN-----KTYLKKLKKIMLDLGVEV 182
Query: 234 PWVMC----KQTDAPENIIDAC-------------NGYYCDGYKPNSYNK-PTLWTENWD 275
P +Q ++ID N + + N K P + E WD
Sbjct: 183 PLFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------Y 328
GW+ WG + R +DLA V RG +N YM+ GGTNFG +G
Sbjct: 243 GWFNRWGEEIITRDAQDLANCVKELLTRGS--INLYMFHGGTNFGFMNGCSARGQKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKW---GHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHV 385
+TSYDYDA + E G ++E +K+L I+ EP + S I L N++ +
Sbjct: 301 VTSYDYDALLTEAGDITEKYQCVKKVMKELFPDIQQMEPRMREKKSYGTIPL--NRKVSL 358
Query: 386 YRANRYGSQSNCSAFLANIDE 406
+ S+ S F +++
Sbjct: 359 FETLEDISECQRSVFPQTLEQ 379
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 179 bits (455), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 157/310 (50%), Gaps = 26/310 (8%)
Query: 51 HRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYN 110
H+ +++G + +A +HYPR W I K G + I YVFWN HE G++N
Sbjct: 35 HKTFLLNGKPFTVKAAELHYPRIPRPYWEHRIKMCKALGMNAICIYVFWNIHEQKEGEFN 94
Query: 111 FKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
F G ND+ +F +L +G+Y+ +R GPYVCAEW GG P WL I+ R + F E
Sbjct: 95 FTGNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLRERDPYFMER 154
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL-GL 229
++ F K+ + + L +GGPIIM+Q+ENEYG SYG K YV M G
Sbjct: 155 VKIFEDKVAEQLAP--LTIQRGGPIIMVQVENEYG----SYGID-KQYVGEIRDMLRQGW 207
Query: 230 GAGVPWVMCK-QTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWDGW 277
G V C ++ N +D G N N+ P + +E W GW
Sbjct: 208 GNDVKMFQCDWSSNFTHNGLDDLIWTMNFGTGANIDNQFKKLKSLRPDAPLMCSEFWSGW 267
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSY 332
+ WG R RP +D+ + +G SF + YM GGT+FG +G P + +TSY
Sbjct: 268 FDKWGARHETRPAQDMVNNIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDVTSY 326
Query: 333 DYDAPIDEYG 342
DYDAPI+EYG
Sbjct: 327 DYDAPINEYG 336
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 90/220 (40%), Gaps = 35/220 (15%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQS--GYNDLILLSQTVGLQNYGAF 595
T+ ++ + D +++I+ Q G V V+ + +E + + L +L + +G N+G
Sbjct: 423 TLHLNDIHDYGQIWIDNQYIGKV--DRVRNEKSIELPAVKAGSTLTILVEAMGRINFGRA 480
Query: 596 LEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ KD G V +T + G ++ L +Q + + + A I
Sbjct: 481 I-KDFKGITNDVTITT-QQGKHEIQYTLKGWQSSYIDDSYETAVRALSAARKHTEKDTPI 538
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
+Y+ YF+ + LD + GKGQ +VNGH +GR W++
Sbjct: 539 MGKRGYYRGYFNLKK-VGDTFLDFETWGKGQVYVNGHAMGRIWSI--------------- 582
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N +V+ + TG
Sbjct: 583 ------------GPQQTLY-VPGCWLKKGRNEVVVLDITG 609
>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 778
Score = 179 bits (455), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 124/383 (32%), Positives = 186/383 (48%), Gaps = 42/383 (10%)
Query: 11 LQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHY 70
+ L ++ + +++ I + C +S+ + TF ++ ++DG ++ +A +HY
Sbjct: 1 MNLLKPCIFGVAVLITAIFMGCSTSNKSQTF------EVGNQTFLLDGKPFIIKAAEMHY 54
Query: 71 PRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLY 130
R E W I K G + I Y FWN HE G+++FKG+NDI +F +L +G+Y
Sbjct: 55 TRIPAEYWEHRIQMCKALGMNTICIYAFWNIHEQRPGEFDFKGQNDIAEFCRLAQKNGMY 114
Query: 131 LQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSW 190
+ LR GPYVC+EW GG P WL I+ RTN+ F E + F+ +I + + L +
Sbjct: 115 IMLRPGPYVCSEWEMGGLPWWLLKKKDIQLRTNDPYFLERTKLFMNEIGKQLAD--LQAP 172
Query: 191 QGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAP 244
+GG IIM+Q+ENEYG + K+Y+ + G G VP C Q +
Sbjct: 173 RGGNIIMVQVENEYGGYAVN-----KEYIANVRDIVRGAGFTDVPLFQCDWSSTFQLNGL 227
Query: 245 ENIIDACN---GYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAV 297
++++ N G D K + P + +E W GW+ WG + R E + +
Sbjct: 228 DDLLWTINFGTGANIDAQFKSLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAETMVSGL 287
Query: 298 ARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHL 352
R SF + YM GGT FG G P Y +SYDYDAPI E G + PK+
Sbjct: 288 KDMLDRNISF-SLYMAHGGTTFGHWGGANCPPYSAMCSSYDYDAPISEAGWAT-PKY--- 342
Query: 353 KDLHAAIKLCEPALVAADSAQYI 375
KL E + ADSAQ I
Sbjct: 343 ------YKLREMLMQYADSAQVI 359
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 89/227 (39%), Gaps = 46/227 (20%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
+ ID + D +V+ +G+L G + + + L +L + +G N+ +
Sbjct: 422 VLLIDEVHDWAQVYADGKLLGRLDRRRSENSLTLPALKAGTQLDILVEAMGRVNFDYAIH 481
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSI--EENEAEWTDLTRDGI 655
D G +V+L ++ + LKG Q+YS + + A D +
Sbjct: 482 -DRKGITEKVELLTEES------------RKELKG--WQVYSFPTDADFAAQKDFRKGNK 526
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
+Y+ F+ + D V LD+ + GKG WVNG IGR+W +
Sbjct: 527 AEGPAYYRASFNLKETGD-VFLDMQTWGKGMVWVNGKAIGRFWEI--------------- 570
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
P QT Y +P WL+ N +V+ + G + EI
Sbjct: 571 ------------GPQQTLY-MPGCWLKKGKNEIVVLDLLGPDKAEIK 604
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 128/399 (32%), Positives = 194/399 (48%), Gaps = 63/399 (15%)
Query: 23 MMMMMIHLSC----VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
++++ I +C S S STF + H +++G + S +HYPR E W
Sbjct: 7 LLVLFILFACNVLIFSQSRKSTF----EIKNGH--FLLNGKLFSIHSGEMHYPRIPQEYW 60
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
+ K G + + TYVFWN HE G++N+ G+ D+ KF+K GLY+ +R GPY
Sbjct: 61 KHRLQMMKAMGLNAVTTYVFWNYHEENPGKWNWSGEKDLKKFIKTAQEVGLYVIIRPGPY 120
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
VCAEW FGG+P WL++I G++ R +N F E Q+++ ++ + +++ + + GGP+IM+
Sbjct: 121 VCAEWEFGGYPWWLQNIKGLKIREDNNLFLAETQKYITQLYNQVKDLQITN--GGPVIMV 178
Query: 199 QIENEYGNMESSYGQQGKD-----YVKWAASMALGL---GAGVP-------WVM------ 237
Q ENE+G S+ Q KD + + A + L G VP W+
Sbjct: 179 QAENEFG----SFVAQRKDIPLASHRTYNAKIVKQLKDAGFSVPMFTSDGSWLFEGGSVV 234
Query: 238 -----CKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVED 292
D EN+ N Y N+ P + E + GW W + P
Sbjct: 235 GALPTANGEDNIENLKKIVNQY-------NNNQGPYMVAEFYPGWLAHWAEKFPRVDAGT 287
Query: 293 LAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLL 344
+A ++ + SF NYYM GGTNFG T+G + +TSYDYDAPI E G
Sbjct: 288 VARQTDKYLKNDVSF-NYYMVHGGTNFGFTNGANYDKNHDIQPDLTSYDYDAPITEAGWR 346
Query: 345 SEPKWGHLKDL---HAAIKLCE-PALVAADSAQYIKLGQ 379
+ PK+ L+ + H KL E PA + + IKL +
Sbjct: 347 T-PKYDSLRAVISKHTKAKLPEVPAPIKVIDIKDIKLSK 384
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/216 (22%), Positives = 92/216 (42%), Gaps = 39/216 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+ + +RD ++ING+ G + ++ P++ + L +L + G NYG+ +
Sbjct: 429 TLDLKGLRDYATIYINGEKVGELNRYYNHYTMPIDIPFN-STLEILVENWGRINYGSRIN 487
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
++ G VK+ GD +++ ++ +F + + +D +PS
Sbjct: 488 ENTKGIISAVKI-----GDTEITGNWEMTKLPFPDQFASTIKAKPIDTGKQAQLKD-VPS 541
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
Y+ F+ + D +D+ S GKG +VNG +IGR+W V
Sbjct: 542 L---YQGEFELTETGD-TFIDMQSWGKGVIFVNGRNIGRFWKV----------------- 580
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
P QT Y +P WL+ N ++IF++
Sbjct: 581 ----------GPQQTLY-IPGVWLKKGKNEIIIFDQ 605
>gi|376338078|gb|AFB33584.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 179 bits (454), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 111/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A + GC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q++ N+
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSGLKVNKPKAELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+FAS+GTP GRC F+ G+C+ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|376338072|gb|AFB33581.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
gi|376338074|gb|AFB33582.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 111/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A + GC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q++ N+
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSVLKVNKPKAELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+FAS+GTP GRC F+ G+C+ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|376338076|gb|AFB33583.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 110/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A + GC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q++ N+
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSGLKVNKPKAELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+FAS+GTP GRC F+ G+C ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCXXXSTMS 157
>gi|383128332|gb|AFG44822.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 110/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A +GGC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q + N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSGLKVNKPKAELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+F S+GTP GRC F+ G+C+ ++S
Sbjct: 119 HCPSSGHLIKSIKFVSFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 113/324 (34%), Positives = 167/324 (51%), Gaps = 31/324 (9%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +++G ++ +A +HYPR W I K G + I YVFWN HE G++NF
Sbjct: 35 KTFLLNGKPFVIKAAELHYPRIPRPYWEHRIRMCKALGMNTICLYVFWNIHEQQEGKFNF 94
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G ND+ F +L GLY+ +R GPYVCAEW GG P WL I R + F E +
Sbjct: 95 TGNNDVAAFCRLAQKHGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRERDPYFMERV 154
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVK--------- 220
+ F +++ + + L +GGPIIM+Q+ENEYG+ ++ Y Q +D V+
Sbjct: 155 KVFEQQVGNQLAP--LTIDKGGPIIMVQVENEYGSYGVDKEYVSQIRDIVRSSGFDKVAL 212
Query: 221 ----WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDG 276
WA++ + W M T A NI + +P S P + +E W G
Sbjct: 213 FQCDWASNFEKNGLDDLIWTMNFGTGA--NIDEQFK--RLGELRPQS---PKMCSEFWSG 265
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITS 331
W+ WG R RP +++ + +G SF + YM GGT+FG +G P + +TS
Sbjct: 266 WFDKWGARHETRPAKNMVAGIDEMLTKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTS 324
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDL 355
YDYDAPI+EYG L+ PK+ L+ +
Sbjct: 325 YDYDAPINEYG-LATPKYYELRAM 347
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/229 (23%), Positives = 100/229 (43%), Gaps = 42/229 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+TI+ D ++VF++ QL G + + P+ L +L + +G N+G +
Sbjct: 421 TLTINDPHDYVQVFLDNQLIGRIDRVKNEKTLPMPAIRKGQRLSILVEAMGRINFGRAI- 479
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI-YSIEE----NEAEWTD-LT 651
KD G V L+G + ++I + + ++ + ++++ E W+ +
Sbjct: 480 KDHKGITDNVTLSGETDNLQWEARITDWKMLPIPDDYATVRWAVDALTRMKEIVWSKTIP 539
Query: 652 RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTC 711
+D I +Y+ YF+ + L++ + GKGQ ++NG+ IGR+W +
Sbjct: 540 QDKI----GYYRGYFNLKK-VGDTFLNMEAFGKGQVYINGYAIGRFWNI----------- 583
Query: 712 DYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG--GNP 758
P QT Y VP WL+ N +++ + G GNP
Sbjct: 584 ----------------GPQQTLY-VPGCWLKKGQNEVIVLDMVGPKGNP 615
>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 213
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 92/199 (46%), Positives = 116/199 (58%), Gaps = 4/199 (2%)
Query: 571 VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGL 630
V + G N L +LS TVGL N G + AG G V L G G D+SK W+Y+VGL
Sbjct: 17 VNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGL 76
Query: 631 KGEFQQIYSIE-ENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWV 689
KGE +YS++ N +W + P TWYKT F+ P G +P+ALD+ SM KGQ WV
Sbjct: 77 KGEILNLYSVKGSNSVQWMKGSFQKQP--LTWYKTTFNTPAGNEPLALDMSSMSKGQIWV 134
Query: 690 NGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLV 749
NG IGRY+ G C + C Y G + KC NCG P+Q WYH+PR WL + NLL+
Sbjct: 135 NGRSIGRYFPGYIASGKC-NKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLI 193
Query: 750 IFEETGGNPFEISVKLRST 768
I EE GGNP IS+ R+
Sbjct: 194 ILEEIGGNPQGISLVKRTV 212
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 123/366 (33%), Positives = 178/366 (48%), Gaps = 42/366 (11%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K F + D ++DG +IS +HY R PE W + K G + +ETYV WN H
Sbjct: 2 KSFEIGKD---FMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMH 58
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G +NF+G D+VK+V+L GL + LR PY+CAEW FGG P WL I R+
Sbjct: 59 EPKEGVFNFEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDIRVRS 118
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
N F +++ F K ++ L+ L GGPIIM+Q+ENEYG S+G K+YV+
Sbjct: 119 NTNLFLNKVENFYKVLLPLVTS--LQVENGGPIIMMQVENEYG----SFGND-KEYVRSI 171
Query: 223 ASMALGLGAGVPWVMC----KQTDAPENIID---ACNGYYCDG-----------YKPNSY 264
+ LG VP ++ ++ID G + K N
Sbjct: 172 KKLMRDLGVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESFIKENKK 231
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + E WDGW+ WG + R +LA V +R +N+YM+ GGTNFG +G
Sbjct: 232 EWPLMCMEFWDGWFNRWGMEIIRRDSSELAEEVKELLKRAS--INFYMFQGGTNFGFMNG 289
Query: 325 GPFY-------ITSYDYDAPIDEYGLLSEPKW----GHLKDLHAAIKLCEPALVAADSAQ 373
ITSYDYDA + E+G + PK+ +K++ + + EP ++ +
Sbjct: 290 CSSRENVDLPQITSYDYDALLTEWGEPT-PKYYAVQRAIKEVCSDVDQFEPRILPRANYG 348
Query: 374 YIKLGQ 379
IKL +
Sbjct: 349 EIKLSR 354
Score = 46.2 bits (108), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 51/229 (22%), Positives = 90/229 (39%), Gaps = 54/229 (23%)
Query: 545 RDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR 604
RD + +F+N QL + + ++ N L +L + +G NYGA L
Sbjct: 408 RDRVHLFLNEQLIDTQYRDEIGREVSLDLTKEENTLDILVENMGRVNYGARL-------L 460
Query: 605 GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKT 664
Q + G +G + + L+ ++ Y++E + + D P+T ++Y+
Sbjct: 461 SQTQRKGISSGVM--------IDIHLQSNWEH-YALEFDNLDEIDFNGQWEPNTPSFYEY 511
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
F+ + D LD +GKG +NG ++G+YW V P G
Sbjct: 512 TFNVQELKDTF-LDCSKLGKGFVVLNGFNLGKYWD-VGPTG------------------- 550
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETG---------GNPFEISVK 764
+ ++P L N L++FE G GNP + +K
Sbjct: 551 --------YLYIPAPLLIKGENKLIVFETEGNYEEELYLRGNPIYLDIK 591
>gi|383128326|gb|AFG44819.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128328|gb|AFG44820.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128336|gb|AFG44824.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128338|gb|AFG44825.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 110/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A +GGC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q + N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSGLKVNKPKAELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+FAS+GTP G C F+ G+C+ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTNSTMS 157
>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
Length = 596
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 113/331 (34%), Positives = 166/331 (50%), Gaps = 42/331 (12%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
PF + + +++G ++S +HY R PE W + K G + +ETYV WN H+
Sbjct: 2 PFQIG--EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQ 59
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
Q+NF + D+VKF++ GLY+ LR PY+CAEW FGG P WL +IP I R N
Sbjct: 60 PQPDQFNFSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQN 119
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
+ F E+ R+ ++++ + + QGG I+M+QIENEYG S+G K+Y++
Sbjct: 120 DPLFIAEIDRYFQELLPRIAPYQI--TQGGNILMMQIENEYG----SFGND-KNYLRAIL 172
Query: 224 SMALGLGAGVP-------WVMCKQTDA--PENIIDACN------------GYYCDGYKPN 262
++ L G VP W + A ++I+ N Y D +
Sbjct: 173 ALMLIHGVNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDKHG-K 231
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-- 320
SY P + E WDGW+ W + R +DLA +R +N+YM+ GGTNFG
Sbjct: 232 SY--PLMCMEFWDGWFNRWKEPVIRRDAQDLADCTKELLERAS--INFYMFQGGTNFGFW 287
Query: 321 -----RTSGGPFYITSYDYDAPIDEYGLLSE 346
R +TSYDYDAP+ E+G SE
Sbjct: 288 NGCSARLDTDLPQVTSYDYDAPVHEWGEPSE 318
>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
parasuis SH0165]
Length = 596
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 113/331 (34%), Positives = 166/331 (50%), Gaps = 42/331 (12%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
PF + + +++G ++S +HY R PE W + K G + +ETYV WN H+
Sbjct: 2 PFQIG--EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQ 59
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
Q+NF + D+VKF++ GLY+ LR PY+CAEW FGG P WL +IP I R N
Sbjct: 60 PQPDQFNFSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQN 119
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAA 223
+ F E+ R+ ++++ + + QGG I+M+QIENEYG S+G K+Y++
Sbjct: 120 DPLFIAEIDRYFQELLPRIAPYQIT--QGGNILMMQIENEYG----SFGND-KNYLRAIR 172
Query: 224 SMALGLGAGVP-------WVMCKQTDA--PENIIDACN------------GYYCDGYKPN 262
++ L G VP W + A ++I+ N Y D +
Sbjct: 173 ALMLIHGVNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDKHG-K 231
Query: 263 SYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-- 320
SY P + E WDGW+ W + R +DLA +R +N+YM+ GGTNFG
Sbjct: 232 SY--PLMCMEFWDGWFNRWKEPVIRRDAQDLANCTKELLERAS--INFYMFQGGTNFGFW 287
Query: 321 -----RTSGGPFYITSYDYDAPIDEYGLLSE 346
R +TSYDYDAP+ E+G SE
Sbjct: 288 NGCSARLDTDLPQVTSYDYDAPVHEWGEPSE 318
>gi|383128340|gb|AFG44826.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/159 (52%), Positives = 110/159 (69%), Gaps = 4/159 (2%)
Query: 689 VNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
VNG IGRYW + +A +GGC D+CDYRGAY+S KC TNCG P+Q YHVPRSW+Q + N+
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGKPSQKLYHVPRSWIQPTGNV 60
Query: 748 LVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINKMAPEMHL 807
LV+FEE GG+P +IS RS VC +VSE+H PPV W + S L +NK E+ L
Sbjct: 61 LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKS--SATSGLKVNKPKGELQL 118
Query: 808 HC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLS 845
HC G++I SI+FAS+GTP G C F+ G+C+ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTNSTMS 157
>gi|387791561|ref|YP_006256626.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379654394|gb|AFD07450.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 619
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 176/364 (48%), Gaps = 44/364 (12%)
Query: 50 DHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQY 109
++ A + DG + S +H+ R E W + K G + + TYVFWN HE+ G +
Sbjct: 29 ENGAFVYDGKPVQIHSGEMHFARVPQEYWRHRLKMMKAMGLNSVATYVFWNYHETAPGVW 88
Query: 110 NFK-GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFK 168
+FK G +I +F+K+ G GL + LR GPY CAEW +GG+P +L+++ G+E R NN F
Sbjct: 89 DFKTGNKNISEFIKIAGEEGLMVILRPGPYACAEWEYGGYPWFLQNVEGLEVRRNNPKFL 148
Query: 169 EEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVK 220
+ ++ + ++ + + +GGPIIM+Q ENE+G SY Q KD Y
Sbjct: 149 AACKEYIDHLAKEVKNQQIT--KGGPIIMVQAENEFG----SYVAQRKDIPLAEHKAYSS 202
Query: 221 WAASMALGLGAGVPWVMCK-----QTDAPENIIDACNG--------YYCDGYKPNSYNKP 267
+ L G VP + + EN + NG D Y N P
Sbjct: 203 AIKAQLLAAGFDVPLFTSDGSWLFEGGSIENCLPTANGEDNIENLKKVVDQY--NGGKGP 260
Query: 268 TLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF 327
+ E + GW W P P ED+ ++ Q SF NYYM GGTNFG TSG +
Sbjct: 261 YMVAEFYPGWLDHWAEPFPKVPTEDVVKQTEKYLQNNVSF-NYYMVHGGTNFGYTSGANY 319
Query: 328 --------YITSYDYDAPIDEYGLLSEPKWGHLKDL---HAAIKLCE-PALVAADSAQYI 375
+TSYDYDAPI E G + PK+ +++L H + K+ E P + I
Sbjct: 320 DKNHDIQPDMTSYDYDAPISEAGWAT-PKYIAIRELMKKHVSYKIPEVPQPLPVIEIPEI 378
Query: 376 KLGQ 379
KL Q
Sbjct: 379 KLTQ 382
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 89/215 (41%), Gaps = 48/215 (22%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN-DLILLSQTVGLQNYGAFLE 597
+ ++ +RD V++NG+ + ++ E +N L + + +G NYGA +
Sbjct: 428 LELNGLRDYALVYVNGEKVAELNRYYKNY--SCEIDVPFNATLDIFVENMGRINYGAKIT 485
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
++ G V + G ++S Y++ L+ + +++ SI+ E + + G
Sbjct: 486 ENNKGIISPVVING-----TEISGNWKMYKMPLEKQ-EEVASIKAKEVKSQPVVLKG--- 536
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
TF +T LD+ + GKG +VNG+H+GRYW V
Sbjct: 537 TFNLTET--------GDTFLDMEAWGKGIVFVNGYHLGRYWNV----------------- 571
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N + I E
Sbjct: 572 ----------GPQQTLY-LPGCWLKKGANEITIVE 595
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/354 (33%), Positives = 177/354 (50%), Gaps = 46/354 (12%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
C ++ A+ +P + + +++G ++ +A +HYPR W I K G +
Sbjct: 81 CPATVQAAA--RPGDFTTGKGTFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMN 138
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ YVFWN HE GQ++F G+ND+ F +L +G+Y+ +R GPYVCAEW GG P W
Sbjct: 139 TLCLYVFWNIHEQREGQFDFTGQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWW 198
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ES 209
L I R + F E ++ F +K+ + + L +GGPIIM+Q+ENEYG+ +
Sbjct: 199 LLKKKDIRLREQDPYFMERVELFEQKVAEQLAP--LTIRRGGPIIMVQVENEYGSYGEDK 256
Query: 210 SYGQQGKDYVK--WAASMALGLGAG-----------------------VPWVMCKQTDAP 244
+Y Q +D ++ W+ S G G G + W M T A
Sbjct: 257 AYVSQIRDVLRRYWSLS-PTGEGRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGA- 314
Query: 245 ENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
NI D +P++ P + +E W GW+ WG R RP D+ + +G
Sbjct: 315 -NINDQFR--RLGELRPDA---PKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKG 368
Query: 305 GSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLK 353
SF + YM GGT+FG +G P + +TSYDYDAPI+EYG + PK+ L+
Sbjct: 369 ISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYG-QATPKFWELR 420
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/354 (33%), Positives = 177/354 (50%), Gaps = 46/354 (12%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
C ++ A+ +P + + +++G ++ +A +HYPR W I K G +
Sbjct: 19 CPATVQAAA--RPGDFTTGKGTFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMN 76
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ YVFWN HE GQ++F G+ND+ F +L +G+Y+ +R GPYVCAEW GG P W
Sbjct: 77 TLCLYVFWNIHEQREGQFDFTGQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWW 136
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ES 209
L I R + F E ++ F +K+ + + L +GGPIIM+Q+ENEYG+ +
Sbjct: 137 LLKKKDIRLREQDPYFMERVELFEQKVAEQLAP--LTIRRGGPIIMVQVENEYGSYGEDK 194
Query: 210 SYGQQGKDYVK--WAASMALGLGAG-----------------------VPWVMCKQTDAP 244
+Y Q +D ++ W+ S G G G + W M T A
Sbjct: 195 AYVSQIRDVLRRYWSLS-PTGEGRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGA- 252
Query: 245 ENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
NI D +P++ P + +E W GW+ WG R RP D+ + +G
Sbjct: 253 -NINDQFR--RLGELRPDA---PKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKG 306
Query: 305 GSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLK 353
SF + YM GGT+FG +G P + +TSYDYDAPI+EYG + PK+ L+
Sbjct: 307 ISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQAT-PKFWELR 358
>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
Length = 199
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/200 (43%), Positives = 123/200 (61%), Gaps = 3/200 (1%)
Query: 569 QPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
Q ++ +G N + LLS VGL N G E+ G G V L G +G D+SK W+Y++
Sbjct: 2 QKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGTWDMSKWKWSYKI 61
Query: 629 GLKGEFQQIYS-IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
G+KGE +++ E + WT + TWYK+ F P G +P+ALD+ +MGKGQ
Sbjct: 62 GVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQV 121
Query: 688 WVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
W+NG +IGR+W +G C C+Y G +++ KC +NCG +Q WYHVPRSWL+ S NL
Sbjct: 122 WINGRNIGRHWPAYKAQGSC-GRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNL 179
Query: 748 LVIFEETGGNPFEISVKLRS 767
+V+FEE GG+P IS+ R+
Sbjct: 180 IVVFEELGGDPNGISLVKRT 199
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 177/365 (48%), Gaps = 40/365 (10%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K F + D ++DG +IS +HY R PE W + K G + +ETYV WN H
Sbjct: 2 KSFEIGKD---FMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMH 58
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G +NF+G D+VK+V+L GL + LR PY+CAEW FGG P WL I R+
Sbjct: 59 EPKEGIFNFEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRS 118
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
N F +++ F K ++ ++ L GGPIIM+Q+ENEYG S+G K+YV+
Sbjct: 119 NTNLFLNKVENFYKVLLPMVTP--LQVENGGPIIMMQVENEYG----SFGND-KEYVRNI 171
Query: 223 ASMALGLGAGVPWVMC----KQTDAPENIID---ACNGYYCDG-----------YKPNSY 264
+ LG VP ++ ++ID G + K N
Sbjct: 172 KKLMRDLGVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKK 231
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + E WDGW+ WG + R +LA V +R +N+YM+ GGTNFG +G
Sbjct: 232 EWPLMCMEFWDGWFNRWGMEIIRRDGSELAEEVKELLKRAS--INFYMFQGGTNFGFMNG 289
Query: 325 GPFY-------ITSYDYDAPIDEYGLLSEPKWG---HLKDLHAAIKLCEPALVAADSAQY 374
ITSYDYDA + E+G + + +K++ + ++ EP ++ +
Sbjct: 290 CSSRENVDLPQITSYDYDALLTEWGEPTSKYYAVQRAIKEVCSDVEQFEPRILPRANYGE 349
Query: 375 IKLGQ 379
IKL +
Sbjct: 350 IKLNR 354
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 52/241 (21%), Positives = 96/241 (39%), Gaps = 51/241 (21%)
Query: 545 RDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR 604
RD + +F+N QL + + ++ N L +L + +G NYGA L
Sbjct: 408 RDRVHLFLNEQLVDTQYRDEIGREVSLDLTKEENTLDILVENMGRVNYGARL-------L 460
Query: 605 GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKT 664
+ G +G + + L+ ++ Y++E + + D P+T ++Y+
Sbjct: 461 SPTQRKGISSGVM--------IDIHLQSNWEH-YALEFDNLDEIDFNGQWEPNTPSFYEY 511
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
F+ + ++ LD +GKG +NG ++G+YW V P G
Sbjct: 512 TFNVQE-LNDTFLDCSKLGKGFVVLNGFNLGKYWD-VGPTG------------------- 550
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVR 784
+ ++P L N L++FE G +E + LR I + +Y P+
Sbjct: 551 --------YLYIPAPLLIKGENNLIVFETEGN--YEEELYLRENPIYL----DVNYSPLS 596
Query: 785 K 785
K
Sbjct: 597 K 597
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 159/305 (52%), Gaps = 23/305 (7%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+D ++S +HY R PE W D + + K G + +ETYV WN HE I G++ F G
Sbjct: 63 FFLDNKELRILSGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTG 122
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
DI +FV + GL + LR GP++C+EW FGG P WL P ++ R+ PF + +
Sbjct: 123 MLDIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQMDVRSTYRPFMDAARS 182
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMAL---- 227
+++ ++ + E+M + + GGPII +QIENEYG+ + +Y Q+ K+ + + + +
Sbjct: 183 YMRSLISEL-EDMQYQY-GGPIIAMQIENEYGSYSDDVNYMQELKNIMTDSGVIEILFTS 240
Query: 228 ----GLGAG-VPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
GL G VP V N G D KP + E W GW+ W
Sbjct: 241 DNKHGLQPGRVPGVFMTTNFKNTN----EGGRMFDKLHELQPGKPLMVMEFWSGWFDHWE 296
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG---PFY--ITSYDYDAP 337
+ +E+ A AV Q+G S +N YM+ GGTNFG +G P+ +TSYDYD+P
Sbjct: 297 EKHHTMSLEEYASAVEYILQQGSS-INLYMFHGGTNFGFLNGANTEPYLPTVTSYDYDSP 355
Query: 338 IDEYG 342
+ E G
Sbjct: 356 LSEAG 360
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 176/357 (49%), Gaps = 36/357 (10%)
Query: 18 VYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEM 77
+Y +++++ ++ SC S SS TF +++G ++ +A IHYPR E
Sbjct: 6 LYLLILVVAVLGSSC-SQSSEGTF------EVGKNTFLLNGEPFVVKAAEIHYPRIPKEY 58
Query: 78 WPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGP 137
W I K G + I YVFWN HE G+Y+F G+ DI F +L +G+Y+ +R GP
Sbjct: 59 WEHRIKMCKALGMNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGMYVIVRPGP 118
Query: 138 YVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIM 197
YVCAEW GG P WL I+ R + + E ++ F+ ++ + + + +GG IIM
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVKLFLNEVGKQLADLQIS--KGGNIIM 176
Query: 198 LQIENEYG--NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--------I 247
+Q+ENEYG ++ Y + +D VK A GVP C EN
Sbjct: 177 VQVENEYGAFGIDKPYISEIRDMVKQAGF------TGVPLFQCDWNSNFENNALDDLLWT 230
Query: 248 IDACNGYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQR 303
I+ G D K + P + +E W GW+ WG + R E+L + R
Sbjct: 231 INFGTGANIDEQFKRLKELRPDTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDR 290
Query: 304 GGSFMNYYMYFGGTNFGRTSGGPF-----YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF + YM GGT+FG G F TSYDYDAPI+E G ++ PK+ +++L
Sbjct: 291 NISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYLEVRNL 345
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 93/221 (42%), Gaps = 47/221 (21%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+ I D +VF+NG+ ++ G V + P+ + G + L +L + +G N+G
Sbjct: 419 TLLITEAHDWAQVFLNGKKLATLSRLKGEGVVKLPPL--KEG-DRLDILVEAMGRMNFGK 475
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG 654
+ D G +V+L K ++L K Y + + F + ++ E +
Sbjct: 476 GI-YDWKGITEKVELQSDKG--VELVKDWQVYTIPVDYSFARDKQYKQQE------NAEN 526
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
P+ +Y++ F+ + + L++ + KG WVNGH IGRYW +
Sbjct: 527 QPA---YYRSTFNLNE-LGDTFLNMMNWSKGMVWVNGHAIGRYWEI-------------- 568
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N ++I + G
Sbjct: 569 -------------GPQQTLY-VPGCWLKKGENEIIILDMAG 595
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 113/337 (33%), Positives = 173/337 (51%), Gaps = 38/337 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
ID N+ ++S +HY R P W D + K G + +ETY+ WN HE G+++F+G
Sbjct: 12 IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI KF+K+ GLY+ LR PY+CAEW FGG P WL I+ R+++ F E+++ +
Sbjct: 72 DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEIKLRSSDDNFIEKLRNYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP- 234
++ + + + +GGP++M+Q+ENEYG SYG + K+Y++ AS+ G VP
Sbjct: 132 NDLLPRLVKYQV--TKGGPVLMMQVENEYG----SYGNE-KEYLRIVASIMKENGVDVPL 184
Query: 235 ------WVMCKQTDA--PENIIDACN-----GYYCDGYK----PNSYNKPTLWTENWDGW 277
W+ + + ++I + N CD K N P + E WDGW
Sbjct: 185 FTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDGW 244
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------IT 330
+ WG + R DLA V + G +N YM+ GGTNFG +G +T
Sbjct: 245 FNRWGEDIIRRDSIDLAEDVKEMLKIGS--INLYMFRGGTNFGFMNGCSARGNNDLPQVT 302
Query: 331 SYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
SYDYDA + E+G S+ + +L +K P +V
Sbjct: 303 SYDYDAILTEWGNPSDKYY----ELQKVMKSLFPNIV 335
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 114/329 (34%), Positives = 167/329 (50%), Gaps = 44/329 (13%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
+SY+ + +++G LIS +HY R PE W D + K K G + +ETY+ WN HE
Sbjct: 4 LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
GQ+NF G D+V+F+++ L + +R PY+CAEW FGG P WL I R ++
Sbjct: 64 GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWLLK-EDIRLRCSDPR 122
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
F E++ + ++ ++ L S GGPII +QIENEYG SYG + Y++ +M
Sbjct: 123 FLEKVSAYYDALIPQLKP--LLSTSGGPIIAVQIENEYG----SYGND-QAYLQALRNML 175
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--------------------YKPNSYNK 266
+ G V+ +D P + D G +G Y+PN+
Sbjct: 176 VERGID---VLLFTSDGPAD--DMLQGGMTEGVLATVNFGSRPKEAFGKLEEYQPNA--- 227
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-- 324
P + E W+GW+ W R ED A + G S +N+YM GGTNFG +SG
Sbjct: 228 PLMCMEYWNGWFDHWFEEHHTRSAEDAAQVLDEMLSMGAS-VNFYMLHGGTNFGFSSGAN 286
Query: 325 -GPFY---ITSYDYDAPIDEYGLLSEPKW 349
G Y +TSYDYD+ I E G ++ PK+
Sbjct: 287 HGGRYKPTVTSYDYDSAISEAGDIT-PKY 314
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 83/223 (37%), Gaps = 63/223 (28%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+TI +RD VF+N +L G V+ W + G L +L + +G NYG+ L
Sbjct: 396 LTIQDVRDRAHVFLNRKLVG-VVERWDPQQLSIMIPEGGAQLDILVENMGRINYGSEL-L 453
Query: 599 DGAGFRGQVKLTG-------FKNGDIDLSKILW--TYQVGLKGEFQQIYSIEENEAEWTD 649
D G V L G +N ++D L GL+GE Q ++ E
Sbjct: 454 DRKGITHGVCLNGQFLFHWEVRNLELDTLDGLHFEATNTGLEGE-QPVF------CEAAL 506
Query: 650 LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQD 709
L +DG TF L L KG +VNG ++GRYW V
Sbjct: 507 LIQDGPQDTF-----------------LRLDGWKKGVVFVNGFNLGRYWEV--------- 540
Query: 710 TCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y VP L+ N +V+ E
Sbjct: 541 ------------------GPQQTLY-VPAPILRQGENHIVVLE 564
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 173/348 (49%), Gaps = 33/348 (9%)
Query: 27 MIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSK 86
M L+ + AST K + + +++G ++ +A +HYPR W I K
Sbjct: 1 MALLATTMLTPASTAQKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCK 60
Query: 87 EGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFG 146
G + + YVFWN HE G+++F G ND+ +F +L +GLY+ +R GPYVCAEW G
Sbjct: 61 ALGMNTVCLYVFWNIHEQQEGKFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMG 120
Query: 147 GFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN 206
G P WL I R + F E ++ F +K+ + + L GGPIIM+Q+ENEYG+
Sbjct: 121 GLPWWLLKKKDIRLREPDPYFMERVKLFERKVGEQLAS--LTIQNGGPIIMVQVENEYGS 178
Query: 207 --MESSYGQQGKDYVK-------------WAASMALGLGAGVPWVMCKQTDAPENIIDAC 251
+Y +D V+ WA++ + W M T A D
Sbjct: 179 YGKNKAYVSAIRDIVRRSGFDKVTLFQCDWASNFEKNGLDDLVWTMNFGTGA-----DID 233
Query: 252 NGYYCDG-YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNY 310
+ G +PN+ P + +E W GW+ WG R RP + + + +G SF +
Sbjct: 234 QQFRRLGELRPNA---PQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKGISF-SL 289
Query: 311 YMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLK 353
YM GGT+FG +G P + +TSYDYDAPI+EYG + PK+ L+
Sbjct: 290 YMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQAT-PKYWELR 336
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 52/232 (22%), Positives = 94/232 (40%), Gaps = 50/232 (21%)
Query: 535 VRPTVTIDSMRDVLRVFINGQLTGS---VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQN 591
V +T++ D +VF++G+ G V ++ PVE + +L + + +G N
Sbjct: 409 VESMLTLNEPHDFAQVFVDGKYIGKIDRVKNEKTLMLPPVEKGA---ELCIRIEAMGRIN 465
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWT-------YQVGLKGEFQQIYSIEEN 643
+G + KD G +V ++ +G + + WT Y+ +K + +
Sbjct: 466 FGRAI-KDYKGITKEVTISTEMDGHEASWNLKNWTIVPIPDNYETAVKALSVGTETSKRT 524
Query: 644 EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
LT+ G +Y+ +F D L++ + GKGQ +VNGH IGR+W +
Sbjct: 525 RQHANLLTKAG------YYRGHFTLRKPGD-TFLNMEAFGKGQVYVNGHAIGRFWNI--- 574
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 575 ------------------------GPQQTLY-LPGCWLKQGRNEVIVLDVVG 601
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 119/347 (34%), Positives = 171/347 (49%), Gaps = 42/347 (12%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
SC S SS TF + +++GN ++ +A IHYPR E W I K G
Sbjct: 19 SC-SQSSKETF------EIGDKTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGM 71
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
+ I YVFWN HE G+Y+F G+ DI F +L +G+Y+ +R GPYVCAEW GG P
Sbjct: 72 NTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPW 131
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--ME 208
WL I+ R + + E ++ F+ ++ + + + +GG IIM+Q+ENEYG+ ++
Sbjct: 132 WLLKKKDIKLREQDPYYMERVKLFMNEVGKQLTDLQIS--KGGNIIMVQVENEYGSFGID 189
Query: 209 SSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--------IIDACNGYYCDG-- 258
Y + +D VK A GVP C EN I+ G D
Sbjct: 190 KPYIAEIRDIVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 259 -----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
+P+ P + +E W GW+ WG + R EDL + R SF + YM
Sbjct: 244 KRLQELRPDI---PLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMT 299
Query: 314 FGGTNFGRTSGGPF-----YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
GGT+FG G F TSYDYDAPI+E G ++ PK+ +++L
Sbjct: 300 HGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 50/223 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+TI D +VF++G+ ++ G ++ P++ + L +L + +G N+G
Sbjct: 419 TLTITEAHDWAQVFLDGRKLATLSRLKGEGTVILPPMKEGA---QLDILVEAMGRMNFGK 475
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ--IYSIEENEAEWTDLTR 652
+ D G +V++ NG I K Y + + F Q + ++N ++ R
Sbjct: 476 GI-YDWKGITEKVEVQS-NNGVITSLKNWKVYNIPVDYAFAQNKKFVKQDNPQKYPAYYR 533
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
TFT KT L++ + KG WVNG+ IGRYW +
Sbjct: 534 ----GTFTLDKT--------GDTFLNMTTWSKGMVWVNGYAIGRYWEI------------ 569
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N ++I + G
Sbjct: 570 ---------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 117/344 (34%), Positives = 169/344 (49%), Gaps = 36/344 (10%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
SC SS + F V + +++G ++ +A IHYPR E W I SK G
Sbjct: 19 SCTQSSKGT-----FEVG--DKTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGM 71
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
+ I YVFWN HE G+Y+F G+ DI F ++ +G+Y+ +R GPYVCAEW GG P
Sbjct: 72 NTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPW 131
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--ME 208
WL I+ R + + E ++ F+ ++ + + + +GG IIM+Q+ENEYG+ ++
Sbjct: 132 WLLKKEDIKLREQDPYYMERVKLFMNEVGKQLADLQIS--KGGNIIMVQVENEYGSFGID 189
Query: 209 SSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--------IIDACNGYYCDG-- 258
Y +D VK A GVP C EN ++ G D
Sbjct: 190 KPYIAAIRDMVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTVNFGTGANIDQQF 243
Query: 259 --YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
K N P + +E W GW+ WG + R E+L + R SF + YM GG
Sbjct: 244 ERLKELRPNTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGG 302
Query: 317 TNFGRTSGGPF-----YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
T+FG G F TSYDYDAPI+E G ++ PK+ ++DL
Sbjct: 303 TSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKFLEVRDL 345
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 90/221 (40%), Gaps = 46/221 (20%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+ I D +VF+NG+ ++ G ++ P++ +S L +L + +G N+G
Sbjct: 419 TLIITEAHDWAQVFLNGKKLATLSRLKGEGTVILPPMKEES---RLDILVEAMGRMNFGK 475
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG 654
+ D G +V+L +G+I K Y + + F Q E+ RD
Sbjct: 476 GI-YDWKGITEKVELQS-NDGNITSLKDWQVYNIPVDYSFAQNKKYEK---------RDN 524
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+Y+ F D + L++ + KG W+NGH +GRYW +
Sbjct: 525 TEKYPAYYRGTFTL-DKVGDTFLNMMNWSKGMVWINGHAVGRYWEI-------------- 569
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ +N +VI + G
Sbjct: 570 -------------GPQQTLY-VPGCWLKEGDNEVVILDMAG 596
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 166/336 (49%), Gaps = 44/336 (13%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+SYD + LIS IHY R P W D + K K G + IETYV WN HE
Sbjct: 3 TLSYDQGQFTMGDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEPR 62
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+++F+G +D+ +FV+L G GLY+ +R PY+CAEW FGG P WL + R N+
Sbjct: 63 EGEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWLLK-DDMRLRCNDP 121
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F E++ + ++ + L + +GGPII +QIENEYG SY G D A
Sbjct: 122 RFLEKVAAYYDALLPQLTP--LLATKGGPIIAVQIENEYG----SY---GNDQAYLQAQR 172
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--------------------YKPNSYN 265
A+ + GV V+ +D P++ D G +G Y+P+
Sbjct: 173 AMLIERGVD-VLLFTSDGPQD--DMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDG-- 227
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
P + E W+GW+ W + R ED A + G S +N+YM GGTNFG SG
Sbjct: 228 -PLMCMEYWNGWFDHWFEQHHTRDAEDAARVLDDMLGMGAS-VNFYMVHGGTNFGFGSGA 285
Query: 326 PF------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDA I E G L+ PK+ +++
Sbjct: 286 NHSDKYEPTVTSYDYDAAISEAGDLT-PKYHAFREV 320
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 82/215 (38%), Gaps = 46/215 (21%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+TI +RD VF++ +L G V+ W PV G L +L + +G NYG L
Sbjct: 396 LTIQDVRDRALVFLDRKLVG-VVERWNPQSIPVTIPEGGAQLDILIENMGRVNYGPQL-Y 453
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQV-GLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G V+L G + L+ +QV L+ E S + A + + G
Sbjct: 454 DRKGITHGVRLNG---------QFLFHWQVRSLELETLAGLSFDAAAAAAWEEEQPG--- 501
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y+ D L L KG ++NG ++GRYW V
Sbjct: 502 ---FYEAKLVIEDEPKDTFLRLDGWKKGVVFMNGFNLGRYWEV----------------- 541
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P Q Y VP L+ N +++FE
Sbjct: 542 ----------GPQQALY-VPAPVLRQGENEIIVFE 565
>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
ED99]
Length = 590
Score = 176 bits (446), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 176/354 (49%), Gaps = 37/354 (10%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
++D ++S IHY R + W D + K G + +ETYV WN HE+I +Y+FK
Sbjct: 9 TFLLDDKPIKILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYDFK 68
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G D+ F++L GLY+ +R PY+CAEW FGGFP WL + + R+ + + E+++
Sbjct: 69 GHKDLKHFIELAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTMRIRSRDEKYLEKVK 128
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
++ ++ ++ + QGGPIIM+Q+ENEYG S+GQ DY++ A M G
Sbjct: 129 KYYHELFKILTPLQID--QGGPIIMMQVENEYG----SFGQD-HDYLRSLAHMMREEGVT 181
Query: 233 VP-------WVMCKQTDA--PENIIDACN-------GYYCDGYKPNSYNK--PTLWTENW 274
VP W C + + ++I+ N + ++K P + E W
Sbjct: 182 VPFFTSDGAWDQCLRAGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCMEFW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGPF 327
DGW+ WG + R +DLA V + G +N YM+ GGTNFG R +
Sbjct: 242 DGWFNRWGEPVIKRDSDDLAEEVRDAVKLGS--LNLYMFHGGTNFGFWNGCSARGTKDLP 299
Query: 328 YITSYDYDAPIDEYGLLSEPKWG---HLKDLHAAIKLCEPALVAADSAQYIKLG 378
+TSYDY AP+DE G +E + LK+ I+ EP S + I L
Sbjct: 300 QVTSYDYHAPLDEAGNPTEKYFALQEMLKEEMPDIEQHEPRTKTFMSMKAIPLA 353
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 29/104 (27%)
Query: 660 TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
+YK FD + + +D+ GKG VNG +IGRYW +
Sbjct: 507 AFYKYTFDLAES-NNTHIDVSGFGKGVVLVNGFNIGRYWEI------------------- 546
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
P+Q+ Y +P+++L+ N +++F+ G P I +
Sbjct: 547 --------GPSQSLY-IPKAFLKQGQNEIIVFDSEGKYPESIQL 581
>gi|225407896|ref|ZP_03761085.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
gi|225042575|gb|EEG52821.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
Length = 590
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 177/347 (51%), Gaps = 39/347 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG L+S +HY R PE W D + K G + +ETY+ WN HE G+++F G
Sbjct: 12 LDGRPVKLLSGAVHYFRLMPEYWEDCLYNLKAMGFNTVETYIPWNIHEPEEGEFDFSGSR 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV+L GS GL++ LR P++CAEW GG P WL P ++ RTN F +++ +
Sbjct: 72 DVEAFVRLAGSMGLHVILRPSPFICAEWEMGGLPAWLLRYPDMKVRTNTPLFLVKVEAYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+++ + + L +GGP+I++Q+ENEYG S+G K+Y++ S+ GA VP+
Sbjct: 132 RELFRHIAD--LQITRGGPVILMQVENEYG----SFGND-KEYLRRIKSLMERFGAEVPF 184
Query: 236 VMCKQT-DAP--------ENIIDACN-GYYCDG--------YKPNSYNKPTLWTENWDGW 277
+ DA + ++ N G D +K + P + E WDGW
Sbjct: 185 FTSDGSWDAALEAGSLIEDGVLATANFGSRSDENLDVLEAFFKRHGRKWPLMCMEFWDGW 244
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------IT 330
+ W ++ R EDLA V + +R +N YM+ GGTNFG +G IT
Sbjct: 245 FNRWREKIITRDAEDLAMEVRQLLERAS--INLYMFQGGTNFGFYNGCSARGYTDLPQIT 302
Query: 331 SYDYDAPIDEYGLLSEPKWG---HLKDLHAAIKLCEPALVAADSAQY 374
SY+YDA + E+G +E + +++L I EP A + A Y
Sbjct: 303 SYNYDAILTEWGQPTEKFYQVREVIRELFPEIPTGEPR--AHERAAY 347
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 160/327 (48%), Gaps = 33/327 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++ G ++S +HY R P W D +A+ G + ++TYV WN HE G F G
Sbjct: 24 LLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHERTPGDVRFDG 83
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +FV+L +GL + +R GPY+CAEW+ GG P WL PG+ RT++ PF + R
Sbjct: 84 WRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRPRTSHPPFLAAVAR 143
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ R L + +GGP++ +QIENEYG SYG G DYV+W G
Sbjct: 144 WFDQLIP--RIAALQAGRGGPVVAVQIENEYG----SYGDDG-DYVRWVRDALTARGVT- 195
Query: 234 PWVMCKQTDAPENII---DACNGYYCD---GYKPNSYNK---------PTLWTENWDGWY 278
+ D P ++ A G G +P + P E W+GW+
Sbjct: 196 --ELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPFFCAEFWNGWF 253
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------YITS 331
WG + RP A V R GGS ++ YM GGTNFG +G +TS
Sbjct: 254 DHWGEQHHVRPARSAADDVGRILGAGGS-LSLYMAHGGTNFGLWAGANHDGDRLQPTVTS 312
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAA 358
YD DAP+ E+G L+E + +L AA
Sbjct: 313 YDSDAPVAEHGALTEKFFALRDELTAA 339
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 155/322 (48%), Gaps = 44/322 (13%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
IIDG + +IS +HY R PE W D + K+ G + +ETY+ WN HE +G+++F G+
Sbjct: 11 IIDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDFDGQ 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ F++L GLY+ +R PY+C+EW GG P WL I RTN++ + + ++ +
Sbjct: 71 KDVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHLEEY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++ ++ + + + G II+ Q+ENEYG SY Q KDY+K M G VP
Sbjct: 131 YAVLLPMIAKYQIN--REGTIILAQLENEYG----SYNQD-KDYLKALLKMMREYGIEVP 183
Query: 235 WVMCKQT-----------------------DAPENIIDACNGYYCDGYKPNSYNKPTLWT 271
T +A ENI + K + P +
Sbjct: 184 IFTADGTWEEALEAGSLFEEDVFPTGNFGSNAKENI-----AVLKEFMKKHQIVAPIMCM 238
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSG 324
E WDGW+ W + R E+L + G +N+YM+ GGTNFG R
Sbjct: 239 EFWDGWFNRWNMEIVKRDPEELVQSAKEMIDLGS--INFYMFHGGTNFGWMNGCSARKEH 296
Query: 325 GPFYITSYDYDAPIDEYGLLSE 346
ITSYDYDA + EYG +E
Sbjct: 297 DLPQITSYDYDAILTEYGAKTE 318
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 42/167 (25%), Positives = 75/167 (44%), Gaps = 28/167 (16%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN---DLILLSQTVGLQNYGA 594
T+ I RD +++++ + + + ++ PV + + L +L + +G NYG+
Sbjct: 397 TMRIIDARDRAQIYLDKEYVAT--QYQEEIGDPVTLHTKKDCKHSLGILLENMGRVNYGS 454
Query: 595 FLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT 651
L+ D + G +NG DI +K Y + +F ++ N+ EW +
Sbjct: 455 KLQAD-------TQRKGIRNGVMLDIHFTKNWKQYCI----DFTKV-----NQIEWDNAN 498
Query: 652 RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
G TF Y FD D +DL + GKG VNG ++GR++
Sbjct: 499 IGG--PTFNEY--VFDIQDTPKETFIDLSAFGKGIVIVNGFNLGRFY 541
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 175 bits (444), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 165/328 (50%), Gaps = 27/328 (8%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++ G + S +HYPR E W + K G + + TYVFWN HE G++NF G+
Sbjct: 35 LLSGKPFTIYSGEMHYPRVPSEYWKHRLQMMKSMGLNTVTTYVFWNYHEEEPGKWNFSGE 94
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ KF+K +GLY+ +R GPYVCAEW FGG+P WL+ +E RT+N F ++ + +
Sbjct: 95 KDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFGGYPWWLQKDKNLEIRTDNKAFLKQCENY 154
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG----QQGKDYVKWAASMALGLG 230
+ ++ + + + GGP+IM+Q ENE+G+ + +Q K Y + G
Sbjct: 155 INELAKQIIPLQINN--GGPVIMVQAENEFGSYVAQRKDISLEQHKKYSHKIKDFLVKSG 212
Query: 231 AGVPWVMCK-----QTDAPENIIDACNGY-YCDGY--KPNSYNK---PTLWTENWDGWYT 279
VP+ + + E + NG D K N +N P + E + GW
Sbjct: 213 ITVPFFTSDGSWLFKEGSIEGALPTANGEGDVDNLRKKINEFNNGKGPYMVAEYYPGWLD 272
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------YITS 331
W ED+ + + G SF NYYM GGTNFG TSG + +TS
Sbjct: 273 HWAEPFVKVSTEDVVKQTELYIKNGISF-NYYMIHGGTNFGFTSGANYDKNHDIQPDLTS 331
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDLHAAI 359
YDYDAPI+E G ++ PK+ L+D+ I
Sbjct: 332 YDYDAPINEAGWVT-PKFNALRDIFQKI 358
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 175 bits (444), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 158/318 (49%), Gaps = 28/318 (8%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
R +DG +IS IHY R P+ W D I K++ G + IETYV WN H R +++
Sbjct: 9 RDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHT 68
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G D+ +F+ ++ GL +R GPY+CAEW+ GG P WL P I R+++ + E+
Sbjct: 69 DGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDIVVRSSDPTYLTEV 128
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+R+++ + ++ + GGPII++Q+ENEYG +YG + Y+ ++ LG
Sbjct: 129 ERYLEHLAPIVEPRQIN--HGGPIILMQVENEYG----AYGND-RAYLTHLTNVYRNLGF 181
Query: 232 GVPWV--------MCKQTDAPENIIDACNGYYCD----GYKPNSYNKPTLWTENWDGWYT 279
VP M P+ G D + + P + +E W GW+
Sbjct: 182 VVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIGWFD 241
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-------PFYITSY 332
WG V D A A+ R G S +N YM+ GGTNFG T+G P +TSY
Sbjct: 242 HWGAHHHTTDVADAANALDRLLGAGAS-VNIYMFHGGTNFGFTNGANDKGVYQPL-VTSY 299
Query: 333 DYDAPIDEYGLLSEPKWG 350
DYDAP+ E G +E W
Sbjct: 300 DYDAPLAEDGYPTEKYWA 317
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 175 bits (444), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 157/306 (51%), Gaps = 26/306 (8%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG ++S +HY R P++W D I K++ G + IETYV WNAH RG ++ G
Sbjct: 12 LLDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDLTGN 71
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ +F+ LV + GL+ +R GPY+CAEW+ GG P WL PG+ RT + E + +
Sbjct: 72 LDLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAIAGY 131
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+I+ ++ + +GGP++M+Q+ENEYG +YG DY++ +M G VP
Sbjct: 132 YDEILAVVAPRQVT--RGGPVLMVQVENEYG----AYGDD-ADYLRALVTMMRERGIEVP 184
Query: 235 WVMCKQTD--------APENIIDACNGY----YCDGYKPNSYNKPTLWTENWDGWYTTWG 282
C Q + PE A G + + + P + E WDGW+ +WG
Sbjct: 185 LTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFDSWG 244
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPFY--ITSYDYDA 336
+ H A A G+ N YM+ GGTN G T+G G + TSYDYDA
Sbjct: 245 EQH-HTTDAAEAAADLDLLLSQGASANLYMFHGGTNLGFTNGANDKGTYLPITTSYDYDA 303
Query: 337 PIDEYG 342
P+ E G
Sbjct: 304 PLAEDG 309
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 179/366 (48%), Gaps = 42/366 (11%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
K F + D ++DG +IS +HY R PE W + K G + +ETYV WN H
Sbjct: 2 KSFEIGKD---FMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIH 58
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G +NF+G D+VK+V+L GL + LR PY+CAEW FGG P WL I R+
Sbjct: 59 EPKEGVFNFEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRS 118
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
N F ++++ F K ++ ++ L GGPIIM+Q+ENEYG S+G K+YV+
Sbjct: 119 NTNLFLDKVENFYKVLLPMVTP--LQVENGGPIIMMQVENEYG----SFGND-KEYVRSI 171
Query: 223 ASMALGLGAGVPWVMC----KQTDAPENIID---ACNGYYCDG-----------YKPNSY 264
+ L VP ++ ++ID G + K N
Sbjct: 172 KKIMRDLDVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKK 231
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + E WDGW+ WG + R +LA V +R +N+YM+ GGTNFG +G
Sbjct: 232 EWPLMCMEFWDGWFNRWGMEIIRRDGSELAEEVKELLKRAS--INFYMFQGGTNFGFMNG 289
Query: 325 GPFY-------ITSYDYDAPIDEYGLLSEPKWGH----LKDLHAAIKLCEPALVAADSAQ 373
ITSYDYDA + E+G + PK+ +K++ + ++ EP ++ +
Sbjct: 290 CSSRENVDLPQITSYDYDALLTEWGEPT-PKYYAVQRVIKEVCSDVEQFEPRILPRANYG 348
Query: 374 YIKLGQ 379
IKL +
Sbjct: 349 EIKLNR 354
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 50/229 (21%), Positives = 91/229 (39%), Gaps = 47/229 (20%)
Query: 545 RDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFR 604
RD + +F+N QL + + ++ N L +L + +G NYGA L
Sbjct: 408 RDRVHMFLNEQLIDTQYRDEIGREVSLDLTKEENTLDILVENMGRVNYGARL-------L 460
Query: 605 GQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKT 664
+ G +G + + L+ ++ Y++E + + D P+T ++Y+
Sbjct: 461 SPTQRKGISSGVM--------IDIHLQSNWEH-YALEFDNLDEIDFNGQWEPNTPSFYEY 511
Query: 665 YFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTT 724
F+ + ++ LD +GKG +NG ++G+YW V P G
Sbjct: 512 TFNVQE-LNDTFLDCSKLGKGFVVLNGFNLGKYWD-VGPTG------------------- 550
Query: 725 NCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
+ ++P L N L++FE G +E + LR I E
Sbjct: 551 --------YLYIPAPLLIKGENNLIVFETEGN--YEEELYLRENPIYLE 589
>gi|393780989|ref|ZP_10369190.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
gi|392677324|gb|EIY70741.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
Length = 776
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 173/351 (49%), Gaps = 29/351 (8%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
++ +++ +C++ + FK F V + +++G ++ +A +HY R W I
Sbjct: 5 IIYLLLFCTCLALPGQAQQFKTFEVG--KKTFLLNGEPFIVKAAELHYTRIPQPYWEHRI 62
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K G + I YVFWN HE GQ++F G+NDI F +L G+Y+ +R GPYVCAE
Sbjct: 63 KMCKALGMNTICLYVFWNIHEQEEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAE 122
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG P WL I RT + + E + F+KK+ + + + +GG IIM+Q+EN
Sbjct: 123 WEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKKVGEQLVPLQIT--RGGNIIMVQVEN 180
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN---G 253
EYG SYG K YV M G G VP C +A ++++ N G
Sbjct: 181 EYG----SYGTD-KPYVSAIRDMVRGAGFTEVPLFQCDWSSNFTNNALDDLLWTVNFGTG 235
Query: 254 YYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMN 309
D K P + +E W GW+ WG + RP +D+ + R SF +
Sbjct: 236 ANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGLKDMLDRNISF-S 294
Query: 310 YYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 295 LYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 344
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 33/144 (22%), Positives = 57/144 (39%), Gaps = 34/144 (23%)
Query: 660 TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
+YK F D LD+ + GKG WVNGH +GR+W +
Sbjct: 529 AYYKATFKL-SKTDDTFLDMSTWGKGMVWVNGHAMGRFWEI------------------- 568
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCEQVSES 778
P QT + +P WL+ N +++ + G + +K ++ E+ E+
Sbjct: 569 --------GPQQTLF-MPGCWLKKGVNEIIVLDLKGPEKAMVKGLKKPILDVLREKAPET 619
Query: 779 HYPPVRKWSNSYSVDGKLSINKMA 802
H RK ++ + ++K A
Sbjct: 620 H----RKEGEHLNLSAETPVHKGA 639
>gi|223942939|gb|ACN25553.1| unknown [Zea mays]
Length = 199
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/170 (49%), Positives = 115/170 (67%), Gaps = 8/170 (4%)
Query: 682 MGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
MGKG+AWVNG IGRYW T +AP+ GC ++C+YRGAY+S KC CG P+QT YHVPRS+
Sbjct: 1 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSF 60
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
LQ +N LV+FE GG+P +IS +R T VC QVSE+H + WS+ + + +
Sbjct: 61 LQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSS------QQPMQR 114
Query: 801 MAPEMHLHC-QDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
P + L C ++G +ISS++FAS+GTP G C +S G C + +LS+V E
Sbjct: 115 YGPALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQE 164
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 167/320 (52%), Gaps = 33/320 (10%)
Query: 59 NRRMLISAG-IHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
NR ++ A +HYPR W I K G + I YVFWN HE G+++F G +D+
Sbjct: 43 NRPFVVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFWNIHEQREGEFDFSGNSDV 102
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
F +L +G+Y+ +R GPYVCAEW GG P WL I R ++ F E ++ F +K
Sbjct: 103 AAFCRLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVEIFEQK 162
Query: 178 IVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK--WAASMALGLGAGV 233
+ + + L GGPIIM+Q+ENEYG+ + Y Q +D ++ W + G G +
Sbjct: 163 VAEQLAP--LTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRDVLRKYWYTN---GRGPAL 217
Query: 234 ---PWVMCKQTDAPENIIDACN---GYYCDG-------YKPNSYNKPTLWTENWDGWYTT 280
W + + E++I N G D +P++ P + +E W GW+
Sbjct: 218 FQCDWASNFEKNGLEDLIWTMNFGTGANIDAQFMRLGELRPDA---PKMCSEFWSGWFDK 274
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYD 335
WG R RP +D+ + +G SF + YM GGT+FG +G P + +TSYDYD
Sbjct: 275 WGARHETRPAKDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYD 333
Query: 336 APIDEYGLLSEPKWGHLKDL 355
API+EYG ++ PK+ L+ +
Sbjct: 334 APINEYGQVT-PKFWELRKM 352
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 172/364 (47%), Gaps = 42/364 (11%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
++++ L+ V S F + H +++G + S IHYPR W +
Sbjct: 14 IILLFFSLNTVFSQKGK-----FEIRDGH--FLLNGKPFTIYSGEIHYPRVPSAYWKHRL 66
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K G + + TYVFWN HE G++NF G+ D+ KF+K +GLY+ +R GPYVCAE
Sbjct: 67 EMMKAMGLNTVTTYVFWNYHEEAPGKWNFSGEKDLQKFIKTAQETGLYVIIRPGPYVCAE 126
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W FGG+P WL+ +E R +N F EE +++ ++ + + + GGP+IM+Q EN
Sbjct: 127 WEFGGYPWWLQKNKELEIRRDNKAFSEECWKYISQLAKQITPMQITN--GGPVIMVQAEN 184
Query: 203 EYGNMESSYGQQGKD--------YVKWAASMALGLGAGVPWVMCKQTD-----APENIID 249
E+G SY Q KD Y M L G VP + + E +
Sbjct: 185 EFG----SYVAQRKDIPLEEHRKYSHKIKEMLLKSGISVPLFTSDGSSLFKGGSVEGALP 240
Query: 250 ACNGYY-CDGYKP--NSYN---KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQR 303
NG D K N YN P + E + GW W E++ + +
Sbjct: 241 TANGESDIDVLKKSINEYNGGKGPYMIAEYYPGWLDHWAEPFVKVSTEEVVKQTNLYIEN 300
Query: 304 GGSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
G SF NYYM GGTNFG TSG + +TSYDYDAPI E G + PK+ L+ +
Sbjct: 301 GVSF-NYYMIHGGTNFGFTSGANYDKDHDIQPDLTSYDYDAPISEAGWAT-PKYNALRKI 358
Query: 356 HAAI 359
I
Sbjct: 359 FQKI 362
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 96/221 (43%), Gaps = 57/221 (25%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+ + +RD V+ING+ G + K +E +SG + L +L + +G NYGA +
Sbjct: 432 LEVKGLRDYANVYINGKWKGELNRVNKKYDLDIEIKSG-DRLEILVENMGRINYGAEIVH 490
Query: 599 DGAGFRGQVKLTGFK-NGDIDLSKILWTYQVGLKGEFQQIYSIEEN-----EAEWTDLTR 652
+ G VK+ G + +G+ ++ + + K F+ +IE++ EAE+T L
Sbjct: 491 NLKGIISPVKINGTEVSGNWEMLPL--PFDTFPKHHFKN-KNIEDHSPVIQEAEFT-LNE 546
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
G TF LD+ + GKG ++NG + GRYW+ V P+
Sbjct: 547 TG--DTF-----------------LDMRNFGKGIVFINGRNAGRYWSTVGPQ-------- 579
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
QT Y +P WL+ N + IFE+
Sbjct: 580 ------------------QTLY-IPGVWLKKGRNKIQIFEQ 601
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 165/324 (50%), Gaps = 29/324 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G+R + IHY R E W D + K K G + + TY+ WN HE RG++NF G
Sbjct: 90 FLLEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 149
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ FV++ GL++ LR GPY+C+EW+ GG P WL +E RT F + +
Sbjct: 150 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVDL 209
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ R L QGGPII +Q+ENEYG+ + + +Y+ + L G
Sbjct: 210 YFNQLIP--RVVPLQYTQGGPIIAVQVENEYGSYD-----KDPNYMPYIKMALLKRGIVE 262
Query: 234 PWVMCKQTDA-----PENIIDACNGYYCDGYKPNSY-----NKPTLWTENWDGWYTTWGG 283
+ D E ++ N D N NKPT+ TE W GW+ TWGG
Sbjct: 263 LLMTSDNKDGLSGGYVEGVLATINLKNVDSIIFNYLQSFQDNKPTMVTEFWTGWFDTWGG 322
Query: 284 RLPHRPV--EDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYD 335
PH V +D+ +V+ Q G S +N YM+ GGTNFG +G + +TSYDYD
Sbjct: 323 --PHHIVDADDVMVSVSSIIQMGAS-LNLYMFHGGTNFGFMNGAQHFTDYQADVTSYDYD 379
Query: 336 APIDEYGLLSEPKWGHLKDLHAAI 359
A + E G + PK+ L++ + +
Sbjct: 380 AILTEAGDYT-PKFFKLREYFSTL 402
Score = 46.6 bits (109), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 86/214 (40%), Gaps = 45/214 (21%)
Query: 542 DSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
D +RD +VF+N G I + V+ + + G+ L +L + G NYG L K
Sbjct: 481 DHIRDRAQVFVNKIYIG-YIDYLVEGLT-IPRGQGHRKLSILVENCGRVNYGLMLNKQRK 538
Query: 602 GFRGQVKL--TGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTF 659
G G + L + +N I Y + +K +F Q Y + + W+ + + F
Sbjct: 539 GLIGDIYLNDSPLRNFKI--------YSLEMKADFFQRYVLS---STWSPVPEEATGPAF 587
Query: 660 TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
++ + L L KG ++NG ++GR+W++ G Q+T
Sbjct: 588 --FRGTLHVGFIVLDTFLKLEGWVKGVVFINGQNLGRFWSI-----GPQETL-------- 632
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
++P WL N +++FEE
Sbjct: 633 ---------------YLPGPWLHPGENEIIVFEE 651
>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
Length = 595
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 157/316 (49%), Gaps = 34/316 (10%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+++G +IS IHY R PE W + K G + +ETY+ WN HE+ +Y+F
Sbjct: 8 EEFLLNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDF 67
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G+ DI +FV+ GL++ LR PY+CAEW FGG P WL + R+++ F E++
Sbjct: 68 SGQLDIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNMRIRSSDPQFIEKV 127
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ KK+ + + + S GGP+IM+Q+ENEYG SYG+ K+Y+K + L LG
Sbjct: 128 SSYYKKLFEQIVPLQVTS--GGPVIMMQLENEYG----SYGED-KEYLKTLYELMLELGV 180
Query: 232 GVP-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTEN 273
VP W ++ ++ G + K N N P + E
Sbjct: 181 TVPIFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEY 240
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W GW+ W + R +DL V + G +N YM+ GGTNFG +G
Sbjct: 241 WGGWFNRWNDPIIKRDAQDLTNDVKEALKIGS--LNLYMFHGGTNFGFMNGCSARLGKDL 298
Query: 328 -YITSYDYDAPIDEYG 342
+TSYDYDAP++E G
Sbjct: 299 PQLTSYDYDAPLNEQG 314
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 75/182 (41%), Gaps = 47/182 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGE 633
+G N L +L + +G NYG L D + G + G + DL I Q L +
Sbjct: 438 AGSNQLDVLVENMGRVNYGHKLLAD-------TQQKGIRRGVMSDLHFITDWEQYSL--D 488
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
F + +I+ NE EW ++ PS F YK D P+ +++ GKG VNG +
Sbjct: 489 FLKPLTIDFNE-EW----KENAPS-FYQYKVTIDTPED---TFINMELFGKGIVLVNGFN 539
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
IGR+W V PT + Y P+S + N +++FE
Sbjct: 540 IGRFWNV---------------------------GPTLSLY-APKSLFKKGENEIIVFET 571
Query: 754 TG 755
G
Sbjct: 572 EG 573
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 173/349 (49%), Gaps = 38/349 (10%)
Query: 26 MMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKS 85
+ + + C +S+S + + F V Y+ + DG +S +HY R W D I K
Sbjct: 12 LFVFVLCDTSNSTNN--RTFIVDYEKNEFLKDGEVFRYVSGDLHYFRVPKSYWKDRIQKI 69
Query: 86 KEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
K G + I TYV W+ HE G YNF+G D+ F+KL+ G+YL LR GPY+CAE +F
Sbjct: 70 KAAGLNAITTYVEWSLHEPFPGTYNFEGMADLEYFIKLIQDEGMYLLLRPGPYICAERDF 129
Query: 146 GGFPVWLRDI-PGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEY 204
GGFP WL ++ P RTN++ +K+ + ++ ++ M+ + GG IIM+Q+ENEY
Sbjct: 130 GGFPYWLLNVTPKGSLRTNDSSYKKYVSQWFSVLMKKMQPHLY--GNGGNIIMVQVENEY 187
Query: 205 GNMESSYGQQGKDYVKWAASMALGLGAGVPWV----MCKQTD-----APE--NIID---A 250
G SY DY W + G + +C+Q D PE +D +
Sbjct: 188 G----SYYACDSDYKLWLRDLLKGYVEDKALLYTIDICRQRDFDCGPIPEVYATVDFGIS 243
Query: 251 CNGYYCDGYKPNSYNK--PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFM 308
N C + N Y K P++ +E + GW W P +D+ + SF
Sbjct: 244 VNAATCFDFLKN-YQKGGPSVNSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLSLNASF- 301
Query: 309 NYYMYFGGTNFGRTSGGPF-----------YITSYDYDAPIDEYGLLSE 346
++YM+ GGTNFG TSG +TSYDYDAPI E G L+E
Sbjct: 302 SFYMFHGGTNFGFTSGANTNESDANIGYLPQLTSYDYDAPITEAGDLTE 350
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/347 (34%), Positives = 170/347 (48%), Gaps = 42/347 (12%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
SC S SS TF + +++G ++ +A IHYPR E W I K G
Sbjct: 19 SC-SQSSKETF------EIGDKTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGM 71
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
+ I YVFWN HE G+Y+F G+ DI F +L +G+Y+ +R GPYVCAEW GG P
Sbjct: 72 NTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPW 131
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--ME 208
WL I+ R + + E ++ F+ ++ + + + +GG IIM+Q+ENEYG+ ++
Sbjct: 132 WLLKKKDIKLREQDPYYMERVKLFMNEVGKQLTDLQIN--KGGNIIMVQVENEYGSFGID 189
Query: 209 SSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--------IIDACNGYYCDG-- 258
Y + +D VK A GVP C EN I+ G D
Sbjct: 190 KPYIAEIRDIVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 259 -----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
+P+ P + +E W GW+ WG + R EDL + R SF + YM
Sbjct: 244 KRLQELRPDI---PLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMT 299
Query: 314 FGGTNFGRTSGGPF-----YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
GGT+FG G F TSYDYDAPI+E G ++ PK+ +++L
Sbjct: 300 HGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
Score = 40.8 bits (94), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 51/221 (23%), Positives = 86/221 (38%), Gaps = 46/221 (20%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+TI D +VF++G+ ++ G ++ P++ + L +L + +G N+G
Sbjct: 419 TLTITEAHDWAQVFLDGKKLATLSRLKGEGTVILPPMKEGA---QLDILVEAMGRMNFGK 475
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG 654
+ D G +V++ NG I K Y + + F Q + +D
Sbjct: 476 GI-YDWKGITEKVEIQS-NNGVITSLKNWKVYNIPVDYAFAQNKEF---------MKQDN 524
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+Y+ F D L++ + KG WVNG+ IGRYW +
Sbjct: 525 PLKYPAYYRGTF-MLDKTGDTFLNMTNWSKGMVWVNGYAIGRYWEI-------------- 569
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N ++I + G
Sbjct: 570 -------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/347 (34%), Positives = 170/347 (48%), Gaps = 42/347 (12%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
SC S SS TF + +++G ++ +A IHYPR E W I K G
Sbjct: 19 SC-SQSSKETF------EIGDKTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGM 71
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
+ I YVFWN HE G+Y+F G+ DI F +L +G+Y+ +R GPYVCAEW GG P
Sbjct: 72 NTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPW 131
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--ME 208
WL I+ R + + E ++ F+ ++ + + + +GG IIM+Q+ENEYG+ ++
Sbjct: 132 WLLKKKDIKLREQDPYYMERVKLFMNEVGKQLADLQIS--KGGNIIMVQVENEYGSFGID 189
Query: 209 SSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--------IIDACNGYYCDG-- 258
Y + +D VK A GVP C EN I+ G D
Sbjct: 190 KPYIAEIRDIVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQF 243
Query: 259 -----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMY 313
+P+ P + +E W GW+ WG + R EDL + R SF + YM
Sbjct: 244 KRLQELRPDI---PLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMT 299
Query: 314 FGGTNFGRTSGGPF-----YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
GGT+FG G F TSYDYDAPI+E G ++ PK+ +++L
Sbjct: 300 HGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 50/223 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+TI D +VF++G+ ++ G ++ P++ + L +L + +G N+G
Sbjct: 419 TLTITEAHDWAQVFLDGRKLATLSRLKGEGTVILPPMKEGA---QLDILVEAMGRMNFGK 475
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ--IYSIEENEAEWTDLTR 652
+ D G +V++ NG I K Y + + F Q + ++N ++ R
Sbjct: 476 GI-YDWKGITEKVEVQS-NNGVITSLKNWKVYNIPVDYAFAQNKKFVKQDNPQKYPAYYR 533
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
TFT KT L++ + KG WVNG+ IGRYW +
Sbjct: 534 ----GTFTLDKT--------GDTFLNMTNWSKGMVWVNGYAIGRYWEI------------ 569
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N ++I + G
Sbjct: 570 ---------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 128/392 (32%), Positives = 175/392 (44%), Gaps = 104/392 (26%)
Query: 312 MYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADS 371
MY G TNF RT+GGPF T+YDYDAP+DE+G L++PK+GHLK LH E L
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLT---- 78
Query: 372 AQYIKLGQNQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPD 431
Y + ++ Y ++ S F+ N++ A + F G SY +P W VSILPD
Sbjct: 79 --YGNISTADFGNLVMTTVYQTEEGSSCFIGNVN----AKINFQGTSYDVPAWYVSILPD 132
Query: 432 CRNTVFNTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWS 491
C+ +NTAK KL ++
Sbjct: 133 CKTESYNTAK----------------------------RMKLRTS--------------- 149
Query: 492 ENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVF 551
L NV+ D SD+LW++T + + + D ++ K +R I+S VL F
Sbjct: 150 ---------LRFKNVSNDESDFLWYMTTVNLKEQDPAWGKNMSLR----INSTAHVLHGF 196
Query: 552 INGQLTGSV-----IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQ 606
+NGQ TG+ H+V Q +F G N + LLS TV L NYGAF E AG G
Sbjct: 197 VNGQHTGNYRVENGKFHYV-FEQDAKFNPGVNVITLLSVTVDLPNYGAFFENVPAGITGP 255
Query: 607 VKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYF 666
V + G +NGD + K L T+ K T F
Sbjct: 256 VFIIG-RNGDETVVKYLSTHNGATK-------------------------------LTIF 283
Query: 667 DAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
AP G +PV +DL GKG+A +N ++ GRYW
Sbjct: 284 KAPLGSEPVVVDLLGFGKGKASINENYTGRYW 315
>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
Length = 586
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 166/328 (50%), Gaps = 39/328 (11%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
+IS IHY R PE W + K G + +ETYV WN HE +GQY F D+ +F++
Sbjct: 19 IISGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQ 78
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L S GL + LR PY+CAE+ FGG P WL + R+ PF E ++ + +++ +
Sbjct: 79 LADSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMRVRSTYPPFMERVRLYYRELFKEV 138
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWV------ 236
+ + S GGPII++Q+ENEYG YG + K Y++ +M G VP V
Sbjct: 139 IDLQITS--GGPIILMQVENEYG----GYGSE-KKYLQELVTMMKENGVTVPLVTSDGPW 191
Query: 237 --MCKQTDAPENIIDACNGYYCDGYKPNSYNK---------PTLWTENWDGWYTTWGGRL 285
M + E+ + N C P +++ P + E W GW+ W +
Sbjct: 192 GDMLENGSLQESALPTVN---CGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAWQDKK 248
Query: 286 PHRP-VEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
H V+ ++ +RG +N+YM+ GGTNFG +G +Y TSYDYDAP+
Sbjct: 249 HHTTDVKSSVESLEEILKRGS--VNFYMFHGGTNFGFMNGANYYGKLLPDTTSYDYDAPL 306
Query: 339 DEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+EYG +E K+ K++ A + +P L
Sbjct: 307 NEYGEQTE-KYKAFKEVIA--RYSDPIL 331
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 85/228 (37%), Gaps = 48/228 (21%)
Query: 528 SFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTV 587
S EV ID + D ++FIN L + ++ N L +L + +
Sbjct: 388 SIGAAREVNDFRLID-VADRAQIFINQTLIATKYDQEMEGNIVFTLTEPKNQLGILVENM 446
Query: 588 GLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEW 647
G NY +++ G G + + G + + Q YS+ +
Sbjct: 447 GRVNYSVTMDQQRKGISGGIVVNG-----------------AFQTNWSQ-YSLSLEDPTL 488
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
D +R+ IP T T+ + F+ + D +D+ GKG +VNG ++GRYW V
Sbjct: 489 IDFSREWIPDTPTFSRFVFELEESGD-TFIDMSKWGKGVVFVNGFNLGRYWNV------- 540
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P Q Y +P L+ N L+IFE G
Sbjct: 541 --------------------RPQQKLY-IPGPKLKVGVNELIIFETEG 567
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 171/350 (48%), Gaps = 40/350 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I DG LIS IH+ R W D + K++ G + +ETYVFWN E GQ++F G
Sbjct: 35 FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
NDI FV+ S GL + LR GPYVCAEW GGFP WL P + R+ + F + QR
Sbjct: 95 NNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+++ + +R L + GGPII +Q+ENEYG SY G D+ A AL + AG+
Sbjct: 155 YLEALGTQVRP--LLNSNGGPIIAMQVENEYG----SY---GDDHGYLQAVRALFIKAGL 205
Query: 234 PWVMCKQTDAPE--------NIIDACNGYYCDGYKPNSYNK--------PTLWTENWDGW 277
+ +D + +++ A N G + +K P L E W GW
Sbjct: 206 GGALLFTSDGAQMLGNGTLPDVLAAVN--VAPGEAKQALDKLATFHPGQPQLVGEYWAGW 263
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---------- 327
+ WG + A + ++G S +N YM+ GGT+FG +G F
Sbjct: 264 FDQWGKPHAQTDAKQQADEIEWMLRQGHS-INLYMFVGGTSFGFMNGANFQGGPGDHYSP 322
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
TSYDYDA +DE G PK+ +D+ + +P + A + ++I L
Sbjct: 323 QTTSYDYDAALDEAG-RPMPKFALFRDVITGVTGLQPPPLPA-ATRFIDL 370
>gi|421766812|ref|ZP_16203581.1| Beta-galactosidase 3 [Lactococcus garvieae DCC43]
gi|407624838|gb|EKF51571.1| Beta-galactosidase 3 [Lactococcus garvieae DCC43]
Length = 597
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 114/334 (34%), Positives = 159/334 (47%), Gaps = 37/334 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++D +IS IHY R W D + K GA+ +ETY+ WN HE G ++F+G
Sbjct: 10 FMLDNQPVKIISGAIHYFRIPQSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVFDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
DI FVKL S GL + LR Y+CAEW FGG P WL P + R+ ++ F +++
Sbjct: 70 MKDIHTFVKLAESLGLMVILRPSVYICAEWEFGGLPAWLLKGPEMRLRSTDSRFMTKVEN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K ++ + + + GGP+IM+Q+ENEYG SYG + KDY++ S+ G V
Sbjct: 130 YFKVLLPYISSLQITA--GGPVIMMQVENEYG----SYGME-KDYLRQTMSLMEKYGINV 182
Query: 234 PWVMCK-----QTDAPENIIDAC-------------NGYYCDGYKPNSYNKPTLWTENWD 275
P DA I D D K + P + E WD
Sbjct: 183 PLFTSDGAWQAALDAGSLIEDDVLVTGNFGSRSKENAAVLADFMKEHGKKWPLMCMEYWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGPFY 328
GW+ WG + R +DLA V + G +N YM+ GGTNFG R +G
Sbjct: 243 GWFNRWGEPIIKREPQDLADEVKAMLEIGS--LNLYMFHGGTNFGFYNGCSARDTGNLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLC 362
ITSYDYDA + E G EP + A ++C
Sbjct: 301 ITSYDYDALLTEAG---EPTAKYYAVQKAIKEVC 331
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 159/313 (50%), Gaps = 40/313 (12%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG L+S IHY R PE W D + K K G + +ETY+ WN HE GQ+ F G
Sbjct: 13 LDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFDGLA 72
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+V+FV++ G GL++ +R PY+CAEW FGG P WL PG+ R + P+ + + +
Sbjct: 73 DVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGMRVRCMHRPYLDRVDAYY 132
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
V L + L GGPII +QIENEYG SYG + Y+ + L G
Sbjct: 133 D--VLLPLLKPLLCTNGGPIIAMQIENEYG----SYGND-RAYLVYLKDAMLQRGMD--- 182
Query: 236 VMCKQTDAPEN----------IIDACN-GYYCD-------GYKPNSYNKPTLWTENWDGW 277
V+ +D PE+ +++ N G + Y+P+ P + E W+GW
Sbjct: 183 VLLFTSDGPEHFMLQGGMIPGVLETVNFGSRAEEAFEMLRKYQPDG---PIMCMEYWNGW 239
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--------PFYI 329
+ WG + R +D+A + G S +N+YM+ GGTNFG SG I
Sbjct: 240 FDHWGEQHHTRDAKDVADVFDDMLRLGAS-VNFYMFHGGTNFGYMSGANCPQRDHYEPTI 298
Query: 330 TSYDYDAPIDEYG 342
TSYDYD P++E G
Sbjct: 299 TSYDYDVPLNESG 311
>gi|347967091|ref|XP_001689312.2| AGAP002056-PA [Anopheles gambiae str. PEST]
gi|333469762|gb|EDO63217.2| AGAP002056-PA [Anopheles gambiae str. PEST]
Length = 629
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 171/351 (48%), Gaps = 29/351 (8%)
Query: 27 MIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSK 86
+ L + ++ S + F++ YD+ ++DG ++ HY RA PE WP ++ +
Sbjct: 8 LFALVFLFAAPRSVDMRLFSIDYDNDTFVMDGKPFQYVAGSFHYFRALPESWPSILRSMR 67
Query: 87 EGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFG 146
G + I TYV W+ H YN++G DI F++L S+GLY+ LR GPY+CAE + G
Sbjct: 68 AAGLNAITTYVEWSLHNPKEDVYNWQGMADIEHFLELADSAGLYVILRPGPYICAERDMG 127
Query: 147 GFPVW-LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GFP W L P I RTN+ + E++ + ++ L R + QGGPIIM+Q+ENEYG
Sbjct: 128 GFPSWLLHKYPDILLRTNDLRYLREVRTWYAQL--LSRVQRFLVGQGGPIIMVQVENEYG 185
Query: 206 NMESS-------YGQQGKDYVKWAASMALGLGAGVPWV-----MCKQTDAPENIIDACNG 253
+ + + + YV A + G G+ + D D NG
Sbjct: 186 SFYACDHKYLNWLRDETERYVMGNAVLFTNNGPGLEGCGAIEHVLSSLDFGPGTEDEING 245
Query: 254 YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAV--ARFFQRGGSFMNYY 311
++ K P + E + GW T W + PH D V F R +N Y
Sbjct: 246 FWSTLRKTQP-KGPLVNAEYYPGWLTHW--QEPHMARTDTKPVVDSLDFMLRNKVNVNIY 302
Query: 312 MYFGGTNFGRTSGG--------PFYITSYDYDAPIDEYGLLSEPKWGHLKD 354
M+FGGTN+G T+G +TSYDYDAP+DE G PK+ L+D
Sbjct: 303 MFFGGTNYGFTAGANNMGAGGYAADLTSYDYDAPLDESG-DPTPKYFALRD 352
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 174/372 (46%), Gaps = 34/372 (9%)
Query: 1 MHSKKNNRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNR 60
+H + +L LA+ V M SS+ + + + R +DG
Sbjct: 5 IHFHRKTIVILSALAILVVLWM---------AFGSSNKRVVVRSKGLVANGRHFTMDGKP 55
Query: 61 RMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKF 120
++S +HY R P+ W D I K K G + +ETYV WN HE I+G +NFK DIV+F
Sbjct: 56 FTILSGAMHYFRIPPQYWEDRIVKLKAMGLNTVETYVSWNLHEEIQGDFNFKDGLDIVEF 115
Query: 121 VKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVD 180
+K LY+ +R GPY+CAEW+ GG P WL P I R+ + F + RF +++
Sbjct: 116 IKTAQKHDLYVIMRPGPYICAEWDLGGLPSWLLHNPNIYLRSLDPIFMKATLRFFDELI- 174
Query: 181 LMREEMLFSWQ---GGPIIMLQIENEYGNMESSYGQQGK---DYVKWAASMALGLGAGVP 234
L +Q GGPII QIENEY + ++S K + V L G+
Sbjct: 175 ----PRLIDYQYSNGGPIIAWQIENEYLSYDNSSAYMRKLQQEMVIRGVKELLFTSDGIW 230
Query: 235 WVMCKQTDAPENIIDACN-----GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRP 289
+ ++ + ++ N G + N P + TE W GW+ WG
Sbjct: 231 QMQIEKKYSLPGVLKTVNFQRNETNILKGLRKLQPNMPLMVTEFWSGWFDHWGEDKHVLT 290
Query: 290 VEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-----GPFY--ITSYDYDAPIDEYG 342
VE A + S +NYYM GGTNFG +G G + ITSYDYDAPI E G
Sbjct: 291 VEKAAERTKNILKMESS-INYYMLHGGTNFGFMNGANAENGKYKPTITSYDYDAPISESG 349
Query: 343 LLSEPKWGHLKD 354
++ PK+ L++
Sbjct: 350 DIT-PKYRELRE 360
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 73/167 (43%), Gaps = 18/167 (10%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIG-HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
++ I+S RD +V ++ + S G + +K + N L +L + +G N+ +
Sbjct: 439 SIKIESYRDRAQVLVDNRTYFSAFGKNKLKSIPFGRKTPNSNKLQILVENMGRVNFKQEI 498
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT---DLTRD 653
G G V F +GD +S ++ EF+Q + N+AEW+ + R
Sbjct: 499 NNQRKGILGDV----FVDGDRHMSWKIYPL------EFKQDFMESLNKAEWSVRSERRRR 548
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
G P ++ F D + + KG +VNG ++GRYW +
Sbjct: 549 G-PGM---HRGSFSIDDSPKDTFVLMSGWTKGVCFVNGRNLGRYWNI 591
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 117/342 (34%), Positives = 173/342 (50%), Gaps = 45/342 (13%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R P++W D + + G + +ETYV WN HE +RG+ +F G D+ +F+
Sbjct: 26 VLSGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARFIS 85
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L G GL + +R GPY+CAEW+FGG P WL PGI RT++ F + + +V ++
Sbjct: 86 LAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAVVPVI 145
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD 242
R L + GGP++ +Q+ENEYG SYG Y++ L G V+ +D
Sbjct: 146 RP--LLTTAGGPVVAVQVENEYG----SYGDDAA-YLEHCRKGLLDRGID---VLLFTSD 195
Query: 243 AP----------ENIIDACN-GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPH 287
P ++ N G D + P + E W+GW+ WG PH
Sbjct: 196 GPGPDWLDNGTIPGVLATVNFGSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHWG--EPH 253
Query: 288 --RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------YITSYDYDAPI 338
R V+D A + + GGS +N+YM GGTNFG SG +TSYDYDA +
Sbjct: 254 HVRDVDDAAGVLDDVLRAGGS-VNFYMAHGGTNFGLWSGANVEDGKLQPTVTSYDYDAAV 312
Query: 339 DEYGLLSEPKWGHLKDL---HAAIKLCE----PALVAADSAQ 373
E G L+ PK+ +++ +A L E PA +A +A+
Sbjct: 313 GEAGELT-PKFHAFREVISRYAVTALPELPPLPARLAPQTAE 353
>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
Length = 592
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 127/453 (28%), Positives = 205/453 (45%), Gaps = 54/453 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG LIS +HY R PE W D + K K G + +ETY+ WN HE +GQ++F G
Sbjct: 10 FMLDGQPVKLISGALHYFRIVPEYWQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFSG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ D+ +FV+ + GL++ LR PY+CAEW FGG P WL + R+ P+ + +
Sbjct: 70 RKDVARFVRKAQALGLWVILRPTPYICAEWEFGGLPAWLLADDSMRVRSTYQPYLDAVDA 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ ++ ++R LF GGP++M QIENEYG S+G K Y+K + G V
Sbjct: 130 YYAELFKVIRP--LFFTHGGPVLMCQIENEYG----SFGND-KQYLKAIKRLMEKHGCDV 182
Query: 234 PW---------VMCKQTDAPENIIDACN---------GYYCDGYKPNSYNKPTLWTENWD 275
P V+ T E ++ N G N + P + E W
Sbjct: 183 PMFTSDGGWREVLDAGTLLNEGVLPTANFGSRTDEQIGALRQFMNDNDIHGPLMCMEFWI 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------I 329
GW+ WG L R ++ A + ++G +N YM+ GGTN +G ++ I
Sbjct: 243 GWFNNWGSPLKTRDAKEAADELDAMLRQGS--VNIYMFHGGTNPEFYNGCSYHNGMDPQI 300
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEAHVYRAN 389
TSYDY AP+ E+G +E K+ +++ A P ++ + + N
Sbjct: 301 TSYDYAAPLTEWGTEAE-KYAAFREVIAKYNPITPVPLST------PITFKSYGELRCEN 353
Query: 390 RYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSI----------LPDC--RNTVF 437
+ + S+ I+ + LGQ Y + + L DC R VF
Sbjct: 354 KVSLFNTLSSLAQPIETDIPQPMEKLGQGYGYILYRAHVGKARELAKAKLADCDDRAQVF 413
Query: 438 NTAKVSSQTSIKTVEFSLPLSPNISVPQQSMIE 470
K+ + +T+ ++PL+ + P ++I+
Sbjct: 414 VNQKLIATQYKETMGSNIPLT--LDHPTDNVID 444
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 87/218 (39%), Gaps = 60/218 (27%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLI-LLSQTVGLQNYGAFL--EKDGAG 602
D +VF+N +L + + P+ +++I +L + +G NYGA L G
Sbjct: 408 DRAQVFVNQKLIATQYKETMGSNIPLTLDHPTDNVIDILVENLGRINYGASLVSPHQRKG 467
Query: 603 FRGQVKL-----TGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
+G L TG++ ++L + QV GE+Q+ G+P+
Sbjct: 468 IKGGFMLDLHFHTGWQQYCLELDNV---DQVDFTGEYQE-----------------GVPA 507
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y+ D + D L+L GKG A++NG ++GR+W +
Sbjct: 508 ---FYQFTVDIEEPAD-TFLNLNGWGKGAAFLNGENLGRFWEL----------------- 546
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
PT Y +P L+ N +V+FE G
Sbjct: 547 ----------GPTHYLY-IPAPLLKKGKNTIVLFETEG 573
>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
Length = 595
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DIV+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KDIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|28199702|ref|NP_780016.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182682446|ref|YP_001830606.1| beta-galactosidase [Xylella fastidiosa M23]
gi|386083781|ref|YP_006000063.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417557800|ref|ZP_12208811.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
gi|28057823|gb|AAO29665.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182632556|gb|ACB93332.1| Beta-galactosidase [Xylella fastidiosa M23]
gi|307578728|gb|ADN62697.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179583|gb|EGO82518.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
Length = 612
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 170/350 (48%), Gaps = 40/350 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I DG LIS IH+ R W D + K++ G + +ETYVFWN E GQ++F G
Sbjct: 35 FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
NDI FV+ S GL + LR GPYVCAEW GGFP WL P + R+ + F + QR
Sbjct: 95 NNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+++ + +R L + GGPII +Q+ENEYG SY G D+ A AL + AG+
Sbjct: 155 YLEALGTQVRP--LLNGNGGPIIAVQVENEYG----SY---GDDHGYLQAVRALFIKAGL 205
Query: 234 PWVMCKQTDAPE--------NIIDACNGYYCDGYKPNSYNK--------PTLWTENWDGW 277
+ D + +++ A N G + +K P L E W GW
Sbjct: 206 GGALLFTADGAQMLGNGTLPDVLAAVN--VAPGEAKQALDKLATFHPGQPQLVGEYWAGW 263
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---------- 327
+ WG + A + ++G S +N YM+ GGT+FG +G F
Sbjct: 264 FDQWGKPHAQTDAKQQADEIEWMLRQGHS-INLYMFVGGTSFGFMNGANFQGGPSDHYSP 322
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
TSYDYDA +DE G PK+ +D+ + +P + A S ++I L
Sbjct: 323 QTTSYDYDAVLDEAG-RPMPKFALFRDVITRVTGLQPPPLPAAS-RFIDL 370
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 109/333 (32%), Positives = 163/333 (48%), Gaps = 28/333 (8%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+++ H A + G ++S +HY R PE W D + + G + ++TYV WN HE
Sbjct: 24 TLTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHERR 83
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+ F G D+ +FV+L +GL + +R GPY+CAEW+ GG P WL PG+ R +
Sbjct: 84 PGEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMRLRAGHQ 143
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
P+ + + R+ +V + E L + GGP++ +QIENEYG SYG YV+W
Sbjct: 144 PYLDAVARWFDALVPRVAE--LQAVHGGPVVAVQIENEYG----SYGDD-HAYVRWVRDA 196
Query: 226 ALGLGA--------GVPWVMCKQTDAPENIIDACNGYYCDG----YKPNSYNKPTLWTEN 273
+ G G +M P + A G + +P L E
Sbjct: 197 LVDRGITELLYTADGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPFLCAEF 256
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+GW+ WG + R + A V GGS ++ YM GGTNFG +G
Sbjct: 257 WNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGS-VSLYMAHGGTNFGLWAGANHDGGVLR 315
Query: 328 -YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI 359
+TSYD DAP+ E+G L+ PK+ L++ AA+
Sbjct: 316 PTVTSYDSDAPVSEHGALT-PKFHALRERFAAL 347
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 173/348 (49%), Gaps = 37/348 (10%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGG 89
++ S+S + + + F + + +++G ++ +A IHYPR E W I K G
Sbjct: 13 VTVFSTSCSQSSKEIFEIG--DKTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALG 70
Query: 90 ADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFP 149
+ I YVFWN HE G+Y+F G+ DI F +L +G+Y+ +R GPYVCAEW GG P
Sbjct: 71 MNTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLP 130
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--M 207
WL I+ R + + E ++ F+ ++ + + + +GG IIM+Q+ENEYG+ +
Sbjct: 131 WWLLKKKDIKLREQDPYYMERVKLFMNEVGKQLTDLQIS--KGGNIIMVQVENEYGSFGI 188
Query: 208 ESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN--------IIDACNGYYCDG- 258
+ Y + +D VK A GVP C EN I+ G D
Sbjct: 189 DKPYIAEIRDIVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQ 242
Query: 259 ------YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
+P+ P + +E W GW+ WG + R EDL + R SF + YM
Sbjct: 243 FKRLQELRPDI---PLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYM 298
Query: 313 YFGGTNFGRTSGGPF-----YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
GGT+FG G F TSYDYDAPI+E G ++ PK+ +++L
Sbjct: 299 THGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 50/223 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+TI D +VF++G+ ++ G ++ P++ + L +L + +G N+G
Sbjct: 419 TLTITEAHDWAQVFLDGKKLATLSRLKGEGTVILPPMKEGA---QLDILVEAMGRMNFGK 475
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQ--IYSIEENEAEWTDLTR 652
+ D G +V++ NG I K Y + + F Q + ++N ++ R
Sbjct: 476 GI-YDWKGITEKVEVQS-NNGVITSLKNWKVYNIPVDYAFAQNKKFVKQDNPQKYPAYYR 533
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
TFT KT L++ + KG WVNG+ IGRYW +
Sbjct: 534 ----GTFTLDKT--------GDTFLNMTNWSKGMVWVNGYAIGRYWEI------------ 569
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N ++I + G
Sbjct: 570 ---------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596
>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
Length = 206
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 84/170 (49%), Positives = 116/170 (68%), Gaps = 6/170 (3%)
Query: 683 GKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
GKG AWVNG IGRYW T +A GGC ++CDYRG+Y ++KC NCG P+QT YHVPRSWL
Sbjct: 5 GKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWL 64
Query: 742 QASNNLLVIFEETGGNPFEISVKLRST-RIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
+ S N+LV+FEE GG+P +IS + T +C VS+SH PPV W++ + + N+
Sbjct: 65 KPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNR---NR 121
Query: 801 MAPEMHLHCQ-DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
P + L C +I SI+FAS+GTP+G C F++G+C++ SLS+V +
Sbjct: 122 TRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQK 171
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 165/324 (50%), Gaps = 31/324 (9%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + + YVFWN HE GQ++F G
Sbjct: 35 FLLNGRPFVIKAAELHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTG 94
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ F +L +G+Y+ +R GPYVCAEW GG P WL + R ++ F ++
Sbjct: 95 NNDVAAFCRLAHKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVRLREDDPYFMARVKA 154
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVK----------- 220
F ++ + L GGPIIM+Q+ENEYG+ + Y + +D VK
Sbjct: 155 FEAEVGRQLAP--LTIQNGGPIIMVQVENEYGSYGINKKYVSEIRDIVKASGFDKVTLFQ 212
Query: 221 --WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
WA++ + W M T A NI + +P + P + +E W GW+
Sbjct: 213 CDWASNFEHNGLDDLVWTMNFGTGA--NIDEQFR--RLKQLRPEA---PLMCSEFWSGWF 265
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYD 333
WG R RP +D+ + ++G SF + YM GGT+FG +G P + +TSYD
Sbjct: 266 DKWGARHETRPAKDMVEGIDEMLRKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 324
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA 357
YDAPI+EYG+ + PK+ L++ A
Sbjct: 325 YDAPINEYGMPT-PKFFALRNTMA 347
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 40/95 (42%), Gaps = 29/95 (30%)
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ YFD D L+L GKGQ +VNGH +GR+W +
Sbjct: 540 YYRGYFDLKKTGD-TFLNLEQWGKGQVYVNGHALGRFWHI-------------------- 578
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 579 -------GPQQTLY-LPGCWLKKGRNEIIVLDVVG 605
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 113/328 (34%), Positives = 167/328 (50%), Gaps = 35/328 (10%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +++G ++ +A +HYPR W I K G + + YVFWN HE G+++F
Sbjct: 37 KTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHEQEEGKFDF 96
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G ND+ F +L +G+Y+ +R GPYVCAEW GG P WL I R + F + +
Sbjct: 97 TGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMQRV 156
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ F K++ + L GGPIIM+Q+ENEYG SYG+ K YV +A + +
Sbjct: 157 EIFEKEVGKQLAP--LTIQNGGPIIMVQVENEYG----SYGKD-KPYV--SAIRDIVRKS 207
Query: 232 GVPWVMCKQTDAPENIIDACNGY--------YCDGYKPNSY---------NKPTLWTENW 274
G V Q D N ++ NG + G + N P + +E W
Sbjct: 208 GFDKVSLFQCDWSSNFLN--NGLDDLTWTMNFGTGANIDQQFKRLGEVRPNAPKMCSEFW 265
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---I 329
GW+ WG R RP +D+ + +G SF + YM GGT+FG +G P + +
Sbjct: 266 SGWFDKWGARHETRPAKDMVEGMDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDV 324
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
TSYDYDAPI+E+GL + PK+ L+ + A
Sbjct: 325 TSYDYDAPINEWGLAT-PKFYELQKMMA 351
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 92/220 (41%), Gaps = 54/220 (24%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN--DLILLSQTVGLQNYGAFLEKDGAGF 603
D +V++NG+ G + VK + +E + L ++ + +G N+G + KD G
Sbjct: 430 DFAQVYVNGKYVGKI--DRVKNEKSLELPAMPQGAQLTIVVEGMGRINFGRAI-KDYKGI 486
Query: 604 RGQVKLTGFK-NGDIDLSKILWT-------YQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
G V LT K N ++ L+ W YQ +K ++ N+ G+
Sbjct: 487 IGNVTLTTQKENCELALTPTRWNNSSIADDYQTAVKA-----LAMPTNKMR-------GL 534
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
+ +Y+ YF+ + +++ + GKGQ +VNGH +GR+W +
Sbjct: 535 QTKAGYYRGYFNIKK-VGDTFINMEAFGKGQVYVNGHALGRFWQI--------------- 578
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 579 ------------GPQQTLY-LPGCWLKKGKNEVIVLDVVG 605
>gi|71731106|gb|EAO33173.1| Beta-galactosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 612
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 170/350 (48%), Gaps = 40/350 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I DG LIS IH+ R W D + K++ G + +ETYVFWN E GQ++F G
Sbjct: 35 FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
NDI FV+ S GL + LR GPYVCAEW GGFP WL P + R+ + F + QR
Sbjct: 95 NNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+++ + +R L + GGPII +Q+ENEYG SY G D+ A AL + AG+
Sbjct: 155 YLEALGTQVRP--LLNGNGGPIIAVQVENEYG----SY---GDDHGYLQAVHALFIKAGL 205
Query: 234 PWVMCKQTDAPE--------NIIDACNGYYCDGYKPNSYNK--------PTLWTENWDGW 277
+ D + +++ A N + G + +K P L E W GW
Sbjct: 206 GGALLFTADGAQMLGNGTLPDVLAAVN--FAPGEAKQALDKLATFHPGQPQLVGEYWAGW 263
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---------- 327
+ WG + A + ++G S +N YM+ GGT+FG +G F
Sbjct: 264 FDQWGKPHAQTDAKQQADEIEWMLRQGHS-INLYMFVGGTSFGFMNGANFQGGPGDHYSP 322
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
TSYDYDA +DE G PK+ +D+ + +P + S ++I L
Sbjct: 323 QTTSYDYDAVLDEAG-RPMPKFALFRDVITRVTGLQPPPLPGAS-RFIDL 370
>gi|339640120|ref|ZP_08661564.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
gi|339453389|gb|EGP66004.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
Length = 595
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 163/319 (51%), Gaps = 40/319 (12%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W + + K G + +ETY+ W+ HE GQ+ G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRETLHNLKALGYNTVETYIPWSLHEPQEGQFVTDGLL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + LV GL+L +R PY+CAE++FGG P WL + PG+ FR N+A F E++ RF
Sbjct: 72 DFEAYFDLVQEMGLHLIVRPTPYICAEFDFGGMPPWLLNYPGMRFRVNDALFLEKVSRF- 130
Query: 176 KKIVDLMREEML---FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
D + ++L F+ +GGPI+M+Q+ENEYG SY + K+Y++ A M G
Sbjct: 131 ---YDWLFPKLLPYQFT-EGGPILMMQVENEYG----SYAED-KEYMRNIAKMMRDRGVS 181
Query: 233 VP-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENW 274
VP W+ ++ G + K N+ N P + TE W
Sbjct: 182 VPLFTSDGTWIEALESGTLIEDDIFVTGNFGSQAKENTDNLRAFMERHGKKWPLMCTEFW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ 328
DGW++ WG + R EDLA V + G MN ++ GGTNFG SG
Sbjct: 242 DGWFSRWGEEIVRRDAEDLAQDVKEMMRIGS--MNLFLLRGGTNFGFISGCSARKTRDLP 299
Query: 329 -ITSYDYDAPIDEYGLLSE 346
ITSYD+DAP+ E+G+ +E
Sbjct: 300 QITSYDFDAPVTEWGVPTE 318
>gi|357626884|gb|EHJ76789.1| putative carbamoyl-phosphate synthase large chain [Danaus
plexippus]
Length = 2861
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 176/351 (50%), Gaps = 51/351 (14%)
Query: 37 SASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETY 96
S F N+S ++DG ++S +HY R E W D + K + G + + TY
Sbjct: 45 SQKDFQNARNISIVGDDFMLDGKPLRIVSGSVHYYRLPAEYWRDRLRKIRAAGLNAVSTY 104
Query: 97 VFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW-LRDI 155
V W++HE G Y+F+G DI +F+K+ LY+ LR GPY+CAE + GG P W L
Sbjct: 105 VEWSSHEEEEGAYSFEGDKDIARFLKIAAEENLYVLLRPGPYICAERDLGGLPYWLLSKY 164
Query: 156 PGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS--YGQ 213
P I+ RT + F E ++++ K+ + ++ +L GGPII++Q+ENEYG+ +S Y +
Sbjct: 165 PDIKLRTTDGNFIAETKKWMAKLFEEVKPFLL--GNGGPIILVQVENEYGSYGASKEYMK 222
Query: 214 QGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG----------YKP-- 261
Q +D +K A + TD P Y+ DG + P
Sbjct: 223 QIRDIIKSHVEDA---------ALLYTTDGP------YRSYFIDGSISGTLTTIDFGPTT 267
Query: 262 ---NSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNY 310
N++ + P + +E + GW T W + + + F + + + +N+
Sbjct: 268 SVINTFKELRAYMPVGPLMNSEFYPGWLTHWSEHIQQVSTDRVTFTLRDMLENKIN-LNF 326
Query: 311 YMYFGGTNFGRTSG---GPFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
Y++FGGTNF TSG G FY ITSYDYDAP+ E G +E K+ ++D+
Sbjct: 327 YVFFGGTNFEFTSGANYGRFYQPDITSYDYDAPLSEAGDPTE-KYYAIRDV 376
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 78/169 (46%), Gaps = 13/169 (7%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
+ I RD + V+++ +L G + + + + G + L LL + G N+G +
Sbjct: 455 VLNIKKPRDFIFVYVDKKLQGVISRMMMLYSLSINSKPG-STLSLLVENQGRINFGNRIH 513
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G G V L N ++ + Y + +K ++ S + A D DG P
Sbjct: 514 -DFKGILGSVLLN---NKTLEGPWSVTGYSLDVKK--SKLLSDDNISAFTEDALSDG-PM 566
Query: 658 TFTWYKTYFDAPDGIDPVA--LDLGSMGKGQAWVNGHHIGRYWTVVAPK 704
F + F P+G +P+ +D + GKG +VNG+++GRYW V P+
Sbjct: 567 MF---EGQFVIPEGEEPLDTFIDTTNWGKGYIFVNGYNLGRYWPKVGPQ 612
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 173/352 (49%), Gaps = 36/352 (10%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
+ +++G ++ +A +HYPR W I K G + + YVFWN+HE G Y+F
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL I R ++ F E +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVN 475
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVK---------- 220
F + + +++ + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 476 LFEEAVAKQVKDLTIAN--GGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIALF 533
Query: 221 ---WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
WA++ L + W M T A + A +PNS P + +E W GW
Sbjct: 534 QCDWASNFTLNGLDDLIWTMNFGTGANVDQQFA----KLKKLRPNS---PLMCSEFWSGW 586
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSY 332
+ WG RP ED+ + RG SF + YM GGTN+G +G P + +TSY
Sbjct: 587 FDKWGANHETRPAEDMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSY 645
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE-----PALVAADSAQYIKLGQ 379
DYDAPI E G + PK+ L++ A E PAL+ S K +
Sbjct: 646 DYDAPISESGQTT-PKYWKLREAMAKYMDGEKQAKVPALIKPISIPAFKFTE 696
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 92/237 (38%), Gaps = 70/237 (29%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL-----------LSQT 586
T+T+ D ++F++G+ G + + ++G L+L L +
Sbjct: 740 TLTVSDAHDYAQIFVDGKYIGKL-----------DRRNGEKQLVLPGCPKGAQLDILVEA 788
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIY 638
+G N+G + KD G VKL+ NG + ++ + TY+ +FQ I
Sbjct: 789 MGRINFGRAI-KDFKGITKNVKLSTDINGYPFTCDLKNWEVYNLEDTYEFYQGMKFQPIQ 847
Query: 639 SIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
S+ +N + IP Y+ F D L+ + GKG +VNG+ +GR W
Sbjct: 848 SLTDNLGQ-------RIPGV---YRAKFQVKKPSD-TFLNFETWGKGLVYVNGYALGRIW 896
Query: 699 TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
+ P QT Y VP WL+ N +V+F+ G
Sbjct: 897 EI---------------------------GPQQTLY-VPGCWLKKGENEIVVFDIVG 925
>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
Length = 638
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 127/362 (35%), Positives = 170/362 (46%), Gaps = 51/362 (14%)
Query: 24 MMMMIHLSCVSSSSASTF-FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M I V + TF K N YD +A I +S +HY R + W +
Sbjct: 17 FMSTIAFQDVQAQKKHTFEIKDGNFVYDGKATRI-------LSGEMHYARIPHQYWKHRL 69
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K G + + TYVFWN HE G +NF+G +D+ F+K G GL++ LR GPY CAE
Sbjct: 70 QMVKSMGLNTVATYVFWNFHEESPGNWNFEGDHDLAAFIKTAGEVGLHVILRPGPYACAE 129
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQI 200
W+FGG+P WL+ I G+E R +NA F E + KK +D + +E+ L GGPIIM+Q
Sbjct: 130 WDFGGYPWWLQKIDGLEIRRDNAKFLE----YTKKYIDRLAKEVGSLQITNGGPIIMVQA 185
Query: 201 ENEYGNMESSYGQQGKD-----YVKWAASMALGL---GAGVPWVMCK-----QTDAPENI 247
ENE+G SY Q KD + + A + L G VP + A
Sbjct: 186 ENEFG----SYVSQRKDIPLEEHKAYNAKIKKQLEEAGFNVPLFTSDGSWLFEGGAIPGA 241
Query: 248 IDACNG--------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVAR 299
+ NG D Y N+ P + E + GW W +A +
Sbjct: 242 LPTANGENNISNLKKVVDQY--NNNQGPYMVAEFYPGWLDHWAEPFAKVDAGRIARQTEK 299
Query: 300 FFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------ITSYDYDAPIDEYGLLSEPKWGH 351
+ Q SF NYYM GGTNFG TSG + ITSYDYDAPI E G + PK+
Sbjct: 300 YLQNDISF-NYYMVHGGTNFGFTSGANYNNKSDIQPDITSYDYDAPISEAGWTT-PKYDS 357
Query: 352 LK 353
++
Sbjct: 358 IR 359
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 100/237 (42%), Gaps = 46/237 (19%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGY---NDLILLSQTVGLQNYGAF 595
+ ID +RD V+I+G + +G +V + E + L +L + +G NYG+
Sbjct: 435 LKIDGLRDFAVVYIDG----TKVGELNRVFKNYEMDIDIPFNSTLQILVENMGRINYGSE 490
Query: 596 LEKDGAGFRGQVKLTGFK-NGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL-TRD 653
+ + G V + + GD + ++ L G +Q +I+ + + + T
Sbjct: 491 IIHNHKGIISPVLINDMEITGDWTMQQLPMDKVPDLAG--KQTATIQNTKVNTSKIATLK 548
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
G P Y+ FD + I +D+ GKG ++NG +IGRYW K G Q T
Sbjct: 549 GQP---VLYQGTFDLKE-IGDTFIDMEKWGKGIVFINGINIGRYW-----KTGPQHTL-- 597
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRI 770
++P +L+ +N +VIFE+ EI ++ + ++
Sbjct: 598 ---------------------YIPGPYLKKGSNSIVIFEQLND---EIKTEVSTVKV 630
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 184/367 (50%), Gaps = 36/367 (9%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISA 66
NR++ L S ++ L + + A+ + F V ++ +++G ++ +A
Sbjct: 5 NRSISHVLKAS-------LLTAGLFLFTPTEAAAKTETFGVG--NKTFLLNGKPFIIKAA 55
Query: 67 GIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGS 126
+HYPR W I K G + + YVFWN HE G+++F G ND+ +F++L
Sbjct: 56 EVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDFTGNNDVAEFIRLAQE 115
Query: 127 SGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM 186
+GLY+ +R GPYVCAEW GG P WL I R + F E + F KK+ + + +
Sbjct: 116 NGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERYRIFAKKLGEQIGD-- 173
Query: 187 LFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-QTDAP 244
L +GGPIIM+Q+ENEYG SYG+ K YV + G V C ++
Sbjct: 174 LTIEKGGPIIMVQVENEYG----SYGED-KPYVSGIRDIIRDSGFDKVTLFQCDWSSNFT 228
Query: 245 ENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWDGWYTTWGGRLPHRPVEDL 293
+N +D G N N+ P + +E W GW+ WGGR R +++
Sbjct: 229 KNGLDDLVWTMNFGTGANIENEFKKLGELRPESPQMCSEFWSGWFDKWGGRHETRGSKEM 288
Query: 294 AFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPK 348
+ +G SF + YM GGT++G +G P + +TSYDYDAPI+E G ++ PK
Sbjct: 289 VGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDVTSYDYDAPINEAGQVT-PK 346
Query: 349 WGHLKDL 355
+ L+++
Sbjct: 347 YMELREM 353
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 102/235 (43%), Gaps = 43/235 (18%)
Query: 539 VTIDSMRDVLRVFINGQLTGSV--IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL 596
+TI D +VFING+L GS+ H ++ P + + L +L + +G N+G +
Sbjct: 426 LTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMKEG--DQLDILVEAMGRINFGRAI 483
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIP 656
KD G +V+L+ N ++ L +Q+ + Q+ + + ++ L +P
Sbjct: 484 -KDFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSDSYQV----QKDMKYVPLKDQKVP 538
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGA 716
Y+ F+ D L+L + GKGQ +VNGH IGR+W +
Sbjct: 539 GC---YRATFNLKKTGD-TFLNLETWGKGQVYVNGHAIGRFWKI---------------- 578
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT Y +P WL+ N +++ + G P E V+ S I+
Sbjct: 579 -----------GPQQTLY-MPGCWLKKGENEIIVQDIVG--PQETVVEGLSKPII 619
>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
Length = 581
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 164/319 (51%), Gaps = 29/319 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
ID + +IS G+HY R E W D + K K G + +ETY+ WN HE +G++ F+G
Sbjct: 12 IDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCFEGNL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI KFV + GLY+ LR PY+CAEW FGG P WL G+ R + PF + ++ +
Sbjct: 72 DITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGMRLRCSYKPFLKHVEEYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDY-VKWAASMALGLGAG 232
++ +++ L +GGP+IM+Q+ENEYG ++ Y + +D+ V + + L G
Sbjct: 132 HRLFEVIAP--LQYTKGGPVIMMQVENEYGYYGNDTLYLKTLQDFMVSYGCEVPLVTSDG 189
Query: 233 VPWVMCKQTDAPENIIDACN-----GYYCDGYKPNSYNKPTLWTENWDGWYTTWG----- 282
PW E ++ N + NKP + E W GW+ +WG
Sbjct: 190 -PWGDAFDCGKLEGVLQTGNFGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFDSWGQTEHK 248
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDA 336
P++ E+L + +N YM+ GGTNFG +G +Y +TSYDYDA
Sbjct: 249 QEDPNKNAENLDEILE------SGHVNIYMFMGGTNFGFMNGSNYYDVLTPDVTSYDYDA 302
Query: 337 PIDEYGLLSEPKWGHLKDL 355
+ E G L+ PK+ LK++
Sbjct: 303 LLTEAGDLT-PKYELLKNV 320
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 186/372 (50%), Gaps = 40/372 (10%)
Query: 7 NRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISA 66
NR++ L S ++ L + + A+ + F V ++ +++G ++ +A
Sbjct: 5 NRSISHVLKAS-------LLTAGLFLFTPTEAAAKTETFGVG--NKTFLLNGKPFIIKAA 55
Query: 67 GIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGS 126
+HYPR W I K G + + YVFWN HE G+++F G ND+ +F++L
Sbjct: 56 EVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDFTGNNDVAEFIRLAQE 115
Query: 127 SGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM 186
+GLY+ +R GPYVCAEW GG P WL I R + F E + F +K+ + + +
Sbjct: 116 NGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERYRIFAQKLGEQIGD-- 173
Query: 187 LFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPEN 246
L +GGPIIM+Q+ENEYG SYG+ K YV +A + +G V Q D N
Sbjct: 174 LTIEKGGPIIMVQVENEYG----SYGED-KPYV--SAIRDIIRDSGFDKVTLFQCDWSSN 226
Query: 247 I----IDACNGYYCDGYKPNSYNK-----------PTLWTENWDGWYTTWGGRLPHRPVE 291
+D G N N+ P + +E W GW+ WGGR R +
Sbjct: 227 FTKNGLDDLVWTMNFGTGANIENEFKKLGELRPESPQMCSEFWSGWFDKWGGRHETRGSK 286
Query: 292 DLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSE 346
++ + +G SF + YM GGT++G +G P + +TSYDYDAPI+E G ++
Sbjct: 287 EMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDVTSYDYDAPINEAGQVT- 344
Query: 347 PKWGHLKDLHAA 358
PK+ L+++ A
Sbjct: 345 PKYMELREMLAG 356
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 102/236 (43%), Gaps = 43/236 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV--IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+TI D +VFING+L GS+ H ++ P + + L +L + +G N+G
Sbjct: 425 VLTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMKEG--DQLDILVEAMGRINFGRA 482
Query: 596 LEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ KD G +V+L+ N ++ L +Q+ + Q+ + + ++ L +
Sbjct: 483 I-KDFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSDSYQV----QKDMKYVPLKDQKV 537
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
P Y+ F+ D L+L + GKGQ +VNGH IGR+W +
Sbjct: 538 PGC---YRATFNLKKTGD-TFLNLETWGKGQVYVNGHAIGRFWKI--------------- 578
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT Y +P WL+ N +++ + G P E V+ S I+
Sbjct: 579 ------------GPQQTLY-MPGCWLKKGENEIIVQDIVG--PQETVVEGLSKPII 619
>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
Length = 595
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 113/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KDVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 124/392 (31%), Positives = 184/392 (46%), Gaps = 46/392 (11%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R PE W D + K K G + +ETY+ WN HE G++NF G DI F+
Sbjct: 20 ILSGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMADIEAFIT 79
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L G GL++ +R PY+CAEW FGG P WL P ++ R + F +++ + +++
Sbjct: 80 LAGKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKKVDAYYDELIP-- 137
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD 242
R L S GGPII +QIENEYG SYG Y+++ + G V+ +D
Sbjct: 138 RLVPLLSTNGGPIIAVQIENEYG----SYGNDTA-YLQYLQEALIARGVD---VLLFTSD 189
Query: 243 APEN------IIDACNGYYCDGYKPNSY---------NKPTLWTENWDGWYTTWGGRLPH 287
P + + G +P+ P + E W+GW+ W
Sbjct: 190 GPTDGMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMKPHHT 249
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEY 341
R ED A A G S +N+YM+ GGTNFG +G ++ ITSYDYDAP+ E
Sbjct: 250 RDSEDAASVFAEMLALGAS-VNFYMFHGGTNFGFYNGANYHDKYEPTITSYDYDAPLSEC 308
Query: 342 GLLSEPKWGHLKDL---HAAIKLCE-PALVAADSAQYIKLGQNQEAHVYRANRYGSQSNC 397
G ++ K+ ++ + H ++L + PAL D + G Y
Sbjct: 309 GDVTT-KYEAVRQVIAKHQGVELGDLPAL--PDPVRKKAYG------TVSMTSYADLLEN 359
Query: 398 SAFLANIDEH-TAASVTFLGQSYTLPPWSVSI 428
LA+ ++H T + LGQ+Y +S I
Sbjct: 360 LPVLASSEKHRTPVPMELLGQNYGFIVYSTKI 391
>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
Length = 638
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 118/330 (35%), Positives = 161/330 (48%), Gaps = 43/330 (13%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
+ DG ++S +HY R + W + K G + + TYVFWN HE G +NF+G
Sbjct: 42 VYDGKTTRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEGD 101
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
+D+ F+K G GL++ LR GPY CAEW+FGG+P WL+ I G+E R +NA F E +
Sbjct: 102 HDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNAKFLE----Y 157
Query: 175 VKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMAL 227
KK +D + +E+ L GGPIIM+Q ENE+G SY Q KD + + A +
Sbjct: 158 TKKYIDRLAKEVGSLQITNGGPIIMVQAENEFG----SYVSQRKDIPLEEHKAYNAKIKK 213
Query: 228 GL---GAGVPWVMCK-----QTDAPENIIDACNG--------YYCDGYKPNSYNKPTLWT 271
L G VP + A + NG D Y N+ P +
Sbjct: 214 QLEEAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVA 271
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--- 328
E + GW W +A ++ Q SF NYYM GGTNFG TSG +
Sbjct: 272 EFYPGWLDHWAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKS 330
Query: 329 -----ITSYDYDAPIDEYGLLSEPKWGHLK 353
ITSYDYDAPI E G + PK+ ++
Sbjct: 331 DIQPDITSYDYDAPISEAGWAT-PKYDSIR 359
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 100/237 (42%), Gaps = 46/237 (19%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGY---NDLILLSQTVGLQNYGAF 595
+ ID +RD V+I+G + +G +V + E + L +L + +G NYG+
Sbjct: 435 LKIDGLRDFAVVYIDG----TKVGELNRVFKNYEMDIDIPFNSTLQILVENMGRINYGSE 490
Query: 596 LEKDGAGFRGQVKLTGFK-NGDIDLSKILWTYQVGLKG-EFQQIYSIEENEAEWTDLTRD 653
+ + G V + + GD + ++ L G + I + + N ++ LT
Sbjct: 491 MIHNHKGIISPVLINDMEITGDWTMQQLPMDKVPDLAGKQTAAIQNTKTNASKIAALT-- 548
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
G P Y+ FD + I +D+ GKG ++NG +IGRYW K G Q T
Sbjct: 549 GQP---VLYQGTFDLKE-IGDTFIDMEKWGKGIVFINGINIGRYW-----KTGPQHTL-- 597
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRI 770
++P +L+ +N +VIFE+ EI ++ + ++
Sbjct: 598 ---------------------YIPAPYLKKGSNSIVIFEQLND---EIKTEVSTVKV 630
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/307 (33%), Positives = 150/307 (48%), Gaps = 23/307 (7%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG ++S +HY R P+ W D I K++ G + +ETYV WN H RG ++ G+
Sbjct: 12 LLDGRSLQIVSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVHSPERGVFDTSGR 71
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ +F+ LV + GL+ +R GPY+CAEW GG P WL P + R F E + +
Sbjct: 72 RDLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIGEY 131
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++ ++ E + +GGP++M+Q+ENEYG + + Y++ A M G VP
Sbjct: 132 YAALLPIVAERQVT--RGGPVLMVQVENEYGAYGDDPPVERERYLRALADMIRAQGIDVP 189
Query: 235 WVMCKQTD--------APENIIDACNGYYCDG----YKPNSYNKPTLWTENWDGWYTTWG 282
Q + PE + A G + + P + E WDGW+ + G
Sbjct: 190 LFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMCMEFWDGWFDSAG 249
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-------PFYITSYDYD 335
P E A + G S +N YM GGTNFG TSG P TSYDYD
Sbjct: 250 LHHHTTPPEANARDLDDLLAAGAS-VNLYMLHGGTNFGLTSGANDKGVYRPI-TTSYDYD 307
Query: 336 APIDEYG 342
AP+ E+G
Sbjct: 308 APLSEHG 314
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 170/350 (48%), Gaps = 40/350 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I DG LIS IH+ R W D + K++ G + +ETYVFWN E GQ++F G
Sbjct: 35 FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
NDI FV+ S GL + LR GPYVCAEW GGFP WL P + R+ + F + QR
Sbjct: 95 NNDISAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+++ + +R L + GGPII +Q+ENEYG SY G D+ A AL + AG+
Sbjct: 155 YLEALGTQVRP--LLNGNGGPIIAVQVENEYG----SY---GDDHGYLQAVRALFIKAGL 205
Query: 234 PWVMCKQTDAPE--------NIIDACNGYYCDGYKPNSYNK--------PTLWTENWDGW 277
+ D + +++ A N G + +K P L E W GW
Sbjct: 206 GGALLFTADGAQMLGNGTLPDVLAAVN--VAPGEAKQALDKLATFHPGQPQLVGEYWAGW 263
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---------- 327
+ WG + A + ++G S +N YM+ GGT+FG +G F
Sbjct: 264 FDQWGKPHAQTDAKQQADEIEWMLRQGHS-INLYMFVGGTSFGFMNGANFQGGPSDHYSP 322
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKL 377
TSYDYDA +DE G PK+ +D+ + +P + A + ++I L
Sbjct: 323 QTTSYDYDAALDEAG-RPMPKFVLFRDVITRVTGLQPPPLPA-ATRFIDL 370
>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
johnsonii DSM 18315]
gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
DSM 18315]
Length = 539
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 162/325 (49%), Gaps = 27/325 (8%)
Query: 51 HRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYN 110
++ ++DG ++ +A IHY R E W I K G + I Y FWN HE G+++
Sbjct: 36 NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95
Query: 111 FKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
F G+NDI F +L +Y+ LR GPYVC+EW GG P WL I+ RTN+ F E
Sbjct: 96 FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
+ F+ +I + + + +GG IIM+Q+ENEYG+ + K+Y+ + G G
Sbjct: 156 TKLFMNEIGKQLADLQIT--KGGNIIMVQVENEYGSYATD-----KEYIANIRDIVKGAG 208
Query: 231 -AGVPWVMCK-----QTDAPENIIDACN---GYYCD----GYKPNSYNKPTLWTENWDGW 277
VP C Q +A ++++ N G D K N P + +E W GW
Sbjct: 209 FTDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGW 268
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSY 332
+ WG + R E + + RG SF + YM GGT FG G P Y +SY
Sbjct: 269 FDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSY 327
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHA 357
DYDAPI E G + PK+ L++L A
Sbjct: 328 DYDAPISEAG-WTTPKYFKLRELLA 351
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 114/327 (34%), Positives = 164/327 (50%), Gaps = 41/327 (12%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +++G ++ +A +HYPR W I K G + + YVFWN HE G+++F
Sbjct: 35 KTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEGRFDF 94
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G ND+ +F +L +GLY+ +R GPYVCAEW GG P WL I R + F E +
Sbjct: 95 TGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYFMERV 154
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG--------------QQGKD 217
+ F +K+ + + L GGPIIM+Q+ENEYG SYG Q G D
Sbjct: 155 KLFERKVGEQLAS--LTIQNGGPIIMVQVENEYG----SYGENKAYVSAIRDIVRQSGFD 208
Query: 218 YV-----KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG-YKPNSYNKPTLWT 271
V WA++ + W M T A D + G +PN+ P + +
Sbjct: 209 KVTLFQCDWASNFEKNGLDDLVWTMNFGTGA-----DIDQQFRRLGELRPNA---PQMCS 260
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY- 328
E W GW+ WG R RP + + + +G SF + YM GGT+FG +G P +
Sbjct: 261 EFWSGWFDKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFA 319
Query: 329 --ITSYDYDAPIDEYGLLSEPKWGHLK 353
+TSYDYDAPI+EYG + PK+ L+
Sbjct: 320 PDVTSYDYDAPINEYGQAT-PKYWELR 345
Score = 43.5 bits (101), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 93/228 (40%), Gaps = 50/228 (21%)
Query: 539 VTIDSMRDVLRVFINGQLTGS---VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+T++ D +VF++G+ G V ++ PVE + +L + + +G N+G
Sbjct: 422 LTLNEPHDFAQVFVDGKYIGKIDRVKNEKTLMLPPVEKGT---ELCIRIEAMGRINFGRA 478
Query: 596 LEKDGAGFRGQVKLTGFKNG-DIDLSKILWT-------YQVGLKGEFQQIYSIEENEAEW 647
+ KD G +V ++ +G + + WT Y+ +K + +
Sbjct: 479 I-KDYKGITKEVTISAEMDGHEASWNLKNWTIVPIPDNYETAVKALSVGTETSKRTRQHA 537
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
LT+ G +Y+ +F D L++ + GKGQ +VNGH IGR+W +
Sbjct: 538 KLLTKAG------YYRGHFMLRKPGD-TFLNMEAFGKGQVYVNGHAIGRFWNI------- 583
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 584 --------------------GPQQTLY-LPGCWLKQGRNEVIVLDVVG 610
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 190/404 (47%), Gaps = 31/404 (7%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
K +SY ++ +G L++ +HY R P W D + + G + ++TYV WN
Sbjct: 1 MKRSTLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNF 60
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G F G D+ +F++L GL + +R GPY+CAEW+ GG P WL PG+ R
Sbjct: 61 HERTAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLR 120
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYV 219
T++ P+ E + R+ +V + E L + +GGP++ +QIENEYG+ + +Y + +D +
Sbjct: 121 TSHGPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGSYGDDRAYVRHIRDAL 178
Query: 220 KWAASMALGLGAGVPWVMCKQTDA-PENIIDACNGYYCDG----YKPNSYNKPTLWTENW 274
L A P + + A P + A G D + +P E W
Sbjct: 179 VARGITELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFCAEFW 238
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
+GW+ WG + RP A + GGS ++ YM GGTNFG +G
Sbjct: 239 NGWFDHWGDKHHVRPAPSAAEDLGGILDEGGS-VSLYMAHGGTNFGLWAGANHEGGTIRP 297
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKL--------CEPALVAADSAQYIKLGQ 379
+TSYD DAPI E G L+ PK+ L+D A+ +P L+A ++
Sbjct: 298 TVTSYDSDAPIAENGALT-PKFFALRDRLTALGTAATRRPLPADPPLLAPRDLPVLR--- 353
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPP 423
+A + A R ++ + + +E + AS L ++ L P
Sbjct: 354 --QAALLDALRATAEPVTAPLPLSFEELSLASGLVLYEAEPLLP 395
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 175/357 (49%), Gaps = 46/357 (12%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+ +++ LS VS++ F + + +++G ++ +A +HYPR W
Sbjct: 14 VALLVTAMLSPVSAARKGGTF-----TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHR 68
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
I K G + + YVFWN HE G+++F ND+ +F +L +GLY+ +R GPYVCA
Sbjct: 69 IKMCKALGMNTVCLYVFWNIHEQQEGKFDFTDNNDVAEFCRLAQRNGLYVIVRPGPYVCA 128
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW GG P WL I R + F E ++ F +K+ + + L GGPIIM+Q+E
Sbjct: 129 EWEMGGLPWWLLKKKDIRLREPDPYFMERVKLFERKVGEQLAS--LTIQNGGPIIMVQVE 186
Query: 202 NEYGNMESSYG--------------QQGKDYV-----KWAASMALGLGAGVPWVMCKQTD 242
NEYG SYG Q G D V WA++ + W M T
Sbjct: 187 NEYG----SYGENKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTMNFGTG 242
Query: 243 APENIIDACNGYYCDG-YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFF 301
A D + G +PN+ P + +E W GW+ WG R RP + + +
Sbjct: 243 A-----DIDQQFRRLGELRPNA---PQMCSEFWSGWFDKWGARHETRPAKTMVEGIDEML 294
Query: 302 QRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLK 353
+G SF + YM GGT+FG +G P + +TSYDYDAPI+EYG + PK+ L+
Sbjct: 295 SKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQAT-PKYWELR 349
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 51/232 (21%), Positives = 93/232 (40%), Gaps = 50/232 (21%)
Query: 535 VRPTVTIDSMRDVLRVFINGQLTGS---VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQN 591
V +T++ D +VF++G+ G V ++ PVE + +L + + +G N
Sbjct: 422 VESMLTLNEPHDFAQVFVDGKYIGKIDRVKNEKTLMLPPVEKGA---ELCIRIEAMGRIN 478
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWT-------YQVGLKGEFQQIYSIEEN 643
+G + KD G +V ++ +G + + WT Y+ +K + +
Sbjct: 479 FGRAI-KDYKGITKEVTISAEMDGHEASWNLKNWTIVPIPDNYETAVKALSVGTETSKRT 537
Query: 644 EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
LT+ G +Y+ +F D L++ + KGQ +VNGH IGR+W +
Sbjct: 538 RQHAKLLTKAG------YYRGHFTLRKPGD-TFLNMEAFAKGQVYVNGHAIGRFWNI--- 587
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 588 ------------------------GPQQTLY-LPGCWLKQGRNEVIVLDVVG 614
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 190/404 (47%), Gaps = 31/404 (7%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
K +SY ++ +G L++ +HY R P W D + + G + ++TYV WN
Sbjct: 1 MKRSTLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNF 60
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G F G D+ +F++L GL + +R GPY+CAEW+ GG P WL PG+ R
Sbjct: 61 HERTAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLR 120
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYV 219
T++ P+ E + R+ +V + E L + +GGP++ +QIENEYG+ + +Y + +D +
Sbjct: 121 TSHGPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGSYGDDRAYVRHIRDAL 178
Query: 220 KWAASMALGLGAGVPWVMCKQTDA-PENIIDACNGYYCDG----YKPNSYNKPTLWTENW 274
L A P + + A P + A G D + +P E W
Sbjct: 179 VARGITELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFCAEFW 238
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
+GW+ WG + RP A + GGS ++ YM GGTNFG +G
Sbjct: 239 NGWFDHWGDKHHVRPAPSAAEDLGGILDEGGS-VSLYMAHGGTNFGLWAGANHEGGTIRP 297
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKL--------CEPALVAADSAQYIKLGQ 379
+TSYD DAPI E G L+ PK+ L+D A+ +P L+A ++
Sbjct: 298 TVTSYDSDAPIAENGALT-PKFFALRDRLTALGTVAARRPLPADPPLLAPRDLPVLR--- 353
Query: 380 NQEAHVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPP 423
+A + A R ++ + + +E + AS L ++ L P
Sbjct: 354 --QAALLDALRATAEPVTAPLPLSFEELSLASGLVLYEAEPLLP 395
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 158/326 (48%), Gaps = 30/326 (9%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
+I+ G+HY R + W D + K K G + +ETYV WN HE+ +G Y F G DI F++
Sbjct: 20 IIAGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIE 79
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L S L++ +R PY+CAEW FGG P WL PG++ RT PF + ++ + + + ++
Sbjct: 80 LAQSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKIL 139
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMC---- 238
L Q GPII++QIENEYG Y K+Y+ + G VP V
Sbjct: 140 AP--LQIDQDGPIILMQIENEYG-----YYGNDKEYLSTLLKIMRDFGTTVPVVTSDGPW 192
Query: 239 -KQTDAPENIIDAC---------NGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPH- 287
+ DA + D + + +K NKP + E W GW+ WG H
Sbjct: 193 GEALDAGSLLADVSLPTMNFGTGAKEHIENFKEKYVNKPVMCMEFWVGWFDAWGDDRHHT 252
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEY 341
R D A + G +N YM+ GGTNFG +G +TSYDYDA + E
Sbjct: 253 RDASDAANELRDILNEGS--VNIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTEC 310
Query: 342 GLLSEPKWGHLKDLHAAIKLCEPALV 367
G L+E + K + ++ E L+
Sbjct: 311 GDLTEKYYEFKKVISEFTEIKEVELL 336
>gi|420143773|ref|ZP_14651269.1| Beta-galactosidase 3 [Lactococcus garvieae IPLA 31405]
gi|391856250|gb|EIT66791.1| Beta-galactosidase 3 [Lactococcus garvieae IPLA 31405]
Length = 597
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 160/334 (47%), Gaps = 37/334 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++D +IS IHY R W D + K GA+ +ETY+ WN HE G ++F+G
Sbjct: 10 FMLDNEPVKIISGAIHYFRIPQSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVFDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
DI FVKL S GL + LR Y+CAEW FGG P WL P + R+ ++ F +++
Sbjct: 70 MKDIRAFVKLAESLGLMVILRPSVYICAEWEFGGLPAWLLKEPEMRLRSTDSRFMTKVEN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K ++ + + + GGP+IM+Q+ENEYG SYG + K+Y++ ++ G V
Sbjct: 130 YFKVLLPYISPLQITA--GGPVIMMQVENEYG----SYGME-KEYLRQTMALMKKYGINV 182
Query: 234 PWVMCK-----QTDAPENIIDAC------------NGYYCDGYKPNSYNK-PTLWTENWD 275
P DA I D N G+ K P + E WD
Sbjct: 183 PLFTSDGAWQAALDAGSLIEDDVLVTGNFGSRSKENAAVLAGFMKEHGKKWPLMCMEYWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGPFY 328
GW+ WG + R +DLA V + G +N YM+ GGTNFG R +G
Sbjct: 243 GWFNRWGEPIIKREPQDLADEVKTMLELGS--LNLYMFHGGTNFGFYNGCSARDTGNLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLC 362
ITSYDYDA + E G EP + A ++C
Sbjct: 301 ITSYDYDALLTEAG---EPTAKYYAVQKAIKEVC 331
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 173/337 (51%), Gaps = 23/337 (6%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
V Y+ + I+G + L SA IHY R E W +++ K+K G + ++TY WN HE
Sbjct: 18 VQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNCVDTYFAWNVHEPEE 77
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF+G ND F+ L GL++ R GP++CAEW+FGGFP WL ++FR +
Sbjct: 78 GEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWLNTKKDMKFRAFDMQ 137
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ + R++ +I+ ++R+ + + GG +I++Q+ENEYG + S + +DY+ +
Sbjct: 138 YLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYLASD--EVARDYMLHLRDVM 193
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY-----NKPTLWTENWDGWYTTW 281
L G VP + C E ++ N + + N+ + P + TE W GW+ W
Sbjct: 194 LDRGVMVPLITC--VGGAEGTVEGANFWSGADHHYNNLVQKQPDTPKIVTEFWTGWFEHW 251
Query: 282 GGRLPHRPVEDLAFAVARFFQ--RGG-----SFMNYYMYFGGTNFGRTSGGP--FYITSY 332
G P + A R + R G +M + G GRT G F +TSY
Sbjct: 252 GA--PAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNFGGYGGRTVGASDIFMVTSY 309
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
DYDAP+ EYG +++ K+ K + ++ E L+ A
Sbjct: 310 DYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLLNA 345
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 56/135 (41%), Gaps = 43/135 (31%)
Query: 654 GIPSTFTWYKTYFDAP----DGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQD 709
G+P W+ FD P D + L L M KG W+NG +GRYW V G Q+
Sbjct: 823 GVP---VWHTVQFDKPELPADVNAKLKLRLTGMSKGTLWLNGIDLGRYWQV-----GPQE 874
Query: 710 TCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTR 769
Y +P +WL+ N LV+F+E G +P ++ R
Sbjct: 875 D-----------------------YKIPMAWLKDRNE-LVLFDENGASPSKV-------R 903
Query: 770 IVCEQVSESHYPPVR 784
++ +Q S + ++
Sbjct: 904 LLYDQASSKRWISIQ 918
>gi|328958462|ref|YP_004375848.1| beta-galactosidase [Carnobacterium sp. 17-4]
gi|328674786|gb|AEB30832.1| beta-galactosidase [Carnobacterium sp. 17-4]
Length = 589
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 165/327 (50%), Gaps = 35/327 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G + S +HY R PE W + K G + +ETY+ WN HE G+Y F G
Sbjct: 10 FLLNGEPFKITSGAVHYFRVLPEDWYHSLYNLKALGFNTVETYIPWNVHEPKEGEYQFSG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ DI KFV+L GL++ LR PY+CAEW FGG P WL + R+++ F E++ R
Sbjct: 70 QWDIKKFVQLAEELGLFVILRPSPYICAEWEFGGLPAWLLTYKDMLIRSSDPVFIEKVSR 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K+++ + L GGP+IM+Q+ENEYG SYG+ K+Y++ + L LG +
Sbjct: 130 YYKELLKQITP--LQVDHGGPVIMMQLENEYG----SYGED-KEYLRTLYELMLKLGVTI 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPN---------SYNK--PTLWTENWD 275
P W ++ ++ G + K N S K P + E WD
Sbjct: 183 PIFTSDGAWRATQEAGTMTDLDILTTGNFGSRSKENFKELKEFHESKGKKWPLMCMEYWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW+ W + R +L V + G +N YM+ GGTNFG +G
Sbjct: 243 GWFNRWNDPIIKRDALELTQDVKEALEIGS--LNLYMFHGGTNFGFMNGCSARLRKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAP++E G +E K+ LK++
Sbjct: 301 VTSYDYDAPLNEQGNPTE-KYFALKNM 326
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 75/182 (41%), Gaps = 47/182 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGE 633
SG N L +L + +G NYG L D + G + G + DL I Q L +
Sbjct: 438 SGSNQLDVLVENMGRVNYGHKLLAD-------TQQKGIRRGVMSDLHFITNWEQYSL--D 488
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
F + SI+ ++ EW ++ PS F YK DAP+ +++ GKG VNG +
Sbjct: 489 FSEPLSIDFDK-EW----KENSPS-FYQYKVTIDAPED---TFINMELFGKGIVLVNGFN 539
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
IGR+W V PT + Y P S + N +++FE
Sbjct: 540 IGRFWNV---------------------------GPTLSLY-APMSLFRKGENEIIVFET 571
Query: 754 TG 755
G
Sbjct: 572 EG 573
>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
Length = 779
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 162/325 (49%), Gaps = 27/325 (8%)
Query: 51 HRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYN 110
++ ++DG ++ +A IHY R E W I K G + I Y FWN HE G+++
Sbjct: 36 NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95
Query: 111 FKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
F G+NDI F +L +Y+ LR GPYVC+EW GG P WL I+ RTN+ F E
Sbjct: 96 FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
+ F+ +I + + + +GG IIM+Q+ENEYG+ + K+Y+ + G G
Sbjct: 156 TKLFMNEIGKQLADLQIT--KGGNIIMVQVENEYGSYATD-----KEYIANIRDIVKGAG 208
Query: 231 -AGVPWVMCK-----QTDAPENIIDACN---GYYCD----GYKPNSYNKPTLWTENWDGW 277
VP C Q +A ++++ N G D K N P + +E W GW
Sbjct: 209 FTDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGW 268
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSY 332
+ WG + R E + + RG SF + YM GGT FG G P Y +SY
Sbjct: 269 FDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSY 327
Query: 333 DYDAPIDEYGLLSEPKWGHLKDLHA 357
DYDAPI E G + PK+ L++L A
Sbjct: 328 DYDAPISEAGWTT-PKYFKLRELLA 351
>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 624
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 163/334 (48%), Gaps = 53/334 (15%)
Query: 58 GNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
G ++S +HY R + W + K G + + TYVFWN HE G+++F G ++
Sbjct: 35 GEEIPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
+++++ G G+ + LR GPYVCAEW FGG+P WL++IPG+E R +N E ++ KK
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNT----EFLKYTKK 150
Query: 178 IVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASMAL 227
+D + EE+ L +GGPIIM+Q ENE+G SY Q KD Y
Sbjct: 151 YIDRLYEEVGDLQCTKGGPIIMVQCENEFG----SYVSQRKDIPLEEHRSYNAKIKGQLA 206
Query: 228 GLGAGVP-------WVM---CKQTDAP--------ENIIDACNGYYCDGYKPNSYNKPTL 269
G +P W+ C P N+ N Y+ D P +
Sbjct: 207 DAGFTIPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGD-------KGPYM 259
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E + GW + WG P ++A + Q SF N+YM GGTNFG TSG +
Sbjct: 260 VAEFYSGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 328 ------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G L+ PK+ ++ +
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWLT-PKYDSIRSV 351
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 35/82 (42%), Gaps = 28/82 (34%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
+D+ + GKG ++NG HIGRYW V P QT Y +
Sbjct: 555 IDMRAWGKGIIFINGKHIGRYWKV---------------------------GPQQTLY-I 586
Query: 737 PRSWLQASNNLLVIFEETGGNP 758
P WL+ N +VIFE+ P
Sbjct: 587 PGVWLRKGKNKIVIFEQLNEIP 608
>gi|29376389|ref|NP_815543.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|227519038|ref|ZP_03949087.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553661|ref|ZP_03983710.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256961654|ref|ZP_05565825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293383358|ref|ZP_06629271.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388990|ref|ZP_06633475.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907816|ref|ZP_07766806.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910433|ref|ZP_07769280.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714340|ref|ZP_16771066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715597|ref|ZP_16772313.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676484|ref|ZP_18113355.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681702|ref|ZP_18118489.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685588|ref|ZP_18122282.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686206|ref|ZP_18122874.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690524|ref|ZP_18127059.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424694932|ref|ZP_18131318.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696643|ref|ZP_18132984.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424700339|ref|ZP_18136532.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703758|ref|ZP_18139884.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424712611|ref|ZP_18144783.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424718249|ref|ZP_18147501.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424721894|ref|ZP_18150963.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723972|ref|ZP_18152924.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733572|ref|ZP_18162127.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424741709|ref|ZP_18170052.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424751990|ref|ZP_18179997.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|29343852|gb|AAO81613.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|227073538|gb|EEI11501.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177203|gb|EEI58175.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256952150|gb|EEU68782.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291079149|gb|EFE16513.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081771|gb|EFE18734.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626177|gb|EFQ09460.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289706|gb|EFQ68262.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575942|gb|EFU88133.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580774|gb|EFU92965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350621|gb|EJU85522.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356496|gb|EJU91227.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358329|gb|EJU93003.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364102|gb|EJU98549.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367740|gb|EJV02077.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402369105|gb|EJV03397.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402374029|gb|EJV08075.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377412|gb|EJV11319.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402379869|gb|EJV13650.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402382152|gb|EJV15835.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402384002|gb|EJV17579.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402390099|gb|EJV23464.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402391584|gb|EJV24885.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402396442|gb|EJV29504.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402401146|gb|EJV33935.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402404973|gb|EJV37581.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 169/343 (49%), Gaps = 43/343 (12%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG LIS IHY R TP W D + K GA+ IETY+ WN HE + G Y+F+G
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DIV FV L GL + LR Y+CAEW FGG P WL + R+ + F +++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKVRTY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
V L + L GGP+IM+Q+ENEYG SYG + K+Y++ + G VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 235 WVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTENW 274
+ A E ++D G + K N+ ++K P + E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ 328
DGW+ WG + R +DLA V G +N YM+ GGTNFG +G
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGS--LNLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 329 -ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
+TSYDYDA + E G +E K+ H++ AIK P + A+
Sbjct: 299 QVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAE 337
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 70/180 (38%), Gaps = 49/180 (27%)
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEFQ 635
+L +L + +G NYG L G ++ G + G DI + G Q
Sbjct: 443 ELDVLVENLGRVNYGFKL-------NGPTQVKGIRGGIMQDIHFHQ----------GYRQ 485
Query: 636 QIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
++ ++ + D T P+ ++Y+ F D D +D S GKG VNG ++G
Sbjct: 486 YALTLSADQLKKIDYTAGKNPAQPSFYQAEFTLTDLADTF-IDCRSYGKGVVIVNGINLG 544
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW RG +S C P+ +L+ N +VIFE G
Sbjct: 545 RYWQ--------------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEG 576
>gi|421514041|ref|ZP_15960756.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|401672838|gb|EJS79281.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 169/343 (49%), Gaps = 43/343 (12%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG LIS IHY R TP W D + K GA+ IETY+ WN HE + G Y+F+G
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DIV FV L GL + LR Y+CAEW FGG P WL + R+ + F +++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKVRTY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
V L + L GGP+IM+Q+ENEYG SYG + K+Y++ + G VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 235 WVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTENW 274
+ A E ++D G + K N+ ++K P + E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R +DLA V G +N YM+ GGTNFG +G
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGS--LNLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
+TSYDYDA + E G +E K+ H++ AIK P + A+
Sbjct: 299 QVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAE 337
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 70/180 (38%), Gaps = 49/180 (27%)
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEFQ 635
+L +L + +G NYG L G ++ G + G DI + G Q
Sbjct: 443 ELDVLVENLGRVNYGFKL-------NGPTQVKGIRGGIMQDIHFHQ----------GYRQ 485
Query: 636 QIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
++ ++ + D T P+ ++Y+ F D D +D S GKG VNG ++G
Sbjct: 486 YALTLSADQLKKIDYTAGKNPAQPSFYQAEFTLTDLADTF-IDCRSYGKGVVIVNGINLG 544
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW RG +S C P+ +L+ N +VIFE G
Sbjct: 545 RYWQ--------------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEG 576
>gi|229545563|ref|ZP_04434288.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256619317|ref|ZP_05476163.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853375|ref|ZP_05558745.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256964870|ref|ZP_05569041.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|257090147|ref|ZP_05584508.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|294614275|ref|ZP_06694194.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|307272958|ref|ZP_07554205.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|307277803|ref|ZP_07558888.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291733|ref|ZP_07571605.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|384518848|ref|YP_005706153.1| beta-galactosidase [Enterococcus faecalis 62]
gi|422685728|ref|ZP_16743941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422689100|ref|ZP_16747212.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422720655|ref|ZP_16777264.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422731066|ref|ZP_16787446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|422739263|ref|ZP_16794446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|430849460|ref|ZP_19467237.1| glycosyl hydrolase [Enterococcus faecium E1185]
gi|229309303|gb|EEN75290.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256598844|gb|EEU18020.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711834|gb|EEU26872.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256955366|gb|EEU71998.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256998959|gb|EEU85479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|291592934|gb|EFF24524.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|306497185|gb|EFM66730.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505543|gb|EFM74728.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306510572|gb|EFM79595.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|315029440|gb|EFT41372.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032046|gb|EFT43978.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144925|gb|EFT88941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|315162898|gb|EFU06915.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577862|gb|EFU90053.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|323480981|gb|ADX80420.1| beta-galactosidase [Enterococcus faecalis 62]
gi|430537598|gb|ELA77922.1| glycosyl hydrolase [Enterococcus faecium E1185]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 169/343 (49%), Gaps = 43/343 (12%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG LIS IHY R TP W D + K GA+ IETY+ WN HE + G Y+F+G
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DIV FV L GL + LR Y+CAEW FGG P WL + R+ + F +++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKVRTY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
V L + L GGP+IM+Q+ENEYG SYG + K+Y++ + G VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 235 WVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTENW 274
+ A E ++D G + K N+ ++K P + E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R +DLA V G +N YM+ GGTNFG +G
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGS--LNLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
+TSYDYDA + E G +E K+ H++ AIK P + A+
Sbjct: 299 QVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAE 337
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 70/180 (38%), Gaps = 49/180 (27%)
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEFQ 635
+L +L + +G NYG L G ++ G + G DI + G Q
Sbjct: 443 ELDVLVENLGRVNYGFKL-------NGPTQVKGIRGGIMQDIHFHQ----------GYRQ 485
Query: 636 QIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
++ ++ + D T P+ ++Y+ F D D +D S GKG VNG ++G
Sbjct: 486 YALTLSADQLKKIDYTAGKNPAQPSFYQAEFTLTDLADTF-IDCRSYGKGVVIVNGINLG 544
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW RG +S C P+ +L+ N +VIFE G
Sbjct: 545 RYWQ--------------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEG 576
>gi|312903586|ref|ZP_07762766.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|310633462|gb|EFQ16745.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
Length = 611
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 169/343 (49%), Gaps = 43/343 (12%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG LIS IHY R TP W D + K GA+ IETY+ WN HE + G Y+F+G
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DIV FV L GL + LR Y+CAEW FGG P WL + R+ + F +++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKVRTY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
V L + L GGP+IM+Q+ENEYG SYG + K+Y++ + G VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 235 WVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTENW 274
+ A E ++D G + K N+ ++K P + E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R +DLA V G +N YM+ GGTNFG +G
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGS--LNLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
+TSYDYDA + E G +E K+ H++ AIK P + A+
Sbjct: 299 QVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAE 337
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 167/325 (51%), Gaps = 37/325 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ M++ IHY R E W D + K + G + + TY+ WN HE RG+++F
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
D+ +V L + GL++ LR GPY+CAE + GG P WL P + RT N F E + ++
Sbjct: 131 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 190
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ KI+ L GGP+I +Q+ENEYG+ Q+ ++Y+ + L
Sbjct: 191 DHLIPKILPLQYR------HGGPVIAVQVENEYGSF-----QKDRNYMNYLKKAL--LKR 237
Query: 232 GVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSY--------NKPTLWTENWDGWYT 279
G+ ++ D I + NG + + +S+ +KP + E W GWY
Sbjct: 238 GIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYD 297
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYD 333
+WG + + E++ V +F G SF N YM+ GGTNFG +GG + +TSYD
Sbjct: 298 SWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYD 356
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAA 358
YDA + E G +E K+ L+ L A+
Sbjct: 357 YDAVLSEAGDYTE-KYFKLRKLFAS 380
>gi|307275710|ref|ZP_07556850.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|306507586|gb|EFM76716.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
Length = 611
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 169/343 (49%), Gaps = 43/343 (12%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG LIS IHY R TP W D + K GA+ IETY+ WN HE + G Y+F+G
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DIV FV L GL + LR Y+CAEW FGG P WL + R+ + F +++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKVRTY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
V L + L GGP+IM+Q+ENEYG SYG + K+Y++ + G VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 235 WVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTENW 274
+ A E ++D G + K N+ ++K P + E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R +DLA V G +N YM+ GGTNFG +G
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGS--LNLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAAD 370
+TSYDYDA + E G +E K+ H++ AIK P + A+
Sbjct: 299 QVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAE 337
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 70/180 (38%), Gaps = 49/180 (27%)
Query: 579 DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEFQ 635
+L +L + +G NYG L G ++ G + G DI + G Q
Sbjct: 443 ELDVLVENLGRVNYGFKL-------NGPTQVKGIRGGIMQDIHFHQ----------GYRQ 485
Query: 636 QIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIG 695
++ ++ + D T P+ ++Y+ F D D +D S GKG VNG ++G
Sbjct: 486 YALTLSADQLKKIDYTAGKNPAQPSFYQAEFTLTDLADTF-IDCRSYGKGVVIVNGINLG 544
Query: 696 RYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RYW RG +S C P+ +L+ N +VIFE G
Sbjct: 545 RYWQ--------------RGPIHSLYC--------------PKEFLKKGTNEIVIFETEG 576
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 156/313 (49%), Gaps = 32/313 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG ++S +HY R P++W D I K++ G + IETYV WNAH RG++ G
Sbjct: 9 LLDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTDGA 68
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ +F++LV + G+ +R GPY+CAEW+ GG P WL P + R + + E + +
Sbjct: 69 LDLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVSEY 128
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ ++DL+ + +GGP++++Q+ENE YG G D+V MAL G+
Sbjct: 129 LGTVLDLVAPFQVD--RGGPVVLVQVENE-------YGAYGSDHVYLEKLMALTRSHGI- 178
Query: 235 WVMCKQTDAPENIIDA---CNGYYCDG------------YKPNSYNKPTLWTENWDGWYT 279
V D P + A +G + G + + P + E WDGW+
Sbjct: 179 TVPLTSIDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGWFD 238
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG------PFYITSYD 333
WG +D A + G S +N YM+ GGTNFG TSG TSYD
Sbjct: 239 HWGAHHHTTSAQDAARELDELLAAGAS-VNIYMFHGGTNFGFTSGANDKGVYQPTTTSYD 297
Query: 334 YDAPIDEYGLLSE 346
YDAP+ E G +E
Sbjct: 298 YDAPLAEDGYPTE 310
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 167/325 (51%), Gaps = 37/325 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ M++ IHY R E W D + K + G + + TY+ WN HE RG+++F
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
D+ +V L + GL++ LR GPY+CAE + GG P WL P + RT N F E + ++
Sbjct: 118 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 177
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ KI+ L GGP+I +Q+ENEYG+ Q+ ++Y+ + L
Sbjct: 178 DHLIPKILPLQYR------HGGPVIAVQVENEYGSF-----QKDRNYMNYLKKAL--LKR 224
Query: 232 GVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSY--------NKPTLWTENWDGWYT 279
G+ ++ D I + NG + + +S+ +KP + E W GWY
Sbjct: 225 GIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYD 284
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYD 333
+WG + + E++ V +F G SF N YM+ GGTNFG +GG + +TSYD
Sbjct: 285 SWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYD 343
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAA 358
YDA + E G +E K+ L+ L A+
Sbjct: 344 YDAVLSEAGDYTE-KYFKLRKLFAS 367
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 167/325 (51%), Gaps = 37/325 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ M++ IHY R E W D + K + G + + TY+ WN HE RG+++F
Sbjct: 97 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 156
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
D+ +V L + GL++ LR GPY+CAE + GG P WL P + RT N F E + ++
Sbjct: 157 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 216
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ KI+ L GGP+I +Q+ENEYG+ Q+ ++Y+ + L
Sbjct: 217 DHLIPKILPLQYR------HGGPVIAVQVENEYGSF-----QKDRNYMNYLKKAL--LKR 263
Query: 232 GVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSY--------NKPTLWTENWDGWYT 279
G+ ++ D I + NG + + +S+ +KP + E W GWY
Sbjct: 264 GIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYD 323
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYD 333
+WG + + E++ V +F G SF N YM+ GGTNFG +GG + +TSYD
Sbjct: 324 SWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYD 382
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAA 358
YDA + E G +E K+ L+ L A+
Sbjct: 383 YDAVLSEAGDYTE-KYFKLRKLFAS 406
>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
harrisii]
Length = 704
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 164/324 (50%), Gaps = 29/324 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G+ + IHY R E W D + K K G + + TY+ WN HE RG++NF G
Sbjct: 122 FLLEGSHFQIFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 181
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ FV++ GL++ LR GPY+C+EW+ GG P WL +E RT A F + + R
Sbjct: 182 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYAGFLKAVDR 241
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ ++ R L QGGPII +Q+ENEYG+ + + +Y+ + + G
Sbjct: 242 YFNHLIP--RVVPLQYKQGGPIIAVQVENEYGSYD-----KDSNYMPYIKKALMSRGINE 294
Query: 234 PWVMCKQTDA-----PENIIDACNGYYCDGYKPN-----SYNKPTLWTENWDGWYTTWGG 283
+ D E ++ N + D N NKPT+ TE W GW+ TWGG
Sbjct: 295 LLMTSDNKDGLSGGYLEGVLATVNLKHVDSMIFNYLHSFQENKPTMVTEYWTGWFDTWGG 354
Query: 284 RLPHR--PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYD 335
PH +D+ V+ Q G S +N YM+ GGTNFG +G + +TSYDYD
Sbjct: 355 --PHNIVDADDVVVTVSSIIQMGAS-LNLYMFHGGTNFGFMNGAQHFGEYLADVTSYDYD 411
Query: 336 APIDEYGLLSEPKWGHLKDLHAAI 359
A + E G + PK+ L++ + I
Sbjct: 412 AILTEAGDYT-PKFFKLREFFSTI 434
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 89/218 (40%), Gaps = 54/218 (24%)
Query: 542 DSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
D +RD +VF+N GS + + ++ + E+Q G+ L +L + G NYG L +
Sbjct: 513 DHIRDRAQVFVNRIYVGS-MDYEIEGLPIPEYQ-GHRKLSILVENRGRVNYGQKLNEQRK 570
Query: 602 GFRGQVKL--TGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTF 659
G G + L + +N I Y + +K F Q S +W + + F
Sbjct: 571 GLIGDIYLNESPLRNFKI--------YSLEMKENFFQSLS----SIKWNQVPEEATGPAF 618
Query: 660 TWYKTYFDAPDGIDPVALD----LGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
F ID + LD L KG ++NG ++GR+W + G Q+T
Sbjct: 619 ------FRGTLHIDSIVLDTFLKLEGWFKGVVFINGQNLGRFWNI-----GPQETL---- 663
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
++P WL+ NN +++FEE
Sbjct: 664 -------------------YLPGPWLRPGNNEIIVFEE 682
>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
Length = 595
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
Length = 198
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/169 (49%), Positives = 109/169 (64%), Gaps = 8/169 (4%)
Query: 682 MGKGQAWVNGHHIGRYWTV-VAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
MGKGQAWVNG IGRYW +AP GC CDYRGAY++ KC NCG P QT YH+PR+W
Sbjct: 1 MGKGQAWVNGQSIGRYWPAYLAPSTGCTTNCDYRGAYDASKCLRNCGQPAQTLYHIPRTW 60
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
+ + NLLV+ EE GG+P +IS+ R+ + VC VSE+ PP W + L
Sbjct: 61 VHSGKNLLVLHEELGGDPSKISLLTRTGQEVCAHVSEADPPPADSWQPN------LEFMS 114
Query: 801 MAPEMHLHCQDGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ ++ L C+ G+ IS I FAS+GTP+G C F+ GNCHA + LSVV +
Sbjct: 115 QSSQVRLTCEQGWHISMINFASFGTPRGHCGTFNPGNCHANV-LSVVQQ 162
>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
Length = 595
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
Length = 595
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
Length = 595
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
gi|238005922|gb|ACR33996.1| unknown [Zea mays]
Length = 345
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 162/325 (49%), Gaps = 39/325 (12%)
Query: 534 EVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVK------VVQPVEFQSGYNDLILLSQTV 587
+++ + ++S F+N + G GH K + +P++ + G N + +L+ T+
Sbjct: 6 DIKTVLEVNSHGHASVAFVNTKFVGC--GHGTKMNKAFTLEKPMDLKKGVNHVAVLASTM 63
Query: 588 GLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIE-ENEAE 646
G+ + GA+LE AG +V++ G G +DL+ W + VGL GE +QIY+ +
Sbjct: 64 GMMDSGAYLEHRLAGV-DRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVT 122
Query: 647 WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGG 706
W D TWYK +FD P G DP+ LD+ +MGKG +VNG IGRYW
Sbjct: 123 WKPAVND---RPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWI------- 172
Query: 707 CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLR 766
Y+ A G P+Q YH+PRS+L+ +N+LV+FEE G P I +
Sbjct: 173 -----SYKHAL---------GRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTV 218
Query: 767 STRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK--MAPEMHLHCQDGYIISSIEFASYG 824
+C +SE + ++ W D ++++ + P L C +I + FASYG
Sbjct: 219 KRDNICTFISERNPAHIKSWERK---DSQITVTAADLKPRATLTCSPKKLIQQVVFASYG 275
Query: 825 TPQGRCQKFSRGNCHAPMSLSVVSE 849
P G C ++ G+CH P + +V +
Sbjct: 276 NPMGICGNYTIGSCHTPRAKELVEK 300
>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
Length = 595
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 W---------VMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENW 274
V+ T E+I G + K N+ N P + E W
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|167755577|ref|ZP_02427704.1| hypothetical protein CLORAM_01091 [Clostridium ramosum DSM 1402]
gi|167704516|gb|EDS19095.1| glycosyl hydrolase family 35 [Clostridium ramosum DSM 1402]
Length = 584
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 155/326 (47%), Gaps = 44/326 (13%)
Query: 51 HRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYN 110
++ I+GN+ +IS +HY R PE W D + K G + +ETYV WN HE +G+Y+
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 111 FKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
F G DI F+KL L++ LR PY+CAEW GG P WL P I RTN+ + +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
+ ++ ++ + + + Q GPII+ Q+ENEYG SYG+ K+Y+ M G
Sbjct: 127 LDQYFSILLPKLSKYQIT--QNGPIILAQLENEYG----SYGED-KEYLLAVYQMMRKYG 179
Query: 231 AGVPWVMCKQT-----------------------DAPENIIDACNGYYCDGYKPNSYNKP 267
VP T A ENI + + Y+ + P
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENI--TVLKKFMESYQITA---P 234
Query: 268 TLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF 327
+ E WDGW+ W + R ++ + G +N+YM+ GGTNFG +G
Sbjct: 235 LMCMEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGS--VNFYMFQGGTNFGWMNGCSA 292
Query: 328 Y-------ITSYDYDAPIDEYGLLSE 346
ITSYDYDA + EYG +E
Sbjct: 293 RKEHDLPQITSYDYDAILTEYGAKTE 318
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 72/183 (39%), Gaps = 56/183 (30%)
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEF 634
+DL +L + +G NYG+ L+ + + G +NG DI +K Y +
Sbjct: 438 HDLKILMENMGRVNYGSKLQ-------AETQQKGIRNGVILDIHFTKKWKHYCLNF---- 486
Query: 635 QQIYSIEENEAEWTDLT--RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
E DL +G S +++ F+A D + +DL GKG +VNGH
Sbjct: 487 -----------EHLDLLNWENGYQSGPGFHEYIFEA-DEVKETFIDLEGFGKGVVFVNGH 534
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
H GR++ PT + Y +P +L+ N ++IFE
Sbjct: 535 HCGRFYE---------------------------AGPTLSLY-IPGPFLKKGINQIIIFE 566
Query: 753 ETG 755
G
Sbjct: 567 TEG 569
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/293 (34%), Positives = 150/293 (51%), Gaps = 16/293 (5%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R PE W D + K + G + +ETY+ WN HE GQ+ F G D+ +FV+
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
+ G GL++ LR PY+CAEW FGG P WL P I+ R + + E++ ++ +++
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIP-- 138
Query: 183 REEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKD-YVKWAASMALGLGAGVPWVMCK 239
R L + +GGP+I +QIENEYG+ +++Y + KD +K + L G M +
Sbjct: 139 RLVPLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDGLIKRGVDVLLFTSDGPTDGMLQ 198
Query: 240 QTDAPENIIDACNGYYC----DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAF 295
P + G D + P + E W+GW+ W R ED A
Sbjct: 199 GGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDAAA 258
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYG 342
S +N+YM+ GGTNFG +G F+ +TSYDYDAP+ E G
Sbjct: 259 VFKEMLDLNAS-VNFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 178/364 (48%), Gaps = 32/364 (8%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ + L+ +SA F + ++ +++G ++ +A +HYPR W I
Sbjct: 6 LLITALLLTFAQFASAGDF------TVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEHRI 59
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K G + + YVFWN HE GQ++F ND+ +F +L +G+Y+ +R GPYVCAE
Sbjct: 60 KMCKALGMNTLCIYVFWNIHEQREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVCAE 119
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG P WL I R + F E ++ F +K+ + + L GGPIIM+Q+EN
Sbjct: 120 WEMGGLPWWLLKKKDIRLRERDPYFLERVKIFEQKVGEQLAP--LTIQNGGPIIMVQVEN 177
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGL-GAGVPWVMCK-----QTDAPENIIDACN---G 253
EYG SYG+ K YV G+ G + C + + ++++ N G
Sbjct: 178 EYG----SYGED-KPYVSEIRDCLRGIYGEKLTLFQCDWSSNFERNGLDDLVWTMNFGTG 232
Query: 254 YYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMN 309
D K N P + +E W GW+ WG RP +D+ + + SF +
Sbjct: 233 ANIDHEFARLKQLRPNAPLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF-S 291
Query: 310 YYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEP 364
YM GGT+FG +G P + +TSYDYDAPI+EYG +E + K + K P
Sbjct: 292 LYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGGTTEKFFQLRKMMQKYSKTPLP 351
Query: 365 ALVA 368
A+ A
Sbjct: 352 AIPA 355
Score = 46.2 bits (108), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 94/228 (41%), Gaps = 52/228 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV--IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+T++ D +VFI+ G + + + ++ P + +L +L + +G N+G
Sbjct: 414 VLTLNDGHDFAQVFIDSTYIGKIDRVRNEKSLLLPAVKKG--QELKILIEAMGRINFGRA 471
Query: 596 LEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ KD G V L+ K+G +++W + I++I ++ A
Sbjct: 472 I-KDYKGITESVTLSTDKDG----HELIWNLKR------WDIFTIPDSYAAAKKALDTAK 520
Query: 656 PSTFT--------WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
+ T +Y+ YF+ + L++ + GKGQ +VNGH IGR+W++
Sbjct: 521 RDSLTKMVFKGSGYYRGYFNLKR-VGDTFLNMENWGKGQVYVNGHAIGRFWSI------- 572
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N +V+ + G
Sbjct: 573 --------------------GPQQTLY-VPGCWLKKGKNEVVVLDVVG 599
>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
Length = 595
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 115/316 (36%), Positives = 163/316 (51%), Gaps = 32/316 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGL--- 229
+ V L + L QGGP+IM+Q+ENEYG+ ME SY +Q K+ + A S+ + L
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYGSYGMEKSYLRQTKE-LMLAHSIDIPLFTS 187
Query: 230 -GAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY-----------NKPTLWTENWDGW 277
GA + V+ T E+I G + K N+ N P + E WDGW
Sbjct: 188 DGAWLE-VLDAGTLIDEDIF--VTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDGW 244
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------YIT 330
+ WG + R E+LA V + G +N YM+ GGTNFG +G IT
Sbjct: 245 FNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLPQIT 302
Query: 331 SYDYDAPIDEYGLLSE 346
SYDYDA ++E G +E
Sbjct: 303 SYDYDALLNEAGQPTE 318
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 168/344 (48%), Gaps = 37/344 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++Y ++ G +++ +HY R P+ W D + + G + ++TY+ WN HE
Sbjct: 9 LTYSEGTLLRAGRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHERRT 68
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++ F G DI +FV+ +GL + +R GPY+CAEW+ GG P WL D PG+ R++ AP
Sbjct: 69 GEHRFDGWRDIERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRPRSSYAP 128
Query: 167 FKEEMQR----FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+ +E+ R + +I DL + +GGP++ +Q+ENEYG SYG Y++W
Sbjct: 129 YLDEVARWFDVLIPRIADLQ------AARGGPVVAVQVENEYG----SYGDD-HAYMRWV 177
Query: 223 ASMALGLGA--------GVPWVMCKQTDAPENIIDACNGYYCDG----YKPNSYNKPTLW 270
G G G +M P + A G D + +P L
Sbjct: 178 HDALAGRGVTELLYTADGPTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLC 237
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-- 328
E W+GW+ WG + R V A A+ +GGS ++ Y GGTNFG +G
Sbjct: 238 AEFWNGWFDHWGEKHHTRSVGSAAAALDEILAKGGS-VSLYPAHGGTNFGLWAGANHADG 296
Query: 329 -----ITSYDYDAPIDEYGLLSEPKWGHLKD-LHAAIKLCEPAL 366
+TSYD DAPI E+G + PK+ +D L AA E L
Sbjct: 297 ALQPTVTSYDSDAPIAEHGAPT-PKFHAFRDRLLAATGAAEREL 339
>gi|422824944|ref|ZP_16873129.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|422827211|ref|ZP_16875390.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|422857055|ref|ZP_16903709.1| beta-galactosidase [Streptococcus sanguinis SK1]
gi|324992224|gb|EGC24146.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|324994315|gb|EGC26229.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|327459541|gb|EGF05887.1| beta-galactosidase [Streptococcus sanguinis SK1]
Length = 592
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 160/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV +GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|422864131|ref|ZP_16910760.1| beta-galactosidase [Streptococcus sanguinis SK408]
gi|327472954|gb|EGF18381.1| beta-galactosidase [Streptococcus sanguinis SK408]
Length = 592
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 160/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV +GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
griseus]
Length = 761
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 169/323 (52%), Gaps = 33/323 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG++ M++ IHY R E W D + K + G + + TY+ WN HE RG ++F
Sbjct: 188 LDGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSEIL 247
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +V L + GL++ LR GPY+CAE + GG P WL P ++ RT F + + ++
Sbjct: 248 DLEAYVSLAATLGLWVILRPGPYICAEVDLGGLPSWLLGYPELQLRTTQQEFLDAVDKYF 307
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L +GGP+I +QIENEYG S+ + G DY+++ AL V
Sbjct: 308 DHLIP--RILPLQYLRGGPVIAVQIENEYG----SFSKDG-DYMEYIKE-ALQKRGIVEL 359
Query: 236 VMCK------QTDAPENIIDACNGYYCDGYKPNSY--------NKPTLWTENWDGWYTTW 281
++ QT + + + N ++ +S+ +KP + E W GW+ TW
Sbjct: 360 LLTSDNHKGIQTGSVKGALTTIN---MASFEKDSFIKLLQMQNDKPIMVMEYWTGWFDTW 416
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYD 335
G + E++ + V+RF + G SF N YM+ GGTNFG +G Y +TSYDYD
Sbjct: 417 GREHNVKSAEEIRYTVSRFIKYGISF-NMYMFHGGTNFGFINGAFHYDKHSSVVTSYDYD 475
Query: 336 APIDEYGLLSEPKWGHLKDLHAA 358
A + E G +E K+ L+ L A+
Sbjct: 476 AVLTEAGDYTE-KYFKLRKLFAS 497
>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
Length = 595
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 157/319 (49%), Gaps = 38/319 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R P W + K GA+ +ETY+ WN HE G ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++V+FVK+ L + LR Y+CAEW FGG P WL P I R+ + F E+++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+ V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + L VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 235 WVMCKQTDAPENIIDA---------CNGYYCDGYKPNSY-----------NKPTLWTENW 274
+ A ++DA G + K N+ N P + E W
Sbjct: 184 --LFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYW 241
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
DGW+ WG + R E+LA V + G +N YM+ GGTNFG +G
Sbjct: 242 DGWFNRWGEPIITRDPEELATEVKEMLEIGS--LNLYMFHGGTNFGFYNGCSARGNTDLP 299
Query: 328 YITSYDYDAPIDEYGLLSE 346
ITSYDYDA ++E G +E
Sbjct: 300 QITSYDYDALLNEAGQPTE 318
>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
Length = 779
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 177/355 (49%), Gaps = 24/355 (6%)
Query: 17 SVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
+V +M+M++ + C + S ++ F+ + +++G ++ +A IHY R E
Sbjct: 7 TVSLLMVMLICVLSGCKNQSGSNGTFE-----IGDKTFLLNGKPFIIKAAEIHYTRIPVE 61
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
W I K G + I Y FWN HE G+++F G+NDI F +L +G+Y+ LR G
Sbjct: 62 YWEHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKNGMYIMLRPG 121
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVC+EW GG P WL I+ RTN+ F E + ++ +I + + + +GG II
Sbjct: 122 PYVCSEWEMGGLPWWLLKKEDIQLRTNDPYFIERTRIYMNEIGKQLADRQIT--RGGNII 179
Query: 197 MLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACN-- 252
M+Q+ENEYG+ + SY + +D ++ A + L W +A ++++ N
Sbjct: 180 MVQVENEYGSYATDKSYIAKNRDILRDAGFTDVPLFQ-CDWSSNFLNNALDDLVWTVNFG 238
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K N P + +E W GW+ WG + R E + + R SF
Sbjct: 239 TGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMIAGLRDMLDRNISF 298
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
+ YM GGT FG G P Y +SYDYDAPI E G + PK+ L++ A
Sbjct: 299 -SLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWAT-PKYHKLREFMA 351
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 54/234 (23%), Positives = 94/234 (40%), Gaps = 44/234 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+ ID + D +VFI+G+L G + + + + L +L + +G N+ +
Sbjct: 423 TLLIDEVHDWAQVFIDGKLIGRLDRRRGEFTIKLPATAAGARLDILIEAMGRVNFDKAIH 482
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G +V L D K Y + + F + + ++T + P+
Sbjct: 483 -DRKGITNKVVL--ITESSSDELKDWQVYNLPVDYSFVK-------DKKYTPGKKIEAPA 532
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y+ F+ D V LD+ + GKG WVNG +GR+W +
Sbjct: 533 ---YYRATFNLETPGD-VFLDMQTWGKGMVWVNGKAMGRFWEI----------------- 571
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT + +P WL+ N +++ + G P + SVK T I+
Sbjct: 572 ----------GPQQTLF-MPGCWLKKGENEIIVLDLKG--PEKASVKGLKTPIL 612
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 100/308 (32%), Positives = 156/308 (50%), Gaps = 26/308 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S G+HY R P +W D + K++ G + +ETYV WN H+ ++ G
Sbjct: 18 LDGEPFRILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGL 77
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F+ L + GL++ LR GPY+CAEW GG P WL P + R+ + F + +
Sbjct: 78 DLPRFLDLAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYF 137
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++++ + + + + +GGP++ +Q+ENEYG +YG Y++ A G VP
Sbjct: 138 RRLLPPLHDRL--ASRGGPVLAVQVENEYG----AYGDD-TAYLEHLADSLRRHGVDVPL 190
Query: 236 VMCKQ-TDAPENIIDACNGYYCDGYKPNSY---------NKPTLWTENWDGWYTTWGGRL 285
C Q D + G +P ++ + P L TE W GW+ WGG
Sbjct: 191 FTCDQPADLERGALAGVLATANFGSRPAAHLATLRTARPSAPLLCTEFWIGWFDRWGGNH 250
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-------PFYITSYDYDAPI 338
R E + + G S +N+YM+ GGTNFG +G P +TSYDYDAP+
Sbjct: 251 VVRDAEQASQELDELLATGAS-VNFYMFHGGTNFGFMNGANDKHTYRP-TVTSYDYDAPL 308
Query: 339 DEYGLLSE 346
DE G +E
Sbjct: 309 DEAGDPTE 316
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 101/293 (34%), Positives = 150/293 (51%), Gaps = 16/293 (5%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R PE W D + K + G + +ETY+ WN HE GQ+ F G D+ +FV+
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
+ G GL++ LR PY+CAEW FGG P WL P I+ R + + E++ ++ +++
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIP-- 138
Query: 183 REEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKD-YVKWAASMALGLGAGVPWVMCK 239
R L + +GGP+I +QIENEYG+ +++Y + KD +K + L G M +
Sbjct: 139 RLVPLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDGLIKRGVDVLLFTSDGPTDGMLQ 198
Query: 240 QTDAPENIIDACNGYYC----DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAF 295
P + G D + P + E W+GW+ W R ED A
Sbjct: 199 GGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDAAA 258
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYG 342
S +N+YM+ GGTNFG +G F+ +TSYDYDAP+ E G
Sbjct: 259 VFKEMLDLNAS-VNFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310
>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
Length = 200
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/170 (50%), Positives = 108/170 (63%), Gaps = 7/170 (4%)
Query: 682 MGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
MGKG+AWVNG IGRYW T VA GC D+C+YRG Y S KC NCG P+QT YHVPRS+
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSF 60
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
L+ + N LV+FEE GG+P +IS + VC VS+SH P + W+ GK+
Sbjct: 61 LKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKV---- 116
Query: 801 MAPEMHLHCQD-GYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
P + L C + +ISSI+FASYGTP G C F RG C + +LS+V +
Sbjct: 117 -GPALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKK 165
>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 593
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/340 (34%), Positives = 165/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------ 327
WDGW+ WG + HR DLA V G +N YM+ GGTNFG +G
Sbjct: 242 WDGWFNRWGEPVIHREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGEKDL 299
Query: 328 -YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 81/216 (37%), Gaps = 53/216 (24%)
Query: 546 DVLRVFINGQLTGS----VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK--D 599
D L ++++G L + +G + ++ E + D+ L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLILGQTEKDTLALDI--LVENLGRVNYGFKLNNPTQ 467
Query: 600 GAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTF 659
G RG V DI + Y + E Q+ I D T P
Sbjct: 468 SKGIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQP 511
Query: 660 TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
++Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 512 SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHS 556
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 557 LYC--------------PKEFLQQGQNEVVIFETEG 578
>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
Length = 644
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 165/324 (50%), Gaps = 35/324 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ M++ IHY R E W D + K + G + + TY+ WN HE RG+++F
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
D+ +V L + GL++ LR GPY+CAE + GG P WL PG RT N F E + ++
Sbjct: 131 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 190
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG- 230
+ KI+ L +GGP+I +Q+ENEYG+ + K+Y+++ L G
Sbjct: 191 DHLIPKILPLQYR------RGGPVIAVQVENEYGSFRND-----KNYMEYIKKALLNRGI 239
Query: 231 ----------AGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTT 280
+G+ K A N+ + ++ + +KP + E W GWY +
Sbjct: 240 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQN-DKPIMIMEYWTGWYDS 298
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYDY 334
WG + + ++ + RFF G SF N YM+ GGTNFG +GG +TSYDY
Sbjct: 299 WGSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDY 357
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAA 358
DA + E G +E K+ L+ L A+
Sbjct: 358 DAVLSEAGDYTE-KYFKLRKLFAS 380
>gi|384209874|ref|YP_005595594.1| beta-galactosidase [Brachyspira intermedia PWS/A]
gi|343387524|gb|AEM23014.1| beta-galactosidase [Brachyspira intermedia PWS/A]
Length = 592
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 162/343 (47%), Gaps = 45/343 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I++G L+S IHY R E W D + K G + +ETY+ WN HE G ++F G
Sbjct: 10 FILNGKPIKLLSGAIHYFRFVEEYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGVFDFSG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
DI F+KL L + LR PY+CAEW FGG P WL ++ RTN F ++
Sbjct: 70 NKDIASFIKLAQKMDLLVILRPTPYICAEWEFGGLPAWLLRYDNMKVRTNTELFLSKVDA 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K++ + + L + GP+IM+QIENEYG S+G K+Y+K ++ + GA V
Sbjct: 130 YYKELFKQIAD--LQITRNGPVIMMQIENEYG----SFGND-KEYLKALKNLMVKHGAEV 182
Query: 234 PW---------VMCKQTDAPENII-------------DACNGYYCDGYKPNSYNKPTLWT 271
P V+ T + I+ DA ++ + P +
Sbjct: 183 PLFTSDGAWDAVLEAGTLVDDGILATVNFGSQAKESFDATEKFF----ERKGIKNPLMCM 238
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---- 327
E WDGW+ W + R +D V +RG +N YM+ GGTNFG +G
Sbjct: 239 EFWDGWFNLWKEPIIKRDADDFIMEVKEIIKRGS--INLYMFIGGTNFGFYNGTSVTGYT 296
Query: 328 ---YITSYDYDAPIDEYGLLSEPKWGHLK---DLHAAIKLCEP 364
ITSYDYDA + E+G +E + K +L IK EP
Sbjct: 297 DFPQITSYDYDAVLTEWGEPTEKFYKLQKLINELFPEIKTFEP 339
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 81/156 (51%), Gaps = 21/156 (13%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVE--FQSGYNDLILLSQTVGLQNYGAFLEKDGAGF 603
D + ++NG+ G + + ++++P+E F +G N L LL + VG NYG L++
Sbjct: 409 DRVHFYLNGEYKG--VKYQDELIEPIEMHFNNGDNVLELLVENVGRVNYGYKLQE----- 461
Query: 604 RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK 663
QVK G + G + + + ++Q Y++ + + D + I +T ++Y+
Sbjct: 462 CSQVK--GIRIGVMA--------DIHFETGWEQ-YALPLDNIKDVDFSSKWIENTPSFYR 510
Query: 664 TYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
FD + D LD +GKG A++NG ++GRYW+
Sbjct: 511 YEFDVKEPADTF-LDCSKLGKGAAFINGFNLGRYWS 545
>gi|422852505|ref|ZP_16899175.1| beta-galactosidase [Streptococcus sanguinis SK150]
gi|325693831|gb|EGD35750.1| beta-galactosidase [Streptococcus sanguinis SK150]
Length = 592
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMECYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQGVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
Length = 631
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 165/324 (50%), Gaps = 35/324 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ M++ IHY R E W D + K + G + + TY+ WN HE RG+++F
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
D+ +V L + GL++ LR GPY+CAE + GG P WL PG RT N F E + ++
Sbjct: 118 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 177
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG- 230
+ KI+ L +GGP+I +Q+ENEYG+ + K+Y+++ L G
Sbjct: 178 DHLIPKILPLQYR------RGGPVIAVQVENEYGSFRND-----KNYMEYIKKALLNRGI 226
Query: 231 ----------AGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTT 280
+G+ K A N+ + ++ + +KP + E W GWY +
Sbjct: 227 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQN-DKPIMIMEYWTGWYDS 285
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYDY 334
WG + + ++ + RFF G SF N YM+ GGTNFG +GG +TSYDY
Sbjct: 286 WGSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDY 344
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAA 358
DA + E G +E K+ L+ L A+
Sbjct: 345 DAVLSEAGDYTE-KYFKLRKLFAS 367
>gi|422852902|ref|ZP_16899566.1| beta-galactosidase [Streptococcus sanguinis SK160]
gi|325697836|gb|EGD39720.1| beta-galactosidase [Streptococcus sanguinis SK160]
Length = 592
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 160/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV +GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G +
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTI 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 105/314 (33%), Positives = 158/314 (50%), Gaps = 28/314 (8%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG ++S IHY R P+ W D I K++ G + IETYV WNAHE + GQ++++G
Sbjct: 12 LLDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEGQWSWEGG 71
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ F+K V G++ +R PY+CAEW+ GG P WL R + F +Q +
Sbjct: 72 LDLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKAAGVRRDEPVFMAAVQAY 131
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++++ +++ E L GGP+I++QIENEYG +YG +Y++ + G VP
Sbjct: 132 LRRVYEVI--EPLQIHHGGPVILVQIENEYG----AYGSD-PEYLRKLVDITSSAGITVP 184
Query: 235 WVMCKQTDAPENIIDACNGYYCDG------------YKPNSYNKPTLWTENWDGWYTTWG 282
Q + + G G + + P + E W+GW+ WG
Sbjct: 185 LTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWNGWFDDWG 244
Query: 283 GRLPHRPVEDLAFAVARFFQRG-GSFMNYYMYFGGTNFGRTSG----GPF--YITSYDYD 335
PH + A A G G+ +N YM GGTNFG T+G G + +TSYDYD
Sbjct: 245 --TPHHTTDAEASAADLDALLGSGASVNLYMLCGGTNFGLTNGANDKGTYEPIVTSYDYD 302
Query: 336 APIDEYGLLSEPKW 349
AP+DE G + W
Sbjct: 303 APLDEAGHPTAKYW 316
>gi|422871792|ref|ZP_16918285.1| beta-galactosidase [Streptococcus sanguinis SK1087]
gi|328945306|gb|EGG39459.1| beta-galactosidase [Streptococcus sanguinis SK1087]
Length = 592
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 117/339 (34%), Positives = 162/339 (47%), Gaps = 41/339 (12%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAA---IKLCEP 364
ITSYD+DAPI E+G +E + + H +K EP
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELKQMEP 339
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 171/343 (49%), Gaps = 29/343 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGLGA 231
+ +++ + L + GG I+M+QIENEYG+ E +Y + +D + AL +
Sbjct: 129 YYDVLMEKIVPHQLVN--GGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTS 186
Query: 232 GVPWVMCKQTDA--PENIIDACN---------GYYCDGYKPNSYNKPTLWTENWDGWYTT 280
PW + + ++I+ N G ++ + P + E WDGW+
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------YITSYD 333
W + R ++LA +V G +N YM+ GGTNFG +G ITSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
YDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKESFAQ 346
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|422861007|ref|ZP_16907651.1| beta-galactosidase [Streptococcus sanguinis SK330]
gi|327468658|gb|EGF14137.1| beta-galactosidase [Streptococcus sanguinis SK330]
Length = 592
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVSV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
Length = 255
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 117/206 (56%), Gaps = 49/206 (23%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+VSYD R+++IDG RR+++S IHYPR+TPE
Sbjct: 29 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEE---------------------------- 60
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+ ++G+Y LRIGPY+C EWN+GG P WLRDIPG++FR +N
Sbjct: 61 ------------------IQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 102
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAA 223
PF+ EM+ F IV+ M++ +F+ QGGPII+ QIENEYGN+ + + Q +Y+ W A
Sbjct: 103 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 162
Query: 224 SMALGLGAGVPWVMCKQ-TDAPENII 248
MA GVPW+MC+Q D P N++
Sbjct: 163 DMANKQNVGVPWIMCQQDDDVPHNVL 188
>gi|125717147|ref|YP_001034280.1| glycosyl hydrolase family protein [Streptococcus sanguinis SK36]
gi|125497064|gb|ABN43730.1| Glycosylhydrolase, family 35, putative [Streptococcus sanguinis
SK36]
Length = 592
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|422845798|ref|ZP_16892481.1| beta-galactosidase [Streptococcus sanguinis SK72]
gi|325688586|gb|EGD30603.1| beta-galactosidase [Streptococcus sanguinis SK72]
Length = 592
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 627
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 124/387 (32%), Positives = 182/387 (47%), Gaps = 46/387 (11%)
Query: 11 LQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHY 70
L+CLA++ M+++ + T F ++ + +G L S +HY
Sbjct: 4 LKCLAMAT---MLLLTATTAEAKQNKQTKTTRNTFAITDGQ--FVYNGKPMQLHSGEMHY 58
Query: 71 PRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-GKNDIVKFVKLVGSSGL 129
R W + K G + + TYVFWN HE+ G++++K G ++ +FVK G+
Sbjct: 59 ARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKTGNRNLRQFVKTAAEEGM 118
Query: 130 YLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFS 189
+ LR GPY CAEW+FGG+P WL G+ R +N PF + + ++ ++ MR+ +
Sbjct: 119 LVILRPGPYCCAEWDFGGYPWWLSKAKGLVIRADNQPFLDSCRVYINQLASQMRDLQIT- 177
Query: 190 WQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASMALGLGAGVP------- 234
+GGPIIM+Q ENE+G SY Q KD Y + G VP
Sbjct: 178 -KGGPIIMVQAENEFG----SYVAQRKDVPLESHRAYSAKIKQQLIDAGFDVPLFTSDGS 232
Query: 235 WVMCKQTDAPENIIDACNGYY-CDGYKP--NSYN---KPTLWTENWDGWYTTWGGRLPHR 288
W+ T E + NG + K N YN P + E + GW + W P
Sbjct: 233 WLFKGGTI--EGALPTANGENDIEKLKKVVNEYNGGKGPYMVAEFYPGWLSHWAEPFPQV 290
Query: 289 PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------ITSYDYDAPIDE 340
E + A++ + G SF NYYM GGTNFG TSG + +TSYDYDAPI E
Sbjct: 291 STESIVKQTAKYLENGVSF-NYYMVHGGTNFGFTSGANYTTATNLQSDLTSYDYDAPISE 349
Query: 341 YGLLSEPKWGHLKDLHAA-IKLCEPAL 366
G + PK+ L+ L +K PA+
Sbjct: 350 AG-WNTPKYDALRALMIKNVKYNVPAV 375
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 124/311 (39%), Gaps = 75/311 (24%)
Query: 448 IKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVT 507
IK V++++P P +P ++ KLS ++ + + V ++ T E LN
Sbjct: 365 IKNVKYNVPAVPQ-RIPVIAIPNIKLSKSADVLNLLTKGKAVENDTPLT----FEDLNQG 419
Query: 508 KDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV 567
Y Y H Q + T+ I + D V++NGQ G + V
Sbjct: 420 HGYVLYRRHFNQ--------------PISGTMKIAGLADYALVYVNGQKVGEL--DRVSD 463
Query: 568 VQPVEFQSGYNDLI-LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY 626
V +E +N ++ +L + +G NYGA + + G G V + G +
Sbjct: 464 VDSIEINMPFNGVLDILVENMGRINYGARIPQSIKGINGPVVIDGNE------------- 510
Query: 627 QVGLKGEFQQIYSIEENEAEWTDL----TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSM 682
+ G +Q +Y + NEA + G+P T Y F+ D L++ +
Sbjct: 511 ---ITGNWQ-MYKLPMNEAPDVNALPTANNKGLP---TLYSGTFNL-DTTGDTFLNMETW 562
Query: 683 GKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
GKG ++NG ++GRYW RG P QT Y +P +L+
Sbjct: 563 GKGIVFINGFNLGRYWK--------------RG-------------PQQTLY-LPGCFLK 594
Query: 743 ASNNLLVIFEE 753
N +V+FE+
Sbjct: 595 KGENKIVVFEQ 605
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 109/308 (35%), Positives = 153/308 (49%), Gaps = 30/308 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+D +IS IHY R PE W D + K + G + +ETYV WN HE+ G Y F G
Sbjct: 12 LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ GLY+ LR PY+CAEW FGG P WL P ++ R + PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+ +R+ L QGGPIIM+Q+ENEYG SY K+Y++ + G P
Sbjct: 132 AHLFPQVRD--LQITQGGPIIMMQVENEYG----SYAND-KEYLRKMVAAMRQHGVETPL 184
Query: 236 VMCKQT--DAPEN--IIDA------CNGYYCDGY----KPNSYNKPTLWTENWDGWYTTW 281
V D EN I D C + + K + +P + E W GW+ W
Sbjct: 185 VTSDGPWHDMLENGSIKDLALPTINCGSNIKENFEKLRKFHGEKRPLMVMEFWIGWFDAW 244
Query: 282 GGRLPH-RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDY 334
G H ++D + G +N YM+ GGTNFG +G +Y +TSYDY
Sbjct: 245 GDDQHHTTSIQDAVKELQDCLALGS--VNIYMFHGGTNFGFMNGSNYYERLAPDVTSYDY 302
Query: 335 DAPIDEYG 342
DA + E+G
Sbjct: 303 DALLTEWG 310
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 114/344 (33%), Positives = 171/344 (49%), Gaps = 32/344 (9%)
Query: 31 SCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGA 90
SC + + TF +++G ++ +A +HYPR W I + K G
Sbjct: 23 SCSPKTESGTF------EAGKGTFLLNGEPFVVKAAELHYPRIPRAYWEHRIKQCKALGM 76
Query: 91 DVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPV 150
+ I YVFWN HE G+++F G+ D+ +F +L + +Y+ LR GPYVCAEW GG P
Sbjct: 77 NTICLYVFWNFHEEKPGEFDFTGQKDLAEFCRLCQKNDMYVILRPGPYVCAEWEMGGLPW 136
Query: 151 WLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS 210
WL I R ++ F E + F K++ + + L +GGPIIM+Q+ENEYG S
Sbjct: 137 WLLKKKDIRLREDDPYFLERVAIFEKEVANQV--AGLTIQKGGPIIMVQVENEYG----S 190
Query: 211 YGQQGKDYVKWAASMALGLGAGVPWVMCK-----QTDAPENIIDACN---GYYCDG---- 258
YG+ K+YV + G V C Q +A ++++ N G D
Sbjct: 191 YGES-KEYVAKIRDIVRGNFGDVTLFQCDWASNFQLNALDDLVWTMNFGTGANIDEQFAP 249
Query: 259 YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTN 318
K + P + +E W GW+ WG R +D+ + +G SF + YM GGTN
Sbjct: 250 LKKVRPDSPLMCSEFWSGWFDKWGANHETRAADDMIAGIDEMLSKGISF-SLYMTHGGTN 308
Query: 319 FGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
+G +G P + +TSYDYDAPI E G ++ PK+ L++ A
Sbjct: 309 WGHWAGANSPGFAPDVTSYDYDAPISESGKIT-PKYEKLRETLA 351
>gi|422864548|ref|ZP_16911173.1| beta-galactosidase [Streptococcus sanguinis SK1058]
gi|327490742|gb|EGF22523.1| beta-galactosidase [Streptococcus sanguinis SK1058]
Length = 592
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|401681814|ref|ZP_10813709.1| glycosyl hydrolase family 35 [Streptococcus sp. AS14]
gi|400185120|gb|EJO19350.1| glycosyl hydrolase family 35 [Streptococcus sp. AS14]
Length = 592
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGQPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRSFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 159/307 (51%), Gaps = 26/307 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG +++ +HY R P++W D I K++ G + IETY WN HE + G Y+F G
Sbjct: 11 FLLDGRPHRVLAGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFTG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F++LV +G++ +R GPY+CAEW+ GG P WL P + R + + +
Sbjct: 71 MLDLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLGAVSA 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+++++ D++ + +GGP++++QIENEYG +YG K Y++ + G V
Sbjct: 131 YLRRVYDVVTPLQID--RGGPVVLVQIENEYG----AYGSD-KFYLRHLVDLTRECGITV 183
Query: 234 PWVMCKQ-TDA--PENIIDACNGYYCDGYKPNSY---------NKPTLWTENWDGWYTTW 281
P Q TD + +D + G + P + +E W+GW+ W
Sbjct: 184 PLTTVDQPTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFWNGWFDHW 243
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG------PFYITSYDYD 335
G R ED A + G S +N YM+ GGTNFG TSG ITSYDYD
Sbjct: 244 GDRHHTTSAEDSAAELDALLAAGAS-VNIYMFHGGTNFGLTSGANDKGVYQPTITSYDYD 302
Query: 336 APIDEYG 342
AP+DE G
Sbjct: 303 APLDEAG 309
>gi|422880263|ref|ZP_16926727.1| beta-galactosidase [Streptococcus sanguinis SK1059]
gi|422930132|ref|ZP_16963071.1| beta-galactosidase [Streptococcus sanguinis ATCC 29667]
gi|422930724|ref|ZP_16963655.1| beta-galactosidase [Streptococcus sanguinis SK340]
gi|332364839|gb|EGJ42608.1| beta-galactosidase [Streptococcus sanguinis SK1059]
gi|339614112|gb|EGQ18823.1| beta-galactosidase [Streptococcus sanguinis ATCC 29667]
gi|339620700|gb|EGQ25268.1| beta-galactosidase [Streptococcus sanguinis SK340]
Length = 592
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 171/343 (49%), Gaps = 29/343 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGLGA 231
+ +++ + L + GG I+M+QIENEYG+ E +Y + +D + AL +
Sbjct: 139 YYDVLMEKIVPHQLVN--GGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTS 196
Query: 232 GVPWVMCKQTDA--PENIIDACN---------GYYCDGYKPNSYNKPTLWTENWDGWYTT 280
PW + + ++I+ N G ++ + P + E WDGW+
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------YITSYD 333
W + R ++LA +V G +N YM+ GGTNFG +G ITSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
YDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKESFAQ 356
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + + +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAE-VKDTFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|422849537|ref|ZP_16896213.1| beta-galactosidase [Streptococcus sanguinis SK115]
gi|325689511|gb|EGD31516.1| beta-galactosidase [Streptococcus sanguinis SK115]
Length = 592
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 154/320 (48%), Gaps = 33/320 (10%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +++G + S +HY R P W D + K K G + +ETY+ WN HE GQ+ F
Sbjct: 10 KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
+ + DI KFVKL S GLY+ LR PY+CAEW FGG P WL P + R+N F E++
Sbjct: 70 EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKV 129
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ + + ++ + GGP++M+Q+ENEYG S+G K Y++ S+ G
Sbjct: 130 ANYYEALFKVLVPLQIT--HGGPVLMMQVENEYG----SFGND-KAYLRHVKSLMETNGV 182
Query: 232 GVPWVMC----KQTDAPENIIDA---CNGYYCDGYKPN-----------SYNKPTLWTEN 273
VP +Q ++I+ + + N N P + E
Sbjct: 183 DVPLFTADGSWQQALKAGSLIEDDVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCMEF 242
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY----- 328
WDGW+ W + R + +A + SF N YM+ GGTNFG +G
Sbjct: 243 WDGWFNRWQEEIVTRSADSFQTDLAELVKEQASF-NLYMFRGGTNFGFFNGCSSRQNVDY 301
Query: 329 --ITSYDYDAPIDEYGLLSE 346
ITSYDYDA + E G SE
Sbjct: 302 PQITSYDYDAVLHEDGRPSE 321
>gi|449664450|ref|XP_002165261.2| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 589
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 107/344 (31%), Positives = 171/344 (49%), Gaps = 34/344 (9%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+ + ++I + +SSS + F + Y++ + DG IS IHY R + W D
Sbjct: 5 VFICLLIVFAKISSSE-----RTFKIDYENNKFLKDGTEFRYISGSIHYMRVPEDYWEDR 59
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
++K ++ G + I+TY+ WN HE G + F G+ ++ KF+KL L + LR GPY+CA
Sbjct: 60 LSKIRKAGLNAIQTYIPWNFHEPTEGNFQFGGQQNVFKFLKLAQKYDLLVILRPGPYICA 119
Query: 142 EWNFGGFPVWLRDIPG---IEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
EW FGGFP WL G ++ RT++ + ++++ ++ ++ +R + GGPII +
Sbjct: 120 EWEFGGFPYWLLKKVGNKTMQLRTSDNLYLQKVENYMSVLLSGLRPYLY--ENGGPIITV 177
Query: 199 QIENEYG----NMESSYGQQG--KDYVKWAASMALGLGAGVPWVMCKQTDAPENIID--- 249
Q+ENEYG + E Y + + Y+ + GAG ++ C +D
Sbjct: 178 QVENEYGSYGCDHEYMYKLESIFRKYLGENVILFTTDGAGDSYLKCGTIKPLFATVDFGP 237
Query: 250 -ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFM 308
A Y D + P + +E + GW WGG+ H +ED+ + + S +
Sbjct: 238 TAEPKLYFDIQRKYQPLGPLVNSEFYTGWLDHWGGQHAHTSLEDVTDTLDKMLSLNAS-V 296
Query: 309 NYYMYFGGTNFGRTSGG----------PFYITSYDYDAPIDEYG 342
N YM+ GGTNFG +G P TSYDYDAP+ E G
Sbjct: 297 NMYMFEGGTNFGFMNGANQDSNSLQPQP---TSYDYDAPLSEAG 337
>gi|422859360|ref|ZP_16906010.1| beta-galactosidase [Streptococcus sanguinis SK1057]
gi|327459140|gb|EGF05488.1| beta-galactosidase [Streptococcus sanguinis SK1057]
Length = 592
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 155/318 (48%), Gaps = 38/318 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKEWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSE 346
ITSYD+DAPI E+G +E
Sbjct: 301 ITSYDFDAPITEWGQPTE 318
>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
Length = 584
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 106/326 (32%), Positives = 155/326 (47%), Gaps = 44/326 (13%)
Query: 51 HRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYN 110
++ I+GN+ +IS +HY R PE W D + K G + +ETYV WN HE +G+Y+
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 111 FKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
F G DI F+KL L++ LR PY+CAEW GG P WL P I RTN+ + +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
+ ++ ++ + + + Q GPII+ Q+ENEYG SYG+ K+Y+ M G
Sbjct: 127 LDQYFSILLPKLSKYQIT--QNGPIILAQLENEYG----SYGED-KEYLLAVYQMMRKYG 179
Query: 231 AGVPWVMCKQT-----------------------DAPENIIDACNGYYCDGYKPNSYNKP 267
VP T A ENI + + ++ + P
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENI--TVLKKFMESHQITA---P 234
Query: 268 TLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF 327
+ E WDGW+ W + R ++ + G +N+YM+ GGTNFG +G
Sbjct: 235 LMCMEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGS--VNFYMFQGGTNFGWMNGCSA 292
Query: 328 Y-------ITSYDYDAPIDEYGLLSE 346
ITSYDYDA + EYG +E
Sbjct: 293 RKEHDLPQITSYDYDAILTEYGAKTE 318
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 72/183 (39%), Gaps = 56/183 (30%)
Query: 578 NDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG---DIDLSKILWTYQVGLKGEF 634
+DL +L + +G NYG+ L+ + + G +NG DI +K Y +
Sbjct: 438 HDLKILMENMGRVNYGSKLQ-------AETQQKGIRNGVILDIHFTKKWKHYCLNF---- 486
Query: 635 QQIYSIEENEAEWTDLT--RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
E DL +G S +++ F+A D + +DL GKG +VNGH
Sbjct: 487 -----------EHLDLLNWENGYQSGPGFHEYIFEA-DEVKETFIDLEGFGKGVVFVNGH 534
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
H GR++ PT + Y +P +L+ N ++IFE
Sbjct: 535 HCGRFYE---------------------------AGPTLSLY-IPGPFLKKGINQIIIFE 566
Query: 753 ETG 755
G
Sbjct: 567 TEG 569
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 163/318 (51%), Gaps = 31/318 (9%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
+IS IHY R P W D + K + G + +ETYV WN HE G+++F D+ +F++
Sbjct: 19 IISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQ 78
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L GLY+ LR PY+CAEW FGG P WL P ++ R + PF E++ R+ ++ +
Sbjct: 79 LAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQV 138
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV-------PW 235
+ + Q GPI+M+Q+ENEYG SYG K Y++ +A + G V PW
Sbjct: 139 SDLQIT--QEGPILMMQVENEYG----SYGND-KSYLRKSAELMRHNGIDVSLFTSDGPW 191
Query: 236 VMCKQTDAPENI---IDACNGYYCDGYKP----NSYNKPTLWTENWDGWYTTWGGRLPH- 287
+ + + ++I C + ++ + +P + E W GW+ WG H
Sbjct: 192 LDMLENGSIKDIALPTINCGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHT 251
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEY 341
V D A + + G +N YM+ GGTNFG +G +Y +TSYDYDA + E+
Sbjct: 252 TSVTDAANELRDCLEAGS--VNIYMFHGGTNFGFMNGANYYEKLSPDVTSYDYDALLSEW 309
Query: 342 GLLSEPKWGHLKDLHAAI 359
G ++ PK+ + + I
Sbjct: 310 GDVT-PKYEAFQQVIGEI 326
>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
Length = 583
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 116/336 (34%), Positives = 163/336 (48%), Gaps = 44/336 (13%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+SYD + LIS IHY R P W D + K K G + IETYV WN HE
Sbjct: 3 TLSYDEGQFKMGDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEPR 62
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+++F+ D+ +FV+L G GLY+ +R PY+CAEW FGG P WL + R N+
Sbjct: 63 EGEFHFERMADVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWLLK-DDMRLRCNDP 121
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
F E++ + ++ + L + +GGPII +QIENEYG SYG D A
Sbjct: 122 RFLEKVSAYYDALLPQLTP--LLATKGGPIIAVQIENEYG----SYGN---DQAYLQAQR 172
Query: 226 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG--------------------YKPNSYN 265
A+ + GV V+ +D P++ D G +G Y+P+
Sbjct: 173 AMLIERGVD-VLLFTSDGPQD--DMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDG-- 227
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
P + E W+GW+ W R +D A + G S +N+YM GGTNFG SG
Sbjct: 228 -PLMCMEYWNGWFDHWFEPHHTRDAKDAARVLDDMLGMGAS-VNFYMVHGGTNFGFGSGA 285
Query: 326 PF------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDA I E G L+ PK+ +++
Sbjct: 286 NHSDKYEPTVTSYDYDAAISEAGDLT-PKYHAFREV 320
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 81/215 (37%), Gaps = 46/215 (21%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+TI +RD VF++ +L G V+ W PV L +L + +G NYG L
Sbjct: 396 LTIQDVRDRALVFLDRKLIG-VVERWNPQSLPVTVPEDGAQLDILVENMGRVNYGPQL-Y 453
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQV-GLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G V+L G + L+ ++V L+ E S + A + + G
Sbjct: 454 DRKGITHGVRLNG---------QFLFHWEVRSLELETLAGLSFDTAGARAWEEEQPG--- 501
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y+ D L L KG +VNG ++GRYW V
Sbjct: 502 ---FYEAKLVIEDEPKDTFLRLEGWKKGVVFVNGFNLGRYWEV----------------- 541
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P Q Y VP L+ N +V+FE
Sbjct: 542 ----------GPQQALY-VPAPVLRQGENEIVVFE 565
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 163/318 (51%), Gaps = 31/318 (9%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
+IS IHY R P W D + K + G + +ETYV WN HE G+++F D+ +F++
Sbjct: 19 IISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQ 78
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L GLY+ LR PY+CAEW FGG P WL P ++ R + PF E++ R+ ++ +
Sbjct: 79 LAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQV 138
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV-------PW 235
+ L Q GPI+M+Q+ENEYG SYG K Y++ +A + G V PW
Sbjct: 139 SD--LQITQEGPILMMQVENEYG----SYGND-KSYLRKSAELMRHNGIDVPLFTSDGPW 191
Query: 236 VMCKQTDAPENI---IDACNGYYCDGYKP----NSYNKPTLWTENWDGWYTTWGGRLPH- 287
+ + + ++I C + ++ + +P + E W GW+ WG H
Sbjct: 192 LDMLENGSIKDIALPTINCGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHT 251
Query: 288 RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEY 341
V D A + + G +N YM+ GGTNFG +G +Y +TSYDYDA + E+
Sbjct: 252 TSVTDAANELRDCLEAGS--VNIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDALLSEW 309
Query: 342 GLLSEPKWGHLKDLHAAI 359
G ++ PK+ + + I
Sbjct: 310 GDVT-PKYEAFQQVIGEI 326
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 173/364 (47%), Gaps = 51/364 (14%)
Query: 26 MMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKS 85
I L S SS S F + + + DG +IS +HYPR + W +
Sbjct: 9 FFILLFVFSISSFSQKKHTFEIK--NGDFVYDGKPVRIISGEMHYPRIPHQYWRHRMQML 66
Query: 86 KEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
K G + + TYVFWN HE G+++F G ++ +++K+ G GL + LR GPYVCAEW F
Sbjct: 67 KAMGLNAVATYVFWNIHEPEPGKWDFTGDKNLAEYIKIAGEEGLMVILRPGPYVCAEWEF 126
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GG+P WL+++ G+E R +N F + Q ++ ++ + + +GGPI+M+Q ENE+G
Sbjct: 127 GGYPWWLQNVEGLELRRDNEQFLKYTQLYINRLYKEVGNLQIT--KGGPIVMVQAENEFG 184
Query: 206 NMESSYGQQGKD-----YVKWAASMALGL-------------------GAGVPWVMCKQT 241
SY Q KD + ++ A + L G VP +
Sbjct: 185 ----SYVSQRKDIPLEEHRRYNAKIVQQLKDAGFDVPSFTSDGSWLFEGGAVPGALPTAN 240
Query: 242 DAP--ENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVAR 299
EN+ A + Y N P + E + GW W P +A +
Sbjct: 241 GESNIENLKKAVDKY-------NGGQGPYMVAEFYPGWLAHWLEPHPQISATSIARQTEK 293
Query: 300 FFQRGGSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGH 351
+ Q S +NYYM GGTNFG TSG + +TSYDYDAPI E G ++ PK+
Sbjct: 294 YLQNNVS-INYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGWVT-PKYDS 351
Query: 352 LKDL 355
L+++
Sbjct: 352 LRNV 355
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 93/231 (40%), Gaps = 65/231 (28%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYND-LILLSQTVGLQNYGAFL 596
T+ I+ +RD ++ N + G + ++ + ++ +N L +L + +G NYG+ +
Sbjct: 426 TLKINGLRDYAIIYANDEKVGELNRYFNQ--DSIDVDIPFNSTLEILVENMGRINYGSEI 483
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIP 656
+ G V + G + ++G++Q +Y I +EA D ++
Sbjct: 484 VHNTKGIISPVIINGME----------------IEGDWQ-MYQIPMDEA--PDFSKMQKN 524
Query: 657 STF--------------TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA 702
S F YK F+ + D LD+ GKG ++NG +IGRYW V
Sbjct: 525 SVFGNTESAAKRLLGAPALYKGTFNLTETGD-TFLDMEDWGKGIVFINGKNIGRYWHV-- 581
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
P QT Y VP WL+ N +VIFE+
Sbjct: 582 -------------------------GPQQTLY-VPGVWLKKGQNEIVIFEQ 606
>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
Length = 783
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 165/331 (49%), Gaps = 27/331 (8%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+P ++ +++G ++ +A IHY R E W I K G + I Y FWN H
Sbjct: 29 EPQTFEIGNKEFLLNGKPFLIKAAEIHYTRIPAEYWEHRIEMCKALGMNTICIYAFWNIH 88
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G+++F+G+ND+ +F +L G+Y+ LR GPYVC+EW GG P WL I RT
Sbjct: 89 EQRPGEFDFEGQNDVARFCRLAQKHGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIALRT 148
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ F E + F+ ++ + + L + +GG IIM+Q+ENEYG +Y + K+Y+
Sbjct: 149 SDPYFLERTKIFMNELGKQLAD--LQAPRGGNIIMVQVENEYG----AYAED-KEYIASI 201
Query: 223 ASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-GYYCD------GYKPNSYNKPTL 269
+ G G VP C Q + ++++ N G D + P +
Sbjct: 202 RDIVRGAGFTDVPLFQCDWASTFQRNGLDDLLWTINFGTGADIDQQFKALREARPETPLM 261
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PF 327
+E W GW+ WG + RP + + + R SF + YM GGT FG G P
Sbjct: 262 CSEYWSGWFDHWGRKHETRPADVMVKGIKDMMDRNISF-SLYMTHGGTTFGHWGGANSPS 320
Query: 328 Y---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
Y +SYDYDAPI E G + PK+ L+DL
Sbjct: 321 YSAMCSSYDYDAPISEAGWAT-PKYYQLRDL 350
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 88/228 (38%), Gaps = 47/228 (20%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
+ +D D +V++NGQL G + + + + L +L + +G N+ +
Sbjct: 425 VLLVDEPHDWAQVYLNGQLLGRLDRRRGENILSLPDVKAGTRLDILVEAMGRVNFDRAIH 484
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G +V+L G + W Q+YS +A++ + S
Sbjct: 485 -DRKGITDKVQL--LNEGCEPQTLTGW-----------QVYSFP-TDAKFAADKQFAKGS 529
Query: 658 TF---TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
F +Y+T F D LD+ + GKG WVNGH +GR+W +
Sbjct: 530 KFDGPAYYRTTFTL-DKTGDTFLDMSTWGKGMVWVNGHAMGRFWKI-------------- 574
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS 762
P QT + +P WL+ N +V+ + G + +I
Sbjct: 575 -------------GPQQTLF-MPGCWLKKGKNEIVVLDLLGPDETKIE 608
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 164/321 (51%), Gaps = 31/321 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +++ +HY R P W D + K K G + +ETYV WN HE G+++F
Sbjct: 13 LDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEPHEGEFHFGDWL 72
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
+I ++++L G GLY+ +R GPY+CAEW GG P WL P ++ R P+ + + +
Sbjct: 73 NIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQMKLRCMYQPYLDAVGEYF 132
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ + R L S +GGPII +Q+ENEYG SYG + Y+K+ + G
Sbjct: 133 SQL--MHRLVPLQSTRGGPIIAMQVENEYG----SYGNDTR-YLKYLEELLRQCGVDVLL 185
Query: 232 ----GVPWVMCKQTDAPENIIDACN-----GYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
GV M + P ++ A N G + + P L E WDGW+ WG
Sbjct: 186 FTADGVADEMMQYGSLP-HLFKAVNFGNRPGDAFEKLREYQTGGPLLVAEFWDGWFDHWG 244
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-----PFY---ITSYDY 334
R R ++A + G S +N YM+ GGTNFG +G P Y +TSYDY
Sbjct: 245 ERHHTRSAGEVARVLDDLLSEGAS-VNLYMFHGGTNFGFMNGANAFPSPHYTPTVTSYDY 303
Query: 335 DAPIDEYGLLSEPKWGHLKDL 355
DAP+ E G ++ PK+ ++++
Sbjct: 304 DAPLSECGNIT-PKYEAMREV 323
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 159/308 (51%), Gaps = 18/308 (5%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +DG +IS +HY R E W D + K K G + IETYV WN HE I G+YNF
Sbjct: 63 KTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEPIPGKYNF 122
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G D+V F+ L Y+ LR GPY+C+EW FGG P WL P ++ RT P+ +
Sbjct: 123 TGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKMKVRTMYPPYIAAV 182
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGL 229
++ ++ ++ L GGPII Q++NEYG+ ++ Y K++++ + L
Sbjct: 183 TKYFNYLLPFVKP--LQYQYGGPIIAFQLDNEYGSYFKDADYLPYLKEFLQNKGIIELLF 240
Query: 230 GAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYN----KPTLWTENWDGWYTTWGGRL 285
+ + +QT ++ N + + + N P + E W GW+ WG +
Sbjct: 241 ISDSIEGLRQQTIP--GVLKTVNFKRMENHFTDLSNMQPDAPLMVMEFWTGWFDWWGEKH 298
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-----PFY--ITSYDYDAPI 338
V++ + F +GGS +N+YM+FGGTNFG +G F+ ITSYDYDA I
Sbjct: 299 HILTVQEFGETLNEIFSQGGS-VNFYMFFGGTNFGFMNGAYKDGTGFHADITSYDYDALI 357
Query: 339 DEYGLLSE 346
E G L+E
Sbjct: 358 AENGDLTE 365
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 28/78 (35%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD+ S GKG +VNG ++GRYW++ P QT + +
Sbjct: 589 LDMSSWGKGVVFVNGRNLGRYWSI---------------------------GPQQTLF-L 620
Query: 737 PRSWLQASNNLLVIFEET 754
P WL N ++IFEET
Sbjct: 621 PGPWLHKGANEIIIFEET 638
>gi|445062232|ref|ZP_21374649.1| beta-galactosidase [Brachyspira hampsonii 30599]
gi|444506390|gb|ELV06735.1| beta-galactosidase [Brachyspira hampsonii 30599]
Length = 592
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 112/339 (33%), Positives = 161/339 (47%), Gaps = 37/339 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
I++G ++S IHY R E W D + K G + +ETY+ WN HE G ++F G
Sbjct: 10 FILNGKPIKILSGAIHYFRFVREYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGFFDFSG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
DI F+K L + LR PY+CAEW FGG P WL I+ RTN F ++
Sbjct: 70 NKDIASFIKTAQKLDLLVILRPTPYICAEWEFGGLPAWLLRYDNIKVRTNTQLFLSKVDA 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K++ + + + + GP+IM+QIENEYG S+G K+Y++ ++ + GA V
Sbjct: 130 YYKELFKHIDDLQI--TRNGPVIMMQIENEYG----SFGND-KEYLRALKNLMIKHGAEV 182
Query: 234 PW---------VMCKQTDAPENIIDACN------GYYCDGYK---PNSYNKPTLWTENWD 275
P V+ T + I+ N + D K KP + E WD
Sbjct: 183 PLFTSDGAWDAVLEAGTLIDDGILATVNFGSKAKESFDDTEKFFARKGIKKPLMCMEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------Y 328
GW+ W + R +D V +RG +N YM+ GGTNFG +G
Sbjct: 243 GWFNLWKDPIIKRDADDFIMEVKEILKRGS--INLYMFIGGTNFGFYNGTSVTGYTDFPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLK---DLHAAIKLCEP 364
ITSYDYDA + E+G +E + K +L IK EP
Sbjct: 301 ITSYDYDAVLTEWGEPTEKFYKLQKLINELFPEIKTFEP 339
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 81/156 (51%), Gaps = 21/156 (13%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVE--FQSGYNDLILLSQTVGLQNYGAFLEKDGAGF 603
D + ++NG+ G + + ++++P+E F G N L LL + VG NYG L++
Sbjct: 409 DRVHFYLNGEYKG--VKYQDELIEPIEMHFNDGDNILELLVENVGRVNYGYKLQE----- 461
Query: 604 RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK 663
QVK G + G + + + F+Q Y++ + E D + D I +T ++Y+
Sbjct: 462 CSQVK--GIRIGVMA--------DIHFETGFEQ-YALSLDNIEDVDFSADWIENTPSFYR 510
Query: 664 TYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWT 699
F+ + D LD +GKG A++NG ++GRYW+
Sbjct: 511 YEFEVKEAADTF-LDCSKLGKGVAFINGFNLGRYWS 545
>gi|297842039|ref|XP_002888901.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297334742|gb|EFH65160.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 686
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 171/350 (48%), Gaps = 45/350 (12%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DGN +I +HY R PE W D + ++K G + I+ YV WN HE G+ F+G D
Sbjct: 72 DGNHFQIIGGDLHYFRVLPEYWEDRLLRAKALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 131
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI-PGIEFRTNNAPFKEEMQR-- 173
+V F+KL + LR GPY+C EW+ GGFP WL + P ++ RT++ + + ++R
Sbjct: 132 LVSFLKLCDKLDFMVMLRAGPYICGEWDLGGFPAWLLSVKPRLQLRTSDPAYLKLVERWW 191
Query: 174 --FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALG-LG 230
+ KI L+ GGP+IM+QIENEYG SYG K Y++ SMA G LG
Sbjct: 192 GVLLPKIFPLIYS------NGGPVIMVQIENEYG----SYGND-KAYLRKLVSMARGHLG 240
Query: 231 ---------AGVPWVMCKQTDAPENIIDACNGYYCDGYKP-----NSYN----KPTLWTE 272
G + K T +++ A + D P +N P L +E
Sbjct: 241 DDIIVYTTDGGTKETLEKGTVPVDDVYSAVDFTTGDDPWPIFELQKKFNAPGSSPPLSSE 300
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
+ GW T WG ++ E A ++ + R GS + YM GGTNFG +G
Sbjct: 301 FYTGWLTHWGEKIAKTDAEFTATSLEKILSRNGSAV-LYMVHGGTNFGFYNGANTGSEES 359
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQ 373
+TSYDYDAPI E G + PK+ L+ + + +++ ++ +
Sbjct: 360 DYKPDLTSYDYDAPIKESGDIDNPKFRALQRVIKKYNVASHSIIPSNKQR 409
>gi|332030018|gb|EGI69843.1| Beta-galactosidase [Acromyrmex echinatior]
Length = 594
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/333 (33%), Positives = 166/333 (49%), Gaps = 31/333 (9%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
+V Y++ ++DG +S HY R + W D + K + G + I TYV W+ HE
Sbjct: 1 DVDYENNQFLLDGKPFQYVSGSFHYFRTPRQYWRDRLRKMRAAGLNAISTYVEWSLHEPE 60
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW-LRDIPGIEFRTNN 164
GQ+N+ G D+V F+ + L++ LR GPY+CAE + GG P W LR++P I RT +
Sbjct: 61 PGQFNWTGDADLVNFLNIAQEEDLFVLLRPGPYICAERDMGGLPYWLLREVPNINLRTKD 120
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM---ESSYGQQGKD-YVK 220
A F ++ +I+ +R L GGPIIM+QIENEYG+ + Y K+ +VK
Sbjct: 121 ADFVRYATLYLNEILSKIRP--LLRGNGGPIIMVQIENEYGSYYACDIEYMDMLKEVFVK 178
Query: 221 WAASMALGL---GAGVPWVMCKQTDAPENIID------ACNGYYC-DGYKPNSYNKPTLW 270
+ AL GA + C +D N + Y+P P +
Sbjct: 179 KVGNKALLYTTDGAAASLLRCGFISGAYATVDFGTASNVTNSFLSMRLYQPRG---PLVN 235
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG----- 325
+E + GW T WG E + ++ G S +N+YM++GGTNFG TSG
Sbjct: 236 SEFYPGWLTHWGEPFQRTKTEAIVKSLEEMLALGAS-VNFYMFYGGTNFGFTSGANGGAG 294
Query: 326 ---PFYITSYDYDAPIDEYGLLSEPKWGHLKDL 355
P +TSYDYDAP+ E G PK+ ++D+
Sbjct: 295 VYNP-QLTSYDYDAPLTEAG-DPTPKYFAIRDV 325
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 27/76 (35%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
LD GKG A+VNGH++GRYW +V P Q +V
Sbjct: 525 LDTTGWGKGVAFVNGHNLGRYWPLVGP---------------------------QITLYV 557
Query: 737 PRSWLQASNNLLVIFE 752
P +L+ N L+I E
Sbjct: 558 PAPYLREGENELIILE 573
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 152/307 (49%), Gaps = 28/307 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+D +IS IHY R PE W D + K + G + +ETYV WN HE+ G Y F G
Sbjct: 12 LDNKPLKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ GLY+ LR PY+CAEW FGG P WL P ++ R + PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+ +R+ L QGGPIIM+Q+ENEYG SY K+Y++ + G P
Sbjct: 132 AHLFPQVRD--LQITQGGPIIMMQVENEYG----SYAND-KEYLRKMVAAMRQHGVETPL 184
Query: 236 VMCKQT--DAPEN--IIDA------CNGYYCDGYKP----NSYNKPTLWTENWDGWYTTW 281
V D EN I D C + ++ + +P + E W GW+ W
Sbjct: 185 VTSDGPWHDMLENGSIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYD 335
G H A + GS +N YM+ GGTNFG +G +Y +TSYDYD
Sbjct: 245 GDDQHHTTSTQDAVKELQDCLALGS-VNIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYD 303
Query: 336 APIDEYG 342
A + E+G
Sbjct: 304 ALLTEWG 310
>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
Length = 591
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 160/328 (48%), Gaps = 42/328 (12%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
+ F + D ++DG LIS IHY R TP W D + K GA+ +ETY+ WN
Sbjct: 1 MRTFEIKED---FLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNL 57
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G Y+F+G DI FVK + GL + LR Y+CAEW FGG P WL + P + R
Sbjct: 58 HEPREGVYDFEGMKDICAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLR 116
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
+ + F +++ + + V L + L GGP+IM+Q+ENEYG SYG + K Y++
Sbjct: 117 STDPRFMAKVRNYFQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQ 169
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNSY-------- 264
+ G VP + A E ++DA G + K N+
Sbjct: 170 TKELMEEYGIDVP--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAK 227
Query: 265 ---NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N P + E WDGW+ WG + R +DLA V G +N YM+ GGTNFG
Sbjct: 228 HGKNWPIMCMEYWDGWFNRWGEPIIKRAGQDLANEVKEMLAVGS--LNLYMFHGGTNFGF 285
Query: 322 TSG----GPF---YITSYDYDAPIDEYG 342
+G G ++SYDYDA + E G
Sbjct: 286 YNGCSARGALDLPQVSSYDYDALLTEAG 313
>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
Length = 768
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 170/327 (51%), Gaps = 35/327 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HYPR + W + + G + + TYVFWN HE+ G+++F+G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
++ +++++ G GL + LR GPYVCAEW FGG+P WL++IPG+E R +N F + + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL- 229
K+ + + + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFG----SYVAQRKDIPLEEHRRYNAKIKRQLA 212
Query: 230 --GAGVPWV------MCKQTDAPENIIDACNGYYCDGYKP--NSYN---KPTLWTENWDG 276
G VP + + P + A + K N Y+ P + E + G
Sbjct: 213 DAGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPG 272
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------Y 328
W W P +A + Q SF N+YM GGTNFG TSG +
Sbjct: 273 WLMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPD 331
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 332 LTSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
Length = 768
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 170/327 (51%), Gaps = 35/327 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HYPR + W + + G + + TYVFWN HE+ G+++F+G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
++ +++++ G GL + LR GPYVCAEW FGG+P WL++IPG+E R +N F + + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL- 229
K+ + + + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFG----SYVAQRKDIPLEEHRRYNAKIKRQLA 212
Query: 230 --GAGVPWV------MCKQTDAPENIIDACNGYYCDGYKP--NSYN---KPTLWTENWDG 276
G VP + + P + A + K N Y+ P + E + G
Sbjct: 213 DAGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPG 272
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------Y 328
W W P +A + Q SF N+YM GGTNFG TSG +
Sbjct: 273 WLMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPD 331
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 332 LTSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
Length = 768
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 170/327 (51%), Gaps = 35/327 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HYPR + W + + G + + TYVFWN HE+ G+++F+G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
++ +++++ G GL + LR GPYVCAEW FGG+P WL++IPG+E R +N F + + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL- 229
K+ + + + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFG----SYVAQRKDIPLEEHRRYNAKIKRQLA 212
Query: 230 --GAGVPWV------MCKQTDAPENIIDACNGYYCDGYKP--NSYN---KPTLWTENWDG 276
G VP + + P + A + K N Y+ P + E + G
Sbjct: 213 DAGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPG 272
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------Y 328
W W P +A + Q SF N+YM GGTNFG TSG +
Sbjct: 273 WLMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPD 331
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 332 LTSYDYDAPISEAGWVT-PKFDSIRNV 357
>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
Length = 633
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 171/347 (49%), Gaps = 42/347 (12%)
Query: 27 MIHLSCVSSS-SASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKS 85
++ LS +S++ +A T P + SY+ +++G +I + R PE W + +
Sbjct: 8 LVALSALSATLAAETTHAPGSFSYNRTDFLLNGQPFQIIGGQMDPQRILPEYWTHRLKMA 67
Query: 86 KEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNF 145
+ G + I +Y++WN HE G ++F G+ND+ +F +L GL + LR GPY+C E ++
Sbjct: 68 RAMGLNTIFSYLYWNLHEPRPGAWDFSGRNDVARFFRLAQQEGLRVVLRPGPYICGERDW 127
Query: 146 GGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG 205
GGFP WL +PG+ R NN PF + + ++ ++ + + + QGGPI+M Q+ENEYG
Sbjct: 128 GGFPAWLSQVPGMAVRQNNRPFLDAAKSYIDRLGKELGQLQIT--QGGPILMAQLENEYG 185
Query: 206 NMESSYGQQGKDYVKWAASMA---------LGLGAGVPWVMCKQTDAPENIID------- 249
+ + K Y+ A+M G G ++ Q +ID
Sbjct: 186 SFGTD-----KTYLAALAAMLRENFDVFLYTNDGGGQSYLEGGQLHGVLAVIDGDSQSGF 240
Query: 250 -ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPV----EDLAFAVARF--FQ 302
A + Y D P S P L E + W WG PH+ + D+A AVA
Sbjct: 241 AARDKYVTD---PTSLG-PQLNGEYYISWIDQWGSDYPHQQIAGSQADVAKAVADLDWTL 296
Query: 303 RGGSFMNYYMYFGGTNFGRTSG-----GPF--YITSYDYDAPIDEYG 342
GG + YM+ GGTNFG +G GP TSYDY AP+DE G
Sbjct: 297 AGGYSFSIYMFHGGTNFGFENGGIRDDGPLAAMTTSYDYGAPLDESG 343
>gi|157106609|ref|XP_001649402.1| beta-galactosidase [Aedes aegypti]
gi|108879821|gb|EAT44046.1| AAEL004575-PA [Aedes aegypti]
Length = 648
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 168/324 (51%), Gaps = 30/324 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y++ ++DG I+ HY RA P+ W ++ + G + + TYV W+ H
Sbjct: 32 RTFTIDYENNTFLLDGAPFQYIAGSFHYFRALPQAWGPILKSMRAAGLNAVTTYVEWSLH 91
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRD-IPGIEFR 161
+G YN+ G DI +FV+L + L + LR GPY+CAE + GGFP WL + PGI+ R
Sbjct: 92 NPKKGVYNWDGMADIERFVQLAQNEDLLVILRPGPYICAERDMGGFPYWLLNKYPGIQLR 151
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM---ESSYGQQGKD- 217
T + + E++ + ++ R E F GGPIIM+Q+ENEYG+ + Y + +D
Sbjct: 152 TADVAYLREVRTWYAELFS--RLEPYFYGNGGPIIMVQVENEYGSFFACDYKYMKWLRDE 209
Query: 218 ---YVKWAASMALGLGAGVPWVMCKQTDAPENIID-------ACNGYYCDGYKPNSYNKP 267
YV+ A + G G+ C D + +D +GY+ D K P
Sbjct: 210 TERYVRGKAVLFTNNGPGL--TQCGGIDGVLSTLDFGPGTALEIDGYWKDLRKLQP-KGP 266
Query: 268 TLWTENWDGWYTTWG-GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-- 324
+ E + GW T W ++ P+E + ++ R+ +N YM++GGTNFG T+G
Sbjct: 267 LVNAEYYPGWLTHWQEQQMARSPIEPVVTSL-RYMLSSKVNVNIYMFYGGTNFGFTAGAN 325
Query: 325 ----GPFY--ITSYDYDAPIDEYG 342
G F ITSYDYDAP+DE G
Sbjct: 326 EQGPGRFIPDITSYDYDAPLDESG 349
>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLTLDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 166/349 (47%), Gaps = 46/349 (13%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L + + ++M + L+C S FNV DG LI +HY R
Sbjct: 7 LYKFILGLLMPFLFLACSSKERIKIDGGTFNV---------DGKDVQLICGEMHYARIPH 57
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
E W D + +++ G + I YVFWN HE G+++F G+ D+ +FV+L GLY+ LR
Sbjct: 58 EYWRDRLKRARAMGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRP 117
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPY CAEW+FGG+P WL + +R+ + F E +R++K + + L GG I
Sbjct: 118 GPYACAEWDFGGYPSWLLKEKDMVYRSKDPRFLEYCERYIKALGKQLAP--LTVNNGGNI 175
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDAP--ENIIDA 250
+M+Q+ENEYG+ + K+Y+ M G VP C Q +A + +
Sbjct: 176 LMVQVENEYGSYAAD-----KEYLAALRDMIKDAGFNVPLFTCDGGGQVEAGHIDGALPT 230
Query: 251 CNGYY-------CDGYKPNSYNKPTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVAR 299
NG + D Y P P E + W+ WG R RP E L + + +
Sbjct: 231 LNGVFSEDIFKIIDKYHPGG---PYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLGQ 287
Query: 300 FFQRGGSFMNYYMYFGGTNF----GRTSGGPFY--ITSYDYDAPIDEYG 342
G ++ YM+ GGTNF G + G + TSYDYDAP+ E+G
Sbjct: 288 -----GVSVSMYMFHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWG 331
Score = 42.7 bits (99), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 85/228 (37%), Gaps = 57/228 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + ++G+ S+ + + ++ Q L +L + G NYG
Sbjct: 414 KQKLIIQDLRDYAVILVDGKQVASLDRRYNQNNVMLDIQKAPATLEILVENTGRVNYGPD 473
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ + E+
Sbjct: 474 ILFNRKGITNQVLCGDEKLTGWSITPLPLYK-------------EKVSEMNFGES----- 515
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
I ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 516 ----IQGKPAFHKGIFTVRQKGD-CFVDMSRWGKGAVWVNGKSLGRFWNI---------- 560
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE-ETGGN 757
P QT Y +P WL+ N +V+FE E GN
Sbjct: 561 -----------------GPQQTLY-LPAPWLKEGENEIVVFEMEDTGN 590
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 166/349 (47%), Gaps = 46/349 (13%)
Query: 16 LSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
L + + ++M + L+C S FNV DG LI +HY R
Sbjct: 7 LYKFILGLLMPFLFLACSSKERIKIDGGTFNV---------DGKDVQLICGEMHYARIPH 57
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
E W D + +++ G + I YVFWN HE G+++F G+ D+ +FV+L GLY+ LR
Sbjct: 58 EYWRDRLKRARAMGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRP 117
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPY CAEW+FGG+P WL + +R+ + F E +R++K + + L GG I
Sbjct: 118 GPYACAEWDFGGYPSWLLKEKDMVYRSKDPRFLEYCERYIKALGKQLAP--LTVNNGGNI 175
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDAP--ENIIDA 250
+M+Q+ENEYG+ + K+Y+ M G VP C Q +A + +
Sbjct: 176 LMVQVENEYGSYAAD-----KEYLAALRDMIKDAGFNVPLFTCDGGGQVEAGHIDGALPT 230
Query: 251 CNGYY-------CDGYKPNSYNKPTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVAR 299
NG + D Y P P E + W+ WG R RP E L + + +
Sbjct: 231 LNGVFSEDIFKIIDKYHPGG---PYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLGQ 287
Query: 300 FFQRGGSFMNYYMYFGGTNF----GRTSGGPFY--ITSYDYDAPIDEYG 342
G ++ YM+ GGTNF G + G + TSYDYDAP+ E+G
Sbjct: 288 -----GVSVSMYMFHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWG 331
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 85/228 (37%), Gaps = 57/228 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + ++G+ S+ + + ++ Q L +L + G NYG
Sbjct: 414 KQKLIIQDLRDYAVILVDGKQVASLDRRYNQNNVMLDIQKAPATLEILVENTGRVNYGPD 473
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ + E+
Sbjct: 474 ILFNRKGITNQVLCGDEKLTGWSITPLPLYK-------------EKVSEMNFGES----- 515
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
I ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 516 ----IQGKPAFHKGIFTVRQKGD-CFVDMSRWGKGAVWVNGKSLGRFWNI---------- 560
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE-ETGGN 757
P QT Y +P WL+ N +V+FE E GN
Sbjct: 561 -----------------GPQQTLY-LPAPWLKEGENEIVVFEMEDTGN 590
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 346
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
Length = 592
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGVRLRSTDPIFMTKVRN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 130 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 182
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 334
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 409 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 468
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 469 GIRGGVM------QDIHFHQGCQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 512
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 513 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 557
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 558 C--------------PKEFLQQGQNEVVIFETEG 577
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 174/343 (50%), Gaps = 34/343 (9%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+ +++++ I +S + SS +S V + I+G LI +HYPR E W
Sbjct: 5 HKTVLVILNIIVSFLISSCSSP---KEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYW 61
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
D + +++ G + + YVFWN HE G+++F G+ DI +F++ GLY+ LR GPY
Sbjct: 62 RDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPY 121
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
VCAEW+FGG+P WL + +R+ + F +R++K++ + L GG IIM+
Sbjct: 122 VCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSP--LTINNGGNIIMV 179
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDAP--ENIIDACNG 253
Q+ENEYG+ + K+Y+ M G VP C Q +A E + NG
Sbjct: 180 QVENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNG 234
Query: 254 YYC-DGYK-PNSYNK--PTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARFFQRGG 305
+ D +K + Y K P E + W+ WG R RP E L + ++ G
Sbjct: 235 VFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----G 289
Query: 306 SFMNYYMYFGGTNF----GRTSGGPF--YITSYDYDAPIDEYG 342
++ YM+ GGTNF G +GG + TSYDYDAP+ E+G
Sbjct: 290 VSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 332
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 84/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + I+G+ S+ + + + L +L + G NYG
Sbjct: 415 KQKLVIQDLRDYAVILIDGKQVASLDRRYNQNSVTLNVSKTPATLEILVENTGRVNYGPD 474
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ +E E
Sbjct: 475 ILFNRKGITSQVLWGNEKLTGWSITPLPLYK-------------EKVSEMEFGE------ 515
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
T G+P+ ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 516 TIKGVPA---FHKGTFTVEKKGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 561
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 562 -----------------GPQQTLY-LPAPWLKEGENEIVVFE 585
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 168/348 (48%), Gaps = 44/348 (12%)
Query: 17 SVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
+V+ M+ +++ +S SS V ++ I+G LI +HYPR E
Sbjct: 7 NVFIMLNLIVSFFISACSSPREQ-------VKIENGTFNINGKDVQLICGEMHYPRIPHE 59
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
W D + +++ G + + YVFWN HE G ++F G+ DI +FV++ GLY+ LR G
Sbjct: 60 YWRDRLHRARAMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPG 119
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVCAEW+FGG+P WL + +R+ + F +R++K++ + + + GG II
Sbjct: 120 PYVCAEWDFGGYPSWLLKEKDLTYRSKDPRFMSYCERYIKELGKQLAPLTINN--GGNII 177
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDAPE--NIIDAC 251
M+Q+ENEYG+ + K+Y+ M G VP C Q +A +
Sbjct: 178 MVQVENEYGSYAAD-----KEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTL 232
Query: 252 NGYY-------CDGYKPNSYNKPTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARF 300
NG + D Y P P E + W+ WG R RP E L + +
Sbjct: 233 NGVFGEDIFKIVDKYHPGG---PYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGH- 288
Query: 301 FQRGGSFMNYYMYFGGTNF----GRTSGGPFY--ITSYDYDAPIDEYG 342
G ++ YM+ GGTNF G + G F TSYDYDAP+ E+G
Sbjct: 289 ----GVSVSMYMFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWG 332
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 47/222 (21%), Positives = 84/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + ++G+ S+ + + ++ L +L + G NYG
Sbjct: 415 KQKLIIQDLRDYAVILVDGKQVASLDRRYNQNSTTLDIHKVPATLEILVENTGRVNYGPD 474
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ S+ +
Sbjct: 475 ILFNRKGITSQVLWGNEKLTGWSITPLPLYK-------------EEVSSLSFGQE----- 516
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
G+P+ +++ F D +D+ GKG WVNG +GR+W +
Sbjct: 517 -IKGVPA---FHRGTFIIEQQGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 561
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 562 -----------------GPQQTLY-IPAPWLKKGENEIVVFE 585
>gi|422877900|ref|ZP_16924370.1| beta-galactosidase [Streptococcus sanguinis SK1056]
gi|332358593|gb|EGJ36417.1| beta-galactosidase [Streptococcus sanguinis SK1056]
Length = 592
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVWREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|217075719|gb|ACJ86219.1| unknown [Medicago truncatula]
Length = 200
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 79/170 (46%), Positives = 112/170 (65%), Gaps = 7/170 (4%)
Query: 682 MGKGQAWVNGHHIGRYW-TVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
MGKG+AWVNG IGRYW T ++P GC D+C+YRG Y++ KC NCG P+QT YHVPR+W
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAW 60
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSESHYPPVRKWSNSYSVDGKLSINK 800
L+ +N V+FEE+GG+P +IS + VC V+ESH PPV W+++ + K
Sbjct: 61 LKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAESE-----RK 115
Query: 801 MAPEMHLHCQ-DGYIISSIEFASYGTPQGRCQKFSRGNCHAPMSLSVVSE 849
+ P + L C ISSI+FAS+GTP+ C ++ G+C + +LS+V +
Sbjct: 116 VGPVLSLECPYPNQAISSIKFASFGTPRRTCGNYNHGSCSSNRALSIVQK 165
>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 591
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 160/328 (48%), Gaps = 42/328 (12%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
+ F + D ++DG LIS IHY R TP W D + K GA+ +ETY+ WN
Sbjct: 1 MRTFEIKED---FLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNL 57
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G Y+F+G DI FVK + GL + LR Y+CAEW FGG P WL + P + R
Sbjct: 58 HEPREGVYDFEGMKDICAFVKQAQTLGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLR 116
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
+ + F +++ + + V L + L GGP+IM+Q+ENEYG SYG + K Y++
Sbjct: 117 STDPRFMAKVRNYFQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQ 169
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS--------- 263
+ G VP + A E ++DA G + K N+
Sbjct: 170 TKELMEEYGIDVP--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAK 227
Query: 264 --YNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N P + E WDGW+ WG + R +DLA V G +N YM+ GGTNFG
Sbjct: 228 HGKNWPIMCMEYWDGWFNRWGEPIIKRDGQDLANEVKEMLAVGS--LNLYMFHGGTNFGF 285
Query: 322 TSG----GPF---YITSYDYDAPIDEYG 342
+G G ++SYDYDA + E G
Sbjct: 286 YNGCSARGALDLPQVSSYDYDALLTEAG 313
>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
Length = 765
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 170/327 (51%), Gaps = 35/327 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HYPR + W + + G + + TYVFWN HE+ G+++F+G
Sbjct: 36 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
++ +++++ G GL + LR GPYVCAEW FGG+P WL++IPG+E R +N F + + ++
Sbjct: 96 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 155
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL- 229
K+ + + + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 156 DKLYEQVGD--LQVSKGGPIIMVQAENEFG----SYVAQRKDIPLEEHRRYNAKIKRQLA 209
Query: 230 --GAGVPWV------MCKQTDAPENIIDACNGYYCDGYKP--NSYN---KPTLWTENWDG 276
G VP + + P + A + K N Y+ P + E + G
Sbjct: 210 DAGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPG 269
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------Y 328
W W P +A + Q SF N+YM GGTNFG TSG +
Sbjct: 270 WLMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPD 328
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 329 LTSYDYDAPISEAGWVT-PKFDSIRNV 354
>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 593
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 82/218 (37%), Gaps = 53/218 (24%)
Query: 546 DVLRVFINGQLTGS----VIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK--D 599
D L ++++G L + +G + + E + + L +L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDT--HALDILVENLGRVNYGFKLNNPTQ 467
Query: 600 GAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTF 659
G RG V DI + Y + E Q+ I D T P
Sbjct: 468 SKGIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQP 511
Query: 660 TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
++Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 512 SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHS 556
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
C P+ +LQ N +VIFE G N
Sbjct: 557 LYC--------------PKEFLQQGQNEVVIFETEGIN 580
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLVN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKDSFAQ 346
>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 80/216 (37%), Gaps = 49/216 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
C P+ +LQ N +VIFE G N
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEGIN 580
>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.7 bits (99), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 80/216 (37%), Gaps = 49/216 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
C P+ +LQ N +VIFE G N
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEGIN 580
>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 768
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 170/327 (51%), Gaps = 35/327 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HYPR + W + + G + + TYVFWN HE+ G+++F+G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
++ +++++ G GL + LR GPYVCAEW FGG+P WL++IPG+E R +N F + + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL- 229
K+ + + + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFG----SYVAQRKDIPLEEHRRYNAKIKRQLA 212
Query: 230 --GAGVPWV------MCKQTDAPENIIDACNGYYCDGYKP--NSYN---KPTLWTENWDG 276
G VP + + P + A + K N Y+ P + E + G
Sbjct: 213 DAGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPG 272
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------Y 328
W W P +A + Q SF N+YM GGTNFG TSG +
Sbjct: 273 WLMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPD 331
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 332 LTSYDYDAPISEAGWVT-PKFDSIRNV 357
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 70/325 (21%), Positives = 119/325 (36%), Gaps = 57/325 (17%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V + +P +P +P + L+ + KE V S T E LN
Sbjct: 360 KYVTYDVPEAP-APIPLIEIPSISLTKVADVLALAKEGEPVASPTPLT----FEQLNQGY 414
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q ++ + I +RD ++++G+ G + + +
Sbjct: 415 GYVLYSTHFNQ--------------PLKGRLEIPGLRDYATIYVDGERVGELNRCFNQYA 460
Query: 569 QPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
++ L +L + +G NYG + ++ G VK+ G + D + K+
Sbjct: 461 MEIDIPFNAT-LDILVENMGRINYGEEIVRNTKGIISSVKINGSEISDWKMYKLPMDRMP 519
Query: 629 GLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
L +Y E + + Y+ F D D +D+ GKG +
Sbjct: 520 ALVSGEPYVYKNGSPEVA-------ALGNKPVLYEGTFHLSDTGD-TFIDMEDWGKGIIF 571
Query: 689 VNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL 748
+NG +IGRYW Y G P QT Y +P WL N +
Sbjct: 572 INGVNIGRYW--------------YAG-------------PQQTLY-IPGVWLNKGENKI 603
Query: 749 VIFEETGGNPFEISVKLRSTRIVCE 773
VI+E+ N + SV+ T ++ +
Sbjct: 604 VIYEQL-NNDRKSSVRTVKTPVLTK 627
>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/342 (34%), Positives = 165/342 (48%), Gaps = 46/342 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGA 231
+ + V L + L QGGP+IM+Q+ENEYG+ ME +Y QQ K ++ LG
Sbjct: 131 YFQ--VLLPKLSPLQITQGGPVIMMQVENEYGSYGMEKAYLQQTKQIME-------ELGI 181
Query: 232 GVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWT 271
VP + A E ++DA G + K N+ + K P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSG 324
E WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAK 297
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 298 DLPQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.7 bits (99), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 80/216 (37%), Gaps = 49/216 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
C P+ +LQ N +VIFE G N
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEGIN 580
>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
Length = 592
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 130 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 182
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 334
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 409 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 468
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 469 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 512
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 513 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 557
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 558 C--------------PKEFLQQGQNEVVIFETEG 577
>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 592
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 130 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 182
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 334
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 409 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 468
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 469 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 512
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 513 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 557
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 558 C--------------PKEFLQQGQNEVVIFETEG 577
>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
Length = 768
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 170/327 (51%), Gaps = 35/327 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HYPR + W + + G + + TYVFWN HE+ G+++F+G
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
++ +++++ G GL + LR GPYVCAEW FGG+P WL++IPG+E R +N F + + ++
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL- 229
K+ + + + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFG----SYVAQRKDIPLEEHRRYNAKIKRQLA 212
Query: 230 --GAGVPWV------MCKQTDAPENIIDACNGYYCDGYKP--NSYN---KPTLWTENWDG 276
G VP + + P + A + K N Y+ P + E + G
Sbjct: 213 DAGFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPG 272
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------Y 328
W W P +A + Q SF N+YM GGTNFG TSG +
Sbjct: 273 WLMHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPD 331
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 332 LTSYDYDAPISEAGWVT-PKFDSIRNV 357
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 70/325 (21%), Positives = 120/325 (36%), Gaps = 57/325 (17%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V + +P +P +P + L+ + KE V S T E LN
Sbjct: 360 KYVTYDVPEAP-APIPLIEIPSISLTKVADVLALAKEGEPVASPTPLT----FEQLNQGY 414
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q ++ + I +RD ++++G+ G + + +
Sbjct: 415 GYVLYSTHFNQ--------------PLKGRLEIPGLRDYATIYVDGERVGELNRCFNQYA 460
Query: 569 QPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV 628
++ L +L + +G NYG + ++ G VK+ G + D + K+
Sbjct: 461 MEIDIPFNAT-LDILVENMGRINYGEEIVRNTKGIISSVKINGSEISDWKMYKLPMDRMP 519
Query: 629 GLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAW 688
L + +Y E + + Y+ F D D +D+ GKG +
Sbjct: 520 ALVSDEPYVYKNGSPEVA-------ALGNKPVLYEGTFHLSDTGD-TFIDMEDWGKGIIF 571
Query: 689 VNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLL 748
+NG +IGRYW Y G P QT Y +P WL N +
Sbjct: 572 INGVNIGRYW--------------YAG-------------PQQTLY-IPGVWLNKGENKI 603
Query: 749 VIFEETGGNPFEISVKLRSTRIVCE 773
VI+E+ N + SV+ T ++ +
Sbjct: 604 VIYEQL-NNDRKSSVRTVKTPVLTK 627
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 163/324 (50%), Gaps = 31/324 (9%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + + YVFWN+HE G Y+F
Sbjct: 357 FLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTE 416
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL + R ++ F E +
Sbjct: 417 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVAL 476
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F + + +++ + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 477 FEEAVAKQVKDLTIAN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALFQ 534
Query: 221 --WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
WA++ L + W M T A + A +PNS P + +E W GW+
Sbjct: 535 CDWASNFTLNGLDDLIWTMNFGTGANVDQQFA----KLKQLRPNS---PLMCSEFWSGWF 587
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYD 333
WG RP D+ + RG SF + YM GGTN+G +G P + +TSYD
Sbjct: 588 DKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYD 646
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA 357
YDAPI E G + PK+ L++ A
Sbjct: 647 YDAPISESGQTT-PKYWALREAMA 669
Score = 45.8 bits (107), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 93/228 (40%), Gaps = 52/228 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGH--WVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
T+T++ D +VF++G+ G + ++V P + D+++ + +G N+G
Sbjct: 740 TLTVNDAHDYAQVFVDGKYIGKLDRRNGEKQLVLPACVKGSRLDILV--EAMGRINFGRA 797
Query: 596 LEKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEW 647
+ KD G V+L+ NG + ++ I TY+ +FQ I S+
Sbjct: 798 I-KDFKGITKNVELSMDINGYPFVCDLKNWEVFNIEDTYEFYQGMKFQPIESL------- 849
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
TD IP Y+ F D L+ + GKG +VNG+ +GR W +
Sbjct: 850 TDRLGQRIPGV---YRAKFQVKKPSD-TFLNFETWGKGLVYVNGYALGRIWEI------- 898
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N +V+F+ G
Sbjct: 899 --------------------GPQQTLY-VPGCWLKKGENEIVVFDIVG 925
>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTRQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLVN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKDSFAQ 356
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 356
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 162/322 (50%), Gaps = 29/322 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+ G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 93 LGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSGNL 152
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT F E + ++
Sbjct: 153 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNKYF 212
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L + GPII +Q+ENEYG+ + KDY+ + L G+
Sbjct: 213 DHLIS--RVVPLQYRKRGPIIAVQVENEYGSFA-----EDKDYMPYIQKAL--LERGIVE 263
Query: 236 VMCKQTDAPENIIDACNGYYC----DGYKPNSY--------NKPTLWTENWDGWYTTWGG 283
++ DA + G + ++ N + NKP + E W GW+ TWGG
Sbjct: 264 LLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFDTWGG 323
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAP 337
+ + ED+ V++F SF N YM+ GGTNFG +G ++ +TSYDYDA
Sbjct: 324 KHMIKNAEDVEDTVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGKHRGVVTSYDYDAV 382
Query: 338 IDEYGLLSEPKWGHLKDLHAAI 359
+ E G +E K+ L+ L ++
Sbjct: 383 LTEAGDYTE-KYFKLRKLFGSV 403
>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTRQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 80/216 (37%), Gaps = 49/216 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
C P+ +LQ N +VIFE G N
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEGIN 580
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 173/343 (50%), Gaps = 34/343 (9%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+ +++++ I +S + SS +S V + I+G LI +HYPR E W
Sbjct: 7 HKTVLVILNIIVSFLISSCSSP---KEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYW 63
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
D + +++ G + + YVFWN HE G+++F G+ DI +F++ GLY+ LR GPY
Sbjct: 64 RDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPY 123
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
VCAEW+FGG+P WL + +R+ + F +R++K++ + L GG IIM+
Sbjct: 124 VCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSP--LTINNGGNIIMV 181
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDA--PENIIDACNG 253
Q+ENEYG+ + G Y+ M G VP C Q +A E + NG
Sbjct: 182 QVENEYGSYAADKG-----YLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNG 236
Query: 254 YYC-DGYKP-NSYNK--PTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARFFQRGG 305
+ D +K + Y K P E + W+ WG R RP E L + ++ G
Sbjct: 237 VFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----G 291
Query: 306 SFMNYYMYFGGTNF----GRTSGGPF--YITSYDYDAPIDEYG 342
++ YM+ GGTNF G +GG + TSYDYDAP+ E+G
Sbjct: 292 VSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
Score = 46.2 bits (108), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 84/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + I+G+ S+ + + + L +L + G NYG
Sbjct: 417 KQKLVIQDLRDYAVILIDGKQVASLDRRYNQNSMTLNVSKTPATLEILVENTGRVNYGPD 476
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ +E E
Sbjct: 477 ILFNRKGITSQVLWGNEKLTGWSITPLPLYK-------------EKVSEMEFGE------ 517
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
T G+P+ ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 518 TIKGVPA---FHKGTFTVEKKGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 563
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 564 -----------------GPQQTLY-LPAPWLKEGENEIVVFE 587
>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 173/343 (50%), Gaps = 34/343 (9%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+ +++++ I +S + SS +S V + I+G LI +HYPR E W
Sbjct: 7 HKTVLVILNIIVSFLISSCSSP---KEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYW 63
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
D + +++ G + + YVFWN HE G+++F G+ DI +F++ GLY+ LR GPY
Sbjct: 64 RDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPY 123
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
VCAEW+FGG+P WL + +R+ + F +R++K++ + L GG IIM+
Sbjct: 124 VCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSP--LTINNGGNIIMV 181
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDA--PENIIDACNG 253
Q+ENEYG+ + G Y+ M G VP C Q +A E + NG
Sbjct: 182 QVENEYGSYAADKG-----YLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNG 236
Query: 254 YYC-DGYKP-NSYNK--PTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARFFQRGG 305
+ D +K + Y K P E + W+ WG R RP E L + ++ G
Sbjct: 237 VFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----G 291
Query: 306 SFMNYYMYFGGTNF----GRTSGGPF--YITSYDYDAPIDEYG 342
++ YM+ GGTNF G +GG + TSYDYDAP+ E+G
Sbjct: 292 VSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 84/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + I+G+ S+ + + + L +L + G NYG
Sbjct: 417 KQKLVIQDLRDYAVILIDGKQVASLDRRYNQNSMTLNVSKTPATLEILVENTGRVNYGPD 476
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ +E E
Sbjct: 477 ILFNRKGITSQVLWGNEKLTGWSITPLPLYK-------------EKVSEMEFGE------ 517
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
T G+P+ ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 518 TIKGVPA---FHKGTFTVEKKGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 563
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 564 -----------------GPQQTLY-LPAPWLKEGENEIVVFE 587
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 168/361 (46%), Gaps = 49/361 (13%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
++DG +IS IHY R PE W D + K K G + +ETY+ WN E +G++ F
Sbjct: 9 TFLLDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFD 68
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G D KF+ L GLY +R PY+CAEW GG P W+ +PG+E R N P+ + ++
Sbjct: 69 GLCDFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVR 128
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
+ K ++ + + +GG II++QIENEYG Y + Y+ + + G
Sbjct: 129 DYYKVLLPRLVNHQID--KGGNIILMQIENEYG-----YYGKDMSYMHFLEGLMREGGIT 181
Query: 233 VPWVMCKQTDAPENIIDACNGYYCDG----------------YKPNSYNKPTLWTENWDG 276
VP+V I C+G G K P + E W G
Sbjct: 182 VPFVTSDGPWGKMFIHGQCDGALPTGNFGSHARPLFANMKRMMKKTGNRGPLMCMEFWIG 241
Query: 277 WYTTWGGR-----LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--- 328
W+ WG + R ++DL + + + +N+YM+ GGTNFG +G ++
Sbjct: 242 WFDAWGNKEHKTSKLKRNIKDLNYMLKK------GNVNFYMFHGGTNFGFMNGSNYFTKL 295
Query: 329 ---ITSYDYDAPIDEYGLLSE---------PKWGHLKDLHAAIKLCEPALVAADSAQYIK 376
TSYDYDAP+ E G ++E K+ +++ + K+ + A + + IK
Sbjct: 296 TPDTTSYDYDAPLSEDGKITEKYRTFQSIIKKYRDFEEMPLSTKIEQKAYGKVKAGKSIK 355
Query: 377 L 377
L
Sbjct: 356 L 356
>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
Length = 593
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/342 (34%), Positives = 165/342 (48%), Gaps = 46/342 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGA 231
+ + V L + L QGGP+IM+Q+ENEYG+ ME +Y QQ K ++ LG
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYGSYGMEKAYLQQTKQIME-------ELGI 181
Query: 232 GVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWT 271
VP + A E ++DA G + K N+ + K P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSG 324
E WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAK 297
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 298 DLPQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.7 bits (99), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 80/216 (37%), Gaps = 49/216 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
C P+ +LQ N +VIFE G N
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEGIN 580
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 133/466 (28%), Positives = 204/466 (43%), Gaps = 84/466 (18%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+P++++ D R+++++G+R +L+S IHYPR+TP MWP L A+++ G + IE+Y FWN H
Sbjct: 1034 RPYSIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKH 1093
Query: 103 ESIR-GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFP------------ 149
+ R G Y++ D+ F+ L L++ R GPYVCAEW GG P
Sbjct: 1094 SATRYGAYDYGFNGDVDLFLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASN 1153
Query: 150 VWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES 209
W+ D+PG++ RTNN + E R+++ ++ E S G +IENEYG +S
Sbjct: 1154 AWIHDVPGMKTRTNNTAWLNETGRWMRDHFAVI--EPHLSRNGA---SNRIENEYGGSKS 1208
Query: 210 SYGQQGKD--------------------YVKWAASMALGLGAGVPWVMCKQTDAPENIID 249
+V A AL G G P Q A +++
Sbjct: 1209 DAAAVAYVDALDALADAVAPELVWMMCGFVSLVAPDALHTGNGCPH---DQGPASAHVV- 1264
Query: 250 ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMN 309
P P +TE+ + WY WG RP D+A+ VA + GG+ N
Sbjct: 1265 ---------VPPAPGADPAWYTED-ELWYDAWGLPSLARPPADVAYGVASYVATGGAMHN 1314
Query: 310 YYMYFGGTNFGRTS------GG------PFYITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
+YM+ GG ++G S GG P Y AP+ G EP + HL +H
Sbjct: 1315 FYMWHGGNHYGNWSTATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHG 1374
Query: 358 AI-------------KLCEPALVAA-DSAQYIKLGQNQEAHVYRANRYGSQSNCSAFLAN 403
+ L P+ VAA A ++K + + V+ + + C A
Sbjct: 1375 TLDAYAEVLLGATPEALATPSCVAACPHAYFLKFANDTASVVFGVHACAQWNACDANATA 1434
Query: 404 IDEHTAASVTFL-----GQSYTLPPWSVSILPDCRNTV-FNTAKVS 443
+ A++ T L + LP SV L V FNT+ V+
Sbjct: 1435 AVDVRASNATGLFPAGAPPALVLPFGSVVALDGASGAVLFNTSDVA 1480
>gi|336063700|ref|YP_004558559.1| beta-galactosidase [Streptococcus pasteurianus ATCC 43144]
gi|334281900|dbj|BAK29473.1| beta-galactosidase precursor [Streptococcus pasteurianus ATCC
43144]
Length = 595
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 172/356 (48%), Gaps = 48/356 (13%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +DG ++S IHY R P+ W + K G + +ETYV WN HE G+++F
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G D+ +F+ + GLY +R PY+CAEW FGG P WL + G+ R+ + F + +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWLLE-KGVRVRSQDKDFLQVV 126
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+R+ + ++ + + L QGG I+M Q+ENEYG SYG+ K Y++ M L LG
Sbjct: 127 KRYYEALIPRLIKHQLD--QGGNILMFQVENEYG----SYGED-KVYLRELKQMMLELGL 179
Query: 232 GVPWVMCKQTDAP-------ENIID---ACNGYYCDGYKPN---------SYNK--PTLW 270
P+ +D P ++I+ G + K N Y K P +
Sbjct: 180 EEPFFT---SDGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMC 236
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-- 328
E WDGW+ WG + R E+LA AV + G +N YM+ GGTNFG +G
Sbjct: 237 MEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGS--INLYMFHGGTNFGFMNGCSARKQ 294
Query: 329 -----ITSYDYDAPIDEYG-------LLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
+TSYDYDA +DE G +L +LH A L +P + D A
Sbjct: 295 TDLPQVTSYDYDAILDEAGNPTKKFYILQHRLKNKYPELHYAAPLVKPTMAIKDIA 350
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 88/222 (39%), Gaps = 55/222 (24%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL-- 596
V + RD +VF+NG + + V F S + L +L + +G NYG L
Sbjct: 402 VRLIDTRDRAQVFLNGNHIVTQYQEEIGDDIQVNFTSEESQLDILVENMGRVNYGHKLTA 461
Query: 597 --EKDGAGFRGQVKLTGFKNGDIDLSKILW-TYQVGLKGEFQQIYSIEENEAEWTDLTRD 653
+ G G RG + F N W TY + + YS + W R+
Sbjct: 462 PSQHKGIG-RGVMLDLHFVNQ--------WETYPLSMNSIKNLKYS-----SPW----RE 503
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
G+PS F +K + P+ +D+ GKG A++NG+++GR+W +
Sbjct: 504 GVPS-FYEFKFHCLNPED---TYMDMSGFGKGVAFINGYNLGRFWNI------------- 546
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
PT + Y +PR + N + IFE G
Sbjct: 547 --------------GPTLSLY-IPRGMMVCGENTITIFETEG 573
>gi|354585216|ref|ZP_09004105.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
gi|353188942|gb|EHB54457.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
Length = 619
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 157/313 (50%), Gaps = 32/313 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS +HY R PE W D + K K G + +ETY+ WN HE G++NF G
Sbjct: 12 LLDGQPYRIISGAVHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPTEGEFNFSGM 71
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ F++L G GL++ +R P++CAEW FGG P WL I R ++ + ++ +
Sbjct: 72 ADVGSFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPLYLSKVDHY 131
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
+++ M L S GGPI+ +Q+ENEYG SYG Y+++ + + G V
Sbjct: 132 YDELIPRMVP--LLSSNGGPILAVQVENEYG----SYGND-HAYLEYLRAGLVRRGVDV- 183
Query: 235 WVMCKQTDAPEN------IIDACNGYYCDG---------YKPNSYNKPTLWTENWDGWYT 279
+ +D P + ID + G Y+ ++P + E W+GW+
Sbjct: 184 --LLFTSDGPTDEMLLGGSIDHVHATVNFGSRVEESFGKYREYRTDEPLMVMEFWNGWFD 241
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYD 333
W R D+A + ++G S +N YM+ GGTNFG SG TSYD
Sbjct: 242 HWMEDHHVRDAADVAGVLDEMLEKGSS-INMYMFHGGTNFGFYSGANHIKTYEPTTTSYD 300
Query: 334 YDAPIDEYGLLSE 346
YDAP+ E+G +E
Sbjct: 301 YDAPLTEWGDKTE 313
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 98/239 (41%), Gaps = 60/239 (25%)
Query: 541 IDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQ--SGYNDLILLSQTVGLQNYGAFLEK 598
I +RD +VF++G+ G VI W +QP++ + L +L + +G NYG +
Sbjct: 401 IQEVRDRAQVFLDGRPLG-VIERWN--LQPLDITVPATGARLDILVENMGRINYGPLIH- 456
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYS-------IEENEAEWTDLT 651
D G V++ ++ L+ + V Q+ S +++ +AE +L+
Sbjct: 457 DPKGITEGVRID---------NQFLYNWTVRTLPLASQMLSSLSYKPVMDKGQAEHEELS 507
Query: 652 RD-----GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGG 706
G+P +Y+ F D I L KG AW+NG ++GRYW
Sbjct: 508 TSTSEDTGLPG---FYRGSFQVED-IGDTFLRFDGWTKGVAWINGFNLGRYW-------- 555
Query: 707 CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
N G P + Y +P L+ N LV+FE GG P V+L
Sbjct: 556 ------------------NAG-PQKALY-IPGPLLRKGENELVLFELHGG-PESCEVEL 593
>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
Length = 1106
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 29/335 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + I YVFWN+HES G ++F G
Sbjct: 358 FLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTG 417
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL I R ++ F E +
Sbjct: 418 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVGI 477
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F K + + + + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 478 FEKAVAEQVAGMTIQN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQC 535
Query: 221 -WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
WA++ + W M T A NI +P+S P + +E W GW+
Sbjct: 536 DWASNFTKNGLHDLVWTMNFGTGA--NIDQQFAP--LKKLRPDS---PLMCSEFWSGWFD 588
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDY 334
WG RP D+ + +G SF + YM GGTN+G +G P + +TSYDY
Sbjct: 589 KWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 647
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
DAPI E G + W K L + + A V A
Sbjct: 648 DAPISESGQTTPKYWELRKALSKYMNGEKQAKVPA 682
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 91/227 (40%), Gaps = 52/227 (22%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN--DLILLSQTVGLQNYGAFL 596
+T++ D +VF++G+ G + + +EF + L +L + +G N+G +
Sbjct: 741 LTVNDAHDYAQVFLDGKYIGKLDRR--NGEKQLEFPACPKGARLDILVEAMGRINFGRAI 798
Query: 597 EKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT 648
KD G V+LT +G D ++ + TY +FQ I S+++ +
Sbjct: 799 -KDFKGITQSVELTVDIDGRPFTCNLKDWEVYNLEDTYDFYKNMKFQPIGSLKDELGQ-- 855
Query: 649 DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQ 708
IP Y+ F D L+ + GKG +VNGH +GR W +
Sbjct: 856 -----RIPGC---YRATFKVNKPSD-TFLNFETWGKGLVYVNGHAMGRIWEI-------- 898
Query: 709 DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++F+ G
Sbjct: 899 -------------------GPQQTLY-IPGCWLKKGENEVIVFDIIG 925
>gi|420262409|ref|ZP_14765050.1| beta-galactosidase [Enterococcus sp. C1]
gi|394770166|gb|EJF49970.1| beta-galactosidase [Enterococcus sp. C1]
Length = 585
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 153/307 (49%), Gaps = 28/307 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+D +IS IHY R PE W D + K + G + +ETYV WN HE+ G Y F+G
Sbjct: 12 LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGIL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ GLY+ LR PY+CAEW FGG P WL P ++ R + PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+ +R+ L QGGPI+M+Q+ENEYG SY K+Y++ + G P
Sbjct: 132 AHLFPQVRD--LQITQGGPILMMQVENEYG----SYAND-KEYLRKMVAAMRQQGVETPL 184
Query: 236 VMCKQT--DAPEN--IIDA------CNGYYCDGYKP----NSYNKPTLWTENWDGWYTTW 281
V D EN I D C + ++ + +P + E W GW+ W
Sbjct: 185 VTSDGPWHDMLENGSIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYD 335
G H A + GS +N YM+ GGTNFG +G +Y +TSYDYD
Sbjct: 245 GDDHHHTTSTADAVKELQDCLAEGS-VNIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYD 303
Query: 336 APIDEYG 342
A + E+G
Sbjct: 304 ALLTEWG 310
>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
Length = 591
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 160/328 (48%), Gaps = 42/328 (12%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
+ F + D ++DG LIS IHY R TP W D + K GA+ +ETY+ WN
Sbjct: 1 MRTFEIKED---FLLDGKPIKLISGAIHYFRMTPVQWTDSLYNLKALGANTVETYIPWNL 57
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G Y+F+G DI FVK + GL + LR Y+CAEW FGG P WL + P + R
Sbjct: 58 HEPREGVYDFEGMKDICAFVKQAQTIGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLR 116
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
+ + F +++ + + V L + L GGP+IM+Q+ENEYG SYG + K Y++
Sbjct: 117 STDPRFMAKVRNYFQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQ 169
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS--------- 263
+ G VP + A E ++DA G + K N+
Sbjct: 170 TKELMEEYGIDVP--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAK 227
Query: 264 --YNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N P + E WDGW+ WG + R +DLA V G +N YM+ GGTNFG
Sbjct: 228 HGKNWPIMCMEYWDGWFNRWGEPIIKRDGQDLANEVKEMLAVGS--LNLYMFHGGTNFGF 285
Query: 322 TSG----GPF---YITSYDYDAPIDEYG 342
+G G ++SYDYDA + E G
Sbjct: 286 YNGCSARGALDLPQVSSYDYDALLTEAG 313
>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
Length = 1106
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 29/335 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + I YVFWN+HES G ++F G
Sbjct: 358 FLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTG 417
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL I R ++ F E +
Sbjct: 418 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVGI 477
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F K + + + + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 478 FEKAVAEQVAGMTIQN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQC 535
Query: 221 -WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
WA++ + W M T A NI +P+S P + +E W GW+
Sbjct: 536 DWASNFTKNGLHDLVWTMNFGTGA--NIDQQFAP--LKKLRPDS---PLMCSEFWSGWFD 588
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDY 334
WG RP D+ + +G SF + YM GGTN+G +G P + +TSYDY
Sbjct: 589 KWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 647
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
DAPI E G + W K L + + A V A
Sbjct: 648 DAPISESGQTTPKYWELRKALSKYMNGEKQAKVPA 682
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 91/227 (40%), Gaps = 52/227 (22%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN--DLILLSQTVGLQNYGAFL 596
+T++ D +VF++G+ G + + +EF + L +L + +G N+G +
Sbjct: 741 LTVNDAHDYAQVFLDGKYIGKLDRR--NGEKQLEFPACPKGARLDILVEAMGRINFGRAI 798
Query: 597 EKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT 648
KD G V+LT +G D ++ + TY +FQ I S+++ +
Sbjct: 799 -KDFKGITQSVELTVDIDGRPFTCNLKDWEVYNLEDTYDFYKNMKFQPIGSLKDELGQ-- 855
Query: 649 DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQ 708
IP Y+ F D L+ + GKG +VNGH +GR W +
Sbjct: 856 -----RIPGC---YRATFKVNKPSD-TFLNFETWGKGLVYVNGHAMGRIWEI-------- 898
Query: 709 DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++F+ G
Sbjct: 899 -------------------GPQQTLY-IPGCWLKKGENEVIVFDIIG 925
>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
Length = 1106
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 29/335 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + I YVFWN+HES G ++F G
Sbjct: 358 FLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTG 417
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL I R ++ F E +
Sbjct: 418 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVGI 477
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F K + + + + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 478 FEKAVAEQVAGMTIQN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQC 535
Query: 221 -WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
WA++ + W M T A NI +P+S P + +E W GW+
Sbjct: 536 DWASNFTKNGLHDLVWTMNFGTGA--NIDQQFAP--LKKLRPDS---PLMCSEFWSGWFD 588
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDY 334
WG RP D+ + +G SF + YM GGTN+G +G P + +TSYDY
Sbjct: 589 KWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 647
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
DAPI E G + W K L + + A V A
Sbjct: 648 DAPISESGQTTPKYWELRKALSKYMNGEKQAKVPA 682
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 91/227 (40%), Gaps = 52/227 (22%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYND--LILLSQTVGLQNYGAFL 596
+T++ D +VF++G+ G + + +EF + L +L + +G N+G +
Sbjct: 741 LTVNDAHDYAQVFLDGKYIGKLDRR--NGEKQLEFPACPKGARLDILVEAMGRINFGRAI 798
Query: 597 EKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT 648
KD G V+LT +G D ++ + TY +FQ I S+++ +
Sbjct: 799 -KDFKGITQSVELTVDIDGRPFTCNLKDWEVYNLEDTYDFYKNMKFQPIGSLKDELGQ-- 855
Query: 649 DLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQ 708
IP Y+ F D L+ + GKG +VNGH +GR W +
Sbjct: 856 -----RIPGC---YRATFKVNKPSD-TFLNFETWGKGLVYVNGHAMGRIWEI-------- 898
Query: 709 DTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++F+ G
Sbjct: 899 -------------------GPQQTLY-IPGCWLKKGENEVIVFDIIG 925
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKDSFAQ 346
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 160/328 (48%), Gaps = 33/328 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++D ++S IHY R P W + K G + +ETYV WN HE G ++F G
Sbjct: 10 FLLDDEPFTILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ F+ S GLY +R P++CAEW FGG P WL + R+++ F + +
Sbjct: 70 SIDLAAFLDEAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQ 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ ++ ++ + +GG IIM+Q+ENEYG SY + KDY++ + + G V
Sbjct: 130 YYDHLMPILVSRQID--KGGNIIMMQVENEYG----SYCED-KDYLRAIRRLMVERGVSV 182
Query: 234 -------PWVMCKQTDAPENIIDACNGYYCDGYKPN-----SYNK------PTLWTENWD 275
PW C + + C G + K N +++K P + E WD
Sbjct: 183 PLCTSDGPWRGCLRAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGPFY 328
GW+ +G + R EDLA V + GGS +N YM+ GGTNFG R +
Sbjct: 243 GWFNRYGENVIRRDPEDLASCVREVLELGGS-LNLYMFHGGTNFGFMNGCSARHTHDLHQ 301
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLH 356
+TSYDYDAP+DE G +E + + +H
Sbjct: 302 VTSYDYDAPLDEQGNPTEKYFAIQRTVH 329
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 104/248 (41%), Gaps = 63/248 (25%)
Query: 531 KTNEVRPTVTIDSMRDVLRVFINGQLTGSV----IGHWVKVVQPVEFQSGYNDLILLSQT 586
+ +E R V ID+ RD ++F+NG + IG + V P E +N L +L++
Sbjct: 398 RADEERIRV-IDA-RDRAQMFVNGDKVATQYQEHIGEDIHCVLPCE----HNRLDVLTED 451
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-IDLSKILWTYQVGLKGEFQQIYSIE--EN 643
+G NYG L D + G + G +DL + G + + +I+ +
Sbjct: 452 MGRVNYGHKLLAD-------TQHKGIRTGVCVDLH-----FVTGWEMRCLPLDNIDNLDY 499
Query: 644 EAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAP 703
A W + G PS +Y+ FD + D +D GKG A+VNG ++GR+W
Sbjct: 500 SAGWVE----GQPS---FYRAKFDISEPAD-TFIDTTGFGKGVAFVNGTNVGRFW----- 546
Query: 704 KGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
DK P T Y VP L N LV+FE G ++ +
Sbjct: 547 ----------------DK------GPIMTLY-VPHGLLHPGTNELVMFETEG--VYDAKI 581
Query: 764 KLRSTRIV 771
LRS ++
Sbjct: 582 SLRSEPVI 589
>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
Length = 1106
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 29/335 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + I YVFWN+HES G ++F G
Sbjct: 358 FLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTG 417
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL I R ++ F E +
Sbjct: 418 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVGI 477
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F K + + + + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 478 FEKAVAEQVAGMTIQN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQC 535
Query: 221 -WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
WA++ + W M T A NI +P+S P + +E W GW+
Sbjct: 536 DWASNFTKNGLHDLVWTMNFGTGA--NIDQQFAP--LKKLRPDS---PLMCSEFWSGWFD 588
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDY 334
WG RP D+ + +G SF + YM GGTN+G +G P + +TSYDY
Sbjct: 589 KWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 647
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
DAPI E G + W K L + + A V A
Sbjct: 648 DAPISESGQTTPKYWELRKALSKYMNGEKQAKVPA 682
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 163/324 (50%), Gaps = 31/324 (9%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + + YVFWN+HE G Y+F
Sbjct: 357 FLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTE 416
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL + R ++ F E +
Sbjct: 417 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVAL 476
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F + + +++ + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 477 FEEAVAKQVKDLTIAN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALFQ 534
Query: 221 --WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
WA++ L + W M T A + A +PNS P + +E W GW+
Sbjct: 535 CDWASNFTLNGLDDLIWTMNFGTGANVDQQFA----KLKQLRPNS---PLMCSEFWSGWF 587
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYD 333
WG RP D+ + RG SF + YM GGTN+G +G P + +TSYD
Sbjct: 588 DKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYD 646
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA 357
YDAPI E G + PK+ L++ A
Sbjct: 647 YDAPISESGQTT-PKYWALREAMA 669
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 93/228 (40%), Gaps = 52/228 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGH--WVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
T+T++ D +VF++G+ G + ++V P + D+++ + +G N+G
Sbjct: 740 TLTVNDAHDYAQVFVDGKYIGKLDRRNGEKQLVLPACVKGSRLDILV--EAMGRINFGRA 797
Query: 596 LEKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEW 647
+ KD G V+L+ NG + ++ I TY+ +FQ I S+
Sbjct: 798 I-KDFKGITKNVELSMDINGYPFVCDLKNWEVFNIEDTYEFYQGMKFQPIESL------- 849
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
TD IP Y+ F D L+ + GKG +VNG+ +GR W +
Sbjct: 850 TDRLGQRIPGV---YRAKFQVKKPSDTF-LNFETWGKGLVYVNGYALGRIWEI------- 898
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N +V+F+ G
Sbjct: 899 --------------------GPQQTLY-VPGCWLKKGENEIVVFDIVG 925
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKDSFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + + +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAE-VKDTFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALSQAEP-LVKDSFAQ 346
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 162/319 (50%), Gaps = 37/319 (11%)
Query: 62 MLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFV 121
M++ IHY R E W D + K + G + + TY+ WN HE RG+++F D+ +V
Sbjct: 1 MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60
Query: 122 KLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF----VKK 177
L + GL++ LR GPY+CAE + GG P WL P + RT N F E + ++ + K
Sbjct: 61 LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIPK 120
Query: 178 IVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVM 237
I+ L GGP+I +Q+ENEYG+ Q+ ++Y+ + L G+ ++
Sbjct: 121 ILPLQYR------HGGPVIAVQVENEYGSF-----QKDRNYMNYLKKAL--LKRGIVELL 167
Query: 238 CKQTDAPENIIDACNGYYC----DGYKPNSY--------NKPTLWTENWDGWYTTWGGRL 285
D I + NG + + +S+ +KP + E W GWY +WG +
Sbjct: 168 LTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKH 227
Query: 286 PHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYDYDAPID 339
+ E++ V +F G SF N YM+ GGTNFG +GG + +TSYDYDA +
Sbjct: 228 IEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLS 286
Query: 340 EYGLLSEPKWGHLKDLHAA 358
E G +E K+ L+ L A+
Sbjct: 287 EAGDYTE-KYFKLRKLFAS 304
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 118/381 (30%), Positives = 177/381 (46%), Gaps = 45/381 (11%)
Query: 17 SVYPMMMMMMMIHLSCVSSS-SASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
+VY ++ M +S ++ A + F V+ DH ++G L+S +HY R
Sbjct: 13 AVYAAALLFMACTISAQTAKMPAGSVTHTFRVAGDH--FELNGEPVQLLSGEMHYARIPR 70
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
E W + +K G + + TY+FWN HE G Y+F G +D+ FVK+ GL + LR
Sbjct: 71 EYWRARLQMAKAMGLNTVATYIFWNVHEPKPGVYDFSGNHDVAAFVKMAQEEGLNVILRA 130
Query: 136 GPYVCAEWNFGGFPVWLRDIP--GIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGG 193
GPY CAEW FGG+P WL P G R+N+ + ++R++K++ M L GG
Sbjct: 131 GPYACAEWEFGGYPSWLMKDPKMGSALRSNDEVYMAPVERWIKRLGQEMVP--LLISNGG 188
Query: 194 PIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNG 253
PI+ +Q+ENEYG+ G D A + + AG D + +++
Sbjct: 189 PIVAVQVENEYGDF-------GGDKKYLAHMLEIFQNAGFKDSFLYTVDPSKALVNGSLE 241
Query: 254 YYCDGYKPNSYN--------------KPTLWTENWDGWYTTWGGRLPHRPV----EDLAF 295
G N +P +E W GW+ WG RP+ +D+A+
Sbjct: 242 GLPSGVNFGVGNAERGLTALAHLRPGQPLFASEYWPGWFDHWGHPHETRPIPPQLKDIAY 301
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------ITSYDYDAPIDEYGLLSEPK 348
+ S +N YM+ GGT+FG SG + +TSYDYDAP+DE G + PK
Sbjct: 302 TLDH-----KSSINIYMFHGGTSFGFMSGASWTGGEYLPDVTSYDYDAPLDEAGHPT-PK 355
Query: 349 WGHLKDLHAAIKLCEPALVAA 369
+ +DL A LV A
Sbjct: 356 FYAYRDLMAKYVKTPLPLVPA 376
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 67/305 (21%), Positives = 116/305 (38%), Gaps = 70/305 (22%)
Query: 449 KTVEFSLPLSPNISVPQQSMI-ESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVT 507
K V+ LPL P +VP+ + E + S W + P+ V SE T +E ++ +
Sbjct: 365 KYVKTPLPLVP--AVPEVIAVPEFTVGRASSLWDHL--PVPVKSEKPLT----MEAMDQS 416
Query: 508 KDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV 567
Y+ Y +++ V+ + +D++ D V++NG+L GS I +K
Sbjct: 417 YGYALYRKQLSE--------------PVKGELVLDAVHDYALVYLNGKLIGS-IDRRLKQ 461
Query: 568 VQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQ 627
+ L +L + G N + + G V L G + L +++
Sbjct: 462 DRITLATDKPARLDILVENSGRINSTKMMRGETKGITRGVTLAG---------RPLTSWE 512
Query: 628 VGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQA 687
+ +A T G +F +K + LD+ ++GKG
Sbjct: 513 ----DYSLPMLDAGTMKASSTKRQVSGPHFSFGSFKV-----AKVGDTFLDVRALGKGAL 563
Query: 688 WVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNL 747
W+NGH +GR+W V G Q+T VP WL+ N
Sbjct: 564 WINGHAMGRFWNV-----GPQETL-----------------------FVPGPWLKRGRND 595
Query: 748 LVIFE 752
+V+F+
Sbjct: 596 VVVFD 600
>gi|325569852|ref|ZP_08145846.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325156975|gb|EGC69143.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 585
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 153/307 (49%), Gaps = 28/307 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+D +IS IHY R PE W D + K + G + +ETYV WN HE+ G Y F+G
Sbjct: 12 LDKKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGIL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ GLY+ LR PY+CAEW FGG P WL P ++ R + PF E++ R+
Sbjct: 72 DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+ +R+ L QGGPI+M+Q+ENEYG SY K+Y++ + G P
Sbjct: 132 AHLFPQVRD--LQITQGGPILMMQVENEYG----SYAND-KEYLRKMVAAMRQQGVETPL 184
Query: 236 VMCKQT--DAPEN--IIDA------CNGYYCDGYKP----NSYNKPTLWTENWDGWYTTW 281
V D EN I D C + ++ + +P + E W GW+ W
Sbjct: 185 VTSDGPWHDMLENGTIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYD 335
G H A + GS +N YM+ GGTNFG +G +Y +TSYDYD
Sbjct: 245 GDDHHHTTSTADAVKELQDCLAEGS-VNIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYD 303
Query: 336 APIDEYG 342
A + E+G
Sbjct: 304 ALLTEWG 310
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 173/343 (50%), Gaps = 34/343 (9%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+ +++++ I +S + SS +S V + I+G LI +HYPR E W
Sbjct: 7 HKTVLVILNIIVSFLISSCSSP---KEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYW 63
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
D + ++ G + + YVFWN HE G+++F G+ DI +F++ GLY+ LR GPY
Sbjct: 64 RDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPY 123
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
VCAEW+FGG+P WL + +R+ + F +R++K++ + L GG IIM+
Sbjct: 124 VCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSP--LTINNGGNIIMV 181
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDA--PENIIDACNG 253
Q+ENEYG+ + K+Y+ M G VP C Q +A E + NG
Sbjct: 182 QVENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNG 236
Query: 254 YYC-DGYK-PNSYNK--PTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARFFQRGG 305
+ D +K + Y K P E + W+ WG R RP E L + ++ G
Sbjct: 237 VFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----G 291
Query: 306 SFMNYYMYFGGTNF----GRTSGGPF--YITSYDYDAPIDEYG 342
++ YM+ GGTNF G +GG + TSYDYDAP+ E+G
Sbjct: 292 VSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 50/222 (22%), Positives = 83/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + I+G+ S+ + + + L +L + G NYG
Sbjct: 417 KQKLVIQDLRDYAVILIDGKQVASLDRRYNQNSVTLNVSKTPATLEILVENTGRVNYGPD 476
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KL G+ + L K +++ +E E
Sbjct: 477 ILFNRKGITSQVLWGNEKLAGWSITPLPLYK-------------EKVSEMEFGE------ 517
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
T G+P+ ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 518 TIKGVPA---FHKGTFTVEKKGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 563
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 564 -----------------GPQQTLY-LPAPWLKEGENEIVVFE 587
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 173/343 (50%), Gaps = 34/343 (9%)
Query: 19 YPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+ +++++ I +S + SS +S V + I+G LI +HYPR E W
Sbjct: 5 HKTVLVILNIIVSFLISSCSSP---KEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYW 61
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
D + ++ G + + YVFWN HE G+++F G+ DI +F++ GLY+ LR GPY
Sbjct: 62 RDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPY 121
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
VCAEW+FGG+P WL + +R+ + F +R++K++ + L GG IIM+
Sbjct: 122 VCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSP--LTINNGGNIIMV 179
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDA--PENIIDACNG 253
Q+ENEYG+ + K+Y+ M G VP C Q +A E + NG
Sbjct: 180 QVENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNG 234
Query: 254 YYC-DGYK-PNSYNK--PTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARFFQRGG 305
+ D +K + Y K P E + W+ WG R RP E L + ++ G
Sbjct: 235 VFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----G 289
Query: 306 SFMNYYMYFGGTNF----GRTSGGPF--YITSYDYDAPIDEYG 342
++ YM+ GGTNF G +GG + TSYDYDAP+ E+G
Sbjct: 290 VSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 332
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 50/222 (22%), Positives = 83/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + I+G+ S+ + + + L +L + G NYG
Sbjct: 415 KQKLVIQDLRDYAVILIDGKQVASLDRRYNQNSVTLNVSKTPATLEILVENTGRVNYGPD 474
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KL G+ + L K +++ +E E
Sbjct: 475 ILFNRKGITSQVLWGNEKLAGWSITPLPLYK-------------EKVSEMEFGE------ 515
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
T G+P+ ++K F D +D+ GKG WVNG +GR+W +
Sbjct: 516 TIKGVPA---FHKGTFTVEKKGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 561
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 562 -----------------GPQQTLY-LPAPWLKEGENEIVVFE 585
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 162/334 (48%), Gaps = 53/334 (15%)
Query: 58 GNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
G ++S +HY R + W + K G + + TYVFWN HE G+++F G ++
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
+++++ G G+ + LR GPYVCAEW FGG+P WL++IPG+E R +N E ++ KK
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNT----EFLKYTKK 150
Query: 178 IVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASMAL 227
+D + +E+ L +GGPIIM+Q ENE+G SY Q KD Y
Sbjct: 151 YIDRLYQEVGPLQCTKGGPIIMVQCENEFG----SYVSQRKDISFEEHRSYNAKIKGQLA 206
Query: 228 GLGAGVP-------WVM---CKQTDAP--------ENIIDACNGYYCDGYKPNSYNKPTL 269
G VP W+ C P N+ N Y+ P +
Sbjct: 207 DAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGG-------KGPYM 259
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E + GW + WG P ++A + Q SF N+YM GGTNFG TSG +
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNNVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 328 ------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++ +
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWIT-PKYDSIRSV 351
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 34/77 (44%), Gaps = 28/77 (36%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
+D+ + GKG ++NG HIGRYW V P QT Y +
Sbjct: 555 IDMRAWGKGVIFINGKHIGRYWKV---------------------------GPQQTLY-I 586
Query: 737 PRSWLQASNNLLVIFEE 753
P WL+ N +VIFE+
Sbjct: 587 PGVWLRKGENKIVIFEQ 603
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 167/348 (47%), Gaps = 44/348 (12%)
Query: 17 SVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
+V+ M+ +++ +S SS V ++ I+G LI +HYPR E
Sbjct: 7 NVFIMLNLIVSFFISACSSPRE-------QVKIENGTFNINGKDVQLICGEMHYPRIPHE 59
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
W D + ++ G + + YVFWN HE G ++F G+ DI +FV++ GLY+ LR G
Sbjct: 60 YWRDRLHRAHAMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPG 119
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PYVCAEW+FGG+P WL + +R+ + F +R++K++ + + + GG II
Sbjct: 120 PYVCAEWDFGGYPSWLLKEKDLTYRSKDPRFMSYCERYIKELGKQLAPLTINN--GGNII 177
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDAPE--NIIDAC 251
M+Q+ENEYG+ + K+Y+ M G VP C Q +A +
Sbjct: 178 MVQVENEYGSYAAD-----KEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTL 232
Query: 252 NGYY-------CDGYKPNSYNKPTLWTENWDGWYTTWGGRLP----HRPVEDLAFAVARF 300
NG + D Y P P E + W+ WG R RP E L + +
Sbjct: 233 NGVFGEDIFKIVDKYHPGG---PYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGH- 288
Query: 301 FQRGGSFMNYYMYFGGTNF----GRTSGGPFY--ITSYDYDAPIDEYG 342
G ++ YM+ GGTNF G + G F TSYDYDAP+ E+G
Sbjct: 289 ----GVSVSMYMFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWG 332
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 47/222 (21%), Positives = 84/222 (37%), Gaps = 56/222 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + ++G+ S+ + + ++ L +L + G NYG
Sbjct: 415 KQKLIIQDLRDYAVILVDGKQVASLDRRYNQNSTTLDIHKVPATLEILVENTGRVNYGPD 474
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K +++ S+ +
Sbjct: 475 ILFNRKGITSQVLWGNEKLTGWSITPLPLYK-------------EEVSSLSFGQE----- 516
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
G+P+ +++ F D +D+ GKG WVNG +GR+W +
Sbjct: 517 -IKGVPA---FHRGTFIIEQQGD-CFVDMSQWGKGAVWVNGKSLGRFWNI---------- 561
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y +P WL+ N +V+FE
Sbjct: 562 -----------------GPQQTLY-IPAPWLKKGENEIVVFE 585
>gi|380512533|ref|ZP_09855940.1| beta-galactosidase [Xanthomonas sacchari NCPPB 4393]
Length = 616
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 169/364 (46%), Gaps = 48/364 (13%)
Query: 33 VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADV 92
V ++ + T + F DH I DG +IS IH+ R W D + K++ G +
Sbjct: 22 VRAADSGTAWPAFATQGDH--FIRDGKPYQVISGAIHFQRIPRAYWKDRLQKARAMGLNT 79
Query: 93 IETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL 152
+ETYVFWN E GQ++F G NDI FV + GL + LR GPYVCAEW GG+P WL
Sbjct: 80 VETYVFWNLVEPRPGQFDFSGNNDIAAFVDEAAAQGLNVILRPGPYVCAEWEAGGYPAWL 139
Query: 153 RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYG 212
PG+ R+ + F Q ++ + ++ + + GGPI+ +Q+ENEYG SYG
Sbjct: 140 FAEPGMRVRSQDPRFLAASQAYLDALAAQVKPRL--NGNGGPIVAVQVENEYG----SYG 193
Query: 213 QQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD---------GYKPNS 263
D+ + A+ + AG + D P+ + NG D G N+
Sbjct: 194 D---DHAYMRLNRAMFVQAGFDKALLFTADGPDVL---ANGTLPDTLAVVNFAPGDAKNA 247
Query: 264 YN--------KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARF--FQRGGSFMNYYMY 313
+ +P + E W GW+ WG + D + F R G N YM+
Sbjct: 248 FETLAKFRPGQPQMVGEYWAGWFDQWGEK---HAATDATKQASEFEWILRQGHSANIYMF 304
Query: 314 FGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKWGHLKD-LHAAIKLC 362
GGT+FG +G F TSYDYDA +DE G + PK+ +D + +
Sbjct: 305 VGGTSFGFMNGANFQKNPSDHYAPQTTSYDYDAVLDEAGRPT-PKFTLFRDAIQRVTGIA 363
Query: 363 EPAL 366
PAL
Sbjct: 364 PPAL 367
>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 593
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 165/342 (48%), Gaps = 46/342 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL G+ R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGA 231
+ + ++ + + QGGP+IM+Q+ENEYG+ ME +Y QQ K ++ LG
Sbjct: 131 YFQVLLPKLAPMQIT--QGGPVIMMQVENEYGSYGMEKAYLQQTKQIME-------ELGI 181
Query: 232 GVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWT 271
VP + A E ++DA G + K N+ + K P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSG 324
E WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAK 297
Query: 325 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 298 DLPQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 78/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ + Q N +VIFE G
Sbjct: 559 C--------------PKEFFQQGQNEVVIFETEG 578
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 169/342 (49%), Gaps = 40/342 (11%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+P + + D +DG ++ S +HYPR W + + ++ G + + TY FW+ H
Sbjct: 29 RPPHFAIDGAHFKLDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQH 88
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQ++F G+ND+ F+K GL + LR GPYVCAE +FGGFP WL G+ R+
Sbjct: 89 EPEPGQWSFSGQNDLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRS 148
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+A + R+ K++ + + L S +GGPI+MLQ+ENEYG SYG+ DY++
Sbjct: 149 MDARYLAASARYFKRLAQEVAD--LQSSRGGPILMLQLENEYG----SYGRD-HDYLRAV 201
Query: 223 ASMALGLGAGVPWVMCKQ-----------TDAPENIIDACNG--------YYCDGYKPNS 263
+ G P D P +++ G ++P+
Sbjct: 202 RTQMRQAGFDAPLFTSDGGAGRLFEGGTLADVPA-VVNFGGGADDAQASVQELAAWRPHG 260
Query: 264 YNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTS 323
P + E W GW+ WG + + E+ A V R +G SF N YM+ GGT+FG +
Sbjct: 261 ---PRMAGEYWAGWFDHWGEQHHTQSPEEAARTVERMLSQGVSF-NLYMFHGGTSFGWLA 316
Query: 324 GGPFY--------ITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
G + TSYDYDA +DE G + PK+ L+D+ A
Sbjct: 317 GANYSGSEPYQPDTTSYDYDAALDEAGRPT-PKYFALRDVIA 357
Score = 46.2 bits (108), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 82/219 (37%), Gaps = 54/219 (24%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+ +D + D V +G++ G + + V+ +G L LL + +G +GA L
Sbjct: 430 LVLDGLHDHATVLADGRVIGRLDRRLGESTLVVDLPAGVQ-LDLLVEAMGRIGFGAKLVD 488
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL------TR 652
D G VKL GD +L WT +Y + + A L
Sbjct: 489 DTKGITRAVKL-----GDDELEG--WT-----------VYPLPLDAAALKRLPAGAAGEV 530
Query: 653 DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
G +++ +D LD GKGQ WVNG H+GRYW +
Sbjct: 531 AGAAGAPGFWRGTLTLSKPVDTF-LDTRGWGKGQVWVNGRHLGRYWHI------------ 577
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIF 751
P QT Y +P SWL+ N +++F
Sbjct: 578 ---------------GPQQTLY-LPASWLKEGANEVLVF 600
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 346
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 159/315 (50%), Gaps = 19/315 (6%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
+++G ++ +A +HYPR W I K G + + YVFWN HE G+++F
Sbjct: 75 TFLLNGQPFVVKAAELHYPRIPRPYWEQRIKMCKSLGMNTVCLYVFWNIHEQQEGKFDFT 134
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G ND+ F +L +G+Y+ +R GPYVCAEW GG P WL I R ++ F ++
Sbjct: 135 GNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREDDPYFMARVK 194
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLG 230
F ++ + L GGPIIM+Q+ENEYG+ + Y Q +D VK + + L
Sbjct: 195 AFEAEVGRQLAP--LTIQNGGPIIMVQVENEYGSYGVNKKYVSQIRDIVKASGFDKVTLF 252
Query: 231 AGVPWVMCKQTDAPENIIDACN---GYYCDG----YKPNSYNKPTLWTENWDGWYTTWGG 283
W + + ++++ N G D K + P + +E W GW+ WG
Sbjct: 253 Q-CDWASNFENNGLDDLVWTMNFGTGSNIDAQFKRLKQLRPDAPLMCSEFWSGWFDKWGA 311
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPI 338
R RP + + + + SF + YM GGT+FG +G P + +TSYDYDAPI
Sbjct: 312 RHETRPAKAMVEGIDEMLSKNISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPI 370
Query: 339 DEYGLLSEPKWGHLK 353
+EYG + PK+ L+
Sbjct: 371 NEYGHAT-PKFWELR 384
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 51/225 (22%), Positives = 94/225 (41%), Gaps = 44/225 (19%)
Query: 539 VTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
++++ D ++FI+ +L G++ ++ PV+ + N LI + +G N+G
Sbjct: 461 LSLNDAHDYAQIFIDNKLIGTIDRTKNEKSIMLPPVKKGTKLNILI---EAMGRINFGRA 517
Query: 596 LEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ KD G V + NG +L+ L + + + Q + + + + R
Sbjct: 518 V-KDFKGITESVIIHTEMNGH-ELTYNLKNWVIAPIPDSYQ--NAQHAFDKLNETYRCFS 573
Query: 656 PSTFT-----WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
P F+ +Y+ YFD + L+L GKGQ +VNGH +GR+W +
Sbjct: 574 PINFSSQSIGYYRGYFDLKK-VGDTFLNLEQWGKGQVYVNGHALGRFWRI---------- 622
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 623 -----------------GPQQTLY-LPGCWLKKGRNEIIVMDIVG 649
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYRPEIQL 580
>gi|323353539|ref|ZP_08088072.1| beta-galactosidase [Streptococcus sanguinis VMC66]
gi|322121485|gb|EFX93248.1| beta-galactosidase [Streptococcus sanguinis VMC66]
Length = 592
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 159/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGG I+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGTILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V + Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKKMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 346
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQIHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY---- 328
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 329 ---ITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYRPEIQL 590
>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 629
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 176/359 (49%), Gaps = 22/359 (6%)
Query: 17 SVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
+V + + ++ + SS + F + Y++ ++DG +S HY R +
Sbjct: 3 TVVGLFITYLLAFSNLAESSEHNIKNYSFAIDYENDQFLLDGKPFRYVSGSFHYFRTPRQ 62
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
W ++ K + GG + + TYV W+ HE Q+ + G DIV+F+K+ L++ LR G
Sbjct: 63 HWRGILRKMRAGGLNAVSTYVEWSMHEPEFDQWVWDGDADIVEFIKIAQEEDLFVILRPG 122
Query: 137 PYVCAEWNFGGFPVWLRD-IPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
PY+CAE +FGGFP WL +P I+ RT + + +RF+ +I L R + L GGPI
Sbjct: 123 PYICAERDFGGFPYWLLSRVPDIKLRTKDERYVFYAERFLNEI--LRRTKPLLRGNGGPI 180
Query: 196 IMLQIENEYGNMESSYGQ-QGKDY------VKWAASMALGLGAGVPWVMCKQTDAPENII 248
IM+Q+ENEYG+ + Q + K Y VK A + G+ + C I
Sbjct: 181 IMVQVENEYGSFYACDDQYKSKMYEIFHRHVKNDAVLFTTDGSARSMLKCGSIPGVYATI 240
Query: 249 DACNG----YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
D NG + + S P + +E + GW T WG ++A +
Sbjct: 241 DFGNGANVPFNYKIMREFSPKGPLVNSEYYPGWLTHWGESFQRVNSHNVAKTLDEMLAYN 300
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF------YITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
S +N YMY+GGTNF TSG +TSYDYDAP+ E G + PK+ L+D+ A
Sbjct: 301 VS-VNIYMYYGGTNFAFTSGANINEHYWPQLTSYDYDAPLTEAGDPT-PKYFELRDVIA 357
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 346
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 162/334 (48%), Gaps = 53/334 (15%)
Query: 58 GNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
G ++S +HY R + W + K G + + TYVFWN HE G+++F G ++
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
+++++ G G+ + LR GPYVCAEW FGG+P WL++IPG+E R +N E ++ KK
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNT----EFLKYTKK 150
Query: 178 IVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASMAL 227
+D + +E+ L +GGPIIM+Q ENE+G SY Q KD Y
Sbjct: 151 YIDRLYQEVGPLQCTKGGPIIMVQCENEFG----SYVSQRKDISFEEHRSYNAKIKGQLA 206
Query: 228 GLGAGVP-------WVM---CKQTDAP--------ENIIDACNGYYCDGYKPNSYNKPTL 269
G VP W+ C P N+ N Y+ P +
Sbjct: 207 DAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGG-------KGPYM 259
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E + GW + WG P ++A + Q SF N+YM GGTNFG TSG +
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 328 ------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++ +
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWIT-PKYDSIRSV 351
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 34/77 (44%), Gaps = 28/77 (36%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
+D+ + GKG ++NG HIGRYW V P QT Y +
Sbjct: 555 IDMRAWGKGVIFINGKHIGRYWKV---------------------------GPQQTLY-I 586
Query: 737 PRSWLQASNNLLVIFEE 753
P WL+ N +VIFE+
Sbjct: 587 PGVWLRKGENKIVIFEQ 603
>gi|306832839|ref|ZP_07465973.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
gi|304424978|gb|EFM28110.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
Length = 595
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 172/356 (48%), Gaps = 48/356 (13%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
+ +DG ++S IHY R P+ W + K G + +ETYV WN HE G+++F
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G D+ +F+ + GLY +R PY+CAEW FGG P WL + G+ R+ + F + +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWLLE-KGVRVRSQDKGFLQVV 126
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+R+ + ++ + + L QGG I+M Q+ENEYG SYG+ K Y++ M L LG
Sbjct: 127 KRYYEVLIPRLIKHQLD--QGGNILMFQVENEYG----SYGED-KVYLRELKQMMLELGL 179
Query: 232 GVPWVMCKQTDAP-------ENIID---ACNGYYCDGYKPN---------SYNK--PTLW 270
P+ +D P ++I+ G + K N Y K P +
Sbjct: 180 EEPFFT---SDGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMC 236
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-- 328
E WDGW+ WG + R E+LA AV + G +N YM+ GGTNFG +G
Sbjct: 237 MEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGS--INLYMFHGGTNFGFMNGCSARKQ 294
Query: 329 -----ITSYDYDAPIDEYG-------LLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
+TSYDYDA +DE G +L +LH A L +P + D A
Sbjct: 295 TDLPQVTSYDYDAILDEAGNPTKKFYILQHRLKNKYPELHYATPLVKPTMAIKDIA 350
Score = 42.7 bits (99), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 88/222 (39%), Gaps = 55/222 (24%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL-- 596
V + RD +VF+NG + + V F S + L +L + +G NYG L
Sbjct: 402 VRLIDTRDRAQVFLNGNHIVTQYQEEIGDDIQVNFTSEESQLDILVENMGRVNYGHKLTA 461
Query: 597 --EKDGAGFRGQVKLTGFKNGDIDLSKILW-TYQVGLKGEFQQIYSIEENEAEWTDLTRD 653
+ G G RG + F N W TY + + YS + W R+
Sbjct: 462 PSQHKGIG-RGVMLDLHFVNQ--------WETYPLSMNSIKNLKYS-----SPW----RE 503
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
G+PS F +K + P+ +D+ GKG A++NG+++GR+W +
Sbjct: 504 GVPS-FYEFKFHCLNPED---TYMDMSGFGKGVAFINGYNLGRFWNI------------- 546
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
PT + Y +PR + N + IFE G
Sbjct: 547 --------------GPTLSLY-IPRGMMVCGENTITIFETEG 573
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 162/324 (50%), Gaps = 31/324 (9%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + + YVFWN+HE G Y+F
Sbjct: 357 FLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTE 416
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL + R ++ F E +
Sbjct: 417 QNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVAL 476
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F + + ++ + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 477 FEEAVAKQVKNLTIAN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALFQ 534
Query: 221 --WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWY 278
WA++ L + W M T A + A +PNS P + +E W GW+
Sbjct: 535 CDWASNFTLNGLDDLIWTMNFGTGANVDQQFA----KLKQLRPNS---PLMCSEFWSGWF 587
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYD 333
WG RP D+ + RG SF + YM GGTN+G +G P + +TSYD
Sbjct: 588 DKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYD 646
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA 357
YDAPI E G + PK+ L++ A
Sbjct: 647 YDAPISESGQTT-PKYWALREAMA 669
Score = 45.8 bits (107), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 93/228 (40%), Gaps = 52/228 (22%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGH--WVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
T+T++ D +VF++G+ G + ++V P + D+++ + +G N+G
Sbjct: 740 TLTVNDAHDYAQVFVDGKYIGKLDRRNGEKQLVLPACVKGSRLDILV--EAMGRINFGRA 797
Query: 596 LEKDGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEW 647
+ KD G V+L+ NG + ++ I TY+ +FQ I S+
Sbjct: 798 I-KDFKGITKNVELSMDINGYPFVCDLKNWEVFNIEDTYEFYQGMKFQPIESL------- 849
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
TD IP Y+ F D L+ + GKG +VNG+ +GR W +
Sbjct: 850 TDRLGQRIPGV---YRAKFQVKKPSDTF-LNFETWGKGLVYVNGYALGRIWEI------- 898
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N +V+F+ G
Sbjct: 899 --------------------GPQQTLY-VPGCWLKKGENEIVVFDIVG 925
>gi|255652865|ref|NP_001157373.1| beta-galactosidase [Bombyx mori]
gi|239938036|gb|ACS36117.1| beta-galactosidase [Bombyx mori]
Length = 606
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 162/327 (49%), Gaps = 39/327 (11%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
K N+S +IDG +IS +HY R W D + K K G + + TYV W+
Sbjct: 1 MKGHNISIVGDKFMIDGKPLHIISGSLHYFRVPAVYWRDRLHKFKAAGLNTVATYVEWSY 60
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLR-DIPGIEF 160
HE QYNF+G D+V+FV+ GL++ LR+GPY+CAE + GG P WL P I+
Sbjct: 61 HEPEEKQYNFEGDRDLVRFVQTAAEVGLHVLLRVGPYICAERDLGGLPYWLLGKYPNIKL 120
Query: 161 RTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMES--SYGQQGKDY 218
RT + F E ++KK+ + + +LF GGPII++Q+ENEYG+ +S +Y ++ +D
Sbjct: 121 RTTDKDFIAESDIWLKKLFEQV-SHLLFG-NGGPIILVQVENEYGSYDSDLAYKEKMRDL 178
Query: 219 VKW-----------------AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP 261
+ A M G+ A + + + Q P D+ +
Sbjct: 179 ISAHVGDKALLYTTDGPSLVGAGMIPGVHATIDFGVTSQ---PTEQFDSL-------FHL 228
Query: 262 NSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
P + +E + GW T WG R+ D+ + R +N+Y++FGG+NF
Sbjct: 229 RPAPGPLMNSEFYPGWLTHWGERMARVGTNDIVLTL-RNMIVNKIHVNFYVFFGGSNFEF 287
Query: 322 TSGGPF------YITSYDYDAPIDEYG 342
TSG F ITSYDYDAP+ E G
Sbjct: 288 TSGANFDGTYQPDITSYDYDAPLSEAG 314
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 84/177 (47%), Gaps = 28/177 (15%)
Query: 533 NEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNY 592
NE + ++ RD++ VF++G+ G + K + +G + L LL + G NY
Sbjct: 400 NETEGVLVLNKPRDLVFVFVDGKPQGVLSRMHKKYHLRISSTAG-SKLSLLVENQGRINY 458
Query: 593 GAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQI-YSIE--ENEAEWTD 649
G L D G LS++++ +V + G++ Y +E + + ++
Sbjct: 459 GTLLH-DRKGI---------------LSEVIYNNKV-IGGKWSITGYPLETVQFNSSVSE 501
Query: 650 LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG--KGQAWVNGHHIGRYWTVVAPK 704
+T+ T+Y+ F P+G P+ L + G KG WVNGH++GRYW V P+
Sbjct: 502 VTQGP-----TFYEGTFVLPEGQKPLDTFLDTTGWDKGYVWVNGHNLGRYWPGVGPQ 553
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 164/323 (50%), Gaps = 25/323 (7%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S IHY R E W D + K K G + +ETYV WN HE +G+++F G
Sbjct: 20 LDGKPVQILSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHEPEKGKFDFTGML 79
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI +++ + GL++ R GPY+CAEW++GG P WL P ++ RT P+ E ++RF
Sbjct: 80 DIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNMQVRTTYQPYMEAVERFF 139
Query: 176 KKIVDLMREEMLFSW-QGGPIIMLQIENEYGN--MESSYGQQGKDYVKWAASMALGLGAG 232
++ +++ F + +GGPII +Q+ENEYG+ + Y K ++ L L +
Sbjct: 140 DALLPIVKP---FQYKEGGPIIAMQVENEYGSYARDDKYLTAVKQAIQKRGIEELLLTSD 196
Query: 233 VPWVMCKQTDAPENIIDACNGYY-----CDGYKPNSYNKPTLWTENWDGWYTTWGG---R 284
+ + ++ N + K N+P + E W GW+ WG +
Sbjct: 197 GGQIERLERGCIPGVLMTANFNFNPKKQLGALKKLQPNRPQMVMEFWSGWFDHWGRDHHK 256
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
L E L + RF S +N+YM+ GGTNFG +G + +TSYDYDAP+
Sbjct: 257 LHVEKFEQLLGDILRF----PSSVNFYMFHGGTNFGFMNGANYINGYKPDVTSYDYDAPL 312
Query: 339 DEYGLLSEPKWGHLKDLHAAIKL 361
E G + PK+ ++L + +
Sbjct: 313 SEAGDPT-PKYYKTRELLKTLAM 334
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 107/248 (43%), Gaps = 52/248 (20%)
Query: 543 SMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLI-LLSQTVGLQNYG-------A 594
+RD ++F+NG+ +G + +W +V + ND++ +L + G N+
Sbjct: 420 DVRDRAQIFVNGEESGML--NW-RVGEIAMSGLKENDILDILVENQGRVNFAQTMDGVKK 476
Query: 595 FLEKDGAGF-RGQVKLTGFKN--GDIDLS----KILWTYQVGLKGEFQQIYSIEENEAEW 647
F+ + AG RG L K G++ L+ K + + LK EFQ + E
Sbjct: 477 FVLESVAGVNRGDALLDQRKGLVGEVLLNTTPLKTWEIFPLELKPEFQTRLVESPDWQEP 536
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGS-MGKGQAWVNGHHIGRYWTVVAPKGG 706
TD T P+ ++ F+ P+ LD+ GKG A +NG ++GRYW + G
Sbjct: 537 TDATEVPFPA---FHLVNFNIPEEPKDTFLDMKKGWGKGVAILNGFNLGRYWHI-----G 588
Query: 707 CQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLR 766
Q+T +VP +L+ +N L++FE+ PF+ V
Sbjct: 589 PQETL-----------------------YVPAPFLKKGDNQLLLFEQH--IPFKEVVFTD 623
Query: 767 STRIVCEQ 774
+ R+ E+
Sbjct: 624 TPRLGKEK 631
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 162/334 (48%), Gaps = 53/334 (15%)
Query: 58 GNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
G ++S +HY R + W + K G + + TYVFWN HE G+++F G ++
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
+++++ G G+ + LR GPYVCAEW FGG+P WL++IPG+E R +N E ++ KK
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNT----EFLKYTKK 150
Query: 178 IVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASMAL 227
+D + +E+ L +GGPIIM+Q ENE+G SY Q KD Y
Sbjct: 151 YIDRLYQEVGPLQCTKGGPIIMVQCENEFG----SYVSQRKDISFEEHRSYNAKIKGQLA 206
Query: 228 GLGAGVP-------WVM---CKQTDAP--------ENIIDACNGYYCDGYKPNSYNKPTL 269
G VP W+ C P N+ N Y+ P +
Sbjct: 207 DAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGG-------KGPYM 259
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E + GW + WG P ++A + Q SF N+YM GGTNFG TSG +
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318
Query: 328 ------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++ +
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWIT-PKYDSIRSV 351
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 35/82 (42%), Gaps = 28/82 (34%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHV 736
+D+ + GKG ++NG HIGRYW V P QT Y +
Sbjct: 555 IDMRAWGKGVIFINGKHIGRYWKV---------------------------GPQQTLY-I 586
Query: 737 PRSWLQASNNLLVIFEETGGNP 758
P WL+ N +VIFE+ P
Sbjct: 587 PGVWLRKGENKIVIFEQLNEVP 608
>gi|422881390|ref|ZP_16927846.1| beta-galactosidase [Streptococcus sanguinis SK355]
gi|332364328|gb|EGJ42102.1| beta-galactosidase [Streptococcus sanguinis SK355]
Length = 592
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 158/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S Q GPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQDGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 649
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 170/350 (48%), Gaps = 32/350 (9%)
Query: 22 MMMMMMIHLSCVSSSSAS------TFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATP 75
M + ++ L C + +S+S T + F + Y H + DG IS IHY R
Sbjct: 1 MFALFLLSLLCPALASSSSSSSVITSQRTFGIDYGHNCFLKDGQPFRYISGSIHYSRIPR 60
Query: 76 EMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRI 135
W D + K K G D I+TYV WN HE RG YNF G D+ F++L GL + LR
Sbjct: 61 YYWKDRLLKMKMAGLDAIQTYVPWNFHEPERGVYNFTGDRDLEYFLQLAQEVGLLVILRA 120
Query: 136 GPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPI 195
GPY+CAEW+ GG P WL + I R+++ + + ++ + M+ + GGPI
Sbjct: 121 GPYICAEWDMGGLPAWLLEKESIVLRSSDPDYLTAVGSWMGIFLPKMKPHLY--QNGGPI 178
Query: 196 IMLQIENEYGNMESSYGQQGKDYVKWAASM---ALG--------LGAGVPWVMCKQTDAP 244
IM+Q+ENEYG SY DY+++ ++ LG GA + ++ C
Sbjct: 179 IMVQVENEYG----SYFACDFDYLRYLQNLFRQYLGDEVVLFTTDGASMFYLRCGALQGL 234
Query: 245 ENIIDACNGYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARF 300
+ +D G + P + +E + GW WG R P +A +++
Sbjct: 235 YSTVDFGPGRNVTAAFSTQRHTEPKGPLVNSEFYTGWLDHWGHRHITVPASIVAKSLSEI 294
Query: 301 FQRGGSFMNYYMYFGGTNFGRTSGG--PFYI--TSYDYDAPIDEYGLLSE 346
G + +N YM+ GGTNFG +G P+ TSYDYDAP+ E G L+E
Sbjct: 295 LASGAN-VNMYMFIGGTNFGYWNGANMPYMAQPTSYDYDAPLSEAGDLTE 343
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 75/180 (41%), Gaps = 28/180 (15%)
Query: 532 TNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQN 591
T E + T D + D V ++G G + +K + + +G DL LL + +G N
Sbjct: 425 TEETPLSSTFDGIHDRAYVSVDGVRQGILERSSLKKLN-ITGNAG-ADLDLLVENMGRVN 482
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT 651
+G F D G V L G+ L+ W +IY ++ + A L
Sbjct: 483 FGRF-NNDFKGLISNVTL-----GEEILTD--W-----------EIYPLDIDRAVAEGLE 523
Query: 652 RDGIPSTF---TWYKTYFDAPDGIDPVALD----LGSMGKGQAWVNGHHIGRYWTVVAPK 704
G S++ +Y F P GI + D KGQ W+NG ++GRYW V P+
Sbjct: 524 NKGNASSYEVPAFYIGSFSIPSGIPDLPQDTYLTFPGWTKGQVWINGFNLGRYWPVAGPQ 583
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/333 (33%), Positives = 165/333 (49%), Gaps = 47/333 (14%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +IS IHY R PE W D + K K G + +ETY+ WN HE +G+++F+G
Sbjct: 12 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
DI +FVK GLY+ LR PY+CAEW FGG P WL G++ R + PF + +Q +
Sbjct: 72 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 131
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+KKIV GGP+I++Q+ENEYG Y ++Y+ G
Sbjct: 132 DVLLKKIVPYQIN------YGGPVILMQVENEYG-----YYANDREYLLAMRDKMQKGGV 180
Query: 232 GVPWVMCKQTDAPENIIDACNGYYCDGYKPN-----------------SYNKPTLWTENW 274
VP V +D P + NG + +G P + P + TE W
Sbjct: 181 VVPLV---TSDGP--FEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFW 235
Query: 275 DGWYTTWG-GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY----- 328
GW+ WG G +E+ + + + G +N YM+ GGTNFG +G +Y
Sbjct: 236 VGWFDHWGNGGHMTGNLEESVKDLDKMLELG--HVNIYMFEGGTNFGFMNGSNYYDELTP 293
Query: 329 -ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIK 360
+TSYDYDA + E G ++E K+ +D+ A +
Sbjct: 294 DVTSYDYDALLTEDGQITE-KYRRYRDVIAKYR 325
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 356
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 157/322 (48%), Gaps = 32/322 (9%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++ D + ++DG LIS +HYPR W D + K++ G + + Y FWN HE
Sbjct: 26 LTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHEEEE 85
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G ++F G+ DI +FV++ GL++ LR GPYVCAEW+ GG+P WL P + R+ ++
Sbjct: 86 GHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAVNLRSLDSR 145
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ +++K + + L + +GGPI+ +Q+ENEYG+ S + Y+ M
Sbjct: 146 YIAAADKWMKALGQQLAP--LQAAKGGPILAVQVENEYGSFPDSAQPNAQAYLDRVHQMV 203
Query: 227 L-----------GLGAGVPWVMCKQTDAPENI-IDACNGYYCDG---YKPNSYNKPTLWT 271
L G GA V + + T A ID G YK N
Sbjct: 204 LDAGFKDSLLYTGDGADV---LARGTFADLTAGIDYGTGDSARSIALYKKFRPNTNIYTA 260
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAF--AVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY- 328
E WDGW+ WG + H V+ V GGS ++ YM GGT+FG +G
Sbjct: 261 EYWDGWFDHWGAK--HEVVDASIHLKEVHDVLTSGGS-ISLYMLHGGTSFGWMNGANIDH 317
Query: 329 ------ITSYDYDAPIDEYGLL 344
+TSYDYDAPIDE G L
Sbjct: 318 NHYEPDVTSYDYDAPIDEAGQL 339
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 50/230 (21%), Positives = 87/230 (37%), Gaps = 61/230 (26%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHW------VKVVQPVEFQSGYNDLILLSQTVGLQN 591
T+ +D + R++++G+L G++ +++ +P + L +L + G N
Sbjct: 420 TLKLDRLHSYARIYLDGKLVGTLDRRLDQDHIDLQINKPTQ-------LDILVENTGRVN 472
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT 651
+ + + AG QV L G E QIYS+ T +
Sbjct: 473 FTEAIRTEQAGITHQVLLNG------------------TPVENWQIYSLPFESIPTTGFS 514
Query: 652 RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTC 711
++ T F+ +D LD+ ++ KG WVNGH++GR+W +
Sbjct: 515 TKPCEGPCLYHAT-FNLTTPVD-TYLDVHTLSKGNVWVNGHNLGRFWKI----------- 561
Query: 712 DYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEI 761
P T Y +P SWL+ N + + E G EI
Sbjct: 562 ----------------GPLGTLY-LPSSWLKPGPNKIEVLELDGKPSLEI 594
>gi|256424388|ref|YP_003125041.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256039296|gb|ACU62840.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 586
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 174/346 (50%), Gaps = 36/346 (10%)
Query: 30 LSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGG 89
+ V++ S TF + + ++D +IS +H R E W I +K G
Sbjct: 1 MGSVNAQSKHTF------ALSKKDFLLDSKPYQIISGEMHPARIPKEYWRHRIQMAKAMG 54
Query: 90 ADVIETYVFWNAHESIRGQYNFKGKN-DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGF 148
+ I YVFWN HE G+++F +N DIV F+K+V G+++ LR GPYVCAEW FGG
Sbjct: 55 CNTIAAYVFWNYHEQEEGKFDFTSENRDIVAFIKMVQEEGMWVMLRPGPYVCAEWEFGGL 114
Query: 149 PVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNME 208
P +L IP I+ R + + +R++K + + ++ + + GGPI+M+Q+ENEYG
Sbjct: 115 PPYLLRIPDIKVRCMDPRYIAATERYIKALSEEVKPLQITN--GGPIVMVQVENEYG--- 169
Query: 209 SSYGQQGKDYVKWAASMALGLGAGVPW--------VMCKQTDAPENII----DACNGYYC 256
S+G ++Y+ M + G VP+ + + P I + G +
Sbjct: 170 -SFGND-REYMLKVKDMWVQNGINVPFYTADGPVSALLEAGSVPGAAIGLDSGSSEGDFA 227
Query: 257 DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGG 316
K N + P+ +E++ GW T WG + RP + +F N Y+ GG
Sbjct: 228 AAEKQNP-DVPSFSSESYPGWLTHWGEKW-ARPDKAGIVKEVKFLMDTKRSFNLYVIHGG 285
Query: 317 TNFGRT----SGGPFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
TNFG T SGG Y +TSYDYDAPI+E G + K+ L+DL
Sbjct: 286 TNFGFTAGANSGGKGYEPDLTSYDYDAPINEQGDTTA-KYNALRDL 330
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 356
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQIHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKESFAQ 356
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 89/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q+T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQVTQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 110/329 (33%), Positives = 163/329 (49%), Gaps = 28/329 (8%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG L S IHY R PE W D + K K G + +ETYV WN HE G++ F+G D
Sbjct: 14 DGEELRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ +F++L G GL++ +R PY+CAEW FGG P WL PG++ R + + ++ +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 177 KIVDLMREEMLFSWQGGPIIMLQIENEYGNMES--SYGQQGKD-YVKWAASMALGLGAGV 233
+++ R L GGP+I++Q+ENEYG+ S +Y + +D V+ + L G
Sbjct: 134 ELIP--RLVPLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGP 191
Query: 234 PWVMCKQTDAPENIIDACNGYY-------CDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
M + P + G Y+P P + E W+GW+ W
Sbjct: 192 TDAMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQG---PLMCMEYWNGWFDHWMEEHH 248
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDE 340
R D A + G S +N+YM+ GGTNFG +G ITSYDYD+P+ E
Sbjct: 249 QRDAADAARVFGEMLEAGAS-VNFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSPLTE 307
Query: 341 YGLLSEP--KWGHLKDLHAA-IKLCEPAL 366
+G EP K+ ++D+ A + L P L
Sbjct: 308 WG---EPTAKYDAVRDVLAKHLPLGAPEL 333
Score = 46.2 bits (108), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 91/230 (39%), Gaps = 51/230 (22%)
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQT 586
IS +T +V + + +RD +VF++G G V+ W PV G L +L +
Sbjct: 388 ISGPRTGQV---LHVQEVRDRAQVFLDGTPAG-VVERWDPKGLPVTVPEGGAALDILVEN 443
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTG-FKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA 645
+G NYG L D G V+L F+ G WT GL + S+E
Sbjct: 444 MGRINYGPLL-SDAKGITCGVRLDNQFQYG--------WT-MYGLP-----LDSLEGGAP 488
Query: 646 EWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKG 705
E L P +Y+ F+ + D + L KG ++NG H+GRYW
Sbjct: 489 E--PLAEGEAPGGPAFYRAAFEVDEPAD-TFVRLDGWTKGVVFINGFHLGRYWE------ 539
Query: 706 GCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RG P +T Y +P L+ N LV+FE G
Sbjct: 540 --------RG-------------PQKTLY-LPGPLLRRGTNELVVFELHG 567
>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
Length = 591
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 114/328 (34%), Positives = 160/328 (48%), Gaps = 42/328 (12%)
Query: 42 FKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNA 101
+ F + D ++DG LIS IHY R T W D + K GA+ +ETY+ WN
Sbjct: 1 MRTFEIKED---FLLDGKPIKLISGAIHYFRMTSAQWADSLYNLKALGANTVETYIPWNL 57
Query: 102 HESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFR 161
HE G Y+F+G DI FVK + GL + LR Y+CAEW FGG P WL + P + R
Sbjct: 58 HEPREGVYDFEGMKDIFAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLR 116
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW 221
+ + F +++ + + V L + L GGP+IM+Q+ENEYG SYG + K Y++
Sbjct: 117 STDPRFMAKVRNYFQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQ 169
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDA---------CNGYYCDGYKPNSY-------- 264
+ G VP + A E ++DA G + K N+
Sbjct: 170 TKELMEECGIDVP--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAK 227
Query: 265 ---NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR 321
N P + E WDGW+ WG + R +DLA V G +N YM+ GGTNFG
Sbjct: 228 HGKNWPIMCMEYWDGWFNRWGEPIIKRDGQDLANEVKEMLAVGS--LNLYMFHGGTNFGF 285
Query: 322 TSG----GPF---YITSYDYDAPIDEYG 342
++G G ++SYDYDA + E G
Sbjct: 286 SNGCSARGALDLPQVSSYDYDALLTEAG 313
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 87/215 (40%), Gaps = 53/215 (24%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
PT + Y +P+ L+ N +VIFE G
Sbjct: 546 -------GPTLSLY-IPKGLLKKGQNEIVIFETEG 572
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 112/335 (33%), Positives = 164/335 (48%), Gaps = 29/335 (8%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++ +A +HYPR W I K G + I YVFWN+HE G ++F G
Sbjct: 356 FLLNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFTG 415
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+ND+ +F +L + +Y+ LR GPYVCAEW GG P WL I R ++ F E +
Sbjct: 416 QNDLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVGI 475
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVK----------- 220
F K + + + + + + GGPIIM+Q+ENEYG+ + Y Q +D V+
Sbjct: 476 FEKAVAEQVADMTIQN--GGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVTLFQC 533
Query: 221 -WAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
WA++ + W M T A NI +P+S P + +E W GW+
Sbjct: 534 DWASNFTKNGLHDLVWTMNFGTGA--NIDQQFAP--LKKLRPDS---PLMCSEFWSGWFD 586
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDY 334
WG RP D+ + +G SF + YM GGTN+G +G P + +TSYDY
Sbjct: 587 KWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 645
Query: 335 DAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
DAPI E G + W K L + + A V A
Sbjct: 646 DAPISESGQTTPKYWELRKTLSKYMDGEKQAKVPA 680
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 50/225 (22%), Positives = 87/225 (38%), Gaps = 48/225 (21%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+T++ D ++F+NG+ G + + L +L + +G N+G + K
Sbjct: 739 LTVNDAHDYAQIFLNGKYIGKLDRRNGEKQLAFPACPKGARLDILVEAMGRINFGRAI-K 797
Query: 599 DGAGFRGQVKLTGFKNG--------DIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
D G V+LT +G D ++ + TY +F+ I S+++ +
Sbjct: 798 DFKGITRSVELTVDIDGHPFTCDLKDWEVYNLEDTYDFYKNMKFRPIGSLKDESGQ---- 853
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
IP Y+ F D L+ + GKG +VNGH +GR W +
Sbjct: 854 ---RIPGC---YRATFKVNKPSD-TFLNFETWGKGLVYVNGHAMGRIWEI---------- 896
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++F+ G
Sbjct: 897 -----------------GPQQTLY-IPGCWLKKGENEVMVFDIIG 923
>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
gallopavo]
Length = 643
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 173/351 (49%), Gaps = 27/351 (7%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + YD + DG IS IHY R W D + K K G D I+TYV WN H
Sbjct: 14 RTFGIDYDCNCFVKDGRPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLDAIQTYVPWNYH 73
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E+ G Y+F G D+ F++L +GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 74 ETQMGVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRS 133
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+++ + +++++ ++ M+ + GGPIIM+Q+ENEYG SY DY++
Sbjct: 134 SDSDYLTAVEKWMGVLLPKMKPHLY--QNGGPIIMVQVENEYG----SYFACDYDYLRSL 187
Query: 223 ASM---ALG--------LGAGVPWVMCKQTDAPENIID-ACNGYYCDGYKPNSYNKPT-- 268
+ LG GA + C +D A G + ++PT
Sbjct: 188 LKIFRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPTGP 247
Query: 269 -LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-- 325
+ +E + GW WG R P + +A + RG + +N YM+ GGTNF +G
Sbjct: 248 LVNSEFYTGWLDHWGHRHAVVPSQTIAKTLNEILARGAN-VNLYMFIGGTNFAYWNGANM 306
Query: 326 PFY--ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY 374
P+ TSYDYDAP+ E G L+E K+ L+++ L+ ++++
Sbjct: 307 PYMSQPTSYDYDAPLSEAGDLTE-KYFALREVIGMYNQLPEGLIPPTTSKF 356
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 74/124 (59%), Positives = 95/124 (76%), Gaps = 1/124 (0%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
NVSYD R++II+G R++LISA IHYPR+ P MWP+L+ +KEGG DVIETYVFWN H+
Sbjct: 20 NVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVHQPT 79
Query: 106 R-GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
+Y+F G+ D+VKF+ +V +G+YL LRIGP+V AEWNFGG PVWL + G FRT+N
Sbjct: 80 SPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFRTDN 139
Query: 165 APFK 168
FK
Sbjct: 140 YNFK 143
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFINGQLTGSV----IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVYQATQYQTEIGEDIYVTLPQE----NNQIDILMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|307188518|gb|EFN73255.1| Beta-galactosidase [Camponotus floridanus]
Length = 624
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 163/337 (48%), Gaps = 36/337 (10%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
V+ S T+ F V Y++ ++DG +S HY RA + W D + K + G +
Sbjct: 19 TVNLPSNDTWQYSFGVDYENNQFLLDGKPFRYVSGSFHYFRAPRQYWRDRLRKMRAAGLN 78
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYV W+ HE GQ+N+ G D+++F+ + L++ LR GPY+CAE + GG P W
Sbjct: 79 AVSTYVEWSLHEPEPGQFNWAGDADLIEFLNIAQEEDLFVLLRPGPYICAERDLGGLPYW 138
Query: 152 -LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS 210
LR+ P I+ RT +A F + ++ ++++ ++ L GGPIIM+QIENEYG S
Sbjct: 139 LLREAPDIKLRTKDAAFMKYATAYLNQVLEKVKP--LLRGNGGPIIMVQIENEYG----S 192
Query: 211 YGQQGKDYVKWAASMALGL-----------GAGVPWVMCKQTDAPENIID------ACNG 253
Y +Y + +G GA + C ID N
Sbjct: 193 YNACDTEYTDMLKEIIVGKVGSKALLYTTDGASASLLRCGFVPGAYATIDFGTSVNVTNS 252
Query: 254 YYC-DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYM 312
+ Y+P P + +E + GW T WG E + + G S +N YM
Sbjct: 253 FQSMRLYQPRG---PLVNSEFYPGWLTHWGETFQRVKTEAVTKTLREMLALGAS-VNIYM 308
Query: 313 YFGGTNFGRTSG-----GPF--YITSYDYDAPIDEYG 342
++GGTNFG TSG G + ITSYDYDAP+ E G
Sbjct: 309 FYGGTNFGFTSGANGGVGAYSPQITSYDYDAPLTEAG 345
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 677 LDLGSMGKGQAWVNGHHIGRYWTVVAPK 704
LD GKG A+VNGH++GRYW +V P+
Sbjct: 557 LDPTGWGKGVAFVNGHNLGRYWPLVGPQ 584
>gi|422822094|ref|ZP_16870287.1| beta-galactosidase [Streptococcus sanguinis SK353]
gi|324990399|gb|EGC22337.1| beta-galactosidase [Streptococcus sanguinis SK353]
Length = 592
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 158/335 (47%), Gaps = 38/335 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S I Y R P+ W D + K G + +ETY+ W HE GQ+ +
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEEML 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D + KLV GLYL +R PY+CAE++FGG P WL P + R N+ F E++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHF- 130
Query: 176 KKIVDLMREEML--FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
D + ++L S QGGPI+M+Q+ENEYG SY + K Y++ A M G V
Sbjct: 131 ---YDWLFPKLLPYQSDQGGPILMMQVENEYG----SYAED-KAYMRSIAQMMKVRGVTV 182
Query: 234 P-------WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK-----------PTLWTENWD 275
P W+ ++ G + K N+ N P + TE WD
Sbjct: 183 PLFTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW++ W + R EDLA V Q G MN ++ GGTNFG SG
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGS--MNLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE 363
ITSYD+DAPI E+G +E + + H E
Sbjct: 301 ITSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELE 335
>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
Length = 778
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T F + ++DG ++ +A +HY R W
Sbjct: 5 IIALLVLFTVILFSSAQAQTTAHKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWSH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F KL G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 57/128 (44%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPAMPAYYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASIKGLKKPILDVL 614
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 615 REKAPETH 622
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEVIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 111/338 (32%), Positives = 170/338 (50%), Gaps = 27/338 (7%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + YD + DG+ IS IHY R W D + K K G + I+TYV WN H
Sbjct: 23 RTFGIDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNYH 82
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G Y+F G D+ F++L +GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 83 EPQMGVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRS 142
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+++ + +++++ ++ M+ + + GGPIIM+Q+ENEYG SY DY++
Sbjct: 143 SDSDYLTAVEKWMGVLLPKMKPHLYHN--GGPIIMVQVENEYG----SYFACDYDYLRSL 196
Query: 223 ASM---ALG--------LGAGVPWVMCKQTDAPENIID-ACNGYYCDGYKPNSYNKPT-- 268
+ LG GA + C +D A G + ++PT
Sbjct: 197 LKIFRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPTGP 256
Query: 269 -LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-- 325
+ +E + GW WG R P E +A + RG + +N YM+ GGTNF +G
Sbjct: 257 LVNSEFYTGWLDHWGHRHIVVPSETIAKTLNEILARGAN-VNLYMFIGGTNFAYWNGANM 315
Query: 326 PFYI--TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKL 361
P+ TSYDYDAP+ E G L+E K+ L+++ + +
Sbjct: 316 PYMSQPTSYDYDAPLSEAGDLTE-KYFALREVIGMVSI 352
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/333 (33%), Positives = 165/333 (49%), Gaps = 47/333 (14%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +IS IHY R PE W D + K K G + +ETY+ WN HE +G+++F+G
Sbjct: 19 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 78
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF- 174
DI +FVK GLY+ LR PY+CAEW FGG P WL G++ R + PF + +Q +
Sbjct: 79 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 138
Query: 175 ---VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+KKIV GGP+I++Q+ENEYG Y ++Y+ G
Sbjct: 139 DVLLKKIVPYQIN------YGGPVILMQVENEYG-----YYANDREYLLAMRDKMQKGGV 187
Query: 232 GVPWVMCKQTDAPENIIDACNGYYCDGYKPN-----------------SYNKPTLWTENW 274
VP V +D P + NG + +G P + P + TE W
Sbjct: 188 VVPLV---TSDGP--FEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFW 242
Query: 275 DGWYTTWG-GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY----- 328
GW+ WG G +E+ + + + G +N YM+ GGTNFG +G +Y
Sbjct: 243 VGWFDHWGNGGHMTGNLEESVKDLDKMLELG--HVNIYMFEGGTNFGFMNGSNYYDELTP 300
Query: 329 -ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIK 360
+TSYDYDA + E G ++E K+ +D+ A +
Sbjct: 301 DVTSYDYDALLTEDGQITE-KYRRYRDVIAKYR 332
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 166/324 (51%), Gaps = 31/324 (9%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++S +HY R P++W D I K++ G + IETYV WNAH G ++ G
Sbjct: 11 FLLDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLSG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F++LV +G+Y +R GPY+CAEW+ GG P WL P + R + + ++
Sbjct: 71 GLDLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVRE 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ K+ +++ + +GGP++++Q+ENEYG ++G K Y+K A G V
Sbjct: 131 YLTKVYEVVVPHQID--RGGPVLLVQVENEYG----AFGDD-KRYLKALAEHTREAGVTV 183
Query: 234 PWVMCKQTDAPENI----IDACNGYYCDG---------YKPNSYNKPTLWTENWDGWYTT 280
P Q PE + +D + G + + P + +E W+GW+
Sbjct: 184 PLTTVDQP-TPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGWFDH 242
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-------PFYITSYD 333
WG D A + G S +N YM+ GGTNFG T+G P ITSYD
Sbjct: 243 WGAHHHTTSAADSAAELDALLAAGAS-VNLYMFHGGTNFGLTNGANDKGVYQPL-ITSYD 300
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA 357
YDAP+DE G + PK+ +D+ A
Sbjct: 301 YDAPLDEAGDPT-PKYHAFRDVIA 323
>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
Length = 778
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T F + ++DG ++ +A +HY R W
Sbjct: 5 IIALLVLFTVILFSSAQAQTTAHKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWSH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F KL G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFAGQNDIAAFCKLAQQHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVDKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 38/151 (25%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+PS +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPSMPAYYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASIKGLKKPILDVL 614
Query: 772 CEQVSESHYPPVRKWSNSYSVDGKLSINKMA 802
E+ E+H RK + G+ I++ A
Sbjct: 615 REKAPETH----RKDGEKLKLTGEKVIHEGA 641
>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
Length = 778
Score = 166 bits (420), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T F + ++DG ++ +A +HY R W
Sbjct: 5 IIALLVLFTVILFSSAQAQTTAHKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWSH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F KL G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVDKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 57/128 (44%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPAMPAYYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASIKGLKKPILDVL 614
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 615 REKAPETH 622
>gi|18410234|ref|NP_565051.1| beta-galactosidase 17 [Arabidopsis thaliana]
gi|75163694|sp|Q93Z24.1|BGL17_ARATH RecName: Full=Beta-galactosidase 17; Short=Lactase 17; Flags:
Precursor
gi|16648842|gb|AAL25611.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|22655360|gb|AAM98272.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|332197279|gb|AEE35400.1| beta-galactosidase 17 [Arabidopsis thaliana]
Length = 697
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 115/326 (35%), Positives = 162/326 (49%), Gaps = 37/326 (11%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DGNR +I +HY R PE W D + ++ G + I+ YV WN HE G+ F+G D
Sbjct: 73 DGNRFQIIGGDLHYFRVLPEYWEDRLLRANALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 132
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI-PGIEFRTNNAPFKEEMQRFV 175
+V F+KL + LR GPY+C EW+ GGFP WL + P ++ RT++ + + ++R+
Sbjct: 133 LVSFLKLCEKLDFLVMLRAGPYICGEWDLGGFPAWLLAVKPRLQLRTSDPVYLKLVERWW 192
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALG-LG---- 230
V L + L GGP+IM+QIENEYG SYG K Y++ SMA G LG
Sbjct: 193 D--VLLPKVFPLLYSNGGPVIMVQIENEYG----SYGND-KAYLRKLVSMARGHLGDDII 245
Query: 231 -----AGVPWVMCKQTDAPENIIDACNGYYCDGYKP-----NSYN----KPTLWTENWDG 276
G + K T ++ A + D P +N P L +E + G
Sbjct: 246 VYTTDGGTKETLDKGTVPVADVYSAVDFSTGDDPWPIFKLQKKFNAPGRSPPLSSEFYTG 305
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------- 327
W T WG ++ E A ++ + R GS + YM GGTNFG +G
Sbjct: 306 WLTHWGEKITKTDAEFTAASLEKILSRNGSAV-LYMVHGGTNFGFYNGANTGSEESDYKP 364
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLK 353
+TSYDYDAPI E G + PK+ L+
Sbjct: 365 DLTSYDYDAPIKESGDIDNPKFQALQ 390
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQIHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 173/348 (49%), Gaps = 39/348 (11%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMC----KQTDAPENIID---ACNGYYCDGYKPN---------SYNK--PTLWTENWD 275
P+ + T ++I+ G + K N + K P + E WD
Sbjct: 192 PFFTSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFEEHGKKWPLMCMEFWD 251
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------- 328
GW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 252 GWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTIDLPQ 309
Query: 329 ITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 310 ITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQIHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIRL 590
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 110/321 (34%), Positives = 156/321 (48%), Gaps = 27/321 (8%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
+++G ++ +A +HYPR W I K G + I YVFWN HE G+++F
Sbjct: 74 TFLLNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFT 133
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G+ND+ F +L + +Y+ LR GPYVCAEW GG P WL I R + F E +
Sbjct: 134 GQNDLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREADPYFIERVN 193
Query: 173 RFVKKIVDLMREEMLFSWQ-GGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
F +++ R+ + Q GGPIIM+Q+ENEYG SYG+ K+YV +
Sbjct: 194 IFEQEVA---RQVGGLTIQNGGPIIMVQVENEYG----SYGES-KEYVSLIRDIVRTNFG 245
Query: 232 GVPWVMCK------QTDAPENI--IDACNGYYCD----GYKPNSYNKPTLWTENWDGWYT 279
V C + P+ + I+ G D G K + P + +E W GW+
Sbjct: 246 DVTLFQCDWASNFTKNALPDLLWTINFGTGANIDQQFAGLKKLRPDSPLMCSEFWSGWFD 305
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDY 334
WG RP D+ + +G SF + YM GGTN+G +G P + +TSYDY
Sbjct: 306 KWGANHETRPASDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 364
Query: 335 DAPIDEYGLLSEPKWGHLKDL 355
DAPI E G + W K L
Sbjct: 365 DAPISESGQTTPKYWALRKTL 385
>gi|261406481|ref|YP_003242722.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261282944|gb|ACX64915.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 619
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 103/322 (31%), Positives = 158/322 (49%), Gaps = 34/322 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
+++++ ++DG +IS IHY R PE W D + K K G + +ETY+ WN HE
Sbjct: 4 LTWENGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQE 63
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G++NF G D+ F++L G GL++ +R P++CAEW FGG P WL I R ++
Sbjct: 64 GEFNFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ ++ + +++ + L S GGPI+ +Q+ENEYG SYG Y+++
Sbjct: 124 YLSKVDHYYDELIPQLVP--LLSTHGGPILAVQVENEYG----SYGND-HAYLEYLREGL 176
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCD----------------GYKPNSYNKPTLW 270
+ G V+ +D P + + G D Y+ +P +
Sbjct: 177 VRRGVD---VLLFTSDGPTDEM-LLGGTLSDVHATVNFGSRVEESFRKYREYRAEEPLMV 232
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-- 328
E W+GW+ W R D+A + + G S MN YM+ GGTNFG SG
Sbjct: 233 MEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEMGSS-MNMYMFHGGTNFGFYSGANHIQA 291
Query: 329 ----ITSYDYDAPIDEYGLLSE 346
TSYDYDAP+ E+G +E
Sbjct: 292 YEPTTTSYDYDAPLTEWGDKTE 313
>gi|296399420|gb|ADH10537.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 162/323 (50%), Gaps = 26/323 (8%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y+ + + DG IS IHY R W D + K K G D I+TYV WN H
Sbjct: 5 RSFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYH 64
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G Y+F G D+ F++L +GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 65 EPRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRS 124
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+++ + E ++R++ ++ MR + GGPIIM+Q+ENEYG SY DY+++
Sbjct: 125 SDSDYLEAVERWMGVLLPKMRPYLY--QNGGPIIMVQVENEYG----SYFACDYDYLRFL 178
Query: 223 ASMA---LGL--------GAGVPWVMCKQTDAPENIID-ACNGYYCDGYKPNSYNKPT-- 268
+ LG GA + C +D A G + ++P
Sbjct: 179 LKLFRLHLGHEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPMGP 238
Query: 269 -LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-- 325
+ +E + GW WG R P E +A + RG + +N YM+ GGTNF +G
Sbjct: 239 LVNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILARGAN-VNLYMFIGGTNFAYWNGANM 297
Query: 326 PFY--ITSYDYDAPIDEYGLLSE 346
P+ TSYDYDAP+ E G L+E
Sbjct: 298 PYMPQPTSYDYDAPLSEAGDLTE 320
>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
Length = 778
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T F + ++DG ++ +A +HY R W
Sbjct: 5 IIALLVLFTVILFSSAQAQTTAHKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWSH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F KL G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVDKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 54/128 (42%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPFMPAYYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRI---V 771
P QT + +P WL+ N +++ + G P + S+K I +
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASIKGLKKPILDML 614
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 615 REKAPETH 622
>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
Length = 780
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 157/316 (49%), Gaps = 19/316 (6%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++ +A IHY R E W I K G + I Y FWN HE G+++FKG
Sbjct: 40 FLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFKG 99
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+NDI F +L G+Y+ LR GPYVC+EW GG P WL I+ RTN+ F E +
Sbjct: 100 QNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTKL 159
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAASMALGLGA 231
F+ +I + + L +GG IIM+Q+ENEYG + +Y +D VK A + L
Sbjct: 160 FMNEIGKQLAD--LQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPLFQ 217
Query: 232 GVPWVMCKQTDAPENIIDACN---GYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGR 284
W Q + ++++ N G D K + P + +E W GW+ WG +
Sbjct: 218 -CDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGRK 276
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPID 339
R + + R SF + YM GGT FG G P Y +SYDYDAPI
Sbjct: 277 HETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPIS 335
Query: 340 EYGLLSEPKWGHLKDL 355
E G + PK+ L++L
Sbjct: 336 EAGWAT-PKYYKLREL 350
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 97/234 (41%), Gaps = 44/234 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+ ID + D +VF +G+L G + + + + L +L + +G N+ +
Sbjct: 424 TLLIDEVHDWAQVFADGKLLGRLDRRRGESTVVLPALAAGTRLDILVEAMGRVNFDVAIH 483
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G +V+L G +L +QV F Y+ +++ DG P+
Sbjct: 484 -DRKGITDKVELIS-DTGRQELED----WQVY---SFPVDYAFVQDKKYAAGDKLDG-PA 533
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y+T F+ D + V LD+ + GKG WVNG +GR+W +
Sbjct: 534 ---YYRTTFEL-DEVGDVFLDMQTWGKGMVWVNGKAMGRFWEI----------------- 572
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT + +P WL+ N ++I + G P + V+ R I+
Sbjct: 573 ----------GPQQTLF-MPGCWLKKGKNEIIILDLLG--PEKAVVEGRKEPIL 613
>gi|307289489|ref|ZP_07569436.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422703871|ref|ZP_16761687.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499556|gb|EFM68926.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315164595|gb|EFU08612.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 593
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 163/340 (47%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL + R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKSVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
Length = 636
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 164/347 (47%), Gaps = 19/347 (5%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + YD + DG IS IHY R P W D + K K G D I+TYV WN H
Sbjct: 7 RSFGIDYDSNCFVKDGKPFRYISGSIHYSRVPPYYWKDRLLKMKMAGLDAIQTYVPWNYH 66
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G Y+F G D+ F++L +GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 67 EPQMGTYDFFGGKDLQYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRS 126
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG-------NMESSYGQQG 215
+++ + E ++R++ ++ MR + GGPIIM+Q+ENEYG N +
Sbjct: 127 SDSDYLEAVERWMGVLLPKMRPYLY--QNGGPIIMVQVENEYGSYFACDYNYLRFLLKLF 184
Query: 216 KDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----GYKPNSYNKPTLWT 271
+ ++ + GA + C +D G + + P + +
Sbjct: 185 RLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGANVTAAFLAQRSSEPKGPLVNS 244
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY- 328
E + GW WG P + +A + G + +N YM+ GGTNF +G P+
Sbjct: 245 EFYTGWLDHWGHHHSVVPAQTIAKTLNEILASGAN-VNLYMFIGGTNFAYWNGANMPYMP 303
Query: 329 -ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY 374
TSYDYDAP+ E G L+E K+ L+ + K L + ++
Sbjct: 304 QPTSYDYDAPLSEAGDLTE-KYFALRKVIGMYKQLPEGLTPPTTPKF 349
>gi|296399387|gb|ADH10509.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 157/323 (48%), Gaps = 26/323 (8%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y+ + + DG IS IHY R W D + K K G D I+TYV WN H
Sbjct: 5 RSFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYH 64
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G Y+F G D+ F++L +GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 65 EPRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRS 124
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
+++ + E ++R++ ++ MR + GGPIIM+Q+ENEYG SY DY+++
Sbjct: 125 SDSDYLEAVERWMGVLLPKMRPYLY--QNGGPIIMVQVENEYG----SYFACDYDYLRFL 178
Query: 223 ASMALG-LGAGVPWVMCKQTDAPENIIDACNGYYCD--------------GYKPNSYNKP 267
+ LG V A G Y + + P
Sbjct: 179 LKLFRLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPMGP 238
Query: 268 TLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-- 325
+ +E + GW WG R P E +A + RG + +N YM+ GGTNF +G
Sbjct: 239 LVNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILARGAN-VNLYMFIGGTNFAYWNGANM 297
Query: 326 PFY--ITSYDYDAPIDEYGLLSE 346
P+ TSYDYDAP+ E G L+E
Sbjct: 298 PYMPQPTSYDYDAPLSEAGDLTE 320
>gi|294779195|ref|ZP_06744602.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|294453706|gb|EFG22101.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
Length = 592
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 163/340 (47%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL + R+ + F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 130 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 182
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 334
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 409 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 468
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 469 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 512
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 513 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 557
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 558 C--------------PKEFLQQGQNEVVIFETEG 577
>gi|256957323|ref|ZP_05561494.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|257077681|ref|ZP_05572042.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|307270129|ref|ZP_07551446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|422710565|ref|ZP_16767610.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|422721468|ref|ZP_16778057.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422867159|ref|ZP_16913760.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|256947819|gb|EEU64451.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256985711|gb|EEU73013.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|306513498|gb|EFM82113.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|315031294|gb|EFT43226.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315035298|gb|EFT47230.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|329577710|gb|EGG59137.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 593
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 163/340 (47%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 11 FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL + R+ + F +++
Sbjct: 71 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 130
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 131 YFQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 183
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 335
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 410 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 469
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 470 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 513
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 514 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 558
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 559 C--------------PKEFLQQGQNEVVIFETEG 578
>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
43184]
gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
Length = 780
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 157/316 (49%), Gaps = 19/316 (6%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++ +A IHY R E W I K G + I Y FWN HE G+++FKG
Sbjct: 40 FLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFKG 99
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+NDI F +L G+Y+ LR GPYVC+EW GG P WL I+ RTN+ F E +
Sbjct: 100 QNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTKL 159
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAASMALGLGA 231
F+ +I + + L +GG IIM+Q+ENEYG + +Y +D VK A + L
Sbjct: 160 FMNEIGKQLAD--LQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPLFQ 217
Query: 232 GVPWVMCKQTDAPENIIDACN---GYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGR 284
W Q + ++++ N G D K + P + +E W GW+ WG +
Sbjct: 218 -CDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGRK 276
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPID 339
R + + R SF + YM GGT FG G P Y +SYDYDAPI
Sbjct: 277 HETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPIS 335
Query: 340 EYGLLSEPKWGHLKDL 355
E G + PK+ L++L
Sbjct: 336 EAG-WATPKYYKLREL 350
Score = 43.1 bits (100), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 97/234 (41%), Gaps = 44/234 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+ ID + D +VF +G+L G + + + + L +L + +G N+ +
Sbjct: 424 TLLIDEVHDWAQVFADGKLLGRLDRRRGENTVVLPALAAGTRLDILVEAMGRVNFDVAIH 483
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G +V+L G +L +QV F Y+ +++ DG P+
Sbjct: 484 -DRKGITDKVELIS-DTGRQELED----WQVY---SFPVDYAFVQDKKYAAGDKLDG-PA 533
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y+T F+ D + V LD+ + GKG WVNG +GR+W +
Sbjct: 534 ---YYRTTFEL-DEVGDVFLDMQTWGKGMVWVNGKAMGRFWEI----------------- 572
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT + +P WL+ N ++I + G P + V+ R I+
Sbjct: 573 ----------GPQQTLF-MPGCWLKKGKNEIIILDLLG--PEKAVVEGRKEPIL 613
>gi|295113973|emb|CBL32610.1| Beta-galactosidase [Enterococcus sp. 7L76]
Length = 592
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 163/340 (47%), Gaps = 42/340 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G +IS IHY R TP W D + K GA+ +ETY+ WN HE G Y+F+G
Sbjct: 10 FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+I FV+L L + LR Y+CAEW FGG P WL + R+ + F +++
Sbjct: 70 MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ + V L + L QGGP+IM+Q+ENEYG SYG + K Y++ + LG V
Sbjct: 130 YFQ--VLLPKLAPLQITQGGPVIMIQVENEYG----SYGME-KAYLRQTKQIMEELGIEV 182
Query: 234 PWVMCKQTDAPENIIDA---------CNGYYCDGYKPNS---------YNK--PTLWTEN 273
P + A E ++DA G + K N+ + K P + E
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
WDGW+ WG + R DLA V G +N YM+ GGTNFG R +
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGS--LNLYMFHGGTNFGFYNGCSARGAKDL 298
Query: 327 FYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPAL 366
+TSYDYDA + E G +E + + AIK P +
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEV 334
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 79/214 (36%), Gaps = 49/214 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLIL--LSQTVGLQNYGAFLEK--DGA 601
D L ++++G L + V + Q+ + L L L + +G NYG L
Sbjct: 409 DRLHIYVDGDLAATQYQETVGEELLISGQTEKDTLALDILVENLGRVNYGFKLNNPTQSK 468
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTW 661
G RG V DI + Y + E Q+ I D T P ++
Sbjct: 469 GIRGGVM------QDIHFHQGYQHYPLTFSQE--QLAKI--------DYTAGKNPLQPSF 512
Query: 662 YKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDK 721
Y+ F+ D +D GKG VNGHH+GRYW + G +S
Sbjct: 513 YQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------GPIHSLY 557
Query: 722 CTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
C P+ +LQ N +VIFE G
Sbjct: 558 C--------------PKEFLQQGQNEVVIFETEG 577
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 151/313 (48%), Gaps = 27/313 (8%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A ++DG +IS IHYPR E W D + +K G + I TYVFWN HE +GQY+F
Sbjct: 32 AFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAKAMGLNTIGTYVFWNVHEPEKGQYDFS 91
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G NDI FVK+ L++ LR PYVCAEW FGG+P WL++I G++ R+ + E +
Sbjct: 92 GNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFGGYPYWLQEIKGLKVRSKEPQYLEAYR 151
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
++ + + L GG I+M+QIENEYG+ KDY+ M + G
Sbjct: 152 NYIMAVGKQLSP--LLVTHGGNILMVQIENEYGSYSDD-----KDYLDINRKMFVEAGFD 204
Query: 233 VPWVMCKQTDAPEN-----IIDACNGYYCDGYKPNSYNK------PTLWTENWDGWYTTW 281
C A +N ++ A NG N+ P E + W+ W
Sbjct: 205 GLLYTCDPKAAIKNGHLPGLLPAINGVDDPLQVKQLINENHSGKGPYYIAEWYPAWFDWW 264
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG------PF--YITSYD 333
G + P + G S +N YM+ GGT G +G P+ I+SYD
Sbjct: 265 GTKHHTVPYRQYLGKLDSVLAAGIS-INMYMFHGGTTRGFMNGANANDADPYEPQISSYD 323
Query: 334 YDAPIDEYGLLSE 346
YDAP+DE G +E
Sbjct: 324 YDAPLDEAGNATE 336
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 83/219 (37%), Gaps = 56/219 (25%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+ + +RD V +NG+ G + + ++ +G L LL + +G N+G +L
Sbjct: 418 LQLKELRDYCVVMVNGKRAGVLDRRSKRDSIALDLPAGKVKLDLLVENLGRINFGPYLLS 477
Query: 599 DGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRD 653
+ G +V +L G++ + K+ G+K R
Sbjct: 478 NRKGITEKVLFDRQELKGWQQYGLPFDKLPAVAAKGIKAGAN------------VPTYRQ 525
Query: 654 GIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDY 713
G TFT KT LD+ + GKG W+NGHH+GRYW V
Sbjct: 526 G---TFTLDKT--------GDTWLDMSNWGKGAVWINGHHLGRYWQV------------- 561
Query: 714 RGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y VP WL+ N +VI E
Sbjct: 562 --------------GPQQTIY-VPAEWLKKGMNDIVIME 585
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/329 (33%), Positives = 163/329 (49%), Gaps = 28/329 (8%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG L S IHY R PE W D + K K G + +ETYV WN HE G++ F+G D
Sbjct: 14 DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ +F++L G GL++ +R PY+CAEW FGG P WL PG++ R + + ++ +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 177 KIVDLMREEMLFSWQGGPIIMLQIENEYGNMES--SYGQQGKD-YVKWAASMALGLGAGV 233
+++ R L GGP+I++Q+ENEYG+ S +Y + +D V+ + L G
Sbjct: 134 ELIP--RLVPLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGP 191
Query: 234 PWVMCKQTDAPENIIDACNG-------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
M + P + G Y+P P + E W+GW+ W
Sbjct: 192 TDSMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQG---PLMCMEYWNGWFDHWMEEHH 248
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDE 340
R D A + G S +N+YM+ GGTNFG +G ITSYDYD+P+ E
Sbjct: 249 QRDAADAARVFGEMLEAGAS-VNFYMFHGGTNFGFHNGANHIKTYEPTITSYDYDSPLTE 307
Query: 341 YGLLSEP--KWGHLKDLHAA-IKLCEPAL 366
+G EP K+ ++D+ A + L P L
Sbjct: 308 WG---EPTAKYYAVRDVLAEHLPLGAPEL 333
Score = 45.8 bits (107), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 99/257 (38%), Gaps = 60/257 (23%)
Query: 506 VTKDYSDYLWHITQIY---VSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIG 562
VT+ + + + Q Y + IS +T +V + + +RD +VF++G G V+
Sbjct: 364 VTRPCPETMERLGQAYGFVLYRTRISGPRTGQV---LHVQEVRDRAQVFLDGTPAG-VVE 419
Query: 563 HWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTG-FKNGDIDLSK 621
W PV G L +L + +G NYG L D G V+L F+ G
Sbjct: 420 RWDPQGLPVTVPEGGAALDILVENMGRINYGPLL-SDAKGITCGVRLDNQFQYG------ 472
Query: 622 ILWTYQVGLKGEFQQIYSIEENEAE---WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALD 678
WT +Y + + E + L P +Y+ F+ + D +
Sbjct: 473 --WT-----------MYGLPLDSLEGVAYEPLAEGEAPGGPAFYRAAFEVDEPAD-TFVR 518
Query: 679 LGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPR 738
L KG ++NG H+GRYW RG P +T Y +P
Sbjct: 519 LDGWTKGVVFINGFHLGRYWE--------------RG-------------PQKTLY-LPG 550
Query: 739 SWLQASNNLLVIFEETG 755
L+ N LV+FE G
Sbjct: 551 PLLRRGTNELVVFELHG 567
>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
Length = 656
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/329 (33%), Positives = 165/329 (50%), Gaps = 28/329 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G+ +++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 84 LEGHEFLILGGSIHYFRVPRESWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 143
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F+ L GL++ LR GPY+C+E + GG P L P + RT N F E + ++
Sbjct: 144 DMEAFILLAAEVGLWVILRPGPYICSEIDLGGLPSRLLQDPTSQLRTTNHSFIEAVDEYL 203
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L +GGPII +Q+ENEYG+ + + Y+ + L G
Sbjct: 204 DHLI--ARVVPLQYRKGGPIIAVQVENEYGSF-----HKDEAYMPYLHKALLKRGIVELL 256
Query: 236 VMCKQTD-----------APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
+ T+ A N+ G + D Y+ S NKP L E W GW+ TWG +
Sbjct: 257 LTSDNTNEVLKGHIKGVLATVNMKSFKEGEFKDLYQVQS-NKPILIMEFWVGWFDTWGNK 315
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
R D+ + F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 316 HAVRDAIDVENTIFDFIRLEISF-NVYMFHGGTNFGFMNGATYFEQHRGVVTSYDYDAVL 374
Query: 339 DEYGLLSEPKWGHLKDLHAAIKLCE-PAL 366
E G + PK+ L++L +I + PAL
Sbjct: 375 TEAGDYT-PKFFKLRELFKSIFVTPLPAL 402
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/350 (31%), Positives = 168/350 (48%), Gaps = 31/350 (8%)
Query: 24 MMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIA 83
+ +I S S+ + + + + +++G ++ +A +HYPR W I
Sbjct: 7 LKTIITTLLFSLSTLTALARGGDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIK 66
Query: 84 KSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEW 143
K G + I YVFWN HE +Y+F G ND+ F +L +G+Y+ +R GPYVCAEW
Sbjct: 67 MCKALGMNTICLYVFWNIHEQQESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEW 126
Query: 144 NFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENE 203
GG P WL I R ++ F ++ F ++ + L GGPIIM+Q+ENE
Sbjct: 127 EMGGLPWWLLKKKDIRLREDDPYFLARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENE 184
Query: 204 YGN--MESSYGQQGKDYVK-------------WAASMALGLGAGVPWVMCKQTDAPENII 248
YG+ + Y Q +D VK WA++ + W M T + I
Sbjct: 185 YGSYGVNKQYVSQIRDIVKASGFDKVTLFQCDWASNFEKNGLDDLLWTMNFGTGSN---I 241
Query: 249 DACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFM 308
DA +P + P + +E W GW+ WG R RP + + + + SF
Sbjct: 242 DA-QFKRLKQLRPET---PLMCSEFWSGWFDKWGARHETRPAKAMVEGINEMLSKNISF- 296
Query: 309 NYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLK 353
+ YM GGT+FG +G P + +TSYDYDAPI+EYG + PK+ L+
Sbjct: 297 SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYG-HATPKFWELR 345
Score = 42.4 bits (98), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 93/224 (41%), Gaps = 42/224 (18%)
Query: 539 VTIDSMRDVLRVFINGQLTGSV-IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
++++ D ++FI+ +L G++ K ++ + G L +L + +G N+G +
Sbjct: 422 LSLNEAHDYAQIFIDNKLIGTIDRTKNEKSIKLPPVKQGAT-LTILIEAMGRINFGRAV- 479
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT-RDGIP 656
KD G V + NG D+S L + + Y ++ + D T R P
Sbjct: 480 KDFKGITESVTIDTEMNGH-DVSYHLKNWVIA---PIPDSYQTAQHAFDKLDETNRCFSP 535
Query: 657 STFT-----WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTC 711
F+ +Y+ YF+ + L+L GKGQ +VNGH +GR+W +
Sbjct: 536 INFSSPSIGYYRGYFNLKK-VGDTFLNLEQWGKGQVYVNGHALGRFWRI----------- 583
Query: 712 DYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y +P WL+ N +++ + G
Sbjct: 584 ----------------GPQQTLY-LPGCWLKKGRNEIIVMDIVG 610
>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
Length = 628
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 57/359 (15%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKNGH--FYRNGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K+ G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD-----YVKWAASMALGL---GAGVPWVMCK-----QTDAPENIIDACNG--- 253
SY Q KD + + A + L G VP + A + NG
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 254 ---------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
Y DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVDQYHDG------KGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQND 297
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 298 VSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 133/333 (39%), Gaps = 72/333 (21%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + +L+ + ++ V S+ T E LN
Sbjct: 357 KYVKYTIPEAPAPN-PVIEIPSIQLNKVADVLAFAEKQKPVSSDTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G V++ G + G D+ Y
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVQIAGKEIVGGWDM------Y 509
Query: 627 QVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK---TYFDAPDGIDPVA---LDLG 680
Q+ + E + ++ + T +PS K ++ +D V +D+
Sbjct: 510 QLPMD-EMPDLTKLKAD-------THKNVPSEVAKLKGCPVLYEGTFTLDKVGDTFMDME 561
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
S GKG +VNG +IGRYW V P QT Y VP W
Sbjct: 562 SWGKGIVFVNGVNIGRYWKV---------------------------GPQQTLY-VPGVW 593
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
L+ N +VIFE+ P + VK T ++ +
Sbjct: 594 LKKGENKIVIFEQLNETP-QTEVKTVKTPVLMK 625
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 169/342 (49%), Gaps = 45/342 (13%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R P W + K G + +ETYV WN HE +G ++F+G D+ +F+K
Sbjct: 29 ILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLK 88
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L GLY +R PY+CAEW FGGFP WL + PG R+NN + + + + +++ +
Sbjct: 89 LAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEYYDVLMEKI 147
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD 242
L + GG I+M+QIENEYG S+G++ K Y++ + + G P+ +D
Sbjct: 148 VPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTAPFFT---SD 197
Query: 243 AP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
P ++I+ N G ++ + P + E WDGW+ W
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------YITSYDY 334
+ R ++LA +V G +N YM+ GGTNFG +G ITSYDY
Sbjct: 258 KEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDY 315
Query: 335 DAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
DAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 316 DAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V+ E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQIHQATQYQTEIGEDIYVILSQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/329 (33%), Positives = 163/329 (49%), Gaps = 28/329 (8%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG L S IHY R PE W D + K K G + +ETYV WN HE G++ F+G D
Sbjct: 14 DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ +F++L G GL++ +R PY+CAEW FGG P WL PG++ R + + ++ +
Sbjct: 74 LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133
Query: 177 KIVDLMREEMLFSWQGGPIIMLQIENEYGNMES--SYGQQGKD-YVKWAASMALGLGAGV 233
+++ R L GGP+I++Q+ENEYG+ S +Y + +D V+ + L G
Sbjct: 134 ELIP--RLVPLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGP 191
Query: 234 PWVMCKQTDAPENIIDACNGYY-------CDGYKPNSYNKPTLWTENWDGWYTTWGGRLP 286
M + P + G Y+P P + E W+GW+ W
Sbjct: 192 TDSMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQG---PLMCMEYWNGWFDHWMEEHH 248
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDE 340
R D A + G S +N+YM+ GGTNFG +G ITSYDYD+P+ E
Sbjct: 249 QRDAADAARVFGEMLEAGAS-VNFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSPLTE 307
Query: 341 YGLLSEP--KWGHLKDLHAA-IKLCEPAL 366
+G EP K+ ++D+ A + L P L
Sbjct: 308 WG---EPTAKYYAVRDVLAEHLPLGAPEL 333
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 90/233 (38%), Gaps = 57/233 (24%)
Query: 527 ISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQT 586
IS +T +V + + +RD +VF++G G V+ W PV G L +L +
Sbjct: 388 ISGPRTGQV---LHVQEVRDRAQVFLDGTPAG-VVERWDPQGLPVTVPEGGAALDILVEN 443
Query: 587 VGLQNYGAFLEKDGAGFRGQVKLTG-FKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEA 645
+G NYG L D G V+L F+ G WT +Y + +
Sbjct: 444 MGRINYGPLL-SDAKGITCGVRLDNQFQYG--------WT-----------MYGLPLDSL 483
Query: 646 E---WTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVA 702
E + L P +Y+ F+ + D + L KG ++NG H+GRYW
Sbjct: 484 EGVAYEPLAEGEAPGGPAFYRAAFEVDEPAD-TFVRLDGWTKGVVFINGFHLGRYWE--- 539
Query: 703 PKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
RG P +T Y +P L+ N LV+FE G
Sbjct: 540 -----------RG-------------PQKTLY-LPGPLLRRGTNELVVFELHG 567
>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
Length = 594
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV W+ HE +G ++F+G
Sbjct: 10 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 70 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 129 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 181
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 182 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 238
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 239 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 296
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 297 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 346
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 407 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 461
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 462 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 506
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 507 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 546 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 580
>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 674
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/343 (34%), Positives = 165/343 (48%), Gaps = 41/343 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-G 113
+ +G L S +HY R W + K G + + TYVFWN HE+ G++++K G
Sbjct: 90 VYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKTG 149
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
++ +FVK G+ + LR GPY CAEW FGG+P WL G+ R +N PF + +
Sbjct: 150 NRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGLVIRADNQPFLDSCRV 209
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASM 225
++ ++ MR+ + +GGPIIM+Q ENE+G SY Q KD Y
Sbjct: 210 YINQLASQMRDLQIT--KGGPIIMVQAENEFG----SYVAQRKDIPLETHRAYSAKIKQQ 263
Query: 226 ALGLGAGVP-------WVMCKQTDAPENIIDACNGYY-CDGYKP--NSYN---KPTLWTE 272
L G VP W+ T E + NG + K N YN P + E
Sbjct: 264 LLDAGFDVPLFTSDGSWLFKGGTI--EGALPTANGESDIEKLKKVVNEYNGGKGPYMVAE 321
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY---- 328
+ GW + W P E + A++ + G SF NYYM GGTNFG TSG +
Sbjct: 322 FYPGWLSHWAEPFPQVSTESIVKQTAKYLENGISF-NYYMVHGGTNFGFTSGANYTTATN 380
Query: 329 ----ITSYDYDAPIDEYGLLSEPKWGHLKDLHAA-IKLCEPAL 366
+TSYDYDAPI E G + PK+ L+ L +K PA+
Sbjct: 381 LQPDLTSYDYDAPISEAG-WNTPKYDALRALMIKNVKYNVPAV 422
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 133/334 (39%), Gaps = 76/334 (22%)
Query: 448 IKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVT 507
IK V++++P P +P ++ KL+ ++ + + V S+ T E LN
Sbjct: 412 IKNVKYNVPAVPQ-RIPVIAIPNIKLNKSADVLNLLTKGKAVESDKPLT----FEDLNQG 466
Query: 508 KDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV 567
Y Y H Q + + + + D V++NGQ G + V
Sbjct: 467 HGYVLYRRHFNQ--------------PIGGMLKVAGLADYALVYVNGQKVGEL--DRVSD 510
Query: 568 VQPVEFQSGYNDLI-LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY 626
V +E +N ++ +L + +G NYGA + + G G V + G +
Sbjct: 511 VDSIEINVPFNGVLDILVENMGRINYGARITQSIKGINGPVVIDGNE------------- 557
Query: 627 QVGLKGEFQQIYSIEENEAEWTDL----TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSM 682
+ G +Q +Y + NE + G+P T Y F+ D L++ +
Sbjct: 558 ---ITGNWQ-MYKLPMNEVPDVNALPTANNKGLP---TLYSGTFNL-DTTGDTFLNMETW 609
Query: 683 GKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
GKG +VNG ++GRYW RG P QT Y +P +L+
Sbjct: 610 GKGIVFVNGINLGRYWK--------------RG-------------PQQTLY-LPGCFLK 641
Query: 743 ASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVS 776
N +V+FE+ P + SV ++T I+ + V
Sbjct: 642 KGENKIVVFEQQNDTP-QTSVAGQTTPILQKLVK 674
>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
Length = 216
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/128 (57%), Positives = 93/128 (72%), Gaps = 5/128 (3%)
Query: 215 GKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENW 274
GK Y+ W + MA L GVPW++C+Q DAP+ +I+ C G+YCD + PN+ N P WTENW
Sbjct: 57 GKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQFTPNTANSPKKWTENW 116
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-YITSYD 333
GW+ +WG + PHR E +AFAVARFFQ F N YMY GGTNFGRT+GGP+ TS+D
Sbjct: 117 TGWFKSWGDKDPHRTAEGVAFAVARFFQ----FQNCYMYHGGTNFGRTAGGPYSTTTSHD 172
Query: 334 YDAPIDEY 341
YDAP+DE+
Sbjct: 173 YDAPLDEH 180
>gi|332187631|ref|ZP_08389367.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
gi|332012379|gb|EGI54448.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
Length = 613
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 176/361 (48%), Gaps = 57/361 (15%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
MM+ + ++S+ +T F V + + DG +ISA +HY R W D
Sbjct: 7 MMVAASALVPTIASAQGTTPAHSFTVQGN--GFLKDGKPYQVISAEMHYTRIPRAYWRDR 64
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K+K G + I TY FWNAHE G Y+F G+NDI F++ + GL + LR GPYVCA
Sbjct: 65 LRKAKAMGLNTITTYSFWNAHEPRPGTYDFTGQNDIAAFIRDAQAEGLDVILRPGPYVCA 124
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW GG+P WL + R+ + + + R++ ++ ++ +L + GGPI+ +Q+E
Sbjct: 125 EWELGGYPSWLLKDRNLLLRSTDPKYTAAVDRWLARLGQEVKPLLLRN--GGPIVAIQLE 182
Query: 202 NEYG----------NMESSYGQQG-KDYVKWAASMALGLGAG----VPWVMCKQTDAPEN 246
NEYG +++SY + G D V + ++ A L G VP V+ + +N
Sbjct: 183 NEYGAFGSDKAYLEGLKASYQRAGLADGVLFTSNQAGDLAKGSLPEVPSVVNFGSGGAQN 242
Query: 247 IIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG-------GRLPHRPVEDLAFAVAR 299
+ + DG + + E W GW+ WG G+ + E+L F + R
Sbjct: 243 AVAKLEAFRPDGLR--------MVGEYWAGWFDKWGEDHHETDGK---KEAEELGFMLKR 291
Query: 300 FFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------ITSYDYDAPIDE-------YGLL 344
G ++ YM+ GGT FG +G + TSYDY+AP+DE YGLL
Sbjct: 292 -----GYSVSLYMFHGGTTFGWMNGADSHTGTDYHPDTTSYDYNAPLDEAGNPRYKYGLL 346
Query: 345 S 345
+
Sbjct: 347 A 347
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 173/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV W+ HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
WDGW+ W + R ++LA +V G +N YM+ GGTNFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFGFMNGCSARGTID 306
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + +Q Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMADLHFMTQWQQ---------YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHMELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
Length = 628
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 174/360 (48%), Gaps = 59/360 (16%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKDGH--FYRNGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGNLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD-----YVKWAASMALGL---GAGVP-------WVM-----------CKQTDA 243
SY Q KD + + A + L G VP W+
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 244 PENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQR 303
EN+ N Y+ DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVNQYH-DG------KGPYMVAEFYPGWLSHWAEPFPQVGASGIARQTEKYLQN 296
Query: 304 GGSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 297 DVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/330 (23%), Positives = 132/330 (40%), Gaps = 66/330 (20%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + KL+ + ++ V ++ T E LN
Sbjct: 357 KYVKYTVPEAPAPN-PVIEIPSIKLTKVADVLAFAEKQKPVSADTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G VK+ G + G+ D+ ++ +
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVKIAGKEITGEWDMYQLPMSE 515
Query: 627 Q---VGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
LK + E + + + +G TFT D + +D+ + G
Sbjct: 516 MPDLAKLKADAHANVPAEAAKLKGCPVLYEG---TFTL--------DNVGDTFIDMENWG 564
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KG +VNG +IGRYW V P QT Y +P WL+
Sbjct: 565 KGIIFVNGVNIGRYWKV---------------------------GPQQTLY-IPGVWLKK 596
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
N +VIFE+ P + VK T ++ +
Sbjct: 597 GTNKIVIFEQLNEVP-QAEVKTVKTPVLMK 625
>gi|311281324|ref|YP_003943555.1| glycoside hydrolase [Enterobacter cloacae SCF1]
gi|308750519|gb|ADO50271.1| glycoside hydrolase family 35 [Enterobacter cloacae SCF1]
Length = 591
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 154/316 (48%), Gaps = 36/316 (11%)
Query: 51 HRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYN 110
+ ++ DG LIS IHY R P+ W + K GA+ +ETY+ WN H+ ++
Sbjct: 7 EKNLLQDGKPVQLISGAIHYFRLVPQYWEHSLNNLKALGANCVETYLPWNIHQPDPERFC 66
Query: 111 FKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
F G D+ +F+ L GL++ LR PY+CAEW FGG P WL P + R++ F +
Sbjct: 67 FTGMADVERFIALAQRKGLFVILRPSPYICAEWEFGGLPAWLLRDPSMRVRSSQPAFLQA 126
Query: 171 MQRFVKKIVDLMREEMLFSWQ---GGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
++R+ +++ L WQ GGP++M+Q+ENEYG S+G K Y++ A+M
Sbjct: 127 VERYYAELL-----PRLAPWQYDRGGPVVMMQLENEYG----SFGND-KAYLRTLAAMMR 176
Query: 228 GLGAGVP-------WVMCKQTDA--PENIIDACN-----GYYCDGYKPNSYNKPTLWTEN 273
G VP W Q + +N++ N D +P + E
Sbjct: 177 RYGVSVPLFTSDGAWQEALQAGSLCEDNVLATANFGSRSAESLDNLAAFQPERPLMCLEF 236
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY----- 328
W+GW+ +G + R +D+ + R +N YM+ GGTNFG +G
Sbjct: 237 WNGWFNRYGDAIIRRDADDVGQEIRTLLTRAS--INIYMFQGGTNFGFMNGCSVRGDKDL 294
Query: 329 --ITSYDYDAPIDEYG 342
+TSYDYDA + E+G
Sbjct: 295 PQVTSYDYDALLSEWG 310
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 86/235 (36%), Gaps = 52/235 (22%)
Query: 546 DVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFL--EKDGAGF 603
D ++ + NG+ + + P + N L LL + +G NYG L G
Sbjct: 405 DRVQFYCNGEHLATQYHEQIGEQIPFALREADNVLDLLIENMGRVNYGPRLLAPTQRKGL 464
Query: 604 RGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK 663
RG + + D D I+ + + + D + P +Y+
Sbjct: 465 RGGLVIDLHLETDWD------------------IFPLPLDNIDDVDFSAGWQPQQPAFYE 506
Query: 664 TYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCT 723
F A D LD S+GKG A++NG ++GRYW YRG
Sbjct: 507 YCF-AIDSPADTFLDTRSLGKGVAFINGFNLGRYW--------------YRGPLG----- 546
Query: 724 TNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIVCEQVSES 778
+ ++P L+ N L+IFE G E+ + V +V+ES
Sbjct: 547 ---------YLYIPAPLLKQGENRLIIFETEG---VEVGALALLNKPVYIEVTES 589
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 172/351 (49%), Gaps = 45/351 (12%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+++G ++S IHY R P W + K G + +ETYV WN HE +G ++F+G
Sbjct: 20 FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+ +F+KL GLY +R PY+CAEW FGGFP WL + PG R+NN + + +
Sbjct: 80 ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ +++ + L + GG I+M+QIENEYG S+G++ K Y++ + + G
Sbjct: 139 YYDVLMEKIVPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTA 191
Query: 234 PWVMCKQTDAP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTE 272
P+ +D P ++I+ N G ++ + P + E
Sbjct: 192 PFFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCME 248
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY---- 328
WDGW+ W + R ++LA +V G +N YM+ GG NFG +G
Sbjct: 249 FWDGWFNRWKEPIIKRDPQELAESVREALALGS--INLYMFHGGINFGFMNGCSARGTID 306
Query: 329 ---ITSYDYDAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
ITSYDYDAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 307 LPQITSYDYDAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 91/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V P E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQVHQATQYQTEIGEDIYVTLPQE----NNQIDVLIENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ +GKG +VN ++GR+W V
Sbjct: 517 FYQYHVELAEVKD-TFIDVSKLGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 169/359 (47%), Gaps = 57/359 (15%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKNGH--FYRNGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD--------YVKWAASMALGLGAGVPWVMCK-----QTDAPENIIDACNG--- 253
SY Q KD Y +G VP + A + NG
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADVGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 254 ---------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
Y DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVDQYHDG------KGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQND 297
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 298 VSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 133/333 (39%), Gaps = 72/333 (21%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + +L+ + ++ V S+ T E LN
Sbjct: 357 KYVKYTIPEAPAPN-PVIEIPSIQLNKVADVLAFAEKQKPVSSDTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G V++ G + G D+ Y
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVQIAGKEIVGGWDM------Y 509
Query: 627 QVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK---TYFDAPDGIDPVA---LDLG 680
Q+ + E + ++ + T +PS K ++ +D V +D+
Sbjct: 510 QLPMD-EMPDLTKLKAD-------THKNVPSEVAKLKGCPVLYEGTFTLDKVGDTFMDME 561
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
S GKG +VNG +IGRYW V P QT Y VP W
Sbjct: 562 SWGKGIVFVNGVNIGRYWKV---------------------------GPQQTLY-VPGVW 593
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
L+ N +VIFE+ P + VK T ++ +
Sbjct: 594 LKKGENKIVIFEQLNETP-QTEVKTVKTPVLMK 625
>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
Length = 648
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/334 (34%), Positives = 164/334 (49%), Gaps = 31/334 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y H + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 19 RTFKIDYHHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFH 78
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQY F G+ D+ F+KL GL + LR GPY+CAEW+ GG P WL I R+
Sbjct: 79 EPQPGQYKFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRS 138
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + + +++ V L R + L GGPII +Q+ENEYG SY DY+++
Sbjct: 139 SDPDYLAAVDKWLG--VLLPRMKPLLYQNGGPIITVQVENEYG----SYFTCDYDYLRFL 192
Query: 223 ASM---ALGL--------GAGVPWVMCKQTDAPENIIDACNGYYCDG----YKPNSYNKP 267
+ LG GA P++ C +D G + + P
Sbjct: 193 QKLFHYHLGKDVLLFTTDGALEPFLQCGALQGLYATVDFGPGANITAAFEVQRKSEPKGP 252
Query: 268 TLWTENWDGWYTTWGGRLPHRPV--EDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
+ +E + GW WG PH V E +A ++ RG + +N YM+ GGTNF +G
Sbjct: 253 LVNSEFYTGWLDHWGQ--PHSTVKTEVVASSLHDILARGAN-VNLYMFIGGTNFAYWNGA 309
Query: 326 --PFYI--TSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 310 NMPYKAQPTSYDYDAPLSEAGDLTE-KYFALRDV 342
>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 584
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 157/310 (50%), Gaps = 30/310 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S G+HY R P W D + K++ G + I+TY+ WN HE G ++F G
Sbjct: 13 LDGQPFRIVSGGLHYFRVHPAQWSDRLRKARLMGLNTIDTYIPWNLHERRPGTFDFGGIL 72
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F+ + GL++ LR GPY+C EW GG P WL P + R+ + F + ++ ++
Sbjct: 73 DLAAFLDAAAAEGLHVLLRPGPYICGEWEGGGLPSWLLADPDLALRSTDPAFLQAVEAYL 132
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
I+ ++ + +GGP+I +Q+ENEYG +YG Y++ G VP+
Sbjct: 133 DAIMPIVLPRL--GTRGGPVIAVQVENEYG----AYGSD-TAYMERLYEALTSRGIDVPF 185
Query: 236 VMCKQTDAPENIID-ACNGYYCD------------GYKPNSYNKPTLWTENWDGWYTTWG 282
+D P ++ D A G + P + E W+GW+ WG
Sbjct: 186 FT---SDQPNDLADGALPGVLATANFGGKVTASLAALRAQQPTGPLMCAEFWNGWFDYWG 242
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPFY--ITSYDYDA 336
G R ED A+ Q G S +N+YM+ GGTNFG T+G G + +TSYDYD+
Sbjct: 243 GTHAQRSAEDAGAALEEMLQAGAS-VNFYMFHGGTNFGFTNGANDKGTYRATVTSYDYDS 301
Query: 337 PIDEYGLLSE 346
P+DE G +E
Sbjct: 302 PLDEAGDPTE 311
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 158/312 (50%), Gaps = 32/312 (10%)
Query: 65 SAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLV 124
S IHY R PE W D + K K G + +ETYV WN HE + GQ+++ G ++ KF+ L
Sbjct: 15 SGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFILLA 74
Query: 125 GSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMRE 184
G Y+ LR GPY+CAEW FGG P WL ++ R+ PFK+ + RF + ++
Sbjct: 75 QELGFYVILRPGPYICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEIKS 134
Query: 185 EMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAP 244
L + +GGPII +Q+ENEYG SYG ++Y+++ + G V ++
Sbjct: 135 --LQASKGGPIIAVQVENEYG----SYGSD-EEYMQFIRDALINRGIVELLVTSDNSEGI 187
Query: 245 EN--IIDACNGYYCDGYKPNSY-------NKPTLWTENWDGWYTTWGGRLPHRPVEDLAF 295
++ Y G+ + + P++ E W GW+ WG + + V +A
Sbjct: 188 KHGGAPGVLKTYNFQGHAKSHLSILERLQDAPSIVMEFWSGWFDHWGEK--NHQVHTIAH 245
Query: 296 AVARF---FQRGGSFMNYYMYFGGTNFGRTSGGPFY---------ITSYDYDAPIDEYGL 343
F SF N+Y++ GGTNFG +G F +TSYDYDAP+ E G
Sbjct: 246 VTNTFKDILDCDASF-NFYVFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDAPLSEAGD 304
Query: 344 LSEPKWGHLKDL 355
++E K+ L+ +
Sbjct: 305 ITE-KYMELRKI 315
>gi|329927841|ref|ZP_08281902.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938242|gb|EGG34637.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 619
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 156/316 (49%), Gaps = 38/316 (12%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG +IS IHY R PE W D + K K G + +ETY+ WN HE G+++F G
Sbjct: 12 LLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQEGKFSFSGM 71
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ F++L G GL++ +R P++CAEW FGG P WL I R ++ + ++ +
Sbjct: 72 ADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPLYLSKVDHY 131
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGK--DYVKWAASMALGLGAG 232
+++ R L S GGPI+ +Q+ENEYG SYG DY++ A + G+
Sbjct: 132 YDELIP--RLVPLLSSNGGPILAVQVENEYG----SYGNDHAYLDYLR-AGLVRRGID-- 182
Query: 233 VPWVMCKQTDAPENIIDACNGYYCD----------------GYKPNSYNKPTLWTENWDG 276
V+ +D P + + G D Y+ +P + E W+G
Sbjct: 183 ---VLLFTSDGPTDEM-LLGGTLNDVHATVNFGSRVEESFRKYREYRTEEPLMVMEFWNG 238
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------IT 330
W+ W R D+A + ++G S MN YM+ GGTNFG SG T
Sbjct: 239 WFDHWMEDHHVRDAADVAGVLDEMLEKGSS-MNMYMFHGGTNFGFYSGANHIQTYEPTTT 297
Query: 331 SYDYDAPIDEYGLLSE 346
SYDYDAP+ E+G +E
Sbjct: 298 SYDYDAPLTEWGDKTE 313
>gi|380693434|ref|ZP_09858293.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 778
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSSAEAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWDH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 101/245 (41%), Gaps = 46/245 (18%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+ I + D ++F +G+L + + + L +L + +G N+ +
Sbjct: 421 TLKITEVHDWAQIFADGKLLARLDRRKGEFTTTLPALKKGTQLDILVEAMGRVNFDKSIH 480
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPS 657
D G +V+L +GD WT F YS +N+ ++ D +P+
Sbjct: 481 -DRKGITEKVELL---SGDRTKELKNWTVY-----NFPVDYSFIKNK-KYKDTKI--LPT 528
Query: 658 TFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAY 717
+Y++ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 529 MPAYYQSSFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI----------------- 570
Query: 718 NSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIVCEQ 774
P QT + +P WL+ N +++ + G P + S+K L+ ++ E+
Sbjct: 571 ----------GPQQTLF-IPGCWLKEGENEILVLDLKG--PAKASMKGLKKPILDVLREK 617
Query: 775 VSESH 779
E+H
Sbjct: 618 APETH 622
>gi|332264040|ref|XP_003281056.1| PREDICTED: beta-galactosidase-1-like protein 3 [Nomascus
leucogenys]
Length = 655
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 162/321 (50%), Gaps = 27/321 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNM 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPQLLLRTTNKGFIEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ R L QGGP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGIVELL 254
Query: 232 ----GVPWVMCKQTD---APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
G V+ T A N+ + +K +KP L E W GW+ WG +
Sbjct: 255 LTSDGEKHVLSGHTKGVLAAINLQKLHQNTFSQLHKVQR-DKPLLIMEYWVGWFDRWGDK 313
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
+ +++ AV+ F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 314 HHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHTGIVTSYDYDAVL 372
Query: 339 DEYGLLSEPKWGHLKDLHAAI 359
E G +E K+ L+ L ++
Sbjct: 373 TEAGDYTE-KYFKLQKLFESV 392
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 159/312 (50%), Gaps = 37/312 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G ++S +HY R PE+W D + K K G + +ETYV WN HE GQ+ ++G
Sbjct: 17 LNGQPFRVLSGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFRYEGGL 76
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F++L S GLY+ +R GP++CAEW FGG P WL P +E R P+ E ++RF
Sbjct: 77 DLAAFIRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEAVRRFY 136
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ + + +GGPI+ +Q+ENEYG SYG + Y+ W + L GV
Sbjct: 137 DDLLPRLLPLQI--QRGGPILAMQVENEYG----SYGSD-QLYLTWLRRLM--LDGGVET 187
Query: 236 VMCKQTDAPENIIDACNGYYCDGYKPNSY----------------NKPTLWTENWDGWYT 279
++ A ++++ +G +K ++ + P + E W+GW+
Sbjct: 188 LLFTSDGATDHMLK--HGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWNGWFD 245
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF---------YIT 330
WG R D A A+ R G+ +N YM+ GGTNFG +G +
Sbjct: 246 HWGEPHHTRDAADAADALERIMA-CGAHVNVYMFHGGTNFGFMNGANTDLLTRDYQPTVN 304
Query: 331 SYDYDAPIDEYG 342
SYDYDAP+DE G
Sbjct: 305 SYDYDAPLDETG 316
>gi|390336578|ref|XP_792349.2| PREDICTED: beta-galactosidase-like [Strongylocentrotus purpuratus]
Length = 671
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 159/325 (48%), Gaps = 37/325 (11%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + YD + DG +S HY R W D + K K G + ++TYV WN H
Sbjct: 27 RSFTIDYDSNTFLKDGQPFRYVSGSFHYSRVPAFYWQDRLDKMKMAGLNAVQTYVIWNFH 86
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G++NF G +DI+ F+K +GL + LR GPY+C EW+ GG P WL +IPGI R+
Sbjct: 87 ELKPGEFNFDGDHDILSFLKKANDTGLAVILRPGPYICGEWDLGGLPAWLLNIPGIVLRS 146
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQ-QGKDYVKW 221
+N + + ++ + +R + + GGPIIM+Q+ENEYG+ ++ Q Q + Y +
Sbjct: 147 SNDLYMAHVTEWMNFFLPKLRPYLYVN--GGPIIMVQVENEYGSYQTCDHQYQRQLYHLF 204
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--------------- 266
A++ P V+ TD P + + C G D Y +
Sbjct: 205 RANLG-------PDVVLFTTDGPGDHLLQC-GTLQDMYATIDFGAGSNSTGMFQEMRKFE 256
Query: 267 ---PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG-GSFMNYYMYFGGTNFGRT 322
P + +E + GW W PH+ V+ A + G+ +N YM+ GGTNFG
Sbjct: 257 PKGPLVNSEYYTGWLDHW--EHPHQTVKTAAVCTSLDQMLALGANVNMYMFEGGTNFGFW 314
Query: 323 SGGPFYI-----TSYDYDAPIDEYG 342
+G + TSYDYDAP+ E G
Sbjct: 315 NGANYPTFNPQPTSYDYDAPLTEAG 339
>gi|301065438|ref|YP_003787461.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
gi|300437845|gb|ADK17611.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
Length = 598
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 172/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT+++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDSAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAMTQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
Length = 596
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 105/331 (31%), Positives = 161/331 (48%), Gaps = 36/331 (10%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
+ +DG R + S HY R P +W D + + K G + + TYV WN HE +GQ+
Sbjct: 8 SFYLDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLG 67
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN-APFKEEM 171
G D+V F++ V GLYL +R GPY+CAEW FGGFP WL P + RT++ P+ E+
Sbjct: 68 GLYDLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEV 127
Query: 172 QRFVKKIVDLMREEMLFSWQ-GGPIIMLQIENEYGNM---ESSYGQ-QGKDYVKWAASMA 226
++++ ++ ++ + F+++ GGPII Q+ENE+G+ + Y Q Y W +
Sbjct: 128 KQYLSQLFAVLTK---FTYKHGGPIIAFQVENEFGSKGVHDPEYLQFLVTQYSSWNLNEL 184
Query: 227 LGLGAGVPWV---MCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
L G ++ A N+ D + K +P + TE W GW+ WG
Sbjct: 185 LFTSDGKKYLSNGTLPDVLATINLNDHAKE-DLEELKEFQPERPLMVTEFWAGWFDHWGE 243
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------------I 329
H +L + S +N+YM+ GGTNFG +G + +
Sbjct: 244 EHHHYGTTELERELEAILSLNAS-VNFYMFIGGTNFGFWNGANYLSYNKDKEASLLGPTV 302
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIK 360
TSYDYDA + E WGH+K + I+
Sbjct: 303 TSYDYDAAVSE--------WGHVKPKYNVIR 325
>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 583
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 155/309 (50%), Gaps = 34/309 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +IS +HY R PE W D + K K GA+ +ETYV WN HE +G++ F+G
Sbjct: 14 LDGKPFKIISGAVHYFRIVPEYWRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGML 73
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI +F+ L GLY+ +R PY+CAEW FGG P WL G+ R PF E ++ +
Sbjct: 74 DISRFILLAQELGLYVIVRPSPYICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYY 133
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+ ++ L GGP+I++Q+ENEYG YG + Y++ + L GA VP
Sbjct: 134 SVLFPILVP--LQIHHGGPVILMQVENEYG----YYGDDTR-YMETMKQLMLDNGAEVPL 186
Query: 236 VMCKQTDAPENIIDACN---GYYCDG------------YKPNSYNKPTLWTENWDGWYTT 280
V +D P + +C G G K + P + TE W GW+
Sbjct: 187 V---TSDGPMDESLSCGRLPGVLPTGNFGSKTEERFEVLKKYTEGGPLMCTEFWVGWFDH 243
Query: 281 WG-GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYD 333
WG G +E+ + + + G +N YM+ GGTNFG +G +Y +TSYD
Sbjct: 244 WGNGGHMRGNLEESTKDLDKMLEMG--HVNIYMFEGGTNFGFMNGSNYYDELTPDVTSYD 301
Query: 334 YDAPIDEYG 342
YDA + E G
Sbjct: 302 YDAVLTEAG 310
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 80/194 (41%), Gaps = 48/194 (24%)
Query: 571 VEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGL 630
+F+SG L +L + +G N+G +E G G V+L G + + W
Sbjct: 432 ADFESG-ALLDILVENMGRVNFGPLMESQRKGIAGCVQLNGHMHYN-------W------ 477
Query: 631 KGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVN 690
++Y++ N E D ++ T +YK F+ + D LD G GKG A++N
Sbjct: 478 -----EMYTLPLNNLEKLDFSKGYEEGTPGFYKFVFEVEEAGD-TFLDFGGWGKGCAFLN 531
Query: 691 GHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVI 750
G ++GR+W + P + Y +P L+ N +++
Sbjct: 532 GFNLGRFWEI---------------------------GPQKRLY-IPGPLLKEGRNEIIL 563
Query: 751 FEETGGNPFEISVK 764
FE G EIS+K
Sbjct: 564 FETDGKTAPEISLK 577
>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
Length = 628
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 172/359 (47%), Gaps = 57/359 (15%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKNGH--FYRNGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD-----YVKWAASMALGL---GAGVPWVMCK-----QTDAPENIIDACNG--- 253
SY Q KD + + A + L G VP + A + NG
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 254 ---------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
Y DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVDQYHDG------KGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQND 297
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 298 VSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 77/333 (23%), Positives = 133/333 (39%), Gaps = 72/333 (21%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + +L+ + ++ V S+ T E LN
Sbjct: 357 KYVKYTIPEAPAPN-PVIEIPSIQLNKVADVLAFAEKQKPVSSDTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G V++ G + G D+ Y
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVQIAGKEIVGGWDM------Y 509
Query: 627 QVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK---TYFDAPDGIDPVA---LDLG 680
Q+ + E + ++ + T +PS K ++ +D V +D+
Sbjct: 510 QLPMD-EMPDLTKLKAD-------THKNVPSEVAKLKGCPVLYEGTFTLDKVGDTFMDME 561
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
S GKG +VNG +IGRYW V P QT Y +P W
Sbjct: 562 SWGKGIVFVNGVNIGRYWKV---------------------------GPQQTLY-IPGVW 593
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
L+ N +VIFE+ P + VK T ++ +
Sbjct: 594 LKKGENKIVIFEQLNETP-QTEVKTVKTPVLMK 625
>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
Length = 628
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 172/359 (47%), Gaps = 57/359 (15%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKNGH--FYRNGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD-----YVKWAASMALGL---GAGVPWVMCK-----QTDAPENIIDACNG--- 253
SY Q KD + + A + L G VP + A + NG
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 254 ---------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
Y DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVDQYHDG------KGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQND 297
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 298 VSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 133/333 (39%), Gaps = 72/333 (21%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + +L+ + ++ V S+ T E LN
Sbjct: 357 KYVKYTIPEAPAPN-PVIEIPSIQLNKVADVLAFAEKQKPVSSDTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G V++ G + G D+ Y
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVQIAGKEIVGGWDM------Y 509
Query: 627 QVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK---TYFDAPDGIDPVA---LDLG 680
Q+ + E + ++ + T +PS K ++ +D V +D+
Sbjct: 510 QLPMD-EMPDLTKLKAD-------THKNVPSEVAKLKGCPVLYEGTFTLDKVGDTFMDME 561
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
S GKG +VNG +IGRYW V P QT Y VP W
Sbjct: 562 SWGKGIVFVNGVNIGRYWKV---------------------------GPQQTLY-VPGVW 593
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
L+ N +VIFE+ P + VK T ++ +
Sbjct: 594 LKKGENKIVIFEQLNETP-QTEVKTVKTPVLMK 625
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 172/359 (47%), Gaps = 57/359 (15%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKNGH--FYRNGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD-----YVKWAASMALGL---GAGVPWVMCK-----QTDAPENIIDACNG--- 253
SY Q KD + + A + L G VP + A + NG
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 254 ---------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
Y DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVDQYHDG------KGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQND 297
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 298 VSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 103/244 (42%), Gaps = 53/244 (21%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN-DLILLSQTVGLQNYGAFL 596
T+ I +RD V+++G+ G V+ K VE + +N L +L + +G NYG+ +
Sbjct: 427 TLEIPGLRDYAVVYVDGEQVG-VLNRNTKTYS-VEIEVPFNATLQILVENMGRINYGSEI 484
Query: 597 EKDGAGFRGQVKLTGFK-NGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ G V++ G + G D+ YQ+ + E + ++ + T +
Sbjct: 485 VHNTKGIISPVQIAGKEIVGGWDM------YQLPMD-EMPDLTKLKAD-------THKNV 530
Query: 656 PSTFTWYK---TYFDAPDGIDPVA---LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQD 709
PS K ++ +D V +D+ S GKG +VNG +IGRYW V
Sbjct: 531 PSEVAKLKGCPVLYEGTFTLDKVGDTFMDMESWGKGIVFVNGVNIGRYWKV--------- 581
Query: 710 TCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTR 769
P QT Y VP WL+ N +VIFE+ P + VK T
Sbjct: 582 ------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNETP-QTEVKTVKTP 621
Query: 770 IVCE 773
++ +
Sbjct: 622 VLMK 625
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/341 (31%), Positives = 164/341 (48%), Gaps = 25/341 (7%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+++++M+ S S S F V Y + DG + IS IHY R W D
Sbjct: 9 VLLLLMLFGRSLGESPS-------FTVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKD 61
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
+ K G + I+TYV WN HE + G YNF G D+ F+KL GL + LR GPY+C
Sbjct: 62 RLLKMYMAGLNAIQTYVPWNYHEEVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYIC 121
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW+ GG P WL I R+ + + + +++ K++ +++ + GGPII +Q+
Sbjct: 122 AEWDMGGLPAWLLKKKDIVLRSTDPDYIAAVDKWMGKLLPMIKPYLY--QNGGPIITVQV 179
Query: 201 ENEYG-------NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNG 253
ENEYG N + + Y+ + GAG+ ++ C +D G
Sbjct: 180 ENEYGSYFACDYNYMRHLSKLFRSYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPG 239
Query: 254 Y-YCDGYKPNSY---NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMN 309
++P + P + +E + GW WG R +A A++ G + +N
Sbjct: 240 ANVTAAFEPQRQVQPHGPLVNSEFYTGWLDHWGSRHSVVSPTQVAKALSEMLLMGAN-VN 298
Query: 310 YYMYFGGTNFGRTSGG--PFYI--TSYDYDAPIDEYGLLSE 346
YM+ GGTNFG +G P+ TSYDYDAP+ E G L+E
Sbjct: 299 LYMFIGGTNFGYWNGANTPYAAQPTSYDYDAPLTEAGDLTE 339
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 32/69 (46%), Gaps = 4/69 (5%)
Query: 640 IEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALD----LGSMGKGQAWVNGHHIG 695
+ E E D + S +Y F PDGI + D L S KGQ W+NG ++G
Sbjct: 519 LGEKELALFDPPQPADLSPPAFYGGSFVIPDGIPDLPQDTYIKLSSWRKGQIWINGFNVG 578
Query: 696 RYWTVVAPK 704
RYW P+
Sbjct: 579 RYWPTRGPQ 587
>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 628
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 172/359 (47%), Gaps = 57/359 (15%)
Query: 32 CVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGAD 91
CV S S STF + H +G ++S +HY R + W + K G +
Sbjct: 18 CVFSQSKSTF----EIKNGH--FYRNGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLN 71
Query: 92 VIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVW 151
+ TYVFWN HE G+++F G ++ +F+K G G+ + LR GPYVCAEW FGG+P W
Sbjct: 72 TVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWW 131
Query: 152 LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMES 209
L+++ G+E R +N E ++ K +D + +E+ L +GGPI+M+Q ENE+G
Sbjct: 132 LQNVKGMEIRRDNP----EFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCENEFG---- 183
Query: 210 SYGQQGKD-----YVKWAASMALGL---GAGVPWVMCK-----QTDAPENIIDACNG--- 253
SY Q KD + + A + L G VP + A + NG
Sbjct: 184 SYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANGESD 243
Query: 254 ---------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG 304
Y DG P + E + GW + W P +A ++ Q
Sbjct: 244 IENLKKVVDQYHDG------KGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQND 297
Query: 305 GSFMNYYMYFGGTNFGRTSGGPF--------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N+YM GGTNFG TSG + +TSYDYDAPI E G ++ PK+ ++++
Sbjct: 298 VSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 133/333 (39%), Gaps = 72/333 (21%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + +L+ + ++ V S+ T E LN
Sbjct: 357 KYVKYTIPEAPAPN-PVIEIPSIQLNKVADVLAFAEKQKPVSSDTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G V++ G + G D+ Y
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVQIAGKEIVGGWDM------Y 509
Query: 627 QVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYK---TYFDAPDGIDPVA---LDLG 680
Q+ + E + ++ + T +PS K ++ +D V +D+
Sbjct: 510 QLPMD-EMPDLTKLKAD-------THKNVPSEVAKLKGCPVLYEGTFTLDKVGDTFMDME 561
Query: 681 SMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSW 740
S GKG +VNG +IGRYW V P QT Y VP W
Sbjct: 562 SWGKGIVFVNGVNIGRYWKV---------------------------GPQQTLY-VPGVW 593
Query: 741 LQASNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
L+ N +VIFE+ P + VK T ++ +
Sbjct: 594 LKKGENKIVIFEQLNETP-QTEVKTVKTPVLMK 625
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 158/314 (50%), Gaps = 37/314 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S IHY R PE W D + K K G + +ETY+ WN HE G + F G
Sbjct: 13 LDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEPREGSFRFDGFA 72
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ G GL++ +R PY+CAEW FGG P WL + R + + E++ R+
Sbjct: 73 DVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWLLK-SSMGLRCMDNEYLEKVDRYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+++ R L +GGPII +Q+ENEYG SYG + A + GL
Sbjct: 132 DELIP--RLLPLLDSRGGPIIAVQVENEYG----SYGND----TAYLAYLRDGLIRRGVD 181
Query: 236 VMCKQTDAPEN------IIDACNGYYCDG---------YKPNSYNKPTLWTENWDGWYTT 280
+ +D P + ++ + G Y+ ++P + E W GW+
Sbjct: 182 CLLFTSDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQDEPLMVMEYWLGWFDH 241
Query: 281 WGGRLPH--RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG---GPFY---ITSY 332
W R PH R D+A + ++G S +N YM+ GGTNFG SG G Y ITSY
Sbjct: 242 W--RKPHHVREAGDVANVLDEMLEQGAS-VNLYMFHGGTNFGFYSGANYGEHYEPTITSY 298
Query: 333 DYDAPIDEYGLLSE 346
DYDAP+ E+G ++E
Sbjct: 299 DYDAPLTEWGDITE 312
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 74/171 (43%), Gaps = 19/171 (11%)
Query: 531 KTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN--DLILLSQTVG 588
K R + + +RD +VF++G+L G V+ W QP+E L +L + +G
Sbjct: 389 KGPRTRQKLHLREVRDRAQVFLDGKLIG-VVERWNP--QPIEIAVPREGARLDVLVENMG 445
Query: 589 LQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWT 648
NYG +L G + F++ WT + L E +Q+ + E T
Sbjct: 446 RVNYGPYLRDHKGITEGILIDNQFQSN--------WTVTL-LPLESEQLARVRYESVEVT 496
Query: 649 D-LTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
DG P+ +Y+ + + + D L KG AW+NG +GRYW
Sbjct: 497 GGQQHDGRPA---FYRGFVEVDEPAD-TFLRFDGWQKGIAWINGFQLGRYW 543
>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
Length = 603
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 114/340 (33%), Positives = 171/340 (50%), Gaps = 46/340 (13%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
P ++ + ++DG ++S HY R P+ W D + + + G + +ETYV WN H+
Sbjct: 24 PGGLTIRGKEFLLDGKPFRILSGAFHYFRTHPQDWRDRLMRMRAMGLNTVETYVAWNFHQ 83
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL---------RD 154
+ +F G D+V FV+ GL + +R GPY+CAEW+FGG P WL R
Sbjct: 84 PDEKEADFTGWRDVVAFVRTADEVGLKVIVRPGPYICAEWDFGGLPAWLLKDKDAPLRRS 143
Query: 155 IPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYG 212
P E R +A F E + RFV L + +GGPII +Q+ENEYG+ + +Y
Sbjct: 144 DPAFE-RAVDAWFAELLPRFVD----------LQATRGGPIIAMQVENEYGSYGDDHAYL 192
Query: 213 QQGKDYVKWAASMALGLGA-GVPWVMCKQTDAPE-----NIIDACNGYYCD--GYKPNSY 264
+ +D ++ L + G K P+ N G + + ++P
Sbjct: 193 EHLRDTMRAQGIDGLLFCSNGATQEALKAGSLPDLLSTVNFGGDPTGPFAELRAFQP--- 249
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVE--DLAFAVARFFQRGGSFMNYYMYFGGTNFGRT 322
+KP TE WDGW+ WG R HR + A V + + G S +N+YM GGTNFG +
Sbjct: 250 DKPLFCTEFWDGWFDHWGER--HRTTDPAQTAADVEKMLEAGAS-INFYMAVGGTNFGWS 306
Query: 323 SG----GPFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+G G Y +TSYDYD+PI E G L+E K+ ++D+
Sbjct: 307 AGANLSGSGYQPTVTSYDYDSPISESGELTE-KFHKVRDV 345
>gi|298386767|ref|ZP_06996322.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|298260441|gb|EFI03310.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 778
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 169/353 (47%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ ++++ + SS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 FIALLVLFTVIFFSSAEAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWDH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYYLLRDL 346
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +Y++ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYRSSFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-IPGCWLKEGENEILVLDLKGPTKSSIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|153808925|ref|ZP_01961593.1| hypothetical protein BACCAC_03226 [Bacteroides caccae ATCC 43185]
gi|149128258|gb|EDM19477.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 778
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 169/353 (47%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVVIFSSAQAQTTARKFEAGKN--TFLLDGEPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKTLGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEY SSY K YV + G VP C +A E+++ N
Sbjct: 181 ENEY----SSYATD-KPYVAAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YKT F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPAMPAYYKTTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKG--PVKASIKGLKKPLLDVL 614
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 615 REKAPETH 622
>gi|29349062|ref|NP_812565.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124327|ref|ZP_09944991.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
gi|29340969|gb|AAO78759.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839176|gb|EES67260.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
Length = 778
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 169/353 (47%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ ++++ + SS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 FIALLVLFTVIFFSSAEAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWDH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYYLLRDL 346
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 52/126 (41%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P +Y++ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPIMPAYYRSSFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-IPGCWLKEGENEILVLDLKGPTKSSIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
Length = 653
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 162/321 (50%), Gaps = 27/321 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ R L QGGP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGIVELL 254
Query: 232 ----GVPWVMCKQTD---APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
G V+ T A N+ + +K +KP L E W GW+ WG +
Sbjct: 255 LTSDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKIQR-DKPLLIMEYWVGWFDRWGDK 313
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
+ +++ AV+ F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 314 HHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVL 372
Query: 339 DEYGLLSEPKWGHLKDLHAAI 359
E G +E K+ L+ L ++
Sbjct: 373 TEAGDYTE-KYLKLQKLFQSV 392
>gi|417991864|ref|ZP_12632235.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
gi|410534805|gb|EKQ09440.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
Length = 598
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 172/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI F+ GLY +R PY+CAEW FGGFP WL + RT+++ +
Sbjct: 64 DFDFSGILDIEHFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDSAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G + D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSHADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
Length = 779
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 169/353 (47%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ ++++ VSS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 6 IALLVLFTVTFFVSSAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 63
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 64 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVC 123
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 124 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 181
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 182 ENEYG----SYGIN-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 236
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 237 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 296
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 297 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 347
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 527 LPTMPAYYKGTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 571
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 572 -------------GPQQTLF-MPGCWLKKGENEILVLDLKG--PAKASIKGLKKPILDVL 615
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 616 REKAPETH 623
>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
Length = 632
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 112/330 (33%), Positives = 167/330 (50%), Gaps = 39/330 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
+ DG +IS +HYPR + W + K G + + TYVFWNAHE G+++F
Sbjct: 38 VYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHEPEPGKWDFTED 97
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++ +++K+ G GL + LR GPYVCAEW FGG+P WL+++ +E R +N F + Q +
Sbjct: 98 KNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEMELRRDNEQFLKYTQLY 157
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL 229
+ ++ + L +GGPIIM+Q ENE+G SY Q KD + ++ A + L
Sbjct: 158 INRLYQEVGN--LQITKGGPIIMVQAENEFG----SYVSQRKDIPLEEHRRYNAKIVQQL 211
Query: 230 ---GAGVP-------WVMCKQTDAPENIIDACNGYY-CDGYKP--NSYNK---PTLWTEN 273
G +P W+ + A + NG D K N YN P + E
Sbjct: 212 KTAGFDIPSFTSDGSWLF--EGGAVPGALPTANGESNIDNLKKVVNRYNGGQGPYMVAEF 269
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------ 327
+ GW W P +A ++ Q S +NYYM GGTNFG TSG +
Sbjct: 270 YPGWLAHWVEPHPQVSATSVARQTEKYLQNDVS-INYYMVHGGTNFGFTSGANYDKKHDI 328
Query: 328 --YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAP+ E G ++ PK+ L+++
Sbjct: 329 QPDLTSYDYDAPVSEAGWVT-PKFDSLRNV 357
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 73/331 (22%), Positives = 130/331 (39%), Gaps = 77/331 (23%)
Query: 449 KTVEFSLPLSPN----ISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHL 504
K V+++LP +P+ I +P S+ K+++ M K +ENN + E L
Sbjct: 360 KYVDYTLPEAPSAIDLIEIP--SIRLDKVATLEG--MDFKT-----TENNTPL--TFEQL 408
Query: 505 NVTKDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHW 564
N Y Y H Q + T+ I +RD V+ N + G + ++
Sbjct: 409 NQGYGYVLYRKHFNQ--------------PISGTLEIKGLRDYATVYTNDEKAGELNRYF 454
Query: 565 VKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW 624
K ++ + L +L + +G NYG+ + + G V++ D+++
Sbjct: 455 NKYTMDIDVPFN-STLEILVENMGRINYGSEIIHNTKGIISPVRIN-----DMEIEGGWQ 508
Query: 625 TYQVGLK-----GEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDL 679
+ + + Q + NE+ L G P YK F+ + D +++
Sbjct: 509 MISIPMDKAPDFSKMDQASVYDNNESAIKSLA--GKP---VLYKGTFNLTETGD-TFINM 562
Query: 680 GSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRS 739
GKG ++NG +IGRYW V P QT Y +P
Sbjct: 563 EDWGKGIIFINGKNIGRYWYV---------------------------GPQQTLY-IPGV 594
Query: 740 WLQASNNLLVIFEETGGNPFEISVKLRSTRI 770
WL+ N ++IFE+ P ++R+T++
Sbjct: 595 WLKKGENKIIIFEQLNDKP---HTEVRTTKV 622
>gi|160885481|ref|ZP_02066484.1| hypothetical protein BACOVA_03481 [Bacteroides ovatus ATCC 8483]
gi|423290348|ref|ZP_17269197.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
gi|156109103|gb|EDO10848.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
gi|392665735|gb|EIY59258.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
Length = 778
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGTD-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|148273884|ref|YP_001223445.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
gi|147831814|emb|CAN02784.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
Length = 599
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 102/305 (33%), Positives = 153/305 (50%), Gaps = 26/305 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +I+ +HY R P+ W D I K++ G D IETYV WNAH RG ++
Sbjct: 20 LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGAFDTSAGL 79
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F+ LV + G++ +R GPY+CAEW+ GG P WL + P + R + + + F+
Sbjct: 80 DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFEDPAVGVRRSEPLYLAAVDEFL 139
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+++ +++ + GGP+I++QIENEYG +YG DY++ + G VP
Sbjct: 140 RRVYEIVAPRQID--MGGPVILVQIENEYG----AYGDDA-DYLRHLVDLTRESGIIVPL 192
Query: 236 VMCKQ-TDA--PENIIDACNGYYCDGYKPNSY---------NKPTLWTENWDGWYTTWGG 283
Q TD +D + G + P + +E WDGW+ WG
Sbjct: 193 TTVDQPTDEMLSRGSLDELHRTGSFGSRATERLATLRRHQPTGPLMCSEFWDGWFDHWGE 252
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYDYDAP 337
H A A G+ +N YM+ GGTNFG T+G ++TSYDYDAP
Sbjct: 253 HH-HTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHKGTYQSHVTSYDYDAP 311
Query: 338 IDEYG 342
+DE G
Sbjct: 312 LDETG 316
>gi|322390566|ref|ZP_08064082.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
gi|321142719|gb|EFX38181.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
Length = 595
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 161/318 (50%), Gaps = 41/318 (12%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A + G ++S IHY R P W + K G + +ETYV WNAHE +GQ++F
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G+ D+ +F++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E +
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDLRIRSSDPAFIEAVD 127
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
R+ +++ L+ + QGGPI+M+Q+ENEYG SYG+ KDY++ + G
Sbjct: 128 RYYDRLLGLLTPYQVD--QGGPILMMQVENEYG----SYGED-KDYLRAIRDLMKEKGVT 180
Query: 233 VPWVMCKQTDAP------------ENIIDACN----GYYCDGYKPNSYNK-----PTLWT 271
P +D P E++ N Y G +++ P +
Sbjct: 181 CPLFT---SDGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWPLMCM 237
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF 327
E WDGW+T W + R E+LA AV + G +N YM+ GGTNFG +G G
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTL 295
Query: 328 ---YITSYDYDAPIDEYG 342
+TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D T++ +Y+ F +D LD+ GKG ++NGH++GR+
Sbjct: 485 YPLDLQDLSQLDFTKEWQAGAPAFYRYDFQLDHTLD-TYLDMTGFGKGVVFINGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|332838248|ref|XP_001156615.2| PREDICTED: galactosidase, beta 1-like 3 [Pan troglodytes]
Length = 653
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 162/321 (50%), Gaps = 27/321 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ R L QGGP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGIVELL 254
Query: 232 ----GVPWVMCKQTD---APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
G V+ T A N+ + +K +KP L E W GW+ WG +
Sbjct: 255 LTSDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQR-DKPLLIMEYWVGWFDRWGDK 313
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
+ +++ AV+ F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 314 HHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVL 372
Query: 339 DEYGLLSEPKWGHLKDLHAAI 359
E G +E K+ L+ L ++
Sbjct: 373 TEAGDYTE-KYLKLQKLFQSV 392
>gi|418004004|ref|ZP_12644053.1| beta-galactosidase 3 [Lactobacillus casei UW1]
gi|410551057|gb|EKQ25134.1| beta-galactosidase 3 [Lactobacillus casei UW1]
Length = 598
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 172/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT+++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDSAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|449672638|ref|XP_002158331.2| PREDICTED: beta-galactosidase-1-like protein 2-like [Hydra
magnipapillata]
Length = 476
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 173/357 (48%), Gaps = 34/357 (9%)
Query: 22 MMMMMMIHLSCVSS----SSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEM 77
++M + +L SS S A+ P + + R + + ++S +HY R
Sbjct: 16 ILMCVFAYLFLFSSFEMTSDANRIQAPEGLKVNGRNFTLKREKFRIMSGSMHYFRIPFRK 75
Query: 78 WPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN-DIVKFVKLVGSSGLYLQLRIG 136
W D + K K G + ++ Y+ WN HE G ++F ++ +F+ L+ GLY +R G
Sbjct: 76 WSDRLLKLKAMGLNTVDIYIPWNLHEPEPGHFDFSSDQLNLSEFLYLLQGYGLYAVIRPG 135
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PY+CAE + GG P WL ++ R+ F E ++R+ K++ ++ + FS+ GGPII
Sbjct: 136 PYICAELDLGGLPSWLLRDKNMKLRSLYPGFIEPVERYFKQLFAIL-QPFQFSY-GGPII 193
Query: 197 MLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDA-----PENIIDAC 251
QIENEYG + Q +Y+K+ + + G + +C E ++
Sbjct: 194 AFQIENEYGVYD-----QDVNYMKYLKEIYISNGLSELFFVCDNKQGLGKYKLEGVLQTI 248
Query: 252 NGYYCDG------YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGG 305
N + D + +KP TE WDGW+ WG D A A+ +RG
Sbjct: 249 NFMWLDAKGMIDKLEAVQPDKPVFVTELWDGWFDHWGENHHIVKTADAALALEYVIKRGA 308
Query: 306 SFMNYYMYFGGTNFGRTSG------GPFY---ITSYDYDAPIDEYGLLSEPKWGHLK 353
SF N YM+ GGTNFG +G G Y ITSYDYDAP+ E G LS+ K+ LK
Sbjct: 309 SF-NLYMFHGGTNFGFINGANANNDGSNYQSTITSYDYDAPVSETGHLSQ-KFDELK 363
>gi|69247392|ref|ZP_00604336.1| Beta-galactosidase [Enterococcus faecium DO]
gi|256619331|ref|ZP_05476177.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|384518861|ref|YP_005706166.1| beta-galactosidase [Enterococcus faecalis 62]
gi|389870025|ref|YP_006377575.1| beta-galactosidase [Enterococcus faecium DO]
gi|68194864|gb|EAN09337.1| Beta-galactosidase [Enterococcus faecium DO]
gi|256598858|gb|EEU18034.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|309385841|gb|ADO66768.1| beta-galactosidase [Enterococcus faecium]
gi|323480994|gb|ADX80433.1| beta-galactosidase [Enterococcus faecalis 62]
gi|388535404|gb|AFK60593.1| beta-galactosidase [Enterococcus faecium DO]
Length = 592
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 172/351 (49%), Gaps = 38/351 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++ G ++S IHY R P W + K G + +ETYV WN HE +G+++F+G
Sbjct: 11 LLKGKTFKILSGAIHYFRIPPCDWEHSLYNLKALGFNTVETYVPWNLHEPQKGEFHFEGI 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ +F+ + GLY +R PY+CAEW FGGFP WL P I R N + E + +
Sbjct: 71 LDLERFLTIAQDLGLYAIVRPSPYICAEWEFGGFPSWLLREP-IHIRRNEIAYLEHVADY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++ + L + GG I+M+QIENEYG S+G++ K+Y++ + + G VP
Sbjct: 130 YDVLMKRIVPHQLNN--GGNILMIQIENEYG----SFGEE-KEYLRAIRDLMIKRGVTVP 182
Query: 235 WVMC----KQTDAPENIID---ACNGYYCDGYKPN---------SYNK--PTLWTENWDG 276
+ + T ++I+ G + K N Y+K P + E WDG
Sbjct: 183 FFTSDGPWRATLRAGSMIEDDILVTGNFGSKAKDNFNSMKQFFKEYDKNWPLMCMEFWDG 242
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------I 329
W+ W + R ++LA AV ++G +N YM+ GGTNFG +G I
Sbjct: 243 WFNRWKEPIIQRDPQELAEAVKEVLEQGS--INLYMFHGGTNFGFMNGCSARGVIDLPQI 300
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLH---AAIKLCEPALVAADSAQYIKL 377
TSYDY AP+DE G +E + K +H IK +P + + I L
Sbjct: 301 TSYDYGAPLDEQGNPTEKYYALRKMIHDNYPEIKQLDPVIKPTIEKKKISL 351
>gi|423220237|ref|ZP_17206732.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
gi|392623314|gb|EIY17417.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
Length = 778
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 169/353 (47%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + SS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVVIFSSAQAQTTARKFEAGKN--TFLLDGEPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL + RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEY SSY K YV + G VP C +A E+++ N
Sbjct: 181 ENEY----SSYATD-KPYVAAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
Score = 46.6 bits (109), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YKT F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPAMPAYYKTTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASIKGLKKPLLDVL 614
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 615 REKAPETH 622
>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 779
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 169/353 (47%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ ++++ VSS+ A T + F + ++DG ++ +A +HY R W
Sbjct: 6 IALLVLFTVTFFVSSAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 63
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 64 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVC 123
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 124 AEWEMGGLPWWLLKKRDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 181
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 182 ENEYG----SYGIN-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 236
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 237 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 296
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G +E K+ L+DL
Sbjct: 297 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 347
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 34/128 (26%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 527 LPTMPAYYKGTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 571
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVK-LRS--TRIV 771
P QT + +P WL+ N +++ + G P + S+K L+ ++
Sbjct: 572 -------------GPQQTLF-MPGCWLKKGENEILVLDLKG--PAKASIKGLKKPILDVL 615
Query: 772 CEQVSESH 779
E+ E+H
Sbjct: 616 REKAPETH 623
>gi|158301280|ref|XP_550752.3| AGAP002055-PA [Anopheles gambiae str. PEST]
gi|157012394|gb|EAL38488.3| AGAP002055-PA [Anopheles gambiae str. PEST]
Length = 657
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 165/330 (50%), Gaps = 41/330 (12%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y+ ++DG ++ HY RA PE W + + GG + ++ YV W+ H
Sbjct: 41 RSFKIDYERDTFVMDGKDFRYVAGSFHYFRALPETWRTKLRTLRAGGLNAVDLYVQWSLH 100
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL-RDIPGIEFR 161
G YN++G ++ ++ LY+ LR GPY+CAE + GG P WL PGI R
Sbjct: 101 NPRDGVYNWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIAVR 160
Query: 162 TNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYV-- 219
T++A + EE++++ +++ M M + GGPIIM+QIENEYG ++G+ K Y+
Sbjct: 161 TSDANYLEEVRKWYGELMSRMEPYMYGN--GGPIIMVQIENEYG----AFGKCDKPYLNF 214
Query: 220 ------KWAASMALGLGAGVPW---VMCKQTDA----------PENIIDACNGYYCDGYK 260
++ A+ P+ + C Q D E +D + Y+
Sbjct: 215 LKQQTERYVQDKAVLFTVDRPYDDEIGCGQIDGVFITTDFGLMTEEEVD-THAAKVRSYQ 273
Query: 261 PNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG 320
P P + TE + GW T W RP + LA A R R G +++YMYFGGTNFG
Sbjct: 274 PKG---PLVNTEFYTGWLTHWQESNQRRPAQPLA-ATLRKMLRDGWNVDFYMYFGGTNFG 329
Query: 321 RTSG------GPFY--ITSYDYDAPIDEYG 342
+G G + ITSYDYDAP+DE G
Sbjct: 330 FWAGANDWGLGKYMADITSYDYDAPMDEAG 359
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 179/383 (46%), Gaps = 47/383 (12%)
Query: 20 PMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWP 79
P+ +++ + + + +A+ + N DG L+S IH+ R W
Sbjct: 5 PLAPLVLALAFALPITGTAAETERWPNFGTQGTQFARDGKPYQLLSGAIHFQRIPRAYWK 64
Query: 80 DLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYV 139
D + K++ G + +ETYVFWN E +GQ++F G ND+ FV+ + GL + LR GPY
Sbjct: 65 DRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYA 124
Query: 140 CAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQ 199
CAEW GG+P WL I R+ + F Q ++ + + ++ L + GGPII +Q
Sbjct: 125 CAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQP--LLNHNGGPIIAVQ 182
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD-- 257
+ENEYG SY D+ A + A+ + AG + +D + + NG D
Sbjct: 183 VENEYG----SY---ADDHAYMADNRAMYVKAGFDKALLFTSDGADML---ANGTLPDTL 232
Query: 258 -------GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ 302
G ++++K P + E W GW+ WG PH + A A F+
Sbjct: 233 AVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFE 288
Query: 303 ---RGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKW 349
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK+
Sbjct: 289 WILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKF 347
Query: 350 GHLKDLHAAIKLCEPALVAADSA 372
++D A + +P + A A
Sbjct: 348 ALMRDAIARVTGVQPPALPAPIA 370
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 179/380 (47%), Gaps = 48/380 (12%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ + ++ ++A T P N + DG L+S IH+ R W D +
Sbjct: 9 LVLALAFALPITGTAAETERWP-NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRL 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K++ G + +ETYVFWN E +GQ++F G ND+ FV+ + GL + LR GPY CAE
Sbjct: 68 QKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG+P WL I R+ + F Q ++ + + ++ L + GGPII +Q+EN
Sbjct: 128 WEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVEN 185
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----- 257
EYG SY D+ A + A+ + AG + +D + + NG D
Sbjct: 186 EYG----SY---ADDHAYMADNRAMYVKAGFDKALLFTSDGADML---ANGTLPDTLAVV 235
Query: 258 ----GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ--- 302
G ++++K P + E W GW+ WG PH + A A F+
Sbjct: 236 NFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWIL 291
Query: 303 RGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKWGHL 352
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK+ +
Sbjct: 292 RQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALM 350
Query: 353 KDLHAAIKLCEPALVAADSA 372
+D A + +P + A A
Sbjct: 351 RDAIARVTGVQPPALPAPIA 370
>gi|414156558|ref|ZP_11412859.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
gi|410869551|gb|EKS17511.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
Length = 595
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 160/312 (51%), Gaps = 35/312 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+ G ++S IHY R P W + K G + +ETYV WNAHE +GQ++F G+
Sbjct: 12 LKGQPFKILSGAIHYFRIDPTDWYHSLYNLKALGFNTVETYVPWNAHEPKKGQFDFSGRL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E + R+
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDLRIRSSDPAFIEAIDRYY 130
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV-- 233
+++ L+ + +GGPI+M+Q+ENEYG SYG+ KDY++ + G
Sbjct: 131 DRLLGLLTPYQVD--RGGPILMMQVENEYG----SYGED-KDYLRAIRDLMKEKGVTCPL 183
Query: 234 -----PWVMCKQTDA--PENIIDACN----GYYCDGYKPNSYNK-----PTLWTENWDGW 277
PW +T E++ N Y G +N+ P + E WDGW
Sbjct: 184 FTSDGPWRATLRTGTLIEEDLFVTGNFGSKAAYNFGQMKEFFNEYGKKWPLMCMEFWDGW 243
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF---YIT 330
+T W + R E+LA AV + G +N YM+ GGTNFG +G G +T
Sbjct: 244 FTRWKEPVIQRDPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTLDLPQVT 301
Query: 331 SYDYDAPIDEYG 342
SYDY A ++E G
Sbjct: 302 SYDYGALLNEQG 313
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D +++ +Y+ F +D LD+ GKG +VNGH++GR+
Sbjct: 485 YPLDLQDLSQLDFSKEWQAGAPAFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
Length = 578
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 152/299 (50%), Gaps = 24/299 (8%)
Query: 75 PEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLR 134
PE W D + K K G + +ETYV WN HE ++ + FK + DIVKFV L GL++ +R
Sbjct: 2 PEYWADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIR 61
Query: 135 IGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGP 194
GPY+C+EW+ GG P WL + P + R+ PF E ++++ K+ L+ + FS +GGP
Sbjct: 62 PGPYICSEWDLGGLPSWLLNDPNMRLRSTYGPFMEAVEKYFSKLFALLT-PLQFS-RGGP 119
Query: 195 IIMLQIENEYGNMESSYGQQ-----GKDYVKWAASMALGLGAGVPWVMCKQTDAPENIID 249
II Q+ENEY +++ K +K A+ L V +
Sbjct: 120 IIAWQVENEYASVQEEVDNHYMELLHKLMLKNGATELLFTSDDVGYTKRYPIKLDGGKYM 179
Query: 250 ACNGYYC--DGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
+ N ++C ++P +KP + TE W GW+ WG + E + G+
Sbjct: 180 SFNKWFCLFLHFQP---DKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILDMGAS 236
Query: 308 MNYYMYFGGTNFGRTSGGPFY-----------ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+N+YM+ GGTNFG +G +TSYDYDAP+ E G ++ PK+ L+ L
Sbjct: 237 INFYMFHGGTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDIT-PKYKALRKL 294
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 179/380 (47%), Gaps = 48/380 (12%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ + ++ ++A T P N + DG L+S IH+ R W D +
Sbjct: 9 LVLALAFALPITGTAAETERWP-NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRL 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K++ G + +ETYVFWN E +GQ++F G ND+ FV+ + GL + LR GPY CAE
Sbjct: 68 QKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG+P WL I R+ + F Q ++ + + ++ L + GGPII +Q+EN
Sbjct: 128 WEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVEN 185
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----- 257
EYG SY D+ A + A+ + AG + +D + + NG D
Sbjct: 186 EYG----SY---ADDHAYMADNRAMYVKAGFDKALLFTSDGADML---ANGTLPDTLAVV 235
Query: 258 ----GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ--- 302
G ++++K P + E W GW+ WG PH + A A F+
Sbjct: 236 NFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWIL 291
Query: 303 RGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKWGHL 352
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK+ +
Sbjct: 292 RQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALM 350
Query: 353 KDLHAAIKLCEPALVAADSA 372
+D A + +P + A A
Sbjct: 351 RDAIARVTGVQPPALPAPIA 370
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 179/380 (47%), Gaps = 48/380 (12%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ + ++ ++A T P N + DG L+S IH+ R W D +
Sbjct: 9 LVLALAFALPITGAAADTERWP-NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRL 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K++ G + +ETYVFWN E +GQ++F G ND+ FV+ + GL + LR GPY CAE
Sbjct: 68 QKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGPYACAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG+P WL I R+ + F Q ++ + + ++ L + GGPII +Q+EN
Sbjct: 128 WEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVEN 185
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----- 257
EYG SY D+ A + A+ + AG + +D + + NG D
Sbjct: 186 EYG----SY---ADDHAYMADNRAMYVKAGFDKALLFTSDGADML---ANGTLPDTLAVV 235
Query: 258 ----GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ--- 302
G ++++K P + E W GW+ WG PH + A A F+
Sbjct: 236 NFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWIL 291
Query: 303 RGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKWGHL 352
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK+ +
Sbjct: 292 RQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALM 350
Query: 353 KDLHAAIKLCEPALVAADSA 372
+D A + +P + A A
Sbjct: 351 RDAIARVTGIQPPALPATIA 370
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 118/378 (31%), Positives = 177/378 (46%), Gaps = 66/378 (17%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
+ DG +IS +HY R + W + K G + + TYVFWN HE G+++F G
Sbjct: 35 VYDGKAIRIISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPGKWDFSGD 94
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++ +++++ G GL + LR GPYVCAEW FGG+P WL+++ G+E R +N E+ ++
Sbjct: 95 RNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGMELRRDN----EQFLKY 150
Query: 175 VKKIVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD------------YVK 220
K ++ + +E+ L QGGPIIM+Q ENE+G SY Q KD +K
Sbjct: 151 TKLYLERLYKEVGKLQITQGGPIIMVQGENEFG----SYVSQRKDITLEEHRAYNAKIIK 206
Query: 221 WAASMALGL------------GAGVPWVM--CKQTDAPENIIDACNGYYCDGYKPNSYNK 266
+ + G VP + + EN+ N Y N
Sbjct: 207 QLKEVGFDVPMFTSDGSWLFEGGYVPGALPTANGENNIENLKKVVNQY-------NGGQG 259
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
P + E + GW W P +A ++ G SF NYYM GGTNFG TSG
Sbjct: 260 PYMVAEFYPGWLAHWCEPHPQVKASTIARQTEKYLANGVSF-NYYMVHGGTNFGFTSGAN 318
Query: 327 F--------YITSYDYDAPIDEYGLLSEPKWGHLKDL------------HAAIKLCE-PA 365
+ +TSYDYDAPI E G ++ PK+ ++++ A L E P+
Sbjct: 319 YDKKHDIQPDLTSYDYDAPISEAGWVT-PKFDSIRNVIKRYVDYPLPEAPKAFPLIEIPS 377
Query: 366 LVAADSAQYIKLGQNQEA 383
+ A + + + QEA
Sbjct: 378 IELQQVADLLAITETQEA 395
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 92/217 (42%), Gaps = 39/217 (17%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+TI+ +RD V+++G+ G + + K +E N L +L + +G NYG+ +
Sbjct: 428 LTIEGLRDYATVYVDGEFVGRLNRYNKKYSMDIEIPFNGN-LEILVENMGRINYGSEIVH 486
Query: 599 DGAGFRGQVKLT-GFKNGDIDLSKILWTYQVGL-KGEFQQIYSIEENEAEWTDLTRDGIP 656
+ G VK+ F G+ +++K+ + K + SI + A G P
Sbjct: 487 NNKGIISPVKIDDNFIEGEWEMTKLPMSEVPAFEKMPANTVTSIMGSSAN----ALVGKP 542
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGA 716
S YK F + D LD+ GKG +VNG +IGRYW V
Sbjct: 543 SL---YKGTFTLQETGD-TFLDMKDWGKGIVFVNGINIGRYWQV---------------- 582
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
P QT + VP WL+ N +VIF++
Sbjct: 583 -----------GPQQTLF-VPGVWLKKGINEIVIFDQ 607
>gi|315647882|ref|ZP_07900983.1| Beta-galactosidase [Paenibacillus vortex V453]
gi|315276528|gb|EFU39871.1| Beta-galactosidase [Paenibacillus vortex V453]
Length = 587
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 98/293 (33%), Positives = 145/293 (49%), Gaps = 16/293 (5%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S +HY R PE W D + K K G + +ETY+ WN HE GQ+ F G D+ FV+
Sbjct: 21 ILSGAVHYFRIVPEYWEDRLMKLKACGFNTVETYIPWNLHEPKEGQFTFDGIADLEGFVQ 80
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
G GL++ LR PY+CAEW FGG P WL P I R + + E++ + +++
Sbjct: 81 KAGHLGLHVILRPSPYICAEWEFGGLPAWLLQYPDIHLRCMDPVYLEKVDHYYDELIP-- 138
Query: 183 REEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQ 240
R L + +GGP+I +QIENEYG+ +++Y + KD + L + P Q
Sbjct: 139 RIVPLLTSKGGPVIAIQIENEYGSYGNDTAYLEYLKDGLSARGVDVLLFTSDGPTDGMLQ 198
Query: 241 TDAPENIIDACN-----GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAF 295
N++ N G + P + E W+GW+ W R E++A
Sbjct: 199 GGTVPNVLATVNFGSRPGEAFAKLREYRTEDPLMCMEYWNGWFDHWLKPHHTRSSEEVAQ 258
Query: 296 AVARFFQRGGSFMNYYMYFGGTNFGRTSGG------PFYITSYDYDAPIDEYG 342
+ S +N+YM+ GGTNFG +G +TSYDYDAP+ E G
Sbjct: 259 VFEEMLRLNAS-VNFYMFHGGTNFGFYNGANDQEKYEPTVTSYDYDAPLSECG 310
>gi|417988603|ref|ZP_12629136.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|417997907|ref|ZP_12638140.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|418015108|ref|ZP_12654689.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
gi|410541233|gb|EKQ15720.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|410542248|gb|EKQ16704.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|410552187|gb|EKQ26219.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
Length = 598
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYSEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAENLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|355567243|gb|EHH23622.1| hypothetical protein EGK_07120 [Macaca mulatta]
Length = 653
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 162/323 (50%), Gaps = 31/323 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G+R ++ IHY R E W D + K + G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHRFLICGGSIHYFRVPREYWRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L QGGP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGI---V 251
Query: 236 VMCKQTDAPENIIDA-----CNGYYCDGYKPNSYN--------KPTLWTENWDGWYTTWG 282
+ +D +N++ + N++N KP L E W GW+ WG
Sbjct: 252 ELLLTSDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLHKVQRDKPLLVMEYWVGWFDRWG 311
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDA 336
+ + +++ AV+ F + SF N YM+ GGTNFG +G + +TSYDYDA
Sbjct: 312 DKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATNFGKHTGIVTSYDYDA 370
Query: 337 PIDEYGLLSEPKWGHLKDLHAAI 359
+ E G +E K+ L+ L ++
Sbjct: 371 VLTEAGDYTE-KYFKLQKLLESV 392
>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
Length = 629
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 106/337 (31%), Positives = 166/337 (49%), Gaps = 37/337 (10%)
Query: 44 PFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHE 103
PF + H ++G + ++S +HY R + W + K G + + TYVFWN HE
Sbjct: 28 PFEIKDGH--FYLNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHE 85
Query: 104 SIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTN 163
+ G+++F G ++ +++K G G+ + LR GPYVCAEW FGG+P WL+++PG+E R +
Sbjct: 86 TEPGKWDFTGDKNLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGMEIRRD 145
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----Y 218
N F + + +++++ + L +GGPI+M+Q ENE+G SY Q KD +
Sbjct: 146 NPQFLKHTEAYIQRLYKEVGH--LQCTKGGPIVMVQCENEFG----SYVAQRKDITLQEH 199
Query: 219 VKWAASMALGL---GAGVPWVMCK-----QTDAPENIIDACNGYYCDGYKPNSYNK---- 266
+ A + L G VP + + E + NG N+
Sbjct: 200 RAYNAKIKQQLADAGFDVPLFTSDGSWLFEGGSTEGALPTANGETDIANLKKVVNQYHGG 259
Query: 267 --PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + E + GW + W P +A + + SF N YM GGTNFG TSG
Sbjct: 260 QGPYMVAEFYPGWLSHWAEPFPQVSASSVARTTESYLKNDVSF-NVYMVHGGTNFGFTSG 318
Query: 325 GPF--------YITSYDYDAPIDEYGLLSEPKWGHLK 353
+ +TSYDYDAPI E G ++ PK+ ++
Sbjct: 319 ANYDKKRDIQPDLTSYDYDAPISEAGWVT-PKYDSIR 354
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 86/223 (38%), Gaps = 40/223 (17%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYN-DLILLSQTVGLQNYGAFL 596
T+ ID +RD V+I+G+ G V+ + +E +N L +L + +G NYG+ +
Sbjct: 429 TLQIDGLRDYAVVYIDGEKAG-VLNRNTQTYS-MEIDVPFNATLQILVENMGRINYGSEI 486
Query: 597 EKDGAGFRGQVKLTGFK-NGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGI 655
+ G V + G + G ++ + + + Y +A +
Sbjct: 487 VHNTKGIISPVTIGGKEITGGWNMYPLPMSKAPEAAKAGRNAYPNTSAQAGKLKGSPVAY 546
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
TFT +T +D+ GKG +VNG +IGRYW
Sbjct: 547 EGTFTLNRT--------GDTFIDMEDWGKGIIFVNGINIGRYWQA--------------- 583
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNP 758
P QT Y +P WL+ N +VIFE+ P
Sbjct: 584 ------------GPQQTLY-IPGVWLKKGENKIVIFEQLNEKP 613
>gi|384939972|gb|AFI33591.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
gi|387541294|gb|AFJ71274.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
Length = 653
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 162/323 (50%), Gaps = 31/323 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G+R ++ IHY R E W D + K + G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHRFLICGGSIHYFRVPREYWRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L QGGP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGI---V 251
Query: 236 VMCKQTDAPENIIDA-----CNGYYCDGYKPNSYN--------KPTLWTENWDGWYTTWG 282
+ +D +N++ + N++N KP L E W GW+ WG
Sbjct: 252 ELLLTSDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLHKVQRDKPLLVMEYWVGWFDRWG 311
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDA 336
+ + +++ AV+ F + SF N YM+ GGTNFG +G + +TSYDYDA
Sbjct: 312 DKHHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATNFGKHTGIVTSYDYDA 370
Query: 337 PIDEYGLLSEPKWGHLKDLHAAI 359
+ E G +E K+ L+ L ++
Sbjct: 371 VLTEAGDYTE-KYFKLQKLLESV 392
>gi|417994975|ref|ZP_12635282.1| beta-galactosidase 3 [Lactobacillus casei M36]
gi|410539221|gb|EKQ13758.1| beta-galactosidase 3 [Lactobacillus casei M36]
Length = 598
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYSEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAENLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAIRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 170/342 (49%), Gaps = 45/342 (13%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R P W + K G + +ETYV WN HE +G ++F+G D+ +F+K
Sbjct: 29 ILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLK 88
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L GLY +R PY+CAEW FGGFP WL + PG R+NN + + + + +++ +
Sbjct: 89 LAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEYYDVLMEKI 147
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD 242
L + GG I+M+QIENEYG S+G++ K Y++ + + G P+ +D
Sbjct: 148 VPHQLAN--GGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGVTAPFFT---SD 197
Query: 243 AP------------ENIIDACN---------GYYCDGYKPNSYNKPTLWTENWDGWYTTW 281
P ++I+ N G ++ + P + E WDGW+ W
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNF----GRTSGGPF---YITSYDY 334
+ R ++LA +V G +N YM+ GGTNF G ++ G ITSYDY
Sbjct: 258 KEPIIKRDPQELAESVREALALGS--INLYMFHGGTNFEFMNGCSARGTIDLPQITSYDY 315
Query: 335 DAPIDEYGLLSEPKWGHLKDLHA---AIKLCEPALVAADSAQ 373
DAP+DE G +E + K LH A+ EP LV AQ
Sbjct: 316 DAPLDEQGNPTEKYFALQKMLHEEYPALPQAEP-LVKDSFAQ 356
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 90/223 (40%), Gaps = 53/223 (23%)
Query: 545 RDVLRVFING--QLTG--SVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD L++F+N Q T + IG + V+ E N + +L + +G NYG L D
Sbjct: 417 RDRLQLFVNQIHQATQYQTEIGEDIYVILSQE----NNQIDVLMENMGRVNYGHKLFAD- 471
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
+ G + G + + ++QQ Y + E D +R+ P +
Sbjct: 472 ------TQKKGIRTGVMA--------DLHFMTQWQQ-YCLPMTSCEQVDYSREWQPDQPS 516
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ + + + D +D+ GKG +VN ++GR+W V
Sbjct: 517 FYQYHLELAEVKD-TFIDVSKFGKGIVFVNQTNLGRFWNV-------------------- 555
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+ L+ N +VIFE G EI +
Sbjct: 556 -------GPTLSLY-IPKGLLKEGQNEIVIFETEGTYQPEIQL 590
>gi|417985674|ref|ZP_12626256.1| beta-galactosidase 3 [Lactobacillus casei 32G]
gi|410527574|gb|EKQ02437.1| beta-galactosidase 3 [Lactobacillus casei 32G]
Length = 598
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 151/305 (49%), Gaps = 34/305 (11%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
+IS +HY R PE W D + K K G + +ETYV WN HE G+++F G D++ FV+
Sbjct: 20 IISGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFGGIADVIAFVE 79
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
L G GL++ +R PY+CAEW FGG P WL ++ R ++ F ++ + V L
Sbjct: 80 LAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQLRCSDPKFLAKVDAYYD--VLLP 137
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD 242
+ L GGPII +Q+ENEYG SYG K Y+ + + G V + +D
Sbjct: 138 KFVPLLCTNGGPIIAMQVENEYG----SYGND-KAYLGYLRDGMIARGIDV---LLFTSD 189
Query: 243 APENIIDACNGYYCD-------GYKPNSY---------NKPTLWTENWDGWYTTWGGRLP 286
P + + G D G +P ++P + E W+GW+ W
Sbjct: 190 GPTDEM-LQGGTLPDVLATVNFGSRPEESFAKFREYRPDEPLMCMEFWNGWFDHWMEEHH 248
Query: 287 HRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDE 340
R ED A + G S +N+YM+ GGTNFG SG +TSYDYDAP+ E
Sbjct: 249 TRDGEDAARVLDDMLGAGAS-VNFYMFHGGTNFGFYSGANHIKTYEPTVTSYDYDAPLTE 307
Query: 341 YGLLS 345
G L+
Sbjct: 308 RGDLT 312
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 93/225 (41%), Gaps = 44/225 (19%)
Query: 541 IDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
I +RD +VF++G G+V W ++ +G L +L + +G NYG L +D
Sbjct: 399 IQDVRDRAQVFLDGSYIGAV-ERWDVRPLKLDVPAGGARLDILVENMGRVNYGPLL-RDH 456
Query: 601 AGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFT 660
G V+L D+ + GL EF E+ D+T +
Sbjct: 457 KGITEGVRLDNQFQYGWDIYPLPLDSLEGL--EFGTAAGPED-----ADVTGE----RPA 505
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+Y+ +F+A + D L L KG A+VNG ++GRYW RG
Sbjct: 506 FYRGFFEAEEAAD-TFLRLEGWTKGVAYVNGFNLGRYWE--------------RG----- 545
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKL 765
P ++ Y VP L+ N +V+FE G +SV+L
Sbjct: 546 --------PQKSLY-VPGPLLRKGTNEIVLFELHGTK--RLSVRL 579
>gi|418000981|ref|ZP_12641151.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|418009807|ref|ZP_12649594.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
gi|410548851|gb|EKQ23035.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|410554934|gb|EKQ28899.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
Length = 598
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQAQTTPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
Length = 598
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 164/346 (47%), Gaps = 47/346 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 37 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 96
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FVK + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 97 NNDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 156
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + ++ L + GGPII +Q+ENEYG+ D+ A + A+ + AG
Sbjct: 157 YLDALAKQVQP--LLNHNGGPIIAVQVENEYGSY-------ADDHAYMADNRAMYVKAGF 207
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 208 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAG 264
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 265 WFDHWGK--PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD 320
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
TSYDYDA +DE G + PK+ ++D A + +P + A
Sbjct: 321 HYAPQTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGVQPPALPA 365
>gi|239629323|ref|ZP_04672354.1| glycosyl hydrolase [Lactobacillus paracasei subsp. paracasei
8700:2]
gi|417979668|ref|ZP_12620358.1| beta-galactosidase 3 [Lactobacillus casei 12A]
gi|417982493|ref|ZP_12623148.1| beta-galactosidase 3 [Lactobacillus casei 21/1]
gi|239528009|gb|EEQ67010.1| glycosyl hydrolase [Lactobacillus paracasei subsp. paracasei
8700:2]
gi|410526941|gb|EKQ01818.1| beta-galactosidase 3 [Lactobacillus casei 12A]
gi|410529717|gb|EKQ04508.1| beta-galactosidase 3 [Lactobacillus casei 21/1]
Length = 598
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY- 328
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 329 ------ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQPQTAPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 103/294 (35%), Positives = 156/294 (53%), Gaps = 31/294 (10%)
Query: 68 IHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSS 127
+HYPR E W D + +++ G + + YVFWN HE G+++F G+ DI +FV+
Sbjct: 1 MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60
Query: 128 GLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEML 187
GLY+ LR GPYVCAEW+FGG+P WL + +R+ + F +R++K++ + +
Sbjct: 61 GLYVILRPGPYVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERYIKELGKQLSSLTI 120
Query: 188 FSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCK---QTDAP 244
+ GG IIM+Q+ENEYG+ + K+Y+ M G VP C Q +A
Sbjct: 121 NN--GGNIIMVQVENEYGSYAAD-----KEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAG 173
Query: 245 --ENIIDACNGYYC-DGYK-PNSYNK--PTLWTENWDGWYTTWGGRLP----HRPVEDLA 294
E + NG + D +K ++Y+K P E + W+ WG R RP E L
Sbjct: 174 HIEGALPTLNGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLD 233
Query: 295 FAVARFFQRGGSFMNYYMYFGGTNF----GRTSGGPF--YITSYDYDAPIDEYG 342
+ ++ G ++ YM+ GGTNF G +GG + TSYDYDAP+ E+G
Sbjct: 234 WMLSH-----GVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWG 282
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 50/228 (21%), Positives = 82/228 (35%), Gaps = 57/228 (25%)
Query: 536 RPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAF 595
+ + I +RD + I+G+ S+ + + + L +L + G NYG
Sbjct: 365 KQKLVIQDLRDYAVILIDGKQVASLDRRYNQNSVTLNVAKTPASLEILVENTGRVNYGPD 424
Query: 596 LEKDGAGFRGQV-----KLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
+ + G QV KLTG+ + L K + + G + + + +
Sbjct: 425 ILFNRKGITNQVLWGDEKLTGWSITPLPLYKENVS-DINFGGTIKDVPAFHK-------- 475
Query: 651 TRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
TFT K +D+ GKG WVNG +GR+W +
Sbjct: 476 ------GTFTIQKK--------GDCFVDMSRWGKGAVWVNGKSLGRFWNI---------- 511
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE-ETGGN 757
P QT Y +P WL+ N +++FE E GN
Sbjct: 512 -----------------GPQQTLY-LPAPWLKEGENEIIVFEMEDTGN 541
>gi|21232326|ref|NP_638243.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
gi|21114096|gb|AAM42167.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
Length = 613
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 179/384 (46%), Gaps = 55/384 (14%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ + I L +++++ + F + DG ++S IH+ R W D
Sbjct: 9 LVLALSIALPITATAASDDQWPTFATQGTQ--FVRDGKPYQVLSGAIHFQRIPRTYWKDR 66
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K++ G + +ETYVFWN E +GQ++F ND+ FV+ + GL + LR GPY CA
Sbjct: 67 LQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGPYACA 126
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW GG+P WL I R+ + F Q ++ + +R L + GGPII +Q+E
Sbjct: 127 EWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSYLDAVAQQVRP--LLNHNGGPIIAVQVE 184
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGY------- 254
NEYG+ + D+ A + A+ + AG + +D + + NG
Sbjct: 185 NEYGSYDD-------DHAYMADNRAMFVKAGFDKALLFTSDGADML---ANGTLPGTLAV 234
Query: 255 --YCDGYKPNSYNK--------PTLWTENWDGWYTTWGGRLPH------RPVEDLAFAVA 298
+ G ++++K P + E W GW+ WG PH + E+L + +
Sbjct: 235 VNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWG--TPHASTNAKQQTEELEWIL- 291
Query: 299 RFFQRGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPK 348
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK
Sbjct: 292 ----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PK 346
Query: 349 WGHLKDLHAAIKLCEPALVAADSA 372
+ ++D+ + +P + A A
Sbjct: 347 FALMRDVITRVTGVQPPALPAPIA 370
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/310 (33%), Positives = 153/310 (49%), Gaps = 28/310 (9%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A ++DG +IS +HYPR E W D + K+K G + I TYVFWN HE +G+Y+F
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G NDI FVK GL++ LR PYVCAEW FGG+P WL++I G+E R+ + + +
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGLEVRSKEPQYLQAYK 465
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
++ ++ + L GG I+M+Q+ENEYG +YG ++Y+ + + G
Sbjct: 466 NYIMQVGKQLAP--LQVNHGGNILMVQVENEYG----AYGSD-REYLDINRRLFIEAGFD 518
Query: 233 VPWVMCK------QTDAPENIIDACNGYYCDG-----YKPNSYNK-PTLWTENWDGWYTT 280
C + + P + + NG K N+ K P E + W+
Sbjct: 519 GLLYTCDPEPFLAKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVAEWYPAWFDW 578
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------YITSY 332
WG + P E + G S +N YM+ GGT +G + I+SY
Sbjct: 579 WGTQHHKVPAEKYTPGLDSVLSAGMS-VNMYMFHGGTTRDFMNGANYNDQNPYEPQISSY 637
Query: 333 DYDAPIDEYG 342
DYDAP+DE G
Sbjct: 638 DYDAPLDEAG 647
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 87/220 (39%), Gaps = 56/220 (25%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQP-VEFQSGYNDLILLSQTVGLQNYGAFL 596
+ I +RD VFING+ SV+ +K ++ L +L + +G NYG +L
Sbjct: 732 ALKIKDLRDYGLVFINGKRI-SVLDRRLKQDSIWLKLPDEKIQLDILVENLGRINYGPYL 790
Query: 597 EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIP 656
K+ G V G K L +Q+ K F + S+ ++ T G P
Sbjct: 791 LKNKKGITEGVSFNG---------KELTGWQM-FKLPFNDLNSVALKNSK----TLSGAP 836
Query: 657 ----STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCD 712
TF+ + L+LG+ GKG WVNGH++GRYW +
Sbjct: 837 VLKKGTFSL--------QTVGDTYLNLGNWGKGVVWVNGHNLGRYWNI------------ 876
Query: 713 YRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y VP WL+ N +++ E
Sbjct: 877 ---------------GPQQTLY-VPVEWLKKGGNEIIVLE 900
>gi|384428898|ref|YP_005638258.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
gi|341938001|gb|AEL08140.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
Length = 613
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 179/384 (46%), Gaps = 55/384 (14%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ + I L +++++ + F + DG ++S IH+ R W D
Sbjct: 9 LVLALAIALPITATAASDDQWPTFATQGTQ--FVRDGKPYQVLSGAIHFQRIPRAYWKDR 66
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K++ G + +ETYVFWN E +GQ++F ND+ FV+ + GL + LR GPY CA
Sbjct: 67 LQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGPYACA 126
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW GG+P WL I R+ + F Q ++ + +R L + GGPII +Q+E
Sbjct: 127 EWEAGGYPAWLFGKDNIRVRSRDPRFLAASQSYLDAVAQQVRP--LLNHNGGPIIAVQVE 184
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGY------- 254
NEYG+ + D+ A + A+ + AG + +D + + NG
Sbjct: 185 NEYGSYDD-------DHAYMADNRAMFVKAGFDKALLFTSDGADML---ANGTLPGTLAV 234
Query: 255 --YCDGYKPNSYNK--------PTLWTENWDGWYTTWGGRLPH------RPVEDLAFAVA 298
+ G ++++K P + E W GW+ WG PH + E+L + +
Sbjct: 235 VNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWG--TPHASTNAKQQTEELEWIL- 291
Query: 299 RFFQRGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPK 348
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK
Sbjct: 292 ----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PK 346
Query: 349 WGHLKDLHAAIKLCEPALVAADSA 372
+ ++D+ + +P + A A
Sbjct: 347 FALMRDVITRVTGVQPPALPAPIA 370
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 176/372 (47%), Gaps = 48/372 (12%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ + ++ ++A T P N + DG L+S IH+ R W D +
Sbjct: 9 LVLALAFALPITGTAAETERWP-NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRL 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K++ G + +ETYVFWN E +GQ++F G ND+ FV+ + GL + LR GPY CAE
Sbjct: 68 QKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG+P WL I R+ + F Q ++ + + ++ L + GGPII +Q+EN
Sbjct: 128 WEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVEN 185
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----- 257
EYG SY D+ A + A+ + AG + +D + + NG D
Sbjct: 186 EYG----SY---ADDHAYMADNRAMYVKAGFDKALLFTSDGADML---ANGTLPDTLAVV 235
Query: 258 ----GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ--- 302
G ++++K P + E W GW+ WG PH + A A F+
Sbjct: 236 NFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWIL 291
Query: 303 RGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKWGHL 352
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK+ +
Sbjct: 292 RQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALM 350
Query: 353 KDLHAAIKLCEP 364
+D A + +P
Sbjct: 351 RDAIARVTGVQP 362
>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 650
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/349 (32%), Positives = 166/349 (47%), Gaps = 47/349 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 76 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 135
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 136 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQS 195
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + + + L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 196 YLDALAKQV--QPLLNHNGGPIIAVQVENEYG----SY---ADDHAYMADNRAMYVKAGF 246
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 247 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAG 303
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 304 WFDHWGK--PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD 359
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + PK+ ++D A + +P + A A
Sbjct: 360 HYAPQTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGVQPPALPAPIA 407
>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
Length = 647
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 169/345 (48%), Gaps = 29/345 (8%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M + + C+ + + S F++ YD+ + DG IS G+HY R W D +
Sbjct: 1 MAFFLFFICCLPTLAISL---SFSIDYDNNCFMKDGKPFRYISGGMHYFRVPQYYWKDRL 57
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K K G + ++TYV WN HE I QYNF G ++ F+++ S L + LR GPY+CAE
Sbjct: 58 LKLKASGMNTVQTYVPWNLHEPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGPYICAE 117
Query: 143 WNFGGFPVWLRDIPGIEFRTNNA-PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
W+FGG P WL P I R++ + E + ++ ++ L++ GGP+IM+Q+E
Sbjct: 118 WDFGGLPGWLLKDPSIVIRSSQGKAYMEAVDAWMSVLLPLVKP--FLYENGGPVIMVQVE 175
Query: 202 NEYGNM---ESSYGQQGKDYVKWAASMALGL-----GAGVPWVMCKQTDAPENIIDACNG 253
NEYG+ + Y + ++ + + L G+ + + C + +D G
Sbjct: 176 NEYGDYIHCDHQYMLHLQQLFRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDF--G 233
Query: 254 YYCDGYKPNSYNK------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
D P + + P + +E + GW WG R + +A A+ + S
Sbjct: 234 ANTDPSIPFANQRKLQQKGPLVNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNAS- 292
Query: 308 MNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYGLLSE 346
+N YM+ GGTNFG SG F+ TSYDYDAP+ E G L+E
Sbjct: 293 VNLYMFEGGTNFGFWSGADFHGQYQPVPTSYDYDAPLTEAGDLTE 337
>gi|227533108|ref|ZP_03963157.1| beta-galactosidase 3, partial [Lactobacillus paracasei subsp.
paracasei ATCC 25302]
gi|227189289|gb|EEI69356.1| beta-galactosidase 3 [Lactobacillus paracasei subsp. paracasei ATCC
25302]
Length = 578
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 12 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 70
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 71 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 129
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 130 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 182
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 183 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 242
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY- 328
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 243 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 300
Query: 329 ------ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 301 DHDLPQVTSYDYDAPLNEQGNPTPKYFAIQKMIHEVLPSQPQTAPLVKPAMRQADNPLTA 360
Query: 376 KL 377
K+
Sbjct: 361 KV 362
>gi|66767541|ref|YP_242303.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
gi|66572873|gb|AAY48283.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
Length = 613
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 179/384 (46%), Gaps = 55/384 (14%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ + I L +++++ + F + DG ++S IH+ R W D
Sbjct: 9 LVLALSIALPITATAASDDQWPTFATQGTQ--FVRDGKPYQVLSGAIHFQRIPRTYWKDR 66
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K++ G + +ETYVFWN E +GQ++F ND+ FV+ + GL + LR GPY CA
Sbjct: 67 LQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGPYACA 126
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW GG+P WL I R+ + F Q ++ + +R L + GGPII +Q+E
Sbjct: 127 EWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSYLDAVAQQVRP--LLNHNGGPIIAVQVE 184
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGY------- 254
NEYG+ + D+ A + A+ + AG + +D + + NG
Sbjct: 185 NEYGSYDD-------DHAYIADNRAMFVKAGFDKALLFTSDGADML---ANGTLPGTLAV 234
Query: 255 --YCDGYKPNSYNK--------PTLWTENWDGWYTTWGGRLPH------RPVEDLAFAVA 298
+ G ++++K P + E W GW+ WG PH + E+L + +
Sbjct: 235 VNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDHWG--TPHASTNAKQQTEELEWIL- 291
Query: 299 RFFQRGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPK 348
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK
Sbjct: 292 ----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRPT-PK 346
Query: 349 WGHLKDLHAAIKLCEPALVAADSA 372
+ ++D+ + +P + A A
Sbjct: 347 FALMRDVITRVTGVQPPALPAPIA 370
>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
Neff]
Length = 604
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 154/312 (49%), Gaps = 36/312 (11%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG ++S IHY R+ PE WP + + G + + TYV WN HE GQY+F G+ D
Sbjct: 36 DGQEFRIVSGSIHYFRSLPEQWPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLD 95
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
IV+F++ G + +R PY+CAE FGG P WL + G++ R ++ + + + F+
Sbjct: 96 IVRFIEAAQQEGFLVIVRPPPYICAELEFGGLPAWLLNEEGLQLRCSDPKYLKRVDSFLD 155
Query: 177 KIVDLMREEMLFSWQ---GGPIIMLQIENEYG----------NMESSYGQQGKDYVKWAA 223
+ ML ++Q GGPII +Q+ENEYG ++E + Q D + +++
Sbjct: 156 HFL-----PMLATYQYSRGGPIIAMQVENEYGSYGNDHLYLRHLELKFRQHQIDAILFSS 210
Query: 224 SMA---LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTT 280
+ A + +G +P ++ ++ N Y+P+ P TE WDGW+
Sbjct: 211 NGAGDQMFVGGALPSLLRTVNFGTGADVEG-NLKVLRKYQPSG---PLFVTEFWDGWFDH 266
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--------PFY--IT 330
WG H + + +N YM FGGTNFG T+G P+ T
Sbjct: 267 WGEEH-HTTTPTQSMKTLEAILSNNASVNLYMAFGGTNFGFTNGANKGYGETDPYQPTTT 325
Query: 331 SYDYDAPIDEYG 342
SYDYDAP++E G
Sbjct: 326 SYDYDAPVNESG 337
>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
Length = 617
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 164/332 (49%), Gaps = 46/332 (13%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-GKN 115
DG + S +HY R E W + K G + + TYVFWN HE G ++FK G
Sbjct: 37 DGKIIKIHSGEMHYERIPKEYWRHRLQMLKAMGLNTVATYVFWNYHEIEPGVWDFKTGNR 96
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F+++ S GLY+ LR GPY C EW FGG+P WL++ P + RTNN F + + ++
Sbjct: 97 DLAEFLRIAKSEGLYVILRPGPYACGEWEFGGYPWWLQNNPDLVIRTNNKAFLDACKTYL 156
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQ-QGKDYVKWAASMALGLGAGVP 234
+ + +++ F+ QGGPIIM+Q ENE+G+ S +D+ + ++
Sbjct: 157 EHLYAVVKGN--FANQGGPIIMVQAENEFGSYVSQRTDISAEDHKAYKTAI--------- 205
Query: 235 WVMCKQTDAPENIIDA-----CNGYYCDGYKP---------------NSYNK---PTLWT 271
+ + K+T PE + G +G P + Y+K P +
Sbjct: 206 YNILKETGFPEPFFTSDGSWLFEGGMVEGVLPTANGESNIENLKKQVDKYHKGQGPYMVA 265
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--- 328
E + GW W E++A ++ G SF NYYM GGTNFG TSG +
Sbjct: 266 EFYPGWLDHWAEPFVKIGSEEIASQTKKYLDAGVSF-NYYMAHGGTNFGFTSGANYNEES 324
Query: 329 -----ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
ITSYDYDAPI E G + PK+ ++D+
Sbjct: 325 DIQPDITSYDYDAPISEAGWAT-PKFMAIRDV 355
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 93/227 (40%), Gaps = 64/227 (28%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSG--YNDLI-LLSQTVGLQNYGAF 595
+ I+ +RD V++NG +G +V + E +N ++ +L + +G NYGA
Sbjct: 430 LKIEGLRDFATVYVNG----VKVGELNRVFKNYELTLSIPFNGILEILVENMGRINYGAE 485
Query: 596 LEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEW--TDLTRD 653
+ + G V + ++ + G ++ +Y + NE T+ +
Sbjct: 486 IVHNTKGIISPVFINEYE----------------ITGGWE-MYKMPMNEVPVLKTETVKS 528
Query: 654 GIPSTFTWYKTYFDAPDGIDPVA---LDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDT 710
G P ++A ID A LD+ + GKG +VNGH++GRYW V
Sbjct: 529 GRP-------VLYEAAVNIDKPADTFLDMTNWGKGIVFVNGHNLGRYWKV---------- 571
Query: 711 CDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGN 757
P QT Y VP WL+A N V+FE+ N
Sbjct: 572 -----------------GPQQTLY-VPGCWLKAGENKFVVFEQLNEN 600
>gi|191637109|ref|YP_001986275.1| beta-galactosidase 3 [Lactobacillus casei BL23]
gi|385818812|ref|YP_005855199.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|385821988|ref|YP_005858330.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|409995961|ref|YP_006750362.1| beta-galactosidase 17 [Lactobacillus casei W56]
gi|190711411|emb|CAQ65417.1| Beta-galactosidase 3 [Lactobacillus casei BL23]
gi|327381139|gb|AEA52615.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|327384315|gb|AEA55789.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|406356973|emb|CCK21243.1| Beta-galactosidase 17 [Lactobacillus casei W56]
Length = 598
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 171/362 (47%), Gaps = 43/362 (11%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGQPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
++F G DI +F+ GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 DFDFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ + R+ ++ + + GG +IM+Q+ENEYG SYG+ KDY+ A +
Sbjct: 123 LQAIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYG----SYGED-KDYLAAVAELMK 175
Query: 228 GLGAGVPW---------VMCKQTDAPENIIDACN-GYYCDGY--------KPNSYNKPTL 269
G VP + + A I+ N G D + + ++ P +
Sbjct: 176 KHGVDVPLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLM 235
Query: 270 WTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
E WDGW+ WG + R E+ A + QRG +N YM+ GGTNFG +G
Sbjct: 236 CMEFWDGWFNRWGEPIIRRDPEETAEDLRAVIQRGS--VNLYMFHGGTNFGFMNGTSARK 293
Query: 328 -----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI-------KLCEPALVAADSAQYI 375
+TSYDYDAP++E G + + K +H + L +PA+ AD+
Sbjct: 294 DHDLPQVTSYDYDAPLNEQGNPTPKYFTIQKMIHEVLPSQAQTTPLVKPAMRQADNPLTA 353
Query: 376 KL 377
K+
Sbjct: 354 KV 355
>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
Length = 628
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 165/335 (49%), Gaps = 53/335 (15%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
+G ++S +HY R + W + K G + + TYVFWN HE G+++F G +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ +F+K G G+ + LR GPYVCAEW FGG+P WL+++ G+E R +N E ++ K
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNP----EFLKYTK 152
Query: 177 KIVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL 229
+D + +E+ L +GGPI+M+Q ENE+G SY Q KD + + A + L
Sbjct: 153 AYIDRLYKEVGDLQCTKGGPIVMVQCENEFG----SYVAQRKDIPLEEHRAYNAKIKQQL 208
Query: 230 ---GAGVP-------WVM-----------CKQTDAPENIIDACNGYYCDGYKPNSYNKPT 268
G VP W+ EN+ N Y+ DG P
Sbjct: 209 ADAGFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYH-DG------KGPY 261
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF- 327
+ E + GW + W P +A ++ Q SF N+YM GGTNFG TSG +
Sbjct: 262 MVAEFYPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYD 320
Query: 328 -------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 321 KKRDIQPDLTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/330 (23%), Positives = 132/330 (40%), Gaps = 66/330 (20%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + KL+ + ++ V ++ T E LN
Sbjct: 357 KYVKYTVPEAPAPN-PVIEIPSIKLTKVADVLAFAEKQKPVSADTPLT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G VK+ G + G+ D+ ++ +
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVKIAGKEITGEWDMYQLPMSE 515
Query: 627 Q---VGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
LK + E + + + +G TFT D + +D+ + G
Sbjct: 516 MPDLAKLKADAHANVPAEAAKLKGCPVLYEG---TFTL--------DNVGDTFIDMENWG 564
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KG +VNG +IGRYW V P QT Y +P WL+
Sbjct: 565 KGIIFVNGVNIGRYWKV---------------------------GPQQTLY-IPGVWLKK 596
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
N +VIFE+ P + VK T ++ +
Sbjct: 597 GTNKIVIFEQLNEVP-QAEVKTVKTPVLMK 625
>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
Length = 628
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 165/335 (49%), Gaps = 53/335 (15%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
+G ++S +HY R + W + K G + + TYVFWN HE G+++F G +
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ +F+K G G+ + LR GPYVCAEW FGG+P WL+++ G+E R +N E ++ K
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNP----EFLKYTK 152
Query: 177 KIVDLMREEM--LFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASMALGL 229
+D + +E+ L +GGPI+M+Q ENE+G SY Q KD + + A + L
Sbjct: 153 AYIDRLYKEVGDLQCTKGGPIVMVQCENEFG----SYVAQRKDIPLEEHRAYNAKIKQQL 208
Query: 230 ---GAGVP-------WVM-----------CKQTDAPENIIDACNGYYCDGYKPNSYNKPT 268
G VP W+ EN+ N Y+ DG P
Sbjct: 209 ADAGFNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYH-DG------KGPY 261
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF- 327
+ E + GW + W P +A ++ Q SF N+YM GGTNFG TSG +
Sbjct: 262 MVAEFYPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYD 320
Query: 328 -------YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G ++ PK+ ++++
Sbjct: 321 KKRDIQPDLTSYDYDAPISEAGWVT-PKYDSIRNV 354
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/330 (23%), Positives = 133/330 (40%), Gaps = 66/330 (20%)
Query: 449 KTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTK 508
K V++++P +P + P + KL+ + ++ V ++ FT E LN
Sbjct: 357 KYVKYTVPEAPAPN-PVIEIPSIKLTKVADVLAFAEKQKPVSADTPFT----FEQLNQGY 411
Query: 509 DYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVV 568
Y Y H Q + T+ I +RD V+++G+ G V+ K
Sbjct: 412 GYVLYTRHFNQ--------------PISGTLEIPGLRDYAVVYVDGEQVG-VLNRNTKTY 456
Query: 569 QPVEFQSGYN-DLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFK-NGDIDLSKILWTY 626
+E + +N L +L + +G NYG+ + + G VK+ G + G+ D+ ++ +
Sbjct: 457 S-MEIEVPFNATLQILVENMGRINYGSEIVHNTKGIISPVKIAGKEITGEWDMYQLPMSE 515
Query: 627 Q---VGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMG 683
LK + E + + + +G TFT D + +D+ + G
Sbjct: 516 MPDLAKLKADAHANVPAEAAKLKGCPVLYEG---TFTL--------DNVGDTFIDMENWG 564
Query: 684 KGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQA 743
KG +VNG +IGRYW V P QT Y +P WL+
Sbjct: 565 KGIIFVNGVNIGRYWKV---------------------------GPQQTLY-IPGVWLKK 596
Query: 744 SNNLLVIFEETGGNPFEISVKLRSTRIVCE 773
N +VIFE+ P + VK T ++ +
Sbjct: 597 GTNKIVIFEQLNEVP-QAEVKTVKTPVLMK 625
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/349 (32%), Positives = 165/349 (47%), Gaps = 47/349 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 99 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 158
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + ++ L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 159 YLDAVAKQVQP--LLNHNGGPIIAVQVENEYG----SY---ADDHAYMADNRAMYVKAGF 209
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D E + NG D G ++++K P + E W G
Sbjct: 210 DKALLFTSDGAEML---ANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAG 266
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 267 WFDHWGK--PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD 322
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + K+ ++D A + +P + A A
Sbjct: 323 HYAPQTTSYDYDAIVDEAGRPTA-KFALMRDAIARVTGVQPPALPAPIA 370
>gi|62955063|ref|NP_001017547.1| beta-galactosidase precursor [Danio rerio]
gi|62089564|gb|AAH92166.1| Galactosidase, beta 1 [Danio rerio]
gi|182890870|gb|AAI65636.1| Glb1 protein [Danio rerio]
Length = 651
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 101/318 (31%), Positives = 156/318 (49%), Gaps = 20/318 (6%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
F+V Y + DG IS IHY R W D + K G + I+TYV WN HE+
Sbjct: 26 FSVDYHRNCFLKDGEPFRYISGSIHYSRIPRVYWKDRLLKMYMAGLNAIQTYVPWNFHEA 85
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
+ GQY+F G D+ +F++L GL + +R GPY+CAEW+ GG P WL I R+++
Sbjct: 86 VPGQYDFSGDRDLEQFLQLCQDIGLLVIMRPGPYICAEWDMGGLPAWLLKKKDIVLRSSD 145
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG-------NMESSYGQQGKD 217
+ + +++ K++ +++ + GGPII +Q+ENEYG N Q +
Sbjct: 146 PDYLAAVDKWMGKLLPIIKRYLY--QNGGPIITVQVENEYGSYFACDFNYMRHLSQLFRF 203
Query: 218 YVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC----DGYKPNSYNKPTLWTEN 273
Y+ A + GAG+ ++ C +D G + + P + +E
Sbjct: 204 YLGEEAVLFTTDGAGLGYLKCGSLQGLYATVDFGPGANVTAAFEAQRHVEPRGPLVNSEF 263
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-----GPFY 328
+ GW WG + P + + + G + +N YM+ GGTNFG +G GP
Sbjct: 264 YPGWLDHWGEKHSVVPTSAVVKTLNEILEIGAN-VNLYMFIGGTNFGYWNGANTPYGP-Q 321
Query: 329 ITSYDYDAPIDEYGLLSE 346
TSYDYD+P+ E G L+E
Sbjct: 322 PTSYDYDSPLTEAGDLTE 339
>gi|336424850|ref|ZP_08604882.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013315|gb|EGN43197.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 596
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 158/321 (49%), Gaps = 45/321 (14%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G +IS GIHY R PE W D + K KE G + +ETY+ WN HE ++G+++F G++
Sbjct: 16 LNGEPFQIISGGIHYFRILPEYWEDRLQKLKELGCNTVETYIPWNMHEPVKGKFDFYGEH 75
Query: 116 -----DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEE 170
D+V FV+ GL++ LR PY+CAEW+FGG P WL ++ RT++ +
Sbjct: 76 VHGMLDVVSFVRTAQRLGLWVILRPSPYICAEWDFGGLPFWLMAGEEMDLRTSDERYLRH 135
Query: 171 MQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLG 230
++ + +++ L+ L QGGP++MLQ+ENEYG S+G K Y++ M G
Sbjct: 136 VRDYYDRLMPLLAP--LQIDQGGPVLMLQVENEYG----SFGND-KKYLESLRDMMRERG 188
Query: 231 AGVPWVMCKQTD-------APENIIDACN------------GYYCDGYKPNSYNKPTLWT 271
VP D E I N Y DG P + T
Sbjct: 189 ITVPLFASDGPDHNMLANTKTEGIFPTANFGSGASKAFSILEEYTDG-------GPCMCT 241
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--- 328
E W GW+ W + H + A G+ +N YM+ GGTNFG +G +
Sbjct: 242 EFWIGWFDAWHDEVHHEGDTETAVKELENILELGN-VNIYMFEGGTNFGFMNGSNYSDHL 300
Query: 329 ---ITSYDYDAPIDEYGLLSE 346
+TSYDYDA + E G +++
Sbjct: 301 TADVTSYDYDALLTEDGQITD 321
>gi|119588243|gb|EAW67839.1| hCG1729998, isoform CRA_d [Homo sapiens]
Length = 653
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 161/321 (50%), Gaps = 27/321 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ R L Q GP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQAGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGIVELL 254
Query: 232 ----GVPWVMCKQTD---APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
G V+ T A N+ + +K +KP L E W GW+ WG +
Sbjct: 255 LTSDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQR-DKPLLIMEYWVGWFDRWGDK 313
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
+ +++ AV+ F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 314 HHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVL 372
Query: 339 DEYGLLSEPKWGHLKDLHAAI 359
E G +E K+ L+ L ++
Sbjct: 373 TEAGDYTE-KYLKLQKLFQSV 392
>gi|299147339|ref|ZP_07040404.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298514617|gb|EFI38501.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 171/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G + VP C +A +++I N
Sbjct: 181 ENEYG----SYGID-KPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G ++ K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|293370654|ref|ZP_06617206.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|292634388|gb|EFF52925.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 778
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 171/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G + VP C +A +++I N
Sbjct: 181 ENEYG----SYGID-KPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKRLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G ++ K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
Length = 778
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGID-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G ++ K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|170782982|ref|YP_001711316.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
gi|169157552|emb|CAQ02748.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
Length = 615
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 161/320 (50%), Gaps = 27/320 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +I+ +HY R P+ W D I K++ G D IETYV WNAH RG ++
Sbjct: 37 LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGTFDTSAGL 96
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F+ LV + G++ +R GPY+CAEW+ GG P WL P + R + + + F+
Sbjct: 97 DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFGDPAVGVRRSEPLYLAAVDEFL 156
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
+++ +++ + GGP+I++QIENEYG +YG + Y++ + G VP
Sbjct: 157 RRVYEIVAPRQID--MGGPVILVQIENEYG----AYGDDAE-YLRHLVDLTRESGIIVPL 209
Query: 236 VMCKQ-TDA--PENIIDACN---------GYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
Q TD +D + + + + P + +E WDGW+ WG
Sbjct: 210 TTVDQPTDEMLSRGSLDELHRTGSFGSRAAERLETLRRHQRTGPLMCSEFWDGWFDHWGE 269
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF------YITSYDYDAP 337
H A A G+ +N YM+ GGTNFG T+G ++TSYDYDAP
Sbjct: 270 HH-HTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHKGTYQSHVTSYDYDAP 328
Query: 338 IDEYGLLSEPKWGHLKDLHA 357
+DE G +E K+ +D+ A
Sbjct: 329 LDETGSPTE-KYFAFRDVIA 347
>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
Length = 778
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGID-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G ++ K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/355 (33%), Positives = 173/355 (48%), Gaps = 42/355 (11%)
Query: 13 CLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPR 72
LAL + P+ +M + + S F+ N + ++DG +IS +HY R
Sbjct: 1 MLALFLLPVSVMAAARRGNSSALSDQRGSFRVENGKF-----VLDGQPFQIISGEMHYER 55
Query: 73 ATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQ 132
W + +K G + I TYVFWN HE G+++F G D+ +F++ +GL +
Sbjct: 56 IPRAYWKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSGNADLAQFIRDAQQTGLKVL 115
Query: 133 LRIGPYVCAEWNFGGFPVWLRDIPGIE--FRTNNAPFKEEMQRFVKKIVDLMRE-EMLFS 189
LR GPY CAEW FGGFP WL P ++ R+N+ F + +++ I+ L RE L
Sbjct: 116 LRAGPYSCAEWEFGGFPAWLMKNPKMQTALRSNDPEFMKPAEQW---ILRLGREVAPLQV 172
Query: 190 WQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAA-----------SMALGLGAGVPWV 236
GGPII +QIENEYG+ +++Y + K A S AL G+ +P V
Sbjct: 173 GYGGPIIGVQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSLLYTANPSRALVRGS-IPGV 231
Query: 237 MCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFA 296
AP + A D +P L +E W GW+ WG PH+ + L+
Sbjct: 232 YSAVNFAPGHAAQA-----LDSLAQLRAGQPLLSSEYWTGWFDHWGE--PHQ-SKPLSLQ 283
Query: 297 VARF--FQRGGSFMNYYMYFGGTNFGRTSGGPFY-------ITSYDYDAPIDEYG 342
V F R G+ +N YM+ GGT+FG SG + +TSYDY AP+DE G
Sbjct: 284 VKDFNYILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAPLDEAG 338
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 71/184 (38%), Gaps = 29/184 (15%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+ ++ M D V++NG+L G++ + S L +L + G N +
Sbjct: 422 LVLNGMNDYALVYLNGKLQGTLNRTCNDSTLMLHSNSAKTRLDILVENSGRINSTRMMLH 481
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKIL--W-TYQVGLK--------GEFQQIYSIEENEAEW 647
G G V L G + L W TY++ +K G Q+ + E++
Sbjct: 482 ANKGLMGPVMLAG---------RALHGWKTYRLPMKPDTIADPLGMPQETHFNEKSTPAQ 532
Query: 648 TDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
TF PD LD+ +GKG W++GH IGRYW V G
Sbjct: 533 AMSGPAFYRGTFRVETKSKQIPDTF----LDIRGLGKGAVWIDGHPIGRYWNV-----GP 583
Query: 708 QDTC 711
QDT
Sbjct: 584 QDTL 587
>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
Thetaiotaomicron
Length = 612
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 155/322 (48%), Gaps = 29/322 (9%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
+++G ++ +A IHYPR E W I K G + I YVFWN HE G+Y+F
Sbjct: 14 TFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEGRYDFA 73
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G+ DI F +L +G Y+ +R GPYVCAEW GG P WL I+ R + + E ++
Sbjct: 74 GQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYYXERVK 133
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAASMALGLG 230
F+ ++ + + + +GG II +Q+ENEYG ++ Y + +D VK A
Sbjct: 134 LFLNEVGKQLADLQIS--KGGNIIXVQVENEYGAFGIDKPYISEIRDXVKQAGF------ 185
Query: 231 AGVPWVMCKQTDAPEN--------IIDACNGYYCDG----YKPNSYNKPTLWTENWDGWY 278
GVP C EN I+ G D K + P +E W GW+
Sbjct: 186 TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEFWSGWF 245
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-----YITSYD 333
WG + R E+L R SF + Y GGT+FG G F TSYD
Sbjct: 246 DHWGAKHETRSAEELVKGXKEXLDRNISF-SLYXTHGGTSFGHWGGANFPNFSPTCTSYD 304
Query: 334 YDAPIDEYGLLSEPKWGHLKDL 355
YDAPI+E G ++ PK+ +++L
Sbjct: 305 YDAPINESGKVT-PKYLEVRNL 325
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 91/221 (41%), Gaps = 47/221 (21%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSV---IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
T+ I D +VF+NG+ ++ G V + P+ + G + L +L + G N+G
Sbjct: 399 TLLITEAHDWAQVFLNGKKLATLSRLKGEGVVKLPPL--KEG-DRLDILVEAXGRXNFGK 455
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG 654
+ D G +V+L K ++L K Y + + F + ++ E +
Sbjct: 456 GI-YDWKGITEKVELQSDKG--VELVKDWQVYTIPVDYSFARDKQYKQQE------NAEN 506
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
P+ +Y++ F+ + + L+ + KG WVNGH IGRYW +
Sbjct: 507 QPA---YYRSTFNLNE-LGDTFLNXXNWSKGXVWVNGHAIGRYWEI-------------- 548
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P QT Y VP WL+ N ++I + G
Sbjct: 549 -------------GPQQTLY-VPGCWLKKGENEIIILDXAG 575
>gi|312866933|ref|ZP_07727144.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
gi|311097415|gb|EFQ55648.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
Length = 595
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 160/315 (50%), Gaps = 35/315 (11%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A + G ++S IHY R P W + K G + +ETY+ WNAHE +GQ++F
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYIPWNAHEPRKGQFDFS 68
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G+ D+ +F++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E +
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDLRIRSSDPAFIEAVD 127
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
R+ +++ L+ + +GGPI+M+Q+ENEYG SYG+ KDY++ + G
Sbjct: 128 RYYDRLLGLLTPYQVD--RGGPILMMQVENEYG----SYGED-KDYLRAIRDLMKEKGVT 180
Query: 233 VPW---------VMCKQTDAPENIIDACN----GYYCDGYKPNSYNK-----PTLWTENW 274
P + T E++ N Y G +++ P + E W
Sbjct: 181 CPLFTSDGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWPLMCMEFW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF--- 327
DGW+T W + R E+LA AV + G +N YM+ GGTNFG +G G
Sbjct: 241 DGWFTRWKEPVIQRDPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTLDLP 298
Query: 328 YITSYDYDAPIDEYG 342
+TSYDY A ++E G
Sbjct: 299 QVTSYDYGALLNEQG 313
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D +++ +Y+ F +D LD+ GKG +VNGH++GR+
Sbjct: 485 YPLDLQDISQLDFSKEWQAGAPAFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
Length = 778
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G VP C +A +++I N
Sbjct: 181 ENEYG----SYGID-KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + RP +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G ++ K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
Length = 586
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/311 (33%), Positives = 155/311 (49%), Gaps = 30/311 (9%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
R ++DG ++S IHY R P++W D I K++ G + IETYV WN H S G +
Sbjct: 9 RDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFRT 68
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G D+ +F+ LV + G+ +R GPY+CAEW+ GG P WL P I R++ + +
Sbjct: 69 DGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSIGVRSSEPGYLAAV 128
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
F+ +++ ++ E + +GGP+I+ QIENEYG +YG K Y++ A G
Sbjct: 129 DGFMDRLLPIVVERQI--TRGGPVILFQIENEYG----AYGSD-KAYLQHLVDTATRAGV 181
Query: 232 GVPWVMCKQTDAPENIID--ACNGYYCDG------------YKPNSYNKPTLWTENWDGW 277
VP C Q E +I+ + G + G + + P + E W+GW
Sbjct: 182 EVPLFTCDQPF--ETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGW 239
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG------PFYITS 331
+ WG H + A G+ +N YM+ GGTNFG T+G ITS
Sbjct: 240 FDNWGTHH-HTTDAAASAAELDALLAAGASVNIYMFHGGTNFGFTNGANDKGIYEPTITS 298
Query: 332 YDYDAPIDEYG 342
YDYDAP+ E G
Sbjct: 299 YDYDAPLSEDG 309
>gi|417918764|ref|ZP_12562312.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
gi|342827747|gb|EGU62128.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
Length = 595
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 160/315 (50%), Gaps = 35/315 (11%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A + G ++S IHY R P W + K G + +ETYV WNAHE +GQ++F
Sbjct: 9 AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G+ D+ +F++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E +
Sbjct: 69 GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDLRIRSSDPVFIEAVD 127
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAG 232
R+ +++ L+ + +GGPI+M+Q+ENEYG SYG+ KDY++ + G
Sbjct: 128 RYYDRLLGLLTPYQVD--RGGPILMMQVENEYG----SYGED-KDYLRAIRDLMKEKGVT 180
Query: 233 VPW---------VMCKQTDAPENIIDACN----GYYCDGYKPNSYNK-----PTLWTENW 274
P + T E++ N Y G +++ P + E W
Sbjct: 181 CPLFTSDGPWRATLRAGTLIEEDLFVTGNFGSKATYNFGQMKEFFDEYGKRWPLMCMEFW 240
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF--- 327
DGW+T W + R E+LA AV + G +N YM+ GGTNFG +G G
Sbjct: 241 DGWFTRWKEPVIQRDPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTLDLP 298
Query: 328 YITSYDYDAPIDEYG 342
+TSYDY A ++E G
Sbjct: 299 QVTSYDYGALLNEQG 313
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D +++ +Y+ F +D LD+ GKG +VNGH++GR+
Sbjct: 485 YPLDLQDLSQLDFSKEWQAGAPAFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 149/305 (48%), Gaps = 26/305 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G +IS +HY R P+ W D + K++ G + +ETYV WN H+ G G
Sbjct: 13 LNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLDGLL 72
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++L + GL + LR GPY+CAEW+ GG P WL ++ R+++ F + R++
Sbjct: 73 DLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIIDRYL 132
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ + M S GGP+I +Q+ENEYG +YG +Y+K+ G
Sbjct: 133 DLLLPPLLPHMAES--GGPVIAVQVENEYG----AYGNDA-EYLKYLVEAFRSRGIEELL 185
Query: 236 VMCKQTDAPENIIDACNGYYCDG------------YKPNSYNKPTLWTENWDGWYTTWGG 283
C Q + + G G + + P + E W GW+ WGG
Sbjct: 186 FTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFDHWGG 245
Query: 284 RLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAP 337
R D+A + + G S +N YM+ GGTNFG T+G + ITSYDYDAP
Sbjct: 246 PHHTRDTADVAADLDKLLAAGAS-VNIYMFHGGTNFGLTNGANHHHTYAPTITSYDYDAP 304
Query: 338 IDEYG 342
+ E G
Sbjct: 305 LTENG 309
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 165/349 (47%), Gaps = 47/349 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 77 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 136
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 137 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 196
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + + + L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 197 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYG----SY---ADDHAYMADNRAMYVKAGF 247
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 248 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAG 304
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 305 WFDHWGK--PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD 360
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + K+ ++D A + +P + A A
Sbjct: 361 HYAPQTTSYDYDAIVDEAGRPTA-KFALMRDAIARVTGVQPPALPAPIA 408
>gi|442626280|ref|NP_001260120.1| beta galactosidase, isoform B [Drosophila melanogaster]
gi|440213416|gb|AGB92656.1| beta galactosidase, isoform B [Drosophila melanogaster]
Length = 670
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 183/376 (48%), Gaps = 50/376 (13%)
Query: 1 MHSKKNNRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKP-FNVSYDHRAIIIDGN 59
M + NR L +A+S ++ +M + CV S + +P F + ++ ++DG
Sbjct: 1 MSGCRRNRKL--TMAVSGCLIIAVMALTVGLCVGLSGDTDAPEPRFTIDHEANTFMLDGQ 58
Query: 60 RRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVK 119
+S HY RA PE W + + G + ++TYV W+ H G+YN++G D+VK
Sbjct: 59 PFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHDGEYNWEGIADVVK 118
Query: 120 FVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL-RDIPGIEFRTNNAPFKEEMQRFVKKI 178
F+++ Y+ LR GPY+CAE + GG P WL P I+ RTN+ + E+ ++ ++
Sbjct: 119 FLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTNDPNYISEVGKWYAEL 178
Query: 179 VDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW--------AASMALGLG 230
+ R + LF GG IIM+Q+ENEYG+ + DY+ W + AL
Sbjct: 179 --MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWLRDETEKYVSGKALLFT 231
Query: 231 AGVP--WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK------------PTLWTENWDG 276
+P + C + EN+ A + D + N +K P + +E + G
Sbjct: 232 VDIPNEKMSCGKI---ENVF-ATTDFGID--RINEIDKIWAMLRALQPTGPLVNSEFYPG 285
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------- 328
W T W + R +++A A+ S +N YM+FGGTNFG T+G +
Sbjct: 286 WLTHWQEQNQRRDGQEVANALRTILSYNAS-VNLYMFFGGTNFGFTAGANYNLDGGIGYA 344
Query: 329 --ITSYDYDAPIDEYG 342
ITSYDYDA +DE G
Sbjct: 345 ADITSYDYDAVMDEAG 360
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 123/315 (39%), Gaps = 62/315 (19%)
Query: 455 LPLSPNISV-PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI----LEHLNVTKD 509
LPL P I++ P + + ++ T K + E S+ + V+ I E L++
Sbjct: 377 LPL-PEITLNPAKRLAYGRVELTPKLTLLSTEGRAALSKGD-PVESIKPKTFEELDL--- 431
Query: 510 YSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ 569
YS + + T++ D D + K ID + D VF++ +L G++
Sbjct: 432 YSGLVLYETELPSMDLDPALLK---------IDQINDRAHVFVDQELVGTLSREAQIYSL 482
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW-TYQV 628
P+ G + L LL + G N+ ++ D G G+V L G + L W +
Sbjct: 483 PLSKGWG-STLQLLVENQGRVNF--YISNDTKGIFGEVSLQLHNGGYLPLEN--WRSTAF 537
Query: 629 GLKGEFQQIYSIEENEAEWTD--LTRDGI----PSTFTWYKTYFDAPDGIDPVALDLGSM 682
L+ +++ E + + D L R I P +T T + D L++
Sbjct: 538 PLEQSAVELWRREHTDEKALDPLLARQRILRNGPILYTGSLTVTEVGD----TYLNMAGW 593
Query: 683 GKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
GKG A+VNG ++GRYW V P Q +VP L+
Sbjct: 594 GKGVAYVNGFNLGRYWPVAGP---------------------------QVTLYVPNEILK 626
Query: 743 ASNNLLVIFEETGGN 757
N LVI E N
Sbjct: 627 VGENSLVILEYQRAN 641
>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
Length = 611
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 166/349 (47%), Gaps = 47/349 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG ++S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 37 FVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 96
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 97 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQS 156
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + ++ L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 157 YLDALAKQVQP--LLNHNGGPIIAVQVENEYG----SY---ADDHAYMADNRAMYVKAGF 207
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 208 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAG 264
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 265 WFDHWGK--PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD 320
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + PK+ ++D A + +P + A A
Sbjct: 321 HYAPQTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGVQPPALPAPIA 368
>gi|340722578|ref|XP_003399681.1| PREDICTED: beta-galactosidase-like [Bombus terrestris]
Length = 646
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 177/357 (49%), Gaps = 38/357 (10%)
Query: 10 LLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIH 69
L L L+V M ++ IH++ + S S F V Y++ ++DG IS H
Sbjct: 2 LTVLLTLAVTSAMGEVVNIHVNNDTQSKFS-----FEVDYENNQFLLDGKPFRYISGSFH 56
Query: 70 YPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGL 129
Y R + W D + K + G + + TYV W+ H+ ++++ G D+++F+ + GL
Sbjct: 57 YFRTPRQYWRDRLRKMRAAGLNAVSTYVEWSLHQPTENEWHWTGDADVIEFINIAQEEGL 116
Query: 130 YLQLRIGPYVCAEWNFGGFPVW-LRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLF 188
++ LR GPY+CAE +FGG P W L +P I+ RTN++ + + ++ ++ +I+D + +
Sbjct: 117 FVLLRPGPYICAERDFGGLPYWLLARVPDIKLRTNDSRYMKYVEIYLNEILD--KVQPYL 174
Query: 189 SWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGL----GAGVPWVMC---- 238
GGPIIM+Q+ENEYG+ + Y + +D ++ L GA + C
Sbjct: 175 RGNGGPIIMVQVENEYGSYACDREYLSRLRDIMRQKIGTKALLYSTDGANANMLRCGFIP 234
Query: 239 ---KQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVE--DL 293
D N N Y+P P + +E + GW T W R P + V+ +
Sbjct: 235 EVYATVDFGPNTNVTKNFEIMRMYQPRG---PLVNSEFYPGWLTHW--REPFQRVQTATV 289
Query: 294 AFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--------PFYITSYDYDAPIDEYG 342
+ G S +N YM++GGTNFG T+G P +TSYDYDAP+ E G
Sbjct: 290 TKTLDEMLSLGAS-VNIYMFYGGTNFGYTAGANGGHNAYNP-QLTSYDYDAPLTEAG 344
>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
Length = 493
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/321 (31%), Positives = 149/321 (46%), Gaps = 30/321 (9%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A +DG + L+S IHY R E W D + K K G + +E YV WN HE G++NF
Sbjct: 62 AFWLDGEKITLVSGSIHYFRVPNEYWLDRLTKLKYAGLNTVELYVSWNLHEPYSGEFNFS 121
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G D+V+F+++ G GL++ R GPY+CAEW +GG P WL ++ RT + E ++
Sbjct: 122 GDLDVVRFIEMAGELGLHVLFRPGPYICAEWEWGGHPYWLLHDTDMKVRTTYPGYLEAVE 181
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSY--GQQGKDYVKWAASM----- 225
+F ++ R L GGPII +QIENEY ++ G ++ W
Sbjct: 182 KFYSELFG--RVNHLMYRNGGPIIAVQIENEYAGFADAFEIGPLDPGFLTWLRQTIKDQQ 239
Query: 226 --ALGLGAGVPWVMCKQTDAPE----NIIDACNG-YYCDGYKPNSYNKPTLWTENWDGWY 278
L + W K + N D Y+ + + N KP + E W GW+
Sbjct: 240 CEELLFTSDGGWDFYKYELEGDPYGLNFDDVLRANYWLNILENNQPGKPKMVMEWWSGWF 299
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----------- 327
WG + + + S +NYYM+ GGTNFG +G F
Sbjct: 300 DFWGYHHQGTTADSFEENLRAILSQNAS-VNYYMFHGGTNFGYMNGANFNTNDQTNDLEY 358
Query: 328 --YITSYDYDAPIDEYGLLSE 346
+TSYDYD P+ E G +++
Sbjct: 359 QPVVTSYDYDCPLSEEGRITK 379
>gi|301763006|ref|XP_002916929.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Ailuropoda
melanoleuca]
Length = 1209
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 154/308 (50%), Gaps = 34/308 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+ G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F
Sbjct: 499 LGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENL 558
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT F E + ++
Sbjct: 559 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYF 618
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L +GGPII +Q+ENEYG+ KDY+ + L G+
Sbjct: 619 DHLIS--RVVPLQYHKGGPIIAVQVENEYGSFAVD-----KDYMPYVRKAL--LERGIVE 669
Query: 236 VMCKQTDAP-------ENIIDACNGYYCDGYKPNSY--------NKPTLWTENWDGWYTT 280
++ DA E ++ N + ++ +++ NKP + E W GW+ T
Sbjct: 670 LLVTSDDAENLQKGYLEGVLATIN---MNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDT 726
Query: 281 WGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDY 334
WGG+ ED+ V++F SF N YM+ GGTNFG +G ++ +TSYDY
Sbjct: 727 WGGKHMVNNAEDVEETVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGIHRAVVTSYDY 785
Query: 335 DAPIDEYG 342
DA + E G
Sbjct: 786 DALLTEAG 793
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 81/174 (46%), Gaps = 30/174 (17%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
+ +DG+ ++I+ IHY R E W D + K K G + + T
Sbjct: 55 SFTLDGSPFLIIAGTIHYFRVPREYWRDRLMKLKACGFNTVTT----------------- 97
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
FV + GL++ L GPY+ ++ + GG P WL P ++ RT F + +
Sbjct: 98 ------AFVAMASDVGLWVILCPGPYIGSDLDLGGLPSWLLRDPKMKLRTTYRGFTKAVN 151
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
+ KI+ + + L +GGPII LQ+ENEYG SY Q K Y+ + +A
Sbjct: 152 LYFDKIIPKIVQ--LQYGKGGPIIALQVENEYG----SY-HQDKRYMPYIKKLA 198
>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 139
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 68/100 (68%), Positives = 84/100 (84%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
VSYDHRA++I+G RR+LIS IHYPR+TPEMWP L+ K+K+GG DV++TYVFWN HE +R
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFG 146
GQY F + D+V+FVKL +GLY+ LRIGPYVCAEWNFG
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFG 127
>gi|291557570|emb|CBL34687.1| Beta-galactosidase [Eubacterium siraeum V10Sc8a]
Length = 579
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 155/318 (48%), Gaps = 44/318 (13%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +IS IHY R PE W D + K G + +ETY+ WN HE+ +G +N+ G +
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMH 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI +F++L GLY+ +R PY+C+EW FGG P WL + R + P+ + +
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ + + + GG IIM+QIENEYG Y Y+++ G VP+
Sbjct: 132 SVLMPKLAPYQIDN--GGNIIMMQIENEYG-----YYGNDTSYLEFLRDTMRKYGITVPF 184
Query: 236 VMCKQTDAPENIIDACNGYYCDGYKPN------------------SYNKPTLWTENWDGW 277
V +D P + +G DG P +KP + E W+GW
Sbjct: 185 V---TSDGPWSEFVFKSG-MVDGALPTGNFGSSAEWQFGEMRRFIGEDKPLMCMEFWNGW 240
Query: 278 YTTWGGR----LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-----GPFY 328
+ WG P + ++L + + GS MN+YM+ GGTNFG SG
Sbjct: 241 FDVWGEEHNITAPEKAAQELDILL-----KNGS-MNFYMFEGGTNFGFMSGKNNEKKTGI 294
Query: 329 ITSYDYDAPIDEYGLLSE 346
+TSYDYDAP+ E G ++E
Sbjct: 295 VTSYDYDAPLTEDGRITE 312
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/236 (21%), Positives = 92/236 (38%), Gaps = 51/236 (21%)
Query: 531 KTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQ 590
+ NE TV ++ D ++ F NG+ + + + +S LL + +G
Sbjct: 388 RENETVSTVRCENTADRVQGFRNGKYAFTAFAETIDEQFELAEKSAGGTTDLLVENIGRV 447
Query: 591 NYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
N+G LE G G +++ + ++ + ++EN+ D
Sbjct: 448 NFGTGLECQHKGVLGGIRINDHRQYGFEMFTL----------------PLDENQLGRIDY 491
Query: 651 TR---DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
R DG+P+ +YK F+ + D LD GKG A++NG ++GR+W +
Sbjct: 492 NRGYNDGVPA---FYKFEFEISEVADTF-LDTDGFGKGVAFINGFNLGRFWNI------- 540
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
P + Y +P L+ N +VIFE G + I++
Sbjct: 541 --------------------GPQKKLY-IPAPLLKKGKNEIVIFETEGNSADSITL 575
>gi|167750408|ref|ZP_02422535.1| hypothetical protein EUBSIR_01382 [Eubacterium siraeum DSM 15702]
gi|167656559|gb|EDS00689.1| glycosyl hydrolase family 35 [Eubacterium siraeum DSM 15702]
Length = 579
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 155/318 (48%), Gaps = 44/318 (13%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +IS IHY R PE W D + K G + +ETY+ WN HE+ +G +N+ G +
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMH 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI +F++L GLY+ +R PY+C+EW FGG P WL + R + P+ + +
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ + + + GG IIM+QIENEYG Y Y+++ G VP+
Sbjct: 132 SVLMPKLAPYQIDN--GGNIIMMQIENEYG-----YYGNDTSYLEFLRDTMRKYGITVPF 184
Query: 236 VMCKQTDAPENIIDACNGYYCDGYKPN------------------SYNKPTLWTENWDGW 277
V +D P + +G DG P +KP + E W+GW
Sbjct: 185 V---TSDGPWSEFVFKSG-MVDGALPTGNFGSSAEWQFGEMRRFIGEDKPLMCMEFWNGW 240
Query: 278 YTTWGGR----LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-----GPFY 328
+ WG P + ++L + + GS MN+YM+ GGTNFG SG
Sbjct: 241 FDVWGEEHNITAPEKAAQELDILL-----KNGS-MNFYMFEGGTNFGFMSGKNNEKKTGI 294
Query: 329 ITSYDYDAPIDEYGLLSE 346
+TSYDYDAP+ E G ++E
Sbjct: 295 VTSYDYDAPLTEDGRITE 312
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/236 (21%), Positives = 92/236 (38%), Gaps = 51/236 (21%)
Query: 531 KTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQ 590
+ NE TV ++ D ++ F NG+ + + + +S LL + +G
Sbjct: 388 RENETVSTVRCENAADRVQGFRNGKYAFTAFAETIDEQFELAEKSAGGTTDLLVENIGRV 447
Query: 591 NYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
N+G LE G G +++ + ++ + ++EN+ + D
Sbjct: 448 NFGTGLECQHKGVLGGIRINDHRQYGFEMFTL----------------PLDENQLDRIDY 491
Query: 651 TR---DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
R DG+P+ +YK F+ + D LD KG A++NG ++GR+W +
Sbjct: 492 NRGYNDGVPA---FYKFEFEISETADTF-LDTDGFRKGVAFINGFNLGRFWNI------- 540
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
P + Y +P L+ N +VIFE G + I++
Sbjct: 541 --------------------GPQKKLY-IPAPLLKKGKNEIVIFETEGNSADSITL 575
>gi|324507659|gb|ADY43243.1| Beta-galactosidase [Ascaris suum]
Length = 655
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 173/344 (50%), Gaps = 36/344 (10%)
Query: 22 MMMMMMIHLSC---VSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMW 78
+++++ + +C + S SA +F ++ + ++DG IS IHY R P+ W
Sbjct: 10 LIIILSLLFNCGAVIDSHSAPSF----SIDPQNNVFLLDGRSFRYISGSIHYFRVHPDQW 65
Query: 79 PDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPY 138
D +++ + G + I+ Y+ WN HE G++ F G +I F++L + LY +RIGPY
Sbjct: 66 NDRLSRMRAAGLNAIQFYIPWNFHEIYEGKHRFDGSRNITHFLQLAMQNELYALVRIGPY 125
Query: 139 VCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIML 198
+CAEW GG P WL I+ RT++ F + ++R+ ++ +++ + GGPI+ML
Sbjct: 126 ICAEWENGGAPWWLLKYKDIKMRTSDKRFLDAVKRWFDVLLPILKPNL--RKNGGPILML 183
Query: 199 QIENEYGNMESSYGQQGKDYVKWAASMALG---------------LGAG-VPWVMCKQTD 242
Q+ENEYG+ + + +++ A G L G +P V
Sbjct: 184 QLENEYGSFDGGCDRNYTIFLRDLARRHFGDDVVLYTTDGGDDFYLKCGTIPGVYATVDF 243
Query: 243 APEN--IIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR-LPHRPVEDLAFAVAR 299
P + ID C Y+P+ P + +E + GW+ TW + +PV ++
Sbjct: 244 GPASSEAIDHCFASQRQ-YEPHG---PLVNSEFYPGWFLTWSQKERGDQPVHNVINGSKY 299
Query: 300 FFQRGGSFMNYYMYFGGTNFGRTSGGP---FYITSYDYDAPIDE 340
F++G +F NYYM+ GGTNF +GG TSYDY AP+ E
Sbjct: 300 MFEKGANF-NYYMFHGGTNFAFWNGGATKTAITTSYDYFAPLSE 342
>gi|258507331|ref|YP_003170082.1| beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|385827042|ref|YP_005864814.1| beta-galactosidase [Lactobacillus rhamnosus GG]
gi|257147258|emb|CAR86231.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|259648687|dbj|BAI40849.1| beta-galactosidase [Lactobacillus rhamnosus GG]
Length = 593
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 172/358 (48%), Gaps = 51/358 (14%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
+++F G DI +F+K GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 EFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ R+ ++ + + + GG +IM+Q+ENEYG SYG+ +DY+ A +
Sbjct: 123 LAAIDRYYTALMPHLVDHQVT--HGGNVIMMQVENEYG----SYGED-QDYLAAVAKLMQ 175
Query: 228 GLGAGVPWVMCKQTDAP-------ENIIDA---CNGYYCDG-----------YKPNSYNK 266
G VP +D P ++IDA G + ++ + +
Sbjct: 176 QHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDW 232
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
P + E WDGW+ WG + R ++ A + +RG +N YM+ GGTNFG +G
Sbjct: 233 PLMCVEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGS--VNLYMFHGGTNFGFMNGTS 290
Query: 327 FY-------ITSYDYDAPIDEYGLLSEPKW--------GHLKDLHAAIKLCEPALVAA 369
+TSYDYDAP++E G + PK+ L ++ A L +P + A
Sbjct: 291 ARKDHDLPQVTSYDYDAPLNEQGNPT-PKYFAIQKMIHEELPEVQQAKPLVKPTMAPA 347
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 73/182 (40%), Gaps = 47/182 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-IDLSKILWTYQVGLKGE 633
G++ L LL + + NYG+ +E + G + G +DL I KG
Sbjct: 441 EGHHQLDLLVENMSRVNYGSKIE-------AITQFKGIRTGVMVDLHFI--------KG- 484
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+QQ Y ++ N A T P+T +YK FD D LD GKG VNG +
Sbjct: 485 YQQ-YPLDLNRASRLTFTEGWQPATPAFYKYTFDLTAPQD-TYLDCRGFGKGVMLVNGVN 542
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
+GR+W KG PT + Y VP L A N +++FE
Sbjct: 543 VGRFWE----KG-----------------------PTLSLY-VPAGLLHAGKNDVIVFET 574
Query: 754 TG 755
G
Sbjct: 575 EG 576
>gi|345800024|ref|XP_546385.3| PREDICTED: galactosidase, beta 1-like 3 [Canis lupus familiaris]
Length = 808
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 168/350 (48%), Gaps = 31/350 (8%)
Query: 29 HLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEG 88
HL+ + + +P + + G++ + IHY R W D + K K
Sbjct: 210 HLTPLELEDRAAGLEPQSPGGRKPCFTLGGHKFQVFGGSIHYFRVPRAYWGDRLRKLKAC 269
Query: 89 GADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGF 148
G + + TYV WN HE RG+++F G D+ FV L GL++ LR GPY+C+E + GG
Sbjct: 270 GFNTVTTYVPWNLHEPERGKFDFSGNLDMEAFVLLAAEMGLWVILRPGPYICSEIDLGGL 329
Query: 149 PVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNME 208
P WL P + RT + F + + ++ ++ R L +GGPII +Q+ENEYG+
Sbjct: 330 PSWLLQDPKMVLRTTYSGFVKAVDKYFDHLIS--RVVPLQYRRGGPIIAVQVENEYGSFA 387
Query: 209 SSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDA----------CNGYYCDG 258
G Y++ A L G+ ++ DA EN++ N +
Sbjct: 388 EDRGYM--PYLQKAL-----LERGIVELLVTSDDA-ENLLKGHIKGVLATINMNSFQESD 439
Query: 259 YKPNSY---NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFG 315
+K SY NKP + E W GW+ TWG + +D+ V +F SF N YM+ G
Sbjct: 440 FKLLSYVQSNKPIMVMEFWVGWFDTWGSEHKVKNPKDVEETVTKFIASEISF-NVYMFHG 498
Query: 316 GTNFGRTSGGPFY------ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAI 359
GTNFG +G + +TSYDYDA + E G +E K+ L+ L ++
Sbjct: 499 GTNFGFMNGATDFGIHRGVVTSYDYDAVLTEAGDYTE-KYFKLRRLFGSV 547
>gi|393782614|ref|ZP_10370797.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
gi|392672841|gb|EIY66307.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
Length = 605
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 161/321 (50%), Gaps = 31/321 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-GK 114
+D +IS IH R E W I K G + + Y+ WN HES G ++F+ G
Sbjct: 41 LDDKPFQIISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGN 100
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
++ KF++ V G++L R GPYVC EW+FGG P +L IP I+ R + + ++R+
Sbjct: 101 KNLEKFIQTVQDEGMFLLFRPGPYVCGEWDFGGLPPYLLSIPDIKIRCMDTRYTAAVERY 160
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
V KI ++++ + + GGPIIM+Q+ENEYG SYG + Y+KW + G VP
Sbjct: 161 VDKIAPIIKKYEITN--GGPIIMVQVENEYG----SYGND-RIYMKWMHDLWRDKGIEVP 213
Query: 235 W--------VMCKQTDAPENIID----ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG 282
+ M + P I A + + K + + +E + GW T W
Sbjct: 214 FYTADGATPYMLEAGTLPGVAIGLDPAASKAEFDEALKVHP-DASVFCSELYPGWLTHWR 272
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG------GPFY--ITSYDY 334
H +E + V G SF NYY+ GGTNFG +G G + +TSYDY
Sbjct: 273 EEWQHPSIEKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGTYQPDVTSYDY 331
Query: 335 DAPIDEYGLLSEPKWGHLKDL 355
DAPI+E G + PK+ L++L
Sbjct: 332 DAPINEMG-QATPKYMALREL 351
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 77/164 (46%), Gaps = 23/164 (14%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLI-LLSQTVGLQNYGAFLE 597
+ +D + D VF+NG+ GS+ + + + N ++ +L +++G N+ A +
Sbjct: 423 LRVDEVHDYATVFLNGRYIGSIDRTLGQHTIDLPVSNVENPVLDILVESMGRINFAAQM- 481
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILW-TYQVGLKGEFQQIYSIEENEAEWTDLTRDGIP 656
D G +V L G ++ + W + + + E+ + +++E +D R G+
Sbjct: 482 IDRKGITDRVTLNG-------MTLMNWEAFNIPMSSEY--VSNLKE-----SDTVRPGM- 526
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
++KT D +DL KG +VNGH++GR+W V
Sbjct: 527 ----FFKTTLQL-DKAGDCYIDLKDFTKGLVYVNGHNLGRFWNV 565
>gi|291530918|emb|CBK96503.1| Beta-galactosidase [Eubacterium siraeum 70/3]
Length = 579
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 150/314 (47%), Gaps = 36/314 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +IS IHY R PE W D + K G + +ETY+ WN HE+ +G +N+ G +
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWDGMH 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
DI +F++L GLY+ +R PY+C+EW FGG P WL + R + P+ + +
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDNYY 131
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ + + + GG IIM+QIENEYG Y Y+++ G VP+
Sbjct: 132 SVLMPKLAPYQIDN--GGNIIMMQIENEYG-----YYGNDTSYLEFLRDTMRKYGITVPF 184
Query: 236 VMCKQTDAPENIIDACNGYYCDGYKPN------------------SYNKPTLWTENWDGW 277
V +D P + +G DG P KP + E W+GW
Sbjct: 185 V---TSDGPWSEFVFKSG-MVDGALPTGNFGSSAEWQLGEMRRFIGEGKPLMCMEFWNGW 240
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-----GPFYITSY 332
+ WG E A + + G MN+YM+ GGTNFG SG +TSY
Sbjct: 241 FDVWGEEHNITAPEKAAQELDTLLKNGS--MNFYMFEGGTNFGFMSGKNNEKKTGIVTSY 298
Query: 333 DYDAPIDEYGLLSE 346
DYDAP+ E G ++E
Sbjct: 299 DYDAPLTEDGRITE 312
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 50/228 (21%), Positives = 89/228 (39%), Gaps = 51/228 (22%)
Query: 531 KTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQ 590
+ NE+ TV ++ D ++ F NG+ + + + +S LL + +G
Sbjct: 388 RENEIVSTVRCENTADRVQGFRNGKYAFTAFAETIDEQFELAEKSAGGTTDLLVENIGRV 447
Query: 591 NYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDL 650
N+G LE G G +++ + ++ + ++EN+ D
Sbjct: 448 NFGTGLECQHKGVLGGIRINDHRQYGFEMFTL----------------PLDENQLGRIDY 491
Query: 651 TR---DGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGC 707
R DG+P+ +YK F+ + D LD GKG A++NG ++GR+W +
Sbjct: 492 NRGYNDGVPA---FYKFEFEISEVADTF-LDTDGFGKGVAFINGFNLGRFWNI------- 540
Query: 708 QDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
P + Y +P L+ N +VIFE G
Sbjct: 541 --------------------GPQKKLY-IPAPLLKKGKNEIVIFETEG 567
>gi|325914137|ref|ZP_08176490.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
gi|325539640|gb|EGD11283.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
Length = 635
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 166/352 (47%), Gaps = 53/352 (15%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG ++S IH+ R W D + K++ G + +ETYVFWN E +GQ++F
Sbjct: 61 FVRDGKPYQILSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSA 120
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 121 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQA 180
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + + + L + GGPII +Q+ENEYG+ + D+ A + A+ + AG
Sbjct: 181 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYDD-------DHAYMADNRAMFVKAGF 231
Query: 234 PWVMCKQTDAPENIIDACNGY---------YCDGYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG + G ++++K P + E W G
Sbjct: 232 DKALLFTSDGADML---ANGTLPGTLAVVNFAPGEAKSAFDKLIKFRPEQPRMVGEYWAG 288
Query: 277 WYTTWGGRLPH------RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--- 327
W+ WG PH + E+L + + R G N YM+ GGT+FG +G F
Sbjct: 289 WFDHWG--TPHASTDAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGN 341
Query: 328 -------YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + PK+ ++D A + +P + A A
Sbjct: 342 PSDHYAPQTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGTQPPALPAPIA 392
>gi|258538519|ref|YP_003173018.1| beta-galactosidase [Lactobacillus rhamnosus Lc 705]
gi|385834266|ref|YP_005872040.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
gi|257150195|emb|CAR89167.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus Lc 705]
gi|355393757|gb|AER63187.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
Length = 593
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 172/358 (48%), Gaps = 51/358 (14%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
+++F G DI +F+K GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 EFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ R+ ++ + + + GG +IM+Q+ENEYG SYG+ +DY+ A +
Sbjct: 123 LAAIDRYYTALMPHLVDHQVT--HGGNVIMMQVENEYG----SYGED-QDYLAAVAKLMQ 175
Query: 228 GLGAGVPWVMCKQTDAP-------ENIIDA---CNGYYCDG-----------YKPNSYNK 266
G VP +D P ++IDA G + ++ + +
Sbjct: 176 QHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDW 232
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
P + E WDGW+ WG + R ++ A + +RG +N YM+ GGTNFG +G
Sbjct: 233 PLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGS--VNLYMFHGGTNFGFMNGTS 290
Query: 327 FY-------ITSYDYDAPIDEYGLLSEPKW--------GHLKDLHAAIKLCEPALVAA 369
+TSYDYDAP++E G + PK+ L ++ A L +P + A
Sbjct: 291 ARKDHDLPQVTSYDYDAPLNEQGNPT-PKYFAIQKMIHEELPEVQQAKPLVKPTMAPA 347
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 73/182 (40%), Gaps = 47/182 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-IDLSKILWTYQVGLKGE 633
G++ L LL + + NYG+ +E + G + G +DL I KG
Sbjct: 441 EGHHQLDLLVENMSRVNYGSKIE-------AITQFKGIRTGVMVDLHFI--------KG- 484
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+QQ Y ++ N A T P+T +YK FD D LD GKG VNG +
Sbjct: 485 YQQ-YPLDLNRASRLTFTEGWQPATPAFYKYTFDLTAPQD-TYLDCHGFGKGVMLVNGVN 542
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
+GR+W KG PT + Y VP L A N +++FE
Sbjct: 543 VGRFWE----KG-----------------------PTLSLY-VPAGLLHAGKNDVIVFET 574
Query: 754 TG 755
G
Sbjct: 575 EG 576
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 165/349 (47%), Gaps = 47/349 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 99 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 158
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + ++ L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 159 YLDAVAKQVQP--LLNHNGGPIIAVQVENEYG----SY---ADDHAYMADNRAMYVKAGF 209
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 210 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAG 266
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 267 WFDHWGK--PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD 322
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + K+ ++D A + +P + A A
Sbjct: 323 HYAPQTTSYDYDAIVDEAGRPTA-KFALMRDAIARVTGVQPPALPAPIA 370
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 103/320 (32%), Positives = 157/320 (49%), Gaps = 25/320 (7%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++ D +DG ++S IHY R + W + + G + I+ Y+ WN HE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G ++F G+ D+V+F + GL + R GPY+C+EW++GG P WL P + R+N
Sbjct: 68 GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
++ + + K++ L+ L GGPII Q+ENEYG+ Y + +++ W A +
Sbjct: 128 YQAAVSSYFSKLLPLLAP--LQHSNGGPIIAFQVENEYGD----YVDKDNEHLPWLADLM 181
Query: 227 LGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSY-----NKPTLWTENWDGWYTTW 281
G + + +D I A N P S NKP L TE W GW+ W
Sbjct: 182 KSHGLFELFFI---SDGGHTIRKA-NMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYW 237
Query: 282 GGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG-----PFY---ITSYD 333
G + + +RG S +N+YM+ GGTNFG +G +Y +TSYD
Sbjct: 238 GHGRNLLNNDVFEKTLKEILKRGAS-VNFYMFHGGTNFGFMNGAIELEKGYYTADVTSYD 296
Query: 334 YDAPIDEYGLLSEPKWGHLK 353
YD P+DE G +E KW +K
Sbjct: 297 YDCPVDESGNRTE-KWEIIK 315
>gi|260592848|ref|ZP_05858306.1| beta-galactosidase [Prevotella veroralis F0319]
gi|260535218|gb|EEX17835.1| beta-galactosidase [Prevotella veroralis F0319]
Length = 621
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 113/331 (34%), Positives = 163/331 (49%), Gaps = 36/331 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-G 113
I DG + S +HY R W + K G + + TY+FWN HE+ G +++ G
Sbjct: 37 IYDGKPIQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWTTG 96
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+++ +F+K G GL + LR GPY CAEW FGG+P WL + RT+N PF + +
Sbjct: 97 THNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKDLVIRTDNKPFLDSCRV 156
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASM--- 225
++ ++ + + L QGGP+IM+Q ENE+G SY Q KD + ++AA +
Sbjct: 157 YINQLAKQVLD--LQVTQGGPVIMVQAENEFG----SYVAQRKDIPLETHKRYAAQIRQQ 210
Query: 226 ALGLGAGVPWVMCK-----QTDAPENIIDACNGY-YCDGYKP--NSYN---KPTLWTENW 274
L G VP + A E + NG D K N Y+ P + E +
Sbjct: 211 LLDAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDIDKLKKVVNEYHGGVGPYMVAEFY 270
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ 328
GW + W P E + ++ G SF NYYM GGTNFG ++G +
Sbjct: 271 PGWLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNATNIQ 329
Query: 329 --ITSYDYDAPIDEYGLLSEPKWGHLKDLHA 357
+TSYDYDAPI E G + PK+ L+DL A
Sbjct: 330 PDMTSYDYDAPISEAG-WATPKYNALRDLIA 359
>gi|421767985|ref|ZP_16204697.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
gi|421773235|ref|ZP_16209883.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411182327|gb|EKS49478.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411186672|gb|EKS53794.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
Length = 656
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 172/358 (48%), Gaps = 51/358 (14%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 68 SIDHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREG 126
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
+++F G DI +F+K GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 127 EFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAY 185
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ R+ ++ + + + GG +IM+Q+ENEYG SYG+ +DY+ A +
Sbjct: 186 LVAIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYG----SYGED-QDYLAAVAKLMQ 238
Query: 228 GLGAGVPWVMCKQTDAP-------ENIIDA---CNGYYCDG-----------YKPNSYNK 266
G VP +D P ++IDA G + ++ + +
Sbjct: 239 QHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDW 295
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
P + E WDGW+ WG + R ++ A + +RG +N YM+ GGTNFG +G
Sbjct: 296 PLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGS--VNLYMFHGGTNFGFMNGTS 353
Query: 327 FY-------ITSYDYDAPIDEYGLLSEPKW--------GHLKDLHAAIKLCEPALVAA 369
+TSYDYDAP++E G + PK+ L ++ A L +P + A
Sbjct: 354 ARKDHDLPQVTSYDYDAPLNEQGNPT-PKYFAIQKMIHEELPEVQQAKPLVKPTMAPA 410
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 73/182 (40%), Gaps = 47/182 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGD-IDLSKILWTYQVGLKGE 633
G++ L LL + + NYG+ +E + G + G +DL I KG
Sbjct: 504 EGHHQLDLLVENMSRVNYGSKIE-------AITQFKGIRTGVMVDLHFI--------KG- 547
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+QQ Y ++ N A T P+T +YK FD D LD GKG VNG +
Sbjct: 548 YQQ-YPLDLNRASQLTFTEGWQPATPAFYKYTFDLTAPQD-TYLDCRGFGKGVMLVNGVN 605
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
+GR+W KG PT + Y VP L A N +++FE
Sbjct: 606 VGRFWE----KG-----------------------PTLSLY-VPAGLLHAGKNDVIVFET 637
Query: 754 TG 755
G
Sbjct: 638 EG 639
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 179/377 (47%), Gaps = 49/377 (12%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
+++ + ++ ++A T P N + DG L+S IH+ R W D +
Sbjct: 9 LVLALAFALPITGAAADTERWP-NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRL 67
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAE 142
K++ G + +ETYVFWN E +GQ++F G ND+ FV+ + GL + LR GPY CAE
Sbjct: 68 QKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNIILRPGPYACAE 127
Query: 143 WNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIEN 202
W GG+P WL I R+ + F Q ++ + + ++ L + GGPII +Q+EN
Sbjct: 128 WEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVEN 185
Query: 203 EYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----- 257
EYG SY D+ A + A+ + AG + +D + + NG D
Sbjct: 186 EYG----SY---ADDHAYMADNRAMYVKAGFDKALLFTSDGADML---ANGTLPDTLAVV 235
Query: 258 ----GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQ--- 302
G ++++K P + E W GW+ WG PH + A A F+
Sbjct: 236 NFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWIL 291
Query: 303 RGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPKWGHL 352
R G + YM+ GGT+FG +G F TSYDYDA +DE G + PK+ +
Sbjct: 292 RQGHSASLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHPT-PKFALM 350
Query: 353 KDLHAAIKLCE-PALVA 368
+D A + + PAL A
Sbjct: 351 RDAIARVTGVQTPALPA 367
>gi|321461520|gb|EFX72551.1| hypothetical protein DAPPUDRAFT_326098 [Daphnia pulex]
Length = 673
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 168/355 (47%), Gaps = 37/355 (10%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF- 111
+++G +IS +HY R P W D + K + GA+ +ETY+ WN HE RG Y+F
Sbjct: 44 GFLLNGKPFHIISGAVHYFRIHPTQWRDRLRKLRAVGANTVETYMPWNLHEPRRGDYDFS 103
Query: 112 KGKND------IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
+G+ND + FV++ L++ LR GP++C+EW FGG P WL P ++ RT+
Sbjct: 104 EGQNDFSSFLNVTAFVEMAQEEDLFVILRPGPFICSEWEFGGLPSWLLRDPDMKVRTSYP 163
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASM 225
+ + ++ ++ + F GPII Q+ENEYG + +Y+
Sbjct: 164 GYLQVADDYLTQVFSRVV-NFQFQKGDGPIIAFQVENEYGAFGVRDEPRDTEYLIHLRDK 222
Query: 226 ALGLGAGVPWVMCKQTDAPENIID--ACNGYY------------CDGYKPNSYNKPTLWT 271
+ LGA M +D P D A G D +KP +
Sbjct: 223 MIALGAT---EMFFTSDTPTKNADLGAVPGELQTANFQNNADPEFDALDILQPDKPYMVA 279
Query: 272 ENWDGWYTTWG-GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG----- 325
E W GW+ WG G +E+ ++ + R F R S +N+YM+ GGT+FG +G
Sbjct: 280 EFWSGWFDHWGQGYHGGSSLEEFSYTLERIFTRNSS-VNFYMFIGGTSFGFMNGANQLPV 338
Query: 326 -PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIK 376
PFY I+SYDYDAP+ E G ++ K+ KDL A + A + +K
Sbjct: 339 FPFYAADISSYDYDAPLTEAGDYTD-KYYAAKDLIAQFNRVPNIYMPALPVESVK 392
>gi|383812458|ref|ZP_09967896.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
F0472]
gi|383355018|gb|EID32564.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
F0472]
Length = 608
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 111/329 (33%), Positives = 158/329 (48%), Gaps = 36/329 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-G 113
I DG + S +HY R W + K G + + TY+FWN HE+ G +++ G
Sbjct: 29 IYDGKPTQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWSTG 88
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
+++ +F+K G GL + LR GPY CAEW FGG+P WL + RT+N PF + +
Sbjct: 89 THNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKNKDLVIRTDNKPFLDSCRV 148
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD--------YVKWAASM 225
++ ++ + + L QGGP+IM+Q ENE+G SY Q KD Y +
Sbjct: 149 YINQLAKQVLD--LQVTQGGPVIMVQAENEFG----SYVAQRKDIPLETHKRYAAQIRQL 202
Query: 226 ALGLGAGVPWVMCK-----QTDAPENIIDACNGY-YCDGYKP--NSYN---KPTLWTENW 274
L G VP + A E + NG D K N Y+ P + E +
Sbjct: 203 LLDAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDIDKLKKVVNEYHGGVGPYMVAEFY 262
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ 328
GW + W P E + ++ G SF NYYM GGTNFG ++G +
Sbjct: 263 PGWLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNATNIQ 321
Query: 329 --ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+TSYDYDAPI E G + PK+ L+DL
Sbjct: 322 PDMTSYDYDAPISEAG-WATPKYNALRDL 349
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 166/329 (50%), Gaps = 34/329 (10%)
Query: 50 DHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQY 109
D IDG L+S +HY R PE W D + K K G + +ETYV WN HE + Y
Sbjct: 26 DGANFTIDGKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHEPEKYTY 85
Query: 110 NFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKE 169
NF+G D+ +++ + GL++ LR GPY+CAEW FGG P WL + RT F +
Sbjct: 86 NFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYVKE-HVRTTRPMFID 144
Query: 170 EMQR-FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS--YGQQGKDYVKWAASMA 226
++ F + + +++ + GGPII +QIENEYG +S Y ++ K ++ +
Sbjct: 145 PVEVWFGRLLAEVVPRQYT---NGGPIIAVQIENEYGGFSNSTEYMERLKKILESRGIVE 201
Query: 227 L-----GLGA----GVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
L G GA G+P V+ K + N D K ++P + E W GW
Sbjct: 202 LLFTSDGKGALISGGIPGVL-KTVNFQNNASDKLQ-----KLKEIQPDRPMMVMEYWTGW 255
Query: 278 YTTWGGRLPHRPVEDLAFAVARFF-QRGGSFMNYYMYFGGTNFGRTSGGPFY-------- 328
+ WG +E +F + F+ G+ +N+YM+ GGTNFG +G
Sbjct: 256 FDHWGEDHHLYRLESESFVHSVFYILDAGASVNFYMFHGGTNFGFMNGANTRYKSGGRTL 315
Query: 329 --ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
ITSYDYDAPI E G L+ PK+ ++++
Sbjct: 316 PTITSYDYDAPISETGDLT-PKYFKIREI 343
>gi|346725882|ref|YP_004852551.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
gi|346650629|gb|AEO43253.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
Length = 611
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 164/345 (47%), Gaps = 47/345 (13%)
Query: 58 GNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDI 117
G L+S IH+ R W D + K++ G + +ETYVFWN E +GQ++F G ND+
Sbjct: 41 GKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDV 100
Query: 118 VKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKK 177
FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q ++
Sbjct: 101 AAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSYLDA 160
Query: 178 IVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVM 237
+ + + L + GGPII +Q+ENEYG SY D+ A + A+ + AG +
Sbjct: 161 LAKQV--QPLLNHNGGPIIAVQVENEYG----SY---ADDHAYMADNRAMYVKAGFDKAL 211
Query: 238 CKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDGWYTT 280
+D + + NG D G ++++K P + E W GW+
Sbjct: 212 LFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDH 268
Query: 281 WGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF---------- 327
WG PH + A A F+ R G N YM+ GGT+FG +G F
Sbjct: 269 WGK--PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAP 324
Query: 328 YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSA 372
TSYDYDA +DE G + PK+ ++D A + +P + A A
Sbjct: 325 QTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGVQPPALPAPIA 368
>gi|199599299|ref|ZP_03212698.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
gi|199589801|gb|EDY97908.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
Length = 593
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 172/358 (48%), Gaps = 51/358 (14%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S DH ++DG ++S IHY R P W + K G + +ETYV WN HE G
Sbjct: 5 SIDHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREG 63
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
+++F G DI +F+K GLY +R PY+CAEW FGGFP WL + RT++ +
Sbjct: 64 EFDFSGILDIERFLKTAEELGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPTY 122
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
+ R+ ++ + + + GG +IM+Q+ENEYG SYG+ +DY+ A +
Sbjct: 123 LAAIDRYYTALMPHLVDHQVT--HGGNVIMMQVENEYG----SYGED-QDYLAVVAKLMQ 175
Query: 228 GLGAGVPWVMCKQTDAP-------ENIIDA---CNGYYCDG-----------YKPNSYNK 266
G VP +D P ++IDA G + ++ + +
Sbjct: 176 QHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDW 232
Query: 267 PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGP 326
P + E WDGW+ WG + R ++ A + +RG +N YM+ GGTNFG +G
Sbjct: 233 PLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGS--VNLYMFHGGTNFGFMNGTS 290
Query: 327 FY-------ITSYDYDAPIDEYGLLSEPKW--------GHLKDLHAAIKLCEPALVAA 369
+TSYDYDAP++E G + PK+ L ++ A L +P + A
Sbjct: 291 ARKDHDLPQVTSYDYDAPLNEQGNPT-PKYFAIQKMIHEELPEVQQAKPLVKPTMAPA 347
>gi|402895880|ref|XP_003911040.1| PREDICTED: beta-galactosidase-1-like protein 3 [Papio anubis]
Length = 653
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 160/323 (49%), Gaps = 31/323 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G R ++ IHY R W D + K + G + + TYV WN HE RG+++F G
Sbjct: 82 LEGRRFLICGGSIHYFRVPRAYWRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L QGGP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGI---V 251
Query: 236 VMCKQTDAPENIIDA-----CNGYYCDGYKPNSYN--------KPTLWTENWDGWYTTWG 282
+ +D +N++ + N++N KP L E W GW+ WG
Sbjct: 252 ELLLTSDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLHKVQRDKPLLVMEYWVGWFDRWG 311
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDA 336
+ + +++ AV+ F + SF N YM+ GGTNFG +G + +TSYDYDA
Sbjct: 312 DKHHVKDAKEVERAVSEFIKYEISF-NVYMFHGGTNFGFMNGATNFGKHTGIVTSYDYDA 370
Query: 337 PIDEYGLLSEPKWGHLKDLHAAI 359
+ E G +E K+ L+ L ++
Sbjct: 371 VLTEAGDYTE-KYFKLQKLLESV 392
>gi|295689222|ref|YP_003592915.1| beta-galactosidase [Caulobacter segnis ATCC 21756]
gi|295431125|gb|ADG10297.1| Beta-galactosidase [Caulobacter segnis ATCC 21756]
Length = 617
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 174/369 (47%), Gaps = 55/369 (14%)
Query: 14 LALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRA 73
+ L V M+ + + + S A + + DG +ISA +HY R
Sbjct: 1 MKLFVRSMLYATLTMSALAILPSDARSAAPAHRFEVSGAGFLKDGAPHQVISAEMHYVRI 60
Query: 74 TPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQL 133
W D + K+K G + I TY FWN HE G Y+F G+ND+ F++ + GL + L
Sbjct: 61 PRAYWRDRLQKAKTMGLNTITTYAFWNVHEPRPGVYDFTGQNDLAAFIRAAQAEGLDVIL 120
Query: 134 RIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGG 193
R GPYVC+EW GG+P WL + R+ + ++R++ ++ ++ +L + GG
Sbjct: 121 RPGPYVCSEWELGGYPSWLLKDRNVLLRSTEPQYAAAVERWMARLGREVKPLLLKN--GG 178
Query: 194 PIIMLQIENEYG----------NMESSYGQQG-KDYVKWAASMALGLGAG----VPWVMC 238
PI+ +Q+ENEYG +E++Y + G D V + ++ A L G +P ++
Sbjct: 179 PIVAIQLENEYGAFGDDKAYLEGLEATYRRAGLADGVLFTSNQASDLAKGSLPHLPSMVN 238
Query: 239 KQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWG-------GRLPHRPVE 291
+ E + + DG + + E W GW+ WG GR + E
Sbjct: 239 FGSGGAEKSVAQLETFRPDGLR--------MVGEYWAGWFDKWGEEHHETDGR---KEAE 287
Query: 292 DLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------ITSYDYDAPIDE--- 340
+L F QRG S ++ YM+ GGT+FG +G + TSYDYDAP+DE
Sbjct: 288 ELRF----MLQRGYS-VSLYMFHGGTSFGWMNGADSHTGKDYHPDTTSYDYDAPLDEAGA 342
Query: 341 ----YGLLS 345
YGLL+
Sbjct: 343 PRYKYGLLA 351
>gi|345880280|ref|ZP_08831835.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
gi|343923634|gb|EGV34320.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
Length = 621
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 160/320 (50%), Gaps = 35/320 (10%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK-G 113
+ DG + S +HY R W + K G + + +YVFWN HE+ G ++++ G
Sbjct: 37 LYDGKPTQIHSGELHYARVPAPYWRHRLQMMKAMGLNAVTSYVFWNHHETSPGVWDWQTG 96
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
++I F+K+ G GL + LR GPY CAEW FGG+P WL G+ RT+N PF + +
Sbjct: 97 NHNIRNFIKIAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKGLVIRTDNKPFLDSCRV 156
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKD-----YVKWAASM--- 225
++ ++ + +R+ + +GGP++M+Q ENE+G SY Q KD + K+AA +
Sbjct: 157 YINQLANQVRDLQIT--KGGPVVMVQAENEFG----SYVAQRKDIPLEVHKKYAAQIRQQ 210
Query: 226 ALGLGAGVPWVMCK-----QTDAPENIIDACNGY-YCDGYKP--NSYN---KPTLWTENW 274
L G +P + + E + NG + K N Y+ P + E +
Sbjct: 211 LLDAGFDIPMFTSDGSWLFKGGSIEGALPTANGEGNIEKLKQVVNEYHGGVGPYMVAEFY 270
Query: 275 DGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ 328
GW + W P E + ++ G SF NYYM GGTNFG T+G +
Sbjct: 271 PGWLSHWAEPFPRVSTESVVKQTKKYLDNGVSF-NYYMVHGGTNFGFTTGANYSNATNLQ 329
Query: 329 --ITSYDYDAPIDEYGLLSE 346
+TSYDYDAPI E G +E
Sbjct: 330 PDMTSYDYDAPISEAGWATE 349
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 71/311 (22%), Positives = 120/311 (38%), Gaps = 75/311 (24%)
Query: 448 IKTVEFSLPLSPNISVPQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVT 507
+K+V + +P P +P ++ + L+ T V +G+ + N T E LN
Sbjct: 359 MKSVAYKVPAVP-ARIPVIAIPKISLNKTVDVMTMV---VGMKAVENDTPM-TFEDLNQG 413
Query: 508 KDYSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKV 567
Y Y H Q + + I + D V++NG G + V
Sbjct: 414 MGYVLYRRHFNQ--------------PISGMMRIKGLADYAVVYVNGTKVGEL--SRVTD 457
Query: 568 VQPVEFQSGYNDLI-LLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTY 626
V +E +N ++ +L + +G NYGA + + F+G K + +I + W
Sbjct: 458 VDSMEVNVPFNGVLDILVENMGRINYGARIVES---FKGITKPVTIEGNEITGN---W-- 509
Query: 627 QVGLKGEFQQIYSIEENEA-EWTDLT---RDGIPSTFTWYKTYFDAPDGIDPVALDLGSM 682
Q+YS+ + + T L + G+P + T D + LD+
Sbjct: 510 ---------QMYSLPMDRMPDMTKLAAGYKAGMPVLYGGSFTL----DKVGDTFLDMAKW 556
Query: 683 GKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
GKG +VNG ++GRYW V P QT Y +P +L+
Sbjct: 557 GKGIVFVNGINLGRYWKV---------------------------GPQQTLY-LPGCFLK 588
Query: 743 ASNNLLVIFEE 753
N +V+FE+
Sbjct: 589 KGKNDIVVFEQ 599
>gi|189217683|ref|NP_001121284.1| galactosidase, beta 1-like precursor [Xenopus laevis]
gi|115527881|gb|AAI24928.1| LOC100158367 protein [Xenopus laevis]
Length = 645
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 164/349 (46%), Gaps = 38/349 (10%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
+ + +M++ VS++S+ TF + ++H DG IS IHY R W D
Sbjct: 8 LRIFLMVVVYGSVSTTSSRTF----EIDFEHNCFRKDGQPFHYISGSIHYSRIPQFYWKD 63
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
+ K K G D I TYV WN HE+ G YNF G +DI F+KL GL + LR GPY+C
Sbjct: 64 RLLKMKMAGLDAIYTYVPWNFHETKPGVYNFSGDHDIESFLKLANEIGLLVILRAGPYIC 123
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW+ GG P WL I R+++ + + + ++ + M+ L GGPII +Q+
Sbjct: 124 AEWDMGGLPAWLLAKESIVLRSSDPDYLQAVDNWMGVFLPKMKP--LLYHNGGPIISVQV 181
Query: 201 ENEYG-------NMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNG 253
ENEYG N Q + ++ + G+ + V C +D G
Sbjct: 182 ENEYGSYFTCDYNYLRHLLQLFRHHLGDEVILFTTDGSALQLVRCGTIQGLYTTVDFGPG 241
Query: 254 ----------YYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPV--EDLAFAVARFF 301
+C+ P + +E + GW WG PH V E + ++
Sbjct: 242 SNITETFLVQRHCEP------KGPLINSEFYTGWLDHWGE--PHSVVATERVTKSLDEIL 293
Query: 302 QRGGSFMNYYMYFGGTNFGRTSGG--PF--YITSYDYDAPIDEYGLLSE 346
G S +N YM+ GGTNFG +G P+ TSYDYDAP+ E G L++
Sbjct: 294 AIGAS-VNMYMFIGGTNFGYWNGANTPYAPQPTSYDYDAPLSEAGDLTD 341
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 74/180 (41%), Gaps = 25/180 (13%)
Query: 532 TNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQN 591
+N T + +RD V +NG G V+ + V +G +L LL +++G N
Sbjct: 423 SNPTTLTTLFNGVRDRAYVMVNGVPQG-VLERDKQTAINVTGAAG-AELDLLVESMGRVN 480
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT 651
+G + D G V L NG+ ++ ++ +G + +I
Sbjct: 481 FGRY-NNDFKGLLTNVTL----NGETLVNWTMYPLDIGSAINSGLLSTIHSPYT------ 529
Query: 652 RDGIPSTF---TWYKTYFDAPDGIDPVALD----LGSMGKGQAWVNGHHIGRYWTVVAPK 704
STF T+YK P GI + D KGQ W+NG ++GRYW V P+
Sbjct: 530 -----STFSAPTFYKGSLIIPTGIPQLPQDTFIQFPGWTKGQIWINGFNLGRYWPVRGPQ 584
>gi|328721397|ref|XP_003247292.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 628
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 167/360 (46%), Gaps = 37/360 (10%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
F V Y+ + DG +S +HY R W D I K K G + I TYV W+ HE
Sbjct: 15 FTVDYERNEFLKDGQVFRYVSGSLHYFRVPKPYWKDRIQKMKAAGLNAISTYVEWSLHEP 74
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRD-IPGIEFRTN 163
G+YNF D+ F++LV G+YL LR GPY+CAE +FGGFP WL + +P RTN
Sbjct: 75 YPGEYNFDDIADLEYFLQLVKDEGMYLLLRPGPYICAERDFGGFPFWLLNVVPKKRLRTN 134
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQG-------K 216
+ +K + ++ V + + + GG IIM+Q+ENEYG+ + + K
Sbjct: 135 DPSYKHYVTKWFN--VLMPKIDRFLYGNGGNIIMVQVENEYGSYNACDQEYMLWLRDLYK 192
Query: 217 DYVKWAASMALGLGAGVPWVMC-------KQTDAPENIIDACNGYYCDGYKPNSYNK-PT 268
YV + A + G G + C D ++ D C Y + + P
Sbjct: 193 RYVGYKALLYTTDGCGYSYFTCGAIPDVYATVDFGASVKDVSQ---CFKYMRTTQKRGPL 249
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY 328
+ +E + GW + W P ++ + S +N+YM+ GGTNFG TSG Y
Sbjct: 250 VNSEYYAGWLSHWREPSPVISSYEVVETMKDMLALNAS-INFYMFHGGTNFGFTSGANKY 308
Query: 329 -----------ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLC---EPALVAADSAQY 374
+TSYDY++P+DE G +E K+ +K L E + VAA Y
Sbjct: 309 ESLKNPDYLPQLTSYDYNSPLDEAGDPTE-KYFKIKKLLEGTNFIVSNEISPVAAPKGDY 367
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 70/172 (40%), Gaps = 27/172 (15%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+TI ++RD +F++ V + + S L +L + G N+G+FLE
Sbjct: 426 LTISTIRDQATIFLDQAQIKVVPRKYENTPISLNINSTVQKLSILIENQGRINFGSFLE- 484
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEW---TDLTRDGI 655
D G V L G W ++ + NE W + +D +
Sbjct: 485 DRKGIFEPVLLGRHVLGP-------W-----------KMIAYPLNETSWFSTIEPQKDAV 526
Query: 656 PSTFTWYKTYFDAPDGIDP---VALDLGSMGKGQAWVNGHHIGRYWTVVAPK 704
F YKT F PDG+ LD+ KG A+VNG +IGRYW P+
Sbjct: 527 LPAF--YKTQFKLPDGLTKPLDTYLDVTGWKKGVAFVNGINIGRYWPSAGPQ 576
>gi|433461907|ref|ZP_20419504.1| beta-galactosidase [Halobacillus sp. BAB-2008]
gi|432189486|gb|ELK46587.1| beta-galactosidase [Halobacillus sp. BAB-2008]
Length = 579
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 149/311 (47%), Gaps = 25/311 (8%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R PE W D + K K G + +ETYV WN HE RG++ F G DI F++
Sbjct: 18 ILSGAIHYFRTVPEHWEDRLEKLKALGLNTVETYVPWNLHEPRRGEFEFSGLADIEGFIQ 77
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
GLY+ +R PY+CAEW GG P WL + R+++ + ++ + K+++
Sbjct: 78 TAADLGLYVIVRPAPYICAEWEMGGLPSWLLKDKDVVMRSSDPVYLSYVESYYKELLPKF 137
Query: 183 REEMLFSWQGGPIIMLQIENEYGN----------MESSYGQQGKDYVKWAASMALGLGAG 232
+ GGPII +QIENEYG ++ Y Q G D + + + G
Sbjct: 138 VPHLY--QNGGPIIAMQIENEYGAYGNDQKYLTFLKKQYEQHGLDTFLFTSDGPDFIEQG 195
Query: 233 VPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVED 292
+ + + A D +K S P + E W GW+ W G R D
Sbjct: 196 SLPDVTTTLNFGSKVEQAFER--LDAFKTGS---PKMVAEFWIGWFDYWTGEHHTRDAGD 250
Query: 293 LAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYGLLSE 346
A +R S +N+YM+ GGTNFG +G Y ITSYDYD+ + E G ++E
Sbjct: 251 AAAVFRELMERKAS-VNFYMFHGGTNFGFMNGANHYDVYYPTITSYDYDSLLTESGAITE 309
Query: 347 PKWGHLKDLHA 357
K+ +K + A
Sbjct: 310 -KYNAVKSILA 319
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 41/177 (23%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
++ I+++ D +++NG ++ + + + F N L +L + +G NYG LE
Sbjct: 390 SMGIEAVHDRAFIYVNGTYQKTIYINDEQKKTTLVFPEKINTLEILVENMGRANYGEHLE 449
Query: 598 KDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQ-QIYSIEENEAEWTDLTRDGIP 656
D G L+K +W +G + F+ ++Y++E D +P
Sbjct: 450 -DRKG----------------LTKNIW---LGEQYFFEWEMYAVE----------LDILP 479
Query: 657 STFT---------WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPK 704
++ +++ FDAP G +D KG +VNG ++GRYW P+
Sbjct: 480 ESYAKQEDSRYPKFFRGTFDAP-GRHDTYIDSEGFTKGNLFVNGFNLGRYWNTAGPQ 535
>gi|426249767|ref|XP_004018620.1| PREDICTED: beta-galactosidase [Ovis aries]
Length = 634
Score = 159 bits (401), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 160/331 (48%), Gaps = 25/331 (7%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 17 RTFQIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFH 76
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G+YNF G +D+ F++L GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 77 ELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRS 136
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS-------YGQQG 215
++ + + +++ ++ MR L GGPII +Q+ENEYG+ S ++
Sbjct: 137 SDPDYLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYGSYYSCDYDYLRFLQKRF 194
Query: 216 KDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG-------YKPNSYNKPT 268
+D++ + G ++ C +D G ++P P
Sbjct: 195 QDHLGEDVLLFTTDGVNEEFLQCGALQGLYATVDFSTGSNLTAAFMLQRKFEPRG---PL 251
Query: 269 LWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--P 326
+ +E + GW WG R + +AF + G + +N YM+ GG+NF +G P
Sbjct: 252 INSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGAN-VNMYMFIGGSNFAYWNGANTP 310
Query: 327 F--YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 311 YQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 340
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 63/136 (46%), Gaps = 21/136 (15%)
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQV-GLKGE---FQ 635
L LL + +G NYG+++ F+G V N +D SKIL +++ L E
Sbjct: 460 LDLLVENMGRVNYGSYIND----FKGLVS-----NLTLD-SKILTNWEIFPLDMENAVLS 509
Query: 636 QIYSIEENEAEWTDLTRDGIPSTF---TWYKTYFDAPDGIDPVA----LDLGSMGKGQAW 688
+ + ++ + + R P T+ T+Y F P GI + L KGQ W
Sbjct: 510 HLGTGGGSDRRYHNKARAHSPPTYALPTFYVGNFTIPSGISDLPQDTFLQFPGWTKGQVW 569
Query: 689 VNGHHIGRYWTVVAPK 704
+NG ++GRYW V P+
Sbjct: 570 INGFNLGRYWPVQGPQ 585
>gi|301767332|ref|XP_002919083.1| PREDICTED: beta-galactosidase-like [Ailuropoda melanoleuca]
Length = 668
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 161/335 (48%), Gaps = 33/335 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y H + DG IS IHY R W D + K K G + I++YV WN H
Sbjct: 31 RTFKIDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFH 90
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQY F G++D+ F+KL GL + LR GPY+CAEW+ GG P WL I R+
Sbjct: 91 EPQPGQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRS 150
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + + +++ ++ M+ L GGPII +Q+ENEYG SY D++++
Sbjct: 151 SDPDYLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFSCDYDHLRFL 204
Query: 223 ASM-ALGLGAGVPWVMCKQTDAPENIIDACNG----YYCDGYKPNSY------------- 264
+ LG V+ TD + C Y + P +
Sbjct: 205 QKLFHYHLGND---VLLFTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQRKSEP 261
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + +E + GW WG E +A A+ RG + +N YM+ GGTNF +G
Sbjct: 262 RGPLVNSEFYTGWLDHWGQPHSTAKTEVVASALHEILSRGAN-VNLYMFIGGTNFAYWNG 320
Query: 325 G--PFYI--TSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 321 ANMPYQAQPTSYDYDAPLSEAGDLTE-KYFALRDV 354
>gi|281352249|gb|EFB27833.1| hypothetical protein PANDA_007660 [Ailuropoda melanoleuca]
Length = 626
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 161/335 (48%), Gaps = 33/335 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y H + DG IS IHY R W D + K K G + I++YV WN H
Sbjct: 4 RTFKIDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFH 63
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQY F G++D+ F+KL GL + LR GPY+CAEW+ GG P WL I R+
Sbjct: 64 EPQPGQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRS 123
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + + +++ ++ M+ L GGPII +Q+ENEYG SY D++++
Sbjct: 124 SDPDYLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFSCDYDHLRFL 177
Query: 223 ASM-ALGLGAGVPWVMCKQTDAPENIIDACNG----YYCDGYKPNSY------------- 264
+ LG V+ TD + C Y + P +
Sbjct: 178 QKLFHYHLGND---VLLFTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQRKSEP 234
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + +E + GW WG E +A A+ RG + +N YM+ GGTNF +G
Sbjct: 235 RGPLVNSEFYTGWLDHWGQPHSTAKTEVVASALHEILSRGAN-VNLYMFIGGTNFAYWNG 293
Query: 325 G--PFYI--TSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 294 ANMPYQAQPTSYDYDAPLSEAGDLTE-KYFALRDV 327
>gi|427392896|ref|ZP_18886799.1| hypothetical protein HMPREF9698_00605 [Alloiococcus otitis ATCC
51267]
gi|425730982|gb|EKU93810.1| hypothetical protein HMPREF9698_00605 [Alloiococcus otitis ATCC
51267]
Length = 597
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 126/409 (30%), Positives = 187/409 (45%), Gaps = 54/409 (13%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG +S IHY R W + K G + +ETYV WN HE G ++F G
Sbjct: 12 LDGEPFQFLSGAIHYFRIPRADWHHSLYNLKALGFNTVETYVPWNVHEPEPGHFDFSGNL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F+K GLY+ LR PY+CAEW +GG P W+ + + R+++ F E + +F
Sbjct: 72 DVKAFIKEAEELGLYVILRPSPYICAEWEYGGLPGWIIN-EDLHPRSSDPAFLELVDKFF 130
Query: 176 KKIVDLMRE--EMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ L +E ++ F+ GGPI+M+QIENEYG SYG+ KDY+K GA V
Sbjct: 131 AR---LFKEVGDLQFT-HGGPILMMQIENEYG----SYGED-KDYLKGVYDSMKAHGADV 181
Query: 234 P-------WVMCKQ----TDAPENIIDACN---------GYYCDGYKPNSYNKPTLWTEN 273
P W+ + TD E+I+ N G D + P + E
Sbjct: 182 PLCTSDGAWLATLRAGTLTDIDEDILITGNFGSKAKENFGNLKDFHDKIGKEWPLMVMEF 241
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG-------RTSGGP 326
W GW+ WG + R ++L A+ Q G +N YM+ GGTNFG R +
Sbjct: 242 WCGWFNRWGEPIVTRETDELVEALREAVQLGS--VNLYMFQGGTNFGFMNGCSARGTHDL 299
Query: 327 FYITSYDYDAPIDEYGLLSEPKWG---HLKDLHAAIKLCEPALVAADSAQYIKLGQNQEA 383
ITSYDY AP+DE G +E + +K+ I EP + + + + ++L EA
Sbjct: 300 HQITSYDYGAPLDEQGNPTEKYYAIQKMIKEEFPDIDQAEPLVKESTAQENVQL----EA 355
Query: 384 HVYRANRYGSQSNCSAFLANIDEHTAASVTFLGQSYTLPPWSVSILPDC 432
V + ++ +D S+ LGQ Y + + D
Sbjct: 356 KVNLVDSLDQVAD------RVDSLYTRSMDELGQHYGYILYQTDFVKDV 398
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 127/323 (39%), Gaps = 85/323 (26%)
Query: 466 QSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQG---ILEHLNVTKDYSDYLW------- 515
Q MI+ + ++ VKE ++ N ++ +++ L+ D D L+
Sbjct: 325 QKMIKEEFPDIDQAEPLVKEST---AQENVQLEAKVNLVDSLDQVADRVDSLYTRSMDEL 381
Query: 516 --HITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSV----IGHWVKVVQ 569
H I D F K + + + RD VF+N Q + IG +
Sbjct: 382 GQHYGYILYQTD---FVKDVDEEERLRVIDGRDRAHVFLNDQHLATQYQEEIGEDI-TTG 437
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDG--AGFRGQVK-----LTGFKNGDIDLSKI 622
P+E N L +L + +G NYG L D G R V LT ++ ID ++
Sbjct: 438 PLEES---NKLDVLVENMGRVNYGHKLLADTQEKGIRQGVTSDLHFLTNWRQYLIDFDRV 494
Query: 623 LWTYQVGLKGEFQQI-YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGS 681
QI YS+E++ ++G+PS + + T+ D ++ +DL
Sbjct: 495 ------------DQIDYSLEKD-------FKEGLPSFYKFNVTF----DDLEDTYIDLSD 531
Query: 682 MGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWL 741
GKG VNGH++GR+W + PT + Y +P+++L
Sbjct: 532 FGKGIVLVNGHNLGRFWDL---------------------------GPTLSLY-LPKAFL 563
Query: 742 QASNNLLVIFEETGGNPFEISVK 764
+ N + IFE G +S K
Sbjct: 564 KEGVNEVTIFETEGKYAPNLSFK 586
>gi|195342884|ref|XP_002038028.1| GM17976 [Drosophila sechellia]
gi|194132878|gb|EDW54446.1| GM17976 [Drosophila sechellia]
Length = 672
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 182/379 (48%), Gaps = 54/379 (14%)
Query: 1 MHSKKNNRALLQ----CLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIII 56
M S + NR L CL ++V M + + + + + A + F + ++ ++
Sbjct: 1 MTSCRRNRKLTMAVSGCLIIAV---MALTVGLCVGLGGDTDAPEEQQRFTIDHEANTFLL 57
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG +S HY RA PE W + + G + ++TYV W+ H G+YN++G D
Sbjct: 58 DGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHDGEYNWEGIAD 117
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL-RDIPGIEFRTNNAPFKEEMQRFV 175
+VKF+++ Y+ LR GPY+CAE + GG P WL P I+ RTN+ + E+ ++
Sbjct: 118 LVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTNDPNYISEVGKWY 177
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW--------AASMAL 227
++ + R + LF GG IIM+Q+ENEYG+ + DY+ W + AL
Sbjct: 178 AEL--MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWLRDETEKYVSGKAL 230
Query: 228 GLGAGVP--WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK------------PTLWTEN 273
+P + C + EN+ A + D + N +K P + +E
Sbjct: 231 LFTVDIPNEKMSCGKI---ENVF-ATTDFGID--RINEIDKIWAMLRALQPTGPLVNSEF 284
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY----- 328
+ GW T W + R +++A A+ S +N YM+FGGTNFG T+G +
Sbjct: 285 YPGWLTHWQEQNQRRDGQEVANALRTILSYNAS-VNLYMFFGGTNFGFTAGANYNLDGGI 343
Query: 329 -----ITSYDYDAPIDEYG 342
ITSYDYDA +DE G
Sbjct: 344 GYAADITSYDYDAVMDEAG 362
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 34/239 (14%)
Query: 473 LSSTSKSWMTVKEPIGVWSENNFTVQGILEHLNVTKDYSDYLWHITQIYVSDDDISFWKT 532
LS+ ++ ++ +P+ F E L++ YS + + T++ D D + K
Sbjct: 406 LSTEGRAALSKGDPVEAIKPKTF------EELDL---YSGLVLYETELPSMDLDPALLK- 455
Query: 533 NEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNY 592
ID + D VF++ +L G++ P+ G + L LL + G N+
Sbjct: 456 --------IDQINDRAHVFVDQELVGTLSREAQIYSLPLSKGWG-STLQLLVENQGRVNF 506
Query: 593 GAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW-TYQVGLKGEFQQIYSIEENEAEWTD-- 649
++ D G G+V L G + L W + L+ +++ E + + D
Sbjct: 507 --YISNDTKGIFGEVSLQLHNGGYLPLEN--WRSTAYPLEQSAVELWRREHTDQKALDPL 562
Query: 650 LTRDGI----PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPK 704
L R I P +T T + D L++ GKG A+VNG ++GRYW V P+
Sbjct: 563 LARQRILRNGPILYTGSLTVAEVGD----TYLNMAGWGKGVAYVNGFNLGRYWPVAGPQ 617
>gi|345487997|ref|XP_001602984.2| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 638
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 177/358 (49%), Gaps = 36/358 (10%)
Query: 24 MMMMIHLSCVSSSSAS--TFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
++ + +SC S++ T F + +++ ++DG +S HY R + W D
Sbjct: 7 LITTLVISCAVSATKDQVTNRTSFAIDFENNQFLLDGKPFRYVSGSFHYFRTPKQYWRDR 66
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K + G + + TYV W+ H+ ++ + G D+VKF++L L++ LR GPY+CA
Sbjct: 67 LRKMRAAGLNALSTYVEWSLHQPEPNKWVWDGDADLVKFLQLAQEEDLFVLLRPGPYICA 126
Query: 142 EWNFGGFPVWLRD-IPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
E FGGFP WL + +PGI+ RTN+ + E + ++ ++ L R + L GGPIIM+Q+
Sbjct: 127 EREFGGFPYWLLNLVPGIKLRTNDTRYLEYAEEYLNQV--LTRVKPLLRGNGGPIIMVQV 184
Query: 201 ENEYGNM---ESSYGQQGK----DYVKWAASMALGLGAGVPWVMCKQTDAPENIID---- 249
ENEYG+ + Y + K ++V A + G+ + C ID
Sbjct: 185 ENEYGSFHACDKDYMTKLKNIIQNHVGTDALLYTTDGSYRQALRCGPVSGAYATIDFGTS 244
Query: 250 ---ACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRG-- 304
N ++P P + +E + GW + W P VE F + +
Sbjct: 245 SNVTQNFNLMREFEPKG---PLVNSEFYPGWLSHW--EEPFERVE--TFKITKMLDEMLS 297
Query: 305 -GSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
G+ +N YM++GGTNF +SG + +TSYDYDAP+ E G L+ K+ +K +
Sbjct: 298 LGASVNMYMFYGGTNFAFSSGANIFDNYTPDLTSYDYDAPLSEAGDLTA-KYHEIKKI 354
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 152/314 (48%), Gaps = 34/314 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++S IHY R P+ W + K G + +ETYV WN HE GQ++F G
Sbjct: 10 FLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+V FVK GL + LR GPY+CAEW GG P WL + ++ R ++ F E+++
Sbjct: 70 GKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVEN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K ++ L+ L +GGP+IM+Q+ENEYG+ + K Y++ M G V
Sbjct: 130 YFKVLLPLIVP--LQVTKGGPVIMVQVENEYGSFSND-----KLYLRALKKMIEDAGIDV 182
Query: 234 PW----------VMCKQTDAPENIIDACNGYYCDG--------YKPNSYNKPTLWTENWD 275
P +M E ++ A G + + + P + E W
Sbjct: 183 PLFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEFWC 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------Y 328
GW+ W + R +++ + QRG +N YM+ GGTNFG +G
Sbjct: 243 GWFNRWNEDIILRDADEVMTCMKELLQRGS--LNLYMFHGGTNFGFMNGSCAGKIGNLPQ 300
Query: 329 ITSYDYDAPIDEYG 342
+TSYDYDA + E+G
Sbjct: 301 VTSYDYDAFLTEWG 314
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 77/192 (40%), Gaps = 49/192 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFL--EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKG 632
G ++L LL + +G NYGA L G RG V + T V
Sbjct: 437 QGEHELSLLVENMGRNNYGARLLAPTQRKGIRGGVMV----------DHHFETEWVQYAL 486
Query: 633 EFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
F+ I + D T+ IP+T +Y+ F+A + D LD ++GKG A++N
Sbjct: 487 SFETIGDV--------DFTKGWIPNTPAFYEYEFEAHECEDTF-LDCSTLGKGVAFINDF 537
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
++GRYW+V P Q Y +P L+ N LV+FE
Sbjct: 538 NLGRYWSV---------------------------GPIQYLY-IPGPLLKVGINKLVLFE 569
Query: 753 ETGGNPFEISVK 764
G I++K
Sbjct: 570 TEGVVAERIALK 581
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 100/314 (31%), Positives = 152/314 (48%), Gaps = 34/314 (10%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
++DG ++S IHY R P+ W + K G + +ETYV WN HE GQ++F G
Sbjct: 10 FLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTG 69
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
D+V FVK GL + LR GPY+CAEW GG P WL + ++ R ++ F E+++
Sbjct: 70 GKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVEN 129
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
+ K ++ L+ L +GGP+IM+Q+ENEYG+ + K Y++ M G V
Sbjct: 130 YFKVLLPLIVP--LQVTKGGPVIMVQVENEYGSFSND-----KLYLRALKKMIEDAGIDV 182
Query: 234 PW----------VMCKQTDAPENIIDACNGYYCDG--------YKPNSYNKPTLWTENWD 275
P +M E ++ A G + + + P + E W
Sbjct: 183 PLFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEFWC 242
Query: 276 GWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------Y 328
GW+ W + R +++ + QRG +N YM+ GGTNFG +G
Sbjct: 243 GWFNRWNEDIILRDADEVMTCMKELLQRGS--LNLYMFHGGTNFGFMNGSCAGKIGNLPQ 300
Query: 329 ITSYDYDAPIDEYG 342
+TSYDYDA + E+G
Sbjct: 301 VTSYDYDAFLTEWG 314
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 76/192 (39%), Gaps = 49/192 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFL--EKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKG 632
G ++L LL + +G NYGA L G RG V + T V
Sbjct: 437 QGEHELSLLVENMGRNNYGARLLAPTQRKGIRGGVMV----------DHHFETEWVQYAL 486
Query: 633 EFQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGH 692
F+ I + D + IP+T +Y+ F+A + D LD ++GKG A++N
Sbjct: 487 SFETIGDV--------DFAKGWIPNTPAFYEYEFEAHECEDTF-LDCSTLGKGVAFINDF 537
Query: 693 HIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
++GRYW+V P Q Y +P L+ N LV+FE
Sbjct: 538 NLGRYWSV---------------------------GPIQYLY-IPGPLLKVGINKLVLFE 569
Query: 753 ETGGNPFEISVK 764
G I++K
Sbjct: 570 TEGVVAERIALK 581
>gi|312378199|gb|EFR24839.1| hypothetical protein AND_10320 [Anopheles darlingi]
Length = 639
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 110/349 (31%), Positives = 171/349 (48%), Gaps = 37/349 (10%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
M+ ++ C++ + + + F + Y+ ++DG ++ HY RA P+ W
Sbjct: 1 MLRHPIVLAVCLAIAGLAEAQRSFTIDYERDTFVMDGKDFRYVAGSFHYFRALPQTWRTK 60
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ + GG + ++ YV W+ H G Y+++G ++ ++ LY+ LR GPY+CA
Sbjct: 61 LRTLRAGGLNAVDLYVQWSLHNPRDGVYSWEGIANVTDIIEAAIEEDLYVILRPGPYICA 120
Query: 142 EWNFGGFPVWL-RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
E + GG P WL PGI+ RT++A + E++++ +++ M M GGPIIM+QI
Sbjct: 121 EIDNGGLPYWLFNKYPGIQVRTSDANYLAEVKKWYGELMSRMEPYMY--GNGGPIIMVQI 178
Query: 201 ENEYGNMESSYGQQGKDYV--------KWAASMALGLGAGVPW---VMCKQTDAPENIID 249
ENEYG ++G+ K Y+ ++ A+ P+ + C Q D I
Sbjct: 179 ENEYG----AFGKCDKPYLNFLKEETNRYVQDKAVLFTVDRPYDDEIGCGQIDGV--FIT 232
Query: 250 ACNGYYCD------GYKPNSYNK--PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFF 301
G D K SY P + TE + GW T W RP LA A R
Sbjct: 233 TDFGLMTDEEVDTHAAKVRSYQPKGPLVNTEFYTGWLTHWQESNQRRPAGPLA-ATLRKM 291
Query: 302 QRGGSFMNYYMYFGGTNFGRTSG------GPFY--ITSYDYDAPIDEYG 342
+ G +++YMYFGGTNFG +G G + ITSYDYDAP+DE G
Sbjct: 292 LKDGWNVDFYMYFGGTNFGFWAGANDWGLGKYMADITSYDYDAPMDEAG 340
>gi|195977873|ref|YP_002123117.1| beta-galactosidase precursor Bga [Streptococcus equi subsp.
zooepidemicus MGCS10565]
gi|195974578|gb|ACG62104.1| beta-galactosidase precursor Bga [Streptococcus equi subsp.
zooepidemicus MGCS10565]
Length = 594
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 174/377 (46%), Gaps = 53/377 (14%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+DG ++S IHY R P+ WP ++ + K G + +ETY+ WN HE +GQ+ F+G
Sbjct: 12 LDGKPFKILSGAIHYFRIAPDSWPRVLYQLKALGFNTVETYIPWNMHEPRKGQFTFEGIA 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F+ L GLY +R PY+CAEW FGG P WL R+++ F + + +
Sbjct: 72 DVEAFLDLAQEYGLYAIVRPSPYICAEWEFGGLPAWLL-TENCRVRSSDEVFLKHVSDYY 130
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ + + L + GG I+M Q+ENEYG SYG++ KDY++ + L G P
Sbjct: 131 DVLLPKLVKRQLDN--GGNILMFQLENEYG----SYGEE-KDYLRKLKELMLAKGISAPL 183
Query: 236 VMCK----QTDAPENIID------------ACNGYYC--DGYKPNSYNKPTLWTENWDGW 277
T A ++ID A + D ++ + P + E W GW
Sbjct: 184 FTSDGPWLATLASGSLIDDDVFVTGNFGSNASKQFASMQDFFQAHQKQWPLMCMEFWLGW 243
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------IT 330
+ W + R ++ A+ + G +N YM+ GGTNFG +G IT
Sbjct: 244 FNRWNEPIIRRDPKEAVDAIMEAIELGS--INLYMFCGGTNFGFMNGSSARLQKDLPQIT 301
Query: 331 SYDYDAPIDEYG-------LLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGQNQEA 383
SYDYDA +DE G LL E LK+ + + EP + + IKL
Sbjct: 302 SYDYDALLDEAGNPTKKYILLQE----RLKERYPQLSFAEPMTSPTMALESIKLSA---- 353
Query: 384 HVYRANRYGSQSNCSAF 400
R + + + N SA
Sbjct: 354 ---RVSLFKTIKNVSAL 367
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 56/136 (41%), Gaps = 37/136 (27%)
Query: 638 YSIEENEAEWTDLT---RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHI 694
+ ++ E +W D + DG+P +Y FD D LDL GKG A VNG ++
Sbjct: 484 FLLDFQELDWIDFSAGWTDGVPG---FYAYDFDCQQPAD-TYLDLSQFGKGIALVNGVNL 539
Query: 695 GRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEET 754
GR+W V PT + Y +P+ L+ N L+IFE
Sbjct: 540 GRFWKV---------------------------GPTLSLY-IPKGLLKQGQNRLLIFETE 571
Query: 755 GGNPFEISVKLRSTRI 770
G F S++L I
Sbjct: 572 G--QFSESIRLTKEPI 585
>gi|373953405|ref|ZP_09613365.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890005|gb|EHQ25902.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 608
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 173/353 (49%), Gaps = 38/353 (10%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
A ++DG +IS +HYPR E W + +K G + I TYVFWN HE +G+++F
Sbjct: 32 EAFLLDGKPFQMISGEMHYPRVPRESWRARMKMAKAMGLNTIGTYVFWNLHEPQKGKFDF 91
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G ND+ +FV++ GL++ LR PYVCAEW FGG+P WL++ G+ R+ A + +E
Sbjct: 92 TGNNDVAEFVRIAKQEGLWVILRPSPYVCAEWEFGGYPYWLQNEKGLVVRSKEAQYLKEY 151
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA 231
+ ++K++ + + GG I+M+QIENEYG SYG KDY+ A + L A
Sbjct: 152 ESYIKEVGKQLAPLQIN--HGGNILMVQIENEYG----SYGSD-KDYL--AINQKLFKEA 202
Query: 232 GVPWVMCKQTDAPE-------NIIDACNGY-YCDGYK----PNSYNKPTLWTENW-DGWY 278
G ++ A + ++ A NG D K N K + W W+
Sbjct: 203 GFDGLLYTCDPAADLVNGHLPGLLPAVNGIDNPDKVKQIISQNHNGKGPYYIAEWYPAWF 262
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF--------YIT 330
WG + P + + G S +N YM+ GGT G +G + ++
Sbjct: 263 DWWGTKHHTVPAAEYTGRLDSVLAAGIS-INMYMFHGGTTRGFMNGANYKDTSPYEPQVS 321
Query: 331 SYDYDAPIDEYGLLSEPKWGHL-----KDLHAAIKLCE-PALVAADSAQYIKL 377
SYDYDAP+DE G + PK+ K L A + L PA A S IKL
Sbjct: 322 SYDYDAPLDEAG-NATPKFMAFRSVIEKHLPAGVTLPPVPAAKPAISVAAIKL 373
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 84/216 (38%), Gaps = 49/216 (22%)
Query: 539 VTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEK 598
+ I +RD V +NG+ G++ + ++ G L +L + +G N+G +L +
Sbjct: 419 LKIKELRDYAVVMLNGKTVGTLDRRLNQDSLQIKLPVGAVVLDILVENLGRINFGKYLLQ 478
Query: 599 DGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG--IP 656
+ G +V + T QV Q+YS+ N AE +L +
Sbjct: 479 NKKGITEKV--------------LFNTQQV----NNWQMYSLPFNHAEAINLKSGSSTMG 520
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGA 716
+ YF+ D LD+ GKG WVNGH++GRYW V
Sbjct: 521 TAPVIKSGYFNLQKTGD-TYLDMRKWGKGLVWVNGHNLGRYWQV---------------- 563
Query: 717 YNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFE 752
P QT Y VP WL+ N + + E
Sbjct: 564 -----------GPQQTLY-VPAEWLKKGQNEVRVLE 587
>gi|426371159|ref|XP_004052521.1| PREDICTED: beta-galactosidase-1-like protein 3 [Gorilla gorilla
gorilla]
Length = 653
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 162/321 (50%), Gaps = 27/321 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IH R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHKFLIFGGSIHCFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR GPY+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMGAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ R L QGGP+I +Q+ENEYG+ ++ K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSF-----KKDKTYMLYLHKALLRRGIVELL 254
Query: 232 ----GVPWVMCKQTD---APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
G V+ T A N+ + +K +KP L E W GW+ WG +
Sbjct: 255 LTSDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQR-DKPLLIMEYWVGWFDRWGDK 313
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
+ +++ AV+ F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 314 HHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVL 372
Query: 339 DEYGLLSEPKWGHLKDLHAAI 359
E G +E K+ L+ L ++
Sbjct: 373 TEAGDYTE-KYLKLQKLFQSV 392
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 173/363 (47%), Gaps = 35/363 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y+ + DG IS IHY R W D + K K G + IETYV WN H
Sbjct: 59 RTFTIDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFH 118
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQY F G+ D+ F++LV GL + LR GPY+CAEW+ GG PVWL + I R+
Sbjct: 119 EPFPGQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIFLRS 178
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + + + ++++ ++ M+ + GGPII +Q+ENEYG SY +Y+++
Sbjct: 179 SDPDYLKAVDKWLEVLLPKMKPYLY--QNGGPIITVQVENEYG----SYFACDYNYLRFL 232
Query: 223 ASM-ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNK--------------- 266
+ LG V V+ A EN + G D Y +
Sbjct: 233 LKVFRQHLGEEV--VLFTTDGAGENYLKC--GTLQDLYATVDFGTSSNITQAFMIQRKVE 288
Query: 267 ---PTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTS 323
P + +E + GW WG +++ ++ RG + +N YM+ GGTNFG +
Sbjct: 289 PKGPLVNSEFYTGWLDHWGESHQTVSTKNIVASLTDMLSRGAN-VNLYMFIGGTNFGFWN 347
Query: 324 GG--PFY--ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCE-PALVAADSAQYIKLG 378
G P+ TSYDYDAP+ E G L+E + + + KL E P + Y K+
Sbjct: 348 GANMPYLPQPTSYDYDAPLSEAGDLTEKYYAVREAIGKFEKLPEGPIPPSTPKFAYGKVA 407
Query: 379 QNQ 381
Q
Sbjct: 408 MKQ 410
>gi|289670687|ref|ZP_06491762.1| beta-galactosidase [Xanthomonas campestris pv. musacearum NCPPB
4381]
Length = 612
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 165/346 (47%), Gaps = 47/346 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S +H+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 38 FVRDGKPYQLLSGAVHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 97
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 98 NNDVAAFVREAAALGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 157
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + ++ L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 158 YLDALAKQVQP--LLNHNGGPIIAVQVENEYG----SY---ADDHAYMAENRAMYVKAGF 208
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 209 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIKFRSDQPRMVGEYWAG 265
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G +
Sbjct: 266 WFDHWGK--PHAATD--ARQQADEFEWILRQGHSANLYMFIGGTSFGFMNGANYQNNPSD 321
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
TSYDYDA +DE G + PK+ ++D A + +P + A
Sbjct: 322 HYAPQTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGVQPPALPA 366
>gi|324509196|gb|ADY43870.1| Beta-galactosidase [Ascaris suum]
Length = 639
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 161/324 (49%), Gaps = 29/324 (8%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
F++ Y ++ ++DG IS IHY R P+ W D +++ + G + I+ Y+ WN HE
Sbjct: 27 FSIDYVNKRFLLDGQPFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHEI 86
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
G F G +I +F+ L + LY +RIGPY+C EW GG P WL I+ RT++
Sbjct: 87 YEGVIGFDGGRNITRFLSLAAQNELYALVRIGPYICGEWENGGLPWWLLKYDDIKMRTSD 146
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQ---------- 214
F ++R+ ++ +++ + GGPI+M+Q+ENEYG+ ++
Sbjct: 147 KRFIRAVERWFGVLLPILKPSL--RKNGGPILMIQVENEYGSFTEGCDRKYTTFLRDLTI 204
Query: 215 ---GKDYVKW----AASMALGLGAGVPWVMCKQTDAP--ENIIDACNGYYCDGYKPNSYN 265
G D V + A + +L G+ +P V P E ID N Y+PN
Sbjct: 205 KHLGDDVVLYTTDGANNQSLKCGS-IPGVFATVDFGPNSEEQIDK-NFATQRSYEPNG-- 260
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
P + +E + GW TW + P D +++ + G+ NYYM++GGTNF +G
Sbjct: 261 -PLVNSEFYPGWIVTWSQKGRIDPSVDEIINGSKYMFKLGASFNYYMFYGGTNFAFWNGA 319
Query: 326 ---PFYITSYDYDAPIDEYGLLSE 346
ITSYDY AP+ E ++E
Sbjct: 320 ETTSAVITSYDYFAPLTEAADINE 343
>gi|289664883|ref|ZP_06486464.1| beta-galactosidase [Xanthomonas campestris pv. vasculorum NCPPB
702]
Length = 582
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 165/346 (47%), Gaps = 47/346 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S +H+ R W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 8 FVRDGKPYQLLSGAVHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 67
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 68 NNDVAAFVREAAALGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 127
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + ++ L + GGPII +Q+ENEYG SY D+ A + A+ + AG
Sbjct: 128 YLDALAKQVQP--LLNHNGGPIIAVQVENEYG----SY---ADDHAYMAENRAMYVKAGF 178
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++++K P + E W G
Sbjct: 179 DKALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKSAFDKLIKFRSDQPRMVGEYWAG 235
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQ---RGGSFMNYYMYFGGTNFGRTSGGPF------ 327
W+ WG PH + A A F+ R G N YM+ GGT+FG +G +
Sbjct: 236 WFDHWGK--PHAATD--ARQQADEFEWILRQGHSANLYMFIGGTSFGFMNGANYQNNPSD 291
Query: 328 ----YITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAA 369
TSYDYDA +DE G + PK+ ++D A + +P + A
Sbjct: 292 HYAPQTTSYDYDAILDEAGHPT-PKFALMRDAIARVTGVQPPALPA 336
>gi|296081427|emb|CBI16778.3| unnamed protein product [Vitis vinifera]
Length = 242
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 74/123 (60%), Positives = 88/123 (71%), Gaps = 4/123 (3%)
Query: 248 IDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
I+ CN +YCD + PNS NKP +WTENW GW T+G PH P ED+ F+VARFF +
Sbjct: 120 INTCNSFYCDQFTPNSPNKPKMWTENWPGWSKTFGALDPHGPREDIVFSVARFFWK---- 175
Query: 308 MNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALV 367
+NYYM GGTNFGRTSGGPF T+YDY+APIDEYGL PK GHLK+L AIK CE L+
Sbjct: 176 VNYYMDHGGTNFGRTSGGPFITTTYDYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLL 235
Query: 368 AAD 370
+
Sbjct: 236 YGE 238
>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 173/360 (48%), Gaps = 33/360 (9%)
Query: 14 LALSVYPMMMMMMMIHLSCVSSSSASTFF--KPFNVSYDHRAIIIDGNRRMLI------- 64
+AL + ++ + V S S + F K V + A+ + N +L
Sbjct: 10 VALGILAAALVFLAFSHQHVQSRSQARHFHNKNVQVRSNRAALAVSSNGFLLYGHPFDIW 69
Query: 65 SAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK-NDIVKFVKL 123
S +HY R E W D + +K G + I TYV WN HE G ++F+ +D+ +F+ L
Sbjct: 70 SGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFDFETHAHDLARFLNL 129
Query: 124 VGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMR 183
GL + +R PY+CAEW+FGG P L P +E R++N F +E++R+ ++ ++R
Sbjct: 130 AHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDLELRSSNDAFLDEVERYYDALMPILR 189
Query: 184 EEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD- 242
L + GGPII +ENEYG SYG +DY++ +M G C
Sbjct: 190 P--LQASNGGPIIAFYVENEYG----SYGAD-RDYLQALVAMMRDRGIVEQMFTCDNAQG 242
Query: 243 ----APENIIDACN-----GYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDL 293
A + N + D ++P + +E W GW+ G EDL
Sbjct: 243 LSRGALPGALQTINFQDNVERHLDQLAHFQPDQPLMVSEYWTGWFDHDGEEHHTFDSEDL 302
Query: 294 AFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY--ITSYDYDAPIDEYGLLSEPKW 349
+ + RG SF N Y++ GGT+FG +G P+ ITSYDYDAP+ E+G ++ PK+
Sbjct: 303 VEGLQKILDRGASF-NLYVFHGGTSFGWNAGANSPYAPDITSYDYDAPLSEHGQVT-PKY 360
>gi|158455090|gb|AAI40686.2| Galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 33/335 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 29 RTFQIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFH 88
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G+YNF G +D+ F++L GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 89 ELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRS 148
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW- 221
++ + + +++ ++ MR L GGPII +Q+ENEYG SY DY+++
Sbjct: 149 SDPDYLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFL 202
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNG----YYCDGYKPNSY------------- 264
LG V+ TD + C Y + P +
Sbjct: 203 QKRFHDHLGED---VLLFTTDGVNERLLQCGALQGLYATLDFSPGTNLTAAFMLQRKFEP 259
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + +E + GW WG R + +AF + G + +N YM+ GGTNF +G
Sbjct: 260 TGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGAN-VNMYMFIGGTNFAYWNG 318
Query: 325 G--PF--YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 319 ANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|296475022|tpg|DAA17137.1| TPA: galactosidase, beta 1 precursor [Bos taurus]
Length = 653
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 33/335 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 29 RTFQIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFH 88
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G+YNF G +D+ F++L GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 89 ELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRS 148
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW- 221
++ + + +++ ++ MR L GGPII +Q+ENEYG SY DY+++
Sbjct: 149 SDPDYLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFL 202
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNG----YYCDGYKPNSY------------- 264
LG V+ TD + C Y + P +
Sbjct: 203 QKRFHDHLGED---VLLFTTDGVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQRKFEP 259
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + +E + GW WG R + +AF + G + +N YM+ GGTNF +G
Sbjct: 260 TGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGAN-VNMYMFIGGTNFAYWNG 318
Query: 325 G--PF--YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 319 ANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|419799561|ref|ZP_14324899.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
gi|385697826|gb|EIG28233.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
Length = 595
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 157/310 (50%), Gaps = 31/310 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+ G ++S IHY R P W + K G + +ETYV WN HE +GQ++F G+
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E + R+
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDMRIRSSDPVFIEAVDRYY 130
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGLGAGV 233
++ L+ + QGGPI+M+Q+ENEYG+ + +Y + +D +K +
Sbjct: 131 DHLLGLLTRYQVD--QGGPILMMQVENEYGSYGEDKAYLRAIRDLMKEKGVTCPLFTSDG 188
Query: 234 PWVMCKQTDAPENIID---------ACNGYYCDGYKPNSYNK-----PTLWTENWDGWYT 279
PW + T N+I+ Y G +++ P + E WDGW+T
Sbjct: 189 PW---RATLRAGNLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFT 245
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF---YITSY 332
W + R E+LA AV + G +N YM+ GGTNFG +G G +TSY
Sbjct: 246 RWKEPVIQREPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTLDLPQVTSY 303
Query: 333 DYDAPIDEYG 342
DY A ++E G
Sbjct: 304 DYGALLNEQG 313
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D +++ +Y+ F +D LD+ GKG A+VNGH++GR+
Sbjct: 485 YPLDLQDLSQLDFSKEWQAGAPAFYRYDFQLDQTLD-TYLDMTGFGKGVAFVNGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
Length = 613
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/330 (32%), Positives = 154/330 (46%), Gaps = 45/330 (13%)
Query: 54 IIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKG 113
+ DG L+S IH+ R E W D + K++ G + +ETYVFWN E +GQ++F G
Sbjct: 39 FVRDGKPYQLLSGAIHFQRIPREYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAG 98
Query: 114 KNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQR 173
ND+ FV+ + GL + LR GPY CAEW GG+P WL I R+ + F Q
Sbjct: 99 NNDVAAFVREAAAQGLNVILRPGPYTCAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQA 158
Query: 174 FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGV 233
++ + + L + GGPII +Q+ENEYG+ + D+ A + A+ + AG
Sbjct: 159 YLDAVSKQVHP--LLNHNGGPIIAVQVENEYGSYDD-------DHAYMADNRAMYVKAGF 209
Query: 234 PWVMCKQTDAPENIIDACNGYYCD---------GYKPNSYNK--------PTLWTENWDG 276
+ +D + + NG D G ++ K P + E W G
Sbjct: 210 DDALLFTSDGADML---ANGTLPDTLAVVNFAPGEAKTAFEKLIKFRPEQPRMVGEYWAG 266
Query: 277 WYTTWGGRLPHRPVEDLAFAVARF--FQRGGSFMNYYMYFGGTNFGRTSGGPF------- 327
W+ WG PH D F R G N YM+ GGT+FG +G F
Sbjct: 267 WFDHWGK--PHAST-DAKQQTEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQGNPSDH 323
Query: 328 ---YITSYDYDAPIDEYGLLSEPKWGHLKD 354
TSYDYDA +DE G + PK+ ++D
Sbjct: 324 YAPQTTSYDYDAILDEAGRPT-PKFALMRD 352
>gi|168039839|ref|XP_001772404.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676391|gb|EDQ62875.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 615
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 176/351 (50%), Gaps = 50/351 (14%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG ++ +HY R +W D + + K G + I+TYV WN HE G NF G D
Sbjct: 11 DGIPFRILGGELHYFR----LWEDRLLRVKSLGLNTIQTYVPWNLHEPRPGHLNFNGSAD 66
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI-PGIEFRTNNAPFKEEMQRFV 175
++ F+KL L + LRIGPY+C EW+ GG P WL ++ P + R+++A + + +
Sbjct: 67 LLSFLKLAHRLDLLVMLRIGPYMCGEWDLGGLPAWLLELKPSVRLRSSDAQYLARVDNWW 126
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA-LGLGAGVP 234
K+++ ++ E+ + GGP+IM+QIENEYG +YG K Y+K+ S A L LG +
Sbjct: 127 KELLPMVAPELFLA--GGPVIMVQIENEYG----TYGSD-KLYLKFLQSQARLHLGDDI- 178
Query: 235 WVMCKQTDAPENIID----------ACNGYYCDGYKPNS-------YN----KPTLWTEN 273
+ ENI D A N + G P S +N P L TE
Sbjct: 179 IIYTTDGAVEENIRDGSLPEAGVLAAIN--FQTGSDPASAFALQKRHNPPGMSPPLATEF 236
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG-----GPF- 327
+ GW + WG +L + A A+ + S + YM GGTNFG SG GP
Sbjct: 237 YTGWLSHWGEKLAKTDAKSTAEALDNILRLNASVV-LYMVHGGTNFGFFSGANTGTGPSD 295
Query: 328 ---YITSYDYDAPIDEYGLLSEPKWGHLKDL---HAAIKLCEPALVAADSA 372
ITSYDYDAPI E G + K+ ++++ +AA +L +P + +A
Sbjct: 296 FQPDITSYDYDAPIGEAGDVGGVKYQEIRNVLSKYAAGRLPDPPPLPQRTA 346
>gi|423259078|ref|ZP_17240001.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|423263951|ref|ZP_17242954.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
gi|387776658|gb|EIK38758.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|392706217|gb|EIY99340.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
Length = 773
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 158/324 (48%), Gaps = 31/324 (9%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
R +++GN ++ +A +HY R W I K G + I Y+FWN HE G+++F
Sbjct: 31 RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G+ ++ KF KL G+Y+ LR GPYVCAEW GG P WL ++ R+ N F E
Sbjct: 91 SGEKNVAKFCKLAQKHGMYIILRPGPYVCAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYV---------- 219
+ F+K++ + L + GG IIM+Q+ENE+G ++ Y +D V
Sbjct: 151 EIFMKELGKQLAPLQLAN--GGNIIMVQVENEFGGYGVDKPYMTAIRDIVCRAGFDKSVL 208
Query: 220 ---KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDG 276
W ++ L + W + T A NI +P++ P + +E W G
Sbjct: 209 FQCDWDSTFELNALDDLLWTLNFGTGA--NIDKEFKK--LSTVRPDT---PLMCSEFWSG 261
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITS 331
W+ WG + RP E + + R SF + YM GGT FG G P Y +S
Sbjct: 262 WFDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSS 320
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDL 355
YDYDAPI E G + PK+ L++L
Sbjct: 321 YDYDAPISEAGWTT-PKYYLLQEL 343
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 31/116 (26%)
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
P +YK F+ D +D+ + GKG WVNGH +GR+W +
Sbjct: 523 PEGPAYYKATFNLTKTGD-TFIDMSTWGKGMVWVNGHALGRFWEI--------------- 566
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT + +P WL+ N +++ + G P E VK T ++
Sbjct: 567 ------------GPQQTLF-LPGCWLKKGKNEIIVLDLKG--PSEAVVKGLKTPVL 607
>gi|392987629|ref|YP_006486222.1| glucosyl hydrolase family protein [Enterococcus hirae ATCC 9790]
gi|392335049|gb|AFM69331.1| glucosyl hydrolase family protein [Enterococcus hirae ATCC 9790]
Length = 592
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 164/338 (48%), Gaps = 38/338 (11%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
+++G ++S IHY R W + K G + +ETYV WN HE +G ++F+G
Sbjct: 11 LLNGKPFKILSGAIHYFRVDSADWYHSLYNLKALGFNTVETYVPWNLHEPKKGDFHFEGI 70
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
D+ F+ + GLY +R PY+CAEW FGGFP WL + G RTN + + +
Sbjct: 71 LDLEHFLSIAEELGLYAIVRPSPYICAEWEFGGFPAWLLN-EGTRIRTNETVYLNHVADY 129
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++ + L + GG I+M+QIENEYG SYG++ KDY++ + L G VP
Sbjct: 130 YDVLIKKIVPHQLTN--GGNILMIQIENEYG----SYGEE-KDYLRSIRDLMLDRGITVP 182
Query: 235 WVMC----KQTDAPENIIDA---CNGYYCDGYKPN------SYNK-----PTLWTENWDG 276
+ + T ++ID G + + N +N+ P + E WDG
Sbjct: 183 FFTSDGPWRATLRAGSMIDEDILVTGNFGSKAEENFSSMEAFFNEHGKKWPLMCMEFWDG 242
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------I 329
W+ W + R ++LA A+ RG +N YM+ GGTNFG +G I
Sbjct: 243 WFNRWKEPIVQRDAKELAEAIKEVVLRGS--INLYMFHGGTNFGFMNGCSARGVIDLPQI 300
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAA---IKLCEP 364
TSYDY AP+DE G +E + +H I+ EP
Sbjct: 301 TSYDYGAPLDEQGNPTEKYYAIQTMIHETFPDIQQMEP 338
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 90/224 (40%), Gaps = 55/224 (24%)
Query: 545 RDVLRVFINGQLTGSV----IGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDG 600
RD ++F+N +L + IG + V P E N L +L + +G NYG L D
Sbjct: 407 RDRSQLFLNQKLQATQYQTEIGEDIIVPMPQE----DNQLDILIENMGRVNYGHKLLAD- 461
Query: 601 AGFRGQVKLTGFKNGDI-DLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTF 659
+ G + G + DL I Q Y + E D +++ P+
Sbjct: 462 ------TQKKGIRTGVMADLHFITDWNQ----------YCLPLESCEKVDFSKEWHPNQP 505
Query: 660 TWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNS 719
++Y+ Y D ++ +DL GKG +VN +IGR+W V
Sbjct: 506 SFYR-YEVTLDEVEDSFIDLSKFGKGVVFVNQTNIGRFWEV------------------- 545
Query: 720 DKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISV 763
PT + Y +P+S + NN +VIFE G EI +
Sbjct: 546 --------GPTLSLY-IPKSLFKKGNNEIVIFETEGTFQPEIQL 580
>gi|337283005|ref|YP_004622476.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
gi|335370598|gb|AEH56548.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
Length = 595
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 157/307 (51%), Gaps = 25/307 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+ G ++S IHY R P W + K G + +ETYV WN HE +GQ++F G+
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E + R+
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDMRIRSSDPAFIEAVDRYY 130
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGLGAGV 233
++ L+ + QGGPI+M+Q+ENEYG+ + +Y + +D +K +
Sbjct: 131 DHLLGLLTPYQVD--QGGPILMMQVENEYGSYGEDKAYLRAIRDLMKKKGVTCPLFTSDG 188
Query: 234 PW--VMCKQTDAPENIIDACN----GYYCDGYKPNSYNK-----PTLWTENWDGWYTTWG 282
PW + T E++ N Y G +++ P + E WDGW+T W
Sbjct: 189 PWRAALRAGTLIEEDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF---YITSYDYD 335
+ R E+LA AV + G +N YM+ GGTNFG +G G +TSYDY
Sbjct: 249 EPVIQREPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYG 306
Query: 336 APIDEYG 342
A ++E G
Sbjct: 307 ALLNEQG 313
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 48/118 (40%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D +++ +Y+ F +D LD+ GKG +VNGH++GR+
Sbjct: 485 YPLDLQDLSQLDFSKEWQAGAPAFYRYDFQLDHTLD-TYLDMTGFGKGVVFVNGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|194213011|ref|XP_001503026.2| PREDICTED: beta-galactosidase-1-like protein 3-like [Equus
caballus]
Length = 880
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 161/326 (49%), Gaps = 37/326 (11%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 250 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSGNL 309
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV GL++ LR GPY+C+E + GG P L P + RT + F E + ++
Sbjct: 310 DLEAFVLTAAEIGLWVILRPGPYICSEIDLGGLPSRLLQDPQVNLRTTDKGFVEAVDKYF 369
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R L +GGPII +Q+ENEYG+ + KDY+ + L G
Sbjct: 370 DHLIS--RVVHLQYRKGGPIIAVQVENEYGSF-----YKDKDYMPYLQQALLKRG----- 417
Query: 236 VMCKQTDAPENIIDACNGYY--------CDGYKPNSY--------NKPTLWTENWDGWYT 279
+ + +N+ D GY ++ +++ +KP + E W GW+
Sbjct: 418 -IVELLLTSDNVDDVLKGYIKGVLATINMKKFRKDAFQHLYKVQRDKPIMIMEYWVGWFD 476
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYD 333
TWG + + D+ V+ F + SF N YM+ GGTNFG +G + +TSYD
Sbjct: 477 TWGSKHEVKDAGDVKNTVSEFIKFEISF-NVYMFHGGTNFGFINGAINFVKHAGVVTSYD 535
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHAAI 359
YDA + E G ++ K+ L+ L +I
Sbjct: 536 YDAVLTEAGDYTK-KYFKLRKLFGSI 560
>gi|24582088|ref|NP_608978.2| beta galactosidase, isoform A [Drosophila melanogaster]
gi|21430516|gb|AAM50936.1| LP09580p [Drosophila melanogaster]
gi|22945722|gb|AAF52321.2| beta galactosidase, isoform A [Drosophila melanogaster]
Length = 672
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 180/374 (48%), Gaps = 50/374 (13%)
Query: 2 HSKKNNRALLQCLALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRR 61
++K A+ CL ++V M + + + + + A F + ++ ++DG
Sbjct: 6 RNRKLTMAVSGCLIIAV---MALTVGLCVGLSGDTDAPEEQPRFTIDHEANTFMLDGQPF 62
Query: 62 MLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFV 121
+S HY RA PE W + + G + ++TYV W+ H G+YN++G D+VKF+
Sbjct: 63 RYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHDGEYNWEGIADVVKFL 122
Query: 122 KLVGSSGLYLQLRIGPYVCAEWNFGGFPVWL-RDIPGIEFRTNNAPFKEEMQRFVKKIVD 180
++ Y+ LR GPY+CAE + GG P WL P I+ RTN+ + E+ ++ ++
Sbjct: 123 EIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTNDPNYISEVGKWYAEL-- 180
Query: 181 LMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW--------AASMALGLGAG 232
+ R + LF GG IIM+Q+ENEYG+ + DY+ W + AL
Sbjct: 181 MPRLQHLFVGNGGKIIMVQVENEYGDYACDH-----DYLNWLRDETEKYVSGKALLFTVD 235
Query: 233 VP--WVMCKQTDAPENIIDACNGYYCDGYKPNSYNK------------PTLWTENWDGWY 278
+P + C + EN+ A + D + N +K P + +E + GW
Sbjct: 236 IPNEKMSCGKI---ENVF-ATTDFGID--RINEIDKIWAMLRALQPTGPLVNSEFYPGWL 289
Query: 279 TTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY---------- 328
T W + R +++A A+ S +N YM+FGGTNFG T+G +
Sbjct: 290 THWQEQNQRRDGQEVANALRTILSYNAS-VNLYMFFGGTNFGFTAGANYNLDGGIGYAAD 348
Query: 329 ITSYDYDAPIDEYG 342
ITSYDYDA +DE G
Sbjct: 349 ITSYDYDAVMDEAG 362
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 123/315 (39%), Gaps = 62/315 (19%)
Query: 455 LPLSPNISV-PQQSMIESKLSSTSKSWMTVKEPIGVWSENNFTVQGI----LEHLNVTKD 509
LPL P I++ P + + ++ T K + E S+ + V+ I E L++
Sbjct: 379 LPL-PEITLNPAKRLAYGRVELTPKLTLLSTEGRAALSKGD-PVESIKPKTFEELDL--- 433
Query: 510 YSDYLWHITQIYVSDDDISFWKTNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQ 569
YS + + T++ D D + K ID + D VF++ +L G++
Sbjct: 434 YSGLVLYETELPSMDLDPALLK---------IDQINDRAHVFVDQELVGTLSREAQIYSL 484
Query: 570 PVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILW-TYQV 628
P+ G + L LL + G N+ ++ D G G+V L G + L W +
Sbjct: 485 PLSKGWG-STLQLLVENQGRVNF--YISNDTKGIFGEVSLQLHNGGYLPLEN--WRSTAF 539
Query: 629 GLKGEFQQIYSIEENEAEWTD--LTRDGI----PSTFTWYKTYFDAPDGIDPVALDLGSM 682
L+ +++ E + + D L R I P +T T + D L++
Sbjct: 540 PLEQSAVELWRREHTDEKALDPLLARQRILRNGPILYTGSLTVTEVGD----TYLNMAGW 595
Query: 683 GKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQ 742
GKG A+VNG ++GRYW V P Q +VP L+
Sbjct: 596 GKGVAYVNGFNLGRYWPVAGP---------------------------QVTLYVPNEILK 628
Query: 743 ASNNLLVIFEETGGN 757
N LVI E N
Sbjct: 629 VGENSLVILEYQRAN 643
>gi|3025876|gb|AAC12775.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 662
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/325 (33%), Positives = 156/325 (48%), Gaps = 30/325 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y H + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 25 RTFTIDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFH 84
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQY F G+ D+ F+KL GL + LR GPY+CAEW+ GG P WL I R+
Sbjct: 85 EPQPGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRS 144
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + + +++ ++ M+ L GGPII +Q+ENEYG SY DY+++
Sbjct: 145 SDPDYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFL 198
Query: 223 ASM---ALGL--------GAGVPWVMCKQTDAPENIIDACNGYYCDG----YKPNSYNKP 267
+ LG GA ++ C +D G + + P
Sbjct: 199 QKLFHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGP 258
Query: 268 TLWTENWDGWYTTWGGRLPHRPV--EDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
+ +E + GW WG PH V E +A ++ G + +N YM+ GGTNF +G
Sbjct: 259 LVNSEFYTGWLDHWGQ--PHSTVRTEVVASSLHDILAHGAN-VNLYMFIGGTNFAYWNGA 315
Query: 326 --PFYI--TSYDYDAPIDEYGLLSE 346
P+ TSYDYDAP+ E G L+E
Sbjct: 316 NMPYQAQPTSYDYDAPLSEAGDLTE 340
>gi|78042544|ref|NP_001030215.1| beta-galactosidase precursor [Bos taurus]
gi|75057630|sp|Q58D55.1|BGAL_BOVIN RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|61554628|gb|AAX46589.1| galactosidase, beta 1 [Bos taurus]
gi|148839051|dbj|BAF64285.1| galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 33/335 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 29 RTFQIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFH 88
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G+YNF G +D+ F++L GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 89 ELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRS 148
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW- 221
++ + + +++ ++ MR L GGPII +Q+ENEYG SY DY+++
Sbjct: 149 SDPDYLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFL 202
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNG----YYCDGYKPNSY------------- 264
LG V+ TD + C Y + P +
Sbjct: 203 QKRFHDHLGED---VLLFTTDGVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQRKFEP 259
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + +E + GW WG R + +AF + G + +N YM+ GGTNF +G
Sbjct: 260 TGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGAN-VNMYMFIGGTNFAYWNG 318
Query: 325 G--PF--YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 319 ANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 155/327 (47%), Gaps = 35/327 (10%)
Query: 47 VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIR 106
++ D +DG ++S IHY R + W + + G + I+ Y+ WN HE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 107 GQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAP 166
G ++F G+ D+V+F + GL + R GPY+C+EW++GG P WL P + R+N
Sbjct: 68 GNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 167 FKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA 226
++ + + K++ L+ L GGPII Q+ENEYG+ Y + +++ W A +
Sbjct: 128 YQAAVSSYFSKLLPLLAP--LQHSNGGPIIAFQVENEYGD----YVDKDNEHLPWLADLM 181
Query: 227 LGLGAGVPWVMCK--QTDAPENIIDA--------------CNGYYCDGYKPNSYNKPTLW 270
G + + T N++ + +P NKP L
Sbjct: 182 KSHGLFELFFISDGGHTIRKANMLKVRSTAQLNSGSFQLLAKAFSLKSLQP---NKPMLV 238
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG----- 325
TE W GW+ WG E + +RG S +N+YM+ GGTNFG +G
Sbjct: 239 TEFWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGAS-VNFYMFHGGTNFGFMNGAIELEK 297
Query: 326 PFY---ITSYDYDAPIDEYGLLSEPKW 349
+Y +TSYDYD P+DE G +E KW
Sbjct: 298 GYYTADVTSYDYDCPVDESGNRTE-KW 323
>gi|193690496|ref|XP_001952133.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 635
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 165/337 (48%), Gaps = 26/337 (7%)
Query: 45 FNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
F + Y++ + DG +S +HY R W D I K K G + I TYV W+ HE
Sbjct: 25 FTIDYENNEFLKDGKVFRYVSGSLHYFRIPQLYWKDRIQKMKAAGLNTITTYVEWSLHEP 84
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI-PGIEFRTN 163
G Y+F+G D+ F++L+ + +YL LR GPY+CAE +FGGFP WL ++ P RTN
Sbjct: 85 FPGVYDFEGIADLEYFIELIKNENMYLILRPGPYICAERDFGGFPYWLLNVTPKRSLRTN 144
Query: 164 NAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNM---ESSYGQQGKD--- 217
N+ +K+ + ++ ++ +++ + GG II++Q+ENEYG+ +S Y +D
Sbjct: 145 NSSYKKYVSKWFSVLMPIIQPHLY--GNGGNIILVQVENEYGSYYACDSEYKLWIRDLFR 202
Query: 218 -YVKWAASMALGLGAGVPWVMCKQTDAPENIID---ACNGYYC-DGYKPNSYNKPTLWTE 272
YV+ A + G G + C +D + N C D + P + +E
Sbjct: 203 SYVENKAVLFTIDGCGQSYFDCGVIPEVYATVDFGISSNASQCFDFMRKVQKGGPLVNSE 262
Query: 273 NWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----- 327
+ GW T W D+ + SF ++YM+ GGTNFG TSG
Sbjct: 263 FYPGWLTHWQESESIVNTTDVVKQMKVMLAMNASF-SFYMFHGGTNFGFTSGANTNDTKE 321
Query: 328 ------YITSYDYDAPIDEYGLLSEPKWGHLKDLHAA 358
+TSYDY+AP+DE G +E + + L A
Sbjct: 322 SIGYLPQLTSYDYNAPLDEAGDPTEKYFKIKQTLEEA 358
>gi|83415088|ref|NP_001032730.1| beta-galactosidase precursor [Canis lupus familiaris]
gi|94730362|sp|Q9TRY9.3|BGAL_CANFA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|76470548|gb|ABA43388.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 668
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/325 (33%), Positives = 156/325 (48%), Gaps = 30/325 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y H + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 31 RTFTIDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFH 90
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E GQY F G+ D+ F+KL GL + LR GPY+CAEW+ GG P WL I R+
Sbjct: 91 EPQPGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRS 150
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + + +++ ++ M+ L GGPII +Q+ENEYG SY DY+++
Sbjct: 151 SDPDYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFL 204
Query: 223 ASM---ALGL--------GAGVPWVMCKQTDAPENIIDACNGYYCDG----YKPNSYNKP 267
+ LG GA ++ C +D G + + P
Sbjct: 205 QKLFHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGP 264
Query: 268 TLWTENWDGWYTTWGGRLPHRPV--EDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG 325
+ +E + GW WG PH V E +A ++ G + +N YM+ GGTNF +G
Sbjct: 265 LVNSEFYTGWLDHWGQ--PHSTVRTEVVASSLHDILAHGAN-VNLYMFIGGTNFAYWNGA 321
Query: 326 --PFYI--TSYDYDAPIDEYGLLSE 346
P+ TSYDYDAP+ E G L+E
Sbjct: 322 NMPYQAQPTSYDYDAPLSEAGDLTE 346
>gi|433679946|ref|ZP_20511609.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814938|emb|CCP42238.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 615
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 159/335 (47%), Gaps = 41/335 (12%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG +IS IH+ R W D + K++ G + +ETYVFWN E +GQ++F G ND
Sbjct: 43 DGKPYQIISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRQGQFDFSGNND 102
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ F+ + GL + LR GPYVCAEW GG+P WL PG+ R+ + F Q ++
Sbjct: 103 LAAFIDAAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAQPGLRVRSQDPRFLAASQAYLD 162
Query: 177 KIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWV 236
+ ++ ++ + GGP+I +Q+ENEYG+ + D+V A+ + + AG
Sbjct: 163 AVAAQVKPKL--NRNGGPVIAVQVENEYGSYDD-------DHVYMQANRTMFVKAGFDKA 213
Query: 237 MCKQTDAPENIIDACNGYYCD-----GYKPNSYNK------------PTLWTENWDGWYT 279
+ D + + NG D + P K P + E W GW+
Sbjct: 214 LLFTADGADVL---ANGTLPDTLAVVNFGPGDAEKAFQTLSKFRPGQPQMVGEYWAGWFD 270
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF----------YI 329
WG + + + A ++G S N YM+ GGT+FG +G F
Sbjct: 271 QWGDKHANTDAKKQASEFEWILRQGHS-ANIYMFVGGTSFGFMNGANFQKNASDHYAPQT 329
Query: 330 TSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEP 364
TSYDYDA +DE G + PK+ +D A I +P
Sbjct: 330 TSYDYDAVLDEAGRPT-PKFALFRDAIARITGVQP 363
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 67/169 (39%), Gaps = 19/169 (11%)
Query: 532 TNEVRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQN 591
T + ++ + +RD RV+++ L GS +V V+ +G + + +L + G N
Sbjct: 420 TGPRKGSLYLGDVRDYARVYVDRSLAGSAERRLQQVAVDVDIPAGPHTVDVLVENGGRIN 479
Query: 592 YGAFLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLT 651
YG L AG V L G K L +Q F WT
Sbjct: 480 YGTHLPDGRAGLVDPVLLNG---------KPLTGWQT-----FSLPMDDPSKLTGWTTAK 525
Query: 652 RDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+G F P LD+ + GKG AW NGH++GR+W +
Sbjct: 526 VEG--PAFHRGTVKIATPTD---TFLDMQAFGKGVAWANGHNLGRHWNI 569
>gi|229553373|ref|ZP_04442098.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
gi|229313254|gb|EEN79227.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
Length = 583
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/351 (31%), Positives = 169/351 (48%), Gaps = 50/351 (14%)
Query: 55 IIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGK 114
++DG ++S IHY R P W + K G + +ETYV WN HE G+++F G
Sbjct: 1 MLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSGI 60
Query: 115 NDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRF 174
DI +F+K GLY +R PY+CAEW FGGFP WL + RT++ + + R+
Sbjct: 61 LDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWLL-TKKMRLRTDDPAYLAAIDRY 119
Query: 175 VKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
++ + + + GG +IM+Q+ENEYG SYG+ +DY+ A + G VP
Sbjct: 120 YTALMPHLVDHQVT--HGGNVIMMQVENEYG----SYGED-QDYLAAVAKLMQQHGVDVP 172
Query: 235 WVMCKQTDAP-------ENIIDA---CNGYYCDG-----------YKPNSYNKPTLWTEN 273
+D P ++IDA G + ++ + + P + E
Sbjct: 173 LFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLMCMEF 229
Query: 274 WDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY----- 328
WDGW+ WG + R ++ A + +RG +N YM+ GGTNFG +G
Sbjct: 230 WDGWFNRWGEPIIRRDPDETAEDLRAVIKRGS--VNLYMFHGGTNFGFMNGTSARKDHDL 287
Query: 329 --ITSYDYDAPIDEYGLLSEPKW--------GHLKDLHAAIKLCEPALVAA 369
+TSYDYDAP++E G + PK+ L ++ A L +P + A
Sbjct: 288 PQVTSYDYDAPLNEQGNPT-PKYFAIQKMIHEELPEVQQAKPLVKPTMAPA 337
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 73/182 (40%), Gaps = 47/182 (25%)
Query: 575 SGYNDLILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGFKNG-DIDLSKILWTYQVGLKGE 633
G++ L LL + + NYG+ +E + G + G +DL I KG
Sbjct: 431 EGHHQLDLLVENMSRVNYGSKIE-------AITQFKGIRTGVMVDLHFI--------KG- 474
Query: 634 FQQIYSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHH 693
+QQ Y ++ N A T P+T +YK FD D LD GKG VNG +
Sbjct: 475 YQQ-YPLDLNRASRLTFTEGWQPATPAFYKYTFDLTAPQD-TYLDCHGFGKGVMLVNGVN 532
Query: 694 IGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
+GR+W KG PT + Y VP L A N +++FE
Sbjct: 533 VGRFWE----KG-----------------------PTLSLY-VPAGLLHAGKNDVIVFET 564
Query: 754 TG 755
G
Sbjct: 565 EG 566
>gi|440904150|gb|ELR54700.1| Beta-galactosidase, partial [Bos grunniens mutus]
Length = 659
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 33/335 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + Y + DG IS IHY R W D + K K G + I+TYV WN H
Sbjct: 35 RTFQIDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFH 94
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E G+YNF G +D+ F++L GL + LR GPY+CAEW+ GG P WL + I R+
Sbjct: 95 ELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRS 154
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKW- 221
++ + + +++ ++ MR L GGPII +Q+ENEYG SY DY+++
Sbjct: 155 SDPDYLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFL 208
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNG----YYCDGYKPNSY------------- 264
LG V+ TD + C Y + P +
Sbjct: 209 QKRFHDHLGED---VLLFTTDGVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQRKFEP 265
Query: 265 NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG 324
P + +E + GW WG R + +AF + G + +N YM+ GGTNF +G
Sbjct: 266 TGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLALGAN-VNMYMFIGGTNFAYWNG 324
Query: 325 G--PF--YITSYDYDAPIDEYGLLSEPKWGHLKDL 355
P+ TSYDYDAP+ E G L+E K+ L+D+
Sbjct: 325 ANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 358
>gi|423071595|ref|ZP_17060369.1| hypothetical protein HMPREF9177_01686 [Streptococcus intermedius
F0413]
gi|355364069|gb|EHG11804.1| hypothetical protein HMPREF9177_01686 [Streptococcus intermedius
F0413]
Length = 609
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 156/305 (51%), Gaps = 35/305 (11%)
Query: 63 LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVK 122
++S IHY R P+ W + K G + +ETY+ WNAHE ++GQ++F+G D+ KF++
Sbjct: 33 ILSGAIHYFRIQPDDWYHSLYNLKALGFNTVETYIPWNAHEPMKGQFDFEGILDVEKFLQ 92
Query: 123 LVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLM 182
GLY+ LR PY+CAEW FGG P WL + + R+++ + + + +++ +
Sbjct: 93 TAQDLGLYVLLRSSPYICAEWEFGGLPAWLLE-ENMRIRSSDPAYLAAVANYYDELLPRL 151
Query: 183 REEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMC---- 238
+L + GG I+M+Q+ENEYG SYG+ K+Y++ M L G P
Sbjct: 152 VPHLLGN--GGNILMMQVENEYG----SYGED-KEYLRAVRDMMLERGVTCPLFTSDGPW 204
Query: 239 KQTDAPENIIDA---CNGYYCDGYKPN---------SYNK--PTLWTENWDGWYTTWGGR 284
+ T +I+ G + K N Y K P + E WDGW+ W
Sbjct: 205 RGTLRAGTLIEDDVFVTGNFGSKAKENFAQMQEFFDEYGKKWPLICMEFWDGWFNRWKEP 264
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY-------ITSYDYDAP 337
+ R E+LA AV Q+G +N YM+ GGTNFG +G +TSYDY+A
Sbjct: 265 VITRDPEELATAVHEVLQQGS--INLYMFHGGTNFGFMNGCSARGNIDLPQVTSYDYEAL 322
Query: 338 IDEYG 342
+DE G
Sbjct: 323 LDEQG 327
>gi|164519026|ref|NP_001073876.2| beta-galactosidase-1-like protein 3 [Homo sapiens]
gi|269849685|sp|Q8NCI6.3|GLBL3_HUMAN RecName: Full=Beta-galactosidase-1-like protein 3
Length = 653
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 160/321 (49%), Gaps = 27/321 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TYV WN HE RG+++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV + GL++ LR G Y+C+E + GG P WL P + RT N F E ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGRYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGA---- 231
++ R L Q GP+I +Q+ENEYG+ + K Y+ + L G
Sbjct: 202 DHLIP--RVIPLQYRQAGPVIAVQVENEYGSF-----NKDKTYMPYLHKALLRRGIVELL 254
Query: 232 ----GVPWVMCKQTD---APENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGR 284
G V+ T A N+ + +K +KP L E W GW+ WG +
Sbjct: 255 LTSDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQR-DKPLLIMEYWVGWFDRWGDK 313
Query: 285 LPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPI 338
+ +++ AV+ F + SF N YM+ GGTNFG +G ++ +TSYDYDA +
Sbjct: 314 HHVKDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVL 372
Query: 339 DEYGLLSEPKWGHLKDLHAAI 359
E G +E K+ L+ L ++
Sbjct: 373 TEAGDYTE-KYLKLQKLFQSV 392
>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 605
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 154/324 (47%), Gaps = 25/324 (7%)
Query: 50 DHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQY 109
D ++G ++ +HY R W D + K K G + + TYV WN HE RG +
Sbjct: 10 DSSQFTLEGKPFRILGGSVHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGTF 69
Query: 110 NFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKE 169
NF+ + D+ +V L GL++ LR GPY+CAEW+ GG P WL ++ RT F
Sbjct: 70 NFQDQLDLKAYVSLAAQLGLWVILRPGPYICAEWDLGGLPSWLLQDEEMQLRTTYPGFVN 129
Query: 170 EMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGN----------MESSYGQQGKDYV 219
+ + K++ +++ M GGPII +Q+ENEYG+ +++ +G +
Sbjct: 130 AVNLYFDKLISVIKPLMFEG--GGPIIAVQVENEYGSFAKDDKYMPFIKNCLQSRGIKEL 187
Query: 220 KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYT 279
+ GL G K + A + +P KP + E W GW+
Sbjct: 188 LMTSDNWEGLRCGGVEGALKTVNLQRLSFGAIQ--HLADIQP---QKPLMVMEYWSGWFD 242
Query: 280 TWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYD 333
WG ED+ V+ RG S +N YM+ GGT FG +G + +TSYD
Sbjct: 243 VWGEHHHVFYAEDMLAVVSEILDRGVS-INLYMFHGGTTFGFMNGAMDFGTYKSQVTSYD 301
Query: 334 YDAPIDEYGLLSEPKWGHLKDLHA 357
YDAP+ E G + PK+ HL++L +
Sbjct: 302 YDAPLSEAGDCT-PKYHHLRNLFS 324
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 84/213 (39%), Gaps = 43/213 (20%)
Query: 542 DSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGA 601
+++RD VF+N + G + +V P G L L + G NYG L++
Sbjct: 405 NNIRDRALVFVNRECVGCLDYKTHEVAIPD--GKGERTLSFLVENCGRVNYGKALDEQRK 462
Query: 602 GFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEW-TDLTRDGIPSTFT 660
G G + L +S + +K F + + N +W TD +P F
Sbjct: 463 GIVGDIVLNNTPLRGFSISCL------DMKPSFIKRLT---NSGQWKTDFKSHCVPGFFQ 513
Query: 661 WYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSD 720
+ D P V+L S GKG +VNG ++GRYW + P
Sbjct: 514 -ARLCVDGPPKDTFVSLR--SWGKGVIFVNGQNLGRYW-FIGP----------------- 552
Query: 721 KCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
Q + ++P WL++ N +++FEE
Sbjct: 553 ----------QHFLYLPAPWLRSGENEIIVFEE 575
>gi|387878583|ref|YP_006308886.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
gi|386792040|gb|AFJ25075.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
Length = 595
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 100/307 (32%), Positives = 157/307 (51%), Gaps = 25/307 (8%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
+ G ++S IHY R P W + K G + +ETYV WN HE +GQ++F G+
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ +F+++ S GLY+ +R P++CAEW FGG P WL + + R+++ F E + R+
Sbjct: 72 DLERFIQIAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDMRIRSSDPAFIEAVDRYY 130
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNM--ESSYGQQGKDYVKWAASMALGLGAGV 233
++ L+ + QGGPI+M+Q+ENEYG+ + Y + +D +K +
Sbjct: 131 DHLLGLLTRYQVD--QGGPILMMQVENEYGSYGEDKVYLRAIRDLMKKKGVTCPLFTSDG 188
Query: 234 PW--VMCKQTDAPENIIDACN----GYYCDGYKPNSYNK-----PTLWTENWDGWYTTWG 282
PW + T +++ N Y G +++ P + E WDGW+T W
Sbjct: 189 PWRATLRAGTLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 283 GRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSG----GPF---YITSYDYD 335
+ R E+LA AV + G +N YM+ GGTNFG +G G +TSYDY
Sbjct: 249 EPVIQREPEELAEAVHEVLELGS--INLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYG 306
Query: 336 APIDEYG 342
A ++E G
Sbjct: 307 ALLNEQG 313
Score = 42.4 bits (98), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 29/118 (24%)
Query: 638 YSIEENEAEWTDLTRDGIPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRY 697
Y ++ + D +++ +Y+ F +D LD+ GKG A+VNGH++GR+
Sbjct: 485 YPLDLQDLSQLDFSKEWQAGAPAFYRYDFQLDQTLD-TYLDMTGFGKGVAFVNGHNLGRF 543
Query: 698 WTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETG 755
W V PT + Y VP +L+ N L++FE G
Sbjct: 544 WEV---------------------------GPTTSLY-VPHGFLKEGANSLIVFETEG 573
>gi|336415312|ref|ZP_08595652.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
gi|335940908|gb|EGN02770.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
Length = 778
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 170/353 (48%), Gaps = 29/353 (8%)
Query: 21 MMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPD 80
++ ++++ + S++ A T + F + ++DG ++ +A +HY R W
Sbjct: 5 LIALLVLFTVIFFSTAQAQTTARKFEAGKN--TFLLDGKPFVVKAAELHYTRIPQAYWEH 62
Query: 81 LIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVC 140
I K G + I Y+FWN HE G+++F G+NDI F + G+Y+ +R GPYVC
Sbjct: 63 RIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVC 122
Query: 141 AEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQI 200
AEW GG P WL I RT + + E + F+K++ + L +GG IIM+Q+
Sbjct: 123 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAP--LQVNKGGNIIMVQV 180
Query: 201 ENEYGNMESSYGQQGKDYVKWAASMALGLG-AGVPWVMCK-----QTDAPENIIDACN-- 252
ENEYG SYG K YV + G + VP C +A +++I N
Sbjct: 181 ENEYG----SYGID-KPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNFG 235
Query: 253 -GYYCD----GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSF 307
G D K P + +E W GW+ WG + R +D+ + R SF
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRLAKDMVQGIKDMLDRNISF 295
Query: 308 MNYYMYFGGTNFGRTSGG--PFY---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
+ YM GGT FG G P Y +SYDYDAPI E G ++ K+ L+DL
Sbjct: 296 -SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/126 (24%), Positives = 53/126 (42%), Gaps = 30/126 (23%)
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYR 714
+P+ +YK+ F D + LD+ + GKG WVNGH +GR+W +
Sbjct: 526 LPTMPAYYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------- 570
Query: 715 GAYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEIS-VKLRSTRIVCE 773
P QT + +P WL+ N +++ + G I +K ++ E
Sbjct: 571 -------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILDVLRE 616
Query: 774 QVSESH 779
+ E+H
Sbjct: 617 KAPETH 622
>gi|188990653|ref|YP_001902663.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
B100]
gi|167732413|emb|CAP50607.1| exported beta-galactosidase [Xanthomonas campestris pv. campestris]
Length = 680
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 171/366 (46%), Gaps = 55/366 (15%)
Query: 22 MMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDL 81
+++ + I L +++++ + F + DG ++S IH+ R W D
Sbjct: 76 LVLALAIALPITATAASDDQWPTFATQGTQ--FVRDGKPYQVLSGAIHFQRIPRAYWKDR 133
Query: 82 IAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCA 141
+ K++ G + +ETYVFWN E +GQ++F ND+ FV+ + GL + LR GPY CA
Sbjct: 134 LQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGPYACA 193
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIE 201
EW GG+P WL I R+ + F Q ++ + + L + GGPII +Q+E
Sbjct: 194 EWETGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVSKQVHP--LLNHNGGPIIAVQVE 251
Query: 202 NEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD---- 257
NEYG+ + D+ A + A+ + AG + +D + + NG D
Sbjct: 252 NEYGSYDD-------DHAYMADNRAMYVKAGFDDALLFTSDGADML---ANGTLPDTLAV 301
Query: 258 -----GYKPNSYNK--------PTLWTENWDGWYTTWGGRLPH------RPVEDLAFAVA 298
G ++++K P + E W GW+ WG PH + E+L + +
Sbjct: 302 VNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHASTDAKQQTEELEWIL- 358
Query: 299 RFFQRGGSFMNYYMYFGGTNFGRTSGGPF----------YITSYDYDAPIDEYGLLSEPK 348
R G N YM+ GGT+FG +G F TSYDYDA +DE G + PK
Sbjct: 359 ----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAG-RATPK 413
Query: 349 WGHLKD 354
+ ++D
Sbjct: 414 FALMRD 419
>gi|187736173|ref|YP_001878285.1| beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
gi|187426225|gb|ACD05504.1| Beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
Length = 780
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/321 (33%), Positives = 160/321 (49%), Gaps = 34/321 (10%)
Query: 48 SYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRG 107
S + ++DG +IS +HYPR + W D + K G + + TY+FWN HE G
Sbjct: 35 STNQENFLMDGKPVKIISGEMHYPRVPRQHWKDRFQRIKAMGMNTVCTYLFWNVHEPEPG 94
Query: 108 QYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPF 167
+++F G D V+F+K +GL++ +R GPYVCAEW FGGFP WL ++ R+ + F
Sbjct: 95 KWDFSGNLDFVEFIKEAQKAGLWVIVRPGPYVCAEWEFGGFPGWLLKDEDLKVRSQDPRF 154
Query: 168 KEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMAL 227
E ++KK+ ++ E L +GGPIIM Q+ENEYG SYG KDYVK +
Sbjct: 155 LEPAMAYLKKVCSML--EPLQITKGGPIIMAQVENEYG----SYGSD-KDYVKKHLDV-- 205
Query: 228 GLGAGVPWVMCKQTDAPEN----------IIDACN------GYYCDGYKPNSYNKPTLWT 271
+ +P V+ +D P + ++ A N G + + K + P +
Sbjct: 206 -IRKELPGVVPFTSDGPNDWMIKNGTLPGVVPAMNFGGGAKGAFANLEK-HKGKTPRING 263
Query: 272 ENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--- 328
E W GW+ WG E + + S N +M GGT+FG +G +
Sbjct: 264 EFWVGWFDHWGKPKNGGSTEGFNRDLKWMLENNVS-PNLFMAHGGTSFGFMNGANWEGAY 322
Query: 329 ---ITSYDYDAPIDEYGLLSE 346
+T+YDY API E G L++
Sbjct: 323 TPDVTNYDYGAPISENGTLTD 343
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 72/166 (43%), Gaps = 17/166 (10%)
Query: 535 VRPTVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGA 594
V+ + +++M+D V+++G+ G+ + + + SG + + + + +G N+G
Sbjct: 421 VKGELKMNNMQDRAIVYVDGKRQGAADRRYKQDSCDIVIPSGLHTVDIFVENMGRINFGG 480
Query: 595 FLEKDGAGFRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDG 654
++ + G RG + L G K L L Y KG +S ++ + R G
Sbjct: 481 QIQGERKGIRGPITLDGKK-----LENFL-IYNFPCKGVELIPFSGKKPAGDQPVFHR-G 533
Query: 655 IPSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTV 700
+ TY D DG KG WVNG ++GR+W +
Sbjct: 534 YFNVSNPKDTYLDMRDGWK----------KGVVWVNGRNLGRFWFI 569
>gi|294633777|ref|ZP_06712335.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830419|gb|EFF88770.1| beta-galactosidase [Streptomyces sp. e14]
Length = 591
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 148/318 (46%), Gaps = 37/318 (11%)
Query: 53 AIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFK 112
A + DG ++SA IHY R P++W D + + + G + +ETY+ WN HE G++ F
Sbjct: 12 AFLRDGEPHQIVSAAIHYFRVHPDLWADRLIRLRAMGVNTVETYIAWNFHEPRPGEFLFD 71
Query: 113 GKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQ 172
G DIVKF++ G GL + +R GPY+CAEW+ GG P WL G R + +
Sbjct: 72 GDRDIVKFIRTAGDLGLDVIVRPGPYICAEWDLGGLPSWLLADRGARLRRREPAYLAAVD 131
Query: 173 RFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG----------NMESSYGQQGKD---YV 219
+ V R L + +GGP++ + IENEYG ++ ++G D +
Sbjct: 132 AWFD--VLFPRLIPLLASRGGPVVAMSIENEYGSFGTDTDYLEHLRKGMIERGADCLLFT 189
Query: 220 KWAASMALGLGAGVPWVMCKQT--DAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGW 277
A LG +P V+ T PE + + G P E W GW
Sbjct: 190 SDGAGDGFLLGGSIPGVLAAGTFGSRPEQSLATLRAHQPTG--------PLFCVEYWHGW 241
Query: 278 YTTWGGRLPH--RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-------Y 328
+ WG PH R D A + R G S +N YM GGTNFG SG
Sbjct: 242 FDHWGE--PHHVRDAADAADTLDRLLAAGAS-VNIYMGHGGTNFGWWSGANHDGLHHQPD 298
Query: 329 ITSYDYDAPIDEYGLLSE 346
+TSYDY AP+ E G L+E
Sbjct: 299 VTSYDYGAPVGEAGELTE 316
>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
Length = 583
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 155/317 (48%), Gaps = 39/317 (12%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG ++S +HY R P+ W D + +++E G + IETY+ WNAH RG++ G D
Sbjct: 14 DGTPVRILSGALHYFRHHPDQWRDRLTRARELGLNTIETYIPWNAHSPARGEFRTDGILD 73
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVK 176
+ +F+ V + G++ +R GPY+CAEW GG P WL G R + + +Q + +
Sbjct: 74 LGRFLDEVAAQGMWAIVRPGPYICAEWTGGGLPGWLF-TAGAAVRRHEPTYLAAIQDYYE 132
Query: 177 KIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWV 236
+ ++ + +GGP++++Q+ENEYG +YG KDY++ + G P
Sbjct: 133 AVAGIVAPRQVD--RGGPVVLVQVENEYG----AYGDD-KDYLRALVKLLRESGITTPLT 185
Query: 237 MCKQTDAPENIIDACNGYYCDGYKPNSYNK----------------PTLWTENWDGWYTT 280
Q PE + NG + +K S+ P + E WDGW+ +
Sbjct: 186 TIDQ---PEPWM-LENGSLPELHKTGSFGSRAAERLATLREHQPTGPLMCAEFWDGWFDS 241
Query: 281 WGGRLPHRPVEDLAFAVARF--FQRGGSFMNYYMYFGGTNFGRTSG----GPFY--ITSY 332
WG H D A + G+ +N YM GGTNFG T+G G + +TSY
Sbjct: 242 WG---LHHHTTDAAASAHELDTLLAAGASVNLYMVCGGTNFGFTNGANDKGTYVPIVTSY 298
Query: 333 DYDAPIDEYGLLSEPKW 349
DYDAP+DE G + W
Sbjct: 299 DYDAPLDEAGRPTAKYW 315
>gi|148231352|ref|NP_001080304.1| galactosidase, beta 1-like 2 [Xenopus laevis]
gi|28422231|gb|AAH46858.1| Loc89944-prov protein [Xenopus laevis]
Length = 634
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 175/360 (48%), Gaps = 32/360 (8%)
Query: 25 MMMIHLSCVSSSSASTFFKPFN-VSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIA 83
+M LS S ++ P + + + +++G ++ +HY R W D +
Sbjct: 21 FLMYRLSLPSHQNSFMMLSPNSGLLAEDSHFLLNGIPYRILGGSMHYFRVPMPYWRDRMK 80
Query: 84 KSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEW 143
K K G + + TYV WN HE +G+++F DI +F+ + GL++ LR GPY+CAEW
Sbjct: 81 KMKACGINTLTTYVPWNLHEPRKGKFDFSKDLDISEFLAIASEMGLWVILRPGPYICAEW 140
Query: 144 NFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENE 203
+ GG P WL ++ RT F E + ++ +++ + + + GGPII +Q+ENE
Sbjct: 141 DLGGLPSWLLRDKDMKLRTTYRGFTEATEAYLDELIPRIAKYQYSN--GGPIIAVQVENE 198
Query: 204 YGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTD-----APENIIDACNGYYCDG 258
YG SY + +Y+++ + + G + D + EN++ N
Sbjct: 199 YG----SYAKDA-NYMEFIKNALVEKGIVELLLTSDNKDGLSSGSLENVLATVN---FQK 250
Query: 259 YKPNSY--------NKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNY 310
+P + NKP + E W GW+ WGG+ V+++ V+ RG S +N
Sbjct: 251 IEPVLFSYLNSIQSNKPVMVMEFWTGWFDYWGGKHHIFDVDEMISTVSEVLNRGAS-INL 309
Query: 311 YMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEP 364
YM+ GGTNFG +G + ITSYDYDAP+ E G + K+ L++L +P
Sbjct: 310 YMFHGGTNFGFMNGALHFHEYRPDITSYDYDAPLTEAGDYTS-KYFKLRELFGDYNAEKP 368
Score = 46.2 bits (108), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 83/211 (39%), Gaps = 42/211 (19%)
Query: 543 SMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLEKDGAG 602
++RD +VF + Q G+V + K + Y L +L + G NYG ++K G
Sbjct: 444 NVRDRAQVFASSQSFGTV--DYKKENLHIPEIPAYRKLAILVENCGRVNYGPMIDKQHKG 501
Query: 603 FRGQVKLTGFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIPSTFTWY 662
G V L +N + K TY + + F I SI NE W+DL+ TF Y
Sbjct: 502 LVGDVYL---RNKPLRNFK---TYSLEMNSTF--ISSI--NEVHWSDLSDCKTGPTF--Y 549
Query: 663 KTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKC 722
+ + L + KG +VN ++GRYW + G Q+T
Sbjct: 550 QGALNVVGSPTDTFLRMKGWKKGVVFVNSKNLGRYWDI-----GPQETL----------- 593
Query: 723 TTNCGNPTQTWYHVPRSWLQASNNLLVIFEE 753
+P WL N + +FEE
Sbjct: 594 ------------FIPGPWLWPGVNEITLFEE 612
>gi|115465145|ref|NP_001056172.1| Os05g0539400 [Oryza sativa Japonica Group]
gi|122168850|sp|Q0DGD7.1|BGAL8_ORYSJ RecName: Full=Beta-galactosidase 8; Short=Lactase 8; Flags:
Precursor
gi|113579723|dbj|BAF18086.1| Os05g0539400 [Oryza sativa Japonica Group]
gi|215696978|dbj|BAG90972.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218197179|gb|EEC79606.1| hypothetical protein OsI_20800 [Oryza sativa Indica Group]
gi|222632392|gb|EEE64524.1| hypothetical protein OsJ_19375 [Oryza sativa Japonica Group]
Length = 673
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 160/332 (48%), Gaps = 49/332 (14%)
Query: 57 DGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKND 116
DG ++ +HY R PE W D + ++K G + I+TYV WN HE + FKG D
Sbjct: 44 DGAPFQIVGGDVHYFRIVPEYWKDRLLRAKALGLNTIQTYVPWNLHEPKPLSWEFKGFTD 103
Query: 117 IVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDI-PGIEFRTNNAPFKEEMQR-- 173
I +++L + + LR+GPY+C EW+ GGFP WL I P IE R++++ + + R
Sbjct: 104 IESYLRLAHELDMLVMLRVGPYICGEWDLGGFPPWLLTIEPTIELRSSDSTYLSLVDRWW 163
Query: 174 --FVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMA---LG 228
+ KI L+ GGPIIM+QIENE+G S+G K+Y+ + +A LG
Sbjct: 164 GVLLPKIAPLLYS------NGGPIIMVQIENEFG----SFGDD-KNYLHYLVEVARRYLG 212
Query: 229 -------LGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP-------NSYNKP----TLW 270
G + T +++ A + + G P YN P L
Sbjct: 213 NDIMLYTTDGGAIGNLKNGTILQDDVFAAVD--FDTGSNPWPIFQLQKEYNLPGKSAPLS 270
Query: 271 TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG----- 325
+E + GW T WG R+ A A+ R R GS + YM GGTNFG +G
Sbjct: 271 SEFYTGWLTHWGERIATTDASSTAKALKRILCRNGSAV-LYMAHGGTNFGFYNGANTGQN 329
Query: 326 ----PFYITSYDYDAPIDEYGLLSEPKWGHLK 353
+TSYDYDAPI EYG + K+ L+
Sbjct: 330 ESDYKADLTSYDYDAPIREYGDVHNAKYKALR 361
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 83/211 (39%), Gaps = 45/211 (21%)
Query: 580 LILLSQTVGLQNYGAFLEKDGAGFRGQVKLTGF-----------KNGDIDLSKILWTYQV 628
L +L + +G NYG ++ D G V++ G N +LSK+ Q+
Sbjct: 492 LYILVENMGRVNYGPYI-FDQKGILSSVEIDGIILRHWKMHPVSLNAVGNLSKLQLIMQM 550
Query: 629 -GLKGEFQQIYSIEENEAEWTDL-TRDGIPSTFTWYKTYF--DAPDGIDPVALDLGSMGK 684
+ IY EN+ + L +GI +Y+ +F D+ + K
Sbjct: 551 TDAEASKVSIYGDSENKLQDVSLYLNEGISEEPAFYEGHFHIDSESEKKDTFISFRGWNK 610
Query: 685 GQAWVNGHHIGRYWTVVAPKGGCQDTCDYRGAYNSDKCTTNCGNPTQTWYHVPRSWLQAS 744
G A+VN +IGR+W + P Q +VP L+
Sbjct: 611 GVAFVNNFNIGRFWPAIGP---------------------------QCALYVPAPILKPG 643
Query: 745 NNLLVIFEETGGNPFEISVKL-RSTRIVCEQ 774
+N++VIFE NP E+++KL + C Q
Sbjct: 644 DNVIVIFELHSPNP-ELTIKLVKDPDFTCGQ 673
>gi|297483826|ref|XP_002693891.1| PREDICTED: galactosidase, beta 1-like 3 [Bos taurus]
gi|296479482|tpg|DAA21597.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 899
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 162/322 (50%), Gaps = 29/322 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G+ +++ +HY R W D + K + G + + TYV WN HE RG ++F G
Sbjct: 323 LEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSGNL 382
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F+ L GL++ LR GPY+C+E + GG P WL P + RT N F + ++
Sbjct: 383 DLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNKYF 442
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R +L QGGPII +Q+ENEYG + + + Y+ + G G
Sbjct: 443 DHLIP--RVALLQYLQGGPIIAVQVENEYG-----FFYKDEAYMPYLLQALQQRGIGGLL 495
Query: 236 VMCKQTDAP-----ENIIDACN--GYYCDGYK---PNSYNKPTLWTENWDGWYTTWGGRL 285
+ T+ + ++ + N G+ D +K +KP L E W GW+ TWG +
Sbjct: 496 LTADSTEEVMRGHIKGVLASINMKGFKVDSFKHLYKLQRHKPILIMEFWVGWFDTWG--I 553
Query: 286 PHR--PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAP 337
HR V ++ +V+ F + G SF N YM+ GGTNFG +G + TSYDYDA
Sbjct: 554 DHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTSYDYDAV 612
Query: 338 IDEYGLLSEPKWGHLKDLHAAI 359
+ E G + K+ L+ L +I
Sbjct: 613 LTEAGDYTA-KYFMLRSLFESI 633
>gi|62859689|ref|NP_001015958.1| galactosidase, beta 1-like precursor [Xenopus (Silurana)
tropicalis]
gi|89271933|emb|CAJ82193.1| galactosidase, beta 1 [Xenopus (Silurana) tropicalis]
Length = 648
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 155/327 (47%), Gaps = 34/327 (10%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
+ F + ++H DG IS IHY R W D + K K G D I TYV WN H
Sbjct: 28 RTFEIDFEHNCFRKDGQPFRYISGSIHYSRVPQYYWKDRLLKMKMAGLDAIYTYVPWNFH 87
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E+ G YNF G +DI F+KL GL + LR GPY+CAEW+ GG P WL I R+
Sbjct: 88 ETKPGVYNFSGDHDIESFLKLANEIGLLVILRAGPYICAEWDMGGLPAWLLAKESIVLRS 147
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG-------NMESSYGQQG 215
++ + + + ++ + M+ + + GGPII +Q+ENEYG N Q
Sbjct: 148 SDPDYLQAVDNWMGVFLPKMKPFLYHN--GGPIISVQVENEYGSYFTCDYNYLRHLLQLF 205
Query: 216 KDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNG----------YYCDGYKPNSYN 265
+ ++ + G+G+ +V C +D G YC+
Sbjct: 206 RHHLGDEVVLFTTDGSGLQYVRCGTIQGLYTTVDFGPGSNVTETFSVQRYCEP------K 259
Query: 266 KPTLWTENWDGWYTTWGGRLPHRPV--EDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTS 323
P + +E + GW WG PH V E + ++ G + +N YM+ GGTNFG +
Sbjct: 260 GPLVNSEFYTGWLDHWGE--PHSVVATEMVTKSLDEILAHGAN-VNMYMFIGGTNFGYWN 316
Query: 324 GG--PF--YITSYDYDAPIDEYGLLSE 346
G P+ TSYDYDAP+ E G L++
Sbjct: 317 GANTPYAPQPTSYDYDAPLSEAGDLTD 343
>gi|134096920|ref|YP_001102581.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|291006638|ref|ZP_06564611.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|133909543|emb|CAL99655.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
Length = 594
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 159/326 (48%), Gaps = 31/326 (9%)
Query: 43 KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAH 102
KP ++ ++DG +I+ +HY R P+ W + + + + G + ++TYV WN H
Sbjct: 13 KPAGLTVRGNEFLLDGEPFRIIAGEMHYFRTHPDQWRNRLDRMRALGLNSVDTYVAWNFH 72
Query: 103 ESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRT 162
E RG+ +F G D+V+FV+ +GL + +R GPY+CAEW+FGG P WL + R
Sbjct: 73 EPRRGEVDFTGWRDVVRFVETAAEAGLKVIIRPGPYICAEWDFGGLPAWLLESGNPPLRC 132
Query: 163 NNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWA 222
++ + E R+ ++ L R L + +GGP++ Q+ENEYG SYG +
Sbjct: 133 SDPAYTELTLRWFDEL--LPRLAPLQATRGGPVLAFQVENEYG----SYGNDQTHLEQLR 186
Query: 223 ASM-------ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKP----NSYN-KPTLW 270
A M L G M + + P+ + A + D P Y + LW
Sbjct: 187 AGMLERGIDSLLFCSNGPSDYMLRGGNLPDTL--ATVNFAGDPTAPFEALREYQPEGPLW 244
Query: 271 -TENWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPF-- 327
TE WDGW+ WG + A V R G S ++ YM GGTNFG +G +
Sbjct: 245 CTEFWDGWFDHWGEEHHTTDPVETAGHVDRMLAAGAS-VSLYMAVGGTNFGWWAGANYDT 303
Query: 328 -------YITSYDYDAPIDEYGLLSE 346
ITSYDYD+PI E G L+E
Sbjct: 304 SKDQYQPTITSYDYDSPIGEAGELTE 329
>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
garnettii]
Length = 669
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 168/351 (47%), Gaps = 34/351 (9%)
Query: 21 MMMMMMMIHLSCVSSSSASTFF----KPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPE 76
+++ M + L + +SA F K F + Y + DG IS IHY R
Sbjct: 4 LLVRMFSLLLVPLLLASADGLFNASLKTFKIDYSRDRFLKDGQPFRYISGSIHYSRLPRF 63
Query: 77 MWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIG 136
W D + K K G + I+TYV WN HE G+Y F +D+ F++L GL + LR G
Sbjct: 64 YWKDRLLKMKMAGLNAIQTYVPWNFHEPQPGKYQFSEDHDVEYFIQLAHELGLLVILRPG 123
Query: 137 PYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGGPII 196
PY+CAEW+ GG P WL + + R+++ + + +++ ++ M+ L GGPII
Sbjct: 124 PYICAEWDMGGLPAWLLEKESMILRSSDPDYLAAVDKWLGVLLPKMKP--LLYQNGGPII 181
Query: 197 MLQIENEYGNMESSYGQQGKDYVKW---------AASMALGLGAGV--PWVMCKQTDAPE 245
+Q+ENEYG SY DY+++ + L G+ ++ C
Sbjct: 182 SVQVENEYG----SYFTCDHDYMRFLLKRFRYYLGDDVVLFTTDGIFEKYLNCGALQGLY 237
Query: 246 NIIDACNGYYCDG----YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPV--EDLAFAVAR 299
+D G + + P + +E + GW WG PH V ED+AF++
Sbjct: 238 ATVDFGTGVNITAAFKLQRKSEPKGPLINSEFYTGWLDHWGQ--PHSTVKTEDVAFSLFD 295
Query: 300 FFQRGGSFMNYYMYFGGTNFGRTSGG--PFYI--TSYDYDAPIDEYGLLSE 346
RG S +N YM+ GGTNF +G P+ TSYDYDAP+ E G L+E
Sbjct: 296 ILARGAS-VNLYMFTGGTNFAYWNGANIPYSAQPTSYDYDAPLSEAGDLTE 345
>gi|115361550|gb|ABI95864.1| beta-galactosidase [Planococcus sp. L4]
Length = 552
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 152/304 (50%), Gaps = 25/304 (8%)
Query: 68 IHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSS 127
+HY R PE W D + K K G + +ETY+ WN HE +GQ++F G DI F++L
Sbjct: 1 MHYFRTVPEQWEDRLQKLKALGLNTVETYIPWNFHEPKKGQFHFSGMADIEGFIELAHRL 60
Query: 128 GLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEML 187
GLY+ LR PY+CAEW GG P WL + R+++ F ++ + +++ + +
Sbjct: 61 GLYVILRPAPYICAEWEMGGLPSWLMKDKNLVLRSSDPAFLGHVEDYFAELLPKFTKHLY 120
Query: 188 FSWQGGPIIMLQIENEYGN----------MESSYGQQGKDYVKWAASMALGLGAGVPWVM 237
GGP+I +QIENEYG ++ Y G + + + + G +
Sbjct: 121 --QNGGPVIAMQIENEYGAYGNDSAYLDFFKAQYEHHGLNTFLFTSDGPDFITQGSMPDV 178
Query: 238 CKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAV 297
+ + ++ D +KP+S P + E W GW+ W G R +D+A
Sbjct: 179 TTTLNFGSRVDESFQA--LDAFKPDS---PKMVAEFWIGWFDYWSGEHTVRSGDDVASVF 233
Query: 298 ARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAPIDEYGLLSEPKWGH 351
++ S +N+YM+ GGTNFG +G Y ITSYDYD+ + E G ++E K+
Sbjct: 234 KEIMEKNIS-VNFYMFHGGTNFGFMNGANHYDIYYPTITSYDYDSLLTEGGAITE-KYKA 291
Query: 352 LKDL 355
+K++
Sbjct: 292 VKEV 295
>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 625
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 159/330 (48%), Gaps = 30/330 (9%)
Query: 52 RAIIIDGNRRM-------LISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHES 104
R + IDG R + ++SA IHY R P++W D + + + G + +E Y+ WN H+
Sbjct: 5 RVLTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQP 64
Query: 105 IRGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNN 164
F G D+ FV+L G G + R GPY+CAEW+FGG P WL + RT +
Sbjct: 65 TPAAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENVRLRTTD 124
Query: 165 APFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESS---YGQQGKDYVKW 221
+ + + +++ ++ E L + +GGP++ +QIENEYG+ + K ++
Sbjct: 125 PVYLAAVDAWFDELIPVLAE--LQATRGGPVVAVQIENEYGSFGADPDYLDHLRKGLIER 182
Query: 222 AASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCD----GYKPNSYNKPTLWTENWDGW 277
L G +M P+ + G D + + P + E W+GW
Sbjct: 183 GVDTLLFTSDGPQELMLAGGTVPDVLATVNFGSRADEAFATLRRVRPDDPPVCMEFWNGW 242
Query: 278 YTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFG---------RTSGGPFY 328
+ +G R +D A ++ GGS +N+YM GGTNFG +G P Y
Sbjct: 243 FDHFGEPHHTRSAQDAARSLDEILAAGGS-VNFYMGHGGTNFGFWAGANHSGVGTGDPGY 301
Query: 329 ---ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
ITSYDYDAP+ E G L+ PK+ +++
Sbjct: 302 QPTITSYDYDAPVGEAGELT-PKFHLFREV 330
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 69/162 (42%), Gaps = 16/162 (9%)
Query: 538 TVTIDSMRDVLRVFINGQLTGSVIGHWVKVVQPVEFQSGYNDLILLSQTVGLQNYGAFLE 597
T+ I+ +RD +VF +G+L G V + ++ DL LL + +G NYG L
Sbjct: 406 TLRIEGVRDRAQVFADGKLLGMVERDIPERTLDLQIPDEGLDLELLVEPLGRVNYGPHL- 464
Query: 598 KDGAGFRGQVKLT-GFKNGDIDLSKILWTYQVGLKGEFQQIYSIEENEAEWTDLTRDGIP 656
D G G V+L F+ G W ++V + ++E EA + T P
Sbjct: 465 ADRKGLIGGVRLDHQFQFG--------WEHRVLPLDDPTGALALENQEAVTANQTAG--P 514
Query: 657 STFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYW 698
+ T + DG L + S + W+NG +GR W
Sbjct: 515 AFHRAAITVREPADGF----LAVPSTARSLVWLNGFLLGRLW 552
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 109/320 (34%), Positives = 156/320 (48%), Gaps = 33/320 (10%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G++ ++ IHY R E W D + K K G + + TY+ WN HE RG++ F G
Sbjct: 104 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYIPWNLHEPQRGKFVFSGNL 163
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ FV L GL++ LR GPY+CAE + GG P WL P + RT F + + +
Sbjct: 164 DLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLLQNPKTQLRTTERTFVDAVDAYF 223
Query: 176 KKIVDLMREEMLFSW-QGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVP 234
LMR + + GGP+I +Q+ENEYG S+ + G+ Y+ + L G
Sbjct: 224 D---HLMRRMVPLQYHHGGPVIAVQVENEYG----SFNRDGQ-YMAYLKEALLKRGIVEL 275
Query: 235 WVMC-----------KQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDGWYTTWGG 283
C K A N+ + + S+ KP L E W GWY +WG
Sbjct: 276 LFTCDYYKDVVNGSLKGVLATVNLGSLGKNSFYQLLQVQSH-KPILIMEYWVGWYDSWG- 333
Query: 284 RLPH--RPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGR------TSGGPFYITSYDYD 335
LPH + ++A V+ F + G SF N YM+ GGTNFG G TSYDYD
Sbjct: 334 -LPHANKSAAEVAHTVSTFIKNGISF-NVYMFHGGTNFGFINAAGIVEGRRSVTTSYDYD 391
Query: 336 APIDEYGLLSEPKWGHLKDL 355
A + E G +E K+ L++L
Sbjct: 392 AVLSEAGDYTE-KYFKLREL 410
>gi|358415935|ref|XP_600640.6| PREDICTED: uncharacterized protein LOC522360 [Bos taurus]
Length = 1360
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 154/305 (50%), Gaps = 28/305 (9%)
Query: 56 IDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKN 115
++G+ +++ +HY R W D + K + G + + TYV WN HE RG ++F G
Sbjct: 323 LEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSGNL 382
Query: 116 DIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFV 175
D+ F+ L GL++ LR GPY+C+E + GG P WL P + RT N F + ++
Sbjct: 383 DLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNKYF 442
Query: 176 KKIVDLMREEMLFSWQGGPIIMLQIENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPW 235
++ R +L QGGPII +Q+ENEYG + + + Y+ + G G
Sbjct: 443 DHLIP--RVALLQYLQGGPIIAVQVENEYG-----FFYKDEAYMPYLLQALQQRGIGGLL 495
Query: 236 VMCKQTDAP-----ENIIDACN--GYYCDGYK---PNSYNKPTLWTENWDGWYTTWGGRL 285
+ T+ + ++ + N G+ D +K +KP L E W GW+ TWG +
Sbjct: 496 LTADSTEEVMRGHIKGVLASINMKGFKVDSFKHLYKLQRHKPILIMEFWVGWFDTWG--I 553
Query: 286 PHR--PVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY------ITSYDYDAP 337
HR V ++ +V+ F + G SF N YM+ GGTNFG +G + TSYDYDA
Sbjct: 554 DHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTSYDYDAV 612
Query: 338 IDEYG 342
+ E G
Sbjct: 613 LTEAG 617
>gi|423278914|ref|ZP_17257828.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
gi|404585906|gb|EKA90510.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
Length = 769
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 153/324 (47%), Gaps = 19/324 (5%)
Query: 46 NVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESI 105
N + +++G + +A +HY R W I K G + I YVFWN HE
Sbjct: 20 NFTIGKNTFLLNGKSFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHEQT 79
Query: 106 RGQYNFKGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 165
G+++F G+NDI F +L G+Y+ +R GPYVCAEW GG P WL I RT +
Sbjct: 80 EGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTLDP 139
Query: 166 PFKEEMQRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYVKWAA 223
F E F+K++ + + +GG IIM+Q+ENEYG ++ Y +D VK A
Sbjct: 140 YFMERTAIFMKEVGKQLAPLQIT--RGGNIIMVQVENEYGAYAVDKPYVSAIRDIVKSAG 197
Query: 224 SMALGLGAGVPWVMCKQTDAPENIIDACN-------GYYCDGYKPNSYNKPTLWTENWDG 276
+ L W + ++++ N K + P + +E W G
Sbjct: 198 FTEVPLFQ-CDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPDTPLMCSEFWSG 256
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITS 331
W+ WG + RP + + + R SF + YM GGT FG G P Y +S
Sbjct: 257 WFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNPAYSAMCSS 315
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDL 355
YDYDAPI E G ++ K+ L+DL
Sbjct: 316 YDYDAPISEPGWATD-KYFQLRDL 338
>gi|373955175|ref|ZP_09615135.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373891775|gb|EHQ27672.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 600
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 175/362 (48%), Gaps = 43/362 (11%)
Query: 23 MMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRATPEMWPDLI 82
M + +S + SS + + S A ++DG +IS +H R W I
Sbjct: 1 MKKFFLIISFILSSGLAISQQKHVFSLGKSAFLLDGKPFQIISGELHPARIPKMYWRHRI 60
Query: 83 AKSKEGGADVIETYVFWNAHESIRGQYNFKGKN-DIVKFVKLVGSSGLYLQLRIGPYVCA 141
+K G + I Y+FWN HE +G ++F +N +IV F+++ G+++ LR GPYVCA
Sbjct: 61 QMAKAMGCNTIAAYIFWNYHEQQKGVFDFTTENRNIVDFIRMCQEEGMWVLLRPGPYVCA 120
Query: 142 EWNFGGFPVWLRDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEM--LFSWQGGPIIMLQ 199
EW+FGG P +L IP I+ R + + E+ R+ VD++ +++ L GGPIIM+Q
Sbjct: 121 EWDFGGLPPYLLSIPDIKLRCMDPRYIAEVTRY----VDVLSQQVKNLQCTSGGPIIMVQ 176
Query: 200 IENEYGNMESSYGQQGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDG- 258
+ENEYG+ + ++Y+K + + G VP+ D P + G DG
Sbjct: 177 VENEYGSYAND-----REYIKTLRGLWVKNGINVPFYTA---DGPAAFMLEAGG--VDGA 226
Query: 259 ---------------YKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLAFAVARFFQR 303
+ + P+ +E++ GW T W + + + V +
Sbjct: 227 AIGLDSGSGDADFELAAKQNPDVPSFSSESYPGWLTHWKEKWQKPGTDGILKDVTYLLEH 286
Query: 304 GGSFMNYYMYFGGTNFGRTSGGPFY--------ITSYDYDAPIDEYGLLSEPKWGHLKDL 355
SF N Y+ GGTNFG +G + +TSYDYDAPI+E G + PK+ L++L
Sbjct: 287 QKSF-NLYVINGGTNFGYNAGANAFTPTQFQPDVTSYDYDAPINERGEPT-PKYYALRNL 344
Query: 356 HA 357
A
Sbjct: 345 IA 346
>gi|423248537|ref|ZP_17229553.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
gi|423253485|ref|ZP_17234416.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392657385|gb|EIY51022.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392659750|gb|EIY53368.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
Length = 773
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 106/324 (32%), Positives = 157/324 (48%), Gaps = 31/324 (9%)
Query: 52 RAIIIDGNRRMLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNF 111
R +++GN ++ +A +HY R W I K G + I Y+FWN HE G+++F
Sbjct: 31 RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90
Query: 112 KGKNDIVKFVKLVGSSGLYLQLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEM 171
G+ ++ KF KL G+Y+ LR GPY CAEW GG P WL ++ R+ N F E
Sbjct: 91 SGEKNVAKFCKLAQKHGMYIILRPGPYACAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150
Query: 172 QRFVKKIVDLMREEMLFSWQGGPIIMLQIENEYG--NMESSYGQQGKDYV---------- 219
+ F+K++ + L + GG IIM+Q+ENE+G ++ Y +D V
Sbjct: 151 EIFMKELGKQLAPLQLAN--GGNIIMVQVENEFGGYGVDKPYMTAIRDIVCRAGFDKSVL 208
Query: 220 ---KWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGYKPNSYNKPTLWTENWDG 276
W ++ L + W + T A NI +P++ P + +E W G
Sbjct: 209 FQCDWDSTFELNALDDLLWTLNFGTGA--NIDKEFKK--LSTVRPDT---PLMCSEFWSG 261
Query: 277 WYTTWGGRLPHRPVEDLAFAVARFFQRGGSFMNYYMYFGGTNFGRTSGG--PFY---ITS 331
W+ WG + RP E + + R SF + YM GGT FG G P Y +S
Sbjct: 262 WFDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSS 320
Query: 332 YDYDAPIDEYGLLSEPKWGHLKDL 355
YDYDAPI E G + PK+ L++L
Sbjct: 321 YDYDAPISEAGWTT-PKYYLLQEL 343
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 31/116 (26%)
Query: 656 PSTFTWYKTYFDAPDGIDPVALDLGSMGKGQAWVNGHHIGRYWTVVAPKGGCQDTCDYRG 715
P +YK F+ D +D+ + GKG WVNGH +GR+W +
Sbjct: 523 PEGPAYYKATFNLTKTGD-TFIDMSTWGKGMVWVNGHALGRFWEI--------------- 566
Query: 716 AYNSDKCTTNCGNPTQTWYHVPRSWLQASNNLLVIFEETGGNPFEISVKLRSTRIV 771
P QT + +P WL+ N +++ + G P E VK T ++
Sbjct: 567 ------------GPQQTLF-LPGCWLKKGKNEIIVLDLKG--PSEAVVKGLKTPVL 607
>gi|157106611|ref|XP_001649403.1| beta-galactosidase [Aedes aegypti]
gi|108879822|gb|EAT44047.1| AAEL004580-PA [Aedes aegypti]
Length = 656
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 172/356 (48%), Gaps = 37/356 (10%)
Query: 15 ALSVYPMMMMMMMIHLSCVSSSSASTFFKPFNVSYDHRAIIIDGNRRMLISAGIHYPRAT 74
A+ V ++ + + + L S + + F + YD ++DG ++ HY RA
Sbjct: 13 AVIVVAIVGLTVGLVLGLDDSGVKNEEGRSFTIDYDRDTFVMDGKDFRYVAGSFHYFRAL 72
Query: 75 PEMWPDLIAKSKEGGADVIETYVFWNAHESIRGQYNFKGKNDIVKFVKLVGSSGLYLQLR 134
P+ W + + GG + ++ YV W+ H QY + G +I ++ + LY+ LR
Sbjct: 73 PQTWRTKLKTLRAGGLNAVDLYVQWSLHNPKENQYVWDGIANIKDVIEAAIEADLYVILR 132
Query: 135 IGPYVCAEWNFGGFPVWL-RDIPGIEFRTNNAPFKEEMQRFVKKIVDLMREEMLFSWQGG 193
GPY+CAE + GG P WL PGI+ RT++A + +E+ + +K++ + M + GG
Sbjct: 133 PGPYICAEIDNGGLPYWLFTKYPGIQVRTSDANYLKEVATWYEKLMSQLTPYMYGN--GG 190
Query: 194 PIIMLQIENEYGNMESSYGQQGKDYV--------KWAASMALGLGAGVPW---VMCKQTD 242
PIIM+Q+ENEYG ++G+ K Y+ K+ A+ P+ + C Q
Sbjct: 191 PIIMVQLENEYG----AFGKCDKPYLNFLKEETEKYTQGKAVLFTVDRPYGNEMECGQ-- 244
Query: 243 APENIIDACNGYYCD--------GYKPNSYNKPTLWTENWDGWYTTWGGRLPHRPVEDLA 294
P + G D + N P + TE + GW T W RP E LA
Sbjct: 245 VPGVFVTTDFGLMTDEEVDTHKAKLRSVQPNGPLVNTEFYTGWLTHWQESNQRRPAEPLA 304
Query: 295 FAVARFFQRGGSFMNYYMYFGGTNFGRTSGGPFY--------ITSYDYDAPIDEYG 342
+ + G + +++YMYFGGTNFG +G + ITSYDYDAP+DE G
Sbjct: 305 NTLRKMLHDGWN-VDFYMYFGGTNFGFWAGANDWGLGKYMADITSYDYDAPMDEAG 359
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,785,302,517
Number of Sequences: 23463169
Number of extensions: 677830159
Number of successful extensions: 1307633
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2162
Number of HSP's successfully gapped in prelim test: 149
Number of HSP's that attempted gapping in prelim test: 1293805
Number of HSP's gapped (non-prelim): 5086
length of query: 849
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 697
effective length of database: 8,792,793,679
effective search space: 6128577194263
effective search space used: 6128577194263
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)